Question 1

What is the difference between the Human, Developer, and Agentic tiers?

Accepted Answer

The tiers are priced by usage pattern, not seat count. Human covers interactive use where a person is driving each request. Developer covers automation and pipelines where a human initiates runs but does not sit in the loop. Agentic covers production systems running continuously without human initiation, where flat-rate pricing cannot be sustained and dedicated infrastructure is required.

Question 2

How does Agentic tier pricing work?

Accepted Answer

Agentic is provisioned capacity, not a flat rate. The monthly minimum is a retainer that reserves dedicated GPU infrastructure for your workloads. Usage scales with your agents up to an agreed ceiling. Capacity is reserved exclusively for your account; your agents are never queued behind other customers. Accounts that consistently use significantly below their provisioned capacity are moved to the Developer tier.

Question 3

Does Marigold train on my data?

Accepted Answer

No. Inference requests are not retained for training, fine-tuning, or any other purpose. Outputs are stored briefly for retrieval and then deleted. Agentic tier accounts can obtain a GDPR data processing agreement.

Question 4

What models does Marigold support?

Accepted Answer

The hosted registry includes Qwen2.5 instruct variants (1.5B, 7B, 14B), Mistral 7B Instruct, PaliGemma 3B for image-to-text, CLIP for image and text embedding, the facebook/mms-tts family for text-to-speech in English, Welsh, French, German, Spanish, Finnish, and Dutch, plus depth estimation and segmentation models.

Question 5

Is there a free tier?

Accepted Answer

Yes. Guest access requires no account. Rate limits apply per IP across text-embedding and instruct model types. Paid plans lift rate limits and add GPU model access.

Question 6

Where does my data go?

Accepted Answer

Inference runs on private AWS infrastructure in London (UK) by default. Data does not leave that region. No third-party model provider receives your inputs.

Question 7

What is the OpenAI-compatible endpoint?

Accepted Answer

POST /v1/chat/completions accepts the same request shape as the OpenAI Chat Completions API. Switch the base URL and API key; the model parameter maps to a Marigold model name. No other code changes are required.

Priced by how you use it.

Plans

How the tiers compare

Frequently asked

Know what inference costs before you build.

Join the waitlist

Usage pattern	Human	Developer	Agentic
IDE and interactive use	Yes	Yes	Yes
Scripted automation	Limited	Yes	Yes
Continuous agent loops	No	No	Yes
GPU model access	No	Yes	Yes
Dedicated capacity	No	No	Yes
Pricing model	Flat monthly	Flat monthly	Provisioned
Data processing agreement	No	No	Yes