Question 1

Does Marigold train on my data?

Accepted Answer

No. Inference requests are not retained for training, fine-tuning, or any other purpose. Outputs are stored briefly for retrieval and then deleted. The Pro tier includes a GDPR data processing agreement that sets this out contractually.

Question 2

What models does Marigold support?

Accepted Answer

The hosted registry includes Qwen2.5 instruct variants (1.5B, 7B, 14B), Mistral 7B Instruct, PaliGemma 3B for image-to-text, CLIP for image and text embedding, the facebook/mms-tts family for text-to-speech in English, Welsh, French, German, Spanish, Finnish, and Dutch, plus depth estimation and segmentation models.

Question 3

Is there a free tier?

Accepted Answer

Yes. Guest access requires no account. Rate limits apply per IP across text-embedding and instruct model types. Paid plans lift rate limits and add GPU model access.

Question 4

Where does my data go?

Accepted Answer

Inference runs on private AWS infrastructure in London (UK) by default. Data does not leave that region. No third-party model provider receives your inputs.

Question 5

Is Marigold GDPR-compliant?

Accepted Answer

The infrastructure is designed for UK and EU data residency. Inference does not leave your chosen region. Pro tier accounts can obtain a signed data processing agreement. Marigold does not act as data controller for inference inputs; your organisation remains controller.

Question 6

What is the OpenAI-compatible endpoint?

Accepted Answer

POST /v1/chat/completions accepts the same request shape as the OpenAI Chat Completions API. Switch the base URL and API key; the model parameter maps to a Marigold model name. No other code changes are required.

Question 7

Can I run Qwen or Mistral via Marigold?

Accepted Answer

Yes. Qwen2.5 instruct variants and Mistral 7B Instruct are in the hosted registry. Submit via the inference API directly or via the OpenAI-compatible endpoint.

Question 8

What is the difference between Marigold and self-hosting a model directly?

Accepted Answer

Self-hosting requires provisioning compute, managing model weights, implementing an inference API, and maintaining everything. Marigold provides a typed async API, weight caching, and an eval surface as a managed layer. You control the deployment region and model selection.

Criterion	Marigold	GPT-4.1-mini	Claude Haiku
Pricing model	Flat monthly	Per token	Per token
Cost predictability	Fixed	Variable	Variable
Model choice	Open-weight registry	GPT-4.1-mini only	Claude Haiku only
Data jurisdiction	UK / EU (your region)	US (OpenAI)	US / EU (Anthropic)
Training on prompts	Never	Opt-out required	No
Bring your own weights	Yes (Pro)	No	No

Flat monthly inference. No token counting.

Plans

Comparison

Frequently asked

Know what inference costs before you build.

Join the waitlist