Available API Models
This page lists all models accessible via the Schatzi AI REST API using API key authentication. All models are hosted on Swiss infrastructure, ensuring your data never leaves Switzerland.
Some models are available in both the Chat UI and the API, while others are API-exclusive. API-exclusive models are not visible in the Chat UI.
To use a specific model, pass its exact display name or ID as the model parameter in your API requests.
Model Overview
The following table provides a complete comparison of all models available via the API.
Loading prices...
Pricing is subject to change at our discretion.
Models by Category
Swiss LLM (AI Act Compliant)
-
Apertus Swiss LLM - Large Ideal for multilingual services, government agencies, and R&D teams. Compliant with the AI Act and respectful of privacy and intellectual property.
- Capabilities: Chat, Multi-lingual, Swiss LLM
- Pricing: Input ... / Output ...
- Availability: Also available in Chat UI
-
Apertus Swiss LLM - Small Optimized for multilingual dialogue use cases.
- Capabilities: Chat, Multi-lingual, Swiss LLM
- Pricing: Input ... / Output ...
- Availability: API exclusive
Vision & Document Analysis
-
Document Analysis - Medium Optimized for compact and efficient vision-language tasks.
- Capabilities: Document Analysis, Chat, Vision, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Document Analysis - Small Optimized for handling text and image input and generating text output.
- Capabilities: Document Analysis, Chat, Vision
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Document Analysis - Xtra Small Optimized for compact and efficient vision-language model tasks.
- Capabilities: Document Analysis, Chat, Vision
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Llama 4 Maverick multi modal - Small Optimized for text and multimodal experiences.
- Capabilities: Document Analysis, Chat, Vision, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Document Analysis & OCR - Small (DeepSeek OCR) Specialized for optical character recognition and document understanding. Excels at converting documents to structured text/markdown and table extraction.
- Capabilities: Document Analysis, Vision
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
inference-miner-u25 Vision-language model optimized for document analysis and parsing.
- Capabilities: Vision, Document Analysis
- Pricing: Input ... / Output ...
- Availability: API exclusive
Reasoning & Problem-Solving
-
Fast Reasoning & Instruction Following - Small Optimized for reasoning and instruction-following capabilities.
- Capabilities: Thinking, Chat, Data Analysis, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Reasoning & Problem Solving - Small Optimized for thinking, reasoning, and reasoning chat completions.
- Capabilities: Thinking, Chat, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Reasoning & Agent tasks - Large Optimized for powerful reasoning, agentic tasks, and versatile developer use cases.
- Capabilities: Data Analysis, Chat, Thinking, Agent, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Reasoning & Problem Solving - Medium Optimized for thinking and reasoning.
- Capabilities: Thinking, Chat, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Reasoning & Problem Solving - Xtra Large Optimized for reasoning chat completions.
- Capabilities: Thinking, Chat, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Reasoning & Tool Use - Large (GLM-4.5 Air) Mixture-of-Experts model with hybrid reasoning, strong tool/function calling, and code generation.
- Capabilities: Thinking, Chat, Function Calling
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Chat & Document Analysis & Reasoning - Large Large-scale model delivering frontier-level performance across complex tasks with advanced multilingual capabilities.
- Capabilities: Document Analysis, Chat, Vision, Function Calling, Reasoning
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Chat, Document Analysis, Coding & Reasoning - Xtra Large Multi-modal model optimized for chat, document analysis, coding, and reasoning.
- Capabilities: Chat, Document Analysis, Coding, Thinking, Data Analysis, Vision, Function Calling
- Pricing: Input ... / Output ...
- Availability: Also available in Chat UI
-
Chat, Vision, Document Analysis & Reasoning - Medium Best-in-class multi-modal model optimized for chat, vision, document analysis, coding, and reasoning.
- Capabilities: Chat, Vision, Document Analysis, Coding, Thinking, Function Calling
- Pricing: Input ... / Output ...
- Availability: Also available in Chat UI
-
Chat, Document Analysis & Agent tasks - Xtra Large Very large-scale model delivering frontier-level performance. Optimized for powerful reasoning and agentic tasks.
- Capabilities: Chat, Document Analysis, Agent, Coding, Thinking, Web Search, Vision, Function Calling
- Pricing: Input ... / Output ...
- Availability: Also available in Chat UI
Multilingual
-
Llama 3.3 Multi-lingual - Medium Optimized for multilingual dialogue use cases.
- Capabilities: Chat, Multi-lingual
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Chat & Function Calling - Small (Granite 3.1) Long-context model optimized for instruction following, RAG, and function calling. Supports 12 languages.
- Capabilities: Chat, Function Calling, Multi-lingual
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Chat & Document Analysis - Xtra Xtra Large Optimized for multilingual dialogue use cases.
- Capabilities: Chat, Multi-lingual
- Pricing: Input ... / Output ...
- Availability: API exclusive
-
Chat, Multi-lingual, Coding & function calling - Small Versatile small model for chat, coding, and multilingual tasks.
- Capabilities: Chat, Multi-lingual, Coding, Function Calling
- Pricing: Input ... / Output ...
- Availability: Also available in Chat UI
Chat & General Purpose
- Search, Chat & Analysis - Small
Optimized for web search and chat. Suitable for content creation and storytelling.
- Capabilities: Web Search, Chat, Vision
- Pricing: Input ... / Output ...
- Availability: API exclusive
Quick Start
You can interact with any of the available models using a standard REST request. Here is an example using the Apertus Swiss LLM - Large model:
curl https://backend.schatziai.ch/v1/chat/completions \
-H "Authorization: Bearer sk-your-api-key" \
-H "Content-Type: application/json" \
-d '{"model": "Apertus Swiss LLM - Large", "messages": [{"role": "user", "content": "Hello"}]}'
Listing Models Programmatically
To retrieve a list of models available to your specific account and authentication level, use the following endpoint:
GET /v1/models
This endpoint returns the model IDs, capabilities, and current pricing for all models you are authorized to use. For full details on the request and response format, see API Endpoints.