Model Profiles
Schatzi AI hosts all models exclusively on Swiss infrastructure, ensuring your data never leaves Switzerland. This page provides detailed technical specifications, capabilities, and pricing for each available model to help you select the optimal solution for your business requirements.
For guidance on selecting the right model for your use case, see /ai-models/choosing-model. To understand how token pricing works, visit /subscription-billing/understanding-tokens.
Apertus Swiss LLM - Large
Description
Designed for organizations requiring maximum transparency and regulatory compliance, this 70B parameter model serves as a reliable foundation for multilingual services, government applications, and research initiatives. With fully documented training methodologies and strict adherence to AI Act requirements, it prioritizes data privacy and intellectual property protection while delivering market-leading performance.
Specifications
- Context window: 65,536 tokens
- Max output tokens: Not specified
- Vision support: No
- Reasoning mode: No
- Function calling: No
- Streaming: Yes
Ideal Use Cases
- Government service automation and citizen support
- Regulatory compliance documentation and reporting
- Academic research and R&D documentation analysis
- Multilingual public sector chatbots
- Legal document review with transparency requirements
- Cross-border administrative processes
Strengths
- Complete training transparency and methodology documentation
- Full AI Act compliance for regulated industries
- Robust multilingual capabilities for European languages
- 70B parameter scale delivering high performance
- Guaranteed Swiss data sovereignty
Limitations
- No vision or image analysis capabilities
- No function calling support for tool integration
- Higher per-token cost compared to smaller alternatives
Pricing
- Input: ... per million tokens
- Output: ... per million tokens
When to Use
Choose this model when regulatory compliance, training transparency, and data sovereignty are non-negotiable requirements. It is specifically engineered for government agencies, research institutions, and organizations handling sensitive information that must remain within Swiss jurisdiction while maintaining full auditability of AI decision-making processes.
When to Choose a Different Model
For applications requiring vision capabilities or image analysis, use Chat & Document Analysis - Large or Chat, Document Analysis & Agent tasks - Xtra Large. If your workflow requires function calling or autonomous agent capabilities, select Reasoning & Agent tasks - Large or Chat, Document Analysis & Agent tasks - Xtra Large instead.
Chat & Document Analysis - Medium
Description
A versatile workhorse designed for rapid document processing and conversational applications. This model delivers instant responses with robust contextual understanding across all major European languages, making it ideal for businesses operating in multilingual environments that require efficient text processing without unnecessary complexity.
Specifications
- Context window: 128,000 tokens
- Max output tokens: Not specified
- Vision support: No
- Reasoning mode: No
- Function calling: No
- Streaming: Yes
Ideal Use Cases
- High-volume document processing and summarization
- Multilingual customer support automation
- Content localization and translation workflows
- Internal knowledge base queries and retrieval
- Batch report analysis and extraction
- Email drafting and business correspondence
Strengths
- Extensive 128K context window for large documents
- Highly cost-effective pricing structure
- Fast response times with streaming support
- Comprehensive European language coverage
- Reliable performance for text-centric workflows
Limitations
- No vision or multimodal capabilities
- No function calling for external tool integration
- Not optimized for complex multi-step reasoning
Pricing
- Input: ... per million tokens
- Output: ... per million tokens
When to Use
Deploy this model for processing large documents or high-volume text analysis where cost efficiency and speed matter more than advanced reasoning or multimodal capabilities. It excels at straightforward document analysis, chat applications, and any workflow where the 128K context window can accommodate entire reports or extensive conversation histories.
When to Choose a Different Model
For vision requirements or image-rich document processing, select Search, Chat & Analysis - Small or Chat & Document Analysis - Large. For agentic workflows requiring function calling or complex reasoning, choose Reasoning & Agent tasks - Large or Chat, Document Analysis & Agent tasks - Xtra Large.
Reasoning & Agent tasks - Large
Description
Engineered for sophisticated analytical workflows and autonomous agent applications. This model excels at complex problem-solving, logical inference, and structured tool use, making it the preferred choice for developers building intelligent automation systems that require step-by-step reasoning capabilities.
Specifications
- Context window: 32,768 tokens
- Max output tokens: Not specified
- Vision support: No
- Reasoning mode: Yes
- Function calling: Yes
- Streaming: Yes
Ideal Use Cases
- Autonomous agent development and deployment
- Complex data analysis and business logic automation
- Multi-step reasoning and decision support systems
- API integration and external tool orchestration
- Workflow automation requiring structured thinking
- Technical problem-solving and debugging assistance
Strengths
- Native reasoning capabilities for complex analysis
- Full function calling support for tool integration
- Developer-friendly architecture for agent building
- Strong analytical performance with streaming
- Optimized for agentic task completion
Limitations
- No vision support for image analysis
- Smaller context window (32K) compared to document-focused models
- May be excessive for simple chat-only applications
Pricing
- Input: ... per million tokens
- Output: ... per million tokens
When to Use
Select this model when building AI agents, automation workflows, or applications requiring structured reasoning and external tool integration. It is specifically designed for scenarios where the model must break down complex problems, reason through multiple steps, and interact with external systems via function calls.
When to Choose a Different Model
For vision-enabled applications or image analysis, use Chat & Document Analysis - Large or Chat, Document Analysis & Agent tasks - Xtra Large. For simpler chat interactions without reasoning requirements, Chat & Document Analysis - Medium offers better cost efficiency and a larger context window.
Search, Chat & Analysis - Small
Description
A compact yet capable multimodal model optimized for creative applications and web-enabled research. Combining vision capabilities with conversational fluency, it serves content creators and researchers who need to analyze visual content alongside text, while maintaining the agility required for interactive storytelling and artistic projects.
Specifications
- Context window: Not specified
- Max output tokens: Not specified
- Vision support: Yes
- Reasoning mode: No
- Function calling: No
- Streaming: Yes
Ideal Use Cases
- Visual content analysis and image description
- Creative storytelling and narrative development
- Web research with visual source analysis
- Artistic project support and inspiration
- Image-based documentation review
- Content creation workflows combining text and visuals
Strengths
- Vision capabilities for image analysis and description
- Optimized for creative and artistic tasks
- Web search integration for current information
- Fast streaming responses for interactive use
- Balanced performance for small-scale applications
Limitations
- No function calling support for tool integration
- No dedicated reasoning mode for complex analysis
- Context window specifications not defined
Pricing
- Input: ... per million tokens
- Output: ... per million tokens
When to Use
Choose this model for creative projects requiring image analysis, storytelling, or when combining visual and textual content generation. It is particularly well-suited for marketing teams, content creators, and researchers who need to analyze visual assets while maintaining conversational fluidity.
When to Choose a Different Model
For complex agent tasks requiring function calling or advanced reasoning, use Reasoning & Agent tasks - Large or Chat, Document Analysis & Agent tasks - Xtra Large. For large-scale document processing with defined context requirements, Chat & Document Analysis - Medium offers a specified 128K context window.
Chat & Document Analysis - Large
Description
A large-scale solution delivering frontier-level performance across complex enterprise tasks. With advanced multilingual capabilities and vision support, it adapts dynamically to query complexity while handling extensive documents and visual content with precision, making it suitable for high-stakes business applications.
Specifications
- Context window: Not specified
- Max output tokens: Not specified
- Vision support: Yes
- Reasoning mode: No
- Function calling: No
- Streaming: Yes
Ideal Use Cases
- Enterprise document analysis with visual elements
- Complex multilingual projects and localization
- High-stakes content review and validation
- Large-scale report generation with charts and graphs
- Image-rich documentation analysis
- Advanced customer support with visual troubleshooting
Strengths
- Vision capabilities for comprehensive document analysis
- Advanced multilingual support for global operations
- Frontier-level performance on complex tasks
- Handles mixed visual and textual content
- Reliable streaming for real-time applications
Limitations
- No function calling capability for external integrations
- No dedicated reasoning mode for step-by-step analysis
- Premium pricing tier compared to medium alternatives
Pricing
- Input: ... per million tokens
- Output: ... per million tokens
When to Use
Deploy this model for complex document analysis requiring vision capabilities, especially when handling image-rich documents, scanned materials, or requiring top-tier multilingual performance. It bridges the gap between pure text models and full agent systems, offering visual understanding without the complexity of function calling.
When to Choose a Different Model
For agent-based workflows requiring function calling or reasoning, select Chat, Document Analysis & Agent tasks - Xtra Large. For cost-effective high-volume text processing without vision requirements, Chat & Document Analysis - Medium offers superior value with a defined 128K context window.
Chat, Document Analysis & Agent tasks - Xtra Large
Description
Our most capable model, designed for the most demanding enterprise applications. This very large-scale system combines vision, reasoning, and agentic capabilities with advanced multilingual support, enabling sophisticated automation across complex document workflows, analytical tasks, and integrated search operations.
Specifications
- Context window: Not specified
- Max output tokens: Not specified
- Vision support: Yes
- Reasoning mode: Yes
- Function calling: Yes
- Streaming: Yes
Ideal Use Cases
- End-to-end enterprise automation pipelines
- Complex agent orchestration with visual inputs
- Vision-enabled document analysis and extraction
- Advanced reasoning workflows with tool use
- Multilingual agent deployment at scale
- Integrated search, analysis, and action systems
Strengths
- Comprehensive capability set combining vision, reasoning, and function calling
- State-of-the-art performance across all modalities
- Advanced multilingual support for global deployment
- Full agentic functionality for autonomous operations
- Enterprise-grade reliability and streaming performance
Limitations
- Highest cost per token in the portfolio
- May be excessive for simple or single-purpose tasks
- Resource intensive for basic applications
Pricing
- Input: ... per million tokens
- Output: ... per million tokens
When to Use
Select this model for mission-critical applications requiring the full spectrum of AI capabilities—vision, reasoning, and tool use—in a single integrated solution. It is specifically engineered for enterprises building sophisticated automation systems that must process visual documents, reason through complex scenarios, and execute actions via function calls.
When to Choose a Different Model
For cost-sensitive applications not requiring all capabilities, consider Chat & Document Analysis - Large for vision without function calling, or Reasoning & Agent tasks - Large for reasoning and function calling without vision. For pure text processing, Chat & Document Analysis - Medium provides excellent value.
Model capabilities, context windows, and pricing are subject to change as we continuously improve our Swiss-hosted infrastructure. For the latest specifications and availability, refer to the model comparison page or check your usage dashboard.