Model Variants
Some AI inference providers offer special variants of models. These models can have different features such as a larger context size. They may incur different costs associated with requests as well.
When AI Gateway makes these models available they will be highlighted on the model detail page with a Model Variants section in the relevant provider card providing an overview of the feature set and linking to more detail.
Model variants sometimes rely on preview or beta features offered by the inference provider. Their ongoing availability can therefore be less predictable than that of a stable model feature. Check the provider's site for the latest information.
AI Gateway automatically enables the 1M token context window for Claude Sonnet 4 and 4.5 models. No configuration is required.
- Learn more: Announcement, Context windows docs
- Pricing: Requests that exceed 200K tokens are charged at premium rates. See pricing details.
Was this helpful?