The AI Toolkit for TypeScript
One endpoint, all your models
An agent that knows your stack
Helping teams ship 6× faster
Fast, scalable, and reliable
Servers, in serverless form
Trace every step
Scalable bot protection
Invisible CAPTCHA
DDoS Protection, Firewall
Granular, custom protection
Trusted by the best teams
The latest posts and changes
See what shipped
Read the latest news
Join us at an event
The native Next.js platform
The progressive web framework
The web's efficient UI framework
Speed with Enterprise scale
Learn the ins and outs of Vercel
Extend and automate workflows
Jumpstart app development
Find help quickly
Get help from solution partners
Deploy at the speed of AI
Power storefronts that convert
Launch campaigns fast
Scale apps with one codebase
Ship features, not infrastructure
Automate away repetition
Deploy for every idea
AI Gateway
OpenAI's most capable embedding model for both english and non-english tasks.
1import { embed } from 'ai';2 3const result = await embed({4 model: 'openai/text-embedding-3-large',5 value: 'Sunny day at the beach',6})
Azure
Input Tokens $0.13/M
OpenAI
GPT-3.5 Turbo
OpenAI's most capable and cost effective model in the GPT-3.5 family optimized for chat purposes, but also works well for traditional completions tasks.
GPT-3.5 Turbo Instruct
Similar capabilities as GPT-3 era models. Compatible with legacy Completions endpoint and not Chat Completions.
GPT-4 Turbo
gpt-4-turbo from OpenAI has broad general knowledge and domain expertise allowing it to follow complex instructions in natural language and solve difficult problems accurately. It has a knowledge cutoff of April 2023 and a 128,000 token context window.
GPT-4.1
GPT 4.1 is OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.
GPT-4.1 mini
GPT 4.1 mini provides a balance between intelligence, speed, and cost that makes it an attractive model for many use cases.
GPT-4.1 nano
GPT-4.1 nano is the fastest, most cost-effective GPT 4.1 model.
GPT-4o
GPT-4o from OpenAI has broad general knowledge and domain expertise allowing it to follow complex instructions in natural language and solve difficult problems accurately. It matches GPT-4 Turbo performance with a faster and cheaper API.
GPT-4o mini
GPT-4o mini from OpenAI is their most advanced and cost-efficient small model. It is multi-modal (accepting text or image inputs and outputting text) and has higher intelligence than gpt-3.5-turbo but is just as fast.
GPT-5
GPT-5 is OpenAI's flagship language model that excels at complex reasoning, broad real-world knowledge, code-intensive, and multi-step agentic tasks.
GPT-5-Codex
GPT-5-Codex is a version of GPT-5 optimized for agentic coding tasks in Codex or similar environments.
GPT-5 mini
GPT-5 mini is a cost optimized model that excels at reasoning/chat tasks. It offers an optimal balance between speed, cost, and capability.
GPT-5 nano
GPT-5 nano is a high throughput model that excels at simple instruction or classification tasks.
GPT-5 pro
GPT-5 pro uses more compute to think harder and provide consistently better answers. Since GPT-5 pro is designed to tackle tough problems, some requests may take several minutes to finish.
gpt-oss-120b
Extremely capable general-purpose LLM with strong, controllable reasoning capabilities
gpt-oss-20b
A compact, open-weight language model optimized for low-latency and resource-constrained environments, including local and edge deployments
o1
o1 is OpenAI's flagship reasoning model, designed for complex problems that require deep thinking. It provides strong reasoning capabilities with improved accuracy for complex multi-step tasks.
o3
OpenAI's o3 is their most powerful reasoning model, setting new state-of-the-art benchmarks in coding, math, science, and visual perception. It excels at complex queries requiring multi-faceted analysis, with particular strength in analyzing images, charts, and graphics.
o3-mini
o3-mini is OpenAI's most recent small reasoning model, providing high intelligence at the same cost and latency targets of o1-mini.
o4-mini
OpenAI's o4-mini delivers fast, cost-efficient reasoning with exceptional performance for its size, particularly excelling in math (best-performing on AIME benchmarks), coding, and visual tasks.
text-embedding-3-small
OpenAI's improved, more performant version of their ada embedding model.
text-embedding-ada-002
OpenAI's legacy text embedding model.