What GPT-5.3 Codex can do and the modalities it supports.
Performance scores for GPT-5.3 Codex across standard benchmarks.
| Benchmark | Score | Date |
|---|---|---|
| SWE-Bench Pro | 56.8 | February 5, 2026 |
| Terminal-Bench 2.0 | 77.3 | February 5, 2026 |
| OSWorld-Verified | 64.7 | February 5, 2026 |
| MMLU | 93 | February 5, 2026 |
| MMLU Pro | 83 | February 5, 2026 |
| GPQA | 81 | February 5, 2026 |
| IFEval | 94 | February 5, 2026 |
| SimpleQA | 58 | February 5, 2026 |
| HLE | 36 | February 5, 2026 |
Token pricing for GPT-5.3 Codex API usage.
$1.75/M
per million tokens
$14.00/M
per million tokens
Available only via Responses API. Supports prompt caching. Batch API pricing available at 50% discount. Configurable reasoning effort: low, medium, high, xhigh.
Alternative models that compete with GPT-5.3 Codex.
| Model | Family | Context Window | Price (In / Out per M) | Status |
|---|---|---|---|---|
| Claude Sonnet 4.6 | Claude | 200K | $3.00/M / $15.00/M | Current |
| Gemini 2.5 Pro | Gemini | 1.0M | $1.25/M / $10.00/M | Current |