← Back to models9195
G
GPT-5.1
Models
OpenAI's latest frontier model with improved reasoning
Solid performer with strong multimodal capabilities
Metrics
elo1,380
providerOpenAI
price input$5.00
price output$15.00
context window128,000
Score Breakdown
code91
math90
reasoning91
intelligence92
Compatibility
Scoring Methodology
intelligence30% weight
Overall reasoning and task completion ability
Source: LMArena ELO, Artificial Analysis Intelligence Index, HuggingFace MMLU-PRO
math20% weight
Mathematical reasoning and problem solving
Source: MATH benchmark, GSM8K, HuggingFace MATH-Lvl5
code20% weight
Code generation, understanding, and debugging
Source: HumanEval, MBPP, SWE-bench
reasoning15% weight
Multi-step logical reasoning
Source: ARC-Challenge, BBH, MMLU-Pro, HuggingFace BBH
instruction_following15% weight
Ability to follow complex instructions accurately
Source: HuggingFace IFEval
Related Signals
Gemini 3 Pro Takes LMArena Lead
Models1d ago
Gemini 3 Pro has reached 1490 ELO on LMArena, surpassing Claude Opus 4.5 and GPT-5.1 to claim the top position in human preference rankings.
Data Sources
Last updated: December 24, 2025