← Back to models90
G
Grok 4.1
Models
xAI's flagship model with strong real-time knowledge
Rapid improvement in reasoning benchmarks
Metrics
elo1,350
providerxAI
price input$2.00
price output$10.00
context window256,000
Score Breakdown
code90
math89
reasoning90
intelligence91
Compatibility
Scoring Methodology
intelligence30% weight
Overall reasoning and task completion ability
Source: LMArena ELO, Artificial Analysis Intelligence Index, HuggingFace MMLU-PRO
math20% weight
Mathematical reasoning and problem solving
Source: MATH benchmark, GSM8K, HuggingFace MATH-Lvl5
code20% weight
Code generation, understanding, and debugging
Source: HumanEval, MBPP, SWE-bench
reasoning15% weight
Multi-step logical reasoning
Source: ARC-Challenge, BBH, MMLU-Pro, HuggingFace BBH
instruction_following15% weight
Ability to follow complex instructions accurately
Source: HuggingFace IFEval
Data Sources
Last updated: December 24, 2025