Grok 4.1

Models

xAI's flagship model with strong real-time knowledge

Rapid improvement in reasoning benchmarks

Metrics

elo1,350

providerxAI

price input$2.00

price output$10.00

context window256,000

code90

math89

reasoning90

intelligence91

intelligence30% weight

Overall reasoning and task completion ability

Source: LMArena ELO, Artificial Analysis Intelligence Index, HuggingFace MMLU-PRO

math20% weight

Mathematical reasoning and problem solving

Source: MATH benchmark, GSM8K, HuggingFace MATH-Lvl5

code20% weight

Code generation, understanding, and debugging

Source: HumanEval, MBPP, SWE-bench

reasoning15% weight

Multi-step logical reasoning

Source: ARC-Challenge, BBH, MMLU-Pro, HuggingFace BBH

instruction_following15% weight

Ability to follow complex instructions accurately

Source: HuggingFace IFEval

Last updated: December 24, 2025