The Data Intelligence Index is a comprehensive evaluation of frontier AI models on data-centric intelligence. As models and Agents become more powerful, we need a systematic way to measure their performance across diverse data challenges, from querying databases to debugging SQL in production.
We assess 6 models across 6 aspects: DB Querying, BI Analysis & Data Manipulation, DB Application Debugging, Human-centric Interaction, Multi-modal Querying, and Data Science Code Translation. This provides a single view of both performance and cost efficiency across data-centric intelligence. For methodology details on this index, see the blog page.
Data Intelligence Index
Evaluating frontier AI on data-centric intelligence across 6 aspects: DB querying, BI analysis & data manipulation, DB application debugging, human-centric interaction, multi-modal querying, and data science code translation.
Model Profiles
Hover legend to highlightScore by Aspect
Representative score (%) per aspect · sorted by indexCost vs Performance
Average cost per task vs Data Intelligence IndexBenchmark Details
Key Findings
Top model averages under 50%. SQL debugging peaks at 40.9%, human-centric interaction at just 28.8%.
Opus 4.6 wins overall, ranking 1st on DB querying, BI analysis, debugging, and code translation.
Kimi 2.5 leads Vision (47.0% vs Opus 43.8%) at a fraction of the cost. Best accuracy-per-dollar.
Human-centric interaction: Even Opus 4.6 achieves just 28.1% and Qwen3 Coder drops to 18.4%.
Opus dominates overall, but Kimi leads multi-modal, and Qwen3 matches Opus on DB querying at 1/17th cost.