Transparency is fundamental. This page documents how SWEN collects, processes and presents AI model benchmark data. Our sources are public, our process is automated and our data is updated daily.
By Luis Fernando Roquette • Last revised: June 01, 2026
SWEN has no commercial relationship with any AI provider. We receive no payment to position models. Rankings reflect exclusively the data from the sources listed below.
All data sources are public and linked. Our sync code is documented. Anyone can verify the data against the original sources.
Intelligence Index and benchmarks are automatically synced every 6 hours via Artificial Analysis. New models are imported in the same window. Pricing and technical specs are enriched weekly.
SWEN aggregates data from 4 specialized sources, each contributing different evaluation dimensions:
URL: lmarena.ai
What we collect: ELO score per model, ranking, vote count
Frequency: Daily
Source methodology: LMArena (formerly LMSYS Chatbot Arena) operates a human voting platform where users compare anonymous responses from two models and pick the better one. The ELO system, analogous to chess rankings, computes a relative rating based on millions of cumulative votes. It is widely considered the most reliable industry benchmark because it reflects real human preference, not synthetic metrics.
URL: artificialanalysis.ai
What we collect: Intelligence Index (composite score 0-100), Coding Index, Math Index, MMLU Pro, GPQA Diamond, MATH-500, AIME 2025, LiveCodeBench, SWE Bench Verified, speed (tokens/s), latency (TTFT)
Frequency: Daily via API v2
Source methodology: The Intelligence Index combines 10 different evaluations into a composite score. Artificial Analysis runs each model against standardized evaluation datasets and measures both quality (accuracy) and performance (speed, latency). Speed and latency data are measured on proprietary infrastructure under controlled conditions.
URL: livebench.ai
What we collect: Global Average, Reasoning, Coding, Math, Data Analysis, Language scores (0-100)
Frequency: Daily
Source methodology: LiveBench is a self-updating benchmark that generates new questions periodically, reducing the risk of contamination (when models memorize answers from the training dataset). Questions are categorized across 6 dimensions and automatically evaluated against verified answer keys.
URL: openrouter.ai
What we collect: Price per million tokens (input/output), context window, max output tokens, supported modalities (text, image, audio, video), tool calling support, reasoning capability, model description
Frequency: Weekly via public API (no authentication)
Source methodology: OpenRouter is an AI API aggregator offering unified access to 300+ models. Pricing data reflects values from the original providers (OpenAI, Anthropic, Google, etc.) with OpenRouter markup. Prices shown on SWEN are values reported by OpenRouter, not direct provider prices.
If you find incorrect or outdated data, or have suggestions to improve our methodology, please contact us:
Verified corrections are applied within 24 hours. We especially appreciate contributions from researchers, developers and AI professionals.
SWEN aggregates data from specialized sources (LMArena, LiveBench, Artificial Analysis). The Intelligence Index and benchmarks are synced automatically every 6 hours via the Artificial Analysis API.
The Intelligence Index and benchmarks are automatically synced every 6 hours via the Artificial Analysis API. New models are imported in the same window. Pricing and technical specs (context window, vision support) are enriched weekly via sync-model-metadata.
Yes. The data is aggregated from public sources and properly attributed. For commercial use or API integration, please contact us. We plan to offer a public API soon.
If you find outdated or incorrect data, send an email to contato@swen.ia.br with the model name and suggested correction. We verify and update within 24 hours.