Question 1

How was this comparison made?

Accepted Answer

The SWEN editorial team evaluated each participant across 4 weighted criteria, including Raciocínio Avançado, Ciência e Matemática, Programação. Scores range from 0 to 10 per criterion, multiplied by each criterion's weight to produce the total score.

Question 2

Who won?

Accepted Answer

Claude Opus 4.7 achieved the highest total score of 90/100.

Question 3

Can results change?

Accepted Answer

Yes. Comparisons are updated when new versions of models/tools are released or when relevant data changes. The last update date is shown above.

Criterion	Weight	Claude Opus 4.7	o3
Raciocínio Avançado	x35	96.0	89.0
Ciência e Matemática	x25	89.0	84.0
Programação	x20	95.0	83.0
Custo-Benefício	x20	70.0	88.0

o3 vs Claude Opus 4.7: Batalha de Raciocínio em 2026

Results

Claude Opus 4.7

o3

Evaluation Criteria

Conclusion

Recommendation

FAQ

How was this comparison made?

Who won?

Can results change?