Locator: 49209AI.
Chatbots are tracked here.
The new study – developed by researchers from New York University Stern School of Business and Goodfin, an AI-powered wealth-management platform – evaluated 23 large language models on their ability to answer multiple choice and essay questions on mock CFA Level III exams. They found frontier reasoning models, including o4-mini, Gemini 2.5 Pro, and Claude Opus, were able to use “chain-of-thought prompting” to successfully pass.
These are the three platforms that were tested:
- Google's Gemini 2.5 Pro
- Anthropic's Claude Opus
- OpenAI's o4-mini.
So, what's "o4-mini"?
O4-mini is not the full version of ChatGPT, but rather a smaller, cost-efficient model developed by OpenAI that is now integrated into ChatGPT and the API to offer fast and capable performance, especially for coding and visual tasks. ChatGPT users can select and use O4-mini, and it was released to all users, including free-tier users, in April 2025.
Chatbots by market share:
