The Million Dollar Way (The Bakken Oil Blog): Testing AI -- CFA -- September 24, 2025

Wednesday, September 24, 2025

Testing AI -- CFA -- September 24, 2025

Locator: 49209AI.

Chatbots are tracked here.

The new study – developed by researchers from New York University Stern School of Business and Goodfin, an AI-powered wealth-management platform – evaluated 23 large language models on their ability to answer multiple choice and essay questions on mock CFA Level III exams. They found frontier reasoning models, including o4-mini, Gemini 2.5 Pro, and Claude Opus, were able to use “chain-of-thought prompting” to successfully pass.

These are the three platforms that were tested:

Google's Gemini 2.5 Pro
Anthropic's Claude Opus
OpenAI's o4-mini.

So, what's "o4-mini"?

O4-mini is not the full version of ChatGPT, but rather a smaller, cost-efficient model developed by OpenAI that is now integrated into ChatGPT and the API to offer fast and capable performance, especially for coding and visual tasks. ChatGPT users can select and use O4-mini, and it was released to all users, including free-tier users, in April 2025.

Chatbots by market share:

The Million Dollar Way (The Bakken Oil Blog)

Pages

Wednesday, September 24, 2025

Testing AI -- CFA -- September 24, 2025

Williston, ND

Total Pageviews