Skip to main content

OpenAI releases its new o3-mini reasoning model free

On Thursday, Microsoft announced that it’s rolling OpenAI’s reasoning model o1 out to its Copilot users, and now OpenAI is releasing a new reasoning model, o3-mini, to people who use the free version of ChatGPT. This will mark the first time that the vast majority of people will have access to one of OpenAI’s reasoning models, which were formerly restricted to its paid Pro and Plus bundles.

Reasoning models use a “chain of thought” technique to generate responses, essentially working through a problem presented to the model step by step. Using this method, the model can find mistakes in its process and correct them before giving an answer. This typically results in more thorough and accurate responses, but it also causes the models to pause before answering, sometimes leading to lengthy wait times. OpenAI claims that o3-mini responds 24% faster than o1-mini.

These types of models are most effective at solving complex problems, so if you have any PhD-level math problems you’re cracking away at, you can try them out. Alternatively, if you’ve had issues with getting previous models to respond properly to your most advanced prompts, you may want to try out this new reasoning model on them. To try out o3-mini, simply select “Reason” when you start a new prompt on ChatGPT

Although reasoning models possess new capabilities, they come at a cost. OpenAI’s o1-mini is 20 times more expensive to run than its equivalent non-reasoning model, GPT-4o mini. The company says its new model, o3-mini, costs 63% less than o1-mini per input token However, at $1.10 per million input tokens, it is still about seven times more expensive to run than GPT-4o mini.

This new model is coming right after the DeepSeek release that shook the AI world less than two weeks ago. DeepSeek’s new model performs just as well as top OpenAI models, but the Chinese company claims it cost roughly $6 million to train, as opposed to the estimated cost of over $100 million for training OpenAI’s GPT-4. (It’s worth noting that a lot of people are interrogating this claim.) 

Additionally, DeepSeek’s reasoning model costs $0.55 per million input tokens, half the price of o3-mini, so OpenAI still has a way to go to bring down its costs. It’s estimated that reasoning models also have much higher energy costs than other types, given the larger number of computations they require to produce an answer.

This new wave of reasoning models present new safety challenges as well. OpenAI used a technique called deliberative alignment to train its o-series models, basically having them reference OpenAI’s internal policies at each step of its reasoning to make sure they weren’t ignoring any rules.

But the company has found that o3-mini, like the o1 model, is significantly better than non-reasoning models at jailbreaking and “challenging safety evaluations”—essentially, it’s much harder to control a reasoning model given its advanced capabilities. o3-mini is the first model to score as “medium risk” on model autonomy, a rating given because it’s better than previous models at specific coding tasks—indicating “greater potential for self-improvement and AI research acceleration,” according to OpenAI. That said, the model is still bad at real-world research. If it were better at that, it would be rated as high risk, and OpenAI would restrict the model’s release.



from MIT Technology Review https://ift.tt/zwC798p
via IFTTT

Comments

Popular posts from this blog

Roundtables: Unveiling the 10 Breakthrough Technologies of 2025

Recorded on January 3, 202 5 Unveiling the 10 Breakthrough Technologies of 2025 Speakers: Amy Nordrum , executive editor, and Charlotte Jee , news editor. Each year, MIT Technology Review publishes an annual list of the top ten breakthrough technologies that will have the greatest impact on how we live and work in the future. This year, the 10 Breakthrough Technologies list was unveiled live by our editors. Hear from  MIT Technology Review  executive editor Amy Nordrum and news editor Charlotte Jee as they share an unveiling of the list of the 10 breakthrough technologies. Related Coverage The 10 Breakthrough Technologies of 2025 3 things that didn’t make the 10 Breakthrough Technologies of 2025 list The 10 Breakthrough Technologies of 2024 from MIT Technology Review https://ift.tt/0Xert49 via IFTTT

Why scientists want to help plants capture more carbon dioxide

This article is from The Spark, MIT Technology Review’s weekly climate newsletter. To receive it in your inbox every Wednesday,  sign up here. Hello hello!  This week in The Spark, we’re taking a look back at one of my favorite sessions from our ClimateTech conference last week, from a chapter we called “Cleaning Your Plate.”  In the session, I sat down with Pamela Ronald, a plant geneticist at the University of California, Davis. She’s been working for years on helping rice survive floods, and now she’s turning her attention to using advanced genetics for carbon removal on farmland.  Genetics and plants Scientists have a wide range of tools at their disposal to influence how plants grow. From standard genetic engineering to more sophisticated gene editing tools like CRISPR, we have more power than ever to influence what traits we want in crops.  But genetic tweaking isn’t anything new. “Virtually everything we eat has been improved using some sort o...

The Animation Guild ratifies a contract with big studios, without AI demands such as letting members opt out of using AI or having AI train on their work (Gene Maddaus/Variety)

Gene Maddaus / Variety : The Animation Guild ratifies a contract with big studios, without AI demands such as letting members opt out of using AI or having AI train on their work   —  The Animation Guild has ratified its contract with the major studios, despite concerns from some about protections against artificial intelligence. from Techmeme https://ift.tt/4pCbZY7 via IFTTT