AI model for demanding tasks

Software masters complex chains of reasoning

New software from OpenAI masters complex chains of reasoning.

Published 9/26/24

Author SDA

Other languages German

ChatGPT developer OpenAI presented a new AI model in September that can solve more complex tasks than previous chatbots. The software, called o1, spends more time “thinking” before giving an answer. The artificial intelligence tries out different approaches and recognizes and corrects its own mistakes, explains OpenAI in a blog post.

This is having an effect on mathematics and software programming. For example, the o1 model solved 83 percent of the tasks in the test for the International Mathematical Olympiad. The current ChatGPT-4o only achieved 13 percent. At the same time, the new model still lacks many of ChatGPT's useful functions. For example, it cannot search for information on the web and does not support the uploading of files.

o1 also invents answers

The documents also show that the new model knowingly gave the wrong answer in 0.38 percent of cases in a test selection of 100,000 queries. This mainly happened when OpenAI o1 was asked to refer to articles, websites or books.

However, in many cases this was not possible without access to the Internet search. So the software itself invented plausible-looking examples. However, the software only ever wanted to fulfill the wishes of the users. The so-called “hallucinations”, in which AI software simply invents information, are generally an unsolved problem.