{"id":2843,"date":"2024-07-19T10:40:17","date_gmt":"2024-07-19T10:40:17","guid":{"rendered":"https:\/\/howtogeek.blog\/pt\/?p=2843"},"modified":"2024-07-19T10:40:17","modified_gmt":"2024-07-19T10:40:17","slug":"openai-unveils-gpt-4o-mini-its-cheapest-small-ai-model-yet-pt","status":"publish","type":"post","link":"https:\/\/howtogeek.blog\/pt\/openai-unveils-gpt-4o-mini-its-cheapest-small-ai-model-yet-pt\/","title":{"rendered":"OpenAI unveils GPT-4o Mini, it\u2019s cheapest small AI model yet"},"content":{"rendered":"<p>Another AI model has joined the sea of AI models and this one\u2019s from OpenAI yet again. OpenAI previously released GPT-4o this year which was supposed to be more efficient than GPT-4. But it was still quite expensive and could rack up quite a bill, especially for developers who need to call the AI model through the API for their apps repeatedly throughout the day.<\/p>\n<p>As a result, developers were turning to cheaper small AI models from competitors, like Gemini 1.5 Flash or Claude 3 Haiku. <\/p>\n<p>Now, OpenAI is releasing GPT-4o mini, their most cost-efficient model yet, with which they\u2019re also entering the small AI model space. While the GPT-4o mini is their cheapest model yet, it is not achieving that low cost by cutting back on intelligence; it is smarter than their existing GPT-3.5 Turbo model.<\/p>\n<p>According to OpenAI, GPT-4o mini scored 82% in MMLU (Measuring Massive Multitask Language Understanding), outperforming many models; these are the respective scores of different models: GPT-3.5 Turbo (70%), Claude 3 Haiku (75.2%) and Gemini 1.5 Flash (78.9%). GPT-4o scored 88.7% on this benchmark, with Gemini Ultra boasting the highest score \u2013 90% (these are not small AI models, though). <\/p>\n<figure class=\"wp-block-image\"><img alt=\"\" class=\"wp-image\" decoding=\"async\" height=\"647\" loading=\"lazy\" src=\"https:\/\/cdn.howtogeek.blog\/wp-content\/uploads\/2024\/07\/image-37-1.webp\" title=\"\" width=\"1150\"\/><figcaption>Source: OpenAI<\/figcaption><\/figure>\n<p>GPT-4o mini is being rolled out to ChatGPT Free, Team and Plus users as well as developers today. For ChatGPT users, it has essentially replaced GPT-3.5; GPT-4o mini will be the model the conversation defaults to once you run out of free GPT-4o queries. Developers will still have the option to use GPT-3.5 through the API, but it will eventually be dropped. ChatGPT Enterprise users will get access to GPT-4o mini next week.<\/p>\n<p>As mentioned above, the focus of GPT-4o mini is to help developers find a low cost and latency model for their app that\u2019s also capable. Compared to other small models, GPT-4o mini excels at reasoning tasks in both text and vision, mathematical reasoning and coding tasks, as well as multimodal reasoning. <\/p>\n<p>It currently supports both text and vision in the API, with support for text, image, video, and audio inputs and outputs on the roadmap for the future. <\/p>\n<p>GPT-4o mini has a context window of 128K tokens for input and 16K tokens for output per request, with its knowledge going up to October 2023. It can also handle non-English text rather cost-effectively. <\/p>\n","protected":false},"excerpt":{"rendered":"<p>Another AI model has joined the sea of AI models and this one\u2019s from OpenAI yet again. OpenAI previously released GPT-4o this year which was supposed to be more efficient than GPT-4. But it was still quite expensive and could rack up quite a bill, especially for developers who need to call the AI model [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[182,183,130],"class_list":["post-2843","post","type-post","status-publish","format-standard","hentry","category-how-to","tag-artificial-intelligence","tag-chatgpt","tag-microsoft"],"acf":[],"_links":{"self":[{"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/posts\/2843","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/comments?post=2843"}],"version-history":[{"count":1,"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/posts\/2843\/revisions"}],"predecessor-version":[{"id":2844,"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/posts\/2843\/revisions\/2844"}],"wp:attachment":[{"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/media?parent=2843"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/categories?post=2843"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/howtogeek.blog\/pt\/wp-json\/wp\/v2\/tags?post=2843"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}