AI

ChatGPT now interprets photos better than an art critic and an investigator combined

The possibilities of generating recent photos from Chatgpt have challenged our previous peeling of the artificial intelligence. The recently announced GPT-4O model shows a noteing capabilities to interpret the images with high accuracy and re-create them with viral effects, such as those inspired by Studio Ghibli. He even mastered a text in the images created from artificial intelligence, which was previously difficult of artificial intelligence. Now, you launch two new models that are able to dissect images for the sermon to collect more information that may even fail in a human look.

Openai announced two new models earlier this week, raising the capabilities of ChatGPT. The new O3 model, which is called Openai “the most powerful thinking model”, improves the existing capabilities of interpretation and perception, improvement in “coding, mathematics, science, visual perception, and more”, as the institution claims. Meanwhile, O4-MINI is a smaller and fastest model for “cost-effective thinking” in the same way. The news follows the launch Openai’s launch of the GPT-4.1 category of models, which brings faster treatment and deeper context.

Chatgpt is now “thinking about pictures”

Through improvements to their logic abilities, models can now combine images into their thinking process, which makes them able to “think about pictures”, ” Openai announces. With this change, both models can be combined with their ideas series. The basic analysis of images exceeds, O3 and O4-MINI models can closely investigate and even process them through procedures such as cultivation, enlargement, flipping, or enriching details to bring any visual signals from images that can improve the capacity of ChatgPT to provide solutions.

By advertising, the models are said to mix optical thinking and the text, which can be combined with other Chatgpt features such as web search, data analysis, and code generation, and it is expected to become the basis for the most advanced artificial intelligence agents with multimedia analysis.

Among other practical applications, you can expect to include pictures of many elements, these flow charts or scribbles of handwritten observations to images of creatures in the real world, and you expect Chatgpt will have a deeper understanding of a better output, even without a descriptive text. With this, Openai is approaching Gueini from Google, which provides impressive ability to interpret the real world through direct video.

Despite the bold allegations, Openai does not limit access to the paid members only, and it is assumed that to prevent its graphics processing units from “melting” again, because it is struggling to keep pace with the demand for new thinking features. As of now, the O3, O4-MINI and O4-MINI models will be exclusively available to Chatgpt Plus, Pro and team members while level users in institutions and education get simultaneously. Meanwhile, free users will be able to determine access to O4-MINI when they choose the “Think” button in the claim bar.


trends"/>




Don’t miss more hot News like this! Click here to discover the latest in AI news!

2025-04-17 07:43:00

Related Articles

Back to top button