ChatGPT Images 2.0 can reason, search the web and generate up to eight coherently related images

OpenAI has presented its new image generation model ChatGPT Images 2.0which comes with new processing capabilities, greater precision and improvements when generating text, in addition to being able to search for content on the Internet and verify its results thanks to its reasoning, generating up to eight related images at a time.

The company has unveiled the successor to ChatGPT Images and has referred to this model as a “radical change” in terms of following detailed instructionsthe placement of elements and the precise relationship between objects.

Specifically, OpenAI has launched ChatGPT Images 2.0 as a “state-of-the-art model” capable of performing complex visual tasks and producing “accurate and ready-to-use” images, as it shared in a statement on its blog.

This is because it not only allows more sophisticated images to be conceptualized, but also generates them more efficiently, following the instructions from users with more fidelity, retaining requested details and rendering the more subtle elements that often cause failures, such as small texts, icons, user interfaces on a computer or dense compositions with a lot of detail, with a resolution up to 2K.

Likewise, it has highlighted notable improvements in the ability to generate content in different formats and, above all, when it comes to represent dense text. Additionally, it is now more precise creating images in any languagenot only in English, and uses “visual and world knowledge” to fill in the missing information.

Following this line, he has also detailed that his multilingual understanding has also improved beyond languages ​​with the Latin alphabet, so it generates better results in languages ​​like Japanese, Korean, Chinese, Hindi and Bengali.

REASONING AND WEB SEARCH CAPABILITY

Another aspect to highlight is that, for the first time for an image generation model, OpenAI has introduced reasoning ability. As a result, ChatGPT Images 2.0 can look for real information on the webuse this information to create different images from a single indication and, finally, Check your results to see if they are correct.

As the technology company has explained, this capacity allows the model to simplify the process between the idea and the image by acting as a visual assistant, “especially when accuracy, up-to-date information, consistency and visual cohesion are critical.”

That is, based on the content shared by users and that found on the web, the model identifies which data is important, structures it and transforms this information into graphic materials autonomously meaningful.

This feature is useful, for example, when generating educational graphic content or visual summariessince the model can synthesize information by itself, write a story, and present it with a clear structure and strong visual flow.

Thus, users can request a set of images consistent with each other, getting up to eight results at once. For example, for the creation of a comic with character continuity, an infographic or precise maps.

MORE REALISTIC IMAGES

OpenAI has also highlighted that Images 2.0 has also Improved realistic image generation. For example, when recreating a photograph, include the small imperfections common in these images to provide more realism.

Likewise, in other still images such as pixel art or manga, it has greater consistency in texture, lighting or composition. It also offers greater format flexibility with aspect ratios up to 3:1 and 1:3, for poster content, mobile or computer screens.

As a result, users will get improved, sharper images of ‘collages’, mangas written in japanese consistently, video game prototypes or photographs with a concrete realistic style. They can also generate advertising material and storyboards.

With all this, ChatGPT Images 2.0 is now available for all ChatGPT and Codex users, although advanced analysis functions are only available to users subscribed to the ChatGPT Plus, Pro and Business versions. Likewise, the new model is also available in the API.

By Editor

One thought on “ChatGPT Images 2.0 can reason, search the web and generate up to eight coherently related images”

Leave a Reply