The ChatGPT Images 2.0 model is a major step forward in image generation capabilities, including significantly enhanced world knowledge, instruction following, and generating detail and complexity such as dense text. The new thinking mode capability introduced along with the model adds reasoning and tool use to the image generation process, allowing the system to integrate live web search data, generate multiple images from a single prompt, and use our reasoning stack to turn a basic prompt into a well-researched and thought-through final image.
The core safety stack we are using with ChatGPT Images 2.0 and thinking mode is based on the same foundations as our ChatGPT Images 1.5 safety stack, with additional safeguards to address new risks that emerge as models become more capable.