{"id":4188,"date":"2026-04-22T07:36:29","date_gmt":"2026-04-22T07:36:29","guid":{"rendered":"https:\/\/unum.codes\/?p=4188"},"modified":"2026-04-22T07:36:31","modified_gmt":"2026-04-22T07:36:31","slug":"chatgpts-new-images-2-0-model-is-surprisingly-good-at-generating-text","status":"publish","type":"post","link":"https:\/\/unum.codes\/index.php\/2026\/04\/22\/chatgpts-new-images-2-0-model-is-surprisingly-good-at-generating-text\/","title":{"rendered":"ChatGPT\u2019s new Images 2.0 model is surprisingly good at generating text"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\" id=\"speakable-summary\">It used to be easy enough to distinguish between human-made and AI-generated imagery \u2014 just two years ago, you couldn\u2019t use image models to&nbsp;<a href=\"https:\/\/techcrunch.com\/2024\/03\/21\/why-is-ai-so-bad-at-spelling\/\">create a menu for a Mexican restaurant<\/a>&nbsp;without inventing new culinary delights like \u201cenchuita,\u201d \u201cchuriros,\u201d \u201cburrto,\u201d and \u201cmargartas.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Now, when I ask the brand new ChatGPT Images 2.0 model for a menu of Mexican food, it creates something that could immediately be used in a restaurant without customers noticing that something\u2019s off. (However, ceviche priced at $13.50 might make me question the quality of the fish.)<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/04\/6e744049-12b8-49ba-8d2a-66e7326c0169.png?w=453\" alt=\"\" class=\"wp-image-3114706\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong>ChatGPT Images 2.0<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">For comparison, here\u2019s the result I got from DALL-E 3 two years ago (at the time, ChatGPT did not generate images):<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img decoding=\"async\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/04\/Screenshot-2024-03-19-at-11.05.24-AM.webp?w=652\" alt=\"\" class=\"wp-image-3114711\"\/><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong>Microsoft Designer (DALL-E 3)<\/figcaption><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">AI image generators have&nbsp;<a href=\"https:\/\/techcrunch.com\/2024\/03\/21\/why-is-ai-so-bad-at-spelling\/\">historically struggled to spell<\/a>&nbsp;because they generally used diffusion models, which work by reconstructing images from noise.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cThe diffusion models [\u2026] are reconstructing a given input,\u201d Asmelash Teka Hadgu, founder and CEO of Lesan AI,&nbsp;<a href=\"https:\/\/techcrunch.com\/2024\/03\/21\/why-is-ai-so-bad-at-spelling\/\">told TechCrunch<\/a>&nbsp;in 2024. \u201cWe can assume writings on an image are a very, very tiny part, so the image generator learns the patterns that cover more of these pixels.\u201d<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Researchers have since explored other mechanisms for image generation, like&nbsp;<a href=\"https:\/\/aws.amazon.com\/what-is\/autoregressive-models\/\" target=\"_blank\" rel=\"noreferrer noopener\">autoregressive models<\/a>, which make predictions about what an image should look like and function more like an LLM.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Unfortunately, OpenAI declined to answer a question in a press briefing this week about what kind of model is powering ChatGPT Images 2.0. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The company did, however, explain that the new model has \u201cthinking capabilities,\u201d which give it the ability to search the web, make multiple images from one prompt, and double-check its creations \u2014 this allows Images 2.0 to create marketing assets in various sizes, as well as multi-paneled comic strips.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">OpenAI also says that Images has a stronger understanding of non-Latin text rendering in languages like Japanese, Korean, Hindi, and Bengali. The model\u2019s knowledge cuts off in December 2025, which could impact how accurately it can generate certain prompts involving recent news.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u201cImages 2.0 brings an unprecedented level of specificity and fidelity to image creation. It can not only conceptualize more sophisticated images, but it actually brings that vision to life e\ufb00ectively, able to follow instructions, preserve requested details, and render the fine-grained elements that often break image models: small text, iconography, UI elements, dense compositions, and subtle stylistic constraints, all at up to 2K resolution,\u201d OpenAI said in a press release.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">These capabilities mean that image generation isn\u2019t as rapid as typing a question to ChatGPT, but generating something complex like a multi-paneled comic still takes just a few minutes.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">All ChatGPT and Codex users will be able to access Images 2.0 starting Tuesday; paid users will be able to generate more advanced outputs. The company will also make the gpt-image-2\u00a0<a href=\"https:\/\/openai.com\/api\/pricing\/\" target=\"_blank\" rel=\"noreferrer noopener\">API available<\/a>, with pricing dependent on the quality and resolution of outputs. <\/p>\n","protected":false},"excerpt":{"rendered":"<p>It used to be easy enough to distinguish between human-made and AI-generated imagery \u2014 just two years ago, you couldn\u2019t use image models to&nbsp;create a menu for a Mexican restaurant&nbsp;without inventing new culinary delights like \u201cenchuita,\u201d \u201cchuriros,\u201d \u201cburrto,\u201d and \u201cmargartas.\u201d Now, when I ask the brand new ChatGPT Images 2.0 model for a menu of&#8230;<\/p>\n","protected":false},"author":1,"featured_media":4189,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[19],"tags":[],"class_list":["post-4188","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-soltech-news"],"_links":{"self":[{"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/posts\/4188","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/comments?post=4188"}],"version-history":[{"count":1,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/posts\/4188\/revisions"}],"predecessor-version":[{"id":4190,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/posts\/4188\/revisions\/4190"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/media\/4189"}],"wp:attachment":[{"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/media?parent=4188"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/categories?post=4188"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/unum.codes\/index.php\/wp-json\/wp\/v2\/tags?post=4188"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}