Skip to main content

OpenAI’s new image generator aims to be practical enough for designers and advertisers

OpenAI has released a new image generator that’s designed less for typical surrealist AI art and more for highly controllable and practical creation of visuals—a sign that OpenAI thinks its tools are ready for use in fields like advertising and graphic design. 

The image generator, which is now part of the company’s GPT-4o model, was promised by OpenAI last May but wasn’t released. Requests for generated images on ChatGPT were filled by an older image generator called DALL-E. OpenAI has been tweaking the new model since then and will now release it over the coming weeks to all tiers of users starting today, replacing the older one. 

The new model makes progress on technical issues that have plagued AI image generators for years. While most have been great at creating fantastical images or realistic deepfakes, they’ve been terrible at something called binding, which refers to the ability to identify certain objects correctly and put them in their proper place (like a sign that says “hot dogs” properly placed above a food cart, not somewhere else in the image). 

It was only a few years ago that models started to succeed at things like “Put the red cube on top of the blue cube,” a feature that is essential for any creative professional use of AI. Generators also struggle with text generation, typically creating distorted jumbles of letter shapes that look more like captchas than readable text.

Example images from OpenAI show progress here. The model is able to generate 12 discrete graphics within a single image—like a cat emoji or a lightning bolt—and place them in proper order. Another shows four cocktails accompanied by recipe cards with accurate, legible text. More images show comic strips with text bubbles, mock advertisements, and instructional diagrams. The model also allows you to upload images to be modified, and it will be available in the video generator Sora as well as in GPT-4o. 

It’s “a new tool for communication,” says Gabe Goh, the lead designer on the generator at OpenAI. Kenji Hata, a researcher at OpenAI who also worked on the tool, puts it a different way: “I think the whole idea is that we’re going away from, like, beautiful art.” It can still do that, he clarifies, but it will do more useful things too. “You can actually make images work for you,” he says, “and not just just look at them.”

It’s a clear sign that OpenAI is positioning the tool to be used more by creative professionals: think graphic designers, ad agencies, social media managers, or illustrators. But in entering this domain, OpenAI has two paths, both difficult. 

One, it can target the skilled professionals who have long used programs like Adobe Photoshop, which is also investing heavily in AI tools that can fill images with generative AI. 

“Adobe really has a stranglehold on this market, and they’re moving fast enough that I don’t know how compelling it is for people to switch,” says David Raskino, the cofounder and chief technical officer of Irreverent Labs, which works on AI video generation. 

The second option is to target casual designers who have flocked to tools like Canva (which has also been investing in AI). This is an audience that may not have ever needed technically demanding software like Photoshop but would use more casual design tools to create visuals. To succeed here, OpenAI would have to lure people away from platforms built for design in hopes that the speed and quality of its own image generator would make the switch worth it (at least for part of the design process). 

It’s also possible the tool will simply be used as many image generators are now: to create quick visuals that are “good enough” to accompany social media posts. But with OpenAI planning massive investments, including participation in the $500 billion Stargate project to build new data centers at unprecedented scale, it’s hard to imagine that the image generator won’t play some ambitious moneymaking role. 

Regardless, the fact that OpenAI’s new image generator has pushed through notable technical hurdles has raised the bar for other AI companies. Clearing those hurdles likely required lots of very specific data, Raskino says, like millions of images in which text is properly displayed at lots of different angles and orientations. Now competing image generators will have to match those achievements to keep up.

“The pace of innovation should increase here,” Raskino says.



from MIT Technology Review https://ift.tt/S3zPyCk
via IFTTT

Comments

Popular posts from this blog

Roundtables: Unveiling the 10 Breakthrough Technologies of 2025

Recorded on January 3, 202 5 Unveiling the 10 Breakthrough Technologies of 2025 Speakers: Amy Nordrum , executive editor, and Charlotte Jee , news editor. Each year, MIT Technology Review publishes an annual list of the top ten breakthrough technologies that will have the greatest impact on how we live and work in the future. This year, the 10 Breakthrough Technologies list was unveiled live by our editors. Hear from  MIT Technology Review  executive editor Amy Nordrum and news editor Charlotte Jee as they share an unveiling of the list of the 10 breakthrough technologies. Related Coverage The 10 Breakthrough Technologies of 2025 3 things that didn’t make the 10 Breakthrough Technologies of 2025 list The 10 Breakthrough Technologies of 2024 from MIT Technology Review https://ift.tt/0Xert49 via IFTTT

Why scientists want to help plants capture more carbon dioxide

This article is from The Spark, MIT Technology Review’s weekly climate newsletter. To receive it in your inbox every Wednesday,  sign up here. Hello hello!  This week in The Spark, we’re taking a look back at one of my favorite sessions from our ClimateTech conference last week, from a chapter we called “Cleaning Your Plate.”  In the session, I sat down with Pamela Ronald, a plant geneticist at the University of California, Davis. She’s been working for years on helping rice survive floods, and now she’s turning her attention to using advanced genetics for carbon removal on farmland.  Genetics and plants Scientists have a wide range of tools at their disposal to influence how plants grow. From standard genetic engineering to more sophisticated gene editing tools like CRISPR, we have more power than ever to influence what traits we want in crops.  But genetic tweaking isn’t anything new. “Virtually everything we eat has been improved using some sort o...

The Animation Guild ratifies a contract with big studios, without AI demands such as letting members opt out of using AI or having AI train on their work (Gene Maddaus/Variety)

Gene Maddaus / Variety : The Animation Guild ratifies a contract with big studios, without AI demands such as letting members opt out of using AI or having AI train on their work   —  The Animation Guild has ratified its contract with the major studios, despite concerns from some about protections against artificial intelligence. from Techmeme https://ift.tt/4pCbZY7 via IFTTT