Normal view
-
TechCrunch News
- A stealth AI model beat DALL-E and Midjourney on a popular benchmark β its creator just landed $30M
OpenAIβs new AI image generator is potent and bound to provoke
The arrival of OpenAI's DALL-E 2 in the spring of 2022 marked a turning point in AI, when text-to-image generation suddenly became accessible to a select group of users, creating a community of digital explorers who experienced wonder and controversy as the technology automated the act of visual creation.
But like many early AI systems, DALL-E 2 struggled with consistent text rendering, often producing garbled words and phrases within images. It also had limitations in following complex prompts with multiple elements, sometimes missing key details or misinterpreting instructions. These shortcomings left room for improvement that OpenAI would address in subsequent iterations, such as DALL-E 3 in 2023.
On Tuesday, OpenAI announced new multimodal image-generation capabilities that are directly integrated into its GPT-4o AI language model, making it the default image generator within the ChatGPT interface. The integration, called "4o Image Generation" (which we'll call "4o IG" for short), allows the model to follow prompts more accurately (with better text rendering than DALL-E 3) and respond to chat context for image modification instructions.
Β© OpenAI
ChatGPTβs image-generation feature gets an upgrade
Viral AI company DeepSeek releases new image model family
DeepSeek, the viral AI company, has released a new set of multimodal AI models that it claims can outperform OpenAIβs DALL-E 3. The models, which are available for download from the AI dev platform Hugging Face, are part of a new model family that DeepSeek is calling Janus-Pro. They range in size from 1 billion [β¦]
Β© 2024 TechCrunch. All rights reserved. For personal use only.
-
TechCrunch News
- Microsoft rolls back its Bing Image Creator model after users complain of degraded quality
Microsoft rolls back its Bing Image Creator model after users complain of degraded quality
Ahead of the holidays, Microsoft said it was upgrading the AI model behind Bing Image Creator, the AI-powered image editing tool built into the companyβs Bing search engine. Microsoft promised that the new model β the latest version of OpenAIβs DALL-E 3 model, code-named PR16 β would allow users to create images βtwice as fast [β¦]
Β© 2024 TechCrunch. All rights reserved. For personal use only.
Google DeepMind unveils a new video model to rival Sora
Google DeepMind, Googleβs flagship AI research lab, wants to beat OpenAI at the video-generation game β and it might just, at least for a little while. On Monday, DeepMind announced Veo 2, a next-gen video-generating AI and the successor to Veo, which powers a growing number of products across Googleβs portfolio. Veo 2 can create [β¦]
Β© 2024 TechCrunch. All rights reserved. For personal use only.
X says its new image generator, Aurora, will launch for all users within the week
X, the Elon Musk-owned social network previously known as Twitter, quietly added a new image generator to itsΒ GrokΒ assistant this past Saturday. Then, it removed it. Now, itβs bringing it back β and officially announcing it. The image generator, called Aurora, was developed by Muskβs AI company, xAI, and trained on billions of examples from the [β¦]
Β© 2024 TechCrunch. All rights reserved. For personal use only.
Elon Muskβs X gains a new image generator, Aurora
X, the Elon Musk-owned social network previously known as Twitter, has added a new image generator to its Grok assistant. However, after going live for a few hours on Saturday, the product seemed to disappear for some users. Just like the first image generator X added to Grok in October, this one, called Aurora, appears [β¦]
Β© 2024 TechCrunch. All rights reserved. For personal use only.