Stability AI partnered with Fireworks AI, an API platform, to make the new models available.
Photo Credit: Stability AI
The new Stable Diffusion 3 models feature a Multimodal Diffusion Transformer (MMDiT) architecture
Stable Diffusion 3 and Stable Diffusion 3 Turbo models were unveiled in preview in February. Now, Stability AI is finally making the artificial intelligence (AI) text-to-image models available for some users. The company will let developers access the AI model through the Stability AI Developer Platform API. It has partnered with the API platform Fireworks AI to bring the models to the public. Notably, the next-generation AI image models by the AI firm come with improved text understanding and spelling capabilities.
Stability AI announced the limited availability of the AI models via a post in its newsroom, and said, “As revealed in the Stable Diffusion 3 research paper, this model is equal to or outperforms state-of-the-art text-to-image generation systems such as DALL-E 3 and Midjourney v6 in typography and prompt adherence, based on human preference evaluations.”
The new text-to-image models have two noteworthy upgrades. First, its understanding of the prompt text has improved. It can now understand the contextual knowledge within the prompt better and can generate images which are closer to what the user desires. It also has improved spelling capabilities. This will help when a user wants to generate an image with written words in it. The company highlighted earlier that the AI will take a closer look at what's being written and offer better output. Overall image quality is also expected to be improved.
These new AI models will also be open-sourced in the near future, at least to some extent. The company said that it will make the model weights available for self-hosting with a Stability AI Membership soon. Stability AI also explained that it used a new Multimodal Diffusion Transformer (MMDiT) architecture for the model.
Apart from the AI image generators, Stability AI also invited a limited number of users to participate in the early release of its Stable Assistant which is currently in beta. The AI assistant is powered by Stable Diffusion 3, and Stable LM 2 12B which adds conversational capabilities. It can generate images from conversations, generate content, as well as improve content to match the generated image. Currently, it is not known when the company might release the new AI image models to all members.
Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.
Engineers Turn Lobster Shells Into Robot Parts That Lift, Grip and Swim
Strongest Solar Flare of 2025 Sends High-Energy Radiation Rushing Toward Earth
Raat Akeli Hai: The Bansal Murders OTT Release: When, Where to Watch the Nawazuddin Siddiqui Murder Mystery
Bison Kaalamaadan Is Now Streaming: Know All About the Tamil Sports Action Drama