Stable Diffusion 3, Turbo Models Are Now Available via Stability AI Developer Platform API

Stability AI partnered with Fireworks AI, an API platform, to make the new models available.

Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 18 April 2024 12:01 IST

Stable Diffusion 3, Turbo Models Are Now Available via Stability AI Developer Platform API

Photo Credit: Stability AI

The new Stable Diffusion 3 models feature a Multimodal Diffusion Transformer (MMDiT) architecture

Highlights

Stability AI said it will soon make the model weights available
Stable Diffusion 3 models have an improved text understanding
It also invited participants for the early release of Stable Assistant

Stable Diffusion 3 and Stable Diffusion 3 Turbo models were unveiled in preview in February. Now, Stability AI is finally making the artificial intelligence (AI) text-to-image models available for some users. The company will let developers access the AI model through the Stability AI Developer Platform API. It has partnered with the API platform Fireworks AI to bring the models to the public. Notably, the next-generation AI image models by the AI firm come with improved text understanding and spelling capabilities.

Stability AI announced the limited availability of the AI models via a post in its newsroom, and said, “As revealed in the Stable Diffusion 3 research paper, this model is equal to or outperforms state-of-the-art text-to-image generation systems such as DALL-E 3 and Midjourney v6 in typography and prompt adherence, based on human preference evaluations.”

The new text-to-image models have two noteworthy upgrades. First, its understanding of the prompt text has improved. It can now understand the contextual knowledge within the prompt better and can generate images which are closer to what the user desires. It also has improved spelling capabilities. This will help when a user wants to generate an image with written words in it. The company highlighted earlier that the AI will take a closer look at what's being written and offer better output. Overall image quality is also expected to be improved.

AI Models Can Now Compete in This Bizarre 'Miss AI' Influencer Pageant

These new AI models will also be open-sourced in the near future, at least to some extent. The company said that it will make the model weights available for self-hosting with a Stability AI Membership soon. Stability AI also explained that it used a new Multimodal Diffusion Transformer (MMDiT) architecture for the model.

Apart from the AI image generators, Stability AI also invited a limited number of users to participate in the early release of its Stable Assistant which is currently in beta. The AI assistant is powered by Stable Diffusion 3, and Stable LM 2 12B which adds conversational capabilities. It can generate images from conversations, generate content, as well as improve content to match the generated image. Currently, it is not known when the company might release the new AI image models to all members.

Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.

Affiliate links may be automatically generated - see our ethics statement for details.

Comments

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Further reading: Stable Diffusion, Stability AI, AI, Artificial intelligence, AI image generator

Akash Dutta Email Akash Dutta

Akash Dutta is a Senior Sub Editor at Gadgets 360. He is particularly interested in the social impact of technological developments and loves reading about emerging fields such as AI, metaverse, and fediverse. In hi... more »