The feature is powered by Google’s Gemini Omni, Google’s new AI model that is claimed to be capable of generating anything from any input.
Photo Credit: Google
Omni is Google's first video generation model that combines multimodal input in a single prompt
Google is rolling out its Gemini Avatar feature to a wider group of paid Gemini subscribers. It allows users to create an AI-powered digital version of themselves that can be used to generate videos and other content within Gemini. The feature is powered by Google's Gemini Omni, Google's new AI model that is claimed to be capable of generating anything from any input. First discovered during an APK teardown in March, Gemini Avatar is designed to replicate a user's appearance and voice.
The avatar feature can be accessed through the Gemini app by navigating to Settings > Avatar (via Android Authority). Once enabled, users can create a personalised AI avatar mimicking their facial features, voice, and mannerisms. It can also be summoned within Gemini conversations using specific commands.
As per Google, the setup process is fairly straightforward. Users must first complete a camera-based training session where they hold their phone at eye level and follow on-screen prompts to capture their face from multiple angles. This enables Gemini to generate a detailed model of their facial structure.
The second stage involves voice training. Gemini asks users to read a series of random phrases and numbers aloud to the system. The company says this allows the system to analyse their speech patterns, tone, accent, and cadence. Once the setup phase is complete, the avatar is linked to the user's Google account and becomes available across Gemini experiences.
To ensure that the face and voice scans are accepted, users should ensure that their phone is held at eye level and the lighting is not too dim or too bright. Further, they are advised to make sure that their eyes, nose, and mouth are visible. The company allows glasses to be included as part of the avatar, but hats, sunglasses, and masks are not allowed. Lastly, there should not be any people or images of faces in the user's background.
Google has also announced several safeguards to address potential risks with realistic AI avatars. To begin with, users must be at least 18 years old to create an avatar. The account owner must also be physically present throughout the enrolment process. The company further emphasised that all videos generated using Gemini Avatar include Google's SynthID watermark. The invisible watermark is embedded directly into generated content and can be detected using Google tools. Google says this will help viewers identify AI-generated videos and reduce the risk of misuse.
The company has yet to announce whether the feature will also be available to free Gemini users.
Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.
Tomb Raider: Legacy of Atlantis Gets AI Disclosure on Steam, Crystal Dynamics Clarifies AI Use