Google Teases Computer Vision, Conversational Capabilities of Gemini AI Ahead of Google I/O Event

In a teaser video shared by Google, Gemini is shown to access the camera and describe the surroundings.

Advertisement
Written by Akash Dutta, Edited by Manas Mitul | Updated: 14 May 2024 19:31 IST
Highlights
  • Google’s Gemini also appears to be better at holding a conversation
  • Google I/O event is scheduled for 10:30pm IST on May 14
  • Google can announce new AI features for Gemini during the event

Google is also expected to unveil Android 15 and Wear OS 5 during the event

Photo Credit: X/Google

Google shared a video on its social media platforms on Monday, teasing new capabilities of its artificial intelligence (AI)-powered chatbot Gemini. The video was released just a day before the company's annual developer-focused Google I/O event. It is believed that the tech giant could make several announcements around AI and unveil new features and possibly new AI models. Besides that, the centre-stage is likely to be taken by Android 15 and Wear OS 5, which could be unveiled during the event.

In a short video posted on X (formerly known as Twitter), the official account of Google teased new capabilities of its in-house AI chatbot. The 50 second-long video highlighted marked improvements in its speech, giving Gemini a more emotive voice and modulations that gives it a more human-like appearance. Further, the video highlighted new computer vision capabilities. The AI could pick up on the visuals on the screen and analyse it.

Advertisement

Gemini could also access the camera of the smartphone, a capability it does not possess at present. The user was moving the camera across the space and asked the AI to describe what it saw. Almost without any time lag, the chatbot could describe the setting as a stage and when prompted, could even recognise the Google I/O logo and share information around it.

The video shared no further details about the AI, and instead asked people to watch the event to know more. There are some questions that might be answered during the event such as whether Google is using a new large language model (LLM) for computer vision or if it an upgraded version of Gemini 1.5 Pro. Further, Google may also reveal what else can the AI do with its computer vision. Notably, there are rumours that the tech giant might introduce Gems, which are considered to be chatbot agents that can be designed for particular tasks, similar to OpenAI's GPTs.

Advertisement

While Google's event is expected to introduce new features to Gemini, OpenAI held its Spring Update event on Monday and unveiled its latest GPT-4o AI model that added features to ChatGPT, similar to the video shared by Google. The new AI model allows it to have a conversational speech, computer vision, real-time language translation, and more.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Vivo T5 Pro 5G Confirmed to Launch in India Soon With These Features
  2. Google Pixel 10 Users Can Now Play Steam Games Offline via GameNative
  3. Samsung Galaxy S26 FE Geekbench Listing Reveals Benchmark Figures
  1. Band Melam OTT Release: Know Where to Watch the Telugu Romantic Musical Film
  2. Microsoft Releases New AI Models That Can Generate Images, Audio and Transcribe Text
  3. Redmi K Pad 2, New Redmi Laptops Tipped to Launch Alongside Redmi K90 Ultra
  4. Google Pixel 10 Users Can Now Play Steam Games Offline via GameNative 0.9.0
  5. Circle Unveils cirBTC Token to Expand Bitcoin’s Role in DeFi Ecosystem
  6. Honor 600 Series Could Launch Soon as Company Starts Teasing Debut of a New Phone
  7. Microsoft AI Chief Wants to Deliver State-of-the-Art AI Models by 2027: Report
  8. Infinix GT 50 Pro Leak Shows Design, Cooling, Gaming Features Ahead of Anticipated Launch
  9. Samsung Galaxy Z Fold 8, Galaxy Z Flip 8 to Stick With Older M13 OLED Panels: Report
  10. Crypto Hack Losses Drop to $168.6 Million in Q1 2026 Despite Ongoing Risks
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.