Meta Used Public Instagram, Facebook Posts to Train Its New AI Assistant

Meta also said it did not use private chats on its messaging services as training data for the AI model.

Advertisement
By Reuters | Updated: 29 September 2023 14:58 IST
Highlights
  • Meta introduced its AI assistant at the Meta Connect annual conference
  • The company excluded private posts shared only with family and friends
  • Tech firms have faced criticism over AI training without permission

Meta made the assistant using a custom model based on the powerful Llama 2

Photo Credit: Reuters

Meta Platforms used public Facebook and Instagram posts to train parts of its new Meta AI virtual assistant, but excluded private posts shared only with family and friends in an effort to respect consumers' privacy, the company's top policy executive told Reuters in an interview.

Meta also did not use private chats on its messaging services as training data for the model and took steps to filter private details from public datasets used for training, said Meta President of Global Affairs Nick Clegg, speaking on the sidelines of the company's annual Connect conference this week.

"We've tried to exclude datasets that have a heavy preponderance of personal information," Clegg said, adding that the "vast majority" of the data used by Meta for training was publicly available.

Advertisement

He cited LinkedIn as an example of a website whose content Meta deliberately chose not to use because of privacy concerns.

Advertisement

Clegg's comments come as tech companies including Meta, OpenAI and Alphabet's Google have been criticized for using information scraped from the internet without permission to train their AI models, which ingest massive amounts of data in order to summarize information and generate imagery.

The companies are weighing how to handle the private or copyrighted materials vacuumed up in that process that their AI systems may reproduce, while facing lawsuits from authors accusing them of infringing copyrights.

Advertisement

Meta AI was the most significant product among the company's first consumer-facing AI tools unveiled by CEO Mark Zuckerberg on Wednesday at Meta's annual products conference, Connect. This year's event was dominated by talk of artificial intelligence, unlike past conferences which focused on augmented and virtual reality.

Meta made the assistant using a custom model based on the powerful Llama 2 large language model that the company released for public commercial use in July, as well as a new model called Emu that generates images in response to text prompts, it said.

Advertisement

The product will be able to generate text, audio and imagery and will have access to real-time information via a partnership with Microsoft's Bing search engine.

The public Facebook and Instagram posts that were used to train Meta AI included both text and photos, Clegg said.

Those posts were used to train Emu for the image generation elements of the product, while the chat functions were based on Llama 2 with some publicly available and annotated datasets added, a Meta spokesperson told Reuters.

Interactions with Meta AI may also be used to improve the features going forward, the spokesperson said.

Clegg said Meta imposed safety restrictions on what content the Meta AI tool could generate, like a ban on the creation of photo-realistic images of public figures.

On copyrighted materials, Clegg said he was expecting a "fair amount of litigation" over the matter of "whether creative content is covered or not by existing fair use doctrine," which permits the limited use of protected works for purposes such as commentary, research and parody.

"We think it is, but I strongly suspect that's going to play out in litigation," Clegg said.

Some companies with image-generation tools facilitate the reproduction of iconic characters like Mickey Mouse, while others have paid for the materials or deliberately avoided including them in training data.

OpenAI, for instance, signed a six-year deal with content provider Shutterstock this summer to use the company's image, video and music libraries for training.

Asked whether Meta had taken any such steps to avoid the reproduction of copyrighted imagery, a Meta spokesperson pointed to new terms of service barring users from generating content that violates privacy and intellectual property rights.

© Thomson Reuters 2023


From the launch of the Infinix GT 10 Pro to Amazon's latest mega-sale, we discuss the most noteworthy technology news events of the week on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Further reading: Meta, Meta AI, Facebook, Instagram, AI, OpenAI
Advertisement

Related Stories

Popular Mobile Brands
  1. The Madras Mystery OTT Release: Know All About This Nazriya Nazim Thriller
  1. Busy Weekend for ISS as Progress 93 Docks and Cygnus XL Prepares for Launch
  2. NASA’s X-59 Quiet Supersonic Jet Prepares for First Flight, to Fly Without the Sonic Boom
  3. The Bad Guys 2 OTT Release: Know All About This Animated Comedy Movie
  4. The Rip OTT Release: When and Where to Watch the Matt Damon, Ben Affleck Thriller
  5. Kurukshetra: The Great War of Mahabharata Animated Series Is Coming to This OTT Platform Very Soon
  6. Astronomers Predict 90 Percent Chance of Spotting an Exploding Black Hole in Next Decade
  7. DNA Cassette Tapes Could Transform the Future of Digital Storage
  8. Researchers Create Metal That Resists Cracking in Deep Space Cold
  9. The Madras Mystery OTT Release: This Nazriya Nazim Thriller Will Soon Arrive on This Platform
  10. The Treasure Hunters OTT Release: Know When and Where to Watch Manisha Rani's Game Show Online
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.