Facebook, Twitter Behavioural Data Fraught With Biases: Study

Advertisement
By Press Trust of India | Updated: 28 November 2014 15:09 IST
Using social media such as Twitter and Facebook to gather data on human behaviour may be fraught with biases, scientists say.

Behavioural scientists use social media to quickly and cheaply gather huge amounts of data about what people are thinking and doing but researchers at Carnegie Mellon University in the US and McGill University in Canada have found that those massive datasets may be misleading. Carnegie Mellon's Juergen Pfeffer and McGill's Derek

Ruths said that scientists need to find ways of correcting for the biases inherent in the information gathered from Twitter and other social media, or to at least acknowledge the shortcomings of that data. It is not an insignificant problem, researchers noted that thousands of research papers each year are now based on data gleaned from social media, a source of data that barely existed even five years ago.

"Not everything that can be labelled as 'Big Data' is automatically great," Pfeffer said. He said that many researchers think - or hope - that if they gather a large enough dataset they can overcome any biases or distortion that might lurk there.

Advertisement

Despite researchers' attempts to generalise their study results to a broad population, social media sites often have substantial population biases; generating the random samples that give surveys their power to accurately reflect attitudes and behaviour is problematic, scientists said.

Advertisement

Instagram, for instance, has special appeal to adults between the ages of 18 and 29, African-Americans, Latinos, women and urban dwellers, while Pinterest is dominated by women between the ages of 25 and 34 with average household incomes of $100,000. Yet Ruths and Pfeffer said researchers seldom acknowledge, much less correct, these built-in sampling biases.

Other questions about data sampling may never be resolved because social media sites use proprietary algorithms to create or filter their data streams and those algorithms are subject to change without warning.

Advertisement

Most researchers are left in the dark, though others with special relationships to the sites may get a look at the site's inner workings. The rise of these "embedded researchers," Ruths and Pfeffer said, in turn is creating a divided social media research community.

In an article published in the journal Science, researchers also noted that not all "people" on these sites are even people. Some are professional writers or public relations representatives, who post on behalf of celebrities or corporations, others are simply phantom accounts. Some "followers" can be bought.

Advertisement

The social media sites try to hunt down and eliminate such bogus accounts - half of all Twitter accounts created in 2013 have already been deleted - but a lone researcher may have difficulty detecting those accounts within a dataset, according to Ruths and Pfeffer.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Facebook, Social, Twitter
Advertisement

Related Stories

Popular Mobile Brands
  1. Oppo K14x 5G With 6,500mAh Battery Goes on Sale in India: See Price, Offers
  2. Vivo X300 FE Reportedly Bags IMDA and TUV Certifications Ahead of Launch
  3. Samsung Galaxy S26+ Reportedly Listed for Sale Online Ahead of Launch
  4. Apple to Reportedly Launch Low-Cost MacBook in 'Playful Colors' in March
  5. AI Impact Summit: From Registration to Schedule, All You Need to Know
  6. Samsung Galaxy A27 5G Lands on IMEI Database, Could Launch Soon
  7. Lava Bold N2 Will Be Launched in India on This Date: See Expected Specs
  8. Anthropic's First Indian Office in Bengaluru Is Now Open
  9. Oppo Find X10 Series Could Debut This Year With This iPhone-Like Feature
  1. X Building Smart 'Cashtags' to Let Users Check Cryptocurrency Prices in Real-Time
  2. Samsung Galaxy A27 5G Listing on IMEI Database Suggests a Galaxy A26 Successor Is on the Way
  3. Anthropic Inaugurates First Indian Office in Bengaluru, Starts Hiring Local Talent
  4. Apple Tipped to Adopt Samsung's Privacy Display Technology for MacBook Models by 2029
  5. Oppo Find X10 Series Tipped to Launch in H2 2026 With Built-In Magnets for Wireless Charging
  6. AMD and TCS to Co-Develop Helios AI Data Centre Architecture, Deliver 200MW Data Centre Blueprint
  7. Tecno Spark 50 4G Tipped to Launch Globally Soon; Design, Colourways, Key Features Leaked
  8. Lava Bold N2 India Launch Date Revealed; Will Be Exclusively Available via Amazon
  9. Government Green Lights Rs. 10,000 Crore Fund of Funds 2.0 Under the Startup India Mission
  10. Samsung’s 'Wide' Galaxy Z Fold Design Revealed via Leaked One UI 9 Animations
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.