ChatGPT Is a Data Privacy Nightmare. if You’ve Ever Posted Online, You Ought to Be Concerned

If you’ve ever written a blog post or product review, or commented on an article online, there’s a good chance this information was consumed by ChatGPT.

Advertisement
By The Conversation | Updated: 8 February 2023 13:12 IST
Highlights
  • Google unveiled its own conversational AI called Bard
  • The problem with chatbots is it’s fuelled by our personal data
  • The more data the model is trained on, the better it gets

ChatGPT was launched by OpenAI in November 2022

Photo Credit: Pixabay

ChatGPT has taken the world by storm. Within two months of its release it reached 100 million active users, making it the fastest-growing consumer application ever launched. Users are attracted to the tool's advanced capabilities – and concerned by its potential to cause disruption in various sectors. A much less discussed implication is the privacy risks ChatGPT poses to each and every one of us. Just yesterday, Google unveiled its own conversational AI called Bard, and others will surely follow. Technology companies working on AI have well and truly entered an arms race.

The problem is it's fuelled by our personal data.

300 billion words. How many are yours? ChatGPT is underpinned by a large language model that requires massive amounts of data to function and improve. The more data the model is trained on, the better it gets at detecting patterns, anticipating what will come next and generating plausible text.

Advertisement

OpenAI, the company behind ChatGPT, fed the tool some 300 billion words systematically scraped from the internet: books, articles, websites and posts – including personal information obtained without consent.

If you've ever written a blog post or product review, or commented on an article online, there's a good chance this information was consumed by ChatGPT.

Advertisement

So why is that an issue? The data collection used to train ChatGPT is problematic for several reasons.

First, none of us were asked whether OpenAI could use our data. This is a clear violation of privacy, especially when data are sensitive and can be used to identify us, our family members, or our location.

Advertisement

Even when data are publicly available their use can breach what we call textual integrity. This is a fundamental principle in legal discussions of privacy. It requires that individuals' information is not revealed outside of the context in which it was originally produced.

Also, OpenAI offers no procedures for individuals to check whether the company stores their personal information, or to request it be deleted. This is a guaranteed right in accordance with the European General Data Protection Regulation (GDPR) – although it's still under debate whether ChatGPT is compliant with GDPR requirements.

Advertisement

This “right to be forgotten” is particularly important in cases where the information is inaccurate or misleading, which seems to be a regular occurrence with ChatGPT.

Moreover, the scraped data ChatGPT was trained on can be proprietary or copyrighted. For instance, when I prompted it, the tool produced the first few paragraphs of Peter Carey's novel “True History of the Kelly Gang” – a copyrighted text.

Finally, OpenAI did not pay for the data it scraped from the internet. The individuals, website owners and companies that produced it were not compensated. This is particularly noteworthy considering OpenAI was recently valued at $29 billion (roughly Rs. 2,39,700 crore), more than double its value in 2021.

OpenAI has also just announced ChatGPT Plus, a paid subscription plan that will offer customers ongoing access to the tool, faster response times and priority access to new features. This plan will contribute to expected revenue of $1 billion (roughly Rs. 8,300 crore) by 2024.

None of this would have been possible without data – our data – collected and used without our permission.

A flimsy privacy policy Another privacy risk involves the data provided to ChatGPT in the form of user prompts. When we ask the tool to answer questions or perform tasks, we may inadvertently hand over sensitive information and put it in the public domain.

For instance, an attorney may prompt the tool to review a draft divorce agreement, or a programmer may ask it to check a piece of code. The agreement and code, in addition to the outputted essays, are now part of ChatGPT's database. This means they can be used to further train the tool, and be included in responses to other people's prompts.

Beyond this, OpenAI gathers a broad scope of other user information. According to the company's privacy policy, it collects users' IP address, browser type and settings, and data on users' interactions with the site – including the type of content users engage with, features they use and actions they take.

It also collects information about users' browsing activities over time and across websites. Alarmingly, OpenAI states it may share users' personal information with unspecified third parties, without informing them, to meet their business objectives.

Time to rein it in? Some experts believe ChatGPT is a tipping point for AI – a realisation of technological development that can revolutionise the way we work, learn, write and even think. Its potential benefits notwithstanding, we must remember OpenAI is a private, for-profit company whose interests and commercial imperatives do not necessarily align with greater societal needs.

The privacy risks that come attached to ChatGPT should sound a warning. And as consumers of a growing number of AI technologies, we should be extremely careful about what information we share with such tools.


Samsung's Galaxy S23 series of smartphones was launched earlier this week and the South Korean firm's high-end handsets have seen a few upgrades across all three models. What about the increase in pricing? We discuss this and more on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: ChatGPT, OpenAI
Advertisement

Related Stories

Popular Mobile Brands
  1. Huawei MatePad SE 11 Set to Launch at This Price in India
  2. OnePlus Nord 6 Could Launch in India at This Price
  3. Huawei Teases MatePad 11.5 Price in India Ahead of Launch
  4. OpenAI's Faster GPT-5.4 Mini and Nano AI Models Are Here: Details
  5. Vivo X300 Ultra, Vivo X300s Will Feature This New Colour Technology
  6. Oppo A6s 5G With 6,500mAh Battery Launched in India: See Price
  7. Jio Users Can Get Free Incoming SMS Abroad Using Wi-Fi Calling
  1. Russia Plans Venera-D Mission to Venus in 2036 With Lander, Orbiter, and Balloon Probe
  2. Realme C100i Spotted on NBTC Certification Database as Key Features Surface Online via Retailer Listings
  3. Huawei MatePad SE 11 Price in India Revealed as Company Confirms Imminent Launch in the Country
  4. Marshall Bromley 450 Launched in India With 360-Degree Sound, Up to 40-Hour Battery Life: Price, Features
  5. Oppo Find X9s Pro Reportedly Bags 3C Certification Ahead of Launch in China: Expected Specifications
  6. Itel Unveils Zeno AI Weaver Voice Recorder in India With Up to 40 Hours Recording Capacity, Live Transcription
  7. UK Parliamentary Committee Seeks Temporary Ban on Crypto Donations Over Foreign Influence Risks
  8. Laalo: Krishna Sada Sahaayate Out on OTT: Know Where to Watch it Online
  9. Google’s Personal Intelligence Is Now Rolling Out to More Users
  10. Dreame L40 Ultra AE Robot Vacuum With 19,000Pa Vormax Suction Launched in India, Dreame D20 Ultra Tags Along
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.