Discover more:Artificial Intelligence, ChatGPT, copyright, Digital Media, Free Reads

After artists and coders, news outlets up against OpenAI for using their articles to train ChatGPT

This issue has also been flagged by the creative industry, amid increasing use of ChatGPT and focus on artificial intelligence

Sarasvati NT

Published

February 20, 2023

In the latest criticism against ChatGPT, news organisations have called out OpenAI for using their articles to train the Artificial Intelligence (AI) software without any sort of agreement for usage, Bloomberg reported. News outlets such as Wall Street Journal and CNN have stated that they must be paid to license content to OpenAI for AI training purposes.

Jason Conti, general counsel for News Corp’s Dow Jones’s unit—which publishes the Wall Street Journal, told Bloomberg that the news firm’s work should be used for AI training purposes only after acquiring a license for it from the company and currently there’s no such deal between OpenAI and Dow Jones. Similarly, an anonymous source from the CNN has said that this is in violation of their terms of service and that the company plans to reach out to OpenAI to further discuss the matter.

There is not much clarity over how data from the internet is used for Machine Learning or training an AI tool. This has raised concerns of copyright infringement in the creative industry. We delve deeper into this in our report on Google’s MusicLM and the copyright issues associated with the AI model.

STAY ON TOP OF TECH POLICY: Our daily newsletter with top stories from MediaNama and around the world, delivered to your inbox before 9 AM. Click here to sign up today!

Who revealed the news sources first?

Francesco Marconi, a computational journalist who has previously worked with the Wall Street Journal, tweeted out on February 15 that ChatGPT is trained using large number of news sources. Marconi could acquire the list of 20 news sources through ChatGPT by using the prompt: “Which specific news sources was chatGPT trained on? Provide a list of the top news sources in your database.”

Here’s the prompt I used: "Which specific news sources was chatGPT trained on? Provide a list of the top news sources in your database."

— Francesco Marconi (@fpmarconi) February 15, 2023

In addition to Wall Street Journal and CNN, the list included major publishers such as New York Times, Reuters, Al Jazeera, BBC News, The Guardian, Associated Press, The Economist, Bloomberg among others. Marconi stated that if there’s no agreement with these publishers, it may amount to a violation of the publishers’ terms and service. Given the long list of news sources the AI bot mentioned, one can anticipate the legal challenges OpenAI will find itself drowning in soon if others follow suit.

Why it matters:

The use of ChatGPT sparked discussions about the AI tool plagiarising papers, reports and other reference materials in the publishing sector. By claiming compensation for their work being used for AI training, news outlets now join the band of artists and coders who are already suing companies for scraping their creations without a deal. The developments in this space will be worth noting to understand the challenges that companies will face while using data on the web for machine learning and for training AI tools.

Lawsuits against AI-systems in news:

Early February, Getty Images sued Stability AI, creators of Stable Diffusion, which is an open-source AI model for creating images based out of text prompts, for “brazen infringement of Getty Images’ intellectual property on a staggering scale”. Getty Images has claimed that the AI company copied over 12 million images from their database without their permission or compensation “as part of its efforts to build a competing business”.

In 2022, programmers and copyright lawyers filed a class action lawsuit against Microsoft, GitHub and OpenAI alleging that GitHub Copilot, has been found to use “long sections of licensed code” without crediting the original coders. Copilot is a GitHub product which works as an AI-based coding assistant and is trained on large “public repositories” of codes from the web, many of which are licensed.

This post is released under a CC-BY-SA 4.0 license. Please feel free to republish on your site, with attribution and a link. Adaptation and rewriting, though allowed, should be true to the original.

Also read:

Discover more:Artificial Intelligence, ChatGPT, copyright, Digital Media, Free Reads

Written By Sarasvati NT

Curious about the intersection of technology with education, caste and welfare rights. For story tips, please feel free to reach out at sarasvati@medianama.com

News

TikTok Criticises US Ban Or Divest Bill, Vows To Fight In Court

"We believe the facts and the law are clearly on our side, and we will ultimately prevail," the company said on the enactment of...

Sharveya Parasnis2 hours ago

News

It will take a multi-year investment cycle before Meta’s AI offerings become profitable: Insights from Meta’s Earnings Call

Zuckerberg expressed confidence in monetizing AI through methods like ads and paid access to larger models, leveraging Meta's successful history with scaled technologies.

Kamya Pandey6 hours ago

News

ICICI bank’s mobile app accidentally revealed credit card details of 17k customers

The data leakage comes on the same day as the Reserve Bank of India (RBI) restricted Kotak Mahindra Bank from onboarding customers over online/mobile...

Kamya Pandey7 hours ago

News

Views: Response to NPCI CEO’s comments that what is not written in regulations is a no-go for fintech entities

NPCI CEO Dilip Asbe recently said that what is not written in regulations is a no-go for fintech entities. But following this advice could...

Sarvesh MathiFebruary 29, 2024

News

Views: The opportunities and challenges for PhonePe’s Indus Appstore

Notably, Indus Appstore will allow app developers to use third-party billing systems for in-app billing without having to pay any commission to Indus, a...

Sarvesh MathiFebruary 22, 2024

News

Views: Why Rapido is moving to subscription model

The existing commission-based model, which companies like Uber and Ola have used for a long time and still stick to, has received criticism from...

Sarvesh MathiFebruary 19, 2024

News

Views: Why PhonePe’s Indus Appstore can challenge Google Play’s dominance in India

Factors like Indus not charging developers any commission for in-app payments and antitrust orders issued by India's competition regulator against Google could contribute to...

Sarvesh MathiSeptember 25, 2023

News

Views: Open Source AI—A Nebulous Concept Bearing a Heavy Weight

Is open-sourcing of AI, and the use cases that come with it, a good starting point to discuss the responsibility and liability of AI?...

Guest AuthorSeptember 20, 2023

Please subscribe to MediaNama. Don't share prints and PDFs.

News

Search queries for international air tickets growing at 43% – Google

Google has released a Google Travel Trends Report which states that branded budget hotel search queries grew 179% year over year (YOY) in India, in...

Sneha JohariMarch 23, 2016

Advert

Advertisement: 135 Digital Job Listings at JobNama – 9th June 2010

135 job openings in over 60 companies are listed at our free Digital and Mobile Job Board: If you’re looking for a job, or...

MedianamaJune 9, 2010

News

Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request

By Aroon Deep and Aditya Chunduru You’re reading it here first: Twitter has complied with government requests to censor 52 tweets that mostly criticised...

Aroon DeepApril 24, 2021

News

Ola, Uber drivers say they are exhausted, fear being wiped out

Rajesh Kumar* doesn’t have many enemies in life. But, Uber, for which he drives a cab everyday, is starting to look like one, he...

Soumyarendra BarikFebruary 24, 2021

MediaNama

News

After artists and coders, news outlets up against OpenAI for using their articles to train ChatGPT

Latest Headlines

Free Reads

News

TikTok Criticises US Ban Or Divest Bill, Vows To Fight In Court

News

It will take a multi-year investment cycle before Meta’s AI offerings become profitable: Insights from Meta’s Earnings Call

News

ICICI bank’s mobile app accidentally revealed credit card details of 17k customers

MediaNama’s mission is to help build a digital ecosystem which is open, fair, global and competitive.

Views

News

Views: Response to NPCI CEO’s comments that what is not written in regulations is a no-go for fintech entities

News

Views: The opportunities and challenges for PhonePe’s Indus Appstore

News

Views: Why Rapido is moving to subscription model

News

Views: Why PhonePe’s Indus Appstore can challenge Google Play’s dominance in India

News

Views: Open Source AI—A Nebulous Concept Bearing a Heavy Weight

Please subscribe to MediaNama. Don't share prints and PDFs.

You May Also Like

News

Search queries for international air tickets growing at 43% – Google

Advert

Advertisement: 135 Digital Job Listings at JobNama – 9th June 2010

News

Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request

News

Ola, Uber drivers say they are exhausted, fear being wiped out

Trending

Latest News