Discover more:Copyright Infringement, Fair Usage, Free Reads, lawsuit, Microsoft, OpenAI, The New York Times

Microsoft Defends AI Training Practices Against NYT Lawsuit

Microsoft has compared the copyright concern raised by NYT about AI to the concern raised by the American Motion Pictures Association about the videocassette recorder (VCR).

Kamya Pandey

Published

March 6, 2024

Microsoft says that large language models (LLMs) are the essence of what copyright law considers transformative and fair use, in its motion to dismiss parts of the New York Times lawsuit against it. The New York Times filed this lawsuit against Microsoft and OpenAI in December last year, arguing that the two companies trained their artificial intelligence (AI) models by infringing on millions of its copyrighted articles. OpenAI has since refuted this argument stating that the publication manipulated prompts to produce regurgitations of its articles.

“Nowhere does The Times allege that anyone other than its legal team would actually do any of this, and certainly not on a scale that merits the doomsday futurology it pushes,” Microsoft said, comparing the copyright concern raised by the publication about AI to the concern raised by the American Motion Pictures Association about the videocassette recorder (VCR).

Microsoft seeks to dismiss the following claims:

That Microsoft is liable for end-user copyright infringement:

Despite holding Microsoft liable for end-user infringement, the New York Times has not alleged that any such infringement took place. Microsoft says that while the publication insinuates that GPT models recite large portions of its content with minimal prompting, it buries this prompting by admitting that their prompts each comprise a short snippet from the beginning of an article from The New York Times, often verbatim snippets of paragraphs. It says that the New York Times’ complaint does not explain why any person in possession of their articles would want to feed part of it into ChatGPT as a prompt.

The publication also fails to claim that Microsoft has contributed to end-user copyright infringement. A contributory infringer is “one who, with knowledge of the infringing activity, induces, causes or materially contributes to the infringing conduct of another.” The complaint does not allege that Microsoft encouraged users to use its product for infringement or promoted/ induced infringement. Further, to make such a claim the New York Times has to establish actual knowledge of specific acts of infringement. Since no such infringement has been alleged, Microsoft has no knowledge of the fact that end users are carrying out infringing activities using its AI models. As such, it argues that it isn’t liable for contributing to such infringement.

That Microsoft violated copyright management information provisions under the Digital Millennium Copyright Act:

Copyright management information (CMI) is information about a copyrighted work, its creator, its owner, or use of the work that is conveyed in connection with a copyrighted work (like its title or the copyright owner’s name). To make a claim about CMI violation, a person has to allege that the CMI was “intentionally removed or altered” from a copy of work. Further, it must be established that the offending party removed CMI “knowing, or having reasonable grounds to know, that it will induce, enable, facilitate, or conceal an infringement” of copyright. Microsoft says that over the past 18 months, many have failed to adequately make this claim against generative AI tools. It breaks down the CMI violation into two parts—

Alleged infringement in training data: Microsoft says that the point of CMI is to inform the public that something is copyrighted. Training takes place out of public view, and training data itself is not disseminated. The company argues that even the New York Times’ complaint acknowledges that Microsoft believes its conduct of training AI language models on copyrighted content is protected under the “fair use” doctrine of copyright law. The New York Times has not alleged that Microsoft’s belief in its fair use rights is unreasonable. The allegation of CMI removal from training data is also not substantiated, with no actual examples of CMI removal being mentioned in The New York Times’ complaint.
Alleged infringed output: To this, Microsoft argues that to generate The New York Times’ content would require the user to already know its origins. Further, CMI removal claims lie only with those works that are “substantially or entirely reproduced.” The New York Times claims that the search results generated by the GPT-based Bing tools vaguely refer to “copies” and “derivatives of Times Works,” without explaining what precisely was copied. Elsewhere in its complaint New York Times has also admitted that that these results are answers to user queries and may include extensive paraphrasing and direct quotes. Microsoft says that the mere non-inclusion of CMI with “paraphrases” or “quotes” does not constitute the removal of CMI from a substantially identical copy of a work.

That the GPT models compete with the New York Times by creating the same or similar content:

The New York Times claims that Microsoft’s AI models threaten high-quality journalism and hinder the publication’s ability to grow its subscriber base. It argues that the models threaten the reputation of its product review website Wirecutter by falsely attributing a review a recommendation to the website. On Wirecutter’s website, the recommendations contain hyperlinks to purchase the recommended products, which can lead to commissions for Wirecutter. Since GPT tools do not include these links, The New York Times argues that this would be “misappropriation of commercial referrals”. This, Microsoft claims, is mere speculation and says there isn’t a single real-world example of such an occurrence.

Also read:

STAY ON TOP OF TECH NEWS: Our daily newsletter with the top story of the day from MediaNama, delivered to your inbox before 9 AM. Click here to sign up today!

Discover more:Copyright Infringement, Fair Usage, Free Reads, lawsuit, Microsoft, OpenAI, The New York Times

Written By Kamya Pandey

News

TikTok Criticises US Ban Or Divest Bill, Vows To Fight In Court

"We believe the facts and the law are clearly on our side, and we will ultimately prevail," the company said on the enactment of...

Sharveya Parasnis23 hours ago

News

It will take a multi-year investment cycle before Meta’s AI offerings become profitable: Insights from Meta’s Earnings Call

Zuckerberg expressed confidence in monetizing AI through methods like ads and paid access to larger models, leveraging Meta's successful history with scaled technologies.

Kamya Pandey1 day ago

News

ICICI bank’s mobile app accidentally revealed credit card details of 17k customers

The data leakage comes on the same day as the Reserve Bank of India (RBI) restricted Kotak Mahindra Bank from onboarding customers over online/mobile...

Kamya Pandey1 day ago

News

Views: Response to NPCI CEO’s comments that what is not written in regulations is a no-go for fintech entities

NPCI CEO Dilip Asbe recently said that what is not written in regulations is a no-go for fintech entities. But following this advice could...

Sarvesh MathiFebruary 29, 2024

News

Views: The opportunities and challenges for PhonePe’s Indus Appstore

Notably, Indus Appstore will allow app developers to use third-party billing systems for in-app billing without having to pay any commission to Indus, a...

Sarvesh MathiFebruary 22, 2024

News

Views: Why Rapido is moving to subscription model

The existing commission-based model, which companies like Uber and Ola have used for a long time and still stick to, has received criticism from...

Sarvesh MathiFebruary 19, 2024

News

Views: Why PhonePe’s Indus Appstore can challenge Google Play’s dominance in India

Factors like Indus not charging developers any commission for in-app payments and antitrust orders issued by India's competition regulator against Google could contribute to...

Sarvesh MathiSeptember 25, 2023

News

Views: Open Source AI—A Nebulous Concept Bearing a Heavy Weight

Is open-sourcing of AI, and the use cases that come with it, a good starting point to discuss the responsibility and liability of AI?...

Guest AuthorSeptember 20, 2023

Please subscribe to MediaNama. Don't share prints and PDFs.

News

Search queries for international air tickets growing at 43% – Google

Google has released a Google Travel Trends Report which states that branded budget hotel search queries grew 179% year over year (YOY) in India, in...

Sneha JohariMarch 23, 2016

Advert

Advertisement: 135 Digital Job Listings at JobNama – 9th June 2010

135 job openings in over 60 companies are listed at our free Digital and Mobile Job Board: If you’re looking for a job, or...

MedianamaJune 9, 2010

News

Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request

By Aroon Deep and Aditya Chunduru You’re reading it here first: Twitter has complied with government requests to censor 52 tweets that mostly criticised...

Aroon DeepApril 24, 2021

News

Ola, Uber drivers say they are exhausted, fear being wiped out

Rajesh Kumar* doesn’t have many enemies in life. But, Uber, for which he drives a cab everyday, is starting to look like one, he...

Soumyarendra BarikFebruary 24, 2021

MediaNama

News

Microsoft Defends AI Training Practices Against NYT Lawsuit

That Microsoft is liable for end-user copyright infringement:

That Microsoft violated copyright management information provisions under the Digital Millennium Copyright Act:

That the GPT models compete with the New York Times by creating the same or similar content:

Latest Headlines

Free Reads

News

TikTok Criticises US Ban Or Divest Bill, Vows To Fight In Court

News

It will take a multi-year investment cycle before Meta’s AI offerings become profitable: Insights from Meta’s Earnings Call

News

ICICI bank’s mobile app accidentally revealed credit card details of 17k customers

MediaNama’s mission is to help build a digital ecosystem which is open, fair, global and competitive.

Views

News

Views: Response to NPCI CEO’s comments that what is not written in regulations is a no-go for fintech entities

News

Views: The opportunities and challenges for PhonePe’s Indus Appstore

News

Views: Why Rapido is moving to subscription model

News

Views: Why PhonePe’s Indus Appstore can challenge Google Play’s dominance in India

News

Views: Open Source AI—A Nebulous Concept Bearing a Heavy Weight

Please subscribe to MediaNama. Don't share prints and PDFs.

You May Also Like

News

Search queries for international air tickets growing at 43% – Google

Advert

Advertisement: 135 Digital Job Listings at JobNama – 9th June 2010

News

Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request

News

Ola, Uber drivers say they are exhausted, fear being wiped out

Trending

Latest News