{"id":178891,"date":"2023-11-06T21:36:45","date_gmt":"2023-11-06T21:36:45","guid":{"rendered":"https:\/\/www.musicbusinessworldwide.com\/?p=178891"},"modified":"2023-11-06T21:36:45","modified_gmt":"2023-11-06T21:36:45","slug":"did-anthropic-just-reveal-how-it-will-try-to-beat-universals-landmark-music-copyright-lawsuit","status":"publish","type":"post","link":"https:\/\/www.musicbusinessworldwide.com\/did-anthropic-just-reveal-how-it-will-try-to-beat-universals-landmark-music-copyright-lawsuit\/","title":{"rendered":"Did Anthropic just reveal how it will try to beat Universal&#8217;s landmark music copyright lawsuit?"},"content":{"rendered":"<p>Last month brought news of a copyright dispute that could signal a seismic shift in the dynamics between the generative AI space and the music industry.<\/p>\n<p><a class=\"link-relationship\" style=\"background: 0px 0px; color: unset !important; text-decoration: none; border-bottom: 1px dashed #ff7d00;\" title=\"Companies &gt; Universal Music Publishing Group [365 articles]\" href=\"https:\/\/www.musicbusinessworldwide.com\/companies\/universal-music-group\/universal-music-publishing-group\/\">Universal Music Publishing Group sued<\/a>\u00a0<a href=\"https:\/\/www.musicbusinessworldwide.com\/google-to-invest-up-to-2bn-in-ai-company-anthropic-which-is-currently-being-sued-for-copyright-infringement-by-universal-music-group\/\" target=\"_blank\" rel=\"noopener\">multi-billion-dollar-backed<\/a> AI company <strong>Anthropic<\/strong> for the alleged \u201csystematic and widespread infringement of their copyrighted song lyrics\u201d via its chatbot Claude.<\/p>\n<p><a href=\"https:\/\/www.musicbusinessworldwide.com\/files\/2023\/10\/UMG-lawsuit.pdf\" target=\"_blank\" rel=\"noopener\">The suit<\/a>, filed by UMPG along with co-plaintiffs <a class=\"link-relationship\" style=\"background: 0px 0px; color: unset !important; text-decoration: none; border-bottom: 1px dashed #ff7d00;\" title=\"Companies &gt; Concord [302 articles]\" href=\"https:\/\/www.musicbusinessworldwide.com\/companies\/concord\/\">Concord Music Group<\/a> and ABKCO, claims that \u201cin the process of building and operating AI models, Anthropic unlawfully copies and disseminates vast amounts of copyrighted works \u2014 including the lyrics to myriad musical compositions owned or controlled by Publishers\u201d.<\/p>\n      <div class=\"mb-advert__incontent\">      <div class=\"mb-advert mb-advert__tweeny hidden-xs hidden-ms hidden-sm\" data-loaded=\"no\" data-sizes=\"992 1200 1440\" data-name=\"628x90 Sponsor banner #5 (992+1200+1440)\" data-params=\"dfp_sponsor5_628\" id=\"dfp_sponsor5_628\"><\/div>      <div class=\"mb-advert mb-advert__banner mb-advert__banner--inline hidden-xs hidden-sm hidden-md hidden-lg\" data-loaded=\"no\" data-sizes=\"480\" data-name=\"468x60 Sponsor banner #5 (480)\" data-params=\"dfp_sponsor5_468\" id=\"dfp_sponsor5_468\"><\/div>      <div class=\"mb-advert mb-advert__mobile mb-advert__mobile--inline hidden-ms hidden-md hidden-lg\" data-loaded=\"no\" data-sizes=\"320 768\" data-name=\"300x50 Sponsor banner #5 (320+768)\" data-params=\"dfp_sponsor5_300\" id=\"dfp_sponsor5_300\"><\/div>      <\/div>      \n<p>UMPG et al&#8217;s <a href=\"https:\/\/www.musicbusinessworldwide.com\/ai-company-anthropic-amazon-sued-universal-music-group\/\" target=\"_blank\" rel=\"noopener\">lawsuit<\/a> seeks potentially tens of millions of dollars in damages from Anthropic, but perhaps more significant is that the outcome of the case could set a major legal precedent for AI companies&#8217; use of copyrighted lyrics on their platforms.<\/p>\n<p>We won&#8217;t know that outcome for some time yet, but details published within a filing from Anthropic with the US Copyright Office last week could be an early indicator of the stance the AI firm is planning to take in its copyright battle with the publishers.<\/p>\n      <div class=\"mb-advert__incontent\">      <div class=\"mb-advert mb-advert__spu\" data-loaded=\"no\" data-name=\"300x250 Sponsor MPU #1\" data-params=\"dfp_spu1\" id=\"dfp_spu1\"><\/div>      <\/div>      \n<p>Back in August, The United States Copyright Office (USCO) <a href=\"https:\/\/www.copyright.gov\/newsnet\/2023\/1017.html\" target=\"_blank\" rel=\"noopener\">issued<\/a> a notice of inquiry (NOI) in the Federal Register on the topic of copyright and AI and alongside that announced a<a href=\"https:\/\/www.copyright.gov\/ai\/docs\/Federal-Register-Document-Artificial-Intelligence-and-Copyright-NOI.pdf\" target=\"_blank\" rel=\"noopener\">\u00a0study<\/a> around copyright law and policy issues raised by artificial intelligence systems.<\/p>\n<p>In order to inform the study and &#8220;help assess whether legislative or <strong>regulatory<\/strong> <strong>steps<\/strong> in this area are warranted&#8221;, the USCO asked for written comment on these issues, &#8220;including those involved in the use of copyrighted works to train AI models, the appropriate levels of transparency and disclosure with respect to the use of copyrighted works, and the legal status of AIgenerated outputs&#8221;.<\/p>\n<p>Amongst the companies that submitted written responses as part of the study include tech giants like <strong>Meta<\/strong>, <strong>Google<\/strong> and <strong>Adobe<\/strong>, as well as prominent AI firms like <strong>Stability<\/strong> <strong>AI<\/strong> and <strong>Anthropic<\/strong>.<\/p>\n<p><em>The Verge<\/em> <a href=\"https:\/\/www.theverge.com\/2023\/11\/4\/23946353\/generative-ai-copyright-training-data-openai-microsoft-google-meta-stabilityai\" target=\"_blank\" rel=\"noopener\">has published<\/a> a roundup of some of the key arguments put forward by these companies regarding the relationship between copyrighted content and the training of datasets used by generative AI.<\/p>\n      <div class=\"mb-advert__incontent\">      <div class=\"mb-advert mb-advert__spu\" data-loaded=\"no\" data-name=\"300x250 Sponsor MPU #2\" data-params=\"dfp_spu2\" id=\"dfp_spu2\"><\/div>      <\/div>      \n<p>According to UMPG <em>et al&#8217;s<\/em> lawsuit last month, which you can <a class=\"link-internal\" href=\"https:\/\/www.musicbusinessworldwide.com\/files\/2023\/10\/UMG-lawsuit.pdf\" target=\"_blank\" rel=\"noopener\">read in full here<\/a>, Anthropic infringes the music companies\u2019 copyrights by \u201cscraping and ingesting massive amounts of text from the internet and potentially other sources, and then using that vast corpus to train its AI models and generate output based on this copied text\u201d.<\/p>\n<p>Anthropic explains in its recent USCO filing, which you can <a href=\"https:\/\/www.musicbusinessworldwide.com\/files\/2023\/11\/Anthropic.pdf\">read here <\/a>(and which we must stress is not connected to last month&#8217;s lawsuit), that its Claude chatbot &#8220;is trained using data from publicly available information on the Internet as of <strong>December 2022<\/strong>, non-public datasets that we commercially obtain from third parties, data that our users or companies hired to provide data labeling and creation services voluntarily create and provide, and data we generate internally&#8221;.<\/p>\n<p>The company also claims that it &#8220;operates its crawling system transparently,&#8221; which it claims, &#8220;means website operators can easily identify Anthropic visits and signal their preferences to Anthropic&#8221;.<\/p>\n<p>Furthermore, Anthropic says that Claude is trained using &#8220;Constitutional AI&#8221; which, it explains, means that its &#8220;model chooses the best output based on a clearly defined, explicit set of values-based instructions&#8221; by the user.<\/p>\n<p>It adds: &#8220;We have worked to incorporate respect for copyright into the design of Claude in a foundational way. We don\u2019t believe users should be able to create outputs using Claude that infringe copyrighted works. That is not an intended or permitted use of this technology, and we take steps to prevent it.&#8221;<\/p>\n<p>Here are some of Anthropic&#8217;s arguments about the relationship between generative AI and copyright law:<\/p>\n<hr \/>\n<h6>1. Anthropic argues that training LLMs\u00a0 using copyrighted material is<strong> &#8216;fair use&#8217;<\/strong><\/h6>\n<p>Anthropic tells the USCO that &#8220;the way Claude was trained qualifies as a quintessentially <strong>lawful<\/strong> use of materials&#8221;.<\/p>\n<p>Citing the US Copyright Act, the company argues that &#8220;copyright protects particular expressions, but does not extend &#8216;to any idea, procedure, process, system, method of operation, concept, principle, or discovery&#8217;.&#8221;<\/p>\n<blockquote><p>&#8220;The way Claude was trained qualifies as a quintessentially <strong>lawful<\/strong> use of materials.&#8221;<\/p><\/blockquote>\n<p>Anthropic adds: &#8220;For Claude, as discussed above, the training process makes copies of information for the purposes of performing a statistical analysis of the data.<\/p>\n<p>&#8220;The copying is merely an intermediate step, extracting unprotectable elements about the entire corpus of works, in order to create new outputs. In this way, the use of the original copyrighted work is non-expressive; that is, it is not re-using the copyrighted expression to communicate it to users.<\/p>\n<p>&#8220;To the extent copyrighted works are used in training data, it is for analysis (of statistical relationships between words and concepts) that is unrelated to any expressive purpose of the work.<\/p>\n<p>&#8220;This sort of transformative use has been recognized as lawful in the past and should continue to be considered lawful in this case.&#8221;<\/p>\n<p>Anthropic also cites various cases, which you can see on page 7 of its USCO filing <a href=\"https:\/\/www.musicbusinessworldwide.com\/files\/2023\/11\/Anthropic.pdf\">here<\/a>, that it argues, &#8220;have allowed copying works in order to create tools for searching across those works and to perform statistical analysis&#8221;.&#8221;<\/p>\n<p>The filing adds: &#8220;The training process for Claude fits neatly within these same paradigms and is fair use. Training uses works in a highly transformative, non-expressive way; rather than replicating and expressing the pre-existing work itself.<\/p>\n<p>&#8220;As discussed above, Claude is intended to help users produce new, distinct works and thus serves a different purpose from the pre-existing work.&#8221;<\/p>\n<hr \/>\n<h6>2. Anthropic does not believe that &#8220;direct, <strong>collective<\/strong>, or <strong>compulsory&#8221;<\/strong> licensing &#8220;is necessary per se&#8221; when it comes to training large language models.<\/h6>\n<p>Two of the questions Anthropic submitted written answers to were: &#8220;Is <strong>direct<\/strong>, <strong>collective<\/strong>, or <strong>compulsory<\/strong> licensing of copyrighted material practicable\/economically feasible for training LLMs?&#8221;<\/p>\n<p>Anthropic argues that &#8220;because training LLMs is a<strong> fair use<\/strong>, [it does] not believe that licensing is necessary per se&#8221;.<\/p>\n<blockquote><p>&#8220;Because training LLMs is a<strong> fair use<\/strong>, we do not believe that licensing is necessary per see.&#8221;<\/p><\/blockquote>\n<p>The company adds: &#8220;To be sure, for a variety of reasons, developers may choose to procure special access to or use of particular datasets as part of commercial transactions.<\/p>\n<p>&#8220;However, a regime that always requires licensing for use of material in training would be inappropriate; it would, at a minimum, effectively lock up access to the vast majority of works, since most works are not actively managed and licensed in any way.&#8221;<\/p>\n<p>Anthropic claims further that &#8220;as a public benefit corporation,&#8221; it is &#8220;open to engaging in further discussion of appropriate permission regimes&#8221;, but says that &#8220;policymakers should be aware of the significant practical challenges that a collective licensing regime would entail&#8221;.<\/p>\n<p>Adds Anthopic: &#8220;Licensing training data still raises many questions and potential problems from both policy and practical perspectives given that models can be trained on substantial volumes of works.<\/p>\n<p>&#8220;Requiring a license for non-expressive use of copyrighted works to train LLMs effectively means impeding use of ideas, facts, and other non-copyrightable material.&#8221;<\/p>\n<hr \/>\n<h6>3. Anthropic suggests that users could be liable for generative AI outputs that infringe copyrights<\/h6>\n<p>The response to this question to the USCO&#8217;s study might form a part of Anthropic&#8217;s defense in its legal dispute against UMG.<\/p>\n<p><strong>Question 25<\/strong> asks: &#8220;Who should be liable for generative AI outputs that may infringe copyrights?&#8221;<\/p>\n<p>According to Anthropic: &#8220;Generally, responsibility for a particular output will rest with the person who entered the prompt to generate it. That is, it is the user who engages in the relevant &#8216;volitional conduct&#8217; to generate the output and thus will usually be the relevant actor for purposes of assessing direct infringement.&#8221;<\/p>\n<blockquote><p>&#8220;Generally, responsibility for a particular output will rest with the person who entered the prompt to generate it.&#8221;<\/p><\/blockquote>\n<p>Anthropic adds: &#8220;At the same time, courts also have tools to adjudicate whether a service provider (or others involved in development of an LLM) face secondary liability for the user\u2019s conduct.<\/p>\n<p>&#8220;While merely offering an LLM service (including doing so commercially) would not in and of itself generate liability, courts are well-equipped to examine particular circumstances where a service provider meets the relevant thresholds for secondary liability &#8211; i.e., whether the provider knows and materially contributes to the infringement; has the right and ability to control the act and directly financially benefits; or induces the infringement by clearly promoting use of its tool for infringing purposes.&#8221;<\/p>\n<p>Anthropic explains further: &#8220;Claude employs a range of measures to inhibit the production of infringing outputs, including terminating accounts of repeat infringers or violators if we become aware of their infringing activities.<\/p>\n<p>&#8220;We look forward to continued collaboration with content creators and others to ensure these measures to combat such uses are robust.&#8221;<\/p>\n<hr \/>\n<p>If Anthropic does choose to use this user liability argument in the suit filed by UMPG, it might only get it so far.<\/p>\n<p>That&#8217;s because one of the issues alleged in UMPG, Concord and ABKCO&#8217;s complaint iss that Anthropic\u2019s AI models generate output containing the publishing companies\u2019 lyrics \u201ceven when the models are not specifically asked to do so\u201d.<\/p>\n<p>The lawsuit claims that the Claude chatbot responds to various prompts that don\u2019t specifically ask for the copyrighted lyrics \u201cby generating output that nevertheless copies Publishers\u2019 lyrics\u201d.<\/p>\n<p>Examples of such requests include asking the chatbot to \u201cwrite a song about a certain topic, provide chord progressions for a given musical composition, or write poetry or short fiction in the style of a certain artist or songwriter\u201d.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Anthropic responds to the Copyright Office\u2019s Notice of Inquiry on Copyright and Artificial Intelligence <\/p>\n","protected":false},"author":15,"featured_media":178088,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[7],"tags":[131248],"class_list":["post-178891","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-anthropic"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/posts\/178891","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/users\/15"}],"replies":[{"embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/comments?post=178891"}],"version-history":[{"count":0,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/posts\/178891\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/media\/178088"}],"wp:attachment":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/media?parent=178891"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/categories?post=178891"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/tags?post=178891"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}