{"id":222156,"date":"2025-01-29T20:53:10","date_gmt":"2025-01-29T20:53:10","guid":{"rendered":"https:\/\/www.musicbusinessworldwide.com\/?p=222156"},"modified":"2025-01-29T21:34:47","modified_gmt":"2025-01-29T21:34:47","slug":"openai-valued-at-157bn-and-facing-multiple-copyright-lawsuits-says-chinas-deepseek-may-have-used-its-data-without-permission","status":"publish","type":"post","link":"https:\/\/www.musicbusinessworldwide.com\/openai-valued-at-157bn-and-facing-multiple-copyright-lawsuits-says-chinas-deepseek-may-have-used-its-data-without-permission\/","title":{"rendered":"OpenAI, valued at $157bn and facing multiple copyright infringement lawsuits, says China\u2019s DeepSeek may have used its data to train rival AI model without permission"},"content":{"rendered":"<p>The sudden arrival of China\u2019s DeepSeek AI chatbot has thrown the US\u2019s AI industry into what <a href=\"https:\/\/www.forbes.com\/sites\/dereksaul\/2025\/01\/29\/deepseek-panic-live-updates-openai-and-microsoft-reportedly-probing-deepseek-used-their-data-for-training\/\"><em>Forbes<\/em> calls<\/a> a \u201cpanic.\u201d<\/p>\n<p>AI-related companies\u2019 stocks fell sharply on Monday, with chipmaker <strong>Nvidia<\/strong> leading the way to a <span style=\"color: #ff0000;\"><strong>17%<\/strong><\/span> stock price drop, wiping out nearly <strong>USD $600 billion<\/strong> of the company\u2019s market cap. (A rebound on Tuesday clawed back some of those losses.)<\/p>\n<p>The reason for the panic? DeepSeek appears to have created a chatbot with similar capabilities to those of US-made chatbots, but using a fraction of the resources and capital needed to develop the US models. The company says it spent just <strong>$5.6 million<\/strong> developing the chatbot, compared to the <a href=\"https:\/\/www.wired.com\/story\/openai-ceo-sam-altman-the-age-of-giant-ai-models-is-already-over\/\" target=\"_blank\" rel=\"noopener\">more than <strong>$100 million<\/strong><\/a> that OpenAI spent developing ChatGPT-4.<\/p>\n<p><span style=\"font-weight: 400;\">      <div class=\"mb-advert__incontent\">      <div class=\"mb-advert mb-advert__tweeny hidden-xs hidden-ms hidden-sm\" data-loaded=\"no\" data-sizes=\"992 1200 1440\" data-name=\"628x90 Sponsor banner #5 (992+1200+1440)\" data-params=\"dfp_sponsor5_628\" id=\"dfp_sponsor5_628\"><\/div>      <div class=\"mb-advert mb-advert__banner mb-advert__banner--inline hidden-xs hidden-sm hidden-md hidden-lg\" data-loaded=\"no\" data-sizes=\"480\" data-name=\"468x60 Sponsor banner #5 (480)\" data-params=\"dfp_sponsor5_468\" id=\"dfp_sponsor5_468\"><\/div>      <div class=\"mb-advert mb-advert__mobile mb-advert__mobile--inline hidden-ms hidden-md hidden-lg\" data-loaded=\"no\" data-sizes=\"320 768\" data-name=\"300x50 Sponsor banner #5 (320+768)\" data-params=\"dfp_sponsor5_300\" id=\"dfp_sponsor5_300\"><\/div>      <\/div>      <\/span><\/p>\n<p>Now it appears that DeepSeek may have accomplished this through intellectual property theft. According to news reports, <strong>OpenAI<\/strong> \u2013 maker of the ChatGPT chatbot that triggered the global AI craze a few years ago \u2013 is investigating whether DeepSeek violated its intellectual property in creating its R1 artificial intelligence model.<\/p>\n<p>OpenAI was reportedly notified of the possible violation by its key investor, <strong>Microsoft<\/strong>, according to <em>Bloomberg<\/em>, which <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-01-29\/microsoft-probing-if-deepseek-linked-group-improperly-obtained-openai-data\" target=\"_blank\" rel=\"noopener\">first reported<\/a> on the matter. The AI company then blocked DeepSeek\u2019s access to ChatGPT, <a href=\"https:\/\/www.ft.com\/content\/a0dfedd1-5255-4fa9-8ccc-1fe01de87ea6\" target=\"_blank\" rel=\"noopener\">according to<\/a> the <em>Financial Times<\/em>.<\/p>\n<p>For those following the growing battle between copyright owners and AI developers, this seems like a clear case of tables turning: OpenAI has been dragged into court by the likes of <em>The New York Times<\/em>, comedian-writer <a href=\"https:\/\/www.musicbusinessworldwide.com\/sarah-silverman-sues-openai-and-meta-over-alleged-copyright-infringement-in-generative-ai-training1\/\" target=\"_blank\" rel=\"noopener\"><strong>Sarah Silverman<\/strong><\/a>, <em>Game of Thrones<\/em> author <a href=\"https:\/\/www.musicbusinessworldwide.com\/george-r-r-martin-john-grisham-and-other-writers-sue-openai-for-copyright-infringement\/\" target=\"_blank\" rel=\"noopener\"><strong>George R.R. Martin<\/strong><\/a>, and German music licensing organization <a href=\"https:\/\/www.musicbusinessworldwide.com\/openai-sued-by-gema-in-germany-for-unlicensed-use-of-song-lyrics\/\" target=\"_blank\" rel=\"noopener\"><strong>GEMA<\/strong><\/a>. They all accuse the company of training its AI on copyrighted content without permission.<\/p>\n<p><span style=\"font-weight: 400;\">      <div class=\"mb-advert__incontent\">      <div class=\"mb-advert mb-advert__spu\" data-loaded=\"no\" data-name=\"300x250 Sponsor MPU #1\" data-params=\"dfp_spu1\" id=\"dfp_spu1\"><\/div>      <\/div>      <\/span><\/p>\n<p>In an interview with Fox News, President <strong>Donald Trump<\/strong>\u2019s artificial intelligence czar, <strong>David Sacks<\/strong>, said there\u2019s \u201csubstantial evidence\u201d that DeepSeek used \u201cdistillation\u201d to develop its AI technology.<\/p>\n<p><a href=\"https:\/\/innodata.com\/what-is-knowledge-distillation-in-ai\/\" target=\"_blank\" rel=\"noopener\">Distillation<\/a> is a process in which a smaller, more efficient AI model is trained to mimic the outputs of a larger, less efficient model in order to replicate its behavior. The technique is used to create AI services that are cheaper to develop, and require less processing power and energy to run.<\/p>\n<p>However, in this instance, it appears DeepSeek used ChatGPT-4 to train its own, smaller model, a violation of OpenAI\u2019s terms of service, <em>FT<\/em> reported.<\/p>\n<p>\u201cThe issue is when you [take it out of the platform and] are doing it to create your own model for your own purposes,\u201d an unnamed individual close to OpenAI said.<\/p>\n<p><span style=\"font-weight: 400;\">      <div class=\"mb-advert__incontent\">      <div class=\"mb-advert mb-advert__spu\" data-loaded=\"no\" data-name=\"300x250 Sponsor MPU #2\" data-params=\"dfp_spu2\" id=\"dfp_spu2\"><\/div>      <\/div>      <\/span><\/p>\n<p>Some experts say it could be hard to stop AI developers from using distillation to piggyback on the achievements of other AI developers, because the practice is widespread.<\/p>\n<p>\u201cIt is a very common practice for start-ups and academics to use outputs from human-aligned commercial LLMs, like ChatGPT, to train another model,\u201d <strong>Ritwik Gupta<\/strong>, a PhD candidate in AI at the University of California, Berkeley, told <em>FT<\/em>.<\/p>\n<p>\u201cIt is not surprising to me that DeepSeek supposedly would be doing the same. If they were, stopping this practice precisely may be difficult.\u201d<\/p>\n<p>\u201cWe know [People\u2019s Republic of China]-based companies \u2013 and others \u2013 are constantly trying to distill the models of leading US AI companies,\u201d OpenAI said in a statement.<\/p>\n<p>\u201cWe engage in countermeasures to protect our IP, including a careful process for which frontier capabilities to include in released models, and believe\u2009.\u2009.\u2009. it is critically important that we are working closely with the US government to best protect the most capable models from efforts by adversaries and competitors to take US technology.\u201d<\/p>\n<hr \/>\n<p>That marks a striking change of tone from the one OpenAI has taken when defending itself against accusations of IP theft. In response to the suit brought by Sarah Silverman and other authors, OpenAI <a href=\"https:\/\/www.musicbusinessworldwide.com\/openais-response-to-sarah-silvermans-lawsuit-shows-music-rightsholders-could-be-in-for-a-tough-fight-over-copyright-and-ai\/\" target=\"_blank\" rel=\"noopener\">indicated<\/a> it plans to defend itself by arguing that using copyrighted works to train AI should be considered a \u201cfair use\u201d exemption to US copyright laws.<\/p>\n<p>The \u201cfair use\u201d argument has been made by other AI developers, <a href=\"https:\/\/www.musicbusinessworldwide.com\/did-anthropic-just-reveal-how-it-will-try-to-beat-universals-landmark-music-copyright-lawsuit\/\" target=\"_blank\" rel=\"noopener\">notably <strong>Anthropic<\/strong><\/a> (sued by <strong>Universal Music Group<\/strong> (UMG), <strong>Concord<\/strong>, and <strong>ABKCO<\/strong> for allegedly violating copyrights on lyrics) and <a href=\"https:\/\/www.musicbusinessworldwide.com\/as-suno-and-udio-admit-training-ai-with-unlicensed-music-record-industry-says-theres-nothing-fair-about-stealing-an-artists-lifes-work\/\" target=\"_blank\" rel=\"noopener\"><strong>Suno<\/strong> and <strong>Udio<\/strong><\/a>, two AI music-making apps, sued by the three music majors, that are banking so heavily on the fair-use argument that they all but admitted to using UMG, <strong>Warner Music<\/strong>, and <strong>Sony Music<\/strong> content without permission in developing their apps.<\/p>\n<p>In the wake of DeepSeek\u2019s release of R1, and the ensuing market chaos, AI developers may well be rethinking their liberal stance on intellectual property.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>DeepSeek spent far less money on developing a chatbot than US AI companies, but it may have done so by stealing OpenAI&#8217;s IP<\/p>\n","protected":false},"author":46,"featured_media":216731,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[7],"tags":[133582,133581,130934],"class_list":["post-222156","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-david-sacks","tag-deepseek","tag-openai"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/posts\/222156","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/users\/46"}],"replies":[{"embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/comments?post=222156"}],"version-history":[{"count":0,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/posts\/222156\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/media\/216731"}],"wp:attachment":[{"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/media?parent=222156"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/categories?post=222156"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.musicbusinessworldwide.com\/wp-json\/wp\/v2\/tags?post=222156"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}