{"id":126700,"date":"2023-02-23T07:00:00","date_gmt":"2023-02-23T13:00:00","guid":{"rendered":"http:\/\/www.thelocalvoice.net\/oxford\/?p=126700"},"modified":"2023-02-22T13:42:05","modified_gmt":"2023-02-22T19:42:05","slug":"can-artificial-intelligence-plagiarize","status":"publish","type":"post","link":"https:\/\/www.thelocalvoice.net\/oxford\/can-artificial-intelligence-plagiarize\/","title":{"rendered":"<strong>Can Artificial Intelligence Plagiarize?<\/strong>"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\"><em>University of Mississippi professor collaborates with Penn State to study copying, paraphrasing in bots such as ChatGPT<\/em><\/h2>\n\n\n\n<p>Since the launch of\u00a0<strong>ChatGPT\u00a0<\/strong>in November, the online tool has gained a record-breaking 100 million active users. Its technology, which automatically generates text for its users based on prompts, is highly sophisticated. But are there ethical concerns?<\/p>\n\n\n\n<p>A <strong>University of Mississippi<\/strong> professor has co-authored a\u00a0paper, led by collaborators at <strong>Penn State University<\/strong>, showing that artificial intelligence-driven language models, possibly including ChatGPT, are guilty of plagiarism \u2013 in more ways than one.<\/p>\n\n\n\n<p>&#8220;My co-authors and I started to think, if people use this technology to write essays, grant proposals, patent applications, we need to care about possibilities for plagiarism,&#8221; said Thai Le, assistant professor of computer and information science in the\u00a0<strong>School of Engineering<\/strong>. &#8220;We decided to investigate whether these models display plagiarism behaviors.&#8221;<\/p>\n\n\n\n<p>The study, which is the first of its kind, evaluated <strong>OpenAI&#8217;s GPT-2<\/strong>, a precursor to ChatGPT&#8217;s current technology. They tested three separate criteria for plagiarism: direct copying of content, paraphrasing and copying ideas from text without proper attribution.<\/p>\n\n\n\n<p>To do this, they created a method to automatically detect plagiarism and tested it against GPT-2&#8217;s training data, which is &#8220;memorized&#8221; in part and reproduced by the technology. Much of this data, which is publicly available online, is scraped from the internet without informing content owners.<\/p>\n\n\n\n<p>By comparing 210,000 generated texts to the 8 million GPT-2 pre-training documents, the team found evidence of all three types of plagiarism in the language models they tested. Their paper explains that GPT-2 can &#8220;exploit and reuse words, sentences and even core ideas in the generated texts.&#8221;<\/p>\n\n\n\n<p>Furthermore, the team hypothesizes that the larger the model size and associated training data, the greater the possibility of plagiarism.<\/p>\n\n\n\n<p>&#8220;People pursue large language models because the larger the model gets, generation abilities increase,&#8221; said <strong>Jooyoung Lee<\/strong>, first author and an information sciences and technology doctoral student at Penn State. &#8220;At the same time, they are jeopardizing the originality and creativity of the content within the training corpus. This is an important finding.&#8221;<\/p>\n\n\n\n<p>The scientists believe that this automatic plagiarism detection method could be applied to later versions of <strong>OpenAI<\/strong> technology, such as those used by ChatGPT.<\/p>\n\n\n\n<p>The research team will present their findings at the\u00a02023 <strong>ACM Web Conference<\/strong>, set for April 30-May 4 in <strong>Austin, Texas<\/strong>.<\/p>\n\n\n\n<p><strong>Robert Cummings<\/strong>, associate professor of writing and rhetoric at <strong>Ole Miss<\/strong>, has\u00a0given advice\u00a0to higher education professionals about ChatGPT&#8217;s implications in the classroom. A collaborator with Le in other AI-related research, Cummings suggests that users should be pragmatic when referencing material gained from language models.<\/p>\n\n\n\n<p>&#8220;We have to be careful about what ideas are ours and what are borrowed,&#8221; Cummings said. &#8220;Pre-ChatGPT, I&#8217;d Google something as part of my research, and it would be sourced. If I was looking for general knowledge, I&#8217;d consult Wikipedia.<\/p>\n\n\n\n<p>&#8220;Now, it&#8217;s important to designate what came from ChatGPT and put it off to the side as unsourced ideas.&#8221;<\/p>\n\n\n\n<p>Le acknowledges the importance of finding solutions to these ethical issues, whether that be on the user side or on the side of scientific advancement.<\/p>\n\n\n\n<p>&#8220;There are many important philosophical questions related to this technology,&#8221; he said. &#8220;Computer science researchers will continue to think of ways to improve these language models to change the way they generate text in such a way that they would not plagiarize.&#8221;<\/p>\n\n\n\n<p>This material is based upon work supported by the <strong>National Science Foundation<\/strong> under Grant Nos. 1934782 and 2114824.<\/p>\n\n\n\n<p><em>By\u00a0Erin Garrett<\/em><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2014\/06\/TheLocalVoiceLigature-25web.jpg\"><img data-recalc-dims=\"1\" decoding=\"async\" width=\"25\" height=\"16\" src=\"https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2014\/06\/TheLocalVoiceLigature-25web.jpg?resize=25%2C16\" alt=\"\" class=\"wp-image-14544\"\/><\/a><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><a href=\"https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2023\/02\/Thai-Le-750.jpg\"><img data-recalc-dims=\"1\" fetchpriority=\"high\" decoding=\"async\" width=\"640\" height=\"960\" src=\"https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2023\/02\/Thai-Le-750.jpg?resize=640%2C960\" alt=\"\" class=\"wp-image-126701\" srcset=\"https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2023\/02\/Thai-Le-750.jpg?resize=683%2C1024&amp;ssl=1 683w, https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2023\/02\/Thai-Le-750.jpg?resize=200%2C300&amp;ssl=1 200w, https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2023\/02\/Thai-Le-750.jpg?w=750&amp;ssl=1 750w\" sizes=\"(max-width: 640px) 100vw, 640px\" \/><\/a><figcaption class=\"wp-element-caption\">Thai Le. Photo by Thomas Graning\/Ole Miss Digital Imaging Services<\/figcaption><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>University of Mississippi professor collaborates with Penn State to study copying, paraphrasing in bots such as ChatGPT Since<\/p>\n","protected":false},"author":123462,"featured_media":126702,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[17687],"tags":[24389,24206,24387,5,7067,24388,24386,4,24385,24390,655],"class_list":["post-126700","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-university-of-mississippi","tag-acm-web-conference","tag-chatgpt","tag-jooyoung-lee","tag-mississippi","tag-ole-miss","tag-openai","tag-openais-gpt-2","tag-oxford","tag-penn-state-university","tag-robert-cummings","tag-university-of-mississippi"],"jetpack_featured_media_url":"https:\/\/i0.wp.com\/www.thelocalvoice.net\/oxford\/wp-content\/uploads\/2023\/02\/ChatGPT.jpg?fit=1200%2C800&ssl=1","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/posts\/126700","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/users\/123462"}],"replies":[{"embeddable":true,"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/comments?post=126700"}],"version-history":[{"count":0,"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/posts\/126700\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/media\/126702"}],"wp:attachment":[{"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/media?parent=126700"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/categories?post=126700"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.thelocalvoice.net\/oxford\/wp-json\/wp\/v2\/tags?post=126700"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}