{"id":23419,"date":"2024-03-13T08:03:16","date_gmt":"2024-03-13T06:03:16","guid":{"rendered":"https:\/\/www.wjst.de\/blog\/?p=23419"},"modified":"2024-03-13T08:04:47","modified_gmt":"2024-03-13T06:04:47","slug":"how-does-ai-recognize-ai-text","status":"publish","type":"post","link":"https:\/\/www.wjst.de\/blog\/sciencesurf\/2024\/03\/how-does-ai-recognize-ai-text\/","title":{"rendered":"How does AI  recognize AI text"},"content":{"rendered":"<p>The <a href=\"https:\/\/www.semrush.com\/blog\/how-to-detect-ai-written-content-and-plagiarism\/\">Semrush blog<\/a> has a nice summary<\/p>\n<blockquote><p>By analyzing two main characteristics of the text:\u00a0perplexity\u00a0and\u00a0<a href=\"https:\/\/machinelearning.wtf\/terms\/burstiness\/\" target=\"_blank\" rel=\"noreferrer noopener\">burstiness<\/a>. In other words, how predictable or unpredictable it sounds to the reader, as well as how varied or uniform the sentences are.<\/p><\/blockquote>\n<p><a href=\"https:\/\/www.techslang.com\/perplexity-in-nlp-definition-pros-and-cons\/#:~:text=Perplexity%20is%20a%20statistical%20measure,the%20model%20predicts%20the%20text\">Perplexity is<\/a><\/p>\n<blockquote><p>a statistical measure of how confidently a language model predicts a text sample. In other words, it quantifies how \u201csurprised\u201d the model is when it sees new data. The lower the perplexity, the better the model predicts the text.<\/p><\/blockquote>\n<p><a href=\"https:\/\/machinelearning.wtf\/terms\/burstiness\/\">Burstiness is<\/a><\/p>\n<blockquote><p>is the intermittent increases and decreases in activity or frequency of an event. One of measures of burstiness is the Fano factor \u2014a ratio between the variance and mean of counts. In natural language processing, burstiness has a slightly more specific definition&#8230; A word is more likely to occur again in a document if it has already appeared in the document. Importantly, the burstiness of a word and its semantic content are positively correlated; words that are more informative are also more bursty.<\/p><\/blockquote>\n<p>Or lets call it <a href=\"https:\/\/www.wired.com\/story\/how-to-spot-generative-ai-text-chatgpt\/\">entropy?<\/a>\u00a0So we now have some criteria<\/p>\n<ul>\n<li style=\"list-style-type: none;\">\n<ul>\n<li>AI texts are more uniform and\u00a0 more predictable and often repetitive with<\/li>\n<li>lack of depth and personality<\/li>\n<li>Sometimes plagiarism checker may recognize &#8220;learned&#8221; AI phrases. Sometimes reference checkers will find &#8220;hallucinated&#8221; references<\/li>\n<li>Incorrect content and outdated information in contrast needs human experts<\/li>\n<li>An obvious, yet underappreciated downside: &#8220;<a href=\"https:\/\/www.nature.com\/articles\/s42254-024-00713-4\">AI texts have nothing to say<\/a>&#8221; &#8211; &#8220;<a href=\"https:\/\/www.nature.com\/articles\/s42254-024-00713-4\">clich\u00e9d nothingness<\/a>&#8220;.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>Well, there appear now also\u00a0 <a href=\"https:\/\/pubpeer.com\/publications\/01593D32DC9AD348EA64D7BF9ABE0C?\">AI prologue sentences<\/a> in scientific literature for example like &#8220;Certainly! Here is&#8230;&#8221;<\/p>\n\n<p>&nbsp;<\/p>\n<div class=\"bottom-note\">\n  <span class=\"mod1\">CC-BY-NC Science Surf , accessed 28.04.2026<\/span>\n <\/div>","protected":false},"excerpt":{"rendered":"<p>The Semrush blog has a nice summary By analyzing two main characteristics of the text:\u00a0perplexity\u00a0and\u00a0burstiness. In other words, how predictable or unpredictable it sounds to the reader, as well as how varied or uniform the sentences are. Perplexity is a statistical measure of how confidently a language model predicts a text sample. In other words, &hellip; <a href=\"https:\/\/www.wjst.de\/blog\/sciencesurf\/2024\/03\/how-does-ai-recognize-ai-text\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">How does AI  recognize AI text<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20,9],"tags":[3358,3330],"class_list":["post-23419","post","type-post","status-publish","format-standard","hentry","category-note-worthy","category-computer-software","tag-ai","tag-text"],"_links":{"self":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/23419","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/comments?post=23419"}],"version-history":[{"count":2,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/23419\/revisions"}],"predecessor-version":[{"id":23421,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/23419\/revisions\/23421"}],"wp:attachment":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/media?parent=23419"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/categories?post=23419"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/tags?post=23419"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}