{"id":24060,"date":"2024-09-19T20:24:53","date_gmt":"2024-09-19T18:24:53","guid":{"rendered":"https:\/\/www.wjst.de\/blog\/?p=24060"},"modified":"2024-09-19T20:25:18","modified_gmt":"2024-09-19T18:25:18","slug":"remarkable-i-dont-want-to-be-part-of-this-scene-anymore","status":"publish","type":"post","link":"https:\/\/www.wjst.de\/blog\/sciencesurf\/2024\/09\/remarkable-i-dont-want-to-be-part-of-this-scene-anymore\/","title":{"rendered":"Remarkable : I don&#8217;t want to be part of this scene anymore"},"content":{"rendered":"<p>From the creator of <a href=\"https:\/\/github.com\/rspeer\/wordfreq\/blob\/master\/SUNSET.md?utm_source=tldrwebdev\">wordfreq<\/a><\/p>\n<blockquote>\n<p dir=\"auto\">Generative AI has polluted the data<br \/>\nI don&#8217;t think anyone has reliable information about post-2021 language usage by humans.<br \/>\nThe open Web (via OSCAR) was one of wordfreq&#8217;s data sources. Now the Web at large is full of slop generated by large language models, written by no one to communicate nothing. Including this slop in the data skews the word frequencies.<\/p>\n<p>&nbsp;<\/p><\/blockquote>\n\n<p>&nbsp;<\/p>\n<div class=\"bottom-note\">\n  <span class=\"mod1\">CC-BY-NC Science Surf , accessed 09.06.2026<\/span>\n <\/div>","protected":false},"excerpt":{"rendered":"<p>From the creator of wordfreq Generative AI has polluted the data I don&#8217;t think anyone has reliable information about post-2021 language usage by humans. The open Web (via OSCAR) was one of wordfreq&#8217;s data sources. Now the Web at large is full of slop generated by large language models, written by no one to communicate &hellip; <a href=\"https:\/\/www.wjst.de\/blog\/sciencesurf\/2024\/09\/remarkable-i-dont-want-to-be-part-of-this-scene-anymore\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Remarkable : I don&#8217;t want to be part of this scene anymore<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,9],"tags":[3358],"class_list":["post-24060","post","type-post","status-publish","format-standard","hentry","category-philosophy-of-science","category-computer-software","tag-ai"],"_links":{"self":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/24060","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/comments?post=24060"}],"version-history":[{"count":2,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/24060\/revisions"}],"predecessor-version":[{"id":24062,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/24060\/revisions\/24062"}],"wp:attachment":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/media?parent=24060"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/categories?post=24060"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/tags?post=24060"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}