{"id":19615,"date":"2022-03-11T08:14:56","date_gmt":"2022-03-11T06:14:56","guid":{"rendered":"https:\/\/www.wjst.de\/blog\/?p=19615"},"modified":"2022-08-02T09:28:17","modified_gmt":"2022-08-02T07:28:17","slug":"another-problem-in-ai-out-of-distribution-generalization","status":"publish","type":"post","link":"https:\/\/www.wjst.de\/blog\/sciencesurf\/2022\/03\/another-problem-in-ai-out-of-distribution-generalization\/","title":{"rendered":"Another problem in AI: Out-of-distribution generalization"},"content":{"rendered":"<p>Not sure if it is really the biggest but certainly one of the most pressing problems: Out-of-distribution generalization. It is <a href=\"https:\/\/towardsdatascience.com\/out-of-distribution-generalization-66b6f8980ef3\">explained<\/a> as<\/p>\n<blockquote><p>Imagine, for example, an AI that\u2019s trained to identify cows in images. Ideally, we\u2019d want it to learn to detect cows based on their shape and colour. But what if the cow pictures we put in the training dataset always show cows standing on grass? In that case, we have a spurious correlation between grass and cows, and if we\u2019re not careful, our AI might learn to become a grass detector rather than a cow detector.<\/p><\/blockquote>\n<p>As an epidemiologist I would have simply said, it is <a href=\"https:\/\/www.nature.com\/articles\/s41467-020-19478-2#:~:text=A%20collider%20is%20most%20simply,1a).\">colliding<\/a> or <a href=\"https:\/\/sphweb.bumc.bu.edu\/otlt\/mph-modules\/bs\/bs704-ep713_confounding-em\/BS704-EP713_Confounding-EM2.html\">confounding<\/a>, so every new field is rediscovering the same problems over and over again.<\/p>\n<p>&nbsp;<\/p>\n<p><iframe loading=\"lazy\" title=\"YouTube video player\" src=\"https:\/\/www.youtube.com\/embed\/QjXFN4UWZCg\" width=\"560\" height=\"315\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>&nbsp;<\/p>\n<blockquote><p>Not unexpected AI just running randomly over pixels is leading to spurious association. Once shape and colour of cows has been detected, surrounding environment, like grass or stable is irrelevant. That means that after getting initial results we have to step back, simulate different lighting conditions from sunlight to lightbulb and environmental conditions from grass to slatted floor (invariance principle). Also shape and size matters &#8211; cow spots will keep to some extent size and form irrespective if it is a real animal or children toy (scaling principle). I am a bit more sceptical about including also multimodal data (eg smacking sound) as the absence of these features is no proof of non-existence while this sound can also be imitated by other animals.<\/p><\/blockquote>\n<p>And yes, <a href=\"https:\/\/projecteuclid.org\/journals\/statistical-science\/volume-21\/issue-1\/Classifier-Technology-and-the-Illusion-of-Progress\/10.1214\/088342306000000060.full\">less is more<\/a>.<\/p>\n\n<p>&nbsp;<\/p>\n<div class=\"bottom-note\">\n  <span class=\"mod1\">CC-BY-NC Science Surf , accessed 17.04.2026<\/span>\n <\/div>","protected":false},"excerpt":{"rendered":"<p>Not sure if it is really the biggest but certainly one of the most pressing problems: Out-of-distribution generalization. It is explained as Imagine, for example, an AI that\u2019s trained to identify cows in images. Ideally, we\u2019d want it to learn to detect cows based on their shape and colour. But what if the cow pictures &hellip; <a href=\"https:\/\/www.wjst.de\/blog\/sciencesurf\/2022\/03\/another-problem-in-ai-out-of-distribution-generalization\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Another problem in AI: Out-of-distribution generalization<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9],"tags":[3358,1995],"class_list":["post-19615","post","type-post","status-publish","format-standard","hentry","category-computer-software","tag-ai","tag-algorithm"],"_links":{"self":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/19615","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/comments?post=19615"}],"version-history":[{"count":8,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/19615\/revisions"}],"predecessor-version":[{"id":20282,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/posts\/19615\/revisions\/20282"}],"wp:attachment":[{"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/media?parent=19615"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/categories?post=19615"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wjst.de\/blog\/wp-json\/wp\/v2\/tags?post=19615"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}