<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[just learning data science]]></title><description><![CDATA[I share my journey from a Data Science graduate, through many dead ends and aha moments, to a more profound understanding of the craft. I write to both help others and invite constructive criticism for my own growth.]]></description><link>https://blog.justlearningdatascience.com</link><image><url>https://substackcdn.com/image/fetch/$s_!2zUb!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F08e7a4ef-ed22-4add-9c10-666203ee6e14_1024x1024.png</url><title>just learning data science</title><link>https://blog.justlearningdatascience.com</link></image><generator>Substack</generator><lastBuildDate>Wed, 06 May 2026 11:08:57 GMT</lastBuildDate><atom:link href="https://blog.justlearningdatascience.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Maciej Gruszczyński]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[justlearningdatascience@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[justlearningdatascience@substack.com]]></itunes:email><itunes:name><![CDATA[Maciej Gruszczyński]]></itunes:name></itunes:owner><itunes:author><![CDATA[Maciej Gruszczyński]]></itunes:author><googleplay:owner><![CDATA[justlearningdatascience@substack.com]]></googleplay:owner><googleplay:email><![CDATA[justlearningdatascience@substack.com]]></googleplay:email><googleplay:author><![CDATA[Maciej Gruszczyński]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The Mystery of the Normal Distribution in Nature]]></title><description><![CDATA[Laid bare]]></description><link>https://blog.justlearningdatascience.com/p/the-mystery-of-the-normal-distribution</link><guid isPermaLink="false">https://blog.justlearningdatascience.com/p/the-mystery-of-the-normal-distribution</guid><dc:creator><![CDATA[Maciej Gruszczyński]]></dc:creator><pubDate>Tue, 11 Nov 2025 23:10:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!UCMv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="pullquote"><p><strong>&#8220;The normal distribution is the most common distribution in nature.&#8221;</strong></p></div><p>I heard this countless times in school&#8230;</p><p>The bell curve itself makes intuitive sense&#8212;there are few very short people, few very tall people and lots in the middle, right? But something deeper bothered me. My teachers were making an extraordinary claim: among infinite possible bell-shaped curves, <strong>nature consistently chooses one specific mathematical form</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UCMv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UCMv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 424w, https://substackcdn.com/image/fetch/$s_!UCMv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 848w, https://substackcdn.com/image/fetch/$s_!UCMv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 1272w, https://substackcdn.com/image/fetch/$s_!UCMv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UCMv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png" width="496" height="328.3956043956044" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:964,&quot;width&quot;:1456,&quot;resizeWidth&quot;:496,&quot;bytes&quot;:123007,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.justlearningdatascience.com/i/176683365?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UCMv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 424w, https://substackcdn.com/image/fetch/$s_!UCMv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 848w, https://substackcdn.com/image/fetch/$s_!UCMv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 1272w, https://substackcdn.com/image/fetch/$s_!UCMv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d9bbec-f269-4c3f-8cd4-bca5ae405353_1674x1108.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{equation}\nf(x) = \\frac{1}{\\sigma \\sqrt{2\\pi} } e^{-\\frac{1}{2}\\left(\\frac{x-\\mu}{\\sigma}\\right)^2}\n\\end{equation}&quot;,&quot;id&quot;:&quot;HROKYPDFMJ&quot;}" data-component-name="LatexBlockToDOM"></div><p></p><p>How could this be?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.justlearningdatascience.com/subscribe?"><span>Subscribe now</span></a></p><p>The answer lies in a remarkable property. When you <strong>add up many small components, their sum tends toward a normal distribution</strong>. If human height is the sum of many body segments, then height will be approximately normally distributed.</p><p>But there&#8217;s a catch&#8212;<strong>it won&#8217;t always work</strong>. Several conditions must be met:</p><ul><li><p>Components are independent</p></li></ul><ul><li><p>Each component must have finite variance (the distribution tail decreases sufficiently fast)</p></li><li><p>No single component can dominate the others in terms of variance (<a href="https://en.wikipedia.org/wiki/Lindeberg%27s_condition">Lindeberg&#8217;s condition</a>)</p></li><li><p>You need enough components</p></li></ul><p>Are these conditions realistic? Do they actually occur in nature?</p><h2>The Human Height Example</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pZp3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pZp3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 424w, https://substackcdn.com/image/fetch/$s_!pZp3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 848w, https://substackcdn.com/image/fetch/$s_!pZp3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 1272w, https://substackcdn.com/image/fetch/$s_!pZp3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pZp3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png" width="304" height="295.33678756476684" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/295285d1-0485-417f-8299-80ee547b15c3_772x750.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:750,&quot;width&quot;:772,&quot;resizeWidth&quot;:304,&quot;bytes&quot;:45550,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.justlearningdatascience.com/i/176683365?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pZp3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 424w, https://substackcdn.com/image/fetch/$s_!pZp3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 848w, https://substackcdn.com/image/fetch/$s_!pZp3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 1272w, https://substackcdn.com/image/fetch/$s_!pZp3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F295285d1-0485-417f-8299-80ee547b15c3_772x750.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Histogram of human height from the NHANES dataset with the normal curve overlaid. Image from &#8220;Statistical Thinking for the 21st Century&#8221; by Russell A. Poldrack</figcaption></figure></div><blockquote><p>Adult height serves as an empirical example of a normally distributed biological trait in textbooks on statistics&#185;.</p></blockquote><p>Let&#8217;s examine human height more closely. At a basic level, we might divide the body into major segments, say: foot, ankle, calf, knee, thigh, hips, belly, chest, neck, and head height. For these to sum to a normal distribution:</p><ul><li><p>they cannot influence each other</p></li><li><p>their individual variances must be fairly similar</p></li><li><p>and with so few components, each would need to be nearly normal itself</p></li></ul><p>The number of components is the key.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BlO6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BlO6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BlO6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BlO6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BlO6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BlO6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg" width="120" height="187.74725274725276" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2278,&quot;width&quot;:1456,&quot;resizeWidth&quot;:120,&quot;bytes&quot;:2067133,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.justlearningdatascience.com/i/176683365?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BlO6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BlO6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BlO6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BlO6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30571b26-64b4-44d5-9e89-c3c07c393398_2185x3418.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p></p><p>Instead of a few major segments, imagine dividing the ankle into all the bones and other tissues that comprise it. Go further&#8212;consider parts of these bones, or even smaller, unspecified fragments beyond human nomenclature. Suddenly, we have a large number of micro-components. Now the requirements become much less strict:</p><ul><li><p>still independent</p></li><li><p>but the component heights can have <strong>varied distributions, far from Normal</strong></p></li><li><p>and more different variances</p></li></ul><p>Yet, their sum will&nbsp;<strong>still converge to a normal distribution</strong>.</p><h3><strong>Central Limit Theorem</strong></h3><p>This remarkable property of the normal distribution has a name: the Central Limit Theorem (CLT).</p><p>The requirements described here apply to the <strong>Lindeberg-Feller version</strong>, but <strong>other formulations exist</strong> with <strong>different conditions</strong>. You can often <strong>trade one requirement for another</strong>&#8212;or several others. The more you remove, the fancier the new ones become.</p><p>Either way, <strong>one of the versions is apparently often satisfied in nature</strong>, and that&#8217;s why we can observe the Normal distribution so frequently.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HlTM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HlTM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HlTM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HlTM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HlTM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HlTM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg" width="270" height="359.2265625" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1703,&quot;width&quot;:1280,&quot;resizeWidth&quot;:270,&quot;bytes&quot;:306096,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.justlearningdatascience.com/i/176683365?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HlTM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HlTM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HlTM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HlTM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665a2e20-969b-4f9c-b0f9-abc64583da28_1280x1703.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">(author unknown)</figcaption></figure></div><h3>P.S.</h3><p>This post was inspired by reading <em>The Art of Statistics: Learning from Data</em> by David Spiegelhalter. As I mentioned in my previous post, I&#8217;m going back to basics. I&#8217;m on page 100 of 400 and I&#8217;m fascinated. My notebook is full of ideas for future posts. We&#8217;ll see how much my upcoming newborn will let me actually write them ;)</p><div class="captioned-button-wrap" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/p/the-mystery-of-the-normal-distribution?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="CaptionedButtonToDOM"><div class="preamble"><p class="cta-caption">Thanks for reading just learning data science! This post is public so feel free to share it.</p></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/p/the-mystery-of-the-normal-distribution?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.justlearningdatascience.com/p/the-mystery-of-the-normal-distribution?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><p>&#185; Slavskii, S.A., et al. (2021). &#8220;The limits of normal approximation for adult height.&#8221; <em>European Journal of Human Genetics</em>, 29, 1082&#8211;1091. https://www.nature.com/articles/s41431-021-00836-7</p>]]></content:encoded></item><item><title><![CDATA[Back to basics]]></title><description><![CDATA[This blog is about learning data science, and today I share my progress after a long absence.]]></description><link>https://blog.justlearningdatascience.com/p/back-to-basics</link><guid isPermaLink="false">https://blog.justlearningdatascience.com/p/back-to-basics</guid><dc:creator><![CDATA[Maciej Gruszczyński]]></dc:creator><pubDate>Sun, 17 Aug 2025 21:13:45 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b257ff5b-158c-45c4-b6ce-dfadfd4aaa93_4000x5000.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The story of my long absence began when <strong>I received the results of an A/B experiment at work and wanted to determine if the difference was statistically significant. </strong></p><p>During my studies, I took courses that included statistical tests. So I started to refresh my knowledge. I realized I could perform the test, calculate the p-value, and reject the null hypothesis, but I felt overwhelmed by the feeling that I had no idea what I was doing. The number of &#8220;why&#8221; questions was devastating. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading just learning data science! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><ul><li><p>Why this test and not another?</p></li><li><p>Why am I modeling data with probability theory at all when reality is more complex? At what point am I &#8220;paying&#8221; for simplifying real data into probability theory?</p></li><li><p>Why is the p-value usually taken as 0.05? Why sometimes 0.005, etc.?</p></li><li><p>What does the p-value tell me at all? And what does it not tell me? Why are interpretations on the internet contradictory?</p></li><li><p>Why does the p-value take into account events more extreme than those observed, if I am not interested in them?</p><p></p><p>and more popping up along the way.</p></li></ul><blockquote><p>How was I supposed to make any business decision with such a limited understanding of the decision-making tool?</p></blockquote><p>The number of these questions made me suspect that these statistical tests are worthless. When I read various materials available on the internet on this subject, I had the impression that everyone was parroting what I had learned in college, rather than asking WHY questions.</p><blockquote><p>I had an ambitious dream of finding answers to all my questions and describing them in a comprehensive blog article.</p></blockquote><p>I was tempted by the bold thesis that tests are worthless. I started reading and searching. I learned more about different types of tests, assumptions, CLT, and the history of statistical tests. Unfortunately, I was constantly struck by the impression that the sources I was reading were often contradictory. Some repeated interpretative errors that others criticized. I still couldn't find the depth and certainty I was looking for. Something didn't add up; it didn't fit together coherently, eluding me. Nevertheless, I gathered a lot of fragments related to the components of statistical tests, from which I tried to assemble a coherent whole that was easy to understand, despite its extensive nature.</p><blockquote><p>The main thesis was that at no point do we &#8220;pay&#8221; for simplifying real data to probability theory.</p></blockquote><p>When I use probability in machine learning, I can check how well such modeling works on a test set. In the case of statistical tests, I have to accept the result uncritically.</p><p>Additionally, I identified several issues that further undermine credibility. The potential for arbitrary selection of the level of statistical significance. An overwhelming amount of overinterpretation of p-values and vast confusion on the internet regarding this issue. I found &#8220;<a href="https://www.tandfonline.com/doi/epdf/10.1080/00031305.2016.1154108">The ASA&#8217;s Statement on p-Values</a>&#8221; by the American Statistical Association, from which I concluded that it is practically impossible to move from the formal definition of p-values to any useful conclusions about the phenomenon under study.</p><p><em>Privately, I was waiting for my second child's birthday while caring for my first baby. I read about statistical tests on the sidewalk. I had my phone in one hand and pushed the stroller with the other. Time was at a premium, so progress was slow. After the birth, I went into survival mode, when only work and family mattered. The space for reflection disappeared. The article dragged on for several months.</em></p><p>The children became a little more independent, and I got back into the game. In search of answers to my questions, I went back to basics. While reading about CLT, I turned to more serious sources and realized I didn't even know what a random variable was. During my studies and when using ML after graduation, it was enough for me to equate a random variable with a probability distribution. It turns out that this is not the case&#8230; My confidence in my own knowledge collapsed for a moment. ChatGPT cheered me up, saying that it's not that I don't know anything, but that I'm just going deeper.</p><p>In the meantime, I asked numerous questions on Cross Validated. Surrounded by educated, serious people, I felt very incompetent. But I believed that reading on my own was not enough. I have to bounce questions and thoughts off people who have vastly more experience. </p><p>I found it challenging to understand some of the answers because they employed concepts I had never encountered before. <strong>When I read that to understand the definition of a random variable, I needed to know the concepts of sample space, measurable space, and &#963;-algebra, I was taken aback.</strong> I don't remember ever seeing these concepts on any slides during my master's studies. They focused more on covering descriptive statistics for EDA, then quickly jumped into classifiers, regression, and neural networks. The concept of a random variable emerged in the context of Bayesian Neural Networks (BNNs) or Variational Autoencoders (VARs), but treating it as a probability distribution was sufficient for my purposes.</p><p>Concerned, I asked ChatGPT <strong>what I need to read to fill this gap</strong>. I selected three books whose tables of contents encouraged me.</p><ol><li><p>The Art of Statistics (Spiegelhalter)</p></li><li><p>Mathematical Statistics and Data Analysis (Rice)</p></li><li><p>Statistical Inference (Casella &amp; Berger)</p></li></ol><p>I will certainly not get through all of them <em>(especially since my third child is on the way and another period of &#8220;time out&#8221; is coming)</em>, but I want to fill in the fundamental gaps.</p><p>And well, we'll see what happens next :)</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading just learning data science! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[A machine learning model with no input variables]]></title><description><![CDATA[...that made it difficult for me to understand a familiar concept]]></description><link>https://blog.justlearningdatascience.com/p/a-machine-learning-model-with-no</link><guid isPermaLink="false">https://blog.justlearningdatascience.com/p/a-machine-learning-model-with-no</guid><dc:creator><![CDATA[Maciej Gruszczyński]]></dc:creator><pubDate>Mon, 29 Jan 2024 22:47:24 GMT</pubDate><enclosure url="https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1456w" sizes="100vw"><img src="https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" width="6016" height="4016" data-attrs="{&quot;src&quot;:&quot;https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:4016,&quot;width&quot;:6016,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;boy wearing gray vest and pink dress shirt holding book&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="boy wearing gray vest and pink dress shirt holding book" title="boy wearing gray vest and pink dress shirt holding book" srcset="https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1485546246426-74dc88dec4d9?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwxfHx3b3d8ZW58MHx8fHwxNzA2NTY5NTgwfDA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Wikipedia is not the best resource for Data Science and Machine Learning beginners. In contrast to academia, online courses, and books, it doesn't order the topics in a way that one builds upon another. Instead, we get unordered articles with a very high entry level.</p><p>However, at the current stage of my Data Science journey, almost 4 years after graduation, I found many Wikipedia articles in-depth and rich. At the same time, I usually struggle to break its strict way of explaining things, being frequently ashamed and dissatisfied.</p><p>Today, I will tell you my short story with the Likelihood function article on Wikipedia, from confusion to the joy of understanding and new insights.</p><div><hr></div><p>When reading through a Wikipedia article about the Likelihood function, I struggled to understand the given definition and example.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QpcA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QpcA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 424w, https://substackcdn.com/image/fetch/$s_!QpcA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 848w, https://substackcdn.com/image/fetch/$s_!QpcA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 1272w, https://substackcdn.com/image/fetch/$s_!QpcA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QpcA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png" width="610" height="295.4142857142857" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/edd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:700,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QpcA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 424w, https://substackcdn.com/image/fetch/$s_!QpcA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 848w, https://substackcdn.com/image/fetch/$s_!QpcA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 1272w, https://substackcdn.com/image/fetch/$s_!QpcA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fedd74f1c-70fc-4051-8cb5-1449ca1f4ef2_700x339.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Something was missing there for me. I remember my Data Science classes, where we covered the Likelihood function when training neural networks for classification with negative log-likelihood as a loss function or inside a Naive Bayes model. Always in the context of some datasets like ancient diabetes or iris. I used to deal with input variables (like sepal width or length) to classify data. However, I was unable to match that experience with the formulas I saw in the Wikipedia article.</p><p>So I immediately went through the coin flip example (they say an example is worth a thousand words&#8230;), but it still seemed very unfamiliar to me.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zr6A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zr6A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 424w, https://substackcdn.com/image/fetch/$s_!zr6A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 848w, https://substackcdn.com/image/fetch/$s_!zr6A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 1272w, https://substackcdn.com/image/fetch/$s_!zr6A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zr6A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png" width="586" height="792.7742857142857" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:947,&quot;width&quot;:700,&quot;resizeWidth&quot;:586,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zr6A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 424w, https://substackcdn.com/image/fetch/$s_!zr6A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 848w, https://substackcdn.com/image/fetch/$s_!zr6A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 1272w, https://substackcdn.com/image/fetch/$s_!zr6A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4f0a0b6-3199-4b58-8912-7b20c01ed943_700x947.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>After reading it several times and taking a moment to reflect, the fog finally cleared, and I understood what was going on. </p><p>In both the example and the provided definition, there are no input variables that I used to work with. And there&#8217;s absolutely nothing wrong with that! It&#8217;s a perfectly fine situation, but it was very awkward for me to get at first glance.</p><p>The thing is that the given model is a probability distribution. Or I should say, a random variable described by a probability distribution. It indeed has no inputs but can produce outputs.</p><p><strong>From the Machine Learning perspective, the model in the example could take any form. Linear function, deep neural network, decision tree, probabilistic graphical model, basically anything that produces outputs! Not only a random variable.</strong></p><p><strong>The random variable part of the definition wasn&#8217;t clear to me because I incorrectly assumed it was somehow connected to the idea of the Likelihood itself, while it was actually not. </strong></p><div><hr></div><p>To better explain what I mean, I need to go step-by-step through the definition.</p><p>Let&#8217;s unravel two provided formulas. It starts with the &#8220;probability density or mass function&#8221; according to the article.</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;x \\rightarrow f(\\theta,x)&quot;,&quot;id&quot;:&quot;AOXSVLZNBY&quot;}" data-component-name="LatexBlockToDOM"></div><p>We can read this right arrow sign as &#8220;function <em>f</em> maps <em>x</em> values to outputs&#8220; and its main goal is to introduce an assumption that <em>&#952;</em> is a fixed(constant) parameter while x remains adjustable<em>.</em></p><p>Since the article says we have a random variable X, then <em>x</em> are all possible values of this random variable, and <em>f</em> is its probability density function.<br>The most trivial probability density function (PDF) that comes to my mind is the Normal Distribution. It takes exactly two parameters: mean (&#956;) and standard deviation (&#963;). Thus, the parameters set <em>&#952;</em> simply consists of <em>&#956;</em> and <em>&#963;</em>.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!91Me!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!91Me!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 424w, https://substackcdn.com/image/fetch/$s_!91Me!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 848w, https://substackcdn.com/image/fetch/$s_!91Me!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 1272w, https://substackcdn.com/image/fetch/$s_!91Me!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!91Me!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png" width="302" height="131.18950437317784" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:298,&quot;width&quot;:686,&quot;resizeWidth&quot;:302,&quot;bytes&quot;:36684,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!91Me!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 424w, https://substackcdn.com/image/fetch/$s_!91Me!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 848w, https://substackcdn.com/image/fetch/$s_!91Me!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 1272w, https://substackcdn.com/image/fetch/$s_!91Me!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05598820-5b91-4b3d-ac8f-4ddf343eb69e_686x298.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The chart shows an example of the <em>f</em> function - the Normal Distribution probability density function.</p><p>We use this function to say what the probability of sampling a value from a particular range from the distribution is.</p><p>In the second formula, we get to the point and see the definition of the Likelihood function:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\theta \\rightarrow f(\\theta,x)&quot;,&quot;id&quot;:&quot;XKFZKKYOOK&quot;}" data-component-name="LatexBlockToDOM"></div><p>which reads &#8220;function <em>f</em> maps parameters set <em>&#952;</em> to outputs&#8221; and introduces an absolutely crucial assumption that from now on, <em>x</em> is a fixed (constant) parameter and <em>&#952; </em>is adjustable.</p><p>At this stage, the article says, &#8220;x is a realization of the random variable X&#8220;, which basically means x is a value sampled from our distribution. Let&#8217;s say we really sampled a value using an online tool or a Python script, and we got <em>1.52</em>. We treat <em>x</em> as a fixed, ground-truth value here and can&#8217;t change it. <em>&#952;</em> is what we can freely change in this formula.</p><p>Let&#8217;s select some specific <em>&#952;, </em>then<em>. </em>For our Normal Distribution, I pick mean &#956; = 1 and standard deviation &#963; = 0.5, whatever. For these parameter values, the probability density of sampling <em>1.52</em> is equal to <em>0.46</em>. <em>(Please notice I use the term probability density because the probability itself is an area under the curve for continuous distributions.)</em> </p><p>And that&#8217;s it. This is how the likelihood works. Given an observed value, we select a parameters set, and the likelihood says what is the probability of sampling this value.</p><p>The most useful thing I&#8217;m aware of we can use this for is to find what distribution parameters <em>&#952; </em>are most likely to produce the sampled value. By trying many different mean and standard deviation values, we will find the ones that are most likely to generate our sampled value <em>1.52. </em>This task is called <em>maximum likelihood estimation</em> (MLE).</p><p>Noooooow, the best thing is, our function <em>f</em> from the definition can be a Probability Density Function or Probability Mass Function (for discrete distributions) describing not only a random variable. From the machine learning perspective, it can describe any model we can think of. So let&#8217;s make things more sexy now and pick a Large Language Model (LLM) as an example.</p><p><em>X</em> now becomes our LLM. A small <em>x</em> from the formula becomes a data sample the LLM works with. And since LLMs are trained to predict the next token in a sequence, our <em>x</em> becomes a tuple (&#8220;Data Science is&#8221;, &#8221;fun&#8220;), where the first sequence is input to the model, and the second word is an expected output. The full sequence &#8220;Data Science is fun&#8220; is our ground-truth observation scraped from the internet or maybe found in a book.</p><p>LLMs are neural networks with parameters set <em>&#952; </em>consisting of weights and biases. When feeding the model with our input &#8220;Data Science is&#8221; we will get probabilities of all the possible completion tokens in the model&#8217;s dictionary. Let&#8217;s say we work with some state-of-the-art model found on the web, and it assigns a probability of 0.8 to the &#8220;fun&#8220; token. Good.</p><p>We can change the weights and the biases in some way, pass the input sequence &#8220;Data Science is&#8221; through the model again, and see if the probability of sampling &#8220;fun&#8220; has increased. At the highest level of abstraction, this is how language models learn. Yeah.</p><div><hr></div><p>From now on, I&#8217;m able to split the model being a random variable from the concept of the likelihood itself to understand both the Wikipedia&#8217;s definition and the example.</p><p>And I could've stopped here here, but my mind ran further. I swiftly categorized the potential models into two groups. Deterministic and non-deterministic ones. Deterministic models usually have some kind of input variables because when they don&#8217;t, they become constant functions, which is not very useful&#8230;</p><p>Non-deterministic models introduce some randomness. They can take no inputs, like the coin flip in Wikipedia&#8217;s example. However, we can mix them with anything from basic math operations on input variables to large operation graph modules. For example, Variational Auto Encoders consist of a deterministic encoder, decoder, and some normal distribution sampling in the middle.</p><div><hr></div><p>That&#8217;s it. <strong>I will have more respect for the coin flips now and be aware that even the ordinary coin is a serious machine-learning model&#8230; </strong></p><p>We&#8217;ll see if this new point of view will help me to get through more articles quickly. I hope it will for you. Cheers!</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for just learning data science with me! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[I’ve never truly understood the Softmax function]]></title><description><![CDATA[Did you know that this most popular and confusing softmax formula can be broken down into two simple operations?]]></description><link>https://blog.justlearningdatascience.com/p/ive-never-truly-understood-the-softmax</link><guid isPermaLink="false">https://blog.justlearningdatascience.com/p/ive-never-truly-understood-the-softmax</guid><dc:creator><![CDATA[Maciej Gruszczyński]]></dc:creator><pubDate>Tue, 23 Jan 2024 22:18:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!7FnH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7FnH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7FnH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7FnH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7FnH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7FnH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7FnH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg" width="1400" height="1750" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1750,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!7FnH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7FnH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7FnH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7FnH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74903bb7-6b3d-494a-96ca-cf089b6394b4_1400x1750.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@ianstauffer?utm_source=medium&amp;utm_medium=referral">Ian Stauffer</a> on <a href="https://unsplash.com/?utm_source=medium&amp;utm_medium=referral">Unsplash</a></figcaption></figure></div><p>Did you know that this most popular and confusing softmax formula can be broken down into two simple operations?</p><p>Instead of this</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Qt81!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qt81!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 424w, https://substackcdn.com/image/fetch/$s_!Qt81!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 848w, https://substackcdn.com/image/fetch/$s_!Qt81!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 1272w, https://substackcdn.com/image/fetch/$s_!Qt81!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qt81!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png" width="248" height="57.99004975124378" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:94,&quot;width&quot;:402,&quot;resizeWidth&quot;:248,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Qt81!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 424w, https://substackcdn.com/image/fetch/$s_!Qt81!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 848w, https://substackcdn.com/image/fetch/$s_!Qt81!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 1272w, https://substackcdn.com/image/fetch/$s_!Qt81!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33e560dd-8931-48d9-ad2c-c43d4dbcb364_402x94.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>I prefer this</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_arG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_arG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 424w, https://substackcdn.com/image/fetch/$s_!_arG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 848w, https://substackcdn.com/image/fetch/$s_!_arG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 1272w, https://substackcdn.com/image/fetch/$s_!_arG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_arG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png" width="478" height="81.23645320197045" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:138,&quot;width&quot;:812,&quot;resizeWidth&quot;:478,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!_arG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 424w, https://substackcdn.com/image/fetch/$s_!_arG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 848w, https://substackcdn.com/image/fetch/$s_!_arG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 1272w, https://substackcdn.com/image/fetch/$s_!_arG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7344a1e-0c1a-4d28-a471-da51bc2f6e8c_812x138.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>In the first step (1), we convert all input values into <strong>positive </strong>ones using the <strong>exponential function</strong> property.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!l3ue!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!l3ue!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 424w, https://substackcdn.com/image/fetch/$s_!l3ue!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 848w, https://substackcdn.com/image/fetch/$s_!l3ue!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 1272w, https://substackcdn.com/image/fetch/$s_!l3ue!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!l3ue!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png" width="198" height="252" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33bc218e-e807-49dc-babd-b9c17945de66_198x252.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:252,&quot;width&quot;:198,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!l3ue!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 424w, https://substackcdn.com/image/fetch/$s_!l3ue!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 848w, https://substackcdn.com/image/fetch/$s_!l3ue!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 1272w, https://substackcdn.com/image/fetch/$s_!l3ue!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33bc218e-e807-49dc-babd-b9c17945de66_198x252.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In the second step (2), we <strong>normalize</strong> our positive values by their sum to fit in the <strong>range [0, 1] </strong>and make them <strong>add to 1</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5egT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png" data-component-name="Image2ToDOM"><div class="image2-inset image2-full-screen"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5egT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 424w, https://substackcdn.com/image/fetch/$s_!5egT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 848w, https://substackcdn.com/image/fetch/$s_!5egT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 1272w, https://substackcdn.com/image/fetch/$s_!5egT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5egT!,w_5760,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;full&quot;,&quot;height&quot;:466,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-fullscreen" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!5egT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 424w, https://substackcdn.com/image/fetch/$s_!5egT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 848w, https://substackcdn.com/image/fetch/$s_!5egT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 1272w, https://substackcdn.com/image/fetch/$s_!5egT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09d2aa2c-62dc-4b19-b087-689df4ed9d16_15000x4800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Clearer now? It was much more clear for me this way!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Maciej&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[3 easy ways to stay up-to-date in Data Science]]></title><description><![CDATA[&#8230; that won&#8217;t take you hours]]></description><link>https://blog.justlearningdatascience.com/p/3-easy-ways-to-stay-up-to-date-in</link><guid isPermaLink="false">https://blog.justlearningdatascience.com/p/3-easy-ways-to-stay-up-to-date-in</guid><dc:creator><![CDATA[Maciej Gruszczyński]]></dc:creator><pubDate>Tue, 23 Jan 2024 22:14:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-sQg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1><strong>1. Follow Twitter Accounts connected with Data Science</strong></h1><p>Many of the most skilled people publish both cutting-edge and basics every day, and you have free access to their knowledge and results.</p><p>Wouldn&#8217;t it boost your learning to work with the best Google, DeepMind, PyTorch, or OpenAI researchers, engineers, PhDs, CTOs, and CEOs? Get closer this way.</p><p>You don&#8217;t even need to look for them &#8212; just follow a few of your favorite libraries and Twitter recommendations will lead you further.</p><p>LinkedIn will do the job too.</p><p>Below is the first page of the 62 accounts I follow:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-sQg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-sQg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 424w, https://substackcdn.com/image/fetch/$s_!-sQg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 848w, https://substackcdn.com/image/fetch/$s_!-sQg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 1272w, https://substackcdn.com/image/fetch/$s_!-sQg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-sQg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png" width="597" height="859" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:859,&quot;width&quot;:597,&quot;resizeWidth&quot;:597,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!-sQg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 424w, https://substackcdn.com/image/fetch/$s_!-sQg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 848w, https://substackcdn.com/image/fetch/$s_!-sQg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 1272w, https://substackcdn.com/image/fetch/$s_!-sQg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8bc11ab-dbb0-4ef1-9764-9c3f18cee993_597x859.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Social media addiction is your friend in this case.</p><h1><strong>2. Subscribe to your favorite libraries&#8217; release notes</strong></h1><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2vrV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2vrV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 424w, https://substackcdn.com/image/fetch/$s_!2vrV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 848w, https://substackcdn.com/image/fetch/$s_!2vrV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 1272w, https://substackcdn.com/image/fetch/$s_!2vrV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2vrV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png" width="1155" height="47" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:47,&quot;width&quot;:1155,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!2vrV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 424w, https://substackcdn.com/image/fetch/$s_!2vrV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 848w, https://substackcdn.com/image/fetch/$s_!2vrV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 1272w, https://substackcdn.com/image/fetch/$s_!2vrV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd4d619c-8351-47e4-b857-d8e1413a66f1_1155x47.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>This kind of message in my mailbox is a golden nugget. I can&#8217;t find words to express how cool it is.</p><p>The amount of hard work compressed in this message is enormous every time. It&#8217;s like your own army of researchers, developers, and a copywriter who research their ideas, publish papers, implement them, and document precisely.</p><p>All you need to do is open a finely crafted tl;dr message with a summary of SOTA methods selection and ready-to-use implementations.</p><p>Here&#8217;s how to subscribe to GitHub release notes:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Fw3Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 424w, https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 848w, https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 1272w, https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png" width="337" height="395" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:395,&quot;width&quot;:337,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 424w, https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 848w, https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 1272w, https://substackcdn.com/image/fetch/$s_!Fw3Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc5306a-bc1c-4637-bf4a-fa3265c2cc08_337x395.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And here&#8217;s a link to the transformers releases: <a href="https://github.com/huggingface/transformers/releases">https://github.com/huggingface/transformers/releases</a></p><h1><strong>3. Medium.com subscription</strong></h1><p>This one is the best at the beginning of your adventure. It helps you gain missing basic knowledge very quickly.</p><p>Medium&#8217;s algorithm selects the best articles and messages for you every day. After a few months of reading one article per day, you will be a different person. Make it a routine.</p><p>This is what it looks like:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oBWd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oBWd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 424w, https://substackcdn.com/image/fetch/$s_!oBWd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 848w, https://substackcdn.com/image/fetch/$s_!oBWd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 1272w, https://substackcdn.com/image/fetch/$s_!oBWd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oBWd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png" width="1370" height="1376" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1376,&quot;width&quot;:1370,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!oBWd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 424w, https://substackcdn.com/image/fetch/$s_!oBWd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 848w, https://substackcdn.com/image/fetch/$s_!oBWd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 1272w, https://substackcdn.com/image/fetch/$s_!oBWd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff859b41d-5230-4dfd-b62e-f221cddb8862_1370x1376.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That&#8217;s it. Share your ways in the comments so we can all become better at what we are passionate about.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Maciej&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Hello]]></title><description><![CDATA[Why this, why now]]></description><link>https://blog.justlearningdatascience.com/p/hello</link><guid isPermaLink="false">https://blog.justlearningdatascience.com/p/hello</guid><dc:creator><![CDATA[Maciej Gruszczyński]]></dc:creator><pubDate>Tue, 23 Jan 2024 22:09:41 GMT</pubDate><enclosure url="https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://blog.justlearningdatascience.com/subscribe?"><span>Subscribe now</span></a></p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1456w" sizes="100vw"><img src="https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080" width="3024" height="4032" data-attrs="{&quot;src&quot;:&quot;https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:4032,&quot;width&quot;:3024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;person waving reflecting shadow on teal wall paint&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="person waving reflecting shadow on teal wall paint" title="person waving reflecting shadow on teal wall paint" srcset="https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 424w, https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 848w, https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1272w, https://images.unsplash.com/photo-1531592937781-344ad608fabf?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3wzMDAzMzh8MHwxfHNlYXJjaHwzfHxoZWxsb3xlbnwwfHx8fDE3MDY0NzY4MTB8MA&amp;ixlib=rb-4.0.3&amp;q=80&amp;w=1080 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Photo by <a href="https://unsplash.com/@yoyoqua">Ioana Cristiana</a> on <a href="https://unsplash.com">Unsplash</a></figcaption></figure></div><h3>Why this, why now </h3><p>Hi, my name is Maciek. I dedicate this space mostly to my Data Science hobby/job. It started in 2019 when I decided to enter a Data Science master's degree at the Wroclaw University of Science and Technology in Poland. I was a little bit afraid to drop the web app development and switch to a more scientific area, but it was worth it.</p><p>I successfully graduated, having a lot of fun from crazy projects, new friends, and genuine satisfaction from completing the most demanding specialization I could choose.</p><p>My first serious Data Science position was at Surfer. A small startup that grew rapidly then. I faced real business problems and had a chance to challenge my graduate skills. Combined with great company culture, people, and expertise, we keep successfully shipping broadly used ML solutions to this day.</p><p>In the meantime, I keep becoming more aware of stuff I don&#8217;t know, or I covered carelessly during my studies. I want to fill all the gaps and go beyond. Writing short articles seems to be a very helpful tool.</p><p>I have some dreams but limited time in life. As a happy husband and father living outside any greater city, I can&#8217;t afford a Ph.D. degree for now. However, I can sit tight in my room in the evening, read some text about a topic I don&#8217;t profoundly understand yet, and share some precious insights with the public.</p><h3>What kind of community am I looking to build here</h3><p>I believe there are Data Scientists who will find something for themselves in the dump of my experiences and aha moments. I include a pinch of emotional load in all my texts to hopefully make someone feel &#8220;Oh, I was thinking the same. I understand this guy. I had the same doubts about it. I wasn&#8217;t alone then!&#8220;</p><p>At the same time, I&#8217;d love to welcome constructive criticism from experts. It&#8217;s necessary for me (us?) to make sure I keep moving toward the truth.</p><p>Maybe you are one of those people?</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.justlearningdatascience.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Maciej&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-button-wrap" data-attrs="{&quot;url&quot;:&quot;https://substack.com/refer/maciejgruszczyski?utm_source=substack&amp;utm_context=post&amp;utm_content=140979621&amp;utm_campaign=writer_referral_button&quot;,&quot;text&quot;:&quot;Start a Substack&quot;}" data-component-name="CaptionedButtonToDOM"><div class="preamble"><p class="cta-caption">Start writing today. Use the button below to create your Substack and connect your publication with Maciej&#8217;s Substack</p></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://substack.com/refer/maciejgruszczyski?utm_source=substack&amp;utm_context=post&amp;utm_content=140979621&amp;utm_campaign=writer_referral_button&quot;,&quot;text&quot;:&quot;Start a Substack&quot;,&quot;hasDynamicSubstitutions&quot;:false}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://substack.com/refer/maciejgruszczyski?utm_source=substack&amp;utm_context=post&amp;utm_content=140979621&amp;utm_campaign=writer_referral_button"><span>Start a Substack</span></a></p></div>]]></content:encoded></item></channel></rss>