I’ve never truly understood the Softmax function
Did you know that this most popular and confusing softmax formula can be broken down into two simple operations?
Instead of this
I prefer this
In the first step (1), we convert all input values into positive ones using the exponential function property.
In the second step (2), we normalize our positive values by their sum to fit in the range [0, 1] and make them add to 1.
Clearer now? It was much more clear for me this way!