Viet-Anh on Software Logo

What is: Softmax?

Year2000
Data SourceCC BY-SA - https://paperswithcode.com

The Softmax output function transforms a previous layer's output into a vector of probabilities. It is commonly used for multiclass classification. Given an input vector xx and a weighting vector ww we have:

P(y=jx)=exTwjk=1KexTwkP(y=j \mid{x}) = \frac{e^{x^{T}w_{j}}}{\sum^{K}_{k=1}e^{x^{T}wk}}