Softmax Classifier

AKA Multinomial Logistic Regression, Cross-entropy loss

Want to interpret raw classifier scores as probabilities

Softmax Function

s=f(xi;W)P(Y=k|X=xi)=eskjesjLi=logP(Y=yi|X=xi)

Maximum Likelihood Estimation: Choose weights to maximize the likelihood of the observed data

Pasted image 20241130142946.png