Blog

1. On word embeddings - Part 2: Approximating the Softmax
2. On word embeddings - Part 1
3. An overview of gradient descent optimization algorithms