Hyper-v
The spelled-out intro to language modeling: building makemore.
The spelled-out intro to language modeling: building makemore. Part 2: MLP
#spelledout #intro #language #modeling #building #makemore
“Andrej Karpathy”
We implement a multilayer perceptron (MLP) character-level language model. In this video we also introduce many basics of machine learning (e.g. model training, learning rate tuning, hyperparameters, evaluation, train/dev/test splits, under/overfitting, etc.).
Links:
– makemore on github:…
source
To see the full content, share this page by clicking one of the buttons below |