LLM from Scratch Demo

124M parameter LLM running on CPU

10 500
0.1 2