nanogpt
https://github.com/karpathy/nanogpt
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
not yet supported1 Subscribers
Add a CodeTriage badge to nanogpt
Help out
- Issues
- Question about batch_size and gradient_accumulation_steps
- About alignment ?
- i think the prefix found in checkpoints is coming from how the model is structured ...
- Running multi GPU inference
- How to inference nanoGPT ckpt.pt with c?
- add a barrier after eval in DDP
- README.md has "<3" characters in the install section
- Fix Readme <3
- Finetuning on Downstream Tasks: Eval zero-shot perplexities on standard evals (e.g. LAMBADA? HELM? etc.)
- Second Dropout layer not present in nn.MultiheadAttention implementation but Karpathy has it in his?
- Docs
- not yet supported