llm.c
https://github.com/karpathy/llm.c
Cuda
LLM training in simple, raw C/CUDA
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Cuda not yet supported2 Subscribers
Add a CodeTriage badge to llm.c
Help out
- Issues
- Updated a few variables to use exact width integer types
- Is there a final bin file that can input some question and output some answer?
- test_gpt2.cu correctness bounds tune per-parameter
- inf loss at big batch
- clang-tidy
- Improve tanh derivative in backward gelu
- Possible bugs in the data loading functions
- When will llama and other frameworks be supported?
- Modified version of ademeure's fused gelu_forward kernel
- How to do Inference on the trained weight of GPT 2 model after finishing the training on CPU using train_gpt2.py and train_gpt2 ?
- Docs
- Cuda not yet supported