llm.c
https://github.com/karpathy/llm.c
Cuda
LLM training in simple, raw C/CUDA
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Cuda not yet supported2 Subscribers
Add a CodeTriage badge to llm.c
Help out
- Issues
- Add `repkv_backward_kernel2` and `repkv_kernel2` (llama3 branch)
- examples for popular models
- Why CUDA when we can SYCL
- [build failed]Compiler encountered an internal error
- ~2x perf improvement beating PyTorch (cublasLt, TF32, CUDA graphs, kernel fusion, etc…)
- Support older CUDA GPU hardware by default
- Should we use 'T' instead of 'BT' here? I'm not sure either
- Removed a few unnecessary heap allocations
- Second matmul for fully custom attention
- init from scratch
- Docs
- Cuda not yet supported