kedro
https://github.com/kedro-org/kedro
Python
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported2 Subscribers
Add a CodeTriage badge to kedro
Help out
- Issues
- Make Kedro CLI utilities functions public
- Implement prototype test setup for benchmarking AI coding assistants on Kedro tasks
- Rich logging integration mishandles Kedro node brackets and markup escaping
- Phase 2 user testing for Kedro Builder (iterate on key feedback + gather early usage signals)
- [DataCatalog2.0]: `catalog.to_config()` outputs empty version for datasets with `versioned=True` set via `config.yml`
- Monitor response times to community issues
- `logging.yaml` Is Not Automatically Detected With Alternative File Extension
- Collecting Real User Feedback on LLM Prompts
- Better manage nodes dependencies when nodes do not share a dataset
- Pipeline/node validation is not catching discrepancy in node output.
- Docs
- Python not yet supported