luigi
https://github.com/spotify/luigi
Python
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to luigi
Help out
- Issues
- tasks left pending because dependency run by other worker
- When initially entering data without object id
- BigQueryLoadTask doesn't fully support UTF-8
- Return non-zero exit code on task failure
- Luigi registers twice classes after re-executing definition of class, and this causes problems when yielding tasks
- Gzip LocalTarget behavior on Windows
- CPU & Memory consumption seems very high when running the local scheduler
- Add a UTC date to the marker document and venv/ to the .gitignore
- Fix string and byte error in contrib/hadoop.py
- Why query in ImportIntoHiveTableTask use time as default partition setting?
- Docs
- Python not yet supported