luigi
https://github.com/spotify/luigi
Python
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to luigi
Help out
- Issues
- Luigi registers twice classes after re-executing definition of class, and this causes problems when yielding tasks
- Gzip LocalTarget behavior on Windows
- CPU & Memory consumption seems very high when running the local scheduler
- Add a UTC date to the marker document and venv/ to the .gitignore
- Fix string and byte error in contrib/hadoop.py
- Why query in ImportIntoHiveTableTask use time as default partition setting?
- Adding Oracle Target to the contrib package
- Notes on luigi.Task input() behavior
- AttributeError: 'LocalTarget' object has no attribute 'is_tmp'
- Adding configurable BigQuery ProjectID on contrib jobs
- Docs
- Python not yet supported