luigi
https://github.com/spotify/luigi
Python
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to luigi
Help out
- Issues
- Time measurement with tqdm not working while yielding dynamic dependencies
- Fix blocking worker queue when scheduling in parallel
- Exceptions in output()/complete() emit DEPENDENCY_MISSING events
- Pass token_credential to BlockBlobService
- update hive.py
- Incorrect return type in on_failure() causes batch emails to silently fail
- ExternalProgramTask: stream_for_searching_tracking_url is not flushed, causing program output to be lost.
- A task fails if the subsequent task is already completed
- Import config parser into TOML parser
- No way to control working directory on remote host when running commands via luigi.contrib.ssh.RemoteContext
- Docs
- Python not yet supported