luigi
https://github.com/spotify/luigi
Python
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to luigi
Help out
- Issues
- ExternalProgramTask: stream_for_searching_tracking_url is not flushed, causing program output to be lost.
- A task fails if the subsequent task is already completed
- Import config parser into TOML parser
- No way to control working directory on remote host when running commands via luigi.contrib.ssh.RemoteContext
- WrapperTask triggers many unnecessary complete() calls
- Signature of the bulk_complete method is unclear
- Repetitive complete() calls because of the check_unfulfilled_deps setting
- Populate python_requires setup.py field
- [History] Same task not being appended
- Fails complete pipeline when encounters `RuntimeError`
- Docs
- Python not yet supported