distributed
https://github.com/dask/distributed
Python
A distributed task scheduler for Dask
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported38 Subscribers
View all SubscribersAdd a CodeTriage badge to distributed
Help out
- Issues
- AttributeError when trying to perform a distributed read_parquet (maybe serialization issues)
- AttributeError: 'ZarrStore' object has no attribute '_append_dim'
- Data loss with `DataFrame.set_index(.., shuffle="disk")`
- Simplifying serialization of status messages
- Huge memory leak and processes do not restart automatically
- Cluster lifecycle management
- WARNING - Memory use is high but worker has no data to store to disk
- Semaphore gets released too often
- Persistent client connection
- KeyError: pickle-protocol when running df.npartitions
- Docs
- Python not yet supported