dask
https://github.com/dask/dask
Python
Parallel computing with task scheduling
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported34 Subscribers
View all SubscribersAdd a CodeTriage badge to dask
Help out
- Issues
- Poor scheduling with `flox`, leading to high memory usage and eventual failure
- Incorrect shape computation with getitem and structured numpy array
- Preserving divisions when reading/loading dataframes with structs containing multiple fields
- `vindex` as outer indexer: memory and time performance
- Hash join transfer with error cannot pickle '_contextvars.ContextVar' object
- ``new_dd_object``'s array logic always assumes the metadata is ``numpy``
- Ensure that repack collections only return tuple if necessary
- dask.bag.Bag.to_dataframe behavior change in 2024.3.0 - setting dtype to string rather than object by default
- Array API in Dask
- Feedback - DataFrame query planning
- Docs
- Python not yet supported