dask
https://github.com/dask/dask
Python
Parallel computing with task scheduling
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported34 Subscribers
View all SubscribersAdd a CodeTriage badge to dask
Help out
- Issues
- Filesystem transactions
- Priority propagation
- Design: nested _meta
- Adding multiple columns inside the same call doesn't work properly, while it does on Pandas
- Applying ufuncs to dask scalars do not return dask scalars.
- `read_parquet` graph transfer grows more than lienearly with number of partitions
- BUG: Inline makes N^2 calls to subs.
- Default npartitions for parquet with fastparquet and parquet with pyarrow are incorrect
- ValueError of DataFrame.merge
- Docs: Exactly which numpy slicing features does an array-like need to support to be used with Dask.from_array
- Docs
- Python not yet supported