dask
https://github.com/dask/dask
Python
Parallel computing with task scheduling
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported34 Subscribers
View all SubscribersAdd a CodeTriage badge to dask
Help out
- Issues
- Read_parquet is slower than expected with S3
- Raise NotImplementedError on seriesgroupby.cov()
- GroupBy.cov() when dask SeriesGroupBy object returns unexpected response
- Dataframe returning NaN for Rolling Window when window-range > window frame
- Retries on read/to csv
- Dataframe .loc slicing can ignore partitions
- Automatic retries beyond `read_parquet` / `to_parquet`
- Spark compatibility with pandas extension dtypes
- BUG/DOC: threads_per_worker does not limit all thread types
- Reading parquet hangs when used parquet folder as input but works when I give list of files
- Docs
- Python not yet supported