modin
https://github.com/modin-project/modin
Python
Modin: Scale your Pandas workflows by changing a single line of code
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported5 Subscribers
Add a CodeTriage badge to modin
Help out
- Issues
- `BenchmarkMode` does not materialize partitions after rebalancing
- PERF: avoid reorder_labels in take_2d_label_or_positional
- PERF: it's possible to build range index in `read_parquet` using metadata of parquet files
- Modin antipattern of `__getitem__` usage
- Better Temp file management
- PERF: `mul` operator does not partition numpy arrays
- CI, BUG: modin-spreadsheets version needs to be bumped
- Docs: Getting started redownloads dataset instead of reading downloaded data set
- PERF: Default to pandas in worker process instead of main one
- PERF: Always use all available partitions when chunking data with min partition size 1, e.g. in rebalance_partitions
- Docs
- Python not yet supported