cleanlab
https://github.com/cleanlab/cleanlab
Python
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported0 Subscribers
Add a CodeTriage badge to cleanlab
Help out
- Issues
- add CI test that runs the main datalab code without any optional dependencies installed
- replace Datalab load/save from pickle to more secure format
- Add FAQ: how do I run Datalab on an unlabeled dataset?
- consider cleanvision's near duplicates too in Datalab near duplicates analysis
- table of issue types should distinguish dataset-level issues vs datapoint-level issues
- Update to latest macOS version in CI workflow
- Follow-Up: Revert macOS CI Environment to Latest Version Once Python Compatibility Is Resolved
- Perf object detection
- Fix: Sphinx doctest internal/task.py
- Doctests are failing for some functions
- Docs
- Python not yet supported