marker
https://github.com/datalab-to/marker
Python
Convert PDF to markdown + JSON quickly with high accuracy
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported0 Subscribers
Add a CodeTriage badge to marker
Help out
- Issues
- For unraid ?
- Table in wrong position if there is more than one table in the same page
- how to load models from local directory
- Batch processing python
- Perpendicular headlines in tables fail
- Line breaks within cells are recognized as multiple lines, resulting in incomplete data
- Incorrect output for pdf forms
- Consider using pre-commit to autoformat and lint code
- Recommended preprocessors for scans that need dewarping
- How to specify langauges for each pdf seperately in new version?
- Docs
- Python not yet supported