marker
https://github.com/datalab-to/marker
Python
Convert PDF to markdown + JSON quickly with high accuracy
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported0 Subscribers
Add a CodeTriage badge to marker
Help out
- Issues
- Bug report — Schema extraction mismatch (Playground vs API)
- CPU only makes machine unresponsive
- How to add option to marker page range of pdf
- AttributeError: 'PdfDocument' object has no attribute 'name'
- Progress bar counts non-pdf files
- Few Concerns on the Markdown Generation - Overlapping image/table/text boxes and Different output while using Surya
- how to convert latex to text, for GT generation?
- reader order conver to onnx
- Need docker deployment with Fast API enabled option
- Unable to run.
- Docs
- Python not yet supported