marker
https://github.com/datalab-to/marker
Python
Convert PDF to markdown + JSON quickly with high accuracy
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported0 Subscribers
Add a CodeTriage badge to marker
Help out
- Issues
- Error when running marker in docker-compose
- WIP feat: accept binary PDF instead of just path to PDF
- Gemini API exhausted, need some pause mechanism or something
- More options for `marker_chunk_convert`
- I encountered a ModuleNotFoundError: No module named 'marker.settings' while trying to use marker_gui,Could you help me resolve this?
- 'gbk' codec can't decode byte 0xb3 in position 1470: illegal multibyte sequence
- Strange characters in OCR of table numbers
- Marker does not reencode images when --OpenAIService_openai_image_format is specified
- [BUG: Breaking] Marker cannot process files with signature lines
- Add strikethrough support
- Docs
- Python not yet supported