marker
https://github.com/datalab-to/marker
Python
Convert PDF to markdown + JSON quickly with high accuracy
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
Python not yet supported0 Subscribers
Add a CodeTriage badge to marker
Help out
- Issues
- How can i force the library to OCR the detected images?
- Images Missing in Markdown Output When Using `marker` Command for Multiple PDFs
- feat: update document filepath type
- TypeError: unsupported operand type(s) for |: '_GenericAlias' and 'NoneType'
- 'PdfConverter' object has no attribute 'artifact_dict'
- Maker with mol
- How to get text without html tag from RecognitionPredictor?
- feat: Update Gemini model references and add rate limiting
- Dependency Conflict: `aiohttp` and `rich` versions cause issues with other libraries (e.g., `chromadb`, `typer`)
- [BUG: Breaking]concurrent.futures.process.BrokenProcessPool: A process in the process pool was terminated abruptly while the future was running or pending.
- Docs
- Python not yet supported