POSS is a server designed for efficient, real-time OCR processing of PDF files. By streaming each page, it enables on-the-fly text extraction and processing, making it ideal for applications requiring quick and continuous handling of PDF documents. POSS leverages powerful OCR tools to handle both text and image-based PDFs, providing accurate and structured text outputs, page by page, as the files are processed. Perfect for integration into document workflows, POSS is lightweight, flexible, and built with scalability in mind.
poss uses poetry to manage the dependencies
curl -sSL https://install.python-poetry.org | python3 -
poetry insall
poetry run pip install magic-pdf[full] --extra-index-url https://wheels.myhloli.com
uvicorn server:app --host 0.0.0.0 --reload
Please refer the client.py for how to send pdf files to the server.
- Use docker to deploy
- LocalImageWriter
- Interactive web page
- Window ocr and chunking
This project is licensed under the MIT License - see the LICENSE file for details.