Skip to content

Howe829/poss

Repository files navigation

POSS:PDF OCR Stream Server

img.png POSS is a server designed for efficient, real-time OCR processing of PDF files. By streaming each page, it enables on-the-fly text extraction and processing, making it ideal for applications requiring quick and continuous handling of PDF documents. POSS leverages powerful OCR tools to handle both text and image-based PDFs, providing accurate and structured text outputs, page by page, as the files are processed. Perfect for integration into document workflows, POSS is lightweight, flexible, and built with scalability in mind.

Install

poss uses poetry to manage the dependencies

curl -sSL https://install.python-poetry.org | python3 -
poetry insall
poetry run pip install magic-pdf[full] --extra-index-url  https://wheels.myhloli.com

Run server

uvicorn server:app --host 0.0.0.0 --reload

Use client

Please refer the client.py for how to send pdf files to the server.

Todo

  • Use docker to deploy
  • LocalImageWriter
  • Interactive web page
  • Window ocr and chunking

License

This project is licensed under the MIT License - see the LICENSE file for details.

Releases

No releases published

Packages

No packages published