POSS:PDF OCR Stream Server

POSS is a server designed for efficient, real-time OCR processing of PDF files. By streaming each page, it enables on-the-fly text extraction and processing, making it ideal for applications requiring quick and continuous handling of PDF documents. POSS leverages powerful OCR tools to handle both text and image-based PDFs, providing accurate and structured text outputs, page by page, as the files are processed. Perfect for integration into document workflows, POSS is lightweight, flexible, and built with scalability in mind.

Install

poss uses poetry to manage the dependencies

curl -sSL https://install.python-poetry.org | python3 -

poetry insall
poetry run pip install magic-pdf[full] --extra-index-url  https://wheels.myhloli.com

Run server

uvicorn server:app --host 0.0.0.0 --reload

Use client

Please refer the client.py for how to send pdf files to the server.

Todo

Use docker to deploy
LocalImageWriter
Interactive web page
Window ocr and chunking

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
web		web
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
client.py		client.py
img.png		img.png
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
server.py		server.py
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

POSS:PDF OCR Stream Server

Install

Run server

Use client

Todo

License

About

Releases

Packages

Languages

License

Howe829/poss

Folders and files

Latest commit

History

Repository files navigation

POSS:PDF OCR Stream Server

Install

Run server

Use client

Todo

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages