Tabula is a tool for liberating data tables trapped inside PDF files. This is a Docker image to run the web server inside a lightweight Linux container.
Run from Docker Hub
docker run -d -p 8080:8080 asnelling/tabula
Now open your browser to http://localhost:8080/
You may wish to tweak the docker run
command above in order to...
-
specify a specific version of Tabula
# Example: v0.9.3 docker run -d -p 8080:8080 asnelling/tabula:0.9.3
-
listen on a different port
# Example: port 8100 docker run -d -p 8100:8080 asnelling/tabula
Access the web interface at http://localhost:8100/
-
follow the logs while the container runs
docker run -p 8080:8080 asnelling/tabula
You may find more options in the official documentation
Instead of pulling a pre-built container image from asnelling/tabula, you may build your own using the Dockerfile in this repo:
git clone [email protected]:asnelling/tabula-docker.git
docker build -t mytabula tabula-docker
docker run -d -p 8080:8080 mytabula