Run a temporary instance of Excalibur PDF extraction tool more easily without fighting dependency issues.

forest c911a63c25 readme wording 5 months ago
Dockerfile 1efc5a58ca got it to work 5 months ago
Dockerfile.source 5c66fd0489 explain how to persist the data. 5 months ago
ReadMe.md c911a63c25 readme wording 5 months ago

ReadMe.md

excalibur-docker

https://github.com/camelot-dev/excalibur

Run a temporary instance of Excalibur PDF extraction tool more easily without fighting dependency issues.

Currently requires --net host because this app listens on 127.0.0.1 by default (only accept connections on local network device) instead of 0.0.0.0 (accept connections from anywhere). This could be changed by adding host= '0.0.0.0' to app.run() here : https://github.com/camelot-dev/excalibur/blob/eb3c64feee85dcae4bde5fa1a1f595123d5281e4/excalibur/cli.py#L58

Currently the container will not die on a Ctrl-C you have to sudo docker kill <containerid> to stop it. ¯\_(ツ)_/¯

If you want the data to stick around when the container restarts try mounting the /home/root/excalibur/ directory in the container to the host file system.


sudo docker build -t excalibur:0.4.2 .

sudo docker run --rm --net host excalibur:0.4.2

# open localhost:5000 in web browser