I run paperless-ngx in a docker container. Have it scan my email for attachments once a day. It automatically tags the email depending on keywords found in the email and sender.
If I scan a document to import I tag it manually.
But paperless-ngx also has ocr, so it will scan the whole page and save that data. So I can search for example ‘samsung’ and it will show me all documents where Samsung is in. Even if it is not tagged.
My docker-compose:
version: "3.3"
networks:
paperless:
name: paperless
driver: bridge
ipam:
config:
- subnet: 172.36.0.0/16
services:
paperless-redis:
container_name: paperless-redis
image: docker.io/library/redis:7
restart: unless-stopped
networks:
- paperless
volumes:
- ./redis:/data
paperless-db:
container_name: paperless-db
image: docker.io/library/postgres:13
restart: unless-stopped
networks:
- paperless
volumes:
- ./db:/var/lib/postgresql/data
environment:
POSTGRES_DB: paperlessdb
POSTGRES_USER: paperless
POSTGRES_PASSWORD: super-secure-password
paperless:
container_name: paperless
image: ghcr.io/paperless-ngx/paperless-ngx:latest
restart: unless-stopped
networks:
- paperless
depends_on:
- paperless-db
- paperless-redis
ports:
- 8002:8000
healthcheck:
test: ["CMD", "curl", "-fs", "-S", "--max-time", "2", "http://localhost:8000"]
interval: 30s
timeout: 10s
retries: 5
volumes:
- ./data:/usr/src/paperless/data
- ./media:/usr/src/paperless/media
- ./export:/usr/src/paperless/export
- ./consume:/usr/src/paperless/consume
env_file: ./docker-compose.env
environment:
PAPERLESS_REDIS: redis://paperless-redis:6379
PAPERLESS_DBHOST: paperless-db
The .en file you can find on there GitHub. But the over important part is to setup a language for it.
# The default language to use for OCR. Set this to the language most of your
# documents are written in.
PAPERLESS_OCR_LANGUAGE=nld
What do you thing of Kasm Workspaces.
You can create a Desktop workspace, different diestros and environments to choose from. And you can create users so each user has his own login.