OCR Debug
Upload a document, run a conversion, inspect every bounding box and validate the extraction against the source page.
Inspect, debug and repair Docling-based RAG pipelines visually.
Docling turns PDFs into rich structured data, but structured data alone does not tell you whether the extraction is good enough for production. Debugging a RAG pipeline means chasing silent failures across OCR, chunking, embedding and retrieval — with no single place to see what went wrong.
Docling Studio is the missing inspection layer: a visual studio where each stage of the pipeline can be opened, verified and repaired before it ships.
Upload a document, run a conversion, inspect every bounding box and validate the extraction against the source page.
Walk a document through chunking, embedding and vector-store retrieval. Visualise every transformation and spot regressions before they ship.
Visual inspection for chunkless retrieval pipelines. Early build available — expect rough edges and breaking changes.
Run Docling Studio locally for fast debugging, or plug it into an external Docling Serve instance to inspect your production pipeline.
Docling embedded in the container. Standalone, zero dependencies, ideal for quick inspection and offline work.
# Local — Docling embedded
docker run -p 8000:8000 \
ghcr.io/scub-france/docling-studio:latest
Connect Docling Studio to an external Docling Serve instance. The recommended mode to inspect staging or production pipelines.
# Remote — connected to Docling Serve docker run -p 8000:8000 \ -e DOCLING_MODE=remote \ -e DOCLING_SERVE_URL=https://your-docling-serve \ ghcr.io/scub-france/docling-studio:latest
Docling Studio supports multiple vector stores through a Protocol-based adapter system — swap the backend without touching the pipeline.
First-class adapter, used in production deployments.
Graph-native retrieval adapter on the roadmap.
Add a new backend by implementing the VectorStore Protocol.
Ports and adapters isolate the pipeline from the outside world. Feature flags switch deployment modes without touching the code.
Depends on Docling, a project in the Linux Foundation AI & Data ecosystem.
Connects to any Docling Serve instance as a remote backend, locally or in production.
Open source under the MIT license, developed and maintained by SCUB.