#NDLOCR

4 articles

TechFeb 27, 20268 min

Three Months with NDLOCR: The Full Journey and What Others Built

From Docker hell to Lite + LLM correction. A retrospective on my own experimentation, plus an introduction to someone else's browser-based NDLOCR-Lite implementation.

OCR NDLOCR NDLOCR-Lite Python Docker Local LLM ONNX WebAssembly Experiment

TechDec 7, 20257 min

Japanese OCR on the Web in 2025: Limits and Lessons

From browser OCR and server-side OCR to cloud APIs and AI — a roundup of what I learned trying to implement Japanese OCR on the web, including the limits of each approach.

OCR JavaScript Tesseract.js NDLOCR Transformers.js AI Docker Google Cloud Vision PaddleOCR Japanese OCR Browser Experiment

TechDec 1, 20253 min

Solving NDLOCR Column Layout Recognition with Histogram Analysis

When Layout Parser wouldn't install and NDLOCR alone couldn't handle a 4-column vertical text book, I used PyMuPDF and histogram analysis to brute-force split the columns.

NDLOCR OCR Python PyMuPDF Experiment

TechDec 1, 20254 min

How to Successfully Build the NDLOCR Docker Image

Points where building the NDLOCR Docker image gets stuck, and how to solve them.

Docker NDLOCR OCR Windows AI CUDA Experiment