Tried to implement morphological analysis in the browser — Sudachi was too heavy, kuromoji.js had a bug. Eventually solved it with a fork and a custom loader.
A story about dates going haywire in an AI-generated PHP script, plus a breakdown of timezone pitfalls in each layer — server, PHP, MySQL, and JavaScript.
When Layout Parser wouldn't install and NDLOCR alone couldn't handle a 4-column vertical text book, I used PyMuPDF and histogram analysis to brute-force split the columns.