#Python

40 articles

TechFeb 1, 20264 min

PageIndex - tree RAG with LLM reasoning only, no vector search

I looked into PageIndex, a RAG system that builds hierarchical document trees using only LLM reasoning, without chunking or vector databases. I also consider how it fits with layout detection and OCR pipelines.

AI RAG LLM OCR Python

TechJan 28, 20264 min

JAXA Earth API for Python: a simple way to work with satellite data

A look at JAXA's Python package for Earth observation satellite data, covering installation, basic usage, and Claude Desktop integration.

Python JAXA Satellite Data API MCP

TechDec 9, 20254 min

Setting Up LLM/RAG for Work, So I Tried LoRA Training on the Side (Part 1)

Introducing a Mac mini M4 Pro to build an in-house RAG system. A plan for setting up a LoRA training environment during downtime while waiting for specs to be finalized.

AI LLM RAG LoRA Mac Apple Silicon Python ComfyUI Stable Diffusion Image Generation Experiment

TechDec 1, 20253 min

Solving NDLOCR Column Layout Recognition with Histogram Analysis

When Layout Parser wouldn't install and NDLOCR alone couldn't handle a 4-column vertical text book, I used PyMuPDF and histogram analysis to brute-force split the columns.

NDLOCR OCR Python PyMuPDF Experiment