All Articles - Page 20 | lilting channel

TechApr 23, 202613 min

Qwen3.6-27B Dense vs Qwen3.6-35B-A3B MoE on M1 Max — MLX Was 2× Faster Than Ollama

Tried Qwen3.6-27B on both Ollama and MLX. Ollama couldn't load the VL-projector-embedded GGUF, MLX ran it at 11 tok/s. On the side, running 35B-A3B under MLX was roughly 2× faster than the Ollama GGUF. Also had both models build a BBS to gauge intent handling.

LLM Local LLM Qwen Ollama MLX Apple Silicon MoE Experiment

TechApr 23, 20266 min

Math for reading AI articles: the full 5-article series

A hub for the 5-article series that organizes math symbols in AI and LLM articles for reading, not solving. Covers equations, vectors and matrices, probability and statistics, derivatives, and gradient descent with backprop, plus a reading-order guide for different backgrounds.

AI LLM 機械学習数式入門

TechApr 23, 202628 min

Gradient descent and backprop, just enough to read AI articles

Gradient descent, SGD and Adam, backpropagation, vanishing/exploding gradients with residual connections, and learning rate schedules — organized around what each piece is doing at a high level. The goal is reading training logs and model card numbers, not computing anything.

AI LLM 機械学習数式入門

TechApr 23, 202623 min

Derivatives, just enough to read AI articles

A minimum set of calculus for reading AI and LLM articles — d/dx, e, the chain rule, partial derivatives, and gradients. Focus on what the symbols are doing, not on solving the formulas.

AI LLM 機械学習数式入門

TechApr 22, 202623 min

Probability and statistics, just enough to read AI articles

A minimum set of probability and statistics for reading AI and LLM articles — conditional probability, cross-entropy, perplexity, and temperature are the main ones; rigorous Bayes and MLE derivations stay out of scope.

AI LLM 機械学習数式入門

TechApr 22, 202613 min

Auto-assigning piano fingerings from a score based on hand size via physics simulation

A browser tool that reads MusicXML and returns fingerings tuned to your hand size and biomechanical constraints. Walks through the backtracking cost minimization, the actual weight values, the academic lineage since Parncutt 1997, and why the same framework generalizes to guitar.

ブラウザ Webアプリ Web Audio API Canvas React アルゴリズム物理

TechApr 22, 202612 min

Rescuing exifr's broken GPS parsing on iPhone 17 HEIC with a browser-only fallback

iPhone 17's HEIC adds new brand identifiers to the ftyp box, pushing it past exifr's hard-coded 50-byte guard. Here's a dynamic-import fallback to ExifReader, plus Null Island filtering and iloc pre-inspection to harden browser-only photo tools.

iOS ブラウザ JavaScript Web セキュリティ

TechApr 22, 202618 min

Vectors and matrices, just enough to read AI articles

A minimum set of vectors and matrices for reading AI and LLM articles — the dot product and matrix product are the main two; determinants, inverses, and eigenvalues stay out of scope.

AI LLM 機械学習数式入門

TechApr 22, 20264 min

Microsoft Patches ASP.NET Core Privilege Escalation CVE-2026-40372

A regression in cryptographic signature validation introduced a CVSS 9.1 flaw into .NET 10.0. The Data Protection API implemented HMAC verification incompletely, opening the door to padding oracle attacks and forged authentication tokens.

Security ASP.NET Core .NET CVE Vulnerability Cryptography

TechApr 21, 202612 min

The small set of math that makes AI articles readable

A minimum set of math for reading AI, LLM, and image-generation articles — the aim isn't to derive anything, just to recognize weighted sums, S-curves, probabilities, and the 'nudge toward the answer' step of training.

AI LLM 機械学習数式入門

TechApr 21, 2026updated11 min

Qwen3.6-35B-A3B on M1 Max via Ollama 0.20.6: 27 tok/s same as 3.5, but 13× thinking tokens

Hands-on Qwen3.6-35B-A3B (23GB 4bit GGUF) on M1 Max 64GB via Ollama 0.20.6. Generation speed stays at 27 tok/s — same as Qwen3.5-35B-A3B — but the same prompt produces 13× more thinking tokens. Multi-turn behavior, persona handling, and a three-tier NSFW probe included.

LLM Local LLM Qwen Ollama Apple Silicon MoE Experiment

TechApr 21, 2026updated19 min

TRELLIS.2 trellis-mac port tested on M1 Max 64GB: setup, generation time, MPS bottlenecks

Hands-on run of trellis-mac (the CUDA-free port of TRELLIS.2) on M1 Max 64GB. Setup via uv with PyTorch 2.11.0 MPS, applied mps_compat.py patches, and recorded actual generation time vs the M4 Pro 24GB 3.5-minute reference, plus where the bottlenecks land on Apple Silicon.

AppleSilicon MPS PyTorch 3D ML 実験