A paper claims that a single binary operator eml(x, y) = exp(x) - ln(y) combined with the constant 1 can express all elementary functions — arithmetic, trig, logarithms, even pi. I read the paper and tested it in 5 languages.
Meta AI's HyperAgents performs metacognitive self-correction that optimizes improvement strategies themselves. Self-improvement appears in four non-coding domains, and strategies learned in one domain transfer to another, along with spontaneously acquired persistent memory.
A paper explains that two seemingly mysterious Transformer behaviors, heavy attention on specific tokens and unusually large activations in specific dimensions, are actually manifestations of the same mechanism.
How should memory be allocated in reasoning models? This paper explains the trade-offs among quantization, KV cache, and test-time compute, based on 1,700 experiments.
I investigated the source behind the viral claim that a Johns Hopkins study found ChatGPT lies 27% of the time, and it turns out multiple different studies have been mixed together.
Implemented all 12 text processing tools planned in the previous article. Also reorganized the category system and switched the listing UI to a table layout.
I compiled research findings and implementation specifications for adding text-processing tools to lilting.ch/lab. Based on a comparison with DevToys, it highlights the gaps and documents detailed specs for 12 candidate tools.