All tags

#Inference Optimization

11 articles

Tech9 min

TRACER trains a surrogate from LLM classification API logs and swaps in via a parity gate