Prefill, decode, and the memory wall — an optimization-first runtime for local LLM inference on consumer NVIDIA GPUs.
Demand curves, elasticity, causal inference, and agentic pricing loops: the microeconomic foundations behind modern pricing systems.