back
Get SIGNAL/NOISE in your inbox daily

TL;DR: We developed a compiler that automatically transforms LLM inference into a single megakernel — a fused GPU kernel that performs…