Under development — I'm actively building this site.

Huzama Ahmad

Huzama Ahmad

Ph.D. Candidate, KAIST AI.

I work on efficient language modeling — the architectures and systems that make large models cheaper to run at long context. Advised by Se-Young Yun in the OSI Lab. Recent work spans speculative decoding, sparse attention, and letting models control their own attention span.

All
All

BibTeX