Education
- Integrated M.S./Ph.D. in Artificial IntelligenceKorea Advanced Institute of Science & Technology (KAIST) · Seoul, South Korea
Advisor: Se-Young Yun (Optimization & Statistical Inference Lab). Research focus: efficient long-context LLMs.
- B.E. in Computer Science and EngineeringNational University of Sciences & Technology (NUST) · Islamabad, Pakistan
Thesis: IoT-Based Intelligent Manufacturing Execution System with Predictive Analysis.
Research Experience
- Graduate ResearcherKorea Advanced Institute of Science & Technology · Seoul, South Korea
Developing efficient long-context architectures within the Optimization & Statistical Inference (OSI) Lab. Earlier at the xfact Lab — investigated the causal effect of residual connections on layer redundancy in LLMs.
- Research InternRiseTech · Islamabad, Pakistan
Implemented a U-Net architecture for high-fidelity spinal segmentation and automated scoliosis classification. Developed a vision–language model to generate clinical diagnostic text from chest X-ray imaging.
Current Research
- Content-Aware Sparsity:
- Dropping distractor tokens by content, not position — cutting noise as well as compute in long-context models.
Technical Infrastructure
- GPU Cluster & HPC Infrastructure — KAIST AI
Built and migrated individual servers into a 9-node, 68-GPU Slurm-managed fleet — FreeIPA identity, Netbird WireGuard control plane, and TrueNAS-backed shared home storage. Migrated 50+ users to a unified UID namespace and enforced cgroups-v2 with NUMA-aware GPU bindings for strict per-job CPU/GPU isolation.
Awards
- Commandant Gold Medal of Excellence · National University of Sciences & Technology
- Commandant Silver Medal of Excellence · National University of Sciences & Technology
Scholarships
- Fully-Funded Ph.D. Scholarship · Korea Advanced Institute of Science & Technology
- Merit-Based Scholarship (3 years) · National University of Sciences & Technology
Teaching & Mentoring
- Teaching Assistant — Deep Learning for Natural Language Processing · KAIST
- Research Mentor — Cultural Sensitivity in Multimodal Language and Diffusion Models · KAIST
- Research Mentor — The Many Voices of ChatGPT: Exploring Language Diversity · KAIST
- Research Mentor — ChatGPT Voices · KAIST
Skills
- AI Systems & HPC:
- Distributed Training (DDP / FSDP), Large-Scale Model Optimization (SPMD), TPU / XLA Performance Engineering, Low-level GPU Programming (Triton, CUDA), HPC Cluster Management (Slurm), High-Throughput Networking (RDMA)
- AI / ML Development:
- PyTorch, JAX, Hugging Face (Transformers, Accelerate), Weights & Biases
- Core Engineering:
- Python, C++, Docker, LaTeX, Shell Scripting