Waris Gill
- Redmond, USA
- waris@vt.edu
- Github
- CV
Hi, I’m Waris Gill, a Senior Applied Scientist at Microsoft working on safety and security for generative AI systems (e.g., Azure OpenAI).
I completed my PhD in Computer Science at Virginia Tech, where I focused on interpretability techniques for distributed privacy-preserving AI systems and methods for evaluating Large Language Models in complex software engineering tasks.
During my PhD, I collaborated with industry on LLM infrastructure: semantic caching systems with Cisco (MeanCache) and embedding models for Redis (v1, v2) that are now widely used in production systems.
My work has been published at MLSys, ICSE, FSE, MSR, IPDPS, and SaTML.
news
| Feb 07, 2026 | My work, ProToken, has been accepted at MLSys 2026. ProToken provides token-level attribution for federated LLMs, enabling interpretability in distributed AI systems. |
|---|---|
| Dec 10, 2025 | My work, BinaryShield, at Microsoft has been accepted at SaTML 2026. BinaryShield enables privacy-preserving threat intelligence sharing to detect prompt injection attacks across LLM services. Microsoft filed a patent for this work. |
| May 27, 2025 | I started my internship at Microsoft as an Applied Scientist in the AI Safety team. |
| Apr 03, 2025 | My research at Redis on compact and efficient embeddings for semantic caching has been open sourced (read the paper here). The redis/langcache-embed-v1 and redis/langcache-embed-v2 models have surpassed 60K and 72K downloads on Hugging Face. Both are optimized for semantic caching in LLM services. |
| Mar 26, 2025 | Delivered a talk on TraceFL, an interpretability technique based on neuron provenance for federated learning, at the Flower AI Summit 2025–the world’s largest federated AI conference—held in London, UK. The talk is available at this link. [Slides] |
| Jan 30, 2025 | Our work on restoring Jupyter Notebooks is accepted at MSR-2025. Congratulations, Tien! |
| Jan 21, 2025 | I started my internship at Redis as a Machine Learning Engineer in the Redis AI team. |
| Dec 19, 2024 | My work, in collaboration with Cisco on MeanCache, has been accepted at IPDPS-2025. MeanCache is a semantic cache for LLM services. |
| Nov 01, 2024 | The baseline of our paper, FedDebug, for debugging malicious/faulty clients in Federated Learning is available in the Flower AI framework. Check out the code here. |
| Oct 31, 2024 | My work, TraceFL, is accepted at 𝗜𝗖𝗦𝗘-𝟮𝟬𝟮𝟱 (acceptance rate ~𝟭𝟬% [132/1219]). TraceFL addresses the open challenge of interpretability in federated learning using neuron provenance. |
| Oct 03, 2024 | Presented 𝐌𝐞𝐚𝐧𝐂𝐚𝐜𝐡𝐞, a semantic cache for LLMs, at the Amazon - Virginia Tech Initiative for Efficient and Robust ML. Selected as one of 18 participants for the poster presentation. [Poster] [Paper] |
| Aug 19, 2024 | Serving as a program committee member on the artifact evaluation track for the 47th International Conference on Software Engineering (ICSE) 2025. |
| Aug 07, 2024 | Delivered an invited talk on Achieving Debugging and Interpretability in Federated Learning Systems at Flower AI, a premier platform for federated learning. [Slides] |
| Dec 04, 2023 | Presented our paper, FedDefender, during the SE4SafeML event at FSE-2023 in San Francisco, California. |
| Sep 20, 2023 | My work at Cisco got open-sourced (Link). |
| May 22, 2023 | I started working at Cisco with Shannon and Pallavi. |
| May 14, 2023 | I received National Science Foundation (NSF) award to present our paper, FedDebug, at ICSE-2023 in Melbourne, Australia. [Slides] |