I am a senior deep learning algorithms engineer at NVIDIA. I primarily work on Dynamo - a high-throughput low-latency inference framework designed for serving generative AI and reasoning models in multi-node distributed environments.
Previously, I was the cofounder of Agora Labs - a startup that built ML infrastructure on top of neo-cloud providers. We were acquired in February 2024 by brev.dev which in turn was acquired by NVIDIA in June 2024. Before that, I studied economics and statistics at Texas A&M University and had a brief stint at Columbia University before dropping out to work on Agora.
You can find some of my writings here.