As AI capabilities surpass our own, the defining challenge shifts from pursuing a monolithic intelligence to designing a truly collaborative partner. My research builds this critical bridge by drawing wisdom from the very source of human progress: our own collective intelligence. I work not only to build standalone AI systems that can accomplish human requests, but to learn from human cooperation to design a new form of collective intelligence at scale—one where AI agents integrate into our society not just as a powerful tool, but as true partners that can co-create with humans seamlessly.
At Google DeepMind, my work involves building some of the largest and most capable models today. I was a core contributor to the training of the
Gemini family of models (1.0, 1.5, and 2.5) and also helped develop the open-source
Gemma models.
My passion for collaborative AI began during my PhD, where my research explored steerability, reliability, and complex problems at the intersection of LLMs and RL. My work on reinforced inference-time alignment was honored with a
Best Paper Award
at AAAI 2021.
You can find all my publications on my Google Scholar.
I'm always open to interesting conversations. Feel free to reach me at my personal email or work email.
At Google DeepMind, my work involves building some of the largest and most capable models today. I was a core contributor to the training of the


My passion for collaborative AI began during my PhD, where my research explored steerability, reliability, and complex problems at the intersection of LLMs and RL. My work on reinforced inference-time alignment was honored with a


You can find all my publications on my Google Scholar.
I'm always open to interesting conversations. Feel free to reach me at my personal email or work email.