Hello! I am Rithesh Kumar, an AI researcher with expertise in deep learning and generative modeling.
Currently, I am a Research Scientist and member of the Audio Research Group at Adobe Research.
Previously, I was the Technical Lead for the Overdub Research team at Descript Inc. In this time, I built and shipped 4+ text-to-speech models behind the
flagship Overdub feature capable of ultra-realistic voice cloning and
performing corrections on recordings through text. Recently, I also led the development of the
Regenerate feature that leverages instant voice cloning technology to make bad edits sound seamless and natural.
Currrently, I live in Toronto, Ontario 🇨🇦.
I completed my MSc in Computer Science (specializing in Artificial Intelligence) at the Mila lab in Université de Montréal supervised by Yoshua Bengio. During my MSc, I had the excellent opportunity to intern at Lyrebird and Microsoft Research - Montréal.
Earlier, I graduated from SSN College of Engineering (affiliated to Anna University) with a Bachelors in Computer Science and Engineering. I spent the final 2 years of my undergrad learning about deep learning, spending a summer at the Serre Lab in Brown University and collaborating with Prof. Yoshua Bengio at the Mila lab.