Audio foundation models & voice-to-voice

Research at Sarvam AI on audio/speech foundation models, voice-to-voice systems, and music generation.

At Sarvam AI I work on audio foundation models and voice-to-voice (speech) systems, along with music generation. The focus is on learning strong audio/speech representations and building models that listen and speak naturally.