Audio foundation models & voice-to-voice
Research at Sarvam AI on audio/speech foundation models, voice-to-voice systems, and music generation.
At Sarvam AI I work on audio foundation models and voice-to-voice (speech) systems, along with music generation. The focus is on learning strong audio/speech representations and building models that listen and speak naturally.