AI Researcher (Voice)

Posted 5 months agoViewed

San FranciscoFull-TimeHuman Computing

Company:Tavus

Location:San Francisco

Languages:English

Seniority level:Senior

Skills:

PythonSoftware DevelopmentArtificial IntelligenceImage ProcessingMachine LearningPyTorchPrototypingResearch

Requirements:

Proven experience with flow matching, diffusion models, auto regressive networks in the audio domain Experience training deep learning models (medium-sized to large) Experience building streaming text-to-speech models or speech-to-speech models Strong foundations in audio modeling and ability to innovate rapidly through prototyping Knowledge of state-of-the-art architectures in representation learning (audio or image domain, face animation) Excellent programming skills and fluency in PyTorch Evidence of original research with publications in top-tier or solid second-tier venues Excited about building lifelike, expressive avatars for real-time applications

Responsibilities:

Lead research efforts on generative video and audio models Work with the Applied ML team to help productionize our research Stay relevant with the latest advancements and help create new advancements