Research Scientist

Applied AI|Bay Area|

Hybrid | Full-time

As a Research Scientist you will help lead the development for our Speech Synthesis technology which is core to VoiceCare AI and our Generative AI Healthcare Agents. You will play a pivotal role in developing and advancing our state-of-the-art speech synthesis capabilities tailored specifically for the healthcare domain. Working alongside a dedicated and talented team, you will lead efforts to tackle challenging problems in generative speech synthesis, text-to-speech, accent modeling, and voice conversions. This position offers an exciting opportunity to contribute to groundbreaking research and shape the future of AI-driven healthcare.

Key Responsibilities:

•Design, develop, and maintain speech synthesis infrastructure
•Collaborate with data scientists and machine learning engineers to design and implement scalable audio pipelines
•Build and maintain APIs and microservices that enable efficient and reliable audio processing
•Develop and implement robust data security and privacy measures to protect sensitive patient information
•Monitor and optimize the performance of speech synthesis systems to ensure maximum efficiency and uptime
•Work closely with product managers to understand customer needs and requirements and translate them into technical solutions
•Mentor and provide technical guidance to junior engineers

Looking for someone who has the following skills:

•Ph.D. or Postdoctoral researcher in audio synthesis, speech processing, or a related field, with a strong academic background in healthcare applications preferred
•3+ years of experience in audio synthesis techniques, including TTS, SST, or voice conversion methods, with a track record of successful research projects
•Demonstrable research experience with a strong publication record in major Speech Synthesis and Speech Processing venues such as ICASSP, Interspeech, or NeurIPS
•Hands-on experience in Python, PyTorch, or TensorFlow
•Familiarity with compute platforms such AWS for scalable model development and deployment
•Experience in audio identity embedding, accent modeling, style-transfer, and multi-language audio synthesis within the context of healthcare applications
•Knowledge of attention mechanisms, diffusion models, and advanced speech signal processing techniques

At VoiceCare, you will:

•Research Scientists translate customer needs into tangible outcomes
•In this crucial role, you will propel forward innovative research and development