Back to Careers

Research Scientist

Applied AIBay Area
Hybrid | Full-time

As a Research Scientist you will help lead the development for our Speech Synthesis technology which is core to VoiceCare AI and our Generative AI Healthcare Agents. You will play a pivotal role in developing and advancing our state-of-the-art speech synthesis capabilities tailored specifically for the healthcare domain. Working alongside a dedicated and talented team, you will lead efforts to tackle challenging problems in generative speech synthesis, text-to-speech, accent modeling, and voice conversions. This position offers an exciting opportunity to contribute to groundbreaking research and shape the future of AI-driven healthcare.

Key Responsibilities:

  • Design, develop, and maintain speech synthesis infrastructure
  • Collaborate with data scientists and machine learning engineers to design and implement scalable audio pipelines
  • Build and maintain APIs and microservices that enable efficient and reliable audio processing
  • Develop and implement robust data security and privacy measures to protect sensitive patient information
  • Monitor and optimize the performance of speech synthesis systems to ensure maximum efficiency and uptime
  • Work closely with product managers to understand customer needs and requirements and translate them into technical solutions
  • Mentor and provide technical guidance to junior engineers

Looking for someone who has the following skills:

  • Ph.D. or Postdoctoral researcher in audio synthesis, speech processing, or a related field, with a strong academic background in healthcare applications preferred
  • 3+ years of experience in audio synthesis techniques, including TTS, SST, or voice conversion methods, with a track record of successful research projects
  • Demonstrable research experience with a strong publication record in major Speech Synthesis and Speech Processing venues such as ICASSP, Interspeech, or NeurIPS
  • Hands-on experience in Python, PyTorch, or TensorFlow
  • Familiarity with compute platforms such AWS for scalable model development and deployment
  • Experience in audio identity embedding, accent modeling, style-transfer, and multi-language audio synthesis within the context of healthcare applications
  • Knowledge of attention mechanisms, diffusion models, and advanced speech signal processing techniques

At VoiceCare, you will:

  • Research Scientists translate customer needs into tangible outcomes
  • In this crucial role, you will propel forward innovative research and development