Back to Careers
Research Scientist
Applied AI|Bay Area|
Hybrid | Full-time
As a Research Scientist you will help lead the development for our Speech Synthesis technology which is core to VoiceCare AI and our Generative AI Healthcare Agents. You will play a pivotal role in developing and advancing our state-of-the-art speech synthesis capabilities tailored specifically for the healthcare domain. Working alongside a dedicated and talented team, you will lead efforts to tackle challenging problems in generative speech synthesis, text-to-speech, accent modeling, and voice conversions. This position offers an exciting opportunity to contribute to groundbreaking research and shape the future of AI-driven healthcare.
Key Responsibilities:
- •Design, develop, and maintain speech synthesis infrastructure
- •Collaborate with data scientists and machine learning engineers to design and implement scalable audio pipelines
- •Build and maintain APIs and microservices that enable efficient and reliable audio processing
- •Develop and implement robust data security and privacy measures to protect sensitive patient information
- •Monitor and optimize the performance of speech synthesis systems to ensure maximum efficiency and uptime
- •Work closely with product managers to understand customer needs and requirements and translate them into technical solutions
- •Mentor and provide technical guidance to junior engineers
Looking for someone who has the following skills:
- •Ph.D. or Postdoctoral researcher in audio synthesis, speech processing, or a related field, with a strong academic background in healthcare applications preferred
- •3+ years of experience in audio synthesis techniques, including TTS, SST, or voice conversion methods, with a track record of successful research projects
- •Demonstrable research experience with a strong publication record in major Speech Synthesis and Speech Processing venues such as ICASSP, Interspeech, or NeurIPS
- •Hands-on experience in Python, PyTorch, or TensorFlow
- •Familiarity with compute platforms such AWS for scalable model development and deployment
- •Experience in audio identity embedding, accent modeling, style-transfer, and multi-language audio synthesis within the context of healthcare applications
- •Knowledge of attention mechanisms, diffusion models, and advanced speech signal processing techniques
At VoiceCare, you will:
- •Research Scientists translate customer needs into tangible outcomes
- •In this crucial role, you will propel forward innovative research and development