Research Engineer - Audio & Speech Models
Zyphra - San Francisco, California, United States
Posted Mar 17, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Research Engineer - Audio & Speech Models San Francisco, California, United States Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on Zyphra's Audio Team, building the next generation of open-source autoencoders, ASR, TTS, SSL, and speech-to-speech models. You will be deeply involved in the entire model training process, from data gathering and processing to designing novel architectures and training methodologies. You'll Work Across: - Large-scale audio training runs - Performance optimization of our training stack - Audio dataset collection, processing, and evaluation - Architecture and training methodology ablations and improvements What We're Looking For / Requirements: - Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up. - Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly) - The ability to work well with others in a high-paced research setting - Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Qualifications / Additional Skills: - Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models - Experience in training audio autoencoders - Understanding of signal processing, especially of audio signals - Experience with diffusion models, consistency models, or GANs - Experience with training on large-scale (multi-node) GPU clusters - Strong grasp of proper experimental methodology for running rigorous ablations
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Business Analyst (Top Secret cleared)
ICF International INC - Washington, DC
-
Engineering Project Specialist II (Full Time) - United State
Cisco - San Jose, California, US
-
Automation AI Ops Engineer
Cisco - 2 Locations