Research Engineer - Audio & Speech Models

Zyphra - San Francisco, California, United States

Posted Mar 17, 2026

Benefits

Parental leave: Not verified
Non-birth-parent leave: Not verified
Family-building benefits: Fertility benefits: Not verified
Adoption assistance: Not verified
Surrogacy assistance: Not verified
Mental health support: Not verified
Relocation assistance: Not verified
Childcare support: Not verified
Learning budget: Not verified
Verification: Not verified
Salary: Not verified
401(k) match: Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type: Not verified
Weekend work: Not verified

Application

Cover letter: Not verified
Assessment: Not verified
Deadline: Not stated

Where they hire

State eligibility is not yet verified.

About this role

Research Engineer - Audio & Speech Models San Francisco, California, United States Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Audio & Speech Models , you will be a core contributor on Zyphra's Audio Team, building the next generation of open-source autoencoders, ASR, TTS, SSL, and speech-to-speech models. You will be deeply involved in the entire model training process, from data gathering and processing to designing novel architectures and training methodologies. You'll Work Across: - Large-scale audio training runs - Performance optimization of our training stack - Audio dataset collection, processing, and evaluation - Architecture and training methodology ablations and improvements What We're Looking For / Requirements: - Strong research taste and intuition. The ability to work through a research project from conception to execution to write-up. - Strong implementation and prototyping ability (can take an idea from conception to experimentation quickly) - The ability to work well with others in a high-paced research setting - Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Qualifications / Additional Skills: - Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models - Experience in training audio autoencoders - Understanding of signal processing, especially of audio signals - Experience with diffusion models, consistency models, or GANs - Experience with training on large-scale (multi-node) GPU clusters - Strong grasp of proper experimental methodology for running rigorous ablations

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs

Systems Engineer - (Execution) - Level 3/4

Northrop Grumman - United States-Alabama-Huntsville
Business Analyst (Top Secret cleared)

ICF International INC - Washington, DC
Engineering Project Specialist II (Full Time) - United State

Cisco - San Jose, California, US
Automation AI Ops Engineer

Cisco - 2 Locations

Research Engineer - Audio & Speech Models

Benefits

Schedule

Application

Where they hire

About this role

What verified means

Related jobs

Systems Engineer - (Execution) - Level 3/4

Business Analyst (Top Secret cleared)

Engineering Project Specialist II (Full Time) - United State

Automation AI Ops Engineer