Research Engineer - Language Model Pre-Training
Zyphra - San Francisco, California, United States
Posted Mar 17, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Research Engineer - Language Model Pre-Training San Francisco, California, United States Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Language Model Pre-Training , you'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with our pretraining team, who will integrate your insights into our next-generation models. You'll Work Across: - Large-scale training runs and model parallelization - Performance optimization of our pretraining stack - Dataset collection, processing, and evaluation - Architecture and methodology research, including optimizer ablations What We're Looking For / Requirements: - Strong engineering aptitude for rapidly implementing reliable and robust systems - Can rapidly learn new fields and are excited to implement new ideas - Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Qualifications / Additional Skills: - Deep expertise and intuition for solving machine learning problems and training models - Experience with training on large-scale (multi-node) GPU clusters - Deep understanding of model training pipelines - including model/data parallelism, distributed optimizers, etc. - Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing - Understanding of large-scale, highly parallel data processing pipelines - High proficiency with PyTorch and Python. - Strong ability to dive into large pre-existing codebases and rapidly get up to speed - Published machine learning research in well-respected venues is a plus - Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Math, Physics) Why
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Business Analyst (Top Secret cleared)
ICF International INC - Washington, DC
-
Engineering Project Specialist II (Full Time) - United State
Cisco - San Jose, California, US
-
Automation AI Ops Engineer
Cisco - 2 Locations