Research Engineer - Language Model Pre-Training

Zyphra - San Francisco, California, United States

Posted Mar 17, 2026

Benefits

Parental leave: Not verified
Non-birth-parent leave: Not verified
Family-building benefits: Fertility benefits: Not verified
Adoption assistance: Not verified
Surrogacy assistance: Not verified
Mental health support: Not verified
Relocation assistance: Not verified
Childcare support: Not verified
Learning budget: Not verified
Verification: Not verified
Salary: Not verified
401(k) match: Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type: Not verified
Weekend work: Not verified

Application

Cover letter: Not verified
Assessment: Not verified
Deadline: Not stated

Where they hire

State eligibility is not yet verified.

About this role

Research Engineer - Language Model Pre-Training San Francisco, California, United States Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Language Model Pre-Training , you'll shape our language model roadmap through end-to-end pretraining development. You will work extremely closely with our pretraining team, who will integrate your insights into our next-generation models. You'll Work Across: - Large-scale training runs and model parallelization - Performance optimization of our pretraining stack - Dataset collection, processing, and evaluation - Architecture and methodology research, including optimizer ablations What We're Looking For / Requirements: - Strong engineering aptitude for rapidly implementing reliable and robust systems - Can rapidly learn new fields and are excited to implement new ideas - Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale Qualifications / Additional Skills: - Deep expertise and intuition for solving machine learning problems and training models - Experience with training on large-scale (multi-node) GPU clusters - Deep understanding of model training pipelines - including model/data parallelism, distributed optimizers, etc. - Strong grasp of proper experimental methodology for running rigorous ablations and other hypothesis testing - Understanding of large-scale, highly parallel data processing pipelines - High proficiency with PyTorch and Python. - Strong ability to dive into large pre-existing codebases and rapidly get up to speed - Published machine learning research in well-respected venues is a plus - Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Math, Physics) Why

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs

Systems Engineer - (Execution) - Level 3/4

Northrop Grumman - United States-Alabama-Huntsville
Business Analyst (Top Secret cleared)

ICF International INC - Washington, DC
Engineering Project Specialist II (Full Time) - United State

Cisco - San Jose, California, US
Automation AI Ops Engineer

Cisco - 2 Locations

Research Engineer - Language Model Pre-Training

Benefits

Schedule

Application

Where they hire

About this role

What verified means

Related jobs

Systems Engineer - (Execution) - Level 3/4

Business Analyst (Top Secret cleared)

Engineering Project Specialist II (Full Time) - United State

Automation AI Ops Engineer