Research Engineer - Agency and Reasoning
Zyphra - San Francisco, California, United States
Posted Mar 17, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Research Engineer - Agency and Reasoning San Francisco, California, United States Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Research Engineer - Agency and Reasoning , you will be a core contributor to Zyphra's Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models. What We're Looking For / Requirements: - Strong research taste and intuition - The ability to work through a research project from conception to execution to write-up - Strong implementation and prototyping skillset - A researcher who can take an idea from conception to experimentation extremely quickly - The ability to work well and cooperate with others in a high-paced research setting - Curiosity, interest, and joy in understanding intelligence. Qualifications / Additional Skills: - Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks - Experience with language-model-supervised fine-tuning and preference-learning methods, such as DPO and simPO. - Experience with context-length extension methods - A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning - Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation - Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics) - Previously published machine learning research in well-respected venues - Highly proficient with PyTorch and Python
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Business Analyst (Top Secret cleared)
ICF International INC - Washington, DC
-
Engineering Project Specialist II (Full Time) - United State
Cisco - San Jose, California, US
-
Automation AI Ops Engineer
Cisco - 2 Locations