Data Engineer - Multimodal Systems
Zyphra - San Francisco, California, United States
Posted Mar 17, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Data Engineer - Multimodal Systems San Francisco, California, United States Zyphra is an artificial intelligence company based in San Francisco, California. The Role: As a Data Engineer - Multimodal Systems , you will be a core contributor to creating, collecting, and improving Zyphra's datasets and data pipelines across a variety of modalities. Your work will intersect with almost every team at Zyphra. You will be involved in collecting large-scale datasets and implementing and optimizing highly parallel data pipelines. You'll Work Across: - Large-scale data collection across a variety of modalities (text, audio, image) - Designing and working with highly efficient, parallelized data processing pipelines across modalities - Designing and running rigorous experimental ablations to demonstrate the impact of new data improvements What We're Looking For / Requirements: - Strong implementation and prototyping ability - Can take an idea from conception to experimentation quickly - The ability to work well with others in a high-paced research setting - Can rapidly learn new fields and are excited to implement new ideas - Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale. Qualifications / Additional Skills: - Experience collecting, handling, and processing large datasets - Experience with parallel Python programming frameworks such as Dask - Understanding of the state-of-the-art in dataset curation across modalities - A generally meticulous nature and a strong interest in actually looking at data and sanity checking things - Strong grasp of proper experimental methodology for running rigorous ablations and other
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Business Analyst (Top Secret cleared)
ICF International INC - Washington, DC
-
Engineering Project Specialist II (Full Time) - United State
Cisco - San Jose, California, US
-
Automation AI Ops Engineer
Cisco - 2 Locations