FewerJobs.
All jobs

Machine Learning Research Scientist, Post-Training

Scale AI - San Francisco, CA; Seattle, WA; New York, NY

Posted Feb 11, 2025

Benefits

Parental leave
Not verified
Non-birth-parent leave
Not verified
Family-building benefits
  • Fertility benefits: Not verified
  • Adoption assistance: Not verified
  • Surrogacy assistance: Not verified
Mental health support
Not verified
Relocation assistance
Not verified
Childcare support
Not verified
Learning budget
Not verified
Verification
Not verified
Salary
Not verified not verified - source not recorded; timestamp not recorded
401(k) match
Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type
Not verified
Weekend work
Not verified

Application

Cover letter
Not verified
Assessment
Not verified
Deadline
Not stated

Where they hire

State eligibility is not yet verified.

About this role

Machine Learning Research Scientist, Post-Training San Francisco, CA; Seattle, WA; New York, NY Scale works with the industry's leading AI labs to provide high quality data and accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in LLM post-training (SFT, RLHF, reward modeling). This role will focus on optimizing data curation and eval to enhance LLM capabilities in both text and multimodal modalities. In this role, you will develop novel methods to improve the alignment and generalization of large-scale generative models. You will collaborate with researchers and engineers to define best practices in data-driven AI development. You will also partner with top foundation model labs to provide both technical and strategic input on the development of the next generation of generative AI models. You will: - Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities. - Design and experiment new approaches to preference optimization. - Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness. - Publish research findings in top-tier AI conferences. Ideally you'd have: - Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field. - Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning. - Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning. - Excellent written and verbal communication skills - Published research in areas of machine learning at major conferences (NeurIPS,

Read the full description at job-boards.greenhouse.io. FewerJobs shows a source-linked preview and links to the original posting.

Apply at job-boards.greenhouse.io

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs