Applied Research Scientist

Sully.AI - US - Remote

Posted Feb 26, 2026

Benefits

Parental leave: Not verified
Non-birth-parent leave: Not verified
Family-building benefits: Fertility benefits: Not verified
Adoption assistance: Not verified
Surrogacy assistance: Not verified
Mental health support: Not verified
Relocation assistance: Not verified
Childcare support: Not verified
Learning budget: Not verified
Verification: Not verified
Salary: Not verified
401(k) match: Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type: Not verified
Weekend work: Not verified

Application

Cover letter: Not verified
Assessment: Not verified
Deadline: Not stated

Where they hire

State eligibility is not yet verified.

About this role

Applied Research Scientist US - Remote About Us At Sully.ai , We're Building the Most Impactful Healthcare Company on Earth We believe that access to a great doctor is a basic human right. Today, that's not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system. Our Mission One Human, One Doctor . We enable our customers to staff 30% of their workforce with AI by creating a shared agent architecture for scale and efficiency. All powered by our own patented, world-class models and deployed in real-world care. Key Responsibilities - Build and scale automated evaluation pipelines (LLM-as-judge + human review) with clinical-grade benchmarks. Hard Requirements - Proven experience designing agentic processes and LLM evaluation/benchmarking frameworks. - Strong Python and ML background (PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex). - Demonstrated ability to design rigorous experiments and translate findings into production. - Track record of published research or deep applied work in LLMs and agent evaluation. - Strong communication and technical writing skills to articulate complex findings clearly. - First-Month Focus - Audit existing evaluation approaches for clinical and agentic tasks. - Define initial benchmarks and build early automated pipelines. - Partner with engineering to land first set of CI gates for accuracy, factuality, and safety. - 90 Days - Deliver a repeatable evaluation framework with automated pipelines in production. - Demonstrate measurable improvements in robustness, hallucination reduction, or safety. - Publish or present internal research findings that directly shape product reliability. - If you've ever said, “I want to do work

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs

Systems Engineer - (Execution) - Level 3/4

Northrop Grumman - United States-Alabama-Huntsville
Business Analyst (Top Secret cleared)

ICF International INC - Washington, DC
Engineering Project Specialist II (Full Time) - United State

Cisco - San Jose, California, US
Automation AI Ops Engineer

Cisco - 2 Locations

Applied Research Scientist

Benefits

Schedule

Application

Where they hire

About this role

What verified means

Related jobs

Systems Engineer - (Execution) - Level 3/4

Business Analyst (Top Secret cleared)

Engineering Project Specialist II (Full Time) - United State

Automation AI Ops Engineer