Applied Research Scientist
Sully.AI - US - Remote
Posted Feb 26, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Applied Research Scientist US - Remote About Us At Sully.ai , We're Building the Most Impactful Healthcare Company on Earth We believe that access to a great doctor is a basic human right. Today, that's not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system. Our Mission One Human, One Doctor . We enable our customers to staff 30% of their workforce with AI by creating a shared agent architecture for scale and efficiency. All powered by our own patented, world-class models and deployed in real-world care. Key Responsibilities - Build and scale automated evaluation pipelines (LLM-as-judge + human review) with clinical-grade benchmarks. Hard Requirements - Proven experience designing agentic processes and LLM evaluation/benchmarking frameworks. - Strong Python and ML background (PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex). - Demonstrated ability to design rigorous experiments and translate findings into production. - Track record of published research or deep applied work in LLMs and agent evaluation. - Strong communication and technical writing skills to articulate complex findings clearly. - First-Month Focus - Audit existing evaluation approaches for clinical and agentic tasks. - Define initial benchmarks and build early automated pipelines. - Partner with engineering to land first set of CI gates for accuracy, factuality, and safety. - 90 Days - Deliver a repeatable evaluation framework with automated pipelines in production. - Demonstrate measurable improvements in robustness, hallucination reduction, or safety. - Publish or present internal research findings that directly shape product reliability. - If you've ever said, “I want to do work
Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Business Analyst (Top Secret cleared)
ICF International INC - Washington, DC
-
Engineering Project Specialist II (Full Time) - United State
Cisco - San Jose, California, US
-
Automation AI Ops Engineer
Cisco - 2 Locations