FewerJobs.
All jobs

Senior Software Development Engineer (AWS ML), Machine Learning Israel (MLIL) — FLOW sub-team (Fleet Lifecycle & Operational Workflows)

Amazon - Tel Aviv-Yafo, Tel Aviv, ISR

Posted May 11, 2026

Benefits

Parental leave
Not verified not verified - source not recorded; timestamp not recorded
Non-birth-parent leave
Not verified not verified - source not recorded; timestamp not recorded
Family-building benefits
  • Fertility benefits: Not verified
  • Adoption assistance: Not verified
  • Surrogacy assistance: Not verified
Mental health support
Not verified
Relocation assistance
Not verified
Childcare support
Not verified
Learning budget
Not verified
Verification
Not verified
Salary
Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type
Not verified
Weekend work
Not verified

Application

Cover letter
Not verified
Assessment
Not verified
Deadline
Not stated

Where they hire

State eligibility is not yet verified.

About this role

Senior Software Development Engineer (AWS ML), Machine Learning Israel (MLIL) — FLOW sub-team (Fleet Lifecycle & Operational Workflows) Tel Aviv-Yafo, Tel Aviv, ISR Annapurna Labs designs silicon and software that accelerates innovation. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world. The MLIL FLOW team is looking for a Senior Software Development Engineer to lead the design and delivery of systems software for our next-generation ML accelerator servers. We build production software to validate, initialize, monitor, and qualify these servers - from first silicon through fleet-scale deployment. We work on the physical systems that execute ML workloads: Server bring-up, hardware diagnostics, interconnect validation, power/thermal monitoring, and fleet-scale operations are our bread and butter. Key job responsibilities • Lead the architecture and implementation of hardware validation and diagnostic software for new ML acceleration platforms. • Drive technical direction for PCIe validation, power/thermal diagnostics, and stress-testing frameworks that run across manufacturing, vetting, and production environments. • Own subsystems end-to-end: from design through implementation, testing, deployment, and operational excellence at fleet scale. • Work with Hardware, Manufacturing, EC2 teams to create coordinated software packages that enable both qualification and rapid deployment. • Debug and root-cause complex hardware/software interaction failures on first silicon and production fleet returns; drive root-cause to closure. • Build and maintain data pipelines, dashboards, and monitoring systems for fleet health and performance benchmarking. • Mentor engineers, define best practices,

Read the full description at www.amazon.jobs. FewerJobs shows a source-linked preview and links to the original posting.

Apply at amazon.jobs

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs