Staff/Sr. ML Compute Efficiency Engineer
Apple - Santa Clara, United States of America
Posted Jan 23, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified last checked Jun 13, 2026
- Salary
- Not verified not verified - source not recorded; timestamp not recorded
- 401(k) match
- Listed Source: EMPLR_CONTRIB_INCOME_AMT. source Last checked Jun 13, 2026.
Was this benefit information wrong? Tell us.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
Staff/Sr. ML Compute Efficiency Engineer Santa Clara, United States of America Scaling machine learning workloads across thousands of GPUs and TPUs creates challenges that few engineers ever encounter. In Apple's Machine Learning Platform Technologies organization, we build the infrastructure that powers large-scale ML training and inference workloads, bringing together expertise in distributed systems, machine learning infrastructure, and high-performance computing. As a performance engineer in the ML Compute Efficiency team, you'll tackle ambiguous systems challenges, identify inefficiencies and build solutions that maximize accelerator utilization, reduce idle and fragmented capacity, and minimize recovery periods. This includes analyzing accelerator performance, digging into various parallelism techniques, and refining workload scheduling and orchestration across the compute fleet. Characterize ML workload behavior through profiling, benchmarks and metrics. Dive into unfamiliar codebases to prototype changes, evaluate tradeoffs, and build production-ready solutions. Design systems for efficient recovery from failures and preemptions. Create tools to identify and alert bottlenecks across applications and frameworks. Use workload-driven insights to influence next-generation hardware selection and procurement decisions. Collaborate closely with ML researchers and infrastructure engineers to address inefficiencies. Drive impact through hands-on contribution and mentorship. Minimum Qualifications: Experience with large-scale distributed systems for AI/ML workloads running on GPUs or TPUs. Strong software engineering skills with experience developing and optimizing training frameworks (e.g. PyTorch, JAX) using C/C++ or Python. Experience working on cross-functional projects with ML research and infrastructure teams. Familiarity with model architectures and various training techniques. Bachelor's degree in Computer Science or equivalent experience, with 7+ years of industry experience. Preferred
Read the full description at jobs.apple.com. FewerJobs shows a source-linked preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.
Related jobs
-
Mechanical Engineering Manager 2 - 16282
Northrop Grumman - United States-Utah-Roy
-
Senior Software Engineer, Simulation and Integration
Axcelis Technologies INC - Beverly, MA
-
Payload AI&T Lead Staff Systems Engineer
Northrop Grumman - United States-Maryland-Linthicum
-
Senior Software Engineer, Equipment Control
Axcelis Technologies INC - Beverly, MA