AI/ML Infrastructure Engineer
Zensors - San Francisco | OnSite
Posted Jun 10, 2026
Benefits
- Parental leave
- Not verified
- Non-birth-parent leave
- Not verified
- Family-building benefits
-
- Fertility benefits: Not verified
- Adoption assistance: Not verified
- Surrogacy assistance: Not verified
- Mental health support
- Not verified
- Relocation assistance
- Not verified
- Childcare support
- Not verified
- Learning budget
- Not verified
- Verification
- Not verified
- Salary
- Not verified
- 401(k) match
- Not verified
Was this benefit information wrong? Tell us.
Market context
- U.S. role benchmark (BLS OEWS)
- $116,543 U.S. median for this role
- Projected growth (BLS Employment Projections)
- +9.8% - Much faster than average
Matched to SOC 15-1252 - Software Engineering aggregate by role bucket.
Source: U.S. Bureau of Labor Statistics, OEWS, May 2024 and Employment Projections, 2024-2034.
Schedule
- Shift type
- Not verified
- Weekend work
- Not verified
Application
- Cover letter
- Not verified
- Assessment
- Not verified
- Deadline
- Not stated
Where they hire
State eligibility is not yet verified.
About this role
AI/ML Infrastructure Engineer San Francisco | OnSite The AI Infrastructure team at Zensors builds the engine that powers our visual sensing platform. We provide the tools to automate the lifecycle of our AI workflow, including model development, evaluation, optimization, deployment, and monitoring across thousands of video streams. As a Machine Learning Engineer in ML Runtime & Optimization, you will develop technologies to accelerate the training and inference of computer vision models that power smart spaces and cities. Your responsibilities will include: - Optimizing Core ML Pipelines: Identifying key bottlenecks in our current video analytics pipeline and performing in-depth analysis to ensure the best possible performance on current server and edge compute architectures. - Cross-Stack Collaboration: Collaborating closely with AI research and platform engineering teams to optimize core parallel algorithms and influence the design of our next-generation inference infrastructure. - Model Acceleration: Applying advanced model optimization techniques-such as quantization (Int8/FP16), pruning, and layer fusion-to our Vision Transformers (ViTs) and CNNs to maximize throughput and minimize latency. - Building Efficient Operators: Working across the entire ML framework/compiler stack (e.g., PyTorch, CUDA, TensorRT, and NVIDIA DeepStream) to write custom optimized ML operator libraries. - Resource Efficiency: Reducing the compute cost per video stream to enable massive scalability of our SaaS product. - Data Management: Building, improving, maintaining, and operating systems to facilitate the collection, labeling, and use of visual data for ML training. REQUIREMENTS - BS/MS or Ph.D. in Computer Science, Electrical Engineering, or a related discipline. - Strong programming skills in C/C++
Read the full description at jobs.ashbyhq.com. FewerJobs shows a preview and links to the original posting.
Apply link not verified; last-live date unavailable.
What verified means
Verified means a displayed claim has recorded source fields, a user-resolvable source, and a full check date.
Related jobs
-
Systems Engineer - (Execution) - Level 3/4
Northrop Grumman - United States-Alabama-Huntsville
-
Manufacturing Technician - Entry Level
Northrop Grumman - United States-Mississippi-Iuka
-
Staff System Architect
Northrop Grumman - United States-Illinois-Rolling Meadows
-
Manufacturing Technician - Level 2
Northrop Grumman - United States-Massachusetts-Devens