# FewerJobs export - 100 curated jobs
Generated: 2026-06-14T11:50:25.195Z
Source: https://fewerjobs.com

## Filters applied
- **q**: Cerebras Systems
- **quality_floor**: default
- **match_401k_strict**: true
- **parental_strict**: true
- **non_birth_strict**: true
- **pto_strict**: true
- **include_older**: false
- **apply_url_verified**: false
- **page**: 1
- **per_page**: 100
- **sort**: relevance

## Jobs
### System Software Engineer (Embedded) - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $175K-$275K
- Posted: 2026-02-17
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628229003
- Excerpt: System Software Engineer (Embedded) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role As part of the Embedded Software team, you will help build the critical software foundation that powers the Cerebras Wafer Scale Engine (WSE)-the world's largest AI processor. Our team owns a diverse range of embedded and system level components that enable the WSE to operate reliably at scale, including microcontroller firmware, wafer level monitoring logic, system administration services, and the Linux platform and BSP layers that keep the entire system running smoothly. This role exists at the intersection of embedded systems, platform engineering, and

### Advanced Technology: AI/ML Research Scientist - Cerebras Systems
- Location: Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-06
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7691353003
- Excerpt: Advanced Technology: AI/ML Research Scientist Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team Cerebras builds wafer-scale AI processors-single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ' pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software. Our work regularly appears at top-tier venues (Supercomputing, SIAM, IEEE,

### Advanced Technology: R&D Engineer - AI/ML, HPC - Cerebras Systems
- Location: Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-06
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7691343003
- Excerpt: Advanced Technology: R&D Engineer - AI/ML, HPC Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team Cerebras builds wafer-scale AI processors-single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ' pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software. Our work regularly appears at top-tier venues (Supercomputing,

### Mechanical Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $180K-$200K
- Posted: 2026-05-29
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7752149003
- Excerpt: Mechanical Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role: As a Mechanical Engineer at Cerebras, you will lead the design of mechanical systems for our next-generation wafer-scale engine. Your responsibilities will include ensuring compliance with specifications, validating manufacturability, and delivering a high-quality product in a fast-paced environment-tackling some of the most challenging problems in the rapidly evolving AI space. In this role, you will develop mechanical infrastructure for Cerebras' custom hardware system. - Rapidly iterate on designs and analysis to inform high level systems trades and steer overall product direction. - Provide comprehensive support for

### Head of IT - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-09
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7677635003
- Excerpt: Head of IT Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are hiring a Head of IT to build and run the internal technology backbone of a company that is scaling quickly and operating at the edge of AI hardware and software. This is not a steady-state IT leadership job, It is a build-and-scale role for someone who thrives when the ground is moving. You will own the systems that Cerebras employees, contractors, and executives rely on every day: laptops, identity, SaaS, networking, collaboration, endpoint security, internal support, and the IT controls that a

### Senior Mechanical Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $190K-$230K
- Posted: 2026-01-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7522557003
- Excerpt: Senior Mechanical Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Senior Mechanical Engineer at Cerebras, you will lead the design of mechanical systems for our next-generation wafer-scale engine. Your responsibilities will include ensuring compliance with specifications, validating manufacturability, and delivering a high-quality product in a fast-paced environment-tackling some of the most challenging problems in the rapidly evolving AI space. In this role, you will develop mechanical infrastructure for Cerebras' custom hardware system. - Rapidly iterate on designs and analysis to inform high level systems trades and steer overall product direction. - Provide

### Distributed Software Engineer - Cerebras Systems
- Location: Bengaluru, Karnataka, India; Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-09
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7582638003
- Excerpt: Distributed Software Engineer Bengaluru, Karnataka, India; Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Cerebras Systems is a pioneer in large-scale AI Supercomputers. These multi-exaflop supercomputers are deployed in some of the biggest datacenters. These supercomputers are built using our Wafer-Scale Cluster technology - a cluster of several Wafer Scale Engine (WSE) chips. The Cluster engineering team is responsible for delivering software that are all-things related to cluster. Responsibilities - Automate bare-metal configuration of networking, OS, and application software in large clusters of Cerebras WSE, servers, and switches. - Additional push button

### Software Architect – Manufacturing Test - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $204K-$245K
- Posted: 2026-05-26
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7749004003
- Excerpt: Software Architect – Manufacturing Test Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role As the Software Architect for Cerebras' manufacturing test platform, you will lead a team of Full Stack Engineers in designing and delivering the end-to-end software systems that power manufacturing test across every stage of our product lifecycle - from individual components to complete Cerebras systems. The platform spans both cloud infrastructure and physical client-server infrastructure deployed across our manufacturing facilities, and you will own the technical vision, architecture, and roadmap across the full stack. Working closely with hardware design, test engineering, operations,

### Sr. Technical Staff - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $250K-$275K
- Posted: 2026-05-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7728797003
- Excerpt: Sr. Technical Staff Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras Systems Inc. has multiple openings for Sr. Technical Staff. Title: Sr. Technical Staff Job Duties: - Post silicon validation of Cerebras Wafer Scale Engines. Test and debug issues on new silicon. - Test, analyze, and characterize high-speed serial interfaces to verify compliance with hardware specifications, record performance data, and recommend design modifications to optimize functionality. - Work with the silicon and operations team to test, bring-up and run burn-in on wafers scale systems. - Support manufacturing operations to utilize the wafer bring up flow. Perform wafer

### Infrastructure Hardware Technical Program Manager (Server and Network Systems) - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-19
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7636808003
- Excerpt: Infrastructure Hardware Technical Program Manager (Server and Network Systems) Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. As an Infrastructure Hardware Technical Program Manager (Server and Network Systems) on the Cluster Architecture Team, you will drive end-to-end delivery of server and network platform programs across Cerebras CS-3-based AI clusters - from requirements and vendor selection through lab bring-up, qualification, and production rollout. You will be the execution owner for multi-team programs spanning OEM/ODM partners, component vendors, internal software/runtime teams and architects, validation/QA, and deployment/operations. This role is intentionally technical: you must understand server, network, and

### Software Development Engineer in Test (Cloud) - Cerebras Systems
- Location: Bengaluru, Karnataka, India (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-11
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7730279003
- Excerpt: Software Development Engineer in Test (Cloud) Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team The Cloud Quality team is responsible for the confidence behind every production release shipped to Cerebras Inference Cloud. We work closely with platform, infrastructure, ML systems, and product engineering teams to ensure that rapid iteration never comes at the expense of customer trust. Our environment spans distributed cloud systems, multi-region deployments, APIs, orchestration layers, and hardware-backed inference services. We are scaling quickly. The systems are growing in complexity, traffic is increasing rapidly, and release velocity remains high. We need engineers

### Senior Hardware Technical Program Manager - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $180K-$230K
- Posted: 2026-03-23
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7675820003
- Excerpt: Senior Hardware Technical Program Manager Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role As a Senior Hardware Technical Program Manager at Cerebras, you will spearhead operational excellence for our high-performance AI compute systems and data centers. You will own the end-to-end hardware schedule for design and engineering improvements, report on engineering issues, and define mitigation strategies. You will own the schedule, implementation, and software integration of hardware changes. You will collaborate closely with electrical and system engineering, manufacturing, supply chain, and system software to drive end-to-end schedule of improvements to our wafer-scale engine supercomputers. Your role

### Full Stack LLM Engineer - Cerebras Systems
- Location: Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-07-18
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6654277003
- Excerpt: Full Stack LLM Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI

### Sr. Member of Technical Staff - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $230K-$250K
- Posted: 2026-05-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7728796003
- Excerpt: Sr. Member of Technical Staff Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras Systems Inc. has multiple openings for Sr. Member of Technical Staff Title: Sr. Member of Technical Staff Job Duties: - Design and develop software features that support system resiliency and high availability, including automated recovery mechanisms and fault-tolerant architecture across distributed environments. - Develop and maintain cloud-based deployment workflows for AI inference software using AWS tools and services to support low-latency and scalable system performance. - Develop Python-based scripts and APIs to streamline data preprocessing, inference execution, and post-processing for real-time inference tasks. -

### Advanced Technology: Compiler Engineer - Cerebras Systems
- Location: Sunnyvale, CA; Vancouver, British Columbia, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-30
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7683606003
- Excerpt: Advanced Technology: Compiler Engineer Sunnyvale, CA; Vancouver, British Columbia, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team Cerebras builds wafer-scale AI processors-single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ' pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software. Our work regularly appears at top-tier venues (Supercomputing, SIAM, IEEE, and NeurIPS ) and

### Manufacturing Test Development Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $170K-$210K
- Posted: 2026-01-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7524095003
- Excerpt: Manufacturing Test Development Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Test Development Engineer on our manufacturing team you will be working with diagnostics, system design, manufacturing, and quality teams to develop test automation solutions for our products from PCBA to system level. You will also work closely with our contract manufacturing sites to fulfill a complete test automation solution for manufacturing test data, yield improvement, and traceability. Responsibilities - Develop and design manufacturing test automation software/scripts to test Cerebras products from PCBA to system level. - Develop and implement GUI solutions

### Senior ML Systems Engineer - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-12
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7622330003
- Excerpt: Senior ML Systems Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are seeking a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for

### Performance & Reliability Engineer - Cerebras Systems
- Location: Sunnyvale, CA; Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-11-25
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7528266003
- Excerpt: Performance & Reliability Engineer Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join Cerebras as a Performance & Reliability Engineer within our innovative Co-Design and Next Generation Team. Our groundbreaking CS-3 system has set new benchmarks in high-performance ML training and inference solutions. It leverages a dinner-plate sized chip with 44GB of on-chip memory to surpass traditional hardware capabilities. This role focuses on characterizing and optimizing the performance and reliability of state-of-the-art AI models running on Cerebras' breakthrough hardware. Responsibilities - Characterize and enhance the performance and reliability of advanced ML hardware/software

### Distributed Systems Cluster Security Software – Engineering Lead - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $140K-$240K
- Posted: 2025-03-24
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6519874003
- Excerpt: Distributed Systems Cluster Security Software – Engineering Lead Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role In this role, you will be the security czar for the Cerebras's AI cluster product. Such AI clusters have 100's of Wafer-scale accelerator systems, 1000's of high-end servers, and several 1000's of networking ports including switches. Plus, there will be network attached storage, all in a large-scale datacenter. You will ensure that Cerebras's large-scale AI clusters are secured through first-principles, best practices, security-first based engineering. Cerebras cluster involves complex HW components, networking and a vertically integrated cluster management software

### ML Research Engineer (Inference) - Cerebras Systems
- Location: Bengaluru, Karnataka, India (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7694185003
- Excerpt: ML Research Engineer (Inference) Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Research Engineer on the Inference ML team at Cerebras Systems, you will adapt today's most advanced language and vision models to run efficiently on our flagship Cerebras architecture. You'll work alongside ML researchers and engineers to design, prototype, validate, and optimize models, gaining end-to-end exposure to cutting-edge inference research on the world's fastest AI accelerator. You will focus on pushing the frontier of speculative decoding , large-model pruning and compression , sparse attention , and sparsity-driven techniques to deliver low-latency,

### Cluster UI Full Stack, Engineering Lead - Cerebras Systems
- Location: Bengaluru, Karnataka, India; Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-28
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7608975003
- Excerpt: Cluster UI Full Stack, Engineering Lead Bengaluru, Karnataka, India; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role In this role, you will be building a world class UI-based large-scale cluster management portal. This portal will act as one stop for all operations and maintenance of cerebras clusters - such as cluster bringup deployment (day0/1/2), job management, health management to name a few. Cerebras AI clusters may have 1000's of Wafer-scale accelerator systems, several 1000's of high-end servers, and several 1000's of networking ports including switches. Responsibilities - Be the primary engineering face and owner

### IT SRE Team Lead - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-09
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7629636003
- Excerpt: IT SRE Team Lead Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking an experienced IT SRE Team Lead to build and run the reliability function for Cerebras' internal technology estate. The IT SRE Team Lead will be responsible for the availability, performance, and operational quality of the systems Cerebras employees rely on every day, including identity, endpoint management, collaboration, SaaS, and internal networking. The right candidate will bring a software engineering mindset to IT operations, treating corporate infrastructure as code, with measurable SLOs, automated remediation, and a ruthless focus on eliminating toil.

### Senior Technical Program Manager – AI Infrastructure, Site Operations - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2025-12-16
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7522548003
- Excerpt: Senior Technical Program Manager – AI Infrastructure, Site Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This Sr. TPM role owns site and data center operations programs supporting Cerebras' AI Cloud and customer deployments. The position sits at Sunnyvale HQ and works closely with Hardware Engineering, Inference Engineering, and Operations leadership to ensure Cerebras systems are reliably deployed, operated, and scaled. This is a highly technical, execution-focused TPM role with strong emphasis on operational readiness, cross-functional coordination, and metrics/KPIs. Responsibilities - Own end-to-end technical programs for data center and site operations - Act as

### Security & IT General Opportunities - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-28
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7751790003
- Excerpt: Security & IT General Opportunities Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Team Our IT & Security team sits at the intersection of security, infrastructure, and cutting-edge AI systems. The team plays a critical role in ensuring Cerebras' environments are secure, reliable, scalable, and ready to support customers operating at the frontier of AI. We are looking for people who care deeply about operational excellence, security best practices, automation, and building systems that can support a rapidly growing organization. What You May Work On Depending on the role and team needs, you

### CoDesign & NextGen - New College Grad - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $145K-$155K
- Posted: 2026-01-07
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7580399003
- Excerpt: CoDesign & NextGen - New College Grad Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Engineers in the CoDesign and NextGen organization work at the interface of software and hardware helping everything from high performance kernel design, next generation ASIC performance modeling and tuning, new system bringup and software tuning, system robustness and simulations. As an new engineer you'll be expected to learn the Cerebras products and platforms end to end starting at the base layer of programming the Wafer Scale Engine through kernel development, modeling performance for our software products and validating the work

### Applied Machine Learning Research Scientist - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-05
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7655035003
- Excerpt: Applied Machine Learning Research Scientist Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As an Applied Machine Learning Research Scientist at Cerebras, you will play a key role in turning modern machine learning techniques into scalable, high-performance systems. This role sits at the intersection of modeling and systems focused not on publishing new algorithms, but on understanding how they work and making them run effectively at scale. Your work will directly impact how large language models (LLMs) are trained, optimized, and deployed on one of the most advanced AI platforms in the

### Full Stack Engineer – Manufacturing Test - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $175K-$220K
- Posted: 2026-02-25
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628224003
- Excerpt: Full Stack Engineer – Manufacturing Test Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role As a Full Stack Engineer focusing on Cerebras' manufacturing test platform, you will design, build, and maintain a comprehensive test software solution for all stages of manufacturing - from individual components to complete Cerebras systems. You will collaborate cross-functionally with hardware design, engineering, operations, and data analytics teams to develop user interfaces and data processing frameworks that directly impact manufacturing efficiency, quality, and scalability. Responsibilities - Collaborate with hardware engineers and test developers to create frameworks that facilitate the development, validation,

### Prognostics & Health Monitoring Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $150K-$250K
- Posted: 2026-04-30
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7720309003
- Excerpt: Prognostics & Health Monitoring Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Role Summary Quality, reliability, and uptime are foundational to scaling Cerebras systems. We are seeking an engineer to define and build our prognostics and health monitoring (PHM) capability-developing frameworks to monitor, assess, and predict hardware health across our fleet. In this role, you will transform telemetry and operational data into actionable insights and automated responses, enabling early detection of degradation, accurate failure prediction, and proactive actions to keep systems highly available, performant, and resilient. This is a highly cross-functional role spanning reliability engineering, data science,

### Senior ML Software Engineer - Integration & Quality - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-05
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7620384003
- Excerpt: Senior ML Software Engineer - Integration & Quality Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are looking for a Software Engineer to join the ML Integration and Quality team at Cerebras. This team sits at the intersection of machine learning infrastructure, distributed systems, and hardware/software co-design. In this role, you will help integrate and validate the software stack that powers the Cerebras AI platform, ensuring large-scale ML workloads run reliably and efficiently across our systems. You will work closely with engineers across runtime, compiler, kernel, and hardware teams to debug

### Engineering Manager, Inference ML Runtime - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-24
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7677112003
- Excerpt: Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role The Inference ML Engineering team at Cerebras builds the runtime, APIs, and systems that power the fastest generative AI inference platform in the world. As an Engineering Manager, Inference ML Runtime , you will lead a team responsible for designing and scaling the systems that enable seamless execution of state-of-the-art AI models on Cerebras hardware. You will operate at the intersection of machine learning, distributed systems, and high-performance runtime engineering , translating cutting-edge research into production-ready infrastructure to serve

### Senior Product Marketing Manager, AI Inference - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-24
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628238003
- Excerpt: Senior Product Marketing Manager, AI Inference Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The AI conversation moves fast - new models ship weekly, benchmarks shift overnight, and the community's attention resets constantly. Cerebras has a massive speed advantage in inference, and this role exists to make sure that advantage is visible, understood, and top-of-mind wherever developers and AI builders are paying attention. As Senior Product Marketing Manager, you'll own realtime product marketing for Cerebras inference. You'll create high-impact technical content - blog posts, benchmark analyses, social threads - that positions Cerebras at the center

### ML Performance Benchmarking Engineer - Cerebras Systems
- Location: Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-18
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7669166003
- Excerpt: ML Performance Benchmarking Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Inference Core Platform group is at the heart of Cerebras' mission to deliver the world's fastest AI inference. Our team builds the foundational software and hardware infrastructure that powers low-latency, high-speed, high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE). We are responsible for the full stack-from model compilation and scheduling down to custom hardware kernels and driver development. The ML Performance Benchmarking team plays a pivotal role in shaping the performance and scalability of AI inference on one of the most

### Senior Performance Engineer, Inference - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-13
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7698266003
- Excerpt: Senior Performance Engineer, Inference Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are hiring a Senior Performance Engineer to join our Product team. You are an expert on state-of-the-art inference performance and will serve as our resident expert on how Cerebras stacks up against alternative inference providers on both price and performance. This role sits at the intersection of performance benchmarking from first principles and competitive intelligence. The role has two core pillars: - Performance Benchmarking You will build, run, and maintain reproducible benchmarks that measure Cerebras inference performance for real customer workloads. This

### Director / Senior Director, Critical Facility Operations - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-03
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7762028003
- Excerpt: Director / Senior Director, Critical Facility Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role At Cerebras, we build systems that redefine the scale of compute. That ambition extends beyond silicon into the infrastructure that powers it. We are looking for a Director / Sr. Director of Critical Facility Operations to lead the environments that enable next-generation AI workloads. This leader will own availability, operational integrity, and performance across a colocation-driven data center footprint-ensuring our infrastructure operates with precision, predictability, and zero room for error. This is not a traditional facilities role. You will operate at

### Staff Inference ML Runtime Engineer - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-11-25
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7523546003
- Excerpt: Staff Inference ML Runtime Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our fast generative inference solution through simple APIs powered by a distributed runtime that runs on large clusters of our own hardware. Our mission is to empower enterprises, developers, and researchers to unlock the full potential of our platform, leveraging its performance, scalability, and flexibility. The team works closely with cross-functional groups, including compiler developers, cluster orchestrators, ML scientists, cloud architects, and product teams, to deliver high-impact

### Performance Engineer - Cerebras Systems
- Location: Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-09-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7112048003
- Excerpt: Performance Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our groundbreaking CS-3 system, hosted by a distributed set of modern and powerful x86 machines, has set new benchmarks in high-performance ML training and inference solutions. It leverages a dinner-plate sized chip with 44GB of on-chip memory to surpass traditional hardware capabilities. This role will challenge and expand your expertise in optimizing AI applications and managing computational workloads primarily on the x86 architecture that run our Runtime driver. Responsibilities - Focus on CPU

### AI Engineer, Model Quality and Performance - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-15
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7735288003
- Excerpt: AI Engineer, Model Quality and Performance Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role You'll own model quality and performance for Cerebras' inference offerings. You will define what "good" looks like across the models we serve, building AI-driven systems to measure it at scale, and translating those signals into artifacts our customers and product team actually use. You'll use AI agents to spin up custom eval suites per customer use case, mine trajectories for representative test data, automate the repetitive parts of release qual, and help build performance datasets and benchmarking workflows for customer use

### Lead RTL Design Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $175K-$275K
- Posted: 2025-11-13
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7527591003
- Excerpt: Lead RTL Design Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a lead front-end design engineer, you will be a key part of the world-class team designing and developing the next generations of the Cerebras Wafer Scale Engine (WSE). This role requires deep expertise in RTL design and integration, with a strong focus on delivering high-performance, power-efficient, and scalable solutions. The role also requires close collaboration and management of external ASIC vendor. You will collaborate closely with the design verification, physical design, software and system teams to bring innovative semiconductor architectures from concept

### Manager, Business Operations - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-29
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7720168003
- Excerpt: Manager, Business Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This role is a high-leverage seat in that build and a deliberate apprenticeship into operating leadership. You will create the business operations, analytics, and execution system that keeps decision-ready insight flowing as the company scales. You will be embedded with operators, turning messy operational reality into durable processes, clear metrics, and repeatable operating rhythms. You will report to the Head of FP&A and work in close partnership with the COO and operations leadership. Why now Cerebras is scaling to meet accelerating demand for fast

### Manufacturing Bring-up Engineer L2 - Cerebras Systems
- Location: Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada (unspecified)
- Salary: $170K-$230K
- Posted: 2026-03-02
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628218003
- Excerpt: Manufacturing Bring-up Engineer L2 Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role We are seeking a highly skilled and motivated Manufacturing Bring-up Engineer to join our team. As the Manufacturing Bring-up Engineer you will support our system level bring-up process execution, implementation, and evolution in the manufacturing pipeline. This is a high visibility role that requires strong technical expertise, coordination, and collaboration to deliver our product from manufacturing to the customer. Responsibilities - Support the Cerebras manufacturing bring-up process execution to configure, test, and validate system performance prior to customer

### Senior Front End Design Engineer (Microarchitecture) - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $250K-$300K
- Posted: 2026-06-04
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7763907003
- Excerpt: Senior Front End Design Engineer (Microarchitecture) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a senior front-end design engineer, you will be a key part of the world-class team designing and developing the next generations of the Cerebras Wafer Scale Engine (WSE). This role requires deep expertise in RTL design and integration, with a strong focus on delivering high-performance, power-efficient, and scalable solutions. You will collaborate closely with the design verification, physical design, software and system teams to bring innovative semiconductor architectures from concept to production, addressing the unique challenges of building WSE systems.

### Lead Full Stack Machine Learning Engineer - Cerebras Systems
- Location: Bengaluru, Karnataka, India (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-10
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7767933003
- Excerpt: Lead Full Stack Machine Learning Engineer Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About T he Role This teams' principal responsibility is to rapidly bring up state-of-the-art open-source models, frameworks and data engineering. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications. Responsibilities - Contribute to the end-to-end bring up of frameworks for RL, inference serving, ML models on Cerebras CSX systems. -

### Senior/Staff Engineer : Post Silicon- Bring Up - Cerebras Systems
- Location: Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada (unspecified)
- Salary: $175K-$275K
- Posted: 2026-02-16
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628233003
- Excerpt: Senior/Staff Engineer : Post Silicon- Bring Up Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role: In this exciting role, you will be responsible for bring up and optimizations of Cerebras's Wafer Scale Engine (WSE). Suitable candidate will have experience delivering end to end solutions working closely with teams across chip design, system performance, software development and productization. Responsibilities: - On Wafer Scale Engines, develop and debug flows that embed well tested and deployable optimizations in production processes to reduce time and costs - Work on refining AI Systems across H/W-S/W

### Principal ML Investigator - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2025-12-12
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7560555003
- Excerpt: Principal ML Investigator Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Cerebras is adding an ML team that can focus on a new ML effort that can align with existing teams. We are seeking a principal investigator who will partner with our ML leaders to formulate the new effort and to build up the new team and capabilities. This new team would coordinate with our current ML teams: Field ML, which works directly with customers, Applied ML, which builds new ML capabilities and applications for customers, and Core ML, which adapts ML algorithms to find

### Product Manager, Strategic Verticals - Cerebras Systems
- Location: San Francisco, California, United States (unspecified)
- Salary: Not disclosed
- Posted: 2025-09-23
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6585289003
- Excerpt: Product Manager, Strategic Verticals San Francisco, California, United States Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Our customers span leading AI Native companies, Fortune 500 Enterprises, Sovereign AI and Federal programs, and leading research institutions. Our mission is to deliver the platform that unlocks the next generation of AI applications, providing the fundamentally new capability to leverage the most intelligent models at real-time serving speeds. Why Cerebras? Here at Cerebras, we have built the world's first wafer-scale compute platform and software stack, purpose-designed to accelerate generative AI by over 10-20x what is possible on legacy processors today. AI developers

### Principal Engineer, AI Inference Reliability - Cerebras Systems
- Location: Remote, California, United States; Sunnyvale CA or Toronto Canada (remote)
- Salary: Not disclosed
- Posted: 2025-10-29
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7511484003
- Excerpt: Principal Engineer, AI Inference Reliability Remote, California, United States; Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. In late 2024, we launched Cerebras Inference, the fastest Generative AI inference service in the world, over 10 times faster than GPU-based hyperscale cloud inference. Since launch, we've scaled to meet the surging demand from AI labs, enterprises, and a thriving developer community. In October 2025, we announced our series G funding, raising $1.1 billion USD to accelerate the expansion of our products and services to meet global AI demand. About the team The Cerebras Inference team's mission

### Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai - Cerebras Systems
- Location: Europe; Remote, California, United States; UAE (remote)
- Salary: Not disclosed
- Posted: 2025-10-31
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7513711003
- Excerpt: Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai Europe; Remote, California, United States; UAE Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role: Would you like to participate in creating the fastest Generative Models inference in the world? Join the Cerebras Inference Team to participate in development of unique Software and Hardware combination that sports best inference characteristics in the market while running largest models available. Cerebras wafer scale inference platform allows running Generative models with unprecedented speed thanks to unique hardware architecture that provides fastest access to local memory, ultra-fast interconnect and huge amount

### Software Engineer, Kernel Reliability - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-05
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7647330003
- Excerpt: Software Engineer, Kernel Reliability Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We're looking for a deeply technical, hands-on software engineer to join our on-field Kernel Reliability team. You'll help tackle a critical challenge: improving the reliability of our advanced compute clusters and the underlying inference, training, and internal production services. In this role, you'll work close to the code and design solutions that will scale with our rapidly growing system production and software service offerings. If you have strong fundamentals in systems, debugging, and failure analysis-and enjoy building tools and solving

### Senior Runtime Engineer - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-10-28
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7510635003
- Excerpt: Senior Runtime Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are building the next generation of large-scale AI systems that power training and inference workloads at unprecedented scale and efficiency. You will design and develop high-performance distributed software that orchestrates massive compute and data pipelines across heterogeneous clusters. Your work will push the limits of concurrency, throughput, and scalability-enabling efficient execution of models at massive scale. This role sits at the intersection of systems engineering and machine learning performance, demanding both architectural depth and low-level implementation skills. You will help

### Member of Technical Staff (Software Engineer) - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $170K-$175K
- Posted: 2026-05-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7728798003
- Excerpt: Member of Technical Staff (Software Engineer) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras Systems Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member of Technical Staff (Software Engineer) Job Duties - Implement infrastructure to support high-performance, low-latency inference service. - Deploy and configure Kubernetes services to ensure scalability and reliability of inference workloads. - Optimize resource allocation and auto-scaling policies to handle variable inference demand while minimizing operational costs. - Integrate inference services with containerized environments using Docker and Kubernetes for orchestration. - Ensure high availability and fault tolerance by implementing

### Compute Server Platform Architect - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-18
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7635821003
- Excerpt: Compute Server Platform Architect Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Compute / Server Platform Architect on the Cluster Architecture Team, you will own the server-side platform architecture that enables Cerebras CS3-based AI clusters (training and inference) to deliver predictable performance, scalability, and reliability. Our accelerators are network-attached, so the x86 server fleet is a first-class part of the end-to-end system: it runs critical-path runtime functions (for example orchestration, prompt caching, and IO/control services) and must be co-designed with software for token-level latency, throughput, and cost efficiency. You will

### ML Systems Performance Engineer - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-21
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7599454003
- Excerpt: ML Systems Performance Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Engineers on the inference performance team operate at the intersection of hardware and software, driving end-to-end model inference speed and throughput. Their work spans low-level kernel performance debugging and optimization, system-level performance analysis, performance modeling and estimation, and the development of tooling for performance projection and diagnostics. Responsibilities - Build performance models (kernel-level, end-to-end) to estimate the performance of state of the art and customer ML models. - Optimize and debug our kernel micro code and compiler algorithms to elevate

### ML Software Tool Development Engineer - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-17
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7634141003
- Excerpt: ML Software Tool Development Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Responsibilities: - Lead the design and implementation of system-level debugging, validation, and observability platforms. - Develop automated systems for collecting and analyzing numerical, and execution anomalies. - Create visualization and analysis tools to enable efficient root-cause investigation. - Build frameworks for failure classification, regression detection, and anomaly monitoring. - Extend compilers, runtimes, and programming interfaces to support advanced profiling and instrumentation. - Improve system bring-up, low-level debug, and validation workflows. - Partner cross-functionally with compiler, hardware, firmware, runtime, and infrastructure teams. -

### Applied AI/ML Scientist - Cerebras Systems
- Location: UAE (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-14
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7588502003
- Excerpt: Applied AI/ML Scientist UAE Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As an Applied AI Scientist in the FieldML team, you will be responsible for developing and customizing large language models and more broadly large-scale deep learning models to solve specific customer problems. You won't just advise; you will build. You will bridge the gap between state-of-the-art research and real-world applications by helping customers harness the power of the Cerebras Wafer-Scale Engine (WSE) for their AI initiatives. We are looking for experienced AI Scientists who are passionate about the "applied" side of machine learning - those

### Network Architect - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2025-11-13
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7527601003
- Excerpt: Network Architect Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Network Architect on the Cluster Architecture Team, you will work closely with the vendors, internal networking teams and industry peers to develop best-in-class front-end datacenter and interconnect architecture of the current and future generations of the Cerebras AI clusters. You will be responsible for developing proof-of-concept of new network designs and features enabling resilient and reliable network for AI workloads. The role will require cross-functional collaboration and interaction with diverse hardware components (e.g., network devices and the Wafer-Scale Engine) as well as software

### QA Lead (ML Integration and Quality) - Cerebras Systems
- Location: Bengaluru, Karnataka, India (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-03
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7647324003
- Excerpt: QA Lead (ML Integration and Quality) Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As an ML QA Lead, you ensure quality of Cerebras SW across all supported ML workloads and workflows. You will be part of MIQ (ML Integration and Quality) team that will focus on SW components feature testing, ML training accuracy and performance, pre deployment/production validation, validating customer workloads and workflows. As part of this role, you will influence the best testing practice, good debugging methodology, effective cross team communication and advocate for world-class products. Responsibilities - Drive quality of various

### Manager - Data Center Asset tracking and Accounting - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-13
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7611652003
- Excerpt: Manager - Data Center Asset tracking and Accounting Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Manager will be primarily responsible for tracking and recovery of Cerebras's data center infrastructure and assets globally through asset end of life. This leadership position requires an organized, highly motivated professional with the ability to drive operational and process improvements. The individual will perform a variety of tasks ranging from routine to complex analysis and play a critical part in asset tracking operations, including asset dispositions, transfers, and periodic cycle counts. This role requires comfort operating in a

### Head of Data Center Acquisition - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7728938003
- Excerpt: Head of Data Center Acquisition Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Why now Demand for Cerebras inference is high and climbing. We need ever-more power-ready data center capacity to meet demand for the world's fastest inference solution. Project facts often change as providers work through power, site, capital, design, security, and schedule issues. This work shapes customer delivery, capital use, and risk for years. The job demands a hands-on deal leader who can separate real capacity from optimistic claims and keep priority transactions on track. Role at a glance - Own the data center capacity pipeline

### AI Models, Product Manager - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-15
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7593896003
- Excerpt: AI Models, Product Manager Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Own the Future of AI Inference Cerebras powers the world's fastest AI inference. As the Product Manager for AI Models, you'll lead the strategic model portfolio that defines our product - deciding which models ship, how they perform, and how the world discovers them. You'll partner directly with leading AI labs, drive launches that shape the industry, and ensure every model on our platform delivers exceptional quality at unprecedented speed. What You'll Own Strategic Model Portfolio - Own the models roadmap: decide which frontier and open-source

### Staff Software Engineer, Inference Cloud - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2024-07-12
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6053533003
- Excerpt: Staff Software Engineer, Inference Cloud Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Location: Sunnyvale We're hiring a Staff Engineer to own major areas of the architecture of our Inference Cloud Platform. This team owns the cloud layer behind our Inference Service, with responsibility for availability, latency, reliability, and global scale. This is a hands on IC role for an engineer who wants to work on the hardest distributed systems problems in the stack: multi-region traffic architecture, graceful degradation under bursty AI workloads, performance at high QPS, and the operating model for a platform that has to stay

### AI Infrastructure Operations Engineer - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2024-03-06
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/5913347003
- Excerpt: AI Infrastructure Operations Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking a highly skilled and experienced AI Infrastructure Operations Engineer to manage and operate our cutting-edge machine learning compute clusters. These clusters would provide the candidate an opportunity to work with the world's largest computer chip, the Wafer-Scale Engine (WSE), and the systems that harness its unparalleled power. You will play a critical role in ensuring the health, performance, and availability of our infrastructure, maximizing compute capacity, and supporting our growing AI initiatives. This role requires a deep

### Manufacturing Linux Network Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-01
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7629637003
- Excerpt: Manufacturing Linux Network Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking an experienced Manufacturing Linux / Network Engineer to design, implement, and maintain robust IT and network infrastructure across our manufacturing facilities. The ideal candidate brings deep expertise in Linux systems administration (Red Hat / Rocky Linux), network security (Palo Alto firewalls), storage infrastructure, CI/CD pipelines (Jenkins), and infrastructure automation (Ansible). This role sits at the intersection of enterprise IT and plant-floor operations, and is critical to delivering the high availability, security, and performance that modern manufacturing environments demand. Responsibilities -

### Business Operations Lead, Datacenters - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-04
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7763752003
- Excerpt: Business Operations Lead, Datacenters Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This is a high-leverage business operations role embedded in the Datacenter organization. You will partner directly with 5-10 senior leaders to build the operating system that keeps the organization aligned, decision-ready, and executing at speed. The job sits at the intersection of planning, operating cadence, communications, and execution. You will turn complex, fast-moving work into clear priorities, durable mechanisms, and actionable insight. This is not a staff role from the sidelines-you will be in the middle of the work, helping leaders drive performance,

### Engineering Manager, Kernel Reliability - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-08
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7582897003
- Excerpt: Engineering Manager, Kernel Reliability Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role We're looking for a deeply technical, hands-on engineering leader for our on-field Kernel Reliability team. You will lead a high performing team to tackle a critical challenge: improving the reliability of our advanced compute clusters and the underlying inference, training, and internal production services. In this role, you'll set the technical vision while staying close to the code and designing solutions that will scale to our exponentially growing system production and software service offerings. If you have proven expertise in software

### Design Validation Test - Lead/Principal Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $175K-$275K
- Posted: 2026-03-04
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7652064003
- Excerpt: Design Validation Test - Lead/Principal Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Role Summary We are seeking a hands‑on DVT Technical Lead (Individual Contributor) to own and drive the Design Validation Test (DVT) process end‑to‑end across complex electrical engineering boards and full systems. You will define validation strategy, build test plans and infrastructure, lead deep debug and root‑cause analysis (RCA), and drive closure through design changes and re‑test. The domain includes difficult power delivery technology, fast high‑speed I/O, and electro‑mechanical systems with thermal, optics, and high‑power constraints. People management is not required (mentoring is a plus).

### Site Reliability Engineer - Ops & Automation - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-10-14
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7497712003
- Excerpt: Site Reliability Engineer - Ops & Automation Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are building a high-performance SRE function to support one of the world's fastest-growing AI inference services, powered by the Wafer-Scale Engine (WSE), helping deliver infrastructure for frontier-class models from leading model builders such as OpenAI. This role offers immediate ownership of real production systems at a growing scale, direct mentorship from seasoned engineers, and close collaboration with incoming Staff SREs who will focus on long-term automation. After ~1 month of shared hands-on operations with the Staff

### Senior Quality Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $175K-$275K
- Posted: 2026-04-30
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7720337003
- Excerpt: Senior Quality Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role: We are looking for a hands-on Senior Quality Engineer to drive manufacturing quality for printed circuit board assemblies (PCBAs), subassemblies, and system-level products across our contract manufacturers and suppliers. This role is responsible for ensuring robust manufacturing processes, superior workmanship, high production yields, and effective corrective actions throughout the product lifecycle. The ideal candidate brings deep expertise in SMT manufacturing, IPC standards, electronics assembly processes, and statistical quality methods. Working closely with Manufacturing Engineering, Supplier Quality, Reliability Engineering, Design Engineering, and contract manufacturers, you

### Director, Strategic Finance - Corporate FP&A - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-21
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7742830003
- Excerpt: Director, Strategic Finance - Corporate FP&A Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This role is a front-row seat in a company at the leading edge of AI compute. You will create the corporate FP&A operating system for the company: long-range planning, annual planning, forecasting, executive reporting, guidance support, and SG&A business partnering that keeps decision-ready insight flowing as the company scales. The scope can stretch for a proven FP&A leader seeking a sharper public-company platform or a high-potential leader ready to pull the function forward from day one. You will work across leadership

### Physical Design Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $230K-$280K
- Posted: 2026-06-10
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7764007003
- Excerpt: Physical Design Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a member of our tight knit physical design team, you will be working on the design and analysis of 3D integrated products. This role involves a combination of traditional ASIC/SoC physical design skills, packaging, power, clock and cooling analysis. You will work closely with the architecture and RTL team to do R&D on novel concepts for 3D integration. Skills and Qualifications Required - 10+ years of physical design/verification experience. - Strong knowledge of block level and full-chip physical verification methodology. - Expert at

### Senior Accountant - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-26
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7748840003
- Excerpt: Senior Accountant Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Senior Operations Accountant will be a key contributor to the Finance and Operations teams, reporting to the Senior Cost Accounting Manager. This role is designed for a detail-oriented professional who excels in fast-paced environments and can derive insights from complex manufacturing and inventory data. You will support the end-to-end execution of cost accounting and fixed asset processes. Responsibilities Cost Capitalization & Inventory Accounting - Execute monthly inventory reconciliations and post all related journal entries. - Perform calculations for inventory rebates, wafer manufacturing yield losses,

### Senior GL Accountant - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-04-09
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7694944003
- Excerpt: Senior GL Accountant Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking a detail-oriented and experienced Senior GL Accountant to join our corporate accounting team. This role is responsible for participating and maintaining general ledger accounting operations, ensuring accurate and timely financial reporting in compliance with U.S. GAAP. Responsibilities - Perform the monthly, quarterly, and annual financial statement close processes globally in accordance with US GAAP, including preparation and review of journal entries, account reconciliations, and variance analysis for cash, prepaids, accruals, inter-company, OPEX and various other accounts. - Support the external reporting

### Kernel Engineer - Cerebras Systems
- Location: Bengaluru, Karnataka, India (unspecified)
- Salary: Not disclosed
- Posted: 2025-10-06
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7486714003
- Excerpt: Kernel Engineer Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads. Your focus will be on implementing, optimizing, and scaling deep learning operations to fully leverage our custom, massively parallel processor architecture. You will be part of a world-class team responsible for the design, performance tuning, and validation of foundational ML and HPC kernels. This includes building a library of parallel and distributed algorithms that maximize

### Electrical Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $150K-$260K
- Posted: 2026-02-19
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628237003
- Excerpt: Electrical Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Responsibilities - Lead printed circuit board design through all development stages: from definition to implementation, bring-up, qualification, and production release. - Full responsibility for electrical specification, schematic design, components selection, and layout considerations. - Extensive lab bring-up and debugging, including developing automated benchtop setups for board characterization. - Collaborate with various design and operations teams: manufacturing operations & test engineering, supply chain, ASIC, mechanical, signal integrity, power delivery, layout, embedded & diagnostic, etc. Skills & Qualifications - B.S.c, M.S.c, or Ph.D. degree in electrical engineering, or equivalent experience.

### Director of Costing Accounting - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-19
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7579627003
- Excerpt: Director of Costing Accounting Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Director of Cost Accounting will lead all cost accounting, inventory, and cost-related financial activities across manufacturing, infrastructure, and AI compute operations. This role is critical to ensuring accurate costing, capitalized asset accounting, and cost transparency across both training and inference workloads. This role requires comfort operating in a fast-paced, evolving environment, where priorities may shift and processes are still being built. The successful candidate will be expected to ramp quickly, close gaps, and contribute immediately, bringing structure, judgment, and execution while helping

### Security SWE - Cerebras Systems
- Location: Sunnyvale CA or Toronto Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-11
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7647319003
- Excerpt: Security SWE Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a frontend engineer on our AI cloud, you will work on our customer-facing inference, training, and admin consoles and API experiences. In this role, you will be responsible for designing and building responsive, user-friendly frontend interfaces that provide an optimal experience for our developers, handling high traffic and throughput efficiently. Your familiarity with the latest web development frameworks and best practices, coupled with a keen eye for design and user experience, will drive team success. We're looking for talented software engineers

### Principal Engineer, Inference Cloud - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2025-09-29
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7480543003
- Excerpt: Principal Engineer, Inference Cloud Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Location: Sunnyvale We're hiring a Principal Engineer for our Inference Cloud Platform. This team owns the cloud layer behind our Inference Service, including availability, latency, reliability, and multi-region scale. This is one of the most senior IC roles on the team, for someone who can identify the highest-leverage platform problems, set direction across multiple teams, define long-term architecture, and write production code on critical paths. Many of the key decisions are ambiguous at the outset; you'll need to frame the problem, make tradeoffs, and drive execution

### Staff Kernel Optimzation Engineer - Cerebras Systems
- Location: Remote, California, United States (remote)
- Salary: Not disclosed
- Posted: 2026-02-05
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7620254003
- Excerpt: Staff Kernel Optimzation Engineer Remote, California, United States Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Kernel Engineer on our team, you will develop high-performance software solutions at the intersection of hardware and software, developing high-performance software for cutting-edge AI and HPC workloads. Your focus will be on implementing, optimizing, and scaling deep learning operations to fully leverage our custom, massively parallel processor architecture. You will be part of a world-class team responsible for the design, performance tuning, and validation of foundational ML and HPC kernels. This includes building a library of parallel and distributed

### 3D Physical Design Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $150K-$270K
- Posted: 2025-04-10
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6538655003
- Excerpt: 3D Physical Design Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a member of our tight knit physical design team, you will be working on the design and analysis of 3D integrated products. This role involves a combination of traditional ASIC/SoC physical design skills, packaging, power, clock and cooling analysis. You will work closely with the architecture and RTL team to do R&D on novel concepts for 3D integration. Skills and Qualifications Required - 10+ years of physical design/verification experience. - Strong knowledge of block level and full-chip physical verification methodology. - Expert

### Senior WAN Network Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-03-30
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7684098003
- Excerpt: Senior WAN Network Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking a highly skilled WAN Network Engineer to design, implement, manage, and optimize global connectivity. The ideal candidate will have strong experience with carrier networks, routing protocols, and network security, and will play a critical role in ensuring high availability, performance, and reliability of global network services. Responsibilities - Design, deploy, and maintain WAN network across leased lines and dark fiber for low latency and 99.999% availability. - Collaborate with telecom providers and ISPs for circuit provisioning, upgrades, and issue resolution.

### Staff Site Reliability Engineer – Automation and Platform - Cerebras Systems
- Location: Remote, California, United States; Sunnyvale, CA; Toronto, Ontario, Canada (remote)
- Salary: Not disclosed
- Posted: 2025-10-03
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7487314003
- Excerpt: Staff Site Reliability Engineer – Automation and Platform Remote, California, United States; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are building a high-performance SRE function to support one of the world's fastest-growing AI inference services, powered by the Wafer-Scale Engine (WSE). This team will help deliver world-class, ultra-reliable inference infrastructure for leading model builders such as OpenAI and other frontier labs. As a Staff SRE, you will lead the engineering effort to eliminate toil at scale by driving implementation of self-service delivery pipelines, shared observability common tooling. This role starts

### Senior Cost Accounting Manager - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-30
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7612274003
- Excerpt: Senior Cost Accounting Manager Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Senior Cost Accounting Manager will thrive in a fast-paced, dynamic environment supporting both Finance and Operations. This role requires excellent communication skills, strong computational and analytical ability, and the capacity to quickly derive insights from high volume manufacturing and inventory-related transactions. The ideal candidate is proactive, detail-oriented, and comfortable owning end-to-end cost accounting processes. This is a high visibility role working closely with leaders in Supply Chain, Procurement and Finance. Responsibilities Cost Capitalization & Allocations - Assess, capitalize, and allocate inventory rebates.

### ASIC Architect - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-04
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7753254003
- Excerpt: ASIC Architect Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Responsibilities - Translate high level architecture spec to micro-architecture feature requirements - Bring up new features in the performance/power model - Perform comprehensive PPA trade-offs for new architectural features - Extract insights for new features and micro-architecture power efficiency - Profile workloads, identify bottlenecks and project competition performance for benchmarking - Engage with SW teams for end-end application level modeling at cluster level - Identify kernel level HW acceleration level opportunities Qualifications - Masters/PhD in Electrical/Computer Engineering - 10+ years of experience across performance analysis and modeling across

### Design Verification Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $120K-$240K
- Posted: 2025-11-25
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7522566003
- Excerpt: Design Verification Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Key Responsibilities - Work with architects, designers, post-silicon, and software engineers, to ensure a high-quality design that works for silicon. - Develop and implement verification strategies, detailed tests, and coverage plans based on micro-architecture. - Create verification methodologies and reusable environments, including components such as stimulus, checkers, assertions, and coverage. - Implement tests, manage regressions, gather coverage, and debug test failures. - Collaborate with cross-functional teams, including architecture, RTL design, physical design, firmware, and validation. - Analyze and debug complex issues across simulation, emulation, and silicon bring-up

### Sourcing Manager – Critical Components - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $200K-$240K
- Posted: 2026-05-04
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7656210003
- Excerpt: Sourcing Manager – Critical Components Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Job Summary The Sourcing Director - Critical Components is responsible for developing and executing global sourcing strategies to secure high-quality, cost-effective critical components and materials. This role ensures supply chain continuity, minimizes risk, and drives innovation by leveraging market analysis, supplier relationship management, and advanced negotiation tactics. The manager collaborates with cross-functional teams to align procurement activities with organizational goals, optimize procurement processes, and enhance supplier relationships. Key Responsibilities: - Strategic Sourcing: Develop and implement comprehensive sourcing strategies for critical components, aligning with long-term business

### Sr. Staff/Staff Design Verification Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $250K-$300K
- Posted: 2026-06-04
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7763735003
- Excerpt: Sr. Staff/Staff Design Verification Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Key Responsibilities - Work with architects, designers, post silicon and software engineers to ensure a high-quality design that works first silicon. - Develop and implement verification strategies, detailed tests and coverage plans based on micro-architecture. - Create verification methodologies and reusable environments, including components such as stimulus, checkers, assertions, and coverage. - Implement tests, manage regressions, gather coverage, and debug test failures. - Collaborate with cross-functional teams including architecture, RTL design, physical design, firmware, and validation. - Analyze and debug complex issues across simulation, emulation,

### Vice President, Creative & Integrated Marketing - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-25
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628240003
- Excerpt: Vice President, Creative & Integrated Marketing Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking a dynamic Vice President of Creative & Integrated Marketing to lead brand expression and orchestrate high-impact, cross-channel marketing programs that drive awareness, engagement, and pipeline growth. Reporting to the CMO, this leader will unify creative excellence with integrated go-to-market execution - ensuring that every campaign, launch, and brand moment is cohesive, differentiated, and performance-oriented. This role blends visionary brand leadership with operational rigor, connecting storytelling to measurable business outcomes. Responsibilities Creative & Brand Leadership - Own the company's

### Billings and Collections Specialist - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: Not disclosed
- Posted: 2026-02-13
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7614890003
- Excerpt: Billings and Collections Specialist Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Senior Billing and Collections Analyst, you'll own the full order-to-invoice cycle, turning delivered hardware, professional services, and compute usage into accurate, timely invoices that support clean financial reporting and on-time customer payment. You'll serve as a key link between Sales, Services, and Finance by leading month-end billing close activities, translating contract terms into billing in NetSuite, and managing credits and adjustments with audit-ready documentation. You'll respond to billing inquiries, resolve order-to-invoice issues before they delay payment, and identify process improvements and

### Network Engineer - Cerebras Systems
- Location: Sunnyvale, CA; Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-03
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7762739003
- Excerpt: Network Engineer Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are looking for a Network Architect to join our Cluster Engineering Team and help shape the front-end datacenter and interconnect fabric for the current and next generations of our AI clusters. You will partner closely with hardware vendors, internal networking teams, and industry peers to define best-in-class network architectures that deliver resilient, reliable, and high-throughput connectivity for large-scale AI workloads. This is a deeply technical role that spans the full stack, from host-side networking and NIC behavior up through cluster-level coordination

### AI Silicon Physical Design Engineer - Cerebras Systems
- Location: Remote, California, United States; Sunnyvale, CA (remote)
- Salary: $150K-$250K
- Posted: 2025-04-21
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6549894003
- Excerpt: AI Silicon Physical Design Engineer Remote, California, United States; Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join our close-knit physical design team where you'll excel in synthesizing, placing, and routing high speed designs. Experience the full spectrum of physical design and implementation, collaborating closely with the RTL team and integrating these blocks seamlessly into the full-chip architecture. Skills & Qualifications - 10+ years of physical design & physical verification experience. - Strong knowledge of block level and full-chip physical verification methodology. - Strong experience in block/subsystem timing closure. - Expert at optimizing for the

### LLM Inference Performance & Evals Engineer - Cerebras Systems
- Location: Toronto, Ontario, Canada (unspecified)
- Salary: Not disclosed
- Posted: 2025-07-24
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/6658665003
- Excerpt: LLM Inference Performance & Evals Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join the inference model team dedicated to bring up the state-of-the-art models, numerically validating and accelerating new model ideas on wafer-scale hardware. You will prototype architectural tweaks, build performance-eval pipelines, and turn hard numbers into changes that land in production. Key Responsibilities - Prototype and benchmark cutting-edge ideas: new attentions, MoE, speculative decoding, and many more innovations as they emerge. - Develop agent-driven automation that designs experiments, schedules runs, triages regressions, and drafts pull-requests. - Work closely with compiler, runtime,

### Senior Yield Enhancement Engineer - Cerebras Systems
- Location: Sunnyvale, CA (unspecified)
- Salary: $175K-$250K
- Posted: 2026-02-16
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7628236003
- Excerpt: Senior Yield Enhancement Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role : Senior Yield Enhancement Engineer We are seeking a highly experienced Senior VLSI Product and Test Engineer with 7+ years of relevant experience in Semiconductor Testing/Failure Analysis/Yield Enhancement. The successful candidate will look at ATE datalogs, understand the defects in detail, disposition wafers based on ATE data and drive FA/Yield enhancement using physical/optical inspection techniques used in FA. Suitable candidate will have depth in testing, characterization of silicon defects, failure modes, and experience delivering end-to-end solutions working closely with teams across chip design, fabrication,

### Senior / Staff Technical Program Manager - Datacenter Capacity Delivery (E2E) - Cerebras Systems
- Location: Europe; Remote, California, United States; Sunnyvale, CA; Toronto, Ontario, Canada (remote)
- Salary: Not disclosed
- Posted: 2026-06-03
- Apply: https://job-boards.greenhouse.io/cerebrassystems/jobs/7762063003
- Excerpt: Senior / Staff Technical Program Manager - Datacenter Capacity Delivery (E2E) Europe; Remote, California, United States; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role The DC Delivery E2E TPM is the single-threaded owner for delivering data center capacity from forecast → site strategy → design → construction → infrastructure readiness → go-live. You will operate as the SSOT (Single Source of Truth) for delivery milestones, risks, and capacity outcomes while orchestrating cross-functional execution across internal teams and external partners. This is a frontier-scale role where ambiguity is high, timelines are compressed, and stakes

### Business System Analyst - Finance Systems - Addepar
- Location: Pune, India (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-03
- Apply: https://job-boards.greenhouse.io/addepar1/jobs/8549270002
- Excerpt: Business System Analyst - Finance Systems Pune, India Business System Analyst - Finance Systems

### Analyst, Business Systems – Financial Systems - Icu Medical Inc
- Location: Chennai, Tamil Nadu, India (unspecified)
- Salary: Not disclosed
- Posted: 2026-01-28
- Apply: https://eduu.fa.us2.oraclecloud.com/hcmUI/CandidateExperience/en/sites/CX_1/requisitions/job/1029
- Excerpt: Analyst, Business Systems – Financial Systems Chennai, Tamil Nadu, India Analyst, Business Systems - Financial Systems

### Systems Engineer - Avionics Systems Safety - Honeywell
- Location: Phoenix, AZ, United States (unspecified)
- Salary: Not disclosed
- Posted: 2026-06-06
- Apply: https://icfcjb.fa.ocs.oraclecloud.com/hcmUI/CandidateExperience/en/sites/Honeywell/requisitions/job/109715
- Excerpt: Systems Engineer - Avionics Systems Safety Phoenix, AZ, United States As a Systems Engr here at Honeywell, you will be part of a team focused on systems development and system safety engineering, contributing to the advancement of Honeywell's most advanced Anthem Integrated Flight System, that is breaking new ground in Avionics design and pilot-machine interface.

### Electrical Systems Engineer - Ametek
- Location: Work Location (Country) United States | Remote/Onsite Onsite (remote)
- Salary: Not disclosed
- Posted: 2026-06-12
- 401(k) match: listed (source-backed)
  - Source: https://www.askebsa.dol.gov/FOIA%20Files/2024/Latest/F_SCH_H_2024_Latest.zip
- Apply: https://jobs.ametek.com/job/Harleysville-Electrical-Systems-Engineer-PA-19438/1286493600/
- Excerpt: Electrical Systems Engineer Work Location (Country) United States | Remote/Onsite Onsite This is for an Electrical Systems Engineer position in the Engineering department of the AMETEK PDS Electrical Power Distribution team. As a member of this team, you will be responsible for System interface of aircraft avionics system, aircraft interface device, computing device, electrical sub-systems, development assurance (ARP4754A), validation & verification and traceability of requirements. Also, you will be responsible for systems design, system level requirements development, decomposing requirements, testing, and supporting FAA certification documentation. This role will be based in our Harleysville, PA . Position Responsibilities: Demonstrate excellence in development of electrical sub-systems or high-integrity aircraft network system / aircraft interface device / computing devices Demonstrate excellence in system design, integration & testing. Demonstrate excellence in development assurance (ARP4754A) Develop system test procedures. Perform requirements management in DOORS. Evaluates customer/operational needs to define and coordinate system performance requirements, functional requirements and regulatory requirements. Applies an interdisciplinary, collaborative approach to plan, design, develop and verify a lifecycle balanced system of systems and system solutions. Coordinates with engineering functions to define requirements related to safety, reliability, maintainability, testability, human systems integration, survivability, vulnerability, susceptibility, system

### System Engineer - Cryptographic systems - Thales
- Location: Tubize, Belgium (unspecified)
- Salary: Not disclosed
- Posted: 2025-11-07
- Apply: https://thales.wd3.myworkdayjobs.com/Careers/job/Tubize/System-Engineer---Cryptographic-systems_R0305314/apply
- Excerpt: System Engineer - Cryptographic systems Tubize, Belgium Join our team as a System Engineer specializing in cryptographic systems. You will define and design innovative solutions, collaborate with cross-functional teams, and contribute to cutting-edge cybersecurity projects. If you have a strong background in system engineering and cryptography, we want to hear from you!

### Systems Engineering Lead, Air Vehicle Systems - Anduril
- Location: Costa Mesa, California, United States (unspecified)
- Salary: $191K-$253K
- Posted: 2025-08-05
- Apply: https://boards.greenhouse.io/andurilindustries/jobs/4801617007?gh_jid=4801617007
- Excerpt: Systems Engineering Lead, Air Vehicle Systems Costa Mesa, California, United States Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years. WHY WE'RE HERE: The systems engineering team is looking for experienced Systems Engineering Lead to support a Group 5 aircraft program at Anduril. The ideal candidate will have experience leading a team of systems engineering working on air vehicle systems. Ideally they have worked with complex unmanned aerial systems (UAS), and will have a strong background in systems engineering for unmanned platforms. WHAT YOU'LL DO We are hiring a Lead Systems Engineer with a background in air vehicle systems to support ongoing development of a group 5 autonomous aircraft. - Shape customer needs into air vehicle and subsystem requirements in collaboration with multidisciplinary teams. - Develop System Engineering artifacts, including system model, to trace from stakeholder inputs through design in a digital thread. Incorporate input from system safety, software engineering, and

### Systems Engineer – Space Systems - Oceaneering International
- Location: Houston, TX, United States (unspecified)
- Salary: Not disclosed
- Posted: 2026-05-22
- Apply: https://ebfr.fa.us2.oraclecloud.com/hcmUI/CandidateExperience/en/sites/jobs/requisitions/job/10513
- Excerpt: Systems Engineer – Space Systems Houston, TX, United States Systems Engineer for NASA/Amentum JETS2 Service Contract

### HPC Systems Engineer 3 - T-Rex Solutions
- Location: Fort Meade, Maryland (unspecified)
- Salary: $130K-$150K
- Posted: 2026-06-12
- Apply: https://www.trexsolutionsllc.com/current-opportunities-at-trex/?gh_jid=8543973002
- Excerpt: HPC Systems Engineer 3 Fort Meade, Maryland T-Rex is looking for a fully cleared HPC Systems Engineer 3 to work on a program in the Fort Meade, Maryland area in support of the Intelligence Community. Responsibilities: Analyzes user's requirements, concept of operations documents, and high-level system architectures to develop system requirements specifications. Analyzes system requirements and leads design and development activities. Guides users in formulating requirements, advises alternative approaches, and conducts feasibility studies. Provides technical leadership for the integration of requirements, design, and technology. Incorporates new plans, designs and systems into ongoing operations. Develops technical documentation. Develops system Architecture and system design documentation. Guides system development and implementation planning through assessment or preparation of system engineering management plans and system integration and test plans. Interacts with the Government regarding Systems Engineering technical considerations and for associated problems, issues or conflicts. Ultimate responsibility for the technical integrity of work performed and deliverables associated with the Systems Engineering area of responsibility. Requirements: A High School Diploma or GED plus fourteen (14) years of general system engineering experience. OR A Bachelor's degree in System Engineering, Computer Science, Information Systems, Engineering Science, Engineering Management, or related discipline from an accredited college or university plus six (6) years of systems engineering experience. OR A Master's degree in a Qualified Engineering Field or a related discipline from an accredited college or university required plus four (4) years of systems engineering experience. OR A PhD in a Qualified Engineering Field or a related discipline from an accredited

---
Source-backed benefit claims include source links; other benefit values are labeled separately.