Cerebras Systems jobs

92 matches, filter-driven and evidence-linked.

Reset

24 shown of 92

Benefit evidence

Resolvable source Inferred from posting Unknown provenance

Only verified benefits

92 jobs match

24 shown on this page

Compare

CoDesign & NextGen - New College Grad

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse Comp disclosed in posting

posted 164 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+3
Description: "cerebras"Description: "systems"Employer: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "systems"
+2

Unspecified Engineering - Mid $145K-$155K Equity

CoDesign & NextGen - New College Grad Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Engineers in the CoDesign and NextGen organization work at the interface of software and hardware helping everything from high performance kernel design, next generation ASIC performance modeling and tuning, new system bringup and software tuning, system robustness and simulations. As an new engineer you'll be expected to learn the Cerebras products and platforms end to end starting at the base layer of programming the Wafer Scale Engine through kernel development, modeling performance for our software products and validating the work

View details Apply at job-boards.greenhouse.io
Compare

Applied Machine Learning Research Scientist

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 106 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+3
Description: "cerebras"Description: "systems"Employer: "cerebras"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+2

Unspecified Engineering - Mid Salary not disclosed

Applied Machine Learning Research Scientist Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As an Applied Machine Learning Research Scientist at Cerebras, you will play a key role in turning modern machine learning techniques into scalable, high-performance systems. This role sits at the intersection of modeling and systems focused not on publishing new algorithms, but on understanding how they work and making them run effectively at scale. Your work will directly impact how large language models (LLMs) are trained, optimized, and deployed on one of the most advanced AI platforms in the

View details Apply at job-boards.greenhouse.io
Compare

Prognostics & Health Monitoring Engineer

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse Comp disclosed in posting

posted 51 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+3
Description: "systems"Description: "cerebras"Employer: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+2

Unspecified Engineering - Mid $150K-$250K Equity

Prognostics & Health Monitoring Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Role Summary Quality, reliability, and uptime are foundational to scaling Cerebras systems. We are seeking an engineer to define and build our prognostics and health monitoring (PHM) capability-developing frameworks to monitor, assess, and predict hardware health across our fleet. In this role, you will transform telemetry and operational data into actionable insights and automated responses, enabling early detection of degradation, accurate failure prediction, and proactive actions to keep systems highly available, performant, and resilient. This is a highly cross-functional role spanning reliability engineering, data science,

View details Apply at job-boards.greenhouse.io
Compare

Full Stack Engineer – Manufacturing Test

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse Comp disclosed in posting

posted 115 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+3
Description: "cerebras"Description: "systems"Employer: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "systems"
+2

Unspecified Engineering - Mid $175K-$220K Equity

Full Stack Engineer – Manufacturing Test Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role As a Full Stack Engineer focusing on Cerebras' manufacturing test platform, you will design, build, and maintain a comprehensive test software solution for all stages of manufacturing - from individual components to complete Cerebras systems. You will collaborate cross-functionally with hardware design, engineering, operations, and data analytics teams to develop user interfaces and data processing frameworks that directly impact manufacturing efficiency, quality, and scalability. Responsibilities - Collaborate with hardware engineers and test developers to create frameworks that facilitate the development, validation,

View details Apply at job-boards.greenhouse.io
Compare

Senior ML Software Engineer - Integration & Quality

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 134 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+3
Description: "systems"Description: "cerebras"Employer: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+2

Unspecified Engineering - Senior Salary not disclosed

Senior ML Software Engineer - Integration & Quality Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are looking for a Software Engineer to join the ML Integration and Quality team at Cerebras. This team sits at the intersection of machine learning infrastructure, distributed systems, and hardware/software co-design. In this role, you will help integrate and validate the software stack that powers the Cerebras AI platform, ensuring large-scale ML workloads run reliably and efficiently across our systems. You will work closely with engineers across runtime, compiler, kernel, and hardware teams to debug

View details Apply at job-boards.greenhouse.io
Compare

Engineering Manager, Inference ML Runtime

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 87 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+3
Description: "systems"Description: "cerebras"Employer: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+2

Unspecified Engineering - Mid Salary not disclosed

Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role The Inference ML Engineering team at Cerebras builds the runtime, APIs, and systems that power the fastest generative AI inference platform in the world. As an Engineering Manager, Inference ML Runtime , you will lead a team responsible for designing and scaling the systems that enable seamless execution of state-of-the-art AI models on Cerebras hardware. You will operate at the intersection of machine learning, distributed systems, and high-performance runtime engineering , translating cutting-edge research into production-ready infrastructure to serve

View details Apply at job-boards.greenhouse.io
Compare

Senior Performance Engineer, Inference

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse

posted 67 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "systems"
+1
Description: "cerebras"Description: "systems"Employer: "systems"
+1

Unspecified Engineering - Senior Salary not disclosed

Senior Performance Engineer, Inference Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are hiring a Senior Performance Engineer to join our Product team. You are an expert on state-of-the-art inference performance and will serve as our resident expert on how Cerebras stacks up against alternative inference providers on both price and performance. This role sits at the intersection of performance benchmarking from first principles and competitive intelligence. The role has two core pillars: - Performance Benchmarking You will build, run, and maintain reproducible benchmarks that measure Cerebras inference performance for real customer workloads. This

View details Apply at job-boards.greenhouse.io
Compare

Director / Senior Director, Critical Facility Operations

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse

2w ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1

Unspecified Operations - Director Plus Salary not disclosed

Director / Senior Director, Critical Facility Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role At Cerebras, we build systems that redefine the scale of compute. That ambition extends beyond silicon into the infrastructure that powers it. We are looking for a Director / Sr. Director of Critical Facility Operations to lead the environments that enable next-generation AI workloads. This leader will own availability, operational integrity, and performance across a colocation-driven data center footprint-ensuring our infrastructure operates with precision, predictability, and zero room for error. This is not a traditional facilities role. You will operate at

View details Apply at job-boards.greenhouse.io
Compare

Manager, Business Operations

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse

posted 51 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "systems"
+1
Description: "systems"Description: "cerebras"Employer: "systems"
+1

Unspecified Operations - Mid Salary not disclosed

Manager, Business Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This role is a high-leverage seat in that build and a deliberate apprenticeship into operating leadership. You will create the business operations, analytics, and execution system that keeps decision-ready insight flowing as the company scales. You will be embedded with operators, turning messy operational reality into durable processes, clear metrics, and repeatable operating rhythms. You will report to the Head of FP&A and work in close partnership with the COO and operations leadership. Why now Cerebras is scaling to meet accelerating demand for fast

View details Apply at job-boards.greenhouse.io
Compare

Manufacturing Bring-up Engineer L2

Cerebras Systems - Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada

Indexed from Greenhouse Comp disclosed in posting

posted 109 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "systems"
+1
Description: "systems"Description: "cerebras"Employer: "systems"
+1

Unspecified Operations - Mid $170K-$230K Equity

Manufacturing Bring-up Engineer L2 Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role We are seeking a highly skilled and motivated Manufacturing Bring-up Engineer to join our team. As the Manufacturing Bring-up Engineer you will support our system level bring-up process execution, implementation, and evolution in the manufacturing pipeline. This is a high visibility role that requires strong technical expertise, coordination, and collaboration to deliver our product from manufacturing to the customer. Responsibilities - Support the Cerebras manufacturing bring-up process execution to configure, test, and validate system performance prior to customer

View details Apply at job-boards.greenhouse.io
Compare

Senior Front End Design Engineer (Microarchitecture)

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse Comp disclosed in posting

2w ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1

Unspecified Engineering - Senior $250K-$300K Equity

Senior Front End Design Engineer (Microarchitecture) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a senior front-end design engineer, you will be a key part of the world-class team designing and developing the next generations of the Cerebras Wafer Scale Engine (WSE). This role requires deep expertise in RTL design and integration, with a strong focus on delivering high-performance, power-efficient, and scalable solutions. You will collaborate closely with the design verification, physical design, software and system teams to bring innovative semiconductor architectures from concept to production, addressing the unique challenges of building WSE systems.

View details Apply at job-boards.greenhouse.io
Compare

Lead Full Stack Machine Learning Engineer

Cerebras Systems - Bengaluru, Karnataka, India

Indexed from Greenhouse

1w ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1

Unspecified Engineering - Senior Salary not disclosed

Lead Full Stack Machine Learning Engineer Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About T he Role This teams' principal responsibility is to rapidly bring up state-of-the-art open-source models, frameworks and data engineering. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications. Responsibilities - Contribute to the end-to-end bring up of frameworks for RL, inference serving, ML models on Cerebras CSX systems. -

View details Apply at job-boards.greenhouse.io
Compare

Senior/Staff Engineer : Post Silicon- Bring Up

Cerebras Systems - Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada

Indexed from Greenhouse Comp disclosed in posting

posted 123 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "systems"
+1
Description: "cerebras"Description: "systems"Employer: "systems"
+1

Unspecified Engineering - Staff Plus $175K-$275K Equity

Senior/Staff Engineer : Post Silicon- Bring Up Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role: In this exciting role, you will be responsible for bring up and optimizations of Cerebras's Wafer Scale Engine (WSE). Suitable candidate will have experience delivering end to end solutions working closely with teams across chip design, system performance, software development and productization. Responsibilities: - On Wafer Scale Engines, develop and debug flows that embed well tested and deployable optimizations in production processes to reduce time and costs - Work on refining AI Systems across H/W-S/W

View details Apply at job-boards.greenhouse.io
Compare

Cluster UI Full Stack, Engineering Lead

Cerebras Systems - Bengaluru, Karnataka, India; Toronto, Ontario, Canada

Indexed from Greenhouse

posted 142 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1

Unspecified Engineering - Senior Salary not disclosed

Cluster UI Full Stack, Engineering Lead Bengaluru, Karnataka, India; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role In this role, you will be building a world class UI-based large-scale cluster management portal. This portal will act as one stop for all operations and maintenance of cerebras clusters - such as cluster bringup deployment (day0/1/2), job management, health management to name a few. Cerebras AI clusters may have 1000's of Wafer-scale accelerator systems, several 1000's of high-end servers, and several 1000's of networking ports including switches. Responsibilities - Be the primary engineering face and owner

View details Apply at job-boards.greenhouse.io
Compare

Product Manager, Strategic Verticals

Cerebras Systems - San Francisco, California, United States

Indexed from Greenhouse

posted 269 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "systems"
+1
Description: "systems"Description: "cerebras"Employer: "systems"
+1

Unspecified Product - Senior Salary not disclosed

Product Manager, Strategic Verticals San Francisco, California, United States Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Our customers span leading AI Native companies, Fortune 500 Enterprises, Sovereign AI and Federal programs, and leading research institutions. Our mission is to deliver the platform that unlocks the next generation of AI applications, providing the fundamentally new capability to leverage the most intelligent models at real-time serving speeds. Why Cerebras? Here at Cerebras, we have built the world's first wafer-scale compute platform and software stack, purpose-designed to accelerate generative AI by over 10-20x what is possible on legacy processors today. AI developers

View details Apply at job-boards.greenhouse.io
Compare

Principal Engineer, AI Inference Reliability

Cerebras Systems - Remote, California, United States; Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 234 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "systems"
+1
Description: "cerebras"Description: "systems"Employer: "systems"
+1

Remote Engineering - Principal Salary not disclosed

Principal Engineer, AI Inference Reliability Remote, California, United States; Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. In late 2024, we launched Cerebras Inference, the fastest Generative AI inference service in the world, over 10 times faster than GPU-based hyperscale cloud inference. Since launch, we've scaled to meet the surging demand from AI labs, enterprises, and a thriving developer community. In October 2025, we announced our series G funding, raising $1.1 billion USD to accelerate the expansion of our products and services to meet global AI demand. About the team The Cerebras Inference team's mission

View details Apply at job-boards.greenhouse.io
Compare

Software Engineer, Kernel Reliability

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 107 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1

Unspecified Engineering - Mid Salary not disclosed

Software Engineer, Kernel Reliability Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We're looking for a deeply technical, hands-on software engineer to join our on-field Kernel Reliability team. You'll help tackle a critical challenge: improving the reliability of our advanced compute clusters and the underlying inference, training, and internal production services. In this role, you'll work close to the code and design solutions that will scale with our rapidly growing system production and software service offerings. If you have strong fundamentals in systems, debugging, and failure analysis-and enjoy building tools and solving

View details Apply at job-boards.greenhouse.io
Compare

Senior Runtime Engineer

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 234 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "systems"
+1
Description: "systems"Description: "cerebras"Employer: "systems"
+1

Unspecified Engineering - Senior Salary not disclosed

Senior Runtime Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are building the next generation of large-scale AI systems that power training and inference workloads at unprecedented scale and efficiency. You will design and develop high-performance distributed software that orchestrates massive compute and data pipelines across heterogeneous clusters. Your work will push the limits of concurrency, throughput, and scalability-enabling efficient execution of models at massive scale. This role sits at the intersection of systems engineering and machine learning performance, demanding both architectural depth and low-level implementation skills. You will help

View details Apply at job-boards.greenhouse.io
Compare

Member of Technical Staff (Software Engineer)

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse Comp disclosed in posting

posted 42 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1

Unspecified Engineering - Staff Plus $170K-$175K

Member of Technical Staff (Software Engineer) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras Systems Inc. has multiple openings for Member of Technical Staff (Software Engineer) Title : Member of Technical Staff (Software Engineer) Job Duties - Implement infrastructure to support high-performance, low-latency inference service. - Deploy and configure Kubernetes services to ensure scalability and reliability of inference workloads. - Optimize resource allocation and auto-scaling policies to handle variable inference demand while minimizing operational costs. - Integrate inference services with containerized environments using Docker and Kubernetes for orchestration. - Ensure high availability and fault tolerance by implementing

View details Apply at job-boards.greenhouse.io
Compare

Staff Inference ML Runtime Engineer

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 206 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "systems"
+1
Description: "cerebras"Description: "systems"Employer: "systems"
+1

Unspecified Engineering - Staff Plus Salary not disclosed

Staff Inference ML Runtime Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our fast generative inference solution through simple APIs powered by a distributed runtime that runs on large clusters of our own hardware. Our mission is to empower enterprises, developers, and researchers to unlock the full potential of our platform, leveraging its performance, scalability, and flexibility. The team works closely with cross-functional groups, including compiler developers, cluster orchestrators, ML scientists, cloud architects, and product teams, to deliver high-impact

View details Apply at job-boards.greenhouse.io
Compare

Performance Engineer

Cerebras Systems - Toronto, Ontario, Canada

Indexed from Greenhouse

posted 284 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1

Unspecified Engineering - Mid Salary not disclosed

Performance Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our groundbreaking CS-3 system, hosted by a distributed set of modern and powerful x86 machines, has set new benchmarks in high-performance ML training and inference solutions. It leverages a dinner-plate sized chip with 44GB of on-chip memory to surpass traditional hardware capabilities. This role will challenge and expand your expertise in optimizing AI applications and managing computational workloads primarily on the x86 architecture that run our Runtime driver. Responsibilities - Focus on CPU

View details Apply at job-boards.greenhouse.io
Compare

AI Engineer, Model Quality and Performance

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse

posted 35 days ago

Why we showed this
Description: "systems"Description: "cerebras"
+2
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1
Description: "systems"Description: "cerebras"Employer: "cerebras"
+1

Unspecified Engineering - Mid Salary not disclosed

AI Engineer, Model Quality and Performance Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role You'll own model quality and performance for Cerebras' inference offerings. You will define what "good" looks like across the models we serve, building AI-driven systems to measure it at scale, and translating those signals into artifacts our customers and product team actually use. You'll use AI agents to spin up custom eval suites per customer use case, mine trajectories for representative test data, automate the repetitive parts of release qual, and help build performance datasets and benchmarking workflows for customer use

View details Apply at job-boards.greenhouse.io
Compare

Lead RTL Design Engineer

Cerebras Systems - Sunnyvale, CA

Indexed from Greenhouse Comp disclosed in posting

posted 218 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1

Unspecified Engineering - Senior $175K-$275K Equity

Lead RTL Design Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a lead front-end design engineer, you will be a key part of the world-class team designing and developing the next generations of the Cerebras Wafer Scale Engine (WSE). This role requires deep expertise in RTL design and integration, with a strong focus on delivering high-performance, power-efficient, and scalable solutions. The role also requires close collaboration and management of external ASIC vendor. You will collaborate closely with the design verification, physical design, software and system teams to bring innovative semiconductor architectures from concept

View details Apply at job-boards.greenhouse.io
Compare

Compute Server Platform Architect

Cerebras Systems - Sunnyvale CA or Toronto Canada

Indexed from Greenhouse

posted 121 days ago

Why we showed this
Description: "cerebras"Description: "systems"
+2
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1
Description: "cerebras"Description: "systems"Employer: "cerebras"
+1

Unspecified Engineering - Staff Plus Salary not disclosed

Compute Server Platform Architect Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Compute / Server Platform Architect on the Cluster Architecture Team, you will own the server-side platform architecture that enables Cerebras CS3-based AI clusters (training and inference) to deliver predictable performance, scalability, and reliability. Our accelerators are network-attached, so the x86 server fleet is a first-class part of the end-to-end system: it runs critical-path runtime functions (for example orchestration, prompt caching, and IO/control services) and must be co-designed with software for token-level latency, throughput, and cost efficiency. You will

View details Apply at job-boards.greenhouse.io

Take this list with you

Download the 92 matching jobs in any format - read offline, archive, or hand to an AI assistant with your resume to find the best fits.

AI-ready prompt (.txt) Markdown (.md) JSON (.json)

Or email it to me instead

The AI-ready prompt is a pre-written question you can paste into Claude, ChatGPT, Gemini, or Perplexity along with your resume. We never see your resume; this happens in your AI client of choice.

AI agent reading directly? Same data lives at /api/jobs.json?page=2&q=Cerebras+Systems&quality=all. See /llms.txt and /api/openapi.json for the full schema.

0 selected

Filters

Benefit evidence

Take this list with you