FewerJobs.

Cerebras Systems jobs

48 matches, filter-driven and evidence-linked.

Reset
Showing 48 high-confidence listings. 0 additional listings are hidden by default. Show all
24 shown of 48

Benefit evidence

Source verified Inferred from posting Unknown provenance
Only source-backed benefits
  • CoDesign & NextGen - New College Grad

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 161 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Other - Mid $145K-$155K Equity

    CoDesign & NextGen - New College Grad Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Engineers in the CoDesign and NextGen organization work at the interface of software and hardware helping everything from high performance kernel design, next generation ASIC performance modeling and tuning, new system bringup and software tuning, system robustness and simulations. As an new engineer you'll be expected to learn the Cerebras products and platforms end to end starting at the base layer of programming the Wafer Scale Engine through kernel development, modeling performance for our software products and validating the work

  • Applied Machine Learning Research Scientist

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 103 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Data - Mid Salary not disclosed

    Applied Machine Learning Research Scientist Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As an Applied Machine Learning Research Scientist at Cerebras, you will play a key role in turning modern machine learning techniques into scalable, high-performance systems. This role sits at the intersection of modeling and systems focused not on publishing new algorithms, but on understanding how they work and making them run effectively at scale. Your work will directly impact how large language models (LLMs) are trained, optimized, and deployed on one of the most advanced AI platforms in the

  • Full Stack Engineer – Manufacturing Test

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 112 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Data - Mid $175K-$220K Equity

    Full Stack Engineer – Manufacturing Test Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role As a Full Stack Engineer focusing on Cerebras' manufacturing test platform, you will design, build, and maintain a comprehensive test software solution for all stages of manufacturing - from individual components to complete Cerebras systems. You will collaborate cross-functionally with hardware design, engineering, operations, and data analytics teams to develop user interfaces and data processing frameworks that directly impact manufacturing efficiency, quality, and scalability. Responsibilities - Collaborate with hardware engineers and test developers to create frameworks that facilitate the development, validation,

  • Prognostics & Health Monitoring Engineer

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 48 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Mid $150K-$250K Equity

    Prognostics & Health Monitoring Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Role Summary Quality, reliability, and uptime are foundational to scaling Cerebras systems. We are seeking an engineer to define and build our prognostics and health monitoring (PHM) capability-developing frameworks to monitor, assess, and predict hardware health across our fleet. In this role, you will transform telemetry and operational data into actionable insights and automated responses, enabling early detection of degradation, accurate failure prediction, and proactive actions to keep systems highly available, performant, and resilient. This is a highly cross-functional role spanning reliability engineering, data science,

  • Senior ML Software Engineer - Integration & Quality

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 132 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Engineering - Senior Salary not disclosed

    Senior ML Software Engineer - Integration & Quality Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are looking for a Software Engineer to join the ML Integration and Quality team at Cerebras. This team sits at the intersection of machine learning infrastructure, distributed systems, and hardware/software co-design. In this role, you will help integrate and validate the software stack that powers the Cerebras AI platform, ensuring large-scale ML workloads run reliably and efficiently across our systems. You will work closely with engineers across runtime, compiler, kernel, and hardware teams to debug

  • Engineering Manager, Inference ML Runtime

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 85 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Engineering - Senior Salary not disclosed

    Engineering Manager, Inference ML Runtime Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role The Inference ML Engineering team at Cerebras builds the runtime, APIs, and systems that power the fastest generative AI inference platform in the world. As an Engineering Manager, Inference ML Runtime , you will lead a team responsible for designing and scaling the systems that enable seamless execution of state-of-the-art AI models on Cerebras hardware. You will operate at the intersection of machine learning, distributed systems, and high-performance runtime engineering , translating cutting-edge research into production-ready infrastructure to serve

  • posted 112 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Marketing - Senior Salary not disclosed

    Senior Product Marketing Manager, AI Inference Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The AI conversation moves fast - new models ship weekly, benchmarks shift overnight, and the community's attention resets constantly. Cerebras has a massive speed advantage in inference, and this role exists to make sure that advantage is visible, understood, and top-of-mind wherever developers and AI builders are paying attention. As Senior Product Marketing Manager, you'll own realtime product marketing for Cerebras inference. You'll create high-impact technical content - blog posts, benchmark analyses, social threads - that positions Cerebras at the center

  • ML Performance Benchmarking Engineer

    Cerebras Systems - Toronto, Ontario, Canada
    Indexed from Greenhouse
    posted 91 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +2
    Unspecified Engineering - Mid Salary not disclosed

    ML Performance Benchmarking Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Inference Core Platform group is at the heart of Cerebras' mission to deliver the world's fastest AI inference. Our team builds the foundational software and hardware infrastructure that powers low-latency, high-speed, high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE). We are responsible for the full stack-from model compilation and scheduling down to custom hardware kernels and driver development. The ML Performance Benchmarking team plays a pivotal role in shaping the performance and scalability of AI inference on one of the most

  • Senior Performance Engineer, Inference

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse
    posted 65 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Senior Salary not disclosed

    Senior Performance Engineer, Inference Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are hiring a Senior Performance Engineer to join our Product team. You are an expert on state-of-the-art inference performance and will serve as our resident expert on how Cerebras stacks up against alternative inference providers on both price and performance. This role sits at the intersection of performance benchmarking from first principles and competitive intelligence. The role has two core pillars: - Performance Benchmarking You will build, run, and maintain reproducible benchmarks that measure Cerebras inference performance for real customer workloads. This

  • 2w ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Operations - Staff Plus Salary not disclosed

    Director / Senior Director, Critical Facility Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role At Cerebras, we build systems that redefine the scale of compute. That ambition extends beyond silicon into the infrastructure that powers it. We are looking for a Director / Sr. Director of Critical Facility Operations to lead the environments that enable next-generation AI workloads. This leader will own availability, operational integrity, and performance across a colocation-driven data center footprint-ensuring our infrastructure operates with precision, predictability, and zero room for error. This is not a traditional facilities role. You will operate at

  • Staff Inference ML Runtime Engineer

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 203 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +2
    Unspecified Engineering - Mid Salary not disclosed

    Staff Inference ML Runtime Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our fast generative inference solution through simple APIs powered by a distributed runtime that runs on large clusters of our own hardware. Our mission is to empower enterprises, developers, and researchers to unlock the full potential of our platform, leveraging its performance, scalability, and flexibility. The team works closely with cross-functional groups, including compiler developers, cluster orchestrators, ML scientists, cloud architects, and product teams, to deliver high-impact

  • Performance Engineer

    Cerebras Systems - Toronto, Ontario, Canada
    Indexed from Greenhouse
    posted 282 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +2
    Unspecified Data - Mid Salary not disclosed

    Performance Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our groundbreaking CS-3 system, hosted by a distributed set of modern and powerful x86 machines, has set new benchmarks in high-performance ML training and inference solutions. It leverages a dinner-plate sized chip with 44GB of on-chip memory to surpass traditional hardware capabilities. This role will challenge and expand your expertise in optimizing AI applications and managing computational workloads primarily on the x86 architecture that run our Runtime driver. Responsibilities - Focus on CPU

  • AI Engineer, Model Quality and Performance

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse
    posted 33 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Mid Salary not disclosed

    AI Engineer, Model Quality and Performance Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role You'll own model quality and performance for Cerebras' inference offerings. You will define what "good" looks like across the models we serve, building AI-driven systems to measure it at scale, and translating those signals into artifacts our customers and product team actually use. You'll use AI agents to spin up custom eval suites per customer use case, mine trajectories for representative test data, automate the repetitive parts of release qual, and help build performance datasets and benchmarking workflows for customer use

  • Lead RTL Design Engineer

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 216 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +2
    Unspecified Engineering - Senior $175K-$275K Equity

    Lead RTL Design Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a lead front-end design engineer, you will be a key part of the world-class team designing and developing the next generations of the Cerebras Wafer Scale Engine (WSE). This role requires deep expertise in RTL design and integration, with a strong focus on delivering high-performance, power-efficient, and scalable solutions. The role also requires close collaboration and management of external ASIC vendor. You will collaborate closely with the design verification, physical design, software and system teams to bring innovative semiconductor architectures from concept

  • Manager, Business Operations

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse
    posted 48 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Senior Salary not disclosed

    Manager, Business Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This role is a high-leverage seat in that build and a deliberate apprenticeship into operating leadership. You will create the business operations, analytics, and execution system that keeps decision-ready insight flowing as the company scales. You will be embedded with operators, turning messy operational reality into durable processes, clear metrics, and repeatable operating rhythms. You will report to the Head of FP&A and work in close partnership with the COO and operations leadership. Why now Cerebras is scaling to meet accelerating demand for fast

  • Manufacturing Bring-up Engineer L2

    Cerebras Systems - Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada
    Indexed from Greenhouse Comp disclosed in posting
    posted 106 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Mid $170K-$230K Equity

    Manufacturing Bring-up Engineer L2 Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role We are seeking a highly skilled and motivated Manufacturing Bring-up Engineer to join our team. As the Manufacturing Bring-up Engineer you will support our system level bring-up process execution, implementation, and evolution in the manufacturing pipeline. This is a high visibility role that requires strong technical expertise, coordination, and collaboration to deliver our product from manufacturing to the customer. Responsibilities - Support the Cerebras manufacturing bring-up process execution to configure, test, and validate system performance prior to customer

  • Senior Front End Design Engineer (Microarchitecture)

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    1w ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Senior $250K-$300K Equity

    Senior Front End Design Engineer (Microarchitecture) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a senior front-end design engineer, you will be a key part of the world-class team designing and developing the next generations of the Cerebras Wafer Scale Engine (WSE). This role requires deep expertise in RTL design and integration, with a strong focus on delivering high-performance, power-efficient, and scalable solutions. You will collaborate closely with the design verification, physical design, software and system teams to bring innovative semiconductor architectures from concept to production, addressing the unique challenges of building WSE systems.

  • Lead Full Stack Machine Learning Engineer

    Cerebras Systems - Bengaluru, Karnataka, India
    Indexed from Greenhouse
    1w ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Senior Salary not disclosed

    Lead Full Stack Machine Learning Engineer Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About T he Role This teams' principal responsibility is to rapidly bring up state-of-the-art open-source models, frameworks and data engineering. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI applications. Responsibilities - Contribute to the end-to-end bring up of frameworks for RL, inference serving, ML models on Cerebras CSX systems. -

  • Senior/Staff Engineer : Post Silicon- Bring Up

    Cerebras Systems - Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada
    Indexed from Greenhouse Comp disclosed in posting
    posted 120 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Data - Senior $175K-$275K Equity

    Senior/Staff Engineer : Post Silicon- Bring Up Bengaluru, Karnataka, India; Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role: In this exciting role, you will be responsible for bring up and optimizations of Cerebras's Wafer Scale Engine (WSE). Suitable candidate will have experience delivering end to end solutions working closely with teams across chip design, system performance, software development and productization. Responsibilities: - On Wafer Scale Engines, develop and debug flows that embed well tested and deployable optimizations in production processes to reduce time and costs - Work on refining AI Systems across H/W-S/W

  • Principal ML Investigator

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse
    posted 187 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Other - Staff Plus Salary not disclosed

    Principal ML Investigator Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Cerebras is adding an ML team that can focus on a new ML effort that can align with existing teams. We are seeking a principal investigator who will partner with our ML leaders to formulate the new effort and to build up the new team and capabilities. This new team would coordinate with our current ML teams: Field ML, which works directly with customers, Applied ML, which builds new ML capabilities and applications for customers, and Core ML, which adapts ML algorithms to find

  • Product Manager, Strategic Verticals

    Cerebras Systems - San Francisco, California, United States
    Indexed from Greenhouse
    posted 267 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +2
    Unspecified Product - Senior Salary not disclosed

    Product Manager, Strategic Verticals San Francisco, California, United States Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Our customers span leading AI Native companies, Fortune 500 Enterprises, Sovereign AI and Federal programs, and leading research institutions. Our mission is to deliver the platform that unlocks the next generation of AI applications, providing the fundamentally new capability to leverage the most intelligent models at real-time serving speeds. Why Cerebras? Here at Cerebras, we have built the world's first wafer-scale compute platform and software stack, purpose-designed to accelerate generative AI by over 10-20x what is possible on legacy processors today. AI developers

  • Principal Engineer, AI Inference Reliability

    Cerebras Systems - Remote, California, United States; Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 231 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +2
    Remote Engineering - Staff Plus Salary not disclosed

    Principal Engineer, AI Inference Reliability Remote, California, United States; Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. In late 2024, we launched Cerebras Inference, the fastest Generative AI inference service in the world, over 10 times faster than GPU-based hyperscale cloud inference. Since launch, we've scaled to meet the surging demand from AI labs, enterprises, and a thriving developer community. In October 2025, we announced our series G funding, raising $1.1 billion USD to accelerate the expansion of our products and services to meet global AI demand. About the team The Cerebras Inference team's mission

  • Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai

    Cerebras Systems - Europe; Remote, California, United States; UAE
    Indexed from Greenhouse
    posted 229 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Remote Engineering - Mid Salary not disclosed

    Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai Europe; Remote, California, United States; UAE Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role: Would you like to participate in creating the fastest Generative Models inference in the world? Join the Cerebras Inference Team to participate in development of unique Software and Hardware combination that sports best inference characteristics in the market while running largest models available. Cerebras wafer scale inference platform allows running Generative models with unprecedented speed thanks to unique hardware architecture that provides fastest access to local memory, ultra-fast interconnect and huge amount

  • Senior Runtime Engineer

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 232 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +2
    Unspecified Engineering - Senior Salary not disclosed

    Senior Runtime Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are building the next generation of large-scale AI systems that power training and inference workloads at unprecedented scale and efficiency. You will design and develop high-performance distributed software that orchestrates massive compute and data pipelines across heterogeneous clusters. Your work will push the limits of concurrency, throughput, and scalability-enabling efficient execution of models at massive scale. This role sits at the intersection of systems engineering and machine learning performance, demanding both architectural depth and low-level implementation skills. You will help

Take this list with you

Download the 48 matching jobs in any format - read offline, archive, or hand to an AI assistant with your resume to find the best fits.

Or email it to me instead

The AI-ready prompt is a pre-written question you can paste into Claude, ChatGPT, Gemini, or Perplexity along with your resume. We never see your resume; this happens in your AI client of choice.

AI agent reading directly? Same data lives at /api/jobs.json?page=2&q=Cerebras+Systems. See /llms.txt and /api/openapi.json for the full schema.

0 selected