FewerJobs.

Cerebras Systems jobs

30 matches, filter-driven and evidence-linked.

Reset
Showing 30 high-confidence listings. 0 additional listings are hidden by default. Show all
24 shown of 30

Benefit evidence

Source verified Inferred from posting Unknown provenance
Only source-backed benefits
  • System Software Engineer (Embedded)

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 116 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Engineering - Mid $175K-$275K Equity

    System Software Engineer (Embedded) Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role As part of the Embedded Software team, you will help build the critical software foundation that powers the Cerebras Wafer Scale Engine (WSE)-the world's largest AI processor. Our team owns a diverse range of embedded and system level components that enable the WSE to operate reliably at scale, including microcontroller firmware, wafer level monitoring logic, system administration services, and the Linux platform and BSP layers that keep the entire system running smoothly. This role exists at the intersection of embedded systems, platform engineering, and

  • Advanced Technology: AI/ML Research Scientist

    Cerebras Systems - Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada
    Indexed from Greenhouse
    posted 68 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Data - Mid Salary not disclosed

    Advanced Technology: AI/ML Research Scientist Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team Cerebras builds wafer-scale AI processors-single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ' pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software. Our work regularly appears at top-tier venues (Supercomputing, SIAM, IEEE,

  • Advanced Technology: R&D Engineer - AI/ML, HPC

    Cerebras Systems - Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada
    Indexed from Greenhouse
    posted 68 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Data - Mid Salary not disclosed

    Advanced Technology: R&D Engineer - AI/ML, HPC Sunnyvale, CA; Toronto, Ontario, Canada; Vancouver, British Columbia, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team Cerebras builds wafer-scale AI processors-single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ' pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software. Our work regularly appears at top-tier venues (Supercomputing,

  • Mechanical Engineer

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    2w ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Mid $180K-$200K Equity

    Mechanical Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role: As a Mechanical Engineer at Cerebras, you will lead the design of mechanical systems for our next-generation wafer-scale engine. Your responsibilities will include ensuring compliance with specifications, validating manufacturability, and delivering a high-quality product in a fast-paced environment-tackling some of the most challenging problems in the rapidly evolving AI space. In this role, you will develop mechanical infrastructure for Cerebras' custom hardware system. - Rapidly iterate on designs and analysis to inform high level systems trades and steer overall product direction. - Provide comprehensive support for

  • Head of IT

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse
    posted 66 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Other - Staff Plus Salary not disclosed

    Head of IT Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are hiring a Head of IT to build and run the internal technology backbone of a company that is scaling quickly and operating at the edge of AI hardware and software. This is not a steady-state IT leadership job, It is a build-and-scale role for someone who thrives when the ground is moving. You will own the systems that Cerebras employees, contractors, and executives rely on every day: laptops, identity, SaaS, networking, collaboration, endpoint security, internal support, and the IT controls that a

  • Senior Mechanical Engineer

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 156 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Senior $190K-$230K Equity

    Senior Mechanical Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Senior Mechanical Engineer at Cerebras, you will lead the design of mechanical systems for our next-generation wafer-scale engine. Your responsibilities will include ensuring compliance with specifications, validating manufacturability, and delivering a high-quality product in a fast-paced environment-tackling some of the most challenging problems in the rapidly evolving AI space. In this role, you will develop mechanical infrastructure for Cerebras' custom hardware system. - Rapidly iterate on designs and analysis to inform high level systems trades and steer overall product direction. - Provide

  • Distributed Software Engineer

    Cerebras Systems - Bengaluru, Karnataka, India; Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 155 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Engineering - Mid Salary not disclosed

    Distributed Software Engineer Bengaluru, Karnataka, India; Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Cerebras Systems is a pioneer in large-scale AI Supercomputers. These multi-exaflop supercomputers are deployed in some of the biggest datacenters. These supercomputers are built using our Wafer-Scale Cluster technology - a cluster of several Wafer Scale Engine (WSE) chips. The Cluster engineering team is responsible for delivering software that are all-things related to cluster. Responsibilities - Automate bare-metal configuration of networking, OS, and application software in large clusters of Cerebras WSE, servers, and switches. - Additional push button

  • Software Architect – Manufacturing Test

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    2w ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Engineering - Mid $204K-$245K Equity

    Software Architect – Manufacturing Test Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role As the Software Architect for Cerebras' manufacturing test platform, you will lead a team of Full Stack Engineers in designing and delivering the end-to-end software systems that power manufacturing test across every stage of our product lifecycle - from individual components to complete Cerebras systems. The platform spans both cloud infrastructure and physical client-server infrastructure deployed across our manufacturing facilities, and you will own the technical vision, architecture, and roadmap across the full stack. Working closely with hardware design, test engineering, operations,

  • Sr. Technical Staff

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 36 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Engineering - Senior $250K-$275K

    Sr. Technical Staff Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras Systems Inc. has multiple openings for Sr. Technical Staff. Title: Sr. Technical Staff Job Duties: - Post silicon validation of Cerebras Wafer Scale Engines. Test and debug issues on new silicon. - Test, analyze, and characterize high-speed serial interfaces to verify compliance with hardware specifications, record performance data, and recommend design modifications to optimize functionality. - Work with the silicon and operations team to test, bring-up and run burn-in on wafers scale systems. - Support manufacturing operations to utilize the wafer bring up flow. Perform wafer

  • posted 114 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +4
    Unspecified Product - Senior Salary not disclosed

    Infrastructure Hardware Technical Program Manager (Server and Network Systems) Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. As an Infrastructure Hardware Technical Program Manager (Server and Network Systems) on the Cluster Architecture Team, you will drive end-to-end delivery of server and network platform programs across Cerebras CS-3-based AI clusters - from requirements and vendor selection through lab bring-up, qualification, and production rollout. You will be the execution owner for multi-team programs spanning OEM/ODM partners, component vendors, internal software/runtime teams and architects, validation/QA, and deployment/operations. This role is intentionally technical: you must understand server, network, and

  • Software Development Engineer in Test (Cloud)

    Cerebras Systems - Bengaluru, Karnataka, India
    Indexed from Greenhouse
    posted 33 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Data - Mid Salary not disclosed

    Software Development Engineer in Test (Cloud) Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team The Cloud Quality team is responsible for the confidence behind every production release shipped to Cerebras Inference Cloud. We work closely with platform, infrastructure, ML systems, and product engineering teams to ensure that rapid iteration never comes at the expense of customer trust. Our environment spans distributed cloud systems, multi-region deployments, APIs, orchestration layers, and hardware-backed inference services. We are scaling quickly. The systems are growing in complexity, traffic is increasing rapidly, and release velocity remains high. We need engineers

  • Senior Hardware Technical Program Manager

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 82 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Product - Senior $180K-$230K Equity

    Senior Hardware Technical Program Manager Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. The Role As a Senior Hardware Technical Program Manager at Cerebras, you will spearhead operational excellence for our high-performance AI compute systems and data centers. You will own the end-to-end hardware schedule for design and engineering improvements, report on engineering issues, and define mitigation strategies. You will own the schedule, implementation, and software integration of hardware changes. You will collaborate closely with electrical and system engineering, manufacturing, supply chain, and system software to drive end-to-end schedule of improvements to our wafer-scale engine supercomputers. Your role

  • Full Stack LLM Engineer

    Cerebras Systems - Toronto, Ontario, Canada
    Indexed from Greenhouse
    posted 330 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Engineering - Mid Salary not disclosed

    Full Stack LLM Engineer Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are seeking a versatile and experienced engineer to join our Inference Core Model Bringup team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for AI

  • Sr. Member of Technical Staff

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 36 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Engineering - Senior $230K-$250K

    Sr. Member of Technical Staff Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. Cerebras Systems Inc. has multiple openings for Sr. Member of Technical Staff Title: Sr. Member of Technical Staff Job Duties: - Design and develop software features that support system resiliency and high availability, including automated recovery mechanisms and fault-tolerant architecture across distributed environments. - Develop and maintain cloud-based deployment workflows for AI inference software using AWS tools and services to support low-latency and scalable system performance. - Develop Python-based scripts and APIs to streamline data preprocessing, inference execution, and post-processing for real-time inference tasks. -

  • Advanced Technology: Compiler Engineer

    Cerebras Systems - Sunnyvale, CA; Vancouver, British Columbia, Canada
    Indexed from Greenhouse
    posted 75 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Mid Salary not disclosed

    Advanced Technology: Compiler Engineer Sunnyvale, CA; Vancouver, British Columbia, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Team Cerebras builds wafer-scale AI processors-single chips delivering tens of PB/s of memory bandwidth and a dataflow architecture that accelerates at a granularity no multi-device system can match. The Advanced Technology Group (ATG) is Cerebras ' pathfinding organization. We work ahead of product to explore new architectures, demonstrate breakthrough performance on scientific and AI workloads, and shape the technical roadmap for future Cerebras hardware and software. Our work regularly appears at top-tier venues (Supercomputing, SIAM, IEEE, and NeurIPS ) and

  • Manufacturing Test Development Engineer

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 156 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Mid $170K-$210K Equity

    Manufacturing Test Development Engineer Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Test Development Engineer on our manufacturing team you will be working with diagnostics, system design, manufacturing, and quality teams to develop test automation solutions for our products from PCBA to system level. You will also work closely with our contract manufacturing sites to fulfill a complete test automation solution for manufacturing test data, yield improvement, and traceability. Responsibilities - Develop and design manufacturing test automation software/scripts to test Cerebras products from PCBA to system level. - Develop and implement GUI solutions

  • Senior ML Systems Engineer

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    posted 121 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +4
    Unspecified Engineering - Senior Salary not disclosed

    Senior ML Systems Engineer Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role We are seeking a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible to rapidly bring up state-of-the-art open-source models (like LLaMA, Qwen, etc) or customer-provided proprietary models on our Cerebras CSX systems. Success in this role requires a system-minded generalist who thrives in fast-paced bringup environments and is comfortable working across the entire Cerebras software stack. Your work will play a critical role in achieving unprecedented levels of performance, efficiency, and scalability for

  • Performance & Reliability Engineer

    Cerebras Systems - Sunnyvale, CA; Toronto, Ontario, Canada
    Indexed from Greenhouse
    posted 201 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +3
    Unspecified Data - Mid Salary not disclosed

    Performance & Reliability Engineer Sunnyvale, CA; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role Join Cerebras as a Performance & Reliability Engineer within our innovative Co-Design and Next Generation Team. Our groundbreaking CS-3 system has set new benchmarks in high-performance ML training and inference solutions. It leverages a dinner-plate sized chip with 44GB of on-chip memory to surpass traditional hardware capabilities. This role focuses on characterizing and optimizing the performance and reliability of state-of-the-art AI models running on Cerebras' breakthrough hardware. Responsibilities - Characterize and enhance the performance and reliability of advanced ML hardware/software

  • Distributed Systems Cluster Security Software – Engineering Lead

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse Comp disclosed in posting
    posted 446 days ago

    Why we showed this

    Description: "systems"Description: "cerebras"
    +4
    Unspecified Data - Senior $140K-$240K

    Distributed Systems Cluster Security Software – Engineering Lead Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role In this role, you will be the security czar for the Cerebras's AI cluster product. Such AI clusters have 100's of Wafer-scale accelerator systems, 1000's of high-end servers, and several 1000's of networking ports including switches. Plus, there will be network attached storage, all in a large-scale datacenter. You will ensure that Cerebras's large-scale AI clusters are secured through first-principles, best practices, security-first based engineering. Cerebras cluster involves complex HW components, networking and a vertically integrated cluster management software

  • ML Research Engineer (Inference)

    Cerebras Systems - Bengaluru, Karnataka, India
    Indexed from Greenhouse
    posted 66 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Mid Salary not disclosed

    ML Research Engineer (Inference) Bengaluru, Karnataka, India Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role As a Research Engineer on the Inference ML team at Cerebras Systems, you will adapt today's most advanced language and vision models to run efficiently on our flagship Cerebras architecture. You'll work alongside ML researchers and engineers to design, prototype, validate, and optimize models, gaining end-to-end exposure to cutting-edge inference research on the world's fastest AI accelerator. You will focus on pushing the frontier of speculative decoding , large-model pruning and compression , sparse attention , and sparsity-driven techniques to deliver low-latency,

  • Cluster UI Full Stack, Engineering Lead

    Cerebras Systems - Bengaluru, Karnataka, India; Toronto, Ontario, Canada
    Indexed from Greenhouse
    posted 136 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Data - Senior Salary not disclosed

    Cluster UI Full Stack, Engineering Lead Bengaluru, Karnataka, India; Toronto, Ontario, Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Role In this role, you will be building a world class UI-based large-scale cluster management portal. This portal will act as one stop for all operations and maintenance of cerebras clusters - such as cluster bringup deployment (day0/1/2), job management, health management to name a few. Cerebras AI clusters may have 1000's of Wafer-scale accelerator systems, several 1000's of high-end servers, and several 1000's of networking ports including switches. Responsibilities - Be the primary engineering face and owner

  • IT SRE Team Lead

    Cerebras Systems - Sunnyvale, CA
    Indexed from Greenhouse
    posted 66 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Engineering - Senior Salary not disclosed

    IT SRE Team Lead Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role We are seeking an experienced IT SRE Team Lead to build and run the reliability function for Cerebras' internal technology estate. The IT SRE Team Lead will be responsible for the availability, performance, and operational quality of the systems Cerebras employees rely on every day, including identity, endpoint management, collaboration, SaaS, and internal networking. The right candidate will bring a software engineering mindset to IT operations, treating corporate infrastructure as code, with measurable SLOs, automated remediation, and a ruthless focus on eliminating toil.

  • posted 179 days ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Product - Senior Salary not disclosed

    Senior Technical Program Manager – AI Infrastructure, Site Operations Sunnyvale, CA Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About The Role This Sr. TPM role owns site and data center operations programs supporting Cerebras' AI Cloud and customer deployments. The position sits at Sunnyvale HQ and works closely with Hardware Engineering, Inference Engineering, and Operations leadership to ensure Cerebras systems are reliably deployed, operated, and scaled. This is a highly technical, execution-focused TPM role with strong emphasis on operational readiness, cross-functional coordination, and metrics/KPIs. Responsibilities - Own end-to-end technical programs for data center and site operations - Act as

  • Security & IT General Opportunities

    Cerebras Systems - Sunnyvale CA or Toronto Canada
    Indexed from Greenhouse
    2w ago

    Why we showed this

    Description: "cerebras"Description: "systems"
    +3
    Unspecified Other - Mid Salary not disclosed

    Security & IT General Opportunities Sunnyvale CA or Toronto Canada Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras , to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference. Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation. About the Team Our IT & Security team sits at the intersection of security, infrastructure, and cutting-edge AI systems. The team plays a critical role in ensuring Cerebras' environments are secure, reliable, scalable, and ready to support customers operating at the frontier of AI. We are looking for people who care deeply about operational excellence, security best practices, automation, and building systems that can support a rapidly growing organization. What You May Work On Depending on the role and team needs, you

Take this list with you

Download the 30 matching jobs in any format - read offline, archive, or hand to an AI assistant with your resume to find the best fits.

Or email it to me instead

The AI-ready prompt is a pre-written question you can paste into Claude, ChatGPT, Gemini, or Perplexity along with your resume. We never see your resume; this happens in your AI client of choice.

AI agent reading directly? Same data lives at /api/jobs.json?q=Cerebras+Systems. See /llms.txt and /api/openapi.json for the full schema.

0 selected