Baseten jobs
30 matches, filter-driven and evidence-linked.
Filters
0 active
Remote, hybrid, onsite
State
Shift type
Weekend work
Country
Cover letter
Assessment
Salary type
Equity type
Family-building benefits
Benefit evidence
- posted 33 days ago
Why we showed this
Description: "baseten"Employer: "baseten"+1
Remote Engineering - Mid Salary not disclosed EquitySRE San Francisco, California, United States, New York, Remote ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As a Site Reliability Engineer at Baseten, you'll define and codify the gold standards of day 2 operations for our ML infrastructure platform. You'll envision and build robust systems, processes, automations, and observability tooling that keep our platform reliable at scale - and that empower the broader organization to operate confidently. You'll work closely with engineering, forward-deployed and product teams: learning from recurring failure patterns, turning tribal knowledge into automated mitigations, and raising the operational floor for the entire company. EXAMPLE INITIATIVES You'll work on projects like these as part of the SRE team: - Improve Baseten SRE Practices, by instrumenting SLOs and SLIs, improving alerting and observability for all services. - Building AI-assisted tooling for incident triage and response. RESPONSIBILITIES - Own the reliability of Baseten's multi-cloud Kubernetes infrastructure, including incident response, post-mortems, and remediation tracking. - Build and maintain observability infrastructure - metrics, logging, dashboards, and alerting - as code. - Author, validate,
-
Post-Training Research Scientist
Baseten - San Francisco, California, United StatesIndexed from Ashbyposted 88 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Mid Salary not disclosed EquityPost-Training Research Scientist San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE This role sits at the frontier of our research agenda. You will pursue open problems at the intersection of post-training methodology and performant inference, and then collaborate with research engineering to translate findings into production systems. A meaningful portion of your time will be dedicated to research that deepens our understanding of how models learn, alignment, and architectural efficiency - questions that may not have immediate product application. The remainder will be directed toward research that solves concrete problems for Baseten's platform and customers, who are the fastest growing AI companies in the world like Cursor, Lovable, and Notion. We are looking for someone with sharp research taste and genuine creative instinct for problem selection. Someone who can identify questions that matter, design clean experiments to answer them, and push the state of the art. The environment here is not theoretical, but rather research that can be validated with eager customers who are serving billions of
-
Software Engineer- BIS (Baseten Inference Stack)
Baseten - San Francisco, California, United StatesIndexed from Ashby1w agoWhy we showed this
Description: "baseten"Employer: "baseten"+2
Unspecified Data - Mid Salary not disclosed EquitySoftware Engineer- BIS (Baseten Inference Stack) San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Baseten's Inference Stack team builds the distributed runtime that powers large-scale LLM inference across our platform. We operate at the intersection of distributed systems, model performance, infrastructure, and developer experience. We enable customers to deploy and operate cutting-edge LLM models with industry-leading performance, scalability, reliability, and ease of use. As a Software Engineer on the Inference Stack team, you'll work across the stack - from the developer experience customers use to deploy models, the libraries used for features like tool calling and reasoning, all the way down to the systems we use to orchestrate deployments in Kubernetes and route traffic efficiently. This is an ideal role for engineers who enjoy owning systems in production, solving hard integration problems, and making complex infrastructure simple and reliable for users. EXAMPLE INITIATIVES Blog Posts https://www.baseten.co/blog/nvidia-dynamo-day-baseten-inference-stack/ https://www.baseten.co/blog/how-baseten-achieved-2x-faster-inference-with-nvidia-dynamo/ https://www.baseten.co/blog/how-baseten-multi-cloud-capacity-management-mcm-powers-cloud-self-hosted-and-hybr/#comparing-deployment-options-cloud-vs-self-hosted-vs-hybrid RESPONSIBILITIES - Develop infrastructure and orchestration systems for deploying and managing large-scale distributed LLM inference - Work across the
- 3w ago
Why we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Hr - Mid Salary not disclosed EquityExecutive Recruiter San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We're hiring an Executive Recruiter to own leadership hiring across Baseten - partnering directly with our CEO, executive team, and functional leaders to bring in the VP and C-suite talent that will define how we scale. Leadership hires are among the highest-leverage decisions a company makes, and at this stage of growth, we need someone who can operate as a true strategic partner - not just a search executor. You'll shape how we define and evaluate leadership, build rigorous and confidential search processes from scratch, and close exceptional candidates who could work anywhere. This is a senior individual contributor role with outsized company impact. You'll reduce our dependence on external search firms, build Baseten's internal executive recruiting capability, and set the standard for how leadership hiring works here. RESPONSIBILITIES - Own end-to-end executive searches. Lead VP- and C-suite hiring across Engineering, GTM, Finance, Product, and Operations - from intake and role scoping through offer close. You'll run
-
Content Engineer
Baseten - San Francisco, California, United States, New York, RemoteIndexed from Ashbyposted 65 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Remote Data - Mid Salary not disclosed EquityContent Engineer San Francisco, California, United States, New York, Remote ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE This is a 3-month contract-to-hire position, with the expectation of converting to full-time for the right candidate. You'll join a small team in Baseten spearheading how we show up natively in the AI era. You'll contribute to our content strategy with a focus on identifying, experimenting, and executing written content in the highest leverage channels that can reach the strongest intent users. This position will directly shape how we show up and present ourselves in front of developers with high intent of using open source and custom models. This isn't a traditional content or writing role. You'll operate iteratively at the intersection of growth, product marketing, and community building. Ultimately, you'll be creating and running the pipeline for generating high viewership content on external sites and define our playbook of how we approach discoverability in AI engines. When applying, please submit a favorite piece of your own writing that is relevant to your work
-
Engineering Manager, Cloud Platform
Baseten - San Francisco, California, United StatesIndexed from Ashby4 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Senior Salary not disclosed EquityEngineering Manager, Cloud Platform San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As the Engineering Manager for Baseten's Cloud Platform team, you will directly manage a team of cloud platform engineers responsible for building the systems and processes that keep our infrastructure scalable, reliable, and efficient - from automated deployments and monitoring to performance optimization and incident response. You are a people-first leader with a strong cloud infrastructure background. You set a high bar for reliability and operational excellence, engage credibly in technical discussions and code reviews, and know how to build a culture of ownership and accountability. You'll spend most of your time close to the work: unblocking your team, shaping technical direction on day-to-day decisions, and developing your engineers. At Baseten, we work closely with our users to understand their struggles operationalizing ML - you'll keep your team connected to that mission and translate user learnings into better infrastructure. RESPONSIBILITIES - Recruit, hire, and grow a high-performing team of cloud platform engineers; provide ongoing coaching,
- posted 53 days ago
Why we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Mid Salary not disclosed EquityGTM Engineer San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE The GTM tooling landscape is changing fast and the teams that win are the ones that adapt and iterate the fastest. This role exists to make sure Baseten is one of them. You'll design, build, and ship AI-powered workflows that scale our sales, marketing, and support functions as a competitive advantage. We have a sprawling stack with real redundancy, real gaps, and real opportunity - and we want someone who can walk in, audit what we have, identify what we're missing, and start shipping fast. You know when to reach for Clay and when to build something custom in Claude Code. You think two to three steps ahead on how the thing you're building today fits into the broader systems architecture tomorrow. And you bring a point of view - on our stack, on what we should be building, and on where AI can do something that low-code tooling simply can't. RESPONSIBILITIES - Own the
-
Post-Training Research Engineer
Baseten - San Francisco, California, United StatesIndexed from Ashbyposted 82 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Mid Salary not disclosed EquityPost-Training Research Engineer San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. We are looking for an engineer with strong experience in machine learning and solid foundations in maths and computer science to join our growing Post-Training team at Baseten. Custom models are instrumental to the success of Baseten customers. By inference volume, the overwhelming majority of traffic at Baseten is to and from models that have been post-trained in some way, whether that be through reinforcement learning, supervised finetuning, a recent technique from the literature, or an in-house research technique from Baseten. The Post-Training team is responsible for the success of our customers' post-trained models, and we employ a wide array of techniques to produce models that are more efficient and higher quality than even the biggest closed source models for the customer's specific needs. Your role as a research engineer is to build the in-house tooling to support all of this. We care about training a wide spectrum of different model architectures with a variety of techniques efficiently
-
Product Manager, Developer Experience
Baseten - San Francisco, California, United StatesIndexed from Ashby3 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Engineering - Senior Salary not disclosed EquityProduct Manager, Developer Experience San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. Product at Baseten Product at Baseten is a nascent function. Our company today has a strong engineering culture, is heavily customer-obsessed, and moves fast. We're building the product function now, and you'd be one of the first people who will help define it. You'll work directly with our founders and with some of the best systems and infrastructure engineers in the world, and you'll set the standard for building building great AI Infrastructure. PMs at Baseten don't sit above engineers - you earn ownership by being technical, finding the truth in front of customers, building great cross-functional relationships, and shipping great product experiences. The role Getting a model into production still takes real expertise - choosing a serving engine, sizing hardware, tuning it, wiring it into an app. We want a developer to go from "it runs on my laptop" to "it's serving production traffic" in minutes, on their own. You'll own the entire experience a developer
- posted 493 days ago
Why we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Engineering - Mid Salary not disclosed EquitySales Development Representative New York, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As a Sales Development Representative at Baseten, you will be helping build a qualified pipeline by engaging with potential customers. This includes outbound and inbound prospecting via phone, e-mail, LinkedIn, or whatever it takes to get them excited about Baseten. In this role, you will work alongside a sales and marketing team that focuses on the business needs of our customers. RESPONSIBILITIES - Build revenue pipeline by setting introductory meetings with potential businesses and key decision makers. - Diligently respond to inbound inquiries and determine potential product fit. - Help influence Baseten's product roadmap for customers and prospects. - Identify high-potential businesses and verticals and develop and execute outbound strategies to bring them to Baseten. - Stay up-to-date on market trends, competition, and industry developments. - Manage and document the progression of the sales pipeline. - Drive pre- & post-engagement at industry events (will attend multiple events in person). REQUIREMENTS - 9+ months of experience in a
-
Senior Compensation Manager
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 102 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Hr - Senior Salary not disclosed EquitySenior Compensation Manager San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Baseten is building for talent density. We believe attracting and retaining exceptional people, and ensuring they feel recognized and valued for their impact, is core to becoming the best place to work. Compensation is a critical lever in that mission. As our Compensation Manager, you will own compensation programs company-wide. You'll be a trusted advisor to senior leaders, shaping our compensation philosophy, leveling framework, and equity programs to ensure we remain competitive, principled, and performance-oriented as we scale. This role blends strategy and execution: designing clear, fair systems while moving quickly in a high-growth environment. RESPONSIBILITIES - Own and evolve Baseten's company-wide compensation strategy, philosophy, and programs. - Collaborate with leadership and HRBP to create and evolve job architecture and leveling frameworks. - Build and maintain compensation bands. Conduct regular market benchmarking to ensure comp bands and strategy remain competitive in a fast moving industry. - Partner closely with Talent to design and approve
-
Software Engineer - Internal Platform
Baseten - San Francisco, California, United States, New York, Remote, MontrealIndexed from Ashbyposted 444 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Remote Data - Mid Salary not disclosed EquitySoftware Engineer - Internal Platform San Francisco, California, United States, New York, Remote, Montreal ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As an early member of Baseten's Platform Team, you will be pivotal in building internal infrastructure to support our engineering organization. While our product provides infrastructure for AI development, your focus will be on creating robust internal systems that amplify the productivity, collaboration, and quality of work across all engineering teams by providing exceptional tooling, efficient workflows, and robust development environments. If you are passionate about elegant solutions-like streamlined monorepos, lightning-fast CI pipelines, and thoughtfully designed shared libraries-you'll thrive at Baseten. RESPONSIBILITIES - Create diverse tooling tailored to the needs of different engineering teams. - Enhance monorepo capabilities and develop project templates for consistency and efficiency. - Design and implement shared libraries focused on observability. - Improve the speed, reliability, and comprehensiveness of our CI pipelines. - Assist in designing and maintaining Terraform modules for infrastructure management. - Provide innovative solutions to enhance visibility in continuous delivery (CD) processes. -
-
Forward Deployed Engineer
Baseten - San Francisco, California, United States, New York, RemoteIndexed from Ashbyposted 807 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Remote Data - Mid Salary not disclosed EquityForward Deployed Engineer San Francisco, California, United States, New York, Remote ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As a Forward Deployed Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. You'll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes. This role is a great fit for entrepreneurial engineers who want a front-row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer-facing implementations. To be clear, this is an engineering role with hands-on coding and software development that also includes aspects of product management, technical customer success, and pre-sales solution engineering mixed in. EXAMPLE INITIATIVES Take a look at these blog posts written by members of our Forward Deployed Engineering team: - Forward Deployed Engineering on the frontier of AI - The fastest, most accurate Whisper transcription - Deploy
-
Account Executive - Industries
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 86 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Sales - Mid Salary not disclosed EquityAccount Executive - Industries San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We're looking for an Enterprise Account Executive to help build and scale Baseten's Industries go-to-market motion. You'll own new business in verticals where AI adoption is accelerating but the stakes-and the buying processes-are uniquely complex: financial services, healthcare and life sciences, insurance, government, and similar industries with rigorous compliance requirements and multi-layered procurement. This is a high-autonomy role where you'll contribute directly to our Industries GTM strategy-identifying new use cases within your verticals, winning lighthouse customers that become references for their industries, and feeding signal back to product and engineering to shape our roadmap. You'll work alongside Baseten's founders, forward-deployed engineering team, and GTM leadership. WHAT YOU'LL DO - Own a revenue target and all aspects of the sales cycle from prospecting to close, including outbounding and engaging Tier 1 accounts in your assigned verticals - Drive new logo acquisition and strategic expansion within key accounts, prioritizing organizations that can serve as lighthouse
-
AI Solutions Engineer
Baseten - San Francisco, California, United States, New York, RemoteIndexed from Ashbyposted 53 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Remote Data - Mid Salary not disclosed EquityAI Solutions Engineer San Francisco, California, United States, New York, Remote ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As an AI Solutions Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. You'll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes. This role is a great fit for entrepreneurial engineers who want a front-row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer-facing implementations. To be clear, this is an engineering role with hands-on coding and software development that also includes aspects of product management, technical customer success, and pre-sales solution engineering mixed in. EXAMPLE INITIATIVES Take a look at these blog posts written by members of our Forward Deployed Engineering team: - Forward Deployed Engineering on the frontier of AI - The fastest, most accurate Whisper transcription - Deploy
-
Assistant General Counsel, Commercial
Baseten - San Francisco, California, United States, New YorkIndexed from Ashby1w agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Sales - Entry Salary not disclosed EquityAssistant General Counsel, Commercial San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Baseten is the AI inference infrastructure that serves the traffic and endpoints behind the most innovative AI-native and enterprise products in the market. We're hiring an Assistant General Counsel, Commercial to own our commercial deal surface as we scale from a high-growth startup into an enterprise-grade AI infrastructure platform. You'll be our primary commercial contracts attorney - embedded directly in the business and partnering day-to-day with Sales, Finance, and Engineering. At Baseten, commercial and product are the same thing. What we build and sell is inference - compute, orchestration, deployment, and optimization delivered as the production backbone for our customers' products. Closing a deal here means understanding multi-tenancy, model deployment, latency and throughput SLAs, and how a customer's workload actually runs on our platform. This is a role for a commercially minded product lawyer who wants to sit at that intersection, not run a siloed contracts function. You'll own enterprise commercial contracting end
- posted 33 days ago
Why we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Senior Salary not disclosed EquityCost Analytics Lead San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We're hiring a Capacity and Infrastructure Analytics Lead to help Baseten build the analytics foundation for tracking infrastructure usage, capacity, and cloud spend across our growing AI inference platform. This is a foundational analytics role focused on understanding how compute resources are consumed, priced, and optimized across Baseten's fleet. You'll create reliable data models that bring together cloud billing exports, provider usage data, capacity data, and internal infrastructure telemetry into a unified view of cost and utilization. You'll work closely with Finance, Infrastructure, Product, and Operations to answer questions like: How efficiently are we using committed capacity? How should we forecast infrastructure needs as usage grows? RESPONSIBILITIES - Build, enhance, and maintain dashboards that track cloud cost, usage, capacity, utilization, and infrastructure efficiency across Baseten's fleet. - Ingest, clean, and model billing and usage data from multiple cloud and infrastructure providers, including sources such as cost and usage reports, provider APIs, invoices, and internal
-
Software Engineer - Model Performance
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 807 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Mid Salary not disclosed EquitySoftware Engineer - Model Performance San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make significant contributions to the exciting field of LLM Inference. If you are a backend engineer who thrives on making things faster and is excited about open-source ML models, we look forward to your application. EXAMPLE INITIATIVES You'll get to work on these types of projects as part of our Model Performance team: - Baseten Embeddings Inference: The fastest embeddings solution available - The Baseten Inference Stack - Driving model performance optimization RESPONSIBILITIES - Implement, refine, and productionize cutting-edge techniques (quantization, speculative decoding, kv cache reuse, chunked prefill and LoRA) for ML model inference and infrastructure. - Deep dive into underlying codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA,
-
Product Manager, Inference Platform
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 72 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Engineering - Senior Salary not disclosed EquityProduct Manager, Inference Platform San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. Product at Baseten Product at Baseten is a nascent function. Our company today has a strong engineering culture, is heavily customer-obsessed, and moves fast. We're building the product function now, and you'd be one of the people who defines it. You'll work directly with our founders and with some of the best systems and AI engineers and you'll set the standard for what product looks like here. PMs at Baseten don't sit above engineers - you earn ownership by being technical, finding the truth in front of customers, building great cross-functional relationships, and just shipping great product experiences. The role Once a model is deployed, keeping it fast, reliable, and economical at scale is where production inference is won or lost. You'll own the surface that makes that happen: how deployments autoscale, how traffic is routed, how the system fails over, and how workloads scale across clusters and regions. You'll own these as products end
-
Account Executive - AI Native: Strategic
Baseten - San Francisco, California, United StatesIndexed from Ashbyposted 432 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Sales - Senior Salary not disclosed EquityAccount Executive - AI Native: Strategic San Francisco, California, United States ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We are looking for an Account Executive to join our dynamic team. This role is a great fit for those with sales experience looking to grow within a high-growth startup environment. As an Account Executive, you will prospect and close new business alongside our founders and Marketing team. You'll consult with our existing customer base and prospective customers to build meaningful relationships and shape our product and go-to-market roadmaps. RESPONSIBILITIES - Help build the foundation and processes for customer-facing teams at Baseten - Lead, negotiate, and execute new sales opportunities for Baseten - In partnership with the marketing and BDR team, develop a strong sales pipeline to support quarterly and annual sales targets - Establish and maintain relationships with key stakeholders within sales accounts - Work cross-functionally to improve the customer experience and ensure sales effectiveness REQUIREMENTS - 4-7 years of sales experience in a SaaS business - Experience in selling to technical
-
Applied AI Inference Engineer
Baseten - San Francisco, California, United States, New York, RemoteIndexed from Ashbyposted 53 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Remote Data - Mid Salary not disclosed EquityApplied AI Inference Engineer San Francisco, California, United States, New York, Remote ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE As an Applied AI Inference Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. You'll own the journey with customers from initial exploration to production deployment, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes. This role is a great fit for entrepreneurial engineers who want a front-row view into how modern companies adopt AI at scale and who enjoy working across product, software development, performance engineering, and customer-facing implementations. To be clear, this is an engineering role with hands-on coding and software development that also includes aspects of product management, technical customer success, and pre-sales solution engineering mixed in. EXAMPLE INITIATIVES Take a look at these blog posts written by members of our Forward Deployed Engineering team: - Forward Deployed Engineering on the frontier of AI - The fastest, most accurate Whisper transcription
-
Engineering Manager, Model Library
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 31 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Senior Salary not disclosed EquityEngineering Manager, Model Library San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE: You'll lead the Model Library team at Baseten - a small, high-ownership team focused on helping developers discover, evaluate, and select the right models for their specific use cases. This role is for a product-minded engineering manager who thrives in ambiguous, high-impact environments and can guide a team from early-stage product thinking through production-quality execution. You'll stay technically grounded while driving the team's work across model discovery, evaluation frameworks, and the infrastructure that powers a best-in-class model library experience. EXAMPLE INITIATIVES: - Model APIs for frontier models - Model training built for production inference - Introducing the Baseten Frontier Gateway RESPONSIBILITIES: - Lead and grow a team of engineers, including hiring, mentorship, and career development. - Own the Model Library product area end-to-end - from discovery and evaluation experiences to the APIs and tooling developers use to integrate models. - Partner with product, ML, and cross-functional stakeholders to define scope, prioritize work, and
-
Software Engineer - GPU Kernels
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 331 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Mid Salary not disclosed EquitySoftware Engineer - GPU Kernels San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE We're seeking a GPU Kernel Engineer to join our team at the cutting edge of AI acceleration, where your code directly impacts the performance of state-of-the-art machine learning models. As a GPU Kernel Engineer, you'll craft the foundation that powers modern AI workloads, optimizing every microsecond of computation to enable breakthrough applications. You'll work in a fast-paced, intellectually stimulating environment where technical excellence is paramount and your contributions directly influence production systems serving millions of users across numerous products. This role offers exceptional growth potential for engineers passionate about low-level optimization and high-impact systems work. EXAMPLE INITIATIVES You'll get to work on these types of projects as part of our Model Performance team: - Baseten Embeddings Inference: The fastest embeddings solution available - The Baseten Inference Stack - Driving model performance optimization RESPONSIBILITIES Core Engineering Responsibilities - Design and implement high-performance GPU kernels for key ML operations, including matrix multiplications, attention mechanisms,
-
Engineering Manager - Model Performance
Baseten - San Francisco, California, United States, New YorkIndexed from Ashbyposted 639 days agoWhy we showed this
Description: "baseten"Employer: "baseten"+1
Unspecified Data - Senior Salary not disclosed EquityEngineering Manager - Model Performance San Francisco, California, United States, New York ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E , backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products. THE ROLE Are you passionate about advancing the frontiers of artificial intelligence while leading a team of exceptional engineers? We are looking for a Tech Lead Manager focused on ML performance and inference. This role is ideal for someone with a strong engineering background who is eager to lead and mentor a team while remaining hands-on with technology. If you thrive in a fast-paced startup environment and are excited about both leadership and technical challenges, we want to hear from you. EXAMPLE INITIATIVES You'll get to work on these types of projects as part of our Model Performance team: - Baseten Embeddings Inference: The fastest embeddings solution available - The Baseten Inference Stack - Driving model performance optimization RESPONSIBILITIES - Lead, mentor, and manage a team of engineers focused on developing and optimizing ML model inference and performance. - Oversee technical strategy and architecture decisions, driving improvements across our engineering organization. - Collaborate with
Take this list with you
Download the 30 matching jobs in any format - read offline, archive, or hand to an AI assistant with your resume to find the best fits.
Or email it to me instead
The AI-ready prompt is a pre-written question you can paste into Claude, ChatGPT, Gemini, or Perplexity along with your resume. We never see your resume; this happens in your AI client of choice.
AI agent reading directly? Same data lives at
/api/jobs.json?q=Baseten.
See /llms.txt and /api/openapi.json for the full schema.
0 selected