FewerJobs.
All jobs

Senior AI Engineer, Voice Platform

ClickUp - United States

Posted May 20, 2026

Benefits

Parental leave
Not verified
Non-birth-parent leave
Not verified
Family-building benefits
  • Fertility benefits: Not verified
  • Adoption assistance: Not verified
  • Surrogacy assistance: Not verified
Mental health support
Not verified
Relocation assistance
Not verified
Childcare support
Not verified
Learning budget
Not verified
Verification
Not verified
Salary
Not verified
401(k) match
Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type
Not verified
Weekend work
Not verified

Application

Cover letter
Not verified
Assessment
Not verified
Deadline
Not stated

Where they hire

State eligibility is not yet verified.

About this role

Senior AI Engineer, Voice Platform United States At ClickUp, we're building the future of work: the first truly converged AI workspace unifying tasks, docs, chat, calendar, and enterprise search, all supercharged by context-driven AI. We are an AI-native company. Every team member is expected to leverage AI daily, and we evaluate AI fluency as part of our hiring process. Join us and help redefine what's possible. 🚀 Role Overview You'll own and evolve the AI systems behind ClickUp's voice platform: real-time streaming transcription, intelligent reformatting, context-aware mention detection, and voice-to-action pipelines. This is a high-impact, hands-on role where you'll push the boundaries of what voice interfaces can do inside a productivity tool used by millions. Key Responsibilities - Design, build, and optimize real-time speech-to-text pipelines (streaming ASR, VAD, audio processing) - Improve transcription accuracy through context injection (user names, teams, custom vocabulary, language detection) - Develop and maintain LLM-powered post-processing (grammar correction, filler removal, mention resolution, formatting) - Build voice-to-action systems that parse natural language into structured workspace commands - Evaluate, benchmark, and integrate ASR models (Whisper, AssemblyAI, Fireworks, etc.) for cost, latency, and accuracy - Collaborate with product and platform teams to ship voice features across MAX Desktop, Mobile, Web, and Browser Extension - Explore multimodal AI capabilities (screen + voice + text) for next-gen assistant experiences Unsure if you meet all the qualifications of this job description but are deeply excited about the role? We hire based on ambition, grit, and a passion for improving the way people

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs