FewerJobs.
All jobs

Research Scientist - Model Team

Mirelo AI - Berlin, Germany, Tübingen, Hybrid

Posted Dec 3, 2025

Benefits

Parental leave
Not verified
Non-birth-parent leave
Not verified
Family-building benefits
  • Fertility benefits: Not verified
  • Adoption assistance: Not verified
  • Surrogacy assistance: Not verified
Mental health support
Not verified
Relocation assistance
Not verified
Childcare support
Not verified
Learning budget
Not verified
Verification
Not verified
Salary
Not verified

Was this benefit information wrong? Tell us.

Schedule

Shift type
Not verified
Weekend work
Not verified

Application

Cover letter
Not verified
Assessment
Not verified
Deadline
Not stated

Where they hire

State eligibility is not yet verified.

About this role

Research Scientist - Model Team Berlin, Germany, Tübingen, Hybrid Mirelo AI is building the next generation of creative tools by generating realistic sound, speech and music from video. We develop cutting-edge foundational generative AI models that "unmute" silent video content and create custom, hyper-realistic audio for gaming, video platforms, and creators. Our technology empowers global storytellers to transform their content. We recently closed a $41 million Seed round co-led by Andreessen Horowitz and Index Ventures with participation from Atlantic, and are rapidly expanding across Product, Engineering, Go-to-Market, and Growth. About the Role At Mirelo, you'll work at the centre of how we build the next generation of multimodal video-to-audio models. This role is deeply hands-on and research-heavy: with a great H100/200-per-engineer ratio you explore and build new multimodal models and push the boundaries of what's possible in music, sound, and speech generation. You'll collaborate closely across research and engineering, run focused ablations, and translate experimental results into clear next steps for the team. From data curation to deployment, you'll help shape the full lifecycle of the models that power our products and partnerships. Key Responsibilities - Design, implement and train large-scale multimodal generative models for audio generation (diffusion and/or autoregressive models). - Explore new modeling ideas for audio generation (music, sound, speech) while taking inspiration from the language and image domains. - Develop and experiment with post-training for new capabilities (fine-grained control, in/out-painting, editing, …) - Conduct rigorous ablation studies, get actionable insights and communicate results to the team to

Read the full description at jobs.ashbyhq.com. FewerJobs shows a source-linked preview and links to the original posting.

Apply at jobs.ashbyhq.com

Apply link not verified; last-live date unavailable.

What verified means

Verified means a displayed claim has a recorded source field, a source URL when available, and a timestamp showing when FewerJobs checked or enriched the evidence.

Related jobs