MITO AI
Location: Madrid (Hybrid)
About MITO AI
MITO AI is a collaborative, AI-native platform reinventing how films, commercials, and music videos are made. We are building the operating system for a $300B+ global video production industry as it shifts to AI-native workflows.
Headquartered between San Francisco (team of 2) and Madrid (team of 14), MITO combines state-of-the-art AI models for image, video, and audio with professional-grade editing tools in a multiplayer, browser-based canvas. Creators and teams can also bring ideas to life by directing an AI agent that generates scripts, scenes and edits in real time.
MITO was founded by Iñaki Berenguer (Master MIT, PhD Cambridge, 5x founder and CEO, founded social photo startup Pixable acquired by SingTel, founded of CoverWallet $1.6B premium revenue and $300M exit in 4 years, founded AI infrastructure company iPronics, which has raised $50M) & Danny Saltaren (award-winning product designer at 2 tech unicorns, National Design Award recipient) and Arantxa Barcia (Art Director).
We are backed by Lightspeed Venture Partners and investors including Kibo, Kfund, Sequoia and a16z scouts, LifeX, Everywhere, 5 unicorn founders, and execs from Github and Roblox.
Role Overview
This role designs and ships the intelligence layer behind storyboarding, scene generation, multi-model orchestration, and creative assistance. You will turn raw model capability into usable creative leverage.
Key Responsibilities
AI Systems Design
* Architect orchestration across multiple AI video, image, and audio models
* Design structured prompt frameworks for scene-based generation
* Build embedding, retrieval, and context systems across projects and assets
* Create intelligent agents that assist in storytelling and editing workflows
RAG & Context Systems
* Develop retrieval systems across scripts, scenes, and creative references
* Build context-aware AI assistants for multi-scene narrative projects
* Optimize latency and cost across multi-step generation flows
Experimentation & Shipping
* Rapidly prototype features using LLMs, and frontier video models
* Design evaluation frameworks for creative quality, consistency, and coherence
* Implement feedback loops to improve outputs over time
Integration & Performance
* Build APIs in Node.js and Python to serve AI-driven functionality
* Work closely with data engineering on asset pipelines and traceability
* Ensure outputs are reliable across collaborative environments
Technical Stack
* Node.js • Python
* LLM APIs (OpenAI, Anthropic, Gemini)
* Video & Image AI models
* Vector search systems
* Docker • Cloud infrastructure
Experience with multimodal AI systems is highly valuable.
Ideal Profile
* 3+ years in applied AI, ML systems, or AI-focused full-stack engineering
* Hands-on experience with RAG, embeddings, and prompt design
* Strong intuition for where AI adds leverage vs noise
* Comfortable orchestrating multiple models into a coherent workflow
* Fast experimenter with product intuition
* Excited by creative tools, storytelling, and video systems
What Success Looks Like
* AI features feel intentional, not gimmicky
* Storyboarding and scene generation feel magical but controllable
* Multi-model orchestration is seamless
* Creative professionals feel empowered, not replaced
* MITO ships intelligent capabilities faster than competitors