Job Description
We are looking for an innovative AI/LLM Engineer to drive the advancement of our conversational AI capabilities. You'll join our Tech∏ team as an AI/LLM Engineer, focusing on prompt engineering, multi-agent orchestration, automated testing, and deploying integrations in production with LLM systems. You'll work closely with Backend Engineers to deliver world-class AI experiences to end users.
What will you do?
Core Development
- Design, optimize, and version prompts for production voice and chat LLM applications
- Architect and orchestrate multi-agent systems for complex conversations
- Build automated testing and validation frameworks for LLM outputs
- Implement prompt versioning, storage, and retrieval systems
System Integration & Deployment
- Collaborate with Backend Engineers to deploy and scale LLM-based systems
- Integrate LLMs with communication APIs (Twilio, WhatsApp, ElevenLabs)
- Implement RAG (Retrieval-Augmented Generation) solutions and vector search for multilingual environments
- Monitor performance metrics and conversation quality
Research & Innovation
- Research and prototype multi-agent frameworks (open-source and commercial)
- Experiment with cutting-edge conversational AI and real-time speech processing techniques
- Contribute to evolving the team's LLMOps best practices
- Continuously improve conversational quality, RAG pipelines, and reduce latency
Qualifications
Must have
- 2+ years hands-on experience with LLMs (OpenAI or similar, open-source models)
- Strong knowledge in prompt engineering and LLM optimization strategies
- Experience in evaluating LLMs, designing and running evaluation frameworks, creating test datasets, and defining success metrics
- Familiarity with automated testing pipelines, building CI/CD-integrated eval systems that run on every prompt change
- Experience in multi-agent architecture, from design to development of orchestration of complex LLM systems
- Good understanding of transformer architectures and proficiency in LLM frameworks such as LangChain, LlamaIndex, or similar frameworks
- Proficiency in Python
- Experience with RAG pipelines and vector databases
- Experience in cross-functional teams, ability to work in fast-moving environments where you own outcomes, not just tasks. You're comfortable with ambiguity and excited by the challenge of figuring things out.
Nice to have
- Experience in healthcare industries
- LLM integration with voice platforms (Twilio, ElevenLabs)
- Background in conversational AI, chatbots, voice assistants
- Knowledge of real-time speech processing and multi-modal systems
- Functional programming principles and advanced NLP
- Exposure to OOP stacks (.NET, PHP)
- Understanding of security and privacy in conversational AI
Additional Information
✨ What we offer:
We value a healthy work-life balance and long-term growth. Benefits vary by location, but here’s what you can expect:
Shared benefits
- 100% remote work, with the option to join our offices in Bologna or Barcelona
- Stock options plan after 6 months
- One extra day off for your birthday
- Access to iFeel – our mental wellbeing platform
Italy-specific
- ️ €8/day meal vouchers – lunch is covered if you're in the Bologna office
- Private health coverage via Metasalute
Spain-specific
- ❤️ Comprehensive private health insurance with Adeslas
- Flexoh – versátil compensation platform
- Wellhub – gym & wellness network membership
- Language courses
How does the recruitment process work?
1. HR interview – a friendly chat to get to know you, your motivations, and tell you more about Tuotempo, our culture, and the team.
2. Technical interview – a deep-dive with our Tech Managers, including practical discussions or small exercises focused on LLMs, multi-agent systems, prompt design, and evaluation workflows.
3. Functional interview — a conversation with our Product Managers to understand how you collaborate cross-functionally and to align on the AI product domain.