Senior DevOps Engineer (Remote) Madrid, Spain | Bucharest, Romania | Berlin, Germany | London, United Kingdom of Great Britain and Northern Ireland | Belgrade, Serbia | Barcelona, Spain | Cluj-Napoca, Romania Full Time At A5 Labs, we are committed to creating cutting-edge, AI-driven experiences that redefine industry standards. If you’ve ever played an online casino game, you may have already encountered our technology and innovation. The Role We’re seeking a Senior DevOps Engineer with expertise in MLOps and LLMOps to join our team. In this role, you will help us build and operate the infrastructure behind our poker applications, ensuring that it is secure, scalable, and efficient. You will work closely with product and engineering teams to enable a self-service approach, allowing developers to ship features faster and more reliably. From designing cloud infrastructure to automating deployments and establishing monitoring and incident management practices, you’ll be at the heart of how we scale our platform and teams.You’ll also play a key role in supporting our MLOps and LLMOps workflows, helping scale AI model deployment and experimentation across our platform.Key Objectives Design and build cloud infrastructure to run poker applications at scale, optimised for learning and exploration by recreational players worldwide. Optimise development workflows by automating builds, testing, and deployments while ensuring fast, reliable infrastructure to minimise friction and maximise developer focus.Establish and maintain robust MLOps and LLMOps workflows to support the scalable development, reliable deployment, and continuous optimisation of LLMs at scale.What you bring to the tableExperience 7+ years in DevOps / Infrastructure Engineering, including AI / ML workloads in production.Cloud & EfficiencyStrong AWS and Cloudflare skills with hands-on experience in EB, ECS, RDS, MSK / Kinesis, CloudWatch, IAM, Lambda, S3, Route 53, etc., and a proven track record in infrastructure cost optimisation.Multi-region & ScalingExperience designing highly available, scalable, multi-region systems with disaster recovery strategies and cost optimisation.Containerisation & OrchestrationHands-on experience with Docker and orchestration platforms such as ECS, EKS, or Kubernetes.Security & ReliabilityGood understanding of cloud security best practices to ensure safe and resilient systems.CI / CD & ObservabilityExperience with CI / CD pipelines, such as Bitbucket Pipelines or GitHub Actions, and observability tools like OpenTelemetry and Datadog or similar.Infrastructure as CodeProficient with Terraform or Pulumi for managing infrastructure.MLOps & LLMOpsExperience supporting ML workflows and model lifecycle management using tools like MLflow and SageMaker.Understanding of model versioning, experiment tracking, feature stores, scalable deployment, and challenges around LLM inference, fine-tuning, and performance observability.Experience setting up incident processes, participating in on-call rotations, and resolving production issues.Worked closely with engineering teams to build tailored infrastructure, provide reusable blueprints and self-service tooling, and promote DevOps best practices.What We OfferA fast-moving environment with minimal bureaucracy and quick decision-makingThe opportunity to work on cutting-edge AI products and servicesA strong focus on high-quality technical solutionsHigh autonomy and rapid feedback cyclesA great chance to learn how to play pokerRemote-friendly work cultureUnlimited vacation policyClose collaboration with engineering teams and meaningful contributions to a shared product visionThis role is part of AceGuardian, a cutting-edge team within A5 Labs. AceGuardian is focused on building advanced AI agents through reinforcement learning, game-solving, fine-tuning, and planning. These AI agents tackle challenges such as anti-cheat detection (including collusion and bots) and optimising gameplay across various games. The team operates in stealth mode and is composed of experts in AI, machine learning, and game development, all working together to revolutionise both gaming and real-world problem-solving. By joining this team, you’ll contribute to innovative projects that push the boundaries of AI in the gaming industry while working alongside some of the brightest minds in the field.J-18808-LjbffrJ-18808-Ljbffr
#J-18808-Ljbffr