Experteer Overview
Para una comprensión completa de esta oportunidad y de lo que se requerirá para ser un candidato exitoso, siga leyendo.
In this role you will provide high-level technical support for our application and manage production incidents to meet SLAs. You will act as Tier-2 escalation, collaborating with DevOps, developers, architects and product owners to keep our cloud-enabled environment resilient. You will develop and maintain monitoring, runbooks and automation to improve reliability and efficiency, and ensure smooth production transitions via CI/CD.
This positions you at the intersection of operations and product delivery, shaping how we scale reliable software in Azure.
Compensaciones / Ventajas
- Provide high-level technical support for the application and manage production incidents per SLAs
- Serve as Tier-2 escalation for production incidents with cross-functional collaboration
- Operate and support cloud-native Azure environment ensuring resilience and performance
- Maintain advanced systems to enhance reliability, monitoring and operational efficiency
- Develop scalable monitoring and alerting solutions using KQL, App Insights and Azure Monitor
- Build and maintain detailed runbooks for incident resolution
- Automate repetitive diagnostic tasks with scripts and tools
- Collaborate with development to ensure smooth production transitions via CI/CD
Responsabilidades
- Bachelor in Computer Science, Information Technology or related field, or 3+ years of professional experience
- Strong understanding of distributed systems xugodme and microservices
- Experience with event-driven architectures and messaging systems
- Experience with container orchestration, especially Kubernetes
- Knowledge of Microsoft Azure
- Experience with Infrastructure as Code (Crossplane, Terraform)
- Proficiency with CI/CD tools (GitHub Actions, ArgoCD)
- Scripting experience (Python, Bash)
- Experience with monitoring, distributed tracing and observability tools (Azure Monitor, Grafana)
Requisitos principales
•