Senior Site Reliability / Gitops Engineer
Canonical is hiring a Senior Site Reliability / Gitops Engineer to join the Information Systems (IS) team. This role focuses on automation-first IT operations, infrastructure as code, and managing both private and public cloud environments. The engineer will contribute to the automation of software operations, provide feedback to developers on product behavior at scale, and collaborate with globally distributed teams to deliver reliable services for Canonical’s production environment and Ubuntu community.
Responsibilities
* Drive the development of automation and GitOps initiatives within the team as an embedded tech lead
* Collaborate with the IS architect to align solutions with the IS architecture vision
* Design and architect services that IS can offer as products
* Develop infrastructure as code practices, increasing automation and improving IaC processes
* Automate software operations for reusability and consistency across private and public clouds, considering distributed systems complexity
* Maintain operational responsibility for Canonical's core services, networks, and infrastructure
* Develop skills in troubleshooting, capacity planning, and performance investigation;
set up, maintain, and use observability tools (Prometheus, Grafana, Elasticsearch);
design and maintain monitoring and alerting
* Provide assistance and collaborate with globally distributed engineering, operations, and support peers
* Carry out larger projects with uninterrupted development time to automate manual tasks
* Share experience and best practices in design sessions, mentorship, and collaborative work
* Take final responsibility for time-critical escalations
What we are looking for
* Modern view on hosting architecture driven by infrastructure as code across private and public clouds
* Product mindset focused on developing products rather than just solutions
* Python software development experience with large projects
* Experience with Kubernetes or other container orchestration systems
* Proven ability to manage and deploy cloud infrastructure with code
* Practical knowledge of Linux networking, routing, and firewalls
* Familiarity with various forms of Linux storage (e.G., Ceph, databases)
* Hands-on experience administering enterprise Linux servers
* Extensive knowledge of cloud computing concepts and technologies
* Bachelor’s degree or higher, preferably in computer science or related engineering field
* Clear and effective communication in English (email, chat, video/voice calls, and in-person)
* Motivated, able to troubleshoot from kernel to web, and willing to ask for help when needed
* Flexible, quick learner open to new things
* Willingness to adapt to fast-changing environments
* Comfortable working within distributed teams
* Passion and familiarity with open-source, especially Ubuntu or Debian
What we offer
* Distributed work environment with twice-yearly in-person team sprints
* Personal learning and development budget of USD 2,000 per year
* Annual compensation review
* Recognition rewards
* Annual holiday leave
* Maternity and paternity leave
* Team Member Assistance Program & Wellness Platform
* Opportunity to travel to new locations to meet colleagues
* Travel upgrades and benefits for long-haul company events
About Canonical
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the publisher of Ubuntu, Canonical supports a world where software is open, collaborative, and scalable. We recruit on a global basis and value excellence in every role. Most colleagues work remotely, with occasional in-person team events.
Canonical is committed to fostering a workplace free from discrimination. We value diversity of experience and perspective and provide fair consideration to all applicants.
Seniority level
* Mid-Senior level
Employment type
* Full-time
Job function
* Engineering and Information Technology
Industries
* Software Development
#J-18808-Ljbffr