Changing the world starts with small steps. At Zoetis Spain SL, we strive to make a positive impact on society through innovative solutions in animal health.
Our mission is to improve the lives of animals and people by delivering high-quality products, services, and expertise. We aim to be a leader in the industry by fostering a culture of innovation, collaboration, and excellence.
Job Title: Site Reliability Engineer
Role Overview:
A Site Reliability Engineer at Zoetis Spain SL works on improving all aspects of our platform and has an impact across the whole organization. They are a blend of systems engineers and software developers who solve scalability issues with software and implement the best production engineering and security practices.
Responsibilities:
* Evolving our infrastructure platform by building self-service components used by all engineering teams and millions of users worldwide.
* Collaborating with Product and Infrastructure teams to architect and develop world-class infrastructure components.
* Designing and implementing tooling to enhance the availability, scalability, observability, and latency of our services used by internal customers.
* Promoting reliability awareness, helping teams adopt reliability principles, and reviewing observability implementations or architectures.
* Defining SLIs, SLOs, and SLAs as part of the service lifecycle.
* Sharing responsibility for on-call duties related to platform services.
* Resolving issues in our highly available platform and automating solutions to prevent future incidents.
* Participating in recruiting to grow our engineering team.
Candidate Profile:
* Strong understanding of Unix, networking (OSI model), containers, monitoring, logging, and CAP theorem (bonus).
* Proficiency in at least one programming language, with the ability to learn others.
* Automates processes to reduce toil.
* Effective and asynchronous communicator.
* Concerned with the impact on the company, team, and personal growth.
* Values diversity, humility, and a bit of humor.
* Prefers iterative actions over waiting for perfection.
* Prioritizes simplicity over complexity (KISS principle).
* Skilled at identifying and resolving bottlenecks.
* Comfortable communicating in English within an international team.
* Enjoys influencing and educating other teams on best practices and simplifying setups.
Potential Projects:
* Improving Kubernetes setup (AWS EKS).
* Enhancing network policies, service mesh, and related infrastructure.
* Developing our monitoring platform.
* Expanding distributed tracing capabilities with OTLP + Tempo.
* Scaling Loki logging platform.
* Maintaining our repository and CI/CD pipelines.
Why Join Us:
We're a company full of motivated, happy people. Here are some reasons why it's great to be part of our high-performance team:
* Excellent salary conditions.
* Flexible work environment and hours.
* Regular team events and staff benefits like free rides.
* Personal development programs and career path opportunities.
* Access to well-being and skill development resources.
* Flexible compensation plan including tickets for meals, transport, healthcare, and childcare.
* All necessary equipment provided—just bring your talent.
* We are committed to diversity and inclusion, proud to be an equal opportunity workplace.