Empleo
Mis anuncios
Mis alertas
Conectarse
Encontrar un trabajo Consejos empleo Fichas empresas
Buscar

Site reliability engineer (sre) (san cugat del vallés)

Vallés
Roche
Publicada el Publicado hace 21 hr horas
Descripción

Join to apply for the Site Reliability Engineer (SRE) role at Roche

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally.

The role requires the candidate to be available for on-call duty service, responding promptly to urgent issues and emergencies outside of regular working hours, ensuring that critical situations are addressed in a timely and effective manner

Who We Are
At Roche, we are passionate about transforming patients’ lives, and we are bold in both decision and action - we believe that good business means a better world. That is why we come to work every single day. We commit ourselves to scientific rigor, unassailable ethics, and access to medical innovations for all.

Roche is strongly committed to a diverse and inclusive workplace. We strive to build teams that represent a range of backgrounds, perspectives, and skills. Embracing diversity enables us to create a great place to work and to innovate for patients.

Roche is building a global site reliability engineering (SRE) team that will support commercial and internal solutions. This team will have the mindset of building and creating engineering solutions to solve a broad spectrum of problems.

Step into the Future of IT Infrastructure with Roche!
As a seasoned Site Reliability Engineer (SRE) at Roche, you'll leverage your deep software engineering expertise to propel our IT infrastructure to new heights of robustness, scalability, and reliability.

Your Mission
Design and maintain cutting-edge tools, scripts, and frameworks that automate repetitive tasks, streamline software deployment, and manage expansive systems with unparalleled efficiency.

Partner closely with forward-thinking development teams to architect and implement high-performance solutions that elevate system efficiency, optimize resource utilization, and enhance deployment processes for superior uptime and user satisfaction.

Your Impact
Lead the charge in incident management and response. Detect system anomalies, troubleshoot swiftly, and conduct thorough root cause analyses to prevent recurring issues.

Champion continuous improvement by refining monitoring and alerting mechanisms, conducting insightful post-incident reviews, and embedding best practices in software lifecycle management.

Your Core Responsibilities

- Reliability Mastery: Proactively monitor and maintain system reliability using advanced tools like DataDog, VictorOps, ELK, Grafana, and Prometheus.
- Uptime Guardian: Ensure optimal uptime and performance by swiftly identifying issues and responding to alerts with precision
- Technical Troubleshooter: Basic understanding of Architecture and designs to deep dive into complex technical issues, troubleshoot, investigate, and resolve them.
- Service Excellence: Maintain and consistently achieve defined SLAs, SLIs, and SLOs, ensuring service levels are consistently met or exceeded
- Automation Innovator: Develop and deploy automation scripts (using Python or other scripting languages) to streamline operations, enhance system efficiencies, and reduce manual tasks
- Cloud Steward: Manage and maintain robust infrastructure across AWS and Azure environments, implementing best practices to ensure peak performance, reliability of cloud-based applications.
- Cross-functional Collaborator: Work closely with engineering, DevOps, security and operations teams to drive continuous improvement and foster a culture of reliability and inclusion
- Incident Responder: Handle requests and incidents through JIRA and ServiceNow, documenting troubleshooting procedures, solutions, and lessons learned to fuel ongoing improvements
- Adaptable Scheduling: Work on-call outside of normal working hours and weekends as scheduled to ensure continuous support
- Team Builder: Actively contribute to the growth and development of the SRE team's capabilities, nurturing a stronger, more inclusive, and resilient team

Who You Are :

- Educational Background: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent professional experience.
- Certifications: Relevant industry certifications (AWS/Azure) to showcase your expertise
- Experience: Approximately 5 years of experience in site reliability engineering, IT operations, DevOps, or related fields, or equivalent skills and experience
- Cloud Expertise: Solid experience with AWS and/or Azure, including setting up, monitoring, and maintaining cloud resources.
- Tool Proficiency: Proficiency with monitoring and logging tools such as DataDog, Splunk-Oncall, ELK stack, Grafana, and Prometheus etc.
- Hands-On Skills: Hands-on experience with JIRA and ServiceNow for tracking incidents, requests, and documentation
- Scripting Knowledge: Proficiency in Python or similar scripting languages for automation purposes
- Incident Response: Understanding of SRE Core principles beside in-depth understanding of incident prioritization, escalation processes, and service level management (SLA/SLO/SLI)
- Troubleshooting: Demonstrates proficient troubleshooting capabilities, especially in cloud and distributed system environments
- Communication and Teamwork: Excellent communication, teamwork, and documentation skills, with a proactive and self-motivated approach to improving system reliability and operational efficiencies
- Diversity and Inclusion: We value and encourage candidates from diverse backgrounds and experiences, believing that diverse perspectives drive innovation and success
- Language requirements: Excelling in both spoken and written English communication

Why Join Us?
By joining our team, you will be part of a dynamic environment where your contributions will directly impact the resilience and reliability of our services. You will have opportunities for professional growth and the ability to collaborate with industry leaders.

Roche is an Equal Opportunity Employer.

#J-18808-Ljbffr

Enviar
Crear una alerta
Alerta activada
Guardada
Guardar
Oferta cercana
Site reliability engineer
Vallés
Roche
Oferta cercana
Senior saas tooling solution specialist
Vallés
Roche
Oferta cercana
Commercial excellence lead
Vallés
Roche
Ofertas cercanas
Empleo Roche
Empleo Roche en Vallés
Empleo Vallés
Empleo Provincia de Valencia
Empleo Comunidad Valenciana
Inicio > Empleo > Site Reliability Engineer (SRE) (San Cugat del Vallés)

Jobijoba

  • Dosieres empleo
  • Opiniones Empresas

Encuentra empleo

  • Ofertas de empleo por profesiones
  • Búsqueda de empleo por sector
  • Empleos por empresas
  • Empleos para localidad

Contacto/ Colaboraciones

  • Contacto
  • Publiquen sus ofertas en Jobijoba

Menciones legales - Condiciones legales y términos de Uso - Política de Privacidad - Gestionar mis cookies - Accesibilidad: No conforme

© 2025 Jobijoba - Todos los Derechos Reservados

Enviar
Crear una alerta
Alerta activada
Guardada
Guardar