Join to apply for the Senior Site Reliability Engineer – Cloud & Automation role at Sage
5 days ago Be among the first 25 applicants
Join to apply for the Senior Site Reliability Engineer – Cloud & Automation role at Sage
Tech Talent Acquisition | Sustainability & DEI Focused | Trilingual:
French, English, Spanish
We’re looking for a Senior Site Reliability Engineer (SRE) to help scale and evolve our South African Accounting platform, ensuring that it remains performant, reliable, and easy to operate as it grows. You’ll work across infrastructure, automation, and observability to streamline our delivery processes and improve platform resilience.
Built with .NET, SQL Server, Redis and AWS, our stack is transitioning to a cloud-native future. In this role, you’ll automate deployments, optimize systems performance, and guide engineering teams in best practices around infrastructure-as-code, scalability, and monitoring.
This is a great opportunity to have meaningful technical impact and contribute to how we build and run cloud-native software at scale.
/ This is a hybrid role requiring 3 days per week in our Barcelona office.
First 90 Days
* 30 days – Learn our platform architecture, deployment pipelines, and reliability goals. Pair with engineering teams and shadow key delivery and release processes.
* 60 days – Propose improvements to automation, performance, and monitoring. Start contributing code to our infrastructure and CI/CD systems.
* 90 days – Take ownership of SRE initiatives across observability, scaling, and operational tooling. Mentor engineers in SRE practices and drive reliability improvements.
Meet the Team
You’ll work closely with other SREs, product engineers, DevOps and platform teams to design scalable deployment strategies, automate operations, and monitor real-world performance. The team operates in a highly collaborative, agile environment with shared ownership and autonomy.
How Success Will Be Measured
* Increased system availability, scalability, and performance
* Reduced manual effort through automation of infrastructure and delivery processes
* Effective collaboration with engineering teams on infrastructure and reliability topics
* Strong contributions to observability and incident response tooling
* Coaching of peers on DevOps and SRE best practices
Skills You’ll Gain
* Deep experience with cloud infrastructure automation and IaC (Infrastructure as Code)
* Advanced knowledge of observability and performance monitoring
* Broader exposure to CI/CD systems and deployment strategies in modern cloud apps
* Experience working across product, infrastructure, and reliability domains
* Mentoring and leadership opportunities within engineering teams
Snapshot of Your Day-to-Day
You’ll design and maintain deployment pipelines, build internal tools to improve developer velocity, and implement monitoring and alerting strategies. You'll also help respond to production incidents, guide teams on infrastructure design, and lead SRE improvements aligned with business priorities.
Must-Have Skills
* Strong development experience in at least one language (ideally C#)
* Scripting experience with Bash or Python
* Solid understanding of AWS cloud infrastructure (e.G., CloudFormation, CDK)
* Hands-on experience with CI/CD pipelines (GitHub Actions, TeamCity, etc.)
* Familiarity with monitoring and observability tools (New Relic, CloudWatch, Grafana, DataDog)
* Experience with SQL Server and understanding of relational/non-relational DBs
* Practical knowledge of containerization and tools like Docker, AWS Fargate
* Understanding of networking, distributed systems, and performance optimization
* Application of SRE principles such as SLAs, incident response, and reliability design patterns
Tech Stack You’ll Work With
* Languages:
C#, Bash, Python
* CI/CD:
GitHub Actions, TeamCity
* Monitoring & Observability:
New Relic, Grafana, DataDog
* Infrastructure:
Docker, SQL Server, Redis
At Sage, we offer you an environment where you can grow professionally without compromising your personal well-being. Our benefits package is designed to provide stability, flexibility, and balance:
* Flexible benefits:
exchange part of your salary and make tax savings on health insurance, meal and transport vouchers, childcare, and training.
* Well-being:
Free access to the Calm app (for up to 5 users), 24/7 counselling, and emotional support from our Healthy Mind Coaches. We also offer self-care and parenting resources through the Cleo app.
* Flexible working:
flexibility of working one hour in, one hour out, shortened workdays on Fridays and during the summer, and the opportunity to work from over 40 countries for up to 10 weeks per year through our Work Away program.
* Annual leave:
23 working days of vacation, 5 paid days per year for volunteering, and 5 additional paid days annually for personal or professional development.
* Extended leave:
7 extra days of maternity leave and 5 extra days of paternity leave, on top of the legal allowance, available after one year of service.
* Financial support:
Life and disability insurance, salary advances of up to 3.5 times your net monthly pay, a €300 net marriage bonus, and access to Sage's employee stock purchase plan at a discounted rate.
Health and Safety Responsibilities
* Fostering the safety culture, by leading with your own example.
* Following established safety procedures and reporting potential hazards promptly helps maintain a secure and efficient workplace.
* Participating in safety training sessions and adhering to preventive guidelines and procedures, the objective is minimizing risks and protecting yourself and the rest of your colleagues.
Seniority level
* Seniority level Mid-Senior level
Employment type
* Employment type Full-time
Job function
* Job function Engineering and Information Technology
* Industries Software Development and IT Services and IT Consulting
Referrals increase your chances of interviewing at Sage by 2x
Get notified about new Site Reliability Engineer jobs in Barcelona, Catalonia, Spain .
Site Reliability Engineer (SRE) / Devops (Hybrid/Remote) Senior Site Reliability Engineer (100% remote-friendly within Spain) Cloud Operations Engineer AWS - Barcelona Site Reliability Engineer (m/w/d) Online Bank-ing & Brokerage Germany (based in Barcelona) Site Reliability Engineer (x/f/m) - Tech Foundations
Barcelona, Catalonia, Spain 21 minutes ago
Development Operations (DevOps) Engineer Junior Software Engineer - Global Feature Store (Machine Learning Platform) System Development Engineering, Maintenance Automation Platform
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr