Description
**About IriusRisk**
At IriusRisk, we're leading the charge in automated threat modeling and secure software design. We work with some of the world's largest banks and other major sectors to help them build secure software from the ground up. Our platform automates the threat modeling process, allowing developers to design and build secure software at scale.
**Purpose of the Role**
As a DevOps Engineer, your role is to bridge the gap between development and operations, ensuring smooth software delivery, efficient infrastructure management, and reliable system performance. You will be responsible for implementing and maintaining DevOps practices, tools, and automation processes to enhance the organization's software development lifecycle. Your goal is to enable faster and more reliable software releases, improve system scalability and stability, and foster collaboration across cross-functional teams.
- Make key decisions to develop solutions that help us achieve our goals quickly and efficiently.
- Deploy Large Language Models (LLMs) on platforms like Runpod, Vast.ai, AWS, Replicate, and Azure.
- Interpret metrics in Deep Learning Systems and optimize cost-effective solutions for production.
- Collaborate with various teams to meet their technical needs.
- Design, deploy, maintain, and improve our AWS cloud infrastructure.
- Assist the customer support team with high-level technical challenges.
**Duties & Responsibilities**
- Be part of taking decisions on the DevOps team to build new solutions that will help the company reach their objectives quickly and efficiently.
- Create and deploy Large Language Models (LLMs) on: Runpod, Vast.ai, AWS, Replicate, Azure etc.
- Knowledge and experience in interpreting metrics in Deep Learning Systems.
- Create testing environments to simulate the best solution optimising the cost of inference in production.
- Work together with the other teams to help with their technical needs and requests.
- Design, deploy, maintain, and enhance our cloud infrastructure in AWS.
- Help the customer support team when they have high technical-level challenges.
**What we offer**
100% remote working.
Excellent working environment.
Work in a highly qualified professional team with world-class references in the Threat modeling sector
Real career development opportunities within the company.
We focus on the 'Do it well' culture and are not afraid to invest in doing things right the first time.
Training and certifications related to your role.
️ Horizontal business culture
´+92% employee retention rate
**Requirements**:
**Essential Skills**
- Experience working with ML, DL, and LLM in production environments.
- Deep understanding and knowledge of the last GPUs / CPUs and HW architectures to help to select the best possible solution for our cases.
- Be able to understand the statistical results of the tests and design the best solution given those results.
- GNU/Linux: Centos and/or Debian based distros.
- Scripting and automation: Experience with Bash, Python, Golang or similar scripting language.
- Source Code Management: Git is a must.
- Containers and virtualization: Knowledge of Kubernetes
- Experience with monitoring solutions (metrics, dashboards, logs, etc )
- Knowledge in configuration management: Ansible, Chef, Puppet or similar.
- Experience with CI/CD pipelines (Jenkins, GoCD, ArgoCD, Github actions or similar )
- Experience with Amazon Web Services: EC2, S3, RDS, IAM, VPC.
- Experience with Infrastructure as Code (Terraform, Cloudformation, deployment manager or similar)
- Knowledge on how Identity Federations based on SAML/OAUTH work (Azure AD, ADFS, PingID, Okta).
**Desired Experience**
- Bachelor's or master's degree in computer science, engineering, or a related field.
- Proven experience (3+ years) as a DevOps Engineer or in a similar role.
- Software Engineer experience/background (2+ years).
**Person Requirements**
- With the initiative to analyze and propose solutions to new problems and challenges.
- Fast and continuous learner. Comfortable adopting new paradigms and technologies.
- Excellent communication skills and strong team player.
- Used to work in an agile environment.
- Able to face new challenges.
- Able to do on-call for emergency assistance to the customer support team in the afternoons only (Very low chance, expect 1 action per month). The on-call disponibility and extra time worked will be paid additionally.