Canonical is a leading provider of open source software and operating systems to the general enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT.
The company is founder led, profitable and growing. HPC is an important and technically challenging compute domain, with specialised tooling and a very high expectation of precision, efficiency and automation. This role is for a software engineer to join our HPC team to deliver an outstanding HPC experience - from bare metal to public cloud - as part of the broader Ubuntu platform.
We are looking for a range of skills and experience, and will work on everything from the kernel to Debian packaging, but the heart of our effort will be Python software development for automation of key software in the HPC sphere. Our focus is on delivering a world class experience for the operation of the HPC cluster itself.
You will be part of the team that delivers charms that deploy and manage the HPC cluster and provide HPC software packages.
Responsibilities
- Write high quality, rigorously designed Python software
- Collaborate proactively with a globally distributed team
- Debug issues and produce high quality code to fix them
- Contribute to technical documentation that define best practices for authoring high quality operators
- Discuss ideas and collaborate on finding good solutions
Requirements
- Bachelor's or equivalent in Computer Science, STEM or similar degree
- Experience with writing modern, maintainable Python
- Love technology and working with brilliant people
- Curious, flexible, articulate, and accountable
- Value soft skills and are passionate, enterprising, thoughtful, and self-motivated
- Interest and experience with two or more of the following: Linux, Kubernetes, Public cloud, OpenStack, LXC/LXD, Python, Go, Debian packaging
- Exceptional academic track record from both high school and preferably university
- Willingness to travel up to 4 times a year for internal events
Additional Skills
- Experience operating HPC clusters in production
- Experiences with Infiniband, RDMA, CUDA, MPI, Slurm, Lustre, and/or Singularity
What we offer
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programme
- Opportunity to travel to new locations to meet colleagues from your team and others
- Priority Pass for travel and travel upgrades for long haul company events
#J-18808-Ljbffr