Software Development Manager, LLM Inference Model Enablement, Neuron SDK
DESCRIPTION. AWS Utility Computing (UC) provides product innovations, from foundational services such as Amazon Elastic Compute Cloud (EC2) to new product innovations that differentiate AWS’s services and features. As the SDM for the LLM Inference Model Enablement team, you will lead a team of expert AI/ML engineers to onboard and optimize state-of-the-art open-source and customer LLMs, both dense and MoE, for inference on Neuron, Trainium, and Inferentia accelerators. You will also drive improvements in model enablement speed and experience, while advancing inference usability and quality through inference features, infrastructure optimization, tools, and automation.
The ideal candidate will have a strong background in LLM model architectures, model performance optimizations, and inference techniques, such as delivering high-performance models using distributed inference libraries. You should be capable of managing demanding, fast-changing priorities and have strong technical ability to understand and deliver as part of a vertically integrated system stack consisting of the PyTorch inference library, Neuron compiler, runtime, and collectives.
Responsibilities
* Work with senior management and technical leaders to define model enablement and performance optimization for the latest SOTA LLMs and deliver them to customers.
* Lead the team to improve the model onboarding experience and enhance inference usability and quality for Neuron-supported models.
* Manage changing priorities as new models and technologies emerge, adapting the team's work accordingly.
* Dive deep to help the team solve technical challenges.
Basic Qualifications
* 3+ years of engineering team management experience.
* 7+ years of working directly within engineering teams.
* 3+ years of designing or architecting systems (design patterns, reliability and scaling).
* Experience partnering with product or program management teams.
Preferred Qualifications
* Experience communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy.
* Experience recruiting, hiring, mentoring/coaching, and managing teams of Software Engineers to improve their skills and effectiveness.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
The base salary range for this position is USD 212,700.00 to 287,700.00 annually. Your Amazon package will include sign‑on payments and restricted stock units (RSUs). Final compensation will be determined based on experience, qualifications, and location. Amazon offers comprehensive benefits including health insurance (medical, dental, vision, prescription), basic life & AD&D insurance, EAP, mental health support, medical advice line, flexible spending accounts, adoption and surrogacy reimbursement coverage, 401(k) matching, paid time off, and parental leave. Learn more about benefits at https://amazon.jobs/en/benefits.
#J-18808-Ljbffr