Jobs / Ama***
Software Development Manager (EC2 Nitro), EC2 Core Provisioning
Ama*** · Seattle, WA, United States
Visa sponsorship details are locked. Unlock company name and apply link with .
Seattle, WA, United States184,900-250,200 USD/yearlyRemote
Remuneration
184,900-250,200 USD/yearly
Location
Seattle, WA, United States
Visa sponsorship
Sponsors visa
Job summary
DESCRIPTION build, deploy, and manage applications with unparalleled flexibility and efficiency. Join our dynamic team, where we apply agentic and machine-learning solutions to one of the hardest problems in the fleet: returning broken servers to production when there is no deterministic signal of what is wrong.
Benefits
Learn more about ourAt https://amazon.jobs/en/USA, WA, Seattle - 184,900.00 - 250,200.00 USD annually
Qualifications
- We are looking for an experienced Software Development Manager (SDM) to lead this team.
- 3+ years of engineering team management experience
- 7+ years of working directly within engineering teams experience
- 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
- 8+ years of leading the definition and development of multi tier web services experience
- Experience partnering with product or program management teams
- PREFERRED
- Experience in communicating with users, other technical teams, and senior leadership to collect
- describe software product features, technical designs, and product strategy
- Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their
- and location.
- Amazon also offers comprehensive
Responsibilities
- Lead and inspire a team of engineers, providing guidance, mentorship, and support to foster their professional growth.
- Own the recovery decision engine that returns broken servers to sellable capacity, driving down unsellable rate and the time a host stays stuck.
- Build and operate this as a production software service — reliable, secure, and observable — running across millions of servers in every region, not a set of offline models or scripts.
- Debug complex, system-level, multi-component failures across hardware, firmware, BMC, and the provisioning and vetting stack, and turn that diagnosis into automated, repeatable recovery.
- Collaborate with hardware engineering, firmware, component owners, vetting, and provisioning teams to expand recovery coverage across platforms and drive failures upstream to their root cause so they stop recurring.
- Raise the bar on the safety of autonomous action on production-bound capacity, holding a high security and operational standard for a service that runs across all regions, including restricted environments.
- Champion best practices in software engineering, including code quality, testing, automation, and continuous integration and delivery (CI/CD).
Skills
Leadership
Degrees
Associate
Work schedule
Shift
Industry
AutomotiveEnergyInsurance
Company size
Smb