Jobs / Mad***
Sr SRE/Dev Ops Engineer
Mad*** · United States · Remote
Visa sponsorship details are locked. Unlock company name and apply link with .
United States170,000-175,000 USD/yearlyRemote
Remuneration
170,000-175,000 USD/yearly
Location
United States · Remote
Eastern Daylight Time (UTC-4)
Visa sponsorship
Sponsors visa
Job summary
ROLE DESCRIPTION Mad*** is seeking a hands-on Senior SRE / AI Platform DevOps Engineer to build, operate, and scale the infrastructure behind our AI-powered services, agents, and orchestration platforms. This role sits at the intersection of site reliability engineering, cloud infrastructure, DevOps automation, observability, and AI operations.
Benefits
Glad you asked…Comprehensive Healthcare100% Company Paid Short and Long Term Disability401k Participation and Equity GrantsContinuing Education ContributionsHSA Employer Contributions and FSA OptionsParental Leave ProgramCommuterResponsible Paid Time Off ProgramComplimentary Madison Reed Products + Discounts on Hair Color Bar ServicesCompany sponsored eventsBut wait, there's more…
Qualifications
- The ideal candidate is infrastructure-first and operationally minded, with deep experience in cloud environments, CI/CD, production monitoring, incident response, and automation.
- of the successful candidate.
- This role must be based in the United States.
- Lead reliability reviews, production readiness assessments, and infrastructure risk assessments.
- Drive improvements in system resilience, scalability, security, performance, and cost optimization.
- Champion SRE best practices across engineering teams.
- REQUIRED EXPERIENCE
- 5+ years of experience in DevOps, Site Reliability Engineering, Platform Engineering, Cloud Infrastructure, or related roles.
- Strong hands-on experience with cloud infrastructure, preferably AWS.
- Experience building and maintaining CI/CD pipelines and automated deployment workflows.
- Proficiency with infrastructure-as-code
Responsibilities
- You will own the systems and practices that ensure our AI-enabled services are reliable, secure, scalable, cost-effective, and production-ready.
- You will help operationalize AI systems by building reliable deployment workflows, telemetry pipelines, monitoring frameworks, and governance processes for models, agents, and orchestration services.
- INFRASTRUCTURE PROVISIONING & AUTOMATION
- Design, provision, and manage cloud infrastructure for AI-powered services, agents, orchestration systems, and supporting platforms.
- Automate environment setup and configuration across development, staging, and production environments.
- Build reusable infrastructure-as-code patterns that improve consistency, security, scalability, and maintainability.
- Partner with engineering teams to ensure production systems are resilient, observable, performant, and cost-efficient.
- Participate in on-call support, incident response, root cause analysis, and continuous reliability improvement.
- CI/CD & DEPLOYMENT ENGINEERING
- Build, maintain, and optimize CI/CD pipelines for services, agents, orchestration layers, and supporting infrastructure.
- Implement automated testing, validation, security, and reliability gates within deployment workflows.
- Design safe deployment patterns including blue/green deployments, canary releases, feature flags, and automated rollback mechanisms.
Degrees
Associate
Work schedule
On-callRotationShift
Industry
AutomotiveEducationEnergyHealthcareMedia
Company size
Smb