Jobs / Mad***

Sr SRE/Dev Ops Engineer

Mad*** · United States · Remote
Visa sponsorship details are locked. Unlock company name and apply link with .
United States170,000-175,000 USD/yearlyRemote
Remuneration
170,000-175,000 USD/yearly
Location
United States · Remote
Eastern Daylight Time (UTC-4)
Visa sponsorship
Sponsors visa

Job summary

ROLE DESCRIPTION Mad*** is seeking a hands-on Senior SRE / AI Platform DevOps Engineer to build, operate, and scale the infrastructure behind our AI-powered services, agents, and orchestration platforms. This role sits at the intersection of site reliability engineering, cloud infrastructure, DevOps automation, observability, and AI operations.

Benefits

Glad you asked…Comprehensive Healthcare100% Company Paid Short and Long Term Disability401k Participation and Equity GrantsContinuing Education ContributionsHSA Employer Contributions and FSA OptionsParental Leave ProgramCommuterResponsible Paid Time Off ProgramComplimentary Madison Reed Products + Discounts on Hair Color Bar ServicesCompany sponsored eventsBut wait, there's more…

Qualifications

  • The ideal candidate is infrastructure-first and operationally minded, with deep experience in cloud environments, CI/CD, production monitoring, incident response, and automation.
  • of the successful candidate.
  • This role must be based in the United States.
  • Lead reliability reviews, production readiness assessments, and infrastructure risk assessments.
  • Drive improvements in system resilience, scalability, security, performance, and cost optimization.
  • Champion SRE best practices across engineering teams.
  • REQUIRED EXPERIENCE
  • 5+ years of experience in DevOps, Site Reliability Engineering, Platform Engineering, Cloud Infrastructure, or related roles.
  • Strong hands-on experience with cloud infrastructure, preferably AWS.
  • Experience building and maintaining CI/CD pipelines and automated deployment workflows.
  • Proficiency with infrastructure-as-code

Responsibilities

  • You will own the systems and practices that ensure our AI-enabled services are reliable, secure, scalable, cost-effective, and production-ready.
  • You will help operationalize AI systems by building reliable deployment workflows, telemetry pipelines, monitoring frameworks, and governance processes for models, agents, and orchestration services.
  • INFRASTRUCTURE PROVISIONING & AUTOMATION
  • Design, provision, and manage cloud infrastructure for AI-powered services, agents, orchestration systems, and supporting platforms.
  • Automate environment setup and configuration across development, staging, and production environments.
  • Build reusable infrastructure-as-code patterns that improve consistency, security, scalability, and maintainability.
  • Partner with engineering teams to ensure production systems are resilient, observable, performant, and cost-efficient.
  • Participate in on-call support, incident response, root cause analysis, and continuous reliability improvement.
  • CI/CD & DEPLOYMENT ENGINEERING
  • Build, maintain, and optimize CI/CD pipelines for services, agents, orchestration layers, and supporting infrastructure.
  • Implement automated testing, validation, security, and reliability gates within deployment workflows.
  • Design safe deployment patterns including blue/green deployments, canary releases, feature flags, and automated rollback mechanisms.

Degrees

Associate

Work schedule

On-callRotationShift

Industry

AutomotiveEducationEnergyHealthcareMedia

Company size

Smb