Jobs / Mad***

Sr SRE/Dev Ops Engineer

Mad*** · United States · Remote

Visa sponsorship details are locked. Unlock company name and apply link with .

United States170,000-175,000 USD/yearlyRemote

Remuneration

170,000-175,000 USD/yearly

Location

United States · Remote

Eastern Daylight Time (UTC-4)

Visa sponsorship

Sponsors visa

Job summary

ROLE DESCRIPTION Mad*** is seeking a hands-on Senior SRE / AI Platform DevOps Engineer to build, operate, and scale the infrastructure behind our AI-powered services, agents, and orchestration platforms. This role sits at the intersection of site reliability engineering, cloud infrastructure, DevOps automation, observability, and AI operations.

Benefits

Glad you asked…Comprehensive Healthcare100% Company Paid Short and Long Term Disability401k Participation and Equity GrantsContinuing Education ContributionsHSA Employer Contributions and FSA OptionsParental Leave ProgramCommuterResponsible Paid Time Off ProgramComplimentary Madison Reed Products + Discounts on Hair Color Bar ServicesCompany sponsored eventsBut wait, there's more…

Qualifications

The ideal candidate is infrastructure-first and operationally minded, with deep experience in cloud environments, CI/CD, production monitoring, incident response, and automation.
of the successful candidate.
This role must be based in the United States.
Lead reliability reviews, production readiness assessments, and infrastructure risk assessments.
Drive improvements in system resilience, scalability, security, performance, and cost optimization.
Champion SRE best practices across engineering teams.
REQUIRED EXPERIENCE
5+ years of experience in DevOps, Site Reliability Engineering, Platform Engineering, Cloud Infrastructure, or related roles.
Strong hands-on experience with cloud infrastructure, preferably AWS.
Experience building and maintaining CI/CD pipelines and automated deployment workflows.
Proficiency with infrastructure-as-code

Responsibilities

You will own the systems and practices that ensure our AI-enabled services are reliable, secure, scalable, cost-effective, and production-ready.
You will help operationalize AI systems by building reliable deployment workflows, telemetry pipelines, monitoring frameworks, and governance processes for models, agents, and orchestration services.
INFRASTRUCTURE PROVISIONING & AUTOMATION
Design, provision, and manage cloud infrastructure for AI-powered services, agents, orchestration systems, and supporting platforms.
Automate environment setup and configuration across development, staging, and production environments.
Build reusable infrastructure-as-code patterns that improve consistency, security, scalability, and maintainability.
Partner with engineering teams to ensure production systems are resilient, observable, performant, and cost-efficient.
Participate in on-call support, incident response, root cause analysis, and continuous reliability improvement.
CI/CD & DEPLOYMENT ENGINEERING
Build, maintain, and optimize CI/CD pipelines for services, agents, orchestration layers, and supporting infrastructure.
Implement automated testing, validation, security, and reliability gates within deployment workflows.
Design safe deployment patterns including blue/green deployments, canary releases, feature flags, and automated rollback mechanisms.

Degrees

Associate

Work schedule

On-callRotationShift

Industry

AutomotiveEducationEnergyHealthcareMedia

Company size

Smb