RxBenefits, Inc.
Advocacy. Expertise. Service.
Director – DevOps & Cloud Infrastructure
Location
United States
Posted
1 day ago
Salary
Not specified
Postgraduate Degree10 yrs expEnglishAWSAzureCloudDockerGrafanaJenkinsKubernetesPythonTerraformGo
Job Description
• Own uptime, availability, scalability, and performance of all production systems.
• Define and manage SLOs, SLAs, error budgets, and incident response practices.
• Lead post-incident reviews and drive systemic reliability improvements.
• Implement observability standards (logging, metrics, tracing).
• Own cloud infrastructure strategy (AWS, Azure, hybrid).
• Lead infrastructure-as-code (Terraform, CloudFormation, ARM, etc.).
• Ensure disaster recovery, backup, and business continuity plans are tested and compliant.
• Monitor and optimize cloud spend through cost governance and FinOps practices.
• Own CI/CD pipelines, deployment automation, and release strategies.
• Enable safe, frequent releases (blue/green, canary, feature flags).
• Standardize DevOps tooling and platform capabilities across teams.
• Partner with Engineering to remove friction and increase delivery velocity.
• Set plan and manage execution of dashboards, availability management and reporting.
• Align with Product Engineering teams to define NFRs related to definition, instrumentation and logging.
• Embed security into DevOps practices (DevSecOps).
• Partner with Security, Legal, and Compliance on audits and certifications (SOC 2, HIPAA, HITRUST, PCI, etc.).
• Ensure secrets management, access controls, and vulnerability remediation.
• Build and lead DevOps, SRE, and Cloud Engineering teams.
• Define the DevOps operating model (centralized, embedded, hybrid).
Job Requirements
- 10+ years of hands-on experience in DevOps, SRE, cloud engineering, and infrastructure.
- 5+ years as a Director in leadership/people management role (leading managers and/or large teams)
- Deep expertise in modern tools and practices: Cloud platforms (AWS, Azure).
- CI/CD (GitHub Actions, GitLab CI, Jenkins, ArgoCD).
- Containers & orchestration (Kubernetes, Docker, Helm).
- Infrastructure as Code (Terraform, Pulumi, Crossplane).
- Monitoring/Observability (DataDog, Sumo, Grafana, ELK, Datadog, New Relic).
- Scripting/automation (Python, Go, Bash).
- Strong understanding of Agile/Scrum/SAFe methodologies.
- Proven track record of building high-performance teams and driving cultural change.
- Excellent communication, strategic thinking, and cross-functional collaboration skills.
- Experience with large-scale, high-availability environments.
Benefits
- Health insurance
- 401(k) matching
- Flexible work hours
- Paid time off
- Remote work options