We protect important data other tools can’t see, from threats they can’t detect, across technologies they can’t control.
Senior Director, SRE – Cloud Infrastructure
Location
United States
Posted
28 days ago
Salary
$250K - $300K / year
Job Description
Job Requirements
- Led SRE and Infrastructure organizations at high-growth SaaS, platform, or security companies
- Strong technical leader with deep experience in cloud-native systems and a strong SRE mindset
- Strong background in Kubernetes, cloud platforms (GCP and/or AWS), and infrastructure as code (Terraform or equivalent)
- Designed or operated large-scale distributed systems, real-time data pipelines, or high-throughput platforms
- Experience owning COGS, cloud spend, and efficiency metrics, communicating tradeoffs to executives
- Comfortable operating at multiple levels: strategic planning, architectural reviews, and deep technical problem solving
- Use data and metrics to drive reliability, performance, cost optimization, and team productivity
- Proven track record of scaling teams and systems while maintaining high reliability and velocity
- Empathetic leader fostering inclusion, ownership, accountability, and psychological safety
- Thrives in fast-moving environments, comfortable navigating ambiguity and change
Benefits
- Offers Equity
- Offers Bonus
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer designing and maintaining cloud infrastructure and automation frameworks
The Staff Site Reliability Engineer will design and manage AWS infrastructure, optimize Kubernetes operations, automate workflows, and troubleshoot systems for improved reliability and performance.
Software Engineer maintaining CI/CD environment for software systems at Boeing
Lead Site Reliability Engineer
IntellumWe help large brands and fast-moving companies increase revenue and decrease support costs through education.
The Lead Software Engineer will lead the SRE team, focusing on reliability, performance optimization, security, and mentoring developers, while improving overall platform resilience.