SentinelOne
Secure your enterprise with the autonomous cybersecurity platform. Endpoint. Cloud. Identity. XDR. Now.
Engineering Manager, Site Reliability (SRE)
DevOps EngineerDevOps EngineerFull TimeRemoteTeam 1,001-5,000Since 2013H1B SponsorCompany SiteLinkedIn
Location
United States
Posted
110 days ago
Salary
$160K - $200K / year
Bachelor Degree8 yrs expEnglishAWSCloudDistributed SystemsGoogle Cloud PlatformGrafanaKubernetesPrometheusTerraform
Job Description
• Grow and lead a team of SRE professionals, including setting performance goals and measuring deliverables against key metrics, while evolving those metrics as S1 grows and needs develop
• Invest in data-driven deep triage on recurring issues, collaborating with other engineering teams to identify and address issues related to reliability, performance, and capacity
• Develop, improve, and implement processes for the full incident lifecycle, including incident management, post-incident analysis, and learning from incidents. Lead incident response efforts, including coordinating with other teams to investigate and resolve customer-impacting incidents
• Design support model for SRE regarding service maturity and service ownership, including monitoring and alerting improvements, and SLI / SLO design and implementation
• Analyze production metrics and signals to identify areas for improvement and take proactive steps to mitigate issues
• Develop and implement best practices and standards for Site Reliability Engineering, from day-to-day operations to hiring and planning
• Communicate effectively with cross-functional teams to ensure alignment on objectives and priorities. Deliver outcomes, not just stories and tasks.
Job Requirements
- 8+ years of related engineering experience, with at least 2 years in a management role
- Demonstrated experience leading technical and operational teams at various stages of maturity
- Excellent analytical and problem-solving skills
- Familiarity with modern software development methodologies, tools, and techniques, including CI/CD
- Experience working with cloud-native applications and large-scale distributed systems, including a working knowledge of technologies such as Kubernetes and Terraform/IaC, and cloud providers such as AWS or GCP
- Experience with various monitoring and alerting techniques and tools, including frameworks and concepts such as SLOs, OTel and Golden Signals as well as tooling such as Prometheus and Grafana
- Extensive experience with incident response and management at various layers of the stack across different business needs and applications, including both hands-on experience leading incidents/post-incident analysis and experience driving broader incident management initiatives
- Ability to thrive in a fast-paced, dynamic environment
Benefits
- Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA
- Unlimited PTO
- Industry-leading gender-neutral parental leave
- Paid Company Holidays
- Paid Sick Time
- Employee stock purchase program
- Disability and life insurance
- Employee assistance program
- Gym membership reimbursement
- Cell phone reimbursement
- Numerous company-sponsored events, including regular happy hours and team-building events
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer112 days ago
ContractRemoteTeam 11-50Since 2006H1B No Sponsor
Storage DevOps Engineer focusing on automation workflows in cloud environments.
AnsibleAWSAzureCloudGoogle Cloud PlatformPythonTerraform
California
Senior Cloud DevOps Engineer
KBR, Inc.We deliver science, technology and engineering solutions to governments and companies around the world.
DevOps Engineer112 days ago
Full TimeRemoteTeam 10,001+Since 1901H1B No Sponsor
Senior Cloud DevOps Engineer leading AWS migration projects at KBR
AnsibleAWSChefCloudJenkinsPuppetPythonTerraform
South Dakota
DevOps Engineer112 days ago
Full TimeRemoteTeam 11-50Since 2022H1B Sponsor
Forward Deployment Engineer translating business needs into technical architectures
AngularAWSAzureBigQueryCloudDockerEC2GraphQLJavaJavaScriptKubernetesMicroservicesNext.jsNode.jsPythonReactSpringSpring BootSpringBootTerraform
United States
DevOps Engineer112 days ago
Full TimeRemoteTeam 201-500Since 2013H1B No Sponsor
Senior DevOps Engineer managing infrastructure automation for Synack's platform
AnsibleAWSAzureCloudGoogle Cloud PlatformKafkaKubernetesLinuxMongoDBPostgresPrometheusRedisRubySaltStackSplunkTerraformUnixGo