faster builds that keep engineers in flow
Site Reliability Engineer
Location
California
Posted
15 days ago
Salary
Not specified
Job Description
Job Requirements
- 4+ years in SRE, DevOps, or Production Engineering roles
- Experience managing Kubernetes in production
- Strong background in cloud infrastructure (GCP or AWS) and IaC (Terraform preferred)
- Solid knowledge of networking, security, and distributed systems
- Track record of improving system availability and developer productivity
- A knack for debugging complex, cross-system issues under pressure
Benefits
- comprehensive medical, dental, vision benefits
- 401k/pension
- parental leave
- generous vacation
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
Meazure LearningOffering full-service assessment development, delivery, and proctoring solutions across the world
Senior DevOps Engineer optimizing cloud infrastructure and deployment processes at Meazure Learning
Senior Site Reliability Engineer - Remote EST
HiBobHiBob helps modern, mid-size businesses transform the way they manage people, giving HR and managers all they need to connect, engage, develop, and retain top talent. Since 2015, we’ve achieved consecutive triple-digit year-over-year growth, all backed by our amazing team of Bobbers from across the globe, making us the choice HRIS of over 4000 midsize and multinational companies. Our HR platform is intuitive, data-driven, and built for the way people work today: globally, remotely, and collaboratively. Fast-growing companies across the globe such as Huel, What3words, Fiverr, and VaynerMedia rely upon Bob to help them create the best work experiences for their people.
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
DevOps Security Engineer
Knox Systems, Inc.Knox is FedRAMP as a Service. SaaS apps achieve FedRAMP in 90 days, saving 90% in year 1 on Knox.
Hands-on DevSecOps role securing cloud-native environments at Knox.
Database Reliability Engineer – Core Team
ClickHouseClickHouse is an open-source, column-oriented OLAP database management system.
Database Reliability Engineer ensuring reliability and performance of ClickHouse core.