Remove complexity, add velocity.
Senior Site Reliability Engineer
Location
United States
Posted
2 days ago
Salary
Not specified
Seniority
Senior
Job Description
Job Requirements
- 5+ years of SRE, platform engineering, or production operations experience in a SaaS environment
- Deep hands-on Kubernetes expertise; you understand the scheduler, networking, storage, and autoscaling at a level where you can debug anything
- Strong AWS fundamentals across compute (EC2, EKS), networking (VPC, NLB, Route53), storage (S3, RDS), and IAM
- Experience defining and operating against SLOs in production; you've written error budgets, not just read about them
- Proficiency with observability tooling (Prometheus, Grafana, OpenTelemetry, Datadog, or equivalent)
- Solid scripting and automation skills; Go, Python, Bash, or similar; you automate what you touch
- Strong written communication: clear runbooks, sharp incident reports, thoughtful post-mortems
- Live within US time zones (Pacific through Eastern), including Canada and other regions
Benefits
- Health insurance, dental, and vision coverage
- Equity participation in a well-funded, growing company
- Home office stipend and equipment budget
- Flexible time off and a culture that respects it
- Work directly with the engineers who built Argo CD and Kargo; you'll learn a lot here
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Lead Site Reliability Engineer
GifthealthSeamlessly unifying access, fulfillment, and support for faster, simpler digital pharmacy care.
Lead Site Reliability Engineer at Gifthealth managing DevOps practices
The Senior SRE will be responsible for owning platform reliability, defining and driving improvements against SLIs/SLOs/SLAs for the Company SaaS platform, and designing/maintaining observability systems across AWS infrastructure. This role also involves participating in on-call rotations, acting as incident commander for high-severity events, and driving improvements to alerting fidelity.
Site Reliability Engineer, SRE
CAKE.comDeliciously simple way to run a business and empower your team 💫
SRE managing scalable infrastructure for CAKE.com



