ONE
Helping people save and grow their money.
Site Reliability Engineer
Location
United States
Posted
170 days ago
Salary
$140K - $180K / year
5 yrs expEnglishAWSCloudDistributed SystemsGrafanaJavaJava ScriptKubernetesNode.jsPrometheusPythonTerraformType ScriptGo
Job Description
• Ensure stability, scalability, and security of systems powering OnePay's financial products for millions of customers
• Design, build, and maintain scalable infrastructure and tooling to improve reliability, performance, and availability across the platform
• Contribute to the evolution of observability stack, platform libraries, cloud architecture, and CI/CD pipelines
• Develop automation and monitoring systems to detect, prevent, and remediate incidents before they impact customers
• Partner closely with product and platform engineering teams to embed reliability best practices in design, development, and deployment
• Lead root cause analysis and postmortems, driving long-term improvements in resiliency and fault tolerance
Job Requirements
- 5+ years of experience as a Software Engineer focused on building and running reliable, large-scale, distributed systems in production
- 5+ years of operational experience in observability tooling and libraries (metrics, logging, tracing); experience using Datadog or similar tools (Prometheus, Grafana)
- Proficiency in at least one programming language (Python, Go, Java, or Node.js preferred) for automation and tooling
- Proficiency in incident management, going on-call, and writing post-mortem reports
- Excellent collaboration skills with the ability to influence and educate product engineering teams on reliability and observability best practices
- Hands-on experience with cloud platforms (AWS preferred), container orchestration (Kubernetes), and IAC tools (Terraform, Pulumi)
- Drive and proactivity; builder and executor mindset
- Familiarity with functional programming concepts and fp-ts/TypeScript is a plus
- Authorization to work in the United States (application asks about work authorization and sponsorship)
Benefits
- Competitive base salary
- Stock options
- Health benefits from Day 1
- 401(k) plan with company match
- Remote-friendly (US)
- Flexible time off (FTO)
- Opportunities for growth
- Inclusive, mission-driven culture
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer171 days ago
Full TimeRemoteTeam 51-200Since 2011H1B No Sponsor
DevOps Engineer at SOFTGIC S.A.S. managing AWS EKS cloud infrastructure.
AWSDockerEC2KubernetesLinuxTerraform
United States
DevOps Engineer171 days ago
Full TimeRemoteTeam 1,001-5,000H1B Sponsor
Principal DevOps Engineer building AWS infrastructure for Veeva's life sciences cloud
AnsibleAWSCloudEC2ElasticSearchGrafanaGroovyJenkinsKubernetesPrometheusTerraform
Site Reliability Engineer
ContainIQMonitor Kubernetes metrics, logs, events, and traces within your cluster, instantly!
DevOps Engineer171 days ago
Full TimeRemoteTeam 1-10Since 2020H1B No Sponsor
Site Reliability Engineer for ContainIQ's cloud-native observability platform.
United States
Senior Customer Reliability Engineer
ReplicatedWe help software vendors ship their apps to complex customer environments using Kubernetes and Helm.
DevOps Engineer173 days ago
Full TimeRemoteTeam 51-200Since 2017H1B No Sponsor
Senior CRE supporting vendors deploying Kubernetes applications for Replicated's self-hosted distribution platform
KubernetesLinuxGo