ONE

Helping people save and grow their money.

Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 201-500H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

170 days ago

Salary

$140K - $180K / year

5 yrs expEnglishAWSCloudDistributed SystemsGrafanaJavaJava ScriptKubernetesNode.jsPrometheusPythonTerraformType ScriptGo

Job Description

• Ensure stability, scalability, and security of systems powering OnePay's financial products for millions of customers • Design, build, and maintain scalable infrastructure and tooling to improve reliability, performance, and availability across the platform • Contribute to the evolution of observability stack, platform libraries, cloud architecture, and CI/CD pipelines • Develop automation and monitoring systems to detect, prevent, and remediate incidents before they impact customers • Partner closely with product and platform engineering teams to embed reliability best practices in design, development, and deployment • Lead root cause analysis and postmortems, driving long-term improvements in resiliency and fault tolerance

Job Requirements

  • 5+ years of experience as a Software Engineer focused on building and running reliable, large-scale, distributed systems in production
  • 5+ years of operational experience in observability tooling and libraries (metrics, logging, tracing); experience using Datadog or similar tools (Prometheus, Grafana)
  • Proficiency in at least one programming language (Python, Go, Java, or Node.js preferred) for automation and tooling
  • Proficiency in incident management, going on-call, and writing post-mortem reports
  • Excellent collaboration skills with the ability to influence and educate product engineering teams on reliability and observability best practices
  • Hands-on experience with cloud platforms (AWS preferred), container orchestration (Kubernetes), and IAC tools (Terraform, Pulumi)
  • Drive and proactivity; builder and executor mindset
  • Familiarity with functional programming concepts and fp-ts/TypeScript is a plus
  • Authorization to work in the United States (application asks about work authorization and sponsorship)

Benefits

  • Competitive base salary
  • Stock options
  • Health benefits from Day 1
  • 401(k) plan with company match
  • Remote-friendly (US)
  • Flexible time off (FTO)
  • Opportunities for growth
  • Inclusive, mission-driven culture

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer

Softgic

Digital and Cognitive Transformation.

DevOps Engineer171 days ago
Full TimeRemoteTeam 51-200Since 2011H1B No Sponsor

DevOps Engineer at SOFTGIC S.A.S. managing AWS EKS cloud infrastructure.

AWSDockerEC2KubernetesLinuxTerraform
United States

Principal DevOps Engineer

Veeva Systems

The Industry Cloud for Life Sciences

DevOps Engineer171 days ago
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

Principal DevOps Engineer building AWS infrastructure for Veeva's life sciences cloud

AnsibleAWSCloudEC2ElasticSearchGrafanaGroovyJenkinsKubernetesPrometheusTerraform
North Carolina
$150K - $300K / year

Site Reliability Engineer

ContainIQ

Monitor Kubernetes metrics, logs, events, and traces within your cluster, instantly!

DevOps Engineer171 days ago
Full TimeRemoteTeam 1-10Since 2020H1B No Sponsor

Site Reliability Engineer for ContainIQ's cloud-native observability platform.

United States

Senior Customer Reliability Engineer

Replicated

We help software vendors ship their apps to complex customer environments using Kubernetes and Helm.

DevOps Engineer173 days ago
Full TimeRemoteTeam 51-200Since 2017H1B No Sponsor

Senior CRE supporting vendors deploying Kubernetes applications for Replicated's self-hosted distribution platform

KubernetesLinuxGo
United States
$149.5K - $192.5K / year