Gifthealth logo
Gifthealth

Seamlessly unifying access, fulfillment, and support for faster, simpler digital pharmacy care.

Lead Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 501-1,000Since 2020Company SiteLinkedIn

Location

United States

Posted

1 day ago

Salary

$123K - $154K / year

Seniority

Senior

Bachelor Degree5 yrs expExperience acceptedEnglishAWSAzureCloudDockerGoogle Cloud PlatformPrometheusRubyRuby on RailsTerraform

Job Description

• Designs, builds, and maintains reliable, scalable software systems supporting Ruby on Rails applications • Embeds reliability, performance, and operational best practices into application code and development workflows • Owns DevOps practices including CI/CD reliability, deployment strategies, and release safety • Leads incident response, debugging, and root cause analysis across application and platform layers • Implements and evolves observability (logging, metrics, tracing) within application and service code • Partners with engineering teams on architecture, capacity planning, and technical standards

Job Requirements

  • Bachelor’s degree in computer science, engineering, or related field OR equivalent professional experience in software engineering, SRE, or DevOps roles (Required)
  • Cloud platform certifications (AWS, GCP, Azure) (Preferred)
  • SRE or DevOps-focused certifications (Preferred)
  • 5+ years of experience in software engineering, SRE, or DevOps roles (Required)
  • Hands-on experience building and operating Ruby on Rails applications in production (Required)
  • Experience in owning production incidents and application-level reliability (Required)
  • Experience in high-growth or scaling engineering organizations (Preferred)
  • Experience working in regulated or customer-impact–sensitive environments (Preferred)
  • Knowledge of Ruby on Rails application architecture and production operations; software reliability engineering principles (SLOs, SLIs, error budgets); and modern DevOps and CI/CD practices (Required)
  • Knowledge of security and compliance considerations in production systems (Preferred)
  • Strong software engineering skills (Ruby and/or comparable backend languages) (Required)
  • Debugging and performance optimization of production applications skills (Required)
  • CI/CD pipelines, deployment automation, and release tooling skills (Required)
  • Monitoring and observability tooling (Datadog, New Relic, Prometheus, etc.) skills (Required)
  • Infrastructure as Code (Terraform or similar) skills (Preferred)
  • Containerization and orchestration (Docker) skills (Preferred)
  • Ability to write production-quality code that improves system reliability (Required)
  • Ability to collaborate with product and engineering teams to influence design decisions (Required)
  • Ability to troubleshoot complex, cross-system failures (Required)
  • Ability to mentor engineers on operational ownership and reliability practices (Preferred)
  • Ability to balance speed of delivery with long-term system health (Preferred)

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Intapp logo

Senior DevOps Engineer

Intapp

Intelligence Applied

DevOps Engineer1 day ago
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

Senior DevOps Engineer for cloud services at Intapp

AzureDistributed SystemsDockerGrafanaKubernetesPrometheusTerraform.NET
North Carolina
Akuity logo

Senior Site Reliability Engineer

Akuity

Remove complexity, add velocity.

DevOps Engineer1 day ago
Full TimeRemoteTeam 11-50Since 2021H1B No Sponsor

The Senior SRE will be responsible for owning platform reliability, defining and driving improvements against SLIs/SLOs/SLAs for the Company SaaS platform, and designing/maintaining observability systems across AWS infrastructure. This role also involves participating in on-call rotations, acting as incident commander for high-severity events, and driving improvements to alerting fidelity.

KubernetesAWSPrometheusGrafanaDockerTerraformBashPythonGoOpenTelemetryEKSVPCS3RDSIAMRoute53NLBEC2GitOpsObservabilitySLASLOSLI
United States
CAKE.com logo

Site Reliability Engineer, SRE

CAKE.com

Deliciously simple way to run a business and empower your team 💫

DevOps Engineer1 day ago
Full TimeRemoteTeam 201-500Since 2009H1B No Sponsor

SRE managing scalable infrastructure for CAKE.com

AnsibleAWSDockerJenkinsLinuxPackerPuppetTerraformUnix
United States
Paxos logo

Staff Site Reliability Engineer, Platform Engineering

Paxos

Paxos is a regulated blockchain infrastructure company building transparent and transformative financial solutions.

DevOps Engineer1 day ago
Full TimeRemoteTeam 201-500Since 2012

Staff Site Reliability Engineer leading cloud infrastructure at Paxos

AWSCloudEC2KubernetesPostgreSQLPythonTerraformGo
United States
$210K - $240.8K / year