Temporal Technologies logo
Temporal Technologies

Build invincible apps.

Staff Software Engineer - Reliability

Software EngineerSoftware EngineerFull TimeRemoteLeadTeam 51-200Since 2018H1B SponsorCompany SiteLinkedIn

Location

United States + 1 moreAll locations: United States, Canada

Posted

1 day ago

Salary

$212K - $286K / year

Seniority

Lead

Distributed SystemsReliability EngineeringChaos TestingLoad TestingObservabilityGoKubernetesPrometheusGrafanaIncident ResponseSREPerformance Tuning

Job Description

About Us

Temporal is an open source programming model that can simplify code, make applications more reliable, and help developers focus on the important things like delivering features faster. We are on a mission to be the reliable foundation of every developer’s toolbox, and are building the team that will make that happen.
 
Our values guide us —they are present in how we show up, make decisions, and work together to make an impact. We’re curious, driven, collaborative, genuine and humble.
 
Temporal is growing and we are looking for those who share our values, challenge 'standard' thinking, and want to influence our future. If you have a passion for improving the developer experience, building world-class open-source software and communities, and want to be a part of our amazing team, we'd love to hear from you!

Summary

Join our Reliability team as a Staff Software Engineer, where you’ll own the reliability of operating Temporal Cloud end to end. You will help define and measure reliability expectations, harden systems through gamedays and chaos testing, and build the tooling and practices that make reliability visible and continuously improving across services and operational processes. We’re looking for someone who thrives in ambiguity, enjoys turning reliability goals into concrete engineering work, and can lead cross-team efforts that make systems more resilient at scale.

What You’ll Do

  • Own reliability outcomes for operating Temporal Cloud end to end, partnering across engineering, infrastructure, and product to drive measurable improvements.
  • Define, implement, and evolve reliability targets and associated practices, including alerting thresholds, operational readiness criteria, and escalation paths.
  • Plan and run gamedays to validate incident response, operational procedures, and cross-team coordination under realistic failure scenarios.
  • Build and scale a chaos testing program that exercises failure modes safely and drives remediation work that reduces real risk.
  • Define and maintain a reliability scorecard across services and key operational processes, and use it to prioritize reliability investments.
  • Lead load testing and performance testing efforts, including test design, tooling, and analysis of bottlenecks and capacity constraints.
  • Improve observability standards (metrics, logs, traces, dashboards) so reliability signals are consistent, actionable, and easy to audit.
  • Drive post-incident learning and corrective actions, ensuring fixes are durable and reduce recurrence risk over time.
  • Make system-level tradeoffs across reliability, performance, cost, and velocity, and document decisions clearly for long-term maintainability.
  • Mentor other engineers and raise the bar on reliability engineering practices across teams.

What You’ll Bring

  • Strong computer science fundamentals, especially in distributed systems, concurrency, and performance.
  • Demonstrated ability to design and build complex systems that operate reliably under high load and partial failure.
  • Experience driving reliability improvements across multiple services, not just within a single codebase.
  • Hands-on experience with at least one of: gamedays, chaos testing, load testing, or building reliability scorecards.
  • Strong judgment in ambiguous situations, including the ability to prioritize reliability work based on risk and impact.
  • Excellent communication skills, including the ability to align multiple stakeholders on reliability goals, plans, and tradeoffs.
  • A collaborative mindset and a track record of mentoring and leveling up engineering practices.

Nice to Haves

  • Experience operating multi-tenant systems and designing protections against noisy-neighbor behaviors.
  • Deep expertise in observability (metrics design, tracing strategy, dashboard standards) and alert hygiene.
  • Experience building internal platforms or tooling that enables other teams to meet reliability standards.
  • Familiarity with workflow orchestration systems or durable execution platforms.
  • Open source contributions, especially in infrastructure or distributed systems.

Compensation

  • Base Salary Range - $212,000 - $286,200, depending on qualifications and location
  • Additionally, this role is eligible to participate in Temporal's equity plan.
Compensation ranges reflect salary and commission compensation (when applicable) across several geographic markets. Employment offers carefully consider multiple factors, including prior experience, knowledge, expertise, skillset, market location, and job level assessed during the interview process.
 
Employee benefits and perks below are for full-time employees, part-time or temporary positions are excluded. 
 
U.S. Benefits 
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more!
International Benefits

Paid Time Off (PTO) and Benefits outside the United States vary by country, and are issued in partnership with Remote.com.  Additionally, Temporal offers perks to all international employees for learning & career development, a lifestyle spending account, in-home office setup (in addition to company-issued hardware), professional memberships, work-from-home meals, and access to the Calm app for mental wellness.

Travel

Temporal is a globally distributed, collaborative team that values opportunities for in-person connection. Occasional travel may be required for company events, team offsites, and other meaningful moments that bring us together.

Additional Perks 
  • $3,600 / Year Work from Home Meals 
  • $1,800 / Year Professional Enrichment (Career Development & Professional Memberships)
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment - laptop, monitor, keyboard, mouse, trackpad, and extension power cable at no cost to you)
  • $74 / Month Reimbursement for Internet
  • Calm App Subscription for Mental Health & Wellness
Temporal Technologies is an Equal Opportunity Employer. Temporal Technologies does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need. We embrace and celebrate differences and diversity.
 
Temporal is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities. If you need to request a reasonable accommodation, please let your Recruiter know so we can assist.
 
We are not working with external recruitment agencies, thanks.

Related Job Pages

More Software Engineer Jobs

Human Interest logo

Senior Software Engineer, Builder Tools

Human Interest

Affordable, full-service 401(k) plans for SMBs.

Software Engineer1 day ago
Full TimeRemoteTeam 501-1,000Since 2015H1B Sponsor

The role involves designing and implementing fault-tolerant cloud infrastructures in AWS, along with developing CI/CD pipelines for multi-account environments. Responsibilities also include building efficiency tools for engineers and contributing to evolving engineering standards.

AWSTerraformCloudFormationCI/CDCodeBuildCodeDeployCodePipelineGitLab CI/CDGitHub ActionsInfrastructure as CodeCloud InfrastructurePythonJavaScriptTypeScriptNode.jsSQLPostgreSQLDockerKubernetesMicroservicesREST APIGraphQLGitAgileScrumTest AutomationUnit TestingIntegration TestingMonitoringLoggingAlertingSecurityComplianceDevOpsSystem DesignSoftware Architecture
United States
$185K - $220K / year
Cotiviti logo

Senior Software Engineer

Cotiviti

Enabling a high-quality and viable healthcare system

Software Engineer1 day ago
Full TimeRemoteTeam 5,001-10,000H1B Sponsor

The Senior Software Engineer is primarily responsible for the performance and stability of critical applications, implementing functionality changes and defect fixes to support new client onboarding and production issue resolution. Responsibilities include acting as a subject matter expert, analyzing and resolving software issues, documenting specifications, and mentoring other developers.

JavaOracleSQLLinuxGitJenkinsAgileSDLC
United States
$105K - $145K / year
Cerby logo

Staff Software Engineer - Tech Lead

Cerby

Identity automation—no APIs, SAML, or SCIM required. From SSO to lifecycle management, Cerby secures disconnected apps.

Software Engineer1 day ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

This role is responsible for setting the technical direction, ensuring delivery excellence, and elevating engineering standards within a product squad, partnering with leadership to deliver high-impact initiatives. Key duties include owning the technical roadmap, designing scalable architectures, leading technical discovery, and championing the adoption of AI-augmented development practices.

PythonGoSaaSOIDCSAMLSCIMIAMRBACABACAPI securityJWTCORSXSSCSRFAWSSQSS3MySQLPostgreSQLRedisDynamoDBCI/CDOpen TelemetryDatadog
United States + 1 moreAll locations: United States, Canada
Coinbase logo

Software Engineer, DevSec

Coinbase

We're building an open financial system for the world.

Software Engineer1 day ago
Full TimeRemoteTeam 1,001-5,000Since 2012H1B Sponsor

Build and maintain production services (primarily in Golang) for developer security and supply-chain tooling. Deliver end-to-end features, write tests, author design docs, participate in on-call rotations, and collaborate with engineers and product to automate vulnerability detection and remediation.

Ai-Assisted DevelopmentArtifact ScanningArtifactoryAWSDockerGCPGogRPCKubernetesMongoDBPostgreSQLSbomSigstoreSlsaXray
United States
$152.4K - $179.3K / year