Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteMid LevelTeam 501-1,000

Location

United States + 1 moreAll locations: United States, Canada

Posted

1 day ago

Salary

Not specified

Seniority

Mid Level

Job Description

Role Description

In this role, you will join Planet's Direct Access Service Infrastructure team, directly contributing to our next-generation Constellation as a Service platform. This platform represents a major new offering to our customers that goes beyond traditional cloud-based platforms and supports on-premises deployments.

You will be responsible for building, deploying, and operating critical compute software that supports end-to-end imaging operations within customer on-premises and/or cloud environments. You will use your understanding of internal compute requirements as well as customers' environmental-specific constraints to help design, implement, and support a robust system for reproducible deployments across operating environments, to guarantee the reliability, scalability, and availability of our services. To do this, you will partner closely with cross-functional engineering teams to enable and empower the integration of software solutions and the troubleshooting of distributed systems.

This is a full-time, remote position based in the United States and Canada. If located near an office, you are expected to work from that office 3 days per week.

Impact You'll Own

  • Build and deploy computing services and infrastructure in customer environments for a next-generation satellite operations and image processing end-to-end platform
  • Operate in a high-impact, tight knit team to architect novel systems for air-gapped deployments at scale
  • Clarify and surface requirements from ambiguous use cases defined by cross-functional stakeholders, including internal users and external customers
  • Responsible for operations such as deployments, service orchestration, and documentation for cross platform stakeholders
  • Scale architecture while ensuring availability of services
  • Improve reliability and scalability by resolving edge cases, studying failure modes, and writing tests
  • Participate in on-call rotations to ensure operational excellence

Qualifications

  • Bachelor’s degree in Computer Science or similar
  • 10+ years of experience building services that leverage cloud-native infrastructure and tooling
  • Experience deploying and maintaining bare-metal and cloud Kubernetes through tools such as Talos, RKE2, Proxmox, or k3s
  • Proficiency with Terraform, Ansible, Helm, Kustomize, and/or similar IaC / GitOps tooling
  • Experience successfully building, releasing, and supporting highly available, consistently performant services
  • Knowledge of hardware and network level implications of on-prem compute
  • Experience with platform optimization, particularly resource optimization, management, and cluster tuning in a constrained environment
  • Ability to observe and troubleshoot distributed systems with tools such as Alloy, Prometheus, Grafana, and OpenTelemetry
  • Advanced skills in Python, Bash, and other tooling as appropriate to build services and meet product goals
  • Excellent communication skills and the ability to work through collaboration with cross-functional engineering teams
  • Experience working with Jira for task management and progress tracking

What Makes You Stand Out

  • Experience with CUDA-based GPU programs
  • Security expertise in sensitive environments, including implementing zero-trust architectures, hardening Kubernetes clusters, conducting security audits, and deploying workloads in air-gapped environments

Benefits

  • Comprehensive Medical, Dental, and Vision plans
  • Health Savings Account (HSA) with a company contribution
  • Generous Paid Time Off in addition to holidays and company-wide days off
  • 16 Weeks of Paid Parental Leave
  • Wellness Program and Employee Assistance Program (EAP)
  • Home Office Reimbursement
  • Monthly Phone and Internet Reimbursement
  • Tuition Reimbursement and access to LinkedIn Learning
  • Equity
  • Commuter Benefits (if local to an office)
  • Volunteering Paid Time Off

Job Requirements

  • Bachelor’s degree in Computer Science or similar
  • 10+ years of experience building services that leverage cloud-native infrastructure and tooling
  • Experience deploying and maintaining bare-metal and cloud Kubernetes through tools such as Talos, RKE2, Proxmox, or k3s
  • Proficiency with Terraform, Ansible, Helm, Kustomize, and/or similar IaC / GitOps tooling
  • Experience successfully building, releasing, and supporting highly available, consistently performant services
  • Knowledge of hardware and network level implications of on-prem compute
  • Experience with platform optimization, particularly resource optimization, management, and cluster tuning in a constrained environment
  • Ability to observe and troubleshoot distributed systems with tools such as Alloy, Prometheus, Grafana, and OpenTelemetry
  • Advanced skills in Python, Bash, and other tooling as appropriate to build services and meet product goals
  • Excellent communication skills and the ability to work through collaboration with cross-functional engineering teams
  • Experience working with Jira for task management and progress tracking
  • What Makes You Stand Out
  • Experience with CUDA-based GPU programs
  • Security expertise in sensitive environments, including implementing zero-trust architectures, hardening Kubernetes clusters, conducting security audits, and deploying workloads in air-gapped environments

Benefits

  • Comprehensive Medical, Dental, and Vision plans
  • Health Savings Account (HSA) with a company contribution
  • Generous Paid Time Off in addition to holidays and company-wide days off
  • 16 Weeks of Paid Parental Leave
  • Wellness Program and Employee Assistance Program (EAP)
  • Home Office Reimbursement
  • Monthly Phone and Internet Reimbursement
  • Tuition Reimbursement and access to LinkedIn Learning
  • Equity
  • Commuter Benefits (if local to an office)
  • Volunteering Paid Time Off

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Upstart logo

Senior Software Engineer, Site Reliability

Upstart

Our mission is to enable effortless credit based on true risk.

Full TimeRemoteTeam 1,001-5,000H1B Sponsor

Lead technical direction for software architecture and cross-team initiatives focusing on scaling consumer-facing systems and maximizing loan originations while maintaining compliance and system integrity.

United States
$166.9K - $230.9K / year
Full TimeRemoteTeam 1,001-5,000

The role involves designing, implementing, and maintaining secure CI/CD pipelines across Azure DevOps for various software components, while also owning platform security integration including secret and identity management.

United States
MongoDB logo

Senior Site Reliability Engineer, Fleet Management / Platform Engineering

MongoDB

MongoDB, originally called 10gen, is a software development company. Since 2007, MongoDB has created an open-source, document-oriented database to help clients

Full TimeRemoteTeam 5,550Since 2008

Contribute to developing and maintaining a scalable and secure runtime environment on top of Kubernetes that supports product needs across Company. Provide internal support for the Kubernetes ecosystem, partnering with engineering teams to solve domain-specific problems.

United States
$127K - $249K / year
Upstart logo

Senior Software Engineer – Site Reliability

Upstart

Our mission is to enable effortless credit based on true risk.

Full TimeRemoteTeam 1,001-5,000H1B Sponsor

Senior Software Engineer leading technical direction for Upstart’s applicant funnel

United States
$166.9K - $230.9K / year