StarCompliance

We are Reputation Guardians, on a mission to make compliance simple and easy.

Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 201-500H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

37 days ago

Salary

Not specified

Bachelor Degree5 yrs expEnglishAWSAzureCloudPrometheusPythonTerraformGo

Job Description

• Champion Reliability by Design: Collaborate with architects and engineers to build resilient, fault-tolerant systems. • Observability Overhaul: Lead the charge on full-stack observability. • Scaling Systems: Develop and implement auto-scaling strategies, load testing plans, and capacity forecasting. • Progressive Delivery: Help implement and automate deployment strategies. • Incident Response: Create and refine on-call processes and incident response playbooks. • Monitoring & Tooling: Own and evolve our monitoring infrastructure. • Developer Empowerment: Build reusable templates and dashboards to empower dev teams. • Cross-functional Collaboration: Work hand-in-hand with various teams for uptime and performance accountability.

Job Requirements

  • 5+ years in SRE, DevOps, or Production Engineering roles, ideally within a SaaS or cloud-native environment.
  • Deep experience with cloud platforms (preferably Azure or AWS), and Infrastructure-as-Code tools (e.g. Terraform).
  • Hands-on experience with Azure DevOps is strongly preferred.
  • Proficiency with observability tools such as New Relic, Datadog, Prometheus, or similar.
  • Strong understanding of software deployment strategies, CI/CD pipelines, and release engineering.
  • Ability to code in at least one modern scripting or systems language (e.g., Python, PowerShell, Go, Bash).
  • Experience operating multi-tenant environments with an emphasis on security, performance, and cost optimization.
  • Excellent communicator who thrives in cross-functional settings.

Benefits

  • Equal Opportunity Employer Statement
  • Pre-employment screening due to sensitive information access
  • Rigorous background investigation.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Site Reliability Engineer

Ivanti

Ivanti finds, heals and protects every device, everywhere – automatically.

DevOps Engineer37 days ago
Full TimeRemoteTeam 1,001-5,000Since 1985H1B Sponsor

Site Reliability Engineer managing cloud services for Ivanti

AnsibleApacheAWSAzureCloudElasticSearchJavaJenkinsKafkaLinuxMongoDBNGINXPostgresPythonRedisSplunkSQLGo.NET
United States

DevOps Engineer / Site Reliability Engineer

Leidos

Leidos is an innovation company rapidly addressing the world’s most vexing challenges in national security and health.

DevOps Engineer38 days ago
Full TimeRemoteTeam 10,001+Since 1969H1B Sponsor

Site Reliability Engineer designing CI/CD pipelines at Leidos

Cloud
Virginia
$87.1K - $157.5K / year

Site Reliability Architect

HHAeXchange

Better Homecare, Better Health

DevOps Engineer38 days ago
Full TimeRemoteTeam 501-1,000Since 2008H1B Sponsor

Site Reliability Architect leading technical strategy for enterprise SaaS platform

AWSCloudDNSGoogle Cloud PlatformJavaKubernetesPythonTCP/IPTerraformGo
United States
$170K - $185K / year

Sr. Site Reliability Engineer (SRE)

Moonlite AI

Moonlite is building a cloud-native experience on-prem. Our software provides the control and customization enterprises need for AI. Build Faster with Moonlite Instantly download and deploy NIMS from NVIDIA or build your own applications with Hugging Face. Customize and deploy AI agents in one click or integrate your own with ease. Total Control Over Your AI Obtain the highest level of security by design for your private environments. Moonlite provides total visibility into all your resources, applications, and users. Find Value with Your Use Case Allocate resources in real-time as needed in your environment. Use the models that best align with your use cases. When a new model is released, test it out and power your applications with it.

DevOps Engineer38 days ago
Full TimeRemoteTeam 10Since 2024

Build and operate production-grade AI infrastructure using Kubernetes, ensuring high availability, reliability, and performance. Develop custom operators and implement automation for efficient operations and monitoring.

AnsibleBashElk StackEnterprise Storage SystemsGrafanaHigh-Performance NetworkingKubernetesLinuxNvidia Gpu TechnologiesPrometheusPythonTerraform
Indiana + 1 moreAll locations: Indiana, Illinois
$165K - $225K / year