Underdog Fantasy

Underdog Fantasy is one of the fastest-growing fantasy sports companies on the market.

Senior Site Reliability Engineer – Infrastructure

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 201-500H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

58 days ago

Salary

$160K - $240K / year

Bachelor DegreeEnglishAWSKotlinKubernetesPostgresPythonRubySwiftType ScriptGo

Job Description

• Own and maintain the incident response process, including defining procedures, tools, and best practices • Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems • Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs • Develop and implement disaster recovery plans, including regular testing and regulatory compliance • Collaborate with teams on architecture decisions to ensure high availability and scalability • Manage launch and event planning for high-traffic occasions, focusing on infrastructure preparation and capacity management (a.k.a. Launch Readiness) • Act as an internal expert and consultant for monitoring tools like Datadog and Pagerduty and infrastructure like AWS and Kubernetes • Emphasis on automation and tooling to scale our workload • Contribute across codebases in Ruby, Python, Go, TypeScript, Swift, and Kotlin as needed to support the initiatives described above.

Job Requirements

  • A strong written and verbal communicator
  • Collaborative by nature
  • Someone who enjoys using research, data, and experiments to make decisions; you believe “Hope is not a strategy.”
  • You enjoy working directly with customers (generally engineers or other people inside the company)
  • You think long-term about what is best for the business and its customers
  • You are excited to take ownership
  • You are very comfortable around an IDE, working with multiple languages, multiple web application frameworks, AWS services, Kubernetes, PostgreSQL
  • You can work independently to learn new languages/technologies as needed
  • You enjoy deploying changes to production quickly, multiple times a week if necessary

Benefits

  • Unlimited PTO (we're extremely flexible with the exception of the first few weeks before & into the NFL season)
  • 16 weeks of fully paid parental leave
  • Home office stipend
  • A connected virtual first culture with a highly engaged distributed workforce
  • 5% 401k match, FSA, company paid health, dental, vision plan options for employees and dependents

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Site Reliability Engineer

Hashgraph

Hashgraph, formerly Swirlds Labs, is a software company home to some of the brightest minds in web3.

DevOps Engineer58 days ago
Full TimeRemoteTeam 51-200Since 2022H1B No Sponsor

Senior Site Reliability Engineer building decentralized systems at Hashgraph

AWSAzureDistributed SystemsGoogle Cloud PlatformKubernetesSolidity
United States
DevOps Engineer58 days ago
ContractRemoteTeam 51-200Since 1993H1B No Sponsor

Lead Site Reliability Engineer overseeing cloud infrastructure reliability

CloudGrafanaKubernetesPrometheusPythonTerraformGo
United States

Site Reliability Engineer L5 – Live SRE

Netflix

Where you come to do the best work of your life. Follow @WeAreNetflix on Twitter, IG, Facebook, & Youtube for more

DevOps Engineer58 days ago
Full TimeRemoteTeam 10,001+Since 1997H1B Sponsor

Site Reliability Engineer supporting live streaming events at Netflix

CloudDNSKafkaLinuxMicroservicesPythonRustSparkSQLTCP/IPUnixGo
United States
Full TimeRemoteTeam 10,001+Since 1986H1B No Sponsor

Site Reliability Engineer ensuring performance and reliability of internal services at SS&C

AWSAzureCloudGoogle Cloud PlatformGrafanaKubernetesPrometheusPythonGo
Colorado + 3 moreAll locations: Colorado, New York, Massachusetts, Missouri
$175K - $185K / year