Our mission is to enable effortless credit based on true risk.
Principal Software Engineer – Site Reliability
Location
United States
Posted
29 days ago
Salary
$195.3K - $270.4K / year
Job Description
Job Requirements
- 10+ years combined experience across Software Engineering and Site Reliability Engineering, with a balanced background in both disciplines
- Proven track record as an SRE thought leader and evangelist, driving adoption of reliability best practices across organizations
- Strong communication and mentoring skills to influence engineers across disciplines
- Proficiency in Python, Go, and JavaScript/TypeScript
- Proficiency with Infrastructure as Code (Terraform, CDK, CloudFormation, etc.)
- Experience building internal tooling from scratch in agile development environments
- Expertise with observability, distributed tracing, RUM, LCP, and performance monitoring tools (e.g., Datadog, Prometheus)
- Experience with on-call and incident management, including large-scale or ML-related incidents
- Strong background in automation and building self-healing systems
- Hands-on experience with LLM/GenAI to improve SRE efficiency and processes
- Program management skills, including the ability to propose innovative solutions, influence leadership, improve processes, and drive cross-functional projects to completion
Benefits
- Competitive compensation, including base pay, bonus opportunities, and annual equity grants that vest quarterly
- Generous 401(k) plan with Upstart matching $2 for every $1 contributed, up to $15,000 per year
- Employee Stock Purchase Plan (ESPP) with discounted stock purchase options for eligible employees
- Affordable medical, dental, and vision coverage, with multiple plan options - Upstart covers 90% to 100% of the cost depending on the plans you choose
- Health Savings Account contributions from Upstart for eligible plans
- Income protection benefits, including company-paid Basic Life, AD&D, and Short- and Long-Term Disability coverage, with options to purchase supplemental coverage
- Paid time off, sick and safe time, and company holidays
- Paid family and parental leave to support caregiving and major life moments
- Family-centered benefits through Carrot and Cleo, supporting fertility, parenthood, and caregiving
- Employee Assistance Program (EAP) offering mental health support and life-centered resources
- Financial wellness resources, including access to financial planning tools and a financial concierge service
- Annual wellness allowance to support your physical and emotional well-being and personal development, based on what matters most to you
- Annual productivity allowance to invest in relevant tools and resources you need to do your best work, no matter where you work from
- Connection and community through team events and onsites, all-company updates, and employee resource groups (ERGs)
- Onsite perks, including catered lunches and fully stocked micro-kitchens when working from one of our four offices, located in the Bay Area, Austin, Columbus, and New York City (opening Summer 2026!).
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevSecOps Delivery Manager
GuidePoint SecurityWe help organizations make smarter cybersecurity decisions that minimize risk.
DevSecOps Delivery Manager managing high-quality secure solutions at GuidePoint Security
Senior Engineer – Build and DevOps
NVIDIANVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
DevOps Engineer supporting NVIDIA’s RAPIDS project
Site Reliability Engineer
MinimalBuilding secure, reproducible environments that work the same everywhere. minimal.dev
As our first SRE you will have a hands-on role: Managing our cloud services on GCP and CloudFlare Ensuring we meet our SLOs Managing monitoring and logging systems Maintaining a strong security posture Managing our CI/CD systems Managing incident response and on-call Automating a...
Site Reliability Engineer
Ooma, Inc.Top rated business phone solution and personalized service to help your business thrive.
Site Reliability Engineer ensuring system stability and efficiency for Ooma's services