We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Staff Software Engineer, Compute
Location
United States
Posted
3 days ago
Salary
Not specified
Job Description
Role Description
This role offers an exciting opportunity to design and build the compute foundations that power large-scale distributed systems used by modern AI and enterprise applications. As a Staff Software Engineer focused on cloud compute infrastructure, you will develop scalable platform primitives that enable reliable, elastic, and secure execution environments. Working at the intersection of control plane and data plane architecture, you will tackle complex challenges such as autoscaling, multi-tenancy, observability, and cross-cloud orchestration. The position requires deep technical expertise, strong system design skills, and the ability to influence platform architecture across engineering teams. You will help create infrastructure abstractions that simplify development while maintaining performance, reliability, and operational excellence. This is an ideal opportunity for engineers passionate about distributed systems, cloud platforms, and building developer-focused infrastructure at scale.
- Design and build managed compute primitives that power scalable execution environments for cloud-based applications and distributed systems.
- Architect and implement autoscaling systems that dynamically optimize resource allocation while maintaining reliability, performance, and safety.
- Develop and operate services on critical execution paths where performance, stability, and correctness directly impact users.
- Define and evolve architecture boundaries between open-source server components and managed cloud platform capabilities.
- Build secure integrations with cloud providers, including handling IAM boundaries, credentials management, networking constraints, and operational safeguards.
- Ensure platform observability and operational excellence through monitoring, tracing, service-level objectives (SLOs), and reliability testing.
- Lead the full lifecycle of platform features, including API design, rollout strategies, backward compatibility, and long-term maintenance.
- Provide technical leadership through architecture discussions, design documentation, code reviews, and mentorship of engineers across teams.
- Collaborate cross-functionally with teams working on server infrastructure, SDKs, security, and control plane systems to deliver cohesive platform improvements.
Qualifications
- Extensive experience designing and building distributed systems or multi-tenant platform services in production environments.
- Strong understanding of core systems engineering principles including concurrency, performance optimization, reliability engineering, and failure-mode analysis.
- Proven track record of delivering infrastructure or platform capabilities used by other developers, including APIs, control planes, or data plane services.
- Experience owning production services with responsibility for reliability, monitoring, incident response, and continuous improvement of operational quality.
- Strong written and verbal communication skills with the ability to document architectural decisions and technical trade-offs clearly.
- Experience with cloud infrastructure platforms and scalable compute systems is highly desirable.
- Familiarity with identity and access management (IAM) models and secure cross-account execution environments is a plus.
- Experience building Kubernetes controllers, managing containerized workloads, or operating heterogeneous compute fleets is beneficial.
- Proficiency in systems programming languages such as Go is advantageous, though strong architectural judgment and system design expertise are most important.
Benefits
- Competitive salary ranging from $230,000 to $275,000 based on experience and qualifications.
- Eligibility to participate in a company equity program.
- Unlimited paid time off plus 12 company holidays and 2 floating holidays.
- Comprehensive health coverage with 100% premium coverage for medical, dental, and vision plans.
- Life insurance, disability insurance, and additional financial protection benefits.
- 401(k) retirement savings plan.
- Annual stipends including $3,600 for work-from-home meals, $1,800 for professional development, and $1,200 for lifestyle spending.
- Home office setup allowance and company-provided equipment.
- Monthly internet reimbursement and additional remote work support.
- Access to wellness resources including a mental health app subscription.
- Opportunities for global collaboration and occasional travel to team offsites and company events.
Job Requirements
- Extensive experience designing and building distributed systems or multi-tenant platform services in production environments.
- Strong understanding of core systems engineering principles including concurrency, performance optimization, reliability engineering, and failure-mode analysis.
- Proven track record of delivering infrastructure or platform capabilities used by other developers, including APIs, control planes, or data plane services.
- Experience owning production services with responsibility for reliability, monitoring, incident response, and continuous improvement of operational quality.
- Strong written and verbal communication skills with the ability to document architectural decisions and technical trade-offs clearly.
- Experience with cloud infrastructure platforms and scalable compute systems is highly desirable.
- Familiarity with identity and access management (IAM) models and secure cross-account execution environments is a plus.
- Experience building Kubernetes controllers, managing containerized workloads, or operating heterogeneous compute fleets is beneficial.
- Proficiency in systems programming languages such as Go is advantageous, though strong architectural judgment and system design expertise are most important.
Benefits
- Competitive salary ranging from $230,000 to $275,000 based on experience and qualifications.
- Eligibility to participate in a company equity program.
- Unlimited paid time off plus 12 company holidays and 2 floating holidays.
- Comprehensive health coverage with 100% premium coverage for medical, dental, and vision plans.
- Life insurance, disability insurance, and additional financial protection benefits.
- 401(k) retirement savings plan.
- Annual stipends including $3,600 for work-from-home meals, $1,800 for professional development, and $1,200 for lifestyle spending.
- Home office setup allowance and company-provided equipment.
- Monthly internet reimbursement and additional remote work support.
- Access to wellness resources including a mental health app subscription.
- Opportunities for global collaboration and occasional travel to team offsites and company events.
Related Guides
Related Job Pages
More Software Engineer Jobs
Software Engineer - Authentication
JobgetherWe use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
This role offers the opportunity to design, build, and operate high-performance authentication systems that secure voice, video, and real-time interactions at scale. The Software Engineer - Authentication will work on distributed production systems, developing robust, reliable, a...
Saalex Corporation is seeking an Innovation Developer - Lead to drive enterprise software modernization initiatives through advanced development automation, Digital Workforce Agents (DWAs), and secure DevSecOps practices.This role ser...
The role involves working on the most critical technical problems across backend, frontend, infrastructure, APIs, and data, taking significant ownership of the core product. Responsibilities include scaling APIs for developer integration and applying skills wherever needed across teams.
The primary focus involves designing and prototyping software for the ngVLA project, which includes refining requirements, designing systems, performing trade studies, and building prototypes. This role will also be directly involved in the early implementation and testing of key software features for the ngVLA and Radar projects.