Airbnb logo
Airbnb

Airbnb is a community based on connection and belonging.

Staff Software Engineer, GraphQL

Software EngineerSoftware EngineerFull TimeRemoteLeadTeam 5,001-10,000Since 2007H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

3 days ago

Salary

$204K - $255K / year

Seniority

Lead

GraphQLJavaKotlinJVMObservabilityDistributed TracingSREPerformance TuningJVM ProfilingConcurrencyMicroservicesHigh AvailabilityIncident ResponseAIOpsPlatform Engineering

Job Description

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way.

The Community You Will Join:

  • Viaduct is a unified data access layer connecting most of Airbnb’s online data. More than 70% of Airbnb’s API traffic flows through the Viaduct platform. 
  • Residing at the center of Airbnb’s tech stack between the user-facing products and backend infrastructure, Viaduct provides a global schema & query system through a GraphQL interface. 
  • Our team mission: “Empower app developers at Airbnb by delivering a seamless and efficient developer experience. We strive to maximize productivity and spark creativity through simplified APIs, improved performance, and the cultivation of tenant team autonomy.”
  • The Viaduct team is a very tenured and experienced team, setting the best practices and next-gen architecture for Airbnb.
  • As part of the Application Platform pillar of Infrastructure, we work closely with partner infra teams (Build Infra, Service Platform, CI/CD, Reliability, Observability, Developer Platform to name a few) as well as product engineers.
  • We treat our platform as a product and follow the principles of good Platform Engineering
  • Viaduct has been running in production for over six years and the team has gained a lot of experience in operating a GraphQL platform at scale. These insights informed the major rewrite - called Viaduct Modern - which we’re in the process of launching and continue to evolve. You will join the effort to create the best developer experience of hundreds of engineers at Airbnb using our revolutionary GraphQL platform. 

Viaduct has been released as an Open Source project. Your contributions to the Viaduct platform will not only serve Airbnb-internal developers, but also any member of the Open Source community who chooses to adopt Viaduct.

The Difference You Will Make:

  • Drive platform reliability and operational excellence by designing and implementing deployment pipelines, SLO frameworks, observability tooling, performance improvements, and AI-enabled incident response automation that help maintain Viaduct's 99.99% uptime target across Airbnb's critical API traffic.
  • Contribute to runtime resiliency initiatives including resource attribution, performance regression testing, and proactive monitoring to ensure the multi-tenant GraphQL platform scales efficiently and degrades gracefully under load.
  • Architect and deliver AI-powered operational tooling that accelerates incident triage, reduces mean-time-to-mitigation, and empowers both the Viaduct team and tenant engineers with self-service debugging capabilities.
  • Shape the future of Viaduct Modern by contributing to the next-generation architecture, improving developer experience for hundreds of engineers, and establishing patterns that will be shared with the open-source community.

A Typical Day: 

  • Embrace an AI-first engineering approach, using LLM-powered agents to generate and iterate on code while you focus on problem-solving, system design, and quality oversight.
  • Investigate and resolve complex production issues by analyzing distributed traces, resource utilization patterns, and system metrics to identify root causes and implement durable fixes.
  • Design and implement observability features including span instrumentation, SLO dashboards, and fine-grained attribution for blocking time, memory, and CPU across tenant workloads.
  • Develop and iterate on tooling for deployment triage, service health monitoring, and incident response automation using LLM capabilities.
  • Lead technical design discussions and RFCs for initiatives like performance regression testing pipelines, emergency deployment workflows, and runtime resiliency improvements.
  • Partner with tenant teams to debug performance issues, provide guidance on GraphQL best practices, and enable self-service capabilities for common operational tasks.
  • Contribute to open-source Viaduct by ensuring platform improvements are generalizable and well-documented for the broader engineering community.

Your Expertise:

  • 9+ years of software engineering experience, with significant depth in backend systems, distributed architectures, and platform engineering.
  • Deep expertise in observability and monitoring, including experience designing SLO frameworks, distributed tracing systems, and metrics pipelines at scale.
  • Proven track record in reliability engineering, with hands-on experience in incident response, root cause analysis, and building systems that maintain high availability (99.99%+).
  • Strong experience with performance tuning and resource management in JVM-based systems, including profiling, garbage collection optimization, and understanding of concurrency models (blocking I/O, thread pools, coroutines in Kotlin).
  • Experience operating critical, high-traffic systems with a focus on deployment safety, automated rollbacks, and progressive delivery strategies.
  • Familiarity with GraphQL or similar API gateway/data access layer technologies
  • Experience building developer tooling and platforms, with a product mindset focused on developer experience and self-service capabilities.
  • Strong leadership and communication skills with the ability to partner effectively across infrastructure and product engineering teams.

Your Location:

This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. Click here for the up-to-date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list. If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.

Our Commitment To Inclusion & Belonging:

Airbnb is committed to working with the broadest talent pool possible. We believe diverse ideas foster innovation and engagement, and allow us to attract creatively-led people, and to develop the best products, services and solutions. All qualified individuals are encouraged to apply.

We strive to also provide a disability inclusive application and interview process. If you are a candidate with a disability and require reasonable accommodation in order to submit an application, please contact us at: reasonableaccommodations@airbnb.com. Please include your full name, the role you’re applying for and the accommodation necessary to assist you with the recruiting process. 

We ask that you only reach out to us if you are a candidate whose disability prevents you from being able to complete our online application.

How We'll Take Care of You:

Our job titles may span more than one career level. The actual base pay is dependent upon many factors, such as: training, transferable skills, work experience, business needs and market demands. The base pay range is subject to change and may be modified in the future. This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.  

Pay Range
$204,000$255,000 USD

Job Requirements

  • 9+ years of software engineering experience, with significant depth in backend systems, distributed architectures, and platform engineering.
  • Deep expertise in observability and monitoring, including experience designing SLO frameworks, distributed tracing systems, and metrics pipelines at scale.
  • Proven track record in reliability engineering, with hands-on experience in incident response and root cause analysis.
  • Strong experience with performance tuning and resource management in JVM-based systems.
  • Familiarity with GraphQL or similar API gateway/data access layer technologies.
  • Experience building developer tooling and platforms, focused on developer experience and self-service capabilities.
  • Strong leadership and communication skills with the ability to partner effectively across teams.
  • Experience operating critical, high-traffic systems with a focus on deployment safety and automated rollbacks.
  • Ability to investigate and resolve complex production issues by analyzing distributed traces and system metrics.
  • Experience designing and implementing observability features and tooling for deployment triage.
  • Ability to lead technical design discussions and RFCs for various initiatives.
  • Contribute to open-source Viaduct by ensuring platform improvements are generalizable and well-documented.

Benefits

  • Base pay range: $204,000 — $255,000 USD.
  • Eligibility for bonus, equity, benefits, and Employee Travel Credits.

Related Job Pages

More Software Engineer Jobs

SSM Health logo

Coder II, Professional

SSM Health

Through our exceptional health care services, we reveal the healing presence of God.

Software Engineer3 days ago
RemoteTeam 10,001+H1B Sponsor

Manages assigned charge review and coding-related claim edit work queues to ensure timely and accurate charge capture. Accurately deciphers charge error reasons and plans follow-up steps.

Missouri + 3 moreAll locations: Missouri, Illinois, Oklahoma, Wisconsin
Software Engineer3 days ago
Full TimeRemoteTeam 51-200

The engineer will be responsible for designing, coding, testing, and debugging software applications, ensuring features are scalable and meet user needs while maintaining high code quality standards. They will also translate business requirements into technical specifications and actively participate in the entire software development lifecycle.

ElixirPhoenixPostgreSQLAWSReactTypeScriptNode.jsDockerGitSDLCMicroservicesAPI DevelopmentCI/CD
United States
$107K - $142K / year
Full TimeRemoteTeam 51-200Since 2018H1B Sponsor

The role involves leading the end-to-end Software Development Lifecycle, including requirements gathering, design, implementation, and operationalization of observability solutions. Responsibilities also include owning technical excellence, designing and building multi-component distributed systems, and investigating complex issues methodically.

GoKubernetesSQLAWSGCPdistributed systemsobservabilityClickHousePrometheusGrafanaLokiThanosdata pipelinesKubernetes monitoringinfrastructure-as-code
United States
$212K - $286K / year
Sound Physicians logo

Software Development Intern

Sound Physicians

We deliver uncompromising care and lasting partnerships across acute and post-acute settings.

Software Engineer3 days ago
InternshipRemoteTeam 5,001-10,000Since 2001

Software Development Intern assisting with Intelligent Automation initiatives

United States
$17 - $22 / hour