Site Reliability Engineer - Infrastructure

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteTeam 11-50

Location

United States + 7 more

Posted

7 days ago

Salary

Not specified

No structured requirement data.

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

This role involves designing and implementing architectural blueprints for our global automation platform.

Design and implement the architectural blueprints that allow our global automation platform to scale while maintaining high availability.
Define the SLIs, SLOs, and error budgets that guide our engineering teams' balance between rapid feature velocity and system stability.
Build and maintain observability pipelines using metrics, logs, and traces to provide engineers with immediate, actionable clarity on service behavior in production.
Participate in the resolution of production incidents and follow the blameless postmortem process to transform system failures into permanent technical improvements.
Cultivate an engineering environment focused on continuous learning from outages to proactively harden our platform against future regressions.
Develop and automate our CI/CD pipelines to ensure code changes are validated and deployed safely using strategies such as canary or blue/green releases.
Introduce and scale chaos engineering experiments to identify and fix infrastructure weak points before they can impact our customers.
Collaborate with developers during early design phases to ensure all new services meet our strict standards for scalability, security, and reliability.
Mentor senior engineers across the organization and represent SRE principles in technical leadership forums to ensure long-term platform health.
Participate in an on-call rotation to respond to incidents and maintain the 24/7 availability of the make.com platform.

Qualifications

6+ years of experience in Software Engineering or SRE roles, with a proven track record of technical leadership.
A thorough understanding of how to apply SLI and SLO principles to drive meaningful reliability outcomes.
A development-first mindset where you approach infrastructure challenges through the lens of a software engineer.
Significant experience in mentoring and leveling up other senior engineers within a high-growth environment.
Deep proficiency in managing and operating Linux/Unix-based infrastructure at scale.
Extensive practical knowledge of cloud providers, with a strong preference for AWS.
Expert-level experience with container orchestration, specifically running production workloads on Kubernetes.
Advanced skills in Infrastructure as Code (IaC) using tools like Terraform to maintain version-controlled environments.
Direct experience building and optimizing CI/CD pipelines and executing modern deployment strategies like canary or blue/green.
Excellent communication skills in English to collaborate effectively with our international teams.

Requirements

Proficiency in back-end technologies: Node.js, TypeScript, PostgreSQL, RabbitMQ, Redis, Elasticsearch.
Experience with front-end technologies: Angular, TypeScript, Redux, Web Components, Canvas, Nx.
Knowledge of infrastructure technologies: Amazon AWS, Docker, Kubernetes.
Familiarity with CI/CD tools: GitHub, CircleCI, ArgoCD.
Experience with monitoring tools: DataDog.
Familiarity with AI tools: Claude Code, Cursor, Gemini, GitHub Copilot.

Benefits

RSUs grant in a rapidly growing company raising its value every day.
Annual bonus.
Multinational team with 42 nationalities creating the future of automation.
Learning & Development plan (online language, professional courses, conference tickets and other trainings) & 2 learning days per year.
Notebook/Macbook and 34’’ curved monitor.
25 days of vacation, 4 sick days, Company day off 31.12.
10 care days to care for your loved ones.
Extra parental vacation (3-6 months).
RSUs grant for a newborn child.
Life insurance.
Benefit Plus Cafeteria (incl. MultiSport Card).
Remote working allowance.
Snack bar, coffee, tea, fruit and vegetable, and sweets all day - every day - available for everyone.
Wednesday lunch, and Friday break, with company-provided food and drinks, with music and lively discussion.
Flexible working hours + home office.
Company therapy pets in Prague's office (dog-friendly office).
Company 3D printer.
Team buildings, parties, and company events multiple times a year.

Job Requirements

6+ years of experience in Software Engineering or SRE roles, with a proven track record of technical leadership.
A thorough understanding of how to apply SLI and SLO principles to drive meaningful reliability outcomes.
A development-first mindset where you approach infrastructure challenges through the lens of a software engineer.
Significant experience in mentoring and leveling up other senior engineers within a high-growth environment.
Deep proficiency in managing and operating Linux/Unix-based infrastructure at scale.
Extensive practical knowledge of cloud providers, with a strong preference for AWS.
Expert-level experience with container orchestration, specifically running production workloads on Kubernetes.
Advanced skills in Infrastructure as Code (IaC) using tools like Terraform to maintain version-controlled environments.
Direct experience building and optimizing CI/CD pipelines and executing modern deployment strategies like canary or blue/green.
Excellent communication skills in English to collaborate effectively with our international teams.
Proficiency in back-end technologies: Node.js, TypeScript, PostgreSQL, RabbitMQ, Redis, Elasticsearch.
Experience with front-end technologies: Angular, TypeScript, Redux, Web Components, Canvas, Nx.
Knowledge of infrastructure technologies: Amazon AWS, Docker, Kubernetes.
Familiarity with CI/CD tools: GitHub, CircleCI, ArgoCD.
Experience with monitoring tools: DataDog.
Familiarity with AI tools: Claude Code, Cursor, Gemini, GitHub Copilot.

Benefits

RSUs grant in a rapidly growing company raising its value every day.
Annual bonus.
Multinational team with 42 nationalities creating the future of automation.
Learning & Development plan (online language, professional courses, conference tickets and other trainings) & 2 learning days per year.
Notebook/Macbook and 34’’ curved monitor.
25 days of vacation, 4 sick days, Company day off 31.12.
10 care days to care for your loved ones.
Extra parental vacation (3-6 months).
RSUs grant for a newborn child.
Life insurance.
Benefit Plus Cafeteria (incl. MultiSport Card).
Remote working allowance.
Snack bar, coffee, tea, fruit and vegetable, and sweets all day - every day - available for everyone.
Wednesday lunch, and Friday break, with company-provided food and drinks, with music and lively discussion.
Flexible working hours + home office.
Company therapy pets in Prague's office (dog-friendly office).
Company 3D printer.
Team buildings, parties, and company events multiple times a year.

Related Categories

Infrastructure Engineer

Related Job Pages

Remote Full-time Jobs (US)More US Remote Jobs

More Infrastructure Engineer Jobs

Infrastructure Engineer

Quavo Fraud & Disputes

Quavo is a leading provider of automated dispute management SaaS solutions for issuing financial institutions.

Infrastructure Engineer7 days ago

Full TimeRemoteTeam 51-200Since 2015H1B No Sponsor

Company Site LinkedIn

The Infrastructure Engineer will support internal processes and compliance by completing user requests, maintaining infrastructure, and executing company initiatives related to cloud environments. Key duties involve maintaining the Linux Operating System, managing cloud infrastructure, troubleshooting issues, and participating in an on-call rotation for maintenance.

View details: Infrastructure Engineer

United States

$65K - $85K / year

Apply

Cloud Infrastructure Engineer

Bloom

Infrastructure Engineer7 days ago

Full TimeRemoteTeam 2-10

This role involves building, deploying, optimizing, and securing cloud-based and containerized solutions. Ensure availability, scalability, and security of cloud applications and related services Focus on automation, performance, efficiency, security, and compliance Work with tea...

View details: Cloud Infrastructure Engineer

United States

Apply

Rail Infrastructure Manager

AECOM

We are the world’s trusted infrastructure consulting firm.

Infrastructure Engineer7 days ago

Full TimeRemoteTeam 10,001+Since 1990H1B Sponsor

Company Site LinkedIn

This role is critical for ensuring the safe, on-schedule, and on-budget delivery of rail infrastructure components for the High-Speed Rail Program through strategic oversight and technical leadership. Responsibilities include leading delivery of components like trackwork and civil structures, monitoring construction, and coordinating interfaces with rail systems disciplines.

View details: Rail Infrastructure Manager

United States

$150K - $277K / year

Apply

Rail Infrastructure Manager

AECOM

We are the world’s trusted infrastructure consulting firm.

Infrastructure Engineer7 days ago

Full TimeRemoteTeam 10,001+Since 1990H1B Sponsor

Company Site LinkedIn

This role is critical for ensuring the successful delivery of rail infrastructure components for the High-Speed Rail Program by providing strategic oversight, technical leadership, and coordination. Responsibilities include leading and managing the delivery of components like trackwork and civil structures, overseeing construction compliance, monitoring progress, and resolving technical and logistical issues.

View details: Rail Infrastructure Manager

United States

$150K - $277K / year

Apply

Site Reliability Engineer - Infrastructure

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Infrastructure Engineer

Cloud Infrastructure Engineer

Rail Infrastructure Manager

Rail Infrastructure Manager