Site Reliability Engineer - Infrastructure
Location
United States + 7 moreAll locations: United States, United Kingdom, Canada, Germany, France, India, Chile, Czech Republic
Posted
7 days ago
Salary
Not specified
No structured requirement data.
Job Description
Role Description
This role involves designing and implementing architectural blueprints for our global automation platform.
- Design and implement the architectural blueprints that allow our global automation platform to scale while maintaining high availability.
- Define the SLIs, SLOs, and error budgets that guide our engineering teams' balance between rapid feature velocity and system stability.
- Build and maintain observability pipelines using metrics, logs, and traces to provide engineers with immediate, actionable clarity on service behavior in production.
- Participate in the resolution of production incidents and follow the blameless postmortem process to transform system failures into permanent technical improvements.
- Cultivate an engineering environment focused on continuous learning from outages to proactively harden our platform against future regressions.
- Develop and automate our CI/CD pipelines to ensure code changes are validated and deployed safely using strategies such as canary or blue/green releases.
- Introduce and scale chaos engineering experiments to identify and fix infrastructure weak points before they can impact our customers.
- Collaborate with developers during early design phases to ensure all new services meet our strict standards for scalability, security, and reliability.
- Mentor senior engineers across the organization and represent SRE principles in technical leadership forums to ensure long-term platform health.
- Participate in an on-call rotation to respond to incidents and maintain the 24/7 availability of the make.com platform.
Qualifications
- 6+ years of experience in Software Engineering or SRE roles, with a proven track record of technical leadership.
- A thorough understanding of how to apply SLI and SLO principles to drive meaningful reliability outcomes.
- A development-first mindset where you approach infrastructure challenges through the lens of a software engineer.
- Significant experience in mentoring and leveling up other senior engineers within a high-growth environment.
- Deep proficiency in managing and operating Linux/Unix-based infrastructure at scale.
- Extensive practical knowledge of cloud providers, with a strong preference for AWS.
- Expert-level experience with container orchestration, specifically running production workloads on Kubernetes.
- Advanced skills in Infrastructure as Code (IaC) using tools like Terraform to maintain version-controlled environments.
- Direct experience building and optimizing CI/CD pipelines and executing modern deployment strategies like canary or blue/green.
- Excellent communication skills in English to collaborate effectively with our international teams.
Requirements
- Proficiency in back-end technologies: Node.js, TypeScript, PostgreSQL, RabbitMQ, Redis, Elasticsearch.
- Experience with front-end technologies: Angular, TypeScript, Redux, Web Components, Canvas, Nx.
- Knowledge of infrastructure technologies: Amazon AWS, Docker, Kubernetes.
- Familiarity with CI/CD tools: GitHub, CircleCI, ArgoCD.
- Experience with monitoring tools: DataDog.
- Familiarity with AI tools: Claude Code, Cursor, Gemini, GitHub Copilot.
Benefits
- RSUs grant in a rapidly growing company raising its value every day.
- Annual bonus.
- Multinational team with 42 nationalities creating the future of automation.
- Learning & Development plan (online language, professional courses, conference tickets and other trainings) & 2 learning days per year.
- Notebook/Macbook and 34’’ curved monitor.
- 25 days of vacation, 4 sick days, Company day off 31.12.
- 10 care days to care for your loved ones.
- Extra parental vacation (3-6 months).
- RSUs grant for a newborn child.
- Life insurance.
- Benefit Plus Cafeteria (incl. MultiSport Card).
- Remote working allowance.
- Snack bar, coffee, tea, fruit and vegetable, and sweets all day - every day - available for everyone.
- Wednesday lunch, and Friday break, with company-provided food and drinks, with music and lively discussion.
- Flexible working hours + home office.
- Company therapy pets in Prague's office (dog-friendly office).
- Company 3D printer.
- Team buildings, parties, and company events multiple times a year.
Job Requirements
- 6+ years of experience in Software Engineering or SRE roles, with a proven track record of technical leadership.
- A thorough understanding of how to apply SLI and SLO principles to drive meaningful reliability outcomes.
- A development-first mindset where you approach infrastructure challenges through the lens of a software engineer.
- Significant experience in mentoring and leveling up other senior engineers within a high-growth environment.
- Deep proficiency in managing and operating Linux/Unix-based infrastructure at scale.
- Extensive practical knowledge of cloud providers, with a strong preference for AWS.
- Expert-level experience with container orchestration, specifically running production workloads on Kubernetes.
- Advanced skills in Infrastructure as Code (IaC) using tools like Terraform to maintain version-controlled environments.
- Direct experience building and optimizing CI/CD pipelines and executing modern deployment strategies like canary or blue/green.
- Excellent communication skills in English to collaborate effectively with our international teams.
- Proficiency in back-end technologies: Node.js, TypeScript, PostgreSQL, RabbitMQ, Redis, Elasticsearch.
- Experience with front-end technologies: Angular, TypeScript, Redux, Web Components, Canvas, Nx.
- Knowledge of infrastructure technologies: Amazon AWS, Docker, Kubernetes.
- Familiarity with CI/CD tools: GitHub, CircleCI, ArgoCD.
- Experience with monitoring tools: DataDog.
- Familiarity with AI tools: Claude Code, Cursor, Gemini, GitHub Copilot.
Benefits
- RSUs grant in a rapidly growing company raising its value every day.
- Annual bonus.
- Multinational team with 42 nationalities creating the future of automation.
- Learning & Development plan (online language, professional courses, conference tickets and other trainings) & 2 learning days per year.
- Notebook/Macbook and 34’’ curved monitor.
- 25 days of vacation, 4 sick days, Company day off 31.12.
- 10 care days to care for your loved ones.
- Extra parental vacation (3-6 months).
- RSUs grant for a newborn child.
- Life insurance.
- Benefit Plus Cafeteria (incl. MultiSport Card).
- Remote working allowance.
- Snack bar, coffee, tea, fruit and vegetable, and sweets all day - every day - available for everyone.
- Wednesday lunch, and Friday break, with company-provided food and drinks, with music and lively discussion.
- Flexible working hours + home office.
- Company therapy pets in Prague's office (dog-friendly office).
- Company 3D printer.
- Team buildings, parties, and company events multiple times a year.
Related Guides
Related Categories
Related Job Pages
More Infrastructure Engineer Jobs
Infrastructure Engineer
Quavo Fraud & DisputesQuavo is a leading provider of automated dispute management SaaS solutions for issuing financial institutions.
The Infrastructure Engineer will support internal processes and compliance by completing user requests, maintaining infrastructure, and executing company initiatives related to cloud environments. Key duties involve maintaining the Linux Operating System, managing cloud infrastructure, troubleshooting issues, and participating in an on-call rotation for maintenance.
This role involves building, deploying, optimizing, and securing cloud-based and containerized solutions. Ensure availability, scalability, and security of cloud applications and related services Focus on automation, performance, efficiency, security, and compliance Work with tea...
This role is critical for ensuring the safe, on-schedule, and on-budget delivery of rail infrastructure components for the High-Speed Rail Program through strategic oversight and technical leadership. Responsibilities include leading delivery of components like trackwork and civil structures, monitoring construction, and coordinating interfaces with rail systems disciplines.
This role is critical for ensuring the successful delivery of rail infrastructure components for the High-Speed Rail Program by providing strategic oversight, technical leadership, and coordination. Responsibilities include leading and managing the delivery of components like trackwork and civil structures, overseeing construction compliance, monitoring progress, and resolving technical and logistical issues.