Site Reliability Engineer
Location
United States
Posted
2 days ago
Salary
$110K - $140K / year
No structured requirement data.
Job Description
Role Description
As a Site Reliability Engineer at Qlik, you’ll sit at the heart of our cloud ecosystem, helping power the reliability, security, and scalability of Qlik and Talend Cloud services used around the world. This is your opportunity to work on systems operating at serious scale — supporting millions of transactions across a global cloud environment — while shaping how reliability engineering is done across the business.
You won’t just “keep the lights on.” You’ll design, improve, automate, and elevate how modern cloud platforms perform. If you’re motivated by complex distributed systems, Kubernetes at scale, and solving meaningful engineering challenges, this is where you’ll thrive.
What makes this role interesting?
- Solve real scale challenges: Work on reliability and performance across a global cloud platform handling millions of transactions.
- Engineer, not just operate: Build tooling, automation, alerts, and scalable infrastructure patterns that prevent problems before they happen.
- Collaborate with highly skilled teams: Partner with Global SRE, Architecture, Platform, and Domain Engineering teams to influence how infrastructure is designed from the ground up.
- Work with modern cloud-native technologies: Kubernetes, IaC, observability tooling, autoscaling, secret management, CI/CD — you’ll be hands-on with today’s most relevant technologies.
- Shape best practices: Help define and champion cloud optimization and reliability standards across the organization.
- Grow your technical influence: Act as a go-to resource for reliability, incident management, cloud engineering, and production operations.
- Continuously evolve: Stay close to emerging tools and practices, contributing to ongoing improvements in our cloud environment.
Your work will directly influence the stability and performance of services relied on by customers worldwide. You will:
- Increase reliability and availability: by implementing resilient infrastructure patterns and performance optimizations.
- Reduce incidents and recovery time: through better observability, automation, and proactive engineering.
- Strengthen scalability: by designing infrastructure that adapts seamlessly to growth.
- Improve cloud efficiency: by driving optimization best practices across AWS and Azure environments.
- Resolve complex system challenges: across infrastructure, networking, applications, and distributed systems.
On-Call Support:
- Participate in on-call duties to maintain the availability and performance of our cloud infrastructure, providing regular updates on project status and activities. This includes first-line incident response.
- Elevate engineering standards by mentoring peers and embedding reliability-first thinking into development workflows.
Qualifications
- Cloud engineering skill across AWS and/or Azure, including hands-on experience supporting production systems running on Kubernetes at scale.
- Infrastructure as Code and microservices experience, using tools such as Terraform, Crossplane or Ansible, with a strong understanding of operating distributed systems in live environments.
- Automation and engineering mindset, with proficiency in Python, Go or Bash, plus experience building and improving CI/CD pipelines and autoscaling strategies.
- Observability and incident management depth, including Prometheus, Grafana, OpenTelemetry, distributed tracing, and SIEM tooling — with the ability to turn insights into reliability improvements.
- Security and networking knowledge, including secret management (e.g., Vault, AWS SSM) and familiarity with infrastructure security and compliance best practices.
- Cloud-native tooling experience, including Helm (managing and creating charts) and exposure to modern database and ecosystem technologies such as MongoDB.
- Strong analytical thinking, with the ability to troubleshoot complex issues across infrastructure, networking, and application layers.
- Curiosity and collaboration at their core; a passion for learning, sharing ideas and insight and comfort with the on-call support rotation – experience here is also welcome.
Benefits
- Genuine career progression pathways and mentoring programs.
- Culture of innovation, technology, collaboration, and openness.
- Flexible, diverse, and international work environment.
- Giving back is a huge part of our culture. Alongside an extra “change the world” day plus another for personal development, we also highly encourage participation in our Corporate Responsibility Employee Programs.
- The anticipated base salary range for this role is $110,000.00 USD to $140,000.00 USD. Final compensation offered by Qlik will be based on factors such as the candidate’s location, job-related skills, education, experience, and other business and organizational needs.
- This position is eligible for comprehensive benefits, including - but not limited to - medical, dental, and vision coverage, life and AD&D, short and long-term disability coverage, paid time off, paid parental/maternity leave, participation in a 401(k) program that includes company match, and many other additional voluntary benefits.
Application Window
The application window is 60 days, but applicants are encouraged to apply as soon as possible. The posting will be removed before the application window closes if the position is filled.
Job Requirements
- Cloud engineering skill across AWS and/or Azure, including hands-on experience supporting production systems running on Kubernetes at scale.
- Infrastructure as Code and microservices experience, using tools such as Terraform, Crossplane or Ansible, with a strong understanding of operating distributed systems in live environments.
- Automation and engineering mindset, with proficiency in Python, Go or Bash, plus experience building and improving CI/CD pipelines and autoscaling strategies.
- Observability and incident management depth, including Prometheus, Grafana, OpenTelemetry, distributed tracing, and SIEM tooling — with the ability to turn insights into reliability improvements.
- Security and networking knowledge, including secret management (e.g., Vault, AWS SSM) and familiarity with infrastructure security and compliance best practices.
- Cloud-native tooling experience, including Helm (managing and creating charts) and exposure to modern database and ecosystem technologies such as MongoDB.
- Strong analytical thinking, with the ability to troubleshoot complex issues across infrastructure, networking, and application layers.
- Curiosity and collaboration at their core; a passion for learning, sharing ideas and insight and comfort with the on-call support rotation – experience here is also welcome.
Benefits
- Genuine career progression pathways and mentoring programs.
- Culture of innovation, technology, collaboration, and openness.
- Flexible, diverse, and international work environment.
- Giving back is a huge part of our culture. Alongside an extra “change the world” day plus another for personal development, we also highly encourage participation in our Corporate Responsibility Employee Programs.
- The anticipated base salary range for this role is $110,000.00 USD to $140,000.00 USD. Final compensation offered by Qlik will be based on factors such as the candidate’s location, job-related skills, education, experience, and other business and organizational needs.
- This position is eligible for comprehensive benefits, including - but not limited to - medical, dental, and vision coverage, life and AD&D, short and long-term disability coverage, paid time off, paid parental/maternity leave, participation in a 401(k) program that includes company match, and many other additional voluntary benefits.
- Application Window
- The application window is 60 days, but applicants are encouraged to apply as soon as possible. The posting will be removed before the application window closes if the position is filled.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
The Site Reliability Engineer leads the architecture, design, and deployment of network solutions across the client portfolio, focusing on developing specifications and implementing cloud network security architecture. Key duties involve designing, developing, installing, and maintaining software solutions for Cloud Operations efficiency, refining deployment processes, and participating in 24x7 on-call support rotations.
Our client is dedicated to serving our nation's military and Veterans. They have the honor to support federal agencies in their efforts to advance the United States health care system and improve the overall health and well-being of all those who serve or have served our country....
DevOps Engineer
Bright Vision Technologies"Retrieve the best out of you" in each process what you do.
We are looking for a skilled DevOps Engineer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth pote...
Associate DevOps Manager - EST
HiBobHiBob helps modern, mid-size businesses transform the way they manage people, giving HR and managers all they need to connect, engage, develop, and retain top talent. Since 2015, we’ve achieved consecutive triple-digit year-over-year growth, all backed by our amazing team of Bobbers from across the globe, making us the choice HRIS of over 4000 midsize and multinational companies. Our HR platform is intuitive, data-driven, and built for the way people work today: globally, remotely, and collaboratively. Fast-growing companies across the globe such as Huel, What3words, Fiverr, and VaynerMedia rely upon Bob to help them create the best work experiences for their people.
This player-coach role involves managing and mentoring one Senior DevOps Engineer while spending 60-80% of time hands-on designing, building, and operating production systems, with a focus on supporting AI-focused projects and cloud infrastructure.