nCloud Integrators
Your Success is Our Business
Site Reliability Engineer – Bilingual, Portuguese, English
Location
United States
Posted
118 days ago
Salary
Not specified
Bachelor DegreePortugueseEnglishAWSCloudDistributed SystemsJavaJava ScriptPythonRubySplunk
Job Description
• Ensure the availability and reliability of distributed systems.
• Help the L1 team to resolve the client’s infrastructure/system issues, escalations, alerts, tickets, and queries.
• Works as a bridge between DevOps and other teams in order to build maintain resilient systems.
• Conduct, coordinate and oversee post incident Root Cause Analysis / Reviews.
• Build and maintain documentation for all assigned clients / projects.
• Leverage DevOps, Agile methodology, and standards in day-to-day work.
• Adopt and propose automation of repetitive tasks to reduce/eliminate toil.
• Implement and troubleshoot using observability tools like Datadog, New Relic, Splunk, CloudWatch etc.
• Adopt and ensure the SRE practices in Team.
• Maintenance of AWS managed resources, CI/CD, IAC.
• Planning and implementing disaster recovery and backup plans for AWS cloud platforms.
• Proactively work on efficiency and capacity planning.
• Keep a proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
• Liaise and work closely with Layer-1 Oncall support, DevOps and Operations teams.
• Drive availability and reliability by defining and implementing SLI, SLO, error budget, Observability, Disaster recovery, and backup to detect and mitigate issues.
Job Requirements
- Bachelor’s degree in computer science (preferred) or equivalent management, technical, scientific discipline
- Ability to program (structured and OO) with one or more high level languages, such as Python, Java, C/C++, Ruby, and JavaScript
- Clear understanding of SRE principles and practices and Agile and DevOps methodologies.
- Experience in AWS Well-Architected framework in order to implement the scalable and reliable infrastructure.
- Great team player with flexibility to work.
- Excellent written/verbal communication and leadership skills.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Developer II
Correlated Solutions, Inc.Correlated Solutions offers non-contact measurement solutions for materials and testing using digital image correlation.
DevOps Engineer121 days ago
Full TimeRemoteTeam 11-50Since 1998H1B No Sponsor
DevOps Developer II automating patching and upgrade processes for Corelation.
JavaLinux
Site Reliability Engineer
enosixReal-time data virtualization between SAP ERP and front-end systems of engagement such as Salesforce, ServiceNow & more
DevOps Engineer122 days ago
Full TimeRemoteTeam 51-200Since 2017H1B No Sponsor
Site Reliability Engineer at enosix developing integration solutions between SAP ERP and front-end systems.
AWSAzureCloudEC2ERPPostgresTerraform
Ohio
Senior Site Reliability Engineer, SRE
Gov Services Hub"Empowering Prime Contractors, Simplifying Services"
DevOps Engineer122 days ago
ContractRemoteTeam 51-200Since 2015H1B No Sponsor
Senior Site Reliability Engineer ensuring systems reliability and scalability
AWSCloudEC2PrometheusPythonTerraform
New York
DevOps Engineer123 days ago
Full TimeRemoteTeam 201-500Since 2006H1B No Sponsor
Staff SRE at FloSports improving developer enablement and infrastructure.
AWSGoogle Cloud PlatformJavaScriptKubernetesNode.jsTerraformGo
United States