CVS Health
Bringing our heart to every moment of your health.
Staff Engineer – SRE, Retail & Pharmacy
DevOps EngineerDevOps EngineerFull TimeRemoteTeam 10,001+Since 1963H1B No SponsorCompany SiteLinkedIn
Location
Massachusetts + 1 moreAll locations: Massachusetts, Texas
Posted
37 days ago
Salary
$118.5K - $284.3K / year
Bachelor Degree8 yrs expEnglishAWSAzureCloudDockerGrafanaJavaJenkinsKubernetesMicroservicesPrometheusPythonSplunk
Job Description
• Implement and maintain comprehensive observability solutions, providing real-time insights into system performance and overall health
• Investigate and resolve incidents quickly during critical situations and perform root cause analysis
• Collaborate with cross-functional teams to build robust monitoring, alerting, and telemetry solutions
• Design and implement observability solutions tailored for edge computing environments
• Define and maintain Service Level Indicators (SLIs), Service Level Objectives (SLOs), and business KPIs
• Build and optimize dashboards, visualizations, and alerting systems
• Implement distributed tracing and log aggregation systems
• Collaborate with engineering teams to ensure applications and infrastructure at edge locations are designed with observability in mind
• Drive proactive identification of issues in edge facilities through advanced observability tools
• Lead incident postmortems and implement observability-driven improvements
• Develop and maintain tools, scripts, and automation to enhance observability pipelines
• Evaluate and integrate industry-standard observability tools
• Optimize observability data storage, retention, and querying
• Mentor and guide junior SREs and engineers on observability best practices
• Partner with solution, engineering, and business teams to align observability efforts with business objectives
• Stay current with emerging observability trends, tools, and methodologies
• Contribute to the development of observability standards, runbooks, and documentation
• Drive cost optimization for observability infrastructure while maintaining high-quality monitoring
Job Requirements
- 8+ years of experience in SRE, DevOps, or related technology roles
- 5+ years of experience in delivering software in a large-scale environment with reliability and resilience concepts (multi-region, multi-cloud, containerization, etc.)
- 5+ years of experience with observability and monitoring tools such as Splunk, Dynatrace, Datadog, Prometheus, Grafana, etc.
- 3+ years of experience with programming/scripting languages (e.g., Python, java) for automation and tooling in distributed environments
- 3+ years of experience on Cloud Technologies (AWS, Microsoft Azure, Google Cloud)
- 3+ years of experience with source control and continuous integration tools like Git/Stash, BitBucket, or Jenkins
- 2+ years of engineering team leadership or management experience
- Experience using customer feedback tools such as Quantum Metrics, Medalia, and Adobe Analytics
- Deep understanding of microservices architecture and cloud-native technologies
- Experience in configuring, supporting, and managing Rancher, Kubernetes, and/or Docker
- Experience in Incident Management, Change Management, Infrastructure Support, and Problem Management concepts and processes
- Excellent interpersonal and communication skills, including the ability to engage technical and non-technical stakeholders.
Benefits
- Affordable medical plan options
- 401(k) plan (including matching company contributions)
- Employee stock purchase plan
- No-cost programs for all colleagues including wellness screenings, tobacco cessation, and weight management programs
- Confidential counseling and financial coaching
- Paid time off
- Flexible work schedules
- Family leave
- Dependent care resources
- Colleague assistance programs
- Tuition assistance
- Retiree medical access
- Many other benefits depending on eligibility
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Site Reliability Engineering Manager
Customer.ioEmail, push notifications, text messages, in-app messages, webhooks: automated and powered by your data.
DevOps Engineer38 days ago
Full TimeRemoteTeam 51-200Since 2012H1B No Sponsor
Engineering Manager leading Site Reliability Engineering team at Customer.io
Cloud
DevOps Engineer38 days ago
Full TimeRemoteTeam 51-200Since 1999H1B No Sponsor
Senior DevSecOps Engineer delivering secure infrastructure solutions at D-Wave.
AnsibleAWSAzureCloudDockerGoogle Cloud PlatformKubernetesLinuxPythonTerraformGo
DevOps Engineer38 days ago
InternshipRemoteTeam 11-50H1B No Sponsor
IT Intern focusing on DevOps and Cloud Platform Engineering at Ascension Technologies
CloudCyber SecurityGoogle Cloud PlatformGroovyJenkinsKubernetesLinuxRDBMSSQLTableauTerraform
Site Reliability Engineer – SkillBridge Intern
ZscalerWe make it easy to secure your cloud transformation. Get fast, secure, and direct access to apps without appliances.
DevOps Engineer38 days ago
InternshipRemoteTeam 5,001-10,000Since 2008H1B Sponsor
Site Reliability Engineer Intern supporting government classified environments at Zscaler
AWSKubernetesPython
Virginia