GoodRx
Affordable healthcare for everyone.
Sr. Platform Engineer
Location
United States
Posted
3 days ago
Salary
Not specified
No structured requirement data.
Job Description
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.
Role Description
This role involves designing, implementing, and evolving highly available, scalable, and secure infrastructure on AWS and GCP.
- Apply sound architectural judgment and long-term systems thinking while balancing cost.
- Design and develop reusable platform tooling and infrastructure-as-code modules that accelerate build and release management while improving reliability and consistency across teams.
- Passionate about improving processes through automation, process engineering, artificial intelligence, and documentation.
- Partner with application engineering teams to define, improve, and enforce operational standards, deployment practices, and reliability expectations across services.
- Independently lead complex infrastructure upgrades, migrations, and performance optimization initiatives.
- Conduct structured root cause analysis for production incidents and drive systemic improvements to prevent recurrence.
- Evaluate emerging technologies and recommend improvements to platform architecture, scalability, and cost efficiency.
- Document architectural decisions, operational standards, and runbooks to improve organizational knowledge sharing.
- Provide technical guidance and mentorship to Platform Engineer team members on infrastructure design and DevOps best practices.
Qualifications
- Extensive experience architecting and operating automated Linux-based cloud infrastructure in production environments.
- Strong experience designing and operating complex CI/CD pipelines (blue/green, canary, multi-stage) in large-scale production environments.
- Deep hands-on experience with infrastructure-as-code (Terraform, CloudFormation, or equivalent), including reusable module design and environment standardization.
- Strong software engineering background in Python, Go, or similar languages, with the ability to build automation and tooling to solve complex operational challenges.
- Build and release experience with tools such as Github Actions, GitlabCI, Codefresh or similar.
- Hands-on experience using AI to speed up research, coding, troubleshooting, and documentation workflows.
- Experience designing observability systems using APM, metrics, logs, and tracing tools (Datadog, CloudWatch, SumoLogic, etc.) to proactively manage reliability.
- Deep hands-on experience operating Kubernetes clusters in production, including workload orchestration, scaling, networking, and troubleshooting.
- Strong analytical and debugging skills, with demonstrated ability to independently diagnose and resolve complex, multi-system issues.
- Excellent collaboration and communication skills, with the ability to advise senior engineering partners and clearly communicate technical trade-offs.
- Excellent time management and organizational skills.
Requirements
- Engineering teams are responsible for supporting appropriate security controls, including management, operational, and technical controls in addition to general GoodRx best practices.
- Read and adhere to the security policies and procedures, being vigilant and observant of potential security threats.
Benefits
- Medical, dental, and vision insurance
- 401(k) with a company match
- Employee Stock Purchase Plan (ESPP)
- Unlimited vacation
- 13 paid holidays
- 72 hours of sick leave
- Mental wellness and financial wellness programs
- Fertility benefits
- Generous parental leave
- Pet insurance
- Supplemental life insurance for you and your dependents
- Company-paid short-term and long-term disability
Job Requirements
- Extensive experience architecting and operating automated Linux-based cloud infrastructure in production environments.
- Strong experience designing and operating complex CI/CD pipelines (blue/green, canary, multi-stage) in large-scale production environments.
- Deep hands-on experience with infrastructure-as-code (Terraform, CloudFormation, or equivalent), including reusable module design and environment standardization.
- Strong software engineering background in Python, Go, or similar languages, with the ability to build automation and tooling to solve complex operational challenges.
- Build and release experience with tools such as Github Actions, GitlabCI, Codefresh or similar.
- Hands-on experience using AI to speed up research, coding, troubleshooting, and documentation workflows.
- Experience designing observability systems using APM, metrics, logs, and tracing tools (Datadog, CloudWatch, SumoLogic, etc.) to proactively manage reliability.
- Deep hands-on experience operating Kubernetes clusters in production, including workload orchestration, scaling, networking, and troubleshooting.
- Strong analytical and debugging skills, with demonstrated ability to independently diagnose and resolve complex, multi-system issues.
- Excellent collaboration and communication skills, with the ability to advise senior engineering partners and clearly communicate technical trade-offs.
- Excellent time management and organizational skills.
- Engineering teams are responsible for supporting appropriate security controls, including management, operational, and technical controls in addition to general GoodRx best practices.
- Read and adhere to the security policies and procedures, being vigilant and observant of potential security threats.
Benefits
- Medical, dental, and vision insurance
- 401(k) with a company match
- Employee Stock Purchase Plan (ESPP)
- Unlimited vacation
- 13 paid holidays
- 72 hours of sick leave
- Mental wellness and financial wellness programs
- Fertility benefits
- Generous parental leave
- Pet insurance
- Supplemental life insurance for you and your dependents
- Company-paid short-term and long-term disability