VGS

The World's Largest Cloud-Based Tokenization Solution

Senior Infrastructure Engineer

Infrastructure EngineerInfrastructure EngineerFull TimeRemoteTeam 201-500H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

85 days ago

Salary

$140K - $190K / year

AWSTerraformCloud FormationLinuxDockerKubernetesKafkaJavaPythonSpring FrameworkCi/cdGit OpsAPI GatewayPrometheusGrafanaOpen TelemetryGoBashNetworkingLoad Balancing

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We are looking for a well-versed, passionate Engineer who wants to play a key role in site reliability engineering and cloud operations of our global cloud infrastructure. You will likely be successful in this role if you identify with the following traits: attention to detail, problem solver, customer-oriented, versatile, resilient, and confident.

What you will be doing at VGS:

  • Architect and maintain scalable, reliable infrastructure: Design and optimize infrastructure for high availability, fault tolerance, and performance across distributed systems.
  • Lead incident management and root cause analysis: Own incident response processes, ensure swift resolution of issues, and drive post-incident improvements to prevent recurrences.
  • Service monitoring and automation: Build and maintain automated monitoring, alerting, and healing systems that improve system health, reduce manual intervention, and minimize downtime.
  • Performance tuning and capacity planning: Identify bottlenecks and optimization opportunities, and implement scaling strategies to handle traffic spikes and growing workloads efficiently.
  • Collaborate with cross-functional teams: Work closely with software engineers, product teams, and DevOps to enhance system reliability and delivery pipelines.
  • Improve operational processes: Champion continuous improvement initiatives in deployment, scaling, and performance testing, while advocating for the adoption of SRE best practices across the organization.
  • Mentorship and leadership: Provide technical mentorship to junior engineers, contribute to strategic decisions around infrastructure, and ensure best practices are implemented at scale.
  • Be proactive and innovative: We rely on your feedback to build a world-class product.
  • Be a part of a team that believes in the core values of transparency, collaboration, grit, and humility; in going above and beyond what is required to do the right thing for our customers and the company; and in having fun while doing all this!

Qualifications

  • Proven experience in Infrastructure/SRE roles, with a track record of managing production systems in complex, large-scale environments.
  • Strong proficiency in AWS, including infrastructure-as-code (Terraform, CloudFormation, etc.).
  • Solid understanding of cloud-native architecture, Linux Systems, microservices, Infrastructure-as-code (Terraform, CloudFormation, CDK), CI/CD (CircleCI, GitHub Actions, Argo), GitOps, Authentication and Authorization, APIs and API Gateway, Docker, Kubernetes (EKS), Kafka (MSK), Java, Spring Framework, Python, and AWS services.
  • Strong plus if you are a database wiz.
  • Expertise in monitoring and observability tools like Prometheus, Grafana, Open Telemetry, New Relic, or similar tools to measure system health and performance.
  • Programming and scripting experience in languages such as Python, Go, Bash, or other relevant languages used in automating infrastructure.
  • Solid understanding of networking, security, and load balancing in cloud-native environments.
  • Strong communication and collaboration skills, with the ability to lead cross-functional initiatives and mentor junior team members.
  • Experience with incident management and disaster recovery best practices.
  • Strong written and verbal communication skills.

Requirements

  • $140,000 - $190,000 a year

Benefits

  • Flexible work hours and flexible PTO
  • Competitive health benefits
  • VGS stock options
  • 401k plan, with employer matching 4% and immediate vesting (available only for US employees)
  • Life & disability insurance
  • Pre-tax flexible spending accounts, dependent and healthcare FSA (available only for US employees)
  • Global parental leave program
  • Employee Assistance Program
  • Home Internet reimbursement
  • New hire home office set-up allowance
  • Professional learning reimbursement

Job Requirements

  • Proven experience in Infrastructure/SRE roles, with a track record of managing production systems in complex, large-scale environments.
  • Strong proficiency in AWS, including infrastructure-as-code (Terraform, CloudFormation, etc.).
  • Solid understanding of cloud-native architecture, Linux Systems, microservices, Infrastructure-as-code (Terraform, CloudFormation, CDK), CI/CD (CircleCI, GitHub Actions, Argo), GitOps, Authentication and Authorization, APIs and API Gateway, Docker, Kubernetes (EKS), Kafka (MSK), Java, Spring Framework, Python, and AWS services.
  • Strong plus if you are a database wiz.
  • Expertise in monitoring and observability tools like Prometheus, Grafana, Open Telemetry, New Relic, or similar tools to measure system health and performance.
  • Programming and scripting experience in languages such as Python, Go, Bash, or other relevant languages used in automating infrastructure.
  • Solid understanding of networking, security, and load balancing in cloud-native environments.
  • Strong communication and collaboration skills, with the ability to lead cross-functional initiatives and mentor junior team members.
  • Experience with incident management and disaster recovery best practices.
  • Strong written and verbal communication skills.
  • $140,000 - $190,000 a year

Benefits

  • Flexible work hours and flexible PTO
  • Competitive health benefits
  • VGS stock options
  • 401k plan, with employer matching 4% and immediate vesting (available only for US employees)
  • Life & disability insurance
  • Pre-tax flexible spending accounts, dependent and healthcare FSA (available only for US employees)
  • Global parental leave program
  • Employee Assistance Program
  • Home Internet reimbursement
  • New hire home office set-up allowance
  • Professional learning reimbursement

Related Categories

Related Job Pages

More Infrastructure Engineer Jobs

Software Engineer, Infrastructure

Clever

Clever believes classrooms and our company should be diverse and inclusive. We celebrate actions that build diverse teams, include every voice, and create safe spaces for everyone to bring their authentic selves to work.

Infrastructure Engineer87 days ago
Full TimeRemote

Clever is on a mission to connect every student to a world of learning. As the leading identity platform for education, more than 111,000 schools worldwide use Clever to power secure digital learning experiences. The Infrastructure team handles platform engineering at Clever, bui...

GoAWSECSDynamoDBVPCLambdaRDSKinesisKubernetesEKSEC2
United States
$126K - $148K / year

IT Infrastructure Specialist

SoluStaff

People Powering Technology

Infrastructure Engineer88 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

IT Infrastructure Specialist supporting deployment and operation of customer environments at Symmetrio

CitrixDNSITSMMS SQL ServerSQLTCP/IPVMware
United States

Data Center Manager

RYZ Labs

RYZ Labs is a startup studio built in 2021 by three lifelong entrepreneurs. The founders of RYZ have worked at some of the world's largest tech companies and some of the most iconic consumer brands. They have lived and worked in Argentina for many years and have decades of experience in Latam. Passion for the early phases of company creation Attracting the brightest talents to build industry-defining companies in a post-pandemic world Remote and distributed teams throughout the US and Latam Use of cutting-edge technologies in cloud computing Aim to provide diverse product solutions for different industries Plans to build a large number of startups in the upcoming years Our Values and What to Expect Customer First Mentality - every decision we make should be made through the lens of the customer. Bias for Action - urgency is critical, expect that the timeline to get something done is accelerated. Ownership - step up if you see an opportunity to help, even if not your core responsibility. Humility and Respect - be willing to learn, be vulnerable, and treat everyone who interacts with RYZ with respect. Frugality - being frugal and cost-conscious helps us do more with less. Deliver Impact - get things done most efficiently. Raise our Standards - always be looking to improve our processes, our team, and our expectations. The status quo is not good enough and never should be.

Infrastructure Engineer88 days ago
Full TimeRemoteTeam 51-200

RYZ Labs is hiring for a Data Center Manager to oversee day-to-day operations, ensuring high availability and consistent performance. Experience with power, cooling (HVAC), and physical security preferred. Day/night shifts and on-call available! Lead daily data center operations ...

UPSGeneratorsPower distributionCRACCRAHCablingDCIMBMSITSMITILBudget managementVendor managementCapacity planningIncident managementChange managementPreventive maintenanceCorrective maintenance
United States + 24 moreAll locations: United States, Brazil, Colombia, Argentina, Chile, Venezuela, Bolivarian Republic Of, Bolivia, Plurinational State Of, Ecuador, French Guiana, Guyana, Paraguay, Peru, Suriname, Uruguay, Mexico, Costa Rica, El Salvador, Guatemala, Honduras, Nicaragua, Panama, Dominican Republic, Puerto Rico
Infrastructure Engineer90 days ago
Full TimeRemoteTeam 1,001-5,000

Systems and Infrastructure Engineer managing core Windows-based infrastructure at Commure

DNSFirewallsOracleSwitchingVMware
United States
$95K - $149K / year