Restaurant365

Restaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique, centralized solution for accounting and back-office operations for restaurants. Restaurant365’s culture is focused on empowering team members to produce top-notch results while elevating their skills.

Site Reliability Engineer II

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 501-1,000H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

44 days ago

Salary

$98.6K - $138.0K / year

Bachelor Degree2 yrs expEnglishAnsibleApacheAWSAzureCloudGrafanaLinuxNGINXPrometheusPythonTerraform

Job Description

• The Site Reliability Engineer II will be responsible for supporting, enhancing, and maintaining Restaurant365’s cloud infrastructure and applications. • Collaborate with DevOps, development, and infrastructure teams to resolve moderately complex issues, propose improvements, and strengthen the reliability, scalability, and security of our SaaS platform. • Respond to production incidents, perform triage and troubleshooting, and contribute to post-incident analysis. • Identify and automate manual processes to improve efficiency and reduce risk. • Enhance and evolve monitoring tools and platforms to improve observability. • Promote and apply best practices for reliability, scalability, and performance across engineering. • Implement and support cloud automation using Terraform, Ansible, or CloudFormation. • Work within change management protocols to provide maximum uptime for production systems. • Participate in on-call rotation, providing 24x7 support for incidents and contributing to root cause analysis. • Partner with developers, architects, vendors, and IT teams to ensure reliable system operations. • Research and remediate vulnerabilities in coordination with security teams. • Maintain documentation of infrastructure, monitoring, runbooks, and incident response procedures.

Job Requirements

  • BS in Computer Science, Information Systems, or related field (or equivalent experience).
  • 2–4 years of experience in site reliability engineering, DevOps, or cloud operations.
  • Experience with cloud platforms (Azure or AWS), including services such as AKS, ECS/EKS, Functions/Lambda, S3, and Blob storage.
  • Proficiency with infrastructure-as-code and automation (Terraform, Ansible, YAML, Python, Bash, PowerShell).
  • Strong Linux engineering skills; working knowledge of Windows administration.
  • Experience supporting production environments and participating in on-call rotations.
  • Familiarity with web servers and middleware (Nginx, Apache Tomcat).
  • Experience with CI/CD tools (GitLab, Git, or similar).
  • Strong written, oral, and interpersonal communication skills.
  • Preferred Qualifications
  • Experience with monitoring tools (Prometheus, Grafana, ELK, Site24x7, Nagios).
  • Knowledge of performance analysis and system vulnerability remediation.
  • Cloud certification (AWS or Azure) preferred.
  • Familiarity with restaurant industry SaaS platforms and customer-facing applications.

Benefits

  • Comprehensive medical benefits, 100% paid for employee
  • 401k + matching
  • Equity Option Grant
  • Unlimited PTO + Company holidays
  • Wellness initiatives

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Deployment Engineer

Mashgin

World's fastest AI powered Touchless self-checkout ecosystem. YC W15.

DevOps Engineer44 days ago
Full TimeRemoteTeam 11-50Since 2015H1B No Sponsor

Deployment Engineer handling technology and hardware installations nationwide for Mashgin

ServiceNow
New York
$105K / year
Full TimeRemote

We're looking for a Staff SRE who can own the reliability, scalability, and operational excellence of our platform. You'll work at the intersection of infrastructure and software engineering - building the systems, tooling, and practices that let our team ship confidently and ope...

United States + 1 moreAll locations: United States, Canada
$195K - $245K / year

Senior Site Reliability Engineer, Azure Red Hat OpenShift

Red Hat

The leading provider of enterprise open source solutions.

DevOps Engineer44 days ago
Full TimeRemoteTeam 10,001+Since 1993H1B Sponsor

Senior Site Reliability Engineer managing OpenShift cloud services at Red Hat

AnsibleAWSAzureChefCloudDNSJavaLinuxPrometheusPuppetPythonTCP/IPGo
California + 1 moreAll locations: California, Oregon
$139.6K - $230.2K / year
Full TimeRemote

As a Senior Site Reliability Engineer, you'll enhance system performance, reliability, and cost efficiency in a large-scale production environment, shifting manual operations to AI-assisted engineering.

AnsibleDatadogElkGrafanaKubernetesLinuxPrometheusPythonRubyTerraform
United States