Microsoft

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Principal AI Operations Engineer

Full TimeRemote

Location

United States

Posted

6 days ago

Salary

Not specified

No structured requirement data.

Job Description

We are seeking a Principal AI Operations Engineer to define the technical direction for the AI Operations group. In this role, you will: Design and architect operational systems Establish standards for branch health, CI/CD pipelines, production deployments, and on-call processes Drive reliability initiatives and maintain production health and uptime Ensure the platform meets its SLOs Be the escalation point for complex incidents Work closely with the Platform team to ensure services are operationally ready

Job Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 6+ years technical engineering experience in DevOps, SRE, or platform operations
  • 6+ years driving complex operational initiatives across teams; demonstrated success leading without authority
  • 4+ years hands-on experience with Kubernetes in production environments
  • 3+ years building and maintaining CI/CD pipelines at scale
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • Preferred Qualifications
  • Experienced with Kubernetes: cluster operations, Helm, troubleshooting, autoscaling, and production management
  • Proficiency with CI/CD platforms: Azure DevOps, GitHub Actions, or similar pipeline tooling
  • Experience with cloud platforms (Azure preferred): AKS, networking, identity management, and resource provisioning
  • Infrastructure as Code: Bicep, Terraform, or Helm chart development
  • Observability tooling: Prometheus, Grafana, OpenTelemetry, and log analytics (Kusto/KQL)

Benefits

  • The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year
  • There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year
  • Certain roles may be eligible for benefits and other compensation

Related Categories

Related Job Pages