Vivun
Empowering Solutions Teams to Redefine the Buyer Experience.
Lead Observability Engineer
Location
California
Posted
152 days ago
Salary
$185K - $205K / year
6 yrs expEnglishGrafanaPrometheus
Job Description
• Own the end-to-end observability strategy for Ava, defining the standards, tools, and patterns that ensure reliable visibility across infrastructure and agentic components.
• Design and implement correlation models that link agent behavior, LLM interactions, and SaaS telemetry into cohesive, actionable insights.
• Unify observability tooling across teams, ensuring metrics, logs, and traces flow into a central platform (e.g., Observe, Datadog, or equivalent).
• Collaborate with engineering and QA to embed observability best practices into development workflows, CI/CD, and quality gates.
• Establish enablement frameworks—documentation, dashboards, and templates—that make observability self-serve for all engineering teams.
• Partner with teammates to ensure observability aligns with infrastructure reliability, alerting, and incident response patterns.
• Contribute to performance and reliability strategy, helping define how we measure agent quality, responsiveness, and system scalability.
Job Requirements
- 6+ years of experience in SRE, DevOps, or Observability Engineering roles, with at least 2+ years leading or designing observability initiatives.
- Deep knowledge of observability tooling (e.g., OpenTelemetry, Prometheus, Grafana, Datadog, Honeycomb, Observe, etc.) and distributed tracing practices.
- Experience with Agentic / LLM-based systems, including tools like LangChain, Celery, OpenAI APIs, or similar orchestration frameworks.
- Strong understanding of how to instrument, trace, and correlate AI/LLM workflows with infrastructure-level telemetry.
- Proven ability to define cross-team standards, influence engineering culture, and establish scalable monitoring patterns.
- Strong collaboration and communication skills—you enable, not dictate.
Benefits
- Competitive salary and full health benefits
- Stock Options at a well funded, pre-IPO company on a fast growth track
- Flexible work schedules and work from anywhere at a fully remote company
- Unlimited PTO with two weeks designated as “quiet period” each year
- An experienced team who will fight beside you in the trenches to accomplish your goals
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
BIM Engineer – Federal Sector
Procon ConsultingProfessional Services firm for owners of real estate and clients with capital construction programs.
Engineer152 days ago
Full TimeRemoteTeam 51-200Since 2000H1B No Sponsor
BIM Engineer specializing in federal sector projects for Procon, a construction management firm.
IoT
Virginia
Engineer152 days ago
Full TimeRemoteTeam 51-200H1B Sponsor
Infra Engineer driving down Datadog costs for Hex Technologies
AWSGrafanaKubernetesPrometheusSplunk
California + 1 moreAll locations: California, New York
Engineer154 days ago
Full TimeRemoteTeam 11-50Since 2018H1B Sponsor
Billing Engineer designing and maintaining billing systems at Render.
NoSQLSQLGo
United States
Engineer156 days ago
Full TimeRemoteTeam 201-500Since 1994H1B No Sponsor
Wastewater Performance Engineer designing wastewater treatment solutions for Energy Systems Group
United States