Principal Operations Engineer

Salesforce Salesforce · Enterprise · New York, NY

Salesforce is seeking a Principal Operations Engineer to lead the design and implementation of automation-first operations for their Digital Enterprise Technology organization. This role focuses on eliminating manual workflows, building automation pipelines, reducing toil, and driving the adoption of self-healing systems. The engineer will also play a key role in incident management, reliability engineering, and defining operational strategies with a customer-centric focus. The position requires extensive experience in engineering, operations, SRE, and a strong background in automation, observability, and distributed systems.

What you'd actually do

  1. Lead the design and implementation of automation-first operations, eliminating manual workflows across incident management, alerting, escalation, runbooks, and day-to-day operational processes
  2. Build and scale alert-to-incident automation pipelines to accelerate detection and response times
  3. Identify and prioritize high-impact toil reduction opportunities across the ecosystem
  4. Drive adoption of self-healing systems and automated remediation patterns
  5. Provide Tier 2+ advanced application support for complex production issues and lead deep-dive investigations into system failures

Skills

Required

  • 12+ years of experience in engineering, operations engineering, SRE, or related roles
  • Proven track record of automating complex operational workflows and improving reliability and operational maturity at scale
  • Deep expertise in incident management systems, observability (metrics, logging, tracing), and distributed systems and microservices
  • Strong experience with automation frameworks, scripting, Infrastructure as Code, and modern DevOps practices
  • Experience operating high-availability, customer-facing systems in enterprise environments
  • Strong written and verbal communication skills with the ability to influence senior engineering leaders and drive outcomes across teams without formal authority
  • A related technical degree required

Nice to have

  • Experience building self-service or platform-based operational tooling
  • Background in automation-driven operations or platform engineering
  • Experience leading large-scale incident management transformations
  • Familiarity with AI/ML-driven operations (AIOps)
  • Experience in SaaS/PaaS enterprise environments
  • Salesforce ecosystem experience (Apex, LWC, APIs, etc.)

What the JD emphasized

  • automating complex operational workflows
  • improving reliability and operational maturity at scale
  • incident management systems
  • observability (metrics, logging, tracing)
  • distributed systems and microservices
  • automation frameworks, scripting, Infrastructure as Code, and modern DevOps practices
  • operating high-availability, customer-facing systems in enterprise environments