Forward Deployed Reliability Engineer

Palantir Palantir · Enterprise · New York, NY · Product Development

This role focuses on ensuring the stability and reliability of Palantir's mission-critical workflows. The engineer will be responsible for incident response, diagnosing and resolving issues, and driving product improvements based on field learnings. Key activities include automating tasks, streamlining workflows, and documenting best practices to enhance overall system resilience and quality of service.

What you'd actually do

  1. Go on-call, responding quickly and effectively to mission-critical incidents
  2. Diagnose, resolve, and proactively prevent issues encountered in the field
  3. Collaborate with internal stakeholders to increase the scalability and reliability of Foundry workflows for our customers
  4. Identify recurring pain points and inefficiencies, and take initiative to automate or streamline workflows
  5. Advocate for and implement product enhancements based on insights gleamed from the field

Skills

Required

  • Python
  • Java
  • SQL
  • parallel data processing
  • Spark job optimization
  • root cause analysis
  • documentation

Nice to have

  • scripting
  • automation
  • workflow streamlining
  • product enhancement advocacy

What the JD emphasized

  • US citizen or green card holder