Operations Platform Engineer

Meta Meta · Big Tech · Redmond, WA

Meta is looking for an Operations Platform Engineer to build and evolve the robotics test and data infrastructure for motor validation and experimentation. This role focuses on creating robust software infrastructure for motor control, testing, telemetry, and data pipelines, aiming to make systems repeatable, observable, and scalable for research and engineering teams. The position involves designing test infrastructure, standardizing test stations, defining telemetry schemas, building data pipelines, establishing observability, and collaborating with various teams to improve data quality and test throughput.

What you'd actually do

  1. Design and build motor and actuator test infrastructure, including control loops, data capture, and validation tooling
  2. Develop and standardize repeatable test stations that scale across hardware variants, labs, and teams
  3. Define and implement telemetry schemas and data contracts for robotic systems (commands, feedback, environment, failures), ensuring consistency across programs
  4. Build time-synchronized data pipelines to support debugging, replay, offline analysis, and training workflows
  5. Establish observability standards for robotic systems, including metrics, logging, diagnostics, anomaly detection, and dashboards

Skills

Required

  • software engineering
  • systems engineering
  • robotics engineering
  • working close to hardware
  • motors
  • sensors
  • actuators
  • embedded systems
  • embedded Linux environments
  • design and build test frameworks or infrastructure for physical systems
  • data ingestion pipelines for high-frequency and/or real-time telemetry
  • time sync
  • buffering
  • backpressure
  • schema evolution
  • Systems engineering fundamentals
  • APIs
  • data schemas
  • failure modes
  • reliability
  • operational discipline
  • maintainable interfaces
  • operate effectively in ambiguous, fast-moving environments with evolving requirements
  • communication and collaboration skills across hardware, software, and research disciplines

Nice to have

  • industrial robotics
  • automation
  • real-time systems
  • robotics data formats
  • replay systems
  • simulation pipelines
  • log-based debugging at scale
  • observability tooling and practices
  • metrics
  • logging
  • tracing
  • dashboards
  • alerting
  • supporting ML or research teams through infrastructure
  • data capture
  • labeling support
  • dataset generation
  • evaluation pipelines
  • bringing prototype systems into scaled, multi-team usage
  • documentation
  • onboarding
  • operational support
  • integrate AI tools to optimize/redesign workflows
  • prompt/context engineering
  • agent orchestration
  • responsible, ethical AI practices
  • risk assessment
  • bias mitigation
  • quality and accuracy reviews
  • AI skill development
  • staying current with emerging AI technologies

What the JD emphasized

  • 7+ years of experience in software engineering, systems engineering, robotics engineering, or related fields
  • 3+ years of experience working close to hardware, including motors, sensors, actuators, embedded systems, and/or embedded Linux environments
  • Proven ability to design and build test frameworks or infrastructure for physical systems (labs, manufacturing tests, reliability rigs, end-of-line, or similar)
  • Experience building data ingestion pipelines for high-frequency and/or real-time telemetry (including time sync, buffering, backpressure, and schema evolution)