Software Engineer - Moveworks

ServiceNow ServiceNow · Enterprise · Mountain View, CA +1 · Engineering

This role focuses on building and maintaining the core infrastructure and foundational functionalities for distributed systems, including key-value stores, schema-less data stores, authentication, event streaming, configuration management, rate limiting, feature flags, and traffic replay. It also involves improving observability and reliability through distributed logging, tracing, monitoring, and alerting, and optimizing performance, scalability, and latency of applications. The role requires experience with backend distributed systems, cloud infrastructure, and open-source technologies like Kafka and Elasticsearch.

What you'd actually do

  1. Architect and build out the core infrastructure and foundational functionalities such as distributed key value store, schema-less data store, authentication and authorization, event streaming, distributed configuration management and hot reloading, rate limiting and circuit breaking, system for feature flags and A/B testing, traffic capture and replay, etc.
  2. Improve observability and reliability of Moveworks systems by building and enhancing distributed logging, tracing, monitoring, and alerting infrastructure
  3. Define methodologies and metrics to measure performance of microservices and product functionalities, identify and fix bottlenecks, as well as improve overall performance and scalability of Moveworks applications
  4. Closely work with machine learning, search, product, data, and frontend teams, understand their infrastructure needs, influence infrastructure roadmap, and lead as well as execute on various projects
  5. Regularly deliver deadline sensitive work that is interdependent with other engineering teams

Skills

Required

  • designing backend distributed systems
  • building backend distributed systems
  • shipping backend distributed systems
  • maintaining backend distributed systems
  • Python
  • Golang
  • Java
  • C++
  • Docker
  • AWS
  • GCP
  • Azure
  • Kafka
  • Elasticsearch
  • Opensearch
  • Istio
  • performance optimization
  • scalability optimization
  • latency optimization
  • monitoring infrastructure

Nice to have

  • schema-less data store
  • authentication and authorization
  • event streaming
  • distributed configuration management
  • hot reloading
  • rate limiting
  • circuit breaking
  • feature flags
  • A/B testing
  • traffic capture and replay
  • distributed logging
  • distributed tracing
  • alerting infrastructure
  • computer science fundamentals

What the JD emphasized

  • 2+ years of experience designing, building, shipping, and maintaining backend distributed systems
  • startup pace
  • high degree of ownership
  • solving for performance, optimization, scalability, latency, and monitoring
  • Strong motivation, gumption, and an appetite for continuous, incremental changes and completing challenging projects fast