Site Reliability Engineer | Trading Operations

Jump Trading Jump Trading · Quant · Amsterdam, Netherlands · IT Infrastructure + WCW

Site Reliability Engineer for a trading firm, focusing on building and maintaining global infrastructure, including monitoring, packet analysis, and automation frameworks. Requires strong Python/Go, Linux, and streaming systems experience.

What you'd actually do

  1. Work in a hybrid Systems Engineer / Software Developer capacity on Jump’s internal systems that manage the global infrastructure and colocation footprint
  2. Architect, build and maintain streaming monitoring tooling, high performance / real-time packet and flow analysis systems, configuration management and automation frameworks
  3. Take an ops-first approach, focusing on improving quality of the platforms for the wider team and always strive to identify improvements and opportunities for further make us more efficient
  4. Identify opportunities to make services more efficient and resilient by designing alerting and observability around team and business requirements
  5. Do deep-dive debugging sessions on low-level performance issues in complex software stacks

Skills

Required

  • Python
  • Go
  • Linux
  • streaming systems architecture and design
  • Message/streaming queues (Apache Kafka, RabbitMQ, etc.)
  • big data / columnar style data stores (Bigtable, Clickhouse, Cassandra) and time series databases
  • version control and CI/CD systems
  • App deployment management and lifecycle (CI/CD, Kubernetes, pods/containers)
  • infrastructure challenges in a reliability capacity
  • strategic thinking skills and maturity in tackling complex problems, dealing with people, technology and processes

Nice to have

  • Rust
  • C/C++
  • Arista or Cisco hardware
  • Routing protocol knowledge
  • network design knowledge

What the JD emphasized

  • world class infrastructure
  • low latent Wide Area Networks
  • high performance / real-time packet and flow analysis systems
  • low-level performance issues