Senior Software Development Engineer

Expedia Expedia · Hospitality · CA

Expedia Group is seeking a Senior Software Development Engineer to join their Universal Messaging Platform (UMP) team. This role will focus on designing, building, and operating high-scale, reliable communication services (Email, SMS, Push, Inbox) to enhance traveler experience and drive business growth. The engineer will lead architecture, development, and operations of mission-critical systems, write and review code, take end-to-end ownership of features, and collaborate with product and cross-functional teams. A key aspect involves safely integrating and operating AI/ML-enabled solutions to improve outcomes, though the core function is not AI/ML development itself.

What you'd actually do

  1. Design, build, and evolve scalable, reliable services and components, including system design, low-level design, API design, and data modeling to support high-traffic, customer-facing and platform experiences.
  2. Write, review, and maintain high-quality, well-tested code, driving engineering best practices in code quality, security, observability, and performance across the team.
  3. Take end-to-end ownership of complex features or services, from technical design through implementation, deployment, monitoring, and iterative improvement in production.
  4. Collaborate closely with product, design, and cross-functional engineering teams to break down ambiguous business problems into clear technical solutions that can be reused across multiple domains.
  5. Safely integrate and operate AI/ML‑enabled solutions that improve outcomes, including familiarity with AI-driven systems, tools, or workflows and applying AI/ML concepts to real world products.

Skills

Required

  • Java
  • Kotlin
  • GraphQL
  • grpc
  • Spring boot
  • distributed systems
  • RESTful APIs
  • microservices
  • cloud platforms (AWS, GCP, or Azure)
  • containerization technologies (Docker, Kubernetes)
  • CI/CD pipelines
  • infrastructure code
  • DevOps practices
  • systems that integrate with multiple services or domains
  • reliability
  • scalability
  • observability

Nice to have

  • service or multi-service architectures at scale
  • resiliency
  • fault tolerance
  • performance optimization
  • leading technical design efforts
  • influencing architectural decisions
  • driving improvements to engineering practices
  • operational excellence
  • data-driven decision making
  • defining metrics
  • logging/monitoring/analytics
  • improving system health
  • user outcomes
  • AI-driven systems, tools, or workflows
  • integrating and operating AI/ML‑enabled solutions
  • technical domains (such as distributed systems, data-intensive services, or full stack web applications)

What the JD emphasized

  • Safely integrate and operate AI/ML‑enabled solutions that improve outcomes
  • applying AI/ML concepts to real world products
  • Safely integrating and operating AI/ML‑enabled solutions that enhance product capabilities or developer productivity
  • Safely integrating and operating AI/ML‑enabled solutions that improve outcomes