Senior, Software Engineer

Walmart · Retail · Bangalore, KA, India

Senior Software Engineer (AI/ML) to design and implement AI/ML-driven solutions for intelligent observability, real-time monitoring, and autonomous remediation across large-scale systems. The role involves building and optimizing real-time inference APIs, developing data pipelines, automating MLOps, and working with Golang, distributed systems, and cloud-native technologies.

What you'd actually do

  1. Design, develop, and deploy scalable AI/ML models for anomaly detection, forecasting, and root-cause analysis.
  2. Build and optimize real-time inference APIs and services integrating ML pipelines into production.
  3. Develop data pipelines for large-scale telemetry, logs, metrics, and traces using event-driven architectures.
  4. Automate model training, evaluation, and deployment pipelines (MLOps).
  5. Continuously monitor model performance and optimize for accuracy, latency, and cost.

Skills

Required

  • software engineering
  • distributed systems
  • cloud-native technologies
  • Golang
  • Python
  • TensorFlow
  • PyTorch
  • Scikit-learn
  • time-series modeling
  • anomaly detection
  • forecasting
  • LLMs
  • RAG pipelines
  • agentic workflows
  • Kubeflow
  • MLflow
  • Vertex AI
  • SageMaker
  • Kafka
  • Pub/Sub
  • SQL
  • NoSQL
  • RESTful APIs
  • gRPC APIs
  • microservices
  • testing
  • CI/CD pipelines
  • Prometheus
  • Grafana
  • OpenTelemetry
  • distributed consensus
  • event sourcing

Nice to have

  • willingness to learn Go
  • LangChain
  • Haystack
  • Semantic Kernel

What the JD emphasized

  • AI/ML engineering
  • deploy ML models end-to-end
  • Golang
  • AI/ML models
  • real-time inference APIs
  • model training, evaluation, and deployment pipelines
  • model performance
  • AI-powered automation and observability workflows
  • distributed and fault-tolerant systems
  • multi-cloud applications
  • CI/CD
  • observability
  • real-time observability
  • AIOps
  • incident management platforms

Other signals

  • AI/ML-driven solutions
  • intelligent observability
  • real-time monitoring
  • autonomous remediation
  • AI agents
  • self-healing systems