Software Development Engineer Ii, Ads AI Core Infrastructure

Amazon Amazon · Big Tech · IN, KA, Bengaluru · Software Development

Software Development Engineer II role focused on building AI agent infrastructure for Amazon Advertising. The role involves designing and implementing scalable real-time data ingestion and processing pipelines to provide advertiser context to AI agents with sub-second latency. Key responsibilities include architecting data services, optimizing performance, and leveraging AI coding agents for development. The role operates at the intersection of real-time data engineering and AI agent infrastructure, aiming to deliver immediate, strategic advice to advertisers.

What you'd actually do

  1. Design and implement scalable architectures for real-time data ingestion from our data warehouse and Kafka streams processing billions of data points daily
  2. Build Model Context Protocol (MCP) server infrastructure—an emerging standard for AI agent-data interaction—that delivers advertiser context with sub-second latency and minimal token consumption
  3. Develop high-throughput data ingestion systems handling both batch and streaming data sources with 1-3 minute refresh cadences
  4. Optimize system performance to achieve near-perfect response success rates and significant token reduction versus traditional approaches
  5. Use AI coding agents like Kiro to generate technical specifications and implementation code, accelerating development from weeks to days

Skills

Required

  • Scalable system design
  • Real-time data processing
  • High-throughput data ingestion
  • Distributed systems
  • Performance optimization
  • Low-latency systems
  • Data modeling
  • Software development best practices
  • Mentoring junior engineers

Nice to have

  • Experience with AI coding agents
  • Familiarity with Model Context Protocol (MCP)
  • Experience with RAG-based embeddings
  • Knowledge of CodeAct patterns

What the JD emphasized

  • sub-second latency
  • minimal token consumption
  • real-time data ingestion
  • billions of data points
  • 1-3 minute refresh cadences
  • sub-second latency
  • 30+ advertising agents and skills
  • 99.9%+ availability requirements

Other signals

  • Generative AI
  • Agentic Systems
  • Real-time Data Engineering
  • LLM Context Optimization
  • Low Latency Inference