Principal/senior Software Engineer, Experimentation Platform - Coreai

Microsoft Microsoft · Big Tech · Redmond, WA +2 · Software Engineering

This role is for a Principal/Senior Software Engineer on the Experimentation Platform team within CoreAI at Microsoft. The platform enables high-scale online experimentation to accelerate product learning and drive progress across Microsoft's AI ecosystem. The engineer will design and build services for rapid iteration in AI systems and product features, empowering others to measure impact and validate hypotheses. The role involves developing critical components of a large-scale experimentation platform, focusing on distributed systems, service reliability, and experimentation methodologies.

What you'd actually do

  1. Independently leverage AI tools and practices across the software development lifecycle (SDLC), taking responsibility for AI-generated assets and coaching team members to adopt responsible AI-assisted development practices.
  2. Lead by example to produce extensible, maintainable, well-tested, secure, and performant code; apply metrics to drive code quality and stability, and continuously improve code performance, testability, and cost-effectiveness across the team.
  3. Own and drive the architecture and design of product components, creating design specifications, and ensuring system architecture meets performance, scalability, resiliency, and disaster recovery requirements with minimal technical oversight.
  4. Collaborate with partner teams, PMs, and subject matter experts (privacy, security, SRE) to determine customer requirements, incorporate feedback, and deliver scalable, reliable features with proper end-to-end testing.
  5. Drive engineering excellence through automation, tooling improvements, security best practices, and deployment infrastructure.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Nice to have

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Experience building and operating large-scale distributed systems on cloud platforms (Azure, AWS, GCP), including design, deployment, monitoring, and troubleshooting of production workloads.
  • Experience designing and implementing service architectures with strong emphasis on scalability, reliability, fault tolerance, and cost optimization.
  • Experience using observability tools (logging, metrics, distributed tracing) to diagnose complex service issues and drive systemic reliability improvements.
  • Experience mentoring engineers, driving code reviews, and raising engineering best practices within a team.
  • Familiarity with experimentation platforms, A/B testing methodologies, and statistical analysis of product metrics.
  • Experience with AI-assisted development workflows and responsible use of AI coding tools in production environments.

What the JD emphasized

  • AI tools and practices
  • AI-generated assets
  • responsible AI-assisted development practices
  • AI systems
  • AI capabilities