Senior Software Engineer, Coreai

Microsoft Microsoft · Big Tech · Redmond, WA +3 · Software Engineering

Senior Software Engineer on the FIT training team within Microsoft's CoreAI organization, focused on building and optimizing AI infrastructure for agentic AI systems. The role involves developing scalable infrastructure for training LLMs, SLMs, and agentic models to achieve frontier-level performance, contributing to both proprietary and open-source frameworks for enterprise-grade agentic workflows.

What you'd actually do

  1. Collaboration with engineers and researchers to build and optimize training infrastructure and tools for LLMs, SLMs, multimodal, and code-specific models.
  2. Design, build and improve services with high scalability and reliability.
  3. Design and implement the services to serve the prod traffic and fulfill the security and privacy requirements.
  4. Participate in efforts to deliver and improve engineering systems and practices to ensure service quality in complex cloud environments.
  5. Contribute to the deployment and monitoring of services in production environments.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field
  • 4+ years technical engineering experience
  • coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Python, or equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check

Nice to have

  • Experience working with engineering teams to deliver large-scale software systems, preferably in AI, machine learning, graphics or related fields.
  • Thrive in a fast-paced, collaborative environment and are comfortable making progress in ambiguity.
  • Enjoy working closely with cross-functional partners and teammates in an inclusive, curious culture.
  • Have strong opinions about best investments to make in establishing the most delightful and performant AI companion engineering system.

What the JD emphasized

  • track record of continuous improvement
  • scalable infrastructure
  • enterprise-grade agentic workflows
  • agentic AI systems
  • frontier-level performance
  • LLMs, SLMs, and agentic models

Other signals

  • training infrastructure
  • agentic AI systems
  • frontier-level performance
  • LLMs, SLMs, and agentic models