Research Software Engineer - Multiple Levels- AI Frontiers

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

Research Software Engineer focused on Generative AI and agentic applications within Microsoft Research's AI Frontiers lab. The role involves developing, improving, and exploring capabilities of AI models and agentic systems, with a focus on creating reliable digital workers that can execute entire workflows and collaborate with humans and other agents. Key areas include enhancing agent reasoning, improving robustness, reimagining workflows for an agent-native world, and creating end-to-end experiences.

What you'd actually do

  1. As Research Software Engineer in AI Frontiers, you will contribute to one of our core pillars working with researchers, engineers, and product groups to create research breakthroughs and realize your ideas in products and services used worldwide.
  2. You will create data, evaluation and training pipelines for training agentic models to reliably use computers, tools and complete tasks collaborating with other agents and humans
  3. You’ll showcase your system design thinking, understanding of both user-facing technologies and GenAI technologies to drive efficient, high-quality execution and communication, while launching innovative AI features.
  4. You will work with our Technical Program Management (PM) team to understand customer needs and our research portfolio to create compelling proof of concepts so customers can take advantage of our innovations

Skills

Required

  • Bachelor's Degree in Computer Science, Software Engineering, System Engineering or related field
  • 2+ years’ technical engineering experience with coding in languages including, but not limited to, Python
  • coding in languages including, but not limited to, Python

Nice to have

  • Master's Degree in Computer Science, Software Engineering, System Engineering, Human Computer Interaction, or related field
  • 3+ years experience working in a software engineering team and/or shipping production code
  • 4+ years of python experience
  • 2+ years’ experience working on GenAI infrastructure, products or prototypes
  • 2+ years architecting and implementing GenAI systems and/or the infrastructure for building these systems
  • Experience in developing and maintaining open-source projects
  • Experience evaluating AI models and systems including creating benchmark datasets, evaluation pipelines, running and debugging experiments, and tracking improvements
  • Experience deploying or managing applications or infrastructure pipelines on a major cloud platform (e.g., Microsoft Azure, AWS, or Google Cloud Platform)
  • Ability to collaborate, communicate effectively, and work as part of a multi-disciplinary team

What the JD emphasized

  • agent-native organizations
  • autonomous AI agents
  • agentic reasoning models
  • agent frameworks
  • agentic ecosystems
  • GenAI infrastructure
  • GenAI systems
  • architecting and implementing GenAI systems
  • evaluating AI models and systems

Other signals

  • autonomous AI agents
  • agentic reasoning models
  • agent frameworks
  • agentic ecosystems
  • GenAI infrastructure
  • GenAI systems