Research Intern - AI Frontiers - Reasoning & Agentic Models

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Applied Sciences

Research intern role focused on advancing agentic model capabilities, including reasoning, tool use, and end-to-end workflow execution across text and visual environments. The role involves research in reinforcement learning, novel training algorithms, synthetic environment creation, multi-agent training, and scaling laws, with a focus on foundational models and various data modalities. The internship aims to contribute to cutting-edge research and potentially ship AI technologies in products.

What you'd actually do

  1. advance the state of the art in agentic model capabilities -- creating models and agents that can reliably perform tasks across digital systems on behalf of humans, combining automation, reasoning, and interaction capabilities to execute workflows end-to-end, leveraging both text-based environments (CLI tools, APIs, scripts, MCPs) and visual environments (GUI applications).
  2. conducts cutting-edge research in AI and publishes findings in top-tier venue such as NeurIPS, ICLR, ICML, and others.
  3. release models and libraries in the open-source (e.g. Phi-4, Phi-4-reaoning, AgentInstruct, AutoGen, MagenticOne) while also working within Microsoft's ecosystem to ship our AI technologies in multiple products.
  4. demonstrated ability for technical work and a proven record of influential publications on Artificial Intelligence.
  5. collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community.

Skills

Required

  • PhD program in Computer Science or related STEM field
  • deep learning
  • large language model training

Nice to have

  • language model pre-training
  • post-training
  • reinforcement learning
  • original research
  • hands-on research
  • collaborative and dynamic environment

What the JD emphasized

  • proven record of influential publications on Artificial Intelligence
  • Proven publication record in top-tier conferences

Other signals

  • agentic model capabilities
  • reasoning
  • tool use
  • multi-agent
  • workflows
  • end-to-end execution
  • text-based environments
  • visual environments