Senior Software Engineer

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

Senior Software Engineer role focused on building AI-powered tooling and test infrastructure for Azure Data integration products. The role involves using LLMs for test suite generation, identifying coverage gaps, detecting flaky tests, and prioritizing regression risks. It also includes building agents for automated diagnostics, log analysis, incident triage, and developing proactive observability with intelligent alerting. The engineer will also create AI-powered diagnostic tools for support and collaborate on automated incident response.

What you'd actually do

  1. Build intelligent test infrastructure that generates, maintains, and evolves test suites as the product changes. Use LLMs to identify coverage gaps, detect flaky tests, and prioritize regression risk, reducing manual test authoring while expanding coverage across pipeline execution, data movement, and copy job scenarios.
  2. Build agents and copilot experiences that serve as the first line of diagnostics, automating log analysis, root cause suggestion, and incident triage. Replace repetitive manual workflows with intelligent automation using Python, PowerShell, and KQL, and continuously evaluate new capabilities to raise team productivity.
  3. Move beyond static dashboards to proactive observability: Power BI and Azure Data Explorer dashboards that surface anomalies automatically, generate incident summaries, and highlight trends before they become outages. Define and instrument SLIs/SLOs with intelligent alerting.
  4. Partner with the support organization to build AI powered diagnostic tools that accelerate case resolution, shifting from static troubleshooting guides toward copilots that walk support engineers through diagnosis and auto suggest solutions. Analyze incident patterns to drive bug prioritization and close feedback loops with engineering.
  5. Collaborate with engineering owners to build increasingly automated incident response: from assisted detection and triage today toward auto mitigation over time. Participate in on call duties; drive root cause analysis, postmortems, and repair item follow through with a focus on reducing recurrence through automation.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • equivalent experience

Nice to have

  • Master’s degree in Computer Science (or related field) with 6+ years of experience, or Bachelor’s degree with 8+ years of relevant experience.
  • Experience building test automation frameworks and implementing CI/CD quality gates.
  • Experience in cloud service operations, incident management, and live site engineering.
  • Experience with AI/LLM tools, Copilot-style assistants, prompt engineering, or agent-based automation.
  • Experience with telemetry, dashboards, and data analysis using Azure Data Explorer (KQL) and Power BI.
  • Experience with data integration platforms such as Azure Data Factory, Fabric Pipelines, or Spark.

What the JD emphasized

  • AI powered tooling
  • LLMs
  • agents
  • copilot experiences
  • automating log analysis
  • incident triage
  • AI powered diagnostic tools
  • auto mitigation

Other signals

  • AI powered tooling
  • LLMs to identify coverage gaps
  • Build agents and copilot experiences
  • automating log analysis
  • incident triage
  • AI powered diagnostic tools
  • accelerate case resolution
  • auto mitigation