Principal Software Engineer

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Software Engineering

This role is for a Principal Software Engineer on the Web Data Platform Team, focusing on scaling the global web scale Index and its supporting platform. The primary responsibility is to enhance the Web Crawler team's platform, which handles tens of billions of URLs daily and is crucial for Search and Recommendation scenarios. The role involves containerizing the service, building the next version of the crawler for increased throughput and global deployment, while intelligently minimizing load on websites. The system must balance discovering new web pages with maintaining document freshness and avoiding excessive crawl requests. It's a distributed platform that uses ML models for resource efficiency, and web-scale data is highlighted as critical for both training and serving use cases with LLMs.

What you'd actually do

  1. Collaborate with and guide stakeholders to confirm requirements for products or solutions.
  2. Lead architectural discussions and own the architecture.
  3. Create and test architectural proposals and develop detailed design specifications.
  4. Develop a clear testing strategy to ensure quality and prevent regression in the product.
  5. Identify dependencies and incorporate them into design documentation.

Skills

Required

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • equivalent experience

Nice to have

  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • equivalent experience

What the JD emphasized

  • hyper-scale distributed systems
  • scale our platform
  • increase the throughput
  • deployed in every region of the world
  • fine balance
  • ML models aiding effective resource usage
  • web scale data has become critical for training needs, in addition to serving use cases