AI Applied Scientist - Phd Intern, Evaluation Systems and Metrics

at Zillow · Consumer · United States · Remote

Zillow is seeking a remote PhD intern to develop cutting-edge evaluation methodologies for AI systems, focusing on creating robust, scalable metrics and frameworks for generative models across multiple modalities. The role involves research into novel metrics, self-improving assessment systems, privacy-preserving evaluation, and ethical fair housing evaluation for agentic systems.

What you'd actually do

  1. develop cutting-edge evaluation methodologies for AI systems
  2. creating robust, scalable metrics and frameworks to assess the quality, consistency, and performance of generative models across multiple modalities
  3. Novel Evaluation Metrics: Develop innovative assessment methodologies for emerging AI capabilities, focusing on consistency and quality across complex multi-modal outputs
  4. Self-Improving Assessment: Design evaluation systems that learn and adapt from feedback, automatically discovering new evaluation criteria and improving assessment quality over time
  5. Privacy-Preserving Evaluation: Design frameworks that incorporate domain-specific implementations of differential privacy to protect sensitive user information while maintaining utility for model training and assessment.
  6. Ethical Fair Housing Evaluation: Develop scalable methodologies for assessing agentic systems, ensuring compliance with fair housing standards and promoting ethical, responsible AI deployment

Skills

Required

  • PhD student in computer science, machine learning, computer vision, or a related field
  • Evaluation methodologies for AI/ML systems
  • Computer vision metrics and 3D consistency assessment
  • Generative model evaluation (text, image, video, 3D)
  • Multi-modal assessment and automated feedback systems
  • Knowledge of data privacy methods (e.g., differential privacy, federated learning, secure ML) and their application.
  • Single agent or multi-agent system evaluations
  • modern deep learning frameworks (e.g., PyTorch, Hugging Face Transformers)
  • Strong research mindset
  • motivation to publish

Nice to have

  • A record of publication in conferences, workshops, or journals is a plus

What the JD emphasized

  • strong publication record
  • publication track record

Other signals

  • developing evaluation methodologies
  • designing evaluation frameworks
  • assessing AI capabilities
  • scalable metrics
Read full job description

About the team

Are you passionate about building rigorous evaluation frameworks that advance AI systems? The Zillow AI Applied Science team develops next-generation evaluation methodologies for generative AI, computer vision, and agentic systems. We work at the intersection of research and production, designing evaluation frameworks that assess current AI capabilities and adapt as technology advances.

About the role

We are seeking remote PhD interns for Summer 2026!

As an intern, you will help develop cutting-edge evaluation methodologies for AI systems. Your research will focus on creating robust, scalable metrics and frameworks to assess the quality, consistency, and performance of generative models across multiple modalities. You may contribute in one or more of the following areas:

  • Novel Evaluation Metrics: Develop innovative assessment methodologies for emerging AI capabilities, focusing on consistency and quality across complex multi-modal outputs
  • Self-Improving Assessment: Design evaluation systems that learn and adapt from feedback, automatically discovering new evaluation criteria and improving assessment quality over time
  • Privacy-Preserving Evaluation: Design frameworks that incorporate domain-specific implementations of differential privacy to protect sensitive user information while maintaining utility for model training and assessment.
  • Ethical Fair Housing Evaluation: Develop scalable methodologies for assessing agentic systems, ensuring compliance with fair housing standards and promoting ethical, responsible AI deployment

This role has been categorized as a Remote position. “Remote” employees do not have a permanent corporate office workplace and, instead, work from a physical location of their choice, which must be identified to the Company. U.S. employees may live in any of the 50 United States, with limited exceptions.

In California, Connecticut, Maryland, Massachusetts, New Jersey, New York, Washington state, and Washington DC the standard base pay range for this role is $104,000.00 - $166,000.00 annually. This base pay range is specific to these locations and may not be applicable to other locations. In Colorado, Hawaii, Illinois, Minnesota, Nevada, Ohio, Rhode Island, and Vermont the standard base pay range for this role is $104,000.00 - $166,000.00 annually. The base pay range is specific to these locations and may not be applicable to other locations.

Who you are

  • Currently enrolled as a PhD student in computer science, machine learning, computer vision, or a related field, with strong publication record

  • Candidates should have a background in one or more of the following areas:

    • Evaluation methodologies for AI/ML systems
    • Computer vision metrics and 3D consistency assessment
    • Generative model evaluation (text, image, video, 3D)
    • Multi-modal assessment and automated feedback systems
    • Knowledge of data privacy methods (e.g., differential privacy, federated learning, secure ML) and their application.
    • Single agent or multi-agent system evaluations
  • Familiarity with modern deep learning frameworks (e.g., PyTorch, Hugging Face Transformers)

  • Strong research mindset, with motivation to publish

  • Interest in applying AI to complex, multi-stakeholder domains

  • A record of publication in conferences, workshops, or journals is a plus

​Here at Zillow - we value the experience and perspective of candidates with non-traditional backgrounds. We encourage you to apply if you have transferable skills or related experiences.

Get to know us

At Zillow, we’re reimagining how people move—through the real estate market and through their careers. As the most-visited real estate platform in the U.S., we help customers navigate buying, selling, financing and renting with greater ease and confidence. Whether you're working in tech, sales, operations, or design, you’ll be part of a company that's reshaping an industry and helping more people make home a reality.

Zillow is honored to be recognized among the best workplaces in the country. Zillow was named one of FORTUNE 100 Best Companies to Work For® in 2025, and included on the PEOPLE Companies That Care® 2025list, reflecting our commitment to creating an innovative, inclusive, and engaging culture where employees are empowered to grow.

No matter where you sit in the organization, your work will help drive innovation, support our customers, and move the industry—and your career—forward, together.

Zillow Group is an equal opportunity employer committed to fostering an inclusive, innovative environment with the best employees. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. If you have a disability or special need that requires accommodation, please contact your recruiter directly.

Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable state and local law.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.