Research Scientist, AI Research (phd)

Meta · Big Tech · Paris, France

Research Scientist focused on deep learning for memory in neural networks, applicable to personalization, lifelong learning, and large contexts. The role involves advancing state-of-the-art in architecture and memory, collaborating globally, and requires a PhD with publications in ML/DL, robotics, LLMs, or computer vision, and strong Python/C++ skills with PyTorch/TensorFlow.

What you'd actually do

Conduct research to advance the state of the art in architecture and memory related topics
Consistently and sustainably advance the state of the art for your problem, including setting and executing against roadmaps for 6-month plus timeframes
Collaborate with different cross-functional teams across the globe in research and product

Skills

Required

PhD in Computer Science or related field
Deep Learning research
Machine Learning
Large Language Models (LLM)
PyTorch or TensorFlow
Python or C/C++
memory architectures
personalization
lifelong learning
large contexts

Nice to have

robotics
computer vision
text-time training
memory controller
indexing structures

What the JD emphasized

PhD in Computer Science or a related field with published projects in the fields of machine learning, deep learning, robotics, large language models and/or computer vision
Publications at peer-reviewed conferences, e.g. ICLR, ICML, NeurIPS

Other signals

deep learning research
memory
personalization
lifelong learning
large contexts
LLM
text-time training
memory controller
indexing structures

Read full job description

We are doing research on deep learning research related to memory that would be suitable to extend the knowledge base of neural network for personalization, lifelong learning and handling large contexts. The team covers a broad range of topics, including LLM, text-time training, memory controller, indexing structures.

Responsibilities

Conduct research to advance the state of the art in architecture and memory related topics Consistently and sustainably advance the state of the art for your problem, including setting and executing against roadmaps for 6-month plus timeframes Collaborate with different cross-functional teams across the globe in research and product

Qualifications

PhD in Computer Science or a related field with published projects in the fields of machine learning, deep learning, robotics, large language models and/or computer vision Proven development skills in Deep Learning, working with PyTorch or TensorFlow Experience developing LLM algorithms or infrastructure in Python or C/C++ Significant contributions to impactful work such as open-source models Publications at peer-reviewed conferences, e.g. ICLR, ICML, NeurIPS