Genai Data Scientist

This role focuses on developing and architecting Generative AI solutions, evaluating and selecting AI tools, building and training models using Python and open-source technologies, and designing generative models with various architectures. The role also involves training users on AI principles and model interaction, and building ML-enabled features for client business processes.

What you'd actually do

  1. Work across client teams to develop and architect Generative AI solutions using ML and GenAI
  2. Develop and promote standards across the community
  3. Evaluate and select appropriate AI tools and machine learning models for tasks, as well as building and training working versions of those models using Python and other open-source technologies
  4. Work with leadership and stakeholders to identify AI opportunities and promote strategy.
  5. Develop and conduct trainings for users across the Government & Public Services landscape on principles used to develop models and how to interact with models to facilitate their business processes.

Skills

Required

  • Python
  • R
  • TensorFlow
  • PyTorch
  • Keras
  • Natural Language Processing (NLP)
  • Large Language Models (LLM)
  • API solutions
  • data wrangling/cleansing
  • statistical modeling
  • programming
  • Agile development environment
  • machine learning algorithms
  • supervised learning
  • unsupervised learning
  • deep learning architectures
  • convolutional neural networks (CNNs)
  • recurrent neural networks (RNNs)
  • GenAI
  • OpenAI
  • Claude
  • Gemini
  • LangChain
  • Agents
  • Vector databases
  • Prompt Engineering
  • fine-tuning

Nice to have

  • text generation
  • image creation
  • data augmentation
  • language modeling concepts
  • AWS
  • Google Cloud
  • Azure
  • text pre-processing
  • tokenization
  • sentiment analysis
  • AI protocols and standards

What the JD emphasized

  • 6+ years of experience programming in in Python or R with libraries like TensorFlow, PyTorch, or Keras
  • 5+ years of experience with Natural Language Processing (NLP) and Large Language Models (LLM)
  • 5+ years of experience building and maintaining scalable API solutions
  • 5+ years of experience in data wrangling/cleansing, statistical modeling, and programming
  • 3+ years of experience with AI/ML, with last 2 years focused on GenAI as well as technologies like OpenAI, Claude, Gemini, LangChain, Agents, Vector databases, and approaches likePrompt Engineering, fine-tuning, etc.

Other signals

  • Develop and architect Generative AI solutions using ML and GenAI
  • Evaluate and select appropriate AI tools and machine learning models for tasks, as well as building and training working versions of those models
  • Design and build generative models, selecting the most suitable architecture (e.g., GANs, VAEs) based on the desired output (text, images, code)