Senior Software Engineer, Agents

Harvey Harvey · AI Frontier · New York, NY · Engineering

This role focuses on building and optimizing agentic AI systems for legal professionals. The engineer will design environments, actions, and tools for these agents, manage context windows, make model selection decisions, and develop evaluations to improve iteration speed and task completion quality. The role involves optimizing agent performance through various techniques and working with infrastructure teams for low-latency execution and improved observability. Experience with LLM APIs, agent frameworks, and shipping user-facing products is required.

What you'd actually do

  1. Partner with customers and PMs to understand legal workflows, design practical evaluations that capture what “excellent” means, and ship agents that get the job done.
  2. Optimize agent performance through prompt engineering, model selection, tool design, skill writing, context window management, and eval harness development.
  3. Work with our model infra team to design and implement infrastructure for low-latency agent execution, including caching strategies, parallel tool calls, or subagent patterns.
  4. Improve our observability and instrumentation to profile agent behavior, identify bottlenecks, and drive optimization decisions.
  5. Stay current on new developments in agentic systems and bring those learnings back to the products we build.

Skills

Required

  • Python
  • LLM APIs
  • agent frameworks
  • shipping user-facing products
  • software engineering experience

Nice to have

  • domain-specific agents
  • iterative mindset
  • evaluations to drive quality
  • adaptable about code and frameworks
  • new best practices

What the JD emphasized

  • build the systems that make our AI agents indispensable
  • design environments and actions for agentic professional work
  • make model selection decisions
  • create optimal tools
  • develop evals that enable faster iteration loops
  • shipping impactful products
  • practical evaluations to drive task completion quality
  • customer delight
  • shipping user-facing products

Other signals

  • building agentic systems
  • shipping impactful products
  • practical evaluations to drive task completion quality
  • customer delight