Software Engineer, Account Abuse

Anthropic Anthropic · AI Frontier · San Francisco, CA · Safeguards (Trust & Safety)

Software Engineer on the Account Abuse team responsible for building systems that gather and analyze signals to prevent bad actors from abusing Anthropic's computing capacity. This involves integrating with third-party vendors, creating monitoring dashboards, and working with data scientists and policy teams to identify abuse patterns and build multi-layered defenses.

What you'd actually do

  1. Jumping into other teams’ code to identify key points to gather signals or introduce interventions with minimal impact on their systems’ stability, complexity, or overall architecture
  2. Integration with third-party data-enrichment vendors
  3. Creating monitoring dashboards, alerts, and internal admin UX
  4. Working closely with our data scientists to maintain situational awareness of our current usage patterns and trends, and with our Policy & Enforcement team to maximize the impact of their human-review availability
  5. Building robust and reliable multi-layered defenses

Skills

Required

  • Python
  • SQL
  • data analysis tools
  • communication skills
  • explain complex technical concepts to non-technical stakeholders

Nice to have

  • experience building trust and safety mechanisms for and using AI/ML systems
  • fraud-detection models
  • security monitoring tools
  • infrastructure to support these systems at scale
  • worked closely with operational teams to build custom internal tooling

What the JD emphasized

  • focus on integrity, spam, fraud, or abuse detection
  • building trust and safety mechanisms for and using AI/ML systems, such as fraud-detection models or security monitoring tools or the infrastructure to support these systems at scale