Weekday (YC W21)

We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent

AI Red-Teamer – Adversarial AI Testing, English

Artificial IntelligenceArtificial IntelligencePart TimeRemoteTeam 11-50Since 2021H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

5 days ago

Salary

$50 - $111 / hour

EnglishCyber Security

Job Description

• Red-team AI models and agents by testing jailbreak attempts, prompt injections, misuse scenarios, and exploit strategies • Generate high-quality human evaluation data by annotating model failures, classifying vulnerabilities, and identifying systemic risks • Apply structured testing methodologies using taxonomies, benchmarks, and playbooks to ensure consistent evaluation • Document findings clearly and reproducibly, producing reports, datasets, and adversarial test cases that teams can act upon • Work across multiple projects, supporting different AI systems and evaluation objectives

Job Requirements

  • You have **prior red-teaming experience**, such as adversarial AI testing, cybersecurity, or socio-technical risk analysis
  • You naturally think **adversarially**, exploring ways to push systems to their limits and uncover weaknesses
  • You prefer **structured methodologies**, using frameworks and benchmarks rather than ad-hoc testing
  • You communicate risks and vulnerabilities **clearly to both technical and non-technical audiences**
  • You are comfortable **working across multiple projects and adapting to new evaluation challenges**
  • Nice-to-Have Specialties
  • Adversarial Machine Learning:** jailbreak datasets, prompt injection attacks, RLHF/DPO vulnerabilities, or model extraction techniques
  • Cybersecurity:** penetration testing, exploit development, reverse engineering
  • Socio-technical risk analysis:** harassment or misinformation testing, abuse pattern analysis
  • Creative adversarial thinking:** backgrounds in psychology, acting, writing, or other disciplines that support unconventional attack strategies

Related Job Pages

More Artificial Intelligence Jobs

VP of Delivery, AI Governance

iTmethods

iTmethods is a 20+ year enterprise technology company transforming into an AI-native platform company. We currently serve over 100 enterprise customers across multiple regulated industries. We are building an AI Governance product needed by enterprise companies in regulated industries such as financial services, defense, pharma, healthcare, and semiconductors. Reign - Enterprise AI Governance Platform. Real-time policy enforcement for every LLM call and AI agent interaction. Trust, compliance, and data sovereignty. Forge - DevOps managed services platform for regulated enterprises. BioCompute - Life Sciences AI platform for pharma and biotech. The EU AI Act high-risk obligations take effect in August, 2026. Enterprises need production-grade AI governance now. Reign delivers true runtime policy enforcement and AI governance.

Artificial Intelligence5 days ago
Full TimeRemote

We are looking for an experienced professional to lead customer delivery of AI Governance solutions to enterprise clients in multiple regulated industries. We are building an outcome-based delivery model. This is not a traditional professional services leadership role. The person...

United States
ContractRemote

The ACCESS Teacher is responsible for providing web-based instruction to high school students across the State of Alabama, which includes maintaining regular contact with facilitators and students regarding progress. Duties involve logging into the learning management system daily, responding to emails, grading assignments, monitoring participation, and posting timely announcements.

United States

Digital Growth & AI Strategy Manager

MariaDB

Set sail for a better database. Learn why 75% of Fortune 500 companies run MariaDB.

Artificial Intelligence5 days ago
Full TimeRemoteTeam 201-500Since 2009H1B No Sponsor

Marketing Manager driving AI strategies for MariaDB

PythonSQL
California
$130K - $150K / year
Full TimeRemoteTeam 10,001+Since 1892H1B Sponsor

AI Program Manager for People & Culture team transforming P&C processes with AI

PMP
United States
$136.4K - $204.6K / year