Tetrad Digital Integrity logo
Tetrad Digital Integrity

Tetrad Digital Integrity (TDI) is a cybersecurity firm built for high-consequence environments where mission, complexity, and trust intersect. Our single focus has been delivering cyber solutions to effectively manage risk & the business of cyber for 25 years! TDI does business with the federal government, which restricts employment to individuals who are either US citizens or lawful permanent residents of the United States. TDI is an Equal Opportunity Employer. Employment decisions are made based on individual qualifications, merit, and business needs. We do not discriminate in employment opportunities or practices based on race, color, religion, sex, or national origin, in accordance with applicable federal laws.

LLM Security Evaluation Expert

Location

United States

Posted

31 days ago

Salary

Not specified

Seniority

Mid Level

Job Description

Role Description

We are seeking a highly skilled LLM Security Evaluation Expert to join our team. In this role, you will be responsible for rigorously testing the security and integrity of Large Language Models (LLMs). Your primary focus will be on designing and executing sophisticated adversarial prompt attacks to identify potential vulnerabilities, assess the model's resistance to exploitation, and ensure it maintains consistent, secure behavior. This is a critical role in safeguarding our AI systems and ensuring they operate responsibly.

Responsibilities

  • Adversarial Prompt Design & Execution:
    • Develop and implement a comprehensive suite of adversarial prompts, ranging from basic to more sophisticated, targeting known and potential LLM vulnerabilities.
    • Craft prompts specifically designed to:
      • Bypass security filters and content moderation policies.
      • Induce the LLM to reveal sensitive, confidential, or proprietary information.
      • Manipulate the LLM's output to generate harmful, biased, or unintended content.
      • Test for prompt injection, jailbreaking, and other emerging attack vectors.
  • Vulnerability Assessment & Analysis:
    • Systematically test LLMs against the designed adversarial prompts.
    • Analyze LLM responses to identify successful exploits, security weaknesses, and patterns of failure.

Qualifications

  • Strong knowledge of how LLMs work, including their architecture, training processes, capabilities, and inherent limitations.
  • Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common characteristics.
  • Proven experience in crafting and refining prompts to elicit specific behaviors or bypass restrictions in LLMs.
  • Demonstrable understanding of techniques like jailbreaking, prompt injection, role-playing attacks, and exploiting model biases.
  • Strong understanding of cybersecurity principles and common attack vectors, particularly as they apply to AI/ML systems.
  • Ability to think like an attacker and anticipate potential exploits.
  • Excellent ability to analyze complex systems, identify subtle vulnerabilities, and systematically test hypotheses.
  • Clear and concise written and verbal communication skills, with the ability to document technical findings thoroughly.
  • Understanding of the ethical implications of AI security and commitment to responsible testing practices.
  • Offensive Security Certified Professional (OSCP)
  • Certified Ethical Hacker (CEH)

Preferred Qualifications

  • Prior experience in AI red teaming, penetration testing of AI/ML systems, or a dedicated LLM security research role.
  • Familiarity with specific LLM security evaluation frameworks or benchmarks (e.g., those developed by NIST, Stanford HELM, or other research institutions).
  • Knowledge of common LLM fine-tuning and alignment techniques (e.g., RLHF) and how they might impact security.
  • Contributions to the AI security community (e.g., research papers, open-source tools, conference presentations).

Requirements

  • TDI does business with the federal government, which restricts employment to individuals who are either US citizens or lawful permanent residents of the United States.

Equal Opportunity Statement

TDI is an Equal Opportunity Employer. Employment decisions are made based on individual qualifications, merit, and business needs. We do not discriminate in employment opportunities or practices based on race, color, religion, sex, or national origin, in accordance with applicable federal laws.

Job Requirements

  • Strong knowledge of how LLMs work, including their architecture, training processes, capabilities, and inherent limitations.
  • Familiarity with prominent LLM families (e.g., GPT series, Claude, Llama, PaLM) and their common characteristics.
  • Proven experience in crafting and refining prompts to elicit specific behaviors or bypass restrictions in LLMs.
  • Demonstrable understanding of techniques like jailbreaking, prompt injection, role-playing attacks, and exploiting model biases.
  • Strong understanding of cybersecurity principles and common attack vectors, particularly as they apply to AI/ML systems.
  • Ability to think like an attacker and anticipate potential exploits.
  • Excellent ability to analyze complex systems, identify subtle vulnerabilities, and systematically test hypotheses.
  • Clear and concise written and verbal communication skills, with the ability to document technical findings thoroughly.
  • Understanding of the ethical implications of AI security and commitment to responsible testing practices.
  • Offensive Security Certified Professional (OSCP)
  • Certified Ethical Hacker (CEH)
  • Preferred Qualifications
  • Prior experience in AI red teaming, penetration testing of AI/ML systems, or a dedicated LLM security research role.
  • Familiarity with specific LLM security evaluation frameworks or benchmarks (e.g., those developed by NIST, Stanford HELM, or other research institutions).
  • Knowledge of common LLM fine-tuning and alignment techniques (e.g., RLHF) and how they might impact security.
  • Contributions to the AI security community (e.g., research papers, open-source tools, conference presentations).
  • TDI does business with the federal government, which restricts employment to individuals who are either US citizens or lawful permanent residents of the United States.
  • Equal Opportunity Statement
  • TDI is an Equal Opportunity Employer. Employment decisions are made based on individual qualifications, merit, and business needs. We do not discriminate in employment opportunities or practices based on race, color, religion, sex, or national origin, in accordance with applicable federal laws.

Related Categories

Related Job Pages

More Security Engineer Jobs

OtherRemoteTeam 1-10Since 1934H1B No Sponsor

Senior Product Security Engineer focusing on AI security at Universal Music Group

Alabama + 3 moreAll locations: Alabama, Arizona, California, Colorado
$161.4K - $199.8K / year
OtherRemoteTeam 1-10Since 1934H1B No Sponsor

Senior AI Security Engineer leading enterprise AI Security program

Alabama + 3 moreAll locations: Alabama, Arizona, California, Colorado
$162.4K - $199.9K / year
Businessolver logo

Information Security Engineer

Businessolver

Benefits Technology, Powered by People

OtherRemoteTeam 1,001-5,000Since 1998H1B Sponsor

Information Security Engineer ensuring data security at Businessolver

United States
$79K - $123K / year
Anomali logo

Senior Account Executive, SIEM, Security Analytics

Anomali

Intelligence-Driven Extended Detection and Response (XDR)

OtherRemoteTeam 201-500Since 2013H1B Sponsor

Senior Account Executive focusing on field sales and account development

Florida