Data First Jobs

OP

AI Safety and Risk Analyst

Contract · In Office · Austin, Texas (USA)

$40,000–$45,000 · Posted Jun 8, 2026

Work Options
Job Type
Position Group

We are on the hunt for a dynamic, resilient, and highly motivated AI Safety Risk Analyst - Red Teaming to be a key player at the cutting edge of AI safety. In this pivotal role, you will lead efforts to identify and mitigate risks in our advanced language models, shaping the responsible development of tomorrow s AI. Unlike automated testing that efficiently covers known vulnerabilities, this role challenges you to uncover the 40 percent of risks that demand human ingenuity, psychological insight, and creative out-of-the-ordinary thinking. You will work across diverse modalities, text, image, audio, and video, to expose weaknesses, evaluate real-world harm, and stress-test our systems against emerging adversarial threats.

This is high-impact, frontier work where your insights directly influence how AI interacts with the world. We value diverse perspectives and encourage candidates from non-traditional backgrounds, whether from the arts, humanities, mental health, or education, to bring their unique skills to this critical mission.

Responsibilities

  • Creative Adversarial Testing.
  • Design and execute novel, multi-turn adversarial attacks, including emotional manipulation, roleplay, social engineering, and authority exploitation, to bypass model safeguards and surface harmful capabilities.
  • Vulnerability Assessment.
  • Evaluate model outputs for actual harm and real-world risk, not just policy violations. Apply the user risk taxonomy to prioritize testing across user types, from casual users to agentic systems.
  • Agentic and Emerging Threat Testing.
  • Probe for agentic vulnerabilities such as privilege escalation, indirect prompt injection, and scope creep in multi-authority systems, the next frontier of AI risk.
  • Data Annotation and Reporting.
  • Generate high-quality human evaluation data by annotating model failures, classifying vulnerabilities, and producing reproducible adversarial test cases that engineering and safety teams can act upon.
  • Cross-Functional Collaboration.
  • Partner with AI researchers, engineers, and domain experts to translate findings into actionable improvements. Contribute to refining our red teaming taxonomy, benchmarks, and tooling infrastructure.
  • Continuous Learning.
  • Stay current with evolving adversarial techniques, internet subcultures, and AI safety research to continuously sharpen your attack strategies.
  • Minimum Qualifications
  • Creative and Psychological Insight: Background in creative writing, humanities, mental health counseling, psychology, or special education, with a demonstrated ability to construct compelling narratives, exploit linguistic nuance, and identify psychological vulnerabilities.
  • Adversarial Mindset: A natural inclination to think like an attacker and push systems to their limits. You should find genuine satisfaction in discovering unexpected failure modes.
  • Adaptability: Comfort switching between modalities (text, image, audio, video) and rapidly adjusting to new model behaviors, testing priorities, and task types.
  • Communication Skills: Excellent written and verbal communication skills, with the ability to document and explain complex vulnerabilities clearly to both technical and non-technical audiences.
  • Resilience and Balance: Capacity to sustain well-being while engaging in psychologically demanding work. This role involves regular exposure to graphic and objectionable content, including violence, exploitation, and self-harm scenarios. Comprehensive wellness support is provided.

Preferred Qualifications

  • Prior experience in professional red teaming, trust & safety, data annotation, or socio-technical risk analysis.
  • Familiarity with large language models (LLMs) and generative AI products such as ChatGPT, Claude, or Gemini.
  • Basic technical skills in prompt engineering, encoding techniques (e.g., Base64, ROT13), or scripting to complement creative attack vectors.
  • Knowledge of AI safety concepts, including RLHF, alignment, and model evaluation frameworks.

Benefits

  • 401(k).
  • Dental Insurance.
  • Health insurance.
  • Vision insurance.
  • We are an equal-opportunity employer and value diversity, equality, inclusion, and respect for people.
  • The salary will be determined based on several factors, including, but not limited to, location, relevant education, qualifications, experience, technical skills, and business needs.

Additional Responsibilities

  • Participate in OP monthly team meetings and participate in team-building efforts.
  • Contribute to OP technical discussions, peer reviews, etc.
  • Contribute content and collaborate via the OP-Wiki/Knowledge Base.
  • Provide status reports to OP Account Management as requested.

About Us

At OP, we help you harness the power of technology for maximum impact. A technology consulting and solutions company, we offer advisory and managed services, innovative platforms, and staffing solutions across a wide range of fields including AI, cyber security, enterprise architecture, and beyond. For nearly two decades, we ve been challenging the status quo of the consulting industry, serving up fresh, ingenious thinking through a radically lean structure. Together, this strategy delivers unprecedented performance at an unparalleled pace for faster results that propel your business forward.

Mention you found this on Data First Jobs — it helps us bring you more roles like this.

AI Safety and Risk Analyst

OP

Like this role? Get carefully selected jobs like it, twice a week, straight to your inbox.

Free, no spam. Unsubscribe anytime.