Red Teaming Expert

Salary :

$30 - 40 hourly

Company : Mpathic

Job Type : Contract

Remote / Work from Home

Number of Applicants

000+

Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Job Description - Red Teaming Expert

About mpathic.ai

Keeping the human in AI. mpathic is a trusted leader in advancing quality and safety in AI systems through expert-led evaluation and human data. We partner with leading technology companies to support red teaming, trust & safety, expert annotation, and model evaluation across high-stakes domains.

About the Role

mpathic is seeking part-time, project-based Red Teaming Experts to support a red-teaming and evaluation campaign focused on AI safety and model behavior in sensitive, real-world interactions.

In this role, you will design, simulate, and evaluate conversations with AI systems to assess safety, risk, and behavioral performance. You will identify failure modes, edge cases, and policy gaps—particularly in scenarios involving distress, ambiguity, or escalation.

This role involves roleplaying and reviewing clinical scenarios with AI agents. As such, we are ideally seeking candidates who bring creative or performance-driven strengths, as these competencies enhance the realism, nuance, and emotional depth needed for AI safety testing. Examples of these can include, but are not limited to:

Theatre degrees or studies

Acting, theatre, improv, or voice-over experience

Strong writing skills, especially dialogue or scenario writing

Experience creating or inhabiting characters (e.g., performers, TTRPG roleplay, narrative designers)

Conversational design, interaction writing, or scripted roleplay experience

Participation in gaming, interactive storytelling, or digital communities where roleplay is common

What You’ll Be Working On

You will help identify, prevent, and characterize risks that emerge when users interact with AI systems.

Responsibilities may include:

Designing and executing red-teaming scenarios across diverse user behaviors

Reviewing AI-generated responses for safety, accuracy, and policy compliance

Identifying failure modes, edge cases, and behavioral risks

Assessing whether AI appropriately recognizes and responds to distress or escalation

Evaluating tone, boundaries, and appropriateness in sensitive interactions

Detecting misleading, overconfident, or unsafe responses

Evaluating multi-turn conversations for consistency and risk handling

Identifying gaps in responses, including missed signals or incomplete handling

Conducting qualitative analysis to identify behavioral patterns and system weaknesses

Documenting edge cases, failure patterns, and safety risks

Applying or contributing to evaluation rubrics, taxonomies, and frameworks

Supporting quality assurance (QA) to ensure consistency across evaluations

Collaborating with internal teams on AI safety and evaluation improvements

Participating in red teaming exercises to surface system vulnerabilities

Maintaining strict confidentiality and quality standards

What We’re Looking For

Successful candidates are detail-oriented, analytically strong, and experienced in evaluating or stress-testing AI systems in complex or high-risk scenarios.

Professional experience in one or more of the following:

LLM red teaming or AI safety evaluation

Trust & safety, content moderation, or policy enforcement

AI/ML evaluation, annotation, or QA workflows

Conversational analysis or behavioral risk assessment

Work involving sensitive or high-stakes user interactions

Strong understanding of:

AI safety principles and common failure modes

Behavioral risk, escalation patterns, and edge-case handling

Mental health sensitivity, boundaries, and responsible AI behavior

How users express distress, confusion, or harmful intent in conversation

Ability to identify:

Safety violations and policy gaps

Missed or mishandled risk signals

Unsafe, misleading, or overconfident responses

Inappropriate tone or boundary-setting

Failures in escalation, de-escalation, or resolution

Inconsistencies across multi-turn interactions

Experience with or Interest in:

Red teaming methodologies and adversarial testing

Evaluating conversational AI systems or chatbots

Developing or applying evaluation frameworks and rubrics

Understanding how AI systems perform under real user behavior

Comfort with:

Tech tools and platforms (Slack, spreadsheets, dashboards)

Evaluating AI-generated responses (no coding required, but must be tech-comfortable)

Ambiguity, iteration, and feedback-driven workflows

Willingness to:

Sign NDAs and work with sensitive or high-impact content

Nice to Have (Not Required)

Background in mental health, behavioral science, or psychology

Experience in QA, annotation, or qualitative analysis

Experience with AI systems in sensitive domains (e.g., healthcare, safety)

Familiarity with evaluation metrics or safety frameworks

Compensation

$30-60/hour, depending on experience and specific project tasks/difficulty

Original job Red Teaming Expert posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Apply Now

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

Auto-Apply to Red Teaming Expert Jobs with your AI JobCopilot

Auto-Apply with AI

Similar Red Teaming Expert Jobs in the US

Get your Resume Reviewed for Free

Email address

Why are you reporting this job?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.

Setup your job alert:

Frequency

By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime. Skip

Red Teaming Expert

Job Description - Red Teaming Expert

Similar Red Teaming Expert Jobs in the US

Mobile Apps