cover image
ActiveFence

GenAI Analyst

Hybrid

London, United kingdom

Full Time

21-09-2025

Share this job:

Skills

Quality Assurance Attention to detail Large Language Models

Job Specifications

ActiveFence is seeking a driven, detail-focused professional to become a vital part of our team as a Generative AI Analyst. In this role, you'll dive into the cutting-edge of technology, meticulously analyzing various content infringements to secure the new wave of Generative AI tools. Your duties will include collaborating with experts in diverse fields such as Hate Speech, Misinformation, Intellectual Property and Copyright, Child Safety, among others.

Your tasks will involve writing adversarial; prompts to identify weaknesses in various AI models, including Large Language Models (LLMs), Text-to-Image, Text-to-Video, and beyond. You'll also oversee data management to guarantee the highest quality of outputs.

Responsibilities

Developing adversarial and risky prompt strategies across several areas of abuse to expose potential vulnerabilities in models.
Managing projects end-to-end, from initial planning and oversight through quality assurance to final delivery.
Handling extensive datasets across multiple languages and areas of abuse, ensuring precision and meticulous attention to detail.
Ongoing investigation into new tactics for circumventing foundational models' safety measures.
Working alongside diverse teams, engineering, product, policy, to tackle new challenges and craft forward-thinking strategies and resolutions.
Promoting a culture of knowledge exchange and continual learning within the team.

Requirements:

Must have:

Background in AI Safety and/or Responsible AI and/or AI Ethics
Familiarity with recent Generative AI models and agents is essential, though direct technical experience is not a prerequisite
Command of English at a near-native level
Attention to detail, organizational capabilities, and the capacity to juggle numerous tasks concurrently

Additional Wants:

Experience with various model types (Text-to-Text, Text-to-Image) is desirable
Prior experience with OSINT (Open Source Intelligence) will be considered an asset
A self-starter attitude, with the energy to excel in a fast-moving and variable environment

About the Company

ActiveFence is the leading provider of UGC and AI Safety solutions, delivering the industry's most robust safeguards, to protect the world's leading foundation models and AI-powered applications. Trusted by safety teams of all sizes, we help protect over three billion users from threats such as child abuse, exploitation, hate speech, and more. Our comprehensive solutions integrate deep intelligence research, AI-driven harmful content detection, and a robust moderation platform, enabling global platforms to operate safely and... Know more