Freelancer - GenAI Analyst
Job Description
About us
ActiveFence is the leading tool stack for Trust & Safety teams, worldwide. By relying on ActiveFence’s end-to-end solution, Trust & Safety teams – of all sizes – can keep users safe from the widest spectrum of online harms, unwanted content, and malicious behaviour, including child safety, disinformation, fraud, hate speech, terror, nudity, and more. Using cutting-edge AI and a team of world-class subject-matter experts to continuously collect, analyze, and contextualize data, ActiveFence ensures that in an ever-changing world, customers are always two steps ahead of bad actors. As a result, Trust & Safety teams can be proactive and provide maximum protection to users across a multitude of abuse areas, in 70+ languages.
Your tasks will involve writing adversarial prompts to identify weaknesses in various cutting-edge AImodels, including Large Language Models (LLMs), Text-to-Image, Text-to-Video, Multi-Modal models, AIAgents and beyond. You’ll also manage and analyze datasets to ensure the generation of high-qualityoutputs and actionable insights that contribute to AI safety research.
Key Responsibilities
● Design adversarial prompts to test AI systems across multiple modalities.
● Identify, categorize, and document model weaknesses or unsafe outputs.
● Support data annotation, curation, and quality control processes.
● Summarize findings into structured reports or data templates.
Requirements:
● Proven experience with Generative AI models is essential, though direct technical experience is
not a prerequisite.
● Understanding of risk taxonomies (e.g., harm categories, policy tiers).
● Command of English at a near-native level.
● Attention to detail, organizational capabilities
● Ability to manage multiple tasks simultaneously and meet deadlines.
Additional Wants:
● Familiarity with various model types (Text-to-Text, Text-to-Image) is desirable.
● Experience with prompt injection techniques, jailbreaks and red-teaming techniques.
● Prior work in model evaluation,prompt engineering, or safety analysis.
● Regional expertise or cultural fluency in specific geopolitical areas.
How to apply:
Please submit your CV to: activefence.freelancerpromptswriter@applynow.io
