Apollo Research

www.apolloresearch.ai

1 Job

16 Employees

About the Company

Apollo Research is an AI safety organization. We specialize in auditing high-risk failure modes, particularly deceptive alignment, in large AI models. Our primary objective is to minimize catastrophic risks associated with advanced AI systems that may exhibit deceptive behavior, where misaligned models appear aligned in order to pursue their own objectives.

Our approach involves conducting fundamental research on interpretability and behavioral model evaluations, which we then use to audit real-world models. Ultimately, our goal is to leverage interpretability tools for model evaluations, as we believe that examining model internals in combination with behavioral evaluations offers stronger safety assurances compared to behavioral evaluations alone.

Listed Jobs

Company Name: Apollo Research
Job Title: Backend Software Engineer
Job Description: Job title: Backend Software Engineer Role Summary: Design, develop, and maintain backend systems and internal tooling to support frontier AGI safety research. Build evaluation libraries, orchestrate large‑scale agentic experiments, monitor LLM traffic, and create data pipelines for analysis and research. Expectations: 5+ years professional software engineering experience with production Python code. Proven ability to lead feature development, own end‑to‑end solutions, and influence large codebases. Open‑source contribution, startup tech‑stack building, or significant product ownership are strong indicators. Key Responsibilities: - Rapidly prototype and iterate internal tools for running and analyzing thousands of LLM evaluations. - Lead the design, implementation, and deployment of major features from concept through release. - Collaborate with researchers to identify challenges, provide technical guidance, and debug research code. - Define and advocate for software design best practices, code health, and reliable CI pipelines. - Build and maintain telemetry APIs, data warehousing services, and orchestration tools for secure, parallel evaluation workflows. - Communicate technical decisions, trade‑offs, and project status clearly to cross‑functional teams. Required Skills: - Advanced Python programming; experience with production‑grade backend frameworks. - Strong architecture design, API development, and database modeling (SQL/NoSQL). - CI/CD pipeline construction and optimization; automated testing and flaky‑test elimination. - Telemetry, logging, and instrumentation for monitoring system reliability. - Data warehousing and ETL pipelines for large‑scale evaluation transcripts. - Cloud hosting (AWS preferred); familiarity with container orchestration (K8s, ECS). - Experience with LLM evaluation, agentic systems, or cybersecurity/infosec is a bonus. - Excellent written and verbal communication; ability to translate research requirements into technical specs. Required Education & Certifications: - Bachelor’s or Master’s degree in Computer Science, Software Engineering, or closely related field. - No mandatory certifications, but relevant industry credentials (e.g., AWS Certified Developer) are advantageous.

London, United kingdom

On site

Mid level

08-12-2025