Apollo Research
@apollo_research_ai
Londonhttps://www.apolloresearch.ai/ Technology, Information and InternetOverview
About Apollo Research
Apollo Research is an AI safety organization. We specialize in auditing high-risk failure modes, particularly deceptive alignment, in large AI models. Our primary objective is to minimize catastrophic risks associated with advanced AI systems that may exhibit deceptive behavior, where misaligned models appear aligned in order to pursue their own objectives.
Our approach involves conducting fundamental research on interpretability and behavioral model evaluations, which we then use to audit real-world models. Ultimately, our goal is to leverage interpretability tools for model evaluations, as we believe that examining model internals in combination with behavioral evaluations offers stronger safety assurances compared to behavioral evaluations alone.
Our approach involves conducting fundamental research on interpretability and behavioral model evaluations, which we then use to audit real-world models. Ultimately, our goal is to leverage interpretability tools for model evaluations, as we believe that examining model internals in combination with behavioral evaluations offers stronger safety assurances compared to behavioral evaluations alone.
Headquarters
LondonWebsite
https://www.apolloresearch.ai/Company Size
11-50 employeesIndustry
Technology, Information and InternetCompany Type
Privately HeldFounded
2023Specialties
Artificial Intelligence, Machine Learning, AI Safety, Interpretability, Model Evaluations, Audits, Research, and Policy AdvisingJobs

Finance Associate (Part time)
Apollo Research
London Area, United Kingdom

Full Stack Engineer (Monitoring)
Apollo Research
London, England, United Kingdom

Backend Engineer (Monitoring)
Apollo Research
London, England, United Kingdom

Full-stack Software Engineer
Apollo Research
London, England, United Kingdom

Backend Software Engineer
Apollo Research
London, England, United Kingdom