Jan Leike (born 1986 or 1987) is an AI alignment researcher who has worked at DeepMind and OpenAI. He joined Anthropic in May 2024.
Education
Jan Leike obtained his undergraduate degree from the University of Freiburg in Germany. After earning a master's degree in computer science, he pursued a PhD in machine learning at the Australian National University under the supervision of Marcus Hutter.
Career
Leike made a six-month postdoctoral fellowship at the Future of Humanity Institute before joining DeepMind to focus on empirical AI safety research, where he collaborated with Shane Legg.
OpenAI
In 2021, Leike joined OpenAI. In June 2023, he and Ilya Sutskever became the co-leaders of the newly introduced "superalignment" project, which aimed to determine how to align future artificial superintelligences within four years to ensure their safety. This project involved automating AI alignment research using relatively advanced AI systems. At the time, Sutskever was OpenAI's Chief Scientist, and Leike was the Head of Alignment. Leike was featured in Time's list of the 100 most influential personalities in AI, both in 2023 and in 2024. In May 2024, Leike announced his resignation from OpenAI, following the departure of Ilya Sutskever, Daniel Kokotajlo and several other AI safety employees from the company. Leike wrote that "Over the past years, safety culture and processes have taken a backseat to shiny products", and that he "gradually lost trust" in OpenAI's leadership.
In May 2024, Leike joined Anthropic, an AI company founded by former OpenAI employees.
References
- ^ "TIME100 AI 2023: Jan Leike". Time. 7 September 2023. Archived from the original on 19 May 2024. Retrieved 19 May 2024.
- ^ "An AI safety researcher on how to become an AI safety researcher". 80,000 Hours. Archived from the original on 19 May 2024. Retrieved 19 May 2024.
- Leike, Jan; Sutskever, Ilya (5 July 2023). "Introducing Superalignment". OpenAI. Archived from the original on 25 May 2024. Retrieved 20 May 2024.
- Booth, Harry (5 September 2024). "TIME100 AI 2024: Jan Leike". TIME. Archived from the original on 8 September 2024. Retrieved 8 September 2024.
- Samuel, Sigal (17 May 2024). ""I lost trust": Why the OpenAI team in charge of safeguarding humanity imploded". Vox. Archived from the original on 18 May 2024. Retrieved 20 May 2024.
- Bastian, Matthias (18 May 2024). "OpenAI's AI safety teams lost at least seven researchers in recent months". the decoder. Archived from the original on 20 May 2024. Retrieved 20 May 2024.
- Milmo, Dan (18 May 2024). "OpenAI putting 'shiny products' above safety, says departing researcher". The Observer. ISSN 0029-7712. Retrieved 20 May 2024.
- "OpenAI researcher who resigned over safety concerns joins Anthropic". 28 May 2024. Archived from the original on 28 May 2024. Retrieved 28 May 2024.