Nothing Special   »   [go: up one dir, main page]

AI Watch

Welcome! This is a website to track people and organizations in the AI safety/alignment/AI existential risk communities. A position or organization being on AI Watch does not indicate an assessment that that position or organization is actually making AI safer or that the position or organization is good for the world in any way. It is mostly a sociological indication that the position or organization is associated with these communities, as well as an indication that the position or organization claims to be working on AI safety or alignment. (There are some plans to eventually introduce such assessments on AI Watch, but for now there are none.) See the code repository for the source code and data of this website.

This website is developed by Issa Rice with data contributions from Sebastian Sanchez, Amana Rice, and Vipul Naik, and has been partially funded by Vipul Naik and Mati Roy (who in July 2023 paid for the time Issa had spent answering people’s questions about AI Watch up until that point).

Last updated on 2024-09-12; see here for a full list of recent changes.

Table of contents

Agendas

Agenda name Associated people Associated organizations
Iterated amplification Paul Christiano, Buck Shlegeris, Dario Amodei OpenAI
Embedded agency Eliezer Yudkowsky, Scott Garrabrant, Abram Demski Machine Intelligence Research Institute
Comprehensive AI services Eric Drexler Future of Humanity Institute
Ambitious value learning Stuart Armstrong Future of Humanity Institute
Factored cognition Andreas Stuhlmüller Ought
Recursive reward modeling Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg Google DeepMind
Debate Paul Christiano OpenAI
Interpretability Christopher Olah
Inverse reinforcement learning
Preference learning
Cooperative inverse reinforcement learning
Imitation learning
Alignment for advanced machine learning systems Jessica Taylor, Eliezer Yudkowsky, Patrick LaVictoire, Andrew Critch Machine Intelligence Research Institute
Learning-theoretic AI alignment Vanessa Kosoy
Counterfactual reasoning Jacob Steinhardt

AI safety relation by subject

Note: as shown by the large number of “unknown” values, most of the positions haven’t been categorized by relation/subject so this table will only be useful in the future.

Subject UnknownAGI organizationGCR organizationpositionunrelated Total
Unknown 8525 467 64 282 1 9339
background 0 0 0 23 0 23
general 0 0 3 43 0 46
policy 0 0 0 1 0 1
popularization 0 0 0 2 0 2
software engineering 0 2 0 8 0 10
strategy 0 0 0 1 0 1
technical research 6 2 3 33 1 45
Total 8531 471 70 393 2 9467

Positions summary by year

Note: as shown by the large number of “unknown” values, most of the positions haven’t been categorized by start/end dates so this table will only be useful in the future.

Year Start date End date
Unknown 1076 6157
1986 1 0
1993 1 0
1997 3 0
1999 2 1
2000 5 0
2001 5 0
2002 62 1
2003 15 1
2004 32 4
2005 60 5
2006 37 13
2007 48 4
2008 78 8
2009 131 16
2010 181 42
2011 218 55
2012 167 75
2013 194 92
2014 263 66
2015 371 153
2016 633 229
2017 705 295
2018 863 397
2019 912 324
2020 892 317
2021 966 337
2022 809 405
2023 610 300
2024 127 170

Positions grouped by person

Showing 201 people with positions.

Name Number of organizations List of organizations
Paul Christiano 9 AI Impacts, Alignment Research Center, Future of Humanity Institute, Machine Intelligence Research Institute, Open Philanthropy, OpenAI, Ought, Redwood Research, University of California, Berkeley
Nick Bostrom 7 Centre for the Study of Existential Risk, Future of Humanity Institute, Future of Life Institute, Google DeepMind, Leverhulme Centre for the Future of Intelligence, Machine Intelligence Research Institute, University of Oxford
Stuart Russell 7 Berkeley Existential Risk Initiative, Center for Human-Compatible AI, Centre for the Study of Existential Risk, Future of Life Institute, Leverhulme Centre for the Future of Intelligence, Machine Intelligence Research Institute, University of California, Berkeley
Andrew Critch 6 Berkeley Existential Risk Initiative, Center for Applied Rationality, Center for Human-Compatible AI, Encultured AI, Machine Intelligence Research Institute, University of California, Berkeley
Dario Amodei 5 Anthropic, Cooperative AI Foundation, Google Brain, Open Philanthropy, OpenAI
Kyle Scott 5 Alignment Research Center, Berkeley Existential Risk Initiative, Center for Applied Rationality, Future of Humanity Institute, Palisade Research
Ryan Carey 5 Centre for the Study of Existential Risk, Future of Humanity Institute, Machine Intelligence Research Institute, OpenAI, Ought
Seán Ó hÉigeartaigh 5 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Future of Humanity Institute, Global Catastrophic Risk Institute, Leverhulme Centre for the Future of Intelligence
Allan Dafoe 4 Cooperative AI Foundation, Future of Humanity Institute, University of Oxford, Yale University
Bas R. Steunebrink 4 IDSIA, NNAISENSE, SUPSI, Università della Svizzera italiana
Heather Roff 4 Arizona State University, Leverhulme Centre for the Future of Intelligence, New America Foundation, University of Oxford
Jaan Tallinn 4 Berkeley Existential Risk Initiative, Centre for the Study of Existential Risk, Future of Life Institute, Machine Intelligence Research Institute
Jan Leike 4 Australian National University, Future of Humanity Institute, Google DeepMind, Machine Intelligence Research Institute
Matthijs Maas 4 Global Catastrophic Risk Institute, Global Politics of Artificial Intelligence Research Group at Yale University and University of Oxford, Hague Centre for Strategic Studies, University of Copenhagen
Miles Brundage 4 Arizona State University, Future of Humanity Institute, General AI Challenge, OpenAI
Roman Yampolskiy 4 General AI Challenge, Global Catastrophic Risk Institute, Machine Intelligence Research Institute, University of Louisville
Seth Baum 4 Centre for the Study of Existential Risk, Global Catastrophic Risk Institute, Machine Intelligence Research Institute, Social & Environmental Entrepreneurs
Adrian Weller 3 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence, University of Cambridge
Alison Gopnik 3 Center for Human-Compatible AI, Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Andrew Snyder-Beattie 3 Berkeley Existential Risk Initiative, Future of Humanity Institute, Leverhulme Centre for the Future of Intelligence
Bart Selman 3 Center for Human-Compatible AI, Cornell University, Machine Intelligence Research Institute
Ben Goldhaber 3 Center for Applied Rationality, Fund for Alignment Research, Ought
Ben Weinstein-Raun 3 Machine Intelligence Research Institute, Ought, Redwood Research
Benjamin Mann 3 Anthropic, Machine Intelligence Research Institute, OpenAI
Daniel Dewey 3 Future of Humanity Institute, Future of Life Institute, Machine Intelligence Research Institute
Daniela Amodei 3 Anthropic, Epoch, OpenAI
Elon Musk 3 Centre for the Study of Existential Risk, Future of Life Institute, OpenAI
Eric Rogstad 3 Berkeley Existential Risk Initiative, Center for Applied Rationality, Lightcone Infrastructure
Francesca Rossi 3 Future of Life Institute, Leverhulme Centre for the Future of Intelligence, University of Padova
Gillian Hadfield 3 Center for Human-Compatible AI, Cooperative AI Foundation, OpenAI
Girish Sastry 3 Future of Humanity Institute, OpenAI, Ought
Helen Toner 3 Center for Security and Emerging Technology, Future of Humanity Institute, OpenAI
Huw Price 3 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence, University of Cambridge
Jack Clark 3 Anthropic, Center for Security and Emerging Technology, OpenAI
Jacob Steinhardt 3 Center for Human-Compatible AI, Open Philanthropy, Stanford University
Janos Kramar 3 Future of Life Institute, Machine Intelligence Research Institute, University of Montreal
Jeremy Schlatter 3 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute, OpenAI
Johannes Treutlein 3 Center for Human-Compatible AI, Centre for Effective Altruism, Effective Altruism Foundation
Jürgen Schmidhuber 3 IDSIA, SUPSI, Università della Svizzera italiana
Kaj Sotala 3 Foundational Research Institute, Lightcone Infrastructure, Machine Intelligence Research Institute
Katja Grace 3 AI Impacts, Future of Humanity Institute, Machine Intelligence Research Institute
Laurent Orseau 3 AgroParisTech, Google DeepMind, INRA
Lawrence Chan 3 Alignment Research Center, Center for Human-Compatible AI, Fund for Alignment Research
Malo Bourgon 3 Berkeley Existential Risk Initiative, Machine Intelligence Research Institute, Redwood Research
Mark Ring 3 IDSIA, SUPSI, Università della Svizzera italiana
Martin Rees 3 Centre for the Study of Existential Risk, Future of Life Institute, Leverhulme Centre for the Future of Intelligence
Matthew Graves 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Max Tegmark 3 Centre for the Study of Existential Risk, Future of Life Institute, Machine Intelligence Research Institute
Michael Cohen 3 Center for Human-Compatible AI, Future of Humanity Institute, The Australian National University
Oliver Habryka 3 Center for Applied Rationality, Lightcone Infrastructure, Machine Intelligence Research Institute
Owain Evans 3 Future of Humanity Institute, Ought, University of Oxford
Patrick LaVictoire 3 Machine Intelligence Research Institute, Quixey, University of Wisconsin–Madison
Peter Barnett 3 Center for Human-Compatible AI, Machine Intelligence Research Institute, Nonlinear
Pieter Abbeel 3 Center for Human-Compatible AI, OpenAI, University of California, Berkeley
Qiaochu Yuan 3 Berkeley Existential Risk Initiative, Center for Applied Rationality, University of California, Berkeley
Ramana Kumar 3 Data61, Machine Intelligence Research Institute, University of Cambridge
Robin Hanson 3 Future of Humanity Institute, George Mason University, Machine Intelligence Research Institute
Scott Emmons 3 Center for AI Safety, Center for Human-Compatible AI, Fund for Alignment Research
Tom Brown 3 Anthropic, Google Brain, OpenAI
Tom McGrath 3 AI Safety Camp, Future of Humanity Institute, Ought
Victoria Krakovna 3 Future of Life Institute, Google DeepMind, Machine Intelligence Research Institute
Yang Liu 3 Centre for the Study of Existential Risk, OpenAI, University of Cambridge
Adam Gleave 2 Center for Human-Compatible AI, Fund for Alignment Research
Adam Scholl 2 Center for Applied Rationality, Global Catastrophic Risk Institute
Ales Flidr 2 Centre for Effective Altruism, Future of Life Institute
Alex Tamkin 2 Anthropic, Stanford University
Alex Zhu 2 Machine Intelligence Research Institute, Nonlinear
Alexey Potapov 2 AIDEUS, ITMO University
Amanda Askell 2 Anthropic, OpenAI
Amrit Sidhu-Brar 2 Cooperative AI Foundation, Effective Altruism Foundation
Anca Dragan 2 Center for Human-Compatible AI, University of California, Berkeley
Andreas Stuhlmüller 2 Ought, Stanford University
Anna Salamon 2 Center for Applied Rationality, Machine Intelligence Research Institute
Ben Goertzel 2 CogPrime, Machine Intelligence Research Institute
Benya Fallenstein 2 Machine Intelligence Research Institute, University of Bristol
Beth Barnes 2 Center for Human-Compatible AI, Centre for the Study of Existential Risk
Blake Borgeson 2 Machine Intelligence Research Institute, Redwood Research
Brandon Perry 2 AI Safety Camp, Center for Human-Compatible AI
Brian Tomasik 2 Effective Altruism Foundation, Foundational Research Institute
Buck Shlegeris 2 Machine Intelligence Research Institute, Redwood Research
Carl Shulman 2 Future of Humanity Institute, Machine Intelligence Research Institute
Carla Zoe Cremer 2 Future of Humanity Institute, Leverhulme Centre for the Future of Intelligence
Carrick Flynn 2 Center for Security and Emerging Technology, Future of Humanity Institute
Catherine Olsson 2 Anthropic, OpenAI
Charlie Rogers-Smith 2 Palisade Research, University of Oxford
Chris Maddison 2 Google DeepMind, University of Oxford
Christine Peterson 2 Foresight Institute, Machine Intelligence Research Institute
Christopher Cundy 2 Center for Human-Compatible AI, Future of Humanity Institute
Christopher Olah 2 Google Brain, OpenAI
Connor Flexman 2 AI Impacts, Machine Intelligence Research Institute
Dan Hendrycks 2 Center for AI Safety, University of California, Berkeley
Daniel Filan 2 Center for Human-Compatible AI, Future of Humanity Institute
Daniel Kokotajlo 2 AI Impacts, Effective Altruism Foundation
Daniel Ziegler 2 OpenAI, Redwood Research
Danny Hernandez 2 Anthropic, OpenAI
David Abel 2 Brown University, Future of Humanity Institute
David Kristoffersson 2 AI Safety Camp, Future of Humanity Institute
David Krueger 2 Center for Human-Compatible AI, Future of Humanity Institute
David Lindner 2 AI Safety Camp, Center for Human-Compatible AI
David Manheim 2 Association for Long Term Existence and Resilience, Future of Humanity Institute
Demis Hassabis 2 Google DeepMind, Leverhulme Centre for the Future of Intelligence
Dmitrii Krasheninnikov 2 Center for Human-Compatible AI, University of Amsterdam
Dorsa Sadigh 2 Center for Human-Compatible AI, Stanford University
Durk Kingma 2 Google DeepMind, OpenAI
Dylan Hadfield-Menell 2 Center for Human-Compatible AI, University of California, Berkeley
Elizabeth Barnes 2 Alignment Research Center, Center for Human-Compatible AI
Ethan Perez 2 Fund for Alignment Research, New York University
Fazl Barez 2 Centre for the Study of Existential Risk, Future of Life Institute
Gina Stuessy 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Gwern Branwen 2 Center for Applied Rationality, Machine Intelligence Research Institute
Haydn Belfield 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Holden Karnofsky 2 OpenAI, Redwood Research
Ian Goodfellow 2 Google DeepMind, OpenAI
Ian McKenzie 2 Fund for Alignment Research, Ought
Jacob Hilton 2 Alignment Research Center, OpenAI
Jacob Lagerros 2 Future of Humanity Institute, Lightcone Infrastructure
Jakob Foerster 2 Center for Human-Compatible AI, OpenAI
James Miller 2 Machine Intelligence Research Institute, Smith College
James Paul Gonzales 2 Berkeley Existential Risk Initiative, Center for Human-Compatible AI
Jeffrey Ladish 2 Anthropic, Palisade Research
Jelena Luketina 2 Aalto University, Université de Montréal
Jesse Clifton 2 Cooperative AI Foundation, Effective Altruism Foundation
Jesse Galef 2 Future of Life Institute, Machine Intelligence Research Institute
Jesse Liptrap 2 Center for Applied Rationality, Machine Intelligence Research Institute
Jia Yuan Loke 2 Anthropic, Effective Altruism Foundation
Jimmy Rintjema 2 AI Impacts, Machine Intelligence Research Institute
Joar Skalse 2 Future of Humanity Institute, Oxford University
Johannes Heidecke 2 AI Safety Camp, Road to AI Safety Excellence
John Salvatier 2 AI Impacts, Future of Humanity Institute
Jon Gauthier 2 Massachusetts Institute of Technology, OpenAI
José Hernández-Orallo 2 General AI Challenge, Leverhulme Centre for the Future of Intelligence
Joseph Halpern 2 Center for Human-Compatible AI, Cornell University
Josh Jacobson 2 Alignment Research Center, Berkeley Existential Risk Initiative
Joshua Fox 2 Association for Long Term Existence and Resilience, Machine Intelligence Research Institute
Joshua Gans 2 National Bureau of Economic Research, University of Toronto
Julia Galef 2 Center for Applied Rationality, OpenAI
Jun Shern Chan 2 Center for AI Safety, Fund for Alignment Research
Justin Shovelain 2 Convergence Analysis, Machine Intelligence Research Institute
Kenzi Amodei 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Kristinn R. Thórisson 2 Center for Analysis & Design of Intelligent Agents, Icelandic Institute for Intelligent Machines
Lewis Hammond 2 Cooperative AI Foundation, Future of Humanity Institute
Linda Linsefors 2 AI Safety Camp, Machine Intelligence Research Institute
Lukas Gloor 2 Effective Altruism Foundation, Foundational Research Institute
Marcello Herreshoff 2 Google, Machine Intelligence Research Institute
Marek Havrda 2 General AI Challenge, GoodAI
Marek Rosa 2 General AI Challenge, GoodAI
Margaret Boden 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Max Daniel 2 Effective Altruism Foundation, Foundational Research Institute
Melody Guan 2 Future of Life Institute, Google Brain
Michael Blume 2 Center for Applied Rationality, Machine Intelligence Research Institute
Michael Keenan 2 Berkeley Existential Risk Initiative, Center for Applied Rationality
Michael Page 2 Center for Security and Emerging Technology, OpenAI
Michael Wellman 2 Center for Human-Compatible AI, University of Michigan
Mihaly Barasz 2 Machine Intelligence Research Institute, Nilcons
Mrinank Sharma 2 Future of Humanity Institute, University of Oxford
Murray Shanahan 2 Centre for the Study of Existential Risk, Leverhulme Centre for the Future of Intelligence
Natalia Díaz Rodríguez 2 ContinualAI, Flowers Laboratory
Neal Jean 2 Future of Humanity Institute, Ought
Neel Nanda 2 Anthropic, Center for Human-Compatible AI
Nisan Stiennon 2 Center for Human-Compatible AI, Machine Intelligence Research Institute
Olga Afanasjeva 2 General AI Challenge, GoodAI
Owen Cotton-Barratt 2 Centre for Effective Altruism, Redwood Research
Ozzie Gooen 2 Convergence Analysis, Ought
Randall C. O’Reilly 2 eCortex, University of Colorado Boulder
Rebecca Raible 2 Anthropic, Berkeley Existential Risk Initiative
Remco Zwetsloot 2 Center for Security and Emerging Technology, OpenAI
Remmelt Ellen 2 AI Safety Camp, Road to AI Safety Excellence
Reuben Stern 2 Ludwig Maximilian University of Munich, University of Wisconsin–Madison
Richard Ngo 2 OpenAI, University of Cambridge
Robert Miles 2 Nonlinear, Road to AI Safety Excellence
Robert Mushkatblat 2 Lightcone Infrastructure, Machine Intelligence Research Institute
Roger Grosse 2 Future of Humanity Institute, University of Toronto
Rosie Campbell 2 Center for Human-Compatible AI, OpenAI
Roxanne Heston 2 Center for Security and Emerging Technology, Future of Humanity Institute
Sam McCandlish 2 Anthropic, OpenAI
Sawyer Bernath 2 Berkeley Existential Risk Initiative, Fund for Alignment Research
Sergey Rodionov 2 AIDEUS, Aix-Marseille University
Smitha Milli 2 Center for Human-Compatible AI, University of California, Berkeley
Sören Mindermann 2 Center for Human-Compatible AI, Future of Humanity Institute
Stanislav Fort 2 Google DeepMind, Stanford University
Stephanie Zolayvar 2 AI Impacts, Center for Applied Rationality
Stephen Hawking 2 Centre for the Study of Existential Risk, Future of Life Institute
Steve Omohundro 2 Machine Intelligence Research Institute, Self-Aware Systems
Steven Umbrello 2 Global Catastrophic Risk Institute, Institute of Ethics and Emerging Technologies
Stuart Armstrong 2 Future of Humanity Institute, Machine Intelligence Research Institute
Tamay Besiroglu 2 Epoch , Future of Humanity Institute
Tao Lin 2 Alignment Research Center, Redwood Research
Thomas Woodside 2 Center for AI Safety, Center for Security and Emerging Technology
Timothée Lesort 2 ContinualAI, Flowers Laboratory
Timothy Telleen-Lawton 2 Anthropic, Center for Applied Rationality
Tobias Baumann 2 Foundational Research Institute, University College London
Tom Everitt 2 Australian National University, Google DeepMind
Tomasz Korbak 2 Anthropic, Fund for Alignment Research
Tsvi Benson-Tilsen 2 Center for Applied Rationality, Machine Intelligence Research Institute
Vael Gates 2 Center for Human-Compatible AI, Fund for Alignment Research
Vanessa Kosoy 2 Association for Long Term Existence and Resilience, Machine Intelligence Research Institute
Vincent Conitzer 2 Cooperative AI Foundation, Duke University
Will Grathwohl 2 Google DeepMind, OpenAI
Will Millership 2 General AI Challenge, GoodAI
Will Sawin 2 Institute for Theoretical Studies at ETH Zurich, Princeton University
Zac Kenton 2 Montreal Institute for Learning Algorithms, Ought

Positions grouped by organization

Showing 155 organizations.

Organization Number of people List of people
OpenAI 363 Evan Weiss, Shuyuan Zhang, Mada Aflak, Laura W., Yasuyoshi Sakamoto, Stewart Hall, Siyuan Fu, Ollie Jaffe, Amber Yore, Kleanthes K., Weiyi Zheng, Uğurcan Türkdoğan, Francis Z., Enoch Cheung, Pedram Keyani, John Rizzo, Tiffany C., Thomas Dimson, CJ Minott, Ofir Nachum, Allan J., Wei An Lee, Ilan Bigio, Erica T., Eric Rynerson, Will Saborio, David Carr, Daniel Kappler, Anton Tananaev, Srinivas Narayanan, Andrei Alexandru, Brydon Eastman, Ali Kamali, Tina Miranda, Hisham Elhaddad, Justin B., David Medina, David Hengky, Michelle Pokrass, Tianhao Zheng, Adam Perelman, Jan Hendrik Kirchner, Hossem Ben Ayed, Cory Decareaux, Mati Roy, Steven Bills, Akila Welihinda, Yaniv Markovski, Vishal Kuo, Eugene Wu, Chester Cho, Adam Nace, Jessica Shieh, Sully Chen, Ryan Peterson, Oleg Mürk, Bogo Giertler , Karl Whitford Pollard, Tatiana Zolotova, Chaitra A., Arun Vijayvergiya, Juston Forte, Joanne Jang, Zarina Stanik, Rob Mallery, Dave Willner, Preston Tuggle, Austin Wiseman, Atqiya Abida Anjum, Angela Jiang, Adam Goldberg, Davit Khachatryan, Rajeev Nayak, Matthew Gentzel, Lama Ahmad, Giambattista Parascandolo, Sarah Shoker, Richard Ngo, Carroll Wainwright, Anna Makanju, Elie Georges, Angie Luo, Vitchyr Pong, Victor Benito Garcia Rocha, Stefanie Biaggi, Rosie Campbell, Lukasz Kaiser, Lisa Dethridge, Vlad Ursu, Isabel Alves de Lima, Sarthak Agrawal, Johannes H., Emanuele Marchiori, Radhika Mathur, Kyle Kosic, Jason Kwon, Natalie Summers, Tabarak Khan, Nicolas Norberto Corizzo, Bob Rotsted, Jesse Han, Ishant Singh, Hannah Wong, Evan Morikawa, Sinith T., Shawn Jain, Che Chang, Zack Kass, Steven Adler, Lucas Negritto, Jonathan Gordon, Maddie Simens, Tarun Gogineni, Phuong Vu, Philippe Tillet, Bram Adams, Adam Rhodes, Julián Santoro, Tyna Eloundou, Fotios Chantzis, Dave Cummings, Mo Bavarian, Theresa Lopez, Denny Jin, Joel Lehman, Raul Puri, Joost Huizinga, Red A., Emy Parparita, Kelly Sims, Arvind Neelakantan, Tim Yanchen Wang, Rachel Lim, Jeff Clune, Tom Rubin, Fraser Kelton, Roger Xu Jiang, Aris Konstantinidis, Jian O., Jacquelyn Lau, Stanislas Polu, Tao Xu, Gretchen M. Krueger, Girish Sastry, Cullen O"Keefe, Mario Saltarelli, Benjamin Mann, Luke Miller, Frances Choi, Long Ouyang, Ife Riamah, Richard Dunn, Peter Hoeschele, Nikolas Tezak, Mor Katz, Alex Paino, Karson Elmgren, Jerry Tworek, Yi Wu, Ilge Akkaya, Fatma Tarlaci, Elynn Chen, Edgar Barraza, Danny Hernandez, Christina Hendrickson, Maxim Sokolov, Jonathan Michaux, Yuhao Wan, Janet Brown, Nancy Otero, Bianca Martin, Ben Chess, Katie Mayer, Qiming Yuan, Mateusz Litwin, Tom Brown, Clemens Winter, Amanda Askell, Janine Korovesis, Daniela Amodei, Mikhail Pavlov, Lei Zhang, Justin Wang, Jacob Hilton, Todor Markov, Ian Atha, Sue Yoon, Christopher Olah, Maddie Hall, Jacob Jackson, Taehoon Kim, Brad Lightcap, Miles Brundage, Michał Staniszewski, Arthur Petron, Matt Mochary, Jeffrey Wu, Ingmar Kanitscheider, Gillian Hadfield, Ethan Knight, Sophia Arakelyan, Christine McLeavey Payne, Nadja Rhodes, Munashe Shumba, Mira Murati, Karl Cobbe, Joshua Meier, Johannes Otterbach, Yilun Du, Xingyou (Richard) Song, Ifu Aniemeka, Holly Grimm, Hannah Davis, Susan Zhang, Suchir Balaji, Erin Grant, Sam McCandlish, Sadhika Malladi, Daniel Ziegler, Michael Petrov, Aravind Srinivas, Louis Cheong, Will Grathwohl, Hanjun Dai, Adam D’Angelo, Peter Zhokhov, Aleksandar Botev, Henrique Ponde de Oliveira Pinto, Thomas Anthony, Eric Sigler, Elena Chatziathanasiadou, Diane Yoon, Rewon Child, Manuel Sherbakoff, Maran Nelson, Julia Galef, Ryan Carey, Parnian Barekatain, Lilian Weng, Kevin Wong, Kaleo Hao, Glenn Powell, David Farhi, Remco Zwetsloot, Christy Dennison, Ashley C. Pilipiszyn, Mathew Shrwed, Adam Smets, Tasha McCauley, David Luan, Maciej Chociej, Jonathan Ward, Jonathan Raiman, Phillip Isola, Nikhil Mishra, Bowen Baker, Alex Nichol, Larissa Schiavo, Karthik Narasimhan, Joshua Achiam, Yuping Luo, Geoffrey Irving, Christos Louizos, Cathy Wu, Oleg Klimov, Brooke Chan, AlShaun Baksh, Aditya Grover, Jiaming Song, Yuhuai Wu, Jakob Foerster, Dustin Tran, Maruan Al-Shedivat, Lerrel Pinto, Kevin Frans, Jason Peng, Yang Liu, Xue Bin Peng, Trapit Bansal, Han Zhang, David Lansky, Quirin Fischer, Christopher Hesse, Matthias Plappert, Art Chaidarun, Rein Houthooft, Christopher Berner, Jean Harb, Ryan Lowe, Aleks Kamko, Jakub Pachocki, Yaroslav Bulatov, Erika Reinhardt, Shariq Hashme, Richard Chen, Danielle Buma, Peter Welinder, Bob McGrew, Michael Page, Jonathan Ho, Tim Shi, Jeremy Schlatter, Taco Cohen, Szymon Sidor, Desmond Henderson, Rachel Fong, Marie La, Louise Cabansay, Josh Tobin, Jonathan Hernandez, Jack Clark, Harri Edwards, Marika Allely, Ludwig Pettersson, Tambet Matiisen, Filip Wolski, Igor Mordatch, Catherine Olsson, Scott Gray, Dario Amodei, Craig Quiter, Zain Shah, Rafał Józefowicz, Pieter Abbeel, Linxi Fan, Kate Miltenberger, Jon Gauthier, Tyler Neylon, Paul Christiano, Marcin Andrychowicz, Jie Tang, Peter Chen, Prafulla Dhariwal, Jim Fan, Tim Salimans, Shivon Zilis, Jonas Schneider, Jeff Arnold, Eric Price, Alec Radford, Yuri Burda, Chris Clark, Ian Goodfellow, Rocky Duan, Bradly Stadie, Andrej Karpathy, Trevor Blackwell, Ilya Sutskever, Elon Musk, Sam Altman, Reid Hoffman, John Schulman, Wojciech Zaremba, Durk Kingma, Matt Krisiloff, Vicki Cheung, Greg Brockman, Lucy Qin, Jonathan Gray, Anish Athalye, Javier Gai, Holden Karnofsky, Helen Toner
Machine Intelligence Research Institute 181 Jimmy Rintjema, Protyay Shyam Chowdhury, Lisa Thiergart, Jeremy Gillen, Gretta Duleba, Peter Barnett, James Payor, Edward Kmett, Victoria Krakovna, Carson Jones, Linda Linsefors, Evan Hubinger, David Simmons, Daniel Demski, Ben Weinstein-Raun, Alex Zhu, Alex Mennen, Alex Appel, Andrew Critch, Buck Shlegeris, Nick Tarleton, Blake Borgeson, Kurt Brown, Jesse Liptrap, Benjamin Mann, Sam Eisenstat, Jeremy Schlatter, Jan Leike, Matthew Graves, Ryan Carey, Connor Flexman, Colm Ó Riain, Aaron Silverbook, Gary Drescher, Kaya Stechly, Andrew Lapinski-Barker, Robin Hanson, Jack Gallagher, Jaan Tallinn, Bart Selman, Stuart Russell, Ramana Kumar, Vanessa Kosoy, Nate Thomas, Abram Demski, Stuart Armstrong, Luke Muehlhauser, Jessica Taylor, Jed McCaleb, Jake Moskowitz, Scott Garrabrant, Jesse Galef, Matthew Fallshaw, Tsvi Benson-Tilsen, Nicolas Gagné, Elizabeth Morningstar, Lila Rieber, Vipul Naik, Nate Soares, Daniel Lewis, Rob Bensinger, Richard Neal, Robert Mushkatblat, Dávid Natingga, Nathan Clark, Moshe Looks, Kaj Sotala, James Miller, Seth Baum, Roman Yampolskiy, Randal Koene, Evan Erickson, Sebastian Nickel, Oliver Habryka, Paul Christiano, Patrick LaVictoire, Nisan Stiennon, Mihaly Barasz, Joshua Fox, Jeremy Miller, Bill Hibbard, Benya Fallenstein, Anja Heinisch, Alex Altair, Vladimir Nesov, Steve Rayhawk, Stephen Barnes, Louie Helm, Ioven Fables, Patrick Robotham, Daniel Roth, Pedro Chaves, Topher Brennan, Liron Shapira, Carl Shulman, Nickolai Leschov, Jonathan Wang, Cameron Taylor, Malo Bourgon, Kevin Fischer, Jake Miller, Gwern Branwen, Erica Edelman, Alex Vermeer, Tomer Kagan, Pejman Makhfi, Lincoln Quirk, Robert V. Brazell, Nevin Freeman, Minda Myers, Keefe Roedersheimer, Diego Caleiro, Will Newsome, Peter Scheyer, Jasen Murray, Peter de Blanc, Luke Grecki, Daniel Dewey, Abraham Wolk, Thomas Colthurst, Stanislas Sochacki, Janos Kramar, Dennis Fan, Ben Hoskin, Ben Goertzel, Jason Levin, Tim Czech, Robert Zahra, Frank Adamek, Aruna Vassar, Amy Willey, Max Tegmark, Henrik Jonsson, Harrison Willey, Edwin Evans, Anna Salamon, Zack M. Davis, Michael Blume, Kemal Eren, Michael Vassar, Andrew Rettek, Katja Grace, Justin Shovelain, Andriy Brodskyy, Andrew Hay, Vincent Fagot, Thomas McCabe, Steven Kaas, Roko Mijic, Bryan Bishop, Alyssa Vance, Peter Cheeseman, David Hart, Susan Fonseca-Klein, C. Colby Thomson, Jonas Lamis, Steve Omohundro, Bruce Klein, Allison Taguchi, Neil Jacobstein, Brian Atkins, Barney Pell, Tyler Emerson, Carolyn L. Burke, Marcello Herreshoff, Peter Thiel, Rick Schwall, Jeff Medina, Emil Gilliam, Nick Bostrom, Christine Peterson, Aubrey de Grey, Ray Kurzweil, Michael Roy Ames, Jeff Alexander, Michael Wilson, Michael Anissimov, Christian Rovner, Michael Raimondi, Eliezer Yudkowsky, Sabine Atkins
Anthropic 170 Joel Lewenstein, Nina Rimsky, Eilona Maitski, Diego Iaconelli, Coyote Codornices Marin, Chris O'Connell, Chinsin Sim, Adam Pearce, Adam Dix, Sally Aldous, Rob Greenlee, Ranell Nakayama, Rae Phillips, Patrick Ekeruo, Nicola Lau, Nicholas Marwell, Meg Tong, Mark S., Laila Rafi, Julian Williams, Jonathan Marcus, Joel Pobar, Graham Jackson, Elaine C., Connor Holloway, Christopher Chalek, Ashley Zlatinov, Akila S., Vu Bui, Vinay Rao, Tomasz Korbak, Rishi Gupta, Kyle Turman, Kei Nishimura, JB Boin, Jamie Neuwirth, Isabel Larrow, Isaac Dunn, Hunar Batra, Dana Malman Warren, Carrie Bentley, Brian Delahunty, Alfred Mountfield, Stephen Jung, Sasha de Marigny, Kate Jensen, Daniel Rosenthal, Brett Andrus, Brendan Collins, Amir Kashanchi, Zack Witten, Elena L., Dianne Na Penn, Anton Paquin, Shawn Owen, Nicholas Turner, Natalie Esperance, Marisa Gobby, Gautham Raj, Everett Katigbak, Alex Tamkin, Zubair Jandali, Tony H., Tanya Singh, Samantha Wong, Rachit Agarwal, Laura Colley, Julia Schmaltz, Josiah Burke, Jihong Kim, Jennifer Pisansky, Evan Frondorf, Emmanuel Ameisen, Dan Dascalescu, Aaron Begg, Ryan Seunghwan Kim, Ruhua Jiang, Pujaa Rajan, Paul-Frederik Schubert, Jason Clinton, Cassandra Evraets, Benoit Steiner, Avital Balwit, Robert Baden, Nathan Bailey, Joshua Batson, Jenan Wise, Ansh Radhakrishnan, Angie Lal, Yifan Wu, Sandy Banerjee, Ryan Soklaski, Nikhil Bhargava, Keri Warr, Julieann Choi, Janel Thamkul, Frances Pye, Esin Durmus, Elizabeth Edwards-Appell, Diana Jung, David Hwang, Ben Kuhn, Alex S., Adam Jermyn, Vlad G., Thompson Paine, Marina Favaro, Linh-Chi T., Justin Spahr-Summers, Gyula Lakatos, Ethan Forrest, Devi Borg, Brayden McLean, Amanda (Lipson) Kelley, Alex Silverstein, Ethan Langevin, Autumn Russell, Peter Lofgren, James Sully, Oliver Rausch, Mike Lambert, Matt Bell, Karina Nguyen, Hongbin Chen, Brian Israel, Neerav Kingsland, Landon Goldberg, Deep Ganguli, Noemí Mercado, Miranda Zhang, Da Yan, Sam Bowman, Nicholas Schiefer, Guro Khundadze, Scott Johnston, Thomas Liao, Shauna Kravec, Saurav Kadavath, Rebecca Raible, Dustin Li, Bryan Seethor, Tom Conerly, Neel Nanda, Jackson Kernion, Jia Yuan Loke, Andy Jones, Liane Lovitt, Jeffrey Ladish, Timothy Telleen-Lawton, Yuntao Bai, Dawn Drain, Anna Chen, Nelson Elhage, Kamal Ndousse, Catherine Olsson, Amanda Askell, Dario Amodei, Danny Hernandez, Benjamin Mann, Tom Brown, Sam McCandlish, Nicholas Joseph, Jared Kaplan, Jack Clark, Daniela Amodei, Zac Hatfield-Dodds, Tom Henighan, Nova DasSarma, Moumita Das, Chris Olah
Center for Security and Emerging Technology 170 Matthew Burtell, Thomas Woodside, Brendan Oliss , Lauren Kahn, Lawrence Hailes, Jenny Jun, John VerWey, Sam Bresnick , Mia Hoffmann, Cole McFaul, Brian Love, Carolina Pachón, Andrea Guerrero, Josh Goldstein, Neha Singh, Hanna Dohmen, Steph Batalis, Katherine Quinn, Vikram Venkatram, Kathleen Curlee, Christian Schoeberl, Donna Artusy, Tantum Collins, Sue Gordon, Stephanie O"Sullivan, Schuyler Moore, Santiago Mutis, Roxanne Heston, Robert Cardillo, Remco Zwetsloot, Rafay Ur Rehman Khan, Olivia Albrighton-Vanway, Michael Sulmeyer, Michael Page, Lorand Laskai, John Bansemer, Jeff Ding, Jacob Strieb, Eri Phinisee, Emelia Probasco, Emefa Addo Agawu, Elsa Kania, Darrin Gladman, Daniel Cebul, Dalila Scott, Dakota Foster, Dakota Cary, Collins Nji, Claire Perkins, Cindy Martinez, Christopher Back, Christine McNeill, Carrick Flynn, Beba Cibralic, Avonelle Davis, Aurora Johnson, Ashwin Acharya, Anna Puglisi, Amy Chao, Alan Loera, Aditi Joshi, Tina Huang, Thuy Nguyen, Ronnie Kinoshita, Nii Simmonds, Kevin Wolf, Heather Frase, Mina Narayanan, Walter Haydock, Shuvo Bardhan, Jessica Ji, Owen Daniels, Laissa A., Ella Kay, Caroline Schuerger, Sara Abdulla, Lisa Oguike, Luke Koslosky, Jack Corrigan, Channing Lee, Kyle Miller, Heeu Millie Kim, Kayla Goode, Eish Sumra, Adrienne Thompson, Shelton Fitch, Maya Gros, Ingrid Dickinson, Ali Crawford, Abelardo Cruz Osorio, Melissa Deng, Mary Hill Brooks, Lizbeth Lucero, J. Guillermo Mendoza Bazán, Filippo Fagnoni, Alex Friedland, Alan Omar Loera Martinez, Piyush Mishra, Oneeb Ul Haq Khan, Gustavo Mauricio Bastien Olvera, Will Hunt, Sean Kucer, George Klein, Diana Gehlhaus Carew, Darius Diamond, Simon Godfrey Rodriguez, Raveena Kshatriya, Max Langenkamp, Jasmine Ding, Christina Ismailos, Chris Rohlf, Bryce Farabaugh, Ashton Garriott, Andrew Lohn, Andreas Greiler-Basaldúa, Alex Barker, Katerina Sedova, Farid Nemri, Zuleirys Santana-Rodriguez, Rebecca Gelles, Jacob Feldgoise, Wyatt Hoffman, Micah Musser, Emily Weinstein, Autumn Toney, Ngor Luong, Matthew Daniels, Yiming Y., Nicolina Demakos, Emily Xue, Reginald Brothers, Charlie Wang, Alexandra Vreeman, Wenchuan Dong, Jack Clark, Melissa Flagg, Jack Lucas, Daniel Hague, Margarita Konaev, Igor Mikolic-Torreira, Catherine Aiken, Jonathan Murdick, Jennifer Melot, Huey-Meei Chang, Alexander M., Tarun Chhabra, Saif M. Khan, Saif Khan, Ryan Fedasiuk, Ilya Rahkovsky, Husanjot Chahal, Dahlia Peterson, Ben Murphy, Andrew Imbrie, Tim G. J. Rudner, Rebecca Kagan, Lynne Weil, Jamie Baker, Daniel Chou, Ben Buchanan, Benjamin Chang, William Hannas, James Dunham, Zachary Arnold, Peggy Evans, Jason Matheny, Helen Toner, Tim Hwang, Tessa Baker, Dewey Murdick
Center for Human-Compatible AI 122 Brandie Nonnecke, Henry Papadatos, Dale Reed, Ben Plaut, Bhaskar Mishra, Cameron Allen, Alexandra Souly, Tu (Alina) Trinh, Sana Pandey, Michael Cohen, Khanh Nguyen, Tiffany Wang, Jacy Reese Anthis, Brian Judge, Olivia Watkins, Niklas Lauffer, David Krueger, Leonie Richter, George Matheos, Shreyas Kapur, Nisan Stiennon, George Obaido, Erdem Biyik, Alexander Turner, Peter Barnett, Anand Siththaranjan, Yuxi Liu, Scott Emmons, Justin Svegliato, Ruairidh McLennan Battleday, Kimin Lee, James Paul Gonzales, Arnaud Fickinger, Wesley Holliday, Neel Nanda, Toni Lorente, Julia Kerley, Paria Rashidinejad, Jonathan Stray, Rafael Albert, Tom Lenaerts, Jakob Foerster, Cassidy Laidlaw, Alyssa Li Dayan, Micah Carroll, Johannes Treutlein, Jessy Lin, Harry Giles, Eric Michaud, Cynthia Chen, Charlotte Roman, Alex Gunning, Stephen Casper, Sören Mindermann, Sergei Volodin, Pedro Freire, Noor Brody, Neel Alex, Meir Friedenberg, Matthew Rahtz, Christopher Cundy, Pulkit Verma, Moritz Hardt, Brian Christian, Rediet Abebe, Nika Haghtalab, Lawrence Chan, David Lindner, Rachel Freedman, Jacob Steinhardt, Jess Reidel, Shlomi Hod, Sam Toyer, Martin Fukui, Caroline Jeanmaire, Vincent Corruble, Rohin Shah, Niko Kolodny, Brandon Perry, Michael Littman, Juliana Schroeder, Gillian Hadfield, Smitha Milli, Ken Goldberg, John Zysman, Dylan Hadfield-Menell, Demian Pouzo, Dawn Song, Daniel Filan, Charis Thompson, Alison Gopnik, Monica Gates, Marion Fourcade, Lara Buchak, Dan Hendryks, Cody Wild, Steven Wang, Rosie Campbell, Mariano Florentino Cuéllar, Tania Lombrozo, Siddharth Srivastava, Adam Gleave, Beth Barnes, Elizabeth Barnes, Dmitrii Krasheninnikov, Andrew Critch, Mark Nitzberg, Karthika Mohan, Joseph Halpern, Bart Selman, Anca Dragan, Tom Griffiths, Stuart Russell, Satinder Singh Baveja, Pieter Abbeel, Michael Wellman, Vael Gates, Thanard Kurutach, Michael Dennis, Thomas Krendl Gilbert, Jaime Fernandez Fisac, Dorsa Sadigh
Google DeepMind 91 Stanislav Fort, Ruiqi Gao, Aditya Srikanth Veerubhotla, Yonghui Wu, Shixiang Shane Gu, Gargi Balasubramaniam, Nithya Attaluri, Thibault Sellam, Rohan Anil, Rishabh Joshi, Pierre Sermanet, Isabel Leal, Anushka Nijhawan, Abhishek Rao, Sergio Guadarrama, Roopali (Paali) V., Raphael Hoffmann, Been Kim, Azade Nova, Piyush Patil, Nidhi Vyas, Wilfried L. Bounsi, Sebastian Riedel, João Gabriel Lopes, Dmitry Nikulin, Sholto Douglas, Grace Lam, Kavya Kopparapu, Arthur Douillard, Shreya Pathak, Mehdi Jafarnia, Yousuf Khan, Paige Bailey, Krishna Haridasan, Ian Goodfellow, Blanca Huergo, Pratik Joshi, Daniel Sohn, Sridhar Thiagarajan, Ruizhe Zhao, Will Grathwohl, Yayi Zou, Shubham Agrawal, Hamze M., David Stutz, Yuzhu Dong, Sho Arora, Keerthana Gopalakrishnan, Sylvestre Rebuffi, Jennifer She, Ira Ktena, Praneet Dutta, Pauline (Luc) Luc, Behnam Neyshabur, Paul Muller, Shantanu Thakoor, Petar Veličković , Yasaman Bahri, Durk Kingma, Lila Ibrahim, Gheorghe Comanici, Vishal Maini, Hanie Sedghi, Verity Harding, Sean Legassick, John Jumper, Zachary Gleicher, Laurel Wagstaff, Gagan Bansal, Daniel J. Mankowitz, Pushmeet Kohli, Vandana Bachani, Miljan Martic, Andrew Lefrancq, Pedro A. Ortega, Koray Kavukcuoglu, Shane Legg, Mustafa Suleyman, Demis Hassabis, Tom Everitt, Victoria Krakovna, Laurent Orseau, Nick Bostrom, Chris Maddison, Jeffrey D. Sachs, Jan Leike, James Manyika, Edward W. Felten, Diane Coyle, Christiana Figueres, Thore Graepel
Future of Humanity Institute 71 Patrick Butlin, Elise Bohan, Janvi Ahuja, Matthew van der Merwe, Peter Wills, Isaac Friend, Maria Violaris, Joar Skalse, Hannah Klim, Tushant Jha, Sam Clarke, Mrinank Sharma, Karolina Milewicz, Duncan Snidal, Michael Osborne, Jacob Lagerros, Michael Cohen, Jan Brauner, Lewis Hammond, Thomas Orton, Roger Grosse, Carla Zoe Cremer, Michael Montague, David Manheim, Ondrej Bajgar, Gregory Lewis, Baobao Zhang, Ryan Carey, Helen Toner, Michael Bonsall, Tom McGrath, Jan Leike, Robin Hanson, Paul Christiano, Ben Garfinkel, Sören Mindermann, Christopher Cundy, William Saunders, Neal Jean, Girish Sastry, Tamay Besiroglu, Clare Lyle, David Kristoffersson, Piers Millett, John Salvatier, David Krueger, David Abel, Allan Dafoe, Carrick Flynn, Niel Bowerman, Kyle Scott, Andrew Snyder-Beattie, Daniel Dewey, Toby Ord, Anders Sandberg, Carl Shulman, Nick Bostrom, Vincent C. Müller, Stuart Armstrong, Sebastian Farquhar, Seán Ó hÉigeartaigh, Roxanne Heston, Owain Evans, Miles Brundage, Katja Grace, Jeffrey Ding, Jade Leung, Eric Drexler, Daniel Filan, Chelsea Guo, Cecilia Tilli
Future of Life Institute 64 Isabella Hampton, Tim Schreier, Alexandra Tsalidis, Hamza Tariq Chaudhry, Maggie Munro, Ben Eisenpress, Landon Klein, Fazl Barez, Anna Hehir, Claudia Prettner, Akhil Deo, Taylor Jones, Risto Uuk, Andrea Berman, Mark Brakel, Carlos Ignacio Gutierrez, Anna Yelizarova, David E. Nicholson, Emilia Javorsky, Alan Yan, Tucker Davey, Na Li, Jacob Beebe, Yishuai Du, Lucas Perry, Melody Guan, David Stanley, Maxim Kesin, Daniel Dewey, Ariel Conn, Meia Chita-Tegmark, Jaan Tallinn, Anthony Aguirre, Richard Mallah, Ales Flidr, Victoria Krakovna, Jesse Galef, Zara Yaqoob, William Jones, Vera Koroleva, Stuart Russell, Stephen Hawking, Saul Perlmutter, Sandra Faber, Rafael Martinez-Galarza, Peter Haas, Nick Bostrom, Morgan Freeman, Max Tegmark, Martin Rees, Kazue Evans, Janos Kramar, George Church, Frank Wilczek, Francesca Rossi, Eric Gastfriend, Erik Brynjolfsson, Elon Musk, Daniel R. Miller, Christof Koch, Chase Moores, Blake Pierson, Alan Guth, Alan Alda
Flowers Laboratory 63 Guillermo Valle, Masataka Sawayama, Eleni Nisioti, Cécile Mazon, Maxime Adolphe , Julius Taylor, Clément Moulin-Frier, Tristan Karch, Laetitia Teodorescu, Mayalen Etcheverry, Benjamin Clément, Grgur Kovac, Hélène Sauzéon, Nathalie Robin, Catherine Cattaert-Megrat, Timothée Lesort, Hugo Caselles-Dupré, Alexander Ten, Remy Portelas, Cédric Colas, Alvaro Ovalle Castaneda, Anna-Lisa Vollmer, Stéphanie Noirpoudre, Loïc Dauphin, Florian Golemo, Sébastien Forestier, William Schueller, Cem Karaoguz, Clément Masson, Adrien Matricon, Nicolas Rabault, Matthieu Lapeyre, Pierre Rouanet, Nicolas Jahier, Alexandre Gepperth, Céline Craye, Alexandra Delmas, Gennaro Raiola, Baptiste Busch, Panagiotis Papadakis, Yoan Mollard, Thibaut Munzer, Didier Roy, Freek Stulp, Thomas Degris, Jonathan Grizou, Guillaume Duceux, Louis-Charles Caron, Manuel Lopes, Olivier Mangin, Natalia Lyubova, Olivier Ly, Fabien Benureau, Paul Fudal, Thomas Cederborg, David Filliat, Adrien Baranes, Jérome Béchu, Pierre-Yves Oudeyer, Damien Caselli, Théo Segonds, Natalia Diaz Rodriguez (this list is partial)
Leverhulme Centre for the Future of Intelligence 46 Kofi Yeboah, Irene Pellegero Querol, Henry Shevlin, Flavia Saxler, Malak Sadek, Niall Donnelly, Toshie Takahashi, Matthijs M. Maas, Haydn Belfield, Rafael A. Calvo, Carla Zoe Cremer, Nick Bostrom, Neil Lawrence, Murray Shanahan, Michael A. Osborne, Martin Rees, Marta Halina, Margaret Boden, Manuela M. Veloso, Lucy Cheke, Kay Firth-Butterfield, Karina Vold, Kanta Dihal, José Hernández-Orallo, Huw Price, Heather Roff, Francesca Rossi, Demis Hassabis, David Runciman, Beth Singler, Anna Alexandrova, Andrew Snyder-Beattie, Alison Gopnik, Alan Winfield, Adrian Weller, Zoubin Ghahramani, Thomas D. Grant, Tameem Adel, Susan Gowans, Stuart Russell, Stephen John, Stephen Cave, Seán Ó hÉigeartaigh, Sarah Dillon, Rune Nyrup, Philip Pettit
Center for Applied Rationality 43 Tara Mac Aulay, Maria Eduarda Rodrigues Sampaio, Arsalaan Alam, Kyle Scott, Logan Brienne Strohl, Kathryn Schmiedicke, Xavier Prospero, Brienne Strohl, Dan Keys, Luke Raskopf, Duncan Sabien, Adom Hartell, Tsvi Benson-Tilsen, Timothy Telleen-Lawton, Stephanie Zolayvar, Qiaochu Yuan, Matthew Graves, Jordan Tyrrell, Eric Rogstad, Elizabeth Garrett, Ben Goldhaber, Adam Scholl, Jack Carroll, Eli Tyre, Michael Keenan, Michael Blume, Lauren Lee, Jesse Liptrap, Cat Lavigne, Ben Sancetta, Gina Stuessy, Pete Michaud, Eric Chisholm, Daniel Colson, Davis Kingsley, Michael Smith, Oliver Habryka, Andrew Critch, Leah Libresco, Kenzi Amodei, Julia Galef, Gwern Branwen, Anna Salamon
Effective Altruism Foundation 40 Jia Yuan Loke, Daniel Kokotajlo, Paul Knott, Mojmír Stehlík, Michael Aird, Julian Stastny, Jesse Clifton, Hadrien Pouget, Eric Chen, Emery Cooper, Anthony DiGiovanni, Ali Merali, Maxime Riché, Alexander Lyzhov, Amrit Sidhu-Brar, Linh Chi Nguyen, Ulla Wessels, Olle Häggström, Ole Martin Moen, Lucius Caviola, David Pearce, Daniel Rüthemann, Adrian Hutter, Stefan Torges, Stefan Klein, Sascha Fink, Sarah Dörpinghaus, Rajshri Jayaraman, Melinda Lohmann, Lukas Gloor, Klaus Wälde, Dina Pomeranz, Brian Tomasik, Anni Leskelä, Thomas Metzinger, Persis Eskander, Ozy Brennan, Johannes Treutlein, David Althaus, Max Daniel
Global Catastrophic Risk Institute 37 Anthony M. Barrett, Dakota Norris, Allan Suresh, Uliana Certan, Andrea Owe, Kyle L. Evanoff, McKenna Fitzgerald, Oliver Couttolenc, Jared Brown, Robert de Neufville, John Garrick, Seán Ó hÉigeartaigh, Adam Scholl, Marilyn Cotrich, Lena Wang, Jenny Mith, Matthijs Maas, Jessica Cianci, Trevor White, Roman Yampolskiy, Gary Ackerman, Caroline Zaw-Mon, Dave Denkenberger, David Denkenberger, Arden Rowell, Jianhua Xu, U. Tuncay Alparslan, Steven Umbrello, Jacob Haqq-Misra, Mark Fusco, Kaitlin Butler, Grant Wilson, Tim Maher, Matt Moretto, Kelly Hostetler, Tony Barrett, Seth Baum
GoodAI 34 Karolína H., Ryan Camilleri, Jose Solorzano, Alex Angelini, Dominik Čech, Sarka Krejcova, David Castillo, Stephanie Wendler, Reham Bukhari, Šimon Šicko, Lucia Šicková, Nicholas Guttenberg, Viktorie Knezkova, Steffen Eichler, Isabeau Premont-Schwarz, Filip Hauptfleisch, Petr Sramek, Jan Štafa, Michal Dvořák, Christine Lee, Will Millership, Lucie Krestova, Marek Havrda, Jan Feyereisl, Olga Afanasjeva, Martin Poliak, Marek Rosa, Simon Andersson, Přemek Paška, Jaroslav Vitku, Shantesh Patil, Petr Hlubuček, Joseph Davidson, Ege Atici
Center for AI Safety 32 Rebecca Rothwell, Ayush Panda, Isabelle Barrass, Zifan Wang, Matthias Hein, David Bau, Long Phan, Ayham Al-Saffar, Xuwang Yin, Aidan O'Gara, Suryansh Mehta, Corin Katzke, Marc Carauleanu, Max Kaufmann, David Lambert, Sidney Hough, Scott Emmons, Michael Chen, Mantas Mazeika, Kevin Liu, Jun Shern Chan, Dan Hendrycks, Andy Zou, Nathaniel Li, Joshua Clymer, Anders Edson, Alex Pan, Madhav Malhotra, Steven Basart, Rune Kvist, Oliver Zhang, Thomas Woodside
Fund for Alignment Research 30 Oskar Hollinsworth, Philip Quirke, Lindsay Murachver, Saad Siddiqui, Anastasiia Gaidashenko, Vael Gates, Siao Si Looi, Isabella Duan, Conor McGurk, Moritz von Knebel, Fynn Heide, Ben Goldhaber, Adrià Garriga-Alonso, Niki Howe, Adam Gleave, Sawyer Bernath, Lawrence Chan, Kellin Pelrine, Karl Berzins, Mohammad Taufeeque, Hannah Betts, Tom Tseng, Nora Belrose, Tomasz Korbak, Scott Emmons, Jun Shern Chan, Jérémy Scheurer, Ian McKenzie, Ethan Perez, Claudia Shi
Ought 29 Sarah Park, Adrian Smith, Luke Stebbing, Charlie George, James Brady, Ian McKenzie, Justin Reppert, Eli Lifland, Amanda Ngo, Aparna Ashok, Jungwon Byun, Paul Christiano, Ozzie Gooen, Owain Evans, Neal Jean, Milan Griffes, Girish Sastry, Chris Cundy, Ben Weinstein-Raun, Ben Goldhaber, Andrew Schreiber, Ben Rachbach, Zachary Miller, Zac Kenton, Tom McGrath, Noah Goodman, Ben West, Ryan Carey, Andreas Stuhlmüller
AI Safety Camp 27 Kristi Uustalu, Sebastian Kosch , Nix Goldowsky-Dill, JJ Hepburn, Cynthia Yoon, Colin Bested, Andrew Player, Tomáš Gavenčiak, Sabrina Kavanagh, Fabian Steuer, David Lindner, Brandon Perry, Tom McGrath, Nandi Schoots, Markus Salmela, Maia Pasek, Linda Linsefors, Karol Kubicki, David Kristoffersson, Remmelt Ellen, Jessica Cooper, Kristina Nemcova, Jirí Nadvorník, Anne Wissemann, Jan Kulveit, Johannes Heidecke, Sai Joseph
Centre for the Study of Existential Risk 27 Matthew Connelly, Fazl Barez, Haydn Belfield, Huw Price, Jaan Tallinn, Martin Rees, Simon Beard, Adrian Weller, Sean Holden, Stephen Hawking, Tim Crane, Max Tegmark, Murray Shanahan, Dana Scott, Stuart Russell, Elon Musk, Alison Gopnik, David Chalmers, Nick Bostrom, Margaret Boden, Ryan Carey, Martina Kunz, Seth Baum, Beth Barnes, Yang Liu, Shahar Avin, Seán Ó hÉigeartaigh
ContinualAI 27 James Smith, Andrea Cossu, Martin Mundt, Tyler Hayes, Irina Rish, Itamar Arel, Subutai Ahmad, Massimiliano Versace, Razvan Pascanu, David Lopez Paz, Eugenio Culurciello, Marc Pickett, Xu Ji, Akshita Gupta, Alec Diallo, Ayşin Sancı, Ghada Sokar, Bing Liu, Joost van de Weijer, Christopher Kanan, Tinne Tuytelaars, Timothée Lesort, Natalia Díaz Rodríguez, Davide Maltoni, German I. Parisi, Keiland Cooper, Vincenzo Lomonaco
Alignment Research Center 25 Eric Neyman, Rebecca Baron, Rae She, Ted Suzman, Amanda She, Jacob Hilton, Tao Lin, Chris Painter, Quentin Feuillade-Montixi, Megan Kinniment, Luke Miles, Lucas Sato, Haoxing Du, Emma Abele, Brian Goodrich, Hjalmar Wijk, Lawrence Chan, Aryan Bhatt, Timothy Kokotajlo, Josh Jacobson, Elizabeth Barnes, Max Hasin, Mark Xu, Kyle Scott, Paul Christiano
Berkeley Existential Risk Initiative 25 Elizabeth Cooper, Kyle Scott, Sofia Davis-Fogel, James Paul Gonzales, Jess Riedel, Andrew Critch, Sawyer Bernath, Stuart Russell, Alex Flint, Josh Jacobson, Sam Bankman-Fried, Matt Fallshaw, Colleen Gleason, Jeremy Schlatter, Jaan Tallinn, Rebecca Raible, Kenzi Amodei, Jacob Tsimerman, Qiaochu Yuan, Eric Rogstad, Andrew Snyder-Beattie, Michael Keenan, Gina Stuessy, Seán Ó hÉigeartaigh, Malo Bourgon
Redwood Research 25 Guilhermo Cutrim Costa, Tyler Storlie, Luke Sallmen, Cienna Rominger, Ryan Greenblatt, Noa Nabeshima, Tao Lin, Peter Schmidt-Nielsen, Daniel Ziegler, Aqeel Ali, Ben Weinstein-Raun, Paul Christiano, Owen Cotton-Barratt, Malo Bourgon, Holden Karnofsky, James Bregan, Claire Zabel, Blake Borgeson, Bill Zito, Ajeya Cotra, Adam Scherlis, Seraphina Nix, Royston Noronha, Buck Shlegeris, Nathaniel Thomas
General AI Challenge 24 Tomas Mikolov, Tak Lo, Ryota Kanai, Roman Yampolskiy, Rodolfo Rosini, Pavel Kordik, Ling Ge, Julian Togelius, José Hernández-Orallo, Jan Romportl, Ivan Zelinka, Ayako Fukui, Alison Lowndes, Will Millership, Olga Afanasjeva, Virginia Dignum, Miles Brundage, Marek Rosa, Marek Havrda, Jan Sekerka, Jan Pospíšil, Irakli Beridze, Frank Dignum, Danit Gal
Conjecture 23 Mihir Rege, Jan Michelfeit, Maris Sala, Beren Millidge, Daniel Braun, Adam Shimi, Myriame Honnay, Lee Sharkey, Katrina Joslin, Jonathan Low, Janko Prester, Carlos Guevara, Caelum Forder, Rachel Stockton, Andrea Miotti, Gabriel Alfour, Sid Black, Laria Reynolds, Kip Parker, Jacob Merizian, Chris Scammell, Kyle McDonell, Connor Leahy
AI Impacts 20 Aysja Johnson, Jeffrey Heninger, Zach Stein-Perlman, Harlan Stewart, Jimmy Rintjema, Richard Korzekwa, Ronny Fernandez, Katja Grace, Daniel Kokotajlo, Asya Bergal, Ronja Lutz, Tegan McCaslin, Paul Christiano, Ben Hoffman, Justis Mills, Connor Flexman, Finan Adamson, Michael Wulfsohn, John Salvatier, Stephanie Zolayvar
Epoch 19 Virginia Blanton, Josh You, Keith Wynroe, Jenny Xiao, David Atkinson, Daniela Amodei, Ben Cottier, Maria da Lama, Matthew Barnett, Ege Erdil, Tom Davidson, Tamay Besiroglu, Pablo Villalobos, Neil Thompson, Marius Hobbhahn, Lennart Heim, Jaime Sevilla, Eduardo Infante-Roldán, Anson Ho
Stanford University 17 Siddharth Karamcheti, Peter Henderson, Pratyusha Kalluri, Dorsa Sadigh, Stanislav Fort, Cody Coleman, Stefano Ermon, Michael Webb, Percy Liang, Alex Aiken, Jacob Steinhardt, Noah D. Goodman, Andreas Stuhlmüller, Aditi Raghunathan, Alex Tamkin, Thomas Icard, Ray Briggs
Cooperative AI Foundation 15 Natasha Jaques, Rebecca Eddington, David Norman, Cecilia Elena Tilli, Michelle Virgo, Akbir Khan, Vincent Conitzer, Lewis Hammond, Jesse Clifton, Ruairi Donnelly, Gillian Hadfield, Eric Horvitz, Dario Amodei, Allan Dafoe, Amrit Sidhu-Brar
Nonlinear 15 Luca De Leo, Matt Putz, Deena Englander, Aaron Bergman, Emerson Spartz, Tristan Cook, Peter Barnett, Daniel del Castillo, Chris Leong, Corey Wood, Kat Woods, Spencer Greenberg, Robert Miles, David Moss, Alex Zhu
University of California, Berkeley 15 Dan Hendrycks, Max Simchowitz, Andrew Critch, Tom Kalil, Stuart Russell, Pieter Abbeel, Smitha Milli, Dylan Hadfield-Menell, Anca Dragan, Sergey Levine, Paul Christiano, Qiaochu Yuan, Michael Janner, Frances Ding, Lydia T. Liu
Lightcone Infrastructure 13 Robert Mushkatblat, Rafe Kennedy, Jacob Lagerros, Ruben Bloom, Raymond Arnold, Kaj Sotala, Matthew Graves, Harmanas Chopra, Eric Rogstad, Ben Albert Pace, Oliver Habryka, James Babcock, Elizabeth Van Nostrand
University of Oxford 10 Charlie Rogers-Smith, Mrinank Sharma, Allan Dafoe, Ruth Fong, Chris Maddison, Owain Evans, Michael Wooldridge, Aidan Gomez, Nick Bostrom, Heather Roff
Whole Brain Architecture Initiative 9 Koji Morikawa, Hideyuki Nakashima, Hiroyuki Morikawa, Masaru Tomita, Kitano Hiroaki, Kenji Doya, Koichi Takahashi, Yutaka Matsuo, Hiroshi Yamakawa
Road to AI Safety Excellence 8 Remmelt Ellen, Trent Fowler, Erik Istre, Rupert McCallum, Robert Miles, Johannes Heidecke, Veerle de Goederen, Toon Alfrink
Theiss Research 8 Karina Torres Castro, Rebecca Bone, Rich Lew, Soraya Bernal, Sebastian Engmann, Brian Nablo, Rodrigo Duran, Jack Glover
7 Joe Collman, Angela P., Anand Srinivasan, Orpheus Lummis, Stag Lynn, Alexander Gietelink Oldenziel, Tegan McCaslin
Australian National University 7 Elliot Catt, Tom Everitt, Jan Leike, Marcus Hutter, Jarryd Martin, Gary Lea, Alan Hájek
Palisade Research 7 Timothee Chauvin, Simon Lermen, Pranav Gade, Kyle Scott, Karina Belokapov, Jeffrey Ladish, Charlie Rogers-Smith
Foundational Research Institute 6 Brian Tomasik, Max Daniel, Kaj Sotala, Caspar Oesterheld, Lukas Gloor, Tobias Baumann
Google Brain 6 Jeremy Nixon, Melody Guan, Tom Brown, Dan Mané, Dario Amodei, Christopher Olah
University of Cambridge 6 Richard Ngo, Adrian Weller, Ramana Kumar, Arif Ahmed, Huw Price, Yang Liu
Association for Long Term Existence and Resilience 5 Vanessa Kosoy, Joshua Fox, Gidon Kadosh, Edo Arad, David Manheim
Carnegie Mellon University 5 Leqi Liu, Noam Brown, Manuela Veloso, Andre Platzer, David Danks
Convergence Analysis 5 Ozzie Gooen, Claire Abu-Assal, Kristian Rönn, Andrew X Stewart, Justin Shovelain
EthicsNet 4 Aleksandra Orchowska, Remco Bloemen, Anish Mohammed, Nell Watson
Massachusetts Institute of Technology 4 Joshua Brett Tenenbaum, Jon Gauthier, Julius Adebayo, Andrew Ilyas
Montreal Institute for Learning Algorithms 4 Zac Kenton, Doina Precup, Joelle Pineau, Yoshua Bengio
Center for AI Policy 3 Jason Green-Lowe, Thomas Larsen, Jakub Kraus
Centre for Effective Altruism 3 Johannes Treutlein, Ales Flidr, Owen Cotton-Barratt
Cornell University 3 Bart Selman, Jim Babcock, Joseph Halpern
eCortex 3 Randall C. O’Reilly, Seth J. Herd, David J. Jilk
Foresight Institute 3 Christine Peterson, Mark S. Miller, Allison Duettmann
IDSIA 3 Bas R. Steunebrink, Jürgen Schmidhuber, Mark Ring
Open Philanthropy 3 Jacob Steinhardt, Dario Amodei, Paul Christiano
SUPSI 3 Jürgen Schmidhuber, Bas R. Steunebrink, Mark Ring
Università della Svizzera italiana 3 Jürgen Schmidhuber, Bas R. Steunebrink, Mark Ring
University of Toronto 3 Dami Choi, Roger Grosse, Joshua Gans
Yale University 3 Wendell Wallach, Allan Dafoe, Daniel Eth
AIDEUS 2 Sergey Rodionov, Alexey Potapov
Arizona State University 2 Heather Roff, Miles Brundage
Center for a New American Security 2 Gregory C. Allen, Paul Scharre
Center for Human Success 2 Wyatt Tessari, David Yu
Encultured AI 2 Andrew Critch, Nick Hay
Endgame 2 Hyrum Anderson, Bobby Filar
Google 2 Marcello Herreshoff, Vladimir Slepnev
Learning Intelligent Distribution Agent 2 Tamas Madl, Stan Franklin
Linköping University 2 Mikael Böörs, Tobias Wängberg
London School of Economics 2 Katie Steele, Wlodek Rabinowicz
Ludwig Maximilian University of Munich 2 Stephan Hartmann, Reuben Stern
Oregon State University 2 Alex Turner, Thomas Dietterich
UCLA School of Law 2 Richard Re, Edward Parson
University College London 2 John Shawe-Taylor, Tobias Baumann
University of Michigan 2 Michael Wellman, James M. Joyce
University of Wisconsin–Madison 2 Reuben Stern, Patrick LaVictoire
Aalto University 1 Jelena Luketina
AgroParisTech 1 Laurent Orseau
Aix-Marseille University 1 Sergey Rodionov
American University 1 Thomas Zeitzoff
Bar-Ilan University 1 Ram Rachum
Birkbeck, University of London 1 Ulrike Hahn
Broad Institute of MIT and Harvard 1 Gopal Sarma
Brown University 1 David Abel
California Institute of Technology 1 Frederick Eberhardt
Carleton University 1 Andrew MacFie
Center for Analysis & Design of Intelligent Agents 1 Kristinn R. Thórisson
CogPrime 1 Ben Goertzel
Czech Technical University 1 Vojtěch Kovařík
Data61 1 Ramana Kumar
Duke University 1 Vincent Conitzer
Electronic Frontier Foundation 1 Peter Eckersley
ETH Zurich 1 Felix Berkenkamp
George Mason University 1 Robin Hanson
Georgia Institute of Technology 1 Fuxin Li
Global Politics of Artificial Intelligence Research Group at Yale University and University of Oxford 1 Matthijs Maas
Hague Centre for Strategic Studies 1 Matthijs Maas
Harvard University 1 David Parkes
Icelandic Institute for Intelligent Machines 1 Kristinn R. Thórisson
Information Society Project 1 Rebecca Crootof
INRA 1 Laurent Orseau
Institute for Future Studies 1 H. Orri Stefánsson
Institute for Theoretical Studies at ETH Zurich 1 Will Sawin
Institute of Ethics and Emerging Technologies 1 Steven Umbrello
Internet Archive 1 Brewster Kahle
ITMO University 1 Alexey Potapov
Legal Priorities Project 1 Nick Hollman
Lingnan University 1 Jiji Zhang
Moscow Institute of Physics and Technology 1 Vladimir Shakirov
Munich Center for Mathematical Philosophy 1 Catrin Campbell-Moore
Nanyang Technological University 1 Preston Greene
NARS 1 Pei Wang
National Bureau of Economic Research 1 Joshua Gans
New America Foundation 1 Heather Roff
New York University 1 Ethan Perez
Nilcons 1 Mihaly Barasz
NNAISENSE 1 Bas R. Steunebrink
organization 1 Robert Sandler
Oxford University 1 Joar Skalse
Phenomenological AI Safety Research Institute 1 G Gordon Worley III
Princeton University 1 Will Sawin
Quebec Artificial Intelligence Institute 1 Vincent Luczkow
Quixey 1 Patrick LaVictoire
Real AI 1 Jonathan Yan
Rice University 1 Moshe Vardi
Self-Aware Systems 1 Steve Omohundro
Smith College 1 James Miller
Social & Environmental Entrepreneurs 1 Seth Baum
Sorbonne University 1 Michaël Trazzi
Susaro 1 Richard Loosemore
Teesside University 1 The Anh Han
Texas A&M University 1 Kenny Easwaran
The Australian National University 1 Michael Cohen
The Consortium on the Landscape of AI Safety 1 Alexis Carlier
The New School 1 Peter Asaro
Ulm University 1 ‪Daniel Alexander Braun
Université de Montréal 1 Jelena Luketina
University of Alberta 1 Tor Lattimore
University of Amsterdam 1 Dmitrii Krasheninnikov
University of Arizona 1 Jenann Ismael
University of Bath 1 Joanna Bryson
University of Bristol 1 Benya Fallenstein
University of Colorado 1 Seth Herd
University of Colorado Boulder 1 Randall C. O’Reilly
University of Copenhagen 1 Matthijs Maas
University of Edinburgh 1 Angelo Frank De Bellis
University of Illinois at Chicago 1 Brian Ziebart
University of Louisville 1 Roman Yampolskiy
University of Melbourne 1 Benjamin Rubinstein
University of Montreal 1 Janos Kramar
University of New Hampshire 1 Andrew Ware
University of Padova 1 Francesca Rossi
University of Southern California 1 Stephen J. Read
University of Texas 1 Peter Stone
University of Washington 1 Daniel Weld
Washington University in St. Louis 1 Julia Haas

Individuals not affiliated with any organization

Showing 17 people.

Organization Website Source
Wei Dai http://www.weidai.com/ [1], [2]
Iceman [3], [4], [5], [6], [7]
Max Harms http://raelifin.com/ [8], [7]
Jeff Kaufman https://www.jefftk.com [9], [7]
Federico Pistono http://federicopistono.org/ [10]
Chris Pasek [11], [12]
Sune Kristian Jakobsen
Hilary Greaves [13]
Sophie-Charlotte Fischer [14]
Alexey Turchin https://avturchin.livejournal.com/ [15], [16], [17], [18]
Dustin Juliano http://dustinjuliano.com/ [19], [20]
Matteo Turchetta [21]
Angela P. Schoellig [21]
Andreas Krause [21]
Jim O’Neill [22]
Gordon Irlam http://www.gordoni.com/ [23]
John Maxwell [24]

Products

This section lists AI safety-related “products”: interactive tools, websites, flowcharts, datasets, etc. Unlike documents, products tend to be interactive, are updated continually, or require inputs from the consumer.

Showing 33 products.

Name Type Creator Creation date Description
Clarifying some key hypotheses in AI alignment diagram Ben Cottier, Rohin Shah 2019-08-15 A diagram collecting several hypotheses in AI alignment and their relationships to existing research agendas.
AI Alignment Forum blog LessWrong 2.0 2018-07-10 A group blog for discussion of technical aspects of AI alignment. The forum is built using the same software as LessWrong 2.0, and is integrated with LessWrong 2.0. For creation date, see [25].
AI Safety Research Camp workshop Tom McGrath, Remmelt Ellen, Linda Linsefors, Nandi Schoots, David Kristoffersson, Chris Pasek 2018-02-01 A research camp to take place in Gran Canaria in April 2018 and in the United Kingdom in July–August 2018. Facebook group at [26]. The creation date is the date of announcement on LessWrong 2.0.
“Levels of defense” in AI safety flowchart Alexey Turchin 2017-12-12 A flowchart applying multilevel defense to AI safety. There is an accompanying post on LessWrong at [27].
AI Alignment Prize contest Zvi Mowshowitz, Vladimir Slepnev, Paul Christiano 2017-11-03 A prize for work that advances understanding in alignment of smarter-than-human artificial intelligence. Winners for the first round, as well as announcement of the second round, can be found at [24]. Winners for the second round, as well as announcement of the third round, can be found at [28].
AI Watch interactive application Issa Rice 2017-10-23 A website to track people and organizations working on AI safety.
AI Safety Open Discussion discussion group Mati Roy 2017-10-23 A Facebook discussion group about AI safety. This is an open group.
AI safety resources list Victoria Krakovna 2017-10-01 A list of resources for long-term AI safety. Seems to have been first announced at [29].
Map of the AI Safety Community graphic Søren Elverlin 2017-09-26 A pictorial map that lists organizations and individuals in the AI safety community.
Open Philanthropy Project AI Fellows Program fellowship Open Philanthropy 2017-09-12 A fellowship to support PhD students in AI and machine learning. For the creation date, see [30].
LessWrong 2.0 blog LessWrong 2.0 2017-06-18 A community blog about rationality, decision theory, AI, the rationality community, and other topics relevant to AI safety. This is a re-launch/modernization of the original LessWrong. For the launch date, the date of the welcome post [31] is used.
Road to AI Safety Excellence course Toon Alfrink 2017-06-15 A proposed course that is designed to produce AI safety researchers. It used to be called “Accelerating AI Safety Adoption in Academia” and was announced on LessWrong at [32]. The Facebook group was created on 2017-06-30 [33].
Annotated bibliography of recommended materials list Center for Human-Compatible AI 2016-12-01 An annotated and interactive bibliography of AI safety-related course materials, textbooks, videos, papers, etc.
Extinction Risk from Artificial Intelligence blog Michael Cohen 2016-06-01 A series of pages exploring arguments for and against working on AI safety. The creation date is inferred from the URLs of images (example: [34]).
AI Alignment blog Paul Christiano 2016-05-28 Paul Christiano’s blog about AI alignment.
AISafety.com Reading Group discussion group Søren Elverlin, Erik B. Jacobsen, Volkan Erdogan 2016-05-24 A weekly reading group covering topics in AI safety.
Cause prioritization app interactive application Michael Dickens, Buck Shlegeris 2016-05-18 An interactive app for quantitative cause prioritization. The app includes a section [35] on AI safety intervention. The creation date is the date of the first commit in the Git repository [36].
Arbital AI alignment domain wiki Arbital, Eliezer Yudkowsky 2016-03-04 A collection of wiki-like pages on topics in AI alignment. The creation date is the date of the launch announcement for Arbital [37]; it’s unclear when the AI alignment domain itself was created.
Introductory resources on AI safety research list Victoria Krakovna 2016-02-28 A list of readings on long-term AI safety. Mirrored at [38]. There is an updated list at [39].
AI Safety Discussion discussion group Victoria Krakovna 2016-02-21 A Facebook discussion group about AI safety. This is a closed group so one needs to request access to see posts.
Reinforce.js implementation of Stuart Armstrong’s toy control problem interactive application Gwern Branwen, FeepingCreature 2016-02-03 A live demo of Stuart Armstrong’s toy control problem [40]. gwern introduced the demo in a LessWrong comment [41].
AI Policies Wiki wiki Gordon Irlam 2015-12-14 A wiki on AI policy. The wiki creation date can be seen in the revision history of the main page [42].
The Control Problem discussion group CyberPersona 2015-08-29 A subreddit about AI safety and control. For the subreddit creation date, see [43].
AGI Failures Modes and Levels map flowchart Alexey Turchin 2015-01-01 A flowchart about failure modes of artificial general intelligence, grouped by the stage of development. There is an accompanying post on LessWrong at [18].
AGI Safety Solutions Map flowchart Alexey Turchin 2015-01-01 A flowchart on potential solutions to AI safety. There is an accompanying post on LessWrong at [44].
Intelligent Agent Foundations Forum discussion group Machine Intelligence Research Institute 2014-11-04 A forum for technical AI safety research. The source code is hosted on GitHub [45]. The timestamp on the introductory post [46] gives the launch date.
A flowchart of AI safety considerations flowchart Eliezer Yudkowsky 2014-11-02 The flowchart was posted to Eliezer Yudkowsky’s Essays (a Facebook group) and has no title.
Effective Altruism Forum blog Centre for Effective Altruism, Rethink Charity, Ryan Carey 2014-09-10 A community blog about effective altruism which often has posts about AI safety. The forum was announced on LessWrong by Ryan Carey [47].
How to study superintelligence strategy list Luke Muehlhauser 2014-07-03 A list of project ideas in superintelligence strategy.
Ordinary Ideas blog Paul Christiano 2011-12-21 Paul Christiano’s blog about “weird AI stuff” [48].
The Uncertain Future interactive application Machine Intelligence Research Institute 2009-10-01 A tool to model future technology and its effect on civilization. For more about the history of the site, see [49].
LessWrong Wiki wiki Machine Intelligence Research Institute 2009-03-12 A companion wiki to the community blog LessWrong. The wiki has pages about AI safety.
LessWrong blog Machine Intelligence Research Institute 2009-02-01 A community blog about rationality, decision theory, AI, updates to MIRI, among other topics.