Rohin Shah

16 Podcast Episodes

“Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla” by Neel Nanda, Tom Lieberum, Matthew Rahtz, János Kramár, Geoffrey Irving, Rohin Shah, vlad_m

“Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla” by Neel Nanda, Tom Lieberum, Matthew Rahtz, János Kramár, Geoffrey Irving, Rohin Shah, vlad_m

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.Cross-posting a paper from the Goo... Read more

#154 - Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

#154 - Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

Can there be a more exciting and strange place to work today than a leading AI lab? Your CEO has said they're worried yo... Read more

9 Jun 2023

3hr 9mins

Similar People

Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

Can there be a more exciting and strange place to work today than a leading AI lab? Your CEO has said they're worried yo... Read more

16 May 2023

3hr 9mins

Rohin Shah

Rohin Shah

Dr. Rohin Shah is a Research Scientist at DeepMind, and the editor and main contributor of the Alignment Newsletter.Feat... Read more

12 Apr 2022

1hr 37mins

Most Popular

AF - Shah and Yudkowsky on alignment failures by Rohin Shah

AF - Shah and Yudkowsky on alignment failures by Rohin Shah

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

28 Feb 2022

2hr 24mins

Rohin Shah on the State of AGI Safety Research in 2021

Rohin Shah on the State of AGI Safety Research in 2021

Rohin Shah, Research Scientist on DeepMind's technical AGI safety team, joins us to discuss: AI value alignment; how an ... Read more

2 Nov 2021

1hr 43mins

AF - [AN #168]: Four technical topics for which Open Phil is soliciting grant proposals by Rohin Shah

AF - [AN #168]: Four technical topics for which Open Phil is soliciting grant proposals by Rohin Shah

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

28 Oct 2021

12mins

AF - [AN #167]: Concrete ML safety problems and their relevance to x-risk by Rohin Shah

AF - [AN #167]: Concrete ML safety problems and their relevance to x-risk by Rohin Shah

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

20 Oct 2021

14mins

AF - [AN #166]: Is it crazy to claim we're in the most important century? by Rohin Shah

AF - [AN #166]: Is it crazy to claim we're in the most important century? by Rohin Shah

Welcome to The Nonlinear Library, where we use Text-to-Speech software to convert the best writing from the Rationalist ... Read more

8 Oct 2021

10mins

55. Rohin Shah - Effective altruism, AI safety, and learning human preferences from the state of the world

55. Rohin Shah - Effective altruism, AI safety, and learning human preferences from the state of the world

If you walked into a room filled with objects that were scattered around somewhat randomly, how important or expensive w... Read more

28 Oct 2020

51mins

“Podium: AI tools for podcasters. Generate show notes, transcripts, highlight clips, and more with AI. Try it today at https://podium.page”