SRI Seminar Series: Roger Grosse

Wed Oct 16 2024 at 12:30 pm to 02:00 pm UTC-04:00

Rotman School of Management, Room LL1065 | Toronto

Schwartz Reisman Institute
Publisher/HostSchwartz Reisman Institute
SRI Seminar Series: Roger Grosse
Advertisement
Can LLMs or agents built on top of them spontaneously “go rogue"? Join SRI Chair Roger Grosse for a special in-person discussion.
About this Event

Our weekly SRI Seminar Series welcomes for a special in-person talk that will also be broadcast online. Grosse is an associate professor of computer science at the University of Toronto, a Schwartz Reisman Chair in Technology and Society, and a founding member of the Vector Institute. Grosse’s research focuses on better understanding neural net training dynamics, with his current work exploring how understandings of deep learning can be applied to generate safe and aligned AI systems. 
In this special in-person lecture, Grosse will articulate the underlying model of how LLMs or agents built on top of them could spontaneously “go rogue.”


Talk title: "On the origin of rogue AI"

Abstract: One of the most concerning scenarios for future AI systems is that the AI autonomously carries out a malign plan not intended by any human. But how could this happen? Classical arguments for catastrophic AI risk were made in terms of idealized long-horizon planning agents which seemingly bear little relationship to current-day large language models (LLMs). In this talk, I’ll try to articulate the underlying model of how LLMs or agents built on top of them could spontaneously “go rogue.” I’ll argue that LLM pre-training, by making complex behaviours more compressible, creates smoother fitness landscapes for evolutionary searches. Such evolutionary searches could lead to tendencies such as reward hacking, consequentialism, and punishment. If this hypothesis is correct, then continued scaling of LLMs will enable a variety of catastrophic risk pathways which, up to now, have been limited to philosophical thought experiments.


About the speaker

Roger Grosse is an associate professor of Computer Science at the University of Toronto, a Schwartz Reisman Chair in Technology and Society, and a founding member of the Vector Institute. Grosse is also a member of technical staff on the Alignment Science Team at Anthropic. Grosse’s research focuses on better understanding neural net training dynamics, and uses this understanding to improve training speed, generalization, uncertainty estimation, and automatic hyperparameter tuning. His current research seeks to apply understandings of deep learning to AI alignment.


Grosse holds a Sloan Fellowship, Canada Research Chair, and Canada CIFAR AI Chair. He received a BS in symbolic systems from Stanford in 2008, a MS in computer science from Stanford in 2009, and a PhD in computer science from MIT in 2014, studying under Bill Freeman and Josh Tenenbaum. From 2014 to 2016, Grosse was a postdoctoral researcher at the University of Toronto, working with Ruslan Salakhutdinov. Along with Colorado Reed, he created Metacademy, a website which uses a dependency graph of concepts to create personalized learning plans for machine learning and related fields.




About the SRI Seminar Series

The SRI Seminar Series brings together the Schwartz Reisman community and beyond for a robust exchange of ideas that advance scholarship at the intersection of technology and society. Seminars are led by a leading or emerging scholar and feature extensive discussion.

To register for all seminar events in the Fall 2024 season, please contact us directly at [email protected].


About the Schwartz Reisman Institute for Technology and Society

The Schwartz Reisman Institute for Technology and Society is a research institute at the University of Toronto that explores the ethical and societal implications of technology. Our mission is to deepen knowledge of technologies, societies, and humanity by integrating research across traditional boundaries to build human-centred solutions.

Explore each session in advance by visiting .

Missed an event? Visit to watch previous seminars.

Advertisement

Event Venue & Nearby Stays

Rotman School of Management, Room LL1065, 95 St George St., Toronto, Canada

Tickets

CAD 0.00

Sharing is Caring:

More Events in Toronto

24th aluCine Latin Film+Media Arts Festival
Wed Oct 16 2024 at 12:00 am 24th aluCine Latin Film+Media Arts Festival

Spadina Theater

Toronto Global Forum
Wed Oct 16 2024 at 07:30 am Toronto Global Forum

Fairmont Royal York Hotel

ESO Regional User Group - Toronto, Canada
Wed Oct 16 2024 at 08:00 am ESO Regional User Group - Toronto, Canada

The Mascot Brewery - 37 Advance Rd, Toronto, ON M8Z 2S6, Canada

ReNew Canada Career Fair
Wed Oct 16 2024 at 10:00 am ReNew Canada Career Fair

Toronto Reference Library - Bram & Bluma Appel Salon (2nd Floor)

Care Centre Information Session
Wed Oct 16 2024 at 10:00 am Care Centre Information Session

Newcomer Women's Services Toronto: Employment Services, 355 Church Street, Toronto, ON, Canada

Toronto Job Fair - Toronto Career Fair
Wed Oct 16 2024 at 11:00 am Toronto Job Fair - Toronto Career Fair

Toronto

2024 Child and Teen Consumption Conference
Wed Oct 16 2024 at 02:00 pm 2024 Child and Teen Consumption Conference

York University

University of Toronto, Temerty Medicine Graduate Program Fair
Wed Oct 16 2024 at 02:00 pm University of Toronto, Temerty Medicine Graduate Program Fair

108 College St

Patina Workshop
Wed Oct 16 2024 at 04:00 pm Patina Workshop

164 Hollywood Ave

APPLICANT OPEN HOUSE Graduate Department of Pharmaceutical Sciences
Wed Oct 16 2024 at 04:30 pm APPLICANT OPEN HOUSE Graduate Department of Pharmaceutical Sciences

Leslie Dan Faculty of Pharmacy, University of Toronto

Salute @ The Axis Club | October 16th
Wed Oct 16 2024 at 06:00 pm Salute @ The Axis Club | October 16th

The Axis Club

Throne of Glass Trivia 1.1 (Part 2: books 4 - 7)
Wed Oct 16 2024 at 06:00 pm Throne of Glass Trivia 1.1 (Part 2: books 4 - 7)

142 Cumberland Street,Toronto,M5R 1A8,CA

Toronto is Happening!

Never miss your favorite happenings again!

Explore Toronto Events