HAI Seminar with Common Crawl

Wed Oct 22 2025 at 12:00 pm to 01:15 pm UTC-07:00

Gates Computer Science Building Room 119 | Stanford

Stanford Institute for Human-Centered Artificial Intelligence (HAI)
Publisher/HostStanford Institute for Human-Centered Artificial Intelligence (HAI)
HAI Seminar with Common Crawl
Advertisement
Visiting scholars share their research with the HAI community.
About this Event

Preserving Humanity's Knowledge and Making it Accessible | Addressing Challenges of Public Web Data


HAI Seminar with Common Crawl

Visit our website to learn more about the event agenda, speakers, and other details


The Common Crawl Foundation is dedicated to preserving humanity's knowledge and making it accessible through its free public web dataset, a vital resource since 2008. As AI development accelerates, concerns have emerged regarding the accessibility and transparency of public web data, impacting open datasets in three key ways: robots.txt exclusions, legal demands, and "bot defenses." Two of these are not visible in public and are not very well understood. We will present insights from a new data product that utilizes Common Crawl's crawl metadata to visually explore these three problems, advocating for greater transparency and informed solutions for the future of public web data.


Details:

Time: 12:00 pm - 1:15 pm PT

Location: Gates Computer Science Building, Room 119, 353 Jane Stanford Way, CA 94503.

Advertisement

Event Venue & Nearby Stays

Gates Computer Science Building Room 119, 353 Serra Mall, Stanford, United States

Tickets

USD 0.00

Icon
Concerts, fests, parties, meetups - all the happenings, one place.

Ask AI if this event suits you:

More Events in Stanford

Manus: Building the hands for AI
Tue, 21 Oct at 04:30 pm Manus: Building the hands for AI

Stanford University

An Odyssey of Early Alzheimer\u2019s Disease in Colombia
Tue, 21 Oct at 05:00 pm An Odyssey of Early Alzheimer’s Disease in Colombia

Stanford Humanities Center (Levinthal Hall)

HAI Seminar with Brad Myers
Tue, 21 Oct at 05:30 pm HAI Seminar with Brad Myers

Gates Computer Science Building Room 119

Rethinking the MBA in Real Time
Tue, 21 Oct at 06:30 pm Rethinking the MBA in Real Time

Stanford Graduate School of Business

8th Annual Stanford Maternal and Child Health Research Institute Symposium
Wed, 22 Oct at 08:15 am 8th Annual Stanford Maternal and Child Health Research Institute Symposium

Berg Hall, Li Ka Shing Learning & Knowledge Center

Introduction to LaTeX on Overleaf
Wed, 22 Oct at 04:30 pm Introduction to LaTeX on Overleaf

Shriram 108

Santiago Ca\u00f1\u00f3n-Valencia
Wed, 22 Oct at 07:30 pm Santiago Cañón-Valencia

327 Lasuen Street, Bing Concert Hall, Stanford, CA, United States, California 94305

Parking San Francisco Dons at Stanford Cardinal Mens Soccer
Thu, 23 Oct at 01:00 am Parking San Francisco Dons at Stanford Cardinal Mens Soccer

Maloney Field at Laird Q. Cagan Stadium

San Francisco Dons at Stanford Cardinal Mens Soccer
Thu, 23 Oct at 01:00 am San Francisco Dons at Stanford Cardinal Mens Soccer

Maloney Field at Laird Q. Cagan Stadium

Stanford Center for Digital Health 2025 Annual Symposium
Thu, 23 Oct at 08:30 am Stanford Center for Digital Health 2025 Annual Symposium

Frances C. Arrillaga Alumni Center

Studio Lecture Series: Njideka Akunyili Crosby
Thu, 23 Oct at 05:30 pm Studio Lecture Series: Njideka Akunyili Crosby

Oshman Hall, McMurtry Building

Stanford is Happening!

Never miss your favorite happenings again!

Explore Stanford Events