Media Summary: In “Powering Up Capability Evaluations,” Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb With recent progress in graphics, ... Computer Science Seminar Series January 15, 2026 “Making Robust AI Safeguards Run Deep”

Stephen Casper Generalized Adversarial Training - Detailed Analysis & Overview

In “Powering Up Capability Evaluations,” Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb With recent progress in graphics, ... Computer Science Seminar Series January 15, 2026 “Making Robust AI Safeguards Run Deep” In Lecture 16, guest lecturer Ian Goodfellow discusses Authors: Vivek B.S., R. Venkatesh Babu Description: Deep learning models have shown impressive performance across a ... This workshop addressed the technical and institutional questions of how to safeguard human interests after AI surpasses human ...

GANs are an unsupervised learning method involving two neural networks iteratively competing. The discriminator is a typical ... Authors: Mingyi Zhou, Jing Wu, Yipeng Liu, Shuaicheng Liu, Ce Zhu Description: Machine learning models are vulnerable to ... This is a 3-minute summary of the paper "

Photo Gallery

Stephen Casper – Generalized Adversarial Training and Testing
stephen casper generalized adversarial training and testing
Stephen Casper – Powering Up Capability Evaluations [Alignment Workshop]
Stephen Casper: Problems with Evals (HAAISS 2024)
Stephen Casper - ML Researchers as Policymakers [Alignment Workshop]
Learning From Simulated and Unsupervised Images Through Adversarial Training
Stephen Casper: Problems with RLHF (HAAISS 2024)
Stephen Casper - Powering up AI Capability Evaluations with Model Tampering Attacks [Alignment Works
Making Robust AI Safeguards Run Deep – Stephen Casper
Lecture 16 | Adversarial Examples and Adversarial Training
Fast is better than free: Revisiting adversarial training (Reading Papers)
Ep 14 - Interp, latent robustness, RLHF limitations w/ Stephen Casper (PhD AI researcher, MIT)
Sponsored
Sponsored
View Detailed Profile
Sponsored
Sponsored