Phillip Guo

PhillipGuo

AI & ML interests

Interp, Unlearning, Editing

Recent Activity

updated a model about 1 hour ago
PhillipGuo/gemma-2-sae-masked-gd-mc-6-fullrank
updated a dataset about 9 hours ago
PhillipGuo/wmdp-deduped-unlearn
updated a model about 17 hours ago
PhillipGuo/gemma-2-gd-mc-5-fullrank
View all activity

Organizations

Truthfulness & Deception Research Team's profile picture quirky-lats-at-mats's profile picture LLM Latent Adversarial Training's profile picture

PhillipGuo's activity