Jordan Taylor

JordanTensor
·

AI & ML interests

Mechanistic interpretability, mechanistic anomaly detection, model internals techniques and AI safety techniques generally.

Recent Activity

liked a model 11 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
updated a collection 17 days ago
Obfuscated Backdoors
updated a collection 17 days ago
Obfuscated Backdoors
View all activity

Organizations

Mechanistic  Anomaly Detection's profile picture