arxiv:2411.14257
Neel Nanda
NeelNanda
AI & ML interests
Mechanistic Interpretability
Recent Activity
authored
a paper
about 1 month ago
Do I Know This Entity? Knowledge Awareness and Hallucinations in
Language Models
Organizations
None yet
Papers
10
models
65
NeelNanda/crosscoders-gpt2-small
Updated
•
5
NeelNanda/GELU_1L512W_C4_Code
Updated
•
881
•
2
NeelNanda/gpt-neox-tokenizer-digits
Updated
•
2
NeelNanda/sparse_autoencoder
Updated
•
3
NeelNanda/redwood-attn-only-2l
Updated
•
9
NeelNanda/Othello-GPT-Transformer-Lens
Updated
NeelNanda/full_pred_log_probs
Updated
NeelNanda/SoLU_1L256W_C4_Width_Scan
Updated
•
4
NeelNanda/SoLU_1L128W_C4_Width_Scan
Updated
•
2
NeelNanda/SoLU_1L64W_C4_Width_Scan
Updated
•
3
datasets
15
NeelNanda/pile-small-tokenized-2b
Viewer
•
Updated
•
10.8M
•
1.1k
NeelNanda/pile-tokenized-10b
Viewer
•
Updated
•
10.8M
•
470
•
1
NeelNanda/openwebtext-tokenized-9b
Viewer
•
Updated
•
8.83M
•
368
NeelNanda/code-10k
Viewer
•
Updated
•
10k
•
52
•
1
NeelNanda/wiki-10k
Viewer
•
Updated
•
10k
•
38
NeelNanda/c4-code-20k
Viewer
•
Updated
•
20k
•
214
•
4
NeelNanda/c4-10k
Viewer
•
Updated
•
10k
•
75
NeelNanda/c4-tokenized-2b
Viewer
•
Updated
•
1.36M
•
123
NeelNanda/code-tokenized
Viewer
•
Updated
•
297k
•
39
NeelNanda/c4-code-tokenized-2b
Viewer
•
Updated
•
1.66M
•
65
•
1