Way to extract feature importance

#468

by avazhai - opened Jan 5

Jan 5

Hi all, I was wondering if after training a geneformer model on your own data, there is a way to extract which genes contribute most to prediction, like a SHAP feature importance analysis or something along those lines.

ctheodoris

Owner Jan 5

Thank you for your question! If you have fine tuned the model to distinguish particular classes, you could use in silico perturbation to determine which genes contribute most to a particular class by determining which genes’ removal shift it most to the opposing class. Another way you could consider analyzing this is by determining which genes are paid most high attention to, which can be determined by examining the attention weights.

ctheodoris changed discussion status to closed Jan 5

avazhai

Jan 5

Thank you for the quick response! Sorry if this is a naive question, but how do I extract the attention weights from the model?

ctheodoris

Owner Jan 6

Please see this prior discussion: https://huggingface.co/ctheodoris/Geneformer/discussions/221

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment