nested import for dictionaries from parent 3b4533f verified madhavanvenkatesh commited on Sep 3, 2024
update tokenizer to defaults for 95M models for special token and input size da8cf3d verified ctheodoris commited on Sep 2, 2024
pointing dictionaries from the mtl module's init (#397) 7470753 verified ctheodoris madhavanvenkatesh commited on Aug 28, 2024
Refactored token dictionary loading and encapsulated dictionary (#398) beb62a4 verified ctheodoris madhavanvenkatesh commited on Aug 28, 2024
Refactor: Convert mask_token_id, pad_token_id, and all_special_ids to properties (#395) 2e06f1a verified ctheodoris madhavanvenkatesh commited on Aug 28, 2024
sync token_dictionary variable name w/ classifier a021deb verified ctheodoris commited on Aug 26, 2024
allow model_type valid options to take params model_type : {"Pretrained", "GeneClassifier", "CellClassifier", "MTLCellClassifier", "MTLCellClassifier-Quantized"} (#390) 47e0ef8 verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
"save_model_without_heads" is redundant (#385) de10ab0 verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
comment out "def save_model_without_heads(original_model_save_directory)"; redundant for ISP/Emb extractor (#382) 22bf20f verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
fixed bug related to dynamic ranges in dictionary with 'min' and 'max' value mismatch in optuna suggest fn (#380) fe1640b verified ctheodoris madhavanvenkatesh commited on Aug 21, 2024
update with 12L and 20L i4096 gc95M models, multitask and quantiz code 933ca80 ctheodoris commited on Aug 15, 2024
update to only have gene names as keys in gene_name_id_dict e61485e verified ctheodoris commited on Aug 11, 2024
Add function for summing of Ensembl IDs (#377) 1e18102 verified ctheodoris hchen725 commited on Aug 11, 2024
embs_df with all model embeddings (#363) 2e64874 verified ctheodoris hchen725 commited on Jul 9, 2024
Add function to get number of model embeddings (#364) c90d791 verified ctheodoris hchen725 commited on Jul 9, 2024
update perturber stats to reflect cos sim and emb_extractor to suppress warnings for non-cls 25dd1da ctheodoris commited on Jul 7, 2024
update to account for set of perturbed genes with aggregate_data eb038a6 ctheodoris commited on Jul 2, 2024
Update geneformer/emb_extractor.py (#350) 471eefc verified ctheodoris hchen725 commited on Jun 13, 2024
Upload in_silico_perturber_stats.py (#313) 8aee0ff verified ctheodoris davidjwen commited on Jun 7, 2024
Update with gene classifier, custom token dict, and str validate options (#329) 0568479 verified ctheodoris hchen725 commited on Apr 8, 2024
add option for hyperparameter tuning to cc.validate 4bddd45 Christina Theodoris commited on Mar 26, 2024
update examples for predict_eval and handle roc for 2 cell classes eeba323 Christina Theodoris commited on Mar 4, 2024
Get the gene keys and gene list keys from the token dictionary instead of medians (#304) b294421 verified ctheodoris hchen725 commited on Feb 28, 2024
Update geneformer/emb_extractor.py (#303) d6c634c verified ctheodoris hchen725 commited on Feb 26, 2024
Add option for variable input_size and to add CLS/SEP Tokens (#299) aa25cd2 verified ctheodoris hchen725 commited on Feb 22, 2024