Prioritize Cancer Driver Genes
prioritize_driver_genes(features_df, cancer_type)
the features data frame for all genes, containing the following columns:
HGNC gene symbol
the maximum metapredictor (coding) impact score for the gene
the maximum non-coding PHRED-scaled CADD score for the gene
SCNA proxy score. SCNA density (SCNA/Mb) of the minimal common region (MCR) in which the gene is located
boolean indicating whether the gene is a hotspot gene (indication of oncogenes) or subject to double-hit (indication of tumor-suppressor genes)
'phenolyzer' score for the gene
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
boolean indicating whether or not the gene takes part in this KEGG pathway
short name of the cancer type. All available cancer types
are listed in MTL_submodel_descriptions
data frame with 3 columns:
HGNC gene symbol
estimated probability for each gene in features_df
of being a
cancer driver. The probabilities are calculated using the selected (via
cancer_type
) cancer type's sub-model.
prediction based on the cancer-type-specific threshold (either 'driver' or 'non-driver')
create_features_df
for creating the features table.
drivers_df <- prioritize_driver_genes(example_features_table, 'LUAD')