Skip to content

Latest commit

 

History

History
36 lines (23 loc) · 1.95 KB

mutspec_nmfEstimateSig.md

File metadata and controls

36 lines (23 loc) · 1.95 KB

MutSpec-NMF_Estimate_Signatures

Compute statistics for estimating the number of signatures to extract with MutSpec-NMF.

Input format

Input matrix created with the tool MutSpec-Stat.
The input matrix can be found in MutSpec-Stat ouput folder: Mutational_Analysis/Figures/Input_NMF/Input_NMF_Count.txt

Output

  • Statistics of several approaches used for estimating the number of signatures to extract with NMF:
    • Brunet et al., proposed to take the first number of signature for which the cophenetic coefficient starts decreasing,
    • Hutchins et al., suggested to choose the first value where the RSS curve presents an inflection point.
    • Frigyesi et al., considered the smallest value at which the decrease in the RSS is lower than the decrease of the RSS obtained from random data.

The estimation are based on Brunet’s algorithm computed from 50 runs for each value of signature to estimate.

The original data are shuffled for comparing the quality measures obtained with our data (Data x) and from randomized data (Data y). The curves for the actual data are in solid line, those for the randomized data are in dashed line.

Usage

Rscript R/estimateSign_Galaxy.r --input outfolder_MutSpec-Stat/Mutational_Analysis/Figures/Input_NMF/Input_NMF_Count.txt --stop 8 --cpu 8 --output estimate_signatures.png

List of parameters:

Parameter Description
--input Input matrix created with the tool MutSpec-Stat
--stop Maximum number of signatures to compute (Selecting a number above 8 may not work on small datasets)
--cpu Number of CPUs
--output Output figure