Maximum Somatic VAF

The maximum somatic variant allele frequency (MSAF) is the highest VAF of a confirmed somatic mutation. The MSAF is often used as a surrogate for tumor fraction, especially when the ctDNA tumor fraction is high.

MSAF algorithm

The MSAF is determined using the following steps:

  1. Somatic variants determined by the TMB algorithm are used as an input (variants having Status Somatic in the {SampleID}_tmb.trace.tsv file). Variants that are not in coding regions, MNVs and variants with depth below 500 are filtered out.

  2. The remaining variants are ranked by the VAF in descending order

  3. The VAF for the highest ranked confident somatic variants is output as MSAF. "Confident somatic variants" are determined analyzing the fragment size of the reads supporting the variants. Circulating tumor DNA (ctDNA) molecules are expected to be shorter, when compared to the normal cell-free DNA (cfDNA) molecules. If the fragment sizes of the reads supporting a variant are significantly shorter than non-supporting reads (p-value < 1x10-5), the variant is considered to be a confident somatic variant.

  4. If no such variant exists, the VAF for the highest ranked COSMIC hotspot variant (with COSMIC count > 50) is output as MSAF.

  5. If no such variant exists, the VAF for the 4th highest ranked variant is output as MSAF.

MSAF algorithm in DRAGEN TSO 500 ctDNA Analysis Software

MSAF output files

The MSAF algorithm outputs results in several files:

  1. Metrics Output File, {SampleID}_MetricsOutput.tsv as Max_Somatic_AF

  2. TMB Max Somatic VAF file, {Sample_ID}.tmb.msaf.csv as MaxSomaticAF, using the same file format as the TMB Trace File.

Last updated

Was this helpful?