Only this pageAll pages
Powered by GitBook
Couldn't generate the PDF for 107 pages, generation stopped at 100.
Extend with 50 more pages.
1 of 100

DRAGEN TSO 500 and TSO 500 ctDNA Analysis Software

Loading...

DRAGEN TSO 500 Guides

DRAGEN TSO 500 v2.6

Loading...

Getting Started

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Launching Analysis

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

DRAGEN TSO 500 v2.5

Loading...

Getting Started

Loading...

Loading...

Loading...

Run Planning

Loading...

Loading...

Loading...

Loading...

Launching Analysis

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

DRAGEN TSO 500 ctDNA Guides

DRAGEN TSO 500 ctDNA v2.6

Loading...

Getting Started

Loading...

Loading...

Loading...

Loading...

Run Planning

Loading...

Loading...

Loading...

Loading...

Loading...

Launching Analysis

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

DRAGEN TSO 500 and TSO 500 ctDNA Analysis Software

DRAGEN TruSight™ Oncology 500 and DRAGEN TruSight™ Oncology 500 ctDNA Analysis Software support secondary analysis for the data generated by assays in the TruSight™ Oncology 500 (TSO 500) portfolio:

  • DRAGEN TruSight™ Oncology 500 Analysis Software supports TruSight Oncology 500 Assay and TruSight Oncology 500 High-Throughput Assay, both Research Use Only (RUO)

  • DRAGEN TruSight™ Oncology 500 ctDNA Analysis Software supports TruSight Oncology 500 ctDNA assay v1 and v2, both Research Use Only (RUO)

Depending on the version, the analysis software is available on:

  • Illumina Connected Analytics (ICA): A cloud-based secure platform for data analysis and management. Analysis setup, monitoring, and results access is facilitated via user-friendly interface of BaseSpace Sequence Hub.

  • DRAGEN server: An on-premises server that offers secondary analysis in a fraction of the time compared with a traditional CPU-based system.

  • On-board of select instruments: Enabled by a user-friendly analysis application installed on a sequencer.

This resource provides information on installation, configuration, running, troubleshooting as well as analysis algorithms of DRAGEN TruSight™ Oncology 500 and DRAGEN TruSight™ Oncology 500 ctDNA Analysis Software.

Introduction to DRAGEN TSO 500 Analysis Software v2.6.x

Scope

This resource provides information on installation, configuration, running, troubleshooting and analysis algorithms for the following software:

  • DRAGEN TruSight Oncology 500 Analysis Software v2.6.0 (for standalone DRAGEN server)

  • DRAGEN TruSight Oncology 500 Analysis Software on Illumina Connected Analytics (ICA) v2.6.0

  • DRAGEN TruSight Oncology 500 Analysis Application on NovaSeq 6000Dx v2.6.0 (uses a paired DRAGEN server)

  • DRAGEN TruSight Oncology 500 Analysis Software v2.6.1 (for standalone DRAGEN server)

The content is applicable to all software versions unless otherwise specified. The content related to setting up and running the analysis on ICA is only relevant to v2.6.0.

Overview

DRAGEN TruSight™ Oncology 500 Analysis Software supports data analysis for TruSight Oncology 500 Assay and TruSight Oncology 500 High-Throughput Assay, both Research Use Only (RUO).

The software provides local and cloud analysis for DNA and RNA libraries generated from formalin-fixed, paraffin-embedded (FFPE) tissue samples. The assays and the software are optimized to provide high sensitivity and specificity for low-frequency somatic variants across coding exons and additional regions of biological relevance in 523 genes for DNA biomarkers.

In addition, this software supports data analysis for TruSight Oncology 500 HRD (RUO), an optional add-on kit to TruSight Oncology 500, that enables detection of homologous recombination deficiency (HRD) through assessment of a genomic instability score (GIS).

TruSight Oncology 500 HRD is not available in Japan

DNA biomarkers:

  • Single nucleotide variants (SNVs)

  • Insertions

  • Deletions

  • Copy number variants (CNVs)

  • Exon-level CNVs

  • Multinucleotide variants (MNVs)

  • Genomic Instability Score (GIS Score) *

DNA Immunotherapy Biomarkers:

  • Tumor mutational burden (TMB)

  • Microsatellite instability (MSI)

RNA biomarkers (called from 55 genes):

  • Fusions

  • Splice variants

Beta features:

  • Absolute copy numbers (ACN)*

  • Loss of heterozygosity (LOH)*

  • Tumor fraction*

  • Ploidy*

Details of the regions covered by the assays can be found in the assay manifest file. Contact your local Illumina representative for more information.

*Requires TruSight Oncology 500 HRD add-on kit

Local and Cloud Deployments

Local analysis is available using a standalone DRAGEN server or an application with a user interface on NovaSeq 6000Dx. The software on the standalone DRAGEN server allows for analysis on a single DRAGEN server or splitting across multiple servers.

Cloud analysis is available on Illumina Connected Analytics with auto-launch or manual launch. Both methods are available from BCLs and FASTQs.

Instrument Compatibility

DRAGEN TruSight Oncology 500 analysis software is compatible with data generated on the Illumina instruments as summarized in the table below.

Instrument
Illumina Connected Analytics
Standalone DRAGEN Server
Paired DRAGEN server
On-board DRAGEN

NextSeq 550Dx (RUO mode)

Yes

Yes

No

N/A

NextSeq 500/550

Yes

Yes

N/A

N/A

NovaSeq 6000

Yes

Yes

N/A

N/A

NovaSeq 6000Dx (RUO mode)

Yes

Yes

Yes

N/A

NextSeq 1000/2000

Yes

Yes

N/A

No

NovaSeq X

Yes

Yes

N/A

No

Navigation of Guide

This resource provides information on installation, configuration, running, troubleshooting as well as analysis algorithms of DRAGEN TruSight Oncology 500 analysis software on Illumina Connected Analytics, standalone DRAGEN server, and the NovaSeq 6000Dx analysis application.

Installation of NovaSeq 6000Dx TSO 500 Analysis Application

Instructions to install DRAGEN TSO 500 Analysis Application on NovaSeq 6000Dx (RUO mode).

Prerequisites

  • A NovaSeq 6000Dx sequencing instrument with paired DRAGEN server v4

  • Illumina Run Manager installed by Illumina support personnel

  • TSOCombined and TSO500_HRD licenses installed by Illumina support personnel

    • TSOCombined and TSO500_HRD licenses are pre-installed in manufacturing since February 2025 and will need to be installed only for instruments purchased prior to that

  • The user installing the app must have admin privileges on Illumina Run Manager

Installation Instructions:

  1. Download the installation package provided in the email by Illumina Customer Care. The link will expire after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} '{link}'

Where the file name is the name of either DRAGEN TSO 500 Analysis Application file or DRAGEN ires file, and the link is provided by Illumina Customer Care.

  1. The installation package contains the following:

    1. DRAGEN TSO 500 Analysis Application: DRAGEN_TSO500HRD_v2.6.0-2v12.iapp

    2. DRAGEN ires: drageninstaller_3.10.17-8.el8.x86_64_prod.ires

MD5sum for the installation package contents are as follows:

  • drageninstaller_3.10.17-8.el8.x86_64_prod.ires

    • 12df06502776d8b673c73fb714dc466a

  • DRAGEN_TSO500HRD_v2.6.0-2v12.iapp

    • dc4fe5fe2fc3eca57969e978886dcedf

  1. Install the DRAGEN version using the ires:

    1. Log into Illumina Run Manager as a user with admin credentials

    2. Navigate to the top left menu to "DRAGEN" in the drop down

    3. Select "Add DRAGEN Installer" and upload the DRAGEN ires file

    4. The installation is complete once the DRAGEN version 3.10.17 is in the install version(s) list

\

Installation of 2.6.1 on Standalone DRAGEN Server

Overview

The installation script for DRAGEN TruSight Oncology 500 Analysis Software installs the following software and dependencies:

  1. DRAGEN TruSight Oncology 500 Analysis Software itself

  2. DRAGEN Software if a compatible version is not present

  3. Docker software if a compatible version is not present

  4. A script required to generate DRAGEN genome hash table

  5. A script to check that DRAGEN TruSight Oncology 500 Analysis Software is installed properly

Installation Requirements

Hardware

  • DRAGEN server v3 or v4

  • Network-attached storage (NAS) with enabled mkfifo if performing analysis for the TruSight Oncology 500 High-Throughput assay

Software

  • Linux CentOS 7.9 operating system (or later) or Oracle Linux 8 (or later), one of which is provided on the server. Oracle Linux 8 is recommended.

  • Docker Software, see table below for minimum version needed. If sufficient Docker software is not present on the server, the TSO 500 installer will install compatible Docker software.

  • DRAGEN Server Software*, see table below for minimum version needed as the host version on the server. If sufficient DRAGEN software is not present on the server, the TSO 500 installer will install compatible DRAGEN software.

Software Dependency
Compatible
Installs

Docker

20.10 or greater

Docker 20.10.15

DRAGEN Server Software*

v3.10.x, where x >=19, v4.3+

DRAGEN Software 3.10.19

*The DRAGEN Server Software version may be higher than the DRAGEN version used by the DRAGEN TSO 500 v2.6.1 pipeline (DRAGEN v3.10.17), which is provided inside the DRAGEN TSO 500 docker image.

Licenses

  • TSOCombined license

  • TSO500_HRD license (to analyze data generated with the TSO 500 HRD add-on kit)

Permissions

Illumina recommends logging in as root user for installation, but as a non-root user for running TSO 500 analysis.

  • Installing and uninstalling DRAGEN TruSight Oncology 500 Analysis Software and running the system check requires root privileges.

  • Run DRAGEN TruSight Oncology 500 Analysis Software without being logged in as a root user. Running the DRAGEN TruSight Oncology 500 Analysis Software as root is not required or recommended.

Compatibility with other DRAGEN pipelines

Software versions without multi-version compatibility referred to as single-version compatible. DRAGEN TSO 500 Analysis Software v2.6.1 will disrupt installations of single-version compatible software from the DRAGEN server. To uninstall a previous version of DRAGEN TSO 500 Analysis Software, refer to the respective guide.

Compatibility of software for co-installation with DRAGEN TSO 500 v2.6.1 on a DRAGEN server is summarized in the table below:

Software
Version
Type
Compatible

DRAGEN TSO 500

2.6.0

Single-version

No

DRAGEN TSO 500

2.5.4

Multi-version

Yes*

DRAGEN TSO 500

2.5.3 or below

Single-version

No

DRAGEN TSO 500 ctDNA

2.6.2+

Multi-version

Yes

DRAGEN TSO 500 ctDNA

2.6.1 or below

Single-version

No

DRAGEN pipelines**

4.3.6+

Multi-version

Yes

DRAGEN pipelines**

4.2 or below

Single-version

No

*DRAGEN TSO 500 Analysis Software v2.6.1 can run on a single server with another multi-version compatible DRAGEN TSO 500 Analysis Software, e.g. DRAGEN TSO 500 v2.5.4 (v2.5.4 should be installed before installing v2.6.1). DRAGEN TSO 500 Analysis Software v2.6.1 can be co-installed with multi-version compatible DRAGEN TSO 500 ctDNA Analysis Software or other DRAGEN pipelines with any order of installation.

**For example, DRAGEN Enrichment, DRAGEN Germline, DRAGEN WGS Heme v1.0.0 and others. Order of installation does not matter.

Installation Instructions

As a root user, perform the following steps to install DRAGEN TruSight Oncology 500 v2.6.1 Analysis Software:

  1. Download the installation package provided in the email from Illumina. The link expires after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} '{link}'

Where the file name is the installation script file name, and the link is provided by Illumina Customer Care.

  1. Make sure no other analysis is being performed. Installing the software while performing other analyses prevent the installer process from proceeding

  2. Copy the install script to the /staging directory to store the script in the directory.

Installation Script: install_DRAGEN_TSO500-2.6.1.run

MD5sum value: sha256:00326573934d73e16b8b9d0f7790b58da5ac8112875deb359d2944f95b95aed5

  1. Use the following command to update the run script permission: chmod +x /staging/install_DRAGEN_TSO500-2.6.1.run

  2. Use the following command to run the installation script, which runs for approximately 20 minutes:

    1. For Docker, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.6.1.run . The script installs compatible DRAGEN software and removes any previously installed versions.

    2. For Apptainer, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.6.1.run -- --noDockerInstall This will not install Apptainer, but will install the analysis software in the SIF container format and modify the software to launch analyses using Apptainer.

  3. During the installation process, you might be instructed to reboot or power cycle the system to complete the installation of the DRAGEN software. A power cycle of the system requires the server be shut down and restarted.

  4. Log out of the server and then log back in.

License Installation

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server connected to the internet:

  1. Confirm that the server is connected to the Internet, example: ping www.illumina.com

  2. Run the following command: /usr/bin/dragen_lic -i auto

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server not connected to the internet:

  1. Download and save the license file(s) to a location that is accessible from the DRAGEN server

  2. For each license file, run the command, where <license file received> is the absolute path to the license file: sudo /usr/bin/dragen_lic -i /tmp/<license file received>.bin

To check the success of license installation, run: /usr/bin/dragen_lic. Installed licenses should be in the list.

Running the System Check

After installation is complete, make sure the system functions properly by running the following command: /usr/local/bin/check_DRAGEN_TSO500-2.6.1.sh

The script checks that:

  • All required services are running

  • Proper Docker image is installed

  • DRAGEN TruSight Oncology 500 Analysis Software can successfully process a test data set

The system check script runs for approximately 25 minutes. If the script prints a failure message, contact Illumina Technical Support and provide the /staging/check_DRAGEN_TSO500_<timestamp>.tgz output file.

If using MacOS to connect to a server, an error can occur if the local settings are not in English. To resolve the error, disable the ability to set environment variables automatically in Terminal settings.

Uninstall Software

The DRAGEN TruSight Oncology 500 Analysis Software installation includes an uninstall script called uninstall_DRAGEN_TSO500-2.6.1.sh, which is located in /usr/local/bin.

Executing the uninstall script removes the following assets:

  • All DRAGEN TruSight Oncology 500 Analysis Software related scripts located in /usr/local/bin

  • Resources found in /staging/illumina/DRAGEN_TSO500

  • The dragen_tso500:2.6.1: Docker image

To uninstall the DRAGEN TruSight Oncology 500 Analysis Software, run the following command as a root user:

uninstall_DRAGEN_TSO500-2.6.1.sh

You are not required to uninstall Docker or DRAGEN software. To remove Docker, review the install instructions for your operating system in the Docker documentation.

Installation of 2.6.0 on Standalone DRAGEN Server

Overview

The installation script for DRAGEN TruSight Oncology 500 Analysis Software installs the following software and dependencies:

  1. DRAGEN TruSight Oncology 500 Analysis Software itself

  2. DRAGEN Software if a compatible version is not present

  3. Docker software if a compatible version is not present

  4. A script required to generate DRAGEN genome hash table

  5. A script to check that DRAGEN TruSight Oncology 500 Analysis Software is installed properly

Installation Requirements

Hardware

  • DRAGEN server v3 or v4

  • If performing analysis for the TruSight Oncology 500 High-Throughput assay, mkfifo needs to be enabled on the network-attached storage (NAS).

Software

  • By default Linux CentOS 7.9 operating system (or later) or Oracle Linux 8 (or later), is provided. Oracle Linux 8 is recommended.

  • Docker Software, see table below

  • DRAGEN Software, see table below

Software Dependency
Compatible
Installs

Docker

20.10 or greater

Docker 20.10.15

DRAGEN Software

v3.10.x where x is 17 or greater

DRAGEN Software 3.10.17

DRAGEN TruSight Oncology 500 v2.6.0 Analysis Software is not compatible with DRAGEN Software v4.0 or above on the same standalone DRAGEN server.

Licenses

  • TSOCombined license

  • TSO500_HRD license (to analyze data generated with the TSO 500 HRD add-on kit)

Permissions

Illumina recommends logging in as root user for installation, but as a non-root user for running TSO 500 analysis.

  • Installing and uninstalling DRAGEN TruSight Oncology 500 Analysis Software and running the system check requires root privileges.

  • Run DRAGEN TruSight Oncology 500 Analysis Software without being logged in as a root user. Running the DRAGEN TruSight Oncology 500 Analysis Software as root is not required or recommended.

Compatibility with other TruSight Oncology 500 and TruSight Oncology 500 ctDNA Analysis Software

DRAGEN TruSight Oncology 500 Analysis Software v2.6.0 can be installed on one DRAGEN server with:

  1. DRAGEN TruSight Oncology 500 ctDNA Analysis Software v2.6.0 (v3.10.17*)

  2. One prior 2.x version of DRAGEN TruSight Oncology 500 ctDNA Analysis Software (v2.1.1 (v3.10.9*), v2.5.0 (v3.10.15*), 2.6.0 (v3.10.17*), 2.6.1 (v3.10.18*))

  3. One prior 2.x version of DRAGEN TruSight Oncology 500 Analysis Software (v2.1.1 (v3.10.9*), v2.5.3 (v3.10.16*)

*DRAGEN Software version

Contrary to the prior versions, the installation scripts for DRAGEN TruSight Oncology 500 Analysis Software v2.6.0 and DRAGEN TruSight Oncology 500 ctDNA v2.6.0 do not uninstall previous versions of DRAGEN TruSight Oncology 500 Analysis Software. To uninstall a previous version of DRAGEN TruSight Oncology 500 Analysis Software, refer to the respective guide.

When installing DRAGEN TruSight Oncology 500 and DRAGEN TruSight Oncology 500 ctDNA software on the same DRAGEN server, install the software with the highest corresponding DRAGEN Software version last, as versions below v2.6.0 will overwrite with its corresponding DRAGEN Software version.

If a prior version of DRAGEN TruSight Oncology 500 Analysis Software (eg. v2.5.3) is installed after v2.6.0, re-execute the installation script for v2.6.0 to install the compatible version of DRAGEN Software without impacting other installations.

Installation Instructions

As a root user, perform the following steps to install DRAGEN TruSight Oncology 500 v2.6.0 Analysis Software:

  1. Download the installation package provided in the email from Illumina. The link expires after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} '{link}'

Where the file name is the installation script file name, and the link is provided by Illumina Customer Care.

  1. Make sure no other analysis is being performed. Installing the software while performing other analyses prevent the installer process from proceeding

  2. Copy the install script to the /staging directory to store the script in the directory.

Installation Script: install_DRAGEN_TSO500-2.6.0.run

MD5sum: 578cda2b8837845b26e2c3c020f2264c

  1. Use the following command to update the run script permission: chmod +x /staging/install_DRAGEN_TSO500-2.6.0.run

  2. Use the following command to run the installation script, which runs for approximately 20 minutes:

    1. For Docker, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.6.0.run . The script installs compatible DRAGEN software and removes any previously installed versions.

    2. For Apptainer, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.6.0.run -- --noDockerInstall This will not install Apptainer, but will install the analysis software in the SIF container format and modify the software to launch analyses using Apptainer.

  3. During the installation process, you might be instructed to reboot or power cycle the system to complete the installation of the DRAGEN software. A power cycle of the system requires the server be shut down and restarted.

  4. Log out of the server and then log back in.

License Installation

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server connected to the internet:

  1. Confirm that the server is connected to the Internet, example: ping www.illumina.com

  2. Run the following command: /opt/edico/bin/dragen_lic -i auto

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server not connected to the internet:

  1. Download and save the license file(s) to a location that is accessible from the DRAGEN server

  2. For each license file, run the command, where <license file received> is the absolute path to the license file: sudo /opt/edico/bin/dragen_lic -i /tmp/<license file received>.bin

To check the success of license installation, run: /opt/edico/bin/dragen_lic. Installed licenses should be in the list.

Running the System Check

After installation is complete, make sure the system functions properly by running the following command: /usr/local/bin/check_DRAGEN_TSO500-2.6.0.sh

The script checks that:

  • All required services are running

  • Proper Docker image is installed

  • DRAGEN TruSight Oncology 500 Analysis Software can successfully process a test data set

The system check script runs for approximately 25 minutes. If the script prints a failure message, contact Illumina Technical Support and provide the /staging/check_DRAGEN_TSO500_<timestamp>.tgz output file.

If using MacOS to connect to a server, an error can occur if the local settings are not in English. To resolve the error, disable the ability to set environment variables automatically in Terminal settings.

Uninstall Software

The DRAGEN TruSight Oncology 500 Analysis Software installation includes an uninstall script called uninstall_DRAGEN_TSO500-2.6.0.sh, which is located in /usr/local/bin.

Executing the uninstall script removes the following assets:

  • All DRAGEN TruSight Oncology 500 Analysis Software related scripts located in /usr/local/bin

  • Resources found in /staging/illumina/DRAGEN_TSO500

  • The dragen_tso500:2.6.0: Docker image

To uninstall the DRAGEN TruSight Oncology 500 Analysis Software, run the following command as a root user:

uninstall_DRAGEN_TSO500-2.6.0.sh

You are not required to uninstall Docker or DRAGEN software. To remove Docker, review the install instructions for your operating system in the Docker documentation.

Run Planning

Here are the articles in this section:

Sample Sheet Requirements

DRAGEN TSO 500 Analysis Software has optional and required fields that are required in addition to general sample sheet requirements. Follow the steps below to create a valid samplesheet.

Standard Sample Sheet Requirements

The following sample sheet requirements describe required and optional fields for DRAGEN TSO 500 Analysis Software. Depending on the deployment (standalone DRAGEN server, ICA with auto-launch, ICA with manual launch, NovaSeq 6000Dx analysis application), certain sections and required values can deviate from the standard requirements. These deviations are noted in the information below.

The analysis fails if the sample sheet requirements are not met.

Use the following steps to create a valid sample sheet.

  1. Download the sample sheet v2 template that matches the instrument & assay run.

  2. In the Sequencing Settings section, enter the following required parameters:

[Sequencing_Settings] Section

  1. In the BCL Convert Settings section, enter the following required parameters:

[BCLConvert_Settings] Section

  1. In the BCL Convert Data section, enter the following parameters for each sample.

[BCLConvert_Data] Section

  1. In the TSO 500 Data section, enter the following parameters:

TSO 500 Data Section header changes depending on the deployment:

  • Standalone DRAGEN Server and ICA with Manual Launch: TSO500S_Data

  • ICA with Auto-launch: Cloud_TSO500S_Data

  • Illumina DRAGEN TruSight Oncology 500 (HRD) Analysis Application on NovaSeq 6000Dx: TSO500HRD_Data

[TSO500S_Data] Section

To ensure a successful analysis, follow these guidelines:

  1. Avoid any blank lines at the end of the sample sheet; these can cause the analysis to fail.

  2. When running local analysis using the command line save the sample sheet in the sequencing run folder with the default name SampleSheet.csv, or choose a different name and specify the path in the command-line options.

ICA with Auto-launch: Sample Sheet Requirements

To auto-launch analysis from the sequencer run folder, ensure the StartsFromFastq and SampleSheetRequested fields are set to FALSE. To auto-launch analysis from FASTQs after BCL Convert auto-launch, StartsFromFastq and SampleSheet Requested fields must be set to TRUE

[Cloud_TSO500S_Data] Section

[Cloud_TSO500S_Settings] Section

[Cloud_Data] Section

[Cloud_Settings] Section

NovaSeq 6000Dx Analysis Application: Sample Sheet Requirements

This section describes fields specific for sample sheets for NovaSeq 6000Dx Analysis Application. For more information on DRAGEN TSO 500 Analysis Software sample sheet requirements, refer to the sections above.

Mismatches between the samples and index primers can cause incorrect results due to loss of positive sample identification. Enter sample IDs and assign indexes in the sample sheet before beginning library preparation. Record sample IDs, indexes, and plate well orientation for reference during library preparation.

[BCLConvert_Settings] Section

[TSO500HRD_Data] Section

NovaSeq 6000Dx Run Set Up

The following instructions describe steps to set up a run on NovaSeq 6000Dx Analysis Application.

Use the following steps to configure a TruSight™ Oncology 500 run in Illumina Run Manager:

  1. Go to the "Runs" section of Illumina Run Manager by selecting "Runs" on the left-hand side

  2. Enter sample data manually or by importing a sample sheet

  3. To enter sample data run manually, select “Create Run”

  4. Choose "DRAGEN TruSight™ Oncology 500 (with HRD) Analysis Application" from the "Create Run" screen to set-up and analyze runs for TruSight Oncology 500 assay with or without HRD add-on

Run Settings

  1. On the "Run Settings" screen, enter a run name with the following criteria:

    1. 1 - 40 characters.

    2. Alphanumeric characters, underscores, or dashes only.

    3. Unique across all runs on the instrument.

    The run name identifies the run from sequencing through analysis.

  2. [Optional] Enter a run description. The run description must have the following criteria:

    1. 1 - 50 characters.

    2. Alphanumeric characters or spaces only.

    3. Spaces must be preceded and followed by an alphanumeric character.

  3. Select kit used during library preparation:

    1. TruSight Oncology 500

    2. TruSight Oncology 500 High-Throughput

  4. Index adapter kit will be automatically selected based on the library prep kit selection

  5. [Optional] Enter a library tube ID.

Depending on the library prep kit selected, additional fields will be populated for run settings and are not editable. Read and index lengths will differ between library prep kit type.

Sample Data

Use the table on the "Sample Data" screen to enter sample information manually.

  1. Select lane information. Options include one to four, or all lanes.

  2. Enter a unique sample ID in the sample ID field with the following criteria:

    1. Controls should be added first.

    2. 1 - 40 characters.

    3. Alphanumeric characters, underscores, or dashes only.

    4. Underscores and dashes must be preceded and followed by an alphanumeric character.

  3. Select an index set ID for the DNA / RNA library prepared from the sample.

  4. [Optional] Enter a library name.

Depending on the options selected for index set ID, additional fields will be auto-populated for sample data and are not editable.

Sample Settings

Use the table on the "Sample Settings" screen to enter additional sample information.

  1. Enter Pair ID with the following criteria:

    1. 1 - 40 characters.

    2. Alphanumeric characters, underscores, or dashes only.

    3. Underscores and dashes must be preceded and followed by an alphanumeric character.

    4. Pairs at most one DNA and one RNA samples from the same biological sample from the same individual.

  2. Select Sample Type: DNA or RNA

  3. Enter Sample Feature: Select HRD for DNA samples with HRD probes. For all other samples, leave the field blank.

  4. [Optional] Enter a sample name with the following criteria:

    1. 1 - 50 characters.

    2. Alphanumeric characters, dashes, underscores, or spaces.

    3. Spaces, underscores, and dashes must be preceded and followed by an alphanumeric character.

  5. [Optional] Enter a sample description with the following criteria:

    1. 1 - 50 characters.

    2. Alphanumeric characters, dashes, underscores, or spaces.

    3. Spaces, underscores, and dashes must be preceded and followed by an alphanumeric character.

Additional fields will be auto-populated based on selections made in the Sample Data screen, which are not editable.

Before starting your run, review that the information entered is correct in the “Run Review” page before saving.

Sample Sheet Introduction

Overview

The sample sheet includes a list of samples and their index sequences, along with additional information required to run DRAGEN TruSight Oncology 500 Analysis Software. For example, DNA samples with the TruSight Oncology 500 HRD add-on probes must be indicated in the Sample Feature column of the sample sheet. Appropriate index adapter sequences are determined by the assay used to perform analysis.

When running analysis on a standalone DRAGEN server or on ICA, a valid sample sheet can be created by:

When running analysis using a NovaSeq 6000Dx Analysis Application, a valid sample sheet can be created by:

The run set up section of this guide includes specific instructions to plan a run and set up a valid sample sheet for each deployment of DRAGEN TruSight Oncology 500 Analysis Software.

Getting Started on Illumina Connected Analytics

Prerequisites

Illumina Connected Analytics (ICA) subscription includes access to DRAGEN TruSight Oncology 500 Analysis Software. To get started, you need:

  • An ICA account with a valid subscription

  • A positive balance of iCredits for data storage

Contact Illumina Customer Care at to obtain installation package for Illumina DRAGEN TruSight Oncology 500 (HRD) Analysis Application on NovaSeq 6000Dx

TSOCombined license has been pre-installed to DRAGEN servers in manufacturing since August 2022 and TSO500_HRD since February 2025 and additionally distributed to DRAGEN servers connected online. To generate a list of installed DRAGEN server licenses, run the following command: /usr/bin/dragen_lic. If a license is not installed, contact Illumina Customer Care at for the license.

A non-root user must be a member of the Docker group to run Docker. For more information on Docker permission requirements and alternatives to running as root, refer to the Docker documentation available on the .

DRAGEN TSO 500 Analysis Software v2.6.1 is multi-version compatible. Multi-version compatibility refers to ability to be installed on a single DRAGEN server with software running a different version of DRAGEN software. For example, multi-version compatible pipelines running DRAGEN v4.3.6 can be co-installed on a server alongside DRAGEN TSO 500 pipelines running DRAGEN v3.10.17. For more details on DRAGEN multi-version compatibility, please visit .

Contact Illumina Customer Care at to obtain the DRAGEN TruSight Oncology 500 Analysis Software installer package.

Use the following command to build the DRAGEN server hash table, which runs for approximately 60 minutes: /usr/local/bin/build-hashtable_DRAGEN_TSO500-2.6.1.sh Refer to if any errors occur.

Review license requirements, how to check which licenses are installed and how to receive a license in . Licenses can be installed before or after DRAGEN TSO 500 software installation.

Contact Customer Care at to request a license file for each of the needed licenses

TSOCombined license has been pre-installed to DRAGEN servers in manufacturing since August 2022 and TSO500_HRD since February 2025 and additionally distributed to DRAGEN servers connected online. To generate a list of installed DRAGEN server licenses, run the following command: /opt/edico/bin/dragen_lic. If a license is not installed, contact Illumina Customer Care at for the license.

A non-root user must be a member of the Docker group to run Docker. For more information on Docker permission requirements and alternatives to running as root, refer to the Docker documentation available on the .

Contact Illumina Customer Care at to obtain the DRAGEN TruSight Oncology 500 Analysis Software installer package.

Use the following command to build the DRAGEN server hash table, which runs for approximately 60 minutes: /usr/local/bin/build-hashtable_DRAGEN_TSO500-2.6.0.sh Refer to if any errors occur.

Review license requirements, how to check which licenses are installed and how to receive a license in . Licenses can be installed before or after DRAGEN TSO 500 software installation.

Contact Customer Care at to request a license file for each of the needed licenses

Sample Parameter
Required
Details
Sample Parameter
Required
Details
Sample Parameter
Required
Details
Sample Parameter
Required
Details

Refer to the following requirements to create sample sheets for running the analysis on ICA with Auto-launch. For sample sheet requirements common between deployments see . Samples sheets can be created using BaseSpace Run Planning Tool or manually by downloading and editing a sample sheet template

Refer to for this section's requirements.

Parameters
Required
Details
Parameters
Required
Details
Parameter
Required
Details
Parameter Name
Required

Refer to [ for this section's requirements.

Alternately, select Import Samples to upload sample information. Refer to for sample sheet requirements.

A sample sheet is required for each analysis with DRAGEN TruSight Oncology 500 Analysis Software. A sample sheet is a comma-separated value (*.csv) file format used by Illumina instruments, platforms, and analysis pipelines to store settings and data for sequencing and analysis. The DRAGEN TruSight Oncology 500 Analysis Software is compatible with the sample sheet v2. For general information on the sample sheet v2, refer to .

BaseSpace Run Planner (preferred), see for details

Downloading and modifying a sample sheet template following the requirements, see for details

Using the user interface of the DRAGEN TruSight Oncology 500 Analysis Application, see for details

Downloading and modifying a sample sheet template following the requirements (see for details), then importing it to Illumina Run Manager.

Refer to the for information on how to register ICA subscription and iCredits.

customercare@illumina.com
customercare@illumina.com
Docker website
page 7 of the DRAGEN v4.3.6 software release notes
customercare@illumina.com
Troubleshooting
customercare@illumina.com
customercare@illumina.com
Docker website
customercare@illumina.com
Troubleshooting
customercare@illumina.com
Licenses
Licenses

LibraryPrepKits

Required

Accepted values are: TSO500 or TSO500HT

SoftwareVersion

Required

The DRAGEN component software version. TruSight Oncology 500 2.6.0 requires 3.10.17. To ensure you are using the latest compatible version, refer to the software release notes.

AdapterRead1

Required

If using 8 bp indexes starting with UP or CP (used with TSO 500): AGATCGGAAGAGCACACGTCTGAACTCCAGTCA If using 10 bp indexes with UDP (used with TSO 500 HT): CTGTCTCTTATACACATCTCCGAGCCCACGAGAC Analysis fails if the incorrect adapter sequences are used

AdapterRead2

Required

If using 8 bp indexes starting with UP or CP (used with TSO 500): AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT If using 10 bp indexes with UDP (used with TSO 500 HT): CTGTCTCTTATACACATCTGACGCTGCCGACGA Analysis fails if the incorrect adapter sequences are used

AdapterBehavior

Required

Enter trim This indicates that the BCL Convert software trims the specified adapter sequences from each read.

MinimumTrimmedReadLength

Required

Enter 35. Reads with a length trimmed below this point are masked.

MaskShortReads

Required

Enter 35. Reads with a length trimmed below this point are masked.

Sample_ID

Required

Must match a Sample_ID listed in the TSO 500 Data section.

Index

Required

Index 1 sequence valid for Index_ID assigned to matching Sample_ID in the TSO 500 Data section.

Index2

Required

Index 2 sequence valid for Index_ID assigned to matching Sample_ID in the TSO 500 Data section.

Lane

Only for NovaSeq 6000 XP, NovaSeq 6000Dx, or NovaSeq X workflows

Indicates which lane corresponds to a given sample. Enter a single numeric value per row. Cannot be empty, i.e the analysis fails if the Lane column is present without a value in each row.

Sample_ID

Required

The unique ID to identify a sample. The sample ID is included in the output file names. Sample IDs are not case sensitive. Sample IDs must have the following characteristics: - Unique for the run. - 1–40 characters. - No spaces. - Alphanumeric characters with underscores and dashes. If you use an underscore or dash, enter an alphanumeric character before and after the underscore or dash. eg, Sample1-T5B1_022515. - Cannot be called all, default, none, unknown, undetermined, stats, or reports. - Must match a Sample_ID listed in the TSO 500 Data section. - Illumina recommends that the sample ID be based on the pair ID. Example: <Pair_ID>-DNA,<Pair_ID>-RNA. - Each sample must have a unique combination of Lane (if applicable), sample ID, and index ID or the analysis will fail.

Sample_Type

Required

Enter DNA or RNA. For HRD samples, this parameter must be DNA.

Pair_ID

Required

A unique ID that links DNA and RNA from the same biological sample from the same individual. Pair ID shares, at most, one DNA and one RNA sample per run. eg, if a Sample_ID is TestSample1-DNA for DNA and TestSample1-RNA for RNA, the Pair_ID TestSample1 will link these samples that are on different rows in the sample sheet together. If the pair ID is associated with more than one DNA or RNA sample, the analysis fails.

Sample_Feature

Required when using HRD add-on kit

Required for HRD enriched samples. For DNA samples that have undergone HRD enrichment, enter HRD in this column of the sample sheet. If the sample has not undergone HRD enrichment, leave the field empty.

Sample_Description

Not Required

Sample description must meet the following requirements: - 1–50 characters. - Alphanumeric characters with underscores, dashes and spaces. If you enter a underscore, dash, or space, enter an alphanumeric character before and after. eg, Solid-FFPE_213.

SoftwareVersion

Not Required

The TSO500S software version

StartsFromFastq

Required

Set the value to TRUE or FALSE. To auto-launch from BCL files, set to FALSE. To auto-launch from FASTQ files after auto-launch of BCL Convert, set to TRUE.

SampleSheetRequested

Required

Set the value to TRUE or FALSE.

To auto-launch from BCL files, set to FALSE. To auto-launch from FASTQ files after auto-launch of BCL Convert, set to TRUE.

Sample_ID

Not Required

The same sample ID used in the Cloud_TSO500S_Data section.

ProjectName

Not Required

The BaseSpace project name.

LibraryName

Not Required

Combination of sample ID and index values in the following format: sampleID_Index_Index2

LibraryPrepKitName

Required

The Library Prep Kit used.

IndexAdapterKitName

Not Required

The Index Adapter Kit used.

GeneratedVersion

Not Required

The cloud GSS version used to create the sample sheet. Optional if manually updating a sample sheet.

CloudWorkflow

Not Required

Ica_workflow_1

Cloud_TSO500_Pipeline

Required

This value is a universal record number (URN . The valid values are: Solid—urn:ilmn:ica:pipeline:8538a5e3-b8d2-469d-baaf-b2164e54cc51#DRAGEN_TruSight_Oncology_500_v2_6_0_2 Solid HRD —urn:ilmn:ica:pipeline:506f136e-980e-427d-ab39-f91654255bea#DRAGEN_TruSight_Oncology_500_HRD_v2_6_0_2

BCLConvert_Pipeline

Required

The value is a URN in the following format: urn:ilmn:ica:pipeline: <pipeline-ID>#<pipeline-name>

SoftwareVersion

Required

Enter the IRM iapp software version 2.6.0-2v12ui

Sample Sheet Introduction
Sample Sheet Requirements
Sample Sheet Creation in BaseSpace Run Planning tool
NovaSeq 6000Dx Run Set Up
Sample Sheet Templates
Illumina Connected Software - Sample Sheet
Sample Sheet Creation in BaseSpace Run Planner page
Sample Sheet Requirements page
Run Planning on Illumina Run Manager
Sample Sheet Requirements page
Software Registration page
Standard Sample Sheet Requirements
[TSO500_Data] Section
TSO500S_Data] Section
NovaSeq 6000Dx Analysis Application: Sample Sheet Requirements

Analysis Output

When the analysis run completes, the DRAGEN TruSight Oncology 500 Analysis Software generates an analysis output folder in a specified location.

To view analysis output, navigate to the analysis output folder and select the files that you want to view.

Single Node Analysis Output Folder Structure

Single output folder structure is as follows.

  • Logs_Intermediates

    • AdditionalSarjMetrics— Contains per pair ID calculations to support the PCT_TARGET_250X metric.

    • Annotation—Contains outputs for small variant annotation.

      • Subfolders per sample ID—Contains the aligned small variants JSON.

    • CombinedVariantOutput

      • Subfolders per pair ID—Contains the combined variant output TSV files.

      • A combined output log file.

    • Contamination

      • Subfolders per DNA sample ID—Contains the contamination metrics JSON file and output logs.

    • DnaDragenCaller

      • Subfolders per sample ID—Contains the aligned BAM and index files, small variant VCF and gVCF, copy number variant VCF, MSI JSON, exon coverage report bed, and QC outputs in CSV format.

    • DnaDragenExonCNVCaller

      • Subfolders per DNA sample ID—Contains the exon-level CNV JSON,the supporting calculation, and the QC files.

    • DnaFastqValidation—Contains the FASTQ validation output log for DNA samples.

    • FastqDownsample

      • Subfolders per RNA sample ID—Contains FASTQ files and output logs.

      • FastqDownsample output

    • FastqGeneration

    • Gis—Contains GIS-related files for HRD samples.

      • Subfolders per HRD sample ID—Contains the GIS JSON, the supporting calculation, and the QC files.

      • Also contains the annotated CNV VCF and gene level TSV file with absolute copy number and minor copy number information

    • LrAnnotation

      • Subfolders per DNA sample ID—Contains the annotated exon-level CNV JSON.

    • LrCalculator

      • Subfolders per DNA sample ID—Contains the exon-level CNV VCF.

    • MetricsOutput

      • Subfolders per pair ID—Contains the metrics output TSV files.

      • A combined output log file.

    • ResourceVerification—Contains the resource file checksum verification logs.

    • RnaAnnotation

      • Subfolders per RNA sample ID—Contains the annotated splice variant JSON.

    • RnaDragenCaller

      • Subfolders per sample ID—Contains the aligned BAM, fusion candidates CSV, exon coverage report bed and QC outputs in CSV format.

    • RnaFastqValidation—Contains the FASTQ validation output log for RNA samples.

    • RnaFusion

      • Subfolders per RNA sample ID—Contains the All Fusions CSV and Fusion Processor logs.

    • RnaQcMetrics

      • Subfolders per RNA sample ID—Contains the RNA QC metrics JSON.

    • RnaSpliceVariantCalling

      • Subfolders per RNA sample ID—Contains the splice variants VCF.

    • Run QC—Contains the Run QC metrics JSON, Intermediate Run QC metrics JSON, and log file.

    • SampleAnalysisResults

      • Subfolders per pair ID—Contains the Sample Analysis Results JSON and detailed log file.

      • SampleSheetValidation—Contains the Intermediate sample sheet and validation log.

    • Tmb

      • Subfolders per DNA sample ID—Contains the TMB metrics CSV, TMB trace TSV, and related files and logs. passing_sample_steps.json —Contains the steps passed for each sample ID. pipeline_trace.txt—Contains a summary and troubleshooting file that lists each Nextflow task executed and the status (for example, COMPLETED or FAILED). run.log—Contains a complete trace-level log file describing the Nextflow pipeline execution. run_report.html—Contains high-level run statistics (performance, usage, etc.) run_timeline.html —Contains timeline-related information about the analysis run.

  • Results

    • Metrics Output TSV (all pair IDs)

    • Pair ID—The following outputs are produced for each sample:

      • Combined Variant Output TSV

        • Metrics Output TSV

        • TMB Trace TSV

        • Small Variant Genome VCF

        • Small Variant Genome Annotated JSON

        • Copy Number Variant VCF

        • GIS JSON

        • MSI JSON

        • Large Rearrangements CNV VCF

        • Large Rearrangements CNV Annotated JSON

        • All Fusion CSV

        • Splice Variant VCF

        • Splice Variant Annotated JSON

        • Exon Coverage Report TSV

        • Gene Coverage Report TSV

Multiple Node Analysis Output Folder Structure

Multiple output folder structure is as follows.

  • Demultiplex Output

    • A Logs_Intermediates folder containing FASTQ files per sample.

  • Node(X) Output—The following outputs are produced for each node used:

    • A Logs_Intermediates folder containing step specific and component specific outputs and logs for every step/component run in the analysis pipeline for the sample run on the node.

    • A Results folder containing results only for the sample run on the node.

  • Gathered Output

    • A Logs_Intermediates folder containing step specific and component specific outputs and logs for every step/component run in each analysis pipeline on every node—this contains outputs for all samples and pairs ran across all nodes in the analysis.

    • A Results folder containing results for all samples and pairs ran across all nodes—results are organized by Pair_ID, then Sample_ID. This folder also contains summary files which contain information on all samples.

ICA Output Folder Structure

This section describes each output folder generated during analysis and where to find metric and analytic files when the pipeline is executed. The same output folder structure and content exist in ICA and BaseSpace Sequence Hub.

High-Level Folder Structure

  • Run ID

    • TSO500_Nextflow_logs

      • _manifest.json

    • Results

      • _tags.json

    • Logs_intermediates

    • Errors—This folder is only present when analysis fails

TSO500_Nextflow_logs Folder Structure

The TSO_500_Nextflow_Logs provides information related to the execution of the pipeline on ICA as a whole and for specific nodes (when an analysis is split across multiple nodes). It contains files used to execute parts of the workflow on different nodes as well as records of the nextflow execution on those nodes.

  • TSO_500_Nextflow_Logs

    • _manifest.json

Results Folder Structure

Contains the aggregated MetricsOutput.tsv file at the root level. Additionally, the Results folder contains a subfolder for each pair ID.

  • Results

    • MetricsOutput.tsv

    • Sample_1

    • Sample_2

    • Sample_<#>

    • _tags.json

The Results subfolder contains the following files:

  • Results

    • MetricsOutput.tsv

    • <Pair_id>

      • CombinedVariantOutput.tsv

      • <SampleName>_MetricsOutput.tsv

    • <DNA_Sample_id>

      • CopyNumberVariants.vcf

      • DNAMergedSmallVariants_Annotated.json.gz

      • MergedSmallVariants.genome.vcf

      • MergedSmallVariants.vcf

      • microstat_output.json

      • TMB_Trace.tsv

    • <RNA_Sample_id>

      • AllFusions.csv

      • RNA_Annotated.json.gz

      • SpliceVariants.vcf

Logs_intermediates Folder Structure

Contains folders for each submodule in the DRAGEN TSO 500 on ICA pipeline. The folders contain a copy of all the relevant files required to create the metric output files and report files, as well as the combined log files at the root level and subfolders for each sample.

  • Logs_intermediates

    • DnaDragenCaller

    • AdditionalSarjMetrics

    • CombinedVariantOutput

    • FastqGeneration

    • MetricsOutput

    • DnaDragenExonCnvCaller

    • DnaFastqValidation

    • DNACoverageReport

    • Gis

    • Tmb

    • SampleAnalysisResults

    • SampleSheetValidation

    • passing_sample_steps.json

    • RnaFusion

    • Contamination

    • Annotation

    • RnaAnnotation

    • RnaDragenCaller

    • RnaSpliceVariantCalling

    • RunQc

    • FastqDownsample

    • PassingSampleSteps

    • ResourceVerification

    • LrCalculator

    • LrAnnotation

    • RnaQcMetrics

    • RnaFastqValidation

    • RNACoverageReport

Errors Folder Structure

Contains Errors.tsv. This file contains the summary of all the errors encountered during pipeline execution.

  • Errors

    • Errors.tsv

NovaSeq 6000Dx Analysis Application Output Folder Structure

The following files and folders are created during analysis by NovaSeq 6000Dx Analysis Application:

  • analysisResults.json

  • CopyComplete.txt

  • edgeos.nextflow.config

  • inputs/

    • sampleMapping.json

    • SampleSheet.csv

    • SampleSheet.json

  • Manifest.tsv

  • params.json

  • Results/

  • workflowLogs/

    • nf-main-***.log

When the analysis run completes, the analysis application generates an analysis output in a specified location. To view analysis output, follow the steps below:

  1. On the “Completed” runs tab, select the run

  2. Review the run details page, and this will give the information to access the output folder

  3. External Location: is the input for the run

  4. Analysis Output Folder: is where the output is stored. To navigate to this page, follow the “server location” and the gds analysis output folder

  5. Navigate to the directory that contains the analysis output folder

  6. Open the folder, and then select the files that you want to view

Sample Sheet Creation in BaseSpace Run Planning tool

How to Create TSO 500 Sample Sheets in BaseSpace Run Planning tool

The sections below represent each step in the BaseSpace Run Planning tool.

Note that NovaSeq X Series has a different run set up configuration screen than other instrument platforms. TSO 500 does not support multi analysis, and in order to run TSO 500 on NovaSeq X Series, enter the appropriate Read 1, Read 2, Index 1 and Index 2 described in the instructions below.

Step 1: Run Settings

Parameter Name
Required
Description

Run Name

Required

Run Name can contain 255 alphanumeric characters, dashes, underscores, periods, and spaces; and must start with an alphanumeric, a dash or an underscore.

Run Description

Optional

Run Description can contain 255 characters except square brackets, asterisks, and commas.

Instrument Platform

Required

Choose from TSO 500 supported instruments:

  • NextSeq 500/550

  • NextSeq 1000/2000

  • NovaSeq 6000/6000Dx

  • NovaSeq X Series

Secondary Analysis

Required

  • BaseSpace/Illumina Connected Analytics (to generate sample sheet for cloud analysis)

  • Local

Read 1

Required on Instrument Platform NovaSeq X Series

  • Fill with value 101 for TSO 500 analysis

Index 1

Required on Instrument Platform NovaSeq X Series

  • Fill the value depending on the TSO 500 assay used:

  • TSO 500 HT: 10

  • TSO 500: 8

Index 2

Required on Instrument Platform NovaSeq X Series

  • Fill the value depending on the TSO 500 assay used:

  • TSO 500 HT: 10

  • TSO 500: 8

Read 2

Required on Instrument Platform NovaSeq X Series

  • Fill with value 101 for TSO 500 analysis

Sample Container ID

Optional

  • Unique Identifier for the container that holds the sample

Step 2: Configuration

Note: On NovaSeq X Series, this page is called "Configuration 1". The right hand corner of the UI displays the Read 1, Read 2, Index 1 and Index 2 entered on the previous run settings screen.

Parameter Name
Required
Description

Application*

Required

  • DRAGEN TruSight Oncology 500 Analysis Software - 2.6.0 (with HRD)

  • DRAGEN TruSight Oncology 500 Analysis Software - 2.6.0

Description

Optional

Optional text field

Library Prep Kit

Required

  • TruSight Oncology 500

  • TruSight Oncology 500 High Throughput

Index Adapter Kit

Required

TSO 500:

  • TruSight Oncology 500 (NovaSeq6000Dx with S1, S2, S4, or SP Flow Cell, NextSeq1000/2000, NovaSeqX Series)

  • TruSight Oncology 500 (NovaSeq6000, NextSeq)

TSO 500 HT:

  • TruSight Oncology 500 High Throughput (NovaSeq6000Dx with S1, S2, S4, or SP Flow Cell, NextSeq1000/2000, NovaSeqX Series)

  • TruSight Oncology 500 High Throughput (NovaSeq6000, NextSeq)

Step 3: Sample Settings

Users can manually enter sample information, or download a template file to bulk upload sample information. Users can import the completed template or a compatible sample sheet.

Parameter Name
Required
Description

Read Lengths: Read 1 and Read 2

Required Not applicable on NovaSeq X Series

Auto filled with the standard values, but can be optionally overwritten.

Override Cycles

Required on NovaSeq X Series

Entered based on Run Settings read lengths & index 1 / index 2

Lane Usage

Not applicable on NovaSeq X Series or NextSeq 1000 / 2000

Checkbox allows users to apply the same lane across samples.

Lane

Required if Lane Usage is unchecked Not applicable on NextSeq 1000 / 2000

Specify lanes for each sample. The unmarked checkbox at the top of the dropdown selects all lanes.

Pair ID

Required

The identifier used to pair DNA and RNA samples in a run. The field is mandatory whether a sample is part of a pair, or not.

To note: The Sample ID field in the generated samplesheet will be auto-filled based on the Pair ID values captured. “_dna” and “_rna” (for DNA and RNA samples respectively) will be appended to the Pair ID value to create the Sample ID.

DNA Index ID

Required

Index set ID options are based on selected Index Adapter Kit

DNA Sample Feature

Required for TSO 500 HRD

Column appears when TSO 500 HRD application is selected. Enter for HRD enriched DNA Samples

RNA Index ID

Required

Index set ID options are based on selected Index Adapter Kit

Project

Optional

Optional field to describe the associated project

Starts from Fastq

Required

True or False

If auto-launching TSO 500 from BCL files, set the value to False. If auto-launching TSO 500 from FASTQ after auto-launching BCL Convert, set the value to True.

DNA Barcode Mismatches Index 1**

DNA Barcode Mismatches Index 2**

RNA Barcode Mismatches Index 1**

RNA Barcode Mismatches Index 2**

Required on NovaSeq X

Default value is set to 1.

These fields are required by NovaSeq X and represent BCL Convert settings for index diversity checks when demultiplexing. These values are not used in TSO 500 analysis.

Step 4: Run Review

Once all details are captured and pass validation, the user can review the details on the Run Review screen. From here they can choose to edit details in previous screens or export the sample sheet. Once completed, press the Cancel button to finish run planning.

Note: once leaving this screen, the run and sample sheet will not be accessible.

For NovaSeqX Plus users, the run can be saved as a draft or as a planned run (via “Save as Draft” and “Save as Planned” buttons respectively). Either selection will save the run to the Planned Runs screen on BaseSpace. There is no option to export the sample sheet on this screen.

Planned Runs Screen (NovaSeq X Series only)

The Planned Runs screen lists all planned or drafted runs. Users can set drafted runs to planned, export the sample sheet, and edit or delete a run on this screen.

Once the run is saved as Planned, it will appear on the NovaSeq X Series instrument where it can be selected for sequencing.

Guided Examples

Please review these guided examples of analysis workflows that include a step of setting up a run in BaseSpace Run Planning tool:

Analysis Launch on Standalone DRAGEN Server

Start the DRAGEN TruSight Oncology 500 Analysis Software with the DRAGEN_TSO500-2.6.0.sh Bash script. The script is installed in the /usr/local/bin directory. The Bash script is executed on the command line and runs the software with Docker (or Apptainer if specified).

  • Path to the sequencing run or FASTQ folder. Copy the run or FASTQ folder to the DRAGEN server into the staging folder with the following recommended organization: /staging/runs/{RunID}. You can copy the run folder onto the DRAGEN server using Linux commands such as rsync. The sample sheet within the run folder is used unless otherwise specified through the command line.

Before running the analysis, confirm that the output directory for the software to write to is empty and does not include results of previous analyses.

Storage Requirements

For optimal performance, run analysis on data stored locally on the DRAGEN server. Analysis of data stored on NAS can take longer and performance can be less reliable.

The DRAGEN server provides an NVMe SSD in the /staging directory to use as the software output directory. Network-attached storage is required for long-term storage.

When running the DRAGEN TruSight Oncology 500 Analysis Software, use the default settings or set the -analysisFolder command line option to a directory in /staging to make sure the DRAGEN server processes read and write data on the NVMe SSD.

Before beginning analysis, develop a strategy to copy data from the DRAGEN server to a network‑attached storage. Delete output data on the DRAGEN server as soon as possible.

The following are the run and analysis output sizes for each sequencing system per 101 bp:

When launching the analysis, the software checks that the minimum disk space required is available. If the minimum disk space is not available, the software shows an error message and prevents analysis from starting. If disk space is exhausted during a run, the run shows an error and stops analyzing.

Moving or modifying files during an analysis may cause the analysis to fail or provide incorrect results.

Analysis Launch on ICA

Methods for Launching Analysis

Illumina Connected Analytics (ICA) supports the following methods for launching DRAGEN TruSight Oncology 500 Analysis Software.

For more information about using ICA or BaseSpace Sequence Hub, refer to the following support pages on the Illumina support site.

Command-Line Options

You can use the following command-line options with DRAGEN TruSight Oncology 500 Analysis Software.

To learn more about the input requirements, use the --help command-line option.

Note:

  • Use full paths when specifying the file paths in the command line.

  • Avoid special characters such as &, *, #, and spaces.

  • When starting from BCL files, only the run folder needs to be specified. The immediate parent directory containing the BCL files does not need to be specified.

When running the analysis software using SSH, Illumina recommends using additional software to prevent unexpected termination of analysis. Illumina recommends screen and tmux.

  1. Wait for any running DRAGEN TruSight Oncology 500 Analysis Software containers to complete before launching a new analysis. Run the following command to generate a list of running containers:docker ps

  2. Select from one of the following options:

  • Start from BCL files in the run folder with the sample sheet included in the run folder. DRAGEN_TSO500-2.6.0.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName}

  • Start from BCL files in the run folder with the sample sheet located in a folder other than the run folder. DRAGEN_TSO500.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleSheet /staging/{SampleSheetName}.csv

  • Start from BCL files in the run folder with a different sample sheet and demultiplexing only. DRAGEN_TSO500-2.6.0.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleSheet /staging/{SampleSheetName}.csv \ --demultiplexOnly

  • Start from FASTQ with the sample sheet included in the FASTQ folder and with different resources and hash table folders. DRAGEN_TSO500-2.6.0.sh \ --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources \ --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable \ --fastqFolder /staging/{FastqFolderName} \ --analysisFolder /staging/{AnalysisFolderName}

  • Start from FASTQ folder with sample sheet included in the FASTQ folder and subset of samples or pairs. DRAGEN_TSO500-2.6.0.sh \ --fastqFolder /staging/{FastqFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleOrPairIDs "Pair_1,Pair2"

Starting from BCL Files

If starting from BCL (*.bcl) files, DRAGEN TruSight Oncology 500 Analysis Software requires the run folder to contain certain files and folders. These inputs are required for Docker.

The run folder contains data from the sequencing run, make sure that the folder contains the following files:

Starting from FASTQ Files

The following inputs are required for running the DRAGEN TruSight Oncology 500 Analysis Software using FASTQ (*.fastq) files. The requirements apply to Docker.

  • Full path to an existing FASTQ folder.

  • The sample sheet is in the FASTQ folder path, or you can set the path to the sample sheet with the --sampleSheet override command line option.

Make sure there is sufficient disk space for the analysis to complete. Refer to the --help command line argument details for disk space requirements.

Use BCL Convert to produce FASTQ files for DRAGEN TruSight Oncology 500 Analysis Software. Using bcl2fastq does not produce the same results and is discouraged.

Make sure that BCL Convert is set to write UMI sequences to the read headers in the FASTQ files.

FASTQ File Organization

Store FASTQ files in individual subfolders that correspond to a specific Sample_ID. Keep file pairs together in the same folder. Alternatively, store the FASTQ files in one flat folder structure where the FASTQ files are stored in one folder.

The DRAGEN TruSight Oncology 500 Analysis Software requires separate FASTQ files per sample. Do not merge FASTQ files.

The instrument generates two FASTQ files per flow cell lane, so that there are eight FASTQ files per sample.

Sample1_S1_L001_R1_001.fastq.gz

  • Sample1 represents the Sample ID.

  • The S in S1 means sample, and the 1 in S1 is based on the order of samples in the sample sheet, so S1 is the first sample.

  • L001 represents the flow cell lane number.

  • The R in R1 means Read, so R1 refers to Read 1.

Sample Sheet Templates

Sample Sheet templates for TSO 500 v2.6.0 standalone DRAGEN server and ICA manual launch analysis can be found in the table below. For auto-launch compatible sample sheets, use BaseSpace Run Planner.

DRAGEN TSO 500 analysis software is compatible with several instruments and assay workflows (standard, XP), each of which have implications for the sample sheet.

Sample sheet templates contain all required fields, including index sequences in the proper orientation for all indexes from a given library prep kit. The templates are provided as a starting point for creating a sample sheet manually when launching analysis on a standalone DRAGEN server or on ICA using manual launch.

*Lane numbers cannot exceed what is supported by the flow cell in use.

Auto-Launch of DRAGEN TSO 500 Analysis on ICA

Auto-launch Prerequisites and Workflow

*The BaseSpace Sequence Hub setting for run monitoring and storage must be selected on the instrument to use DRAGEN TSO 500 analysis auto-launch. For information on preparing your instrument for DRAGEN TSO 500 Auto-launch, refer to the documentation for your instrument.

  1. Use BaseSpace Sequence Hub Run Planning tool or the sample sheet templates provided on the support page to create and export a sample sheet.

    1. If BaseSpace Run Planning tool is not available in your region, use the sample sheet template.

    1. Data is uploaded to BaseSpace Sequence Hub and then pushed to ICA. You can monitor the run in BaseSpace Sequence Hub.

    2. Analysis auto launches in ICA when sequencing and the upload completes. You can monitor the status of the analysis in BaseSpace Sequence Hub or ICA

    3. If necessary, you can requeue the analysis via BaseSpace Sequence Hub.

  2. View the analysis output results in either BaseSpace Sequence Hub or ICA.

To avoid invalid sample sheet configurations, Illumina recommends using BaseSpace Run Planning tool to generate sample sheets. Using an invalid sample sheet can result in failed runs and analyses.

BaseSpace Sequence Hub Requirements for ICA Auto-Launch

BaseSpace Run Planning tool is a multi-step workflow that generates a manual launch or auto-launch capable sample sheet for export and requires the following additional settings:

  • Access to BaseSpace Sequence Hub.

  • ICA Run Storage is enabled under BaseSpace Sequence Hub settings.

Requeue Analysis

You can requeue analysis of a run via the run's Summary page in BaseSpace Sequence Hub.

Minimum Storage Requirements on ICA

Guided Examples

Please review these guided examples of using DRAGEN TSO 500 Analysis Software with auto-launch on ICA:

Run on Multiple DRAGEN Servers

DRAGEN TruSight Oncology 500 Analysis Software can be used to run a subset of samples on different DRAGEN servers to decrease overall processing time. This is possible using a three stage process called scatter/gather, which consists of demultiplexing, analysis, and result gathering.

The first stage is demultiplexing. Demultiplexing runs once on the entire run folder, generates FASTQ files for each sample in the run, and then separates sample files into respective folders. Once complete, note the output directory containing the sample directories holding the FASTQ files.

The process for scattering the analysis on multiple DRAGEN servers is as follows:

  1. Determine how many DRAGEN servers are available to run.

  2. Run demultiplexing on a single DRAGEN server.

Moving or modifying files during an analysis may cause the analysis to fail or provide incorrect results.

To sequence runs on multiple DRAGEN servers using the NovaSeq 6000 XP workflow, modify the sample sheet to include a subset of the lanes. For example, on an S2 flowcell, create two modified sample sheets with one containing the samples from lane 1 and the other from lane 2. This allows only the sample sheet to be modified instead of copying files between servers. This strategy would use the start from Run Folder commands without the --demultiplexOnly option. The entire run folder would need to be copied to each analysis server as demultiplexing is performed once per server.

  1. Transfer the FASTQ folder output from the original DRAGEN server to additional servers.

    1. Logs_Intermediates/FastqGeneration.

  2. Run analysis software using the --fastqFolder option on both the original and additional DRAGEN servers.

    • Option 1 Copy the original SampleSheet.csv to each server. Then provide a subsetted list to the Bash script on each DRAGEN server with the intended samples/pairs to run.

    • Option 2 Copy and modify the SampleSheet.csv to each DRAGEN server to only contain the list of samples/pairs to run. The software verifies that all samples in the sample sheet are contained within the FASTQ folders unless the --sampleOrPairIDs command-line option is present in the analysis launch. Failure to account for these checks results in an error.

  3. Copy the results from demultiplexing and each analysis run onto a single server, and then generate the final /Results directory, which contains the aggregated results. Enter the --gather command followed by the output directories of the demultiplexing step and each individual analysis run.

Commands for Multinode Analysis

The BaseSpace Sequence Hub Run Planning tool is available, and is used to generate a valid sample sheet in v2 format for use on a TSO 500 supported sequencer for both ICA and Standalone DRAGEN Server analysis options. Filling out the form on the user interface will produce a exportable sample sheet with the required fields filled in. Refer to for descriptions of fields that appear in ICA sample sheets.

BaseSpace Run Planning tool cannot generate a valid sample sheet for NovaSeq 6000Dx Analysis Application. Refer to to create a valid sample sheet.

For more information on run planning, refer to the .

For arguments, refer to . You can start from BCL files or from the FASTQ folder produced by BCL Convert. The following requirements apply for both methods:

Run folder must be intact. Refer to for input requirements.

If the analysis output folder path is different from the default, provide the analysis output folder path. Refer to .

Sequencing System
Run Folder Output (Gb)
Analysis Output (Gb)
Minimum Disk Space (Gb)

—Stream run data directly from the instrument to ICA via a specially configured sample sheet and automatically begin DRAGEN TSO 500 analysis.

—Initiate DRAGEN TSO 500 analysis on ICA using the run files and sample sheet files in the project.

Option
Required
Description
Folder/File
Description

The FASTQ folder structure conforms to the folder structure in

For interactive run planning or to create a sample sheet for ICA Autolaunch, use to create valid sample sheets for either local or cloud analysis. To set up a run in BaseSpace run planner, refer to .

Users can visit the section to learn additional details on required fields and values as they fill-in their sample information. Use the lookup table below to select and download the sample sheet template that matches your instrument, assay, and workflow configuration:

Assay
Instrument
Assay Workflow
File

Import the sample sheet to the instrument and start the sequencing run. Refer to for sample sheet guidance.

Refer to the for information on setting up a BaseSpace Sequence Hub project.

Refer to the for more information on requeuing an analysis.

Sequencing System
Minimum Disk Space (Gb)

Refer to the for information on how to manage accounts and subscriptions.

Step
Command
BaseSpace Sequence Hub support site page
NovaSeq 6000Dx: TSO 500 Auto-launch Analysis in Cloud
NextSeq 500/550Dx: TSO 500 and Connected Insights Auto-launch Analysis in Cloud
ICA Auto-launch Sample Sheet Requirements
Sample Sheet Requirements page

NextSeq 500/550/550Dx (RUO) HO flow cell

32-55

82-85

150

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

85-100

250-374

300

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

164-200

360-665

800

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

290-460

890-1600

1500

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

800-1200

2700-4100

3000

NovaSeq X 1.5B

213

352

800

NovaSeq X 10B

1100

1800

3000

NovaSeq X 25B

1800

3300

4000

NextSeq 1000/2000

41

107

150

Config folder

Configuration files

Data folder

*.bcl files

Images folder

[Optional] Raw sequencing image files.

Interop folder

Interop metric files.

Logs folder

[Optional] Sequencing system log files.

RTALogs folder

Real-Time Analysis (RTA) log files.

RunInfo.xml file

Run information.

RunParameters.xml file

Run parameters.

SampleSheet.csv file

Sample information. If you want to use a sample sheet that is not in the run folder or a sample sheet named something other than SampleSheet.csv, provide the full path.

NextSeq 500/550/550Dx (RUO) HO flow cell

350

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

500

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

1100

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

2500

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

4300

NovaSeq X 1.5B

2000

NovaSeq X 10B

4300

NovaSeq X 25B

8400

NextSeq 1000/2000

350

Demultiplexing

DRAGEN_TSO500_2.6.0.sh --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable --runFolder /staging/{RunFolderName} --analysisFolder /staging/{DemultiplexAnalysisFolderName} --demultiplexOnly --sampleSheet /staging/illumina/{SampleSheetName}

Analysis (one server)

DRAGEN_TSO500_2.6.0.sh --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable --fastqFolder /staging/{DemultiplexAnalysisFolderName}/Logs_Intermediates/FastqGeneration/ --analysisFolder /staging/{Node1AnalysisFolderName} --sampleSheet /staging/illumina/{SampleSheetName} --sampleOrPairIDs Pair_1,Pair_2

Analysis (additional servers)

DRAGEN_TSO500_2.6.0.sh --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable --fastqFolder /staging/{DemultiplexAnalysisFolderName}/Logs_Intermediates/FastqGeneration/ --analysisFolder /staging/{Node1AnalysisFolderName} --sampleSheet /staging/illumina/{SampleSheetName} --sampleOrPairIDs Pair_3

Gather

DRAGEN_TSO500_2.6.0.sh --analysisFolder /Gathered_Results --resourcesFolder staging/illumina/DRAGEN_TSO500/resources --runFolder /staging/{RunFolderName}/--sampleSheet /staging/illumina/{SampleSheetName} --gather /Demultiplex_Output /Node1_Output /Node2_Output

TSO500

NextSeq 550

Standard

TSO500 + HRD

NextSeq 550

Standard

TSO500 + HRD

NovaSeq 6000

Standard

TSO500 + HRD

NovaSeq 6000Dx (in RUO mode)

Standard

TSO500 HT

NextSeq 1000 / 2000

Standard

TSO500 HT

NovaSeq 6000

Standard

TSO500 HT

NovaSeq 6000

XP*

TSO500 HT

NovaSeq 6000Dx (in RUO mode)

Standard

TSO500 HT

NovaSeq 6000Dx (in RUO mode)

XP*

TSO500 HT

NovaSeq X

Standard

TSO500 HT

NovaSeq X

XP*

Combined Variant Output

File name: {Pair_ID}_CombinedVariantOutput.tsv

The combined variant output file contains the variants and biomarkers in a single file that is based on a single sample. If using pair ID, the file is based on paired DNA and RNA samples from the same individual. The output contains the following variant types and biomarkers:

  • Small variants

  • Copy number variants (CNV) (with absolute copy number when HRD Assay is run)

  • TMB

  • MSI

  • Fusions

  • Splice variants

  • GIS (when HRD Assay is run)

  • Gene-level Loss of Heterozygosity (when HRD Assay is run)

  • Large Rearrangements

The combined variant output file also contains Analysis Details and Sequencing Run Details sections. The details of each are listed in the following table:

Analysis Details
Sequencing Run Details

- Pair ID - DNA sample ID (if DNA is run) - RNA sample ID (if RNA is run)

- Library Prep Kit - Output date - Output time - Module version - Pipeline version (Docker image version #)

- Run name - Run date - DNA sample index ID (if DNA is run) - RNA sample index ID (if RNA is run) - [HRD] Sample feature - Instrument ID - Instrument control software version - Instrument type

- RTA version - Reagent cartridge lot number

Combined variant output produces small variants with blank fields in the following situations:

  • The variant has been matched to a canonical RefSeq transcript on an overlapping gene not targeted by TruSight Oncology 500.

  • The variant is located in a region designated iSNP, indel, or Flanking in the TST500_Manifest.bed file located in the Resources folder.

Variant Filtering Rules

  • Small Variants - All variants with the FILTER field marked as PASS in the hard-filtered genome VCF are present in the combined variant output.

    • Gene information is only present for variants belonging to canonical transcripts that are within the Gene Allow List–Small Variants.

    • Transcript information is only present for variants belonging to canonical transcripts that are within the Gene Allow List–Small Variants.

  • Copy Number Variants - Copy number variants must meet the following conditions:

    • FILTER field marked as PASS.

    • ALT field is <DUP or <DEL> .

  • Fusion Variants - Fusion variants must meet the following conditions:

    • Passing variant call (KeepFusion field is true).

    • Contains at least one gene on the fusion allow list.

    • Genes separated by a dash (-) indicate that the fusion directionality could be determined. Genes separated by a slash (/) indicate that the fusion directionality could not be determined.

  • Biomarkers TMB/MSI - Always present when DNA sample is processed.

  • Splice Variants - Passing splice variants that are contained on genes EGFR, MET, and AR.

  • Biomarker GIS - Present only if TruSight Oncology 500 HRD analysis is performed

  • Loss of Heterozygosity - Present only when TruSight Oncology 500 HRD is run. Loss of heterozygosity (LOH) must meet the following condition:

    • MCN field is equal to 0

  • Large Rearrangements CNV - Large Rearrangements CNVs must meet the following conditions:

    • BRCA1 or BRCA2 contains at least one affected exon.

    • ALT field is <DUP> or <LOSS> .

DNA Expanded Metrics

DNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.

Metric
Description
Troubleshooting

TOTAL_PF_READS (count)

Total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

Primarily driven by data output of sequencer, quality of library and balancing of library in library pool. If TOTAL_PF_READS is in line with other samples, but coverage metrics are more may suggest non-specific enrichment.

Low values for all samples indicate a poor quality run with possible low cluster numbers or low numbers of Q30 and PF%.

A low value for an individual sample indicates poor pooling of this library into the final pool.

MEAN_FAMILY_SIZE (count)

A UMI Family is a group of reads that all have the same UMI barcode. The family size is the number of reads in family. MEAN_FAMILY_SIZE is the mean of the entire population of reads assembled into UMI families.

The mean UMI family size decreases with increased unique read numbers, and more input DNA leads to more unique reads. Conversely over sequencing of a fixed population of unique DNA molecules leads to increased family size.

As a guide, for a good run with optimal cluster density, passing specs, even sample pooling, and good quality DNA we usually observe values <10.

UMI family size = 1 is not ideal as it is harder to correct for errors.

UMI family size of 2 to 5 enables efficient error correction without wasting sequencing capacity on high percentages of duplicate reads.

MEDIAN_TARGET_COVERAGE (count)

Median depth across all the unique loci occurring in all regions of the manifest file.

Lower median target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output.

PCT_CHIMERIC_READS (%)

Chimeric reads occur when one sequencing read aligns to two distinct portions of the genome with little or no overlap. Metric is proportion of total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

While this can be indicative of large-scale structural rearrangement of the genome, values that are elevated above the usual baseline may indicate enrichment probe contamination during library preparation. A suggested metric USL is 8% (those that are higher might see decrease performance in small variant and tmb scores).

PCT_EXON_100X (%)

Percentage of exon bases with 100X fragment coverage. Calculated against all regions in manifest containing _exon in name.

Can be used in combination with other PCT_EXON metrics to understand under or over coverage of exons.

PCT_READ_ENRICHMENT (%)

Percentage of reads that have overlapping sequence with the target regions defined in the sample manifest.

Indicative of general enrichment performance. Reduced proportions of enriched reads may indicate issues with the enrichment proportion of the library preparation.

PCT_USABLE_UMI_READS (%)

Percentage of reads that have valid UMI sequences associated with them.

As UMI reads are sequenced at the start of each read, loss of valid UMI sequence may be cause by sequencing issues impacting the quality of base calling in this portion of the sequencing read.

MEAN_TARGET_COVERAGE (count)

Mean depth across all the unique loci defined in the manifest file.

Lower mean target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output. Large differences between the median and mean target coverage values may indicated a skewed distribution of target coverage.

PCT_ALIGNED_READS (%)

Proportion of aligned reads that are non-supplementary, non-secondary and pass QC versus aligned reads that are non-supplementary, non-secondary, mapped and pass QC.

PCT_CONTAMINATION_EST (%)

This metric should only be evaluated if the CONTAMINATION_SCORE metric exceed the USL. This metric estimates the amount of contamination in a sample. The contamination level is computed by taking 2.0* the average of the adjusted allele frequencies of all variants that were selected. The adjusted alllele frequency is either the actual allele frequency of the variant if it is less than 0.5, or 1 -allele frequency if it is greater than or equal to 0.5.

If the sample does not fail the CONTAMINATION_SCORE this metric has no intended meaning as it will be driven by statistical noise (e.g. the few variants that naturally fall outside an expected interval around 0.5 due to random chance)

High contamination estimates may be due to any of the following:

Inter-sample contamination caused by mixing of samples during extraction or library preparation.

Intra-sample contamination, due to mixing of clonally different cell populations during extraction. Large scale genomic rearrangements that cause unexpected VAFs for large numbers of variants.

PCT_TARGET_0.4X_MEAN (%)

Parentage of target (all locations in manifest) reads that have a coverage depth of greater the 0.4x the mean target coverage depth (see definition above).

Provides an indication of uniformity of coverage of the target regions in the manifest file. When trended over time reductions in this metric may indicate an issue with the enrichment process resulting in coverage bias.

PCT_TARGET_50X (%)

Percentage of target bases with 50X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_TARGET_100X (%)

Percentage of target bases with 100X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_TARGET_250X (%)

Percentage of target bases with 250X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_SOFT_CLIPPED_BASES (%)

percentage of based that were not used for alignment but retained as part of the alignment file

Soft clipped reads are used as a part of the downstream analysis for small variants calling. A higher-than-expected number could indicate a low-quality enrichment step.

PCT_Q30_BASES (%)

Average percentage of bases ≥ Q30. A prediction of the probability of an incorrect base call (Q‑score).

An indicator of sequencing run quality, low Q30 across all samples on a run could be the result of run overclustering.

ALLELE DOSAGE_RATIO (with HRD add-on)

Proprietary Myriad Genetics estimate of b-allele dosage based on b-allele noise/signal ratio. B-Allele noise is correlated with coverage; lower coverage samples will have higher noise. B-allele signal is also correlated with tumor fraction; a higher tumor fraction produces a higher signal for b-allele sites. Samples with lower tumor fraction and higher amount of noise (or lower coverage) will have higher Allele Dosage Ratio. The upper limit of the score is 50, therefore any sample with 50 Allele Dosage Ratio can be assumed to have tumor fraction close to zero and typically has a GIS = 0.

MEDIAN TARGET HRD (with HRD add-on)

Median target fragment coverage across all target positions in the genome. Coverage is the total number of non-duplicate pair alignments that overlap.

RNA Expanded Metrics

RNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.

Metric
Description
Units

PCT_CHIMERIC_READS

Percentage of reads that are aligned as two segments which map to nonconsecutive regions in the genome.

%

PCT_ON_TARGET_READS

Percentage of reads that cross any part of the target region versus total reads. A read that partially maps to a target region is counted as on target.

%

SCALED_MEDIAN_GENE_COVERAGE

Median of median base coverage of genes scaled by length. An indication of median coverage depth of genes in the panel.

Count

TOTAL_PF_READS

Total number of reads passing filter.

Count

GENE_MEDIAN_COVERAGE

The median coverage depth of all genes in the panel.

Count

GENE_ABOVE_MEDIAN_CUTOFF

Number of genes above the median coverage cutoff.

Count

PER_GENE_MEDIAN_COVERAGE

Median deduped coverage across each gene (available in Logs_Intermediates only)

Count

PCT_SOFT_CLIPPED_BASES

percentage of based that were not used for alignment but retained as part of the alignment file

%

RNA_PCT_030_BASES

Average percentage of bases ≥ Q30. A prediction of the probability of an incorrect base call (Q‑score). Troubleshooting: An indicator of sequencing run quality, low Q30 across all samples on a run could be the result of run overclustering.

%

Command-Line Options
Command-Line Options
Auto-launch
Manual launch
Illumina Connected Analytics support site page
BaseSpace Sequence Hub support site page
BaseSpace Run Planner
Sample Sheet Creation in BaseSpace Run Planner
Sample Sheet guidelines
BaseSpace Sequence Hub support site page
BaseSpace Sequence Hub support site page
Software Registration page
NovaSeq 6000Dx: TSO 500 Auto-launch Analysis in Cloud
NextSeq 500/550Dx: TSO 500 and Connected Insights Auto-launch Analysis in Cloud
Starting from BCL Files
FASTQ File Organization.
ICA Auto-launch Sample Sheet Requirements

FastQ Generation

Sequencing data stored in BCL format are demultiplexed through a process that uses the index sequences unique to each sample to assign clusters to the library from which they originated. Each cluster contains two indexes (i7 and i5 sequences, one at each end of the library fragment). The combination of those index sequences are used to demultiplex the pooled libraries.

After demultiplexing, this process generates FASTQ files, which contain the sequencing reads for each individual sample library and the associated quality scores for each base call, excluding reads from any clusters that did not pass filter.

Analysis Methods

The software processes sequencing data to perform quality control, detect variants, determine tumor mutational burden (TMB), microsatellite instability (MSI) status, and genomic instability score (GIS), and report results. The following sections describe the analysis methods used in DRAGEN TruSight Oncology 500 Analysis Software.

DRAGEN TruSight Oncology 500 Analysis Software uses the following workflows to analyze sequencing data.

  • FASTQ Generation

  • DNA Analysis

    • DNA Alignment and Realignment

    • Read Collapsing

    • Indel Realignment and Read Stitching

    • Small Variant Calling

    • Small Variant Filtering

    • Copy Number Variant (CNV) Calling

    • Phased Variant Calling

    • Variant Merging

    • Annotation

    • Tumor Mutational Burden (TMB) Scoring

    • Microsatellite Instability (MSI) Status

    • Contamination Detection

  • RNA Analysis

    • Downsampling

    • Read Trimming

    • Alignment

    • Duplicate Marking

    • Fusion Calling

    • RNA Fusion Filtering

    • Splice Variant Calling

    • Annotation

    • Fusion Merging

  • Quality Control

    • Run QC

    • DNA Sample QC

    • RNA Sample QC

Block List

The block list represents high noise regions in the panel where false positive variant calls are likely produced. As a result, all positions in the gVCF are marked as Filter=excluded_regions to indicate variant call results are not reliable in such regions.

The block list includes the following genes:

  • HLA A

  • HLA B

  • HLA C

  • KMT2B

  • KMT2C

  • KMT2D

  • chrY

  • Any position with VAF 1% occurrence in six or more of the 60 baseline samples.

--help

No

Displays a help screen with available command line options.

--analysisFolder

No

--resourcesFolder

No

Path to the resource folder location. The default location is /staging/illumina/DRAGEN_TSO500_2.6.0/resources. If not using the default location, enter the full path to the resource folder.

--runFolder

Yes

Required when --fastqFolder is not specified. Provide the full path to the local run folder.

--fastqFolder

Yes

Required when --runFolder is not specified. Provide the full path to the local FASTQ folder. Analysis starts at this location.

--user

No

Optional for Docker. Specify the user ID to be used within the Docker container.

--version

No

Displays the version of the software.

--sampleSheet

No

Provide the full path, including file name, if not provided as SampleSheet.csv in the run folder

--sampleOrPairIDs

No

Provide the comma-delimited sample or pair IDs that should be processed on this node with no spaces. For example, Pair_1,Pair_2,Sample_1.

--demultiplexOnly

No

Demultiplex to generate FASTQ only without additional analysis.

--gather

No

Follow this option for any directories with results that should be gathered into a single Results folder.

--hashtableFolder

No

Defaults to the DRAGEN hash table location created upon install. If not using the default location, enter the hash table location.

DNA Output

Small Variant gVCF

File name: {SAMPLE_ID}_hard-filtered.gvcf.gz

The small variant genome variant call file contains information on all candidate small variants evaluated, including complex variants up to 15 bp from phased variant calling across the entire TSO 500 panel.

The variant status is determined by the FILTER column in the genome VCF as follows.

Filter
Note

PASS

PASS variants.

base_quality

Site filtered because median base quality of alt reads at this locus does not meet threshold.

filtered_reads

Site filtered because the fraction of reads is too large.

fragment_length

Site filtered because absolute difference between the median fragment length of alt reads and median fragment length of ref reads at this locus exceeds threshold.

low_depth

Site filtered because the read depth is too low.

low_frac_info_reads

Site filtered because the fraction of informative reads is below threshold.

long_indel

Site filtered because the indel length is too long.

mapping_quality

Site filtered because median mapping quality of alt reads at this locus does not meet threshold.

multiallelic

Site filtered because more than two alt alleles pass tumor LOD.

no_reliable_supporting_read

Site filtered because no reliable supporting somatic read exists.

read_position

Site filtered because median of distances between start/end of read and this locus is below threshold.

str_contraction

Site filtered due to suspected PCR error where the alt allele is one repeat unit less than the reference.

too_few_supporting_reads

Site filtered because there are too few supporting reads in the tumor sample.

weak_evidence

Somatic variant score (SQ) does not meet threshold.

systematic_noise

Site filtered based on evidence of systematic noise in normal sample.

excluded_regions

Site overlaps with VC excluded regions bed.

Small Variant Annotated JSON

File name: {SAMPLE_ID}_DNAVariants_Annotated.json.gz

The small variants annotated file provides variant annotation information for all nonreference positions from the genome VCF including pass and nonpass variants.

TMB Trace

The TMB trace file provides comprehensive information on how the TMB value is calculated for a given sample. All passing small variants from the small variant filtering step are included in this file. To calculate the numerator of the TmbPerMb value in the TMB JSON, set the TSV file filter to use the IncludedInTMBNumerator with a value of True.

The TMB trace file is not intended to be used for variant inspections. The filtering statuses are exclusively set for TMB calculation purposes. Setting a filter does not translate into the classification of a variant as somatic or germline.

Column
Description

Chromosome

Chromosome

Position

Position of variant

RefCall

Reference base

AltCall

Alternate base

VAF

Variant allele frequency

Depth

Coverage of position

CytoBand

Cytoband of variant

GeneName

Name of gene if applicable. A semicolon delimited list is used for multiple genes.

VariantType

Type of the variant: SNV, insertion, deletion, MNV

CosmicIDs

Cosmic IDs, if multiple concatenated by “;”

MaxCosmicCount

Maximum Cosmic study count

AlleleCountsGnomadExome

Variant allele count in gnomAD exome database

AlleleCountsGnomadGenome

Variant allele count in gnomAD genome database

AlleleCounts1000Genomes

Variant allele count in 1000 genomes database

MaxDatabaseAlleleCounts

Maximum variant allele count over the three databases

GermlineFilterDatabase

TRUE if variant was filtered by the database filter

GermlineFilterProxi

TRUE if variant was filtered by the proxi filter

CodingVariant

TRUE if variant is in the coding region

Nonsynonymous

TRUE if variant has any transcript annotations with nonsynonymous consequences

IncludedinTMBNumberator

TRUE if variant is used in the TMB calculation

Copy Number VCF

The copy number VCF file contains CNV calls for DNA libraries of the amplification genes targeted by DRAGEN TruSight Oncology 500 Analysis Software. The CNV call indicates fold change results for each gene classified as reference, deletion, or amplification.

The value in the QUAL column of the VCF is a Phred transformation of the p-value where Q=-10xlog10(p-value). The p-value is derived from the t-test between the fold change of the gene against the rest of the genome. Higher Q-scores indicate higher confidence in the CNV call.

In the VCF notation, <DUP> indicates the detected fold change (FC) is greater than a predefined amplification cutoff. <DEL> indicates the detected FC is less than a predefined deletion cutoff for that gene. This cutoff can vary from gene to gene.

In analysis versions prior to v2.5, <DEL> calls in the VCF are marked as LowValidation. The LowValidation filter indicates that the calls have been validated only with in silico data sets and are provided as information only.

Each copy number variant is reported as a fold change on normalized read depth in a testing sample relative to the normalized read depth in diploid genomes. Given tumor purity, you can infer the ploidy of a gene in the sample from the reported fold change.

Given tumor purity X%, for a reported fold change Y, you can calculate the copy number n using the following equation:

For example, a tumor purity at 30% and a MET with fold change of 2.2x indicates that 10 copies of MET DNA are observed.

Metrics Output

Metrics Output

The MetricsOutput.tsv file contains the following quality control metrics for all samples:

  • DNA library QC metrics for:

    • Small variant calling

    • TMB

    • MSI

    • CNV

    • [HRD] GIS

  • RNA library QC metrics

  • Run QC metrics, analysis status, and contamination

This TSV file also includes expanded DNA library QC metrics per sample, based on total reads, collapsed reads, chimeric reads, and on-target reads. Analysis using RNA samples also produces RNA library QC metrics and expanded RNA library QC metrics per sample based on total reads and coverage.

The MetricsOutput.tsv file is a final combined metrics report with sample status, key analysis metrics, and metadata. Sample metrics within the report include suggested lower specification limits (LSL) and upper specification limits (USL) for each sample in the run.

RNA Output

Splice Variant VCF

The splice variant VCF contains all candidate splice variants targeted by the analysis panel identified by the RNA analysis pipeline. You can apply the following filters for each variant call:

Filter Name
Description

LowQ

Splice variant score < passing quality score threshold value of 1.

PASS

Splice variant score ≥ passing quality score threshold value of 1.

LowUniqueAlignments

All splice junction supporting reads map to a unique genomic interval near at least one of the two splice sites.

Refer to the headers in the output for more information about each column.

Splice Variant Annotated JSON

If available, each splice variant is annotated using the Illumina Annotation Engine. The following information is captured in the JSON:

  • HGNC Gene

  • Transcript

  • Exons

  • Introns

  • Canonical

  • Consequence

All Fusions CSV

The all fusions CSV file contains all candidate fusions identified by the DRAGEN RNA pipeline. Two output columns in the file describe the candidate fusions: Filter and KeepFusion.

The following table describes the semicolon-separated output found in the Filter columns. The output is either a confidence filter or information only as indicated. If none of the confidence filters are triggered, the Filter column contains the output PASS, else it contains the output FAIL.

Filter Column Output

Filter
Filter Type
Description

DOUBLE_BROKEN_EXON

Confidence filter

If both breakpoints are distant from annotated exon boundaries, the number of supporting reads do not satisfy a high threshold requirement (≥ 10 supporting reads).

LOW_MAPQ

Confidence filter

All fusion supporting read alignments at either of the breakpoints have MAPQ < 20.

LOW_UNIQUE_ALIGNMENTS

Confidence filter

All fusion supporting read alignments map to a unique genomic interval at either of the breakpoints.

LOW_SCORE

Confidence filter

The fusion candidate has probabilistic score as determined by the features of the candidate.

MIN_SUPPORT

Confidence filter

The fusion candidate has very few fusion supporting reads (< 5 supporting read pairs).

READ_THROUGH

Confidence filter

The breakpoints are cis neighbors (< 200 kbp) on the reference genome.

ANCHOR_SUPPORT

Information only

Read alignments of fusion supporting reads are not long enough (12 bp) at either of the two breakpoints.

HOMOLOGOUS

Information only

The candidate is likely a false candidate generated because the two genes involved have high gene homology.

LOW_ALT_TO_REF

Information only

The number of fusion supporting reads is < 1% of the number of reads supporting the reference transcript at either of the two breakpoints.

LOW_GENE_COVERAGE

Information only

Each breakpoint in an enriched gene has fewer than 125 bp with nonzero read coverage.

NO_COMPLETE_SPLIT_READS

Confidence filter

For every fusion-supporting split read, the total number of aligned bases across two breakpoints is less 60% of the read length.

UNENRICHED_GENE

Confidence filter

Neither of the two parent genes is in the enrichment panel.

The KeepFusion column of the output has a value of TRUE when none of the confidence filters are triggered.

Refer to the headers in the output for more information about each column.

Fusion Columns

Fusion Object Field
Source

Gene A

The gene associated with the A side of the fusion. A semicolon delimited list is used for multiple genes.

Gene B

The gene associated with the B side of the fusion. A semicolon delimited list is used for multiple genes.

Gene A Breakpoint

[Information only] The chromosome and offset of the Gene A side of the fusion.

Gene A Location

Location of the breakpoint within Gene A: - IntactExon—Matches exon boundary - BrokenExon—Inside an exon - Intronic—Within an intron - Intergenic—No gene overlap (currently excluded) If multiple genes are in Gene A, then semicolon separated list of locations. This column is used internally to identify genes to report when a breakpoint occurs in a region overlapping multiple genes. Occasionally, additional values are listed for genes that were excluded from the GeneA list.

Gene A Sense

Boolean indicating whether left/right breakpoint order suggests fusion transcript is in the same sense of Gene A. If multiple genes are in Gene A, then semicolon separated list of bools.

Gene A Strand

Strand of Gene A, + for forward, - for reverse.

Gene B Breakpoint

[Information only] The chromosome and offset of the Gene B side of the fusion.

Gene B Location

Location of the breakpoint within Gene B: - IntactExon—Matches exon boundary - BrokenExon—Inside an exon - Intronic—Within an intron - Intergenic—No gene overlap (currently excluded) If multiple genes in Gene B, then semicolon separated list of locations. This column is used internally to identify genes to report when a breakpoint occurs in a region overlapping multiple genes. Occasionally, additional values are listed for genes that were excluded from the GeneB list.

Gene B Sense

Boolean indicating whether left/right breakpoint order suggests fusion transcript is in the same sense of Gene B. If multiple genes are in Gene B, then semicolon separated list of bools.

Gene B Strand

Strand of Gene B, + for forward, - for reverse.

Score

The quality of fusion as determined by DRAGEN server.

Filter

The filter associated with the fusion as determined by the respective caller. Results from different callers are not equivalent.

Ref A Dedup

Gene A uniquely mapping reads paired across or split by the junction. Does not support fusion. Duplicate reads are not included.

Ref B Dedup

Gene B uniquely mapping reads paired across or split by the junction. Does not support fusion. Duplicate reads are not included.

Alt Split Dedup

Uniquely mapping reads split by the junction. Supports fusion. Duplicate reads are not included.

Alt Pair Dedup

Uniquely mapping reads paired across junction. Supports fusion. Duplicate reads are not included.

KeepFusion

The determination whether the fusion should be kept or dropped from the list of fusions.

Fusion Directionality Known

Whether fusion directionality is known and indicated by gene order.

When using Microsoft Excel to view this report, genes that are convertible to dates (such as MARCH1 automatically convert to dd-mm format (1 Mar) by Excel. The following are fusion allow list genes:

  • ABL1

  • AKT3

  • ALK

  • AR

  • AXL

  • BCL2

  • BRAF

  • BRCA1

  • BRCA2

  • CDK4

  • CSF1R

  • EGFR

  • EML4

  • ERBB2

  • ERG

  • ESR1

  • ETS1

  • ETV1

  • ETV4

  • ETV5

  • EWSR1

  • FGFR1

  • FGFR2

  • FGFR3

  • FGFR4

  • FLI1

  • FLT1

  • FLT3

  • JAK2

  • KDR

  • KIF5B

  • KIT

  • KMT2A

  • MET

  • MLLT3

  • MSH2

  • MYC

  • NOTCH1

  • NOTCH2

  • NOTCH3

  • NRG1

  • NTRK1

  • NTRK2

  • NTRK3

  • PAX3

  • PAX7

  • PDGFRA

  • PDGFRB

  • PIK3CA

  • PPARG

  • RAF1

  • RET

  • ROS1

  • RPS6KB1

  • TMPRSS2

Manual Launch of DRAGEN TSO 500 Analysis on ICA

How to Launch Analysis

After adding the Bundle to the Project, an example dataset becomes available in the Demo_Data folder for the Project. 

  1. Download Results: After analysis is complete, navigate to results in the configured output location.

Please see the Illumina Support Shorts for guidance on how to set up and run DRAGEN TSO 500 RUO analysis on ICA.

Analysis Parameters on ICA

To launch an analysis via the ICA user interface, configure a DRAGEN TSO 500 pipeline analysis with the following parameters.

Parameter Name
Description

User Reference

The analysis run name.

User Tags

Text labels to help index the analysis.

Notify me when task is completed

Option to receive an email notification when analysis is complete.

Output Folder

The path to the analysis output folder. The default path is the project output folder.

Entitlement Bundle

Automatically populated from the project details.

Sample Sheet

Select a sample sheet in CSV format for the analysis.

To note: Sample Sheet selection is optional if starting from a run folder, and required when submitting a FASTQ folder.

Input Folder

The run folder or FASTQ folder that contains files to analyze.

FASTQ List CSV

Do not use, this only applies to auto-launch TSO 500 analysis from FASTQs after BCL auto-launch.

Starts from FASTQ

True for analysis performed on files in the FASTQ folder. False for analysis performed on files in the run folder.

Sample or Pair IDs

Optional subset of Sample IDs or Pair IDs to analyze.

Sample List

Do not use, this only applies to auto-launch TSO 500 analysis from FASTQs after BCL auto-launch.

Storage Size

The storage size to allocate for the analysis. The default and recommended value is Large.

DNA Analysis Methods

DNA Alignment and Error Correction

DNA alignment and error correction involves aligning sequencing reads derived from DNA libraries to a reference genome and correcting errors in the sequencing reads prior to variant calling.

DRAGEN unique molecular identifier (UMI) error correction comprises three main steps:

  1. DRAGEN UMI uses its HW accelerated mapper (based on a hash table implementation) to align DNA sequences in FASTQ files to the hg19 reference genome. These alignments are not written to a BAM.

  2. The raw alignments are processed to remove errors, including errors introduced during FFPE preservation, PCR amplification, and sequencing. Reads from the same original DNA molecule are tagged with the same UMI during library preparation. The UMI allows DRAGEN to compare related reads, remove outlier signals, and collapse multiple reads into a single high-quality sequence. Read collapsing adds the following BAM tags:

    • RX/XU—UMI.

    • XV—Number of reads in the family.

    • XW—Number of reads in the duplex-family or 0 if not a duplex family.

  3. DRAGEN performs a final alignment step on the UMI-collapsed reads. These final alignments are then written to a BAM file and a corresponding BAM index file is created.

DRAGEN continues to use these final alignments as input for gene amplification (copy number) calling, small variant calling (SNV, indel, MNV, delin), microsatellite instability (MSI) status determination, and DNA library quality control.

Small Variant Calling and Filtering

DRAGEN supports calling SNVs, indels, MNVs, and delins in tumor-only samples by using mapped and aligned DNA reads from a tumor sample as input. Variants are detected via both column wise pileup analysis and local de novo assembly of haplotypes. The de novo haplotypes allow the detection of much larger insertions and deletions than possible through column wise pileup analysis only. DRAGEN insertions and deletions are validated with lengths of at least 0–25 bp and more than 25 bp can be supported. In addition, DRAGEN also uses the de novo assembly to detect SNVs, insertions, and deletions that are co-phased and part of the same haplotypes. Any such co-phased variants that are within a window of 15 bp can then be reassembled into complex variants (MNVs and delins). The tumor-only pipeline produces a VCF file containing both germline and somatic variants that can be further analyzed to identify tumor mutations. Variant calling extends ± 10 bp into introns; details of the regions covered can be found in the assay manifest file. The pipeline makes no ploidy assumptions, enabling detection of low-frequency alleles.

DRAGEN small variant calling includes the following steps:

  1. Detects regions with sufficient read coverage (callable regions).

  2. Detects regions where the reads deviate from the reference and there is a possibility of a germline or somatic call (active regions).

  3. Assembles de novograph haplotypes are assembled from reads (haplotype assembly).

  4. Extracts possible somatic or germline calls (events) from column wise pileup analysis.

  5. Calibrates read base qualities to account for FFPE noise.

  6. Computes read likelihoods for each read/haplotype pair.

  7. Performs mutation calling by summing the genotype probabilities across all reads/haplotype pairs.

  8. Performs additional filtering to improve variant calling accuracy, including using a systematic noise file. The systematic noise file indicates the statistical probability of noise at specific positions in the genome. This noise file is constructed using clean (normal) samples. Regions where noise is common (eg, difficult to map regions) have higher noise values. The small variant caller penalizes those regions to reduce the probability of making false positive calls.

Copy Number Variant Calling

The DRAGEN copy number variant caller performs amplification, reference, and deletion calling for CNV targets within the assay. It counts the coverage of each target interval on the panel, uses a preprocessed panel of normal samples to normalize target counts, corrects for GC coverage bias, and calculates scores of a CNV event from observed coverage and makes copy number calls.

Exon-Level Copy Number Variant Calling

The BRCA large rearrangement step generates segmentation of the BRCA1 and BRCA2 genes for exon-level CNV detection from the BAM file. Using the same method as CNV calling, the large rearrangement component counts coverage of each target interval of the panel, performs normalization, and calculates the fold change values for each probe across the BRCA genes. Normalization includes GC bias correction, sequencing depth, and probe efficiency using a collection of normal FFPE and genomic DNA samples. Initial segmentation is performed for each gene with circular binary segmentation. The merging of segments is then determined by amplitude, noise, and variance at adjacent segments using thresholds established with in silico data. A large rearrangement is reported for genes with more than one segment. Coordinates of the exon-level CNV and the log2 mean fold change for each of the BRCA gene segments are found in the *_DragenExonCNV.json file.

Annotation

The Illumina Annotation Engine performs annotation of small variants, CNVs, and exon-level CNVs. The inputs are gVCF files and the outputs are annotated JSON files.

The Illumina Annotation Engine processes each variant entry and annotates with available information from databases such as dbSNP, gnomAD genome and exome, 1000 genomes, ClinVar, COSMIC, RefSeq, and Ensembl. The header includes version information and general details. Each annotated variant is included as a nested dictionary structure in separate lines following the header.

The following table shows version information for each annotation database:

Tumor Mutational Burden

DRAGEN is used to compute tumor mutational burden (TMB) in coding regions where there is sufficient coverage.

The following variants are excluded from the TMB calculation:

  • Non-PASS variants.

  • Mitochondrial variants.

  • MNVs.

  • Variants that do not meet a minimum depth threshold.

  • Variants that do not meet the minimum variant allele threshold.

  • Variants that fall outside the eligible regions.

  • Tumor driver mutations. Variants with a population allele count ≥ 50 are treated as tumor driver mutations. Germline variants are not counted towards TMB. Variants are determined as germline based on a database and a proxy filter.

Variants with a population allele count ≥ 10 that are observed in either the 1000 Genomes or gnomAD databases are marked as germline. MNVs, which do not count towards TMB, may be marked as germline when all their component small variants are marked as germline. The proxy filter scans the variants surrounding a specific variant and identifies those variants with similar variant allele frequencies (VAF). If the majority of surrounding variants of similar VAF are germline, then the variant is also marked as germline.

The formula for TMB calculation is:

Outputs are captured in a _TMB_Trace.tsv file that contains information on variants used in the TMB calculation and a .tmb.json file that contains the TMB score calculation and configuration details.

Microsatellite Instability Status

DRAGEN can determine the MSI status of a sample. It uses a normal reference file, which was created from a set of normal samples. During sequencing, normal reference files are generated by tabulating read counts for each microsatellite site. The normal file contains the read count distribution for each microsatellite.

MSI calling for a tumor-only sample is performed by first tabulating tumor counts from the read alignments for each microsatellite site. Then, the Jensen-Shannon distance (JSD) is calculated between each pair of tumor and normal baseline samples. DRAGEN determines unstable sites by performing Chi-square testing of tumor JSD and normal JSD distributions. Unstable sites are called if the mean distance difference of the two JSD distributions is ≥to the distance threshold and Chi-square p-value is ≤ to the p-value threshold. Lastly, DRAGEN produces an MSI status given assessed site count, unstable site count, the percentage of unstable sites in all assessed sites, and the sum of the Jensen-Shannon distance of all the unstable sites.

Genomic Instability Score

Requires HRD add-on assay

Genomic instability score (GIS) is a whole genome signature for homologous recombination deficiency. The GIS is composed of the sum of three components: loss of heterozygosity, telomeric allele imbalance, and large-scale state transition. These components are estimated using the GIS algorithm contracted from Myriad Genetics, which uses an input of the b-allele frequency and coverage across a genome-wide single nucleotide panel. A panel of normal samples is used for both bias reduction and normalization prior to GIS estimation. Final GIS results can be found in the *.gis.json file.

Contamination Detection

The contamination analysis step detects foreign human DNA contamination using the SNP error file and pileup file that are generated during the small variant calling and the TMB trace file. The software determines whether a sample has foreign DNA using the contamination score. In contaminated samples, the variant allele frequencies in SNPs shift from the expected values of 0%, 50%, or 100%. The algorithm collects all positions that overlap with common SNPs that have variant allele frequencies of < 25% or > 75%. Then, the algorithm computes the likelihood that the positions are an error or a real mutation. The contamination score is the sum of all the log likelihood scores across the predefined SNP positions with minor allele frequency < 25% in the sample and are not likely due to CNV events.

The larger the contamination score, the more likely there is foreign DNA contamination. A sample is considered to be contaminated if the contamination score is above predefined quality threshold. The contamination score was found to be high in samples with highly rearranged genomes or HRD samples. 1% of HRD samples found to be above the threshold with no evidence for actual contamination.

Tumor fraction (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Tumor fraction is calculated as described in the User Guide, section “HRD Metrics Report” and leverages the Myriad Genetics algorithm. Tumor fraction is output in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.gis.json and Combined Variant Output file.

Ploidy (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Ploidy is calculated as described in the User Guide, section “HRD Metrics Report” and leverages the Myriad Genetics algorithm. Ploidy is output in the in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.gis.json and Combined Variant Output file.

Absolute Copy Numbers (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Absolute copy numbers are calculated by leveraging the Myriad Genetics algorithm. The algorithm segments the entire genome using the HRD panel and provides an A and B allele estimate for each segment. After the TSO 500 pipeline determines CNV calls (using the TSO 500 panel), the segment covering the gene is identified, and the A and B allele numbers of the segment overlapping the gene are reported. If the gene is within 300 kbases from the segment boundary, the estimate is unreliable and “-1” is output. Absolute copy numbers are output in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.abcn_annotated.vc f, Logs_Intermediates/Gis/SAMPLE/SAMPLE.abcn_genes.tsv and Combined Variant Output file.

Gene-Level Loss of Heterozygosity (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Gene-level loss of heterozygosity is calculated based on the minor copy number reported in the abcn_annotated.vc f. If the minor copy number is 0 then the gene is assumed to have a loss of heterozygosity. Gene-level loss of heterozygosity is output in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.abcn_genes.tsv and Combined Variant Output file.

HRD Metrics Report

The Illumina DRAGEN TruSight Oncology 500 Analysis Software allows for analysis of sequencing data generated from the TruSight Oncology 500 HRD assay. When HRD samples are analyzed new results and metrics are included in the CombinedVariantOutput and MetricsOutput files respectively. The following tables detail how these scores and QC metrics are derived.

*The GIS algorithm within the TSO500 pipeline (which does not have a cell line mode due to the TSO500 pipeline being non-configurable) is only intended for FFPE samples. Cell line samples will not accurately report GIS results as the tumor fraction (>90%) is too high to reliably distinguish tumor vs germline variants.

HRD Metrics Added to Metrics Output File

Path to the local analysis folder. The default location is /staging/DRAGEN_TSO500_2.6.0_Analysis_{timestamp}. If not using the default location, provide the full path to the local analysis folder. Folder must have sufficient space and must be on an NVMe SSD drive. For example, the /staging directory on the DRAGEN server. Refer to table in for minimum disk space requirements.

Refer to for more information.

n=[(200Y)−2(100−X)]/Xn=[(200Y)-2(100-X)]/Xn=[(200Y)−2(100−X)]/X

For troubleshooting information, refer to

Refer to for more information.

Create a Project: Project can be specific for the DRAGEN TruSight Oncology 500 pipeline or it can contain multiple Pipelines and/or Tools). For information on creating Projects, refer to the Projects section in .

ICA standard storage is used by default as soon as the Project is saved. To connect a different storage source, set it up before creating your Project. For details and options, refer to the Storage section in .

Edit Project and Add Bundle: Edit the Project and add the bundle titled, "DRAGEN TSO 500 v2.6.0 (XX)." XX is a 2-letter code designating the region from which you are launching the analysis. Adding the Bundle automatically adds the pipeline and associated resource files and datasets to the Project. For information on Bundles, refer to the Bundles section in .

 Upload the sequencing data: For information on viewing and uploading data, refer to the Data section in .

Start Analysis: In the Project, navigate to Pipelines, select the TSO 500 v2.6.0  Pipeline, and then select  "Start New Analysis". Set up the new analysis by configuring the parameters listed in the . When the required files are completed, start analysis.

For information about using pipelines, refer to .

Database
Version

Metric
Description
Metric
Description
Section in Metrics Output
DNA Analysis Methods
Troubleshooting
RNA Analysis Methods
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics support site page
Storage Requirements
table below

gnomeAD

2.1

COSMIC

v84

ClinVar

2019-02-04

dbSNP

v151

1000 Genomes Project

Phase 3 v5a

RefSeq

NCBI Homo sapiens Annotation Release 105.20201022

TMB=Filtered VariantsEligible Region Size(Mbp)TMB = {Filtered\ Variants \over Eligible\ Region\ Size (Mbp)}TMB=Eligible Region Size(Mbp)Filtered Variants​
NonsynonymousTMB=Filtered Nonsynonymous VariantsEligible Region Size(Mbp)Nonsynonymous TMB = {Filtered\ Nonsynonymous\ Variants \over Eligible\ Region\ Size (Mbp)}NonsynonymousTMB=Eligible Region Size(Mbp)Filtered Nonsynonymous Variants​

GIS Score*

Proprietary Genomic Instability Score (GIS) indicating level of genomic instability in sample genome. Combination of Loss of Heterozygosity (LOH), Telomeric allelic imbalance and Large-scale State Transitions (LST) scores. The GIS scores provided by TruSight Oncology 500 HRD show good correlation (R2= 0.98) with Myriad Genetics GIS however they are not identical (Refer to TruSight Oncology 500 HRD Product Data Sheet Doc# M-GL-00748 for more details). GIS from alternative HRD assays should be not be considered equivalent to Illumina/Myriad GIS.

PCT_TARGET_HRD_50X

Percent of HRD probe SNP panel covered by at least 50X coverage

DNA Library QC Metrics for GIS

EXCESSIVE_TF

EXCESSIVE TF indicates if there is excessive tumor content in sample. Troubleshooting: Samples with pure tumor fraction >90% are outside the design for GIS estimation (this includes pure tumor cell lines)

DNA Library QC Metrics for GIS

ALLELE_DOSAGE_RATIO

Proprietary Myriad Genetics estimate of b-allele dosage based on b-allele noise/signal ratio. B-Allele noise is correlated with coverage; lower coverage samples will have higher noise. B-allele signal is also correlated with tumor fraction; a higher tumor fraction produces a higher signal for b-allele sites. Samples with lower tumor fraction and higher amount of noise (or lower coverage) will have higher Allele Dosage Ratio. The upper limit of the score is 50, therefore any sample with 50 Allele Dosage Ratio can be assumed to have tumor fraction close to zero and typically has a GIS = 0.

DNA Expanded Metrics

MEDIAN_TARGET_HRD_COVERAGE

Median target fragment coverage across all target positions in the genome. Coverage is the total number of non-duplicate pair alignments that overlap.

DNA Expanded Metrics

Coverage Reports

The gene and exon coverage report files are tab-separated value (TSV) files with coverage values matching respectively the exons and genes for both DNA and RNA samples specified in the manifest file.

Quality Control

The software calculates several quality control metrics for runs and samples.

Run QC

The Run Metrics section of the metrics output report provides sequencing run quality metrics along with suggested values to determine if they are within an acceptable range. The overall percentage of reads passing filter is compared to a minimum threshold. For Read 1 and Read 2, the average percentage of bases ≥ Q30, which gives a prediction of the probability of an incorrect base call (Q‑score), are also compared to a minimum threshold. The following tables show run metric and quality threshold information for different systems.

The values in the Run Metrics section are listed as NA in the following situations:

  • If the analysis was started from FASTQ files.

  • If the analysis was started from BCL files and the InterOp files are missing or corrupt.

NextSeq 500/550 or NextSeq 550Dx (RUO)

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_PF_READS (%)

Total percentage of reads passing filter.

≥80.0

All

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥80.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥80.0

All

NovaSeq 6000 or NovaSeq 6000Dx (RUO)

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_PF_READS (%)

Total percentage of reads passing filter.

≥55.0

All

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥80.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥80.0

All

NextSeq 1000/2000

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_PF_READS (%)

Total percentage of reads passing filter.

≥85.0

All

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥85.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥85.0

All

NovaSeq X

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥85.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥85.0

All

DNA Sample QC

DRAGEN TruSight Oncology 500 uses QC metrics to assess the validity of analysis for DNA libraries that pass contamination quality control. If the library fails one or more quality metrics, then the corresponding variant type or biomarker is not reported, and the associated QC category in the report header displays FAIL. Additionally, a companion diagnostic result may not be available if it relies on QC passing for one or more of the following QC categories.

DNA library QC results are available in the MetricsOutput.tsv file. Refer to Metrics Output for details.

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

CONTAMINATION_SCORE

The contamination score is based on VAF distribution of SNPs.

≤ 1457

All

MEDIAN_EXON_COVERAGE

Median exon fragment coverage across all exon bases.

≥ 150

Small variant TMB

PCT_EXON_50X

Percent exon bases with 50x fragment coverage.

≥ 90.0

Small variant TMB

MEDIAN_INSERT_SIZE

The median fragment length in the sample.

≥ 70

Small variant TMB

USABLE_MSI_SITES

The number of MSI sites usable for MSI calling.

≥ 40

MSI

MEDIAN_BIN_COUNT_CNV_TARGET

The median raw bin count per CNV target.

≥ 1.0

CNV

RNA Sample QC

The input for RNA Library QC is RNA alignment. Metrics and guideline thresholds can be found in the MetricsOutput.tsv file. Refer to Metrics Output for details.

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

MEDIAN_CV_GENE_500X

The median CV for all genes with median coverage > 500x. Genes with median coverage > 500x are likely to be highly expressed. Higher CV median > 500x indicates an issue with library preparation (poor sample input and/or probes pulldown issue).

Fusion Splice

MEDIAN_INSERT_SIZE

The median fragment length in the sample.

≥ 80

Fusion Splice

TOTAL_ON_TARGET_READS

The total number of reads that map to the target regions.

≥ 9000000

Fusion Splice

GENE_MEDIAN_COVERAGE

The median deduped coverage across all genes in the RNA panel (55 genes).

N/A*

Fusion Splice

  • To avoid failing RNA samples unnecessarily, Illumina does not recommend a universal threshold to determine RNA sample quality. RNA expression varies significantly across tissue types and a small panel size (55 genes), which makes normalization challenging. Tissue-specific thresholds could be considered for normalization.

RNA Analysis Methods

Downsampling

Each sample is downsampled to 30 million RNA reads. This number represents the total number of single reads (eg, R1 + R2, from all lanes). When using the recommended sequencing configurations or plexity, the samples can have fewer reads than the downsampling limit. In these cases, the FASTQ files are left as-is.

Read Trimming

Reads are trimmed to 76 base pairs for further processing.

RNA Alignment and Fusion Detection

RNA alignment and fusion detection uses trimmed reads in FASTQ format as input. The outputs include a BAM file that contains duplicate-marked read alignments, an SJ.out.tab file that contains unannotated splice junctions, and a CSV file that contains fusion candidates.

DRAGEN aligns RNA reads in a transcript-aware mode using the human hg19 genome containing unplaced contigs (ie, chrUn_gl regions) and uses GENCODEv19 transcript annotations to identify splice sites. DRAGEN identifies and marks duplicate read alignments using start and end coordinates of alignments, which are adjusted for soft clipped reads.

Fusion and splice variant calling only use deduped fragments to score variants. DRAGEN identifies fusion candidates using chimeric split read alignments (pairs of primary and supplementary alignments) against multiple genes. DRAGEN scores and filters reads based on the various features of each candidate such as the number of supporting reads, mapping quality of supporting reads, and sequence homology between parent genes.

The DRAGEN RNA Fusion caller identifies gene fusions by searching for chimeric reads spanning two distinct parent genes. Based on the chimeric reads, DRAGEN first creates a list of fusion candidates, then scores the candidates to report the list of high confidence fusion calls from the candidate pool.

DRAGEN RNA Fusion caller performs the following steps:

  1. Generates fusion candidate generation based on split read alignment.

  2. Recruits additional evidence from fusion supporting discordant read pairs and soft-clipped reads.

  3. Computes fusion candidate features such as gene coverage, read mapping quality, alternate allele frequency, gene homology, alignment anchor length, and breakpoint distance from exon boundary.

  4. Scores and ranks the fusion candidates using a logistic regression model.

  5. Selects a final list of fusion calls based on score and other filters including number of supporting reads, unique read alignment count, read through transcripts, and fusions matching the enriched regions.

Splice Variant Calling

RNA splice variant calling is performed for RNA sample libraries. Candidate splice variants (junctions) from RNA Alignment are compared against a database of known transcripts and a splice variant baseline of non-tumor junctions generated from a set of normal FFPE samples from different tissue types. Any splice variants that match the database or baseline are filtered out unless they are in a set of junctions with known oncological function. If there is sufficient read support, the candidate splice variant is kept. This process also identifies candidate RNA fusions.

RNA Fusion Merging

Fusions identified during RNA fusion calling are merged with fusions from proximal genes identified during RNA splice variant calling. These are then annotated with gene symbols or names with respect to a static database of transcripts (GENCODE Release 19). The result of this process is a set of fusion calls that are eligible for reporting

RNA Splice Variant Annotation

The Illumina Annotation Engine annotates detected RNA splice variant calls with transcript-level changes (eg, affected exons in the transcript of a gene) with respect to RefSeq. This RefSeq database is the same RefSeq database used by the small variant annotation process.

Performance Testing

The following sections describe performance testing methods.

Analytical Performance Testing

Illumina tests the analytical performance of variant calling using an approach that covers the entire workflow including library preparation, sequencing, and secondary analysis. This approach is used to test a diverse selection of variants. When the variant calling pipeline is expanded to call a new variant class, this approach is always used.

In Silico Testing Methods

Illumina uses in silico testing to the test the ability of the software to call an expanded scope of clinically relevant variants, including rare variants. In silico testing is used as a complementary method to analytical performance testing with wet lab step to expand the scope of testing. For example, while Illumina has analytically verified the performance of the software for calling complex variants in EGFR, the in silico testing approach characterizes the ability of the software to call complex variants in other genes.

For in silico testing, variants of interest are extracted from public databases like Cosmic and ClinVar. Each variant is simulated at different VAF levels by, depending on the variant class, spiking in mutant reads into a normal FFPE background (for sequence variants) or by increasing or decreasing the coverage of exons in the normal FFPE sample (for CNVs, for example, exon-level CNVs. The simulated reads match the expected quality of typical FFPE samples, such as fragment length, error rate, and family size. After the simulation, the software processes samples with spiked-in variants and determines the results. This approach does not include library prep and sequencing of tumor FFPE samples that include the rare variants of interest. The software reports these variants, but analytical verification was not performed.

DRAGEN TruSight Oncology 500 Analysis Software includes the following features that were tested in silico for both TruSight Oncology 500 and TruSight Oncology 500 HT:

  • Complex variants in genes beyond EGFR

  • Insertions and deletions > 25 bp

  • CNV amplifications

  • CNV deletions

  • Variants in intron-exon junctions (2 bp – 10 bp into introns)

In addition, the following features were tested in silico for TruSight Oncology 500 HT:

  • Exon-level CNVs in BRCA1 and BRCA2

Beta Features

This version of DRAGEN TruSight Oncology 500 Analysis Software includes beta features which have not been verified by Illumina due to limited access to samples or lack of an appropriate orthogonal method to perform testing, and, the use of in silico testing alone is not sufficient for verification purposes.

Customers are responsible for evaluating and demonstrating performance of any beta features they choose to implement. Beta features are indicated as such in the CombinedVariantOutput.tsv file. Illumina will continue to evaluate beta features with intent to fully release upon completion of verification for each feature.

This version includes the following beta features that may be used with the TruSight Oncology 500 HRD Assay:

  • Tumor fraction (beta)

  • Ploidy (beta)

  • Absolute copy numbers (ACN) (beta)

  • Gene-level loss of heterozygosity (LOH) events (beta)

Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file.

Troubleshooting

General Troubleshooting on Standalone DRAGEN Server

Failure Type
Actions

Software

- Open the log file ./<AnalysisFolder>/Logs_Intermediates/pipeline_trace.txt. This log file displays each pipeline step run by the Nextflow workflow manager software. If a step fails, it is marked as FAILED. Each step generates log files that are stored in step-specific subfolders in the Logs_Intermediates folder. Review the log files in the relevant Logs_Intermediates folder for the step to identify potential sources of error. - Open the errors folder ./<AnalysisFolder>/errors. The workflow creates an error file, error_<NameOfFailedStep>.json, for each step that failed during analysis. For steps that fail per sample, there is a separately labeled file for each sample that failed each step error_<NameOfFailedStep>_<SampleIDIfRelevant>.json. These files contain the command and stdout and stderr from the step.

Samples

Open the combined metrics output results file ./<AnalysisFolder>/Results/<PairId>/MetricsOutput.tsv. If a sample fails an analysis step, the Pair ID that contains the sample shows the failure under FAILED_STEPS in the Analysis Status section, and COMPLETED_ALL_STEPS shows as False. If available, review the individual log files for the failed steps under ./<AnalysisFolder>/Logs_Intermediates to identify potential sources of error.

Multinode Gather

If the following error appears, check if the sample or pair ID was included multiple times during separate node analysis runs, before being gathered together. If the error exists, rerun one of the analyses without the duplicate and reattempt gathering. ERROR:Gather:Destination file ... already exists - check if the same sample ID is in multiple input folders

Sample Sheet Validation Failures

In DRAGEN TruSight Oncology 500 Analysis Software, the analysis fails if a sample sheet is invalid. If an invalid sample sheet in suspected, log files can help troubleshoot a failed analysis. Use the following steps to find the log file for the sample sheet:

  1. Navigate to the following location /<analysis_output>/Logs_Intermediates/SamplesheetValidation.

  2. Open the SamplesheetValidation-.log file

  3. Find a line with the following: SampleSheetValidationTask:NA:1 exited with return code 1 which has not been declared as a valid return code.

General troubleshooting for a failed sample sheet:

Failure Type
Action

Sample Sheet not found

Verify that SampleSheet.csv is present at the top level of the run folder with the name "SampleSheet.csv". If the sample sheet is in a different location, supply the sample sheet using the --sampleSheet option

Indexes are not valid for the sequencer and/or assay

See Valid indexes for assay and instrument combinations for correct indexes for the sequencer and assay.

Pair_ID is not unique

Pair_ID column is required in the TSO500S_Data section of the sample sheet, which pairs at most one RNA and one DNA sample together for analysis. If the sample does not have a pair, use a unique pair ID for single samples.

Sample Sheet is not in v2 format

Verify that the format of the sample sheet is v2. v1 sample sheet is not compatible with DRAGEN TruSight Oncology 500 Analysis Software.

Analysis does not run

Verify the analysis starts from the run folder, and BCLs or FASTQs are in the correct locations as outlined in Starting From BCL Files and Starting From FASTQ Files respectively.

Valid indexes for assay and instrument combinations:

Assay
Index Set ID

TSO 500

  • UP1-UP16

  • CP1-CP16 (DNA Only)

TSO 500 HT

  • UDP0001–UDP0192

Troubleshooting BCL issues:

Failure Type
Action

Lane Column without Values

Ensure that the column is completed. If lane is not applicable to the run, delete the column.

Format of v2 sample sheet is incorrect

HRD analysis is missing

Troubleshooting FASTQs issues:

Failure Type
Action

Sample_ID and/or Sample_Type is not present

Verify that the sample sheet has columns and values for Sample_ID and Sample_Type.

Unique sample IDs

Verify that the Sample_IDs are unique in the sample sheet.

Format of v2 Sample Sheet is incorrect

Verify that the following sections and fields are present in the sample sheet and follow the individual rules in Sample Sheet Requirements. [TS0500S_Data] - Sample_ID - Index_ID - Sample_Type - Pair_ID

Incorrect folder structure

Invalid FASTQ input files

If the FASTQs are invalid, start TSO 500 analysis from BCL files.

HRD analysis missing

Make sure that HRD is in the correct column in the sample sheet.

Other Troubleshooting Tips

Failure Type
Action

The output file directory contains information from previous analyses

If this issue is seen: specify a new target output folder and repeat analysis To prevent this issue: specify an empty directory before starting analysis

Single exon (single probe) genes are still reported in the CNV VCF file, but not the CNV TSV file

No action needed; software is working as expected.

Currently all single probe genes are not emitted to the Copy Number Variants section of our CombinedVariantOutput.tsv. However, you can still find these events in the cnv.vcf.gz.

Due to the single probe nature, accurate CNV calling has not been validated and as such they are emitted as REF

Troubleshooting on ICA

In addition to TSO 500 managed sample sheet validations, ICA managed TSO 500 errors include the following:

Error
Description

Failure type: ValueError: Could not find pipeline ID for app BCLConvert in sample sheet SampleSheet.csv

Installation of v2.5.4 on Standalone DRAGEN Server

Overview

The installation script for DRAGEN TruSight Oncology 500 Analysis Software installs the following software and dependencies:

  1. DRAGEN TruSight Oncology 500 Analysis Software itself

  2. DRAGEN Software if a compatible version is not present

  3. Docker software if a compatible version is not present

  4. A script required to generate DRAGEN genome hash table

  5. A script to check that DRAGEN TruSight Oncology 500 Analysis Software is installed properly

Installation Requirements

Hardware

  • DRAGEN server v3 or v4

  • Network-attached storage (NAS) with enabled mkfifo if performing analysis for the TruSight Oncology 500 High-Throughput assay

Software

The following software is required to successfully install DRAGEN TSO 500 v2.5.4:

  • Linux CentOS 7.9 operating system (or later) or Oracle Linux 8 (or later), one of which is provided on the server. Oracle Linux 8 is recommended.

  • Docker Software, see table below for minimum version needed. If sufficient Docker software is not present on the server, the TSO 500 installer will install compatible Docker software.

  • DRAGEN Server Software*, see table below for minimum version needed as the host version on the server. If sufficient DRAGEN software is not present on the server, the TSO 500 installer will install compatible DRAGEN software.

Software Dependency
Compatible
Installs

Docker

20.10 or greater

Docker 20.10.15

DRAGEN Server Software*

v3.10.x, where x >=19, v4.3+

DRAGEN Software 3.10.19

*The DRAGEN Server Software version may be higher than the DRAGEN version used by the DRAGEN TSO 500 v2.5.4 pipeline (DRAGEN v3.10.16), which is provided inside the DRAGEN TSO 500 docker image.

Licenses

  • TSOCombined license

  • TSO500_HRD license (to analyze data generated with the TSO 500 HRD add-on kit)

Permissions

Illumina recommends logging in as root user for installation, but as a non-root user for running TSO 500 analysis.

  • Installing and uninstalling DRAGEN TruSight Oncology 500 Analysis Software and running the system check requires root privileges.

  • Run DRAGEN TruSight Oncology 500 Analysis Software without being logged in as a root user. Running the DRAGEN TruSight Oncology 500 Analysis Software as root is not required or recommended.

Compatibility with other DRAGEN pipelines

Software versions without multi-version compatibility referred to as single-version compatible. DRAGEN TSO 500 Analysis Software v2.5.4 will disrupt installations of single-version compatible software from the DRAGEN server. To uninstall a previous version of DRAGEN TSO 500 Analysis Software, refer to the respective guide.

Compatibility of software for co-installation with DRAGEN TSO 500 v2.5.4 on a DRAGEN server is summarized in the table below:

Software
Version
Type
Compatible

DRAGEN TSO 500

2.6.1

Multi-version

Yes*

DRAGEN TSO 500

2.6.0

Single-version

No

DRAGEN TSO 500

2.5.3 or below

Single-version

No

DRAGEN TSO 500 ctDNA

2.6.2+

Multi-version

Yes

DRAGEN TSO 500 ctDNA

2.6.1 or below

Single-version

No

DRAGEN pipelines**

4.3.6+

Multi-version

Yes

DRAGEN pipelines**

4.2 or below

Single-version

No

*DRAGEN TSO 500 Analysis Software v2.5.4 can run on a single server with another multi-version compatible DRAGEN TSO 500 Analysis Software, e.g. DRAGEN TSO 500 v2.6.1, and should be installed before v2.6.1. DRAGEN TSO 500 Analysis Software v2.5.4 can be co-installed with multi-version compatible DRAGEN TSO 500 ctDNA Analysis Software or other DRAGEN pipelines with any order of installation.

**For example, DRAGEN Enrichment, DRAGEN Germline, DRAGEN WGS Heme v1.0.0 and others. Order of installation does not matter.

Installation Instructions

As a root user, perform the following steps to install DRAGEN TruSight Oncology 500 Analysis Software v2.5.4:

  1. Download the installation package provided in the email from Illumina. The link expires after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} '{link}'

Where the file name is the installation script file name, and the link is provided by Illumina Customer Care.

  1. Make sure no other analysis is being performed. Installing the software while performing other analyses prevent the installer process from proceeding

  2. Copy the install script to the /staging directory to store the script in the directory.

Installation Script: install_DRAGEN_TSO500-2.5.4.run

MD5sum: sha256:0d871d8b93b4e535c0c1b5614c180fdb815f80764274279881c8a25a4e153160

  1. Use the following command to update the run script permission: chmod +x /staging/install_DRAGEN_TSO500-2.5.4.run

  2. Use the following command to run the installation script (run time ~ 20 mins):

    1. For Docker, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.5.4.run. The script installs compatible DRAGEN software and removes any previously installed versions.

    2. For Apptainer, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.5.4.run -- --noDockerInstall This will not install Apptainer, but will install the analysis software in the SIF container format and modify the software to launch analyses using Apptainer.

  3. During the installation process, you might be instructed to reboot or power cycle the system to complete the installation of the DRAGEN software. A power cycle of the system requires the server be shut down and restarted.

  4. Log out of the server and then log back in.

License Installation

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server connected to the internet:

  1. Confirm that the server is connected to the Internet, example: ping www.illumina.com

  2. Run the following command: /usr/bin/dragen_lic -i auto

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server not connected to the internet:

  1. Download and save the license file(s) to a location that is accessible from the DRAGEN server

  2. For each license file, run the command, where <license file received> is the absolute path to the license file: sudo /usr/bin/dragen_lic -i /tmp/<license file received>.bin

To check the success of license installation, run: /usr/bin/dragen_lic. Installed licenses should be in the list.

Running the System Check

After installation is complete, make sure the system functions properly by running the following command: /usr/local/bin/check_DRAGEN_TSO500-2.5.4.sh

The script checks that:

  • All required services are running

  • Proper Docker image is installed

  • DRAGEN TSO 500 Analysis Software can successfully process a test data set

The system check script runs for approximately 25 minutes. If the script prints a failure message, contact Illumina Technical Support and provide the /staging/check_DRAGEN_TSO500_<timestamp>.tgz output file.

If using MacOS to connect to a server, an error can occur if the local settings are not in English. To resolve the error, disable the ability to set environment variables automatically in Terminal settings.

Uninstall Software

The DRAGEN TSO 500 Analysis Software installation includes an uninstall script called uninstall_DRAGEN_TSO500-2.5.4.sh, which is located in /usr/local/bin.

Executing the uninstall script removes the following assets:

  • All DRAGEN TSO 500 Analysis Software related scripts located in /usr/local/bin

  • Resources found in /staging/illumina/DRAGEN_TSO500

  • The dragen_tso500:2.5.4: Docker image

To uninstall the DRAGEN TSO 500 Analysis Software, run the following command as a root user:

uninstall_DRAGEN_TSO500-2.5.4.sh

You are not required to uninstall Docker or DRAGEN software. To remove Docker, review the install instructions for your operating system in the Docker documentation.

Getting Started on Illumina Connected Analytics

Prerequisites

Illumina Connected Analytics (ICA) subscription includes access to DRAGEN TruSight Oncology 500 Analysis Software. To get started, you need:

Minimum Storage Requirements on ICA

The table provides guidance on the storage needs and associated ICA storage size settings.

Sequencing System
Minimum Disk Space (Gb)
ICA Storage Size Value

NextSeq 500/550/550Dx (RUO) HO flow cell

350

Large

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

500

Large

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

1100

Large

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

2,500

Large

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

4,300

Large

NovaSeq X 1.5B

2,000

Large

NovaSeq X 10B

4,300

Large

NovaSeq X 25B

8,400

XLarge

NextSeq 1000/2000

350

Large

Sample Sheet Introduction

Overview

The sample sheet includes a list of samples and their index sequences, along with additional information required to run DRAGEN TruSight Oncology 500 Analysis Software. For example, DNA samples with the TruSight Oncology 500 HRD add-on probes must be indicated in the Sample Feature column of the sample sheet. Appropriate index adapter sequences are determined by the assay used to perform analysis.

When running analysis on a standalone DRAGEN server or on ICA, a valid sample sheet can be created by:

The run set up section of this guide includes specific instructions to plan a run and set up a valid sample sheet for each deployment of DRAGEN TruSight Oncology 500 Analysis Software.

Sample Sheet Requirements

DRAGEN TSO 500 Analysis Software has optional and required fields that are required in addition to general sample sheet requirements. Follow the steps below to create a valid samplesheet.

Standard Sample Sheet Requirements

The following sample sheet requirements describe required and optional fields for DRAGEN TSO 500 Analysis Software. Depending on the deployment (standalone DRAGEN server, ICA with auto-launch, ICA with manual launch), certain sections and required values can deviate from the standard requirements. These deviations are noted in the information below.

The analysis fails if the sample sheet requirements are not met.

Use the following steps to create a valid sample sheet.

  1. Download the sample sheet v2 template that matches the instrument & assay run.

  2. In the BCL Convert Settings section, enter the following required parameters:

[BCLConvert_Settings] Section

  1. In the BCL Convert Data section, enter the following parameters for each sample.

[BCLConvert_Data] Section

  1. In the TSO 500 Data section, enter the following parameters:

TSO 500 Data Section header changes depending on the deployment:

  • Standalone DRAGEN Server and ICA with Manual Launch: TSO500S_Data

  • ICA with Auto-launch: Cloud_TSO500S_Data

[TSO500S_Data] Section

To ensure a successful analysis, follow these guidelines:

  1. Avoid any blank lines at the end of the sample sheet; these can cause the analysis to fail.

  2. When running local analysis using the command line save the sample sheet in the sequencing run folder with the default name SampleSheet.csv, or choose a different name and specify the path in the command-line options.

ICA with Auto-launch: Sample Sheet Requirements

[Cloud_TSO500S_Data] Section

[Cloud_TSO500S_Settings] Section

[Cloud_Data] Section

[Cloud_Settings] Section

Sample Sheet Creation in BaseSpace Run Planning tool

How to Create TSO 500 Sample Sheets in BaseSpace Run Planning tool

The sections below represent each step in the BaseSpace Run Planning tool.

Step 1: Run Settings

Step 2: Configuration

Step 3: Sample Settings

Users can manually enter sample information, or download a template file to bulk upload sample information. Users can import the completed template or a compatible sample sheet.

Step 4: Run Review

Once all details are captured and pass validation, the user can review the details on the Run Review screen. From here they can choose to edit details in previous screens or export the sample sheet. Once completed, press the Cancel button to finish run planning.

Note: once leaving this screen, the run and sample sheet will not be accessible.

Guided Examples

Please review these guided examples of analysis workflows that include a step of setting up a run in BaseSpace Run Planning tool:

Introduction to DRAGEN TSO 500 Analysis Software v2.5.x

Scope

This resource provides information on installation, configuration, running, troubleshooting and analysis algorithms for the following software:

  • DRAGEN TruSight Oncology 500 Analysis Software on Illumina Connected Analytics (ICA) v2.5.2

  • DRAGEN TruSight Oncology 500 Analysis Software v2.5.3 (for standalone DRAGEN server)

  • DRAGEN TruSight Oncology 500 Analysis Software v2.5.4 (for standalone DRAGEN server)

The content is applicable to all 3 software versions unless otherwise specified. The content related to setting up and running the analysis on ICA is only relevant to v2.5.2.

Overview

DRAGEN TruSight™ Oncology 500 Analysis Software supports data analysis for TruSight Oncology 500 Assay and TruSight Oncology 500 High-Throughput Assay, both Research Use Only (RUO).

The software provides local and cloud analysis for DNA and RNA libraries generated from formalin-fixed, paraffin-embedded (FFPE) tissue samples. The assays and the software are optimized to provide high sensitivity and specificity for low-frequency somatic variants across coding exons and additional regions of biological relevance in 523 genes for DNA biomarkers.

In addition, this software supports data analysis for TruSight Oncology 500 HRD (RUO), an optional add-on kit to TruSight Oncology 500, that enables detection of homologous recombination deficiency (HRD) through assessment of a genomic instability score (GIS).

TruSight Oncology 500 HRD is not available in Japan

DNA biomarkers:

  • Single nucleotide variants (SNVs)

  • Insertions

  • Deletions

  • Copy number variants (CNVs)

  • Exon-level CNVs

  • Multinucleotide variants (MNVs)

  • Genomic Instability Score (GIS Score) *

DNA Immunotherapy Biomarkers:

  • Tumor mutational burden (TMB)

  • Microsatellite instability (MSI)

RNA biomarkers (called from 55 genes):

  • Fusions

  • Splice variants

Beta features:

  • Absolute copy numbers (ACN)*

  • Loss of heterozygosity (LOH)*

  • Tumor fraction*

  • Ploidy*

Details of the regions covered by the assays can be found in the assay manifest file. Contact your local Illumina representative for more information.

*Requires TruSight Oncology 500 HRD add-on kit

Local and Cloud Deployments

Local analysis is available using a standalone DRAGEN server. The software on the standalone DRAGEN server allows for analysis on a single DRAGEN server or splitting across multiple servers.

Cloud analysis is available on Illumina Connected Analytics with auto-launch (from BCL) or manual launch (from BCLs and FASTQs), see exceptions for NextSeq 1000/2000 and NovaSeq X in the instrument compatibility tabel below.

Instrument Compatibility

DRAGEN TruSight Oncology 500 analysis software v2.5.x is compatible with data generated on the Illumina instruments as summarized in the table below.

*Among v2.5.x, only DRAGEN TruSight Oncology 500 analysis software v2.5.2 is supported on ICA

**For the data generated with NextSeq 1000/2000 and NovaSeq X, analysis can be started only from FASTQs (not from BCLs). Only manual launch is available for ICA.

Refer to for more information.

This version of the software includes results generated by features tested in silico and by beta features. Beta features have not been fully evaluated for performance, see .

Search for errors in the sample sheet validation log and compare with the guidelines and warnings in and the following tables.

Verify that the following sections and fields are present in the sample sheet and follow the individual rules in [BCLConvert_Settings] - SoftwareVersion - AdapterRead1 - AdapterRead2 - AdapterBehavior - MinimumTrimmedReadLength - MaskShortReads [BCLConvert_Data] - Sample_ID - index - index2 [TS0500S_Data] - Sample_ID - Index_ID - Sample_Type - Pair_ID - Sample Feature (Optional)

Verify that HRD is in the Sample Feature column in the sample sheet. Refer to for more information.

- Sample Feature (Optional) Verify when FASTQs were generated using the HRD add-kit (Not available in Japan), Sample Feature is added to those DNA Samples. Refer to for more information.

Verify that the FASTQ files are in the correct structure. Refer to for more information.

Action: Ensure StartsFromFastq field is in the [TSO500S_Settings] section, and it is not present in the [BCLConvert_Settings] Section. Refer to for more information.

TSOCombined license has been pre-installed to DRAGEN servers in manufacturing since August 2022 and TSO500_HRD since February 2025 and additionally distributed to DRAGEN servers connected online. To generate a list of installed DRAGEN server licenses, run the following command: /opt/edico/bin/dragen_lic. If a license is not installed, contact Illumina Customer Care at for the license.

A non-root user must be a member of the Docker group to run Docker. For more information on Docker permission requirements and alternatives to running as root, refer to the Docker documentation available on the .

DRAGEN TSO 500 Analysis Software v2.5.4 is multi-version compatible. Multi-version compatibility refers to ability to be installed on a single DRAGEN server with software running a different version of DRAGEN software. For example, multi-version compatible pipelines running DRAGEN v4.3.6 can be co-installed on a server alongside DRAGEN TSO 500 pipelines running DRAGEN v3.10.17. For more details on DRAGEN multi-version compatibility, please visit .

Contact Illumina Customer Care at to obtain the DRAGEN TruSight Oncology 500 Analysis Software installer package.

Use the following command to build the DRAGEN server hash table (run time ~ 1 hr): /usr/local/bin/build-hashtable_DRAGEN_TSO500-2.5.4.sh Refer to if any errors occur.

Review license requirements, how to check which licenses are installed and how to receive a license in . Licenses can be installed before or after DRAGEN TSO 500 software installation.

Contact Customer Care at to request a license file for each of the needed licenses

An ICA account with a valid subscription. Refer to the for information on how to register ICA subscription and iCredits.

A positive balance of iCredits for data storage. Please refer to the and for more information.

A sample sheet is required for each analysis with DRAGEN TruSight Oncology 500 Analysis Software. A sample sheet is a comma-separated value (*.csv) file format used by Illumina instruments, platforms, and analysis pipelines to store settings and data for sequencing and analysis. The DRAGEN TruSight Oncology 500 Analysis Software is compatible with the sample sheet v2. For general information on the sample sheet v2, refer to .

BaseSpace Run Planner (preferred), see for details

Downloading and modifying a sample sheet template following the requirements, see for details

Sample Parameter
Required
Details
Sample Parameter
Required
Details
Sample Parameter
Required
Details

Refer to the following requirements to create sample sheets for running the analysis on ICA with Auto-launch. For sample sheet requirements common between deployments see . Samples sheets can be created using BaseSpace Run Planning Tool or manually by downloading and editing a sample sheet template

Refer to for this section's requirements.

Parameters
Required
Details
Parameters
Required
Details
Parameter
Required
Details

The BaseSpace Sequence Hub Run Planning tool is available and is used to generate a valid sample sheet in v2 format for use on a TSO 500 supported sequencer for both ICA and Standalone DRAGEN Server analysis options. Filling out the form on the user interface will produce a exportable sample sheet with the required fields filled in. Refer to for descriptions of fields that appear in ICA sample sheets.

Parameter Name
Required
Description
Parameter Name
Required
Description
Parameter Name
Required
Description

Instrument
Illumina Connected Analytics*
Standalone DRAGEN Server
Paired DRAGEN server
On-board DRAGEN
RNA Output
Sample Sheet Requirements
customercare@illumina.com
Docker website
page 7 of the DRAGEN v4.3.6 software release notes
customercare@illumina.com
Troubleshooting
customercare@illumina.com
Software Registration page
Beta Features
Licenses
iCredits pricing page
storage requirements

SoftwareVersion

Required

The DRAGEN component software version. For DRAGEN TSO 500 v2.5.3 and v2.5.4 specify 3.10.16.

AdapterRead1

Required

If using 8 bp indexes starting with UP or CP (used with TSO 500): AGATCGGAAGAGCACACGTCTGAACTCCAGTCA If using 10 bp indexes with UDP (used with TSO 500 HT): CTGTCTCTTATACACATCTCCGAGCCCACGAGAC Analysis fails if the incorrect adapter sequences are used

AdapterRead2

Required

If using 8 bp indexes starting with UP or CP (used with TSO 500): AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT If using 10 bp indexes with UDP (used with TSO 500 HT): CTGTCTCTTATACACATCTGACGCTGCCGACGA Analysis fails if the incorrect adapter sequences are used

AdapterBehavior

Required

Enter trim This indicates that the BCL Convert software trims the specified adapter sequences from each read.

MinimumTrimmedReadLength

Required

Enter 35. Reads with a length trimmed below this point are masked.

MaskShortReads

Required

Enter 35. Reads with a length trimmed below this point are masked.

Sample_ID

Required

Must match a Sample_ID listed in the TSO 500 Data section.

Index

Required

Index 1 sequence valid for Index_ID assigned to matching Sample_ID in the TSO 500 Data section.

Index2

Required

Index 2 sequence valid for Index_ID assigned to matching Sample_ID in the TSO 500 Data section.

Lane

Only for NovaSeq 6000 XP, NovaSeq 6000Dx, or NovaSeq X workflows

Indicates which lane corresponds to a given sample. Enter a single numeric value per row. Cannot be empty, i.e the analysis fails if the Lane column is present without a value in each row.

Sample_ID

Required

The unique ID to identify a sample. The sample ID is included in the output file names. Sample IDs are not case sensitive. Sample IDs must have the following characteristics: - Unique for the run. - 1–40 characters. - No spaces. - Alphanumeric characters with underscores and dashes. If you use an underscore or dash, enter an alphanumeric character before and after the underscore or dash. eg, Sample1-T5B1_022515. - Cannot be called all, default, none, unknown, undetermined, stats, or reports. - Must match a Sample_ID listed in the TSO 500 Data section. - Illumina recommends that the sample ID be based on the pair ID. Example: <Pair_ID>-DNA,<Pair_ID>-RNA. - Each sample must have a unique combination of Lane (if applicable), sample ID, and index ID or the analysis will fail.

Sample_Type

Required

Enter DNA or RNA. For HRD samples, this parameter must be DNA.

Pair_ID

Required

A unique ID that links DNA and RNA from the same biological sample from the same individual. Pair ID shares, at most, one DNA and one RNA sample per run. eg, if a Sample_ID is TestSample1-DNA for DNA and TestSample1-RNA for RNA, the Pair_ID TestSample1 will link these samples that are on different rows in the sample sheet together. If the pair ID is associated with more than one DNA or RNA sample, the analysis fails.

Sample_Feature

Required when using HRD add-on kit

Required for HRD enriched samples. For DNA samples that have undergone HRD enrichment, enter HRD in this column of the sample sheet. If the sample has not undergone HRD enrichment, leave the field empty.

Sample_Description

Not Required

Sample description must meet the following requirements: - 1–50 characters. - Alphanumeric characters with underscores, dashes and spaces. If you enter a underscore, dash, or space, enter an alphanumeric character before and after. eg, Solid-FFPE_213.

SoftwareVersion

Not Required

The TSO500S software version

StartsFromFastq

Required

Set the value to TRUE or FALSE. To auto-launch from BCL files, set to FALSE.

Sample_ID

Not Required

The same sample ID used in the Cloud_TSO500S_Data section.

ProjectName

Not Required

The BaseSpace project name.

LibraryName

Not Required

Combination of sample ID and index values in the following format: sampleID_Index_Index2

LibraryPrepKitName

Not Required

The Library Prep Kit used.

IndexAdapterKitName

Not Required

The Index Adapter Kit used.

GeneratedVersion

Not Required

The cloud GSS version used to create the sample sheet. Optional if manually updating a sample sheet.

CloudWorkflow

Not Required

Ica_workflow_1

Cloud_TSO500_Pipeline

Required

This value is a universal record number (URN). The valid values are:

  • Solid—urn:ilmn:ica:pipeline:e8eff7ef-1683-4f63-a0ba-9af542cd39e0#DRAGEN_TSO500_RUO_TISSUE_HT_v2_5_2_1_Pipeline

  • Solid HRD —urn:ilmn:ica:pipeline:172270e9-3678-45a9-a9f4-c9c7a0a32bb8#DRAGEN_TSO500_RUO_TISSUE_HRD_v2_5_2_1_Pipeline

BCLConvert_Pipeline

Required

The value is a URN in the following format: urn:ilmn:ica:pipeline: <pipeline-ID>#<pipeline-name>

Run Name

Required

Run Name can contain 255 alphanumeric characters, dashes, underscores, periods, and spaces; and must start with an alphanumeric, a dash or an underscore.

Run Description

Optional

Run Description can contain 255 characters except square brackets, asterisks, and commas.

Instrument Platform

Required

Choose from TSO 500 supported instruments:

  • NextSeq 500/550

  • NovaSeq 6000/6000Dx

Secondary Analysis

Required

  • BaseSpace/Illumina Connected Analytics (to generate sample sheet for cloud analysis)

  • Local

Sample Container ID

Optional

  • Unique Identifier for the container that holds the sample

Application

Required

  • DRAGEN TruSight Oncology 500 Analysis Software - 2.5.x (with HRD)

  • DRAGEN TruSight Oncology 500 Analysis Software - 2.5.x

Description

Optional

Optional text field

Library Prep Kit

Required

  • TruSight Oncology 500

  • TruSight Oncology 500 High Throughput

Index Adapter Kit

Required

TSO 500:

  • TruSight Oncology 500 (NovaSeq 6000Dx, NovaSeq X, NextSeq 1000/2000)

  • TruSight Oncology 500 (NovaSeq 6000, NextSeq 550)

TSO 500 HT:

  • TruSight Oncology 500 (NovaSeq 6000Dx, NovaSeq X, NextSeq1000/2000)

  • TruSight Oncology 500 (NovaSeq 6000, NextSeq 550)

Read Lengths: Read 1 and Read 2

Required

Auto filled with the standard values, but can be optionally overwritten.

Lane Usage

Optional

Checkbox allows users to apply the same lane across samples.

Lane

Required if Lane Usage is unchecked

Specify lanes for each sample. The unmarked checkbox at the top of the dropdown selects all lanes.

Pair ID

Required

The identifier used to pair DNA and RNA samples in a run. The field is mandatory whether a sample is part of a pair, or not.

To note: The Sample ID field in the generated samplesheet will be auto-filled based on the Pair ID values captured. “_dna” and “_rna” (for DNA and RNA samples respectively) will be appended to the Pair ID value to create the Sample ID.

DNA Index ID

Required

Index set ID options are based on selected Index Adapter Kit

DNA Sample Feature

Required for TSO 500 HRD

Column appears when TSO 500 HRD application is selected. Enter for HRD enriched DNA Samples

RNA Index ID

Required

Index set ID options are based on selected Index Adapter Kit

Project

Optional

Optional field to describe the associated project

Starts from Fastq

Required

True or False

If auto-launching TSO 500 from BCL files, set the value to False.

NextSeq 550Dx (RUO mode)

Yes

Yes

No

N/A

NextSeq 500/550

Yes

Yes

N/A

N/A

NovaSeq 6000

Yes

Yes

N/A

N/A

NovaSeq 6000Dx (RUO mode)

Yes

Yes

No

N/A

NextSeq 1000/2000

Yes**

Yes**

N/A

No

NovaSeq X

Yes**

Yes**

N/A

No

Run on Multiple DRAGEN Servers

DRAGEN TruSight Oncology 500 Analysis Software can be used to run a subset of samples on different DRAGEN servers to decrease overall processing time. This is possible using a three stage process called scatter/gather, which consists of demultiplexing, analysis, and result gathering.

The first stage is demultiplexing. Demultiplexing runs once on the entire run folder, generates FASTQ files for each sample in the run, and then separates sample files into respective folders. Once complete, note the output directory containing the sample directories holding the FASTQ files.

The process for scattering the analysis on multiple DRAGEN servers is as follows:

  1. Determine how many DRAGEN servers are available to run.

  2. Run demultiplexing on a single DRAGEN server.

Moving or modifying files during an analysis may cause the analysis to fail or provide incorrect results.

To sequence runs on multiple DRAGEN servers using the NovaSeq 6000 XP workflow, modify the sample sheet to include a subset of the lanes. For example, on an S2 flowcell, create two modified sample sheets with one containing the samples from lane 1 and the other from lane 2. This allows only the sample sheet to be modified instead of copying files between servers. This strategy would use the start from Run Folder commands without the --demultiplexOnly option. The entire run folder would need to be copied to each analysis server as demultiplexing is performed once per server.

  1. Transfer the FASTQ folder output from the original DRAGEN server to additional servers.

    1. Logs_Intermediates/FastqGeneration.

  2. Run analysis software using the --fastqFolder option on both the original and additional DRAGEN servers.

    • Option 1 Copy the original SampleSheet.csv to each server. Then provide a subsetted list to the Bash script on each DRAGEN server with the intended samples/pairs to run.

    • Option 2 Copy and modify the SampleSheet.csv to each DRAGEN server to only contain the list of samples/pairs to run. The software verifies that all samples in the sample sheet are contained within the FASTQ folders unless the --sampleOrPairIDs command-line option is present in the analysis launch. Failure to account for these checks results in an error.

  3. Copy the results from demultiplexing and each analysis run onto a single server, and then generate the final /Results directory, which contains the aggregated results. Enter the --gather command followed by the output directories of the demultiplexing step and each individual analysis run.

Commands for Multinode Analysis

Step
Command

Demultiplexing

DRAGEN_TSO500.sh --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable --runFolder /staging/{RunFolderName} --analysisFolder /staging/{DemultiplexAnalysisFolderName} --demultiplexOnly --sampleSheet /staging/illumina/{SampleSheetName}

Analysis

(one server)

DRAGEN_TSO500.sh --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable --fastqFolder /staging/{DemultiplexAnalysisFolderName}/Logs_Intermediates/FastqGeneration/ --analysisFolder /staging/{Node1AnalysisFolderName} --sampleSheet /staging/illumina/{SampleSheetName} --sampleOrPairIDs Pair_1,Pair_2

Analysis (additional servers)

DRAGEN_TSO500.sh --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable --fastqFolder /staging/{DemultiplexAnalysisFolderName}/Logs_Intermediates/FastqGeneration/ --analysisFolder /staging/{Node1AnalysisFolderName} --sampleSheet /staging/illumina/{SampleSheetName} --sampleOrPairIDs Pair_3

Gather

DRAGEN_TSO500.sh --analysisFolder /Gathered_Results --resourcesFolder staging/illumina/DRAGEN_TSO500/resources --runFolder /staging/{RunFolderName}/--sampleSheet /staging/illumina/{SampleSheetName} --gather /Demultiplex_Output /Node1_Output /Node2_Output

Sample Sheet Requirements
Sample Sheet Requirements
Sample Sheet Requirements
Illumina Connected Software - Sample Sheet
Sample Sheet Creation in BaseSpace Run Planner page
Sample Sheet Requirements page
NovaSeq 6000Dx: TSO 500 Auto-launch Analysis in Cloud
NextSeq 500/550Dx: TSO 500 and Connected Insights Auto-launch Analysis in Cloud
Starting From FASTQ Files
Sample Sheet Requirements
Standard Sample Sheet Requirements
[TSO500_Data] Section
ICA Auto-launch Sample Sheet Requirements

Analysis Output

When the analysis run completes, the DRAGEN TruSight Oncology 500 Analysis Software generates an analysis output folder in a specified location.

To view analysis output, navigate to the analysis output folder and select the files that you want to view.

Single Node Analysis Output Folder Structure

Single output folder structure is as follows.

  • Logs_Intermediates

    • AdditionalSarjMetrics— Contains per pair ID calculations to support the PCT_TARGET_250X metric.

    • Annotation—Contains outputs for small variant annotation.

      • Subfolders per sample ID—Contains the aligned small variants JSON.

    • CombinedVariantOutput

      • Subfolders per pair ID—Contains the combined variant output TSV files.

      • A combined output log file.

    • Contamination

      • Subfolders per DNA sample ID—Contains the contamination metrics JSON file and output logs.

    • DnaDragenCaller

      • Subfolders per sample ID—Contains the aligned BAM and index files, small variant VCF and gVCF, copy number variant VCF, MSI JSON, and QC outputs in CSV format.

    • DnaDragenExonCNVCaller

      • Subfolders per DNA sample ID—Contains the exon-level CNV JSON,the supporting calculation, and the QC files.

    • DnaFastqValidation—Contains the FASTQ validation output log for DNA samples.

    • FastqDownsample

      • Subfolders per RNA sample ID—Contains FASTQ files and output logs.

      • FastqDownsample output

    • FastqGeneration

    • Gis—Contains GIS-related files for HRD samples.

      • Subfolders per HRD sample ID—Contains the GIS JSON, the supporting calculation, and the QC files.

      • Also contains the annotated CNV VCF and gene level TSV file with absolute copy number and minor copy number information

    • LrAnnotation

      • Subfolders per DNA sample ID—Contains the annotated exon-level CNV JSON.

    • LrCalculator

      • Subfolders per DNA sample ID—Contains the exon-level CNV VCF.

    • MetricsOutput

      • Subfolders per pair ID—Contains the metrics output TSV files.

      • A combined output log file.

    • ResourceVerification—Contains the resource file checksum verification logs.

    • RnaAnnotation

      • Subfolders per RNA sample ID—Contains the annotated splice variant JSON.

    • RnaDragenCaller

      • Subfolders per sample ID—Contains the aligned BAM, fusion candidates CSV and QC outputs in CSV format.

    • RnaFastqValidation—Contains the FASTQ validation output log for RNA samples.

    • RnaFusion

      • Subfolders per RNA sample ID—Contains the All Fusions CSV and Fusion Processor logs.

    • RnaQcMetrics

      • Subfolders per RNA sample ID—Contains the RNA QC metrics JSON.

    • RnaSpliceVariantCalling

      • Subfolders per RNA sample ID—Contains the splice variants VCF.

    • Run QC—Contains the Run QC metrics JSON, Intermediate Run QC metrics JSON, and log file.

    • SampleAnalysisResults

      • Subfolders per pair ID—Contains the Sample Analysis Results JSON and detailed log file.

      • SampleSheetValidation—Contains the Intermediate sample sheet and validation log.

    • Tmb

      • Subfolders per DNA sample ID—Contains the TMB metrics CSV, TMB trace TSV, and related files and logs. passing_sample_steps.json —Contains the steps passed for each sample ID. pipeline_trace.txt—Contains a summary and troubleshooting file that lists each Nextflow task executed and the status (for example, COMPLETED or FAILED). run.log—Contains a complete trace-level log file describing the Nextflow pipeline execution. run_report.html—Contains high-level run statistics (performance, usage, etc.) run_timeline.html —Contains timeline-related information about the analysis run.

  • Results

    • Metrics Output TSV (all pair IDs)

    • Pair ID—The following outputs are produced for each sample:

      • Combined Variant Output TSV

        • Metrics Output TSV

        • TMB Trace TSV

        • Small Variant Genome VCF

        • Small Variant Genome Annotated JSON

        • Copy Number Variant VCF

        • GIS JSON

        • MSI JSON

        • Large Rearrangements CNV VCF

        • Large Rearrangements CNV Annotated JSON

        • All Fusion CSV

        • Splice Variant VCF

        • Splice Variant Annotated JSON

Multiple Node Analysis Output Folder Structure

Multiple output folder structure is as follows.

  • Demultiplex Output

    • A Logs_Intermediates folder containing FASTQ files per sample.

  • Node(X) Output—The following outputs are produced for each node used:

    • A Logs_Intermediates folder containing step specific and component specific outputs and logs for every step/component run in the analysis pipeline for the sample run on the node.

    • A Results folder containing results only for the sample run on the node.

  • Gathered Output

    • A Logs_Intermediates folder containing step specific and component specific outputs and logs for every step/component run in each analysis pipeline on every node—this contains outputs for all samples and pairs ran across all nodes in the analysis.

    • A Results folder containing results for all samples and pairs ran across all nodes—results are organized by Pair_ID, then Sample_ID. This folder also contains summary files which contain information on all samples.

ICA Output Folder Structure

This section describes each output folder generated during analysis and where to find metric and analytic files when the pipeline is executed. The same output folder structure and content exist in ICA and BaseSpace Sequence Hub.

High-Level Folder Structure

  • Run ID

    • TSO500_Nextflow_logs

      • _manifest.json

    • Results

      • _tags.json

    • Logs_intermediates

    • Errors—This folder is only present when analysis fails

TSO500_Nextflow_logs Folder Structure

The TSO_500_Nextflow_Logs provides information related to the execution of the pipeline on ICA as a whole and for specific nodes (when an analysis is split across multiple nodes). It contains files used to execute parts of the workflow on different nodes as well as records of the nextflow execution on those nodes.

  • TSO_500_Nextflow_Logs

    • _manifest.json

Results Folder Structure

Contains the aggregated MetricsOutput.tsv file at the root level. Additionally, the Results folder contains a subfolder for each pair ID.

  • Results

    • MetricsOutput.tsv

    • Sample_1

    • Sample_2

    • Sample_<#>

    • _tags.json

The Results subfolder contains the following files:

  • Results

    • MetricsOutput.tsv

    • <Pair_id>

      • CombinedVariantOutput.tsv

      • <SampleName>_MetricsOutput.tsv

    • <DNA_Sample_id>

      • CopyNumberVariants.vcf

      • DNAMergedSmallVariants_Annotated.json.gz

      • MergedSmallVariants.genome.vcf

      • MergedSmallVariants.vcf

      • microstat_output.json

      • TMB_Trace.tsv

    • <RNA_Sample_id>

      • AllFusions.csv

      • RNA_Annotated.json.gz

      • SpliceVariants.vcf

Logs_intermediates Folder Structure

Contains folders for each submodule in the DRAGEN TSO 500 on ICA pipeline. The folders contain a copy of all the relevant files required to create the metric output files and report files, as well as the combined log files at the root level and subfolders for each sample.

  • Logs_intermediates

    • DnaDragenCaller

    • AdditionalSarjMetrics

    • CombinedVariantOutput

    • FastqGeneration

    • MetricsOutput

    • DnaDragenExonCnvCaller

    • DnaFastqValidation

    • Gis

    • Tmb

    • SampleAnalysisResults

    • SampleSheetValidation

    • passing_sample_steps.json

    • RnaFusion

    • Contamination

    • Annotation

    • RnaAnnotation

    • RnaDragenCaller

    • RnaSpliceVariantCalling

    • RunQc

    • FastqDownsample

    • PassingSampleSteps

    • ResourceVerification

    • LrCalculator

    • LrAnnotation

    • RnaQcMetrics

    • RnaFastqValidation

Errors Folder Structure

Contains Errors.tsv. This file contains the summary of all the errors encountered during pipeline execution.

  • Errors

    • Errors.tsv

Analysis Methods

The software processes sequencing data to perform quality control, detect variants, determine tumor mutational burden (TMB), microsatellite instability (MSI) status, and genomic instability score (GIS), and report results. The following sections describe the analysis methods used in DRAGEN TruSight Oncology 500 Analysis Software.

DRAGEN TruSight Oncology 500 Analysis Software uses the following workflows to analyze sequencing data.

  • FASTQ Generation

  • DNA Analysis

    • DNA Alignment and Realignment

    • Read Collapsing

    • Indel Realignment and Read Stitching

    • Small Variant Calling

    • Small Variant Filtering

    • Copy Number Variant (CNV) Calling

    • Phased Variant Calling

    • Variant Merging

    • Annotation

    • Tumor Mutational Burden (TMB) Scoring

    • Microsatellite Instability (MSI) Status

    • Contamination Detection

  • RNA Analysis

    • Downsampling

    • Read Trimming

    • Alignment

    • Duplicate Marking

    • Fusion Calling

    • RNA Fusion Filtering

    • Splice Variant Calling

    • Annotation

    • Fusion Merging

  • Quality Control

    • Run QC

    • DNA Sample QC

    • RNA Sample QC

Block List

The block list represents high noise regions in the panel where false positive variant calls are likely produced. As a result, all positions in the gVCF are marked as Filter=excluded_regions to indicate variant call results are not reliable in such regions.

The block list includes the following genes:

  • HLA A

  • HLA B

  • HLA C

  • KMT2B

  • KMT2C

  • KMT2D

  • chrY

  • Any position with VAF 1% occurrence in six or more of the 60 baseline samples.

Installation of 2.5.3 on Standalone DRAGEN Server

Overview

The installation script for DRAGEN TruSight Oncology 500 Analysis Software installs the following software and dependencies:

  1. DRAGEN TruSight Oncology 500 Analysis Software itself

  2. DRAGEN Software if a compatible version is not present

  3. Docker software if a compatible version is not present

  4. A script required to generate DRAGEN genome hash table

  5. A script to check that DRAGEN TruSight Oncology 500 Analysis Software is installed properly

Installation Requirements

Hardware

  • DRAGEN server v3 or v4

  • Network-attached storage (NAS) with enabled mkfifo if performing analysis for the TruSight Oncology 500 High-Throughput assay

Software

The following software is required to successfully install DRAGEN TSO 500 v2.5.3:

  • Linux CentOS 7.9 operating system (or later) or Oracle Linux 8 (or later), one of which is provided on the server. Oracle Linux 8 is recommended.

  • Docker Software, see table below for minimum version needed. If sufficient Docker software is not present on the server, the TSO 500 installer will install compatible Docker software.

  • DRAGEN Server Software, see table below for minimum version needed as the host version on the server. If sufficient DRAGEN software is not present on the server, the TSO 500 installer will install compatible DRAGEN software.

Software Dependency
Compatible
Installs

Docker

20.10 or greater

Docker 20.10.15

DRAGEN Software

v3.10.x where x is 16 or greater

DRAGEN Software 3.10.16

Licenses

  • TSOCombined license

  • TSO500_HRD license (to analyze data generated with the TSO 500 HRD add-on kit)

Permissions

Illumina recommends logging in as root user for installation, but as a non-root user for running TSO 500 analysis.

  • Installing and uninstalling DRAGEN TruSight Oncology 500 Analysis Software and running the system check requires root privileges.

  • Run DRAGEN TruSight Oncology 500 Analysis Software without being logged in as a root user. Running the DRAGEN TruSight Oncology 500 Analysis Software as root is not required or recommended.

Compatibility with other DRAGEN Pipelines

Compatibility of software for co-installation with DRAGEN TSO 500 v2.5.3 on a DRAGEN server is summarized in the table below:

Software
Version
Type
Compatible

DRAGEN TSO 500

2.5.2 or below

Single-version

No

DRAGEN TSO 500

2.5.4

Multi-version

No

DRAGEN TSO 500

2.6.0

Single-version

Yes*

DRAGEN TSO 500

2.6.1+

Multi-version

No

DRAGEN TSO 500 ctDNA

1.2 or below

Single-version

No

DRAGEN TSO 500 ctDNA

2.1.1, 2.5.0, 2.6.0, 2.6.1

Single-version

Yes**

DRAGEN TSO 500 ctDNA

2.6.2+

Multi-version

No

DRAGEN pipelines***

3.10.16

Single-version

Yes

DRAGEN pipelines***

Any version except 3.10.16

Single- or multi-version

No

*Install DRAGEN TSO 500 Analysis Software v2.6.0 after installing 2.5.3. If v2.5.3 is installed after v2.6.0, re-execute the installation script for v2.6.0 to install the compatible version of DRAGEN Software without impacting v2.5.3.

** When co-installing v2.5.3 with DRAGEN TSO 500 ctDNA software on the same DRAGEN server, install the software with the highest corresponding DRAGEN Software version last. Find DRAGEN software versions for DRAGEN TSO 500 ctDNA software in parentheses: v2.1.1 (v3.10.9), v2.5.0 (v3.10.15), 2.6.0 (v3.10.17), 2.6.1 (v3.10.18).

***For example, DRAGEN Enrichment, DRAGEN Germline, and others

Installation Instructions

As a root user, perform the following steps to install DRAGEN TSO 500 v2.5.3 Analysis Software:

  1. Download the installation package provided in the email from Illumina. The link expires after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} '{link}'

Where the file name is the installation script file name, and the link is provided by Illumina Customer Care.

  1. Make sure no other analysis is being performed. Installing the software while performing other analyses prevent the installer process from proceeding

  2. Copy the install script to the /staging directory to store the script in the directory.

Installation Script: install_DRAGEN_TSO500-2.5.3.run

MD5sum: sha256:ab9fae7ba58842d797ba689e702693c0fd7a5c2852dda610dce6d68152fe1f8d

  1. Use the following command to update the run script permission: chmod +x /staging/install_DRAGEN_TSO500-2.5.3.run

  2. Use the following command to run the installation script (run time ~ 20 mins):

    1. For Docker, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.5.3.run. The script installs compatible DRAGEN software and removes any previously installed versions.

    2. For Apptainer, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500-2.5.3.run -- --noDockerInstall This will not install Apptainer, but will install the analysis software in the SIF container format and modify the software to launch analyses using Apptainer.

  3. During the installation process, you might be instructed to reboot or power cycle the system to complete the installation of the DRAGEN software. A power cycle of the system requires the server be shut down and restarted.

  4. Log out of the server and then log back in.

License Installation

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server connected to the internet:

  1. Confirm that the server is connected to the Internet, example: ping www.illumina.com

  2. Run the following command: /opt/edico/bin/dragen_lic -i auto

To install a license (TSOCombined and/or TSO500_HRD) on a DRAGEN server not connected to the internet:

  1. Download and save the license file(s) to a location that is accessible from the DRAGEN server

  2. For each license file, run the command, where <license file received> is the absolute path to the license file: sudo /opt/edico/bin/dragen_lic -i /tmp/<license file received>.bin

To check the success of license installation, run: /opt/edico/bin/dragen_lic. Installed licenses should be in the list.

Running the System Check

After installation is complete, make sure the system functions properly by running the following command: /usr/local/bin/check_DRAGEN_TSO500-2.5.3.sh

The script checks that:

  • All required services are running

  • Proper Docker image is installed

  • DRAGEN TSO 500 Analysis Software can successfully process a test data set

The system check script runs for approximately 25 minutes. If the script prints a failure message, contact Illumina Technical Support and provide the /staging/check_DRAGEN_TSO500_<timestamp>.tgz output file.

If using MacOS to connect to a server, an error can occur if the local settings are not in English. To resolve the error, disable the ability to set environment variables automatically in Terminal settings.

Uninstall Software

The DRAGEN TruSight Oncology 500 Analysis Software installation includes an uninstall script called uninstall_DRAGEN_TSO500-2.5.3.sh, which is located in /usr/local/bin.

Executing the uninstall script removes the following assets:

  • All DRAGEN TruSight Oncology 500 Analysis Software related scripts located in /usr/local/bin

  • Resources found in /staging/illumina/DRAGEN_TSO500

  • The dragen_tso500:2.5.3: Docker image

To uninstall the DRAGEN TSO 500 Analysis Software, run the following command as a root user:

uninstall_DRAGEN_TSO500-2.5.3.sh

You are not required to uninstall Docker or DRAGEN software. To remove Docker, review the install instructions for your operating system in the Docker documentation.

Sample Sheet Templates

Sample Sheet templates for TSO 500 v2.5.x standalone DRAGEN server and ICA manual launch analysis can be found in the table below. For auto-launch compatible sample sheets, use BaseSpace Run Planner.

DRAGEN TSO 500 analysis software is compatible with several instruments and assay workflows (standard, XP), each of which have implications for the sample sheet.

Sample sheet templates contain all required fields, including index sequences in the proper orientation for all indexes from a given library prep kit. The templates are provided as a starting point for creating a sample sheet manually when launching analysis on a standalone DRAGEN server or on ICA using manual launch.

*Lane numbers cannot exceed what is supported by the flow cell in use.

Command-Line Options

You can use the following command-line options with DRAGEN TruSight Oncology 500 Analysis Software.

To learn more about the input requirements, use the --help command-line option.

Option
Required
Description

--help

No

Displays a help screen with available command line options.

--analysisFolder

No

--resourcesFolder

No

Path to the resource folder location. The default location is /staging/illumina/DRAGEN_TSO500/resources. If not using the default location, enter the full path to the resource folder.

--runFolder

Yes

Required when --fastqFolder is not specified. Provide the full path to the local run folder.

--fastqFolder

Yes

Required when --runFolder is not specified. Provide the full path to the local FASTQ folder. Analysis starts at this location.

--user

No

Optional for Docker. Specify the user ID to be used within the Docker container.

--version

No

Displays the version of the software.

--sampleSheet

No

Provide the full path, including file name, if not provided as SampleSheet.csv in the run folder

--sampleOrPairIDs

No

Provide the comma-delimited sample or pair IDs that should be processed on this node with no spaces. For example, Pair_1,Pair_2,Sample_1.

--demultiplexOnly

No

Demultiplex to generate FASTQ only without additional analysis.

--gather

No

Follow this option for any directories with results that should be gathered into a single Results folder.

--hashtableFolder

No

Defaults to the DRAGEN hash table location created upon install. If not using the default location, enter the hash table location.

Note:

  • Use full paths when specifying the file paths in the command line.

  • Avoid special characters such as &, *, #, and spaces.

  • When starting from BCL files, only the run folder needs to be specified. The immediate parent directory containing the BCL files does not need to be specified.

When running the analysis software using SSH, Illumina recommends using additional software to prevent unexpected termination of analysis. Illumina recommends screen and tmux.

  1. Wait for any running DRAGEN TruSight Oncology 500 Analysis Software containers to complete before launching a new analysis. Run the following command to generate a list of running containers:docker ps

  2. Select from one of the following options:

  • Start from BCL files in the run folder with the sample sheet included in the run folder. DRAGEN_TSO500.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName}

  • Start from BCL files in the run folder with the sample sheet located in a folder other than the run folder. DRAGEN_TSO500.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleSheet /staging/{SampleSheetName}.csv

  • Start from BCL files in the run folder with a different sample sheet and demultiplexing only. DRAGEN_TSO500.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleSheet /staging/{SampleSheetName}.csv \ --demultiplexOnly

  • Start from FASTQ with the sample sheet included in the FASTQ folder and with different resources and hash table folders. DRAGEN_TSO500.sh \ --resourcesFolder /staging/illumina/DRAGEN_TSO500/resources \ --hashtableFolder /staging/illumina/DRAGEN_TSO500/ref_hashtable \ --fastqFolder /staging/{FastqFolderName} \ --analysisFolder /staging/{AnalysisFolderName}

  • Start from FASTQ folder with sample sheet included in the FASTQ folder and subset of samples or pairs. DRAGEN_TSO500.sh \ --fastqFolder /staging/{FastqFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleOrPairIDs "Pair_1,Pair2"

Starting from BCL Files

For the data generated by NextSeq 1000/2000 and NextSeq X, the analysis can only be started from FASTQs and not from BCLs.

If starting from BCL (*.bcl) files, DRAGEN TruSight Oncology 500 Analysis Software requires the run folder to contain certain files and folders. These inputs are required for Docker.

The run folder contains data from the sequencing run, make sure that the folder contains the following files:

Folder/File
Description

Config folder

Configuration files

Data folder

*.bcl files

Images folder

[Optional] Raw sequencing image files.

Interop folder

Interop metric files.

Logs folder

[Optional] Sequencing system log files.

RTALogs folder

Real-Time Analysis (RTA) log files.

RunInfo.xml file

Run information.

RunParameters.xml file

Run parameters.

SampleSheet.csv file

Sample information. If you want to use a sample sheet that is not in the run folder or a sample sheet named something other than SampleSheet.csv, provide the full path.

Starting from FASTQ Files

The following inputs are required for running the DRAGEN TruSight Oncology 500 Analysis Software using FASTQ (*.fastq) files. The requirements apply to Docker.

  • Full path to an existing FASTQ folder.

  • The sample sheet is in the FASTQ folder path, or you can set the path to the sample sheet with the --sampleSheet override command line option.

Make sure there is sufficient disk space for the analysis to complete. Refer to the --help command line argument details for disk space requirements.

Use BCL Convert to produce FASTQ files for DRAGEN TruSight Oncology 500 Analysis Software. Using bcl2fastq does not produce the same results and is discouraged.

Make sure that BCL Convert is set to write UMI sequences to the read headers in the FASTQ files.

FASTQ File Organization

Store FASTQ files in individual subfolders that correspond to a specific Sample_ID. Keep file pairs together in the same folder. Alternatively, store the FASTQ files in one flat folder structure where the FASTQ files are stored in one folder.

The DRAGEN TruSight Oncology 500 Analysis Software requires separate FASTQ files per sample. Do not merge FASTQ files.

The instrument generates two FASTQ files per flow cell lane, so that there are eight FASTQ files per sample.

Sample1_S1_L001_R1_001.fastq.gz

  • Sample1 represents the Sample ID.

  • The S in S1 means sample, and the 1 in S1 is based on the order of samples in the sample sheet, so S1 is the first sample.

  • L001 represents the flow cell lane number.

  • The R in R1 means Read, so R1 refers to Read 1.

Analysis Launch on Standalone DRAGEN Server

Start the DRAGEN TruSight Oncology 500 Analysis Software with the DRAGEN_TSO500.sh Bash script. The script is installed in the /usr/local/bin directory. The Bash script is executed on the command line and runs the software with Docker (or Apptainer if specified).

  • Path to the sequencing run or FASTQ folder. Copy the run or FASTQ folder to the DRAGEN server into the staging folder with the following recommended organization: /staging/runs/{RunID}. You can copy the run folder onto the DRAGEN server using Linux commands such as rsync. The sample sheet within the run folder is used unless otherwise specified through the command line.

Before running the analysis, confirm that the output directory for the software to write to is empty and does not include results of previous analyses.

Storage Requirements

For optimal performance, run analysis on data stored locally on the DRAGEN server. Analysis of data stored on NAS can take longer and performance can be less reliable.

The DRAGEN server provides an NVMe SSD in the /staging directory to use as the software output directory. Network-attached storage is required for long-term storage.

When running the DRAGEN TruSight Oncology 500 Analysis Software, use the default settings or set the -analysisFolder command line option to a directory in /staging to make sure the DRAGEN server processes read and write data on the NVMe SSD.

Before beginning analysis, develop a strategy to copy data from the DRAGEN server to a network‑attached storage. Delete output data on the DRAGEN server as soon as possible.

The following are the run and analysis output sizes for each sequencing system per 101 bp:

Sequencing System
Run Folder Output (Gb)
Analysis Output (Gb)
Minimum Disk Space (Gb)

NextSeq 500/550/550Dx (RUO) HO flow cell

32-55

82-85

150

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

85-100

250-374

300

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

164-200

360-665

800

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

290-460

890-1600

1500

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

800-1200

2700-4100

3000

NovaSeq X 1.5B

213

352

800

NovaSeq X 10B

1100

1800

3000

NovaSeq X 25B

1800

3300

4000

NextSeq 1000/2000

41

107

150

When launching the analysis, the software checks that the minimum disk space required is available. If the minimum disk space is not available, the software shows an error message and prevents analysis from starting. If disk space is exhausted during a run, the run shows an error and stops analyzing.

Moving or modifying files during an analysis may cause the analysis to fail or provide incorrect results.

Manual Launch of DRAGEN TSO 500 Analysis on ICA

How to Launch Analysis

After adding the Bundle to the Project, an example dataset becomes available in the Demo_Data folder for the Project. 

  1. Download Results: After analysis is complete, navigate to results in the configured output location.

Please see the Illumina Support Shorts for guidance on how to set up and run DRAGEN TSO 500 RUO analysis on ICA.

Analysis Parameters on ICA

To launch an analysis via the ICA user interface, configure a DRAGEN TSO 500 pipeline analysis with the following parameters.

Parameter Name
Description

User Reference

The analysis run name.

User Tags

Text labels to help index the analysis.

Notify me when task is completed

Option to receive an email notification when analysis is complete.

Output Folder

The path to the analysis output folder. The default path is the project output folder.

Entitlement Bundle

Automatically populated from the project details.

Sample Sheet

Select a sample sheet in CSV format for the analysis.

To note: Sample Sheet selection is optional if starting from a run folder, and required when submitting a FASTQ folder.

Input Folder

The run folder or FASTQ folder that contains files to analyze.

Starts from FASTQ

True for analysis performed on files in the FASTQ folder. False for analysis performed on files in the run folder.

Sample or Pair IDs

Optional subset of Sample IDs or Pair IDs to analyze.

Storage Size

The storage size to allocate for the analysis. The default and recommended value is Large.

DNA Output

Small Variant gVCF

File name: {SAMPLE_ID}_hard-filtered.gvcf.gz

The small variant genome variant call file contains information on all candidate small variants evaluated, including complex variants up to 15 bp from phased variant calling across the entire TSO 500 panel.

The variant status is determined by the FILTER column in the genome VCF as follows.

Small Variant Annotated JSON

File name: {SAMPLE_ID}_DNAVariants_Annotated.json.gz

The small variants annotated file provides variant annotation information for all nonreference positions from the genome VCF including pass and nonpass variants.

TMB Trace

The TMB trace file provides comprehensive information on how the TMB value is calculated for a given sample. All passing small variants from the small variant filtering step are included in this file. To calculate the numerator of the TmbPerMb value in the TMB JSON, set the TSV file filter to use the IncludedInTMBNumerator with a value of True.

The TMB trace file is not intended to be used for variant inspections. The filtering statuses are exclusively set for TMB calculation purposes. Setting a filter does not translate into the classification of a variant as somatic or germline.

Copy Number VCF

The copy number VCF file contains CNV calls for DNA libraries of the amplification genes targeted by DRAGEN TruSight Oncology 500 Analysis Software. The CNV call indicates fold change results for each gene classified as reference, deletion, or amplification.

The value in the QUAL column of the VCF is a Phred transformation of the p-value where Q=-10xlog10(p-value). The p-value is derived from the t-test between the fold change of the gene against the rest of the genome. Higher Q-scores indicate higher confidence in the CNV call.

In the VCF notation, <DUP> indicates the detected fold change (FC) is greater than a predefined amplification cutoff. <DEL> indicates the detected FC is less than a predefined deletion cutoff for that gene. This cutoff can vary from gene to gene.

In analysis versions prior to v2.5, <DEL> calls in the VCF are marked as LowValidation. The LowValidation filter indicates that the calls have been validated only with in silico data sets and are provided as information only.

Each copy number variant is reported as a fold change on normalized read depth in a testing sample relative to the normalized read depth in diploid genomes. Given tumor purity, you can infer the ploidy of a gene in the sample from the reported fold change.

Given tumor purity X%, for a reported fold change Y, you can calculate the copy number n using the following equation:

For example, a tumor purity at 30% and a MET with fold change of 2.2x indicates that 10 copies of MET DNA are observed.

Auto-Launch of DRAGEN TSO 500 Analysis on ICA

Auto-launch Prerequisites and Workflow

*The BaseSpace Sequence Hub setting for run monitoring and storage must be selected on the instrument to use DRAGEN TSO 500 analysis auto-launch. For information on preparing your instrument for DRAGEN TSO 500 Auto-launch, refer to the documentation for your instrument.

  1. Use BaseSpace Sequence Hub Run Planning tool or the sample sheet templates provided on the support page to create and export a sample sheet.

    1. If BaseSpace Run Planning tool is not available in your region, use the sample sheet template.

    1. Data is uploaded to BaseSpace Sequence Hub and then pushed to ICA. You can monitor the run in BaseSpace Sequence Hub.

    2. Analysis auto launches in ICA when sequencing and the upload completes. You can monitor the status of the analysis in BaseSpace Sequence Hub or ICA

    3. If necessary, you can requeue the analysis via BaseSpace Sequence Hub.

  2. View the analysis output results in either BaseSpace Sequence Hub or ICA.

To avoid invalid sample sheet configurations, Illumina recommends using BaseSpace Run Planning tool to generate sample sheets. Using an invalid sample sheet can result in failed runs and analyses.

For the data generated by NextSeq 1000/2000 and NextSeq X, only manual option for launching analysis on ICA is available. The analysis can only start from FASTQs files.

BaseSpace Sequence Hub Requirements for ICA Auto-Launch

BaseSpace Run Planning tool is a multi-step workflow that generates a manual launch or auto-launch capable sample sheet for export and requires the following additional settings:

  • Access to BaseSpace Sequence Hub.

  • ICA Run Storage is enabled under BaseSpace Sequence Hub settings.

Requeue Analysis

You can requeue analysis of a run via the run's Summary page in BaseSpace Sequence Hub.

Minimum Storage Requirements on ICA

Guided Examples

Please review these guided examples of using DRAGEN TSO 500 Analysis Software with auto-launch on ICA:

Combined Variant Output

File name: {Pair_ID}_CombinedVariantOutput.tsv

The combined variant output file contains the variants and biomarkers in a single file that is based on a single sample. If using pair ID, the file is based on paired DNA and RNA samples from the same individual. The output contains the following variant types and biomarkers:

  • Small variants

  • Copy number variants (CNV) (with absolute copy number when HRD Assay is run)

  • TMB

  • MSI

  • Fusions

  • Splice variants

  • GIS (when HRD Assay is run)

  • Gene-level Loss of Heterozygosity (when HRD Assay is run)

  • Exon-level CNVs

The combined variant output file also contains Analysis Details and Sequencing Run Details sections. The details of each are listed in the following table:

Combined variant output produces small variants with blank fields in the following situations:

  • The variant has been matched to a canonical RefSeq transcript on an overlapping gene not targeted by TruSight Oncology 500.

  • The variant is located in a region designated iSNP, indel, or Flanking in the TST500_Manifest.bed file located in the Resources folder.

Variant Filtering Rules

  • Small Variants - All variants with the FILTER field marked as PASS in the hard-filtered genome VCF are present in the combined variant output.

    • Gene information is only present for variants belonging to canonical transcripts that are within the Gene Allow List–Small Variants.

    • Transcript information is only present for variants belonging to canonical transcripts that are within the Gene Allow List–Small Variants.

  • Copy Number Variants - Copy number variants must meet the following conditions:

    • FILTER field marked as PASS.

    • ALT field is <DUP or <DEL> .

  • Fusion Variants - Fusion variants must meet the following conditions:

    • Passing variant call (KeepFusion field is true).

    • Contains at least one gene on the fusion allow list.

    • Genes separated by a dash (-) indicate that the fusion directionality could be determined. Genes separated by a slash (/) indicate that the fusion directionality could not be determined.

  • Biomarkers TMB/MSI - Always present when DNA sample is processed.

  • Splice Variants - Passing splice variants that are contained on genes EGFR, MET, and AR.

  • Biomarker GIS - Present only if TruSight Oncology 500 HRD analysis is performed

  • Loss of Heterozygosity - Present only when TruSight Oncology 500 HRD is run. Loss of heterozygosity (LOH) must meet the following condition:

    • MCN field is equal to 0

  • Exon-level CNVs - Exon-levels CNVs must meet the following conditions:

    • BRCA1 or BRCA2 contains at least one affected exon.

    • ALT field is <DUP> or <LOSS> .

RNA Output

Splice Variant VCF

The splice variant VCF contains all candidate splice variants targeted by the analysis panel identified by the RNA analysis pipeline. You can apply the following filters for each variant call:

Refer to the headers in the output for more information about each column.

Splice Variant Annotated JSON

If available, each splice variant is annotated using the Illumina Annotation Engine. The following information is captured in the JSON:

  • HGNC Gene

  • Transcript

  • Exons

  • Introns

  • Canonical

  • Consequence

All Fusions CSV

The all fusions CSV file contains all candidate fusions identified by the DRAGEN RNA pipeline. Two output columns in the file describe the candidate fusions: Filter and KeepFusion.

The following table describes the semicolon-separated output found in the Filter columns. The output is either a confidence filter or information only as indicated. If none of the confidence filters are triggered, the Filter column contains the output PASS, else it contains the output FAIL.

Filter Column Output

The KeepFusion column of the output has a value of TRUE when none of the confidence filters are triggered.

Refer to the headers in the output for more information about each column.

Fusion Columns

When using Microsoft Excel to view this report, genes that are convertible to dates (such as MARCH1 automatically convert to dd-mm format (1 Mar) by Excel. The following are fusion allow list genes:

  • ABL1

  • AKT3

  • ALK

  • AR

  • AXL

  • BCL2

  • BRAF

  • BRCA1

  • BRCA2

  • CDK4

  • CSF1R

  • EGFR

  • EML4

  • ERBB2

  • ERG

  • ESR1

  • ETS1

  • ETV1

  • ETV4

  • ETV5

  • EWSR1

  • FGFR1

  • FGFR2

  • FGFR3

  • FGFR4

  • FLI1

  • FLT1

  • FLT3

  • JAK2

  • KDR

  • KIF5B

  • KIT

  • KMT2A

  • MET

  • MLLT3

  • MSH2

  • MYC

  • NOTCH1

  • NOTCH2

  • NOTCH3

  • NRG1

  • NTRK1

  • NTRK2

  • NTRK3

  • PAX3

  • PAX7

  • PDGFRA

  • PDGFRB

  • PIK3CA

  • PPARG

  • RAF1

  • RET

  • ROS1

  • RPS6KB1

  • TMPRSS2

Analysis Launch on ICA

Methods for Launching Analysis

Illumina Connected Analytics (ICA) supports the following methods for launching DRAGEN TruSight Oncology 500 Analysis Software.

For the data generated by NextSeq 1000/2000 and NextSeq X, only manual option for launching analysis on ICA is available. The analysis can only start from FASTQs files.

For more information about using ICA or BaseSpace Sequence Hub, refer to the following support pages on the Illumina support site.

DNA Expanded Metrics

DNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.

HRD Metrics Report

The Illumina DRAGEN TruSight Oncology 500 Analysis Software allows for analysis of sequencing data generated from the TruSight Oncology 500 HRD assay. When HRD samples are analyzed new results and metrics are included in the CombinedVariantOutput and MetricsOutput files respectively. The following tables detail how these scores and QC metrics are derived.

*The GIS algorithm within the TSO500 pipeline (which does not have a cell line mode due to the TSO500 pipeline being non-configurable) is only intended for FFPE samples. Cell line samples will not accurately report GIS results as the tumor fraction (>90%) is too high to reliably distinguish tumor vs germline variants.

HRD Metrics Added to Metrics Output File

Metrics Output

Metrics Output

The MetricsOutput.tsv file contains the following quality control metrics for all samples:

  • DNA library QC metrics for:

    • Small variant calling

    • TMB

    • MSI

    • CNV

    • [HRD] GIS

  • RNA library QC metrics

  • Run QC metrics, analysis status, and contamination

This TSV file also includes expanded DNA library QC metrics per sample, based on total reads, collapsed reads, chimeric reads, and on-target reads. Analysis using RNA samples also produces RNA library QC metrics and expanded RNA library QC metrics per sample based on total reads and coverage.

The MetricsOutput.tsv file is a final combined metrics report with sample status, key analysis metrics, and metadata. Sample metrics within the report include suggested lower specification limits (LSL) and upper specification limits (USL) for each sample in the run.

TSOCombined license has been pre-installed to DRAGEN servers in manufacturing since August 2022 and TSO500_HRD since February 2025 and additionally distributed to DRAGEN servers connected online. To generate a list of installed DRAGEN server licenses, run the following command: /opt/edico/bin/dragen_lic. If a license is not installed, contact Illumina Customer Care at for the license.

A non-root user must be a member of the Docker group to run Docker. For more information on Docker permission requirements and alternatives to running as root, refer to the Docker documentation available on the .

DRAGEN TSO 500 Analysis Software v2.5.3 is single-version compatible. It does not include multi-version compatibility that refers to ability to be installed on a single DRAGEN server with software running a different version of DRAGEN software. For example, multi-version compatible pipelines running DRAGEN v4.3.6 can be co-installed on a server alongside DRAGEN TSO 500 pipelines running DRAGEN v3.10.17. For more details on DRAGEN multi-version compatibility, please visit .

Contact Illumina Customer Care at to obtain the DRAGEN TSO 500 Analysis Software installer package.

Use the following command to build the DRAGEN server hash table, which runs for approximately 60 minutes: /usr/local/bin/build-hashtable_DRAGEN_TSO500-2.5.3.sh Refer to if any errors occur.

Review license requirements, how to check which licenses are installed and how to receive a license in . Licenses can be installed before or after DRAGEN TSO 500 software installation.

Contact Customer Care at to request a license file for each of the needed licenses

For interactive run planning or to create a sample sheet for ICA Auto-launch, use to create valid sample sheets for either local or cloud analysis. To set up a run in BaseSpace run planner, refer to .

Users can visit the section to learn additional details on required fields and values as they fill-in their sample information. Use the lookup table below to select and download the sample sheet template that matches your instrument, assay, and workflow configuration:

Assay
Instrument
Assay Workflow
File

TSO500

NextSeq 550

Standard

TSO500 + HRD

NextSeq 550

Standard

TSO500 + HRD

NovaSeq 6000

Standard

TSO500 + HRD

NovaSeq 6000Dx (in RUO mode)

Standard

TSO500 HT

NovaSeq 6000

Standard

TSO500 HT

NovaSeq 6000

XP*

TSO500 HT

NovaSeq 6000Dx (in RUO mode)

Standard

TSO500 HT

NovaSeq 6000Dx (in RUO mode)

XP*

Path to the local analysis folder. The default location is /staging/DRAGEN_TSO500_Analysis_{timestamp}. If not using the default location, provide the full path to the local analysis folder. Folder must have sufficient space and must be on an NVMe SSD drive. For example, the /staging directory on the DRAGEN server. Refer to table in for minimum disk space requirements.

The FASTQ folder structure conforms to the folder structure in

For arguments, refer to . You can start from BCL files or from the FASTQ folder produced by BCL Convert. The following requirements apply for both methods:

Run folder must be intact. Refer to for input requirements.

If the analysis output folder path is different from the default, provide the analysis output folder path. Refer to .

Create a Project: Project can be specific for the DRAGEN TruSight Oncology 500 pipeline or it can contain multiple Pipelines and/or Tools). For information on creating Projects, refer to the Projects section in .

ICA standard storage is used by default as soon as the Project is saved. To connect a different storage source, set it up before creating your Project. For details and options, refer to the Storage section in .

Edit Project and Add Bundle: Edit the Project and add the bundle titled, "DRAGEN TSO 500 v2.5.2 (XX)." XX is a 2-letter code designating the region from which you are launching the analysis. Adding the Bundle automatically adds the pipeline and associated resource files and datasets to the Project. For information on Bundles, refer to the Bundles section in .

 Upload the sequencing data: For information on viewing and uploading data, refer to the Data section in .

Start Analysis: In the Project, navigate to Pipelines, desired TSO 500  Pipeline, and then select  "Start New Analysis". Set up the new analysis by configuring the parameters listed in the . When the required files are completed, start analysis.

For information about using pipelines, refer to .

Refer to for more information.

Filter
Note
Column
Description

Import the sample sheet to the instrument and start the sequencing run. Refer to for sample sheet guidance.

Refer to the for information on setting up a BaseSpace Sequence Hub project.

Refer to the for more information on requeuing an analysis.

Sequencing System
Minimum Disk Space (Gb)

Refer to the for information on how to manage accounts and subscriptions.

Analysis Details
Sequencing Run Details

Refer to for more information.

Filter Name
Description
Filter
Filter Type
Description
Fusion Object Field
Source

—Stream run data directly from the instrument to ICA via a specially configured sample sheet and automatically begin DRAGEN TSO 500 analysis.

—Initiate DRAGEN TSO 500 analysis on ICA using the run files and sample sheet files in the project.

Metric
Description
Troubleshooting
Metric
Description
Metric
Description
Section in Metrics Output

For troubleshooting information, refer to

customercare@illumina.com
Docker website
page 7 of the DRAGEN v4.3.6 software release notes
customercare@illumina.com
Troubleshooting
customercare@illumina.com
BaseSpace Run Planner
Sample Sheet Creation in BaseSpace Run Planner
Sample Sheet guidelines
Command-Line Options
Command-Line Options
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics support site page
Licenses
FASTQ File Organization.
Starting from BCL Files
table below

PASS

PASS variants.

base_quality

Site filtered because median base quality of alt reads at this locus does not meet threshold.

filtered_reads

Site filtered because the fraction of reads is too large.

fragment_length

Site filtered because absolute difference between the median fragment length of alt reads and median fragment length of ref reads at this locus exceeds threshold.

low_depth

Site filtered because the read depth is too low.

low_frac_info_reads

Site filtered because the fraction of informative reads is below threshold.

long_indel

Site filtered because the indel length is too long.

mapping_quality

Site filtered because median mapping quality of alt reads at this locus does not meet threshold.

multiallelic

Site filtered because more than two alt alleles pass tumor LOD.

no_reliable_supporting_read

Site filtered because no reliable supporting somatic read exists.

read_position

Site filtered because median of distances between start/end of read and this locus is below threshold.

str_contraction

Site filtered due to suspected PCR error where the alt allele is one repeat unit less than the reference.

too_few_supporting_reads

Site filtered because there are too few supporting reads in the tumor sample.

weak_evidence

Somatic variant score (SQ) does not meet threshold.

systematic_noise

Site filtered based on evidence of systematic noise in normal sample.

excluded_regions

Site overlaps with VC excluded regions bed.

Chromosome

Chromosome

Position

Position of variant

RefCall

Reference base

AltCall

Alternate base

VAF

Variant allele frequency

Depth

Coverage of position

CytoBand

Cytoband of variant

GeneName

Name of gene if applicable. A semicolon delimited list is used for multiple genes.

VariantType

Type of the variant: SNV, insertion, deletion, MNV

CosmicIDs

Cosmic IDs, if multiple concatenated by “;”

MaxCosmicCount

Maximum Cosmic study count

AlleleCountsGnomadExome

Variant allele count in gnomAD exome database

AlleleCountsGnomadGenome

Variant allele count in gnomAD genome database

AlleleCounts1000Genomes

Variant allele count in 1000 genomes database

MaxDatabaseAlleleCounts

Maximum variant allele count over the three databases

GermlineFilterDatabase

TRUE if variant was filtered by the database filter

GermlineFilterProxi

TRUE if variant was filtered by the proxi filter

CodingVariant

TRUE if variant is in the coding region

Nonsynonymous

TRUE if variant has any transcript annotations with nonsynonymous consequences

IncludedinTMBNumberator

TRUE if variant is used in the TMB calculation

n=[(200Y)−2(100−X)]/Xn=[(200Y)-2(100-X)]/Xn=[(200Y)−2(100−X)]/X

NextSeq 500/550/550Dx (RUO) HO flow cell

350

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

500

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

1100

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

2500

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

4300

NovaSeq X 1.5B

2000

NovaSeq X 10B

4300

NovaSeq X 25B

8400

NextSeq 1000/2000

350

- Pair ID - DNA sample ID (if DNA is run) - RNA sample ID (if RNA is run) - Output date - Output time - Module version - Pipeline version (Docker image version #)

- Run name - Run date - DNA sample index ID (if DNA is run) - RNA sample index ID (if RNA is run) - [HRD] Sample feature - Instrument ID - Instrument control software version - Instrument type - RTA version - Reagent cartridge lot number

LowQ

Splice variant score < passing quality score threshold value of 1.

PASS

Splice variant score ≥ passing quality score threshold value of 1.

LowUniqueAlignments

All splice junction supporting reads map to a unique genomic interval near at least one of the two splice sites.

DOUBLE_BROKEN_EXON

Confidence filter

If both breakpoints are distant from annotated exon boundaries, the number of supporting reads do not satisfy a high threshold requirement (≥ 10 supporting reads).

LOW_MAPQ

Confidence filter

All fusion supporting read alignments at either of the breakpoints have MAPQ < 20.

LOW_UNIQUE_ALIGNMENTS

Confidence filter

All fusion supporting read alignments map to a unique genomic interval at either of the breakpoints.

LOW_SCORE

Confidence filter

The fusion candidate has probabilistic score as determined by the features of the candidate.

MIN_SUPPORT

Confidence filter

The fusion candidate has very few fusion supporting reads (< 5 supporting read pairs).

READ_THROUGH

Confidence filter

The breakpoints are cis neighbors (< 200 kbp) on the reference genome.

ANCHOR_SUPPORT

Information only

Read alignments of fusion supporting reads are not long enough (12 bp) at either of the two breakpoints.

HOMOLOGOUS

Information only

The candidate is likely a false candidate generated because the two genes involved have high gene homology.

LOW_ALT_TO_REF

Information only

The number of fusion supporting reads is < 1% of the number of reads supporting the reference transcript at either of the two breakpoints.

LOW_GENE_COVERAGE

Information only

Each breakpoint in an enriched gene has fewer than 125 bp with nonzero read coverage.

NO_COMPLETE_SPLIT_READS

Confidence filter

For every fusion-supporting split read, the total number of aligned bases across two breakpoints is less 60% of the read length.

UNENRICHED_GENE

Confidence filter

Neither of the two parent genes is in the enrichment panel.

Gene A

The gene associated with the A side of the fusion. A semicolon delimited list is used for multiple genes.

Gene B

The gene associated with the B side of the fusion. A semicolon delimited list is used for multiple genes.

Gene A Breakpoint

[Information only] The chromosome and offset of the Gene A side of the fusion.

Gene A Location

Location of the breakpoint within Gene A: - IntactExon—Matches exon boundary - BrokenExon—Inside an exon - Intronic—Within an intron - Intergenic—No gene overlap (currently excluded) If multiple genes are in Gene A, then semicolon separated list of locations. This column is used internally to identify genes to report when a breakpoint occurs in a region overlapping multiple genes. Occasionally, additional values are listed for genes that were excluded from the GeneA list.

Gene A Sense

Boolean indicating whether left/right breakpoint order suggests fusion transcript is in the same sense of Gene A. If multiple genes are in Gene A, then semicolon separated list of bools.

Gene A Strand

Strand of Gene A, + for forward, - for reverse.

Gene B Breakpoint

[Information only] The chromosome and offset of the Gene B side of the fusion.

Gene B Location

Location of the breakpoint within Gene B: - IntactExon—Matches exon boundary - BrokenExon—Inside an exon - Intronic—Within an intron - Intergenic—No gene overlap (currently excluded) If multiple genes in Gene B, then semicolon separated list of locations. This column is used internally to identify genes to report when a breakpoint occurs in a region overlapping multiple genes. Occasionally, additional values are listed for genes that were excluded from the GeneB list.

Gene B Sense

Boolean indicating whether left/right breakpoint order suggests fusion transcript is in the same sense of Gene B. If multiple genes are in Gene B, then semicolon separated list of bools.

Gene B Strand

Strand of Gene B, + for forward, - for reverse.

Score

The quality of fusion as determined by DRAGEN server.

Filter

The filter associated with the fusion as determined by the respective caller. Results from different callers are not equivalent.

Ref A Dedup

Gene A uniquely mapping reads paired across or split by the junction. Does not support fusion. Duplicate reads are not included.

Ref B Dedup

Gene B uniquely mapping reads paired across or split by the junction. Does not support fusion. Duplicate reads are not included.

Alt Split Dedup

Uniquely mapping reads split by the junction. Supports fusion. Duplicate reads are not included.

Alt Pair Dedup

Uniquely mapping reads paired across junction. Supports fusion. Duplicate reads are not included.

KeepFusion

The determination whether the fusion should be kept or dropped from the list of fusions.

Fusion Directionality Known

Whether fusion directionality is known and indicated by gene order.

TOTAL_PF_READS (count)

Total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

Primarily driven by data output of sequencer, quality of library and balancing of library in library pool. If TOTAL_PF_READS is in line with other samples, but coverage metrics are more may suggest non-specific enrichment.

Low values for all samples indicate a poor quality run with possible low cluster numbers or low numbers of Q30 and PF%.

A low value for an individual sample indicates poor pooling of this library into the final pool.

MEAN_FAMILY_SIZE (count)

A UMI Family is a group of reads that all have the same UMI barcode. The family size is the number of reads in family. MEAN_FAMILY_SIZE is the mean of the entire population of reads assembled into UMI families.

The mean UMI family size decreases with increased unique read numbers, and more input DNA leads to more unique reads. Conversely over sequencing of a fixed population of unique DNA molecules leads to increased family size.

As a guide, for a good run with optimal cluster density, passing specs, even sample pooling, and good quality DNA we usually observe values <10.

UMI family size = 1 is not ideal as it is harder to correct for errors.

UMI family size of 2 to 5 enables efficient error correction without wasting sequencing capacity on high percentages of duplicate reads.

MEDIAN_TARGET_COVERAGE (count)

Median depth across all the unique loci occurring in all regions of the manifest file.

Lower median target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output.

PCT_CHIMERIC_READS (%)

Chimeric reads occur when one sequencing read aligns to two distinct portions of the genome with little or no overlap. Metric is proportion of total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

While this can be indicative of large-scale structural rearrangement of the genome, values that are elevated above the usual baseline may indicate enrichment probe contamination during library preparation. A suggested metric USL is 8% (those that are higher might see decrease performance in small variant and tmb scores).

PCT_EXON_100X (%)

Percentage of exon bases with 100X fragment coverage. Calculated against all regions in manifest containing _exon in name.

Can be used in combination with other PCT_EXON metrics to understand under or over coverage of exons.

PCT_READ_ENRICHMENT (%)

Percentage of reads that have overlapping sequence with the target regions defined in the sample manifest.

Indicative of general enrichment performance. Reduced proportions of enriched reads may indicate issues with the enrichment proportion of the library preparation.

PCT_USABLE_UMI_READS (%)

Percentage of reads that have valid UMI sequences associated with them.

As UMI reads are sequenced at the start of each read, loss of valid UMI sequence may be cause by sequencing issues impacting the quality of base calling in this portion of the sequencing read.

MEAN_TARGET_COVERAGE (count)

Mean depth across all the unique loci defined in the manifest file.

Lower mean target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output. Large differences between the median and mean target coverage values may indicated a skewed distribution of target coverage.

PCT_ALIGNED_READS (%)

Proportion of aligned reads that are non-supplementary, non-secondary and pass QC versus aligned reads that are non-supplementary, non-secondary, mapped and pass QC.

PCT_CONTAMINATION_EST (%)

This metric should only be evaluated if the CONTAMINATION_SCORE metric exceed the USL. This metric estimates the amount of contamination in a sample. The contamination level is computed by taking 2.0* the average of the adjusted allele frequencies of all variants that were selected. The adjusted alllele frequency is either the actual allele frequency of the variant if it is less than 0.5, or 1 -allele frequency if it is greater than or equal to 0.5.

If the sample does not fail the CONTAMINATION_SCORE this metric has no intended meaning as it will be driven by statistical noise (e.g. the few variants that naturally fall outside an expected interval around 0.5 due to random chance)

High contamination estimates may be due to any of the following:

Inter-sample contamination caused by mixing of samples during extraction or library preparation.

Intra-sample contamination, due to mixing of clonally different cell populations during extraction. Large scale genomic rearrangements that cause unexpected VAFs for large numbers of variants.

PCT_TARGET_0.4X_MEAN (%)

Parentage of target (all locations in manifest) reads that have a coverage depth of greater the 0.4x the mean target coverage depth (see definition above).

Provides an indication of uniformity of coverage of the target regions in the manifest file. When trended over time reductions in this metric may indicate an issue with the enrichment process resulting in coverage bias.

PCT_TARGET_50X (%)

Percentage of target bases with 50X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_TARGET_100X (%)

Percentage of target bases with 100X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_TARGET_250X (%)

Percentage of target bases with 250X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

ALLELE DOSAGE_RATIO (with HRD add-on)

Proprietary Myriad Genetics estimate of b-allele dosage based on b-allele noise/signal ratio. B-Allele noise is correlated with coverage; lower coverage samples will have higher noise. B-allele signal is also correlated with tumor fraction; a higher tumor fraction produces a higher signal for b-allele sites. Samples with lower tumor fraction and higher amount of noise (or lower coverage) will have higher Allele Dosage Ratio. The upper limit of the score is 50, therefore any sample with 50 Allele Dosage Ratio can be assumed to have tumor fraction close to zero and typically has a GIS = 0.

MEDIAN TARGET HRD (with HRD add-on)

Median target fragment coverage across all target positions in the genome. Coverage is the total number of non-duplicate pair alignments that overlap.

GIS Score*

Proprietary Genomic Instability Score (GIS) indicating level of genomic instability in sample genome. Combination of Loss of Heterozygosity (LOH), Telomeric allelic imbalance and Large-scale State Transitions (LST) scores. The GIS scores provided by TruSight Oncology 500 HRD show good correlation (R2= 0.98) with Myriad Genetics GIS however they are not identical (Refer to TruSight Oncology 500 HRD Product Data Sheet Doc# M-GL-00748 for more details). GIS from alternative HRD assays should be not be considered equivalent to Illumina/Myriad GIS.

PCT_TARGET_HRD_50X

Percent of HRD probe SNP panel covered by at least 50X coverage

DNA Library QC Metrics for GIS

EXCESSIVE_TF

EXCESSIVE TF indicates if there is excessive tumor content in sample. Troubleshooting: Samples with pure tumor fraction >90% are outside the design for GIS estimation (this includes pure tumor cell lines)

DNA Library QC Metrics for GIS

ALLELE_DOSAGE_RATIO

Proprietary Myriad Genetics estimate of b-allele dosage based on b-allele noise/signal ratio. B-Allele noise is correlated with coverage; lower coverage samples will have higher noise. B-allele signal is also correlated with tumor fraction; a higher tumor fraction produces a higher signal for b-allele sites. Samples with lower tumor fraction and higher amount of noise (or lower coverage) will have higher Allele Dosage Ratio. The upper limit of the score is 50, therefore any sample with 50 Allele Dosage Ratio can be assumed to have tumor fraction close to zero and typically has a GIS = 0.

DNA Expanded Metrics

MEDIAN_TARGET_HRD_COVERAGE

Median target fragment coverage across all target positions in the genome. Coverage is the total number of non-duplicate pair alignments that overlap.

DNA Expanded Metrics

FASTQ Generation

Sequencing data stored in BCL format are demultiplexed through a process that uses the index sequences unique to each sample to assign clusters to the library from which they originated. Each cluster contains two indexes (i7 and i5 sequences, one at each end of the library fragment). The combination of those index sequences are used to demultiplex the pooled libraries.

After demultiplexing, this process generates FASTQ files, which contain the sequencing reads for each individual sample library and the associated quality scores for each base call, excluding reads from any clusters that did not pass filter.

Quality Control

The software calculates several quality control metrics for runs and samples.

Run QC

The Run Metrics section of the metrics output report provides sequencing run quality metrics along with suggested values to determine if they are within an acceptable range. The overall percentage of reads passing filter is compared to a minimum threshold. For Read 1 and Read 2, the average percentage of bases ≥ Q30, which gives a prediction of the probability of an incorrect base call (Q‑score), are also compared to a minimum threshold. The following tables show run metric and quality threshold information for different systems.

The values in the Run Metrics section are listed as NA in the following situations:

  • If the analysis was started from FASTQ files.

  • If the analysis was started from BCL files and the InterOp files are missing or corrupt.

NextSeq 500/550 or NextSeq 550Dx (RUO)

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_PF_READS (%)

Total percentage of reads passing filter.

≥80.0

All

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥80.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥80.0

All

NovaSeq 6000 or NovaSeq 6000Dx (RUO)

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_PF_READS (%)

Total percentage of reads passing filter.

≥55.0

All

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥80.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥80.0

All

NextSeq 1000/2000

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_PF_READS (%)

Total percentage of reads passing filter.

≥85.0

All

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥85.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥85.0

All

NovaSeq X

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

PCT_Q30_R1 (%)

Percentage of Read 1 reads with quality score ≥ 30.

≥85.0

All

PCT_Q30_R2 (%)

Percentage of Read 2 reads with quality score ≥ 30.

≥85.0

All

DNA Sample QC

DRAGEN TruSight Oncology 500 uses QC metrics to assess the validity of analysis for DNA libraries that pass contamination quality control. If the library fails one or more quality metrics, then the corresponding variant type or biomarker is not reported, and the associated QC category in the report header displays FAIL. Additionally, a companion diagnostic result may not be available if it relies on QC passing for one or more of the following QC categories.

DNA library QC results are available in the MetricsOutput.tsv file. Refer to Metrics Output for details.

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

CONTAMINATION_SCORE

The contamination score is based on VAF distribution of SNPs.

≤ 1457

All

MEDIAN_EXON_COVERAGE

Median exon fragment coverage across all exon bases.

≥ 150

Small variant TMB

PCT_EXON_50X

Percent exon bases with 50x fragment coverage.

≥ 90.0

Small variant TMB

MEDIAN_INSERT_SIZE

The median fragment length in the sample.

≥ 70

Small variant TMB

USABLE_MSI_SITES

The number of MSI sites usable for MSI calling.

≥ 40

MSI

MEDIAN_BIN_COUNT_CNV_TARGET

The median raw bin count per CNV target.

≥ 1.0

CNV

RNA Sample QC

The input for RNA Library QC is RNA alignment. Metrics and guideline thresholds can be found in the MetricsOutput.tsv file. Refer to Metrics Output for details.

Metric
Description
Recommended Guideline Quality Threshold
Variant Class

MEDIAN_CV_GENE_500X

The median CV for all genes with median coverage > 500x. Genes with median coverage > 500x are likely to be highly expressed. Higher CV median > 500x indicates an issue with library preparation (poor sample input and/or probes pulldown issue).

Fusion Splice

MEDIAN_INSERT_SIZE

The median fragment length in the sample.

≥ 80

Fusion Splice

TOTAL_ON_TARGET_READS

The total number of reads that map to the target regions.

≥ 9000000

Fusion Splice

GENE_MEDIAN_COVERAGE

The median deduped coverage across all genes in the RNA panel (55 genes).

N/A*

Fusion Splice

  • To avoid failing RNA samples unnecessarily, Illumina does not recommend a universal threshold to determine RNA sample quality. RNA expression varies significantly across tissue types and a small panel size (55 genes), which makes normalization challenging. Tissue-specific thresholds could be considered for normalization.

RNA Expanded Metrics

RNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.

Metric
Description
Units

PCT_CHIMERIC_READS

Percentage of reads that are aligned as two segments which map to nonconsecutive regions in the genome.

%

PCT_ON_TARGET_READS

Percentage of reads that cross any part of the target region versus total reads. A read that partially maps to a target region is counted as on target.

%

SCALED_MEDIAN_GENE_COVERAGE

Median of median base coverage of genes scaled by length. An indication of median coverage depth of genes in the panel.

Count

TOTAL_PF_READS

Total number of reads passing filter.

Count

GENE_MEDIAN_COVERAGE

The median coverage depth of all genes in the panel.

Count

GENE_ABOVE_MEDIAN_CUTOFF

Number of genes above the median coverage cutoff.

Count

PER_GENE_MEDIAN_COVERAGE

Median deduped coverage across each gene (available in Logs_Intermediates only)

Count

DNA Analysis Methods

DNA Alignment and Error Correction

DNA alignment and error correction involves aligning sequencing reads derived from DNA libraries to a reference genome and correcting errors in the sequencing reads prior to variant calling.

DRAGEN unique molecular identifier (UMI) error correction comprises three main steps:

  1. DRAGEN UMI uses its HW accelerated mapper (based on a hash table implementation) to align DNA sequences in FASTQ files to the hg19 reference genome. These alignments are not written to a BAM.

  2. The raw alignments are processed to remove errors, including errors introduced during FFPE preservation, PCR amplification, and sequencing. Reads from the same original DNA molecule are tagged with the same UMI during library preparation. The UMI allows DRAGEN to compare related reads, remove outlier signals, and collapse multiple reads into a single high-quality sequence. Read collapsing adds the following BAM tags:

    • RX/XU—UMI.

    • XV—Number of reads in the family.

    • XW—Number of reads in the duplex-family or 0 if not a duplex family.

  3. DRAGEN performs a final alignment step on the UMI-collapsed reads. These final alignments are then written to a BAM file and a corresponding BAM index file is created.

DRAGEN continues to use these final alignments as input for gene amplification (copy number) calling, small variant calling (SNV, indel, MNV, delin), microsatellite instability (MSI) status determination, and DNA library quality control.

Small Variant Calling and Filtering

DRAGEN supports calling SNVs, indels, MNVs, and delins in tumor-only samples by using mapped and aligned DNA reads from a tumor sample as input. Variants are detected via both column wise pileup analysis and local de novo assembly of haplotypes. The de novo haplotypes allow the detection of much larger insertions and deletions than possible through column wise pileup analysis only. DRAGEN insertions and deletions are validated with lengths of at least 0–25 bp and more than 25 bp can be supported. In addition, DRAGEN also uses the de novo assembly to detect SNVs, insertions, and deletions that are co-phased and part of the same haplotypes. Any such co-phased variants that are within a window of 15 bp can then be reassembled into complex variants (MNVs and delins). The tumor-only pipeline produces a VCF file containing both germline and somatic variants that can be further analyzed to identify tumor mutations. Variant calling extends ± 10 bp into introns; details of the regions covered can be found in the assay manifest file. The pipeline makes no ploidy assumptions, enabling detection of low-frequency alleles.

DRAGEN small variant calling includes the following steps:

  1. Detects regions with sufficient read coverage (callable regions).

  2. Detects regions where the reads deviate from the reference and there is a possibility of a germline or somatic call (active regions).

  3. Assembles de novograph haplotypes are assembled from reads (haplotype assembly).

  4. Extracts possible somatic or germline calls (events) from column wise pileup analysis.

  5. Calibrates read base qualities to account for FFPE noise.

  6. Computes read likelihoods for each read/haplotype pair.

  7. Performs mutation calling by summing the genotype probabilities across all reads/haplotype pairs.

  8. Performs additional filtering to improve variant calling accuracy, including using a systematic noise file. The systematic noise file indicates the statistical probability of noise at specific positions in the genome. This noise file is constructed using clean (normal) samples. Regions where noise is common (eg, difficult to map regions) have higher noise values. The small variant caller penalizes those regions to reduce the probability of making false positive calls.

Copy Number Variant Calling

The DRAGEN copy number variant caller performs amplification, reference, and deletion calling for CNV targets within the assay. It counts the coverage of each target interval on the panel, uses a preprocessed panel of normal samples to normalize target counts, corrects for GC coverage bias, and calculates scores of a CNV event from observed coverage and makes copy number calls.

Exon-Level Copy Number Variant Calling

The BRCA large rearrangement step generates segmentation of the BRCA1 and BRCA2 genes for exon-level CNV detection from the BAM file. Using the same method as CNV calling, the large rearrangement component counts coverage of each target interval of the panel, performs normalization, and calculates the fold change values for each probe across the BRCA genes. Normalization includes GC bias correction, sequencing depth, and probe efficiency using a collection of normal FFPE and genomic DNA samples. Initial segmentation is performed for each gene with circular binary segmentation. The merging of segments is then determined by amplitude, noise, and variance at adjacent segments using thresholds established with in silico data. A large rearrangement is reported for genes with more than one segment. Coordinates of the exon-level CNV and the log2 mean fold change for each of the BRCA gene segments are found in the *_DragenExonCNV.json file.

Annotation

The Illumina Annotation Engine performs annotation of small variants, CNVs, and exon-level CNVs. The inputs are gVCF files and the outputs are annotated JSON files.

The Illumina Annotation Engine processes each variant entry and annotates with available information from databases such as dbSNP, gnomAD genome and exome, 1000 genomes, ClinVar, COSMIC, RefSeq, and Ensembl. The header includes version information and general details. Each annotated variant is included as a nested dictionary structure in separate lines following the header.

The following table shows version information for each annotation database:

Database
Version

gnomeAD

2.1

COSMIC

v84

ClinVar

2019-02-04

dbSNP

v151

1000 Genomes Project

Phase 3 v5a

RefSeq

NCBI Homo sapiens Annotation Release 105.20201022

Tumor Mutational Burden

DRAGEN is used to compute tumor mutational burden (TMB) in coding regions where there is sufficient coverage.

The following variants are excluded from the TMB calculation:

  • Non-PASS variants.

  • Mitochondrial variants.

  • MNVs.

  • Variants that do not meet a minimum depth threshold.

  • Variants that do not meet the minimum variant allele threshold.

  • Variants that fall outside the eligible regions.

  • Tumor driver mutations. Variants with a population allele count ≥ 50 are treated as tumor driver mutations. Germline variants are not counted towards TMB. Variants are determined as germline based on a database and a proxy filter.

Variants with a population allele count ≥ 10 that are observed in either the 1000 Genomes or gnomAD databases are marked as germline. MNVs, which do not count towards TMB, may be marked as germline when all their component small variants are marked as germline. The proxy filter scans the variants surrounding a specific variant and identifies those variants with similar variant allele frequencies (VAF). If the majority of surrounding variants of similar VAF are germline, then the variant is also marked as germline.

The formula for TMB calculation is:

TMB=Filtered VariantsEligible Region Size(Mbp)TMB = {Filtered\ Variants \over Eligible\ Region\ Size (Mbp)}TMB=Eligible Region Size(Mbp)Filtered Variants​

NonsynonymousTMB=Filtered Nonsynonymous VariantsEligible Region Size(Mbp)Nonsynonymous TMB = {Filtered\ Nonsynonymous\ Variants \over Eligible\ Region\ Size (Mbp)}NonsynonymousTMB=Eligible Region Size(Mbp)Filtered Nonsynonymous Variants​

Outputs are captured in a _TMB_Trace.tsv file that contains information on variants used in the TMB calculation and a .tmb.json file that contains the TMB score calculation and configuration details.

Microsatellite Instability Status

DRAGEN can determine the MSI status of a sample. It uses a normal reference file, which was created from a set of normal samples. During sequencing, normal reference files are generated by tabulating read counts for each microsatellite site. The normal file contains the read count distribution for each microsatellite.

MSI calling for a tumor-only sample is performed by first tabulating tumor counts from the read alignments for each microsatellite site. Then, the Jensen-Shannon distance (JSD) is calculated between each pair of tumor and normal baseline samples. DRAGEN determines unstable sites by performing Chi-square testing of tumor JSD and normal JSD distributions. Unstable sites are called if the mean distance difference of the two JSD distributions is ≥to the distance threshold and Chi-square p-value is ≤ to the p-value threshold. Lastly, DRAGEN produces an MSI status given assessed site count, unstable site count, the percentage of unstable sites in all assessed sites, and the sum of the Jensen-Shannon distance of all the unstable sites.

Genomic Instability Score

Requires HRD add-on assay

Genomic instability score (GIS) is a whole genome signature for homologous recombination deficiency. The GIS is composed of the sum of three components: loss of heterozygosity, telomeric allele imbalance, and large-scale state transition. These components are estimated using the GIS algorithm contracted from Myriad Genetics, which uses an input of the b-allele frequency and coverage across a genome-wide single nucleotide panel. A panel of normal samples is used for both bias reduction and normalization prior to GIS estimation. Final GIS results can be found in the *.gis.json file.

Contamination Detection

The contamination analysis step detects foreign human DNA contamination using the SNP error file and pileup file that are generated during the small variant calling and the TMB trace file. The software determines whether a sample has foreign DNA using the contamination score. In contaminated samples, the variant allele frequencies in SNPs shift from the expected values of 0%, 50%, or 100%. The algorithm collects all positions that overlap with common SNPs that have variant allele frequencies of < 25% or > 75%. Then, the algorithm computes the likelihood that the positions are an error or a real mutation. The contamination score is the sum of all the log likelihood scores across the predefined SNP positions with minor allele frequency < 25% in the sample and are not likely due to CNV events.

The larger the contamination score, the more likely there is foreign DNA contamination. A sample is considered to be contaminated if the contamination score is above predefined quality threshold. The contamination score was found to be high in samples with highly rearranged genomes or HRD samples. 1% of HRD samples found to be above the threshold with no evidence for actual contamination.

Tumor fraction (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Tumor fraction is calculated as described in the User Guide, section “HRD Metrics Report” and leverages the Myriad Genetics algorithm. Tumor fraction is output in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.gis.json and Combined Variant Output file.

Ploidy (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Ploidy is calculated as described in the User Guide, section “HRD Metrics Report” and leverages the Myriad Genetics algorithm. Ploidy is output in the in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.gis.json and Combined Variant Output file.

Absolute Copy Numbers (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Absolute copy numbers are calculated by leveraging the Myriad Genetics algorithm. The algorithm segments the entire genome using the HRD panel and provides an A and B allele estimate for each segment. After the TSO 500 pipeline determines CNV calls (using the TSO 500 panel), the segment covering the gene is identified, and the A and B allele numbers of the segment overlapping the gene are reported. If the gene is within 300 kbases from the segment boundary, the estimate is unreliable and “-1” is output. Absolute copy numbers are output in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.abcn_annotated.vc f, Logs_Intermediates/Gis/SAMPLE/SAMPLE.abcn_genes.tsv and Combined Variant Output file.

Gene-Level Loss of Heterozygosity (beta)

This is a beta feature. Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file. Requires HRD add-on assay.

Gene-level loss of heterozygosity is calculated based on the minor copy number reported in the abcn_annotated.vc f. If the minor copy number is 0 then the gene is assumed to have a loss of heterozygosity. Gene-level loss of heterozygosity is output in the Logs_Intermediates/Gis/SAMPLE/SAMPLE.abcn_genes.tsv and Combined Variant Output file.

DNA Analysis Methods
BaseSpace Sequence Hub support site page
BaseSpace Sequence Hub support site page
Software Registration page
NovaSeq 6000Dx: TSO 500 Auto-launch Analysis in Cloud
NextSeq 500/550Dx: TSO 500 and Connected Insights Auto-launch Analysis in Cloud
RNA Analysis Methods
Auto-launch
Manual launch
Illumina Connected Analytics support site page
BaseSpace Sequence Hub support site page
Troubleshooting
Storage Requirements
ICA Auto-launch Sample Sheet Requirements

RNA Analysis Methods

Downsampling

Each sample is downsampled to 30 million RNA reads. This number represents the total number of single reads (eg, R1 + R2, from all lanes). When using the recommended sequencing configurations or plexity, the samples can have fewer reads than the downsampling limit. In these cases, the FASTQ files are left as-is.

Read Trimming

Reads are trimmed to 76 base pairs for further processing.

RNA Alignment and Fusion Detection

RNA alignment and fusion detection uses trimmed reads in FASTQ format as input. The outputs include a BAM file that contains duplicate-marked read alignments, an SJ.out.tab file that contains unannotated splice junctions, and a CSV file that contains fusion candidates.

DRAGEN aligns RNA reads in a transcript-aware mode using the human hg19 genome containing unplaced contigs (ie, chrUn_gl regions) and uses GENCODEv19 transcript annotations to identify splice sites. DRAGEN identifies and marks duplicate read alignments using start and end coordinates of alignments, which are adjusted for soft clipped reads.

Fusion and splice variant calling only use deduped fragments to score variants. DRAGEN identifies fusion candidates using chimeric split read alignments (pairs of primary and supplementary alignments) against multiple genes. DRAGEN scores and filters reads based on the various features of each candidate such as the number of supporting reads, mapping quality of supporting reads, and sequence homology between parent genes.

The DRAGEN RNA Fusion caller identifies gene fusions by searching for chimeric reads spanning two distinct parent genes. Based on the chimeric reads, DRAGEN first creates a list of fusion candidates, then scores the candidates to report the list of high confidence fusion calls from the candidate pool.

DRAGEN RNA Fusion caller performs the following steps:

  1. Generates fusion candidate generation based on split read alignment.

  2. Recruits additional evidence from fusion supporting discordant read pairs and soft-clipped reads.

  3. Computes fusion candidate features such as gene coverage, read mapping quality, alternate allele frequency, gene homology, alignment anchor length, and breakpoint distance from exon boundary.

  4. Scores and ranks the fusion candidates using a logistic regression model.

  5. Selects a final list of fusion calls based on score and other filters including number of supporting reads, unique read alignment count, read through transcripts, and fusions matching the enriched regions.

Splice Variant Calling

RNA splice variant calling is performed for RNA sample libraries. Candidate splice variants (junctions) from RNA Alignment are compared against a database of known transcripts and a splice variant baseline of non-tumor junctions generated from a set of normal FFPE samples from different tissue types. Any splice variants that match the database or baseline are filtered out unless they are in a set of junctions with known oncological function. If there is sufficient read support, the candidate splice variant is kept. This process also identifies candidate RNA fusions.

RNA Fusion Merging

Fusions identified during RNA fusion calling are merged with fusions from proximal genes identified during RNA splice variant calling. These are then annotated with gene symbols or names with respect to a static database of transcripts (GENCODE Release 19). The result of this process is a set of fusion calls that are eligible for reporting

RNA Splice Variant Annotation

The Illumina Annotation Engine annotates detected RNA splice variant calls with transcript-level changes (eg, affected exons in the transcript of a gene) with respect to RefSeq. This RefSeq database is the same RefSeq database used by the small variant annotation process.

Performance Testing

The following sections describe performance testing methods.

Analytical Performance Testing

Illumina tests the analytical performance of variant calling using an approach that covers the entire workflow including library preparation, sequencing, and secondary analysis. This approach is used to test a diverse selection of variants. When the variant calling pipeline is expanded to call a new variant class, this approach is always used.

In Silico Testing Methods

Illumina uses in silico testing to the test the ability of the software to call an expanded scope of clinically relevant variants, including rare variants. In silico testing is used as a complementary method to analytical performance testing with wet lab step to expand the scope of testing. For example, while Illumina has analytically verified the performance of the software for calling complex variants in EGFR, the in silico testing approach characterizes the ability of the software to call complex variants in other genes.

For in silico testing, variants of interest are extracted from public databases like Cosmic and ClinVar. Each variant is simulated at different VAF levels by, depending on the variant class, spiking in mutant reads into a normal FFPE background (for sequence variants) or by increasing or decreasing the coverage of exons in the normal FFPE sample (for CNVs, for example, exon-level CNVs. The simulated reads match the expected quality of typical FFPE samples, such as fragment length, error rate, and family size. After the simulation, the software processes samples with spiked-in variants and determines the results. This approach does not include library prep and sequencing of tumor FFPE samples that include the rare variants of interest. The software reports these variants, but analytical verification was not performed.

DRAGEN TruSight Oncology 500 Analysis Software includes the following features that were tested i_n silico_ for both TruSight Oncology 500 and TruSight Oncology 500 HT:

  • Complex variants in genes beyond EGFR

  • Insertions and deletions > 25 bp

  • CNV amplifications

  • CNV deletions

  • Variants in intron-exon junctions (2 bp – 10 bp into introns)

In addition, the following features were tested in silico for TruSight Oncology 500 HT:

  • Exon-level CNVs in BRCA1 and BRCA2

Beta Features

This version of DRAGEN TruSight Oncology 500 Analysis Software includes beta features which have not been verified by Illumina due to limited access to samples or lack of an appropriate orthogonal method to perform testing, and, the use of in silico testing alone is not sufficient for verification purposes.

Customers are responsible for evaluating and demonstrating performance of any beta features they choose to implement. Beta features are indicated as such in the CombinedVariantOutput.tsv file. Illumina will continue to evaluate beta features with intent to fully release upon completion of verification for each feature.

This version includes the following beta features that may be used with the TruSight Oncology 500 HRD Assay:

  • Tumor fraction (beta)

  • Ploidy (beta)

  • Absolute copy numbers (ACN) (beta)

  • Gene-level loss of heterozygosity (LOH) events (beta)

Beta feature results are included in the Combined Variant Output file and other files. However, disclaimers that the results are generated by beta features are only provided in the Combined Variant Output file.

Installation of TSO 500 ctDNA v2.6.0 on Standalone DRAGEN Server

When installing DRAGEN TruSight Oncology 500 and DRAGEN TruSight Oncology 500 ctDNA software on the same DRAGEN server, install the software with the highest corresponding DRAGEN Software version last, as versions below v2.6.0 will overwrite with its corresponding DRAGEN Software version.

If a prior version of DRAGEN TruSight Oncology 500 ctDNA Analysis Software (eg. v2.5.0) is installed after v2.6.0, re-execute the installation script for v2.6.0 to install the compatible version of DRAGEN Software without impacting other installations.

Installation Instructions

As a root user, perform the following steps to install DRAGEN TruSight Oncology 500 ctDNA v2.6.0 Analysis Software:

  1. Download the installation package provided in the email from Illumina. The link expires after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} "{link}"

Where the file name is the installation script file name, and the link is provided by Illumina Customer Care.

  1. Make sure no other analysis is being performed. Installing the software while performing other analyses prevent the installer process from proceeding

  2. Copy the install script to the /staging directory to store the script in the directory.

Installation Script: install_DRAGEN_TSO500_ctDNA-2.6.0.run

MD5sum values v2.6.0: sha256:1ce33d63c1a499768af7e238d41f81c29dea3e5fe5cc2b08abf12b8afec5bd89 v2.6.1: sha256:bbd07dc63a9da517414460218dd9c12fde88a7378178c214ed2b8beba0f9483f

  1. Use the following command to update the run script permission: chmod +x /staging/install_DRAGEN_TSO500_CTDNA-2.6.0.run

  2. Use the following command to run the installation script, which runs for approximately 20 minutes:

    1. For Docker, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500_CTDNA-2.6.0.run . The script installs compatible DRAGEN software and removes any previously installed versions.

    2. For Apptainer, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500_CTDNA-2.6.0.run -- --noDockerInstall This will not install Apptainer, but will install the analysis software in the Singularity Image File (SIF) format and modify the software to launch analyses using Apptainer.

  3. During the installation process, you might be instructed to reboot or power cycle the system to complete the installation of the DRAGEN software. A power cycle of the system requires the server be shut down and restarted.

  4. Log out of the server and then log back in.

  5. Install your DRAGEN server licenses if needed:

    1. To run DRAGEN TruSight Oncology 500 ctDNA v2.6.0 Analysis Software, you need TSOCombined license. This license is pre-installed on DRAGEN servers purchased after August 2022. To check if the license is already installed, run /opt/edico/bin/dragen_liccommand.

    2. For servers connected to the Internet, install your software licenses as follows:

      1. First, test and confirm that the server is connected to the Internet. Example: ping www.illumina.com

      2. To install the license, enter: /opt/edico/bin/dragen_lic -i auto

  6. After installing DRAGEN server licenses, generate a list of installed DRAGEN server licenses by running the following command: /opt/edico/bin/dragen_lic

    If license installation is successful, the list should include TSOCombined.

    If the expected licenses are not installed, contact Illumina Customer Care.

Running the System Check

After installation is complete, make sure the system functions properly by running the following command: /usr/local/bin/check_DRAGEN_TSO500_CTDNA-2.6.0.sh

The script checks that:

  • All required services are running

  • Proper Docker image is installed

  • DRAGEN TruSight Oncology 500 ctDNA Analysis Software can successfully process a test data set

The system check script runs for approximately 25 minutes. If the script prints a failure message, contact Illumina Technical Support and provide the /staging/check_DRAGEN_TSO500_CTDNA_<timestamp>.tgz output file.

If using MacOS to connect to a server, an error can occur if the local settings are not in English. To resolve the error, disable the ability to set environment variables automatically in Terminal settings.

Uninstall Software

The DRAGEN TruSight Oncology 500 Analysis Software installation includes an uninstall script called uninstall_DRAGEN_TSO500_CTDNA-2.6.0.sh, which is located in /usr/local/bin.

Executing the uninstall script removes the following assets:

  • All DRAGEN TruSight Oncology 500 ctDNA Analysis Software related scripts located in /usr/local/bin

  • Resources found in /staging/illumina/DRAGEN_TSO500_CTDNA

  • The dragen_tso500_ctdna Docker image

To uninstall the DRAGEN TruSight Oncology 500 ctDNA Analysis Software, run the following command as a root user:

uninstall_DRAGEN_TSO500_CTDNA-2.6.0.sh

You are not required to uninstall Docker or DRAGEN software. To remove Docker, review the install instructions for your operating system in the Docker documentation.

Troubleshooting

General Troubleshooting on Standalone DRAGEN Server

Sample Sheet Validation Failures

In DRAGEN TruSight Oncology 500 Analysis Software, the analysis fails if a sample sheet is invalid. If an invalid sample sheet in suspected, log files can help troubleshoot a failed analysis. Use the following steps to find the log file for the sample sheet:

  1. Navigate to the following location /<analysis_output>/Logs_Intermediates/SamplesheetValidation.

  2. Open the SamplesheetValidation-.log file

  3. Find a line with the following: SampleSheetValidationTask:NA:1 exited with return code 1 which has not been declared as a valid return code.

General troubleshooting for a failed sample sheet:

Valid indexes for assay and instrument combinations:

Troubleshooting BCL issues:

Troubleshooting FASTQs issues:

Other Troubleshooting Tips

Troubleshooting on ICA

In addition to TSO 500 managed sample sheet validations, ICA managed TSO 500 errors include the following:

Installation of TSO 500 ctDNA v2.6.2 on Standalone DRAGEN Server

*DRAGEN TSO 500 Analysis Software v2.6.1 can run on a single server with another multi-version compatible DRAGEN TSO 500 Analysis Software, e.g. DRAGEN TSO 500 v2.5.4, and should be installed after v2.5.4. DRAGEN TSO 500 Analysis Software v2.6.1 can be co-installed with multi-version compatible DRAGEN TSO 500 ctDNA Analysis Software or other DRAGEN pipelines with any order of installation.

**For example, DRAGEN Enrichment, DRAGEN Germline, and others. Order of installation does not matter.

Installation Instructions

Pre- and Post- Installation Steps

  1. Uninstall all existing single-version compatible software on the server (see table above)

  2. Install DRAGEN TSO 500 ctDNA Analysis Software v2.6.2

    • This step will disrupt previously installed single-version compatible software but will not impact multi-version compatible ones (see table above)

  3. Install other multi-version DRAGEN software/pipelines if needed

    • Multi-version software can be installed in any order

Steps to Install DRAGEN TSO 500 ctDNA Analysis Software v2.6.2

As a root user, perform the following steps to install DRAGEN TSO 500 ctDNA Analysis Software v2.6.2:

  1. Download the installation package provided in the email from Illumina. The link expires after 7 days.

It is recommended to use a command line tool like wget or curl to download the file rather than pasting the link into the web browser bar. For example:

curl -o {filename} "{link}"

wget -O {filename} "{link}"

Where the file name is the installation script file name, and the link is provided by Illumina Customer Care.

  1. Make sure no other analysis is being performed. Installing the software while performing other analyses prevents the installer process from proceeding.

  2. Copy the install script to the /staging directory to store the script in the directory.

Installation Script: install_DRAGEN_TSO500_CTDNA-2.6.2.run

MD5sum value: sha256:fb88b215f5f1077e9bff2cc153c739060a0f29c359523f4f9cbe36eb3258a85c

  1. Use the following command to update the run script permission: chmod +x /staging/install_DRAGEN_TSO500_CTDNA-2.6.2.run

  2. Use the following command to run the installation script (run time ~ 20 minutes):

    1. For Docker, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500_CTDNA-2.6.2.run . The script installs compatible DRAGEN software and removes any previously installed versions.

    2. For Apptainer, use the following command: sudo TMPDIR=/staging /staging/install_DRAGEN_TSO500_CTDNA-2.6.2.run -- --noDockerInstall This will not install Apptainer, but will install the analysis software in the Singularity Image File (SIF) format and modify the software to launch analyses using Apptainer.

  3. During the installation process, you may be instructed to reboot or power cycle the system to complete the installation of the DRAGEN software. A power cycle of the system requires the server be shut down and restarted.

  4. Log out of the server and log back in.

  5. Install your DRAGEN server licenses if needed:

    1. To run DRAGEN TSO 500 ctDNA Analysis Software v2.6.2 , you need TSOCombined license. This license is pre-installed on DRAGEN servers purchased after August 2022. To check if the license is already installed, run /opt/dragen/3.10.19/bin/dragen_lic command.

    2. For servers connected to the Internet, install your software licenses as follows:

      1. First, test and confirm that the server is connected to the Internet. Example: ping www.illumina.com

      2. To install the license, enter: /opt/dragen/3.10.19/bin/dragen_lic -i auto

  6. After installing DRAGEN server licenses, generate a list of installed licenses by running the following command: /opt/edico/bin/dragen_lic

    If license installation is successful, the list should include TSOCombined.

    If the expected licenses are not installed, contact Illumina Customer Care.

Running the System Check

After installation is complete, make sure the system functions properly by running the following command: /usr/local/bin/check_DRAGEN_TSO500_CTDNA-2.6.2.sh

The script checks that:

  • All required services are running

  • Proper Docker image is installed

  • DRAGEN TSO 500 ctDNA Analysis Software can successfully process a test data set

The system check script runs for approximately 25 minutes. If the script prints a failure message, contact Illumina Technical Support and provide the /staging/check_DRAGEN_TSO500_CTDNA_<timestamp>.tgz output file.

If using MacOS to connect to a server, an error can occur if the local settings are not in English. To resolve the error, disable the ability to set environment variables automatically in Terminal settings.

Uninstall Software

The DRAGEN TSO 500 ctDNA Analysis Software installation includes an uninstall script called uninstall_DRAGEN_TSO500_CTDNA-2.6.2.sh, which is located in /usr/local/bin.

Executing the uninstall script removes the following assets:

  • All DRAGEN TSO 500 Analysis Software related scripts located in /usr/local/bin

  • Resources found in /staging/illumina/DRAGEN_TSO500_CTDNA-2.6.2

  • The dragen_tso500_ctdna Docker image

To uninstall the DRAGEN TSO 500 ctDNA Analysis Software, run the following command as a root user:

uninstall_DRAGEN_TSO500_CTDNA-2.6.2.sh

You are not required to uninstall Docker or DRAGEN software. To remove Docker, review the install instructions for your operating system in the Docker documentation.

Refer to for more information.

This version of the software includes results generated by features tested in silico and by beta features. Beta features have not been fully evaluated for performance, see .

Contact Illumina Customer Care at to obtain the DRAGEN TruSight Oncology 500 Analysis Software installer package.

For servers not connected to the internet, contact Illumina Customer Care at for license information.

Failure Type
Actions

Search for errors in the sample sheet validation log and compare with the guidelines and warnings in and the following tables.

Failure Type
Action
Assay
Index Set ID
Failure Type
Action
Failure Type
Action
Failure Type
Action
Error
Description

Contact Illumina Customer Care at to obtain the DRAGEN TSO 500 ctDNA Analysis Software installer package.

For servers not connected to the internet, contact Illumina Customer Care at for license information.

RNA Output
Beta Features

Software

- Open the log file ./<AnalysisFolder>/Logs_Intermediates/pipeline_trace.txt. This log file displays each pipeline step run by the Nextflow workflow manager software. If a step fails, it is marked as FAILED. Each step generates log files that are stored in step-specific subfolders in the Logs_Intermediates folder. Review the log files in the relevant Logs_Intermediates folder for the step to identify potential sources of error. - Open the errors folder ./<AnalysisFolder>/errors. The workflow creates an error file, error_<NameOfFailedStep>.json, for each step that failed during analysis. For steps that fail per sample, there is a separately labeled file for each sample that failed each step error_<NameOfFailedStep>_<SampleIDIfRelevant>.json. These files contain the command and stdout and stderr from the step.

Samples

Open the combined metrics output results file ./<AnalysisFolder>/Results/<PairId>/MetricsOutput.tsv. If a sample fails an analysis step, the Pair ID that contains the sample shows the failure under FAILED_STEPS in the Analysis Status section, and COMPLETED_ALL_STEPS shows as False. If available, review the individual log files for the failed steps under ./<AnalysisFolder>/Logs_Intermediates to identify potential sources of error.

Multinode Gather

If the following error appears, check if the sample or pair ID was included multiple times during separate node analysis runs, before being gathered together. If the error exists, rerun one of the analyses without the duplicate and reattempt gathering. ERROR:Gather:Destination file ... already exists - check if the same sample ID is in multiple input folders

Sample Sheet not found

Verify that SampleSheet.csv is present at the top level of the run folder with the name "SampleSheet.csv". If the sample sheet is in a different location, supply the sample sheet using the --sampleSheet option

Indexes are not valid for the sequencer and/or assay

See Valid indexes for assay and instrument combinations for correct indexes for the sequencer and assay.

Pair_ID is not unique

Pair_ID column is required in the TSO500S_Data section of the sample sheet, which pairs at most one RNA and one DNA sample together for analysis. If the sample does not have a pair, use a unique pair ID for single samples.

Sample Sheet is not in v2 format

Verify that the format of the sample sheet is v2. v1 sample sheet is not compatible with DRAGEN TruSight Oncology 500 Analysis Software.

Analysis does not run

Verify the analysis starts from the run folder, and BCLs or FASTQs are in the correct locations as outlined in Starting From BCL Files and Starting From FASTQ Files respectively.

TSO 500

  • UP1-UP16

  • CP1-CP16 (DNA Only)

TSO 500 HT

  • UDP0001–UDP0192

The output file directory contains information from previous analyses

If this issue is seen: specify a new target output folder and repeat analysis To prevent this issue: specify an empty directory before starting analysis

Single exon (single probe) genes are still reported in the CNV VCF file, but not the CNV TSV file

No action needed; software is working as expected.

Currently all single probe genes are not emitted to the Copy Number Variants section of our CombinedVariantOutput.tsv. However, you can still find these events in the cnv.vcf.gz.

Due to the single probe nature, accurate CNV calling has not been validated and as such they are emitted as REF

customercare@illumina.com
customercare@illumina.com
Sample Sheet Requirements
customercare@illumina.com
customercare@illumina.com

Installation of NovaSeq 6000Dx TSO 500 ctDNA Analysis Application

Instructions to install DRAGEN TSO 500 ctDNA Analysis Application on NovaSeq 6000Dx (RUO mode)

Prerequisites

The following requirements must be met to install and run DRAGEN TruSight Oncology 500 ctDNA Analysis Application on NovaSeq 6000Dx:

  • A NovaSeq 6000Dx sequencing instrument with Illumina Run Manager v1.6.2 or later installed and paired with DRAGEN Server v4.

  • Administrator privileges on Illumina Run Manager.

Installation Instructions

Perform the following steps to download and install the DRAGEN TruSight Oncology 500 ctDNA Analysis Application on NovaSeq 6000Dx installation package:

  1. Contact Illumina Customer Care for the link to the installation package. The link expires after 7 days.

  2. Download the installation package with the link provided in the email from Illumina Customer Care. The installation package contains the following files:

    1. DRAGEN IRES file: drageninstaller_<DRAGEN_VERSION>.el8.x86_64_prod.ires

    2. Illumina Run Manager: TSO500L_v<APP_VERSION>.iapp

  3. Install the DRAGEN version using the IRES file.

    1. Log into Illumina Run Manager as an administrator.

    2. From the Settings menu, select DRAGEN.

    3. Select Add DRAGEN Installer.

    4. Select the DRAGEN IRES file to begin installation.

    5. Check the install versions list to verify that the appropriate DRAGEN version is listed

  4. Install the Illumina Run Manager-compatible application.

    1. Log into Illumina Run Manager as an administrator.

    2. From the left-hand menu, select Applications.

    3. Select the RUO tab, select Install Application, and then select the IAPP file to install the application.

    4. Check the RUO Applications list to verify that the appropriate application version is listed. You can configure application settings such as required users from this list.

External Storage Configuration

Perform the following steps to specify a location to store analysis results:

  1. Log into Illumina Run Manager as an administrator.

  2. From the Settings menu, select External Storage for Analysis Results.

  3. Configure the following parameters:

    1. Server Location: The path of the preferred output file location. A change to the server location may prompt you to enter the domain, user name, and password to the new server location.

      1. In addition to storing the analysis results from any sequencing runs, this location contains the input data created when setting up runs. Make sure that the server location has sufficient storage space for your run.

    2. Encryption—Select Require encryption during file transfer.

  4. Select Save to preserve the changes to the external storage output parameters.

Lane Column without Values

Ensure that the column is completed. If lane is not applicable to the run, delete the column.

Format of v2 sample sheet is incorrect

HRD analysis is missing

Sample_ID and/or Sample_Type is not present

Verify that the sample sheet has columns and values for Sample_ID and Sample_Type.

Unique sample IDs

Verify that the Sample_IDs are unique in the sample sheet.

Format of v2 Sample Sheet is incorrect

Verify that the following sections and fields are present in the sample sheet and follow the individual rules in Sample Sheet Requirements. [TS0500S_Data] - Sample_ID - Index_ID - Sample_Type - Pair_ID

Incorrect folder structure

Invalid FASTQ input files

If the FASTQs are invalid, start TSO 500 analysis from BCL files.

HRD analysis missing

Make sure that HRD is in the correct column in the sample sheet.

Failure type: ValueError: Could not find pipeline ID for app BCLConvert in sample sheet SampleSheet.csv

Introduction to DRAGEN TSO 500 ctDNA Analysis Software v2.6

The Illumina DRAGEN TruSight Oncology 500 (TSO 500) ctDNA Analysis Software supports analysis for DNA libraries that are isolated from plasma and prepared using TruSight Oncology 500 ctDNA v2 and v1 assays. The software, which can be run on the DRAGEN Server, on Illumina Connected Analytics, and as an analysis application on NovaSeq 6000Dx, produces a variant call file (VCF) for small variants. Other outputs include tumor mutational burden (TMB), a Jensen-Shannon divergence (sum JSD) score that can be used for evaluating microsatellite instability (MSI) status, files with copy number variants (CNV), fusions as well as coverage reports.

The secondary analysis software starts from a sequencing run folder containing base call files (BCL) or from FASTQ files staged in a FASTQ folder.

This document provides information on installation, configuration, running, troubleshooting as well as analysis algorithms of DRAGEN TruSight Oncology 500 ctDNA analysis software on Illumina Connected Analytics, standalone DRAGEN server, and the NovaSeq 6000Dx analysis application.

Scope

This resource provides information on installation, configuration, running, troubleshooting and analysis algorithms for the following software:

Software
Versions

DRAGEN TruSight Oncology 500 ctDNA Analysis Software on Illumina Connected Analytics (ICA)

  • v2.6.0

  • v2.6.1

DRAGEN TruSight Oncology 500 ctDNA Analysis Software (for standalone DRAGEN server)

  • v2.6.0

  • v2.6.1

  • v2.6.2

DRAGEN TruSight Oncology 500 ctDNA Analysis Application on NovaSeq 6000Dx (uses a paired DRAGEN server)

  • v2.6.0

The content is applicable to all 3 software versions unless otherwise specified. The content related to setting up and running the analysis on ICA is only relevant to software versions v2.6.0 and v2.6.1. The content related to the analysis application on NovaSeq 6000Dx is only relevant to v2.6.1.

Local and Cloud Deployments

Local analysis is available using a standalone DRAGEN server or an application with a user interface on NovaSeq 6000Dx. The software on the standalone DRAGEN server allows for analysis on a single DRAGEN server or splitting across multiple servers.

Cloud analysis is available on Illumina Connected Analytics with auto-launch or manual launch. Both methods are available from BCLs and FASTQs.

Instrument Compatibility

DRAGEN TruSight Oncology 500 ctDNA analysis software is compatible with data generated on the Illumina instruments as summarized in the table below.

Instrument
Illumina Connected Analytics
Standalone DRAGEN Server
Paired DRAGEN server
On-board DRAGEN

NovaSeq 6000

Yes (v2.6.0 and v2.6.1)

Yes

N/A

N/A

NovaSeq 6000Dx (RUO mode)

Yes (v2.6.0 and v2.6.1)

Yes

Yes (v2.6.1)

N/A

NovaSeq X

Yes (v2.6.0 and v2.6.1)

Yes

N/A

No

Navigation of Guide

This resource provides information on installation, configuration, running, troubleshooting as well as analysis algorithms of DRAGEN TruSight Oncology 500 ctDNA analysis software on Illumina Connected Analytics, standalone DRAGEN server, and the NovaSeq 6000Dx analysis application.

Workflow Diagram

Getting Started on Illumina Connected Analytics

Prerequisites

Illumina Connected Analytics (ICA) subscription includes access to DRAGEN TruSight Oncology 500 ctDNA Analysis Software. To get started, you need:

  • An ICA account with a valid subscription

  • A positive balance of iCredits for data storage

Sample Sheet Requirements

DRAGEN TSO 500 ctDNA Analysis Software has optional and required fields that are required in addition to general sample sheet requirements. Follow the steps below to create a valid samplesheet.

TSO 500L Data Section header changes depending on the deployment:

  • Standalone DRAGEN Server and ICA with Manual Launch: TSO500L_Data

  • ICA with Auto-launch: Cloud_TSO500L_Data

[TSO500L_Data] Section

Parameter
Required
Details

Sample_ID

Required

The unique ID to identify a sample. The sample ID is included in the output file names. Sample IDs are not case sensitive. Sample IDs must have the following characteristics: - Unique for the run. - 1–40 characters. - No spaces. - Alphanumeric characters with underscores and dashes. If you use an underscore or dash, enter an alphanumeric character before and after the underscore or dash. eg, Sample1-T5B1_022515. - Cannot be called all, default, none, unknown, undetermined, stats, or reports. - Must match a Sample_ID listed in the BCL Convert Data section. - Each sample must have a unique combination of Lane (if applicable), sample ID, and index ID or the analysis will fail.

Sample_Type

Required

Enter DNA

Sample_Description

Not Required

Sample description must meet the following requirements: - 1–50 characters. - Alphanumeric characters with underscores, dashes and spaces. If you enter a underscore, dash, or space, enter an alphanumeric character before and after. eg, Liquid-Sample_213.

To ensure a successful analysis, follow these guidelines:

  1. Avoid any blank lines at the end of the sample sheet; these can cause the analysis to fail.

  2. When running local analysis using the command line save the sample sheet in the sequencing run folder with the default name SampleSheet.csv, or choose a different name and specify the path in the command-line options.

ICA with Auto-launch: Sample Sheet Requirements

To auto-launch analysis from the sequencer run folder, ensure the StartsFromFastq and SampleSheetRequested fields are set to FALSE. To auto-launch analysis from FASTQs after BCL Convert auto-launch, StartsFromFastq and SampleSheet Requested fields must be set to TRUE

[Cloud_TSO500L_Data] Section

[Cloud_TSO500L_Settings] Section

Parameters
Required
Details

SoftwareVersion

Not Required

The TSO500S software version

StartsFromFastq

Required

Set the value to TRUE or FALSE. To auto-launch from BCL files, set to FALSE. To auto-launch from FASTQ files after auto-launch of BCL Convert, set to TRUE.

SampleSheetRequested

Required

Set the value to TRUE or FALSE.

To auto-launch from BCL files, set to FALSE. To auto-launch from FASTQ files after auto-launch of BCL Convert, set to TRUE.

[Cloud_Data] Section

Parameter
Required
Details

Sample_ID

Not Required

The same sample ID used in the Cloud_TSO500L_Data section.

ProjectName

Not Required

The BaseSpace project name.

LibraryName

Not Required

Combination of sample ID and index values in the following format: sampleID_Index_Index2

LibraryPrepKitName

Required

The Library Prep Kit used.

IndexAdapterKitName

Not Required

The Index Adapter Kit used.

[Cloud_Settings] Section

Parameter
Required
Details

GeneratedVersion

Not Required

The cloud GSS version used to create the sample sheet. Optional if manually updating a sample sheet.

CloudWorkflow

Not Required

Ica_workflow_1

Cloud_TSO500L_Pipeline

Required

This value is a universal record number (URN) . The valid value is: urn:ilmn:ica:pipeline:850b4fe6-2a2f-4a85-ae57-cf082dfbefd4#DRAGEN_TruSight_Oncology_500_ctDNA_v2_6_1_4

BCLConvert_Pipeline

Required

The value is a URN in the following format: urn:ilmn:ica:pipeline: <pipeline-ID>#<pipeline-name>

NovaSeq 6000Dx Analysis Application: Sample Sheet Requirements

This section describes fields specific for sample sheets for NovaSeq 6000Dx Analysis Application. For more information on DRAGEN TSO 500 ctDNA Analysis Software sample sheet requirements, refer to the sections above.

Mismatches between the samples and index primers can cause incorrect results due to loss of positive sample identification. Enter sample IDs and assign indexes in the sample sheet before beginning library preparation. Record sample IDs, indexes, and plate well orientation for reference during library preparation.

[BCLConvert_Settings] Section

Parameter Name
Required

SoftwareVersion

Required

Enter the IRM iapp software version 2.6.1-4v2

Sample Sheet Creation in BaseSpace Run Planning tool

How to Create TSO 500 ctDNA Sample Sheets in BaseSpace Run Planning tool

The sections below represent each step in the BaseSpace Run Planning tool.

Note that NovaSeq X Series has a different run set up configuration screen than other instrument platforms. TSO 500 ctDNA does not support multi analysis, and in order to run TSO 500 ctDNA on NovaSeq X Series, enter the appropriate Read 1, Read 2, Index 1 and Index 2 described in the instructions below.

Step 1: Run Settings

Parameter Name
Required
Description

Run Name

Required

Run Name can contain 255 alphanumeric characters, dashes, underscores, periods, and spaces; and must start with an alphanumeric, a dash or an underscore.

Run Description

Optional

Run Description can contain 255 characters except square brackets, asterisks, and commas.

Instrument Platform

Required

Choose from TSO 500 ctDNA supported instruments:

  • NovaSeq 6000/6000Dx

  • NovaSeq X Series

Secondary Analysis

Required

  • BaseSpace/Illumina Connected Analytics (to generate sample sheet for cloud analysis)

  • Local

Read 1

Required on Instrument Platform NovaSeq X Series

  • Fill with value 101 for TSO 500 ctDNA analysis

Index 1

Required on Instrument Platform NovaSeq X Series

  • Fill with value 10 for TSO 500 ctDNA analysis

Index 2

Required on Instrument Platform NovaSeq X Series

  • Fill with value 10 for TSO 500 ctDNA analysis

Read 2

Required on Instrument Platform NovaSeq X Series

  • Fill with value 101 for TSO 500 ctDNA analysis

Sample Container ID

Optional

  • Unique Identifier for the container that holds the sample

Step 2: Configuration

Note: On NovaSeq X Series, this page is called "Configuration 1". The right hand corner of the UI displays the Read 1, Read 2, Index 1 and Index 2 entered on the previous run settings screen.

Parameter Name
Required
Description

Application

Required

  • DRAGEN TruSight Oncology 500 ctDNA Analysis Software - 2.6.1

Description

Optional

Optional text field

Library Prep Kit

Required

  • TruSight Oncology 500 ctDNA (only for NovaSeq 6000/6000Dx instruments)

  • TruSight Oncology 500 ctDNA v2

Index Adapter Kit

Required

TSO 500 ctDNA:

  • TruSight Oncology 500 ctDNA (NovaSeq6000Dx)

  • TruSight Oncology 500 ctDNA (NovaSeq6000)

TSO 500 ctDNA v2:

  • TruSight Oncology 500 ctDNA Index Set A and B (UDP 1-192) (NovaSeq6000Dx, NovaSeqX Series)

  • TruSight Oncology 500 ctDNA Index Set A and B (UDP 1-192) (NovaSeq6000)

Step 3: Sample Settings

Users can manually enter sample information, or download a template file to bulk upload sample information. Users can import the completed template or a compatible sample sheet.

Parameter Name
Required
Description

Read Lengths: Read 1 and Read 2

Required Not applicable on NovaSeq X Series

Auto filled with the standard values, but can be optionally overwritten.

Override Cycles

Required on NovaSeq X Series

Entered based on Run Settings read lengths & index 1 / index 2

Lane Usage

Not applicable on NovaSeq X Series

Checkbox allows users to apply the same lane across samples.

Lane

Required if Lane Usage is unchecked

Specify lanes for each sample. The unmarked checkbox at the top of the dropdown selects all lanes.

Index ID

Required

Index set ID options are based on selected Index Adapter Kit

Project

Optional

Optional field to describe the associated project

Starts from Fastq

Required

True or False

If auto-launching TSO 500 ctDNA from BCL files, set the value to False. If auto-launching TSO 500 ctDNA from FASTQ after auto-launching BCL Convert, set the value to True.

DNA Barcode Mismatches Index 1

DNA Barcode Mismatches Index 2

Required on NovaSeq X

Default value is set to 1.

These fields are required by NovaSeq X and represent BCL Convert settings for index diversity checks when demultiplexing. These values are not used in TSO 500 ctDNA analysis.

Step 4: Run Review

Once all details are captured and pass validation, the user can review the details on the Run Review screen. From here they can choose to edit details in previous screens or export the sample sheet. Once completed, press the Cancel button to finish run planning.

Note: once leaving this screen, the run and sample sheet will not be accessible.

For NovaSeqX Plus users, the run can be saved as a draft or as a planned run (via “Save as Draft” and “Save as Planned” buttons respectively). Either selection will save the run to the Planned Runs screen on BaseSpace. There is no option to export the sample sheet on this screen.

Planned Runs Screen (NovaSeq X Series only)

The Planned Runs screen lists all planned or drafted runs. Users can set drafted runs to planned, export the sample sheet, and edit or delete a run on this screen.

Once the run is saved as Planned, it will appear on the NovaSeq X Series instrument where it can be selected for sequencing.

Guided Examples

Please review these guided examples of analysis workflows that include a step of setting up a run in BaseSpace Run Planning tool:

NovaSeq 6000Dx Run Set Up

The following instructions describe steps to set up a run on NovaSeq 6000Dx Analysis Application.

Use the following steps to configure a TruSight™ Oncology 500 ctDNA run in Illumina Run Manager.

  1. Go to the "Runs" section of Illumina Run Manager by selecting "Runs" on the left-hand side.

  2. Enter sample data manually or by importing a sample sheet

  3. To enter sample data run manually, select “Create Run”.

  4. Choose "DRAGEN TruSight™ Oncology 500 ctDNA Analysis Application" from the "Create Run" screen to set-up and analyze runs for TruSight Oncology 500 ctDNA assay.

Run Settings

  1. On the "Run Settings" screen, enter a run name with the following criteria:

    1. 1 - 40 characters.

    2. Alphanumeric characters, underscores, or dashes only.

    3. Unique across all runs on the instrument.

    The run name identifies the run from sequencing through analysis.

  2. [Optional] Enter a run description. The run description must have the following criteria:

    1. 1 - 50 characters.

    2. Alphanumeric characters or spaces only.

    3. Spaces must be preceded and followed by an alphanumeric character.

  3. Select kit used during library preparation:

    1. TruSight Oncology 500 ctDNA

    2. TruSight Oncology 500 ctDNA v2

  4. Index adapter kit will be automatically selected based on the library prep kit selection

  5. [Optional] Enter a library tube ID.

Depending on the library prep kit selected, additional fields will be populated for run settings and are not editable. Read and index lengths will differ between library prep kit type.

Sample Data

Use the table on the "Sample Data" screen to enter sample information manually.

  1. Select lane information. Options include one to four, or all lanes.

  2. Enter a unique sample ID in the sample ID field with the following criteria:

    1. Controls should be added first.

    2. 1 - 40 characters.

    3. Alphanumeric characters, underscores, or dashes only.

    4. Underscores and dashes must be preceded and followed by an alphanumeric character.

  3. Select an index set ID for the DNA library prepared from the sample.

  4. [Optional] Enter a library name.

Depending on the options selected for index set ID, additional fields will be auto-populated for sample data and are not editable.

Sample Settings

Use the table on the "Sample Settings" screen to enter additional sample information.

  1. [Optional] Enter a sample name with the following criteria:

    1. 1 - 50 characters.

    2. Alphanumeric characters, dashes, underscores, or spaces.

    3. Spaces, underscores, and dashes must be preceded and followed by an alphanumeric character.

  2. [Optional] Enter a sample description with the following criteria:

    1. 1 - 50 characters.

    2. Alphanumeric characters, dashes, underscores, or spaces.

    3. Spaces, underscores, and dashes must be preceded and followed by an alphanumeric character.

Additional fields will be auto-populated based on selections made in the Sample Data screen, which are not editable.

Before starting your run, review that the information entered is correct in the “Run Review” page before saving.

Analysis Launch on Standalone DRAGEN Server

Start the DRAGEN TruSight Oncology 500 ctDNA Analysis Software with the DRAGEN_TSO500_CTDNA-2.6.0.sh Bash script. The script is installed in the /usr/local/bin directory. The Bash script is executed on the command line and runs the software with Docker (or Apptainer if specified).

  • Path to the sequencing run or FASTQ folder. Copy the run or FASTQ folder to the DRAGEN server into the staging folder with the following recommended organization: /staging/runs/{RunID}. You can copy the run folder onto the DRAGEN server using Linux commands such as rsync. The sample sheet within the run folder is used unless otherwise specified through the command line.

Before running the analysis, confirm that the output directory for the software to write to is empty and does not include results of previous analyses.

Storage Requirements

For optimal performance, run analysis on data stored locally on the DRAGEN server. Analysis of data stored on NAS can take longer and performance can be less reliable.

The DRAGEN server provides an NVMe SSD in the /staging directory to use as the software output directory. Network-attached storage is required for long-term storage.

When running the DRAGEN TruSight Oncology 500 ctDNA Analysis Software, use the default settings or set the -analysisFolder command line option to a directory in /staging to make sure the DRAGEN server processes read and write data on the NVMe SSD.

Before beginning analysis, develop a strategy to copy data from the DRAGEN server to a network‑attached storage. Delete output data on the DRAGEN server as soon as possible.

The following are the run and analysis output sizes for each sequencing system per 101 bp:

When launching the analysis, the software checks that the minimum disk space required is available. If the minimum disk space is not available, the software shows an error message and prevents analysis from starting. If disk space is exhausted during a run, the run shows an error and stops analyzing.

Moving or modifying files during an analysis may cause the analysis to fail or provide incorrect results.

Verify that the following sections and fields are present in the sample sheet and follow the individual rules in [BCLConvert_Settings] - SoftwareVersion - AdapterRead1 - AdapterRead2 - AdapterBehavior - MinimumTrimmedReadLength - MaskShortReads [BCLConvert_Data] - Sample_ID - index - index2 [TS0500S_Data] - Sample_ID - Index_ID - Sample_Type - Pair_ID - Sample Feature (Optional)

Verify that HRD is in the Sample Feature column in the sample sheet. Refer to for more information.

- Sample Feature (Optional) Verify when FASTQs were generated using the HRD add-kit (Not available in Japan), Sample Feature is added to those DNA Samples. Refer to for more information.

Verify that the FASTQ files are in the correct structure. Refer to for more information.

Action: Ensure StartsFromFastq field is in the [TSO500S_Settings] section, and it is not present in the [BCLConvert_Settings] Section. Refer to for more information.

The software is optimized for analyzing sequencing outputs generated by the TruSight Oncology 500 ctDNA v2 and v1 assays. Modification might lead to inaccurate data and is a violation of the .

Refer to the for information on how to register ICA subscription and iCredits.

Refer to the following requirements to create sample sheets for running the analysis on ICA with Auto-launch. For sample sheet requirements common between deployments see . Samples sheets can be created using BaseSpace Run Planning Tool or manually by downloading and editing a sample sheet template

Refer to for this section's requirements.

The BaseSpace Sequence Hub Run Planning tool is available, and is used to generate a valid sample sheet in v2 format for use on a TSO 500 ctDNA supported sequencer for both ICA and Standalone DRAGEN Server analysis options. Filling out the form on the user interface will produce a exportable sample sheet with the required fields filled in. Refer to for descriptions of fields that appear in ICA sample sheets.

BaseSpace Run Planning tool cannot generate a valid sample sheet for the NovaSeq 6000Dx TSO 500 ctDNA Analysis Application on Illumina Run Manager. Refer to to create a valid sample sheet.

For more information on run planning, refer to the .

Alternately, select Import Samples to upload sample information. Refer to for sample sheet requirements.

For arguments, refer to . You can start from BCL files or from the FASTQ folder produced by BCL Convert. The following requirements apply for both methods:

Run folder must be intact. Refer to for input requirements.

If the analysis output folder path is different from the default, provide the analysis output folder path. Refer to .

Sequencing System
Run Folder Output (Gb)
Analysis Output (Gb)
Minimum Disk Space (Gb)
Sample Sheet Requirements
Sample Sheet Requirements
Sample Sheet Requirements
Sample Sheet Requirements
Illumina Software Subscription Agreement
Software Registration page
Sample Sheet Requirements page
BaseSpace Sequence Hub support site page
NovaSeq 6000Dx: TSO 500 Auto-launch Analysis in Cloud
Starting From FASTQ Files
Standard Sample Sheet Requirements
[TSO500L_Data] Section
ICA Auto-launch Sample Sheet Requirements

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

85-100

250-374

300

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

164-200

360-665

800

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

290-460

890-1600

1500

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

800-1200

2700-4100

3000

NovaSeq X 1.5B

213

352

800

NovaSeq X 10B

1100

1800

3000

Combined Variant Output

File name: {SampleID}_CombinedVariantOutput.tsv

The combined variant file contains the variants and biomarkers in a single file. The output contains the following variant types and biomarkers:

  • Small variants (including EGFR complex variants)

  • Copy number variants

  • Tumor Mutational Burden (TMB)

  • MSI

  • DNA Fusions

The combined variant output file also contains Analysis Details and Sequencing Run Details sections. The details of each are listed in the following table:

Analysis Details
Sequencing Run Details

- Sample ID - Output date - Output time - Pipeline version (Docker image version number)

- Run name - Run date - Sample index ID - Instrument ID - Instrument control software version - Instrument type - RTA version - SBS reagent cartridge lot number - Cluster reagent cartridge lot number

Variant Filtering Rules

Combined variant output produces small variants with blank fields in the following situations:

  • The variant has been matched to a canonical RefSeq transcript on an overlapping gene not targeted by TruSight Oncology 500 ctDNA.

  • The variant is located in a region designated iSNP, indel, or Flanking in the TST500_Manifest.bed file located in the Resources folder.

  • Small Variants - All variants with the FILTER field marked as PASS and which have a canonical RefSeq transcript are present in the combined variant output.

    • Gene and transcript information is only present for variants belonging to canonical transcripts that are within the Gene list–Small Variants.

  • Copy Number Variants - Copy number variants must meet the following conditions:

    • FILTER field marked as PASS.

    • ALT field is <DUP or <DEL> .

    • Gene is part of the copy number variant gene list

  • Fusion Variants - Fusion variants must meet the following conditions:

    • Passing variant call (KeepFusion field is true).

    • Contains at least one gene on the fusion allow list.

    • Genes separated by a dash (-) indicate that the fusion directionality could be determined. Genes separated by a slash (/) indicate that the fusion directionality could not be determined.

Analysis Output

When the analysis run completes, the DRAGEN TruSight Oncology 500 ctDNA Analysis Software generates an analysis output folder in a specified location.

To view analysis output, navigate to the analysis output folder and select the files that you want to view.

Analysis Output Folder Structure

Single output folder structure is as follows.

  • Logs_Intermediates

    • AdditionalSarjMetrics

    • Annotation—Contains outputs for small variant annotation.

      • Subfolders per sample ID—Contains the aligned small variants JSON.

    • CombinedVariantOutput

      • Subfolders per sample ID—Contains the combined variant output TSV files.

      • A combined output log file.

    • Contamination

      • Subfolders per sample ID—Contains the contamination metrics JSON file and output logs.

    • CoverageReports

    • DnaFusionFiltering

    • DragenCaller

      • Subfolders per sample ID—Contains the aligned BAM and index files, small variant VCF and gVCF, copy number variant VCF, MSI JSON, exon coverage report bed, and QC outputs in CSV format.

    • FastqValidation—Contains the FASTQ validation output log for the samples.

    • FastqGeneration

    • MetricsOutput

      • Subfolders per sample ID—Contains the metrics output TSV files.

      • A combined output log file.

    • ResourceVerification—Contains the resource file checksum verification logs.

    • Run QC—Contains the Run QC metrics JSON, Intermediate Run QC metrics JSON, and log file.

    • SampleAnalysisResults

      • Subfolders per sample ID—Contains the Sample Analysis Results JSON and detailed log file.

    • SampleSheetValidation—Contains the Intermediate sample sheet and validation log.

    • Passing Sample Steps - JSON file that contains the steps passed for each Sample ID

    • Tmb

      • Subfolders per sample ID—Contains the TMB metrics CSV, TMB trace TSV, and related files and logs. pipeline_trace.txt—Contains a summary and troubleshooting file that lists each Nextflow task executed and the status (for example, COMPLETED or FAILED). run.log—Contains a complete trace-level log file describing the Nextflow pipeline execution. run_report.html—Contains high-level run statistics (performance, usage, etc.) run_timeline.html —Contains timeline-related information about the analysis run.

  • Results

    • Metrics Output TSV (all Sample IDs)

    • Sample ID—The following outputs are produced for each sample:

      • Combined Variant Output TSV

      • Metrics Output TSV

      • TMB Trace TSV

      • Small Variant Genome VCF

      • Small Variant VCF

      • Small Variant Annotated JSON

      • Copy Number Variant VCF

      • MSI JSON

      • Fusions CSV

      • Exon Coverage Report TSV

      • Gene Coverage Report TSV

ICA Output Folder Structure

This section describes each output folder generated during analysis and where to find metric and analytic files when the pipeline is executed. The same output folder structure and content exist in ICA and BaseSpace Sequence Hub.

High-Level Folder Structure

  • Run ID

    • TSO500_Nextflow_logs

    • _manifest.json

    • Results

      • _tags.json

    • Logs_intermediates

    • Errors—This folder is only present when analysis fails

TSO500_Nextflow_logs Folder Structure

The TSO_500_Nextflow_Logs provides information related to the execution of the pipeline on ICA as a whole and for specific nodes (when an analysis is split across multiple nodes). It contains files used to execute parts of the workflow on different nodes as well as records of the nextflow execution on those nodes.

  • TSO_500_Nextflow_Logs

    • _manifest.json

Results Folder Structure

Contains the aggregated MetricsOutput.tsv file at the root level. Additionally, the Results folder contains a subfolder for each sample ID.

  • Results

    • MetricsOutput.tsv

    • Sample_1

    • Sample_2

    • Sample_<#>

The Results subfolder contains the following files:

  • Results

    • MetricsOutput.tsv

    • <Sample_id>

      • CombinedVariantOutput.tsv

      • Fusions.csv

      • tmb.trace.tsv

      • hard-filtered.gvcf

      • hard-filtered.vcf

      • SmallVariants_Annotated.json.gz

      • cnv.vcf

      • exon_cov_report.tsv

      • gene_cov_report.tsv

      • MetricsOutput.tsv

      • microsat_output.json

Logs_intermediates Folder Structure

Contains folders for each submodule in the DRAGEN TSO 500 ctDNA on ICA pipeline. The folders contain a copy of all the relevant files required to create the metric output files and report files, as well as the combined log files at the root level and subfolders for each sample.

  • Logs_intermediates

    • AdditionalSarjMetrics

    • Annotation

    • CombinedVariantOutput

    • Contamination

    • CoverageReports

    • DnaFusionFiltering

    • DragenCaller

    • FastqValidation

    • FastqGeneration

    • MetricsOutput

    • PassingSampleSteps

    • ResourceVerification

    • Run QC

    • SampleAnalysisResults

    • SampleSheetValidation

    • Tmb

All logs in Logs_Intermediates are generated from the running analysis software. Inputs to the running Docker container (for example, the run folder, sample sheet, and FASTQ folder) are mapped from native locations on the server to the following locations in the container:

Input
Running Docker Container Location

Run folder

/opt/illumina/run-folder

Sample sheet

/opt/illumina/SampleSheet.csv

FASTQ folder

/opt/illumina/fastq-folder

Resources

/opt/illumina/resources

Analysis output folder

/opt/illumina/analysis-folder

The paths in the log messages refer to paths within the running docker container, not paths on the server.

Errors Folder Structure

Contains Errors.tsv. This file contains the summary of all the errors encountered during pipeline execution.

  • Errors

    • Errors.tsv

NovaSeq 6000Dx Analysis Application Output Folder Structure

The following files and folders are created during analysis by NovaSeq 6000Dx Analysis Application:

  • analysisResults.json

  • CopyComplete.txt

  • edgeos.nextflow.config

  • inputs/

    • sampleMapping.json

    • SampleSheet.csv

    • SampleSheet.json

  • Logs_Intermediates

  • Manifest.tsv

  • params.json

  • Results/

  • workflowLogs/

    • nf-main-***.log

When the analysis run completes, the analysis application generates an analysis output in a specified location. To view analysis output, follow the steps below:

  1. On the “Completed” runs tab, select the run

  2. Review the run details page, and this will give the information to access the output folder

  3. External Location: is the input for the run

  4. Analysis Output Folder: is where the output is stored. To navigate to this page, follow the “server location” and the gds analysis output folder

  5. Navigate to the directory that contains the analysis output folder

  6. Open the folder, and then select the files that you want to view

Command-Line Options
Command-Line Options
NovaSeq 6000Dx Analysis Application: Sample Sheet Requirements
Starting from BCL Files

Coverage Reports

The gene and exon coverage report files are tab-separated value (TSV) files with coverage values matching respectively the exons and genes specified in the manifest file.

Sample Sheet Introduction

Overview

The sample sheet includes a list of samples and their index sequences, along with additional information required to run DRAGEN TruSight Oncology 500 ctDNA Analysis Software. For example, the library prep kit used for analysis will need to be listed in the sample sheet. Appropriate index adapter sequences are determined by the assay used to perform analysis.

When running analysis on a standalone DRAGEN server or on ICA, a valid sample sheet can be created by:

When running analysis using a NovaSeq 6000Dx Analysis Application, a valid sample sheet can be created by:

The run set up section of this guide includes specific instructions to plan a run and set up a valid sample sheet for each deployment of DRAGEN TruSight Oncology 500 ctDNA Analysis Software.

Analysis Launch on ICA

Methods for Launching Analysis

Illumina Connected Analytics (ICA) supports the following methods for launching DRAGEN TruSight Oncology 500 ctDNA Analysis Software.

For more information about using ICA or BaseSpace Sequence Hub, refer to the following support pages on the Illumina support site.

Manual Launch of DRAGEN TSO 500 ctDNA Analysis on ICA

How to Launch Analysis

After adding the Bundle to the Project, an example dataset becomes available in the Demo_Data folder for the Project. 

  1. Download Results: After analysis is complete, navigate to results in the configured output location.

Please see the Illumina Support Shorts for guidance on how to set up and run DRAGEN TSO 500 RUO analysis on ICA. This is the same process as DRAGEN TSO 500 ctDNA, but with different inputs specific to DRAGEN TSO 500 solid:

Analysis Parameters on ICA

To launch an analysis via the ICA user interface, configure a DRAGEN TSO 500 ctDNA pipeline analysis with the following parameters.

Parameter Name
Description

User Reference

The analysis run name.

User Tags

Text labels to help index the analysis.

Notify me when task is completed

Option to receive an email notification when analysis is complete.

Output Folder

The path to the analysis output folder. The default path is the project output folder.

Entitlement Bundle

Automatically populated from the project details.

Sample Sheet

Select a sample sheet in CSV format for the analysis.

To note: Sample Sheet selection is optional if starting from a run folder, and required when submitting a FASTQ folder.

Input Folder

The run folder or FASTQ folder that contains files to analyze.

FASTQ List CSV

Do not use, this only applies to auto-launch TSO 500 ctDNA analysis from FASTQs after BCL auto-launch.

Starts from FASTQ

True for analysis performed on files in the FASTQ folder. False for analysis performed on files in the run folder.

Sample or Pair IDs

Optional subset of Sample IDs or Pair IDs to analyze.

Sample List

Do not use, this only applies to auto-launch TSO 500 ctDNA analysis from FASTQs after BCL auto-launch.

Storage Size

The storage size to allocate for the analysis. The default and recommended value is Large.

DNA Output

Small Variant Genome VCF

File name: {SAMPLE_ID}_hard-filtered.gvcf.gz

The small variant genome VCF file includes the variant call status for all targeted intervals, left-padded by 25 bp.

The Epidermal Growth Factor Receptor (EGFR) complex variant VCF includes phased EGFR variants. The FILTER column in the genome VCF determines the variant status. Refer to the following table for more information.

ALT
FILTER
Note

.

PASS

WT.

., A, C, G, etc1

low_depth²

Reference or filtered variant candidate with depth < 1000.

A, C, G, etc1

PASS

PASS variants.

A, C, G, etc1

weak_evidence

Filtered variant candidate with the following conditions:

Filtered variant candidate with low SQ score (< 2).

A, C, G, etc1

excluded_regions³

Position with high background noise. Not available for variant detection.

A, C, G, etc1

systematic_noise

Filtered variant candidate with low AQ score (< 20 for hotspots, < 60 for nonhotspots).

A, C, G, etc1

mapping_quality

Filtered variant candidate with low median mapping quality (< 30).

A, C, G, etc1

read_position

Filtered variant candidate showed bias clustered at fragment ends.

A, C, G, etc1

multiallelic

Filtered if there are two or more ALT alleles at this location.

A, C, G, etc1

low_frac_info_reads

Filtered if the fraction of informative reads is low (< 0.5).

¹ Etc refers to other variant types not mentioned in the table.

² Reference positions and nonpassing variants with coverage below 1000X directly translate into low_depth. For variant calls, low_depth is not applied when a position has a PASS filter.

³ This is a static list of regions compiled by Illumina. Email Illumina Technical Support for more information.

Small Variant Annotated JSON

File name: {SAMPLE_ID}_DNAVariants_Annotated.json.gz

The small variants annotated file provides variant annotation information for all non-reference positions in the VCF, which includes non-pass variants. The variant consequence definition is available on the Sequence Ontology website.

All pass variant calls are annotated using the Illumina Annotation Engine (IAE), also known as Nirvana, with the following information (using the RefSeq transcript):

  • HGNC Gene

    • Transcript

    • Exon

    • Consequence

    • c.HGVS

    • p.HGVS

  • COSMIC

TMB Trace

File name: {Sample_ID}.tmb.metrics.csv

The TMB metrics file contains the tumor mutational burden metrics for each DNA sample. The file format uses the following CSV column convention, similar to other metric CSV files.

Column
Description

Filtered Variant Count

Remaining variants after variant and germline filters.

Eligible Region (MB)

The specified custom regions, in megabases, that meet the minimum coverage threshold.

TMB

Filtered variants normalized by the eligible regions.

Sample Analysis Results JSON

File name: {SampleID}_SampleAnalysisResults.json

The sample analysis results file (SARJ) is an aggregated results file created for each sample. The SARJ file is used for the generation of downstream outputs. The file contains passing variants and passing variant annotations.

Copy Number Variants VCF

File name: {Sample_ID}_cnv.vcf

The copy number variants (CNV) file contains calls for DNA libraries of the CNV genes targeted by TruSight Oncology 500 ctDNA v2 and v1 assays. The CNV call indicates fold change results for each gene classified as reference, deletion, or amplification.

The value in the QUAL column of the copy number variants VCF is a Phred transformation of the p-value represented by the following equation:\

The p-value is derived from the t-test between the fold change (FC) of the gene against the rest of the genome. Higher Q-scores indicate higher confidence in the CNV call.

In the VCF notation, <DUP> indicates the detected FC is greater than a predefined amplification cutoff. <DEL> indicates the FC is less than a predefined deletion cutoff for that gene. This cutoff can vary from gene to gene.

Each copy number variant is reported as the fold change on normalized read depth in a testing sample relative to the normalized read depth in diploid genomes. Given tumor purity, the ploidy of a gene in the sample can be inferred from the reported fold change.\

Given tumor purity X%, for a reported fold change Y, the copy number n can be calculated by using the following equation:

For example, in a testing sample of tumor purity at 30%, MET with a fold change of 2.2x indicates that 10 copies of MET DNA are observed.

Copy Number Variants Metrics CSV

Copy number variant metrics are reported on a per sample level.

  • Sex Genotyper—The predicted sex of the sample.

  • Number of alignment records—The number of alignment records in the sample.

  • Bases in reference genome—The number of bases in the reference genome.

  • Average alignment coverage over genome—The average alignment coverage across the reference genome.

  • Number of filtered records (total)—The number of total filtered records.

  • Number of filtered records (duplicates)—The number of duplicated filtered records.

  • Number of filtered records (MAPQ)—The number of MAPQ filtered records.

  • Number of filtered records (unmapped)—The number of unmapped filtered records.

  • Number of target intervals—The number of target intervals in the sample.

  • Number of segments—The number of segments in the sample. Applicable only to CNV SLM mode.

  • Number of amplifications—The number of amplifications in the sample. Applicable only to CNV SLM mode.

  • Number of deletions—The number of deletions in the sample. Applicable only to CNV SLM mode.

  • Number of passing amplifications—The number of passing amplifications in the sample. Applicable only to CNV SLM mode.

  • Number of passing deletions—The number of passing deletions in the sample. Applicable only to CNV SLM mode.

Small Variant VCF

File name: {SampleID}_hard-filtered.vcf

The small variant file contains both phased variants and all other small variants. The header sections from both the phased variant (complex) VCF and the small variant VCF are included in this merged VCF. Variants that are found for both phased variants and small variants are only displayed as phased variants.

Fusions CSV

File name: {Sample_ID}_Fusions.csv

The fusions file contains all candidate fusions identified by the analysis pipeline.

The fusion columns are described in the following table. If you use Microsoft Excel to view this file, genes that are convertible to dates (for example, MARCH1) automatically convert to dd‑mm format (1-Mar).

Fusion Object Field
Description

Sample

Input sample ID.

Name

Fusion name as reported by the DRAGEN fusion caller.

Chr1

The chromosome of the first breakend.

Pos1

The position of the first breakend.

Chr2

The chromosome of the second breakend.

Pos2

The position of the second breakend.

Direction

The direction of how the breakends are joined.

Alt_Depth

The number of read-pairs supporting the fusion call.

Total_Depth

Max number of read-pairs aligned to a fusion breakend.

BP1_Depth

Number of read-pairs aligned to the first breakend.

BP2_Depth

Number of read-pairs aligned to the second breakend.

VAF

Variant allele frequency.

Gene1

Genes that overlap the first breakend.

Gene2

Genes that overlap the second breakend.

Contig

The fusion contig.

Filter

Indicates whether the fusion has passed all of the fusion filters.

Is_Cosmic_GenePair

Indicates whether the gene pair has been reported by COSMIC (True/False).

Fusion Directionality Known

Indicates whether the fusion direction is known, and indicated by the order of the genes (True/False).

The following table lists the meaning of the values in the direction column. The values are in the format used by Samtools.

Direction
VCF Format
Description

L1R2

t[p[

The left of breakend1 is joined with the right of breakend2.

L1rL2

t]p]

The left of breakend1 is joined with the reverse complement of the left of breakend2.

L2R1

]p]t

The left of breakend2 is joined with the right of breakend1.

rR2R1

[p[t

The reverse complement of the right of breakend2 is joined with the right of breakend1.

Command-Line Options

Option
Required
Description

--help

No

Displays a help screen with available command line options.

--analysisFolder

No

--resourcesFolder

No

Path to the resource folder location. The default location is /staging/illumina/DRAGEN_TSO500_CTDNA_2.6.0/resources. If not using the default location, enter the full path to the resource folder.

--runFolder

Yes

Required when --fastqFolder is not specified. Provide the full path to the local run folder.

--fastqFolder

Yes

Required when --runFolder is not specified. Provide the full path to the local FASTQ folder. Analysis starts at this location.

--user

No

Optional for Docker. Specify the user ID to be used within the Docker container.

--version

No

Displays the version of the software.

--sampleSheet

No

Provide the full path, including file name, if not provided as SampleSheet.csv in the run folder

--sampleOrPairIDs

No

Provide the comma-delimited sample IDs that should be processed on this node with no spaces. For example, Sample_1,Sample_2.

--demultiplexOnly

No

Demultiplex to generate FASTQ only without additional analysis.

Note:

  • Use full paths when specifying the file paths in the command line.

  • Avoid special characters such as &, *, #, and spaces.

  • When starting from BCL files, only the run folder needs to be specified. The immediate parent directory containing the BCL files does not need to be specified.

When running the analysis software using SSH, Illumina recommends using additional software to prevent unexpected termination of analysis. Illumina recommends screen and tmux.

  1. Wait for any running DRAGEN TruSight Oncology 500 ctDNA Analysis Software containers to complete before launching a new analysis. Run the following command to generate a list of running containers:docker ps

  2. Select from one of the following options:

  • Start from BCL files in the run folder with the sample sheet included in the run folder. DRAGEN_TSO500_CTDNA-2.6.0.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName}

  • Start from BCL files in the run folder with the sample sheet located in a folder other than the run folder. DRAGEN_TSO500_CTDNA.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleSheet /staging/{SampleSheetName}.csv

  • Start from BCL files in the run folder with a different sample sheet and demultiplexing only. DRAGEN_TSO500_CTDNA-2.6.0.sh \ --runFolder /staging/{RunFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleSheet /staging/{SampleSheetName}.csv \ --demultiplexOnly

  • Start from FASTQ folder with sample sheet included in the FASTQ folder and subset of samples. DRAGEN_TSO500_CTDNA-2.6.0.sh \ --fastqFolder /staging/{FastqFolderName} \ --analysisFolder /staging/{AnalysisFolderName} \ --sampleOrPairIDs "Sample_1,Sample_2"

Starting from BCL Files

If starting from BCL (*.bcl) files, DRAGEN TruSight Oncology 500 ctDNA Analysis Software requires the run folder to contain certain files and folders. These inputs are required for Docker.

The run folder contains data from the sequencing run, make sure that the folder contains the following files:

Folder/File
Description

Config folder

Configuration files

Data folder

*.bcl files

Images folder

[Optional] Raw sequencing image files.

Interop folder

Interop metric files.

Logs folder

[Optional] Sequencing system log files.

RTALogs folder

Real-Time Analysis (RTA) log files.

RunInfo.xml file

Run information.

RunParameters.xml file

Run parameters.

SampleSheet.csv file

Sample information. If you want to use a sample sheet that is not in the run folder or a sample sheet named something other than SampleSheet.csv, provide the full path.

Starting from FASTQ Files

The following inputs are required for running the DRAGEN TruSight Oncology 500 ctDNA Analysis Software using FASTQ (*.fastq) files. The requirements apply to Docker.

  • Full path to an existing FASTQ folder.

  • The sample sheet is in the FASTQ folder path, or you can set the path to the sample sheet with the --sampleSheet override command line option.

Make sure there is sufficient disk space for the analysis to complete. Refer to the --help command line argument details for disk space requirements.

Use BCL Convert to produce FASTQ files for DRAGEN TruSight Oncology 500 ctDNA Analysis Software. Using bcl2fastq does not produce the same results and is discouraged.

Make sure that BCL Convert is set to write UMI sequences to the read headers in the FASTQ files.

FASTQ File Organization

Store FASTQ files in individual subfolders that correspond to a specific Sample_ID. Keep file pairs together in the same folder. Alternatively, store the FASTQ files in one flat folder structure where the FASTQ files are stored in one folder.

The DRAGEN TruSight Oncology 500 ctDNA Analysis Software requires separate FASTQ files per sample. Do not merge FASTQ files.

The instrument generates two FASTQ files per flow cell lane, so that there are eight FASTQ files per sample.

Sample1_S1_L001_R1_001.fastq.gz

  • Sample1 represents the Sample ID.

  • The S in S1 means sample, and the 1 in S1 is based on the order of samples in the sample sheet, so S1 is the first sample.

  • L001 represents the flow cell lane number.

  • The R in R1 means Read, so R1 refers to Read 1.

Auto-Launch of DRAGEN TSO 500 ctDNA Analysis on ICA

Auto-launch Prerequisites and Workflow

*The BaseSpace Sequence Hub setting for run monitoring and storage must be selected on the instrument to use DRAGEN TSO 500 ctDNA analysis auto-launch. For information on preparing your instrument for DRAGEN TSO 500 ctDNA Auto-launch, refer to the documentation for your instrument.

  1. Use BaseSpace Sequence Hub Run Planning tool or the sample sheet templates provided on the support page to create and export a sample sheet.

    1. If BaseSpace Run Planning tool is not available in your region, use the sample sheet template.

    1. Data is uploaded to BaseSpace Sequence Hub and then pushed to ICA. You can monitor the run in BaseSpace Sequence Hub.

    2. Analysis auto launches in ICA when sequencing and the upload completes. You can monitor the status of the analysis in BaseSpace Sequence Hub or ICA

    3. If necessary, you can requeue the analysis via BaseSpace Sequence Hub.

  2. View the analysis output results in either BaseSpace Sequence Hub or ICA.

To avoid invalid sample sheet configurations, Illumina recommends using BaseSpace Run Planning tool to generate sample sheets. Using an invalid sample sheet can result in failed runs and analyses.

BaseSpace Sequence Hub Requirements for ICA Auto-Launch

BaseSpace Run Planning tool is a multi-step workflow that generates a manual launch or auto-launch capable sample sheet for export and requires the following additional settings:

  • Access to BaseSpace Sequence Hub.

  • ICA Run Storage is enabled under BaseSpace Sequence Hub settings.

Requeue Analysis

You can requeue analysis of a run via the run's Summary page in BaseSpace Sequence Hub.

Minimum Storage Requirements on ICA

Sequencing System
Minimum Disk Space (Gb)

NovaSeq 6000/6000Dx (RUO) SP Flow Cell

500

NovaSeq 6000/6000Dx (RUO) S1 Flow Cell

1100

NovaSeq 6000/6000Dx (RUO) S2 Flow Cell

2500

NovaSeq 6000/6000Dx (RUO) S4 Flow Cell

4300

NovaSeq X 1.5B

2000

NovaSeq X 10B

4300

Guided Examples

Please review these guided examples of using DRAGEN TSO 500 Analysis Software with auto-launch on ICA:

DNA Expanded Metrics

DNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.

Metrics Output

File Name: MetricsOutput.tsv

The metrics output file is a final combined metrics report that provides sample status, key analysis metrics, and metadata in a tab-separated values (TSV) file. Sample metrics within the report indicate guideline‑suggested lower specification limits (LSL) and upper specification limits (USL) for each sample in the run.

One metrics output file is generated for the entire run. An additional file is generated for each sample

Run Metrics

Run metrics from the analysis module indicate the quality of the sequencing run.

Review the following metrics to assess run data quality:

The values in the Run Metrics section are listed as NA in the following situations:

  • The analysis was started from FASTQ files.

  • The analysis was started from BCL files and the InterOp files are missing or corrupt.

  • [NovaSeqX Plus only] There is no PCT_PF_READS value in NovaSeqX Plus runs, so the PCT_PF_READS value will always be NA.

Sample QC Metrics

Review the following metrics to assess sample data quality:

*The recommended threshold of 0.059 for GENE_SCALED_MAD only applies to real cell‑free DNA.

A sample sheet is required for each analysis with DRAGEN TruSight Oncology 500 ctDNA Analysis Software. A sample sheet is a comma-separated value (*.csv) file format used by Illumina instruments, platforms, and analysis pipelines to store settings and data for sequencing and analysis. The DRAGEN TruSight Oncology 500 ctDNA Analysis Software is compatible with the sample sheet v2. For general information on the sample sheet v2, refer to .

BaseSpace Run Planner (preferred), see for details

Downloading and modifying a sample sheet template following the requirements, see for details

Using the user interface of the DRAGEN TruSight Oncology 500 ctDNA Analysis Application, see for details

Downloading and modifying a sample sheet template following the requirements (see for details), then importing it to Illumina Run Manager.

—Stream run data directly from the instrument to ICA via a specially configured sample sheet and automatically begin DRAGEN TSO 500 ctDNA analysis.

—Initiate DRAGEN TSO 500 ctDNA analysis on ICA using the run files and sample sheet files in the project.

Create a Project: Project can be specific for the DRAGEN TruSight Oncology 500 ctDNA pipeline or it can contain multiple Pipelines and/or Tools). For information on creating Projects, refer to the Projects section in .

ICA standard storage is used by default as soon as the Project is saved. To connect a different storage source, set it up before creating your Project. For details and options, refer to the Storage section in .

Edit Project and Add Bundle: Edit the Project and add the bundle titled, "DRAGEN TSO 500 ctDNA v2.6.0 (XX)." XX is a 2-letter code designating the region from which you are launching the analysis. Adding the Bundle automatically adds the pipeline and associated resource files and datasets to the Project. For information on Bundles, refer to the Bundles section in .

 Upload the sequencing data: For information on viewing and uploading data, refer to the Data section in .

Start Analysis: In the Project, navigate to Pipelines, select the TSO 500 ctDNA v2.6.0 Pipeline, and then select  "Start New Analysis". Set up the new analysis by configuring the parameters listed in the . When the required files are completed, start analysis.

For information about using pipelines, refer to .

Refer to for more information.

Path to the local analysis folder. The default location is /staging/DRAGEN_TSO500_CTDNA_2.6.0_Analysis_{timestamp}. If not using the default location, provide the full path to the local analysis folder. Folder must have sufficient space and must be on an NVMe SSD drive. For example, the /staging directory on the DRAGEN server. Refer to table in for minimum disk space requirements.

The FASTQ folder structure conforms to the folder structure in

Import the sample sheet to the instrument and start the sequencing run. Refer to for sample sheet guidance.

Refer to the for information on setting up a BaseSpace Sequence Hub project.

Refer to the for more information on requeuing an analysis.

Refer to the for information on how to manage accounts and subscriptions.

Metric
Description
Troubleshooting
Metric
Description
Recommended Threshold
Metric
Description
Recommended Threshold
Variant Class

For troubleshooting information, refer to

Illumina Connected Software - Sample Sheet
Sample Sheet Creation in BaseSpace Run Planner page
Sample Sheet Requirements page
Run Planning on Illumina Run Manager
Sample Sheet Requirements page
Auto-launch
Manual launch
Illumina Connected Analytics support site page
BaseSpace Sequence Hub support site page
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics help
Illumina Connected Analytics support site page
DNA Analysis Methods
BaseSpace Sequence Hub support site page
BaseSpace Sequence Hub support site page
Software Registration page
NovaSeq 6000Dx: TSO 500 Auto-launch Analysis in Cloud
table below
FASTQ File Organization.
ICA Auto-launch Sample Sheet Requirements

TOTAL_PF_READS (count)

Total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

Primarily driven by data output of sequencer, quality of library and balancing of library in library pool. If TOTAL_PF_READS is in line with other samples, but coverage metrics are more may suggest non-specific enrichment.

Low values for all samples indicate a poor quality run with possible low cluster numbers or low numbers of Q30 and PF%.

A low value for an individual sample indicates poor pooling of this library into the final pool.

MEAN_FAMILY_SIZE (count)

A UMI Family is a group of reads that all have the same UMI barcode. The family size is the number of reads in family. MEAN_FAMILY_SIZE is the mean of the entire population of reads assembled into UMI families.

The mean UMI family size decreases with increased unique read numbers, and more input DNA leads to more unique reads. Conversely over sequencing of a fixed population of unique DNA molecules leads to increased family size.

As a guide, for a good run with optimal cluster density, passing specs, even sample pooling, and good quality DNA we usually observe values <10.

UMI family size = 1 is not ideal as it is harder to correct for errors.

UMI family size of 2 to 5 enables efficient error correction without wasting sequencing capacity on high percentages of duplicate reads.

MEDIAN_TARGET_COVERAGE (count)

Median depth across all the unique loci occurring in all regions of the manifest file.

Lower median target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output.

PCT_CHIMERIC_READS (%)

Chimeric reads occur when one sequencing read aligns to two distinct portions of the genome with little or no overlap. Metric is proportion of total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

While this can be indicative of large-scale structural rearrangement of the genome, values that are elevated above the usual baseline may indicate enrichment probe contamination during library preparation. A suggested metric USL is 8% (those that are higher might see decrease performance in small variant and tmb scores).

PCT_EXON_500X (%)

Percentage of exon bases with 500X fragment coverage. Calculated against all regions in manifest containing _exon in name.

Can be used in combination with other PCT_EXON metrics to understand under or over coverage of exons.

PCT_EXON_1500X (%)

Percentage of exon bases with 1500X fragment coverage. Calculated against all regions in manifest containing _exon in name.

Can be used in combination with other PCT_EXON metrics to understand under or over coverage of exons

PCT_READ_ENRICHMENT (%)

Percentage of reads that have overlapping sequence with the target regions defined in the sample manifest.

Indicative of general enrichment performance. Reduced proportions of enriched reads may indicate issues with the enrichment proportion of the library preparation.

PCT_USABLE_UMI_READS (%)

Percentage of reads that have valid UMI sequences associated with them.

As UMI reads are sequenced at the start of each read, loss of valid UMI sequence may be cause by sequencing issues impacting the quality of base calling in this portion of the sequencing read.

MEAN_TARGET_COVERAGE (count)

Mean depth across all the unique loci defined in the manifest file.

Lower mean target coverage may be due to poor sample input/quality, library preparation issues or low sequencing output. Large differences between the median and mean target coverage values may indicated a skewed distribution of target coverage.

PCT_ALIGNED_READS (%)

Proportion of aligned reads that are non-supplementary, non-secondary and pass QC versus aligned reads that are non-supplementary, non-secondary, mapped and pass QC.

PCT_CONTAMINATION_EST (%)

This metric should only be evaluated if the CONTAMINATION_SCORE metric exceed the USL. This metric estimates the amount of contamination in a sample. The contamination level is computed by taking 2.0* the average of the adjusted allele frequencies of all variants that were selected. The adjusted alllele frequency is either the actual allele frequency of the variant if it is less than 0.5, or 1 -allele frequency if it is greater than or equal to 0.5.

If the sample does not fail the CONTAMINATION_SCORE this metric has no intended meaning as it will be driven by statistical noise (e.g. the few variants that naturally fall outside an expected interval around 0.5 due to random chance)

High contamination estimates may be due to any of the following:

Inter-sample contamination caused by mixing of samples during extraction or library preparation.

Intra-sample contamination, due to mixing of clonally different cell populations during extraction. Large scale genomic rearrangements that cause unexpected VAFs for large numbers of variants.

PCT_TARGET_0.4X_MEAN (%)

Parentage of target (all locations in manifest) reads that have a coverage depth of greater the 0.4x the mean target coverage depth (see definition above).

Provides an indication of uniformity of coverage of the target regions in the manifest file. When trended over time reductions in this metric may indicate an issue with the enrichment process resulting in coverage bias.

PCT_TARGET_500X (%)

Percentage of target bases with 500X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_TARGET_1000X (%)

Percentage of target bases with 1000X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_TARGET_1500X (%)

Percentage of target bases with 1500X fragment coverage. Calculated against all regions in manifest file.

Can be used in combination with other PCT_TARGET metrics to understand under or over coverage of targets.

PCT_DUPLEXFAMILIES (%)

Percent of collapsed reads that are duplex (e.g. composed or original forward strand and original reverse strand reads). Number of families that are merged as duplex over total number of families.

Higher is more desirable, lower family depth leads to lower precent duplex families. If low check for under clustering or chemistry concerns.

MEDIAN_INSERT_SIZE (bp)

Median fragment size for sample.

A low median insert size could be a sign of low sample quality or degradation

MAX_SOMATIC_AF

Max somatic allele frequency of a variant; a proxy for tumor fraction. The TMB step flags the variants by potential somatic status using database, VAF and clonal hematopoiesis information. The remaining variants are ranked by variant allele frequency in descending order. The variant allele frequency of first COSMIC hotspot (count >50) or confident somatic variant (having significantly shorter fragment size) is reported as the MaxSomaticVaf for each sample. If no such variant exists, the 4th variant is reported.

This metric is driven by sample tumor fraction

PCT_SOFT_CLIPPED_BASES (%)

Percentage of based that were not used for alignment but retained as part of the alignment file

Soft clipped reads are used as a part of the downstream analysis for small variants calling. A higher-than-expected number could indicate a low-quality enrichment step.

PCT_Q30_BASES (%)

Average percentage of bases ≥ Q30. A prediction of the probability of an incorrect base call (Q‑score).

An indicator of sequencing run quality, low Q30 across all samples on a run could be the result of run overclustering.

PCT_PF_READS

Percentage of reads on the sequencing flow cell that pass the filter.

≥ 55.0

(No lower specification limit for NovaSeq X Plus)

PCT_Q30_R1

Percentage of bases with a quality score ≥ 30 from Read 1.

≥ 80.0

(≥ 85.0 for NovaSeq X Plus)

PCT_Q30_R2

Percentage of bases with a quality score ≥ 30 from Read 2.

≥ 80.0

(≥ 85.0 for NovaSeq X Plus)

CONTAMINATION_SCORE

The contamination score based on VAF distribution of SNPs.

≤ 1227

All

MEDIAN_EXON_COVERAGE

Median exon fragment coverage across all exon bases.

≥ 1300

Small variant, TMB, fusion, MSI

PCT_EXON_1000X

Percent exon bases with 1000X fragment coverage.

≥ 80.0

Small variant, TMB

GENE_SCALED_MAD

The median of absolute deviations normalized by gene fold change.

≤ 0.059*

CNV

MEDIAN_BIN_COUNT_CNV_TARGET

The median raw bin count per CNV target.

≥ 6.0

CNV

Block List

The following table lists the genes that have associated block listed sites. For the exact location of the block listed site, contact Illumina Technical Support.

Gene
Block List Sites
Gene
Block List Sites
Gene
Block List Sites

ABL1

5

FGFR2

144

PAX7

5

AKT2

5

FGFR3

1

PAX8

275

AKT3

20

FGFR4

36

PBRM1

3

ALK

90

FLCN

2

PDCD1

2

ANKRD11

6

FLI1

36

PDGFRA

5

ANKRD26

9

FLT1

91

PDGFRB

2

AR

81

FLT4

3

PDK1

1

ARID1A

40

FOXA1

48

PDPK1

6

ARID1B

87

FOXL2

4

PGR

5

ARID2

1

FOXO1

2

PHF6

2

ASXL1

3

FOXP1

3

PHOX2B

15

ASXL2

5

FUBP1

1

PIK3C2G

2

ATM

2

GATA4

6

PIK3CA

18

ATR

3

GATA6

12

PIK3CB

42

ATRX

17

GEN1

1

PIK3R1

6

AURKA

1

GID4

3

PIK3R2

2

AXIN2

4

GNAQ

4

PLCG2

3

AXL

74

GNAS

11

PLK2

2

BBC3

2

GPR124

3

PMAIP1

7

BCL10

2

GRM3

1

PMS2

1

BCL2L11

16

H3F3A

1

POLE

3

BCOR

2

H3F3C

2

PPARG

446

BCORL1

1

HGF

1

PRDM1

1

BCR

64

HIST1H1C

2

PRKCI

2

BIRC3

1

HLA-A

72

PRKDC

5

BLM

4

HNF1A

2

PTCH1

13

BMPR1A

4

HNRNPK

9

PTEN

41

BRAF

283

HOXB13

1

PTPRS

14

BRCA1

49

HSP90AA1

4

PTPRT

2

BRCA2

21

ICOSLG

6

QKI

2

BRD4

16

IFNGR1

2

RAD21

1

CARD11

4

iIndel

91

RAD50

5

CASP8

2

INHBA

4

RAD51

18

CBL

8

INPP4A

1

RAD51B

8

CCND1

25

INPP4B

1

RAF1

98

CCND3

49

IRS1

9

RANBP2

12

CCNE1

72

IRS2

19

RARA

2

CD74

50

iSNP

4

RASA1

1

CDH1

4

JAK2

4

RB1

5

CDK12

3

JUN

7

RBM10

13

CDK4

46

KAT6A

5

RECQL4

3

CDK6

13

KDM5A

7

REL

3

CDK8

4

KDM5C

2

RET

3

CDKN2B

2

KDM6A

2

RFWD2

22

CEBPA

12

KDR

1

RICTOR

1

CHD2

5

KIF5B

7

ROS1

287

CHD4

12

KIT

5

RPS6KA4

3

CHEK1

75

KMT2B

51

RPS6KB1

109

CHEK2

64

KMT2C

118

RUNX1

3

chrY

93

KMT2D

108

SDHA

18

CIC

2

KRAS

44

SDHB

3

CREBBP

4

LAMP1

64

SDHD

17

CSNK1A1

4

LATS1

1

SETBP1

7

CTNNB1

1

LATS2

4

SETD2

26

CUL3

1

LoH

85

SF3B1

1

CUX1

9

LRP1B

3

SH2B3

4

DAXX

5

LZTR1

1

SH2D1A

2

DDR2

1

MAGI2

2

SLIT2

1

DDX41

1

MALT1

4

SLX4

2

DIS3

2

MAP2K2

1

SMARCA4

4

DNAJB1

6

MAP2K4

5

SMC1A

1

DNMT1

1

MAP3K1

8

SMC3

8

DNMT3A

4

MAP3K14

2

SMO

2

DOT1L

2

MAP3K4

10

SOX10

7

E2F3

70

MAPK1

6

SOX17

1

EGFR

304

MAPK3

6

SOX9

14

EIF4E

12

MCL1

1

SPEN

4

EML4

9

MDC1

23

STAG1

5

EP300

1

MDM2

53

STAG2

2

ERBB2

14

MDM4

67

STAT4

1

ERBB3

62

MED12

28

STAT5A

1

ERCC1

53

MGA

6

STAT5B

4

ERCC2

57

MLL

9

SUFU

5

ERCC3

4

MLLT3

18

SUZ12

9

ERCC5

4

MRE11A

5

TAF1

9

ERG

2

MSH3

10

TBX3

1

ESR1

32

MSH6

2

TCEB1

1

ETS1

45

MSI

148

TCF3

2

ETV1

862

MST1

18

TCF7L2

6

ETV4

502

MYB

402

TERT

2

ETV5

11

MYC

78

TET1

1

ETV6

187

MYCL1

28

TET2

23

EWSR1

364

MYCN

69

TFE3

299

EZH2

2

MYOD1

3

TFRC

33

FANCA

1

NAB2

10

TGFBR1

6

FANCD2

11

NCOA3

28

TGFBR2

2

FANCG

10

NCOR1

9

TMEM127

5

FANCI

1

NF1

3

TMPRSS2

236

FANCL

1

NKX2-1

4

TOP2A

1

FAT1

2

NOTCH1

4

TP53

22

FBXW7

4

NOTCH3

7

TRAF7

4

FGF1

25

NOTCH4

9

TSC1

4

FGF10

17

NPM1

5

TSC2

1

FGF14

15

NRAS

29

U2AF1

1

FGF19

102

NRG1

47

VEGFA

7

FGF2

26

NTRK1

134

WISP3

2

FGF23

38

NTRK2

145

WT1

10

FGF3

60

NTRK3

13

XIAP

1

FGF4

25

NUTM1

134

XPO1

2

FGF5

14

PAK1

68

XRCC2

1

FGF6

9

PAK3

8

YAP1

1

FGF7

9

PALB2

1

ZBTB7A

11

FGF8

30

PARK2

23

ZFHX3

56

FGF9

21

PARP1

2

ZNF703

7

FGFR1

26

PAX3

156

ZRSR2

2

\

Storage Requirements
Troubleshooting

Sample Sheet Templates

Sample Sheet templates for TSO 500 ctDNA standalone DRAGEN server and ICA manual launch analysis can be found in the table below. For auto-launch compatible sample sheets, use BaseSpace Run Planner.

DRAGEN TSO 500 ctDNA analysis software is compatible with several instruments and assay workflows, each of which have implications for the sample sheet.

Sample sheet templates contain all required fields, including index sequences in the proper orientation for all indexes from a given library prep kit. The templates are provided as a starting point for creating a sample sheet manually when launching analysis on a standalone DRAGEN server or on ICA using manual launch.

For interactive run planning or to create a sample sheet for ICA Autolaunch, use to create valid sample sheets for either local or cloud analysis. To set up a run in BaseSpace run planner, refer to .

Users can visit the section to learn additional details on required fields and values as they fill-in their sample information. Use the lookup table below to select and download the sample sheet template that matches your instrument, assay, and workflow configuration:

Assay
Instrument
Assay Workflow
File

TSO 500 ctDNA v1

NovaSeq 6000

Standard

TSO 500 ctDNA v1

NovaSeq 6000Dx (in RUO mode)

Standard

TSO 500 ctDNA v2

NovaSeq 6000

Standard

TSO 500 ctDNA v2

NovaSeq 6000Dx (in RUO mode)

Standard

TSO 500 ctDNA v2

NovaSeq X

Standard

BaseSpace Run Planner
Sample Sheet Creation in BaseSpace Run Planner
Sample Sheet guidelines
SampleSheet-TSO500_ctDNA-v2_6_1-v1_assay-Novaseq6000-Reverse_Complement-Local.csv
SampleSheet-TSO500_ctDNA-v2_6_1-v1_assay-Novaseq6000Dx-Local.csv
SampleSheet-TSO500_ctDNA-v2_6_1-v2_assay-Novaseq6000-Local.csv
SampleSheet-TSO500_ctDNA-v2_6_1-v2_assay-Novaseq6000Dx-Local.csv
SampleSheet-TSO500_ctDNA-v2_6_1-v2_assay-NovaseqX-local.csv
TSO 500 NextSeq550_v2 Sample Sheet Template.csv
TSO 500 HRD_NextSeq550_v2 Sample Sheet Template.csv
TSO 500 HRD_NovaSeq6000_v2 Sample Sheet Template.csv
TSO 500 HRD_NovaSeq6000Dx_fwd_v2 Sample Sheet Template.csv
TSO 500 HT_NovaSeq6000_SP_S1_S2_S4_v2 Sample Sheet Template.csv
TSO 500 HT_NovaSeq6000_SP_S1_S2_S4_XP_v2 Sample Sheet Template.csv
TSO 500 HT_NovaSeq6000DxRUO_S1_S2_S4_v2 Sample Sheet Template_200037181.csv
TSO 500 HT_NovaSeq6000DxRUO_S1_S2_S4_XP_v2 Sample Sheet Template_200037182.csv
Workflow Overview