This report uses TargetQC to evaluate the quality of a clinical Whole Genome Sequencing (WGS) sample. The evaluation covers three main modules: Target Exon QC, Target Gene QC, and Variant Capture QC.
This section checks how well the exons of target genes are covered by the sequencing data.
First, we look at how much of the target region is covered across different coverage thresholds. In this report, the target region is the exons of protein-coding genes with known phenotype associations in OMIM. Coverage thresholds used: 10X, 20X, 30X, and 40X.
Table: Shows coverage threshold, target region size, average coverage, region size meeting the threshold, and its proportion.
| thresholds | size | avg | count | percent |
|---|---|---|---|---|
| 10X | 21939429 | 54.37 | 13138567 | 59.89 |
| 20X | 21939429 | 54.37 | 11843446 | 53.98 |
| 30X | 21939429 | 54.37 | 10625457 | 48.43 |
| 40X | 21939429 | 54.37 | 9486638 | 43.24 |
Figure: Visualization of Table (x-axis = coverage thresholds, y-axis = proportion of region meeting threshold).
Each exon is assigned a classification label. For WGS, only coverage depth is used (ES uses both capture efficiency and coverage depth).
Table: Lists exons with chromosome position (0-base), average coverage (“coverage” column), coverage-based label (“type” column), and capture-based label.For WGS, all exons are treated as “full capture.” For ES, extra columns would show capture percentage.
Counts of exons in each category are
summarized.
Table: Matrix of exon
counts per category.
| capture | well | middle | poor | capture_total |
|---|---|---|---|---|
| full | 44664 | 8232 | 18521 | 71417 |
| partial | 0 | 0 | 0 | 0 |
| near | 0 | 0 | 0 | 0 |
| no | 0 | 0 | 0 | 0 |
| coverage_total | 44664 | 8232 | 18521 | 71417 |
Figure:Visualizes proportion of exons in 3 coverage types: well-covered, middle-covered, poor-covered.
For ES, it would show coverage conditions under each capture level and capture conditions under each coverage level. Such as:
Figure a: coverage conditions under each capture level
Figure b: capture conditions under each coverage level
This module evaluates gene coverage from an overall perspective, focusing on protein-coding genes with OMIM phenotype associations.
Proportion of bases in coverage ranges
(≥30X, 20–30X, ≤20X) is calculated for:
- All target gene exons.
- Coding regions of protein-coding genes.
Table: Proportions of
bases in these three ranges for all target gene exons.
(columns: gene ID, name, type, target transcript ID, 3 coverage range
proportions).
Table: The same proportions but for coding regions of protein-coding genes (columns: gene ID, name, type, target transcript ID, 3 coverage range proportions).
Figure:Cumulative curve (x-axis = coverage range percentage, y-axis = number of genes reaching that percentage). You can choose to plot curves for different gene types (e.g., all target genes or protein-coding genes). For protein-coding genes, coding regions are used.