Popis: |
The global variation of cell-free DNA fragmentation patterns is a promising biomarker for cancer diagnosis. However, the characterization of its hotspots and aberrations in early-stage cancer at the fine-scale is still poorly understood. Here, we developed an approach to de novo characterize genome-wide cell-free DNA fragmentation hotspots by integrating both fragment coverage and size from whole-genome sequencing. These hotspots are highly enriched in regulatory elements, such as promoters, and hematopoietic-specific enhancers. Surprisingly, half of the high-confident hotspots are still largely protected by the nucleosome and located near repeats, named inaccessible hotspots, which suggests the unknown origin of cell-free DNA fragmentation. In early-stage cancer, we observed the increases of fragmentation level at these inaccessible hotspots from microsatellite repeats and the decreases of fragmentation level at accessible hotspots near promoter regions, mostly with the silenced biological processes from peripheral immune cells and enriched in CTCF insulators. We identified the fragmentation hotspots from 298 cancer samples across 8 different cancer types (92% in stage I to III), 103 benign samples, and 247 healthy samples. The fine-scale fragmentation level at most variable hotspots showed cancer-specific fragmentation patterns across multiple cancer types and non-cancer controls. Therefore, with the fine-scale fragmentation signals alone in a machine learning model, we achieved 42% to 93% sensitivity at 100% specificity in different early-stage cancer. In cancer positive cases, we further localized cancer to a small number of anatomic sites with a median of 85% accuracy. The results here highlight the significance to characterize the fine-scale cell-free DNA fragmentation hotspot as a novel molecular marker for the screening of early-stage cancer that requires both high sensitivity and ultra-high specificity. |