Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
It is universally acknowledged that nucleic acids extracted from FFPE tissues are prone to degradation and cross-linking. However, the prevailing model assumes this degradation is a random, stochastic process resulting in a Gaussian distribution of fragment sizes—a model that underpins the widely used DNA Integrity Number (DIN). This paper argues that FFPE degradation is fundamentally non-random, driven by stereochemical accessibility and the “Methylene Sieve” effect. Consequently, relying on DIN as a quality metric is fundamentally flawed for FFPE samples. We propose the adoption of a Functional Integrity Score (FIS) based on qPCR amplification ratios, which accurately reflects the utility of the sample for downstream spatial and sequencing applications.
1. The Mechanism of the Methylene Sieve
Formaldehyde does not crosslink DNA uniformly. The formation of methylene bridges (-CH2-) between amino groups is highly dependent on the stereochemical accessibility of the DNA. The minor groove of the DNA double helix is particularly susceptible to formaldehyde adduct formation. Furthermore, crosslinking occurs preferentially at sites where nuclear proteins (histones) are intimately bound to the DNA.
This creates the “Methylene Sieve” effect. As fixation time increases, the crosslinks accumulate in a non-random, periodic pattern that mirrors nucleosome binding. When the tissue is subsequently extracted, the reversal of these crosslinks is incomplete. The phosphodiester backbone fractures precisely at these points of heavy cross-linking, not randomly. Therefore, FFPE DNA fragmentation is inherently periodic, heavily skewed toward lengths of approximately 150-200 base pairs (the length of DNA wrapped around a single nucleosome).
2. The Biochemical Cascade of Degradation
The non-random fragmentation is compounded by a secondary biochemical cascade. Formaldehyde in aqueous solution naturally oxidizes to formic acid. In a sealed tissue block over years of storage, this formic acid creates a localized, acidic micro-environment around the DNA. This drives acid-catalyzed depurination. When a purine base (adenine or guanine) is lost, the resulting apurinic (AP) site destabilizes the sugar-phosphate backbone, causing a beta-elimination reaction that cleanly snaps the DNA strand. Thus, degradation is not just mechanical fragmentation; it is a targeted chemical elimination at purine sites.
3. The Fallacy of the DIN
The DNA Integrity Number (DIN), calculated via capillary electrophoresis (e.g., Agilent TapeStation), measures the distribution of fragment sizes, assuming random degradation. A pristine genomic DNA sample yields a Gaussian peak at high molecular weight (DIN 10). FFPE samples, due to the Methylene Sieve effect, yield a heavily skewed, non-Gaussian distribution with massive peaks in the 150-300 bp range.
The DIN algorithm interprets this non-Gaussian distribution as catastrophic degradation, often assigning FFPE samples a DIN of 1 to 3. The fallacy here is structural: a DIN of 2 implies the sample is useless for sequencing. Yet, next-generation sequencing (NGS) libraries are routinely constructed from 150-300 bp fragments. The DIN fundamentally misrepresents the *functional* quality of the DNA. A sample with a DIN of 2 might actually have excellent sequence integrity between the crosslink-induced breakpoints, making it perfectly viable for targeted NGS panels.
4. Toward a Functional Integrity Score (FIS)
For the tissue array industry, where spatial context is vital, we must abandon the structural DIN metric in favor of a Functional Integrity Score (FIS). The FIS does not care how long the fragments are; it cares if the sequence *between* the breaks is readable.
The FIS is calculated using a dual-amplicon qPCR assay. We measure the Cq difference (ΔCq) between a short target (e.g., 100 bp) and a long target (e.g., 300 bp) within a single-copy gene.
5. Conclusion
Nucleic acid degradation in FFPE blocks is a non-random, stereochemically driven process, not stochastic wear. The DIN metric, built on the assumption of random fragmentation, systematically undervalues FFPE-derived nucleic acids. By transitioning to a Functional Integrity Score based on amplification ratios, we can accurately triage archival TMA blocks, unlocking vast collections of “low DIN” samples that are, in fact, highly functional for modern genomic applications.