5  Proofreading enzymes

Figure 5.1: Sebastian Sprudelwasser, on his way to a Berlin Rave. By Anne Kathrine Wiborg Runge
Figure 5.2: Example of a smiley plot of a double stranded library amplified after adapter ligation with a proofreading NEBNext Ultra II Q5 Master polymerase. Data taken from library HH06D of (Cai et al. 2022). Damage data generated using DamageProfiler and plotted using R and tidyverse packages (Wickham et al. 2019).

Sometimes the type of polymerase you use during library construction will influence the type of damage pattern you will receive.

In the example above, Cai et al. (2022) found a funny smiley plot, where while the 3’ G to A patterns look like a classic non-UDG (i.e. non-USER treated, thus retaining damage) plot, the frequency of C to T on the 5’ end was extremely reduced.

They identified the problem as the choice of polymerase. The Q5 polymerase is a ‘high fidelity’ enzyme, which corresponds to being a ‘proofreading’ enzyme. This means that when the enzyme hits an incorrect nucleotide (such as a deaminated cytosine), it will instead remove the nucleotide on it’s 3’ to 5’ exonuclease activity 1.

While having a low error rate is great for modern genomics, this can be less optimal for preserving ancient DNA damage for profiling later on.

In the case of this particular enzyme, it is maybe not so much of a problem as you retain the damage signal on the 3’ for proving authenticity. However this may be problematic for downstream aDNA validation tools that may have an expected ‘model’ of ancient DNA damage.

Tip

The choice of enzyme only matters during the first amplification after adapter ligation. At subsequent amplifications, the (misincorporated) thymines have already been integrated into the template molecules, so it doesn’t matter which enzyme you use.


  1. More information can be found on this blog post on the NEB website↩︎