We have come a long way since the discovery of the double-helix structure of DNA. We have explored nature’s blueprint that is the complete genetic code of a human being. We are now investigating the variations in the genetic code to understand how different types of mutations and modifications within the human genome can influence diversities in phenotype, disease progression, and treatment.
The DNA sequencing methods have evolved significantly since the complete amino acid sequencing of insulin by Frederick Sanger. The Sanger method was inspired by the initial efforts of R. Padmanabhan, Wu and colleagues. Their methods paved the way for primer-based extension and sequencing of DNA by chain-termination method under the vigilance of Sanger at the MRC Center, Cambridge in 1977. Irrespective of which DNA sequencing method researchers have adopted over the past decades, at least one working principle has remained constant – the process aims to determine the order of the nucleotides in the DNA of a given sample.
Here’s a brief look at the different DNA sequencing methods that have defined the frontiers in genomics and metagenomics in the last four decades
The classical sequencing methods
1977 was a defining year for studies in genomics. Allan Maxam and Walter Gilbert are attributed for the publication of the first standardized DNA sequencing method that leveraged the chemical modification of DNA. The chemical modification of the bases resulted in cleavage at specific sites (nucleotides). It uses radioactive labeling at the 5′-end of the DNA molecule. By varying the concentration of the modifying agent, the teams can control the sites of cleavage, which generates a family of differently-sized DNA fragments. Visualization is easy – the gel is placed on an x-ray film that produces dark bands corresponding to radiolabeled DNA. The analysis is relatively simple, as long as the fragments are small and non-repetitive.
The chain termination method was also introduced in 1977 by Frederick Sanger et. al. It had been the most popular DNA sequencing technique until the standardization of other advanced methods. Sanger sequencing uses single-stranded DNA as the template, DNA primer, deoxynucleoside triphosphates (dNTPs), DNA polymerase, and di-deoxynucleoside triphosphates (ddNTPs) as chain terminators. The lack of radioactivity made the Sanger method more favorable than the Maxam-Gilbert method. After controlled DNA extension, there is a round of heat denaturation and separation of the strands using gel electrophoresis.
The Sanger method of DNA sequencing and analysis is straightforward and fast for short DNA sequences. Laboratories combine the standard DNA dye-terminator sequencing with high-throughput automated DNA sequence analyzers for the quick determination of DNA sequence.
Next-generation sequencing or NGS does not refer to a single method of DNA sequencing. There are several standardized methods of DNA sequencing and analysis that fall under the NGS category. Almost all of them share a few features. They are
All NGS methods are high-throughput techniques.
The 454 DNA sequencing is a large-scale pyrosequencing technique that can efficiently sequence around 400-600 megabases within a 10-hour run period. The effectiveness of this DNA sequencing and analysis process is limited due to the limitation in the sizes of the individual reads of DNA sequences. It makes genome assembly quite the challenge. Due to the high expenses and the lack of demand, Roche had declared to discontinue 454 Pyrosequencing of DNA in 2013.
The Illumina sequencing technique leverages reversible terminators for the sequencing-by-synthesis process. Illumina currently offers several proprietary sequencer systems and machines for DNA sequencing via high-throughput methods. Now, HiSeq is one of the most widely used sequencing platforms from Illumina. It offers more than 3 billion reads per flow cell within 72-hours. The MiSeq is the cost-effective alternative that delivers 25 million reads with 300 base pair read length from each end.
The Ion Torrent Systems Inc. developed a DNA sequencing and analysis system that depends on sequencing chemistry along with a semiconductor-based detection system. The method relies on the detection of hydrogen ions released during the DNA polymerization reaction. The hypersensitive ion sensor detects the hydrogen ion, and the intensity of the electronic signal is proportional to the number of homopolymer repeats.
The nanopore method is relatively new. This process depends on the electric signals detected as each nucleotide passes through the cyclodextrin-bound alpha-hemolysin pores. There are currently two main types – protein and solid-state nanopore sequencing. Each pore has a detection region that can recognize the charge difference between the four nucleotide bases.
The analysis of data recorded from DNA sequencing helps the user transform raw data into useful information. DNA seq data analysis typically involves four steps
The analysis of NGS data is particularly complicated due to the bulk of raw data generated after each complete cycle. The results depend on the adaptor addition process as well as the variation in DNA library construction. Simple quality control checks run by commercially available DNA sequencing and analysis tools are imperative to reduce the errors in the final results. Always opt for NGS data analysis software that generates detailed QC reports after each step.
The optimal analysis of the data obtained from NGS should help in getting information on the target DNA sequence or gene, and aide in the discovery of new genes. They can help in the prediction of advanced structures. DNA sequence data analytics now plays a vital role in the development of personalized medicine and targeted therapy based on single nucleotide polymorphism data and other forms of gene polymorphism.
Walking pneumonia, also known as atypical pneumonia, is a milder form of pneumonia that, while…
Internal bleeding is a medical condition where blood escapes from damaged blood vessels and accumulates…
Serotonin, commonly referred to as the "feel-good" neurotransmitter, plays a critical role in regulating mood,…
Autism Spectrum Disorder (ASD) and Attention-Deficit/Hyperactivity Disorder (ADHD) are two neurodevelopmental conditions that, while distinct,…
The year 2024 has seen a significant Listeria outbreak, raising alarms across several countries. Listeria…
Metabolic conditioning (MetCon) has gained significant attention in the fitness world, often hailed as a…