Whole genome sequencing

Last updated on: 17.10.2025

Dieser Artikel auf Deutsch

Requires free registration (medical professionals only)

Please login to access all articles, images, and functions.

Our content is available exclusively to medical professionals. If you have already registered, please login. If you haven't, you can register for free (medical professionals only).


Requires free registration (medical professionals only)

Please complete your registration to access all articles and images.

To gain access, you must complete your registration. You either haven't confirmed your e-mail address or we still need proof that you are a member of the medical profession.

Finish your registration now

DefinitionThis section has been translated automatically.

Whole genome sequencing (WGS) is a method for comprehensively analyzing the entire genome of an organism using next-generation sequencing. WGS is useful in various fields of research, including agricultural genomics, environmental sciences, cancer research, microbiology, developmental biology, etc. WGS thus represents a very comprehensive genetic analysis, which is mainly used in the research of rare diseases, diseases of unknown origin and tumors, as well as in the decoding of the entire genome (approx. 3 billion base pairs in humans). The gene sample is compared with a reference genome in order to identify genomic variants. Although it is estimated that 85% of disease-causing mutations are localized in the coding DNA segments, mutations and copy number variations (CNVs) outside these gene segments can also be detected by WGS.

General informationThis section has been translated automatically.

The introduction of NGS sequencing (Next Generation Sequencing) has made it possible to analyze the entire human genome (Whole Genome Sequencing, WGS) within a reasonable time frame. NGS is a technological advancement of classic Sanger sequencing. Whereas it used to be possible to read one DNA molecule at a time, NGS allows millions of different DNA segments to be read in parallel in a single analysis.

The entire nucleotide sequences (approx. 3 billion DNA building blocks) of all 23 chromosome pairs are sequenced (cut into small pieces), adapters are added to the ends of the fragments so that they can be attached to the sequencing platform. The fragments are amplified. The sequencing machine reads the base sequence (ATCG) of each fragment. A computer reassembles the many fragments into a virtual overall picture. They can now be analyzed bioinformatically. Only a small fraction of the entire genome codes for proteins (the so-called exons) - 98% of the genome consists of non-coding DNA and includes introns, regulatory sequences and various repetitive sequences (see figure).

In principle, linking genomic data with phenotypic and clinical data offers "great opportunities for improving our understanding of complex diseases in particular and also for developing new therapeutic approaches". Genome databases are an important building block for identifying disease-causing mutations. According to recent studies, WGS can be used to find causal mutations in known disease-associated genes in around half of all patients with suspected genetic developmental disorders. To date, mutations have been identified in more than 4,000 of the 20,000 protein-coding human genes. In particular, genome, transcriptome and epigenome analyses are possible.

Orphan Diseases: Whole genome sequencing (WGS) is becoming the preferred method for the molecular genetic diagnosis of rare and unknown diseases and for the identification of relevant causes of cancer. Compared to other molecular genetic methods, WGS captures most genomic variation and eliminates the need for sequential genetic testing (Bagger FO et al. 2024).

Bacteriology: The complete genome sequence of a bacterial pathogen was first successfully determined in 1995. The complete nucleotide sequence of Haemophilus influenzae Rd was determined using the chain termination method developed by Sanger et al. in 1977. However, this technology is labor-intensive, expensive and time-consuming. Since 2004, high-throughput next-generation sequencing technologies have been developed that enable highly efficient, time-saving and cost-effective whole genome sequencing (WGS) of all organisms, including bacterial pathogens. WGS technologies are used to identify bacterial species, strains and genotypes from cultured organisms and directly from clinical samples (Mustafa AS 2024. WGS has also contributed to the determination of antibiotic resistance through the detection of antimicrobial resistance genes and point mutations.

  • AMR: NGS is now an integral part of antimicrobial resistance surveillance (AMR) and the most widely used platform for whole genome sequencing (WGS). The high throughput, relatively low cost, high discriminatory power and fast turnaround time of WGS compared to classical biochemical methods mean that the technology is likely to remain a fundamental tool in AMR surveillance and for public health (Sørensen LH et al. 2021).

Note(s)This section has been translated automatically.

Some laboratories integrate internal NGS automation platforms e.g. Falcon. This ensures seamless service workflows and remarkably high throughput. With automated and streamlined workflows and high-performance computing clusters, it is now guaranteed that data is delivered in a timely manner and without compromising quality, regardless of project size. The most modern sequencing platforms are Illumina , PacBio and Oxford Nanopore . The spatial transcriptomics service is supported by Visium CytAssist from 10X Genomics . All labs benefit from the latest platforms. Researchers can be offered the opportunity to carry out their projects using different sequencing strategies depending on the research application. The cost of Whole Genome Sequencing (WGS) in 2025 is expected to be around 200 to 300 US dollars per genome for the process itself, with advances in sequencing technology (e.g. from Illumina and Ultima Genomics) further reducing this cost. Prices can vary depending on the provider and additional services such as reporting or long-term access, with direct providers such as tellmeGen or Nebula Genomics offering similar prices. Low-pass whole genome sequencing (also known as lpWGS or Shallow Whole Genome Sequencing ) is a cost-effective method for large cohorts to detect genetic variation within the genomes of different species by imputation.

The latest generation of NGS technology is RNA sequencing (RNA-Seq). In its simplest form, RNA-Seq enables the determination of RNA molecules in a sample at the time of sampling.

LiteratureThis section has been translated automatically.

  1. Bagger FO et al. (2024) Whole genome sequencing in clinical practice. BMC Med Genomics 17:39.
  2. Marinova-Bulgaranova TV et al. (2025) Current Methods for Reliable Identification of Species in the Acinetobacter calcoaceticus-Acinetobacter baumannii Complex. Microorganisms 13:1819.
  3. Mustafa AS (2024) Whole Genome Sequencing: Applications in Clinical Bacteriology. Med Princ Pract. 33:185-197.
  4. Sørensen LH et al. (2021) Whole-genome sequencing for antimicrobial surveillance: species-specific quality thresholds and data evaluation from the network of the European Union Reference Laboratory for Antimicrobial Resistance genomic proficiency tests of 2021 and 2022. mSystems 9:e0016024.

Last updated on: 17.10.2025