Subscribe to Newsletter
Diagnostics Genetics and epigenetics, Precision medicine, Guidelines and recommendations

Reference Genome Refresh

The use of whole-genome sequencing to diagnose rare disease is on the rise. In such cases, an individual’s genetic map is compared with the reference genome – a “standard” version, if you will. Although many of the differences between the two are not significant, some divergent sequences can be the drivers of rare disease. However, new evidence uncovered by a team from the Karolinska Institutet in Sweden indicates that the human reference genome might be outdated – at least in regard to its diversity (1). We spoke to Jesper Eisfeldt, a doctoral student in the Rare Diseases research group and first author of the paper, to find out more.

Tell us about your discovery…

We reanalyzed 1,000 previously sequenced Swedish genomes (2) to isolate the regions that are not present in the reference genome. We discovered over 61,000 novel sequences – a volume equivalent to approximately one whole chromosome – affecting over 80 genes, 12 of which are linked to various diseases. After comparing these novel sequences to the genome library for chimpanzees, Icelanders, and the African population, we discovered that they are highly conserved and widely distributed in the human population. This showed us that the human genome is more heterogeneous than previously thought and, as a result, we need to update our reference genome.

What is the potential impact of this lack of diversity in the reference genome?

Sequences missing from the reference genome could be of clinical relevance. For example, they could contain regulatory elements. In some cases, we are unable to fully understand large variants partly located in these missing or wrongly positioned sequences. When comparing our reads to the reference genome, we are at risk of positioning the missing sequences in the wrong place, producing noise and detecting false-positive variation. A more complete reference genome will give us cleaner and more reliable results.

How can the reference genome be changed?

The current reference genome, GRCh38, was made available in 2013. Although continuously improved (the latest patch was released in March 2019), it is based on previous versions of the human genome and largely produced though hierarchical shotgun sequencing. We believe that a graph-based reference genome is the way forward, because graphs can be used to represent multiple variants of the same sequence. In the current reference genome, which is linear and haploid, it is tricky to represent complex and diverse variation.

Receive content, products, events as well as relevant industry updates from The Pathologist and its sponsors.
Stay up to date with our other newsletters and sponsors information, tailored specifically to the fields you are interested in

When you click “Subscribe” we will email you a link, which you must click to verify the email address above and activate your subscription. If you do not receive this email, please contact us at [email protected].
If you wish to unsubscribe, you can update your preferences at any point.

  1. J Eisfeldt et al., “Discovery of novel sequences in 1,000 Swedish genomes”, Mol Biol Evol, [Epub ahead of print] (2019). PMID: 31560401.
  2. A Ameur et al., “SweGen: a whole-genome data resource of genetic variability in a cross-section of the Swedish population”, Eur J Hum Genet, 25, 1253 (2017). PMID: 28832569.
About the Author
Luke Turner

While completing my undergraduate degree in Biology, I soon discovered that my passion and strength was for writing about science rather than working in the lab. My master’s degree in Science Communication allowed me to develop my science writing skills and I was lucky enough to come to Texere Publishing straight from University. Here I am given the opportunity to write about cutting edge research and engage with leading scientists, while also being part of a fantastic team!

Related Application Notes
Evaluation of cell-free fetal DNA to determine fetal RhD status

| Contributed by Revvity

Preventing Bias in scRNAseq Performed on Solid Tumors

| Contributed by Revvity

Enabling Efficient, Cost-effective Sequencing of the Human Whole Exome

| Contributed by Revvity

Related Product Profile
Diagnostics Genetics and epigenetics
QIAseq® Pan Cancer Multimodal cuts user interventions by 50%

| Contributed by QIAGEN

Register to The Pathologist

Register to access our FREE online portfolio, request the magazine in print and manage your preferences.

You will benefit from:
  • Unlimited access to ALL articles
  • News, interviews & opinions from leading industry experts
  • Receive print (and PDF) copies of The Pathologist magazine

Register