The Physcomitrella patens chromosome-scale assembly reveals moss genome structure and evolution
Daniel Lang
(1)
,
Kristian Ullrich
,
Florent Murat
(2)
,
Jörg Fuchs
,
Jerry Jenkins
(3)
,
Fabian Haas
,
Carl Li
,
Guillaume Blanc
(4)
,
Heidrun H. Gundlach
(5)
,
Michiel van Bel
,
Rabea Meyberg
,
Cristina Vives
(6)
,
Jordi Morata
,
Aikaterini Symeonidi
(7)
,
Manuel Hiss
,
Wellington Muchero
,
Lee Kamisugi
,
Omar A. Saleh
(8)
,
Eva Decker
,
Nico van Gessel
,
Jane Grimwood
(9)
,
Richard Hayes
(10)
,
Sean Graham
,
Lee Gunter
,
Daniel Mcdaniel
,
Sebastian N.W. Hoernstein
,
Anders Larsson
(11)
,
Fay-Wei Li
,
Pierre-François Perroud
,
Jeremy Phillips
,
Priya Ranjan
(12)
,
Daniel Rokshar
,
Carl Rothfels
,
Lucas Schneider
,
Shengqiang Shu
,
Dennis Stevenson
,
Fritz Thümmler
,
Michael Tillich
,
Juan Villarreal Aguilar
,
Thomas Widiez
(13)
,
Gane Ka-Shu Wong
(14)
,
Ann Wymore
,
Yong Zhang
(15)
,
Andreas Zimmer
(16)
,
Ralph Quatrano
,
Klaus F.X. Mayer
,
David Goodstein
(9)
,
Josep Casacuberta
,
Klaas Vandepoele
(17)
,
Ralf Reski
(18)
,
Andrew Cuming
,
Gerald Tuskan
(12)
,
Florian Maumus
(19)
,
Jérôme Salse
(2)
,
Jeremy Schmutz
(3)
,
Stefan Rensing
1
Fakultät für Biologie = Faculty of Biology [Freiburg]
2 GDEC - Génétique Diversité et Ecophysiologie des Céréales
3 United States Department of Energy
4 MIO - Institut méditerranéen d'océanologie
5 Inst Bioinformat & Syst Biol, Munich Informat Ctr Prot Sequences
6 Center for Research in Agricultural Genomics
7 Freiburg Initiative in Systems Biology
8 LPS - Laboratoire de Physique Statistique de l'ENS
9 DOE - Department of Energy / Joint Genome Institute
10 LSHTM - London School of Hygiene and Tropical Medicine
11 LUT - Luleå University of Technology
12 BioSciences Division [Oak Ridge]
13 RDP - Reproduction et développement des plantes
14 Department of Biological Sciences [Edmonton]
15 WPI - Wolfgang Pauli Institute
16 Department of Molecular Psychiatry
17 PSB Center - Center for Plant Systems Biology
18 Plant Biotechnology, Faculty of Biology
19 URGI - Unité de Recherche Génomique Info
2 GDEC - Génétique Diversité et Ecophysiologie des Céréales
3 United States Department of Energy
4 MIO - Institut méditerranéen d'océanologie
5 Inst Bioinformat & Syst Biol, Munich Informat Ctr Prot Sequences
6 Center for Research in Agricultural Genomics
7 Freiburg Initiative in Systems Biology
8 LPS - Laboratoire de Physique Statistique de l'ENS
9 DOE - Department of Energy / Joint Genome Institute
10 LSHTM - London School of Hygiene and Tropical Medicine
11 LUT - Luleå University of Technology
12 BioSciences Division [Oak Ridge]
13 RDP - Reproduction et développement des plantes
14 Department of Biological Sciences [Edmonton]
15 WPI - Wolfgang Pauli Institute
16 Department of Molecular Psychiatry
17 PSB Center - Center for Plant Systems Biology
18 Plant Biotechnology, Faculty of Biology
19 URGI - Unité de Recherche Génomique Info
Kristian Ullrich
- Function : Author
Florent Murat
- Function : Author
- PersonId : 751486
- IdHAL : florent-murat
- ORCID : 0000-0003-2116-2511
Jörg Fuchs
- Function : Author
Fabian Haas
- Function : Author
- PersonId : 791049
- ORCID : 0000-0002-7711-5282
Carl Li
- Function : Author
Guillaume Blanc
- Function : Author
- PersonId : 18754
- IdHAL : guillaume-blanc
- ORCID : 0000-0001-5728-1104
- IdRef : 197897657
Michiel van Bel
- Function : Author
- PersonId : 791050
- ORCID : 0000-0002-1873-2563
Rabea Meyberg
- Function : Author
- PersonId : 791051
- ORCID : 0000-0002-9977-4000
Jordi Morata
- Function : Author
Manuel Hiss
- Function : Author
Wellington Muchero
- Function : Author
Lee Kamisugi
- Function : Author
Eva Decker
- Function : Author
Nico van Gessel
- Function : Author
Richard Hayes
- Function : Author
- PersonId : 763647
- ORCID : 0000-0002-5236-7918
Sean Graham
- Function : Author
Lee Gunter
- Function : Author
Daniel Mcdaniel
- Function : Author
Sebastian N.W. Hoernstein
- Function : Author
Anders Larsson
- Function : Author
- PersonId : 767245
- ORCID : 0000-0003-3161-0402
Fay-Wei Li
- Function : Author
- PersonId : 791052
- ORCID : 0000-0002-0076-0152
Pierre-François Perroud
- Function : Author
- PersonId : 753801
- IdHAL : pierre-francois-perroud
- ORCID : 0000-0001-7607-3618
- IdRef : 135385547
Jeremy Phillips
- Function : Author
Daniel Rokshar
- Function : Author
Carl Rothfels
- Function : Author
- PersonId : 791053
- ORCID : 0000-0002-6605-1770
Lucas Schneider
- Function : Author
Shengqiang Shu
- Function : Author
Dennis Stevenson
- Function : Author
Fritz Thümmler
- Function : Author
Michael Tillich
- Function : Author
Juan Villarreal Aguilar
- Function : Author
Thomas Widiez
- Function : Author
- PersonId : 736521
- IdHAL : thomas-widiez
- ORCID : 0000-0001-6002-2306
- IdRef : 113381700
Gane Ka-Shu Wong
- Function : Author
- PersonId : 770693
- ORCID : 0000-0001-6108-5560
Ann Wymore
- Function : Author
Ralph Quatrano
- Function : Author
Klaus F.X. Mayer
- Function : Author
Josep Casacuberta
- Function : Author
- PersonId : 775097
- ORCID : 0000-0002-5609-4152
Klaas Vandepoele
- Function : Author
- PersonId : 769004
- ORCID : 0000-0003-4790-2725
Andrew Cuming
- Function : Author
Florian Maumus
- Function : Author
- PersonId : 747511
- IdHAL : florian-maumus
- ORCID : 0000-0001-7325-0527
- IdRef : 187116962
Jérôme Salse
- Function : Author
- PersonId : 1147737
- IdHAL : jerome-salse
- ORCID : 0000-0003-2942-1098
Stefan Rensing
- Function : Author
- PersonId : 791054
- ORCID : 0000-0002-0225-873X
Abstract
The draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome-scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle. Unlike in flowering plant genomes, gene- and TE-rich regions show an overall even distribution along the chromosomes. However, the chromosomes are mono-centric with peaks of a class of Copia elements potentially coinciding with centromeres. Gene body methylation is evident in 5.7% of the protein-coding genes, typically coinciding with low GC and low expression. Some giant virus insertions are transcriptionally active and might protect gametes from viral infection via siRNA mediated silencing. Structure-based detection methods show that the genome evolved via two rounds of whole genome duplications (WGDs), apparently common in mosses but not in liverworts and hornworts. Several hundred genes are present in colinear regions conserved since the last common ancestor of plants. These syntenic regions are enriched for functions related to plant-specific cell growth and tissue organization. The P. patens genome lacks the TE-rich pericentromeric and gene-rich distal regions typical for most flowering plant genomes. More non-seed plant genomes are needed to unravel how plant genomes evolve, and to understand whether the P. patens genome structure is typical for mosses or bryophytes.