Data underlying the publication: Lactuca super-pangenome reduces bias towards reference genes in lettuce research
doi: 10.4121/c7935d6a-d6ae-42e7-af7e-0ae8cddf70d7
Supplementary data belonging to "Lactuca super-pangenome reduces bias towards reference genes in lettuce research". In order to get an overview of the gene content of the genus Lactuca, we used WGS data of 474 accessions beloning to L. sativa, L. serriola, L. saligna and L. virosa for the construction of a linear pangenome per species. This linear pangenome was built using the assemble-and-iteratively-add approach. Once constructed, presence-absence variation (PAV) and copy-number variation (CNV) were calculated from the WGS data on the linear pangenomes. The PAV data was integrated across species into a Lactuca wide table that contains the variation for each of the 474 accessions for all genes in the super-pangenome. This super-pangenome resource was then used for functional characterisation of the core and variable genes, and a phylogeny of all accessions. Finally, we used the L. sativa PAV data to show its complementary and benefits in GWAS over SNPs. All data underlying these analyses is bundled together in one tarball including README.
- 2024-06-06 first online, published, posted
- TTW Perspective programme "LettuceKnow" (grant code P19-17) [more info...] The Dutch Research Council (NWO) and the breeding companies BASF, Bejo Zaden B.V., Limagrain, Enza Zaden Research & Development B.V., Rijk Zwaan Breeding B.V., Syngenta Seeds B.V., and Takii and Company Ltd.
Theoretical Biology & Bioinformatics, Utrecht University
Translational Plant Biology, Utrecht University
Biosystematics Group, Wageningen University & Research
Department of Genetics, University Medical Center Utrecht
DATA
- 2,097 bytesMD5:
c1091719192c30e39879e331ba9b4002
README.txt - 7,869,281,610 bytesMD5:
b593a076a9817c3d0dde625811752ae3
20240605_supplementary_data.tar.gz -
download all files (zip)
7,869,283,707 bytes unzipped