Data underlying chapter 4 of PhD thesis: Building the genome of a minimal synthetic cell
DOI: 10.4121/ad21c652-ad75-4a99-a09a-46c7d8f383d6
Datacite citation style
Dataset
Licence CC BY-NC 4.0
This dataset belongs to the PhD thesis of Céline Cleij titled "Building the genome of a minimal synthetic cell".
Specifically, the dataset belongs to Chapter 4 titled "De novo design and assembly of minimal genomes for the synthetic cell".
Authors: Céline Cleij, Pascale Daran-Lapujade, Christophe Danelon
Corresponding authors: Pascale Daran-Lapujade and Christophe Danelon
Contact information: [email protected] and [email protected]
This dataset contains data collected during experiments as part of Céline Cleij's PhD project. The data was collected from 2023-2025.
All data processing and analysis steps are described in detail in the Methods section of thesis chapter 4.
Designed SynMG sequences (GenBank) were prepared with the SnapGene software, using the plasmid maps of the sequenced template plasmids and the designed primer sequences.
Raw Nanopore sequencing reads (fastq) were obtained by Plasmidsaurus (Eugene, OR, USA) using Nanopore sequencing technology.
Consensus SynMG sequences (GenBank) were obtained by Plasmidsaurus after processing of the raw reads, and were manually annotated in SnapGene.
The overview of relevant mutations ("Relevant mutations in SynMG variants") was prepared in Excel, based on mutations in consensus sequences and raw reads obtained from sequencing.
LC-MS data was obtained after processing in the Mascot software.
The data is grouped into seven files:
i) Designed SynMG sequences. Files are named after the SynMG version (SynMG1 or SynMG2).
ii) S. cerevisiae - Raw Nanopore sequencing reads. Files are named after the yeast strain from which total DNA was extracted, and after the SynMG variant which was assembled in this strain.
iii) S. cerevisiae - Consensus SynMG sequences. Files are named after the yeast strain from which total DNA was extracted, and after the SynMG variant which was assembled in this strain.
iv) E. coli - Raw Nanopore sequencing reads. Files are named after the E. coli strain from which SynChr DNA was extracted, and after the SynMG variant which was amplified in this strain.
v) E. coli - Consensus SynMG sequences. Files are named after the E. coli strain from which SynChr DNA was extracted, and after the SynMG variant which was amplified in this strain.
vi) Relevant mutations in SynMG variants. This Excel file contains an overview of all relevant mutations SynMG1.1, SynMG1.2, SynMG1.3, SynMG2.1 and SynMG2.2 compared to the designed maps.
vii) LC-MS data. This Excel file contains LC-MS data used for making Figure 4A, B & D.
History
- 2025-03-27 first online, published, posted
Publisher
4TU.ResearchDataFormat
Raw Nanopore sequencing reads/fastq; Consensus SynMG sequences/GenBank; Designed SynMG sequences/GenBank; Relevant mutations in SynMG variants/Excel; LC-MS data/ExcelFunding
- ANR grant (grant code ANR-22-CPJ2-0091-01) Agence Nationale de la Recherche
- BaSyC – Building a Synthetic Cell Gravitation grant (grant code 024.003.019) NWO
Organizations
Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology;Faculty of Applied Sciences, Department of Biotechnology, Delft University of Technology;
Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA
DATA
Files (8)
- 3,044 bytesMD5:
9f3efd3ea2278aa3d974adf244f67d95
README.txt - 808,861 bytesMD5:
ef1311e8fa34c3c71af4a44e1a74bc1f
Designed SynMG sequences.zip - 109,937 bytesMD5:
317f674a5eaa70f2921920d0c635535c
E. coli - Consensus SynMG sequences.zip - 21,251,665 bytesMD5:
e83e2b73c2f63fa68dcde68975e01242
E. coli - Raw Nanopore sequencing reads.zip - 29,107 bytesMD5:
fab7050d8210b6555ddb921b7656fc2e
LC-MS data.xlsx - 27,063 bytesMD5:
0068a087fb79e55ab0b050c17797e74f
Relevant mutations in SynMG variants.xlsx - 151,274 bytesMD5:
17aefa70cbba0087c2e22cb121d9857b
S. cerevisiae - Consensus SynMG sequences.zip - 3,569,351,037 bytesMD5:
06ceff09aaa30df99fb75d2803849942
S. cerevisiae - Raw Nanopore sequencing reads.zip -
download all files (zip)
3,591,731,988 bytes unzipped