%0 Generic %A Lieftinck, Martijn %A Verlaan, Timo %A Reinders, Marcel %D 2025 %T Data underlying the publication: "GONNECT: Coupling Biological Systems to Neural Networks for Improved Model Interpretability" %U %R 10.4121/0d78788b-6bd7-4941-a942-245309107b6d.v1 %K GONNECT %K TCGA %K GO %K Gene Ontology %K Gene Expression %K Machine learning %K Deep learning %K BINNs %K Biologically-informed Neural Networks %K Autoencoders %X
This dataset contains all processed data required to reproduce the results of the GONNECT paper. In this work, we couple the structure of a neural network model to biological prior information, to gain interpretable activations in the neural network's hidden layers (see preprint/publication for more information). The data presented here includes processed gene expression data from The Cancer Genome Atlas (TCGA) data that is used as input data for the model, and both the raw and processed Gene Ontology (GO, https://geneontology.org/) knowledge base, from which the structure of the neural networks in this study is derived. There are also some miscellaneous files used to link genes to proteins. All data used in this study is publically available.
Code corresponding to the paper can be found here: https://github.com/DelftBioinformaticsLab/GONNECT
%I 4TU.ResearchData