Epistemic Bellman Operators; code underlying the dissertation ‘Bayesian Model-Free Deep Reinforcement Learning’
DOI:10.4121/5ce26dac-4323-4c5d-9fe3-157dc17bed12.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
DOI: 10.4121/5ce26dac-4323-4c5d-9fe3-157dc17bed12
DOI: 10.4121/5ce26dac-4323-4c5d-9fe3-157dc17bed12
Datacite citation style
van der Vaart, Pascal (2025): Epistemic Bellman Operators; code underlying the dissertation ‘Bayesian Model-Free Deep Reinforcement Learning’. Version 1. 4TU.ResearchData. software. https://doi.org/10.4121/5ce26dac-4323-4c5d-9fe3-157dc17bed12.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Software
Licence MIT
Interoperability
Code underlying dissertation work "Bayesian Model-Free Deep Reinforcement Learning". This data set contains code for reproducing the experiments conducted in the chapter "Epistemic Bellman Operators", with the goal of validating the mathematical theory in the chapter. The code has no applications other than supporting the mathematical theory.
History
- 2025-08-11 first online, published, posted
Publisher
4TU.ResearchDataFormat
.py, .ipynbAssociated peer-reviewed publication
Epistemic Bellman OperatorsCode hosting project url
https://github.com/Pascal314/epistemic-bellman-operatorsOrganizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Science, Department of Intelligent SystemsTo access the source code, use the following command:
git clone https://data.4tu.nl/v3/datasets/253d2933-5712-48d8-ae79-6835dbdd8bd0.git "epistemic-bellman-operators"