Process Discovery Contest 2023
doi:10.4121/afd6f608-469e-48f9-977d-875b45840d39.v1
The doi above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
doi: 10.4121/afd6f608-469e-48f9-977d-875b45840d39
doi: 10.4121/afd6f608-469e-48f9-977d-875b45840d39
Datacite citation style:
Eric Verbeek; Verbeek, H.M.W. (Eric) (2023): Process Discovery Contest 2023. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/afd6f608-469e-48f9-977d-875b45840d39.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
usage stats
1981
views
1395
downloads
licence
CC0
This data set contains the data set as was used for the Process Discovery Contest of 2023 (PDC 2023).
The data set contains 384 training logs, 96 corresponding test logs and base logs, 96 corresponding
ground truth logs, and 96 models. The logs are all stored using the IEEE XES file format (see either
https://www.xes-standard.org/ or https://ieeexplore.ieee.org/document/7740858), while the models are
workflow nets (a subclass of Petri nets) stored in the PNML fileformat (see
https://www.iso.org/obp/ui/#iso:std:iso-iec:15909:-2:ed-1:v1:en).
history
- 2023-10-04 first online, published, posted
publisher
4TU.ResearchData
format
IEEE XES, ISO PNML
organizations
Task Force on Process Mining (https://tf-pm.org)
DATA
files (6)
- 3,217 bytesMD5:
a3f4c604bd3767f9ded0c1624de8c487
readme.txt - 4,372,016 bytesMD5:
e50ca7e37445347320bc82b4fa2064bd
Base Logs.zip - 4,532,918 bytesMD5:
7998c9286c93892b953dfdd4b94f7ddf
Ground Truth Logs.zip - 576,945 bytesMD5:
2b281aceff385fe8635ab86b3dc61a15
Models.zip - 4,371,117 bytesMD5:
8f7012e7ded6f620344373af392d14cf
Test Logs.zip - 17,149,317 bytesMD5:
70cf660efcdd6d4ba32dc6878b93be2b
Training Logs.zip -
download all files (zip)
31,005,530 bytes unzipped