Benchmarking logs to test scalability of process discovery algorithms

Datacite citation style:
van der Aalst, Wil (2017): Benchmarking logs to test scalability of process discovery algorithms. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/uuid:1cc41f8a-3557-499a-8b34-880c1251bd6e
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite

Dataset

The set of event logs included, are aimed to support the evaluation of the performance of process discovery algorithms. The largest event logs in this data set have millions of events. If you need even bigger datasets, you can generate these yourself using the CPN Tools sources files included (*.cpn). Each file has two parameters nofcases (i.e., the number of process instances) and nofdupl (i.e., the number of times a process is replicated with unique new names).

History

  • 2017-10-12 first online, published, posted

Publisher

Eindhoven University of Technology

Format

media types: application/pdf, application/vnd.openxmlformats-officedocument.wordprocessingml.document, application/zip, text/csv, text/plain, text/xml

Organizations

Eindhoven University of Technology, Department of Mathematics and Computer Science, Data Science Centre Eindhoven

DATA

Files (1)

  • 155,001,566 bytesMD5:a0ceda17a699af1732b8bdaa40e80827data.zip