SpiderDec, the decomposed version of the Spider dev data set

DOI:10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future. For a link that will always point to the latest version, please use
DOI: 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8
Datacite citation style:
Salimzadeh, Sara; Gadiraju, Ujwal; Hauff, Claudia; Arie Van Deursen (2024): SpiderDec, the decomposed version of the Spider dev data set. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite

Dataset

SpiderDec is an extension of the Spider Dataset. The original Spider dataset split the data into training, development, and a hidden test set. For this new dataset, we manually decomposed the questions and corresponding queries within the development set of the Spider dataset, focusing on those with hard and extra hard SQL queries. The result of this effort is the creation of SpiderDec.

History

  • 2024-07-01 first online, published, posted

Publisher

4TU.ResearchData

Format

SQL file

Funding

Organizations

TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Sciences, Department of Software Technology
AI for Fintech Research Lab at ING Group

DATA

Files (1)