SpiderDec, the decomposed version of the Spider dev data set
DOI:10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future.
For a link that will always point to the latest version, please use
DOI: 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8
DOI: 10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8
Datacite citation style:
Salimzadeh, Sara; Gadiraju, Ujwal; Hauff, Claudia; Arie Van Deursen (2024): SpiderDec, the decomposed version of the Spider dev data set. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/66ab9ab1-a08a-4c63-bd92-d11e2c3c06f8.v1
Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite
Dataset
Usage statistics
49
views
13
downloads
Licence Apache-2.0
SpiderDec is an extension of the Spider Dataset. The original Spider dataset split the data into training, development, and a hidden test set. For this new dataset, we manually decomposed the questions and corresponding queries within the development set of the Spider dataset, focusing on those with hard and extra hard SQL queries. The result of this effort is the creation of SpiderDec.
History
- 2024-07-01 first online, published, posted
Publisher
4TU.ResearchDataFormat
SQL fileAssociated peer-reviewed publication
Exploring the Feasibility of Crowd-Powered Decomposition of Complex User Questions in Text-to-SQL TasksFunding
Organizations
TU Delft, Faculty of Electrical Engineering, Mathematics and Computer Sciences, Department of Software TechnologyAI for Fintech Research Lab at ING Group
DATA
Files (1)
- 39,986 bytesMD5:
9f4474cdd23cdb956fa0e0392cc43e27
SpiderDec.zip