TY - DATA T1 - Survey data underlying the MSc thesis: "Generative AI: Investigating Consistency and Neutrality in Multilingual Outputs" PY - 2025/05/20 AU - Ahmed Ibrahim UR - DO - 10.4121/e058cc9d-7ca8-408f-9233-79ea0bd3953f.v1 KW - GenAI KW - Consistency KW - Arabic KW - English N2 -
This dataset contains responses from an online survey designed to evaluate how consistently and neutrally ChatGPT’s English and Arabic answers align across ten prompts (seven politically sensitive, three non-sensitive). Each row captures one participant’s ratings of sentiment and factual consistency between the two language outputs, neutrality scores for each response and the prompt itself, and optional comments. The data were collected via Qualtrics from English- and Arabic-fluent respondents who compared side-by-side model answers, providing quantitative Likert-scale ratings to assess multilingual consistency and neutrality of Generative AI output in a human evaluation study.
ER -