AI_responses_rating

dataset

posted on 2025-04-08, 10:08 authored by Domenico TrezzaDomenico Trezza

This dataset contains the results of an empirical evaluation of 1,200 AI-generated responses to 400 prompt questions related to public service contexts. The responses were assessed by 33 professionals across five institutional categories (adult education, regional policy, local welfare, union representatives, and program management). Each response was rated for understandability and accuracy across three temperature settings. The dataset includes both question-level metadata and user-assigned evaluation scores, and it supports the analysis presented in the paper "Intelligence at Different Temperatures: Experimenting with AI Response Quality in Public Services."

AI_responses_rating

History

Usage metrics

Categories

Keywords

Licence

Exports