Detecting neglectful respondents in a Computer-Aided Web Interview (CAWI)

15th INSEE Statistics Methodology Days (JMS 2025)
Image
JMS 2025

 

By Diane MAILLOT-TCHOFO, Fabienne LE SAGER and Louis MAREC, from Médiamétrie's Data & Methods Department, and Tom DEVYNCK, Médiamétrie and Toulouse School of Economics

 

The most familiar observational error within surveys is associated with respondents’ inability or
unwillingness to provide the correct answer.

In this context, the will to correctly estimate digital devices ownership (e.g. TV, smartphone) drove Médiamétrie to develop an ambivalent method to detect neglectful respondents in a Computer-Aided Web Interview (CAWI).

We drew from the literature (Laura Gamble, 2023 and Anvita Mahajan, 2023) works and derived an ambivalent method combining both approaches. Our first approach makes use of the questionnaire’s completion times. The second approach is a two-step clustering algorithm focused on the ownership of digital equipment. A K-Means  was applied on the respondent’s household socio-demographic characteristics.

Then, machine learning models were applied to each cluster to contain the models’ shortcomings.
Our final list of sloppy respondents was obtained by combining the results of the two approaches.

 

To view the full document, click on the download icon below.
Téléchargement(s)

Download

More
Innovations
About Us
Results & Studies
Solutions
News
×
Dictionnaire
Les mots
des médias
New
edition
+500
definitions
A
B
C
D
E
F
G
H
I