Evaluating Search System Explainability with Psychometrics and Crowdsourcing

Chen, Catherine; Eickhoff, Carsten

Computer Science > Information Retrieval

arXiv:2210.09430 (cs)

[Submitted on 17 Oct 2022 (v1), last revised 3 May 2024 (this version, v3)]

Title:Evaluating Search System Explainability with Psychometrics and Crowdsourcing

Authors:Catherine Chen, Carsten Eickhoff

View PDF

Abstract:As information retrieval (IR) systems, such as search engines and conversational agents, become ubiquitous in various domains, the need for transparent and explainable systems grows to ensure accountability, fairness, and unbiased results. Despite recent advances in explainable AI and IR techniques, there is no consensus on the definition of explainability. Existing approaches often treat it as a singular notion, disregarding the multidimensional definition postulated in the literature. In this paper, we use psychometrics and crowdsourcing to identify human-centered factors of explainability in Web search systems and introduce SSE (Search System Explainability), an evaluation metric for explainable IR (XIR) search systems. In a crowdsourced user study, we demonstrate SSE's ability to distinguish between explainable and non-explainable systems, showing that systems with higher scores indeed indicate greater interpretability. We hope that aside from these concrete contributions to XIR, this line of work will serve as a blueprint for similar explainability evaluation efforts in other domains of machine learning and natural language processing.

Comments:	11 pages, 4 figures, accepted at SIGIR 2024 as full paper
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2210.09430 [cs.IR]
	(or arXiv:2210.09430v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2210.09430

Submission history

From: Catherine Chen [view email]
[v1] Mon, 17 Oct 2022 20:52:41 UTC (2,454 KB)
[v2] Fri, 16 Jun 2023 15:56:30 UTC (3,136 KB)
[v3] Fri, 3 May 2024 22:43:12 UTC (2,687 KB)

Computer Science > Information Retrieval

Title:Evaluating Search System Explainability with Psychometrics and Crowdsourcing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Evaluating Search System Explainability with Psychometrics and Crowdsourcing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators