Revealing the Role of User Moods in Struggling Search Tasks

Luyan Xu; Ujwal Gadiraju; Xuan Zhou

arxiv: 1907.07717 · v1 · pith:52SVX2INnew · submitted 2019-07-17 · 💻 cs.HC

Revealing the Role of User Moods in Struggling Search Tasks

Luyan Xu , Xuan Zhou , Ujwal Gadiraju This is my paper

Pith reviewed 2026-05-24 20:04 UTC · model grok-4.3

classification 💻 cs.HC

keywords user moodstruggling searchsearch behaviorquery issuanceperceived difficultyuser experienceinformation retrieval

0 comments

The pith

User mood systematically biases search behavior and perceived difficulty during struggling tasks.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that a user's mood influences both their actions and their sense of how hard a search task feels. Users in activated pleasant or unpleasant moods issue more queries than those in deactivated or neutral moods. Users in unpleasant moods also rate the difficulty higher. A sympathetic reader would care because these mood effects add a measurable layer to why some searches feel more frustrating or effortful than others, with direct consequences for how systems should respond.

Core claim

This work shows that a user's own mood can systematically bias the user's perception and experience while interacting with a search system and trying to satisfy an information need. People who are in activated-pleasant or activated-unpleasant moods tend to issue more queries than people in deactivated or neutral moods. Those in an unpleasant mood perceive a higher level of difficulty. These insights extend the current understanding of struggling search tasks and have important implications on the design and evaluation of search systems supporting such tasks.

What carries the argument

Mood states (activated-pleasant, activated-unpleasant, deactivated, neutral) that alter query volume and difficulty ratings.

If this is right

Search systems supporting struggling tasks need to consider mood as an input to interaction design.
Evaluation metrics for struggling search must incorporate mood as a variable that affects user reports.
Insights from mood effects can be used to refine how systems detect and respond to user struggle.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Interfaces could test mood-adaptive query suggestions or result rankings as a practical extension.
The same mood categories might be examined in other interactive systems beyond search to check consistency.

Load-bearing premise

The mood categories and the mix of lab studies, in-situ feedback, and crowdsourcing experiments isolate mood effects without large confounding influences from task difficulty or individual differences.

What would settle it

A replication experiment that controls for mood and finds no reliable differences in query counts or difficulty ratings across the four mood conditions would falsify the central claim.

Figures

Figures reproduced from arXiv: 1907.07717 by Luyan Xu, Ujwal Gadiraju, Xuan Zhou.

**Figure 1.** Figure 1: Pick-A-Mood scale to measure the self-reported mood of users before they enter the TaskGenie framework. 2 Figure8 – http://figure-eight.com/ 3http://www.mturk.com/ 3.2 Tasks To analyze how mood effects users’ search behavior in struggling search tasks (SSTs), we formulated 10 struggling text retrieval tasks from Wikipedia using a method from previous study [18]. We made sure that the first interaction of t… view at source ↗

**Figure 2.** Figure 2: Workflow of participants in the experimental [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

read the original abstract

User-centered approaches have been extensively studied and used in the area of struggling search. Related research has targeted key aspects of users such as user satisfaction or frustration, and search success or failure, using a variety of experimental methods including laboratory user studies, in-situ explicit feedback from searchers and by using crowdsourcing. Such studies are valuable in advancing the understanding of search difficulty from a user's perspective, and yield insights that can directly improve search systems and their evaluation. However, little is known about how user moods influence their interactions with a search system or their perception of struggling. In this work, we show that a user's own mood can systematically bias the user's perception, and experience while interacting with a search system and trying to satisfy an information need. People who are in activated-pleasant / activated-unpleasant moods tend to issue more queries than people in deactivated or neutral moods. Those in an unpleasant mood perceive a higher level of difficulty. Our insights extend the current understanding of struggling search tasks and have important implications on the design and evaluation of search systems supporting such tasks.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The abstract flags a plausible mood effect on query volume and difficulty ratings in struggling searches, but supplies zero methods, sample sizes, or stats, so the claim can't be checked.

read the letter

The paper's core observation is that activated moods (pleasant or unpleasant) correlate with issuing more queries, while unpleasant moods link to higher perceived difficulty during struggling searches. This builds on existing work about frustration and satisfaction by adding mood categories as a factor that might bias user experience and behavior. If the data hold, it points to a practical angle for search systems that try to detect or adapt to user state. The abstract does a clean job of stating the directional findings and their potential design implications without overclaiming a new framework. That part is straightforward and useful as a starting point for thinking about user modeling in IR and HCI. The main problem is that nothing in the abstract lets us evaluate whether the mood effects are isolated. No sample sizes, no measurement details for mood, no mention of statistical tests, effect sizes, or controls for task difficulty, prior experience, or individual traits. The stress-test note is accurate here: without those, differences could easily trace to confounders rather than mood itself. The study is observational and empirical, so the absence of any equations or derivations is expected, but the lack of basic reporting on how the data were collected and analyzed is a real gap. This kind of work is aimed at researchers who build or evaluate search interfaces that account for user affect. A reader already working on struggling search or user frustration might skim it for the mood angle, but only if the full paper includes the missing methods and reproducible numbers. Right now the abstract alone does not give enough to justify sending it out for serious refereeing; the methods and results sections would need to be filled in first.

Referee Report

2 major / 0 minor

Summary. The manuscript examines the influence of user moods on search interactions and perceptions during struggling search tasks. Drawing on laboratory user studies, in-situ explicit feedback, and crowdsourcing, it claims that users in activated-pleasant or activated-unpleasant moods issue more queries than those in deactivated or neutral moods, while users in unpleasant moods perceive higher difficulty levels. These findings are positioned as extending understanding of struggling search and informing search system design and evaluation.

Significance. If supported by detailed statistical evidence with proper controls, the work would contribute to user-centered information retrieval by identifying mood as a systematic factor in search behavior and difficulty perception. The multi-method approach (laboratory, in-situ, crowdsourcing) is a positive element that could support broader applicability if the mood isolation is convincingly demonstrated.

major comments (2)

[Abstract] Abstract: the directional findings on query counts and perceived difficulty are stated without sample sizes, statistical tests, effect sizes, or any mention of controls for task type or individual differences, preventing verification that the data support the central claims about mood effects.
[Methods] Methods (implied by description of laboratory, in-situ, and crowdsourcing studies): insufficient detail on mood measurement instruments, induction procedures, task difficulty balancing, participant screening, exclusion criteria, or statistical models (e.g., inclusion of covariates for prior experience or task type) to confirm that observed differences isolate mood rather than confounders.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We agree that additional quantitative details in the abstract and expanded methodological transparency will strengthen the presentation. We respond point-by-point below and will incorporate the suggested changes in the revision.

read point-by-point responses

Referee: [Abstract] Abstract: the directional findings on query counts and perceived difficulty are stated without sample sizes, statistical tests, effect sizes, or any mention of controls for task type or individual differences, preventing verification that the data support the central claims about mood effects.

Authors: We agree the abstract should be more informative. In the revision we will add the total sample sizes across the three studies, report the key statistical tests and p-values supporting the query-count and difficulty-perception differences, include effect-size information where available, and explicitly note that the reported mood effects were obtained after controlling for task type and individual differences (e.g., prior search experience). These numbers and controls already appear in the results sections; we will summarize them concisely in the abstract. revision: yes
Referee: [Methods] Methods (implied by description of laboratory, in-situ, and crowdsourcing studies): insufficient detail on mood measurement instruments, induction procedures, task difficulty balancing, participant screening, exclusion criteria, or statistical models (e.g., inclusion of covariates for prior experience or task type) to confirm that observed differences isolate mood rather than confounders.

Authors: The manuscript already describes the mood scales, induction methods, task sets, and basic statistical approach in each study subsection. To address the concern directly, we will consolidate and expand these descriptions into a dedicated methods overview that lists: (1) the exact mood instruments (e.g., PANAS or equivalent), (2) induction protocols, (3) how tasks were pre-balanced for difficulty, (4) screening and exclusion rules, and (5) the full regression/ANOVA models with covariates for prior experience and task type. This will make explicit that mood effects are estimated after accounting for the listed confounders. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical observational study with no derivations or fitted predictions

full rationale

The paper is an empirical study reporting observations from laboratory, in-situ, and crowdsourcing experiments on mood effects in search tasks. It contains no equations, mathematical derivations, parameter fitting, or predictions that reduce to inputs by construction. Claims rest on data collection and statistical analysis rather than self-referential definitions or self-citation chains that bear the central load. This matches the default case of a self-contained empirical paper with no circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claim rests on the validity of mood measurement instruments and the assumption that the chosen experimental paradigms isolate mood effects; these are standard domain assumptions in HCI and psychology rather than paper-specific inventions.

axioms (2)

domain assumption Mood can be reliably categorized into activated-pleasant, activated-unpleasant, deactivated, and neutral states using established psychological instruments.
Invoked implicitly when reporting differential effects across mood groups.
domain assumption Laboratory and crowdsourced search tasks can be designed to represent real-world struggling search without introducing systematic bias.
Required for generalizing the observed mood effects to search system design.

pith-pipeline@v0.9.0 · 5715 in / 1341 out tokens · 26670 ms · 2026-05-24T20:04:53.776646+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 19 canonical work pages

[1]

Christopher Beedie, Peter Terry, and Andrew Lane. 2005. Distinctions between emotion and mood. Cognition & Emotion 19, 6 (2005), 847–878

work page 2005
[2]

Richard J Davidson. 1994. On emotion, mood, and related affective constructs. The nature of emotion: Fundamental questions (1994), 51–55

work page 1994
[3]

Gianluca Demartini, Djellel Eddine Difallah, Ujwal Gadiraju, Michele Catasta, et al. 2017. An introduction to hybrid human-machine information systems. Foundations and Trends® in Web Science 7, 1 (2017), 1–87

work page 2017
[4]

Pieter MA Desmet, Martijn H Vastenburg, and Natalia Romero. 2016. Mood measurement with Pick-A-Mood: review of current methods and design of a pictorial self-report scale. Journal of Design Research 14, 3 (2016), 241–279

work page 2016
[5]

Nico H Frijda et al. 1994. Varieties of affect: Emotions and episodes, moods, and sentiments. (1994)

work page 1994
[6]

Ujwal Gadiraju, Sebastian Möller, Martin Nöllenburg, Dietmar Saupe, Sebastian Egger-Lampl, Daniel Archambault, and Brian Fisher. 2017. Crowdsourcing ver- sus the laboratory: towards human-centered experiments using the crowd. In Crowdsourcing and Human-Centered Experiments . Springer, 6–26

work page 2017
[7]

Ujwal Gadiraju, Ran Yu, Stefan Dietze, and Peter Holtz. 2018. Analyzing Knowl- edge Gain of Users in Informational Search Sessions on the Web. In CHIIR 2018. ACM, 2–11

work page 2018
[8]

Sture Holm. 1979. A simple sequentially rejective multiple test procedure. Scan- dinavian journal of statistics (1979), 65–70

work page 1979
[9]

Bill Kules and Robert Capra. 2008. Creating exploratory tasks for a faceted search interface. Proc. of HCIR 2008 (2008), 18–21

work page 2008
[10]

Bill Kules and Robert Capra. 2009. Designing exploratory search tasks for user studies of information seeking support systems. In Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries . ACM, 419–420

work page 2009
[11]

William N Morris. 2012. Mood: The frame of mind . Springer Sci. & Business Media

work page 2012
[12]

Jesse J Prinz. 2004. Gut reactions: A perceptual theory of emotion . Oxford UP

work page 2004
[13]

Cesar Vandevelde, Francis Wyffels, Maria-Cristina Ciocci, Bram Vanderborght, and Jelle Saldien. 2016. Design and evaluation of a DIY construction system for educational robot kits. International Journal of Technology and Design Education 26, 4 (2016), 521–540

work page 2016
[14]

Philippe Verduyn, Iven Van Mechelen, and Francis Tuerlinckx. 2011. The relation between event processing and the duration of emotional experience. Emotion 11, 1 (2011), 20

work page 2011
[15]

Bjørn J Villa, Katrien De Moor, Poul E Heegaard, and Anders Instefjord. 2013. Investigating Quality of Experience in the context of adaptive video streaming: findings from an experimental user study. Akademika forlag Stavanger, Norway (2013)

work page 2013
[16]

David Watson and Auke Tellegen. 1985. Toward a consensual structure of mood. Psychological bulletin 98, 2 (1985), 219

work page 1985
[17]

Barbara Wildemuth, Luanne Freund, and Elaine G. Toms. 2014. Untangling search task complexity and difficulty in the context of interactive information retrieval studies. Journal of Documentation 70, 6 (2014), 1118–1140

work page 2014
[18]

Luyan Xu and Xuan Zhou. 2019. Generating Tasks for Study of Struggling Search. In Proceedings of the 2019 CHIIR . ACM, 267–270

work page 2019
[19]

Ran Yu and Ujwal Gadiraju et al. 2018. Predicting User Knowledge Gain in Informational Search Sessions. In the 41st International SIGIR . 75–84

work page 2018

[1] [1]

Christopher Beedie, Peter Terry, and Andrew Lane. 2005. Distinctions between emotion and mood. Cognition & Emotion 19, 6 (2005), 847–878

work page 2005

[2] [2]

Richard J Davidson. 1994. On emotion, mood, and related affective constructs. The nature of emotion: Fundamental questions (1994), 51–55

work page 1994

[3] [3]

Gianluca Demartini, Djellel Eddine Difallah, Ujwal Gadiraju, Michele Catasta, et al. 2017. An introduction to hybrid human-machine information systems. Foundations and Trends® in Web Science 7, 1 (2017), 1–87

work page 2017

[4] [4]

Pieter MA Desmet, Martijn H Vastenburg, and Natalia Romero. 2016. Mood measurement with Pick-A-Mood: review of current methods and design of a pictorial self-report scale. Journal of Design Research 14, 3 (2016), 241–279

work page 2016

[5] [5]

Nico H Frijda et al. 1994. Varieties of affect: Emotions and episodes, moods, and sentiments. (1994)

work page 1994

[6] [6]

Ujwal Gadiraju, Sebastian Möller, Martin Nöllenburg, Dietmar Saupe, Sebastian Egger-Lampl, Daniel Archambault, and Brian Fisher. 2017. Crowdsourcing ver- sus the laboratory: towards human-centered experiments using the crowd. In Crowdsourcing and Human-Centered Experiments . Springer, 6–26

work page 2017

[7] [7]

Ujwal Gadiraju, Ran Yu, Stefan Dietze, and Peter Holtz. 2018. Analyzing Knowl- edge Gain of Users in Informational Search Sessions on the Web. In CHIIR 2018. ACM, 2–11

work page 2018

[8] [8]

Sture Holm. 1979. A simple sequentially rejective multiple test procedure. Scan- dinavian journal of statistics (1979), 65–70

work page 1979

[9] [9]

Bill Kules and Robert Capra. 2008. Creating exploratory tasks for a faceted search interface. Proc. of HCIR 2008 (2008), 18–21

work page 2008

[10] [10]

Bill Kules and Robert Capra. 2009. Designing exploratory search tasks for user studies of information seeking support systems. In Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries . ACM, 419–420

work page 2009

[11] [11]

William N Morris. 2012. Mood: The frame of mind . Springer Sci. & Business Media

work page 2012

[12] [12]

Jesse J Prinz. 2004. Gut reactions: A perceptual theory of emotion . Oxford UP

work page 2004

[13] [13]

Cesar Vandevelde, Francis Wyffels, Maria-Cristina Ciocci, Bram Vanderborght, and Jelle Saldien. 2016. Design and evaluation of a DIY construction system for educational robot kits. International Journal of Technology and Design Education 26, 4 (2016), 521–540

work page 2016

[14] [14]

Philippe Verduyn, Iven Van Mechelen, and Francis Tuerlinckx. 2011. The relation between event processing and the duration of emotional experience. Emotion 11, 1 (2011), 20

work page 2011

[15] [15]

Bjørn J Villa, Katrien De Moor, Poul E Heegaard, and Anders Instefjord. 2013. Investigating Quality of Experience in the context of adaptive video streaming: findings from an experimental user study. Akademika forlag Stavanger, Norway (2013)

work page 2013

[16] [16]

David Watson and Auke Tellegen. 1985. Toward a consensual structure of mood. Psychological bulletin 98, 2 (1985), 219

work page 1985

[17] [17]

Barbara Wildemuth, Luanne Freund, and Elaine G. Toms. 2014. Untangling search task complexity and difficulty in the context of interactive information retrieval studies. Journal of Documentation 70, 6 (2014), 1118–1140

work page 2014

[18] [18]

Luyan Xu and Xuan Zhou. 2019. Generating Tasks for Study of Struggling Search. In Proceedings of the 2019 CHIIR . ACM, 267–270

work page 2019

[19] [19]

Ran Yu and Ujwal Gadiraju et al. 2018. Predicting User Knowledge Gain in Informational Search Sessions. In the 41st International SIGIR . 75–84

work page 2018