pith. sign in
Pith Number

pith:QP225GXE

pith:2026:QP225GXENRVBVBJGUHQFXJNEWS
not attested not anchored not stored refs pending

Beyond Offline A/B Testing: Context-Aware Agent Simulation for Recommender System Evaluation

Gian Maria Marconi, Narimasa Watanabe, Nicolas Bougie, Xiaotong Ye

ContextSim anchors LLM agents in daily life scenarios to simulate contextual user interactions for more reliable recommender evaluation.

arxiv:2604.09549 v2 · 2026-01-26 · cs.IR · cs.AI

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{QP225GXENRVBVBJGUHQFXJNEWS}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Experiments across domains show our method generates interactions more closely aligned with human behavior than prior work. We further validate our approach through offline A/B testing correlation and show that RS parameters optimized using ContextSim yield improved real-world engagement.

C2weakest assumption

That LLM agents with generated life scenarios and enforced consistency at action and trajectory levels accurately capture the contextual factors shaping genuine human decision-making.

C3one line summary

ContextSim generates more human-aligned user interactions for recommender systems via context-aware life simulation and consistency enforcement, yielding parameters that improve real-world engagement.

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-06-02T02:04:52.938344Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

83f5ae9ae46c6a1a8526a1e05ba5a4b49fd4e123a6186430b70a105de0e0083c

Aliases

arxiv: 2604.09549 · arxiv_version: 2604.09549v2 · doi: 10.48550/arxiv.2604.09549 · pith_short_12: QP225GXENRVB · pith_short_16: QP225GXENRVBVBJG · pith_short_8: QP225GXE
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/QP225GXENRVBVBJGUHQFXJNEWS \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 83f5ae9ae46c6a1a8526a1e05ba5a4b49fd4e123a6186430b70a105de0e0083c
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "558882d3d367efbbb21b9371e6dddd3cee0bbeae45f9a6d87bec22b4d0531da8",
    "cross_cats_sorted": [
      "cs.AI"
    ],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.IR",
    "submitted_at": "2026-01-26T05:01:00Z",
    "title_canon_sha256": "d616394f1d1f770c0259af61f2f80dc5dcf925c841022a772aa911cd95c5eb32"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2604.09549",
    "kind": "arxiv",
    "version": 2
  }
}