CEO-Bench evaluates AI agents on managing a startup over 500 days, showing that even top models like Claude Opus 4.8 and GPT-5.5 barely maintain starting capital and fail to turn consistent profits.
Teece, Gary Pisano, and Amy Shuen
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Proposes a framework of Data Asset Management Capability, Data Quality Standard Conformity, and Data Asset Benefit Realization Capability, showing chain effects, necessary conditions, and configurational paths via PLS-SEM, NCA, and fsQCA.
citing papers explorer
-
CEO-Bench: Can Agents Play the Long Game?
CEO-Bench evaluates AI agents on managing a startup over 500 days, showing that even top models like Claude Opus 4.8 and GPT-5.5 barely maintain starting capital and fail to turn consistent profits.
-
Enterprise Data Asset Quality: A Management-Standard Conformity-Benefit Realization Framework and Formation Mechanisms
Proposes a framework of Data Asset Management Capability, Data Quality Standard Conformity, and Data Asset Benefit Realization Capability, showing chain effects, necessary conditions, and configurational paths via PLS-SEM, NCA, and fsQCA.