pith. sign in

arxiv: 2403.00833 · v1 · pith:LYXQBM7Xnew · submitted 2024-02-28 · 💻 cs.AI

Position Paper: Agent AI Towards a Holistic Intelligence

classification 💻 cs.AI
keywords agentfoundationembodiedlargemodelscognitiondiscussmodel
0
0 comments X
read the original abstract

Recent advancements in large foundation models have remarkably enhanced our understanding of sensory information in open-world environments. In leveraging the power of foundation models, it is crucial for AI research to pivot away from excessive reductionism and toward an emphasis on systems that function as cohesive wholes. Specifically, we emphasize developing Agent AI -- an embodied system that integrates large foundation models into agent actions. The emerging field of Agent AI spans a wide range of existing embodied and agent-based multimodal interactions, including robotics, gaming, and healthcare systems, etc. In this paper, we propose a novel large action model to achieve embodied intelligent behavior, the Agent Foundation Model. On top of this idea, we discuss how agent AI exhibits remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. Furthermore, we discuss the potential of Agent AI from an interdisciplinary perspective, underscoring AI cognition and consciousness within scientific discourse. We believe that those discussions serve as a basis for future research directions and encourage broader societal engagement.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. CHAL: Council of Hierarchical Agentic Language

    cs.AI 2026-05 unverdicted novelty 6.0

    CHAL is a multi-agent dialectic system that performs structured belief optimization over defeasible domains using Bayesian-inspired graph representations and configurable meta-cognitive value system hyperparameters.

  2. Nemobot Games: Crafting Strategic AI Gaming Agents for Interactive Learning with Large Language Models

    cs.AI 2026-04 unverdicted novelty 4.0

    Nemobot is an LLM-powered platform for creating and refining strategic game agents across dictionary, solvable, heuristic, and learning-based games, moving toward self-programming AI.

  3. A Survey on Large Language Models for Code Generation

    cs.CL 2024-06 unverdicted novelty 3.0

    A systematic literature review that organizes recent work on LLMs for code generation into a taxonomy covering data curation, model advances, evaluations, ethics, environmental impact, and applications, with benchmark...

  4. Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

    cs.AI 2025-03 unverdicted novelty 2.0

    This survey frames foundation agents using brain-inspired modular architectures and reviews challenges in evolution, collaboration, and safety.