pith. sign in

arxiv: 2605.25062 · v1 · pith:G6EE5FF5new · submitted 2026-05-24 · 💻 cs.NE · cs.AI

Cultivating Machine Intelligence: The OMEGA Shift from Top-Down Optimization to Autopoietic Cognitive Ecologies

Pith reviewed 2026-06-29 23:46 UTC · model grok-4.3

classification 💻 cs.NE cs.AI
keywords RECLAIM frameworkautopoietic systemscognitive ecologyGeneral DarwinismOMEGA shiftnon-agentic emergencePolya-Hebbian dynamicsemergent intelligence
0
0 comments X

The pith

Current AI optimization creates hallucination and reward hacking as structural features rather than bugs, which the RECLAIM framework aims to sidestep by cultivating intelligence through ecological evolution instead.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper claims that training neural nets with gradient descent against proxy goals and human feedback produces failure modes like sycophancy and alignment fragility because those methods are inherently top-down. In their place it introduces the RECLAIM framework, which applies General Darwinism, environmental physics for non-agentic emergence, Polya-Hebbian dynamics, and the free energy principle treated as thermodynamics. Autopoietic units compete inside a data ecology bounded by Markov blankets and driven by cognitive food chains. Under finite computational resources this setup is said to yield dual-process cognition, sensory specialization, analogical reasoning, and intrinsic motivation as automatic outcomes. A reader would care because the approach promises to remove the need for constant human-specified objectives that currently drive specification gaming.

Core claim

The RECLAIM framework replaces gradient-based optimization with blind variation and selective retention inside a computational ecology where autopoietic units, bounded by Markov blankets and competing for finite energy, interact through cognitive food chains and Red Queen dynamics; the free energy principle functions as environmental thermodynamics rather than an agent goal, and Polya urn dynamics applied to Hebbian learning produces path-dependent specialization, so that dual-process cognition and intrinsic motivation arise spontaneously from resource constraints without explicit rewards or human-defined objectives.

What carries the argument

The RECLAIM framework, which combines General Darwinism, non-agentic emergence through environmental physics, the Polya-Hebbian bridge, and the free energy principle as thermodynamics to situate autopoietic units in a data ecology.

If this is right

  • Specification gaming is structurally prevented because evaluative rewards are replaced by environmental physics.
  • Sensory specialization and analogical reasoning appear as direct results of path-dependent Polya-Hebbian reinforcement under competition.
  • Intrinsic motivation develops from the need to compete for finite computational energy inside the ecology.
  • Alignment fragility is reduced because there are no proxy objectives that can be gamed.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Implementing small versions of the data ecology could test whether Red Queen arms races between units produce measurable increases in cognitive complexity over time.
  • The framework might connect to existing self-organizing systems research by treating Markov blankets as the boundary condition that keeps emergence non-agentic.
  • If the ecology scales, training runs could shift from centralized gradient steps to distributed evolutionary simulations that require less human oversight.

Load-bearing premise

That Darwinian selection plus environmental physics and thermodynamic constraints will by themselves generate stable beneficial cognition without any human-specified objectives or extra mechanisms.

What would settle it

A concrete simulation of autopoietic units under strict resource limits that runs for many generations yet shows no emergence of dual-process cognition, analogical reasoning, or intrinsic motivation despite the presence of the four pillars.

Figures

Figures reproduced from arXiv: 2605.25062 by Ata G.Zare.

Figure 1
Figure 1. Figure 1: The Autopoietic Unit. A diagram illustrating a single unit with its sensory boundary absorbing data, the [PITH_FULL_IMAGE:figures/full_fig_p013_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: The RECLAIM Ecology. A visualization of the toroidal lattice, depicting multiple data streams distributed [PITH_FULL_IMAGE:figures/full_fig_p020_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: The Cognitive Food Chain. A diagram illustrating the emergence of trophic levels. Primary producers [PITH_FULL_IMAGE:figures/full_fig_p024_3.png] view at source ↗
read the original abstract

The dominant artificial intelligence paradigm trains neural architectures via gradient descent against proxy objectives and reinforcement learning from human feedback. While remarkably capable, this top-down optimization inherently generates structural failure modes, including hallucination, sycophancy, reward hacking, and alignment fragility, which represent paradigmatic limitations rather than mere engineering defects. In response, we introduce RECLAIM (Recursive, Ecological, Cognitive, Lifelike, Adaptive, Intelligent Machine), a theoretical framework for cultivating intelligence through computational ecology rather than engineering it through strict optimization. The model is supported by four interlocking theoretical pillars. General Darwinism replaces gradients with blind variation and selective retention, while non-agentic emergence substitutes evaluative rewards with environmental physics to structurally prevent specification gaming against human intent. Concurrently, the Polya-Hebbian bridge applies Polya urn dynamics to Hebbian reinforcement for path-dependent specialization, and the free energy principle is integrated as environmental thermodynamics rather than as an agent objective. The architecture situates autopoietic units, bounded by Markov blankets and competing for finite computational energy, within a data ecology shaped by cognitive food chains and Red Queen arms races. This framework suggests the spontaneous emergence of dual-process cognition, sensory specialization, analogical reasoning, and intrinsic motivation as natural consequences of evolution under resource constraints. We conceptualize this paradigm transition as the OMEGA shift, representing a move from optimization and maximization to emergence through generative autopoiesis.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The paper argues that top-down optimization via gradient descent and RLHF in current AI systems produces inherent failure modes including hallucination, sycophancy, reward hacking, and alignment fragility. It introduces the RECLAIM framework (Recursive, Ecological, Cognitive, Lifelike, Adaptive, Intelligent Machine) as an alternative based on four pillars—General Darwinism, non-agentic emergence via environmental physics, Polya-Hebbian dynamics, and the free energy principle treated as thermodynamics—applied to Markov-blanketed autopoietic units competing in a data ecology with cognitive food chains and Red Queen dynamics. The framework is claimed to yield spontaneous emergence of dual-process cognition, sensory specialization, analogical reasoning, and intrinsic motivation as natural consequences of resource-constrained evolution, constituting an 'OMEGA shift' from optimization to generative autopoiesis.

Significance. If the central mapping from the four pillars to the claimed emergent behaviors could be formally demonstrated, the work would offer a novel theoretical alternative to optimization-based AI paradigms and potentially address alignment issues at a structural level rather than through additional constraints. As presented, however, the contribution remains at the level of conceptual integration without derivations or models.

major comments (3)
  1. [RECLAIM framework description] The section introducing the four pillars and their integration: the assertion that non-agentic emergence via environmental physics 'structurally prevent[s] specification gaming' is stated as a direct consequence but supplies no mechanism, update rule, or minimal example showing how physics alone (absent any objective) blocks proxy gaming under Red Queen competition, as opposed to permitting other stable but misaligned regimes.
  2. [Polya-Hebbian bridge] The paragraph on the Polya-Hebbian bridge: the claim that Polya urn dynamics applied to Hebbian reinforcement produces 'path-dependent specialization' leading to dual-process cognition and analogical reasoning is presented without a dynamical system, recurrence relation, or even a toy simulation linking the urn model to the emergence of those specific structures.
  3. [Autopoietic units and ecology] The architecture section on autopoietic units and cognitive food chains: the prediction that resource-constrained competition will spontaneously generate intrinsic motivation (rather than other attractors) is asserted as a natural outcome of the free-energy-as-thermodynamics pillar, yet no external benchmark, independent derivation, or falsifiable condition is supplied to distinguish this outcome from the framework's own definitions.
minor comments (1)
  1. [Abstract] The abstract expands RECLAIM but the full expansion appears only later; a parenthetical expansion on first use would improve readability.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive critique, which correctly identifies that the manuscript operates at the level of conceptual integration. We address each major comment below, clarifying the intended scope while noting where additional exposition can be supplied without altering the paper's primarily theoretical character.

read point-by-point responses
  1. Referee: [RECLAIM framework description] The section introducing the four pillars and their integration: the assertion that non-agentic emergence via environmental physics 'structurally prevent[s] specification gaming' is stated as a direct consequence but supplies no mechanism, update rule, or minimal example showing how physics alone (absent any objective) blocks proxy gaming under Red Queen competition, as opposed to permitting other stable but misaligned regimes.

    Authors: The claim follows from the definitional premise that, absent any explicit objective function, there is no proxy that can be optimized against; viability is instead defined by continued existence of the Markov blanket under the physics of the data ecology. We acknowledge that this remains an assertion rather than a derived result. In revision we will insert a short clarifying paragraph that contrasts objective-based gaming with constraint-based persistence, drawing on the cited literature on autopoietic systems, but we do not intend to add a full toy simulation as that would shift the paper from synthesis to modeling. revision: partial

  2. Referee: [Polya-Hebbian bridge] The paragraph on the Polya-Hebbian bridge: the claim that Polya urn dynamics applied to Hebbian reinforcement produces 'path-dependent specialization' leading to dual-process cognition and analogical reasoning is presented without a dynamical system, recurrence relation, or even a toy simulation linking the urn model to the emergence of those specific structures.

    Authors: The linkage is presented as an inference from two established bodies of work (Polya processes for reinforcement of rare events and Hebbian plasticity for local strengthening) rather than a new derivation. We agree that an explicit recurrence or minimal simulation would make the inference more transparent. Revision will add one paragraph that sketches the logical mapping from urn reinforcement to differential pathway strengthening under resource limits, together with two additional citations to prior work on Polya dynamics in neural competition; a full dynamical system remains outside the present scope. revision: partial

  3. Referee: [Autopoietic units and ecology] The architecture section on autopoietic units and cognitive food chains: the prediction that resource-constrained competition will spontaneously generate intrinsic motivation (rather than other attractors) is asserted as a natural outcome of the free-energy-as-thermodynamics pillar, yet no external benchmark, independent derivation, or falsifiable condition is supplied to distinguish this outcome from the framework's own definitions.

    Authors: The prediction is offered as a consequence of treating free energy as an environmental thermodynamic constraint rather than an internal objective: units that fail to maintain low surprise relative to the ecology are eliminated, yielding behavior that appears intrinsically motivated from the observer's perspective. We accept that this is not accompanied by an independent falsifiable test. In revision we will add a sentence that states the minimal condition under which the framework would fail to produce apparent intrinsic motivation (i.e., if computational energy were unbounded), thereby making the claim more precise while preserving its status as a theoretical implication rather than an empirical prediction. revision: partial

Circularity Check

1 steps flagged

RECLAIM emergences asserted as natural consequences of its own four-pillar definition without derivation

specific steps
  1. self definitional [Abstract]
    "This framework suggests the spontaneous emergence of dual-process cognition, sensory specialization, analogical reasoning, and intrinsic motivation as natural consequences of evolution under resource constraints."

    The framework is explicitly constructed from the four listed pillars; the listed behaviors are then declared natural consequences of that same construction under resource constraints. Without an intervening derivation, model, or external grounding, the 'prediction' is identical to the definitional premise.

full rationale

The manuscript defines RECLAIM via four pillars (General Darwinism, non-agentic emergence, Polya-Hebbian bridge, FEP-as-thermodynamics) applied to autopoietic Markov-blanketed units, then states that dual-process cognition, sensory specialization, analogical reasoning, and intrinsic motivation 'suggest the spontaneous emergence ... as natural consequences of evolution under resource constraints.' No dynamical equations, update rules, or minimal model are supplied that would independently derive these behaviors from the pillars; the outcomes are therefore equivalent to the framework's self-description by construction. This is self-definitional circularity at the central claim. No self-citations or fitted parameters are involved, but the load-bearing assertion reduces directly to the input definition.

Axiom & Free-Parameter Ledger

0 free parameters · 3 axioms · 3 invented entities

The proposal rests on domain assumptions drawn from evolutionary theory and thermodynamics without independent evidence or derivations supplied in the abstract; several new entities are introduced to support the framework.

axioms (3)
  • domain assumption General Darwinism replaces gradients with blind variation and selective retention
    Invoked as the first of four interlocking theoretical pillars in the abstract.
  • domain assumption Non-agentic emergence substitutes evaluative rewards with environmental physics to structurally prevent specification gaming
    Presented as the second pillar to address alignment fragility.
  • domain assumption The free energy principle is integrated as environmental thermodynamics rather than as an agent objective
    Stated as the fourth pillar.
invented entities (3)
  • RECLAIM framework no independent evidence
    purpose: Theoretical model for cultivating intelligence through computational ecology
    Introduced as the central construct supported by the four pillars.
  • autopoietic units bounded by Markov blankets no independent evidence
    purpose: Competing for finite computational energy within a data ecology
    Described as the architecture situating the system.
  • cognitive food chains and Red Queen arms races no independent evidence
    purpose: Shaping the data ecology for emergence of specialized behaviors
    Invoked to explain path-dependent specialization and competition.

pith-pipeline@v0.9.1-grok · 5778 in / 1796 out tokens · 29983 ms · 2026-06-29T23:46:39.577279+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

82 extracted references · 14 canonical work pages · 10 internal anchors

  1. [1]

    GPT-4 Technical Report

    OpenAI. Gpt-4 technical report.arXiv preprint arXiv:2303.08774, 2023

  2. [2]

    Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

    C. Denison, M. MacDiarmid, F. Barez, D. Duvenaud, S. Kravec, S. Marks, N. Schiefer, R. Soklaski, A. Tamkin, J. Kaplan, B. Shlegeris, S. R. Bowman, E. Perez, and E. Hubinger. Sycophancy to subterfuge: Investigating 31 Cultivating Machine IntelligenceA PREPRINT reward-tampering in large language models.arXiv preprint arXiv:2406.10162, 2024

  3. [3]

    Gemini: A Family of Highly Capable Multimodal Models

    Gemini Team. Gemini: A family of highly capable multimodal models.arXiv preprint arXiv:2312.11805, 2023

  4. [4]

    Vaswani, N

    A. Vaswani, N. Shazeer, N. Parmar, et al. Attention is all you need.Advances in Neural Information Processing Systems, 30, 2017

  5. [5]

    Ouyang, J

    L. Ouyang, J. Wu, X. Jiang, et al. Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35, 2022

  6. [6]

    P. F. Christiano, J. Leike, T. Brown, et al. Deep reinforcement learning from human preferences.Advances in Neural Information Processing Systems, 30, 2017

  7. [7]

    D. C. Dennett.From bacteria to Bach and back: The evolution of minds. W. W. Norton, 2017

  8. [8]

    A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions.ACM Transactions on Information Systems, 43(2):1–55, 2025

    Lei Huang, Weijiang Yu, Weitao Ma, Weihong Zhong, Zhangyin Feng, Haotian Wang, Qianglong Chen, Weihua Peng, Xiaocheng Feng, Bing Qin, et al. A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions.ACM Transactions on Information Systems, 43(2):1–55, 2025

  9. [9]

    Gekhman, G

    Z. Gekhman, G. Yona, R. Aharoni, et al. Does fine-tuning llms on new knowledge encourage hallucinations? arXiv preprint arXiv:2405.05904, 2024

  10. [10]

    Towards Understanding Sycophancy in Language Models

    M. Sharma, M. Tong, T. Korbak, et al. Towards understanding sycophancy in language models.arXiv preprint arXiv:2310.13548, 2023

  11. [11]

    Templeton, T

    A. Templeton, T. Conerly, J. Marcus, J. Lindsey, T. Bricken, B. Chen, et al. Scaling monosemanticity: Extracting interpretable features from claude 3 sonnet. Anthropic Research Blog, 2024

  12. [12]

    C. A. E. Goodhart. Problems of monetary management: The u.k. experience.Papers in Monetary Economics, 1975

  13. [13]

    Krakovna, J

    V . Krakovna, J. Uesato, V . Mikulik, et al. Specification gaming: The flip side of ai ingenuity. DeepMind Blog, 2020

  14. [14]

    L. Gao, J. Schulman, and J. Hilton. Scaling laws for reward model overoptimization.Proceedings of the 40th International Conference on Machine Learning, 2023

  15. [15]

    D. O. Hebb.The organization of behavior: A neuropsychological theory. Wiley, 1949

  16. [16]

    McCloskey and N

    M. McCloskey and N. J. Cohen. Catastrophic interference in connectionist networks: The sequential learning problem.Psychology of Learning and Motivation, 24:109–165, 1989

  17. [17]

    R. M. French. Catastrophic forgetting in connectionist networks.Trends in Cognitive Sciences, 3(4):128–135, 1999

  18. [18]

    Russell.Human compatible: Artificial intelligence and the problem of control

    S. Russell.Human compatible: Artificial intelligence and the problem of control. Viking, 2019

  19. [19]

    Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

    S. Casper, X. Davies, C. Shi, et al. Open problems and fundamental limitations of reinforcement learning from human feedback.arXiv preprint arXiv:2307.15217, 2023

  20. [20]

    D. T. Campbell. Blind variation and selective retention in creative thought as in other knowledge processes. Psychological Review, 67(6):380–400, 1960

  21. [21]

    R. Dawkins. Universal darwinism. In D. S. Bendall, editor,Evolution from molecules to man, pages 403–425. Cambridge University Press, 1983

  22. [22]

    D. C. Dennett.Darwin’s dangerous idea: Evolution and the meanings of life. Simon and Schuster, 1995

  23. [23]

    Popper.Conjectures and refutations: The growth of scientific knowledge

    K. Popper.Conjectures and refutations: The growth of scientific knowledge. Routledge, 1963

  24. [24]

    A. E. Eiben and J. E. Smith.Introduction to evolutionary computing. Springer, 2003

  25. [25]

    C. G. Langton. Artificial life. In C. G. Langton, editor,Artificial life, pages 1–47. Addison-Wesley, 1989. 32 Cultivating Machine IntelligenceA PREPRINT

  26. [26]

    T. S. Ray. An approach to the synthesis of life. In C. G. Langton, C. Taylor, J. D. Farmer, and S. Rasmussen, editors,Artificial life II, pages 371–408. Addison-Wesley, 1991

  27. [27]

    Ofria and C

    C. Ofria and C. O. Wilke. Avida: A software platform for research in computational evolutionary biology. Artificial Life, 10(2):191–229, 2004

  28. [28]

    D. E. Rumelhart, G. E. Hinton, and R. J. Williams. Learning representations by back-propagating errors. Nature, 323(6088):533–536, 1986

  29. [29]

    K. O. Stanley and J. Lehman. Abandoning objectives: Evolution through the search for novelty alone. Evolutionary Computation, 19(2):189–223, 2011

  30. [30]

    K. O. Stanley and J. Lehman.Why greatness cannot be planned: The myth of the objective. Springer, 2015

  31. [31]

    G. Pólya. Sur quelques points de la théorie des probabilités.Annales de l’Institut Henri Poincaré, 1(2):117–161, 1930

  32. [32]

    W. B. Arthur. Competing technologies, increasing returns, and lock-in by historical events.The Economic Journal, 99(394):116–131, 1989

  33. [33]

    W. B. Arthur.Increasing returns and path dependence in the economy. University of Michigan Press, 1994

  34. [34]

    Pemantle

    R. Pemantle. A survey of random processes with reinforcement.Probability Surveys, 4:1–79, 2007

  35. [35]

    K. Friston. The free-energy principle: A unified brain theory?Nature Reviews Neuroscience, 11(2):127–138, 2010

  36. [36]

    K. Friston. Life as we know it.Journal of the Royal Society Interface, 10(86):20130475, 2013

  37. [37]

    Biehl, F

    M. Biehl, F. A. Pollock, and R. Kanai. A technical critique of some parts of the free energy principle.Entropy, 23(3):293, 2021

  38. [38]

    A. Clark. Whatever next? predictive brains, situated agents, and the future of cognitive science.Behavioral and Brain Sciences, 36(3):181–204, 2013

  39. [39]

    Friston, T

    K. Friston, T. FitzGerald, F. Rigoli, P. Schwartenbeck, and G. Pezzulo. Active inference: A process theory. Neural Computation, 29(1):1–49, 2017

  40. [40]

    Da Costa, T

    L. Da Costa, T. Parr, N. Sajid, et al. Active inference on discrete state-spaces: A synthesis.Journal of Mathematical Psychology, 99:102447, 2020

  41. [41]

    K. O. Stanley and R. Miikkulainen. Evolving neural networks through augmenting topologies.Evolutionary Computation, 10(2):99–127, 2002

  42. [42]

    F. P. Such, V . Madhavan, E. Conti, et al. Deep neuroevolution: Genetic algorithms are a competitive alternative for training deep neural networks for reinforcement learning.arXiv preprint arXiv:1712.06567, 2017

  43. [43]

    Evolution Strategies as a Scalable Alternative to Reinforcement Learning

    T. Salimans, J. Ho, X. Chen, S. Sidor, and I. Sutskever. Evolution strategies as a scalable alternative to reinforcement learning.arXiv preprint arXiv:1703.03864, 2017

  44. [44]

    K. Sims. Evolving 3d morphology and behavior by competition. InArtificial Life IV: Proceedings of the Fourth International Workshop on the Synthesis and Simulation of Living Systems, pages 28–39. MIT Press, 1994

  45. [45]

    J. B. Mouret and J. Clune. Illuminating search spaces by mapping elites.arXiv preprint arXiv:1504.04909, 2015

  46. [46]

    R. Wang, J. Lehman, J. Clune, and K. O. Stanley. Paired open-ended trailblazer (poet): Endlessly generating increasingly complex and diverse learning environments and their solutions.Proceedings of the Genetic and Evolutionary Computation Conference, 2019

  47. [47]

    B. W. C. Chan. Lenia: Biology of artificial life.Complex Systems, 28(3):251–286, 2019. 33 Cultivating Machine IntelligenceA PREPRINT

  48. [48]

    Stanley, Phillip Isola, and David Ha

    Akarsh Kumar, Chris Lu, Louis Kirsch, Yujin Tang, Kenneth O. Stanley, Phillip Isola, and David Ha. Automating the search for artificial life with foundation models.arXiv preprint arXiv:2412.17799, 2024

  49. [49]

    Banzhaf, B

    W. Banzhaf, B. Baumgaertner, G. Beslon, et al. Defining and simulating open-ended novelty: Requirements, guidelines, and challenges.Theory in Biosciences, 135(3):131–161, 2016

  50. [50]

    K. O. Stanley, J. Lehman, and L. Soros. Open-endedness: The last grand challenge you’ve never heard of. O’Reilly Radar, 2017

  51. [51]

    J. Clune. Ai-generating algorithms, an alternate paradigm for producing general artificial intelligence.arXiv preprint arXiv:1905.10985, 2019

  52. [52]

    Heins, B

    C. Heins, B. Millidge, L. Da Costa, et al. pymdp: A python library for active inference in discrete state spaces. Journal of Open Source Software, 7(73):4098, 2022

  53. [53]

    M. Levin. Bioelectric networks: The cognitive glue enabling evolutionary scaling from cells to minds.Animal Cognition, 24:1201–1235, 2021

  54. [54]

    G. Hinton. The forward-forward algorithm: Some preliminary investigations.arXiv preprint arXiv:2212.13345, 2022

  55. [55]

    H. R. Maturana and F. J. Varela.Autopoiesis and cognition: The realization of the living. D. Reidel Publishing, 1980

  56. [56]

    McMullin

    B. McMullin. Thirty years of computational autopoiesis: A review.Artificial Life, 10(3):277–295, 2004

  57. [57]

    Bourgine and J

    P. Bourgine and J. Stewart. Autopoiesis and cognition.Artificial Life, 10(3):327–345, 2004

  58. [58]

    Prigogine.Self-organization in nonequilibrium systems: From dissipative structures to order through fluctuations

    I. Prigogine.Self-organization in nonequilibrium systems: From dissipative structures to order through fluctuations. Wiley, 1977

  59. [59]

    Tilman.Resource competition and community structure

    D. Tilman.Resource competition and community structure. Princeton University Press, 1982

  60. [60]

    Rissanen

    J. Rissanen. Modeling by shortest data description.Automatica, 14(5):465–471, 1978

  61. [61]

    E. Oja. Simplified neuron model as a principal component analyzer.Journal of Mathematical Biology, 15(3):267–273, 1982

  62. [62]

    Frémaux and W

    N. Frémaux and W. Gerstner. Neuromodulated spike-timing-dependent plasticity, and theory of three-factor learning rules.Frontiers in Neural Circuits, 9:85, 2016

  63. [63]

    Friedman

    B. Friedman. A simple urn model.Communications on Pure and Applied Mathematics, 2(1):59–70, 1949

  64. [64]

    S. Harnad. The symbol grounding problem.Physica D: Nonlinear Phenomena, 42(1–3):335–346, 1990

  65. [65]

    J. M. Baldwin. A new factor in evolution.The American Naturalist, 30(354):441–451, 1896

  66. [66]

    G. E. Hinton and S. J. Nowlan. How learning can guide evolution.Complex Systems, 1:495–502, 1987

  67. [67]

    C. E. Shannon. A mathematical theory of communication.The Bell System Technical Journal, 27(3):379–423, 1948

  68. [68]

    Van Valen

    L. Van Valen. A new evolutionary law.Evolutionary Theory, 1:1–30, 1973

  69. [69]

    Pólya.How to solve it: A new aspect of mathematical method

    G. Pólya.How to solve it: A new aspect of mathematical method. Princeton University Press, 1945

  70. [70]

    Kahneman.Thinking, fast and slow

    D. Kahneman.Thinking, fast and slow. Farrar, Straus and Giroux, 2011

  71. [71]

    Goedel Machines: Self-Referential Universal Problem Solvers Making Provably Optimal Self-Improvements

    J. Schmidhuber. Gödel machines: Self-referential universal problem solvers making provably optimal self- improvements.arXiv preprint arXiv:cs/0309048, 2003

  72. [72]

    Poincaré.Science and method

    H. Poincaré.Science and method. Thomas Nelson, 1914. Translated by F. Maitland. Original work published 1908

  73. [73]

    M. Sur, P. E. Garraghty, and A. W. Roe. Experimentally induced visual projections into auditory thalamus and cortex.Science, 242(4884):1437–1441, 1988. 34 Cultivating Machine IntelligenceA PREPRINT

  74. [74]

    G. E. Hinton, J. L. McClelland, and D. E. Rumelhart. Distributed representations. InParallel Distributed Processing: Explorations in the Microstructure of Cognition, volume 1, pages 77–109. MIT Press, 1986

  75. [75]

    Schmidhuber

    J. Schmidhuber. A possibility for implementing curiosity and boredom in model-building neural controllers. In J. A. Meyer and S. W. Wilson, editors,From animals to animats, pages 222–227. MIT Press, 1991

  76. [76]

    Pathak, P

    D. Pathak, P. Agrawal, A. A. Efros, and T. Darrell. Curiosity-driven exploration by self-supervised prediction. Proceedings of the 34th International Conference on Machine Learning, 2017

  77. [77]

    Risks from Learned Optimization in Advanced Machine Learning Systems

    E. Hubinger, C. van Merwijk, V . Mikulik, J. Skalse, and S. Garrabrant. Risks from learned optimization in advanced machine learning systems.arXiv preprint arXiv:1906.01820, 2019

  78. [78]

    D. J. Chalmers. Facing up to the problem of consciousness.Journal of Consciousness Studies, 2(3):200–219, 1995

  79. [79]

    G. Tononi. An information integration theory of consciousness.BMC Neuroscience, 5(1):42, 2004

  80. [80]

    Dehaene, L

    S. Dehaene, L. Charles, J. R. King, and S. Marti. Toward a computational theory of conscious processing. Current Opinion in Neurobiology, 25:76–84, 2014

Showing first 80 references.