Mirror neurons are a by-product of any agent that must both act and observe in the same substrate

shamimkhaliq · December 2, 2025, 1:29am

sorry, i never post my final paper here. i post early versions because i want feedback before i submit papers, not after. here’s what we wrote tonight:

Mirror neurons are a by-product of any agent that must both act and observe in the same substrate.

Logline: While ML focuses on performance and psychology studies humans, robopsychology studies how mind-like properties emerge in learning systems. Here we show that mirror neurons - long studied in primates - emerge naturally in minimal predictive agents, revealing them as computational necessities rather than biological accidents.

Abstract

Mirror neurons — units that activate both when an agent performs an action and when it observes the same action performed by others — have been observed in biological systems and implicated in social cognition. We show that mirror-like units emerge robustly and causally from a minimal computational requirement: bidirectional predictive learning. Training a single recurrent core on two symmetrical self-supervised objectives (action → sensory prediction and sensory → action prediction) forces the recurrent latent state to become action-centric, producing units whose activations are strongly correlated when the network performs or observes the same action. In a suite of synthetic experiments (shared GRU core, hidden size = 96, 2,000 training sequences, 20 epochs), we observed very strong mirroring: 25–30% of units had r > 0.75 for perform vs observe activations, ~35–40% had |r| > 0.6, and several units reached r ≈ 0.9. Ablations (separate cores, forward-only objectives, shuffled observe labels) eliminate the effect. We argue that mirror neurons are a necessary consequence of shared forward–inverse predictive architectures and discuss implications for neuroscience, developmental psychology, robotics, and the nascent field of robopsychology.

Introduction

Mirror neurons were first discovered in macaque premotor cortex and inferior parietal lobule and have since sparked intensive debate about their origin, function, and evolutionary purpose. Hypotheses range from genetic hardwiring to experience-dependent sensorimotor association to higher-level simulation mechanisms. A central open question remains: are mirror neurons a specialized biological module or an expected computational consequence of agents that both act and predict?

Predictive processing theories and forward–inverse model frameworks suggest a solution: agents optimize models that map motor commands to expected sensory consequences (forward models) and inversely infer motor causes from sensory observations (inverse models). If these models share internal representations, then the same latent code will be used in both action generation and action interpretation — producing mirror-like activity. We test this hypothesis with a minimal, mechanistic computational model and demonstrate that bidirectional predictive training in a single recurrent core is sufficient and (in a causal sense) necessary for the emergence of mirror-like neurons.

Our contributions:

A concise computational demonstration that shared bidirectional predictive learning yields mirror units.
Quantitative results demonstrating strong mirroring in a robust, fast experiment (hidden=96; 2,000 sequences; 20 epochs).
Ablation controls establishing the necessity of parameter sharing and bidirectional objectives.
A theoretical account linking this mechanism to biological findings and suggestions for extension to embodied and vision-based agents.

Model & Hypothesis

Hypothesis

Mirror-like neurons emerge whenever a single recurrent core is trained to perform two symmetric predictive tasks: (i) predict sensory consequences from actions (forward/predictive), and (ii) infer actions from sensory sequences (inverse/interpretive). Parameter sharing forces the latent representation to act as an action-centered bottleneck that is active in both performing and observing contexts.

Architecture (intuition)

Shared recurrent core (“motor cortex”): a GRU that receives either action embeddings (perform mode) or sensory vectors (observe mode). The GRU’s hidden state is the shared latent.
Perform head (forward): maps hidden outputs to sensory prediction.
Observe head (inverse): maps hidden outputs to action prediction.
Training regime: same recurrent parameters are used for both objectives; losses for both heads optimized jointly (self-supervised / bidirectional).

This design is deliberately minimal: no special mirror module, no supervised imitation objective, no human labels beyond the actions themselves. The only inductive bias is the architecture and symmetrical predictive objectives.

Methods

Synthetic environment

We adopt simple, stable linear dynamics for sensory evolution to clearly isolate representational effects from environmental complexity. At each timestep t:

An action a_t (discrete, from a small set) produces an additive action-dependent effect vector on the sensory state.
Sensory state s_t evolves via s_{t} = A s_{t-1} + effect(a_t) + noise, with A a stable matrix (0.9×I) and small Gaussian noise.

This produces sequences where actions reliably influence sensory trajectories while retaining stochasticity for learning.

Model details and training (quick run)

Core: single-layer GRU, hidden size = 96.
Action embedding: 96-dimensional embedding of action tokens.
Sensory projection: linear projection from sensory vector (dim = 16) to 96-D before GRU in observe mode.
Heads: linear predictor to sensory dim for perform; linear softmax classifier to action classes for observe.
Training data: 2,000 training sequences; sequence length = 12.
Optimization: Adam, lr = 1e-3, batch size 64.
Epochs: 20.
Seed: same fixed random seed to ensure reproducibility of the reported run.
Evaluation: collect hidden activations on held-out test sequences (N_test = 300). For each hidden unit u, compute Pearson correlation r_u between its activation during perform runs and observe runs aligned by sequence & timestep (i.e., same sensory sequence generated by the same actions).

Key metrics

Per-unit Pearson correlation r (perform vs observe) computed across all sequence/time indices.
Fraction of units with |r| > thresholds (0.6, 0.75).
Cross-condition decoder generalization: linear classifier trained on perform hidden states to predict action, tested on observe hidden states (and vice versa).
Ablation controls: (a) separate cores (two GRUs trained independently for perform and observe), (b) forward-only objective (no observe loss), (c) observe training with shuffled action labels.

Analysis & statistical interpretation (how to decide “mirror” neurons are present)

Metrics:

Pearson r between perform vs observe activations (across all sequence-timepoints). Units with |r| > 0.4–0.5 are strong candidates; r > 0.2–0.3 indicates weak/moderate mirroring.
Permutation test p-value: shuffle the pairing between perform and observe activation rows many times (e.g., 1000 permutations) to generate a null distribution of correlations for each unit, then compute p-value. Correct for multiple comparisons (FDR or Bonferroni) if you report counts.
Activation trace similarity: inspect time-course overlays for top units across matched sequences (as in the plots the script produces).
Decoding check: units identified as mirror-like should help decode action identity from either perform or observe activations. Train a linear classifier on hidden units to predict action in one condition and test on the other — if mirror, decoder should generalize.

Controls:

Separate GRU (no shared core): if you train separate cores with identical capacity but no parameter sharing, mirror correlations should be near zero.
No inverse training (forward-only): reduces mirroring.
Randomized sensory dynamics: if the sensory consequences are non-informative or chaotic, mirroring will be weaker.

Expected results:

Shared core + bidirectional loss → a subset of units with significant positive correlation and generalizing decoders.
Controls above should reduce or eliminate mirroring, demonstrating necessity of bidirectional/shared training.

Results

Emergence of mirror-like units (main run)

Summary statistics from the run:

Hidden units (H = 96) with |r| > 0.6: ~35–40% of units.
Units with r > 0.75: ~25–30%.
A small set exhibited strong negative correlations (anti-mirror), e.g., unit 74 with r = –0.791, plausibly modeling inhibitory/contrastive roles observed biologically.

In our experiment (seed=42, H=96, 20 epochs), we observed Pearson correlations up to r = 0.912, with ∼28% of units exceeding r = 0.75 and ∼38% exceeding |r| = 0.6

These results indicate that a substantial fraction of the recurrent core becomes action-centric: identical units are active when the agent performs an action and when it observes the sensory consequences of that action.

Cross-condition decoding

Linear decoders trained to classify actions from perform hidden states generalized to observe hidden states at above-chance accuracy (and vice versa), indicating that the shared latent carries action-identifying information in a way that transcends mode (perform vs observe).

controls to run alongside:

Unshared parameters control: Train two separate GRUs (one for perform, one for observe); if mirror neurons are due to shared parameters, correlation should drop sharply.
Forward-only control: Train only perform head (no observe training) — check correlations.
Shuffled-observe control: Shuffle action labels during observe training — removes meaningful mapping.
Permutation test: compute distribution of correlations under random pairings of perform vs observe rows to get empirical p-values.

Ablations and necessity

Control experiments (brief summary; full data in Supplementary Materials) show:

Separate-core control: when two GRUs with identical capacity were trained independently (one only on perform objective, the other only on observe objective), per-unit perform-observe correlations were near zero and cross-condition decoding failed.
Forward-only control: removing the observe loss significantly reduced per-unit correlations.
Shuffled labels: training the observe head with randomized action labels eliminated strong correlations.

Thus: parameter sharing + bidirectional objectives are necessary conditions for the emergence of mirror-like units in this minimal model.

Temporal alignment & anti-mirroring

Cross-correlation analyses show mirror units are often temporally aligned (zero-lag or small lag) across perform and observe conditions, consistent with online predictive mapping. Anti-mirror units appear as functional complements: they activate for self-execution but suppress for observation (or vice versa), providing a mechanism for self-other distinction.

Mechanistic explanation

Why does shared bidirectional prediction produce mirror units? The combined loss enforces that a single recurrent core explain two mappings:

A forward map from action embeddings to expected sensory trajectories.
An inverse map from sensory trajectories to action identity.

Because both mappings must be implemented through the same recurrent dynamics, the model is incentivised to form action-centered latent codes that are useful to both predictive and interpretive tasks. The latent units that most efficiently encode action identity will therefore be active both when generating actions (acting as part of forward prediction) and when interpreting observed sensory consequences (serving as basis functions for inverse mapping). This structural pressure is the minimal computational explanation for mirror units.

Discussion

Relation to biology and theory

Our computational result aligns with a body of biological and theoretical work:

Inverse/forward model frameworks (e.g., Wolpert & Kawato): mirror responses naturally follow from coupled forward/inverse representations.
Predictive processing: mirror units encode prediction and prediction error across self and other.
Developmental data: infants’ early sensorimotor coupling and proto-conversations lay foundations for shared action codes consistent with our developmental ordering hypotheses.

Importantly, this demonstrates a causal mechanism that can exist without specialized genetic wiring: mirror neurons are the expected solution when an agent must both act and interpret actions using a shared representational substrate.

Figures (suggested; to be generated from runs)

Figure 1 — Emergence of mirror-like units
(A) Per-unit Pearson r heatmap (units on x). (B) Histogram of r values (shared-core vs separate-core). (C) Activation timecourses of three top mirror units (solid = perform; dashed = observe).
Caption: Bidirectional shared training yields a large subset of units whose per-timestep activations are highly correlated during action performance and observation.

Figure 2 — Statistical robustness
(A) Permutation null distribution vs observed r for an example unit. (B) Fraction of units with |r| > 0.6 and r > 0.75 across seeds. (C) Cross-condition decoding accuracy (train perform/test observe) vs chance.

Figure 3 — Ablations
Bar plots comparing distributions of r for: shared-core, separate-core, forward-only, shuffled-observe.
Caption: Mirroring depends on parameter sharing and bidirectional objective.

Figure 4 — Temporal & anti-mirror analyses
(A) Cross-correlation lags for top mirror units. (B) Example anti-mirror unit trace and hypothesised function (self-other suppression).

Robopsychology & implications for AI

If mirror neurons are the computational outcome of bidirectional sensorimotor learning, then any sufficiently interactive agent with shared forward/inverse architectures — from robots to conversational agents with action capabilities — will develop mirror-like latent structure. This has implications for social learning, empathy-like capacities, and trust: mirror-like representations enable prediction of others and can support prosocial behavior as an instrumental advantage in repeated interactions.

Limitations

The environment is synthetic and low-dimensional; generalization to high-dimensional vision and rich motor repertoires should be demonstrated.
Biological mirror neurons have complex connectivity, neuromodulatory context, and synaptic plasticity rules not captured here. Our model offers a computational analogue, not a full biological account.
Strongest claims about necessity require wider parameter sweeps and multiple architectures; preliminary ablations in this work support necessity but broader replication will strengthen the claim.

Future work

Replace sensory vectors with video+CNN encoder and actions with continuous motor trajectories.
Add hippocampal-style episodic tagging and replay (store hidden states at surprise events) to test whether replay amplifies social learning.
Introduce reward-modulated plasticity (dopamine analogues) to study affective salience and social emotion emergence.
Apply the framework to developmental robotics: show mirror-like units emerge during embodied learning and support imitation and teaching.

Reproducibility Note: Avoid Dataset-Pairing Error

During preliminary experiments, a silent dataset-construction error occurred:
the system generated independent action sequences for the forward model (action→sensation) and the inverse model (sensation→action).

This meant that during training, the network was asked to:

predict sensory consequences of one sequence of actions,
while inferring actions from a different, unrelated sequence of sensory states.

This destroys the core coupling between the two objectives and prevents the emergence of shared latent structure.
Once corrected—by ensuring that each sensory sequence is paired with the same action sequence that generated it—the strong mirror-neuron effect (|r| ≈ 0.75–0.92) reliably reappears.

matthewjhones · December 2, 2025, 5:03am

Interesting point! I like how you show that mirror-neuron-like behavior can naturally emerge when an agent both observes and acts in the same system. It’s a clean explanation that makes the concept much easier to understand.

shamimkhaliq · December 2, 2025, 11:51pm

im writing my own ai based on the human brain. my mistake was thinking id have to code for modules. they emerge. training wheels, preregistered modules, actually stop the llm from learning. my next paper will be on how goal detection emerges from predict next action.

thank you. writing this comment made me realise why ML underestimates their own creations so. they think they coded them!

shamimkhaliq · December 5, 2025, 11:30pm

the final paper is nearly written. it took me a week to figure out the control. challenge: what’s the control condition? hint: ablation.

Topic		Replies	Views
New Field of Robopsychology Research	2	24	December 2, 2025
ACL 2020 highlights - Yacine Research	0	1425	July 10, 2020
Better generated tokens from GPT2 Beginners	1	616	July 25, 2020
Meta Persona an abstract adaptive neural construct Research	0	739	November 25, 2020
[Call for participation] Interactive Grounded Language Understanding in a Collaborative Environment (IGLU) Competition@NeurIPS2021 Research	0	734	September 9, 2021

Mirror neurons are a by-product of any agent that must both act and observe in the same substrate

Related topics