The Core Prediction
The Core Prediction
The claim is not merely that affect structure, language, and behavior should “correlate.” Correlation is weak—marginal correlations can arise from confounds. The claim is geometric: the distance structure in the information-theoretic affect space should be isomorphic to the distance structure in the embedding-predicted affect space. Not just “these two things covary,” but “these two spaces have the same shape.”
To test this, let be the information-theoretic affect vector for agent-state , computed from internal dynamics (viability gradient, belief update rate, partition loss, eigenvalue distribution, simulation fraction, self-model MI). Let be the affect embedding predicted from the VLM-translated situation description, projected into a standardized affect concept space.
For agent-states sampled across diverse situations, compute pairwise distance matrices:
The prediction: Representational Similarity Analysis (RSA) correlation between the upper triangles of these matrices exceeds the null:
where is established by permutation (Mantel test).
This is strictly stronger than marginal correlation. Two spaces can have correlated means but completely different geometries. RSA tests whether states that are nearby in one space are nearby in the other—whether the topology is preserved.
The specific predictions that fall out: when the affect vector shows the suffering motif—negative valence, collapsed effective rank, high integration, high self-model salience—the embedding-predicted vector should land in the same region of affect concept space. States with the joy motif—positive valence, expanded rank, low self-salience—should cluster together in both spaces. And crucially, the distances between suffering and joy, between fear and curiosity, between boredom and rage, should be preserved across the two measurement modalities.
Not because we trained them to match. Because the structure is the experience is the expression.