Author: cxdig

Quantifying Human-AI Synergy

Christoph Riedl, Ben Weidmann

We introduce a novel Bayesian Item Response Theory framework to quantify human–AI synergy, separating individual and collaborative ability while controlling for task difficulty in interactive settings. Unlike standard static benchmarks, our approach models human–AI performance as a joint process, capturing both user-specific factors and moment-to-moment fluctuations. We validate the framework by applying it to human–AI benchmark data (n=667) and find significant synergy. We demonstrate that collaboration ability is distinct from individual problem-solving ability. Users better able to infer and adapt to others’ perspectives achieve superior collaborative performance with AI–but not when working alone. Moreover, moment-to-moment fluctuations in perspective taking influence AI response quality, highlighting the role of dynamic user factors in collaboration. By introducing a principled framework to analyze data from human-AI collaboration, interactive benchmarks can better complement current single-task benchmarks and crowd-assessment methods. This work informs the design and training of language models that transcend static prompt benchmarks to achieve adaptive, socially aware collaboration with diverse and dynamic human partners.

https://osf.io/preprints/psyarxiv/vbkmt_v1 

Self-Reinforcing Cascades: A Spreading Model for Beliefs or Products of Varying Intensity or Quality

Laurent Hébert-Dufresne, Juniper Lovato, Giulio Burgio, James P. Gleeson, S. Redner, and P. L. Krapivsky

Phys. Rev. Lett. 135, 087401

Models of how things spread often assume that transmission mechanisms are fixed over time. However, social contagions—the spread of ideas, beliefs, innovations—can lose or gain in momentum as they spread: ideas can get reinforced, beliefs strengthened, products refined. We study the impacts of such self-reinforcement mechanisms in cascade dynamics. We use different mathematical modeling techniques to capture the recursive, yet changing nature of the process. We find a critical regime with a range of power-law cascade size distributions with nonuniversal scaling exponents. This regime clashes with classic models, where criticality requires fine-tuning at a precise critical point. Self-reinforced cascades produce critical-like behavior over a wide range of parameters, which may help explain the ubiquity of power-law distributions in empirical social data.

Read the full article at: link.aps.org

A stochastic theory of urban metabolism

Martin Hendrick, Andrea Rinaldo, and Gabriele Manoli

PNAS 122 (33) e2501224122

Cities can be viewed as living organisms and their metabolism as the set of processes controlling their evolving structure and function. Urban population, transport networks, and all anthropogenic activities have been proposed to mimic body mass, vascular systems, and metabolic rates of living organisms. This analogy is supported by the emergence of seemingly universal scaling laws linking city-scale quantities to population size. However, such scaling relations critically depend on the choices of city boundaries and neglect intraurban variations of urban properties. By capitalizing on today’s availability of high-resolution data, findings emerge on the generality of small-scale covariations in city characteristics and their link to city-wide averages, thus opening broad avenues to understand and design future urban environments.

Read the full article at: www.pnas.org

Automating the Search for Artificial Life With Foundation Models

Akarsh Kumar, Chris Lu, Louis Kirsch, Yujin Tang, Kenneth O. Stanley, Phillip Isola, David Ha

Artificial Life (2025) 31 (3): 368–396.

With the recent Nobel Prize awarded for radical advances in protein discovery, foundation models (FMs) for exploring large combinatorial spaces promise to revolutionize many scientific fields. Artificial Life (ALife) has not yet integrated FMs, thus presenting a major opportunity for the field to alleviate the historical burden of relying chiefly on manual design and trial and error to discover the configurations of lifelike simulations. This article presents, for the first time, a successful realization of this opportunity using vision-language FMs. The proposed approach, called automated search for Artificial Life (ASAL), (a) finds simulations that produce target phenomena, (b) discovers simulations that generate temporally open-ended novelty, and (c) illuminates an entire space of interestingly diverse simulations. Because of the generality of FMs, ASAL works effectively across a diverse range of ALife substrates, including Boids, Particle Life, the Game of Life, Lenia, and neural cellular automata. A major result highlighting the potential of this technique is the discovery of previously unseen Lenia and Boids life-forms, as well as cellular automata that are open-ended like Conway’s Game of Life. Additionally, the use of FMs allows for the quantification of previously qualitative phenomena in a human-aligned way. This new paradigm promises to accelerate ALife research beyond what is possible through human ingenuity alone.

Read the full article at: direct.mit.edu