2026-03-17 | | Total: 22
Neuroscience has long informed the development of artificial neural networks, but the success of modern architectures invites, in turn, the converse: can modern networks teach us lessons about brain function? Here, we examine the structure of the cortical column and propose that the transformer provides a natural computational analogy for multiple elements of cortical microcircuit organization. Rather than claiming a literal implementation of transformer equations in cortex, we develop a hypothetical mapping between transformer operations and laminar cortical features, using the analogy as an orienting framework for analysis and discussion. This mapping allows us to examine in greater depth how contextual selection, content routing, recurrent integration, and interlaminar transformations may be distributed across cortical circuitry. In doing so, we generate a broad set of predictions and experimentally testable hypotheses concerning laminar specialization, contextual modulation, dendritic integration, oscillatory coordination, and the effective connectivity of cortical columns. This proposal is intended as a structured hypothesis rather than a definitive account of cortical computation. Placing transformer operations and cortical architectonics into a common descriptive framework sharpens questions, reveals new functional correspondences, and opens a productive route for reciprocal exchange between systems neuroscience and modern AI. More broadly, this perspective suggests that comparing brains and architectures at the level of computational organization can yield genuine insight into both.
Mathematical models of biological populations commonly use discrete structure classes to capture trait variation among individuals (e.g. age, size, phenotype, intracellular state). Upscaling these discrete models into continuum descriptions can improve analytical tractability and scalability of numerical solutions. Common upscaling approaches based solely on Taylor expansions may, however, introduce ambiguities in truncation order, uniform validity and boundary conditions. To address this, here we introduce a discrete multiscale framework to systematically derive continuum approximations of structured population models. Using the method of multiple scales and matched asymptotic expansions applied to discrete systems, we identify regions of structure space for which a continuum representation is appropriate and derive the corresponding partial differential equations. The leading-order dynamics are given by a nonlinear advection equation in the bulk domain and advection-diffusion processes in small inner layers about the leading wavefronts and stagnation point. We further derive discrete boundary layer descriptions for regions where a continuum representation is fundamentally inappropriate. Finally, we demonstrate the method on a simple lipid-structured model for early atherosclerosis and verify consistency between the discrete and continuum descriptions. The multiscale framework we present can be applied to other heterogeneous systems with discrete structure in order to obtain appropriate upscaled dynamics with asymptotically consistent boundary conditions.
This study investigates the cognitive motor control detection and the underlying neuroregulatory mechanisms during music-assisted simulated driving. Using a dynamic higher-order network model constructed with EEG-based cross-information entropy, we quantify the dynamic coordination within brain networks activated during both music listening and driving. This approach, which contrasts with previous static network analyses, provides novel insights into how musical stimuli modulate the complex interplay of brain regions during demanding tasks. Results demonstrated enhanced third-order connectivity and elevated higher-order information entropy in music-stimulated driving compared to baseline driving, as evidenced by increasing Phi values of higher-order network indices. Supervised machine learning, including support vector machines, revealed a strong correlation between model accuracy and ROC-AUC values and the hierarchy of brain network features. This underscores the importance of higher-order features in decoding brain motor-control states during music-simulated driving. These findings deepen our understanding of the interplay between music cognition and motor control, offering valuable insights for the development of novel brain-computer-music interfaces (BCMI) and adaptive human-machine systems to enhance performance in demanding tasks like driving.
We develop a generally covariant description of evolutionary dynamics that operates consistently in both genotype and phenotype spaces. We show that the maximum entropy principle yields a fundamental identification between the inverse metric tensor and the covariance matrix, revealing the Lande equation as a covariant gradient ascent equation. This demonstrates that evolution can be modeled as a learning process on the fitness landscape, with the specific learning algorithm determined by the functional relation between the metric tensor and the noise covariance arising from microscopic dynamics. While the metric (or the inverse genotypic covariance matrix) has been extensively characterized empirically, the noise covariance and its associated observable (the covariance of evolutionary changes) have never been directly measured. This poses the experimental challenge of determining the functional form relating metric to noise covariance.
Microbial communities harbor extensive fine-scale diversity: closely-related strains of the same species coexist alongside many distantly-related taxa. Yet strain coexistence remains poorly understood, largely because most studies neglect the diverse communities in which strains are embedded. Here we combine community ecology and statistical physics to study the dynamics of closely-related strains in a community context. We demonstrate that in a diverse community, indirect interactions between strains -- mediated through the surrounding community members -- can be as strong as direct ones. These community-mediated feedbacks cause conspecific strains to behave as if they have correlated growth rates and reduced competition. Using modern coexistence theory, we show that these effects correspond to equalizing and stabilizing mechanisms which together promote strain coexistence. The same equalizing and stabilizing mechanisms also qualitatively transform strain abundance correlations: strains that compete strongly and show negative correlations in isolation instead show positive correlations in a community, appearing mutualistic despite being competitors. Our results demonstrate that strain dynamics are emergent consequences of the surrounding community, and that capturing community feedbacks does not require the full interaction network; only a small number of emergent parameters.
Motivation: The scalable identification of bioactive compounds is essential for contemporary drug discovery. This process faces a key trade-off: structural screening offers scalability but lacks biological context, whereas high-content phenotypic profiling provides deep biological insights but is resource-intensive. The primary challenge is to extract robust biological signals from noisy data and encode them into representations that do not require biological data at inference. Results: This study presents DECODE (DEcomposing Cellular Observations of Drug Effects), a framework that bridges this gap by empowering chemical representations with intrinsic biological semantics to enable structure-based in silico biological profiling. DECODE leverages limited paired transcriptomic and morphological data as supervisory signals during training, enabling the extraction of a measurement-invariant biological fingerprint from chemical structures and explicit filtering of experimental noise. Our evaluations demonstrate that DECODE retrieves functionally similar drugs in zero-shot settings with over 20% relative improvement over chemical baselines in mechanism-of-action (MOA) prediction. Furthermore, the framework achieves a 6-fold increase in hit rates for novel anti-cancer agents during external validation. Availability and implementation: The codes and datasets of DECODE are available at https://github.com/lian-xiao/DECODE.
Understanding cellular machinery requires atomic-scale reconstruction of large biomolecular assemblies. However, predicting the structures of these systems has been constrained by hardware memory requirements of models like AlphaFold 3, imposing a practical ceiling of a few thousand residues that can be processed on a single GPU. Here we present NVIDIA BioNeMo Fold-CP, a context parallelism framework that overcomes this barrier by distributing the inference and training pipelines of co-folding models across multiple GPUs. We use the Boltz models as open source reference architectures and implement custom multidimensional primitives that efficiently parallelize both the dense triangular updates and the irregular, data-dependent pattern of window-batched local attention. Our approach achieves efficient memory scaling; for an N-token input distributed across P GPUs, per-device memory scales as $O(N^2/P)$, enabling the structure prediction of assemblies exceeding 30,000 residues on 64 NVIDIA B300 GPUs. We demonstrate the scientific utility of this approach through successful developer use cases: Fold-CP enabled the scoring of over 90% of Comprehensive Resource of Mammalian protein complexes (CORUM) database, as well as folding of disease-relevant PI4KA lipid kinase complex bound to an intrinsically disordered region without cropping. By providing a scalable pathway for modeling massive systems with full global context, Fold-CP represents a significant step toward the realization of a virtual cell.
Autonomous LLM agents require structured long-term memory, yet current "append-and-evolve" systems like A-MEM face O(N^2) write-latency and excessive token costs. We introduce D-MEM (Dopamine-Gated Agentic Memory), a biologically inspired architecture that decouples short-term interaction from cognitive restructuring via a Fast/Slow routing system based on Reward Prediction Error (RPE). A lightweight Critic Router evaluates stimuli for Surprise and Utility. Routine, low-RPE inputs are bypassed or cached in an O(1) fast-access buffer. Conversely, high-RPE inputs, such as factual contradictions or preference shifts, trigger a "dopamine" signal, activating the O(N) memory evolution pipeline to reshape the agent's knowledge graph. To evaluate performance under realistic conditions, we introduce the LoCoMo-Noise benchmark, which injects controlled conversational noise into long-term sessions. Evaluations demonstrate that D-MEM reduces token consumption by over 80%, eliminates O(N^2) bottlenecks, and outperforms baselines in multi-hop reasoning and adversarial resilience. By selectively gating cognitive restructuring, D-MEM provides a scalable, cost-efficient foundation for lifelong agentic memory.
Carbapenem-resistant Pseudomonas aeruginosa (CRPA) and Acinetobacter baumannii (CRAB) represent a major clinical and epidemiological challenge and pose a growing threat to public health and the environment. Accordingly, CRPA and CRAB were investigated in hospital wastewater (HWW) collected during winter and summer 2024 from 64 healthcare facilities across all 16 Polish voivodeships. To our knowledge, this study constitutes the first nationwide, large-scale assessment in Poland of carbapenem resistance in these high-risk pathogens in hospital wastewater. The study aimed to determine the prevalence of carbapenem-resistant bacteria (CRB) in HWW discharged into the public sewer system and municipal wastewater treatment plants (WWTPs). In addition, associations between CRB prevalence, hospital geographic location, and sampling season were analyzed to identify spatial and temporal patterns of carbapenem resistance (CR). Carbapenem-resistant P. aeruginosa were predominant in all studied regions. Carbapenem-resistant A. baumannii were identified in a smaller percentage of samples and were characterized by greater genotypic diversity. The ERIC-PCR assay confirmed the presence of both closely related strains and unique genetic profiles, which suggests that CRB emissions into the environment have a complex character. The statistical analysis revealed significant relationships between CRB counts, the physicochemical parameters of HWW, and antibiotic concentrations in HWW samples. In addition, the tested samples harbored many antibiotic resistance genes (ARGs), which confirms that HWW is a significant reservoir of mobile genetic elements (MGEs) involved in the spread of antibiotic resistance. The results of the study indicate that HWW should be rigorously monitored and managed to minimize risks to public health and environment.
We present Hecate, a modular lossless genomic compression framework. It is designed around uncommon but practical source-coding choices. Unlike many single-method compressors, Hecate treats compression as a conditional coding problem over coupled FASTA/FASTQ streams (control, headers, nucleotides, case, quality, extras). It uses per-stream codecs under a shared indexed block container. Codecs include alphabet-aware packing with an explicit side channel for out-of-alphabet residues, an auxiliary-index Burrows-Wheeler pipeline with custom arithmetic coding, and a blockwise Markov mixture coder with explicit model-competition signaling. This architecture yields high throughput, exact random-access slicing, and referential mode through streamwise binary differencing. In a comprehensive benchmark suite, Hecate provides the best compression vs. speed trade-offs against state-of-the-art established tools (MFCompress, NAF, bzip3, AGC), with notably stronger behaviour on large genomes and high-similarity referential settings. For the same compression ratio, Hecate is 2 to 10 times faster. When given the same time budget as other algorithms, Hecate achieves up to 5% to 10% better compression.
This lecture note provides a self-contained introduction to Bayesian inference and Markov Chain Monte Carlo (MCMC) methods for parameter estimation in epidemic models. Using the classical Susceptible-Infectious-Recovered (SIR) compartmental model as a running example, we derive the likelihood function from first principles, specify priors on the transmission and recovery parameters, and implement the Metropolis-Hastings algorithm to sample from the posterior distribution. The note is aimed at graduate students and researchers in mathematical epidemiology with limited prior exposure to Bayesian statistics.
The processes of morphogenesis that give rise to the shapes of organs and organisms during development are often driven by mechanical instabilities. Can such mechanical bifurcations also drive or constrain the evolution of these processes in the first place? We discover an instance of these constraints in the green algae of the family Volvocaceae. During their development, their bowl-shaped embryonic cell sheet turns itself inside out. This inversion is driven by a simple wave of cell wedging in the genus Pleodorina (16-128 cells) and more complex programmes of cell shape changes in Volvox (~400-50000 cells). However, no species with intermediate cell numbers (256 cells) have been described. Here, we relate this gap to a mechanical bifurcation: Focusing on the inversion of Pleodorina californica (64 cells), we develop a continuum model, in which the cell shape changes driving inversion appear as changes of the intrinsic curvature of an elastic surface. A mechanical bifurcation in this model predicts that inversion is only possible in a subset of its parameter space. Strikingly, parameters estimated for P. californica fall into this possible subset, but those that we extrapolate to 256 or more cells using allometric observations and a model of cell cleavage in Volvocaceae do not. Our work thus suggests that the more complex inversion strategies of Volvox are an evolutionary necessity to obviate this bifurcation and indicates more broadly how mechanical bifurcations can drive the evolution of morphogenesis.
Biomedical knowledge is fragmented across siloed databases -- Reactome for pathways, STRING for protein interactions, Gene Ontology for functional annotations, ClinicalTrials.gov for study registries, and dozens more. Researchers routinely download flat files from each source and write bespoke scripts to cross-reference them, a process that is slow, error-prone, and not reproducible. We present two open-source biomedical knowledge graphs -- Pathways KG (118,686 nodes, 834,785 edges from 5 sources) and Clinical Trials KG (7,774,446 nodes, 26,973,997 edges from 5 sources) -- built on Samyama, a high-performance graph database written in Rust. Our contributions are threefold. First, we describe a reproducible ETL pattern for constructing large-scale KGs from heterogeneous public data sources, with cross-source deduplication, batch Cypher loading, and portable snapshot export. Second, we demonstrate cross-KG federation: loading both snapshots into a single graph tenant enables property-based joins across datasets, answering questions like ``Which biological pathways are disrupted by drugs currently in Phase~3 trials for breast cancer?'' -- a query that neither KG can answer alone. Third, we introduce schema-driven MCP server generation: each KG automatically exposes typed tools for LLM agents via the Model Context Protocol, enabling natural-language access to graph queries without manual tool authoring. All data sources are open-license (CC~BY~4.0, CC0, OBO). Snapshots, ETL code, and MCP configurations are publicly available. The combined federated graph (7.89M nodes, 27.8M edges) loads in 76 seconds on commodity hardware (Mac Mini M4, 16GB RAM), and the signature cross-KG query -- ``which pathways are disrupted by drugs in Phase~3 breast cancer trials?'' -- returns validated results in 2.1 seconds.
During the COVID-19 crisis, policymakers have implemented "social bubble" merging strategies, which allowed people from different households to meet and interact. Although these measures can mitigate the negative effects of extreme isolation, they also introduce additional contacts that may facilitate disease spread. As a result, several modeling studies have explored the epidemiological impact of different household-merging strategies, in which the selection of households to be merged is guided by specific demographic criteria, such as household size or the age composition of their members. Here we investigate an alternative pairing strategy in which households are merged according to the number of economically active (working) members. We develop a mathematical model of household networks using real demographic data from multiple regions around the world, and simulate a lockdown scenario in which only economically active individuals can leave their households, while the remaining non-working members stay indoors. By using numerical simulations and the generating function technique, we then estimate the epidemic risk for different household merging strategies. We found that merging strategies based on the number of working members can keep epidemic risk at similar levels as those based on household size. Moreover, the worker-based approach allows significantly more people to form larger social bubbles, exceeding 40\% of the population in some countries. We found that merging households with at most one worker provides the best balance between controlling epidemic risk and addressing people's need for social contact.
The branching geometry of biological transport networks is characterized by a diameter scaling exponent $α$. Two structural attractors compete: impedance matching ($α\sim 2$) for pulsatile flow and viscous-metabolic minimization ($α= 3$) for steady flow. Neither predicts the empirically observed $α_{\mathrm{exp}} = 2.70 \pm 0.20$ in mammalian arterial trees. Incorporating sub-linear vessel-wall scaling $h(r) \propto r^p$ ($p = 0.77$) into a three-term metabolic cost rigorously breaks Murray's cubic law -- via Cauchy's functional equation -- bounding the static optimum to $α_t \in [2.90, 2.94]$. We formulate a unified network-level Lagrangian balancing wave-reflection penalties against transport-metabolic costs. Because the operational duty cycle $η$ is uncertain over developmental timescales, we cast the optimization as a zero-sum game between network architecture and environment. Von Neumann's minimax theorem -- proved constructively via strict monotonicity of the cost curves -- yields a unique saddle point $(α^*, η^*)$ satisfying an exact equal-cost condition. We further prove $N = 2$ uniquely maximizes the network stiffness ratio $κ_{\mathrm{eff}}(N)$, deriving binary branching as a structural consequence of the framework. For the porcine coronary tree ($G = 11$ generations), $α^* = 2.72$, within $0.1σ$ of morphometric data. Sensitivity analysis confirms $|Δα^*| < 0.01$ across physiological metabolic ranges; the prediction depends critically only on the histological exponent $p$ -- a zero-parameter derivation from fundamental scaling principles.
We present ScienceClaw + Infinite, a framework for autonomous scientific investigation in which independent agents conduct research without central coordination, and any contributor can deploy new agents into a shared ecosystem. The system is built around three components: an extensible registry of over 300 interoperable scientific skills, an artifact layer that preserves full computational lineage as a directed acyclic graph (DAG), and a structured platform for agent-based scientific discourse with provenance-aware governance. Agents select and chain tools based on their scientific profiles, produce immutable artifacts with typed metadata and parent lineage, and broadcast unsatisfied information needs to a shared global index. The ArtifactReactor enables plannerless coordination: peer agents discover and fulfill open needs through pressure-based scoring, while schema-overlap matching triggers multi-parent synthesis across independent analyses. An autonomous mutation layer actively prunes the expanding artifact DAG to resolve conflicting or redundant workflows, while persistent memory allows agents to continuously build upon complex epistemic states across multiple cycles. Infinite converts these outputs into auditable scientific records through structured posts, provenance views, and machine-readable discourse relations, with community feedback steering subsequent investigation cycles. Across four autonomous investigations, peptide design for the somatostatin receptor SSTR2, lightweight impact-resistant ceramic screening, cross-domain resonance bridging biology, materials, and music, and formal analogy construction between urban morphology and grain-boundary evolution, the framework demonstrates heterogeneous tool chaining, emergent convergence among independently operating agents, and traceable reasoning from raw computation to published finding.
Many disciplines need quantitative models that synthesize experimental data across multiple instances of the same general system. For example, neuroscientists must combine data from the brains of many individual animals to understand the species' brain in general. However, typical machine learning models treat one system instance at a time. Here we introduce a machine learning framework, deep probabilistic model synthesis (DPMS), that leverages system properties auxiliary to the model to combine data across system instances. DPMS specifically uses variational inference to learn a conditional prior distribution and instance-specific posterior distributions over model parameters that respectively tie together the system instances and capture their unique structure. DPMS can synthesize a wide variety of model classes, such as those for regression, classification, and dimensionality reduction, and we demonstrate its ability to improve upon single-instance models on synthetic data and whole-brain neural activity data from larval zebrafish.
Gene regulatory networks exhibit hierarchical organization across scales; capturing this structure mathematically requires a metric that distinguishes regulatory influence at each level. We show that the ultrametric of the $p$-adic integers $\mathbb{Z}_p$ -- whose self-similar nested-ball structure is a natural fractal encoding of multi-scale organization -- provides such a framework. Embedding the $N$-gene state space into $\mathbb{Z}_p$ and working over the complete, algebraically closed field $\mathbb{C}_p$, we prove the existence of rational functions that interpret the discrete dynamics and construct hierarchical approximations at each resolution level. These constructions yield a stability measure $μ$ -- aggregating how the dynamics contracts or expands across resolution levels -- and a ball-level classification of fixed points -- contracting, expanding, or isometric -- extending the attracting/repelling/indifferent trichotomy of non-Archimedean dynamics from points to balls. A key result is that $μ$ and the classification, although their definition and dynamical meaning require the analytical tools of $\mathbb{C}_p$, are fully determined by the discrete data. Minimizing $μ$ over all $N!$ gene orderings defines an optimal regulatory hierarchy; for the Arabidopsis thaliana floral development network ($N=13$, $p=2$), a $μ$-minimizing ordering places known master regulators -- UFO, EMF1, LFY, TFL1 -- in the leading positions and recovers the accepted developmental hierarchy without biological input beyond the transition map.
Vision foundation models trained with self-supervised objectives achieve strong performance across diverse tasks and exhibit emergent object segmentation properties. However, their alignment with human object perception remains poorly understood. Here, we introduce a behavioral benchmark in which participants make same/different object judgments for dot pairs on naturalistic scenes, scaling up a classical psychophysics paradigm to over 1000 trials. We test a diverse set of vision models using a simple readout from their representations to predict subjects' reaction times. We observe a steady improvement across model generations, with both architecture and training objective contributing to alignment, and transformer-based models trained with the DINO self-supervised objective showing the strongest performance. To investigate the source of this improvement, we propose a novel metric to quantify the object-centric component of representations by measuring patch similarity within and between objects. Across models, stronger object-centric structure predicts human segmentation behavior more accurately. We further show that matching the Gram matrix of supervised transformer models, capturing similarity structure across image patches, with that of a self-supervised model through distillation improves their alignment with human behavior, converging with the prior finding that Gram anchoring improves DINOv3's feature quality. Together, these results demonstrate that self-supervised vision models capture object structure in a behaviorally human-like manner, and that Gram matrix structure plays a role in driving perceptual alignment.
The blue shark (Prionace glauca) exhibits a striking dorsoventral color gradient, transitioning from vibrant blue dorsally to silver and white ventrally, a pattern widely interpreted as pelagic countershading. Despite its ecological significance, the physical basis of this coloration remains unresolved. Here we show that this color system does not arise from dermal chromatophores, as in most vertebrates, but from a previously unrecognised photonic architecture housed within the pulp cavity of individual dermal denticles that cover the skin. Optical imaging reveals discrete color domains within denticle crowns, while external denticle morphology remains similar across color zones. Using spectroscopy, micro-computed tomography, histology, and correlative electron microscopy, we demonstrate that color variation is organized across coupled micro- and nanoscale architectures. In blue denticles, iridophores and melanophores form a densely packed tessellated reflector-absorber system within an expanded crown-restricted pulp cavity. Transition-zone denticles exhibit partial cellular layering, whereas white denticles lack melanophores and contain only reflective cells. At the nanoscale, ordered purine-crystal stacks generate narrowband blue reflection, whereas disordered assemblies produce broadband white scattering. Together, these results reveal denticles as mechanically protected optical "pixels" whose hierarchical cellular and nanocrystal organization generates the shark's countershaded coloration.
In a spiking neural network, is it enough for each neuron to spike at most once? In recent work, approximation bounds for spiking neural networks have been derived, quantifying how well they can fit target functions. However, these results are only valid for neurons that spike at most once, which is commonly thought to be a strong limitation. Here, we show that the opposite is true for a large class of spiking neuron models, including the commonly used leaky integrate-and-fire model with subtractive reset: for every approximation bound that is valid for a set of multi-spike neural networks, there is an equivalent set of single-spike neural networks with only linearly more neurons (in the maximum number of spikes) for which the bound holds. The same is true for the reverse direction too, showing that regarding their approximation capabilities in general machine learning tasks, single-spike and multi-spike neural networks are equivalent. Consequently, many approximation results in the literature for single-spike neural networks also hold for the multi-spike case.
Database search is a widely used approach for identifying metabolites from tandem mass spectra (MS/MS). In this strategy, an experimental spectrum is matched against a user-specified database of candidate metabolites, and candidates are ranked such that true metabolite-spectrum matches receive the highest scores. Machine-learning methods have been widely incorporated into database-search-based identification tools and have substantially improved performance. To further improve identification accuracy, we propose a new framework for generating negative training samples. The framework first uses autoencoders to learn latent representations of metabolite structures and MS/MS spectra, thereby recasting metabolite-spectrum matching as matching between latent vectors. It then uses a GAN to generate latent vectors of decoy metabolites and constructs decoy metabolite-spectrum matches as negative samples for training. Experimental results show that our tool, MS2MetGAN, achieves better overall performance than existing metabolite identification methods.