Quantitative Biology
- [1] arXiv:2405.08391 [pdf, ps, other]
-
Title: Cerebralization of mathematical quantities and physical features in neural science: a critical evaluationLaurent Goffart (CGGG)Subjects: Neurons and Cognition (q-bio.NC)
At the turn of the 20th century, Henri Poincar{é} explained that geometry is a convention and that the properties of space and time are the properties of our measuring instruments. Intriguingly, numerous contemporary authors argue that space, time and even number are ''encoded'' within the brain, as a consequence of evolution, adaptation and natural selection. In the neuroscientific study of movement generation, the activity of neurons would ''encode'' kinematic parameters: when they emit action potentials, neurons would ''speak'' a language carrying notions of classical mechanics. In this article, we shall explain that the movement of a body segment is the ultimate product of a measurement, a filtered numerical outcome of multiple processes taking place in parallel in the central nervous system and converging on the groups of neurons responsible for muscle contractions. The fact that notions of classical mechanics efficiently describe movements does not imply their implementation in the inner workings of the brain. Their relevance to the question how the brain activity enables one to produce accurate movements is questioned within the framework of the neurophysiology of orienting gaze movements toward a visual target.
- [2] arXiv:2405.08397 [pdf, ps, other]
-
Title: Self-supervised contrastive learning unveils cortical folding pattern linked to prematurityJulien Laval (BAOBAB), Aymeric Gaudin (BAOBAB), Vincent Frouin (BAOBAB), Jessica Dubois (UNIACT), Andrea Gondova (UNIACT), Jean-François Mangin (BAOBAB), Joël Chavas (BAOBAB), Denis Rivière (BAOBAB)Journal-ref: MIDL 2024, Jul 2024, Paris, FranceSubjects: Neurons and Cognition (q-bio.NC)
Brain folding patterns have been reported to carry clinically relevant information. The brain folds mainly during the last trimester of pregnancy, and the process might be durably disturbed by preterm birth. Yet little is known about preterm-specific patterns. In this work, we train a self-supervised model (SimCLR) on the UKBioBank cohort (21070 adults) to represent the right superior temporal sulcus (STS) region and apply it to sulci images of 374 babies from the dHCP database, containing preterms and full-terms, and acquired at 40 weeks post-menstrual age. We find a lower variability in the preterm embeddings, supported by the identification of a knob pattern, missing in the extremely preterm population.
- [3] arXiv:2405.08523 [pdf, ps, html, other]
-
Title: How forest insect outbreaks depend on forest size and tree distribution: an individual-based model resultsSubjects: Populations and Evolution (q-bio.PE); Probability (math.PR)
In this work, an individual-based model of forest insect outbreaks is presented. The results obtained show that the outbreak is an emerging feature of the system. It is a common product of the characteristics of insects, the environment in which the insects live, and the way insects behave in it. The outbreak dynamics is an effect of scale. In a sufficiently large forest regardless of the density of trees and their spatial distribution, provided that the range of insect dispersion is large enough, it develops in the form of an outbreak. In very small forests, the dynamics becomes more chaotic. It loses the outbreak character and, especially in the forest with random tree distribution, there is a possibility that the insect population goes extinct. The local dynamics of the number of insects on one tree in a forest, where the dynamics of all insects has the character of outbreak, is characterized by a rapid increase in number and then a rapid decrease until the extinction of the local population. It is the result of the influx of immigrants from neighboring trees. The type of tree distribution in the forest becomes visible when the density of trees becomes low and/or the range of insect dispersion is small. When trees are uniformly distributed and the range of insect dispersion is small, the system persists as a set of more or less isolated local populations. In the forest with randomly distributed trees, the insect population becomes more susceptible to extinction when the tree density and/or range of insect dispersion are small.
- [4] arXiv:2405.08601 [pdf, ps, other]
-
Title: The Requirement for Cognition, in an EquationComments: 12 pagesSubjects: Neurons and Cognition (q-bio.NC); Populations and Evolution (q-bio.PE)
A model of the evolution of cognition is used to derive a Requirement Equation (RE), which defines what computations the fittest possible brain must make, or must choose actions as if it had made those computations. The terms in the RE depend on factors outside an animals brain, which can be modelled without making assumptions about how the brain works, from knowledge of the animals habitat and biology. In simple domains where the choices of actions have small information content, it may not be necessary to build internal models of reality; short cut computations may be just as good at choosing actions. In complex domains such as 3D spatial cognition, which underpins many complex choices of action, the RE implies that brains build Bayesian internal models of the animals surroundings; and that the models are constrained to be true to external reality.
- [5] arXiv:2405.08735 [pdf, ps, other]
-
Title: Competition in the nutrient-driven self-cycling fermentation processComments: 17 pages, 2 figuresSubjects: Populations and Evolution (q-bio.PE); Dynamical Systems (math.DS)
Self-cycling fermentation is an automated process used for culturing microorganisms. We consider a model of $n$ distinct species competing for a single non-reproducing nutrient in a self-cycling fermentor in which the nutrient level is used as the decanting condition. The model is formulated in terms of impulsive ordinary differential equations. We prove that two species are able to coexist in the fermentor under certain conditions. We also provide numerical simulations that suggest coexistence of three species is possible and that competitor-mediated coexistence can occur in this case. These results are in contrast to the chemostat, the continuous analogue, where multiple species cannot coexist on a single nonreproducing nutrient.
New submissions for Wednesday, 15 May 2024 (showing 5 of 5 entries )
- [6] arXiv:2405.08031 (cross-list from cs.LG) [pdf, ps, other]
-
Title: HGTDR: Advancing Drug Repurposing with Heterogeneous Graph TransformersComments: Accepted for Publication in Bioinformatics (11-Feb-2024)Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
Motivation: Drug repurposing is a viable solution for reducing the time and cost associated with drug development. However, thus far, the proposed drug repurposing approaches still need to meet expectations. Therefore, it is crucial to offer a systematic approach for drug repurposing to achieve cost savings and enhance human lives. In recent years, using biological network-based methods for drug repurposing has generated promising results. Nevertheless, these methods have limitations. Primarily, the scope of these methods is generally limited concerning the size and variety of data they can effectively handle. Another issue arises from the treatment of heterogeneous data, which needs to be addressed or converted into homogeneous data, leading to a loss of information. A significant drawback is that most of these approaches lack end-to-end functionality, necessitating manual implementation and expert knowledge in certain stages. Results: We propose a new solution, HGTDR (Heterogeneous Graph Transformer for Drug Repurposing), to address the challenges associated with drug repurposing. HGTDR is a three-step approach for knowledge graph-based drug re-purposing: 1) constructing a heterogeneous knowledge graph, 2) utilizing a heterogeneous graph transformer network, and 3) computing relationship scores using a fully connected network. By leveraging HGTDR, users gain the ability to manipulate input graphs, extract information from diverse entities, and obtain their desired output. In the evaluation step, we demonstrate that HGTDR performs comparably to previous methods. Furthermore, we review medical studies to validate our method's top ten drug repurposing suggestions, which have exhibited promising results. We also demon-strated HGTDR's capability to predict other types of relations through numerical and experimental validation, such as drug-protein and disease-protein inter-relations.
- [7] arXiv:2405.08040 (cross-list from physics.soc-ph) [pdf, ps, html, other]
-
Title: No evidence of systematic proximity ascertainment bias in early COVID-19 cases in Wuhan Reply to Weissman (2024)Comments: Reply to Weissman (2024) arXiv:2401.08680Subjects: Physics and Society (physics.soc-ph); Populations and Evolution (q-bio.PE)
In a short text published as Letter to the Editor of the Journal of the Royal Statistical Society Series A, Weissman (2024) argues that the finding that early COVID-19 cases without an ascertained link to Wuhan's Huanan Seafood Wholesale market resided on average closer to the market than cases epidemiologically linked to it, reveals "major proximity ascertainment bias". Here we show that Weissman's conclusion is based on a flawed premise, and that there is no such "internal evidence" of major bias. The pattern can indeed be explained by places of infection not being limited to residential neighbourhoods, and by stochasticity -- i.e., without requiring any ascertainment bias.
- [8] arXiv:2405.08217 (cross-list from cs.LG) [pdf, ps, html, other]
-
Title: Data Valuation with Gradient SimilaritySubjects: Machine Learning (cs.LG); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
High-quality data is crucial for accurate machine learning and actionable analytics, however, mislabeled or noisy data is a common problem in many domains. Distinguishing low- from high-quality data can be challenging, often requiring expert knowledge and considerable manual intervention. Data Valuation algorithms are a class of methods that seek to quantify the value of each sample in a dataset based on its contribution or importance to a given predictive task. These data values have shown an impressive ability to identify mislabeled observations, and filtering low-value data can boost machine learning performance. In this work, we present a simple alternative to existing methods, termed Data Valuation with Gradient Similarity (DVGS). This approach can be easily applied to any gradient descent learning algorithm, scales well to large datasets, and performs comparably or better than baseline valuation methods for tasks such as corrupted label discovery and noise quantification. We evaluate the DVGS method on tabular, image and RNA expression datasets to show the effectiveness of the method across domains. Our approach has the ability to rapidly and accurately identify low-quality data, which can reduce the need for expert knowledge and manual intervention in data cleaning tasks.
- [9] arXiv:2405.08304 (cross-list from cs.CL) [pdf, ps, other]
-
Title: Computational Thought Experiments for a More Rigorous Philosophy and Science of the MindComments: 6 pages, 4 figures, to appear at CogSci 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Neurons and Cognition (q-bio.NC)
We offer philosophical motivations for a method we call Virtual World Cognitive Science (VW CogSci), in which researchers use virtual embodied agents that are embedded in virtual worlds to explore questions in the field of Cognitive Science. We focus on questions about mental and linguistic representation and the ways that such computational modeling can add rigor to philosophical thought experiments, as well as the terminology used in the scientific study of such representations. We find that this method forces researchers to take a god's-eye view when describing dynamical relationships between entities in minds and entities in an environment in a way that eliminates the need for problematic talk of belief and concept types, such as the belief that cats are silly, and the concept CAT, while preserving belief and concept tokens in individual cognizers' minds. We conclude with some further key advantages of VW CogSci for the scientific study of mental and linguistic representation and for Cognitive Science more broadly.
- [10] arXiv:2405.08384 (cross-list from math.AP) [pdf, ps, other]
-
Title: Group Dispersal Modelling revisitedSubjects: Analysis of PDEs (math.AP); Probability (math.PR); Quantitative Methods (q-bio.QM)
In this paper we revisit the notion of grouped dispersal that have been introduced by Soubeyrand and co-authors \cite{soubeyrand2011patchy} to model the simultaneous (and hence dependent) dispersal of several propagules from a single source in a homogeneous environment. We built a time continuous measure valued process that takes into account the main feature of a grouped dispersal and derive its infinitesimal generator. To cope with the mutligeneration aspect associated to the demography we introduce two types of propagules in the description of the population which is one of the main innovations here. We also provide a rigorous description of the process and its generator. We derive as well, some large population asymptotics of the process unveilling the degenerate ultra parabolic system of PDE satisfied by the density of population. Finally, we also show that such a PDE system has a non-trivial solution which is unique in a certain functional space.
- [11] arXiv:2405.08404 (cross-list from math.PR) [pdf, ps, other]
-
Title: Genetic contribution of an advantaged mutant in the biparental Moran model -- finite selectionCamille Coron (INRAE), Yves Le JanSubjects: Probability (math.PR); Populations and Evolution (q-bio.PE)
We consider a population of N individuals, whose dynamics through time is represented by a biparental Moran model with two types: an advantaged type and a disadvantaged type. The advantage is due to a mutation, transmitted in a Mendelian way from parent to child that reduces the death probability of individuals carrying it. We assume that initially this mutation is carried by a proportion a of individuals in the population. Once the mutation is fixed, a gene is sampled uniformly in the population, at a locus independent of the locus under selection. We then give the probability that this gene initially comes from an advantaged individual, i.e. the genetic contribution of these individuals, as a function of a and when the population size is large.
- [12] arXiv:2405.08409 (cross-list from nlin.AO) [pdf, ps, other]
-
Title: Bifurcation analysis of a two-neuron central pattern generator model for both oscillatory and convergent neuronal activitiesSubjects: Adaptation and Self-Organizing Systems (nlin.AO); Neurons and Cognition (q-bio.NC)
The neural oscillator model proposed by Matsuoka is a piecewise affine system, which exhibits distinctive periodic solutions. Although such typical oscillation patterns have been widely studied, little is understood about the dynamics of convergence to certain fixed points and bifurcations between the periodic orbits and fixed points in this model. We performed fixed point analysis on a two-neuron version of the Matsuoka oscillator model, the result of which explains the mechanism of oscillation and the discontinuity-induced bifurcations such as subcritical/supercritical Hopf-like, homoclinic-like, and grazing bifurcations. Furthermore, it provided theoretical predictions concerning a logarithmic oscillation-period scaling law and noise-induced oscillations, which are both observed around those bifurcations. These results are expected to underpin further investigations into both oscillatory and transient neuronal activities with respect to central pattern generators.
Cross submissions for Wednesday, 15 May 2024 (showing 7 of 7 entries )
- [13] arXiv:2302.01924 (replaced) [pdf, ps, html, other]
-
Title: CLASH: Contrastive learning through alignment shifting to extract stimulus information from EEGSubjects: Neurons and Cognition (q-bio.NC); Signal Processing (eess.SP)
Stimulus-evoked EEG data has a notoriously low signal-to-noise ratio and high inter-subject variability. We propose a novel paradigm for the self-supervised extraction of stimulus-related brain response data: a model is trained to extract similar information between two time-aligned segments of EEG in response to the same stimulus. The extracted information can subsequently be used to obtain better results in downstream tasks that utilize the response to the stimulus. We show the efficacy of our method for a downstream task of decoding the speech envelope from auditory EEG. Our method outperforms other state-of-the-art denoising techniques, improving reconstruction scores by 45\%. Additionally, we show that in contrast to the baseline denoising techniques, our method can be used with data of unseen subjects and stimuli without retraining, improving decoding performance by 19\% and 34\% over raw EEG for two holdout datasets. Finally, the last experiment reveals that the accuracies obtained in the CLASH paradigm are significantly correlated with the percentile of obtained reconstruction correlation on the null distribution. In general, we showed that the proposed paradigm is suitable to train deep learning models to extract stimulus information from EEG while being stimulus feature agnostic.
- [14] arXiv:2404.05553 (replaced) [pdf, ps, other]
-
Title: Alljoined1 -- A dataset for EEG-to-Image decodingJonathan Xu, Bruno Aristimunha, Max Emanuel Feucht, Emma Qian, Charles Liu, Tazik Shahjahan, Martyna Spyra, Steven Zifan Zhang, Nicholas Short, Jioh Kim, Paula Perdomo, Ricky Renfeng Mao, Yashvir Sabharwal, Michael Ahedor Moaz Shoura, Adrian NestorComments: 8 Pages, 6 FiguresSubjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI)
We present Alljoined1, a dataset built specifically for EEG-to-Image decoding. Recognizing that an extensive and unbiased sampling of neural responses to visual stimuli is crucial for image reconstruction efforts, we collected data from 8 participants looking at 10,000 natural images each. We have currently gathered 46,080 epochs of brain responses recorded with a 64-channel EEG headset. The dataset combines response-based stimulus timing, repetition between blocks and sessions, and diverse image classes with the goal of improving signal quality. For transparency, we also provide data quality scores. We publicly release the dataset and all code at this https URL.
- [15] arXiv:2405.06718 (replaced) [pdf, ps, other]
-
Title: Vector-borne threats: Sustainable approaches to their diagnosis and treatmentAreesha Naveed, Ayesha Haidar, Rameen Atique, Arshi Saeed, Bushra Anwar, Ambreen Talib, Uzma Bilal, Javeria Sharif, Ayesha Nadeem, Sania Tariq, Ayesha Muazzam, Abdul SamadComments: 4 Figure, 1 tableSubjects: Other Quantitative Biology (q-bio.OT)
Arbovirus is a vital, life-threatening disease worldwide and continues to be a significant problem while the world is dealing with the major coronavirus (COVID-19) pandemic. Vectors, mostly mosquitoes and ticks, transmit this disease. Dengue fever, chikungunya, and Zika viruses are the major threats because of their high incidence, public health burden, and clinically significant disease spectrum. These vector-borne disease causes one-fourth of annual deaths, leading to various infectious diseases. The arbovirus represents eight different families and 14 genera; most viruses belong to the family Bunyaviridae, and some also belong to Togaviridae, Reoviridae, and Flaviviridae. The arbovirus disease was isolated first in tropical and subtropical regions of South America and Africa and has high significance because of suitable environmental conditions for virus transmission and vector expansion. Its transmission cycle ranges from simple to highly complex. DENV is the most prevalent, results in febrile illness, and has transmission in 128 different countries. CHIKV causes infection in asymptomatic people, and the problems include nephritis, arthritis, myelitis, and acute encephalopathy. ZIKV-infected 80% of people are asymptomatic and may cause rashes, myalgia, fever, headache, and conjunctivitis. Vaccines for DENV are not clinically available; it is a primary arboviral infection in the world nowadays. The exposure of arbovirus diseases continues to be a global health problem regardless of continuing efforts. This review article will overview major arbovirus diseases and their diagnosis, treatment, and prevention strategies.
- [16] arXiv:2405.06725 (replaced) [pdf, ps, other]
-
Title: On the Shape of Brainscores for Large Language Models (LLMs)Comments: The Figure 10 from arXiv:1710.04019, Figure 6.28 from arXiv:2403.13825, and captions are both from this https URL, where the case in my paper is Figure 3, and has already cited its original source. I believe both arXiv:1710.04019 and arXiv:2403.13825 should cite the original source, rather than force me to cite themSubjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
With the rise of Large Language Models (LLMs), the novel metric "Brainscore" emerged as a means to evaluate the functional similarity between LLMs and human brain/neural systems. Our efforts were dedicated to mining the meaning of the novel score by constructing topological features derived from both human fMRI data involving 190 subjects, and 39 LLMs plus their untrained counterparts. Subsequently, we trained 36 Linear Regression Models and conducted thorough statistical analyses to discern reliable and valid features from our constructed ones. Our findings reveal distinctive feature combinations conducive to interpreting existing brainscores across various brain regions of interest (ROIs) and hemispheres, thereby significantly contributing to advancing interpretable machine learning (iML) studies. The study is enriched by our further discussions and analyses concerning existing brainscores. To our knowledge, this study represents the first attempt to comprehend the novel metric brainscore within this interdisciplinary domain.
- [17] arXiv:2307.14804 (replaced) [pdf, ps, html, other]
-
Title: Collective behavior from surprise minimizationComments: 29 pages (main text), 29 pages (supplemental appendices), 4 figures, 1 supplemental figure, 5 moviesJournal-ref: Proceedings of the National Academy of Sciences, 121(17), e2320239121 (2024)Subjects: Adaptation and Self-Organizing Systems (nlin.AO); Multiagent Systems (cs.MA); Neurons and Cognition (q-bio.NC)
Collective motion is ubiquitous in nature; groups of animals, such as fish, birds, and ungulates appear to move as a whole, exhibiting a rich behavioral repertoire that ranges from directed movement to milling to disordered swarming. Typically, such macroscopic patterns arise from decentralized, local interactions among constituent components (e.g., individual fish in a school). Preeminent models of this process describe individuals as self-propelled particles, subject to self-generated motion and 'social forces' such as short-range repulsion and long-range attraction or alignment. However, organisms are not particles; they are probabilistic decision-makers. Here, we introduce an approach to modelling collective behavior based on active inference. This cognitive framework casts behavior as the consequence of a single imperative: to minimize surprise. We demonstrate that many empirically-observed collective phenomena, including cohesion, milling and directed motion, emerge naturally when considering behavior as driven by active Bayesian inference -- without explicitly building behavioral rules or goals into individual agents. Furthermore, we show that active inference can recover and generalize the classical notion of social forces as agents attempt to suppress prediction errors that conflict with their expectations. By exploring the parameter space of the belief-based model, we reveal non-trivial relationships between the individual beliefs and group properties like polarization and the tendency to visit different collective states. We also explore how individual beliefs about uncertainty determine collective decision-making accuracy. Finally, we show how agents can update their generative model over time, resulting in groups that are collectively more sensitive to external fluctuations and encode information more robustly.
- [18] arXiv:2402.01130 (replaced) [pdf, ps, other]
-
Title: convSeq: Fast and Scalable Method for Detecting Patterns in Spike DataComments: This paper has been accepted to ICML 2024Subjects: Signal Processing (eess.SP); Neurons and Cognition (q-bio.NC)
Spontaneous neural activity, crucial in memory, learning, and spatial navigation, often manifests itself as repetitive spatiotemporal patterns. Despite their importance, analyzing these patterns in large neural recordings remains challenging due to a lack of efficient and scalable detection methods. Addressing this gap, we introduce convSeq, an unsupervised method that employs backpropagation for optimizing spatiotemporal filters that effectively identify these neural patterns. Our method's performance is validated on various synthetic data and real neural recordings, revealing spike sequences with unprecedented scalability and efficiency. Significantly surpassing existing methods in speed, convSeq sets a new standard for analyzing spontaneous neural activity, potentially advancing our understanding of information processing in neural circuits.