Quantitative Biology
- [1] arXiv:2406.05170 [pdf, ps, other]
-
Title: Research on Tumors Segmentation based on Image Enhancement MethodSubjects: Other Quantitative Biology (q-bio.OT); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
One of the most effective ways to treat liver cancer is to perform precise liver resection surgery, the key step of which includes precise digital image segmentation of the liver and its tumor. However, traditional liver parenchymal segmentation techniques often face several challenges in performing liver segmentation: lack of precision, slow processing speed, and computational burden. These shortcomings limit the efficiency of surgical planning and execution. In this work, the model initially describes in detail a new image enhancement algorithm that enhances the key features of an image by adaptively adjusting the contrast and brightness of the image. Then, a deep learning-based segmentation network was introduced, which was specially trained on the enhanced images to optimize the detection accuracy of tumor regions. In addition, multi-scale analysis techniques have been incorporated into the study, allowing the model to analyze images at different resolutions to capture more nuanced tumor features. In the presentation of the experimental results, the study used the 3Dircadb dataset to test the effectiveness of the proposed method. The experimental results show that compared with the traditional image segmentation method, the new method using image enhancement technology has significantly improved the accuracy and recall rate of tumor identification.
- [2] arXiv:2406.05185 [pdf, ps, html, other]
-
Title: Tree balance in phylogenetic modelsSubjects: Populations and Evolution (q-bio.PE); Applications (stat.AP)
Tree shape statistics, particularly measures of tree (im)balance, play an important role in the analysis of the shape of phylogenetic trees. With applications ranging from testing evolutionary models to studying the impact of fertility inheritance and selection, or tumor development and language evolution, the assessment of tree balance is crucial. Currently, a multitude of at least 30 (im)balance indices can be found in the literature, alongside numerous other tree shape statistics.
This diversity prompts essential questions: How can we minimize the selection of indices to mitigate the challenges of multiple testing? Is there a preeminent balance index tailored to specific tasks? Previous studies comparing the statistical power of indices in detecting trees deviating from the Yule model have been limited in scope, utilizing only a subset of indices and alternative tree models.
This research expands upon the examination of index power, encompassing all established indices and a broader array of alternative models. Our investigation reveals distinct groups of balance indices better suited for different tree models, suggesting that decisions on balance index selection can be enhanced with prior knowledge. Furthermore, we present the \textsf{R} software package \textsf{poweRbal} which allows the inclusion of new indices and models, thus facilitating future research. - [3] arXiv:2406.05248 [pdf, ps, other]
-
Title: Processing, evaluating and understanding FMRI data with afni_proc.pyComments: 52 pages, 10 figures, 6 tablesSubjects: Neurons and Cognition (q-bio.NC)
FMRI data are noisy, complicated to acquire, and typically go through many steps of processing before they are used in a study or clinical practice. Being able to visualize and understand the data from the start through the completion of processing, while being confident that each intermediate step was successful, is challenging. AFNI's this http URL is a tool to create and run a processing pipeline for FMRI data. With its flexible features, this http URL allows users to both control and evaluate their processing at a detailed level. It has been designed to keep users informed about all processing steps: it does not just process the data, but first outputs a fully commented processing script that the users can read, query, interpret and refer back to. Having this full provenance is important for being able to understand each step of processing; it also promotes transparency and reproducibility by keeping the record of individual-level processing and modeling specifics in a single, shareable place. Additionally, this http URL creates pipelines that contain several automatic self-checks for potential problems during runtime. The output directory contains a dictionary of relevant quantities that can be programmatically queried for potential issues and a systematic, interactive quality control (QC) HTML. All of these features help users evaluate and understand their data and processing in detail. We describe these and other aspects of this http URL here using a set of task-based and resting state FMRI example commands.
- [4] arXiv:2406.05258 [pdf, ps, html, other]
-
Title: Advances in Machine Learning, Statistical Methods, and AI for Single-Cell RNA Annotation Using Raw Count Matrices in scRNA-seq DataComments: A survey of best practices for using machine learning, statistical methods, and AI for Single-Cell RNA annotation using raw count matrices in scRNA-seq dataSubjects: Other Quantitative Biology (q-bio.OT); Other Statistics (stat.OT)
Single-cell RNA sequencing (scRNA-seq) has revolutionized our ability to analyze gene expression at the resolution of individual cells, providing unprecedented insights into cellular heterogeneity and complex biological systems. This paper reviews various advanced computational and machine learning techniques tailored for the analysis of scRNA-seq data, emphasizing their roles in different stages of the data processing pipeline.
- [5] arXiv:2406.05347 [pdf, ps, html, other]
-
Title: MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-TrainingSubjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Multiple Sequence Alignment (MSA) plays a pivotal role in unveiling the evolutionary trajectories of protein families. The accuracy of protein structure predictions is often compromised for protein sequences that lack sufficient homologous information to construct high quality MSA. Although various methods have been proposed to generate virtual MSA under these conditions, they fall short in comprehensively capturing the intricate coevolutionary patterns within MSA or require guidance from external oracle models. Here we introduce MSAGPT, a novel approach to prompt protein structure predictions via MSA generative pretraining in the low MSA regime. MSAGPT employs a simple yet effective 2D evolutionary positional encoding scheme to model complex evolutionary patterns. Endowed by this, its flexible 1D MSA decoding framework facilitates zero or few shot learning. Moreover, we demonstrate that leveraging the feedback from AlphaFold2 can further enhance the model capacity via Rejective Fine tuning (RFT) and Reinforcement Learning from AF2 Feedback (RLAF). Extensive experiments confirm the efficacy of MSAGPT in generating faithful virtual MSA to enhance the structure prediction accuracy. The transfer learning capabilities also highlight its great potential for facilitating other protein tasks.
- [6] arXiv:2406.05540 [pdf, ps, html, other]
-
Title: A Fine-tuning Dataset and Benchmark for Large Language Models for Protein UnderstandingYiqing Shen, Zan Chen, Michail Mamalakis, Luhan He, Haiyang Xia, Tianbin Li, Yanzhou Su, Junjun He, Yu Guang WangSubjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
The parallels between protein sequences and natural language in their sequential structures have inspired the application of large language models (LLMs) to protein understanding. Despite the success of LLMs in NLP, their effectiveness in comprehending protein sequences remains an open question, largely due to the absence of datasets linking protein sequences to descriptive text. Researchers have then attempted to adapt LLMs for protein understanding by integrating a protein sequence encoder with a pre-trained LLM. However, this adaptation raises a fundamental question: "Can LLMs, originally designed for NLP, effectively comprehend protein sequences as a form of language?" Current datasets fall short in addressing this question due to the lack of a direct correlation between protein sequences and corresponding text descriptions, limiting the ability to train and evaluate LLMs for protein understanding effectively. To bridge this gap, we introduce ProteinLMDataset, a dataset specifically designed for further self-supervised pretraining and supervised fine-tuning (SFT) of LLMs to enhance their capability for protein sequence comprehension. Specifically, ProteinLMDataset includes 17.46 billion tokens for pretraining and 893,000 instructions for SFT. Additionally, we present ProteinLMBench, the first benchmark dataset consisting of 944 manually verified multiple-choice questions for assessing the protein understanding capabilities of LLMs. ProteinLMBench incorporates protein-related details and sequences in multiple languages, establishing a new standard for evaluating LLMs' abilities in protein comprehension. The large language model InternLM2-7B, pretrained and fine-tuned on the ProteinLMDataset, outperforms GPT-4 on ProteinLMBench, achieving the highest accuracy score. The dataset and the benchmark are available at this https URL.
- [7] arXiv:2406.05738 [pdf, ps, html, other]
-
Title: Smiles2Dock: an open large-scale multi-task dataset for ML-based molecular dockingSubjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG); Applications (stat.AP); Computation (stat.CO)
Docking is a crucial component in drug discovery aimed at predicting the binding conformation and affinity between small molecules and target proteins. ML-based docking has recently emerged as a prominent approach, outpacing traditional methods like DOCK and AutoDock Vina in handling the growing scale and complexity of molecular libraries. However, the availability of comprehensive and user-friendly datasets for training and benchmarking ML-based docking algorithms remains limited. We introduce Smiles2Dock, an open large-scale multi-task dataset for molecular docking. We created a framework combining P2Rank and AutoDock Vina to dock 1.7 million ligands from the ChEMBL database against 15 AlphaFold proteins, giving us more than 25 million protein-ligand binding scores. The dataset leverages a wide range of high-accuracy AlphaFold protein models, encompasses a diverse set of biologically relevant compounds and enables researchers to benchmark all major approaches for ML-based docking such as Graph, Transformer and CNN-based methods. We also introduce a novel Transformer-based architecture for docking scores prediction and set it as an initial benchmark for our dataset. Our dataset and code are publicly available to support the development of novel ML-based methods for molecular docking to advance scientific research in this field.
- [8] arXiv:2406.05797 [pdf, ps, html, other]
-
Title: 3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular TokenizationComments: 18 pagesSubjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Machine Learning (cs.LG)
The integration of molecule and language has garnered increasing attention in molecular science. Recent advancements in Language Models (LMs) have demonstrated potential for the comprehensive modeling of molecule and language. However, existing works exhibit notable limitations. Most existing works overlook the modeling of 3D information, which is crucial for understanding molecular structures and also functions. While some attempts have been made to leverage external structure encoding modules to inject the 3D molecular information into LMs, there exist obvious difficulties that hinder the integration of molecular structure and language text, such as modality alignment and separate tuning. To bridge this gap, we propose 3D-MolT5, a unified framework designed to model both 1D molecular sequence and 3D molecular structure. The key innovation lies in our methodology for mapping fine-grained 3D substructure representations (based on 3D molecular fingerprints) to a specialized 3D token vocabulary for 3D-MolT5. This 3D structure token vocabulary enables the seamless combination of 1D sequence and 3D structure representations in a tokenized format, allowing 3D-MolT5 to encode molecular sequence (SELFIES), molecular structure, and text sequences within a unified architecture. Alongside, we further introduce 1D and 3D joint pre-training to enhance the model's comprehension of these diverse modalities in a joint representation space and better generalize to various tasks for our foundation model. Through instruction tuning on multiple downstream datasets, our proposed 3D-MolT5 shows superior performance than existing methods in molecular property prediction, molecule captioning, and text-based molecule generation tasks. Our code will be available on GitHub soon.
- [9] arXiv:2406.05832 [pdf, ps, html, other]
-
Title: Improving Antibody Design with Force-Guided Sampling in Diffusion ModelsPaulina Kulytė, Francisco Vargas, Simon Valentin Mathis, Yu Guang Wang, José Miguel Hernández-Lobato, Pietro LiòSubjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Biomolecules (q-bio.BM)
Antibodies, crucial for immune defense, primarily rely on complementarity-determining regions (CDRs) to bind and neutralize antigens, such as viruses. The design of these CDRs determines the antibody's affinity and specificity towards its target. Generative models, particularly denoising diffusion probabilistic models (DDPMs), have shown potential to advance the structure-based design of CDR regions. However, only a limited dataset of bound antibody-antigen structures is available, and generalization to out-of-distribution interfaces remains a challenge. Physics based force-fields, which approximate atomic interactions, offer a coarse but universal source of information to better mold designs to target interfaces. Integrating this foundational information into diffusion models is, therefore, highly desirable. Here, we propose a novel approach to enhance the sampling process of diffusion models by integrating force field energy-based feedback. Our model, DiffForce, employs forces to guide the diffusion sampling process, effectively blending the two distributions. Through extensive experiments, we demonstrate that our method guides the model to sample CDRs with lower energy, enhancing both the structure and sequence of the generated antibodies.
- [10] arXiv:2406.05859 [pdf, ps, html, other]
-
Title: From First-order to Higher-order Interactions: Enhanced Representation of Homotopic Functional Connectivity through Control of Intervening VariablesSubjects: Neurons and Cognition (q-bio.NC)
The brain's complex functionality emerges from network interactions that go beyond dyadic connections, with higher-order interactions significantly contributing to this complexity. One method of capturing higher-order interactions is through traversing the brain network using random walks. The efficacy of these random walks depends on the defined mutual interactions between two brain entities. More precise capture of higher-order interactions enables a better reflection of the brain's intrinsic neurophysiological characteristics. One well-established neurophysiological concept is Homotopic Functional Connectivity (HoFC), which illustrates the synchronized spontaneous activity between corresponding regions in the brain's left and right hemispheres. We employ node2vec, a random walk node embedding approach, alongside resting-state fMRI from the Human Connectome Project (HCP) to obtain higher-order feature vectors. We assess the efficacy of different functional connectivity parameterizations using HoFC. The results indicates that the quality of capturing higher-order interactions largely depends on the statistical dependency measure between brain regions. Higher-order interactions defined by partial correlation, better reflects HoFC compare to other statistical associations. In this case of first-order interactions, tangent space embedding more effectively demonstrates HoFC. The findings validate HoFC and underscore the importance of functional connectivity construction method in capturing intrinsic characteristics of the human brain.
- [11] arXiv:2406.06143 [pdf, ps, other]
-
Title: The Integrated Information Theory needs AttentionComments: 23 pages (including references), 6 figuresSubjects: Neurons and Cognition (q-bio.NC)
The Integrated Information Theory (IIT) might be our current best bet at a scientific explanation of phenomenal consciousness. IIT focuses on the distinctively subjective and phenomenological aspects of conscious experience. Currently, it offers the fundaments of a formal account, but future developments shall explain the qualitative structures of every possible conscious experience. But this ambitious project is hindered by one fundamental limitation. IIT fails to acknowledge the crucial roles of attention in generating phenomenally conscious experience and shaping its contents. Here, we argue that IIT urgently needs an account of attention. Without this account, IIT cannot explain important informational differences between different kinds of experiences. Furthermore, though some IIT proponents celebratedly endorse a double dissociation between consciousness and attention, close analysis reveals that such as dissociation is in fact incompatible with IIT. Notably, the issues we raise for IIT will likely arise for many internalist theories of conscious contents in philosophy, especially theories with primitivist inclinations. Our arguments also extend to the recently popularized structuralist approaches. Overall, our discussion highlights how considerations about attention are indispensable for scientific as well as philosophical theorizing about conscious experience.
- [12] arXiv:2406.06327 [pdf, ps, other]
-
Title: Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial NavigationSubjects: Neurons and Cognition (q-bio.NC)
Navigating through a physical environment to reach a desired location involves a complex interplay of cognitive, sensory, and motor functions. When navigating with others, experiencing a degree of behavioral and cognitive synchronization is both natural and ubiquitous. This synchronization facilitates a harmonious effort toward achieving a common goal, reflecting how individuals instinctively align their actions and thoughts in collaborative settings. Collaborative spatial tasks, which are crucial in daily and professional settings, require coordinated navigation and problem-solving skills. This study explores the neural mechanisms underlying such tasks by using hyperscanning electroencephalography (EEG) technology to examine brain dynamics in dyadic route planning within a virtual reality setting. By analyzing intra- and inter-brain couplings across delta, theta, alpha, beta, and gamma EEG bands using both functional and effective connectivity measures, we identified significant neural synchronization patterns associated with collaborative task performance in both leaders and followers. Functional intra-brain connectivity analyses revealed distinct neural engagement across EEG frequency bands, with increased delta couplings observed in both leaders and followers. Theta connectivity was particularly enhanced in followers, whereas the alpha band exhibited divergent patterns that indicate role-specific neural strategies. Inter-brain analysis revealed increased delta causality between interacting members but decreased theta and gamma couplings from followers to leaders. Additionally, inter-brain analysis indicated decreased couplings in faster-performing dyads, especially in theta bands. These insights enhance our understanding of the neural mechanisms driving collaborative spatial navigation and demonstrate the effectiveness of hyperscanning in studying complex brain-to-brain interactions.
- [13] arXiv:2406.06397 [pdf, ps, html, other]
-
Title: Contrastive learning of T cell receptor representationsYuta Nagano, Andrew Pyo, Martina Milighetti, James Henderson, John Shawe-Taylor, Benny Chain, Andreas Tiffeau-MayerComments: 19 pages, 17 figuresSubjects: Biomolecules (q-bio.BM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Computational prediction of the interaction of T cell receptors (TCRs) and their ligands is a grand challenge in immunology. Despite advances in high-throughput assays, specificity-labelled TCR data remains sparse. In other domains, the pre-training of language models on unlabelled data has been successfully used to address data bottlenecks. However, it is unclear how to best pre-train protein language models for TCR specificity prediction. Here we introduce a TCR language model called SCEPTR (Simple Contrastive Embedding of the Primary sequence of T cell Receptors), capable of data-efficient transfer learning. Through our model, we introduce a novel pre-training strategy combining autocontrastive learning and masked-language modelling, which enables SCEPTR to achieve its state-of-the-art performance. In contrast, existing protein language models and a variant of SCEPTR pre-trained without autocontrastive learning are outperformed by sequence alignment-based methods. We anticipate that contrastive learning will be a useful paradigm to decode the rules of TCR specificity.
New submissions for Tuesday, 11 June 2024 (showing 13 of 13 entries )
- [14] arXiv:2406.05173 (cross-list from physics.med-ph) [pdf, ps, other]
-
Title: Cross-sectional shape analysis for risk assessment and prognosis of patients with true lumen narrowing after type-A aortic dissection surgeryJ V Ramana Reddy (1), Toshitaka Watanabe (2), Taro Hayashi (3), Hiroshi Suito (1) ((1) Advanced Institute for Materials Research, Tohoku University, 2-1-1 Katahira, Sendai, Miyagi, Japan, (2) Medical Corporation Shimada Clinic Clover Clinic, Osaka, Japan, (3) Department of Cardiovascular Surgery, Akashi Medical Center, Hyogo, Japan)Subjects: Medical Physics (physics.med-ph); Geometric Topology (math.GT); Quantitative Methods (q-bio.QM); Applications (stat.AP)
Background: For acute type-A aortic dissection (ATAAD) surgery, early post-surgery assessment is crucially important for effective treatment plans, underscoring the need for a framework to identify the risk level of aortic dissection cases. We examined true-lumen narrowing during follow-up examinations, collected morphological data 14 days (early stages) after surgery, and assessed patient risk levels over 2.8 years.
Purpose: To establish an implementable framework supported by mathematical techniques to predict the risk of aortic dissection patients experiencing true-lumen narrowing after ATAAD surgery.
Materials and Methods: This retrospective study analyzed CT data from 21 ATAAD patients. Forty uniformly distributed cross-sectional shapes (CSSs) are derived from each lumen to account for gradual changes in shape. We introduced the form factor (FF) to assess CSS morphology. Linear discriminant analysis (LDA) is used for the risk classification of aortic dissection patients. Leave-one-patient-out cross-validation (LOPO-CV) is used for risk prediction.
Results: For this investigation, we examined data of 21 ATAAD patients categorized into high-risk, medium-risk, and low-risk cases based on clinical observations of the range of true-lumen narrowing. Our risk classification machine-learning (ML) model preserving the model's generalizability. The model's predictions reliably identified low-risk patients, thereby potentially reducing hospital visits. It also demonstrated proficiency in accurately predicting the risk for all high-risk patients.
Conclusion: The suggested method anticipates the risk linked to aortic enlargement in patients with a narrowing true lumen in the early stage following ATAAD surgery, thereby aiding follow-up doctors in enhancing patient care. - [15] arXiv:2406.05787 (cross-list from physics.optics) [pdf, ps, other]
-
Title: Optical signal recording of cellular activity in optogenetic stimulation of human pulp dental cells using a twin-core fiber-based Mach-Zehnder interferometer biosensorFaezeh Akbari, Mohammad Ismail Zibaii, Sara Chavoshi Nezhad, Azam Layeghi, Leila Dargahi, Orlando FrazaoSubjects: Optics (physics.optics); Neurons and Cognition (q-bio.NC)
Frazao This paper introduces an innovative two-core fiber (TCF) optic sensor employing a Mach-Zehnder interferometer (MZI) to monitor the optogenetic response of light-sensitive human dental pulp stem cells (hDPSCs). The in-fiber MZI, formed using a segment of TCF optic, detects refractive index (RI) changes in the surrounding medium. The sensor utilizes the evanescent wave of one core as the sensing arm, necessitating a thin cladding achieved through one-sided chemical etching. This design allows the sensor to detect subtle alterations in the RI of the environment by observing displacements in the interference spectrum. The optogenetic stimulation of light-sensitive cells induces variations in ion concentrations, leading to a corresponding change in refractive index. The fabricated sensor, with a peak sensitivity of 675.74 nm/RIU within the RI range of 1.39-1.43, can detect these changes. A computer simulation validated the sensitivity and optimized fabrication parameters, exhibiting satisfactory agreement with experimental results. Spectrum displacements were recorded for both light-sensitive hDPSCs and regular hDPSCs (as a control test). Results from the experiment, analyzed and compared using data analysis software, revealed that 473 nm blue light effectively stimulated light-sensitive hDPSCs. Notably, the proposed sensor, a novel structure, demonstrated its capability to detect RI changes in the cell medium during optogenetic applications.
- [16] arXiv:2406.05884 (cross-list from physics.soc-ph) [pdf, ps, html, other]
-
Title: Revisiting institutional punishment in the $N$-person prisoner's dilemmaSubjects: Physics and Society (physics.soc-ph); Adaptation and Self-Organizing Systems (nlin.AO); Populations and Evolution (q-bio.PE)
The conflict between individual and collective interests makes fostering cooperation in human societies a challenging task, requiring drastic measures such as the establishment of sanctioning institutions. These institutions are costly because they have to be maintained regardless of the presence or absence of offenders. Here, we propose realistic improvements to the standard $N$-person prisoner's dilemma formulation with institutional punishment by eliminating overpunishment, requiring a minimum number of contributors to establish the sanctioning institution, and sharing the cost among them once this minimum number is reached. In addition, we focus on large groups or communities for which sanctioning institutions are ubiquitous. Using the replicator equation framework for an infinite population, we find that by sufficiently fining players who fail to contribute either to the public good or to the sanctioning institution, a population of contributors immune to invasion by these free riders can be established, provided that the contributors are sufficiently numerous. In a finite population, we use finite-size scaling to show that, for some parameter settings, demographic noise helps to fixate the strategy that contributes to the public good but not to the sanctioning institution even for infinitely large populations when, somewhat counterintuitively, its proportion in the initial population vanishes with a small power of the population size.
- [17] arXiv:2406.06135 (cross-list from physics.bio-ph) [pdf, ps, other]
-
Title: A story of cooperation: Centrosome-cytoskeleton interactions and implicationsComments: 14 pages, 1 FigureSubjects: Biological Physics (physics.bio-ph); Cell Behavior (q-bio.CB); Subcellular Processes (q-bio.SC)
A structural link between cell's centrosome and cytoskeleton has been proposed years ago. Centrosomes are usually located in the proximity to the nuclei and maintain nucleus-centrosome axis. This positioning aids in determining the polarity of interphase cells and ensure spindle assembly in mitotic cells. Centrosome also maintains physical interaction with different forms of cytoskeleton to trade-off between internal architecture and cell polarity in tissue specific as well as development specific manner. Several crosslinkers are also available to support this interaction and consequently promote cytoskeleton nucleation as well as centrosome nucleation. We present an overview of coordinated action of cytoskeletal elements on centrosomes and vice versa to modulate complex cellular functions, as diverse as cell migration, cell adhesion and cell division.
- [18] arXiv:2406.06147 (cross-list from cs.ET) [pdf, ps, html, other]
-
Title: Nanoscale Transmitters Employing Cooperative Transmembrane Transport Proteins for Molecular CommunicationTeena tom Dieck, Lukas Brand, Sebastian Lotter, Kathrin Castiglione, Robert Schober, Maximilian SchäferComments: 7 pages double-column, 5 figures, 1 table. This work has been submitted to the 11th ACM International Conference on Nanoscale Computing and Communication, Milan, ItalySubjects: Emerging Technologies (cs.ET); Subcellular Processes (q-bio.SC)
This paper introduces a novel optically controllable molecular communication (MC) transmitter (TX) design, which is based on a vesicular nanodevice (ND) functionalized for the release of signaling molecules via transmembrane proteins. Due to its optical-to-chemical conversion capability, the ND can be used as an externally controllable TX for several MC applications such as bit transmission and targeted drug delivery. The proposed TX design comprises two cooperating modules, an energizing module and a release module, and depending on the specific choices for the modules allows for the release of different types of signaling molecules. After setting up a general system model for the proposed TX design, we conduct a detailed mathematical analysis of a specific realization. In particular, we derive an exact analytical and an approximate closed-form solution for the concentration of the released signaling molecules and validate our results by comparison with a numerical solution. Moreover, we consider the impact of a buffering medium, which is typically present in experimental and application environments, in both our analytical and numerical analyses to evaluate the feasibility of our proposed TX design for practical chemical implementation. The proposed analytical and closed-form models facilitate system parameter optimization, which can accelerate the experimental development cycle of the proposed ND architecture in the future.
- [19] arXiv:2406.06393 (cross-list from cs.CV) [pdf, ps, html, other]
-
Title: STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomicsSubjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Genomics (q-bio.GN)
Recent advances in multi-modal algorithms have driven and been driven by the increasing availability of large image-text datasets, leading to significant strides in various fields, including computational pathology. However, in most existing medical image-text datasets, the text typically provides high-level summaries that may not sufficiently describe sub-tile regions within a large pathology image. For example, an image might cover an extensive tissue area containing cancerous and healthy regions, but the accompanying text might only specify that this image is a cancer slide, lacking the nuanced details needed for in-depth analysis. In this study, we introduce STimage-1K4M, a novel dataset designed to bridge this gap by providing genomic features for sub-tile images. STimage-1K4M contains 1,149 images derived from spatial transcriptomics data, which captures gene expression information at the level of individual spatial spots within a pathology image. Specifically, each image in the dataset is broken down into smaller sub-image tiles, with each tile paired with 15,000-30,000 dimensional gene expressions. With 4,293,195 pairs of sub-tile images and gene expressions, STimage-1K4M offers unprecedented granularity, paving the way for a wide range of advanced research in multi-modal data analysis an innovative applications in computational pathology, and beyond.
- [20] arXiv:2406.06479 (cross-list from cs.LG) [pdf, ps, html, other]
-
Title: Graph-Based Bidirectional Transformer Decision Threshold Adjustment Algorithm for Class-Imbalanced Molecular DataSubjects: Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Data sets with imbalanced class sizes, often where one class size is much smaller than that of others, occur extremely often in various applications, including those with biological foundations, such as drug discovery and disease diagnosis. Thus, it is extremely important to be able to identify data elements of classes of various sizes, as a failure to detect can result in heavy costs. However, many data classification algorithms do not perform well on imbalanced data sets as they often fail to detect elements belonging to underrepresented classes. In this paper, we propose the BTDT-MBO algorithm, incorporating Merriman-Bence-Osher (MBO) techniques and a bidirectional transformer, as well as distance correlation and decision threshold adjustments, for data classification problems on highly imbalanced molecular data sets, where the sizes of the classes vary greatly. The proposed method not only integrates adjustments in the classification threshold for the MBO algorithm in order to help deal with the class imbalance, but also uses a bidirectional transformer model based on an attention mechanism for self-supervised learning. Additionally, the method implements distance correlation as a weight function for the similarity graph-based framework on which the adjusted MBO algorithm operates. The proposed model is validated using six molecular data sets, and we also provide a thorough comparison to other competing algorithms. The computational experiments show that the proposed method performs better than competing techniques even when the class imbalance ratio is very high.
- [21] arXiv:2406.06491 (cross-list from nlin.CD) [pdf, ps, html, other]
-
Title: Input Driven Synchronization of Chaotic Neural Networks with Analyticaly Determined Conditional Lyapunov ExponentsComments: 3 Figures, 12 pagesSubjects: Chaotic Dynamics (nlin.CD); Dynamical Systems (math.DS); Neurons and Cognition (q-bio.NC)
Recurrent neural networks (RNNs) with random, but sufficiently strong and balanced coupling display a well known high-dimensional chaotic dynamics. Here, we investigate if externally applied inputs to these RNNs can stabilize globally synchronous, input-dependent solutions, in spite of the strong chaos-inducing coupling. We find that when the balance between excitation and inhibition is exact, that is when the row-sum of the weights is constant and 0, a globally applied input can readily synchronize all neurons onto a synchronous solution. The stability of the synchronous solution is analytically explored in this work with a master stability function. For any synchronous solution to the network dynamics, the conditional Lyapunov spectrum can be readily determined, with the stability of the synchronous solution critically dependent on the largest real eigenvalue component of the RNN weight matrix. We find that the smaller the maximum real component of the weight matrix eigenvalues, the more readily the network synchronizes. Further, the conditional Lyapunov exponents are easily computed numerically for any synchronization signal without simulating the RNN. Finally, for certain oscillatory synchronization signals, the conditional Lyapunov exponents can be determined analytically.
Cross submissions for Tuesday, 11 June 2024 (showing 8 of 8 entries )
- [22] arXiv:2302.06842 (replaced) [pdf, ps, html, other]
-
Title: Random boundaries: quantifying segmentation uncertainty in solutions to boundary-value problemsSubjects: Tissues and Organs (q-bio.TO)
Engineering simulations using boundary-value partial differential equations often implicitly assume that the uncertainty in the location of the boundary has a negligible impact on the output of the simulation. In this work, we develop a novel method for describing the geometric uncertainty in image-derived models and use a naive method for subsequently quantifying a simulation's sensitivity to that uncertainty. A Gaussian random field is constructed to represent the space of possible geometries, based on image-derived quantities such as pixel size, which can then be used to probe the simulation's output space. The algorithm is demonstrated with examples from biomechanics where patient-specific geometries are often segmented from low-resolution, three-dimensional images. These examples show the method's wide applicability with examples using linear elasticity and fluid dynamics. We show that important biomechanical outputs of these example simulations, namely maximum principal stress and wall shear stress, can be highly sensitive to realistic uncertainties in geometry.
- [23] arXiv:2302.12177 (replaced) [pdf, ps, html, other]
-
Title: EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site PredictionComments: Accepted to ICML 2024 (Oral)Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
Predicting the binding sites of target proteins plays a fundamental role in drug discovery. Most existing deep-learning methods consider a protein as a 3D image by spatially clustering its atoms into voxels and then feed the voxelized protein into a 3D CNN for prediction. However, the CNN-based methods encounter several critical issues: 1) defective in representing irregular protein structures; 2) sensitive to rotations; 3) insufficient to characterize the protein surface; 4) unaware of protein size shift. To address the above issues, this work proposes EquiPocket, an E(3)-equivariant Graph Neural Network (GNN) for binding site prediction, which comprises three modules: the first one to extract local geometric information for each surface atom, the second one to model both the chemical and spatial structure of protein and the last one to capture the geometry of the surface via equivariant message passing over the surface atoms. We further propose a dense attention output layer to alleviate the effect incurred by variable protein size. Extensive experiments on several representative benchmarks demonstrate the superiority of our framework to the state-of-the-art methods.
- [24] arXiv:2312.02203 (replaced) [pdf, ps, html, other]
-
Title: Learning High-Order Relationships of Brain RegionsComments: Accepted at ICML 2024, Camera Ready VersionSubjects: Neurons and Cognition (q-bio.NC); Machine Learning (cs.LG)
Discovering reliable and informative relationships among brain regions from functional magnetic resonance imaging (fMRI) signals is essential in phenotypic predictions. Most of the current methods fail to accurately characterize those interactions because they only focus on pairwise connections and overlook the high-order relationships of brain regions. We propose that these high-order relationships should be maximally informative and minimally redundant (MIMR). However, identifying such high-order relationships is challenging and under-explored due to the exponential search space and the absence of a tractable objective. In response to this gap, we propose a novel method named HYBRID which aims to extract MIMR high-order relationships from fMRI data. HYBRID employs a CONSTRUCTOR to identify hyperedge structures, and a WEIGHTER to compute a weight for each hyperedge, which avoids searching in exponential space. HYBRID achieves the MIMR objective through an innovative information bottleneck framework named multi-head drop-bottleneck with theoretical guarantees. Our comprehensive experiments demonstrate the effectiveness of our model. Our model outperforms the state-of-the-art predictive model by an average of 11.2%, regarding the quality of hyperedges measured by CPM, a standard protocol for studying brain connections.
- [25] arXiv:2401.00381 (replaced) [pdf, ps, html, other]
-
Title: Modeling of Memory Mechanisms in Cerebral Cortex and Simulation of Storage PerformanceSubjects: Neurons and Cognition (q-bio.NC); Distributed, Parallel, and Cluster Computing (cs.DC)
At the intersection of computation and cognitive science, graph theory is utilized as a formalized description of complex relationships and structures. Traditional graph models are often static, lacking dynamic and autonomous behavioral patterns. They rely on algorithms with a global view, significantly differing from biological neural networks, in which, to simulate information storage and retrieval processes, the limitations of centralized algorithms must be overcome. This study introduces a directed graph model that equips each node with adaptive learning and decision-making capabilities, thereby facilitating decentralized dynamic information storage and modeling and simulation of the brain's memory process. We abstract different storage instances as directed graph paths, transforming the storage of information into the assignment, discrimination, and extraction of different paths. To address writing and reading challenges, each node has a personalized adaptive learning ability. A storage algorithm without a God's eye view is developed, where each node uses its limited neighborhood information to facilitate the extension, formation, solidification, and awakening of directed graph paths, achieving competitive, reciprocal, and sustainable utilization of limited resources. Storage behavior occurs in each node, with adaptive learning behaviors of nodes concretized in a microcircuit centered around a variable resistor, simulating the electrophysiological behavior of neurons. Under the constraints of neurobiology on the anatomy and electrophysiology of biological neural networks, this model offers a plausible explanation for the mechanism of memory realization, providing a comprehensive, system-level experimental validation of the memory trace theory.
- [26] arXiv:2401.14442 (replaced) [pdf, ps, html, other]
-
Title: Improving Antibody Humanness Prediction using Patent DataComments: ICML 2024, 14 pages, 6 figures, Code: this https URLSubjects: Quantitative Methods (q-bio.QM); Machine Learning (cs.LG); Machine Learning (stat.ML)
We investigate the potential of patent data for improving the antibody humanness prediction using a multi-stage, multi-loss training process. Humanness serves as a proxy for the immunogenic response to antibody therapeutics, one of the major causes of attrition in drug discovery and a challenging obstacle for their use in clinical settings. We pose the initial learning stage as a weakly-supervised contrastive-learning problem, where each antibody sequence is associated with possibly multiple identifiers of function and the objective is to learn an encoder that groups them according to their patented properties. We then freeze a part of the contrastive encoder and continue training it on the patent data using the cross-entropy loss to predict the humanness score of a given antibody sequence. We illustrate the utility of the patent data and our approach by performing inference on three different immunogenicity datasets, unseen during training. Our empirical results demonstrate that the learned model consistently outperforms the alternative baselines and establishes new state-of-the-art on five out of six inference tasks, irrespective of the used metric.
- [27] arXiv:2404.17128 (replaced) [pdf, ps, html, other]
-
Title: Simple Network Mechanism Leads to Quasi-Real Brain Activation Patterns with Drosophila ConnectomeSubjects: Neurons and Cognition (q-bio.NC); Social and Information Networks (cs.SI)
Considering the high computational demands of most methods, using network communication models to simulate the brain is a more economical way. However, there is still insufficient evidence that they can effectively replicate the brains' real activation patterns. Moreover, it remains unclear whether actual network structures are crucial in simulating intelligence. Addressing these issues, we propose a large scale network communication model based on simple rules and design criteria to assess the differences between network models and real situations. To enhance the connection with the real world, we also incorporate an improved neuron dynamic model. We conduct research on the biggest adult Drosophila connectome data set. Experimental results show significant activation in neurons that should respond to stimulus and slight activation in irrelevant ones, which we call quasi-real activation pattern. Besides, when changing the network structure, the quasi-activation patterns disappear. Interestingly, activation regions have shorter network distances to their input neurons, implying that the network structure (not spatial distance) is the core to form brain functionality. In addition, giving input neurons a unilateral stimulus, we observe a bilateral response, which is consistent with reality. Then we find that both hemispheres have extremely similar statistical indicators. We also develop real-time 3D large spatial network visualization software to observe experimental phenomena, filling the software gap. This research reveals network models' power: it can reach the quasi-activation pattern with simple rules. Besides, it proves network structure matters in brain activity pattern generation. Future research could fully simulate brain behavior through network models, paving the way for artificial intelligence by developing new propagation rules and optimizing link weights.
- [28] arXiv:2405.18768 (replaced) [pdf, ps, html, other]
-
Title: RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow MatchingComments: Accepted to ICML 2024Subjects: Biomolecules (q-bio.BM); Machine Learning (cs.LG)
The growing significance of RNA engineering in diverse biological applications has spurred interest in developing AI methods for structure-based RNA design. While diffusion models have excelled in protein design, adapting them for RNA presents new challenges due to RNA's conformational flexibility and the computational cost of fine-tuning large structure prediction models. To this end, we propose RNAFlow, a flow matching model for protein-conditioned RNA sequence-structure design. Its denoising network integrates an RNA inverse folding model and a pre-trained RosettaFold2NA network for generation of RNA sequences and structures. The integration of inverse folding in the structure denoising process allows us to simplify training by fixing the structure prediction network. We further enhance the inverse folding model by conditioning it on inferred conformational ensembles to model dynamic RNA conformations. Evaluation on protein-conditioned RNA structure and sequence generation tasks demonstrates RNAFlow's advantage over existing RNA design methods.
- [29] arXiv:2406.02522 (replaced) [pdf, ps, other]
-
Title: Lichen-Mediated Self-Growing Construction Materials for Habitat Outfitting on MarsSubjects: Cell Behavior (q-bio.CB); Earth and Planetary Astrophysics (astro-ph.EP); Instrumentation and Methods for Astrophysics (astro-ph.IM); Popular Physics (physics.pop-ph)
As its next step in space exploration, the National Aeronautics and Space Administration (NASA) revealed plans to establish a permanent human presence on Mars. To build the centrally located, monolithic habitat, NASA has a history of experimenting with lightweight inflatable habitats to reduce mass and volume. However, the physical structures used to outfit the inflatable must generally be launched by a second spacecraft. This study proposes that, rather than shipping prefabricated outfitting elements to Mars, habitat outfitting can be realized by in-situ construction using cyanobacteria and fungi as building agents. A synthetic lichen system, composed of diazotrophic cyanobacteria and filamentous fungi, can be created to produce abundant biominerals and biopolymers, which will glue Martian regolith into consolidated building blocks. These self-growing building blocks can be assembled into various structures, such as floors, walls, partitions, and furniture.
- [30] arXiv:2406.03115 (replaced) [pdf, ps, other]
-
Title: GET: A Generative EEG Transformer for Continuous Context-Based Neural SignalsOmair Ali, Muhammad Saif-ur-Rehman, Marita Metzler, Tobias Glasmachers, Ioannis Iossifidis, Christian KlaesSubjects: Neurons and Cognition (q-bio.NC)
Generating continuous electroencephalography (EEG) signals through advanced artificial neural networks presents a novel opportunity to enhance brain-computer interface (BCI) technology. This capability has the potential to significantly enhance applications ranging from simulating dynamic brain activity and data augmentation to improving real-time epilepsy detection and BCI inference. By harnessing generative transformer neural networks, specifically designed for EEG signal generation, we can revolutionize the interpretation and interaction with neural data. Generative AI has demonstrated significant success across various domains, from natural language processing (NLP) and computer vision to content creation in visual arts and music. It distinguishes itself by using large-scale datasets to construct context windows during pre-training, a technique that has proven particularly effective in NLP, where models are fine-tuned for specific downstream tasks after extensive foundational training. However, the application of generative AI in the field of BCIs, particularly through the development of continuous, context-rich neural signal generators, has been limited. To address this, we introduce the Generative EEG Transformer (GET), a model leveraging transformer architecture tailored for EEG data. The GET model is pre-trained on diverse EEG datasets, including motor imagery and alpha wave datasets, enabling it to produce high-fidelity neural signals that maintain contextual integrity. Our empirical findings indicate that GET not only faithfully reproduces the frequency spectrum of the training data and input prompts but also robustly generates continuous neural signals. By adopting the successful training strategies of the NLP domain for BCIs, the GET sets a new standard for the development and application of neural signal generation technologies.
- [31] arXiv:2112.11084 (replaced) [pdf, ps, html, other]
-
Title: Curvature-driven transport of thin Bingham fluid layers in airway bifurcationsSubjects: Fluid Dynamics (physics.flu-dyn); Biological Physics (physics.bio-ph); Computational Physics (physics.comp-ph); Tissues and Organs (q-bio.TO)
The mucus on the bronchial wall forms a thin layer of non-Newtonian fluid. One of the roles of mucus is to protect the lungs by capturing inhaled pollutants. It is transported by mucocilliary clearance toward the tracheo-pharyngeal bifurcation, where it is eliminated. Due to the corrugation of its interface with air, the mucus layer is subject to surface tension forces that interact with its rheology. It is still not clear whether these forces can affect mucus displacement and, if they can, under what conditions and how this displacement can occur. In this work, we model the mucus as a thin Bingham fluid layer located on the wall of idealized, multi-scaled airway bifurcations. We analyze the resulting physical system using lubrication theory and 3D simulations. The theoretical analysis allows us to characterize the nonlinear behavior of the system and determine the geometric conditions under which the Bingham fluid can be moved by surface tension. 3D simulations are then used to quantify the effects in idealized airway bifurcations on a range of scales corresponding to those of bronchial bifurcations. Our results suggest that surface tension effects can displace overly thick mucus layers in airway bifurcations, a typical situation in obstructive lung pathologies (asthma, BPCO, cystic fobrosis, etc.). Moreover, our results indicate that this movement can disrupt mucociliary clearance and the homogeneity of the layer thickness, thus increasing the risk of lung infection.
- [32] arXiv:2303.05390 (replaced) [pdf, ps, html, other]
-
Title: Unbiased likelihood estimation of Wright-Fisher diffusion processesComments: 16 pages. Expanded Numerical resultsSubjects: Statistics Theory (math.ST); Populations and Evolution (q-bio.PE)
In this paper we propose a Monte Carlo maximum likelihood estimation strategy for discretely observed Wright-Fisher diffusions. Our approach provides an unbiased estimator of the likelihood function and is based on exact simulation techniques that are of special interest for diffusion processes defined on a bounded domain, where numerical methods typically fail to remain within the required boundaries. We start by building unbiased likelihood estimators for scalar diffusions and later present an extension to the multidimensional case. Consistency results of our proposed estimator are also presented and the performance of our method is illustrated through numerical examples.
- [33] arXiv:2402.03961 (replaced) [pdf, ps, html, other]
-
Title: Self-Reproduction and Evolution in Cellular Automata: 25 Years after EvoloopsComments: 21 pages, 2 figuresSubjects: Cellular Automata and Lattice Gases (nlin.CG); Neural and Evolutionary Computing (cs.NE); Pattern Formation and Solitons (nlin.PS); Populations and Evolution (q-bio.PE)
The year of 2024 marks the 25th anniversary of the publication of evoloops, an evolutionary variant of Chris Langton's self-reproducing loops which proved constructively that Darwinian evolution of self-reproducing organisms by variation and natural selection is possible within deterministic cellular automata. Over the last few decades, this line of Artificial Life research has since undergone several important developments. Although it experienced a relative dormancy of activities for a while, the recent rise of interest in open-ended evolution and the success of continuous cellular automata models have brought researchers' attention back to how to make spatio-temporal patterns self-reproduce and evolve within spatially distributed computational media. This article provides a review of the relevant literature on this topic over the past 25 years and highlights the major accomplishments made so far, the challenges being faced, and promising future research directions.
- [34] arXiv:2403.20097 (replaced) [pdf, ps, html, other]
-
Title: ITCMA: A Generative Agent Based on a Computational Consciousness StructureComments: 20 pages, 11 figuresSubjects: Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Neurons and Cognition (q-bio.NC)
Large Language Models (LLMs) still face challenges in tasks requiring understanding implicit instructions and applying common-sense knowledge. In such scenarios, LLMs may require multiple attempts to achieve human-level performance, potentially leading to inaccurate responses or inferences in practical environments, affecting their long-term consistency and behavior. This paper introduces the Internal Time-Consciousness Machine (ITCM), a computational consciousness structure to simulate the process of human consciousness. We further propose the ITCM-based Agent (ITCMA), which supports action generation and reasoning in open-world settings, and can independently complete tasks. ITCMA enhances LLMs' ability to understand implicit instructions and apply common-sense knowledge by considering agents' interaction and reasoning with the environment. Evaluations in the Alfworld environment show that trained ITCMA outperforms the state-of-the-art (SOTA) by 9% on the seen set. Even untrained ITCMA achieves a 96% task completion rate on the seen set, 5% higher than SOTA, indicating its superiority over traditional intelligent agents in utility and generalization. In real-world tasks with quadruped robots, the untrained ITCMA achieves an 85% task completion rate, which is close to its performance in the unseen set, demonstrating its comparable utility and universality in real-world settings.