Aging is the dominant risk factor for neurodegenerative and systemic diseases, yet its molecular signatures remain obscured within high-dimensional, noisy, and strongly correlated proteomes. To address this challenge, we introduce the Protein Risk Score (ProtRS) framework—a systematic evaluation framework for ProtRS modeling that assesses how different multivariate approaches extract...
Polygenic risk scores can be used to model the individual genetic liability for human traits. Current methods primarily focus on modeling the mean of a phenotype neglecting the variance. However, genetic variants associated with phenotypic variance can provide important insights to gene-environment interaction studies. To overcome this, we propose snpboostlss, a cyclical gradient boosting...
Regularized Multi-Omics Regression Modeling for Transcriptomic–Proteomic Integration in Mice with induced liver Damage.
Toxicological compounds exert complex effects on tissues and organisms, which can be investigated using genomic, transcriptomic, and proteomic data. A central challenge lies in understanding the relationship between RNA and protein levels. While these are expected to be...
Single-cell RNA sequencing has given researchers unparalleled insight into biological systems. It enables the identification of distinct cellular subpopulations, the characterization of differences between them, and the assessment of overall tissue heterogeneity. Conventional analysis pipelines first cluster individual cells into similar groups and then test for differentially expressed genes...
Penalized regression models such as Lasso, Elastic-Net and their adaptive extensions are widely used for simultaneous variable selection and prediction in high-dimensional data analysis. However, conventional implementations of adaptive Elastic-Net (AdaENet) regression often estimate the adaptive hyper-parameter for the Elastic-Net penalty term using the entire dataset before dividing it into...