-
2026
-
Estimating Consensus Ideal Points Using Multi-Source Data
arXiv preprint arXiv:2601.05213 (2026)
pdf cite doi -
2025
-
Top-k Feature Importance Ranking
Transactions on Machine Learning Research (2025)
pdf cite code -
Consensus dimension reduction via multi-view learning
arXiv preprint arXiv:2512.15802 (2025)
pdf cite code data -
Using machine learning algorithms to optimize treatment with high-cost biologics in a national cohort of patients with inflammatory bowel disease
JAMIA Open (2025)
pdf cite doi code -
Designing a Data Science simulation with MERITS: A Primer
Journal of Computational and Graphical Statistics (2025)
pdf cite doi -
A comparative transcriptomics analysis of mammalian and non-mammalian acute kidney injury (AKI) models
Frontiers in Cell and Developmental Biology (2025)
pdf cite doi -
Interpretable Network-assisted Random Forest+
arXiv preprint arXiv:2509.15611 (2025)
pdf cite code -
Estimating Fe and Mg Abundances in the Milky Way Dwarf Galaxies Using Subaru/HSC and DEIMOS
The Astrophysical Journal (2025)
pdf cite doi -
Local MDI+: Local Feature Importances for Tree-Based Models
arXiv preprint arXiv:2506.08928 (2025)
pdf cite code -
Epistasis regulates genetic control of cardiac hypertrophy
Nature Cardiovascular Research (2025)
pdf cite doi code slides -
Unsupervised Machine Learning for Scientific Discovery: Workflow and Best Practices
arXiv preprint arXiv:2506.04553 (2025)
pdf cite code data -
Integrating Random Forests and Generalized Linear Models for Improved Accuracy and Interpretability
arXiv preprint arXiv:2307.01932 (2025)
pdf cite code data slides -
A simplified MyProstateScore2.0 for high-grade prostate cancer
Cancer Biomarkers (2025)
pdf cite doi code slides -
Distilling heterogeneous treatment effects: Stable subgroup estimation in causal inference
arXiv preprint arXiv:2502.07275 (2025)
pdf cite doi code -
2024
-
simChef: High-quality data science simulations in R
Journal of Open Source Software (2024)
pdf cite doi code slides -
Model Generalizability Considerations in Development and Evaluation of SaMD
doi:10.21203/rs.3.rs-3915862/v1 (2024)
pdf cite doi -
2023
-
A blood-based metabolomic signature predictive of risk for pancreatic cancer
Cell Reports Medicine (2023)
pdf cite -
2021
-
The Future will be Different than Today: Model Evaluation Considerations when Developing Translational Clinical Biomarker
KDD Health Day - DSHealth Workshop (2021)
pdf cite -
Integrated Principal Components Analysis
Journal of Machine Learning Research (2021)
pdf cite code slides -
imodels: a python package for fitting interpretable models
Journal of Open Source Software (2021)
pdf cite code -
2020
-
Feature selection for data integration with mixed multiview data
Annals of Applied Statistics (2020)
pdf cite doi -
A stability-driven protocol for drug response interpretable prediction (staDRIP)
ML4H: Machine Learning for Health - Extended Abstract (NeurIPS Workshop) (2020)
pdf cite code poster -
Curating a COVID-19 data repository and forecasting county-level death counts in the United States
Harvard Data Science Review (2020)
pdf cite doi code
No matching items