Weekly BioML Digest [February 23, 2026]
Machine Learning Γ Computational Biology paper compilation
Hey! It's your weekly digest of machine learning papers in CompBio and Drug Discovery.
Feedback? Email me at biomldigest@gmail.com.
π Section 1: Peer-Reviewed Journals (Top 20)
584 papers matched filters β 20 selected after LLM relevance + novelty ranking.
-
Rapid directed evolution guided by protein language models and epistatic interactions.
Vincent Q Tran, Matthew Nemeth, Liam J. Bartie, Sita S. Chandrasekaran, Alison Fanton, Hyungseok C. Moon, Brian L. Hie, Silvana Konermann, Patrick D. Hsu β Science, 2026-02-19
abs -
FATE-MAP predicts teratogenicity and human gastrulation failure modes by integrating deep learning and mechanistic modeling
Rufo, Joseph, Qiu, Chongxu, Han, Dasol, Baxter, Naomi, Daley, Gabrielle, Dhillon, Jasmine, Wong, Felix, Collins, James J., Wilson, Maxwell Z. β Nature Communications, 2026-02-19
abs -
Accurate predictions of disordered protein ensembles with STARLING
Novak, Borna, Lotthammer, Jeffrey M., Emenecker, Ryan J., Holehouse, Alex S. β Nature, 2026-02-18
abs -
DRfold2 is a deep learning-based tool that enables efficient and accurate RNA structure prediction.
Yang Li, Chenjie Feng, Xi Zhang, Sho Tsukiyama, Duanyu Feng, Yang Zhang β PLoS biology, 2026-02-17
abs -
Learning the committor without collective variables
Contreras Arredondo, Sergio, Tang, Chenyu, Talmazan, Radu A., MegΓas, Alberto, Chen, Cheng Giuseppe, Chipot, Christophe β Nature Computational Science, 2026-02-17
abs -
Extending the range of graph neural networks with global encodings
Caruso, Alessandro, Venturin, Jacopo, Giambagli, Lorenzo, Rolando, Edoardo, El-Machachi, Zakariya, NoΓ©, Frank, Clementi, Cecilia β Nature Communications, 2026-02-18
abs -
AI-guided competitive docking for virtual screening and compound efficacy prediction
Mirgaux, Manon, Barcelli, Valeria, Chua, Adeline C. Y., Bifani, Pablo, Wintjens, RenΓ© β npj Drug Discovery, 2026-02-16
abs -
From single-sequences to evolutionary trajectories: protein language models capture the evolutionary potential of SARS-CoV-2
Lamb, Kieran D., Hughes, Joseph, Lytras, Spyros, Young, Francesca, Koci, Orges, Herzig, James C., Lovell, Simon C., Grove, Joe, Yuan, Ke, Robertson, David L. β Nature Communications, 2026-02-19
abs -
Evolutionary-guided advanced deep-learning architecture powers mammalian GPCRome agonist predictions.
Aayushi Mittal, Mudit Gupta, Sanjay Kumar Mohanty, A. Gaur, Saveena Solanki, Syed Yasser Ali, A. Raza, Pranjal Sharma, Suvendu Kumar, Vishakha Gautam, Shiva Satija, Nikhil Katyayan, Kunal Tiwari, Anmol Kumar Sharma, D. Kaur, Debarka Sengupta, Shashi Kumar Gupta, Gaurav Ahuja β Cell reports, 2026-02-18
abs -
De novo covalent drug generation with enhanced drug-likeness and safety
Zhang, Wenbo, Liu, Tianxiao, Dong, Xiaoying, Sun, Saisai, Yao, Xiaojun, Li, Pengyong, Gao, Lin β Communications Biology, 2026-02-17
abs -
Enabling multi-target drug discovery through latent evolutionary optimization and synthesis-aware prioritization (EVOSYNTH)
Nguyen, Viet Thanh Duy, Pham, Phuc, Hy, Truong-Son β Communications Chemistry, 2026-02-16
abs -
Exploring chemistry and catalysis by biasing skewed distributions via deep learning
Zhang, Zhikun, Piccini, GiovanniMaria β Nature Communications, 2026-02-21
abs -
Synthesis of covalent organic frameworks for photocatalytic hydrogen peroxide production guided by large language models
Shu, Chang, Wang, Ledu, Yang, Xiaoju, Xie, Wenao, Xie, Peixuan, Wang, Xiao, Yang, Xuan, Rao, Jingyi, Wang, Kewei, Chen, Linjiang, Tan, Bien, Wang, Xiaoyan β Nature Communications, 2026-02-21
abs -
Machine learning guided discovery of water stable metal-organic frameworks for photocatalytic hydrogen production.
Xiao Niu, Zhiming Zhang, Xiaoyu Wu, Yan Liu, Yong Cui, Jianwen Jiang β Chemical science, 2026-02-18
abs -
DeepFit: Physically and Chemically Informed XAS-Structure Fitting Made Simple.
Kirill Kulaev, B. Protsenko, Weiren Cheng, Qinghua Liu, D. Gorbunov, Mikhail Lifar, Valery G. Vlasenko, Anatolii Burlov, S. Guda, Alexander Guda, M. Soldatov, Alexander Soldatov β The journal of physical chemistry letters, 2026-02-19
abs -
Hybrid Computational Strategy for Predicting Complex Ligand-Metal Architectures.
Galymzhan Moldagulov, Kisung Lee, Sanzhar Nurgaliyev, Assanali Salem, A. Kuznietsov, B. Grzybowski β Angewandte Chemie, 2026-02-21
abs -
PROTAC-Splitter: a machine learning framework for automated identification of PROTAC substructures
Ribes, Stefano, Zhang, Ranxuan, Cropsal, TΓ©lio, KΓ€llberg, Anders, Tyrchan, Christian, Nittinger, Eva, Mercado, RocΓo β Journal of Cheminformatics, 2026-02-20
abs -
Polypharmacology Browser PPB3: A Web-Based Deep Learning Tool for Target Prediction Using ChEMBL Data.
M. Darsaraee, S. Javor, J. Reymond β Journal of chemical information and modeling, 2026-02-21
abs -
RGTFormer: Predicting mutation-associated multi-drug resistance in Mycobacterium tuberculosis using a categorical gated transformer and relational graph convolutional network.
R. Joshi, Hitesh Reddy Dereddy, Sandip Mukhopadhyay, Radim Burget, M. Dutta β Computational biology and chemistry, 2026-02-18
abs -
TaLiRAGen: target-aware ligand generation via retrieval-augmented large language models
Nan, Xiaofei, You, Xing, Liu, Xuezhen, Liu, Hongde, Ji, Chengxiang, Du, Yongsheng, Song, Jinshuai β Molecular Diversity, 2026-02-22
abs
𧬠Section 2: Preprints (arXiv + bioRxiv)
143 papers matched filters β 20 selected after LLM relevance + novelty ranking.
-
𧬠Accurate Macromolecular Complex Modeling for Cryo-EM with CryoZeta
Zhang, Z.; Li, S.; Farheen, F.; Kagaya, Y.; Liu, B.; Ibtehaz, N.; Terashi, G.; Nakamura, T.; Zhu, H.; Khan, K.; Zhang, Y.; Kihara, D. β bioRxiv, 2026-02-16
abs -
𧬠ProteomeLM: A proteome-scale language model enables accurate and rapid prediction of protein-protein interactions and gene essentiality across taxa
Malbranke, C.; Zalaffi, G. P.; Bitbol, A.-F. β bioRxiv, 2026-02-17
abs -
𧬠Deep models of protein evolution in time generate realistic evolutionary trajectories and functional proteins
Koehl, A.; Prillo, S.; Liu, M.; Xiong, J.; Weng, L.; Savage, D. F.; Song, Y. S. β bioRxiv, 2026-02-20
abs -
𧬠Fine-tuning protein language models on human spatial constraint improves variant effect prediction by reducing wild-type sequence bias
Bajracharya, G.; Capra, J. A. β bioRxiv, 2026-02-19
abs -
𧬠Pan-cell-type prediction of splicing patterns from sequence and splicing factor expression
Vetsigian, K.; Lancaster, J.; Ieremie, I.; Radens, C. M.; Smyth, P.; Young, S. β bioRxiv, 2026-02-19
abs -
𧬠Generative AI Guided Design of High-Affinity T cell Receptors
Min, M. R.; Li, T.; Onoguchi, K.; Mori, D.; Demachi-Okamura, A.; Warrell, J.; Machart, P.; Moesch, A.; Meiser, A.; Pait, I. G.; Muraoka, D.; Matsushita, H.; Paiardi, G.; Ferraz, M.; Bendjama, K. β bioRxiv, 2026-02-18
abs -
𧬠ProtFlow: Flow Matching-based Protein Sequence Design with Comprehensive Protein Semantic Distribution Learning and High-quality Generation
Kong, Z.; Zhu, Y.; Xu, Y.; Yin, M.; Hou, T.; Wu, J.; Xu, H.; Hsieh, C.-Y. β bioRxiv, 2026-02-17
abs -
π BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening
Anjie Qiao, Zhen Wang, Yaliang Li, Jiahua Rao, Yuedong Yang β arXiv, 2026-02-16
abs -
𧬠SpecLig: Energy-Guided Hierarchical Model for Target-Specific 3D Ligand Design
Zhang, P.; Han, R.; Kong, X.; Chen, T.; Ma, J. β bioRxiv, 2026-02-19
abs -
π MolCrystalFlow: Molecular Crystal Structure Prediction via Flow Matching
Cheng Zeng, Harry W. Sullivan, Thomas Egg, Maya M. Martirossyan, Philipp HΓΆllmer, Jirui Jin, Richard G. Hennig, Adrian Roitberg, Stefano Martiniani, Ellad B. Tadmor, Mingjie Liu β arXiv, 2026-02-17
abs -
π Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models
Yu Xie, Ludwig Winkler, Lixin Sun, Sarah Lewis, Adam E. Foster, JosΓ© JimΓ©nez Luna, Tim Hempel, Michael Gastegger, Yaoyi Chen, Iryna Zaporozhets, Cecilia Clementi, Christopher M. Bishop, Frank NoΓ© β arXiv, 2026-02-18
abs -
𧬠Towards inferring atomic scale conformation landscape of biomolecules from cryo-electron tomography data
Feyzi, F. S.; Jonic, S. β bioRxiv, 2026-02-17
abs -
𧬠BOND-PEP: topology-conditioned bipartite alignment for evidence-grounded peptide binder generation
Ding, W. β bioRxiv, 2026-02-18
abs -
𧬠Distilling Protein Language Models with Complementary Regularizers
Wijaya, E. β bioRxiv, 2026-02-18
abs -
𧬠MolDeBERTa: Foundational Model for Physicochemical and Structural-Informed Molecular Representation Learning
de Oliveira, G. B.; Saeed, F. β bioRxiv, 2026-02-17
abs -
𧬠Multi-modal tissue-aware graph neural network for in silico genetic discovery
Aggarwal, A.; Sokolova, K.; Troyanskaya, O. G. β bioRxiv, 2026-02-18
abs -
𧬠OncoBERT: Context-Aware Modeling of Somatic Mutations for Precision Oncology
Patkar, S.; Auslander, N.; Harmon, S.; Choyke, P.; Turkbey, B. β bioRxiv, 2026-02-19
abs -
𧬠Fast structural search for classification of gut bacterial mucin O-glycan degrading enzymes
Erden, M.; Schult, T.; Yanagi, K.; Sahoo, J. K.; Kaplan, D. L.; Cowen, L. J.; Lee, K. β bioRxiv, 2026-02-18
abs -
𧬠Harnessing DNA Foundation Models for Cross-Species Transcription Factor Binding Site Prediction in Plant Genomes
Haghani, M.; Dhulipalla, K. V.; Li, S. β bioRxiv, 2026-02-19
abs -
𧬠Bacterial protein function prediction via multimodal deep learning
Muzio, G.; Adamer, M.; Fernandez, L.; Miklautz, L.; Borgwardt, K.; Avican, K. β bioRxiv, 2026-02-22
abs