Weekly BioML Digest [February 23, 2026]

Weekly BioML Digest [February 23, 2026]

Machine Learning Γ— Computational Biology paper compilation

Hey! It's your weekly digest of machine learning papers in CompBio and Drug Discovery.

Feedback? Email me at biomldigest@gmail.com.

πŸ“š Section 1: Peer-Reviewed Journals (Top 20)

584 papers matched filters β†’ 20 selected after LLM relevance + novelty ranking.

  • Rapid directed evolution guided by protein language models and epistatic interactions.
    Vincent Q Tran, Matthew Nemeth, Liam J. Bartie, Sita S. Chandrasekaran, Alison Fanton, Hyungseok C. Moon, Brian L. Hie, Silvana Konermann, Patrick D. Hsu β€” Science, 2026-02-19
    abs

  • FATE-MAP predicts teratogenicity and human gastrulation failure modes by integrating deep learning and mechanistic modeling
    Rufo, Joseph, Qiu, Chongxu, Han, Dasol, Baxter, Naomi, Daley, Gabrielle, Dhillon, Jasmine, Wong, Felix, Collins, James J., Wilson, Maxwell Z. β€” Nature Communications, 2026-02-19
    abs

  • Accurate predictions of disordered protein ensembles with STARLING
    Novak, Borna, Lotthammer, Jeffrey M., Emenecker, Ryan J., Holehouse, Alex S. β€” Nature, 2026-02-18
    abs

  • DRfold2 is a deep learning-based tool that enables efficient and accurate RNA structure prediction.
    Yang Li, Chenjie Feng, Xi Zhang, Sho Tsukiyama, Duanyu Feng, Yang Zhang β€” PLoS biology, 2026-02-17
    abs

  • Learning the committor without collective variables
    Contreras Arredondo, Sergio, Tang, Chenyu, Talmazan, Radu A., MegΓ­as, Alberto, Chen, Cheng Giuseppe, Chipot, Christophe β€” Nature Computational Science, 2026-02-17
    abs

  • Extending the range of graph neural networks with global encodings
    Caruso, Alessandro, Venturin, Jacopo, Giambagli, Lorenzo, Rolando, Edoardo, El-Machachi, Zakariya, NoΓ©, Frank, Clementi, Cecilia β€” Nature Communications, 2026-02-18
    abs

  • AI-guided competitive docking for virtual screening and compound efficacy prediction
    Mirgaux, Manon, Barcelli, Valeria, Chua, Adeline C. Y., Bifani, Pablo, Wintjens, RenΓ© β€” npj Drug Discovery, 2026-02-16
    abs

  • From single-sequences to evolutionary trajectories: protein language models capture the evolutionary potential of SARS-CoV-2
    Lamb, Kieran D., Hughes, Joseph, Lytras, Spyros, Young, Francesca, Koci, Orges, Herzig, James C., Lovell, Simon C., Grove, Joe, Yuan, Ke, Robertson, David L. β€” Nature Communications, 2026-02-19
    abs

  • Evolutionary-guided advanced deep-learning architecture powers mammalian GPCRome agonist predictions.
    Aayushi Mittal, Mudit Gupta, Sanjay Kumar Mohanty, A. Gaur, Saveena Solanki, Syed Yasser Ali, A. Raza, Pranjal Sharma, Suvendu Kumar, Vishakha Gautam, Shiva Satija, Nikhil Katyayan, Kunal Tiwari, Anmol Kumar Sharma, D. Kaur, Debarka Sengupta, Shashi Kumar Gupta, Gaurav Ahuja β€” Cell reports, 2026-02-18
    abs

  • De novo covalent drug generation with enhanced drug-likeness and safety
    Zhang, Wenbo, Liu, Tianxiao, Dong, Xiaoying, Sun, Saisai, Yao, Xiaojun, Li, Pengyong, Gao, Lin β€” Communications Biology, 2026-02-17
    abs

  • Enabling multi-target drug discovery through latent evolutionary optimization and synthesis-aware prioritization (EVOSYNTH)
    Nguyen, Viet Thanh Duy, Pham, Phuc, Hy, Truong-Son β€” Communications Chemistry, 2026-02-16
    abs

  • Exploring chemistry and catalysis by biasing skewed distributions via deep learning
    Zhang, Zhikun, Piccini, GiovanniMaria β€” Nature Communications, 2026-02-21
    abs

  • Synthesis of covalent organic frameworks for photocatalytic hydrogen peroxide production guided by large language models
    Shu, Chang, Wang, Ledu, Yang, Xiaoju, Xie, Wenao, Xie, Peixuan, Wang, Xiao, Yang, Xuan, Rao, Jingyi, Wang, Kewei, Chen, Linjiang, Tan, Bien, Wang, Xiaoyan β€” Nature Communications, 2026-02-21
    abs

  • Machine learning guided discovery of water stable metal-organic frameworks for photocatalytic hydrogen production.
    Xiao Niu, Zhiming Zhang, Xiaoyu Wu, Yan Liu, Yong Cui, Jianwen Jiang β€” Chemical science, 2026-02-18
    abs

  • DeepFit: Physically and Chemically Informed XAS-Structure Fitting Made Simple.
    Kirill Kulaev, B. Protsenko, Weiren Cheng, Qinghua Liu, D. Gorbunov, Mikhail Lifar, Valery G. Vlasenko, Anatolii Burlov, S. Guda, Alexander Guda, M. Soldatov, Alexander Soldatov β€” The journal of physical chemistry letters, 2026-02-19
    abs

  • Hybrid Computational Strategy for Predicting Complex Ligand-Metal Architectures.
    Galymzhan Moldagulov, Kisung Lee, Sanzhar Nurgaliyev, Assanali Salem, A. Kuznietsov, B. Grzybowski β€” Angewandte Chemie, 2026-02-21
    abs

  • PROTAC-Splitter: a machine learning framework for automated identification of PROTAC substructures
    Ribes, Stefano, Zhang, Ranxuan, Cropsal, TΓ©lio, KΓ€llberg, Anders, Tyrchan, Christian, Nittinger, Eva, Mercado, RocΓ­o β€” Journal of Cheminformatics, 2026-02-20
    abs

  • Polypharmacology Browser PPB3: A Web-Based Deep Learning Tool for Target Prediction Using ChEMBL Data.
    M. Darsaraee, S. Javor, J. Reymond β€” Journal of chemical information and modeling, 2026-02-21
    abs

  • RGTFormer: Predicting mutation-associated multi-drug resistance in Mycobacterium tuberculosis using a categorical gated transformer and relational graph convolutional network.
    R. Joshi, Hitesh Reddy Dereddy, Sandip Mukhopadhyay, Radim Burget, M. Dutta β€” Computational biology and chemistry, 2026-02-18
    abs

  • TaLiRAGen: target-aware ligand generation via retrieval-augmented large language models
    Nan, Xiaofei, You, Xing, Liu, Xuezhen, Liu, Hongde, Ji, Chengxiang, Du, Yongsheng, Song, Jinshuai β€” Molecular Diversity, 2026-02-22
    abs

🧬 Section 2: Preprints (arXiv + bioRxiv)

143 papers matched filters β†’ 20 selected after LLM relevance + novelty ranking.

  • 🧬 Accurate Macromolecular Complex Modeling for Cryo-EM with CryoZeta
    Zhang, Z.; Li, S.; Farheen, F.; Kagaya, Y.; Liu, B.; Ibtehaz, N.; Terashi, G.; Nakamura, T.; Zhu, H.; Khan, K.; Zhang, Y.; Kihara, D. β€” bioRxiv, 2026-02-16
    abs

  • 🧬 ProteomeLM: A proteome-scale language model enables accurate and rapid prediction of protein-protein interactions and gene essentiality across taxa
    Malbranke, C.; Zalaffi, G. P.; Bitbol, A.-F. β€” bioRxiv, 2026-02-17
    abs

  • 🧬 Deep models of protein evolution in time generate realistic evolutionary trajectories and functional proteins
    Koehl, A.; Prillo, S.; Liu, M.; Xiong, J.; Weng, L.; Savage, D. F.; Song, Y. S. β€” bioRxiv, 2026-02-20
    abs

  • 🧬 Fine-tuning protein language models on human spatial constraint improves variant effect prediction by reducing wild-type sequence bias
    Bajracharya, G.; Capra, J. A. β€” bioRxiv, 2026-02-19
    abs

  • 🧬 Pan-cell-type prediction of splicing patterns from sequence and splicing factor expression
    Vetsigian, K.; Lancaster, J.; Ieremie, I.; Radens, C. M.; Smyth, P.; Young, S. β€” bioRxiv, 2026-02-19
    abs

  • 🧬 Generative AI Guided Design of High-Affinity T cell Receptors
    Min, M. R.; Li, T.; Onoguchi, K.; Mori, D.; Demachi-Okamura, A.; Warrell, J.; Machart, P.; Moesch, A.; Meiser, A.; Pait, I. G.; Muraoka, D.; Matsushita, H.; Paiardi, G.; Ferraz, M.; Bendjama, K. β€” bioRxiv, 2026-02-18
    abs

  • 🧬 ProtFlow: Flow Matching-based Protein Sequence Design with Comprehensive Protein Semantic Distribution Learning and High-quality Generation
    Kong, Z.; Zhu, Y.; Xu, Y.; Yin, M.; Hou, T.; Wu, J.; Xu, H.; Hsieh, C.-Y. β€” bioRxiv, 2026-02-17
    abs

  • πŸ“„ BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening
    Anjie Qiao, Zhen Wang, Yaliang Li, Jiahua Rao, Yuedong Yang β€” arXiv, 2026-02-16
    abs

  • 🧬 SpecLig: Energy-Guided Hierarchical Model for Target-Specific 3D Ligand Design
    Zhang, P.; Han, R.; Kong, X.; Chen, T.; Ma, J. β€” bioRxiv, 2026-02-19
    abs

  • πŸ“„ MolCrystalFlow: Molecular Crystal Structure Prediction via Flow Matching
    Cheng Zeng, Harry W. Sullivan, Thomas Egg, Maya M. Martirossyan, Philipp HΓΆllmer, Jirui Jin, Richard G. Hennig, Adrian Roitberg, Stefano Martiniani, Ellad B. Tadmor, Mingjie Liu β€” arXiv, 2026-02-17
    abs

  • πŸ“„ Enhanced Diffusion Sampling: Efficient Rare Event Sampling and Free Energy Calculation with Diffusion Models
    Yu Xie, Ludwig Winkler, Lixin Sun, Sarah Lewis, Adam E. Foster, JosΓ© JimΓ©nez Luna, Tim Hempel, Michael Gastegger, Yaoyi Chen, Iryna Zaporozhets, Cecilia Clementi, Christopher M. Bishop, Frank NoΓ© β€” arXiv, 2026-02-18
    abs

  • 🧬 Towards inferring atomic scale conformation landscape of biomolecules from cryo-electron tomography data
    Feyzi, F. S.; Jonic, S. β€” bioRxiv, 2026-02-17
    abs

  • 🧬 BOND-PEP: topology-conditioned bipartite alignment for evidence-grounded peptide binder generation
    Ding, W. β€” bioRxiv, 2026-02-18
    abs

  • 🧬 Distilling Protein Language Models with Complementary Regularizers
    Wijaya, E. β€” bioRxiv, 2026-02-18
    abs

  • 🧬 MolDeBERTa: Foundational Model for Physicochemical and Structural-Informed Molecular Representation Learning
    de Oliveira, G. B.; Saeed, F. β€” bioRxiv, 2026-02-17
    abs

  • 🧬 Multi-modal tissue-aware graph neural network for in silico genetic discovery
    Aggarwal, A.; Sokolova, K.; Troyanskaya, O. G. β€” bioRxiv, 2026-02-18
    abs

  • 🧬 OncoBERT: Context-Aware Modeling of Somatic Mutations for Precision Oncology
    Patkar, S.; Auslander, N.; Harmon, S.; Choyke, P.; Turkbey, B. β€” bioRxiv, 2026-02-19
    abs

  • 🧬 Fast structural search for classification of gut bacterial mucin O-glycan degrading enzymes
    Erden, M.; Schult, T.; Yanagi, K.; Sahoo, J. K.; Kaplan, D. L.; Cowen, L. J.; Lee, K. β€” bioRxiv, 2026-02-18
    abs

  • 🧬 Harnessing DNA Foundation Models for Cross-Species Transcription Factor Binding Site Prediction in Plant Genomes
    Haghani, M.; Dhulipalla, K. V.; Li, S. β€” bioRxiv, 2026-02-19
    abs

  • 🧬 Bacterial protein function prediction via multimodal deep learning
    Muzio, G.; Adamer, M.; Fernandez, L.; Miklautz, L.; Borgwardt, K.; Avican, K. β€” bioRxiv, 2026-02-22
    abs

Read more