Weekly BioML Digest [May 04, 2026]

Share
Weekly BioML Digest [May 04, 2026]

Machine Learning × Computational Biology paper compilation

Hey! It's your weekly digest of machine learning papers in CompBio and Drug Discovery.

Feedback? Email me at biomldigest@gmail.com.

📚 Peer-Reviewed Journals (Top 20)

534 matched filters -> 20 selected after LLM relevance + novelty ranking.

  • Genome modelling and design across all domains of life with Evo 2
    Brixi, Garyk, Durrant, Matthew G., Ku, Jerome, Naghipourfar, Mohsen, Poli, Michael, Sun, Gwanggyu, Brockman, Greg, Chang, Daniel, Fanton, Alison, Gonzalez, Gabriel A., King, Samuel H., Li, David B., Merchant, Aditi T., Nguyen, Eric, Ricci-Tam, Chiara, Romero, David W., Schmok, Jonathan C., Taghibakhshi, Ali, Vorontsov, Anton, Yang, Brandon, Deng, Myra, Gorton, Liv, Nguyen, Nam, Wang, Nicholas K., Pearce, Michael T., Simon, Elana, Adams, Etowah, Amador, Zachary J., Ashley, Euan A., Baccus, Stephen A., Dai, Haoyu, Dillmann, Steven, Ermon, Stefano, Guo, Daniel, Herschl, Michael H., Ilango, Rajesh, Janik, Ken, Lu, Amy X., Mehta, Reshma, Mofrad, Mohammad R. K., Ng, Madelena Y., Pannu, Jaspreet, Ré, Christopher, John, John, Sullivan, Jeremy, Tey, Joseph, Viggiano, Ben, Zhu, Kevin, Zynda, Greg, Balsam, Daniel, Collison, Patrick, Costa, Anthony B., Hernandez-Boussard, Tina, Ho, Eric, Liu, Ming-Yu, McGrath, Thomas, Powell, Kimberly, Pinglay, Sudarshan, Burke, Dave P., Goodarzi, Hani, Hsu, Patrick D., Hie, Brian L. — Nature, 2026-04-30
    abs

  • eSIG-Net: an interaction language model that decodes the protein code of single mutations
    Pan, Xingxin, Shrawat, Aditya, Raghavan, Sidharth, Dong, Chuanpeng, Yang, Yuntao, Li, Zhao, Zheng, W. Jim, Eckhardt, S. Gail, Wu, Erxi, Fuxman Bass, Juan I., Jarosz, Daniel F., Chen, Sidi, McGrail, Daniel J., Sheynkman, Gloria M., Huang, Jason H., Sahni, Nidhi, Yi, S. Stephen — Nature Methods, 2026-04-29
    abs

  • DeepSeMS: revealing the hidden biosynthetic potential of the global ocean microbiome with a large language model
    Xu, Tingjun, Yang, Yuwei, Zhu, Ruixin, Lin, Weili, Li, Jixuan, Zheng, Yan, Zhang, Peng, Zhang, Guoqing, Zhao, Guoping, Jiao, Na — Nature Computational Science, 2026-04-30
    abs

  • Improving protein-ligand complex generation with force field guidance
    Lai, Helen, Wang, Tingyu, Sirelkhatim, Hassan, Eaton, Joe, Huang, Howard, Rees, Brad, Engkvist, Ola, Janet, Jon Paul, Wang, Xiaoyun, Tibo, Alessandro — Journal of Cheminformatics, 2026-05-02
    abs

  • Benchmarking co-folding methods to predict the structures of covalent protein–ligand complexes
    Zhang, Tong-han, Zhu, Jin-tao, Huang, Zhi-xian, Xie, Juan, Pei, Jian-feng, Lai, Lu-hua — Acta Pharmacologica Sinica, 2026-05-01
    abs

  • Augmented prediction of multi-species protein–RNA interactions using evolutionary conservation of RNA-binding proteins
    He, Jiale, Zhou, Tong, Hu, Lu-Feng, Jiao, Yuhua, Wang, Junhao, Yan, Shengwen, Jia, Siyao, Chen, Qiuzhen, Zhu, Wentao, Zhang, Jilin, Jia, Mutian, Li, Yuanning, Wang, Xianwei, Wang, Yangming, Yang, Yucheng T., Sun, Lei — Nature Communications, 2026-04-27
    abs

  • Pharmacology knowledge graphs enable drug repurposing without chemical structure information
    Abo-Dahab, Youssef, Hernandez, Ruby, Duran, Ismael Caleb Arechiga — Discover Artificial Intelligence, 2026-04-28
    abs

  • Deep Learning-Based Protein Half-Life Prediction for Identifying Rate-Limiting Enzymes in Metabolic Pathways to Alleviate Bottleneck Reactions
    Yunhyeok Lee, Jun Ren, Jingyu Lee, Minh Thi-Hong Tran, Yu-Kyung Kim, Youngseop Chang, So-Hee Oh, Hyang-Mi Lee, D. Na — Journal of Microbiology and Biotechnology, 2026-04-28
    abs

  • Physics-informed deep learning for molecular solubility prediction: integrating thermodynamic constraints with neural network architectures
    Amiri, Masoud — Scientific Reports, 2026-04-27
    abs

  • A Comprehensive Dataset of Chemical Reactions Covering Second and Third Row Elements with Million-Scale Quantum Chemical Calculations
    Lee, Kyunghoon, Kang, Sungwoo, Kang, Beomgyu, Son, Won-Joon, Lee, Seungmin, Kim, Dae Sin, Kim, Woo Youn — Scientific Data, 2026-04-29
    abs

  • Experimentally validated deep learning control of protein aggregation
    Cima, Vojtech, Kunka, Antonin, Planas-Iglesias, Joan, Grakova, Ekaterina, Havlasek, Martin, Subramanian, Madhumalar, Beloch, Michal, Marek, Martin, Slaninova, Katerina, Damborsky, Jiri, Prokop, Zbynek, Bednar, David, Martinovic, Jan — Communications Chemistry, 2026-04-28
    abs

  • Hypervariable loop profiling decodes sequence determinants of antibody stability
    Wan, Yue, Liang, Jiahao, Dai, Yile, Srinivasan, Karthik, Billesbølle, Christian, Zhu, Ju-Fen, Shin, Jung-Eun, Paul, Steffanie, Marks, Debora, Song, Yun S., Myers, Benjamin R., Koehl, Antoine, Manglik, Aashish — Nature Structural & Molecular Biology, 2026-04-30
    abs

  • Accelerating natural product discovery with linked MS-genomics and language/transformer-based models
    Tay, Dillon W. P., Koh, Winston, Ang, Shi Jun, Wong, Zicong Marvin, Lim, Yi Wee, Heng, Elena, Ng, Yu Hung, Yeo, Naythan Z. X., Adaikkappan, Krishnan, Wong, Fong Tian, Lim, Yee Hwee — npj Antimicrobials and Resistance, 2026-04-30
    abs

  • Integration of in vitro and in silico approaches enables prediction of drug-induced liver injury
    Geci, René, Sayin, Ahenk Zeynep, Schaller, Stephan, Kuepfer, Lars — Archives of Toxicology, 2026-05-01
    abs

  • plasmaCHORD: A Machine Learning Approach to Distinguish Clonal Hematopoiesis–Derived Variants in Liquid Biopsies from Patients with Solid Tumors
    J. Canzoniero, D. Rabizadeh, I. Ziakas, Jaime Wehr, A. Balan, Amna Jamali, Blair V. Landon, L. Sivapalan, S. Scott, Gavin Pereira, V. Lam, C. Hann, C. Lovly, Jessica Tao, P. Forde, Joseph C. Murray, M. Sausen, G. Meijer, G. Vink, R. Fijneman, E.J.T. M.A. I.H.J.T. A.I. J.W.T. A.W. M.P. R. J. J.J.B. F Belt Davidis de Hingh de Vos Dekker Haringhuizen H, E. Belt, M. A. Davidis, I. D. de Hingh, A. De Vos, J. Dekker, A. Haringhuizen, M. Hendriks, R. Hoekstra, J. Jansen, Jacob Janssen, F. Jeurissen, F. Kloppenberg, J. Konsten, M. Koopman, M. Los, P. Nieboer, Nandette Peters, J. Pruijt, R. Rietbroek, A. Schiphorst, R. Schrauwen, J. Schreinemakers, L. Simkens, D. W. Sommeijer, H. Stockmann, Koen Talsma, F. Terheggen, M. Tjin‐A‐Ton, Liselot Valkenburg, H. van Cruijsen, J. van der Bilt, A. Van Der Velden, Joyce M. van Dodewaard, Teo van Voorthuizen, W. Vles, D. Zimmerman, V. Velculescu, J. Phallen, R. Scharpf, Valsamo Anagnostou — Clinical Cancer Research, 2026-05-01
    abs

  • EpitopeGNN: A Graph Neural Network for Influenza A Virus Hemagglutinin Subtype Classification Based on 3D Structure
    A. Timofeev, Alexander Anufriev, O. Ergashev, I. Isakova-Sivak — BioMedInformatics, 2026-04-28
    abs

  • MS-ConTab: Multi-Scale Contrastive Learning of Mutation Signatures for Pan-Cancer Representation and Stratification
    Yifan Dou, Adam Khadre, Ruben C. Petreaca, Mirzaei Golrokh — Bioinformatics, 2026-04-28
    abs

  • Machine Learning Model for Predicting Molecular Surface Area and Volume Defined by the Kohn-Sham One-electron Potential
    Meng, Peiran, Feng, Jiayi, Song, Jiaqing, Kong, Xianghe, Zhao, Jian, Yu, Chunyang, Gong, Lidong — Chemical Research in Chinese Universities, 2026-04-27
    abs

  • Hybrid deep learning model (E2BNet) combining ESM2 and BiLSTM for 8-state protein secondary structure prediction
    Droba, Diganta Das, Banu, Ayesha, Sami, Miftiaz Inti, Hossain, Riad, Chowdhury, Mahfuzulhoq — Discover Applied Sciences, 2026-04-27
    abs

  • PRGNet: a Parallel Residual Graph Network for enhanced drug-target binding affinity prediction
    Liu, Jing, Mehmood, Aamir, Liu, Haiqi, Liu, Yishu, Wei, Dongqing, Li, Daixi — BMC Genomics, 2026-04-28
    abs

🧬 Preprints (arXiv + bioRxiv)

62 matched filters -> 20 selected after LLM relevance + novelty ranking.

  • 🧬 Programming Biomolecular Interactions with All-Atom Generative Model
    Kong, X.; Chen, J.; Zhang, Z.; Li, G.; Zhu, Q.; Wei, L.; Li, M.; Shi, Y.; Dai, W.; Zhang, Z.; Tan, W.; Jiao, R.; Wang, X.; Zheng, J.; Yu, Z.; Wu, Q.; Guo, Z.; Zhang, L.; Li, W.; Huang, Q.; Zhu, T.; Wang, X.; Huang, W.; She, Y.; Zhang, J.; Liu, Y.; Liu, K.; Ma, J. — bioRxiv, 2026-04-30
    abs

  • 📄 MIMIC: A Generative Multimodal Foundation Model for Biomolecules
    Siavash Golkar, Jake Kovalic, Irina Espejo Morales, Samuel Sledzieski, Minhuan Li, Ksenia Sokolova, Geraud Krawezik, Alberto Bietti, Claudia Skok Gibbs, Roman Klypa, Shengwei Xiong, Francois Lanusse, Liam Parker, Kyunghyun Cho, Miles Cranmer, Tom Hehir, Michael McCabe, Lucas Meyer, Rudy Morel, Payel Mukhopadhyay, Mariel Pettee, Helen Qu, Jeff Shen, David Fouhey, Hadi Sotoudeh, Vikram Mulligan, Pilar Cossio, Sonya M. Hanson, Alisha N. Jones, Olga G. Troyanskaya, Shirley Ho — arXiv, 2026-04-27
    abs

  • 🧬 Learning the All-Atom Equilibrium Distribution of Biomolecular Interactions at Scale
    Wang, Y.; Xu, Y.; Li, W.; Yu, H.; Tan, W.; Li, S.; Huang, Q.; Chen, N.; Wu, X.; Wu, Q.; Liu, K. — bioRxiv, 2026-04-29
    abs

  • 🧬 Generative design of sequence specific DNA binding proteins
    Sehgal, E.; Politanska, Y.; Mitra, R.; Kim, P. T.; Gonzalez Rodriguez, N.; Warrier, T.; Kubaney, A.; Morishita, A.; Quijano, R.; Butcher, J.; Krishna, R.; Pecoraro, R.; Belmont, B.; Roullier, N.; Goreshnik, I.; Vafeados, D. K.; Kwon, P.; Ramarao, R.; Taipale, J.; Glasscock, C. J.; Baker, D. — bioRxiv, 2026-04-27
    abs

  • 🧬 Pan-cancer virtual spatial transcriptomics from routine histology with Phoenix
    Tran, M.; Gindra, R. H.; Putze, P.; Senbai, K.; Palla, G.; Kos, T.; Falcomata, C.; Wang, C.; Guo, R.; Boxberg, M.; Berclaz, L. M.; Lindner, L. H.; Bergmayr, L.; Knoesel, T.; Jurmeister, P.; Klauschen, F.; Homicsko, K.; Gottardo, R.; Eckstein, M.; Matek, C.; Mock, A.; Theis, F. J.; Saur, D.; Peng, T. — bioRxiv, 2026-04-29
    abs

  • 🧬 Aiki-GeNano: Multi-Stage Preference Optimization for Generative Design of Developable Nanobodies
    Meda, R. S.; Doshi, J.; Iyer, E.; Shastry, S.; Mysore, V. — bioRxiv, 2026-05-01
    abs

  • 🧬 Advancing ab initio genome annotation with OrionGeno
    Liu, L.; Cai, X.; Wang, S.; Deng, Y.; Wu, Y.; Pan, Y.; Wang, J.; Zhang, C.; Xia, H.; Tan, N.; Su, K.; Liu, Y.; Zhou, X.; Liu, L.; Wei, T.; Zhang, Y.; Li, Q.; Li, Y.; Yin, P.; Xu, X. — bioRxiv, 2026-04-29
    abs

  • 🧬 Reconstructing True 3D Spatial Omics at Single-Cell Resolution
    Yang, Y.; Luo, Y.; Zhang, K.; Bu, Y.; Xia, Z.; Peng, H.; Yan, R.; Liu, Q.; Chen, Y.; Shen, L.; Chen, E. — bioRxiv, 2026-05-01
    abs

  • 🧬 Scalable machine learning improves resistance prediction and identifies novel determinants in Mycobacterium tuberculosis
    Serajian, M.; Lotfollahi, M.; Green, O.; Smith, K.; Marini, S.; Prosperi, M.; Boucher, C. — bioRxiv, 2026-04-29
    abs

  • 📄 Learning biophysical models of gene regulation with probability flow matching
    Suryanarayana Maddu, Victor Chardès, Michael J. Shelley — arXiv, 2026-04-27
    abs

  • 📄 Computational Design and Experimental Validation of Photoactive PARP1 Inhibitors
    Simon Axelrod, Miroslav Kašpar, Kristýna Jelínková, Markéta Šmídková, Erika Bartůňková, Sille Štěpánová, Eugene Shakhnovich, Václav Kašička, Martin Dračínský, Zlatko Janeba, Rafael Gómez-Bombarelli — arXiv, 2026-04-27
    abs

  • 🧬 Reference-free compound identification using computational prediction of molecular properties and multi-dimensional spectrometric measurements: a fentanyl case study
    Harrilal, C. P.; Hollerbach, A. L.; Ciesielski, D.; Schultz, K. J.; Overstreet, R.; Rice, P. S.; King, E.; Nguyen, J.; Ross, D. H.; Lin, V. S.; Deng, G. Y.; Brayfindley, E.; Webb-Robertson, B.-J.; Raugei, S.; Ibrahim, Y. M.; Ewing, R. G.; Metz, T. — bioRxiv, 2026-04-27
    abs

  • 🧬 MissAlignment Teaches Itself Better Cryo-ET Tilt-Series Alignment by Making It Worse
    Chaillet, M. L.; van Loenhout, J.; Leung, M. R.; Burt, A.; Tegunov, D. — bioRxiv, 2026-05-02
    abs

  • 🧬 Designing biochemical circuits with tree search
    Bhamidipati, P. S.; Thomson, M. — bioRxiv, 2026-05-03
    abs

  • 📄 Towards A Generative Protein Evolution Machine with DPLM-Evo
    Xinyou Wang, Liang Hong, Jiasheng Ye, Zaixiang Zheng, Yu Li, Shujian Huang, Quanquan Gu — arXiv, 2026-04-30
    abs

  • 📄 Hyper-Dimensional Fingerprints as Molecular Representations
    Jonas Teufel, Luca Torresi, André Eberhard, Pascal Friederich — arXiv, 2026-04-30
    abs

  • 📄 VQ-SAD: Vector Quantized Structure Aware Diffusion For Molecule Generation
    Farshad Noravesh, Reza Haffari, Layki Soon, Arghya Pal — arXiv, 2026-05-01
    abs

  • 🧬 Overcoming systematic data biases enables accurate prediction of enzyme kcat fold-changes for computational protein design
    Rousset, Y.; Kroll, A.; Lercher, M. — bioRxiv, 2026-04-30
    abs

  • 🧬 Knowledge Inclusive Machine Learning for Disease Gene Prioritisation
    Gamage, C. J.; Xia, Y.; Rupasinghe, R.; Senevirathne, S.; Senanayake, D.; Malepathirana, T.; Hevapathige, A.; Corbett, M.; O'Brien, T. J.; Petrou, S.; Berkovic, S. F.; Scheffer, I. E.; Gecz, J.; Bahlo, M.; Bennett, M. F.; Halgamuge, S. K. — bioRxiv, 2026-05-02
    abs

  • 🧬 UshEffect-3D: Structure-informed Classification of USH2A Missense Variants for Inherited Retinal Disease
    Choudhary, D.; Portelli, S.; Ascher, D. B. — bioRxiv, 2026-04-27
    abs

Read more