RNA prediction gets machine‑learning boost

- Wenkai Wang, Zhenling Peng, and Jianyi Yang unveiled trRosettaRNA2, a deep-learning system for RNA 3D structure prediction that also models alternative conformations. - The model beat other RNA predictors in benchmarks, used fewer parameters and compute, and helped Yang-Server top CASP16’s automated RNA prediction track. - Better RNA structure guesses could speed work on ribozymes, switches, and RNA drugs, where shape controls function.

RNA structure prediction is one of those problems that sounds niche until you remember how much biology runs on RNA shape. RNA is not just a messenger copied from DNA. It folds, bends, pairs with itself, and changes form — and those shapes help decide what the molecule actually does. The news here is that a team led by Wenkai Wang, Zhenling Peng, and Jianyi Yang has built a machine-learning system called trRosettaRNA2 that pushes RNA 3D prediction forward, including the harder job of modeling multiple conformations instead of one frozen structure. ### What is the hard part here? Proteins got the big AI structure breakthrough first because researchers had huge training sets. RNA is tougher. There are far fewer experimentally solved RNA 3D structures, and RNA molecules are often floppy rather than rigid. That means the model has to learn from sparse data and still handle molecules that can adopt more than one biologically relevant shape. (nature.com) ### What did this team actually build? Basically, they split the problem in two. First, they trained a module on the much larger pool of RNA secondary-structure data — the base-pairing map that says which letters in the sequence tend to bind each other. Then they fed that information into an end-to-end 3D predictor using what they call structure-aware, or secondary-structure-aware, attention. The result is trRosettaRNA2, plus a secondary-structure predictor called trRNA2-SS. (nature.com) ### Why does secondary structure matter so much? Because RNA folding is hierarchical. The rough pairing pattern usually comes before the full 3D geometry. If you can get that scaffold right, the search space for the final shape shrinks a lot. Turns out this is a smart way around the data shortage — borrow signal from abundant 2D data to improve the scarce 3D task. That is the real trick here. (nature.com) ### Did it actually beat older methods? Yes — that is the headline result. In the paper’s benchmarks, trRosettaRNA2 outperformed other RNA 3D structure prediction methods while using fewer parameters and less computational power. The team also says Yang-Server, built on the method, ranked as the top automated server for RNA structure prediction in the CASP16 blind test and beat AlphaFold 3 there. That matters because blind tests are much harder to game than retrospective benchmarks. (nature.com) ### What is new beyond a single best structure? The conformer piece. Many RNAs do not live in one neat pose. They shift among related shapes, and those shifts can control binding, catalysis, and regulation. The paper highlights ribonuclease P RNA as a case where trRosettaRNA2 captured structural heterogeneity without experimental restraints. In plain English — it is trying to predict motion and alternatives, not just a static snapshot. (nature.com) ### Why should drug people care? Because RNA therapeutics and RNA-targeting drugs depend on structure. If you want to design a small molecule, an antisense oligo, or even a synthetic RNA with a specific behavior, you need a decent guess of what shape the target adopts. Faster modeling also helps basic science — riboswitches, ribozymes, viral RNAs, all of that. A practical server already exists, and the older trRosettaRNA setup could model a roughly 200-nucleotide RNA in about 1 hour on modest CPU resources. (nature.com) ### What is still not solved? RNA dynamics are still messy. Better prediction is not the same as full understanding, and experimental validation still matters. The broader Cell Press overview on RNA prediction makes the same point — deep learning is moving the field fast, but data limits and structural flexibility are still the core bottlenecks. So this is not “RNA solved.” It is more like the field finally has a stronger map. (nature.com) ### Bottom line The important shift is not just that one model got better. It is that RNA prediction is starting to borrow the playbook that transformed protein modeling — use machine learning, lean on richer intermediate data, and turn a slow structural problem into something more routine. For RNA biology and therapeutics, that is a real upgrade. (nature.com) (cell.com)

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Shared from Scout - Be the smartest in the room.