Stanford Das Lab Enhances RNA Folding Research with NVIDIA DGX Cloud

Felix Pinkston   Apr 11, 2025 23:42  UTC 15:42

0 Min Read

The Das Lab at Stanford University is making significant strides in RNA folding research, utilizing the advanced computational capabilities of NVIDIA DGX Cloud. This initiative, supported by the NAIRR Pilot program, provides the lab with access to 32 NVIDIA A100 DGX Cloud nodes, each equipped with eight GPUs, over a three-month period. This substantial computational power has enabled the lab to transition from small-scale experiments to large-scale distributed training, facilitating the training of large models and datasets, according to NVIDIA.

Community-Driven Research

Under the leadership of Dr. Rhiju Das, the Das Lab has been at the forefront of RNA research. In 2020, the lab hosted the OpenVaccine Kaggle competition in response to the Covid-19 pandemic and continued its efforts with the Ribonanza competition in 2024. These initiatives aim to accelerate the understanding of RNA structures and their biological functions.

One of the major hurdles in RNA folding research is the scarcity of experimental RNA structure data. To overcome this, the Das Lab developed Eterna, a crowdsourcing game that allows users to design RNA sequences. These sequences are synthesized in the lab, and chemical mapping experiments are conducted to infer RNA structures.

Innovative Strategies

The lab's strategy incorporates several innovative approaches:

  • Crowdsourced Data Collection: Eterna is used to gather novel RNA sequences from the public, complemented by expert-curated databases.
  • Data Approximation: Chemical mapping experiments produce reactivity profiles that help approximate RNA structures.
  • Model Design through Crowdsourcing: Using Kaggle competitions, the lab tests various model architectures and training pipelines with community involvement.

Additionally, the lab has developed a reinforcement learning model trained to play Eterna, accelerating the generation of novel sequences. This model utilized 4,000 A100 GPU hours on the NVIDIA DGX Cloud and was trained using the Q-learning algorithm.

Remarkable Results

The Das Lab has successfully curated the largest database for RNA structure training. The foundation models, trained on 256 A100 GPUs, have led to the development of RibonanzaNet2, which currently achieves state-of-the-art performance in RNA folding tasks. This model is now available for community use and fine-tuning.

On February 26, 2025, the lab launched the Stanford RNA 3D Folding Kaggle competition, offering a $75,000 prize pool to encourage further refinement of RibonanzaNet2 for RNA structure prediction. This competition invites participants to leverage experimental RNA structures collected during the contest period.

Future Prospects

The research conducted by the Das Lab holds significant potential for advancing biological sciences, with implications for medicine, agriculture, and biotechnology. By developing more accurate RNA models, researchers can better understand disease mechanisms and create more effective treatments.

Looking ahead, the Das Lab plans to expand its datasets and models, utilizing even more powerful computational resources provided by NVIDIA DGX Cloud. Their work exemplifies the power of crowdsourcing and cutting-edge technology in advancing scientific research.



Read More