The Tools Bioinformaticians Are Building Now

Published by The Daily Scout

What happened

Professionals are sharing the custom tools they're building for data analysis. Bioinformatician James Ferguson released "kuva," a Rust-based plotting library with 25 plot types specifically for bioinformatics. Meanwhile, PhD student Yara posted a tutorial for building protein interaction networks with STRING and Cytoscape. These projects show how computational biologists often create their own software to solve specific research problems.

Why it matters

A typical day for a bioinformatician involves much more than just coding. Their time is often split between writing scripts for data analysis, debugging programs, managing large datasets, and attending meetings with research teams to discuss project goals and results. This role is highly collaborative, requiring strong communication skills to work effectively with biologists, statisticians, and other scientists. The educational path for a bioinformatician is distinct from that of a medical doctor, even though both are rooted in life sciences. A bioinformatics track, often pursued through a Master's or PhD, emphasizes computer science, statistics, and biology. This path is geared towards research and data analysis, while a pre-med track focuses on foundational science courses required for medical school, with a primary goal of patient care. A PhD in bioinformatics is tailored for individuals who want to develop new computational methods and analyze large-scale biological data, often leading to roles in academic research or as lead scientists in the biotech industry. In contrast, an M.D. is trained for clinical decision-making and patient interaction. While some pursue an M.D./PhD to bridge both worlds, it's a much longer and more demanding path. The choice between programming languages like Rust and Python often comes down to the specific task. Python is popular in the scientific community due to its extensive libraries for data analysis and visualization. Rust, on the other hand, is chosen for its high performance and memory safety, which is crucial when building complex, high-throughput analysis tools from the ground up. Custom plotting libraries, like "kuva," are essential for visualizing complex genomic data. For example, researchers might use such a library to create Circos plots, which can show relationships between different parts of the genome, like chromosomal rearrangements in cancer cells. These visualizations help scientists identify patterns that would be impossible to see in raw data. Tools like STRING and Cytoscape are fundamental for understanding the intricate web of life at a molecular level. STRING is a massive database of known and predicted protein-protein interactions. Researchers use Cytoscape to visualize these interactions as networks, helping them to understand cellular processes and how they are affected in disease. This visualization can reveal key proteins in a disease pathway, which could become targets for new drugs.

Key numbers

  • Bioinformatician James Ferguson released "kuva," a Rust-based plotting library with 25 plot types specifically for bioinformatics.

What happens next

  • This visualization can reveal key proteins in a disease pathway, which could become targets for new drugs.

Quick answers

What happened in The Tools Bioinformaticians Are Building Now?

Professionals are sharing the custom tools they're building for data analysis. Bioinformatician James Ferguson released "kuva," a Rust-based plotting library with 25 plot types specifically for bioinformatics. Meanwhile, PhD student Yara posted a tutorial for building protein interaction networks with STRING and Cytoscape. These projects show how computational biologists often create their own software to solve specific research problems.

Why does The Tools Bioinformaticians Are Building Now matter?

A typical day for a bioinformatician involves much more than just coding. Their time is often split between writing scripts for data analysis, debugging programs, managing large datasets, and attending meetings with research teams to discuss project goals and results. This role is highly collaborative, requiring strong communication skills to work effectively with biologists, statisticians, and other scientists. The educational path for a bioinformatician is distinct from that of a medical doctor, even though both are rooted in life sciences. A bioinformatics track, often pursued through a Master's or PhD, emphasizes computer science, statistics, and biology. This path is geared towards research and data analysis, while a pre-med track focuses on foundational science courses required for medical school, with a primary goal of patient care. A PhD in bioinformatics is tailored for individuals who want to develop new computational methods and analyze large-scale biological data, often leading to roles in academic research or as lead scientists in the biotech industry. In contrast, an M.D. is trained for clinical decision-making and patient interaction. While some pursue an M.D./PhD to bridge both worlds, it's a much longer and more demanding path. The choice between programming languages like Rust and Python often comes down to the specific task. Python is popular in the scientific community due to its extensive libraries for data analysis and visualization. Rust, on the other hand, is chosen for its high performance and memory safety, which is crucial when building complex, high-throughput analysis tools from the ground up. Custom plotting libraries, like "kuva," are essential for visualizing complex genomic data. For example, researchers might use such a library to create Circos plots, which can show relationships between different parts of the genome, like chromosomal rearrangements in cancer cells. These visualizations help scientists identify patterns that would be impossible to see in raw data. Tools like STRING and Cytoscape are fundamental for understanding the intricate web of life at a molecular level. STRING is a massive database of known and predicted protein-protein interactions. Researchers use Cytoscape to visualize these interactions as networks, helping them to understand cellular processes and how they are affected in disease. This visualization can reveal key proteins in a disease pathway, which could become targets for new drugs.

Get your own daily briefing

Scout delivers personalized news, insights, and conversations tailored to your role and industry.

Download on the App Store

Published by The Daily Scout - Be the smartest in the room.