🍟 CRISPRware: Guide RNA Library Design🧬, 🧪hDNApipe: streamlining human genome analysis👤, 🦠MiCoDe: Microbiome Community Detection🔍

Stay Updated with the Latest in Bioinformatics!

Issue: 93 | Date: 04 July 2025

👋 Welcome to the Bioinformer Weekly Roundup!

In this newsletter, we curate and bring you the most captivating stories, developments, and breakthroughs from the world of bioinformatics. Whether you are a seasoned researcher, a student, or simply curious about the intersection of biology and data science, we have got you covered. Subscribe now to stay ahead in the exciting realm of Bioinformatics!

🔬 Featured Research

69.9-kb long inverted repeat increases genome instability in a strain of Lactobacillus crispatus | Oxford Academic

This study likely investigates a large inverted repeat sequence in the genome of L. crispatus and its role in promoting genomic instability. The repeat may facilitate recombination events or structural rearrangements, impacting genome integrity and possibly influencing strain-specific traits.

Comprehensive profiling of integrative conjugative elements (ICEs) in Mollicutes: distinct catalysts of gene flow and genome shaping | Oxford Academic

This research probably characterizes ICEs across Mollicutes, a group of wall-less bacteria. It may detail how ICEs contribute to horizontal gene transfer, genome plasticity, and adaptation, highlighting their structural diversity and evolutionary significance in shaping microbial genomes.

Inferring metabolite states from spatial transcriptomes using multiple graph neural network | bioRxiv

This study introduces MGFEA, a graph-based algorithm that infers metabolite levels from spatial and single-cell transcriptomic data. It integrates gene interaction and spatial graphs guided by genome-scale metabolic models to estimate metabolic fluxes. MGFEA improves inference accuracy by incorporating metabolome data and addresses limitations of prior models like scFEA and Compass.

A systematic assessment of phylogenomic approaches for microbial species tree reconstruction | bioRxiv

The authors evaluate various phylogenomic methods for reconstructing microbial species trees, focusing on gene-level evolutionary histories and their impact on genome-wide phylogenies. They propose a visualization framework using low-dimensional tree space to identify outlier gene histories and improve species tree estimation. The approach aids in selecting gene sets for robust phylogenomic inference.

Machine learning differentiates between bulk and pseudo-bulk RNA-seq datasets | bioRxiv

This research presents bulk2sc, a variational autoencoder model that generates synthetic single-cell RNA-seq data from bulk RNA-seq. It deconvolves pseudo-bulk datasets by learning cell-type distributions, enabling single-cell level insights from bulk data. The model is validated against real scRNA-seq data and offers a cost-effective alternative for disease studies.

Novel binning-based methods for model fitting and data splitting improved machine learning imbalanced data | bioRxiv

The study benchmarks deep learning-based metagenomic binning tools, highlighting COMEBin and GenomeFace for their accuracy and speed. It emphasizes the effectiveness of multi-sample binning and embedding space partitioning for low-coverage datasets. The work provides standardized workflows for evaluating binning performance and improving MAG recovery.

Extensive data mining uncovers novel diversity among members of the rare biosphere within the Thermoplasmatota | BMC Microbiome

Researchers identified three novel orders within the class Ca. Penumbrarchaeia of Thermoplasmatota using metagenomic mining and enrichments. These rare biosphere members exhibit unique gene content and potential roles in organic matter degradation in anoxic environments. The study highlights their functional novelty and habitat specificity.

Lineage-specific expansions of polinton-like viruses in photosynthetic cryptophytes | BMC Microbiome

Using long-read sequencing, the study uncovers over a thousand polinton-like viruses (PLVs) in cryptophyte genomes, particularly Rhodomonas lacustris. These PLVs show lineage-specific expansions and diverse replication strategies. The findings link PLVs to host-virus interactions and suggest their role as endogenous viral elements in freshwater protists.

Comparative transcriptomic analysis reveals the important role of hepatic fatty acid metabolism in the acute heat stress response in chickens | BMC Genomics

This study analyses transcriptomic changes across multiple chicken tissues under acute heat stress. The liver shows significant differential gene expression, with fatty acid metabolism pathways playing a central role. Functional validation of FASN in hepatocytes confirms its involvement in mitigating heat-induced metabolic disruptions.

Complete chloroplast genomes of 25 mulberry plants: insight into genome characteristics, comparative analysis and phylogenetic relationships | BMC Genomics

The authors sequenced and analysed chloroplast genomes of 25 Morus species, identifying conserved structures and SSR polymorphisms. Phylogenetic analyses grouped the species into three clades based on usage (leaf, fruit, wild). The study provides SSR markers for classification and insights into mulberry phylogeny.

🛠️ Latest Tools

2dSpAn-Auto: an automated tool for analysis of two-dimensional dendritic spine images | BMC Bioinformatics

2dSpAn-Auto provides two workflows—binary skeletonization (2dSpAn-Auto.b) and fuzzy skeletonization (2dSpAn-Auto.f)—to segment and quantify dendritic spines in 2D maximum intensity projection images. It extracts spine density and morphometry metrics (area, length, head width, neck widths) along with total dendrite length via automated batch processing with optional expert parameter tuning through a GUI. Validation across in vitro, ex vivo, and in vivo imaging demonstrates high accuracy and reproducibility under varying protocols. The open-source tool, released under GPL v3, addresses the need for fast, modality-agnostic spine analysis in neurological research and clinical studies.