Interactive version of Figure 1:

Chromosomes and ERs distribution by GC content and replicon size

We compiled all replicons from complete bacterial genomes and classified them into two categories:

The rationale behind this classification is described in the Methods, Replicon analysis section.

For every taxonomic group at the levels of phylum, class, order, family, and genus that contained ≥100 ERs in our dataset, we generated two-dimensional scatter plots and kernel density (KDE) plots represented as violin plots using GC content (%) and replicon size (bp).

For each eligible taxonomic group, two versions of every plot were produced:

KDE violin plots were scaled identically within each taxon to enable direct comparison of the shape of chromosome and ER distributions, rather than their absolute counts. KDE is a non-parametric technique that estimates smooth probability-density curves without assuming normality or unimodality (Kitchenham, 2015; Węglarczyk, 2018).

References

Phyla with ≥100 ERs

Click to show / hide plots

Classes with ≥100 ERs

Click to show / hide plots

Orders with ≥100 ERs

Click to show / hide plots

Families with ≥100 ERs

Click to show / hide plots

Genera with ≥100 ERs

Click to show / hide plots