We compiled all replicons from complete bacterial genomes and classified them into two categories:
The rationale behind this classification is described in the Methods, Replicon analysis section.
For every taxonomic group at the levels of phylum, class, order, family, and genus that contained ≥100 ERs in our dataset, we generated two-dimensional scatter plots and kernel density (KDE) plots represented as violin plots using GC content (%) and replicon size (bp).
For each eligible taxonomic group, two versions of every plot were produced:
KDE violin plots were scaled identically within each taxon to enable direct comparison of the shape of chromosome and ER distributions, rather than their absolute counts. KDE is a non-parametric technique that estimates smooth probability-density curves without assuming normality or unimodality (Kitchenham, 2015; Węglarczyk, 2018).