
from QC to gene prediction and phylogenomics
BUSCO v5.3.2 is the current stable version!
Gitlab, a Conda package and Docker container are also available.
Based on evolutionarily-informed expectations of gene content of near-universal single-copy orthologs, BUSCO metric is complementary to technical metrics like N50.
BUSCO has been selected as one of the SIB Remarkable Outputs of 2021!
For more info and to discover the other Remarkable Outputs see here.
Availability
- Git source code
- Docker container
- Conda package
New in v4/v5
- Bacteria & archaea revised
- Auto-lineage selection
- Faster assessments of euk genomes
vs CheckM
- Scores eukaryotes and prokaryotes
- Can run on a laptop
- Better resolution, less overestimates
Cite us
The novelties introduced in BUSCO v4 and v5 and the new BUSCO datasets (*_odb10) are described here.
If you've used these versions the correct citation would be:
Mosè Manni, Matthew R Berkeley, Mathieu Seppey, Felipe A Simão, Evgeny M Zdobnov, BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes. Molecular Biology and Evolution, Volume 38, Issue 10, October 2021, Pages 4647–4654
Additional protocols and applications are described in:
Manni, M., Berkeley, M. R., Seppey, M., & Zdobnov, E. M. (2021). BUSCO: Assessing genomic data quality and beyond. Current Protocols, 1, e323. doi: 10.1002/cpz1.323
Obtain BUSCO
Full installation instructions are provided in the user guide.
BUSCO is available as a conda package and as a Docker image. Both of these versions are ready to run out of the box. Alternatively, it is also possible to manually install BUSCO.
The BUSCO software now directly downloads the necessary datasets, specified by the user or automatically selected.
To display all available datasets
busco --list-datasets
You can also download them manually and see a list with the number of marker genes.
Earlier versions: v4, v3, v2, v1