John Lees about 13 hours ago
We've been looking at how to compare and cluster large numbers of genomes, such as those in large isolate databases such as AllTheBacteria, and metagenome assemblies (e.g. SPIRE, MGnify).
On a combined dataset of 5.6 million assemblies, we can now cluster/dereplicate everything in under a day!
add a skeleton here at some point