Antonio Camargo
@apcamargo.bsky.social
📤 762
📥 122
📝 54
🚨New preprint out! We present a foundational genomic resource of human gut microbiome viruses. It delivers high-quality, deeply curated data spanning taxonomy, predicted hosts, structures, and functions, providing a reference for gut virome research. (1/8)
www.biorxiv.org/content/10.1...
4 days ago
4
90
49
reposted by
Antonio Camargo
Yunha Hwang
13 days ago
We're thrilled to announce SeqHub, an AI-enabled platform for biological sequence analysis. SeqHub brings together sequence search, genome annotation, and data sharing in one place.
loading . . .
3
50
22
reposted by
Antonio Camargo
ace-gtdb.bsky.social
19 days ago
Our
@narjournal.bsky.social
manuscript is out! It explores the growth of the GTDB (
gtdb.ecogenomic.org
) since its inception, as well as updates to the website, methodology, policies, and major taxonomic and nomenclatural changes over the past three years.
academic.oup.com/nar/advance-...
loading . . .
GTDB release 10: a complete and systematic taxonomy for 715 230 bacterial and 17 245 archaeal genomes
Abstract. The Genome Taxonomy Database (GTDB; https://gtdb.ecogenomic.org) provides a phylogenetically consistent and rank normalized genome-based taxonomy
https://academic.oup.com/nar/advance-article/doi/10.1093/nar/gkaf1040/8296754
0
68
49
A BLAST update adding support for compressed files and csv output with headers is a Good Friday night surprise!
blast.ncbi.nlm.nih.gov/doc/blast-ne...
loading . . .
2025 BLAST NEWS — BlastNews 0.1.1 documentation
https://blast.ncbi.nlm.nih.gov/doc/blast-news/2025-BLAST-News.html#download-blast-2-17-0-now
2 months ago
0
35
17
reposted by
Antonio Camargo
Rayan Chikhi
2 months ago
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
3
217
134
reposted by
Antonio Camargo
Ben J Woodcroft
4 months ago
Out in
@natbiotech.nature.com
: Metagenome taxonomy profilers usually ignore unknown species. SingleM is an accurate profiler which doesn't, even detecting phyla with no MAGs. Profiles of 700,000 metagenomes at
sandpiper.qut.edu.au
. A 🧵
7
130
80
reposted by
Antonio Camargo
Bonsai Sequence Bioinformatics
4 months ago
Preprint alert from the group 🚨 super fast grep-like sequence selection
add a skeleton here at some point
0
6
5
reposted by
Antonio Camargo
Cameron Thrash
4 months ago
Scaling laws of bacterial and archaeal plasmids
www.nature.com/articles/s41...
#jcampubs
loading . . .
Scaling laws of bacterial and archaeal plasmids - Nature Communications
The capacity of a plasmid to express genes is constrained by parameters such as its length and copy number. Here, Maddamsetti et al. present a computational method that enables rapid and accurate dete...
https://www.nature.com/articles/s41467-025-61205-2
1
37
13
reposted by
Antonio Camargo
Robert Aboukhalil
5 months ago
Excited to announce our first interactive article on
sandbox.bio
, about genomic ranges:
sandbox.bio/concepts/gen...
Move & resize the ranges to see how that affects bedtools operations like merge and intersect in real time!
loading . . .
1
47
18
reposted by
Antonio Camargo
Simon Roux
5 months ago
New pre-print out \o/ All about CRISPR, metagenomes, and what you learn when you collect (a lot of) spacers from natural communities, with
@apcamargo.bsky.social
@urineri.bsky.social
@lhug.bsky.social
but also Uri Gophna, Nikhil George (not on Bsky I think) & others at JGI
doi.org/10.1101/2025...
loading . . .
https://doi.org/10.1101/2025.06.12.659409
3
92
45
reposted by
Antonio Camargo
Sebastian Schmidt
5 months ago
The Koonin Law of Computatoinal Biology: Whenever you think you have a great idea in computational or evolutionary biology, it will already have been published by Eugene Koonin in the mid 90ies.
3
69
10
reposted by
Antonio Camargo
Yunha Hwang
5 months ago
At Tatta Bio, we have been thinking deeply about the sequence-to-function problem. We believe that before AI can power functional prediction, we first need to rethink how we curate, manage, and share sequence data. Here, we share our initial ideas on what we are building next:
loading . . .
Today's sequence data infrastructure is set up for failure in the age of AI.
Building an open and collaborative sequence platform for both Human and AI scientists.
https://tattabio.substack.com/p/todays-sequence-data-infrastructure
1
8
4
reposted by
Antonio Camargo
Jim Shaw
6 months ago
Announcing myloasm, a new long-read (ONT R10/PacBio) metagenome assembler that I've been working on during my postdoc in the Heng Li lab (
@lh3lh3.bsky.social
).
myloasm-docs.github.io
loading . . .
myloasm - metagenomic assembly with (noisy) long reads
https://myloasm-docs.github.io/
5
132
81
reposted by
Antonio Camargo
Tábita Hünemeier
6 months ago
Our new paper is out in
@science.org
! By exploring the rich genetic diversity of Brazil, we show how fine-scale genomic analyses reveal that this diversity, rooted in Indigenous ancestry and centuries of complex demographic history, plays a key role in population health.
3
23
8
reposted by
Antonio Camargo
Eduardo Amorim
6 months ago
Very proud of my colleagues and friends for their amazing publication out in Science today! Nunes et al. "Admixture’s impact on Brazilian population evolution and health"
www.science.org/doi/10.1126/...
@hunemeier.bsky.social
@macscastro.bsky.social
👏
loading . . .
Admixture’s impact on Brazilian population evolution and health
Brazil, the largest Latin American country, is underrepresented in genomic research despite boasting the world’s largest recently admixed population. In this study, we generated 2723 high-coverage who...
https://www.science.org/doi/10.1126/science.adl3564
0
20
10
reposted by
Antonio Camargo
STCmicrobeblog
7 months ago
schaechter.asmblog.org/schaechter/2...
#MicroSky
#Archaea
#ArchaeaSky
#SymbioSky
add a skeleton here at some point
1
20
12
reposted by
Antonio Camargo
Oliver Schwengers
7 months ago
We happily present: “Bakta Web – rapid and standardized genome annotation on scalable infrastructures” @OxUniPress NAR’s Web Server issue
doi.org/10.1093/nar/...
Easy to use, no registration, fast, scalable, various visualizations, in sync with Bakta CLI:
bakta.computational.bio
(1/5)
loading . . .
Bakta Web – rapid and standardized genome annotation on scalable infrastructures
Abstract. The Bakta command line application is widely used and one of the most established tools for bacterial genome annotation. It balances comprehensiv
https://doi.org/10.1093/nar/gkaf335
1
41
28
reposted by
Antonio Camargo
Alex Crits-Christoph
7 months ago
Unique investigation of some errors in long read assemblers. In particular these remarkably chimeric contigs are 😱, if rare.... Improving long read assemblers is definitely the space to be in when it comes to the future of metagenomics, as short reads won't be part of it 😉
add a skeleton here at some point
3
65
39
reposted by
Antonio Camargo
Ben J Woodcroft
7 months ago
A 1.0 release for Sandpiper. 700,000 microbial community profiles (3x the last version, 4.7 Pbp metaG), searchable via the
@ace-gtdb.bsky.social
R226 taxonomy that just dropped. MetaGs are going exponential, but we are still nowhere near a MAG for all species.
sandpiper.qut.edu.au
#microsky
🧬🖥️ 1/2
3
49
35
reposted by
Antonio Camargo
ace-gtdb.bsky.social
7 months ago
GTDB release 10 based on RefSeq 226 (R10-RS226) is live at
gtdb.ecogenomic.org
. This release covers 732,475 genomes (22% increase) and has 143,6141 species clusters (37% increase). Release notes at:
forum.gtdb.ecogenomic.org/t/announcing...
. Release statistics at:
gtdb.ecogenomic.org/stats/r226
.
loading . . .
GTDB - Genome Taxonomy Database
The Genome Taxonomy Database (GTDB) is an initiative to establish a standardised microbial taxonomy based on genome phylogeny.
https://gtdb.ecogenomic.org
0
27
19
reposted by
Antonio Camargo
Itai Yanai
7 months ago
Is the genome just a bag of genes? A new paper in Science now reports that for two thirds of an organisms' genes the position along the chromosome is actually very tightly constrained! Amazing work from my favorite night scientist Martin Lercher and his team!
www.science.org/doi/10.1126/...
1
102
30
reposted by
Antonio Camargo
Cameron Thrash
7 months ago
CoverM is published! CoverM: Read alignment statistics for metagenomics
academic.oup.com/bioinformati...
#jcampubs
loading . . .
CoverM: Read alignment statistics for metagenomics
AbstractSummary. Genome-centric analysis of metagenomic samples is a powerful method for understanding the function of microbial communities. Calculating r
https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btaf147/8107763?rss=1&login=false
1
72
32
reposted by
Antonio Camargo
Kaçar Lab at UW-Madison
8 months ago
🚨New paper!🚨 A comprehensive take on the origins and evolution of translation factors & how these essential players evolved across the tree of life. 🌍🧬 Led by Evrim Fer
@uwmadisonmdtp.bsky.social
grad student! 👏 Free access:
www.sciencedirect.com/science/arti...
@cp-trendsgenetics.bsky.social
3
94
50
reposted by
Antonio Camargo
RdRp Summit
9 months ago
The abstract submission & registration for
#RdRpSummit2025
is NOW OPEN!!! 🎉🙌 Do you work on RNA virus discovery using RdRps? Are you joining the
#ViBioM2025
in Portugal? Consider submitting your research in one of our sessions! More info:
RdRp.io
0
14
15
Publishing a paper in Nature while based in Brazil is an incredible achievement. Huge congratulations to the authors! I'm excited to give this a proper read :)
add a skeleton here at some point
9 months ago
0
3
0
One of the great things about bioinformatics is how open it can be. Contributing to a project you admire can actually get you involved in it :)
add a skeleton here at some point
10 months ago
0
8
1
reposted by
Antonio Camargo
A. Murat Eren (Meren)
10 months ago
I'm very happy to report that Matt Schechter's PhD work is now online as a pre-print: "Ribosomal protein phylogeography offers quantitative insights into the efficacy of genome-resolved surveys of microbial communities",
www.biorxiv.org/content/10.1...
Here's a little 🧵 about it.
1
35
20
reposted by
Antonio Camargo
Eduardo Rocha
10 months ago
This preprint makes a point that is valid for a lot of machine learning approaches. Organisms or genes are linked by evolutionary history; they are not independent. This results in correlation between learning and test sets, and often in over-optimistic evaluations of the methods' outcome.
add a skeleton here at some point
3
59
27
reposted by
Antonio Camargo
Joint Genome Institute
10 months ago
In Science Advances: "We took a deep dive into over 1.8 million bacterial and archaeal genomes to see how much of their diversity we’ve actually captured. Turns out that despite all the genomes we’ve sequenced, we’ve only scratched the surface." -Dongying Wu 🖥️🧬🦠
biosciences.lbl.gov/2025/01/17/t...
loading . . .
Taking Stock of the Known and Unknown Microbial Space - Biosciences Area
Using publicly available genome sequence data generated over the past three decades, JGI researchers assess the known fraction of microbial diversity.
https://biosciences.lbl.gov/2025/01/17/taking-stock-of-the-known-and-unknown-microbial-space/
2
43
17
reposted by
Antonio Camargo
Simon Roux
10 months ago
🚨Postdoc position(s) alert🚨 If you like to develop new bioinformatic tools for *human* microbiome analysis, are interested by virus/phages, we (JGI Viral Genomics and Microbiome Data groups) may have a great position for you ! No official job posting yet, but email or DM if you are interested !
0
75
77
reposted by
Antonio Camargo
Knut Drescher
10 months ago
We found that many bacterial species use exogenous peptidoglycan fragments - released by lysis of neighboring cells - as a general danger signal, triggering a danger response that protects bacteria against many dangers: biofilm formation. Details here 👇
www.nature.com/articles/s41...
loading . . .
Bacteria use exogenous peptidoglycan as a danger signal to trigger biofilm formation - Nature Microbiology
Peptidoglycan released by neighbouring kin or non-kin cell lysis induces physiological changes that protect from a range of stresses, including phage predation.
https://www.nature.com/articles/s41564-024-01886-5
0
143
58
reposted by
Antonio Camargo
Ryan Wick
10 months ago
New year, new assemblies! I'm excited to announce Autocycler, my new tool for consensus assembly of long-read bacterial genomes! It's the successor to Trycycler, designed to be faster and less reliant on user intervention. Check it out:
github.com/rrwick/Autoc...
(1/5)
loading . . .
Home
A tool for generating consensus long-read assemblies for bacterial genomes - rrwick/Autocycler
https://github.com/rrwick/Autocycler/wiki
2
156
100
reposted by
Antonio Camargo
Mart Krupovic
10 months ago
RNA virologists, check out "The protein structurome of Orthornavirae and its dark matter" by Pascal Mutz, Valerian Dolja, Eugene Koonin et al (including
@simrouxvirus.bsky.social
,
@apcamargo.bsky.social
,
@urineri.bsky.social
,
@anamarijabutkovic.bsky.social
)
journals.asm.org/doi/10.1128/...
loading . . .
The protein structurome of Orthornavirae and its dark matter | mBio
Advanced methods for protein structure prediction, such as AlphaFold2, greatly expand our capability to identify protein domains and infer their likely functions and evolutionary relationships. This i...
https://journals.asm.org/doi/10.1128/mbio.03200-24
0
22
10
reposted by
Antonio Camargo
Gav Armstrong
11 months ago
Don't use red and green data lines/surfaces in the same panel please
#chemsky
. It can be difficult for some colorblind readers to differentiate them. I've accepted (in principle) 2 papers today, and both sets of authors were asked to remove red/green colour contrasts
www.nature.com/articles/d41...
loading . . .
Colour me better: fixing figures for colour blindness
Images can be made more accessible by choosing hues, shapes and textures carefully.
https://www.nature.com/articles/d41586-021-02696-z
4
113
50
reposted by
Antonio Camargo
Mart Krupovic
11 months ago
An interesting paper for the EV (extracellular vesicle) fans by Patel et al.
@ahnaskop.bsky.social
lab: "Extracellular vesicles, including large translating vesicles called midbody remnants, are released during the cell cycle"
www.molbiolcell.org/doi/10.1091/...
0
11
3
reposted by
Antonio Camargo
Yunha Hwang
11 months ago
Can LLM agents discover novel protein functions? Introducing Gaia Agent 🌎 🤖: an AI biologist capable of reasoning across genomic contexts to predict functions of proteins! Gaia Agent is now integrated with Gaia Search at
gaia.tatta.bio
2
38
14
reposted by
Antonio Camargo
Yunha Hwang
11 months ago
If you are at
#NeurIPS2024
don't miss
@ancornman1.bsky.social
's talk on OMG/gLM2 at 9AM!
@workshopmlsb.bsky.social
East meeting room 11,12
0
12
3
reposted by
Antonio Camargo
Yunha Hwang
11 months ago
Excited to be at
#NeurIPS
this week.
@ancornman1.bsky.social
will give a spotlight talk at the
@workshopmlsb.bsky.social
on gLM2/OMG! Please reach out if you want to chat about gLM2/OMG/Gaia and our latest projects😇
www.biorxiv.org/content/10.1...
loading . . .
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling
Biological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraini...
https://www.biorxiv.org/content/10.1101/2024.08.14.607850v2
0
9
3
reposted by
Antonio Camargo
Rob Patro
11 months ago
Very cool work from Yang Lu et al. demonstrating miscalibration of BLASTP’s E-values and generating well-calibrated values via a knockoff-based approach (cc
@mikelove.bsky.social
) -
academic.oup.com/bioinformati...
! More analyses could benefit from knockoff-based approaches.
loading . . .
A BLAST from the past: revisiting blastp’s E-value
AbstractMotivation. The Basic Local Alignment Search Tool, BLAST, is an indispensable tool for genomic research. BLAST established itself as the canonical
https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btae729/7916501
2
37
22
reposted by
Antonio Camargo
Pierre Peterlongo
12 months ago
🧬🔍There are 50 petabases of freely-available DNA sequencing data. We introducing Logan Search which allows you to search for any DNA sequence in minutes, bringing Earth’s largest genomic resource to your fingertips. 🏔️
logan-search.org
🏔️
#Genomics
#Bioinformatics
#OpenScience
2
108
60
reposted by
Antonio Camargo
Yunha Hwang
12 months ago
Hello 🦋
#protein
/
#microbio
/
#BioML
community! We are excited to release Gaia🌎, a context-aware protein search tool, extending protein search and discovery capabilities beyond sequence and structure, to include *genomic context*. Search your favorite protein sequences with on
gaia.tatta.bio
10
237
83
reposted by
Antonio Camargo
Martin Steinegger 🇺🇦
12 months ago
New GPU-based MMseqs2: 20x faster searches on a single L40S (approx. as fast as a RTX 4090) vs. a 128-core CPU. This work enables to set up a very cost-efficient ColabFold MSA GPU server. 🧵🧵 📄
www.biorxiv.org/content/10.1...
💾
mmseqs.com
🗞️
developer.nvidia.com/blog/boost-a...
2
174
61
reposted by
Antonio Camargo
Alex Crits-Christoph
about 1 year ago
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling From friends at Tatta Bio GitHub:
github.com/TattaBio/OMG
www.biorxiv.org/content/10.1...
loading . . .
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling
Biological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraini...
https://www.biorxiv.org/content/10.1101/2024.08.14.607850v2.abstract
0
5
2
reposted by
Antonio Camargo
Ákos T Kovács
about 1 year ago
Terrabacteria: redefining bacterial envelope diversity, biogenesis and evolution
#NatureRevMicro
from Simonetta Gribaldo
www.nature.com/articles/s41...
0
4
3
reposted by
Antonio Camargo
Alex Crits-Christoph
about 1 year ago
AntiDefenseFinder! And it is available also as an option with DefenseFinder:
defensefinder.mdmlab.fr
Exploring the diversity of anti-defense systems across prokaryotes, phages, and mobile genetic elements
www.biorxiv.org/content/10.1...
loading . . .
Exploring the diversity of anti-defense systems across prokaryotes, phages, and mobile genetic elements
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
https://www.biorxiv.org/content/10.1101/2024.08.21.608784v1
1
16
16
reposted by
Antonio Camargo
Roland Hatzenpichler
over 1 year ago
Methanogenesis outside the Euryarchaeota experimentally demonstrated by three cultivation-driven studies (two from my lab)! A long🧵.🐻with me
tinyurl.com/4v4fkda6
tinyurl.com/yr4p7js6
tinyurl.com/mtsrj6b9
24
20
11
reposted by
Antonio Camargo
Karthik Anantharaman
over 1 year ago
If you are interested in prophages, we have a new database: Prophage-DB. Check it out and all feedback is welcome
biorxiv.org/cgi/content/...
loading . . .
Prophage-DB: A comprehensive database to explore diversity, distribution, and ecology of prophages
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
https://biorxiv.org/cgi/content/short/2024.07.11.603044v1
0
13
18
reposted by
Antonio Camargo
Ákos T Kovács
over 1 year ago
Phylogenetic reconciliation: making the most of genomes to understand microbial ecology and evolution
#ISMEJournal
from Gergely Szöllősi and collagues (Phil Hugenholtz, Anja Spang, Cecile Gubry-Rangin, Paul O Sheridan, Laura Eme, Rochelle M Soo and more
academic.oup.com/ismej/advanc...
0
11
6
reposted by
Antonio Camargo
Alex Crits-Christoph
over 1 year ago
A global atlas of soil viruses reveals unexplored biodiversity and potential biogeochemical impacts - Nature Microbiology
www.nature.com/articles/s41...
@apcamargo.bsky.social
Are these new genomes already in the current version of IMG/VR or would I want to download separately and combine for now?
loading . . .
A global atlas of soil viruses reveals unexplored biodiversity and potential biogeochemical impacts - Nature Microbiology
This study presents an extensive global compendium of metagenomically derived sequences that will serve as a foundation for understanding the role of viruses in soil ecosystems.
https://www.nature.com/articles/s41564-024-01686-x
1
6
5
reposted by
Antonio Camargo
Alex Crits-Christoph
over 1 year ago
github.com/rcedgar/usea...
USEARCH is... open source now (!)
loading . . .
GitHub - rcedgar/usearch12: Open-source usearch
Open-source usearch. Contribute to rcedgar/usearch12 development by creating an account on GitHub.
https://github.com/rcedgar/usearch12/tree/master
1
5
6
Load more
feeds!
log in