Antonio Camargo
@apcamargo.bsky.social
📤 744
📥 117
📝 41
A BLAST update adding support for compressed files and csv output with headers is a Good Friday night surprise!
blast.ncbi.nlm.nih.gov/doc/blast-ne...
loading . . .
2025 BLAST NEWS — BlastNews 0.1.1 documentation
https://blast.ncbi.nlm.nih.gov/doc/blast-news/2025-BLAST-News.html#download-blast-2-17-0-now
18 days ago
0
35
17
reposted by
Antonio Camargo
Rayan Chikhi
21 days ago
🌎👩🔬 For 15+ years biology has accumulated petabytes (million gigabytes) of🧬DNA sequencing data🧬 from the far reaches of our planet.🦠🍄🌵 Logan now democratizes efficient access to the world’s most comprehensive genetics dataset. Free and open.
doi.org/10.1101/2024...
3
213
134
reposted by
Antonio Camargo
Ben J Woodcroft
2 months ago
Out in
@natbiotech.nature.com
: Metagenome taxonomy profilers usually ignore unknown species. SingleM is an accurate profiler which doesn't, even detecting phyla with no MAGs. Profiles of 700,000 metagenomes at
sandpiper.qut.edu.au
. A 🧵
7
128
80
reposted by
Antonio Camargo
Bonsai Sequence Bioinformatics
3 months ago
Preprint alert from the group 🚨 super fast grep-like sequence selection
add a skeleton here at some point
0
6
5
reposted by
Antonio Camargo
Cameron Thrash
3 months ago
Scaling laws of bacterial and archaeal plasmids
www.nature.com/articles/s41...
#jcampubs
loading . . .
Scaling laws of bacterial and archaeal plasmids - Nature Communications
The capacity of a plasmid to express genes is constrained by parameters such as its length and copy number. Here, Maddamsetti et al. present a computational method that enables rapid and accurate dete...
https://www.nature.com/articles/s41467-025-61205-2
1
37
13
reposted by
Antonio Camargo
Robert Aboukhalil
4 months ago
Excited to announce our first interactive article on
sandbox.bio
, about genomic ranges:
sandbox.bio/concepts/gen...
Move & resize the ranges to see how that affects bedtools operations like merge and intersect in real time!
loading . . .
1
47
18
reposted by
Antonio Camargo
Simon Roux
3 months ago
New pre-print out \o/ All about CRISPR, metagenomes, and what you learn when you collect (a lot of) spacers from natural communities, with
@apcamargo.bsky.social
@urineri.bsky.social
@lhug.bsky.social
but also Uri Gophna, Nikhil George (not on Bsky I think) & others at JGI
doi.org/10.1101/2025...
loading . . .
https://doi.org/10.1101/2025.06.12.659409
3
92
45
reposted by
Antonio Camargo
Sebastian Schmidt
4 months ago
The Koonin Law of Computatoinal Biology: Whenever you think you have a great idea in computational or evolutionary biology, it will already have been published by Eugene Koonin in the mid 90ies.
3
69
10
reposted by
Antonio Camargo
Yunha Hwang
4 months ago
At Tatta Bio, we have been thinking deeply about the sequence-to-function problem. We believe that before AI can power functional prediction, we first need to rethink how we curate, manage, and share sequence data. Here, we share our initial ideas on what we are building next:
loading . . .
Today's sequence data infrastructure is set up for failure in the age of AI.
Building an open and collaborative sequence platform for both Human and AI scientists.
https://tattabio.substack.com/p/todays-sequence-data-infrastructure
1
8
4
reposted by
Antonio Camargo
Jim Shaw
4 months ago
Announcing myloasm, a new long-read (ONT R10/PacBio) metagenome assembler that I've been working on during my postdoc in the Heng Li lab (
@lh3lh3.bsky.social
).
myloasm-docs.github.io
loading . . .
myloasm - metagenomic assembly with (noisy) long reads
https://myloasm-docs.github.io/
5
132
81
reposted by
Antonio Camargo
Tábita Hünemeier
4 months ago
Our new paper is out in
@science.org
! By exploring the rich genetic diversity of Brazil, we show how fine-scale genomic analyses reveal that this diversity, rooted in Indigenous ancestry and centuries of complex demographic history, plays a key role in population health.
3
23
8
reposted by
Antonio Camargo
Eduardo Amorim
4 months ago
Very proud of my colleagues and friends for their amazing publication out in Science today! Nunes et al. "Admixture’s impact on Brazilian population evolution and health"
www.science.org/doi/10.1126/...
@hunemeier.bsky.social
@macscastro.bsky.social
👏
loading . . .
Admixture’s impact on Brazilian population evolution and health
Brazil, the largest Latin American country, is underrepresented in genomic research despite boasting the world’s largest recently admixed population. In this study, we generated 2723 high-coverage who...
https://www.science.org/doi/10.1126/science.adl3564
0
20
10
reposted by
Antonio Camargo
STCmicrobeblog
5 months ago
schaechter.asmblog.org/schaechter/2...
#MicroSky
#Archaea
#ArchaeaSky
#SymbioSky
add a skeleton here at some point
1
20
12
reposted by
Antonio Camargo
Oliver Schwengers
5 months ago
We happily present: “Bakta Web – rapid and standardized genome annotation on scalable infrastructures” @OxUniPress NAR’s Web Server issue
doi.org/10.1093/nar/...
Easy to use, no registration, fast, scalable, various visualizations, in sync with Bakta CLI:
bakta.computational.bio
(1/5)
loading . . .
Bakta Web – rapid and standardized genome annotation on scalable infrastructures
Abstract. The Bakta command line application is widely used and one of the most established tools for bacterial genome annotation. It balances comprehensiv
https://doi.org/10.1093/nar/gkaf335
1
40
28
reposted by
Antonio Camargo
Alex Crits-Christoph
5 months ago
Unique investigation of some errors in long read assemblers. In particular these remarkably chimeric contigs are 😱, if rare.... Improving long read assemblers is definitely the space to be in when it comes to the future of metagenomics, as short reads won't be part of it 😉
add a skeleton here at some point
3
65
39
reposted by
Antonio Camargo
Ben J Woodcroft
5 months ago
A 1.0 release for Sandpiper. 700,000 microbial community profiles (3x the last version, 4.7 Pbp metaG), searchable via the
@ace-gtdb.bsky.social
R226 taxonomy that just dropped. MetaGs are going exponential, but we are still nowhere near a MAG for all species.
sandpiper.qut.edu.au
#microsky
🧬🖥️ 1/2
3
50
35
reposted by
Antonio Camargo
ace-gtdb.bsky.social
5 months ago
GTDB release 10 based on RefSeq 226 (R10-RS226) is live at
gtdb.ecogenomic.org
. This release covers 732,475 genomes (22% increase) and has 143,6141 species clusters (37% increase). Release notes at:
forum.gtdb.ecogenomic.org/t/announcing...
. Release statistics at:
gtdb.ecogenomic.org/stats/r226
.
loading . . .
GTDB - Genome Taxonomy Database
The Genome Taxonomy Database (GTDB) is an initiative to establish a standardised microbial taxonomy based on genome phylogeny.
https://gtdb.ecogenomic.org
0
23
18
reposted by
Antonio Camargo
Itai Yanai
6 months ago
Is the genome just a bag of genes? A new paper in Science now reports that for two thirds of an organisms' genes the position along the chromosome is actually very tightly constrained! Amazing work from my favorite night scientist Martin Lercher and his team!
www.science.org/doi/10.1126/...
1
102
30
reposted by
Antonio Camargo
Cameron Thrash
6 months ago
CoverM is published! CoverM: Read alignment statistics for metagenomics
academic.oup.com/bioinformati...
#jcampubs
loading . . .
CoverM: Read alignment statistics for metagenomics
AbstractSummary. Genome-centric analysis of metagenomic samples is a powerful method for understanding the function of microbial communities. Calculating r
https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btaf147/8107763?rss=1&login=false
1
72
32
reposted by
Antonio Camargo
Kaçar Lab at UW-Madison
6 months ago
🚨New paper!🚨 A comprehensive take on the origins and evolution of translation factors & how these essential players evolved across the tree of life. 🌍🧬 Led by Evrim Fer
@uwmadisonmdtp.bsky.social
grad student! 👏 Free access:
www.sciencedirect.com/science/arti...
@cp-trendsgenetics.bsky.social
3
94
50
reposted by
Antonio Camargo
RdRp Summit
7 months ago
The abstract submission & registration for
#RdRpSummit2025
is NOW OPEN!!! 🎉🙌 Do you work on RNA virus discovery using RdRps? Are you joining the
#ViBioM2025
in Portugal? Consider submitting your research in one of our sessions! More info:
RdRp.io
0
14
15
Publishing a paper in Nature while based in Brazil is an incredible achievement. Huge congratulations to the authors! I'm excited to give this a proper read :)
add a skeleton here at some point
7 months ago
0
2
0
One of the great things about bioinformatics is how open it can be. Contributing to a project you admire can actually get you involved in it :)
add a skeleton here at some point
8 months ago
0
7
1
reposted by
Antonio Camargo
A. Murat Eren (Meren)
8 months ago
I'm very happy to report that Matt Schechter's PhD work is now online as a pre-print: "Ribosomal protein phylogeography offers quantitative insights into the efficacy of genome-resolved surveys of microbial communities",
www.biorxiv.org/content/10.1...
Here's a little 🧵 about it.
1
35
20
reposted by
Antonio Camargo
Eduardo Rocha
8 months ago
This preprint makes a point that is valid for a lot of machine learning approaches. Organisms or genes are linked by evolutionary history; they are not independent. This results in correlation between learning and test sets, and often in over-optimistic evaluations of the methods' outcome.
add a skeleton here at some point
3
59
27
reposted by
Antonio Camargo
Joint Genome Institute
8 months ago
In Science Advances: "We took a deep dive into over 1.8 million bacterial and archaeal genomes to see how much of their diversity we’ve actually captured. Turns out that despite all the genomes we’ve sequenced, we’ve only scratched the surface." -Dongying Wu 🖥️🧬🦠
biosciences.lbl.gov/2025/01/17/t...
loading . . .
Taking Stock of the Known and Unknown Microbial Space - Biosciences Area
Using publicly available genome sequence data generated over the past three decades, JGI researchers assess the known fraction of microbial diversity.
https://biosciences.lbl.gov/2025/01/17/taking-stock-of-the-known-and-unknown-microbial-space/
2
43
17
reposted by
Antonio Camargo
Simon Roux
9 months ago
🚨Postdoc position(s) alert🚨 If you like to develop new bioinformatic tools for *human* microbiome analysis, are interested by virus/phages, we (JGI Viral Genomics and Microbiome Data groups) may have a great position for you ! No official job posting yet, but email or DM if you are interested !
0
75
77
reposted by
Antonio Camargo
Knut Drescher
9 months ago
We found that many bacterial species use exogenous peptidoglycan fragments - released by lysis of neighboring cells - as a general danger signal, triggering a danger response that protects bacteria against many dangers: biofilm formation. Details here 👇
www.nature.com/articles/s41...
loading . . .
Bacteria use exogenous peptidoglycan as a danger signal to trigger biofilm formation - Nature Microbiology
Peptidoglycan released by neighbouring kin or non-kin cell lysis induces physiological changes that protect from a range of stresses, including phage predation.
https://www.nature.com/articles/s41564-024-01886-5
0
143
58
reposted by
Antonio Camargo
Ryan Wick
9 months ago
New year, new assemblies! I'm excited to announce Autocycler, my new tool for consensus assembly of long-read bacterial genomes! It's the successor to Trycycler, designed to be faster and less reliant on user intervention. Check it out:
github.com/rrwick/Autoc...
(1/5)
loading . . .
Home
A tool for generating consensus long-read assemblies for bacterial genomes - rrwick/Autocycler
https://github.com/rrwick/Autocycler/wiki
2
156
100
reposted by
Antonio Camargo
Mart Krupovic
9 months ago
RNA virologists, check out "The protein structurome of Orthornavirae and its dark matter" by Pascal Mutz, Valerian Dolja, Eugene Koonin et al (including
@simrouxvirus.bsky.social
,
@apcamargo.bsky.social
,
@urineri.bsky.social
,
@anamarijabutkovic.bsky.social
)
journals.asm.org/doi/10.1128/...
loading . . .
The protein structurome of Orthornavirae and its dark matter | mBio
Advanced methods for protein structure prediction, such as AlphaFold2, greatly expand our capability to identify protein domains and infer their likely functions and evolutionary relationships. This i...
https://journals.asm.org/doi/10.1128/mbio.03200-24
0
22
10
reposted by
Antonio Camargo
Gav Armstrong
10 months ago
Don't use red and green data lines/surfaces in the same panel please
#chemsky
. It can be difficult for some colorblind readers to differentiate them. I've accepted (in principle) 2 papers today, and both sets of authors were asked to remove red/green colour contrasts
www.nature.com/articles/d41...
loading . . .
Colour me better: fixing figures for colour blindness
Images can be made more accessible by choosing hues, shapes and textures carefully.
https://www.nature.com/articles/d41586-021-02696-z
4
113
50
reposted by
Antonio Camargo
Mart Krupovic
9 months ago
An interesting paper for the EV (extracellular vesicle) fans by Patel et al.
@ahnaskop.bsky.social
lab: "Extracellular vesicles, including large translating vesicles called midbody remnants, are released during the cell cycle"
www.molbiolcell.org/doi/10.1091/...
0
11
3
reposted by
Antonio Camargo
Yunha Hwang
9 months ago
Can LLM agents discover novel protein functions? Introducing Gaia Agent 🌎 🤖: an AI biologist capable of reasoning across genomic contexts to predict functions of proteins! Gaia Agent is now integrated with Gaia Search at
gaia.tatta.bio
2
38
14
reposted by
Antonio Camargo
Yunha Hwang
9 months ago
If you are at
#NeurIPS2024
don't miss
@ancornman1.bsky.social
's talk on OMG/gLM2 at 9AM!
@workshopmlsb.bsky.social
East meeting room 11,12
0
12
3
reposted by
Antonio Camargo
Yunha Hwang
10 months ago
Excited to be at
#NeurIPS
this week.
@ancornman1.bsky.social
will give a spotlight talk at the
@workshopmlsb.bsky.social
on gLM2/OMG! Please reach out if you want to chat about gLM2/OMG/Gaia and our latest projects😇
www.biorxiv.org/content/10.1...
loading . . .
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling
Biological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraini...
https://www.biorxiv.org/content/10.1101/2024.08.14.607850v2
0
9
3
reposted by
Antonio Camargo
Rob Patro
10 months ago
Very cool work from Yang Lu et al. demonstrating miscalibration of BLASTP’s E-values and generating well-calibrated values via a knockoff-based approach (cc
@mikelove.bsky.social
) -
academic.oup.com/bioinformati...
! More analyses could benefit from knockoff-based approaches.
loading . . .
A BLAST from the past: revisiting blastp’s E-value
AbstractMotivation. The Basic Local Alignment Search Tool, BLAST, is an indispensable tool for genomic research. BLAST established itself as the canonical
https://academic.oup.com/bioinformatics/advance-article/doi/10.1093/bioinformatics/btae729/7916501
2
37
22
reposted by
Antonio Camargo
Pierre Peterlongo
11 months ago
🧬🔍There are 50 petabases of freely-available DNA sequencing data. We introducing Logan Search which allows you to search for any DNA sequence in minutes, bringing Earth’s largest genomic resource to your fingertips. 🏔️
logan-search.org
🏔️
#Genomics
#Bioinformatics
#OpenScience
2
108
60
reposted by
Antonio Camargo
Yunha Hwang
10 months ago
Hello 🦋
#protein
/
#microbio
/
#BioML
community! We are excited to release Gaia🌎, a context-aware protein search tool, extending protein search and discovery capabilities beyond sequence and structure, to include *genomic context*. Search your favorite protein sequences with on
gaia.tatta.bio
10
237
83
reposted by
Antonio Camargo
Martin Steinegger 🇺🇦
10 months ago
New GPU-based MMseqs2: 20x faster searches on a single L40S (approx. as fast as a RTX 4090) vs. a 128-core CPU. This work enables to set up a very cost-efficient ColabFold MSA GPU server. 🧵🧵 📄
www.biorxiv.org/content/10.1...
💾
mmseqs.com
🗞️
developer.nvidia.com/blog/boost-a...
2
174
61
reposted by
Antonio Camargo
Alex Crits-Christoph
12 months ago
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling From friends at Tatta Bio GitHub:
github.com/TattaBio/OMG
www.biorxiv.org/content/10.1...
loading . . .
The OMG dataset: An Open MetaGenomic corpus for mixed-modality genomic language modeling
Biological language model performance depends heavily on pretraining data quality, diversity, and size. While metagenomic datasets feature enormous biological diversity, their utilization as pretraini...
https://www.biorxiv.org/content/10.1101/2024.08.14.607850v2.abstract
0
5
2
reposted by
Antonio Camargo
Ákos T Kovács
about 1 year ago
Terrabacteria: redefining bacterial envelope diversity, biogenesis and evolution
#NatureRevMicro
from Simonetta Gribaldo
www.nature.com/articles/s41...
0
4
3
reposted by
Antonio Camargo
Alex Crits-Christoph
about 1 year ago
AntiDefenseFinder! And it is available also as an option with DefenseFinder:
defensefinder.mdmlab.fr
Exploring the diversity of anti-defense systems across prokaryotes, phages, and mobile genetic elements
www.biorxiv.org/content/10.1...
loading . . .
Exploring the diversity of anti-defense systems across prokaryotes, phages, and mobile genetic elements
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
https://www.biorxiv.org/content/10.1101/2024.08.21.608784v1
1
16
16
reposted by
Antonio Camargo
Roland Hatzenpichler
about 1 year ago
Methanogenesis outside the Euryarchaeota experimentally demonstrated by three cultivation-driven studies (two from my lab)! A long🧵.🐻with me
tinyurl.com/4v4fkda6
tinyurl.com/yr4p7js6
tinyurl.com/mtsrj6b9
24
20
11
reposted by
Antonio Camargo
Karthik Anantharaman
about 1 year ago
If you are interested in prophages, we have a new database: Prophage-DB. Check it out and all feedback is welcome
biorxiv.org/cgi/content/...
loading . . .
Prophage-DB: A comprehensive database to explore diversity, distribution, and ecology of prophages
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
https://biorxiv.org/cgi/content/short/2024.07.11.603044v1
0
13
18
reposted by
Antonio Camargo
Ákos T Kovács
about 1 year ago
Phylogenetic reconciliation: making the most of genomes to understand microbial ecology and evolution
#ISMEJournal
from Gergely Szöllősi and collagues (Phil Hugenholtz, Anja Spang, Cecile Gubry-Rangin, Paul O Sheridan, Laura Eme, Rochelle M Soo and more
academic.oup.com/ismej/advanc...
0
11
6
reposted by
Antonio Camargo
Alex Crits-Christoph
over 1 year ago
A global atlas of soil viruses reveals unexplored biodiversity and potential biogeochemical impacts - Nature Microbiology
www.nature.com/articles/s41...
@apcamargo.bsky.social
Are these new genomes already in the current version of IMG/VR or would I want to download separately and combine for now?
loading . . .
A global atlas of soil viruses reveals unexplored biodiversity and potential biogeochemical impacts - Nature Microbiology
This study presents an extensive global compendium of metagenomically derived sequences that will serve as a foundation for understanding the role of viruses in soil ecosystems.
https://www.nature.com/articles/s41564-024-01686-x
1
6
5
reposted by
Antonio Camargo
Alex Crits-Christoph
over 1 year ago
github.com/rcedgar/usea...
USEARCH is... open source now (!)
loading . . .
GitHub - rcedgar/usearch12: Open-source usearch
Open-source usearch. Contribute to rcedgar/usearch12 development by creating an account on GitHub.
https://github.com/rcedgar/usearch12/tree/master
1
5
6
reposted by
Antonio Camargo
Alex Crits-Christoph
over 1 year ago
It's the most wonderful time of the year: the time when we all learn about the latest nanopore updates via screenshots of tweets of photos of slides Looks not disappointing, no sign of the accuracy wall! And dorado 0.7 is now out with the new models:
github.com/nanoporetech...
1
11
3
reposted by
Antonio Camargo
Roland Dunbrack 🏳️🌈
over 1 year ago
I reviewed the AlphaFold3 paper from DeepMind for the journal Nature. I tried really hard to get the editors to demand that DeepMind release the code (even an executable) so people could do the many high-throughput studies we saw for AF2 (see image from my review). I failed. So just a server 4 now.
3
59
30
reposted by
Antonio Camargo
Wolfgang Huber
over 1 year ago
If you write a bioinformatics software package intended for others to use: Don't spit out diagnostic messages on the console. Some of that info is better kept in the documentation, the rest in the results object, for programmatic access. If your tool is good, it'll soon be one piece in a tool chain.
2
7
5
Load more
feeds!
log in