Zhigui Bao 鲍志贵
@zbao.bsky.social
📤 172
📥 172
📝 49
🌱 PhD student at Weigelworld
@plantevolution.bksy.social
| Graph pangenome | Population genetics
reposted by
Zhigui Bao 鲍志贵
Yun S. Song
1 day ago
We are excited to share GPN-Star, a cost-effective, biologically grounded genomic language modeling framework that achieves state-of-the-art performance across a wide range of variant effect prediction tasks relevant to human genetics.
www.biorxiv.org/content/10.1...
(1/n)
3
157
91
reposted by
Zhigui Bao 鲍志贵
bioRxiv Plant Bio
4 days ago
A deep-time landscape of plant cis-regulatory sequence evolution
https://www.biorxiv.org/content/10.1101/2025.09.17.676453v1
0
4
3
reposted by
Zhigui Bao 鲍志贵
Pierre Baduel
5 days ago
Happy to share the results of a long-haul post-doc project, now online
@science.org
, aiming at understanding the rules of transgeneration epigenetic inheritance over TEs in plants and its extent and impact in nature. More below!
doi.org/10.1126/scie...
loading . . .
Transposable elements are vectors of recurrent transgenerational epigenetic inheritance
DNA methylation loss at transposable elements (TEs) can affect neighboring genes and be epigenetically inherited in plants, yet the determinants and significance of this additional system of inheritan...
https://doi.org/10.1126/science.ady3475
7
55
30
reposted by
Zhigui Bao 鲍志贵
PLOS Biology
8 days ago
The
#Drosophila
Dscam1 gene generates 10000s of isoforms, but only a small fraction supports neuronal functions. This study shows that
#fitness
&
#immunity
are the likely primary evolutionary drivers of Dscam1 isoform diversity in
#arthropods
@plosbiology.org
🧪
plos.io/4gp8cWd
0
18
10
reposted by
Zhigui Bao 鲍志贵
Camille Roux
11 days ago
Hybridization and introgression are major evolutionary processes. Since the 1940s, the prevailing view has been that they shape plants far more than animals. In our new study (
www.science.org/doi/10.1126/...
), we find the opposite: animals exchange genes more, and for longer, than plants
3
183
116
reposted by
Zhigui Bao 鲍志贵
Zamin Iqbal
about 1 month ago
"We show that unrelated proteins have a universal tendency towards convergent evolution of secondary and tertiary motifs, causing an excess of high-scoring FP alignment... previous methods routinely overestimate significance by up to six orders of magnitude."
www.biorxiv.org/content/10.1...
loading . . .
Protein structure alignment significance is often exaggerated
Machine learning has generated millions of high-quality predicted protein structures, creating a need for computationally efficient structure search algorithms and robust estimates of statistical sign...
https://www.biorxiv.org/content/10.1101/2025.07.17.665375v1
0
52
25
reposted by
Zhigui Bao 鲍志贵
bioRxiv Bioinfo
about 1 month ago
Evolutionary and methodological considerations when interpreting gene presence-absence variation in pangenomes
https://www.biorxiv.org/content/10.1101/2025.08.14.670405v1
0
0
3
reposted by
Zhigui Bao 鲍志贵
PlantEvolution 🌱🌾
about 1 month ago
2/2 With @derekseveri.bsky.social, Joy Bergelson, Fabrice Roux and Talia Karasov. Illustration: Genetically diverse Arabidopsis grown in the lab | Closely related wild Arabidopsis during the natural growing season | Diversity of Arabidopsis habitats.
2
16
6
reposted by
Zhigui Bao 鲍志贵
K.D. Murray
about 1 month ago
Happy to be able to finally share our NLR pangenome paper, out now in CHM. "Pangenomic context reveals the extent of intraspecific plant NLR evolution"
www.cell.com/cell-host-mi...
#plantscience
#plantimmunity
#pangenomes
#science
#nlr
loading . . .
Pangenomic context reveals the extent of intraspecific plant NLR evolution
Individual- and population-level diversity is required for pathogen defense by nucleotide-binding site leucine-rich repeat (NLR) proteins. Teasdale et al. leverage annotated, divergent A. thaliana gen...
https://www.cell.com/cell-host-microbe/fulltext/S1931-3128(25)00283-5
2
41
23
So cool!
add a skeleton here at some point
about 1 month ago
0
0
0
reposted by
Zhigui Bao 鲍志贵
Ryan Gutenkunst
about 2 months ago
If you're new to demographic history inference from population genomics, try this webapp I created to illustrate how dadi fits bottleneck models to site frequency spectra:
ryangutenkunst-dadi-two-epoch.hf.space
. It even outputs files for submitting to the GHIST competition!
ghi.st
1
38
23
reposted by
Zhigui Bao 鲍志贵
bioRxiv Plant Bio
2 months ago
Somatic mobility of transposons is explosive and shaped by distinct integration biases in Arabidopsis thaliana
https://www.biorxiv.org/content/10.1101/2025.07.14.664700v1
0
1
1
reposted by
Zhigui Bao 鲍志贵
Laurie Belcher
2 months ago
OrthoFinder just dropped a major update It’s faster, more accurate, and ready for thousands of genomes Let’s break it down (1/10)
github.com/OrthoFinder/...
www.biorxiv.org/content/10.1...
1
127
73
reposted by
Zhigui Bao 鲍志贵
Erik Garrison
2 months ago
Postdoc position opening in my group! Research projects: pangenomes for diverse organisms, genome evolution, biocomputing, language models. Please reach out if interested!
1
28
31
reposted by
Zhigui Bao 鲍志贵
Adam Phillippy
2 months ago
Accurate diagram 😂
add a skeleton here at some point
1
17
4
reposted by
Zhigui Bao 鲍志贵
Heng Li
3 months ago
Preprint on "Finding easy regions for short-read variant calling from pangenome data":
arxiv.org/abs/2507.03718
0
31
14
reposted by
Zhigui Bao 鲍志贵
Ulrich Lutz
3 months ago
New paper out! 🎉 With an innovative approach of CRISPRing a gene mutation on many backgrounds, we provide a proof of concept for how “pan-genetic” analysis can reveal the true extent of genetic networks in a species. Thanks @plantevolution.bsky.social-lab! Open-access:
tinyurl.com/3aw7t6ff
0
11
3
reposted by
Zhigui Bao 鲍志贵
David A Knowles
3 months ago
New work from the lab trying to wrap our heads around the massive complexity of the human transcriptome revealed by long-read RNA-seq! Fun collab with Gloria Sheynkman.
www.biorxiv.org/content/10.1...
loading . . .
Perplexity as a Metric for Isoform Diversity in the Human Transcriptome
Long-read sequencing (LRS) has revealed a far greater diversity of RNA isoforms than earlier technologies, increasing the critical need to determine which, and how many, isoforms per gene are biologic...
https://www.biorxiv.org/content/10.1101/2025.07.02.662769v1
2
55
22
reposted by
Zhigui Bao 鲍志贵
North American Arabidopsis Steering Committee
3 months ago
Detlef Weigel- Max Planck Inst- Tubingen- honored for his lifetime of excellence in research, mentorship, and support of the Arabidopsis community
#ICAR2025
0
50
15
reposted by
Zhigui Bao 鲍志贵
Heng Li
3 months ago
Preprint on "Improving spliced alignment by modeling splice sites with deep learning". It describes minisplice for modeling splice signals. Minimap2 and miniprot now optionally use the predicted scores to improve spliced alignment.
arxiv.org/abs/2506.12986
0
109
55
reposted by
Zhigui Bao 鲍志贵
Andrew Hipp
3 months ago
Long-term flowering-time data on Japanese mountain cherry (recorded since the 9th century!) shows a shift in full-flowering date beginning in the late 19th century. Fascinating new
@newphyt.bsky.social
paper by
@jgpausas.bsky.social
onlinelibrary.wiley.com/doi/abs/10.1...
2
102
50
reposted by
Zhigui Bao 鲍志贵
Science Magazine
3 months ago
As the world warms, plants in natural ecosystems and agricultural settings find ways to respond to the heat. In a new special issue of Science, researchers examine how heat affects plants at multiple scales, from the molecular level to the biosphere.
scim.ag/44cSw3Z
2
118
64
reposted by
Zhigui Bao 鲍志贵
Vaughn Cooper
4 months ago
Sharing the most significant work from my group, led by the
@evolvingstem.bsky.social
team. Come for the discoveries of how Pseudomonas adapts in biofilms, stay for the story of how they were discovered by thousands of young scientists in grades 9-12. 🧪🧫🧬🧵
www.biorxiv.org/content/10.1...
loading . . .
Student-led experimental evolution reveals novel biofilm regulatory networks underlying adaptations to multiple niches
We established a research-education partnership known as EvolvingSTEM that provides secondary school students the opportunity to conduct authentic research experiments centered on microbial evolution....
https://www.biorxiv.org/content/10.1101/2025.06.06.658356v1
4
163
79
reposted by
Zhigui Bao 鲍志贵
Lisa Smith
4 months ago
With the publication of a new preprint (
www.biorxiv.org/content/10.1...
) with collaborators
@labschneeberger.bsky.social
@plantevolution.bsky.social
and
@jurriaanton.bsky.social
, I feel it is timely to talk about what will probably be the most successful 'failed' experiment of my career.
loading . . .
The mutational dynamics of the Arabidopsis centromeres
Centromeres are specialized chromosome regions essential for sister chromatid cohesion and spindle attachment during mitosis. Many centromeres comprise highly variable, megabase-scale satellite DNA ar...
https://www.biorxiv.org/content/10.1101/2025.06.02.657473v1
3
19
8
reposted by
Zhigui Bao 鲍志贵
bioRxiv Evolutionary Biology
4 months ago
Rapid adaptation and extinction across climates in synchronized outdoor evolution experiments of Arabidopsis thaliana
https://www.biorxiv.org/content/10.1101/2025.05.28.654549v1
0
4
2
reposted by
Zhigui Bao 鲍志贵
Zeqian Li
4 months ago
I started tgv to learn Rust and building it has been unbelievably fun! If you wanna pick up a super fast and fun programming language, tgv is open for contribution! There aren't many bioinformatics tools built entirely by the community. If tgv can become one, I'll be so psyched
add a skeleton here at some point
1
67
20
reposted by
Zhigui Bao 鲍志贵
Charlie Pugh
4 months ago
New preprint in collaboration with
@paulinanunezv.bsky.social
supervised by
@jonnyfrazer.bsky.social
and Mafalda Dias – we propose a simple approach to improving zero-shot variant effect prediction in pre-existing protein and genome language models: 🧶 1/n
www.biorxiv.org/content/10.1...
loading . . .
From Likelihood to Fitness: Improving Variant Effect Prediction in Protein and Genome Language Models
Generative models trained on natural sequences are increasingly used to predict the effects of genetic variation, enabling progress in therapeutic design, disease risk prediction, and synthetic biolog...
https://www.biorxiv.org/content/10.1101/2025.05.20.655154v1
1
74
27
reposted by
Zhigui Bao 鲍志贵
Roeder Lab
4 months ago
Check out the beautiful cover to our focus issue on Translational research from Arabidopsis to crop plants and beyond. More articles coming shortly. Congrats Patrice Salome and
@jotlovell.bsky.social
.
@theplantcell.bsky.social
academic.oup.com/plcell/issue...
0
8
2
reposted by
Zhigui Bao 鲍志贵
Molecular Biology and Evolution
5 months ago
Ancestral sequence reconstruction often assumes unrealistic homogeneous substitution models; however
@rmuniztrejo.bsky.social
et al. find that reconstruction accuracy is determined by phylogenetic signal, not model choice. 🔗
doi.org/10.1093/molb...
#evobio
#molbio
loading . . .
Robustness of Ancestral Sequence Reconstruction to Among-site and Among-lineage Evolutionary Heterogeneity
Abstract. Ancestral sequence reconstruction is typically performed using homogeneous evolutionary models, which assume that the same substitution propensit
https://doi.org/10.1093/molbev/msaf084
0
38
19
reposted by
Zhigui Bao 鲍志贵
Dmitri Petrov
5 months ago
A ridiculous amount of very careful work by
@jahemker.bsky.social
and coauthors gave a clear answer - one needs ultra-long and not just long reads to call SVs correctly in Drosophila. Now we are ready to quantify evolutionary impact of Drosophila SVs. Let us know what you think!
add a skeleton here at some point
0
20
7
reposted by
Zhigui Bao 鲍志贵
Francis M. Martin
5 months ago
Conventional and organic farms with more intensive management have lower soil functionality | Science
www.science.org/doi/10.1126/...
loading . . .
Conventional and organic farms with more intensive management have lower soil functionality
Organic farming is often considered to be more sustainable than conventional farming. However, both farming systems comprise highly variable management practices. In this study, we show that in organi...
https://www.science.org/doi/10.1126/science.adr0211?utm_source=sfmc&utm_medium=email&utm_content=alert&utm_campaign=SCIeToc&et_rid=495898616&et_cid=5597924
0
9
7
reposted by
Zhigui Bao 鲍志贵
Dan Kliebenstein
5 months ago
Have you ever wondered how a specialized metabolite enzyme/gene under fluctuating selection goes through speciation? Well the answer is it gets lost a lot. And by a lot we mean a lot. >25 gene copies were empirically tested using a phylo-functional approach.
www.biorxiv.org/content/10.1...
loading . . .
Convergence and constraint in glucosinolate evolution across the Brassicaceae
Diversity in plant specialized metabolites plays critical roles in plant-environment interactions. In longer evolutionary scales, e.g. between families or orders, this diversity arises from whole-geno...
https://www.biorxiv.org/content/10.1101/2025.04.23.650103v1
1
19
12
reposted by
Zhigui Bao 鲍志贵
Heng Li
5 months ago
Preprint on hifiasm Nanopore-only assembly. Led by Haoyu Cheng:
www.biorxiv.org/content/10.1...
loading . . .
Efficient near telomere-to-telomere assembly of Nanopore Simplex reads
Telomere-to-telomere (T2T) assembly is the ultimate goal for de novo genome assembly. Existing algorithms capable of near T2T assembly all require Oxford Nanopore Technologies (ONT) ultra-long reads w...
https://www.biorxiv.org/content/10.1101/2025.04.14.648685v1
5
139
83
add a skeleton here at some point
5 months ago
0
0
0
reposted by
Zhigui Bao 鲍志贵
Karel Břinda
6 months ago
A decade ago, we had thousands of bacterial genomes. Now, we have millions. How to scale computational methods? Our paper in
@naturemethods.bsky.social
answers this: use evolutionary history to guide compression and search.
rdcu.be/eg4OA
w/
@baym.lol
,
@zaminiqbal.bsky.social
et al. 🧵1/
3
160
38
reposted by
Zhigui Bao 鲍志贵
Shujun Ou
6 months ago
Hello bluesky world! Newbee here! I have a postdoc position immediately available in my lab. It will focus on identifying high-quality transposons in many genomes and finding their impacts in evolution and traits. Most works, including EDTA2 development and annotation of 400+ genomes, are done! 1/n
3
26
24
reposted by
Zhigui Bao 鲍志贵
Molly Schumer
6 months ago
With
@hybridzones.bsky.social
&
@jenncoughlan.bsky.social
, we have been working on an update to Daven Presgraves' influential 2010 review on hybrid incompatibilities (
shorturl.at/cJndf
). The preprint is available here (
shorturl.at/DTC48
) with an updated table of almost 100 incompatibilities!
loading . . .
The molecular evolutionary basis of species formation - Nature Reviews Genetics
Recently, several new speciation genes have been identified that have contributed to our understanding of the molecular details of the evolution of hybrid dysfunction. This Progress article describes ...
https://shorturl.at/cJndf
1
143
72
When you go to wrong hotel, but right plant
6 months ago
1
13
2
reposted by
Zhigui Bao 鲍志贵
Heng Li
6 months ago
longcallD is a new variant caller for genomic long reads. It jointly calls phased small and structural variants. Single binary, one command line for the whole process. Comparable accuracy to mainstream callers. Great work by Yan Gao.
github.com/yangao07/lon...
loading . . .
GitHub - yangao07/longcallD: A local-haplotagging-based small and structural variant caller
A local-haplotagging-based small and structural variant caller - yangao07/longcallD
https://github.com/yangao07/longcallD
3
105
52
reposted by
Zhigui Bao 鲍志贵
Hajk-Georg Drost
6 months ago
Deadline was just extended to 30th March! :)
add a skeleton here at some point
0
3
8
reposted by
Zhigui Bao 鲍志贵
bioRxiv Bioinfo
6 months ago
Investigating the topological motifs of inversions in pangenome graphs
https://www.biorxiv.org/content/10.1101/2025.03.14.643331v1
0
1
3
reposted by
Zhigui Bao 鲍志贵
Maya Voichek
6 months ago
1/ Transposable elements are often called "jumping genes" because they mobilize within genomes. 🧬 But did you know they can also jump 𝘣𝘦𝘵𝘸𝘦𝘦𝘯 cells? 🤯 Our new study reveals how retrotransposons invade the germline directly from somatic cells.
www.biorxiv.org/content/10.1...
A short thread 🧵👇
11
544
292
reposted by
Zhigui Bao 鲍志贵
Joe Peters Lab
7 months ago
Telomeric transposons are pervasive in linear bacterial genomes | Science
www.science.org/doi/10.1126/...
loading . . .
Telomeric transposons are pervasive in linear bacterial genomes
Eukaryotes have linear DNA and their telomeres are hotspots for transposons, which in some cases took over telomere maintenance. Here we identify several families of independently evolved telomeric tr...
https://www.science.org/doi/10.1126/science.adp1973
6
143
80
reposted by
Zhigui Bao 鲍志贵
Will Ratcliff
7 months ago
1/46 Hey folks, we have a new paper out on the MuLTEE. Strap in and I’ll tell you the story of how this “little paper on polyploidy” turned into the most data rich paper our lab has produced, largely thanks to the leadership and work ethic of
@kaitong25.bsky.social
.
www.nature.com/articles/s41...
loading . . .
Genome duplication in a long-term multicellularity evolution experiment - Nature
In the Multicellularity Long Term Evolution Experiment, diploid yeast evolve to be tetraploid under selection for larger multicellular size, revealing how whole-genome duplication can arise due to its...
https://www.nature.com/articles/s41586-025-08689-6
16
358
185
reposted by
Zhigui Bao 鲍志贵
bioRxiv Bioinfo
7 months ago
GrAnnoT, a tool for effecient and reliable annotation transfer through pangenome graph
https://www.biorxiv.org/content/10.1101/2025.02.26.640337v1
0
8
6
reposted by
Zhigui Bao 鲍志贵
Sophia Zebell
7 months ago
Thrilled to share our latest preprint, an interdisciplinary look at epistasis between variants in the coding and noncoding genome!
#plantscienceresearch
www.biorxiv.org/content/10.1...
loading . . .
Cryptic variation fuels plant phenotypic change through hierarchical epistasis
Cryptic genetic variants exert minimal or no phenotypic effects alone but have long been hypothesized to form a vast, hidden reservoir of genetic diversity that drives trait evolvability through epist...
https://www.biorxiv.org/content/10.1101/2025.02.23.639722v1
1
15
7
reposted by
Zhigui Bao 鲍志贵
Andrea Guarracino
7 months ago
Want to level up in
#pangenomics
? Join our workshop, conference & biohackathon in
#Memphis
, May 11-15, 2025. Connect with leading scientists, embrace genomic diversity, and contribute to cutting-edge software. Register now!
pangenome.github.io/MemPanG25/
#Bioinformatics
#MemPanG25
2
18
8
reposted by
Zhigui Bao 鲍志贵
bioRxiv Plant Bio
7 months ago
Disruption of the mRNA m6A writer complex triggers autoimmunity in Arabidopsis
https://www.biorxiv.org/content/10.1101/2025.02.18.638636v1
0
0
3
reposted by
Zhigui Bao 鲍志贵
Zoe Joly-Lopez
7 months ago
In
@elife.bsky.social
: Systems genomics of salinity stress response in rice
doi.org/10.7554/eLif...
!
loading . . .
Systems genomics of salinity stress response in rice
Insights into the molecular and genetic landscape underlying adaptive salinity stress responses in rice.
https://doi.org/10.7554/eLife.99352
0
10
9
reposted by
Zhigui Bao 鲍志贵
Alexandros Bousios
7 months ago
Another piece to add in the emerging golden era of centromere research. Our review of centrophilic transposons in plants
@annualreviews.bsky.social
with
@hendersi.bsky.social
and Tetsuji Kakutani.
www.annualreviews.org/content/jour...
1
13
5
Load more
feeds!
log in