Ben Trent
@benwtrent.bsky.social
📤 135
📥 65
📝 38
Doer of things | Builder of things | software engineer @elastic
Apache Lucene 10.3.0 is released! 40% faster lexical search is absolutely crazy for a project that has been doing lexical search for a quarter of a century
lucene.apache.org/core/corenew...
loading . . .
Lucene™ Core News
Apache Lucene is a high-performance, full-featured search engine library written entirely in Java. It is a technology suitable for...
https://lucene.apache.org/core/corenews.html#apache-lucenetm-1030-available
10 days ago
0
4
0
Storing floating point values as a big 'ole JSON blob is silly, so we stopped doing that. Great stuff from Jim on making vector search in Elasticsearch substantially cheaper!
www.elastic.co/search-labs/...
loading . . .
Elasticsearch vector search: Excluding vectors from source - Elasticsearch Labs
Elasticsearch now excludes vectors from source by default, saving space and improving performance while keeping vectors accessible when needed.
https://www.elastic.co/search-labs/blog/elasticsearch-exclude-vectors-from-source
about 1 month ago
0
3
0
reposted by
Ben Trent
Doug Turnbull
4 months ago
Next in the series of building a search engine from scratch - we focus on hybrid retrieval with @benwtrent.bsky.socialof Elastic. How do you add filtering to a vector search index? I'll code. He'll yell at me.
maven.com/p/430592/hyb...
loading . . .
Hybrid search live coded from scratch
RAG systems all use vector databases. HNSW (Hierarchical Navigable Small Worlds) is the most common algorithm. If you want to build RAG, you should appreciate how this algorithm works (Missed previous...
https://maven.com/p/430592/hybrid-search-live-coded-from-scratch?utm_medium=ll_share_link&utm_source=instructor
0
3
1
reposted by
Ben Trent
Adrien Grand
6 months ago
It's time to redo benchmarks!
#Lucene
10.2 was just released, with - huge speedups to non-scoring boolean queries, range queries and filtered vector search, - better merging defaults for faster search, - much faster merging of vectors And more...
lucene.apache.org/core/corenew...
loading . . .
Lucene™ Core News
Apache Lucene is a high-performance, full-featured search engine library written entirely in Java. It is a technology suitable for...
https://lucene.apache.org/core/corenews.html#apache-lucenetm-1020-available
1
6
1
Indexing and merging times are getting better for
#Apache
#Lucene
vector search. Lucene has a read-only segment architecture. One of the drawbacks of this approach is throwing away previously completed work when merging HNSW graphs. Well, this got better :)
6 months ago
1
2
1
Filtered vector search is crazy important. So we made HNSW filtered search in Apache Lucene better. At similar recall, it can be 3-5x faster!
7 months ago
1
5
1
reposted by
Ben Trent
Philipp Krenn
8 months ago
"elasticsearch: 15 years of indexing it all, finding what matters":
www.elastic.co/search-labs/...
we turned it into a proper blog post with shay :)
loading . . .
Elasticsearch history: 15 years of indexing and searching - Elasticsearch Labs
Elasticsearch just turned 15-years-old! Take a look back at the last 15 years of indexing and searching, and turn to the next 15 years of relevance.
https://www.elastic.co/search-labs/blog/elasticsearch-history-15-years
0
3
1
I really enjoyed this talk by
@elasticmark.bsky.social
. He is back at finding crazy & interesting ways to explore data (I guess he never stopped). Clustering with binary vectors & vector search with Elasticsearch
www.youtube.com/watch?v=sJU_...
loading . . .
Binary Vectors & Fuzzy Facets: Clustering Results in a Browser Using Binary Vectors
YouTube video by Official Elastic Community
https://www.youtube.com/watch?v=sJU_8mtzH7Y
8 months ago
0
2
1
It's wonderful to see practical & important programming work. Debugging concurrent programs is incredibly difficult, here is a bug found in Apache Lucene by the CMU Pasta Lab using their new Fray testing framework
www.elastic.co/search-labs/...
loading . . .
Concurrency bugs in Lucene: How to fix optimistic concurrency failures - Elasticsearch Labs
Thanks to Fray, a deterministic concurrency testing framework from CMU’s PASTA Lab, we tracked down a tricky Lucene bug and squashed it
https://www.elastic.co/search-labs/blog/optimistic-concurrency-lucene-debugging
8 months ago
1
2
1
The number of improvements in Lucene here are crazy. Pretty much every count and boolean query gets a nice boost and some of the count improvements are hilarious 🚀🚀🚀.
add a skeleton here at some point
9 months ago
1
5
0
It's so cool to see
#Apache
#Lucene
going strong after about a quarter of a century 🤯. 2025 is gonna be a fun year for Lucene.
www.elastic.co/search-labs/...
loading . . .
Lucene Wrapped 2024 - Elasticsearch Labs
2024 has been another major year for Apache Lucene. In this blog, we’ll explore the key highlights.
https://www.elastic.co/search-labs/blog/apache-lucene-wrapped-2024
9 months ago
0
3
0
Early termination for vector search can be more than just "gathering K candidates" my colleague Tommaso gives a small overview of basic early termination strategies for vector index search.
www.elastic.co/search-labs/...
loading . . .
Early termination in HNSW for faster approximate KNN search - Elasticsearch Labs
Learn how HNSW can be made faster for KNN search, using smart early termination strategies.
https://www.elastic.co/search-labs/blog/hnsw-knn-search-early-termination
9 months ago
0
3
0
My team wrote a new backing algorithm for our BBQ indices, called Optimized Scalar Quantization. Here is a high level overview of its implementation in Elasticsearch (and soon Apache Lucene).
www.elastic.co/search-labs/...
for the math nerds, skip to Tom's blog:
www.elastic.co/search-labs/...
loading . . .
Optimized Scalar Quantization: Even Better Binary Quantization - Elasticsearch Labs
Here we explain optimized scalar quantization in Elasticsearch and how we used it to improve Better Binary Quantization (BBQ).
https://www.elastic.co/search-labs/blog/optimized-scalar-quantization-elasticsearch
9 months ago
0
2
1
reposted by
Ben Trent
Adrien Grand
9 months ago
Lucene has been evaluating disjunctive queries by loading (windows of) postings into a bit set and or-ing these bit sets for 20+ years. It started using the same approach for conjunctive queries a few days ago.
benchmarks.mikemccandless.com/CountAndHigh...
(annotation HS)
loading . . .
Lucene CountAndHighHigh queries/sec
https://benchmarks.mikemccandless.com/CountAndHighHigh.html
1
2
1
Something a little different from my typical blogs. This line of code in Apache Lucene took me 3 days to write. For fixing bugs, it's about the journey, not necessarily the destination.
www.elastic.co/search-labs/...
(the cover art was provided by one of my kids :))
loading . . .
Lucene bug adventures: Fixing a corrupted index exception - Elasticsearch Labs
Sometimes, a single line of code takes days to write. Here, we get a glimpse of an engineer's pain and debugging over multiple days to fix a potential Apache Lucene index corruption.
https://www.elastic.co/search-labs/blog/lucene-corrupted-index-exception
9 months ago
1
4
1
Our Better Binary Quantization (BBQ) index in Elasticsearch has a new backing algorithm. Better(er) recall & query speed for vector search. Its a natural evolution of our scalar quantization. Shipping soon. It's pretty neat
www.elastic.co/search-labs/...
loading . . .
Understanding optimized scalar quantization - Elasticsearch Labs
In this post we explain a new form of scalar quantization we've developed at Elastic that achieves state-of-the-art accuracy for binary quantization
https://www.elastic.co/search-labs/blog/scalar-quantization-optimization
9 months ago
0
3
1
Elasticsearch just got more powerful. Now, semantic, hybrid, and vector retrieval with custom rules for pinning and bubbling results to the top! Now you have multi-phased, hybrid retrieval in combination with business rules :D
www.elastic.co/search-labs/...
loading . . .
Ensuring business rules work seamlessly with semantic search - Elasticsearch Labs
Harness the power of query rules combined with semantic search and rerankers.
https://www.elastic.co/search-labs/blog/semantic-search-query-rules
9 months ago
0
1
0
It was so much fun talking
#Elasticsearch
with Steve Mayzak on “You Know, For Search”. I could nerd out for hours, but we kept it down to just 1 hour (maybe even that is too long....). Give it a listen, if nothing else, for Steve's dulcet tones:
open.spotify.com/episode/7HLH...
loading . . .
Quantization: The Important Bits
You know, for search, an Elastic podcast · Episode
https://open.spotify.com/episode/7HLH43iYe5hgCKvTsr0CjX?si=YXoDvX7PQAeXsJtlp495Iw
10 months ago
0
2
3
Be prepared to learn more about semantic rerankers than you ever thought you needed to know. Another awesome analysis from my colleagues at Elasticsearch
www.elastic.co/search-labs/...
loading . . .
Exploring depth in a 'retrieve-and-rerank' pipeline - Elasticsearch Labs
Select an optimal re-ranking depth for your model and dataset.
https://www.elastic.co/search-labs/blog/elastic-semantic-reranker-part-3
10 months ago
0
7
0
More magic from chef Chris Hegarty. How better binary quantization vector ops are accelerated with Java SIMD in Elasticsearch vector search
www.elastic.co/search-labs/...
loading . . .
Smokin' fast BBQ with hardware accelerated SIMD instructions - Search Labs
How we optimized vector comparisons in BBQ with hardware accelerated SIMD (Single Instruction Multiple Data) instructions.
https://www.elastic.co/search-labs/blog/bbq-vector-comparison-simd-instructions
10 months ago
0
4
0
I cannot adequately express how proud I am of the
#Elasticsearch
team for delivering this. It is a humungous engineering achievement and the results of (metaphorical) blood, sweat, and (maybe real ;) ) tears.
go.es.io/3CVo82X
loading . . .
Do less with serverless: Elastic Cloud Serverless — Now GA
Elastic Cloud Serverless is the easiest way to start and scale your capabilities in search, observability and security. Built on a reimagined Elasticsearch architecture, it ensures low-latency queryin...
https://go.es.io/3CVo82X
10 months ago
0
2
0
We have seen this idea played out nicely with Tantivy and Apache Lucene. Benchmarking between each other and lovingly borrowing ideas between the projects.
add a skeleton here at some point
10 months ago
0
3
2
I work with some pretty smart folks. We are shipping a new cross-encoder reranking models. Some pretty great numbers at a moderately sized 183M params.
www.elastic.co/search-labs/...
loading . . .
Introducing Elastic Rerank: Elastic's new semantic re-ranker model - Search Labs
Learn about how Elastic's new re-ranker model was trained and how it performs.
https://www.elastic.co/search-labs/blog/elastic-semantic-reranker-part-2
10 months ago
0
3
0
Honestly, Grant Sanderson (3blue1brown) is one of (if not the best) mathematics communicator of my generation. If you want an overview of how LLMs work at a deep & intuitive level, I whole heartedly recommend this lecture:
youtu.be/KJtZARuO3JY?...
loading . . .
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
YouTube video by Grant Sanderson
https://youtu.be/KJtZARuO3JY?si=AXRwMJ1DFXifZbD2
10 months ago
0
5
0
Here is a talk I gave on Better Binary Quantization (BBQ) for vector search in Elasticsearch. For those that don't like reading blogs, or just want to know how bad the puns can really get, this talk is for you:
www.youtube.com/watch?v=04Nz...
loading . . .
Better Binary Quantization (BBQ): From Bytes to BBQ, The Secret to Better Vector Search by Ben Trent
YouTube video by Official Elastic Community
https://www.youtube.com/watch?v=04NzMt2Nigc
10 months ago
0
5
1
Better Binary Quantization (BBQ) is now in 8.16. When it comes to index time, query time, and recall, we found it way better than the old faithful Product Quantization (PQ) in Elasticsearch vector search. Here is how we compared them when developing BBQ:
www.elastic.co/search-labs/...
loading . . .
Better Binary Quantization vs. Product Quantization - Search Labs
Why we chose to spend time working on better binary quantization instead of production quantization in Lucene and Elasticsearch.
https://www.elastic.co/search-labs/blog/bit-vectors-elasticsearch-bbq-vs-pq
11 months ago
0
9
2
Did you know Elasticsearch added the "retrievers" framework back in 8.12? As a further extension of _search, it gives extra flexibility that our traditional "query" API does not. Like RRF, global reranking, and multi-phased retrieval. It's now GA:
elastic.co/search-labs/...
[đź§µ begin]
loading . . .
Elasticsearch retrievers are generally available with Elasticsearch 8.16.0! - Search Labs
Elasticsearch retrievers have gone through a significant revamp and are now generally available for all to use. Learn all about their architecture and use-cases.
https://elastic.co/search-labs/blog/elasticsearch-retrievers-ga-8.16.0
11 months ago
1
6
0
We have shipped Better Binary Quantization for vectorsearch in Elasticsearch. Product Quantization is sooo slow. BBQ is 30x faster to quantize, >2x faster queries, and you still get the 95+% reduction in memory requirements. Here is an overview of how it works:
www.elastic.co/search-labs/...
loading . . .
Better Binary Quantization (BBQ) in Lucene and Elasticsearch - Search Labs
How Better Binary Quantization works in Lucene and Elasticsearch.
https://www.elastic.co/search-labs/blog/better-binary-quantization-lucene-elasticsearch
11 months ago
0
3
0
Fast HNSW and 1/4 the memory cost? Yes please! New int8 quantization in Apache Lucene!
loading . . .
Introducing Scalar Quantization in Lucene — Elastic Search Labs
How did we introduce scalar quantization into Lucene
https://www.elastic.co/search-labs/scalar-quantization-in-lucene
almost 2 years ago
0
3
0
you reached the end!!
feeds!
log in