How AI Is Built
@howaiisbuilt.fm
๐ค 45
๐ฅ 1
๐ 3
The podcast for people building AI.
https://open.spotify.com/show/3hhSTyHSgKPVC4sw3H0NUc
reposted by
How AI Is Built
Daniel Davis
8 months ago
Thanks to
@nicolay.fyi
for giving me the opportunity to talk about
@trustgraph.bsky.social
on How AI Is Built! Labels such as "facts", "observations", and "assertions" take on new meanings when we begin to consider time. Click ๐ to watch the full episode! ๐๏ธ
youtu.be/VpFVAE3L1nk
?
loading . . .
Temporal RAG: Embracing Time for Smarter, Reliable Knowledge Graphs
YouTube video by How AI Is Built
https://youtu.be/VpFVAE3L1nk
0
6
3
reposted by
How AI Is Built
Nicolay Gerold
8 months ago
Dropping some new episodes on
@howaiisbuilt.fm
. Links below.
1
1
1
reposted by
How AI Is Built
Nicolay Gerold
9 months ago
Trey and I talk about the different techniques for AI-powered search and how we can combine them to build modern search systems. Spotify:
open.spotify.com/episode/1udV...
Apple:
podcasts.apple.com/us/podcast/a...
loading . . .
AI-Powered Search: Context Is King, But Your RAG System Ignores Two-Thirds of It | S2 E21
How AI Is Built ยท Episode
https://open.spotify.com/episode/1udVhGicv73zdrIa0XtodN?si=LJKIZXTRSxWK40WAaPwDWg
0
2
1
reposted by
How AI Is Built
Nicolay Gerold
9 months ago
You want the exact opposite. You want layers of tools aligned in a graph that you can tune, debug, and update in isolation. Today on How AI Is Built, we are talking to one of the OGs of search: Trey Grainger, the author of AI Powered Search.
www.youtube.com/watch?v=6IQq...
loading . . .
AI-Powered Search: Context Is King, But Your RAG System Ignores Two-Thirds of It | S2 E21
YouTube video by How AI Is Built
https://www.youtube.com/watch?v=6IQqTIMJdBo
1
1
1
New episode is out. The three contexts of search, layered architectures and much more!
add a skeleton here at some point
9 months ago
0
0
0
reposted by
How AI Is Built
Nicolay Gerold
9 months ago
Youtube:
www.youtube.com/watch?v=trG5...
Spotify:
open.spotify.com/episode/6eyT...
Apple:
podcasts.apple.com/us/podcast/c...
loading . . .
Chunking for RAG: Stop Breaking Your Documents Into Meaningless Pieces | S2 E20
YouTube video by How AI Is Built
https://www.youtube.com/watch?v=trG5pjChAkM
0
1
2
reposted by
How AI Is Built
Nicolay Gerold
9 months ago
The biggest lie in RAG is that semantic search is simple. The reality is that it's easy to build, it's easy to get up and running, but it's really hard to get right. And if you don't have a good setup, it's near impossible to debug. One of the reasons it's really hard is chunking.
1
2
1
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
Most companies can't afford huge teams labeling AI data. So, use an AI model to train an AI model. The big labs like Cohere and OpenAI already use โsynthetic dataโ - AI-generated data that mimics real-world patterns. The LLMs you use are already trained with it.
youtu.be/thqgKG5lZ8Q
loading . . .
How AI Can Start Teaching Itself
YouTube video by How AI Is Built
https://youtu.be/thqgKG5lZ8Q
1
1
1
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
Youtube:
youtu.be/Z9Z820HadIA
Spotify:
open.spotify.com/episode/3LAJ...
Apple:
podcasts.apple.com/us/podcast/a...
loading . . .
A Search System That Learns As You Use It (Agentic RAG)
YouTube video by How AI Is Built
https://youtu.be/Z9Z820HadIA
0
2
1
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
Want to learn more? Today on
@howaiisbuilt.fm
, we are talking to Stephen Batifol from Zilliz. Stephen and I discuss agentic RAG and the future of search - where systems decide their own path to find answers. What's your take on agentic RAG?
youtu.be/Z9Z820HadIA
loading . . .
A Search System That Learns As You Use It (Agentic RAG)
YouTube video by How AI Is Built
https://youtu.be/Z9Z820HadIA
1
2
1
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
"Instead of being a one-way pipeline, agentic RAG allows you to check, 'Am I actually answering the user's question?'" Different questions need different approaches. โก๏ธ ๐ค๐๐ฒ๐ฟ๐-๐๐ฎ๐๐ฒ๐ฑ ๐๐น๐ฒ๐ ๐ถ๐ฏ๐ถ๐น๐ถ๐๐: - Structured data? Use SQL - Context-rich query? Use vector search - Date-specific? Apply filters first
2
5
1
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
We talk about how they enable BM25 on PostgreSQL, how they integrate into the Postgres Query engines, and how you can build search in your database.
open.spotify.com/episode/4CXX...
loading . . .
Rethinking Search Inside Postgres, From Lexemes to BM25
How AI Is Built ยท Episode
https://open.spotify.com/episode/4CXX0XKlicoPnGuefQS2ZJ?si=2-EkbOSdTsGbIxFKNO9Fxg
0
1
1
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
Not anymore. ParadeDB is building an open-source PostgreSQL extension to enable search within your database. Today on How AI Is Built, I am talking to
@philippemnoel.bsky.social
, the founder and CEO of
@paradedb.bsky.social
.
youtu.be/RPjGuOcrTsQ
loading . . .
Rethinking Search Inside Postgres, From Lexemes to BM25
YouTube video by How AI Is Built
https://youtu.be/RPjGuOcrTsQ
1
3
2
reposted by
How AI Is Built
Nicolay Gerold
10 months ago
Many companies use ElasticSearch or OpenSearch and use 10% of the capacity. On top, they have to build ETL pipelines. Get data normalized. Worry about race conditions. All in all, when you want to do search on top of your existing database, you are forced to build distributed systems.
#ai
1
2
2
Documentation quality is the silent killer of RAG systems. A single ambiguous sentence might corrupt an entire set of responses. But the hardest part isn't fixing errors - it's finding them. Check out the episode with Max. Links to Spotify, Apple in the thread.
add a skeleton here at some point
11 months ago
0
1
1
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
LLMs hallucinate. We want to put the blame on them. But often itโs our fault. Many knowledge bases have: โ Temporal Inconsistencies - Multiple versions from different time periods - Historical information without timeline context >>
1
1
1
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
With RAG these issues are amplified. We do not look at full documents anymore, but at bits and pieces. So we have to be extra careful. Today on
@howaiisbuilt.fm
we talk to Max Buckley. Max works at Google and has built a lot of interesting stuff with LLMs to improve knowledge bases for RAG. >>
1
0
1
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
Some query types might not work at all. It is very costly in terms of storage and compute. We have to keep our indexes in memory to achieve a low enough latency for search. What we are talking about today works for everything, works out of domain, and is one of the most efficient. >>
1
2
1
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
People implementing RAG jump straight into vector search. But vector search has a lot of downsides. Vector search is not robust out of domain. Different types of queries need different embedding models with different vector indices. >>
1
4
2
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
You probably guessed it, we are talking about the OG ranking function in search: BM25. Today we are back continuing our series on search on
@howaiisbuilt.fm
with
@taidesu.bsky.social
. We talk about BM25, how it works, what makes it great and how you can tailor it to your use-case.
1
2
2
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
"Sadly, it's a bit off a snake oil. These long context embedding models have tested basically all of them, not really working well. So it's [best length of chunks] something between like 500 and 1,000 tokens." Text embeddings are far from perfect. They struggle with long documents. >>
1
6
3
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
Vector Databases come with their own set of challenges. The data is too large to be stored on a single node. We often need to handle 10k to 50k QPS. Indexes are very slow to build, but we still want to search the fresh data. >>
4
3
2
Catch the episode on: - Youtube:
youtu.be/3PEARAf7HEc
(now in 4K :D) - Spotify:
open.spotify.com/episode/5lCl...
- Apple:
podcasts.apple.com/us/podcast/v...
add a skeleton here at some point
11 months ago
0
0
0
reposted by
How AI Is Built
Nicolay Gerold
11 months ago
โThere is no free lunch.โ Every performance optimization comes with tradeoffs in either functionality, flexibility, or cost. When building search systems, there's a seductive idea that we can optimize everything: fast results, high relevancy, and low costs. But thatโs not the reality.
5
1
1
you reached the end!!
feeds!
log in