Dan Petrovic
@dejanseo.bsky.social
📤 439
📥 161
📝 189
I do machine learning and SEO. Director of DEJAN |
https://dejan.ai/
A paper Why Language Models Hallucinate (Kalai, Nachum, Vempala, Zhang, 2025) shows that LLMs inevitably hallucinate due to statistical limits and evaluation incentives. Without grounding in real retrieval systems, they cannot provide reliable search.
dejan.ai/blog/llm-is-...
loading . . .
LLM is a Presentation Layer in AI Search
Classic IR: crawl, index, retrieve, rank remain with search engines. There is a persistent myth that large language models (LLMs) have fundamentally replaced search. In truth, LLMs do not crawl the we...
https://dejan.ai/blog/llm-is-a-presentation-layer-in-ai-search/
1 day ago
0
4
1
Citation Mining for AI SEO 👉
cm.dejan.ai
At DEJAN AI this represents a single step in a complex citation mining pipeline which involves GSC API, query intent classification, synthetic query and fanout generation, prompt generation, citation target text retrieval, NLP and semantic matching.
loading . . .
2 days ago
0
1
0
What a great day for a Gen AI Search industry! Google's DeepMind team has just released EmbeddingGemma. The most powerful embedding model with direct lineage to Gemini technology which powers Google's AI Search:
dejan.ai/blog/embeddi...
loading . . .
EmbeddingGemma: The Game-Changing Model Every SEO Professional Needs to Know
Why Google’s Latest Embedding Model Could Reshape Search Understanding In the business of Gen AI search optimization, staying ahead means understanding the underlying technologies that power modern se...
https://dejan.ai/blog/embeddinggemma/
18 days ago
0
2
0
Internally we use terms like SR (Selection Rate) and PB (Primary Bias) in the context of AI Search optimization. What do they mean? Take a look because these things really are a big deal:
dejan.ai/blog/sr/
loading . . .
Primary Bias on Selection Rate in AI Search
What is Selection Rate? Selection Rate (SR) is a key performance metric for AI systems that measures the frequency with which an AI selects and incorporates a specific item from a total set of groundi...
https://dejan.ai/blog/sr/
18 days ago
0
1
0
Update: Brand FAQs added to our AI sentiment tool:
reviews.dejan.ai
loading . . .
18 days ago
0
0
0
Want this? We're looking for AI Search Optimization pioneers to provide feedback. Early Access Request:
dejan.ai/engage/?feat...
loading . . .
20 days ago
0
1
0
The Latent History of AI Boom
dejan.ai/blog/ai-boom/
loading . . .
The Latent History of AI Boom
This is the story of how AI transitioned from niche to mainstream and the pieces that fell into place to make that happen. Picture this. It’s 2017, we’re in the era dominated by Recurrent Neural Netwo...
https://dejan.ai/blog/ai-boom/
21 days ago
0
1
0
Something Joshua Squires discovered about AI Overviews but didn't get enough attention. Here's my theory:
dejan.ai/blog/ai-over...
loading . . .
AI Overviews = Dialogflow Agent?
Joshua Squires shared one of the most interesting AI Overview leaks and for some reason it was mostly ignored by the SEO industry. I’d like to draw your attention to it today because it provides two k...
https://dejan.ai/blog/ai-overviews-dialogflow-agent/
22 days ago
0
0
0
dejan.ai/blog/fan-out...
loading . . .
Fan-Out Query Search Volume Prediction Using Deep Learning
While traditional keyword research tools provide valuable data, they often fall short in discovering truly novel or long-tail search query variations that a business might not yet rank for, or even be...
https://dejan.ai/blog/fan-out-query-search-volume-prediction-using-deep-learning/
23 days ago
0
2
0
Look at me predicting monthly search volumes for ~700,000 fan-out queries for one of my clients. No SaaS, no APIs. A custom model I trained for them on their own GSC data. This is absolutely delightful stuff! ✨
loading . . .
24 days ago
0
2
0
Are you engaging with comment bots without realising? Here's a helpful AI detection guide:
dejan.ai/blog/comment...
loading . . .
Comprehensive Guide to Identifying AI Comment Bots
Some people use AI to speed up the process of getting their ideas and message out. Others use it to polish up their language which I think is really cool use of AI, especially if they’re not a native ...
https://dejan.ai/blog/comment-bots/
25 days ago
0
2
0
Chrome is now *loaded* with AI features and models. Here's my reference list:
dejan.ai/blog/chrome-...
loading . . .
Chrome AI Frameworks & Models
Chrome’s AI-driven segmentation platform enhances user experiences by predicting behaviours and tailoring features accordingly. Explore the different models that power these optimizations and how they...
https://dejan.ai/blog/chrome-ai-models/
26 days ago
0
0
1
dejan.ai/blog/does-sc...
Tested and nothing. Sorry guys :( Thanks
@lilyray.nyc
loading . . .
Does Schema Help With “AI”?
This test is designed to show whether Open AI’s browsing tool does a better job at supplying their model GPT-5 with grounding context from a page with schema. We took the exact HTML from the original ...
https://dejan.ai/blog/does-schema-help-with-ai/
about 1 month ago
1
4
2
dejan.ai/blog/your-we...
loading . . .
Your website is about to start talking. Are you ready for this?
Chrome is about to give all websites a voice through a built-in version of Gemini. Your visitors will have completely private chats with it. No external API calls to Google’s servers and once loaded y...
https://dejan.ai/blog/your-website-is-about-to-start-talking-are-you-ready-for-this/
about 1 month ago
0
2
2
Everything SEO people love these days, chunking, vectors and more:
dejan.ai/blog/inside-...
loading . . .
Inside Chrome’s Semantic Engine: A Technical Analysis of History Embeddings
I decoded Chrome’s internal semantic search, found the exact chunking mechanism, embedding logic and am now able to browse, search and cluster my own search history through decoded vector embeddings. ...
https://dejan.ai/blog/inside-chromes-semantic-engine-a-technical-analysis-of-history-embeddings/
about 1 month ago
0
0
0
What a fascinating way to get an idea what Google "thinks" you do based on what it has on you. Based on the "About" snippet I think I should stop releasing random tools and start promoting the fact I run a commercial AI Search agency with paying clients.
about 1 month ago
1
2
0
about 1 month ago
1
14
5
This tool allows you to see if a person, brand, product or service is a known entity in Google's knowledge graph.
entities.dejan.ai
about 1 month ago
0
0
0
The two pillars of AI optimization are model understanding and control with well-established analogues in the machine learning industry called mechanistic interpretability and model steering. This is new. The rest is SEO. Read:
dejan.ai/blog/underst...
about 1 month ago
0
5
0
Poll Results: Most people call chatGPT etc just "AI". AI – 71.1% (621 votes) Chatbots – 11.9% (103 votes) Something else – 9.5% (82 votes) AI Assistants – 7.6% (66 votes)
dejan.ai/blog/people-...
loading . . .
People call them AI. That’s it.
Poll Results on Social Media: What Do We Call ChatGPT, Claude, Gemini, Perplexity? Across 864 total votes collected on social media polls, respondents gave a fragmented view on how to label tools like...
https://dejan.ai/blog/people-call-them-ai-thats-it/
about 1 month ago
0
1
0
How to find and fix risky links? Video:
www.youtube.com/watch?v=AjkF...
Tool:
penguin.dejan.ai
loading . . .
How to find and fix risky links?
YouTube video by DEJAN
https://www.youtube.com/watch?v=AjkFmqYbmrw
about 1 month ago
0
0
0
The Ultimate Guide to Natural Link Integration Watch this:
www.youtube.com/watch?v=Td_k...
Natural Link Integration:
linkbert.com
Unnatural Link Detection:
penguin.dejan.ai
loading . . .
The Ultimate Guide to Natural Link Integration
YouTube video by DEJAN
https://www.youtube.com/watch?v=Td_k7QTGkH8
about 1 month ago
0
1
0
We scraped Open AI and Google and trained this model on their content. LinkBERT is a link building expert and can identify optimal place for links in web content. Demo:
linkbert.com
www.youtube.com/watch?v=iNKZ...
loading . . .
LinkBERT V2
YouTube video by DEJAN
https://www.youtube.com/watch?v=iNKZHfrbmnY
about 1 month ago
2
4
1
My claim that the future of SEO is secured with AI relying on search engines rather than internal memory has been both praised and challenged by the community. I've updated the article with key citations and links to reliable sources:
dejan.ai/blog/gpt-5-m...
loading . . .
GPT-5 Made SEO Irreplaceable
OpenAI’s latest model is trained to be intelligent, not knowledgeable. Wait, what? Yup. You read that right. Here’s an example: Now, you may think this is some pretty esoteric knowledge not broadly re...
https://dejan.ai/blog/gpt-5-made-seo-irreplaceable/
about 1 month ago
0
4
1
Are you an SEO? Rejoice! With GPT-5 OpenAI secured your job for a very, very long time:
dejan.ai/blog/gpt-5-m...
about 1 month ago
0
5
3
✅ New model trained and deployed. ✅ Added "High Effort" mode for deep query fan-out. ✨ Free Access:
dejan.ai/tools/fanout/
about 1 month ago
0
1
0
www.chris-green.net/post/substan...
loading . . .
How "Substantial" is Ranking Content?
Content Substance and Rankings - TL;DR • The data suggests that more than 50% of ranking results in the top 15 organic positions are “fluffy” according to the substance model. • There’s no clear diffe...
https://www.chris-green.net/post/substantial-content-ranking-content
about 2 months ago
0
0
0
Are your backlinks safe? Our link spam detection algorithm can accurately predict obvious money links on any page. Test yours and see if we can spot them:
penguin.dejan.ai
about 2 months ago
0
1
0
dejan.ai/blog/journal...
loading . . .
Journalism Is Dead. Say Hello to Gournalism.
John Botman For nearly two centuries, journalism operated under the assumption that truth mattered, stories should be original, and humans should write things for other humans to read. Quaint, right? ...
https://dejan.ai/blog/journalism-is-dead-say-hello-to-gournalism/
about 2 months ago
0
3
0
Which queries will be grounded with search? You can now paste your queries in bulk, one per line:
grounding.dejan.ai
and get classification results for each.
about 2 months ago
1
5
2
OpenAI Grounding Classifier
grounding.dejan.ai
about 2 months ago
0
1
0
A friendly guide to getting started with machine learning in SEO.
youtu.be/9DnmDbzp5lA
[~30min.]
loading . . .
Machine Learning for SEO by DEJAN AI
YouTube video by DEJAN
https://youtu.be/9DnmDbzp5lA
about 2 months ago
0
2
0
For my text chunk loving friends:
chunk.dejan.ai
2 months ago
0
2
1
What do humans and AI have in common? We don’t read. Instead we rely on attention mechanisms to process text information. When optimising content for AI and humans you must get to the point early and optimise content to reduce cognitive load.
dejan.ai/blog/human-f...
loading . . .
Human Friendly Content is AI Friendly Content
What do humans and AI have in common? We don’t read. Instead we rely on attention mechanisms to process text information. When optimising content for AI and humans you must get to the point early and ...
https://dejan.ai/blog/human-friendly-content-is-ai-friendly-content/
2 months ago
0
2
0
My team is going to love this:
youtu.be/2zly04mUjxY
loading . . .
AI SEO - Competitive Intelligence for Google AI Search
YouTube video by DEJAN
https://youtu.be/2zly04mUjxY
2 months ago
0
1
0
dejan.ai/blog/analysi...
loading . . .
Analysis of Gemini Embed Task-Based Dimensionality Deltas
When generating vector embeddings for your text using Gemini Embed there are several embedding optimisation modes: For each one you get slightly different embeddings, each optimised for the task at ha...
https://dejan.ai/blog/analysis-of-gemini-embed-task-based-dimensionality-deltas/
2 months ago
0
0
0
www.youtube.com/live/G7isIBu...
loading . . .
Campfire Chat with Dan Petrovic
YouTube video by The SEO Community
https://www.youtube.com/live/G7isIBuAdII
2 months ago
0
0
0
I applied a super-cool thresholding algorithm by Nobuyuki Otsu from 1979 to help me with arbitrary label search query classification using GLiNER. A very unlikely application in NLP but it works like a charm!
dejan.ai/blog/otsu/
3 months ago
0
0
0
I analysed 18,000,000 real human search queries. They're short.
3 months ago
0
2
0
Optimising for AI visibility.
dejan.ai/media/html/r...
Here's a glimpse into one part of our AI model interpretability pipeline where we probe the model and extract citations, named entities and sentiment for the main brand and its top competitors.
loading . . .
3 months ago
0
1
0
Got that? Good.
techcrunch.com/2025/06/30/c...
3 months ago
0
2
0
Watching this space.
loading . . .
3 months ago
0
0
0
Today I've finalised training and and testing of the world's first Gemma 3 embedding model, specially engineered for reversible/decodable high quality embeddings which will be used to create training data for the embedding decoder model to train our query fan-out model:
dejan.ai/blog/gemma-e...
loading . . .
Training Gemma‑3‑1B Embedding Model with LoRA
In our previous post, Training a Query Fan-Out Model, we demonstrated how to generate millions of high-quality query reformulations without human labelling, by navigating the embedding space between a...
https://dejan.ai/blog/gemma-embed/
3 months ago
0
1
0
Grab some coffee.
dejan.ai/blog/trainin...
loading . . .
Training a Query Fan-Out Model
Google discovered how to generate millions of high-quality query reformulations without human input by literally traversing the mathematical space between queries and their target documents. Here’s Ho...
https://dejan.ai/blog/training-a-query-fan-out-model/
3 months ago
0
0
0
Why didn't I think of this before!
3 months ago
0
1
0
dejan.ai/blog/cosine-...
loading . . .
Cosine Similarity or Dot Product?
Google’s embedder uses dot product between normalized vectors which is computationally more efficient but mathematically equivalent to cosine similarity. How Googler’s work and think internally typica...
https://dejan.ai/blog/cosine-similarity-or-dot-product/
3 months ago
0
1
0
Today we're adding a powerful new model to our AI SEO tech stack to help us classify client queries on whatever labels we want and whatever industries we want. Good bye "informational, navigational...etc".
dejan.ai/blog/univers...
loading . . .
Universal Query Classifier
Generalist, Open‑Set Classification for Any Label Taxonomy We’ve developed a search query classifier that takes any list of labels you hand it at inference time and tells you which ones match each sea...
https://dejan.ai/blog/universal-query-classifier/
3 months ago
0
2
0
😂
3 months ago
0
11
1
I've developed something we call the "Tree Walker". An algorithm which helps us get into model's head. It analyzes of all possible paths a model can take once it starts a sentence, exploring up to 5 top tokens for each next token using a dynamic probability threshold.
loading . . .
3 months ago
1
0
0
Do not bend for the algorithms. Do what's right. Hold your ground. Wait. Watch the algorithms bend to you.
4 months ago
2
6
0
Load more
feeds!
log in