Peter Sergeant
@sgnt.ai
š¤ 134
š„ 225
š 222
Full-time building LLM and ML powered NPCs/agents for a popular online game.
[email protected]
Forget the complexity: AI all boils down to drawing the right lines:
sgnt.ai/p/finding-li...
loading . . .
Forget the complexity: AI all boils down to drawing the right lines | sgnt.ai
Over and over again, despite the best efforts of humans, the most effective AI systems come down to one simple idea: finding the right shaped line that fits some data points.
https://sgnt.ai/p/finding-lines/
3 months ago
0
1
0
In RAG, your chunks solve three distinct problems; trying to use one algorithm to solve all three probably isn't optimal:
sgnt.ai/p/rag-trinity/
4 months ago
1
0
0
One day in 2012 I wrote up from a nightmare in which I was being chased by angle brackets, and sweating profusely, and found this code in my editor
github.com/pjlsergeant/...
loading . . .
GitHub - pjlsergeant/xslt-fever-dream: XSLT is a great programming language
XSLT is a great programming language. Contribute to pjlsergeant/xslt-fever-dream development by creating an account on GitHub.
https://github.com/pjlsergeant/xslt-fever-dream
4 months ago
0
5
1
I will fix your vibe-coded MVP. Not joking.
sgnt.ai/p/vibe-coded/
loading . . .
I will fix your vibe-coded MVP | sgnt.ai
You delivered your vision quickly using one of the many excellent LLM-based coding tools, users are happy. I will dig you out of the tech-debt hole you have created.
https://sgnt.ai/p/vibe-coded/
5 months ago
1
9
2
China sends its most promising civil servants to a US university for 1-3 years, and rather seeing this as a total and complete Cultural Victory, recruitment opportunity, and chance to indoctrinate them, the Trump administration wants to shut this down. Baffling.
www.wsj.com/world/china/...
loading . . .
Harvard Has Trained So Many Chinese Communist Officials, They Call It Their āParty Schoolā
The universityās Kennedy School of Government has long been favored by party cadres seeking career boosts.
https://www.wsj.com/world/china/china-communist-party-harvard-f855112b
5 months ago
1
2
0
Three dumb ChatGPT tricks I use all the time... 1. If I need work critiqued, I tell the LLM that I got someone else to write it for me, and I'm going to pay them based on how good it is. Less time spent softening feedback, and more time spent on pros and cons. Repeat we get to a solid 9 or 10/10
6 months ago
1
2
0
I finally completed an article I've been trying to write for a year; the non-programmer's guide to embeddings! No maths, lots of dogs, and a step by step intuitive guide:
sgnt.ai/p/embeddings...
loading . . .
Understanding Modern AI is Understanding Embeddings: A Guide for Non-Programmers (with lots of dogs!) | sgnt.ai
Embeddings are a core AI concept that underpin a great deal of what we today think of as being AI. This article is going to give you an accurate and intuitive understanding of what an āembeddingā is i...
https://sgnt.ai/p/embeddings-explainer/
6 months ago
0
7
1
reposted by
Peter Sergeant
Joseph Fasano
6 months ago
It's final paper season.
11
1595
470
Letās say we were to discover tomorrow that dolphins are in fact fish, not mammals. Given the huge amount of data in LLM training sets asserting that dolphins are mammals, how does that fact end up getting updated? Just that we generate more new text as time goes on and we eventually have more ā¦
6 months ago
2
1
0
Our LLM game NPC couldn't block chat input & got interrupted constantly ... we engineered "killable" responses using cancel tokens + input debouncing so it could pivot instantly like a human. Made interactions way more natural for impatient players.
sgnt.ai/p/interrupti...
loading . . .
When Users Wonāt Wait: Engineering Killable LLM Responses | sgnt.ai
In our application, the chatbot canāt hide behind a loading spinner; users keep talking and expect it to pivot instantly. This constraint forced us to develop some lightweight techniques you can graft...
https://sgnt.ai/p/interruptible-llm-responses/
7 months ago
1
4
2
In-memory free-text search is a super-power for LLMs; spend less on inference by sprinkling in some free lexical search into the promptā¦
sgnt.ai/p/free-text-...
loading . . .
In-memory free-text search is a super-power for LLMs | sgnt.ai
While working on LLM-driven NPCs, I observed significant improvements in several areas by adding a simple component: in-memory free-text search
https://sgnt.ai/p/free-text-search/
7 months ago
0
2
1
From OpenAIās agent building advice. The first two of these are terrible, terrible ideas for things to give to a LLM to make a decision on. More:
sgnt.ai/p/hell-out-o...
7 months ago
0
0
0
I would love to see more research about whatever weāre calling Sapir-Whorf for LLMs. Do you get substantially different opinions from an LLM depending on which language you query it in? Does its opinion on the greatest empire of all time change depending on if your query is Mongolian or Italian?
7 months ago
0
1
0
Don't let an LLM make decisions or implement business logic, they suck at that:
sgnt.ai/p/hell-out-o...
7 months ago
0
0
0
Four bad definitions of agentic AI:
sgnt.ai/p/agentic-ai...
loading . . .
Four bad definitions of "Agentic AI" | sgnt.ai
If your team promises to deliver (or buy!) 'Agentic AI', then everyone needs to have a shared understanding of what that means; you don't want to be the one left trying to explain the mismatch to stak...
https://sgnt.ai/p/agentic-ai-bad-definitions/
8 months ago
0
0
0
Is anyone else building behavioural plugins for agents / chat-bots? Certain modes can be enabled for brief periods that add new functionality depending on triggers? I keep having to invent novel stuff for this project, and it would be amazing to chat to people doing anything vaguely similar
8 months ago
1
0
0
Remake of Eternal Sunshine of the Spotless Mind, only this time the guyās seducing her based on her ChatGPT history
8 months ago
0
0
0
Please can we not radicalize Mark Cuban by being mean to him online
add a skeleton here at some point
9 months ago
1
1
0
hold up
9 months ago
0
0
0
If they've RLHF'd out the ability for ChatGPT to check its own working I'm gonna be pissed
9 months ago
0
0
0
Iām not going to link to the article because we donāt reward clickbait in this house, but claiming that āgenerative AI is a conā because OpenAI and Anthropic havenāt proven that prop-model plus selling inference is a good business model is dumb
9 months ago
1
1
0
Full marks for self-confidence but _nil points_ for execution there buddy
9 months ago
0
2
0
Iām aware of two reasonably good definitions of Agentic AI, but I can definitively tell you that in practice it just means āstuff that AI might be able to do for us some dayā. I havenāt heard anyone use it as anything other as a stand-in for āawesome stuff coming soonā in 3 days of this expo
9 months ago
0
2
0
Iāve done a few decades of software engineering and never had to deal with a field before where keeping the ability to rip out components is such an advantage as AI
9 months ago
2
2
0
At an AI expo in Dubai. Nothing destroys an exhibitorās credibility faster than having the word blockchain anywhere near their product
9 months ago
0
3
1
reposted by
Peter Sergeant
Ethan Mollick
9 months ago
One thing academics should take away from Deep Research is that a substantial number of your readers in the future will likely be AI agents. Is your paper available in an open repository? Are any charts and graphs described well in the text? Is it well-titled? Probably worth considering theseā¦
2
85
12
Blog article to come when I'm not insanely busy, but had good results today with adding a "read the room" instruction to a the CoT prompt. The problem I'm trying to solve is the user telling the bot "my favourite sword is the Katana!" and the bot just providing info the user doesn't need about it
9 months ago
1
0
0
I mean, I'm not saying anything particularly novel here, but DeepSeek is really very very good, and I'm not sure how ChatGPT stays as my daily driver at this rate
10 months ago
1
0
0
Writing the perfect non-technical article about embeddings is haunting me. I have rewritten this damn thing four or five times, each time getting stuck on a given topic transition being too stark. Itās so close I can feel it, but Iām sure Iāll still be saying that in a year
10 months ago
0
1
0
"Street-fighting RAG: chain-of-thought prompting", in which I start talking about the challenges of developing a RAG application in what I am utterly convinced is the most challenging possible environment...
www.sgnt.ai/p/street-fig...
10 months ago
1
1
1
Many thousands of messages to analyse in the New Year spewed forth from our intentionally very abusive and mean NPC. While he's exceptionally crotchety, angry, violent, and sweary, it's also important he's not racist, sexist, or ableist. If this would be useful synthetic data for anyone, do lmk
11 months ago
0
0
0
Iām at a Christmas Eve party and the players found the first NPC. Thousands of messages in the last hours, and here I am watching it happen on Langfuse from my iPhone š Send hug-ops
11 months ago
0
1
0
Langfuse's possibilities and web interface are sparking joy, but Langfuse's JS/TS documentation is absolutely _not_
11 months ago
0
0
0
Not my strongest effort, but a pretty coherent Scrooge NPC character in production after a couple of days of effort (building on a much larger body of work I'm putting together). He's, uh, a little angry. Bah humbug!
11 months ago
0
0
0
I created a Character AI account to see if you can interrupt characters while talking (you can't). I sent maybe four messages to a random bot. It's been emailing me every day since. Perhaps I am meant to cure _it's_ loneliness?
11 months ago
0
1
0
What's SOTA for contextual chunking of PDFs these days? I have a reasonably clear picture of how I could whip something together that I think would be pretty effective, but is there something recent, excellent, and obvious?
11 months ago
1
1
0
I feel like the technique described could be naĆÆvely implemented as āgenerate the output a few times with some reflection and then get the bot to summarize what it thinks are the best answersā
add a skeleton here at some point
11 months ago
1
1
0
AI has _not_ been able to convincingly recommend other songs to go on my playlist that leads with Wayfaring Pilgrim by Roy Buchanan and Al Kooper playing Season of the Witch. Can Bluesky do any better?
11 months ago
0
0
0
One of the advantages of developing for a game is we can actually deploy AI agents -- obviously there is endless testing, evals, etc, but if our LLM-backed negotiation bot accidentally under-prices an asset it's not the end of the world. The next year is going to be incredibly exciting
11 months ago
1
1
0
Iāve completely stopped Twitter use for ⦠a month now? Turns out I just wanted something to scroll and thereās now enough here to scratch that itch
11 months ago
1
2
0
Other than the PleIAs models, are there any other truly open-source and not-copyright-encumbered models?
11 months ago
2
1
0
It's _almost_ too obvious to need saying, but your life will be 50x better if you can very, very quickly test prompts (without running the whole eval suite) and then have all the logs you might possibly need at your fingerprints
11 months ago
1
0
0
One of my favourite things to do recently with ChatGPT is voice-mode to ask it to interview me about something I need to write. Specify I only want one question as a time, and then dialogue with it. At the end, it can summarise and organise what I said
11 months ago
1
3
0
Am I hallucinating this, or are none of the "fast inference" companies actually ready to sell yet? Groq has pricing, but you can't sign up for billing. Cerebras has no pricing (and a tiny context window). SambaNova, no pricing.
11 months ago
2
0
0
Gemini 2.0 flash: speed looks good vis-a-vis 4o-mini, and quality looks good so far against my eval set. If it's cheaper than 4o-mini too (which, it probably will be?) then OpenAI have a real problem, because switching between them is a value in a config file.
11 months ago
1
3
0
All I want for Xmas is legislation that bans apps and sites from disabling paste in password fields
add a skeleton here at some point
11 months ago
0
0
0
Have any of you actually made generating SPARQL queries against a knowledge graph work in practice on a real deployed project? Iām pretty skeptical of how feasible this is in the real world
11 months ago
0
0
0
Is having a truly open model create lots of synthetic tokens, and then using a non-free model decide which of them are suitable for further training, copyright laundering?
11 months ago
0
0
0
When, if ever, do QAs / traditional test developers take over writing the LLM evals? I have a strong interest in testing, and hearing almost nothing from that part of the wider dev community
11 months ago
1
1
0
reposted by
Peter Sergeant
Ethan Mollick
11 months ago
A test of how seriously your firm is taking AI: when o-1 (& the new Gemini model) came out this week, were there assigned folks who immediately ran the model through your internal, validated, firm-specific benchmarks to see how useful it as? Did you update any plans or goals as a result?
11
195
37
Load more
feeds!
log in