Juan Peredo
@juanperedotech.bsky.social
📤 43
📥 59
📝 24
I have always thought teachers have one of the most important jobs in society. I am building Bolbeck imagine to help them create books that kids may actually want to read and thus make teachers lives a bit easier. Just upload the manuscript and work with the assistant to create your PDF book.
1 day ago
1
0
0
Have you ever had an idea for a great story? Bring it to life with Bolbeck Imagine.
13 days ago
0
0
0
My goal to make it easy for anyone to make a children’s book continues…
14 days ago
0
0
0
I had a great time presenting my talk about the good, the bad and the just plain cool about creating applications that use AI agents at The AI Collective + Chicago Java Users Group meetup today. We had great questions and insightful conversations
3 months ago
0
1
0
reposted by
Juan Peredo
The New Stack
6 months ago
Agentic workflows are not a replacement for microservices; they serve as a new coordination layer built upon existing service invocation.
loading . . .
What Agentic Workflows Mean to Microservices Developers
Agentic workflows are not a replacement for microservices; they serve as a new coordination layer built upon existing service invocation.
https://bit.ly/3GtQaEk
0
1
1
reposted by
Juan Peredo
Luca Beurer-Kellner
7 months ago
👿 MCP is all fun, until you add this one malicious MCP server and forget about it. We have discovered a critical flaw in the widely-used Model Context Protocol (MCP) that enables a new form of LLM attack we term 'Tool Poisoning'. Leaks SSH key, API keys, etc. Details below 👇
1
14
9
I had a great time at the ArcOfAI Conference. Lots of great people with great perspectives about AI. Plus I delivered my talk on lessons learned from building agentic apps! Thanks to the organizers that created such a well organized event!
@arcofai.bsky.social
www.bolbeck.com/blogs/post/l...
loading . . .
Navigating the new frontier - Bolbeck LLC
Lessons from building a multi-modal agentic AI application
https://www.bolbeck.com/blogs/post/lessonsFromAgenticApps
6 months ago
0
2
2
Flying to Austin today for the Arc of AI conference ! Excited to learn new concepts and have insightful conversations ! Plus on Thursday, I give my talk and share what i have learnt while building AI apps.
@arcofai.bsky.social
7 months ago
0
1
2
reposted by
Juan Peredo
Arc of AI Conference
7 months ago
Attend
@juanperedotech.bsky.social
's session at Arc of AI Conference, to discuss lessons learned from the journey in creating agentic AI applications 🤖
www.arcofai.com/speaker/f631...
Don't miss out and get your tickets now! Use JOIN-JUAN-50OFF for discount 📅Mar 31-Apr 3. Austin TX 🎟️
arcofai.com
0
0
2
Great summary of the fantastic talks at AI Engineer Online Summit. Check it out & see which ones spike your interest. You could, for example, watch my talk: Lessons from building GenAI based applications 😇 Links to all the talks are in the summary.
www.linkedin.com/pulse/agent-...
loading . . .
Agent-Based Systems Have Arrived: AI Engineer Summit Online 2025
TL;DR: The AI Engineer Online Summit 2025 shows that AI agents are rapidly maturing. The talks had a strong sense of realism and value.
https://www.linkedin.com/pulse/agent-based-systems-have-arrived-ai-engineer-summit-online-pingali-fdj2c?utm_source=share&utm_medium=member_ios&utm_campaign=share_via
8 months ago
0
1
0
reposted by
Juan Peredo
TechCrunch
8 months ago
Google launches a free AI coding assistant with very high usage caps
loading . . .
Google launches a free AI coding assistant with very high usage caps
On Tuesday, Google introduced a new, free consumer version of its AI code completion and assistance tool, Gemini Code Assist, and which the company calls Gemini Code Assist for Individuals. The company also rolled out Gemini Code Assist for GitHub, a code…
https://tcrn.ch/3EUKSkE
3
43
6
reposted by
Juan Peredo
luokai
8 months ago
In the face of competition from Claude 3.7 Sonnet in coding, Google is launching a free version of Gemini Code Assist globally. It comes with: - 180,000 code completions per month - Support for all programming languages in the public domain - 128,000 token context window
loading . . .
1
4
3
Are you intrigued by creating GenAI-based applications? I invite you to watch my video on the AI Engineer channel, where I share insights from my experience in developing GenAI apps. The video will prepare you for a successful start in your AI journey.
m.youtube.com/watch?v=YYcN...
loading . . .
Lessons from building GenAI based applications — Juan Peredo
YouTube video by AI Engineer
https://m.youtube.com/watch?v=YYcNm2RexnY
8 months ago
0
0
0
reposted by
Juan Peredo
Sung Kim
9 months ago
1/5 So, how did DeepSeek develop DeepSeek R1? They used both DeepSeek-V3-Base and a simple prompt: 1. They asked the same question multiple times to DeepSeek-V3-Base as a group. 2. They then graded the answers, assigning an accuracy score and a format score (e.g., <think></think>).
2
81
15
Happy to share that I will be presenting at the Arc of AI conference in April! Join me and a distinguished list of speakers in Austin. Let’s share ideas and make some friends!
@arcofai.bsky.social
www.arcofai.com
loading . . .
Arc of AI Conference
The conference to learn, apply, and improve our craft.
https://www.arcofai.com
9 months ago
0
1
1
reposted by
Juan Peredo
InfoQ
9 months ago
🚀 Discover the Best of
#SoftwareArchitecture
from 2024! Explore this must-read collection of exceptional articles published on
#InfoQ
last year! 💡 Stay informed. 🔥 Stay inspired. 🏆 And always
#StayAhead
of the curve! 👉 Knowledge is power! 💪 Thread 👇
1
8
3
reposted by
Juan Peredo
Corey “🎃Managed NAT Gateway👻” Quinn
10 months ago
Someone on Reddit searched for (the non-existent) John Wick 5, and Google spared no expense to lie to them.
111
4930
1748
reposted by
Juan Peredo
Kelsey Hightower
10 months ago
DeepSeek, a LLM trained for a fraction of the cost of GPT-Xx models, in 2 months for 6 million, on limited GPUs due to export restrictions, and competing head to head. This is crazy. It's not the AI part I'm excited about, it's the level of efficiency.
github.com/deepseek-ai/...
loading . . .
GitHub - deepseek-ai/DeepSeek-V3
Contribute to deepseek-ai/DeepSeek-V3 development by creating an account on GitHub.
https://github.com/deepseek-ai/DeepSeek-V3
10
269
47
reposted by
Juan Peredo
Daniel - Js Craft
10 months ago
Maybe one of the best explanations of what it means to be a true senior software dev and leader:
0
0
2
reposted by
Juan Peredo
Sung Kim
10 months ago
Migrating to uv by
@ivanleomk.bsky.social
He wrote a guide on how we migrated instructor from poetry to uv and documented the gotchas that they encountered They saw around a 3x speedup and a ~67% reduction in CI Pipeline execution without any code changes
python.useinstructor.com/blog/2024/12...
loading . . .
Migrating to uv - Instructor
How we migrated from poetry to uv
https://python.useinstructor.com/blog/2024/12/26/migrating-to-uv/
2
27
3
reposted by
Juan Peredo
Sung Kim
10 months ago
Confirmed. DeepSeek-V3 is coming soon and it is bit better than Claude 3.5 Sonnet per Aider LLM Leaderboard.
aider.chat/docs/leaderb...
add a skeleton here at some point
1
25
6
According to Pluralsight 2025 Tech Forecast by, 20% of companies have deployed AI projects & 55% have plans to do so in 2025. However, 80% of all AI projects fail which is twice the failure rate of other types of projects. Read the full report here:
www.pluralsight.com/content/dam/...
10 months ago
0
1
0
LangChin
@langchain.bsky.social
just released their State of AI report which is full of interesting tidbits like the top LLM providers and the top LLM-as-judge evaluation metrics. You can read the report here:
blog.langchain.dev/langchain-st...
loading . . .
LangChain State of AI 2024 Report
Dive into LangSmith product usage patterns that show how the AI ecosystem and the way people are building LLM apps is evolving.
https://blog.langchain.dev/langchain-state-of-ai-2024/
10 months ago
0
1
0
reposted by
Juan Peredo
LangChain
10 months ago
Introducing diff view in LangSmith's Prompt Hub 🔍 Easily compare changes between any two commits, review updates, or revert to earlier versions. Go to the Commits tab, toggle 'diff,' and track your prompt evolution — so you can iterate quickly and with control.
loading . . .
1
4
1
Here are the slides from my “Tips for building applications that use GenAI” talk at AICamp earlier this week. It was fun sharing some AI dev tidbits with the audience!
#GenAI
#AI
www.bolbeck.com/files/TipsFo...
loading . . .
https://www.bolbeck.com/files/TipsForGenDev.pdf
10 months ago
0
1
0
Had a great time presenting “Tips and tricks for building applications that use GenAI” at AICamp Chicago today! We had great questions and discussions throughout the talk!
#AI
10 months ago
0
0
0
reposted by
Juan Peredo
The New Stack
10 months ago
At AWS re:Invent, CTO Werner Vogels shared some fundamental lessons Amazon learned about system design.
loading . . .
Werner Vogels' 6 Lessons for Keeping Systems Simple
At AWS re:Invent, CTO Werner Vogels shared some fundamental lessons Amazon learned about system design.
https://thenewstack.io/werner-vogels-6-lessons-for-keeping-systems-simple
1
2
1
Meta releases llama 3.3 70B which rivals in performance the existing Llama 3.1 405B even though it is less than 1/5 the size. Just as exciting, you can now run it in Groq.
@groqinc.bsky.social
groq.com/a-new-scalin...
loading . . .
A New Scaling Paradigm: Meta's Llama 3.3 70B Challenges "Death of Scaling Law" - Groq is Fast AI Inference
Model Offers Comparable Quality to 3.1 405B at Less Than 1/5th the Size
https://groq.com/a-new-scaling-paradigm-metas-llama-3-3-70b-challenges-death-of-scaling-law/
10 months ago
0
2
2
reposted by
Juan Peredo
Daniel Dominguez
10 months ago
Open-source
#LLMs
are essential for making
#AI
more accessible! Discover more about OLMo 2 in my latest
@infoq.bsky.social
check it out!
loading . . .
Ai2 Launches OLMo 2, a Fully Open-Source Foundation Model
The Allen Institute for AI research team has introduced OLMo 2, a new family of open-source language models available in 7 billion (7B) and 13 billion (13B) parameter configurations. Trained on up to ...
https://www.infoq.com/news/2024/12/olmo-2-ai2/
0
3
2
If someone is curious about the new Amazon Nova models (and many other models), they are available in openrouter
openrouter.ai/amazon
loading . . .
Amazon - Models
Browse models from Amazon
https://openrouter.ai/amazon
10 months ago
0
0
0
Really enjoyed the keynote by
@werner.social
at reInvent! It was filled with great insights like: - Make evolvability a requirement - Break complexity into pieces - Align organization to architecture - Organize into cells - Design predictable systems - Automate complexity Best keynote at reInvent
10 months ago
0
0
0
AWS announced EKS Auto mode to simplify the management of EKS Kubernetes clusters. Looks interesting and potentially very useful.
docs.aws.amazon.com/eks/latest/u...
loading . . .
Automate cluster infrastructure with EKS Auto Mode - Amazon EKSAutomate cluster infrastructure with EKS Auto Mode - Amazon EKS
Automate cluster infrastructure with EKS Auto Mode
https://docs.aws.amazon.com/eks/latest/userguide/automode.html
11 months ago
0
1
0
reposted by
Juan Peredo
Python Hub
11 months ago
smollm Everything about the SmolLM & SmolLM2 family of models.
https://github.com/huggingface/smollm
0
3
1
AWS just announced a new family of LLMs at ReInvent today called Nova. They seem to be competitively priced and their benchmarks seem pretty good, specially for a V1 product. We will soon find out if reality matches their excellent marketing :)
aws.amazon.com/ai/generativ...
loading . . .
Generative Foundation Model - Amazon Nova - AWS
Amazon Nova is a generation of state-of-the-art (SOTA) foundation model that delivers frontier intelligence and industry leading price-performance.
https://aws.amazon.com/ai/generative-ai/nova/
11 months ago
0
1
0
reposted by
Juan Peredo
The New Stack
11 months ago
This tutorial shows how to use asyncio to work around the limitations of Python’s global interpreter lock (GIL) to achieve efficient concurrent programming.
loading . . .
Circumventing Python's GIL With Asyncio
This tutorial shows how to use asyncio to work around the limitations of Python’s global interpreter lock (GIL) to achieve efficient concurrent programming.
https://thenewstack.io/circumventing-pythons-gil-with-asyncio
0
0
1
reposted by
Juan Peredo
The New Stack
11 months ago
Vendor beefs up support for re-ranking models, sparse vector retrieval, and security features like audit logs, RBAC, and more.
loading . . .
Pinecone Revamps Retrieval Capabilities for Its Vector Database Platform
Vendor beefs up support for re-ranking models, sparse vector retrieval, and security features like audit logs, RBAC, and more.
https://thenewstack.io/pinecone-revamps-retrieval-capabilities-for-its-vector-database-platform
0
1
1
reposted by
Juan Peredo
Svelte
11 months ago
🎅🎄🎁 ADVENT OF SVELTE 🎅🎄🎁 We're going to ship one thing a day every day from now until Christmas. Wish us luck!
svelte.dev/blog/advent-...
First up...
loading . . .
Advent of Svelte
Twenty-four days, twenty-four features
https://svelte.dev/blog/advent-of-svelte
19
554
96
Small is beautiful and better for the environment :) Choosing the correct LLM/SLLM for the correct task can help saving money while still achieving the appropriate business goals.
add a skeleton here at some point
11 months ago
0
1
1
Predictive task scaling in AWS ECS. Assuming this works as announced, it could help right size your container tasks to more closely match demand. This in turn could potentially reduce cost and/or improve application response to traffic spikes.
aws.amazon.com/blogs/contai...
loading . . .
Optimize compute resources on Amazon ECS with Predictive Scaling | Amazon Web Services
This blog is co-authored by Jooyoung Kim, Senior Containers Specialist Solutions Architect, Abhishek Nautiyal, Senior Product Manager, Amazon ECS and Ankur Sethi, Senior Product Manager, Amazon EC2. I...
https://aws.amazon.com/blogs/containers/optimize-compute-resources-on-amazon-ecs-with-predictive-scaling/
11 months ago
0
1
0
To all of those celebrating, Happy Thanksgiving!
11 months ago
0
0
0
Menlo Ventures’ State of AI shows, among other things, LLMs share by provider. Interestingly, OpenAI is still on the lead but it has drop by 16% points year over year while Anthropic has gained 12 points. Here the report that is full of interesting info about AI in 2024:
menlovc.com/2024-the-sta...
loading . . .
2024: The State of Generative AI in the Enterprise - Menlo Ventures
The enterprise AI landscape is being rewritten in real time. We surveyed 600 U.S. enterprise IT decision-makers to reveal the emerging winners and losers.
https://menlovc.com/2024-the-state-of-generative-ai-in-the-enterprise/
11 months ago
0
0
0
you reached the end!!
feeds!
log in