Recce - Trust, Verify, Ship
@datarecce.bsky.social
📤 50
📥 153
📝 258
Helping data teams preview, validate, and ship data changes with confidence.
https://datarecce.io
pinned post!
Recce 1.0 is now live on Product Hunt!
www.producthunt.com/posts/recce-4
Upvote and leave a comment to help us grow the Recce community and bring better data review processed to more data teams Thanks for your support!
#OpenSource
#Data
#DataEngineering
#Analytics
#DeveloperTools
#dbt
loading . . .
Recce - Explore, validate, and share data impact before merging | Product Hunt
Recce helps data teams discover actual data impact and turn insight into actionable checklists for dbt pull request reviews. It’s a practical way to implement data best practices - know what’s changin...
https://www.producthunt.com/posts/recce-4
about 1 year ago
0
4
0
Data review flagged 99.999% row-count variance. PR was two lines. Base: 5 years of prod history. Current: 1-hour CI build. False alarms train reviewers to scroll past. That's the damage.
blog.reccehq.com/session-base...
#dbt
#DataEngineering
loading . . .
Session Base per PR: Why Data Reviews Lie
Data PR review breaks when the base and current environments are built differently. Here is why, and how session base per PR fixes the false alarms.
https://blog.reccehq.com/session-base-per-pr-why-data-reviews-lie
3 days ago
0
0
0
90% of enterprise programmers spend their time on maintenance, not greenfield development. Michael Stonebraker's take on where AI actually earns its keep inverts the marketing narrative completely.
16 days ago
1
0
0
Michael Stonebraker was right about CODASYL. Right about NoSQL. Now he's run text-to-SQL on a real enterprise warehouse and got 10% accuracy against an 80% benchmark. The pattern is hard to ignore.
blog.reccehq.com/benchmarks-l...
#DataEngineering
#TextToSQL
#AI
loading . . .
Benchmarks Lie: What a Turing Award Winner Found When He Tested Text-to-SQL on Real Data
Text-to-SQL benchmarks show 80% accuracy. A Turing Award winner tested the same models on a real 1,400-table warehouse and got 10%. Here is why.
https://blog.reccehq.com/benchmarks-lie-what-a-turing-award-winner-found-when-he-tested-text-to-sql-on-real-data
17 days ago
0
0
0
Before you let agents touch your codebase, build these gates. Not because you don't trust the agent - but because you wouldn't trust anyone without them. Including yourself.
blog.reccehq.com/before-you-l...
#ClaudeCode
#AIAgents
#DevWorkflow
loading . . .
Before You Let Agents Touch Your Codebase, Build These Gates
Quality gates are what make Recce's agent-driven development actually work. Here's the pre-commit hooks, linting config, and review process keeping AI-written code production-ready.
https://blog.reccehq.com/before-you-let-agents-touch-your-codebase-build-these-gates
20 days ago
0
0
0
A wiki is something you look at. A shared AI system is something you work through.
#ClaudeCode
#DataEngineering
#BIP
blog.reccehq.com/we-didnt-set-out-to-build-a-team-ai-plugin
loading . . .
We Didn't Set Out to Build a Team AI Plugin
How Recce built a Claude Code plugin to share team knowledge, voice, product context, and workflows, across every AI session.
https://blog.reccehq.com/we-didnt-set-out-to-build-a-team-ai-plugin
about 1 month ago
0
0
0
"Pandas in 2011 was essentially book-driven development, quite literally."
@wesmckinney.com
wrote features because he needed them for the book chapter.
#pandasPython
#DataRenegades
loading . . .
about 1 month ago
0
0
0
AI coding tools generate plausible but wrong SQL constantly. The fix isn't waiting for a smarter model. AI skills are markdown files that encode domain knowledge into coding tools. No framework, just structured text in a repo.
loading . . .
A Practical Guide to AI Skills for Analytics Engineering
I built a self-improving AI skill system for analytics engineering at Recce. Here's the framework, a real bug it caught, and how we scaled it.
https://blog.reccehq.com/ai-skills-for-analytics-eng
about 1 month ago
1
0
0
Our own Kent Chen wrote up the multi-agent architecture the team built for Recce's AI Data Review. Single agent kept forgetting findings as PRs got complex. Fix: orchestrator + two specialists, each with its own 200k context window.
loading . . .
Designing Reliable AI Agents for dbt Data Reviews
Code changes have AI review tools. Data changes don' - until now. Here's how we went from a single prompt to an AI agent that performs the first pass on data validation in every PR.
https://blog.reccehq.com/designing-reliable-ai-agents-for-dbt-data-reviews
about 2 months ago
1
2
0
"If I had taken three years longer to do things the right way, it would have been too late." --
@wesmckinney.com
on why pandas shipped imperfect and won.
#pandasPython
#DataRenegades
loading . . .
about 2 months ago
0
0
0
"Arrow has the intricacy of a fine Swiss watch." The co-creator of Apache Arrow (
@wesmckinney.com
) on why AI agents cannot replicate decade-long infrastructure design.
#ApacheArrow
#DataRenegades
loading . . .
about 2 months ago
1
2
0
Starting soon: Bauplan x Recce live session on safe AI automation from branch to production.
luma.com/mm3gsalo?tk=...
#DataEngineering
#AI
loading . . .
Trusting AI with Your Data: Safe Automation from Branch to Production · Zoom · Luma
AI coding assistants already help you ship software faster. But when it comes to data, the stakes are different. One bad pipeline change can corrupt production…
https://luma.com/mm3gsalo?tk=S6sKK7
about 2 months ago
0
0
0
"This is bad. Why haven't you fixed this yet? I would have already fixed this today with Claude code." --
@wesmckinney.com
on radical accountability for software vendors.
#AIcodingagent
#DataRenegades
loading . . .
about 2 months ago
0
4
1
Tomorrow 9 AM PT | Bauplan and Recce walk through a branch-to-production workflow where AI-generated pipeline changes run on isolated branches and get reviewed automatically before hitting production.
luma.com/mm3gsalo?tk=...
loading . . .
Trusting AI with Your Data: Safe Automation from Branch to Production · Zoom · Luma
AI coding assistants already help you ship software faster. But when it comes to data, the stakes are different. One bad pipeline change can corrupt production…
https://luma.com/mm3gsalo?tk=S6sKK7
2 months ago
0
0
1
Why did no big retailers attend the fall JavaScript conference? Every employee was locked down for Black Friday prep. Roger Magoulas sat with someone from Target to find out. No dataset would have surfaced that. 🎧 Full episode:
youtu.be/z3E0Pi4XaLg
#analytics
#datascience
loading . . .
2 months ago
0
1
0
"I would just wake up and write Python code."
@wesmckinney.com
on the founder hours that created pandas.
#DataRenegades
#pandasPython
loading . . .
2 months ago
0
0
0
"If you have a successful career built on glazing people, I kind of hate you." Bryan Bischof on why truth-seeking is non-negotiable for data teams.
#DataScience
#DataCulture
loading . . .
2 months ago
0
0
0
The most predictive feature in the world's first coffee recommender? Favorite salad dressing. The baristas knew. Bryan Bischof on Data Renegades
#DataScience
#MachineLearning
loading . . .
2 months ago
0
0
0
"When an escalator breaks, it just becomes stairs. When your data workload fails, it's often just stale data." Scott Breitenother on why most data failures aren't as bad as they feel.
#DataEngineering
#DataReliability
loading . . .
2 months ago
0
0
0
CL Kao takes the stage at the DataTune Conf tomorrow: "A Practical Playbook for Building Data Agents (That Don't Break Your Pipeline)" 288 benchmark trials. Three context problems every data agent hits. CL and
@doriwilson.com
will be at the Recce booth all day. Come say hey.
#DataTune
2 months ago
0
0
0
Building Recce's AI Data Review meant working around three hard limits in Claude: 200k context window, ~90k single prompt, 25k per MCP tool response. 🧵
loading . . .
Designing Reliable AI Agents for dbt Data Reviews
Code changes have AI review tools. Data changes don' - until now. Here's how we went from a single prompt to an AI agent that performs the first pass on data validation in every PR.
https://blog.reccehq.com/designing-reliable-ai-agents-for-dbt-data-reviews
2 months ago
1
0
0
Wes McKinney built pandas in a mouse-infested NYC apartment on founder hours. Now he runs parallel Claude Code sessions and says AI is forcing "radical accountability" on every software vendor shipping mediocre products. Full conversation:
youtu.be/Uso8-yaERkE
#DataRenegades
#pandas
#ApacheArrow
2 months ago
0
1
0
"The data team can't afford to be two years behind. You need to get that exoskeleton going now." Scott Breitenother on why full-stack data people using AI agents are the new operating model.
#DataEngineering
#AIAgents
loading . . .
2 months ago
0
2
0
New engineers at Kilo ship code on day one. MVP feature by end of week. In production the next week. "We ruthlessly hunt down bureaucracy." Scott Breitenother on Data Renegades
#AIAgents
#Engineering
loading . . .
3 months ago
0
0
0
"These technologies, they're not robots replacing us. They're exoskeletons that make us better, faster, stronger." Scott Breitenother on Data Renegades
#DataEngineering
#AI
loading . . .
3 months ago
0
0
0
Scott Breitenother jumped into every Slack thread at Brooklyn Data. It made the company fast and him the bottleneck. His fix: subscribe to replies, don't comment, check back in 3 hours.
#Leadership
#DataTeams
loading . . .
3 months ago
0
0
0
Code changes have AI review tools. Data changes don't... until now. Our own Kent Chen wrote about how the team built a multi-agent system with Claude Agent SDK and MCP that reviews data changes in every dbt PR. Orchestrator + two specialist agents using 6 Recce MCP tools.
loading . . .
Designing Reliable AI Agents for dbt Data Reviews
Code changes have AI review tools. Data changes don' - until now. Here's how we went from a single prompt to an AI agent that performs the first pass on data validation in every PR.
https://blog.reccehq.com/designing-reliable-ai-agents-for-dbt-data-reviews
3 months ago
2
0
0
"The team still exists, but it's not five humans, it's one human and four agents." Scott Breitenother on why the unit of productivity has changed.
#DataEngineering
#AIAgents
loading . . .
3 months ago
1
0
0
Bryan Bischof's hot take: self-serve data analytics is the hype trend most likely to derail. "People are selling third base. I'm still on first base."
#DataEngineering
#Analytics
loading . . .
3 months ago
0
0
0
New episode of Data Renegades is live Scott Breitenother built a 100-person data consultancy, watched himself become the bottleneck, and rebuilt everything at Kilo Code Their data team is one person plus four AI agents.
loading . . .
https://youtu.be/qKFBaDWMxkk
3 months ago
1
0
0
"I shit you not. I found one where it gave three backpacks." Bryan Bischof on the production bug where out-of-distribution items sat near the null vector and showed up in every recommendation.
#MachineLearning
#DataScience
loading . . .
3 months ago
0
0
0
"Saying 'you are wrong' is not curious. Saying 'why are your priors different than what the data is showing' is curious." Bryan Bischof on Data Renegades
#DataScience
#DataCulture
loading . . .
3 months ago
0
0
0
Happy Valentine's Day to everyone who spent this week falling back in love with their data stack. Five days. Five companies. No slides. No safety nets. Thank you Greybeam, dltHub, Database Tycoon, & bauplan for doing this with us and shipping in front of an audience.
loading . . .
The Data Valentine Challenge | Recce
Join the Data Valentine Challenge! 5 days of quick, actionable challenges led by experts from Recce, Greybeam, dltHub, Database Tycoon, and Bauplan.
https://reccehq.com/data-valentine-week-challenge
3 months ago
1
2
0
"The best analytics asks more questions than it answers." Roger Magoulas on why dashboards shipped is the wrong success metric. The real measure is whether the work generates questions worth investigating. 🎧
www.heavybit.com/library/podc...
#dataengineering
#analytics
loading . . .
Data Renegades | Ep. #6, From Big Data to Curiosity-Driven Insight with Roger Magoulas | Heavybit
On episode 6 of Data Renegades, CL Kao and Dori Wilson of Recce speak with Roger Magoulas about the real bottlenecks holding data organizations back.
https://www.heavybit.com/library/podcasts/data-renegades/ep-6-from-big-data-to-curiosity-driven-insight-with-roger-magoulas
3 months ago
0
0
0
An AI agent tried to write to production. The lakehouse said no. Day 5 of Data Valentine Challenge: @BauplanLabs brought the finale. Aldrin let Claude Code build his entire pipeline from scratch — on transactional branches that caught every mistake 🧵
loading . . .
An AI Agent Built My Entire Data Pipeline. Here's How I Kept It From Breaking Production
Aldrin from Bauplan closed the Data Valentine Challenge with a demo where he didn't write a single line of pipeline code. Claude Code did all of it — importing satellite telemetry into a lakehouse,…
https://youtu.be/yzX05Z8FlYw
3 months ago
1
0
0
"The best code is the code you don't write. Or in this case, the code you delete." Day 4 of Data Valentine Challenge: Database Tycoon ran a live dbt makeover. Stephen volunteered his NYC transit project. Chloe walked the lineage and cut everything dead.
loading . . .
The Data Valentine Challenge | Recce
Join the Data Valentine Challenge! 5 days of quick, actionable challenges led by experts from Recce, Greybeam, dltHub, Database Tycoon, and Bauplan.
https://reccehq.com/data-valentine-week-challenge
3 months ago
1
1
0
"We didn't write a single line of Python." Day 3 of Data Valentine Challenge: Ashish from dltHub walked through the workspace workflow — GitHub API to DuckDB to reports, no code. One command. One prompt. Pipeline runs 🧵
loading . . .
The Data Valentine Challenge | Recce
Join the Data Valentine Challenge! 5 days of quick, actionable challenges led by experts from Recce, Greybeam, dltHub, Database Tycoon, and Bauplan.
https://reccehq.com/data-valentine-week-challenge
3 months ago
1
0
0
"Why don't these numbers match?" Kyle from @greybeam_ai opened Day 2 of Data Valentine Challenge with a scenario every data person knows too well. The fix? Query Snowflake, Google Sheets, and raw APIs, all in one SQL statement. 🧵
3 months ago
1
0
0
A feed broke. Roger Magoulas didn't notice. Tim O'Reilly called it out. His fix: a weekly "Synced" email confirming data freshness. Then he added insights. Soon it went company-wide. 🎧 Full episode:
youtu.be/z3E0Pi4XaLg
#dataengineering
#datateams
loading . . .
3 months ago
0
0
0
Tell your agents you love them. Seriously. 💕 CL opened Data Valentine Challenge with 300 benchmark trials testing emotional framing on coding agents. The question: what happens when you add "you totally got this, take your time, love you" to every prompt? 🧵
loading . . .
- YouTube
Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.
https://youtu.be/r7QXlb-p3-8
3 months ago
1
0
0
Phone numbers are seven digits because humans retain about four things plus or minus three. Roger Magoulas on why 47-tab dashboards don't work. Stakeholders will remember none of it. 🎧
www.heavybit.com/library/podc...
#analytics
#dataviz
3 months ago
0
0
0
What percentage of your data team's time goes to pipeline maintenance versus exploratory analysis? Roger Magoulas argues the ratio determines whether you generate insight or just keep the lights on. 🎧
youtu.be/z3E0Pi4XaLg
#dataengineering
#datateams
loading . . .
"Data Engineers Don't Know WHY They're Building" | Roger Magoulas
Roger Magoulas helped popularize "big data." He co-chaired the O'Reilly Strata Conference and worked with DJ Patil to frame what data science could become. He's been building data teams since the…
https://youtu.be/z3E0Pi4XaLg
3 months ago
0
0
0
Want a better data stack + love fun data-pun stickers? Join the Data Valentine Challenge Feb 9–13 🦾 Attend all 5 webinars + 1 social post = get a sticker. Signup here:
reccehq.com/data-valenti...
#DataEngineering
#DataQuality
3 months ago
0
0
0
Next week: Fall in love with your data 💕 The Data Valentine Challenge 💘 5 days, 5 companies, 5 real fixes Data teams are drowning. You need quick wins. 30-60 -min daily challenges that actually move the needle. Recce • Greybeam • dltHub • Database Tycoon • Bauplan Details soon 👀
#DataEngineering
3 months ago
0
1
0
LLMs are non-deterministic. But the code they generate is deterministic. Roger Magoulas: prompt for SQL that monitors your pipelines. Run the code. Sidestep the hallucination problem. 🎧 Full episode:
www.heavybit.com/library/podc...
#llm
#dataengineering
loading . . .
3 months ago
0
0
0
"I had casually said, I think I could build a data warehouse in 90 days. My boss said, OK, go to it." Listen to Roger Magoulas on how curiosity and chance encounters beat formal training. 🎧
youtu.be/z3E0Pi4XaLg
#dataengineering
#analytics
#bigdata
loading . . .
"Data Engineers Don't Know WHY They're Building" | Roger Magoulas
Roger Magoulas helped popularize "big data." He co-chaired the O'Reilly Strata Conference and worked with DJ Patil to frame what data science could become. He's been building data teams since the…
https://youtu.be/z3E0Pi4XaLg
3 months ago
0
0
0
Data engineers are so buried in pipeline maintenance, feed monitoring, and DAG management that they cannot shift into insight mode. All reaction. No anticipation. Roger Magoulas on how the value gets crowded out.
loading . . .
3 months ago
0
0
0
Mode raised 80 million over 10 years and resisted being called a BI tool until the market forced consolidation. Lesson: technical tools drift toward BI because wall-to-wall deployments require value for non-technical users.
#BITools
#DataEngineering
4 months ago
0
0
0
New episode of Data Renegades podcast is live with guest Roger Magoulas. He coined the term "big data" and has 30 experience years of building data teams. Check it out on your favorite podcast app, youtube, or directly here:
www.heavybit.com/library/podc...
loading . . .
Data Renegades | Heavybit
Exploring data, code, culture, and everything in between.
https://www.heavybit.com/library/podcasts/data-renegades
4 months ago
0
0
0
Benn predicts BI tools get displaced by LLMs analyzing support tickets, not by better dashboards. Execs already decide based on customer stories they heard, not numbers. AI just makes qualitative analysis scale.
#AI
#DataEngineering
Watch the full clip:
youtube.com/shorts/vZjaL...
loading . . .
"CEOs Will Choose Support Ticket Summaries Over Your Dashboard" (clip)
This is a clip from the Data Renegades podcast. See the full video at https://youtu.be/azQPAb1V-1E and subscribe to the Data Renegades podcast wherever you get your podcasts. Data Renegades is…
https://youtube.com/shorts/vZjaLv7JaMY?feature=share
4 months ago
1
0
0
Mode's product strategy: Don't average 50 customer opinions into a Frankenstein feature set. Find the single most opinionated user who gets it and build exactly what they want. Character beats consensus.
#ProductStrategy
#DataEngineering
loading . . .
"Don't Average Customer Feedback. Find the One Who's Right." (clip)
This is a clip from the Data Renegades podcast. See the full video at https://youtu.be/azQPAb1V-1E and subscribe to the Data Renegades podcast wherever you get your podcasts. Data Renegades is…
https://youtube.com/shorts/zEroDPWKZx0
4 months ago
0
0
0
Load more
feeds!
log in