Recce - Trust, Verify, Ship
@datarecce.bsky.social
π€ 46
π₯ 153
π 227
Helping data teams preview, validate, and ship data changes with confidence.
https://datarecce.io
pinned post!
Recce 1.0 is now live on Product Hunt!
www.producthunt.com/posts/recce-4
Upvote and leave a comment to help us grow the Recce community and bring better data review processed to more data teams Thanks for your support!
#OpenSource
#Data
#DataEngineering
#Analytics
#DeveloperTools
#dbt
loading . . .
Recce - Explore, validate, and share data impact before merging | Product Hunt
Recce helps data teams discover actual data impact and turn insight into actionable checklists for dbt pull request reviews. Itβs a practical way to implement data best practices - know whatβs changin...
https://www.producthunt.com/posts/recce-4
10 months ago
0
4
0
New engineers at Kilo ship code on day one. MVP feature by end of week. In production the next week. "We ruthlessly hunt down bureaucracy." Scott Breitenother on Data Renegades
#AIAgents
#Engineering
loading . . .
about 15 hours ago
0
0
0
"These technologies, they're not robots replacing us. They're exoskeletons that make us better, faster, stronger." Scott Breitenother on Data Renegades
#DataEngineering
#AI
loading . . .
1 day ago
0
0
0
Scott Breitenother jumped into every Slack thread at Brooklyn Data. It made the company fast and him the bottleneck. His fix: subscribe to replies, don't comment, check back in 3 hours.
#Leadership
#DataTeams
loading . . .
6 days ago
0
0
0
Code changes have AI review tools. Data changes don't... until now. Our own Kent Chen wrote about how the team built a multi-agent system with Claude Agent SDK and MCP that reviews data changes in every dbt PR. Orchestrator + two specialist agents using 6 Recce MCP tools.
loading . . .
Designing Reliable AI Agents for dbt Data Reviews
Code changes have AI review tools. Data changes don' - until now. Here's how we went from a single prompt to an AI agent that performs the first pass on data validation in every PR.
https://blog.reccehq.com/designing-reliable-ai-agents-for-dbt-data-reviews
6 days ago
2
0
0
"The team still exists, but it's not five humans, it's one human and four agents." Scott Breitenother on why the unit of productivity has changed.
#DataEngineering
#AIAgents
loading . . .
8 days ago
1
0
0
Bryan Bischof's hot take: self-serve data analytics is the hype trend most likely to derail. "People are selling third base. I'm still on first base."
#DataEngineering
#Analytics
loading . . .
9 days ago
0
0
0
New episode of Data Renegades is live Scott Breitenother built a 100-person data consultancy, watched himself become the bottleneck, and rebuilt everything at Kilo Code Their data team is one person plus four AI agents.
loading . . .
https://youtu.be/qKFBaDWMxkk
13 days ago
1
0
0
"I shit you not. I found one where it gave three backpacks." Bryan Bischof on the production bug where out-of-distribution items sat near the null vector and showed up in every recommendation.
#MachineLearning
#DataScience
loading . . .
13 days ago
0
0
0
"Saying 'you are wrong' is not curious. Saying 'why are your priors different than what the data is showing' is curious." Bryan Bischof on Data Renegades
#DataScience
#DataCulture
loading . . .
15 days ago
0
0
0
Happy Valentine's Day to everyone who spent this week falling back in love with their data stack. Five days. Five companies. No slides. No safety nets. Thank you Greybeam, dltHub, Database Tycoon, & bauplan for doing this with us and shipping in front of an audience.
loading . . .
The Data Valentine Challenge | Recce
Join the Data Valentine Challenge! 5 days of quick, actionable challenges led by experts from Recce, Greybeam, dltHub, Database Tycoon, and Bauplan.
https://reccehq.com/data-valentine-week-challenge
16 days ago
1
2
0
"The best analytics asks more questions than it answers." Roger Magoulas on why dashboards shipped is the wrong success metric. The real measure is whether the work generates questions worth investigating. π§
www.heavybit.com/library/podc...
#dataengineering
#analytics
loading . . .
Data Renegades | Ep. #6, From Big Data to Curiosity-Driven Insight with Roger Magoulas | Heavybit
On episode 6 of Data Renegades, CL Kao and Dori Wilson of Recce speak with Roger Magoulas about the real bottlenecks holding data organizations back.
https://www.heavybit.com/library/podcasts/data-renegades/ep-6-from-big-data-to-curiosity-driven-insight-with-roger-magoulas
16 days ago
0
0
0
An AI agent tried to write to production. The lakehouse said no. Day 5 of Data Valentine Challenge: @BauplanLabs brought the finale. Aldrin let Claude Code build his entire pipeline from scratch β on transactional branches that caught every mistake π§΅
loading . . .
An AI Agent Built My Entire Data Pipeline. Here's How I Kept It From Breaking Production
Aldrin from Bauplan closed the Data Valentine Challenge with a demo where he didn't write a single line of pipeline code. Claude Code did all of it β importing satellite telemetry into a lakehouse,β¦
https://youtu.be/yzX05Z8FlYw
17 days ago
1
0
0
"The best code is the code you don't write. Or in this case, the code you delete." Day 4 of Data Valentine Challenge: Database Tycoon ran a live dbt makeover. Stephen volunteered his NYC transit project. Chloe walked the lineage and cut everything dead.
loading . . .
The Data Valentine Challenge | Recce
Join the Data Valentine Challenge! 5 days of quick, actionable challenges led by experts from Recce, Greybeam, dltHub, Database Tycoon, and Bauplan.
https://reccehq.com/data-valentine-week-challenge
18 days ago
1
1
0
"We didn't write a single line of Python." Day 3 of Data Valentine Challenge: Ashish from dltHub walked through the workspace workflow β GitHub API to DuckDB to reports, no code. One command. One prompt. Pipeline runs π§΅
loading . . .
The Data Valentine Challenge | Recce
Join the Data Valentine Challenge! 5 days of quick, actionable challenges led by experts from Recce, Greybeam, dltHub, Database Tycoon, and Bauplan.
https://reccehq.com/data-valentine-week-challenge
19 days ago
1
0
0
"Why don't these numbers match?" Kyle from @greybeam_ai opened Day 2 of Data Valentine Challenge with a scenario every data person knows too well. The fix? Query Snowflake, Google Sheets, and raw APIs, all in one SQL statement. π§΅
20 days ago
1
0
0
A feed broke. Roger Magoulas didn't notice. Tim O'Reilly called it out. His fix: a weekly "Synced" email confirming data freshness. Then he added insights. Soon it went company-wide. π§ Full episode:
youtu.be/z3E0Pi4XaLg
#dataengineering
#datateams
loading . . .
20 days ago
0
0
0
Tell your agents you love them. Seriously. π CL opened Data Valentine Challenge with 300 benchmark trials testing emotional framing on coding agents. The question: what happens when you add "you totally got this, take your time, love you" to every prompt? π§΅
loading . . .
- YouTube
Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.
https://youtu.be/r7QXlb-p3-8
20 days ago
1
0
0
Phone numbers are seven digits because humans retain about four things plus or minus three. Roger Magoulas on why 47-tab dashboards don't work. Stakeholders will remember none of it. π§
www.heavybit.com/library/podc...
#analytics
#dataviz
22 days ago
0
0
0
What percentage of your data team's time goes to pipeline maintenance versus exploratory analysis? Roger Magoulas argues the ratio determines whether you generate insight or just keep the lights on. π§
youtu.be/z3E0Pi4XaLg
#dataengineering
#datateams
loading . . .
"Data Engineers Don't Know WHY They're Building" | Roger Magoulas
Roger Magoulas helped popularize "big data." He co-chaired the O'Reilly Strata Conference and worked with DJ Patil to frame what data science could become. He's been building data teams since theβ¦
https://youtu.be/z3E0Pi4XaLg
23 days ago
0
0
0
Want a better data stack + love fun data-pun stickers? Join the Data Valentine Challenge Feb 9β13 π¦Ύ Attend all 5 webinars + 1 social post = get a sticker. Signup here:
reccehq.com/data-valenti...
#DataEngineering
#DataQuality
25 days ago
0
0
0
Next week: Fall in love with your data π The Data Valentine Challenge π 5 days, 5 companies, 5 real fixes Data teams are drowning. You need quick wins. 30-60 -min daily challenges that actually move the needle. Recce β’ Greybeam β’ dltHub β’ Database Tycoon β’ Bauplan Details soon π
#DataEngineering
26 days ago
0
1
0
LLMs are non-deterministic. But the code they generate is deterministic. Roger Magoulas: prompt for SQL that monitors your pipelines. Run the code. Sidestep the hallucination problem. π§ Full episode:
www.heavybit.com/library/podc...
#llm
#dataengineering
loading . . .
27 days ago
0
0
0
"I had casually said, I think I could build a data warehouse in 90 days. My boss said, OK, go to it." Listen to Roger Magoulas on how curiosity and chance encounters beat formal training. π§
youtu.be/z3E0Pi4XaLg
#dataengineering
#analytics
#bigdata
loading . . .
"Data Engineers Don't Know WHY They're Building" | Roger Magoulas
Roger Magoulas helped popularize "big data." He co-chaired the O'Reilly Strata Conference and worked with DJ Patil to frame what data science could become. He's been building data teams since theβ¦
https://youtu.be/z3E0Pi4XaLg
29 days ago
0
0
0
Data engineers are so buried in pipeline maintenance, feed monitoring, and DAG management that they cannot shift into insight mode. All reaction. No anticipation. Roger Magoulas on how the value gets crowded out.
loading . . .
30 days ago
0
0
0
Mode raised 80 million over 10 years and resisted being called a BI tool until the market forced consolidation. Lesson: technical tools drift toward BI because wall-to-wall deployments require value for non-technical users.
#BITools
#DataEngineering
about 1 month ago
0
0
0
New episode of Data Renegades podcast is live with guest Roger Magoulas. He coined the term "big data" and has 30 experience years of building data teams. Check it out on your favorite podcast app, youtube, or directly here:
www.heavybit.com/library/podc...
loading . . .
Data Renegades | Heavybit
Exploring data, code, culture, and everything in between.
https://www.heavybit.com/library/podcasts/data-renegades
about 1 month ago
0
0
0
Benn predicts BI tools get displaced by LLMs analyzing support tickets, not by better dashboards. Execs already decide based on customer stories they heard, not numbers. AI just makes qualitative analysis scale.
#AI
#DataEngineering
Watch the full clip:
youtube.com/shorts/vZjaL...
loading . . .
"CEOs Will Choose Support Ticket Summaries Over Your Dashboard" (clip)
This is a clip from the Data Renegades podcast. See the full video at https://youtu.be/azQPAb1V-1E and subscribe to the Data Renegades podcast wherever you get your podcasts. Data Renegades isβ¦
https://youtube.com/shorts/vZjaLv7JaMY?feature=share
about 1 month ago
1
0
0
Mode's product strategy: Don't average 50 customer opinions into a Frankenstein feature set. Find the single most opinionated user who gets it and build exactly what they want. Character beats consensus.
#ProductStrategy
#DataEngineering
loading . . .
"Don't Average Customer Feedback. Find the One Who's Right." (clip)
This is a clip from the Data Renegades podcast. See the full video at https://youtu.be/azQPAb1V-1E and subscribe to the Data Renegades podcast wherever you get your podcasts. Data Renegades isβ¦
https://youtube.com/shorts/zEroDPWKZx0
about 1 month ago
0
0
0
"You have to write it over and over until you can stand it. Your bar isn't too high. Getting through the point where it stops being fun is how you make it good." Benn Stancil shared this quote about writing, but it applies to building products.
about 1 month ago
1
0
0
We sat down with Benn Stancil (Co-founder of Mode). Here are the takeaways π§΅:
about 2 months ago
1
0
0
If you've ever nuked a production database, wondered why data teams can't share code like frontend teams, or want to know why Max thinks AI will hit data roles harder than software roles, listen to this. Full episode:
youtu.be/6dQntoiQBY8
#DataEngineering
#AIEngineering
loading . . .
The Creator of Airflow on Why Data Engineering Can't Share Code
Max built the airflow reset DB command. Someone ran it in production and nuked the database. This is what happened next and why code reuse in data engineering is fundamentally broken. Maximeβ¦
https://youtu.be/6dQntoiQBY8
about 2 months ago
0
0
0
Max evangelized Airflow to over 50 companies before it gained real traction. One interaction at a time on GitHub plus being present in the community. Hear how he did it:
youtube.com/shorts/O2J0H...
#OpenSource
#CommunityBuilding
loading . . .
The Real Work Behind Building Open Source (clip)
How to build successful open source. Good repo, strong docs, and being present one interaction at a time. This is a clip from the Data Renegades podcast. See the full video atβ¦
https://youtube.com/shorts/O2J0HJyTkkk
about 2 months ago
0
2
0
If you use Airflow or Superset in production, what's one feature you wish existed that would 10x your workflow? Drop your answer below.
#DataEngineering
#OpenSource
2 months ago
0
1
0
Hearing Max explain how he told Airbnb he would not work without building Airflow is a masterclass in knowing your leverage as an engineer. Watch:
youtube.com/shorts/-n833...
#CareerAdvice
#Airflow
loading . . .
How Max Negotiated Building Airflow at Airbnb (clip)
The negotiation that created Airflow. Max told Airbnb he would not work without the tools he needed. This is a clip from the Data Renegades podcast. See the full video atβ¦
https://youtube.com/shorts/-n8335QlDRI
2 months ago
0
0
0
Data engineering has no npm. Every company rebuilds sessionization, DAU calculations, and experimentation frameworks because SQL dialects fragment everything and we lack unified data models for parametric pipelines. Watch Max explain:
youtube.com/shorts/zS5L_...
#DataEngineering
#CodeReuse
loading . . .
The Code Reuse Crisis Nobody Talks About (clip)
Why we can't share code in data engineering. Every company rebuilds the same pipelines because there's no npm for data. This is a clip from the Data Renegades podcast. See the full video atβ¦
https://youtube.com/shorts/zS5L_2KcIKA
2 months ago
0
0
0
New Data Renegades episode is live!
@clkao.bsky.social
&
@doriwilson.com
sat down with
@bennstancil.bsky.social
founder of Mode. Listen on your fave podcast app to hear: β Why the modern data stack was a distraction β The identity crisis every BI tool faces Or stream directly here:
loading . . .
Data Renegades | Ep. #5, The Identity Crisis of BI with Benn Stancil | Heavybit
On episode 5 of Data Renegades, CL Kao and Dori Wilson sit down with Benn Stancil to explore how data tools evolve, and sometimes lose their identity.
https://www.heavybit.com/library/podcasts/data-renegades/ep-5-the-identity-crisis-of-bi-with-benn-stancil
2 months ago
0
1
1
"I wanted to build something that matters. Open source gave me scope of impact beyond one company." Max on why he evangelized Airflow to 50 companies before momentum hit.
#OpenSource
#TechImpact
2 months ago
1
0
0
We sat down with Max Beauchemin, creator of Airflow and Superset. Here are the takeaways: π§΅
#DataEngineering
#Airflow
2 months ago
1
0
0
reposted by
Recce - Trust, Verify, Ship
Simon Willison
3 months ago
I'm in an episode of the new Data Renegades podcast with CL Kao and Dori Wilson talking about data journalism and Datasette and related topics - it's a fun conversation, here are some extracted highlights
simonwillison.net/2025/Nov/26/...
loading . . .
Highlights from my appearance on the Data Renegades podcast with CL Kao and Dori Wilson
I talked with CL Kao and Dori Wilson for an episode of their new Data Renegades podcast titled Data Journalism Unleashed with Simon Willison. I fed the transcript into Claude β¦
https://simonwillison.net/2025/Nov/26/data-renegades-podcast/
1
27
5
The data industry is noisy. We wanted to help create some quiet. π€« We are proud to launch the Data Renegades podcast today, the brainchild of our founder
@clkao.bsky.social
and Head of Data
@doriwilson.com
They set out to build the show they actually wanted to listen to: π§΅π
3 months ago
1
0
0
Proud and excited to be hosting so many community events today & tomorrow in San Francisco for
@techweekbya16z.bsky.social
!
add a skeleton here at some point
5 months ago
1
1
0
reposted by
Recce - Trust, Verify, Ship
Dori
5 months ago
hosting 3 events in sf this week πͺ tues 2-5:30pm: data + code unconference (you set the agenda) tues 4pm: happy hour w/ posthog + deskree (ice cream + pizza) wed 5pm: devex demos at convex (vercel, grafana, arthur ai, deskree, convex, recce) pure community vibes. links in thread π
1
3
1
#SFTechWeek
calendar just dropped & we're doing something different at Recce. Instead of the slide decks + swag bags that most of these become, we're hosting events focused on real community. Thread on our lineup π
6 months ago
1
0
0
Ad-hoc validation scripts accumulate from past incidents but don't transfer to new contexts. Under time constraints, data practitioners can only rely on validation scripts. Impact Radius addresses this challenge through metadata analysis alone.
cloud.reccehq.com
loading . . .
Recce Cloud
Generated by Recce Cloud
https://cloud.reccehq.com
6 months ago
0
0
0
Marketing reports conversion issues. Investigation approach matters: β Random data exploration β Metadata-guided investigation Click problematic column β column lineage shows derived or passthrough β trace upstream β identify real issue.
loading . . .
Building Impact Radius #3: Three Essential Workflows for Data Teams
After building Impact Radius, we realized showing the tool isn't enough. You need to see HOW it fits into your daily workflow.
https://blog.reccehq.com/building-impact-radius-3-three-essential-workflows-for-data-teams
6 months ago
0
1
0
Metadata analysis eliminates unnecessary validation queries. Data practitioners commonly validate dbt changes by checking row counts across all downstream models: 47 models generating significant warehouse costs to identify the 3 that actually changed. Try using metadata only
6 months ago
0
1
0
π‘ For viadukt, data accuracy isn't a nice-to-have. It's core to their product. "Now, with Recce Cloud, we've dramatically improved our ability to deliver reliable data and address issues before they impact our customers." β Pascal Biesenbach, CEO & Co-founder, viadukt
reccehq.com/case-study-v...
loading . . .
https://reccehq.com/case-study-viadukt
6 months ago
0
1
0
Column-level lineage emerges from standard dbt artifacts. Running `dbt run` and `dbt docs generate` produces artifacts that enable column-level lineage visualization and impact analysis.
cloud.reccehq.com
accepts dbt artifacts to demonstrate metadata analysis
6 months ago
0
0
0
The validation need is universal. The setup capability varies significantly. Teams with robust infrastructure can implement comprehensive validation processes. Teams without DevOps have troubles. The gap creates an adoption barrier. Read more on closing this gap in our blog.
6 months ago
0
0
0
"The PRs created by John are always high quality. I can review them easily." Users love having data validation included in their PR process. But how easy a tool is to set up determines actual usage. Read more in our blog.
6 months ago
0
0
0
Load more
feeds!
log in