Erfan Hesami
@hopefanhe.bsky.social
📤 29
📥 36
📝 10
Data Engineer Substack:
https://pipeline2insights.substack.com
reposted by
Erfan Hesami
Pipeline To Insights
7 months ago
Data Compression in SQL In this post, we’ll explore: - What is Data Compression? - Benefits of Data Compression in SQL. - Types of Compression in SQL. - Comparison of SQL Compression Techniques. - When to Use Compression and When to Avoid.
#databs
#datasky
loading . . .
Data Compression in SQL
How to Store More and Query Faster in SQL
https://pipeline2insights.substack.com/p/data-compression-in-sql-olap-oltp
1
3
2
Explore key data modelling concepts for databases and data warehouses, including: - Normalisation vs. Denormalisation - 3NF - Dimensional Modeling - Star vs. Snowflake Schema comparisons.
#databs
#datasky
loading . . .
Data Modelling Fundamentals: Normalisation, 3NF and Dimensional Modelling
Normalisation, 3NF, and dimensional modelling, with insights into Star and Snowflake schemas for efficient database and warehouse design
https://open.substack.com/pub/pipeline2insights/p/key-concepts-in-data-modeling-for-data-engineers?utm_source=app-post-stats-page
9 months ago
0
0
0
reposted by
Erfan Hesami
Pipeline To Insights
9 months ago
11 Storage Formats for Data Engineers Efficient data starts with the right storage format. Explore 11 formats every data engineer should know to match workloads and scale seamlessly. Highlights: Row & Columnar, Key-Value, Document, Graph, Time-Series, Hybrid.
#databs
#datasky
loading . . .
11 Storage Formats for Data Engineers
How to leverage storage formats for efficient and scalable data systems
https://pipeline2insights.substack.com/p/11-storage-formats-for-data-engineers
0
5
3
reposted by
Erfan Hesami
Pipeline To Insights
10 months ago
Week 6 of '100 Days of SQL Optimisation': Focused on DuckDB, leveraging columnar storage, sorted data, temp tables, Parquet, and optimal data types to boost efficiency. See how in-memory execution and smart structures enhance query performance!
@duckdb.org
#dataBS
#datasky
#duckdb
loading . . .
Week #6: 100 Days of SQL Optimisation
Exploring DuckDB and Its Capabilities
https://open.substack.com/pub/pipeline2insights/p/week-6-100-days-of-sql-optimisation?utm_source=app-post-stats-page&r=p5bpr&utm_medium=ios
1
4
2
reposted by
Erfan Hesami
Pipeline To Insights
10 months ago
We are starting a 32-week Data Engineering Interview Guide program, covering everything from fundamentals to advanced topics, with sessions every Saturday. Do you think we're missing any critical topics? We're curious about your opinions😊
#dataBS
#datasky
loading . . .
Week 0/32 - A Comprehensive Data Engineering Interview Preparation Guide
Join us every Saturday on This New Journey
https://open.substack.com/pub/pipeline2insights/p/week-032-a-comprehensive-data-engineering?utm_source=app-post-stats-page&r=p5bpr&utm_medium=ios
0
4
3
reposted by
Erfan Hesami
Pipeline To Insights
10 months ago
As a Data Engineer, understanding the data storage lifecycle and data retention policies is critical for designing efficient, cost-effective, and compliant data systems.
@joereis.bsky.social
#dataBS
#datasky
substack.com/@pipeline2in...
loading . . .
0
7
2
reposted by
Erfan Hesami
Pipeline To Insights
10 months ago
In our new post, we've covered 10 of the most popular data pipeline design patterns. We’d love to hear your thoughts. For more details, please check out the full post created by (
@hgeren.bsky.social
and
@hopefanhe.bsky.social
):
open.substack.com/pub/pipeline...
#dataBS
#datasky
loading . . .
10 Pipeline Design Patterns for Data Engineers
How to leverage Design Patterns for scalable and efficient data pipelines
https://open.substack.com/pub/pipeline2insights/p/10-pipeline-design-patterns-for-data?r=p5bpr&utm_campaign=post&utm_medium=web
0
3
2
reposted by
Erfan Hesami
Pipeline To Insights
10 months ago
Discover how dlt simplifies data ingestion. Learn its origins and real-world use cases. Follow a step-by-step guide to build your first pipeline and join the growing dlt community!
@matthausk.bsky.social
@datateam.bsky.social
@hgeren.bsky.social
@hopefanhe.bsky.social
#dataBS
#datasky
loading . . .
Introduction to data load tool (dlt): A Python Library for Simple Data Ingestion
Discover the basics of dlt and its role in modern data engineering workflows
https://open.substack.com/pub/pipeline2insights/p/introduction-to-data-load-tool-dlt?utm_source=app-post-stats-page&r=p5bpr&utm_medium=ios
2
9
3
reposted by
Erfan Hesami
Pipeline To Insights
11 months ago
Storage is at the heart of Data Engineering. In this post, we explore the hierarchy of data storage from the ground up, drawing inspiration from Fundamentals of Data Engineering by
@joereis.bsky.social
and Matt Housley, as well as insights from the DE Professionals on Coursera.
#dataBS
#datasky
loading . . .
Storage Fundamentals For Data Engineers
Why organised and durable storage is the cornerstone of Data Engineering?
https://open.substack.com/pub/pipeline2insights/p/storage-fundamentals-every-data-engineer?utm_source=app-post-stats-page&r=p5bpr&utm_medium=ios
3
16
2
My curiosity about the data life cycle led me to explore Data Engineering.Reading Fundamentals of DE book built my foundation, while pioneers like Zach Wilson and Joe Reis inspired me. Community engagement was key to my success, enabling learning, sharing,and growth in this journey.
#databs
#datasky
loading . . .
From Analytics to Data Engineering: A Career Journey
How curiosity and continuous learning drive the transition from Analytics to Data Engineering
https://open.substack.com/pub/pipeline2insights/p/from-analytics-to-data-engineering?r=p5bpr&utm_campaign=post&utm_medium=web
11 months ago
0
6
0
reposted by
Erfan Hesami
Hasan Geren
11 months ago
Hey
#dataBS
and
#datasky
folks, Our new post about "how understanding Big O Notation & Execution Plans can optimize SQL queries" has just been posted. Check it out if you're interested, and we'd love to hear your thoughts!
@hopefanhe.bsky.social
open.substack.com/pub/pipeline...
loading . . .
SQL Behind the Curtain: How Are Queries Executed?
Explore the journey of your SQL query guided by execution plans
https://open.substack.com/pub/pipeline2insights/p/sql-behind-the-curtain-how-are-queries?r=p5bpr&utm_campaign=post&utm_medium=web
1
8
2
The last step in the data engineering lifecycle is serving data, making it accessible for analytics, machine learning, and Reverse ETL. In our latest post, we break down how data engineers play a crucial role in: - Analytics - ML - Reverse ETL
#databs
#datasky
open.substack.com/pub/pipeline...
loading . . .
From Data to Decisions: A guide to serving insights for maximum impact
Master the final step in data engineering, serving data, to empower stakeholders, drive smarter decisions, and transform insights into action.
https://open.substack.com/pub/pipeline2insights/p/from-data-to-decisions-a-guide-to?utm_source=app-post-stats-page&r=p5bpr&utm_medium=ios
11 months ago
0
4
0
Free 6-week Bootcamp with new video every day from November 15th to December 31st 2024 by
@eczachly.bsky.social
Watch the launch video here:https://www.youtube.com/watch?v=myhe0LXpCeo Don’t miss it Thank you Zach 🙏
#dataBS
#datasky
11 months ago
0
2
0
Week 1 of "100 Days of SQL Optimisation" covered key techniques like column selection, multicolumn indexes, filtering, window functions, Rank, CTE and composite indexes with IMDb data. Check out the full post for more!
@hgeren.bsky.social
#dataBS
#datasky
loading . . .
Week #1: 100 Days of SQL Optimisation
How Small Tweaks Transformed Our Queries, Saving Time and Resources
https://open.substack.com/pub/pipeline2insights/p/week-1-100-days-of-sql-optimisation?utm_source=app-post-stats-page&r=p5bpr&utm_medium=ios
11 months ago
0
6
1
you reached the end!!
feeds!
log in