Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
dataengineering
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker
Jerry A. Henley
Jerry A. Henley
Jerry A. Henley
Follow
Feb 17
From Script to Spreadsheet: Building a Self-Serve Etsy Competitor Tracker
#
webscraping
#
devops
#
python
#
dataengineering
2
reactions
Comments
Add Comment
5 min read
Building a 'Data-on-Demand' Microservice: Wrapping Alibaba Scrapers for Internal Tools
Robert N. Gutierrez
Robert N. Gutierrez
Robert N. Gutierrez
Follow
Feb 17
Building a 'Data-on-Demand' Microservice: Wrapping Alibaba Scrapers for Internal Tools
#
webscraping
#
scraper
#
python
#
dataengineering
2
reactions
Comments
Add Comment
5 min read
Part 2: dbt Project Structure & Building Models 📁
Abdelrahman Adnan
Abdelrahman Adnan
Abdelrahman Adnan
Follow
Feb 16
Part 2: dbt Project Structure & Building Models 📁
#
data
#
dataengineering
#
sql
#
tutorial
Comments
Add Comment
4 min read
# Module 4 Summary - Analytics Engineering with dbt
Abdelrahman Adnan
Abdelrahman Adnan
Abdelrahman Adnan
Follow
Feb 16
# Module 4 Summary - Analytics Engineering with dbt
#
analytics
#
dataengineering
#
sql
#
tutorial
Comments
Add Comment
2 min read
Part 3: Testing, Documentation & Deployment 🚀
Abdelrahman Adnan
Abdelrahman Adnan
Abdelrahman Adnan
Follow
Feb 16
Part 3: Testing, Documentation & Deployment 🚀
#
analytics
#
dataengineering
#
sql
#
tutorial
Comments
Add Comment
5 min read
Machine Learning Starts With a WHERE Clause
Brittany
Brittany
Brittany
Follow
Feb 16
Machine Learning Starts With a WHERE Clause
#
dataengineering
#
datascience
#
machinelearning
#
sql
1
reaction
Comments
Add Comment
2 min read
Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance
Bato
Bato
Bato
Follow
Feb 16
Pandas 3.0's PyArrow String Revolution: A Deep Dive into Memory and Performance
#
ai
#
datascience
#
dataengineering
#
machinelearning
2
reactions
Comments
Add Comment
6 min read
How We Built a Deterministic File Import Pipeline in TypeScript (CSV, XLSX, ZIP)
Rakesh
Rakesh
Rakesh
Follow
Feb 16
How We Built a Deterministic File Import Pipeline in TypeScript (CSV, XLSX, ZIP)
#
backend
#
dataengineering
#
saas
#
typescript
Comments
Add Comment
2 min read
AWS Data Engineer Associate (DEA-C01): What Each Domain Actually Tests (From Someone Who Just Passed)
ExamCert.App
ExamCert.App
ExamCert.App
Follow
Feb 15
AWS Data Engineer Associate (DEA-C01): What Each Domain Actually Tests (From Someone Who Just Passed)
#
aws
#
cloud
#
certification
#
dataengineering
Comments
Add Comment
2 min read
Why Most Data Governance Tools Miss the Real Relationships — and What to Do About It
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Feb 16
Why Most Data Governance Tools Miss the Real Relationships — and What to Do About It
#
datagovernance
#
dataengineering
#
dataarchitecture
#
datainfrastructure
Comments
Add Comment
2 min read
11 Compaction Optimizations for Iceberg Data Lakes
David
David
David
Follow
Feb 16
11 Compaction Optimizations for Iceberg Data Lakes
#
dataengineering
#
iceberg
#
snowflake
1
reaction
Comments
Add Comment
25 min read
Under the Hood of Arisyn: How Statistical Field Fingerprinting Enables Deterministic Data Linking
Hello Arisyn
Hello Arisyn
Hello Arisyn
Follow
Feb 15
Under the Hood of Arisyn: How Statistical Field Fingerprinting Enables Deterministic Data Linking
#
dataengineering
#
dataarchitecture
#
databasesystems
#
ai
Comments
Add Comment
2 min read
ELI25: Apache Kafka Quick Notes for Interviews
Hayden Cordeiro
Hayden Cordeiro
Hayden Cordeiro
Follow
Feb 15
ELI25: Apache Kafka Quick Notes for Interviews
#
architecture
#
dataengineering
#
distributedsystems
#
interview
Comments
Add Comment
4 min read
Postmortem: Eliminating OOM Failures in Spark on Kubernetes (Azure) After Cloud Migration
Pranav Bhasker
Pranav Bhasker
Pranav Bhasker
Follow
Feb 15
Postmortem: Eliminating OOM Failures in Spark on Kubernetes (Azure) After Cloud Migration
#
dataengineering
#
kubernetes
#
cloud
#
spark
Comments
Add Comment
5 min read
We All Accepted the "Python Tax.", Pandas 3.0 Just Reduced It.
Bato
Bato
Bato
Follow
Feb 15
We All Accepted the "Python Tax.", Pandas 3.0 Just Reduced It.
#
python
#
pandas
#
performance
#
dataengineering
2
reactions
Comments
Add Comment
2 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account