Endigest logo
Endigest
All Tech BlogsExplore TagsSend Feedback
Newsletter
Endigest logo
Endigest

© 2026 Endigest. All rights reserved.

  • About
  • Privacy
  • Terms
  • Contact
  • RSS

Data Eng Articles

Explore real-world engineering experiences from top tech companies.

필터 초기화
⌘K
AllFrontendBackendAI / MLML OpsDevOpsMobileArchitectureData EngSecurityProductCulture

Trending This Week

#1
GitHub logoGitHub

Agent-driven development in Copilot Applied Science

11 views2026-03-31
#2
The Hacker News logoThe Hacker News

Vertex AI Vulnerability Exposes Google Cloud Data and Private Artifacts

9 views2026-03-31
#3
Google Cloud logoGoogle Cloud

Spanner's multi-model advantage for the era of agentic AI

8 views2026-03-31
#4
The Hacker News logoThe Hacker News

TrueConf Zero-Day Exploited in Attacks on Southeast Asian Government Networks

8 views2026-03-31
#5
Google Cloud logoGoogle Cloud

How AI-powered tools are driving the next wave of sustainable infrastructure and reporting

8 views2026-03-31
#6
Databricks logoDatabricks

What is a Cloud-Based Database Management System?

8 views2026-03-25

Get the latest tech trends every morning

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

  • 1
  • More pages
  • 4
  • 5
  • 6
Databricks logoDatabricks
31 min read
Data Engineering•2026-02-18

Predictive Optimization at Scale: A Year of Innovation and What’s Next

This post covers Databricks' Predictive Optimization (PO) in Unity Catalog, which became the default platform behavior in 2025 for autonomous lakehouse table maintenance.

Platform
Product
Figma logoFigma
241 min read
Data Engineering•2026-02-18

Redefining impact as a data scientist

This post explores how data science work at Figma's Billing infrastructure differs from traditional product analytics, offering five lessons on expanding impact in complex, correctness-driven domains.

Pinterest logoPinterest
281 min read
Data Engineering•2026-02-17

Drastically Reducing Out-of-Memory Errors in Apache Spark at Pinterest

This post describes Pinterest's Auto Memory Retries feature for Apache Spark, which automatically retries OOM-failed tasks on larger executors to reduce failures and resource waste.

engineering
data
pinterest
apache-spark
open-source
Supabase logoSupabase
01 min read
Data Engineering•2026-02-10

Hydra joins Supabase

The Hydra team, maintainers of the pg_duckdb extension, is joining Supabase to advance Postgres-native analytics capabilities.

Pinterest logoPinterest
21 min read
Data Engineering•2026-02-05

Next Generation DB Ingestion at Pinterest

Pinterest describes its next-generation database ingestion framework built on CDC, Kafka, Flink, Spark, and Iceberg to replace legacy batch-based pipelines.

pinterest
icebergs
change-data-capture
spark
engineering
Lyft logoLyft
01 min read
Data Engineering•2026-01-06

Lyft’s Feature Store: Architecture, Optimization, and Evolution

This article describes the architecture, optimization, and evolution of Lyft's Feature Store, a core ML infrastructure platform serving 60+ use cases across the rideshare stack.

feature-engineering
machine-learning
features
feature-store
data-science
Cloudflare logoCloudflare
21 min read
Data Engineering•2025-12-18

Announcing support for GROUP BY, SUM, and other aggregation queries in R2 SQL

Cloudflare announces support for GROUP BY, SUM, and other aggregation queries in R2 SQL, its serverless analytics query engine over R2 Data Catalog.

R2
Data
Edge Computing
Rust
Serverless
SQL
Grab logoGrab
08 min read
Data Engineering•2025-12-18

How Grab is accelerating growth with real-time personalization using Customer Data Platform scenarios

Grab built 'Scenarios' in their CDP to enable real-time personalization beyond daily batch updates.

Database
FlinkSQL
Engineering
Supabase logoSupabase
01 min read
Data Engineering•2025-12-08

Introducing iceberg-js: A JavaScript Client for Apache Iceberg

This post introduces iceberg-js, a minimal JavaScript client for the Apache Iceberg REST Catalog API targeting JavaScript and TypeScript developers.

AWS logoAWS
21 min read
Data Engineering•2025-12-02

Announcing replication support and Intelligent-Tiering for Amazon S3 Tables

Amazon S3 Tables announces Intelligent-Tiering storage and cross-region replication support for Apache Iceberg tables.

Amazon S3 Tables
Analytics
Announcements
Launch
News
Grab logoGrab
09 min read
Data Engineering•2025-11-26

Real-time data quality monitoring: Kafka stream contracts with syntactic and semantic test

This post introduces Coban, Grab's platform for real-time Kafka stream data quality monitoring using user-defined data contracts with syntactic and semantic test rules.

Engineering
Kafka
Performance
Data science
Data processing
Real-time streaming
Engineering
Data
Squareup logoSquareup
01 min read
Data Engineering•2024-09-30

Enhancing Data Quality Using Better Designed ETLs

This article presents an ETL design document template used at Square to improve data quality, team consistency, and documentation practices.

Data Science