Explore real-world engineering experiences from top tech companies.
Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.
Amazon S3 Tables announces Intelligent-Tiering storage and cross-region replication support for Apache Iceberg tables.
This post introduces Coban, Grab's platform for real-time Kafka stream data quality monitoring using user-defined data contracts with syntactic and semantic test rules.
This article presents an ETL design document template used at Square to improve data quality, team consistency, and documentation practices.
PayPal shares how they reduced Apache Spark job cloud costs by up to 70% by migrating from CPU-based Spark 2 to GPU-accelerated Spark 3 using NVIDIA's Spark RAPIDS.
This post describes the peer review process for data science work developed at Square, drawing from both software code reviews and academic peer review traditions.
This post explains how to fetch and manipulate UK Bank Holidays JSON data using Pandas on a Jupyter Notebook to produce a queryable DataFrame.