Explore real-world engineering experiences from top tech companies.
Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.
This post introduces continuous checkpointing in Orbax and MaxText, a feature designed to maximize training reliability and I/O utilization with minimal performance overhead.
This post describes how Lyft built a Bayesian hierarchical tree model to predict rider conversion in real-time under sparse data conditions.
Tevogen Bio partnered with Microsoft and Databricks to accelerate drug discovery using AI and large-scale data engineering.
Airbnb explains how COVID broke their booking-to-trip forecasting models and the architectural changes they built to handle structural data shifts.
This guide covers the most widely adopted MLOps frameworks and how to evaluate them for production machine learning deployments.
This post presents a pipeline to fine-tune a domain-specific embedding model in under a day on a single GPU without manual labeling.
This article details the technical architecture of Facebook's Friend Bubbles feature on Reels, which surfaces content that friends have liked or reacted to.
This article provides a comprehensive overview of data science, covering its core skills, career roles, and education pathways.
Airbnb describes how they built a destination recommendation model to help exploratory users in the trip planning stage discover and narrow down travel destinations.
This post surveys 16 open-source reinforcement learning libraries to understand how they implement asynchronous training architectures that decouple inference from training.
Hugging Face introduces Storage Buckets, a mutable S3-like object storage system on the Hub designed for intermediate ML artifacts such as checkpoints, optimizer states, and processed datasets.
This post explains Ulysses Sequence Parallelism (SP), a technique from Snowflake AI Research for training LLMs on sequences up to millions of tokens by distributing attention computation across GPUs.