Explore real-world engineering experiences from top tech companies.
Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.
Transformers.js v4 preview is now available on NPM, bringing a new WebGPU runtime, build system overhaul, and expanded model support.
This post demonstrates finetuning FunctionGemma with Tunix, a JAX-based LLM post-training library, on Google TPUs.
Spotify's ads team describes how they re-architected their serving stack to replace the Two-Tower model with more expressive neural networks capable of deep feature interactions.
Pinterest's Ads team developed transformer-based behavioral sequence models to improve ad candidate generation using users' offsite activity history.
This post identifies a late-phase instability mechanism in production-scale reinforcement learning for tool-using agents, caused by tool-conditioned variance amplification.
Meta introduces the User True Interest Survey (UTIS) model to improve Facebook Reels recommendations by incorporating direct user feedback beyond traditional engagement signals.
Pinterest introduces PinLanding, a production pipeline that uses multimodal AI to automatically generate shopping collections from billions of catalog items.
This post provides a practical guide to debugging JAX workloads on Cloud TPUs, covering essential tools and their relationships in distributed environments.
Dropbox Dash built a custom hybrid feature store to power real-time AI ranking across tens of thousands of work documents.
Pinterest Search presents a methodology for scaling search relevance assessment using fine-tuned LLMs to replace costly human annotation.
Pinterest describes how Pinner (user) surveys are used to train a machine learning model that improves content quality recommendations across Homefeed, Related Pins, and Search.
Pinterest shares its strategic shift toward fine-tuned open-source AI models, achieving comparable performance at less than 10% the cost of proprietary models.