Share on

Age checks hit the web, CPU‑only exascale, and the new AI engineering playbook

NEWSLETTER

Beyond the Build • June 29, 2026

XX minutes of reading

NEWSLETTER | Amplifi Labs

KIDS Act Fast-Tracks Age Checks, Pressures Encryption and Moderation

Around the web • June 28, 2026

Congress is set to vote on the KIDS Act—a package combining a revised KOSA with the SAFE BOTS and SCREEN Acts—that would effectively push platforms to implement age verification under a negligence-style “knows or should have known” standard. The bill broadens moderation obligations (including around otherwise lawful discussions) and sets rules for direct, encrypted, and ephemeral messaging, pressuring services to collect IDs or use AI-based age estimation (with accuracy and bias risks) and potentially limit end-to-end encryption features. Developers should anticipate new compliance work across signup flows, parental controls, content policies, and chatbot capabilities, with outsized impact on startups.

Read Full Article →

AI engineering & tools

Proxy-KD distills black-box LLMs, beating classic white-box KD

Around the web •June 28, 2026

Researchers introduce Proxy-KD, a knowledge distillation approach that uses a proxy model to transfer capabilities from proprietary, black-box LLMs (e.g., GPT-4) to smaller student models. Experiments show Proxy-KD improves black-box distillation and surpasses traditional white-box KD, offering a practical path to train compact models without teacher internals—potentially reducing inference costs and enabling on-prem deployments.

Read Full Article →

Aleph Alpha codifies LLM training with Savanna, enabling one-click runs

Around the web •June 25, 2026

Aleph Alpha details Savanna, an internal “Model Training as Code” platform that encodes pretraining and post-training as typed, version-controlled functions, yielding hermetic, reproducible, one-click pipelines with provenance and caching. Integrated with GitHub CI (5‑minute PR-scale end-to-end tests, nightly larger validations) and orchestrated via Flyte on Kubernetes, it immutably versions artifacts in object storage with Weights & Biases lineage, automates evals, and deduplicates sweeps. For developers, the pattern reduces integration debt, accelerates iteration and cross-team ownership, and sets the stage for agent-driven auto-research.

Read Full Article →

Compute & performance shifts

China’s CPU-Only LineShine Tops TOP500 with 2.2 Exaflops, Leads HPCG

Around the web •June 28, 2026

China’s LineShine debuts at #1 on the TOP500 with a CPU-only Armv9 LX2 system delivering 2.198 FP64 exaflops (Rmax) across ~22k nodes/13M cores, and takes the HPCG crown at 22.004 PF/s—an exceptional result without GPUs. The machine draws 42.22 MW for 52.07 GF/W, underscoring rising CPU-side efficiency, while Italy’s MI300A-based HPC7 enters at #6 and Fugaku remains #3 on HPCG. Green500’s top 10 is unchanged, and TOP500 governance shifts to ACM SIGHPC with DOI-backed lists, signaling procurement and architecture shifts in exascale HPC.

Read Full Article →

Edera brings end-to-end NUMA awareness to Xen virtualization

Around the web •June 24, 2026

This piece explains how NUMA topology can impose 1.5x to 3x remote-memory latency (and worse under contention), why CPU and memory affinity must align, and how interleaving trades peak performance for predictability. Edera says it has shipped an end-to-end NUMA-aware stack for Xen—exposing real topology across guest, paravirt I/O, dom0, and hypervisor layers—to remove long-standing blind spots where dom0 and VMs lacked actionable vNUMA insight. For operators and performance-minded developers on modern EPYC/Xeon systems, this promises steadier tail latency and higher throughput, with next parts detailing automatic memory and vCPU placement.

Read Full Article →

Practical ops & tooling

Run and track AI dev agents in your terminal with Herdr

Around the web •June 29, 2026

Herdr is a Rust-based, tmux-like, agent-aware terminal multiplexer that organizes work into workspaces, tabs, and panes with mouse support and persistent detach/reattach—locally or over SSH. It auto-detects agent state (blocked/working/done/idle) for popular CLIs like Claude Code, GitHub Copilot CLI, Devin, Cursor, and more, and offers official integrations plus a socket API for session restore and programmatic control. Open-source under AGPL-3.0 with optional commercial licensing, it installs via curl/Homebrew/mise; Linux/macOS are stable and Windows builds are in preview.

Read Full Article →

Interactive Stanford tracker charts DRAM/NAND/HBM prices and AI accelerator costs

Around the web •June 28, 2026

Stanford’s DAM project publishes an interactive “Memory Prices” dataset tracking historical and current $/GB for DRAM, NAND, and HBM, with downloadable CSV and drill-downs by DRAM and HBM generations (HBM4 projected). It also models AI accelerator cost breakdowns (HBM, logic die, packaging/CoWoS, auxiliary) across Nvidia, AMD, Google, and Amazon; DRAM/NAND refresh monthly and HBM quarterly (last updated 2026-06-26). Useful for infra and AI teams budgeting and capacity planning, with caveats that figures reflect cheapest nominal retail listings and modeled estimates, not contract prices.

Read Full Article →

‍