Episode #357: 01 April 2026

Daily Vibe Casting

0:00

-20:46

Episode #357: 01 April 2026

Claude Code leak fallout, smaller edge AI models, and big bets as tools move closer to the device

Daily Vibe Casting

Apr 01, 2026

Overview

Today was a tug of war between “make it smaller and run it anywhere” and “make it bigger and fund it forever”. We saw serious progress on compact models and edge hardware, while the Claude Code leak kept sparking fresh angles on IP, open-source momentum, and how fast software can be re-made once it’s out in the wild. On top of that, a new Gmail quality-of-life tweak landed, and the usual April 1 pranks reminded everyone not to take release notes at face value.

The big picture

The centre of gravity keeps moving towards two extremes at once. On the ground, more capability is being squeezed into smaller models and cheaper setups, the sort you can run on a phone or a modest local box. At the top, the capital and compliance machinery around frontier AI is accelerating too, with safety agreements, mega-rounds, and agent products aimed straight at regulated work. The gap between “I can run this offline” and “this needs an army of lawyers” is starting to feel like the story.

Liquid AI bets on small agents that can actually use tools

Liquid AI’s LFM2.5-350M pitch is simple: stop treating sub-1B models as toys. The focus here is reliable extraction and function calling at a size that normally falls apart, with the added point that quantised it can sit under 500MB for tighter devices and offline workflows.

If the benchmarks hold up in real deployments, this is part of a broader pattern: agent-style behaviour is no longer reserved for the largest models, which changes what “local-first” can mean for document processing and lightweight automation.

Liquid AI@liquidai

Today, we release LFM2.5-350M. Agentic loops at 350M parameters. A 350M model trained for reliable data extraction and tool use, where models at this scale typically struggle. <500MB when quantized, built for environments where compute, memory, and latency are constrained. 🧵

5:17 PM · Mar 31, 2026 · 239K Views

68 Replies · 244 Reposts · 1.96K Likes

PrismML comes out of stealth with a “1-bit” 8B model

PrismML’s launch is another reminder that compression is now a first-class research lane, not an afterthought. An 8B model squeezed down to around a gigabyte, with claims of big speed and energy wins, is the sort of thing that makes on-device inference feel less like a party trick and more like a product plan.

The scepticism in the replies is healthy, because “true 1-bit” claims are easy to market and harder to prove. Still, the direction is clear: people want strong models that fit where the work happens.

Andrew Curran@AndrewCurran_

PrismML is leaving stealth today.

PrismML @PrismML

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in

6:59 PM · Mar 31, 2026 · 129K Views

14 Replies · 64 Reposts · 1.41K Likes

Apple-approved eGPU driver opens up Mac AI setups

Tiny Corp says Apple has approved its eGPU driver for Thunderbolt and USB4 Macs, covering both AMD and NVIDIA. That matters because it lowers the friction for a practical “Mac plus external GPU” workflow, without the usual system tweaks that put off anyone who is not keen on tinkering.

It is also a nice counterpoint to the edge-model story: sometimes the quickest win is not a smaller model, it is simply making more compute usable.

the tiny corp@__tinygrad__

If you have a Thunderbolt or USB4 eGPU and a Mac, today is the day you've been waiting for! Apple finally approved our driver for both AMD and NVIDIA. It's so easy to install now a Qwen could do it, then it can run that Qwen...

5:30 AM · Apr 1, 2026 · 440K Views

127 Replies · 587 Reposts · 4.17K Likes

Receipt photos to structured data, the unglamorous win that counts

Hasan Toor’s demo with Perceptron AI is about a painfully real problem: messy receipts that defeat standard OCR. The appeal is straightforward, take photos of crumpled paper and get clean line items and totals out the other end, ready for CSV and categorisation.

This is the kind of “boring” automation that can save hours, because it targets the friction points where humans still retype and double-check.

Hasan Toor@hasantoxr

Perceptron AI @perceptroninc walked me through something that solves a problem most tools still get wrong. Physical receipts → structured data. Automatically. They ran a demo: photos of crumpled, faded, oddly-formatted receipts. Line items, totals, dates, and vendor extracted

Perceptron AI @perceptroninc

Despite having vision, most AI agents still struggle to see. General-purpose multimodal models are powerful, but they’re expensive for every visual task. We built something better: Perceptron's MCP gives any agent stronger vision capabilities through Isaac with far lower cost.

5:45 PM · Mar 31, 2026 · 72.7K Views

22 Replies · 5 Reposts · 154 Likes

The Claude Code leak, and how fast the internet re-builds software

Yuchen Jin framed the leak’s aftershock in the starkest terms: once code is public, AI tools can help re-express it into a new codebase at speed, including wholesale language ports. That is not the same as training weights leaking, but it does change the practical enforceability of “closed” tooling when the scaffolding can be replicated overnight.

It is also a cultural moment, because the community response is not just mirroring, it is iterating, translating, and shipping variants as if that is the default.

Yuchen Jin@Yuchenj_UW

> Anthropic leaked Claude Code source code > someone forked it > 32.6k stars, 44.3k forks > got scared of getting sued > convert the whole codebase from TypeScript to Python with Codex AI is quietly erasing copyright.

3:08 PM · Mar 31, 2026 · 395K Views

227 Replies · 437 Reposts · 5.96K Likes

What the leak reveals about Claude Code’s design

Aakash Gupta took the more useful angle: treat the leak as a window into how a modern agentic CLI is put together. The interesting bits are the coordinator patterns, permissioning, command surface, and the practical UI decisions that make these tools usable day to day.

Even if Anthropic patches the packaging mistake, these architectural lessons are now common knowledge, and they will show up in competitors and forks in weeks, not years.

Aakash Gupta@aakashgupta

Claude Code’s source code was leaked. Here’s what you can learn from it.

3:52 PM · Mar 31, 2026 · 552K Views

30 Replies · 150 Reposts · 1.11K Likes

DMCA scattergun meets confused bystanders

Daniel San’s post captures the messy end of enforcement: takedowns that appear to hit forks of Anthropic’s public repos, not just mirrors of leaked source. When legal responses go broad, it is easy to spook the wrong people and create extra noise.

In a world where code spreads instantly, the clean-up matters almost as much as the original mistake, because it shapes developer trust.

Daniel San@dani_avila7

Got this email from GitHub, possibly related to the Claude Code codebase leak but the repo they're referencing is a fork of Anthropic's own public repo It's not the codebase one, it's a repo with Skills, examples, docs, etc. If you want more details, here's all the info:

11:59 PM · Mar 31, 2026 · 152K Views

13 Replies · 12 Reposts · 200 Likes

OpenClaw gets a practical field guide, costs and all

Lenny Rachitsky flagged Claire Vo’s guide to OpenClaw, pitching it as a start-to-finish manual that includes the bits people normally hide, such as real API spend and the security foot-guns that show up once agents have file access and tool permissions.

The mood across agent tooling is maturing: fewer “look what it can do”, more “here is what it costs, here is what can go wrong, here is how to run it without panicking”.

Lenny Rachitsky@lennysan

OpenClaw: The complete guide @clairevo has just put together the definitive guide to getting started with and mastering OpenClaw. Building on our podcast episode, this post covers everything you need to know, from first install to multi-agent setups, plus the real costs and

3:32 PM · Mar 31, 2026 · 238K Views

69 Replies · 160 Reposts · 1.16K Likes

AI for fraud and compliance moves from pitch decks to production

Y Combinator’s Founder Firesides episode spotlights Variance coming out of stealth with a $21M Series A, aimed at risk work like fraud detection and identity checks. This is a reminder that “agents” are not just for coding, they are being pointed straight at messy, high-stakes queues inside large firms.

It is also a quiet theme of the day: lean teams using coding agents to scale output, while the product itself is another layer of automation watching for scams and abuse.

Y Combinator@ycombinator

In this episode of Founder Firesides, YC Managing Partner Jared Friedman talks to Karine Mellata (@karine_exe), co-founder of Variance (@trustvariance), who is coming out of stealth and announcing their $21 million Series A. Variance builds purpose-built AI agents for risk and

4:03 PM · Mar 31, 2026 · 83.8K Views

23 Replies · 21 Reposts · 127 Likes

Gmail finally lets you change your address without burning the old one

Sundar Pichai announced a long-requested tweak: you can update your @gmail.com username while keeping the old address as an alias. It is a small feature with an outsized emotional payoff for anyone still stuck with a teenage inbox name on job applications.

The gradual rollout and regional limits will annoy some people, but it is the sort of account-level change Google usually avoids, which makes it notable.