Hey makers 👋

Another week, another batch of mind-bending releases:

Baidu open-sources ERNIE 4.5 and declares price war on token bills.
ChatGPT (o3) with web search quietly becomes the best “research intern” you own.
Codex, Lovable, Base44 & Replit square off in our first Vibe Builder showdown.

Take a break, grab a slice

🚀 Ship Faster (and cheaper!)

Credit: Getty Images

Baidu just open-sourced ERNIE 4.5 - 10 Models, 424 B Params

Just yesterday, Baidu took its flagship language-and-vision model family, ERNIE 4.5, and released all ten versions on Hugging Face with a 129K context window. Here’s what’s up:

Pay $0 per prompt. Because the full “weights” are public, you can run ERNIE on your own laptop or a cheap cloud box instead of feeding coins into someone else’s API.
It’s no slouch. The biggest ERNIE beats DeepSeek V3 on 22 of 28 common tests for coding, reasoning, and maths, even though it’s half the size.
Sees text and pictures. A new training trick - heterogeneous multimodal Mixture-of-Experts (MoE) - lets the model learn language and images together, so it can caption screenshots or explain charts without extra plugins.

Dive Deeper | Try it out on Hugging Face

🧠 Level Up Your Game

Which chatbot should you trust for web research?

A fresh benchmark from FutureSearch⁠/⁠LessWrong put 12 big models and 11 “Deep Research” products through 89 tough, multi-step web-research tasks. Winner: the plain ChatGPT (model o3) + built-in web search beat every dedicated research tool—and did it faster and cheaper.

Key pizza take-aways

Default weapon: open a normal ChatGPT tab, toggle Search the web, and go. On FutureSearch’s Deep Research Bench it scored higher than OpenAI’s own “Deep Research” mode and every rival service.
Iterate, don’t over-engineer. Regular chat + search outperformed most “Deep Research” modes because you can ask quick follow-ups, sanity-check answers, and steer the search yourself.
PDFs still trip up Claude. If your workflow involves white papers or SaaS T&Cs, skip Claude’s web modes for now—its agents can’t reliably read embedded PDFs.
Building your own agent? FutureSearch’s API tests show Claude 4 Sonnet/Opus once you wrap them in a ReAct pipeline. That’s good to know especially if you’re baking research into a product.
DeepSeek R1 is the budget pick. Self-hosting DeepSeek gives respectable accuracy for pennies when you need hundreds of quick queries

📌 ReAct pipeline (Reason + Act)

An LLM running inside a little loop where it:

Thinks out loud – “I need the latest revenue for Company X.”
Chooses an action – Search(“Company X 2024 revenue”)
Receives the observation – Search result: $48 M reported by TechCrunch, April 2025
Thinks again – “Great, now compare to Company Y…”
Repeats until it has enough facts to answer you concisely.

How I do it

I Ask → but Verify. After a result, prompt: “Double-check each claim and give me the source URLs only.”
Link-check. Click the links yourself—LLMs still hallucinate ~20-30 % of the time on number questions…

🛠 Tools

Vibe-coding Platforms Showdown 2025 (so far)

Here’s my comparison of the four hottest “vibe” platforms: OpenAI Codex, Lovable, Base44, and Replit.

Yes, the back-and-forth rabbit holes are real (I’ve been down a few myself), but remember: we’re living through a once-in-a-generation shift in how software gets made—might as well enjoy the ride!

Lovable → design-first speed run
- One prompt = landing page + DB + auth with nice defaults.
- Best when UI polish matters and the whole app fits in <30 screens.
Base44 → all-in-one peace of mind
- Built-in database, auth, file storage, email, analytics—no extra APIs to wire up.
Replit + AI Agent → code-curious flexibility
- I use the app wizard scaffolds or the remixed project, then drop into an IDE for fine-grained control of the code.
OpenAI Codex → autonomous teammate for dev-founders
- It feels more like a junior engineer: edits your repo, runs tests, pushes PRs, etc ..

The Snapshot

Axis	OpenAI Codex	Lovable	Base44	Replit AI Agent
How you build	Assign coding tasks to an agent; it edits your repo & runs tests	Chat-prompt + live preview UI; Supabase/Stripe 1-click	Chat-prompt builder; everything (DB, auth, email) baked-in	Wizard plans app, then full IDE + chat; manual code always possible
Free tier	Included with ChatGPT Plus for now	5 msgs/day, 30 / mo	25 msgs/mo, 500 integration credits	10 AI “check-points”, 10 dev apps
Entry paid plan	$20 / mo ChatGPT Plus + token usage	$25 / mo • 100 msgs	$20 / mo • 100 msgs + 2 k integrations	$20 / mo (Core) full Agent + $25 credits
Sweet spot apps	Complex SaaS, heavy backend, agentic refactors	Landings, small SaaS, consumer tools, blogs	B2B dashboards, internal tools, secure portals	Anything code-heavy: multi-stack SaaS, games, bots
Not great for…	Pure no-coders; fancy UI	Deep custom logic; vendor lock-in shy	Custom design pixel-perfection; migrating off-platform	Founders allergic to code

🌶️ The Funny Slice

Il pomodoro 🥫… *italian gesturing*

🎧 Heard on X…

Keep shipping! 🚀

P.S. - Got an AI tool that's moving the needle for your business? Building an AI tool that helps fellow hackers? Hit reply and tell me about it. Always looking for real-world wins from our community.

🍕Pay $0 per prompt & Know your Research tools