• A Slice of AI
  • Posts
  • 🍕Pay $0 per prompt & Know your Research tools

🍕Pay $0 per prompt & Know your Research tools

PLUS: Vibe Coding Platforms You Need To Try

Hey makers 👋

Another week, another batch of mind-bending releases:

  • Baidu open-sources ERNIE 4.5 and declares price war on token bills.

  • ChatGPT (o3) with web search quietly becomes the best “research intern” you own.

  • Codex, Lovable, Base44 & Replit square off in our first Vibe Builder showdown.

Take a break, grab a slice

🚀  Ship Faster (and cheaper!)

Credit: Getty Images

Baidu just open-sourced ERNIE 4.5 - 10 Models, 424 B Params

Just yesterday, Baidu took its flagship language-and-vision model family, ERNIE 4.5, and released all ten versions on Hugging Face with a 129K context window. Here’s what’s up:

  • Pay $0 per prompt. Because the full “weights” are public, you can run ERNIE on your own laptop or a cheap cloud box instead of feeding coins into someone else’s API.

  • It’s no slouch. The biggest ERNIE beats DeepSeek V3 on 22 of 28 common tests for coding, reasoning, and maths, even though it’s half the size.

  • Sees text and pictures. A new training trick - heterogeneous multimodal Mixture-of-Experts (MoE) - lets the model learn language and images together, so it can caption screenshots or explain charts without extra plugins.

Dive Deeper | Try it out on Hugging Face

🧠 Level Up Your Game

Which chatbot should you trust for web research?

A fresh benchmark from FutureSearch⁠/⁠LessWrong put 12 big models and 11 “Deep Research” products through 89 tough, multi-step web-research tasks. Winner: the plain ChatGPT (model o3) + built-in web search beat every dedicated research tool—and did it faster and cheaper.

Key pizza take-aways

  • Default weapon: open a normal ChatGPT tab, toggle Search the web, and go. On FutureSearch’s Deep Research Bench it scored higher than OpenAI’s own “Deep Research” mode and every rival service.

  • Iterate, don’t over-engineer. Regular chat + search outperformed most “Deep Research” modes because you can ask quick follow-ups, sanity-check answers, and steer the search yourself.

  • PDFs still trip up Claude. If your workflow involves white papers or SaaS T&Cs, skip Claude’s web modes for now—its agents can’t reliably read embedded PDFs.

  • Building your own agent? FutureSearch’s API tests show Claude 4 Sonnet/Opus once you wrap them in a ReAct pipeline. That’s good to know especially if you’re baking research into a product.

  • DeepSeek R1 is the budget pick. Self-hosting DeepSeek gives respectable accuracy for pennies when you need hundreds of quick queries

📌 ReAct pipeline (Reason + Act)

An LLM running inside a little loop where it:

  1. Thinks out loud“I need the latest revenue for Company X.”

  2. Chooses an actionSearch(“Company X 2024 revenue”)

  3. Receives the observationSearch result: $48 M reported by TechCrunch, April 2025

  4. Thinks again“Great, now compare to Company Y…”

  5. Repeats until it has enough facts to answer you concisely.

How I do it

  1. I Ask → but Verify. After a result, prompt: “Double-check each claim and give me the source URLs only.”

  2. Link-check. Click the links yourself—LLMs still hallucinate ~20-30 % of the time on number questions…

🛠️ Tools

Vibe-coding Platforms Showdown 2025 (so far)


Here’s my comparison of the four hottest “vibe” platforms: OpenAI Codex, Lovable, Base44, and Replit.

Yes, the back-and-forth rabbit holes are real (I’ve been down a few myself), but remember: we’re living through a once-in-a-generation shift in how software gets made—might as well enjoy the ride!

  • Lovable → design-first speed run

    • One prompt = landing page + DB + auth with nice defaults.

    • Best when UI polish matters and the whole app fits in <30 screens.

  • Base44 → all-in-one peace of mind

    • Built-in database, auth, file storage, email, analytics—no extra APIs to wire up.

  • Replit + AI Agent → code-curious flexibility

    • I use the app wizard scaffolds or the remixed project, then drop into an IDE for fine-grained control of the code.

  • OpenAI Codex → autonomous teammate for dev-founders

    • It feels more like a junior engineer: edits your repo, runs tests, pushes PRs, etc ..

The Snapshot

Axis

OpenAI Codex

Lovable

Base44

Replit AI Agent

How you build

Assign coding tasks to an agent; it edits your repo & runs tests

Chat-prompt + live preview UI; Supabase/Stripe 1-click

Chat-prompt builder; everything (DB, auth, email) baked-in

Wizard plans app, then full IDE + chat; manual code always possible

Free tier

Included with ChatGPT Plus for now

5 msgs/day, 30 / mo

25 msgs/mo, 500 integration credits

10 AI “check-points”, 10 dev apps

Entry paid plan

$20 / mo ChatGPT Plus + token usage

$25 / mo • 100 msgs

$20 / mo • 100 msgs + 2 k integrations

$20 / mo (Core) full Agent + $25 credits

Sweet spot apps

Complex SaaS, heavy backend, agentic refactors

Landings, small SaaS, consumer tools, blogs

B2B dashboards, internal tools, secure portals

Anything code-heavy: multi-stack SaaS, games, bots

Not great for…

Pure no-coders; fancy UI

Deep custom logic; vendor lock-in shy

Custom design pixel-perfection; migrating off-platform

Founders allergic to code

🌶️ The Funny Slice

Il pomodoro 🥫… *italian gesturing*

🎧️ Heard on X…

Keep shipping! 🚀

P.S. - Got an AI tool that's moving the needle for your business? Building an AI tool that helps fellow hackers? Hit reply and tell me about it. Always looking for real-world wins from our community.