Skip to main content
Rohit Raj
AccueilAgentsProjetsServicesDépôtsNotesÀ proposContactVoir Travail Actuel

Notes d'Ingénierie & Travail Open-Source

Documenter les décisions d'ingénierie, l'architecture et les apprentissages de la construction de systèmes IA en production.

Blog Articles

A luminous silicon processor radiating neural filaments illustrating the best local LLM for coding in 2026

Best Local LLM for Coding in 2026: When It Actually Replaces Claude and GPT

12 min read

Jun 17, 2026

Two Hacker News front-page threads this week — one at 1,245 points — are asking the same thing: can a local model finally replace Claude or GPT for daily coding? The honest 2026 answer is "for ~80% of your sessions, yes." Here is the builder read: which local coding models actually crossed the SWE-bench line, how to set one up with Ollama in ten minutes, exactly how much VRAM you need, and the hybrid routing pattern that keeps the hard 20% on the cloud.

best local llm for coding 2026replace claude code with local modellocal llm vs claude for coding
Glowing orange and indigo particle ring illustrating AI dev tools weekly roundup week 25 of 2026

This Week in AI Dev: A Frontier Model Gets Pulled While Open Weights Keep Shipping (Week 25 of 2026)

7 min read

Jun 16, 2026

Week 25 of 2026 in AI dev tools: the US government forces Anthropic to suspend Claude Fable 5 and Mythos 5 three days after launch, Moonshot ships Kimi K2.7-Code open weights, Google open-sources the text-diffusion model DiffusionGemma, the Claude Agent SDK flips to metered billing on June 15, OpenCode crosses 160K GitHub stars, and agentic payments via x402 + AP2 quietly become real.

ai dev tools this weekclaude fable 5 suspendedkimi k2.7 code
A crystalline memory core beside a dissolving particle cloud illustrating AI agent memory vs context window

AI Agent Memory vs Context Window: Why a Bigger Window Isn't Memory (2026)

13 min read

Jun 15, 2026

A 1M-token context window is not memory — it is RAM that gets wiped when the session ends. On the LoCoMo benchmark a two-layer memory setup hit 91.6% accuracy at ~6,956 tokens vs 72.9% at ~26,000 tokens for full context. Here is the builder read: why context windows behave like RAM, the two-layer architecture every production agent needs, a minimal pgvector memory layer in code, and when a bigger window is still the right call.

ai agent memory vs context windowcontext window is not memoryai agent memory architecture 2026
Glowing faceted core radiating energy filaments illustrating Kimi K2.7-Code vs Claude and GPT coding models

Kimi K2.7-Code vs Claude Opus 4.8 and GPT-5.5: Is the 1T Open Coding Model Worth It? (2026)

12 min read

Jun 14, 2026

Moonshot AI dropped Kimi K2.7-Code on June 12, 2026 — a 1T-parameter open-weight coding model that costs $0.95/$4.00 per million tokens, roughly 5-7x cheaper than Claude Opus 4.8 and GPT-5.5. Here is the developer read: the real benchmark numbers (and why they are all first-party), a verified cost-per-task comparison the hype guides skip, how to run it via API or locally, and when you should still reach for Claude or GPT.

kimi k2.7 codekimi k2.7 code vs claude opuskimi k2.7 code review
Glowing coin of light passing between two abstract nodes over a dark grid illustrating AI agent payments with x402 and AP2

AI Agent Payments in 2026: x402 vs AP2 — How to Let Your Agent Actually Pay

12 min read

Jun 13, 2026

x402 crossed 161M cumulative payments and got picked up by AWS Bedrock AgentCore in May 2026, while Google’s AP2 defines the trust layer above it. Here is the developer read: how x402 and AP2 actually work, working code to monetize an MCP server or API per request, the per-request settlement trap, and when to skip crypto rails entirely.

ai agent paymentsx402 vs ap2x402 protocol
Glowing pink core orbited by translucent geometric shards illustrating the OpenCode open-source AI coding agent

OpenCode vs Claude Code vs Cursor: The Best AI Coding Agent in 2026?

12 min read

Jun 12, 2026

OpenCode just became the most-starred AI coding agent on GitHub — 172,198 stars under MIT, with v1.17.4 shipping June 12, 2026. Here is the developer read: how the free, model-agnostic OpenCode compares to Claude Code and Cursor, the Terminal-Bench numbers, the BYOK cost math, and when each one is the right call.

opencode vs claude codeopencode ai coding agentopencode vs cursor
Luminous particle swarm coalescing from dark noise into an ordered crystalline lattice illustrating DiffusionGemma text diffusion

DiffusionGemma: Text Diffusion LLMs Explained, and When to Actually Use One (2026)

11 min read

Jun 11, 2026

Google open-sourced DiffusionGemma on June 10, 2026 — a 26B MoE that writes a 256-token block in parallel instead of one token at a time, hitting 700+ tokens/sec on an RTX 5090 and up to 4x faster than Gemma 4. The catch: quality sits below standard Gemma 4. Here is the developer read — how text diffusion works, how to run it locally, the speed-vs-quality decision, and when to skip it.

diffusiongemmadiffusiongemma vs gemma 4text diffusion model
Glowing GPU emitting parallel streams of light into an ordered grid illustrating running DiffusionGemma locally on RTX 5090

How to Run DiffusionGemma Locally: A vLLM Serving Guide for RTX 5090 and H100 (2026)

11 min read

Jun 11, 2026

A build-focused guide to self-hosting Google's DiffusionGemma: the exact vLLM serve command, what each diffusion flag does, how to call it like an OpenAI endpoint, and how to tune the speed-vs-quality trade-off on an RTX 5090 or H100.

run diffusiongemma locallydiffusiongemma vllmserve diffusiongemma rtx 5090
Glowing teal-violet crystalline monolith with a molten core on black illustrating the Claude Fable 5 model release

Claude Fable 5: Pricing, the API, and When to Use It vs Opus 4.8 (2026)

11 min read

Jun 10, 2026

Anthropic shipped Claude Fable 5 on June 9, 2026 — a Mythos-class model at $10/$50 per million tokens, double the Opus 4.8 rate. Here is the developer read: the claude-fable-5 API, the Opus-4.8 safeguard fallback you must design around, the new 30-day retention rule, Fable vs Mythos, and when to wait.

claude fable 5claude fable 5 pricingclaude fable 5 vs opus 4.8
Constellation of glowing connected nodes illustrating AI dev tools weekly roundup week 24 of 2026

This Week in AI Dev: Codex Builds Apps, the Open-Weight Frontier Explodes, and Anthropic Meters the Agent SDK (Week 24 of 2026)

6 min read

Jun 9, 2026

Week 24 of 2026 in AI dev tools: OpenAI's Codex graduates from coding agent to app builder with Sites and role plugins, three open-weight models drop in 72 hours (MiniMax M3, Gemma 4 12B, NVIDIA Nemotron 3 Ultra), Anthropic moves the Agent SDK to metered billing on June 15, Microsoft Build hardens agent security, and the Gemini CLI consumer sunset hits June 18.

ai dev tools this weekopenai codex sitesminimax m3 open weight model
Dark editorial cover with a glowing geometric scaffold cradling a core orb illustrating harness engineering for AI coding agents

What Is Harness Engineering? OpenAI’s Agent-First Codex Playbook (2026)

12 min read

Jun 8, 2026

Harness engineering is the discipline of building the scaffolding — docs, golden rules, custom linters, and agent-to-agent review loops — that lets AI coding agents ship reliable software at scale. OpenAI coined the term after building a ~1M-line beta product in 5 months with zero hand-written code using Codex. Here is what a harness actually contains, the architecture that makes it work, when it pays off, when to skip it, and how I run a smaller version of it today.

harness engineeringwhat is harness engineeringharness engineering codex
Dark editorial cover illustrating self-hosted open-source NotebookLM alternatives Open Notebook, Khoj and SurfSense in 2026

Open Notebook vs Khoj vs SurfSense: Best Self-Hosted NotebookLM Alternative (2026)

12 min read

Jun 7, 2026

Open Notebook just hit #1 on GitHub Trending — but is it the best self-hosted NotebookLM alternative? Here's how Open Notebook (MIT), Khoj (AGPL-3.0), and SurfSense (Apache-2.0) actually compare on Docker setup, RAG architecture, integrations, and the open-source license trap that can bite a commercial build.

open source notebooklm alternativeself-hosted notebooklmopen notebook vs khoj
Dark editorial cover with a glowing fractured chip illustrating NVIDIA RTX Spark local AI for Windows PCs

NVIDIA RTX Spark + Windows: What Microsoft’s Local-AI Superchip Means for Developers (2026)

12 min read

Jun 7, 2026

NVIDIA and Microsoft unveiled the RTX Spark superchip at Computex 2026 — a 20-core Grace Arm CPU plus a 6,144-core Blackwell RTX GPU and up to 128GB unified memory that runs 120B-parameter LLMs locally with up to 1M tokens of context. Here is the developer-only read: the confirmed specs, RTX Spark vs DGX Spark, how it ties into Satya Nadella’s agentic-AI push at Build 2026, what you can actually build on it this fall, and when to wait.

nvidia rtx sparknvidia rtx spark for developersrtx spark vs dgx spark
Glowing knowledge-graph nodes on a dark grid illustrating open-source AI agent memory layers compared in 2026

AI Agent Memory in 2026: Mem0 vs Zep vs Letta vs MemPalace (Open-Source, Benchmarked)

13 min read

Jun 6, 2026

Agent memory — not the model — is the 2026 bottleneck. MemPalace just hit 54.1k GitHub stars and shipped v3.4.0 with a 96.6% LongMemEval score and zero API calls. Here's how the four open-source AI agent memory layers (Mem0, Zep, Letta, MemPalace) actually compare on architecture, real benchmarks, and honest licensing — plus a code snippet to add memory in minutes and how I'd wire it into a production agent.

open source ai agent memoryai agent memorymem0 vs zep vs letta
Dark editorial cover with a cracked circuit trace illustrating an AI vulnerability scanner finding code security bugs with Claude in 2026

Claude AI Vulnerability Scanner: Anthropic's Open-Source Code-Security Harness (2026)

12 min read

Jun 5, 2026

Anthropic open-sourced defending-code-reference-harness — a Claude-powered pipeline that finds and patches security bugs in your code — and it hit the GitHub Trending front page this week. Here's what actually shipped, how to run /vuln-scan on your own repo, how it compares to the claude-code-security-review Action, managed Claude Security, and Snyk/Semgrep/CodeQL, where it quietly breaks, and how I'd wire it into a production CI without burning your token budget.

claude ai vulnerability scannerai code vulnerability scanneranthropic defending code reference harness
Dark editorial cover illustrating LLM context compression to cut token costs for AI agents in 2026

Cut LLM Token Costs Up to 90% with Context Compression (2026)

11 min read

Jun 4, 2026

Headroom hit #1 on GitHub Trending on June 4, 2026 with a tool that compresses tool outputs, logs, and RAG chunks before they reach the model — cutting input tokens up to 92%. Here's how LLM context compression actually works, how Headroom stacks up against LLMLingua, prompt caching, and RAG reranking, when it quietly breaks, and how I'd wire it into a production MVP without losing accuracy.

llm context compressioncut llm token costsreduce llm api costs
Abstract editorial cover illustrating AI dev tools weekly roundup week 23 of 2026

This Week in AI Dev: Claude Opus 4.8, Copilot Goes Token-Metered, MCP's Stateless Next Spec (Week 23 of 2026)

6 min read

Jun 2, 2026

Six ships from Week 23 of 2026 that change how you build with AI: Claude Opus 4.8 lands, GitHub Copilot moves to token-metered AI Credits on June 1, the MCP next-spec RC locks a stateless protocol core, OpenAI Codex becomes an autonomous Goal-Mode runtime, Windsurf bundles Devin and raises prices, and the open-weight coding race tightens with Kimi K2.6 and GLM-5.1.

ai dev tools this weekgithub copilot ai credits billingclaude opus 4.8
Constellation of glowing cyan nodes illustrating Claude Code dynamic workflows orchestrating parallel AI subagents

Claude Code Dynamic Workflows: A Hands-On Guide for Developers (2026)

12 min read

Jun 1, 2026

Anthropic shipped dynamic workflows in Claude Code on May 28, 2026 — a JavaScript script Claude writes to orchestrate up to 1,000 subagents on one task, in the background, while your session stays free. Here's what they actually are, how they differ from subagents and skills, the three ways to trigger one, where they earn their (heavy) token cost, when to skip them, and how I wire them into real client builds.

claude code dynamic workflowsclaude code dynamic workflows guideclaude code ultracode
Glowing hourglass in teal and violet illustrating OpenRouter vs LiteLLM vs Portkey LLM gateway choice

OpenRouter vs LiteLLM vs Portkey: Which LLM Gateway for Your AI MVP? (2026)

13 min read

May 31, 2026

OpenRouter raised a $113M Series B on May 28, 2026 (led by CapitalG) — proof the LLM-gateway layer is now core infrastructure. But which one belongs in your AI MVP: OpenRouter's hosted marketplace, LiteLLM's self-hosted proxy, or Portkey's observability gateway? Here's the real cost math in ₹ and $, the minimal config for each, and the decision rule I use when wiring a gateway into a 6-week build.

openrouter vs litellm vs portkeybest llm gateway 2026llm gateway for ai mvp
Cracked circuit board trace with one glowing fault line illustrating AI-generated code anti-patterns

AI-Generated Code Anti-Patterns: 9 Production Bugs Hiding in Vibe-Coded Apps (2026)

12 min read

May 29, 2026

AI coding agents produce roughly 1.7x more issues than human-written code (CodeRabbit, Dec 2025), and AI-generated code drove 35 new CVEs in March 2026 alone. Here are the 9 anti-patterns I catch reviewing vibe-coded MVPs — phantom validation, optimistic auth, IDOR, race conditions, retry storms, God components — with the before/after fix for each and a checklist to catch them before they ship.

ai generated code anti-patternscommon bugs in ai generated codevibe coding tech debt
Editorial dark cover with glowing hourglass illustrating Claude Opus 4.8 vs 4.7 upgrade decision for developers

Claude Opus 4.8 vs 4.7 for Developers: What Changed and Should You Upgrade (2026)

11 min read

May 28, 2026

Anthropic shipped Claude Opus 4.8 on May 28, 2026 — agentic coding jumps 64.3% to 69.2%, it is around 4x less likely to let a code flaw pass unremarked, and pricing is unchanged at $5/$25 per million tokens. Here is the developer-only breakdown: the confirmed benchmark deltas, the code changes to make, where it actually beats 4.7, and the one reason you might wait.

claude opus 4.8 vs 4.7claude opus 4.8 for developersshould i upgrade to claude opus 4.8
Editorial dark cover illustrating a secure MCP server build in TypeScript with prompt-injection defense layer

Build a Secure MCP Server in TypeScript: The Post-Copilot Defense Playbook (2026)

13 min read

May 27, 2026

Microsoft Copilot Cowork was exfiltrating SharePoint and OneDrive files via a 5-line prompt injection hidden inside an 81-line skill file — and it worked on 5 of 5 trials against Claude Opus 4.7, model-agnostic. Most "build an MCP server" tutorials ship code that has the same hole. This is the secure TypeScript build, end to end, with the defense layer none of the top tutorials include.

secure mcp server typescriptmcp server typescript 2026mcp server prompt injection defense
Abstract editorial cover illustrating AI dev tools weekly roundup week 22 of 2026

This Week in AI Dev: DeepSeek's Permanent 75% Cut, GitHub Auto-Routing, Gemini Managed Agents (Week 22 of 2026)

6 min read

May 26, 2026

Six ships from Week 22 of 2026 that change how you build with AI: DeepSeek V4 Pro locks in a 75% price cut, Copilot auto-routes models, Google ships Managed Agents, NuExtract3 opens up structured extraction, Cursor 3 goes multi-agent, and DeepSeek Reasonix lands as an open coding agent.

ai dev tools this weekdeepseek v4 pro price cutgithub copilot auto model selection
Dark editorial cover illustrating Gemini CLI to Antigravity CLI migration and alternatives for 2026

Gemini CLI Shuts Down June 18, 2026: Antigravity CLI Migration + 4 Alternatives Worth Switching To

14 min read

May 25, 2026

Google is killing Gemini CLI for free, Pro, and Ultra users on June 18, 2026 — 24 days from today. The replacement, Antigravity CLI (agy), is closed-source, Go-based, and ships with weekly quotas instead of daily ones. Here is the 10-minute migration if you stay, the 4 alternatives worth switching to instead (Claude Code, Codex CLI, Aider, OpenCode), and the decision tree I would actually use for an India MVP client this month.

gemini cli to antigravity migrationgemini cli shutdown june 18 2026antigravity cli alternatives
Dark editorial cover illustrating DeepSeek V4 Pro vs Claude Sonnet vs GPT-5.5 MVP API cost comparison 2026

DeepSeek V4 Pro vs Claude Sonnet 4.6 vs GPT-5.5: The Real MVP Cost in 2026

13 min read

May 24, 2026

DeepSeek made its 75% V4 Pro discount permanent on 2026-05-22 — output tokens now sit at $0.87/M, roughly 34× below GPT-5.5 and 28× below Claude Sonnet 4.6. Here is the line-item math for a real India MVP, the benchmarks the pricing page hides, and the four production failure modes that decide whether the cheaper model actually saves you anything.

deepseek v4 pro vs claude vs gptdeepseek v4 pro pricing 2026cheapest llm api for mvp
Translucent layered glass plates with embedded glowing lines illustrating PostHog vs Mixpanel vs Amplitude analytics comparison

PostHog vs Mixpanel vs Amplitude — Best Product Analytics for India MVP (2026)

13 min read

May 23, 2026

PostHog wins for engineering-led India MVPs because it self-hosts on AWS Mumbai for DPDP compliance and bundles session replay + feature flags into one tier. Mixpanel wins for PM-heavy teams that want the lowest learning curve. Amplitude wins for funded teams with a dedicated analyst. The real cost gap at 10M events/month is 8×. Here is the math, the data-residency reality, and the exact stack I run on myFinancial.

posthog vs mixpanel vs amplitudeproduct analytics india mvp 2026posthog india self host aws mumbai
Dark editorial cover illustrating Claude Code plugins ecosystem and context engineering megatrend in 2026

Claude Code Plugins in 2026: The Context-Engineering Stack Indie Devs Are Actually Installing

12 min read

May 23, 2026

Four of today's top 15 trending GitHub repos are Claude Code plugins. CodeGraph hit 2,434 stars in 24 hours. Karpathy's skills file: 3,372. This isn't noise — it's the moment context engineering became the differentiator. Here's what to install, what to skip, and why every listicle you've read this month is already wrong.

claude code pluginsclaude code skillsclaude code knowledge graph
Glowing prism shattering into blue and violet light shards illustrating hire iOS developer India 2026 cost and tradeoffs

Hire iOS Developer India 2026: Founding Engineer vs Agency vs Swift Freelancer (Real Cost)

13 min read

May 22, 2026

A founding engineer in India ships a native iOS MVP in 6–8 weeks for ₹7.5–10.5L fixed — Swift, SwiftUI, TestFlight, both review submissions, all in. A Bangalore agency quotes ₹28–42L for the same scope, lands in 17 weeks, and hands you a codebase mixing UIKit and SwiftUI across screens. Here is the May 2026 rate card, the App Store review traps that bite native iOS specifically, and the decision tree I wish my last three FinTech and consumer-app founders had read.

hire ios developer india 2026ios developer india costswift mvp india
Single neon cyan spline arcing through dark space illustrating Resend vs SendGrid vs AWS SES transactional email comparison

Resend vs SendGrid vs AWS SES for India MVPs in 2026 — Real Cost on 10K Transactional Emails

14 min read

May 21, 2026

Resend looks the friendliest, SendGrid looks the safest, and AWS SES looks the cheapest. Run a real 10K-email/month India MVP through each and the math, the deliverability to Indian Gmail inboxes, and the DLT-style compliance burden separate them by an order of magnitude. Here is which one survives a 50-user-to-5K-user growth curve, which one bankrupts you at scale, and the exact ₹ numbers I have logged across four client launches.

resend vs sendgrid vs sestransactional email india 2026resend india mvp
Cracked monolith with glowing amber fissures illustrating hire Flutter developer India 2026 cost and tradeoffs

Hire Flutter Developer India 2026: Founding Engineer vs Agency vs FlutterFlow (Real Cost)

13 min read

May 20, 2026

A founding engineer in India ships a Flutter MVP in 5–8 weeks for ₹6.5–9.5L fixed. A Bangalore agency quotes ₹24–38L for the same scope and lands in 16. FlutterFlow saves 3 weeks of UI work and then traps you in a no-code stack you cannot extend. Here is the real cost math for May 2026, which Flutter packages survive production, and the decision tree I wish my last two FinTech clients had read.

hire flutter developer india 2026flutter developer india costflutter mvp india
Topographic contour lines glowing on a dark surface illustrating LangGraph vs CrewAI vs AutoGen multi-agent framework comparison

LangGraph vs CrewAI vs AutoGen: Which Multi-Agent Framework Wins for India MVPs in 2026

13 min read

May 18, 2026

LangGraph, CrewAI, and AutoGen all promise the same thing — orchestrate three or four LLM calls into something that feels like an agent. Real cost, real latency, and real failure modes split them sharply once you ship. Here is which one survives a 200-user India MVP, which one bankrupts you, and which one I have spent ₹40K of OpenAI credits learning to avoid.

langgraph vs crewai vs autogenmulti-agent framework india 2026langgraph india mvp
Stylized glowing hourglass on dark teal background illustrating hire React Native developer India 2026 timeline and cost

Hire React Native Developer India 2026: Founding Engineer vs Agency vs Freelance (Real Cost)

12 min read

May 18, 2026

A founding engineer in India ships your React Native MVP for ₹6–9L in 6 weeks. The same scope from a Bangalore agency quotes ₹22–35L and lands in 16. A Toptal freelancer is ₹4,500/hr and a 3-week hiring loop. Here is the real cost math for 2026, which model fails on which kind of app, and the decision tree I wish my last three clients had read.

hire react native developer india 2026react native developer india costreact native mvp india
Three glowing geometric nodes layered on dark grid illustrating Drizzle vs Prisma vs TypeORM comparison for India MVPs

Drizzle vs Prisma vs TypeORM — Real ORM Pick for India MVP 2026

14 min read

May 17, 2026

A Postgres-backed Next.js MVP shipping in 6 weeks: Drizzle cold-starts in ~40ms on Vercel Mumbai, Prisma at ~280ms (after the engine binary downloads), TypeORM at ~520ms with decorators bloating bundle size. Here is the real ORM math for an India MVP — cold start, type safety, migration story, edge runtime support, and which one to actually pick.

drizzle vs prisma vs typeormbest orm india mvp 2026typescript orm comparison
Three glowing nodes in tense orbit on dark grid illustrating OpenAI vs Claude vs Gemini API cost comparison for India MVPs

OpenAI vs Claude vs Gemini API — Real Cost for India MVP 2026

15 min read

May 16, 2026

A WhatsApp support bot doing 10K conversations a month costs roughly ₹1,250 on Gemini 2.5 Flash, ₹3,800 on GPT-5-mini, and ₹7,200 on Claude Sonnet 4 — before GST, FX margin, and the 2% TDS your CA will flag on foreign invoices. Here is the line-item cost math, the latency truth from a Mumbai-based MVP, and which API actually wins for which India use case.

openai vs claude vs geminillm api cost indiagpt-5 vs claude vs gemini pricing
Three abstract vector field pillars on dark backdrop illustrating Pinecone vs Qdrant vs pgvector comparison for India RAG MVP 2026

Pinecone vs Qdrant vs pgvector — Which Vector DB for India RAG MVP (2026)

14 min read

May 15, 2026

At 500K embeddings for an Indian RAG MVP, Pinecone Standard costs ~₹6,200/month, self-hosted Qdrant runs ~₹1,400/month on Hetzner, and pgvector inside the Postgres you already pay for adds ₹0. Here is the real cost math, recall benchmark on a Hindi+English corpus, and the migration story when you outgrow each.

pinecone vs qdrant vs pgvectorvector database india 2026rag mvp india
Constellation of luminous nodes on dark backdrop illustrating Vercel vs Railway vs Hetzner India MVP hosting cost comparison 2026

Vercel vs Railway vs Hetzner — India MVP Hosting Cost & Latency (2026)

13 min read

May 14, 2026

At 100K monthly requests for an Indian MVP, Vercel Pro lands at roughly ₹2,800/month, Railway hovers around ₹1,800/month, and a Hetzner CX22 + Cloudflare combo is ₹420/month. Here is the real cost math, the Mumbai latency truth, and the migration story when one of them stops fitting your scale.

vercel vs railway vs hetznerindia mvp hosting cost 2026vercel pricing india
Three abstract auth stack pillars on dark backdrop illustrating Clerk vs Supabase Auth vs Better-Auth comparison for India MVP 2026

Clerk vs Supabase Auth vs Better-Auth — Which to Pick for India MVP (2026)

13 min read

May 13, 2026

At 10K monthly active users for an Indian MVP, Clerk Pro lands around ₹17,000/month, Supabase Auth is included in the same ₹2,100 Pro plan, and Better-Auth on your own Postgres costs ₹0 plus a weekend of engineering. Here is the real cost math, lock-in tradeoff, and the migration story when you outgrow the hosted option.

clerk vs supabase authbetter-auth india 2026clerk pricing india mvp
Dark editorial render illustrating Supabase versus Firebase backend cost comparison for Indian MVP 2026

Supabase vs Firebase for Indian MVPs in 2026 — Real Cost on 10K MAU

12 min read

May 12, 2026

Firebase tightened Spark-tier limits in Q1 2026 and Supabase opened the Mumbai ap-south-1 region in late 2025. At 10K MAU for an Indian MVP, Supabase Pro at $25/month now beats Firebase Blaze by 2-4x — here is the real cost math, RLS migration story, and when Firebase still wins.

supabase vs firebase indiasupabase pricing india 2026firebase alternative india
Dark editorial render illustrating self-hosted n8n versus Zapier cloud automation cost comparison India 2026

Self-Host n8n vs Zapier for Indian MVPs in 2026 — Real Cost on 50K Operations/Month

11 min read

May 11, 2026

Self-hosted n8n costs ₹600/month on a $7 Hetzner VPS for 50K operations; Zapier Professional charges $73/month (~₹6,100) for the same volume. Here is the real Docker setup, ops cost, and break-even math from my multi-platform social automation rig.

self host n8nn8n vs zapiern8n cost india
Glowing topographic contours in teal and violet illustrating Razorpay vs Stripe India MVP cost comparison

Razorpay vs Stripe for Indian MVPs in 2026 — Real Cost on a 100-User Month

9 min read

May 10, 2026

Razorpay charges 2% on cards and 0% on UPI for Indian MVPs; Stripe charges 4.3% plus GST plus cross-border 3% on the same volume. Here is the actual rupee math from MyFinancial production.

razorpay vs stripepayment gateway india mvprazorpay pricing 2026
Glitched terminal stream in teal and violet illustrating Claude Code CLI vs hiring a developer in 2026

Claude Code vs Hiring a Developer in 2026: $20 CLI or $80K Engineer?

11 min read

May 9, 2026

Claude Code is the strongest agentic coding CLI of 2026. But $20–200/month buys assistance, not a finished product. This is the line between Claude Code as a developer multiplier and Claude Code as a developer replacement, with the cost math that actually matters when you ship to real users.

claude code vs hire developerclaude code pricing 2026anthropic claude code review
Floating geometric crystals in cyan and magenta neon illustrating v0 by Vercel vs hiring a developer 2026

v0 by Vercel vs Hiring a Developer in 2026 — When Generative UI Hits the Day-30 Wall

13 min read

May 8, 2026

v0 ships a clickable Next.js + shadcn prototype in 20 minutes for $20 a month. Then on day 30 you need auth, a real database, RLS, Stripe webhooks that survive retries, and a deploy that does not break preview. Here is exactly when v0 is the right call, when a real engineer is, and the cost crossover most founders only see after the prototype is already in customer hands.

v0 by vercel vs hire developerv0 vercel pricing 2026v0 production limits
Liquid metal mercury surface with amber ripples illustrating Replit Agent vs hire developer 2026 comparison

Replit Agent vs Hiring a Developer in 2026: Browser IDE Autonomy or Founding Engineer?

11 min read

May 6, 2026

Replit Agent v3 ships full-stack apps from a browser, runs its own tests, and deploys in one click. So why are founders still ending up with $400 monthly bills, broken auth, and rebuilds at month three? Honest pricing math, real failure modes, and when Replit Agent actually beats hiring a developer in 2026.

replit agent vs hire developerreplit agent pricing 2026replit agent v3 review
Cracked monolith with glowing teal fissures illustrating Devin AI vs hire developer 2026 comparison

Devin AI vs Hiring a Developer in 2026: $20/Month Agent or Founding Engineer?

10 min read

May 5, 2026

Devin AI dropped from $500/mo to $20/mo and now scores 51.5% on SWE-bench. Should you fire your developer and let Cognition's autonomous agent ship your MVP? The honest math, real failure modes, and when Devin actually saves money in 2026.

devin ai vs hire developerdevin ai pricing 2026cognition devin review
Two glowing pillars on dark grid, one cracked, illustrating Cursor AI vs hiring a developer cost comparison 2026

Cursor AI vs Hire Developer 2026 — When $200 Plan Hurts More

13 min read

May 4, 2026

Cursor Pro is $20/month, Pro+ is $60, Ultra is $200. One Agent task burns 5–10 premium requests, and a single multi-step refactor on a real codebase eats your monthly credit pool by week two. Here is exactly when Cursor is the right call, when a developer is, and the cost crossover most founders only see after they have shipped a security bug to prod.

cursor ai vs hire developercursor ide cost 2026cursor pro ultra pricing
Two abstract glowing pillars on a dark grid illustrating Bolt.new vs hiring a developer cost comparison 2026

Bolt.new vs Hire Developer 2026 — When AI Builder Hits Limits

12 min read

May 3, 2026

Bolt.new ships a working prototype in 30 minutes for $25/month. Then complexity arrives — a 31% success rate on real SaaS apps, 5–8 million tokens burned on one Supabase auth bug, and a $5K–$40K hardening bill. Here is exactly when Bolt is the right call, when a developer is, and the cost crossover most founders find too late.

bolt.new vs hire developerbolt.new alternative 2026bolt.new production limits
Abstract editorial cover illustrating Hire AI Engineer India 2026

Hire AI Engineer India 2026 — Real Cost, Stack, Sprint Alternative

13 min read

May 2, 2026

AI engineers in the US cost $180K-$280K all-in. India full-time: $45K-$95K. A 6-week senior contract with a builder who has shipped RAG, MCP, and on-device AI in production: $15K-$25K flat. Here is the honest breakdown.

hire ai engineer indiahire ai engineer india 2026ai engineer cost india
Abstract editorial cover illustrating Hire Technical Co-Founder India 2026

Hire Technical Co-Founder India 2026 — Cost, Equity, Alternative

11 min read

May 1, 2026

Technical co-founders in India cost 25-50% equity and 6-12 months to find. Here are the 2026 numbers, the 4-year dilution math, and the contrarian case that most pre-seed founders should hire a senior contractor first and a co-founder never.

hire technical co-founder india 2026technical co-founder cost indiafind technical cofounder for startup
Abstract editorial cover illustrating Founding Engineer Equity 2026

Founding Engineer Equity 2026 — How Much to Negotiate (India)

11 min read

Apr 29, 2026

Carta says 1.5% for hire #1, dropping to 0.33% by hire #5. Pave median is 1%. AI talent pressure pushed the floor up since March 2026. Honest 2026 numbers — plus the $0-equity alternative most pre-seed founders should default to.

founding engineer equity percentage 2026how much equity for founding engineerfirst engineer equity startup
Abstract editorial cover illustrating Retool vs Custom Internal Tool in 2026

Retool vs Custom Internal Tool in 2026 — When to Buy, When to Build, When to Hire

9 min read

Apr 27, 2026

Retool wins for the first 18 months. Custom wins after $20K in seat licenses or when the workflow stops fitting tables and forms. Honest 2026 numbers from a senior engineer who builds both.

retool vs custom buildinternal tool development costretool alternatives 2026
Abstract editorial cover illustrating Your Supabase RLS Just Leaked Production Data

Your Supabase RLS Just Leaked Production Data — The 5 Bugs Every Vibe-Coded App Hits in 2026

10 min read

Apr 26, 2026

Supabase row-level security is opt-in, silent when wrong, and tested with a superuser token that bypasses everything. In January 2025, 170+ Lovable apps leaked their production databases because nobody enabled RLS. Here are the 5 RLS bugs every vibe-coded app ships, and the audit pattern a senior engineer runs to catch them before user 500.

supabase rls production bugssupabase row level securitysupabase rls leak
Abstract editorial cover illustrating 6-Week MVP Sprint

6-Week MVP Sprint — Week-by-Week Breakdown of What Actually Ships

11 min read

Apr 25, 2026

Most "6-week MVP" promises are 12-week projects with marketing copy. This is what a real 6-week sprint looks like — week one through week six, what gets built, what gets cut, and what production-ready means at day 42.

6 week mvp6 week mvp sprintmvp sprint timeline
Abstract editorial cover illustrating 6-Week MVP Tech Stack in 2026

6-Week MVP Tech Stack in 2026 — The Boring Choices That Actually Ship

12 min read

Apr 25, 2026

Every "should I use X or Y" debate costs you a week of the sprint. Here is the boring, opinionated, time-tested stack that ships a production MVP in 6 weeks — Next.js, Postgres, Vercel, Stripe, and a handful of opinionated picks.

6 week mvp tech stackmvp tech stack 2026best stack for mvp
Abstract editorial cover illustrating What a $15K MVP Actually Includes

What a $15K MVP Actually Includes — vs. the $50K Agency Quote You Just Got

11 min read

Apr 25, 2026

A $15K-$30K fixed-price MVP and a $50K-$100K agency quote ship the same thing 80% of the time. The difference is who absorbs the markup. Here is the line-item breakdown — what is in scope, what is overhead, and where the agency margin actually comes from.

mvp cost breakdown 202615k mvp vs 50k agencymvp pricing
Abstract editorial cover illustrating 6-Week MVP Sprint vs 3-Month Agency Build

6-Week MVP Sprint vs 3-Month Agency Build — Which One Actually Ships First in 2026?

11 min read

Apr 25, 2026

Most "3-month MVP" agency builds take 5-6 months. Most "6-week sprints" take 6 weeks. The difference is not engineer talent — it is the contract structure and the scope discipline. Honest comparison from someone who has seen both fail and succeed.

6 week mvp vs agencymvp sprint vs 3 month buildagency mvp timeline
Abstract editorial cover illustrating Hire a Founding Engineer in India

Hire a Founding Engineer in India — Direct vs Toptal, Arc, Uplers in 2026

12 min read

Apr 25, 2026

Marketplaces like Toptal, Arc, and Uplers add 30-50% markup, 2-4 weeks of recruiter delay, and zero portfolio transparency. Here is how going direct compares — pricing, speed, quality, and what each model actually optimizes for.

hire founding engineer indiatoptal vs direct hirearc.dev review
Abstract editorial cover illustrating Lovable App Production Bugs

Lovable App Production Bugs — 5 Fixes Need a Real Engineer (2026)

11 min read

Apr 25, 2026

Lovable, Bolt, and v0 ship 70% of an MVP in a weekend. The remaining 30% — auth edge cases, RLS, payment webhooks, performance, schema migrations — is where every vibe-coded app breaks. Here are the 5 production bugs that always need a real engineer to fix, and what the rescue work actually costs.

lovable production bugslovable rescue 2026vibe coding production
Abstract editorial cover illustrating April 2026 AI News Decoded: 7 Stories That Actually Change What Founders Should

April 2026 AI News Decoded: 7 Stories That Actually Change What Founders Should Build

11 min read

Apr 24, 2026

Claude Mythos 5, GPT-5.4 Thinking, Gemini 3.1 Ultra, DeepSeek V4, Snap cutting 16% headcount — April 2026 reshaped the frontier. Most roundups list the news. This post tells founders, freelance devs, and founding engineers what to actually build, drop, or route next.

AI news April 2026 for foundersClaude Mythos 5 vs GPT 5.4 vs Gemini 3.1multi-model routing startup 2026
Abstract editorial cover illustrating Founding Engineer vs Fractional CTO in 2026

Founding Engineer vs Fractional CTO in 2026 — Which One Does Your Startup Actually Need?

12 min read

Apr 23, 2026

Fractional CTOs advise. Founding engineers ship. If you have an idea and no product, you need someone writing code — not slide decks. Here is the honest trade-off, with real costs, real timelines, and the signal that tells you which one to hire.

founding engineer vs fractional ctodo i need a fractional ctohire founding engineer or cto
Abstract editorial cover illustrating The Best Lovable Alternative in 2026 Is Not Another AI Builder

The Best Lovable Alternative in 2026 Is Not Another AI Builder — It Is a Developer Who Can Read the Code

11 min read

Apr 23, 2026

Every "best Lovable alternative" list recommends Bolt, Replit, v0, Emergent. The honest alternative for a founder whose Lovable app is breaking in production is a senior developer on a 2–4 week rescue contract. Here is why, what it costs, and what to look for.

lovable alternativelovable alternative developerhire developer fix lovable app
Abstract editorial cover illustrating Founding Engineer vs Lovable in 2026

Founding Engineer vs Lovable in 2026 — When $180K Beats $25/mo

11 min read

Apr 22, 2026

Lovable ships your first prototype in a weekend. A founding engineer costs $180K + equity. Here is the honest trade-off, where Lovable breaks, and the in-between option nobody talks about.

founding engineer vs lovabledo i need a founding engineerlovable vs hiring developer
Abstract editorial cover illustrating India vs US MVP Developer Cost in 2026

India vs US MVP Developer Cost in 2026 — $18K or $120K for the Same App?

11 min read

Apr 22, 2026

The real 2026 breakdown of MVP cost in India vs the US — why the agency-vs-Silicon-Valley binary is a trap, and how independent senior engineers with AI tools now win on both axes.

india vs us mvp costoffshore mvp cost 2026cost to build mvp india
Abstract editorial cover illustrating Vibe Coding vs Hiring a Developer

Vibe Coding vs Hiring a Developer — When Lovable, Bolt, and Cursor Stop Being Enough

11 min read

Apr 22, 2026

Vibe coding ships prototypes in 2.4 days. Then the 500-user wall hits — auth bugs, payment edges, DB drift. The honest handoff playbook from AI tool to human engineer.

vibe coding vs traditional developmentvibe coding mvpwhen to hire developer after lovable
Abstract editorial cover illustrating How I Built an Enterprise Deal Matching Platform with Spring Boot + Next.js + GP

How I Built an Enterprise Deal Matching Platform with Spring Boot + Next.js + GPT-4o

10 min read

Apr 16, 2026

Architecture deep-dive into SynFlow — a full-stack intelligence platform that matches deals to profiles using rule-based scoring and AI-powered profile extraction from LinkedIn text.

enterprise deal matching platformspring boot nextjs full stackgpt-4o profile extraction
Abstract editorial cover illustrating Cloud-First AI Is Dead. I Built a Fully Offline AI App to Prove It.

Cloud-First AI Is Dead. I Built a Fully Offline AI App to Prove It.

8 min read

Apr 16, 2026

Google just shipped an offline AI dictation app. Android 16 runs notification summaries on-device. The "cloud-first" mentality for AI apps is dying — here's how I built a production Android app with zero network permissions using Gemma 4, LiteRT, and regex fallback.

on device AI android 2026offline AI app developmentgemma 4 on device LLM
Abstract editorial cover illustrating ₹805 Crore Lost to UPI Fraud This Year. I Built an Offline Scam Detector That Ne

₹805 Crore Lost to UPI Fraud This Year. I Built an Offline Scam Detector That Needs Zero Internet.

9 min read

Apr 16, 2026

1 in 5 Indian families have been hit by UPI fraud. 51% never report it. Cloud-based scam checkers need internet — exactly what victims in Tier 2/3 India don't have. Here's why I built ScamRakshak with zero network permissions.

UPI fraud India 2026scam detector app Indiadigital payment fraud protection
Abstract editorial cover illustrating Building an Android Finance Tracker with SMS Auto-Import

Building an Android Finance Tracker with SMS Auto-Import — Kotlin + Jetpack Compose

9 min read

Apr 15, 2026

How I built FinBaby — an offline Android app that reads bank SMS messages, auto-categorizes transactions, and provides 50/30/20 budgeting for Indian middle-class families.

android finance tracker appkotlin jetpack compose tutorialbank sms auto import android
Abstract editorial cover illustrating Building a Multi-Tenant Retail SaaS with Spring Boot 3.4 + Java 21

Building a Multi-Tenant Retail SaaS with Spring Boot 3.4 + Java 21 — 12-Module Architecture

11 min read

Apr 14, 2026

How I architected RetailOS — an India-first multi-tenant retail platform with billing, inventory, GST invoicing, khata ledger, and offline sync in a 12-module Maven monorepo.

multi-tenant saas spring bootjava 21 spring boot 3.4retail saas india
Abstract editorial cover illustrating a 12-module Spring Boot multi-tenant SaaS architecture

I Built a 12-Module Multi-Tenant SaaS Platform Alone. Here's the Architecture That Made It Possible.

11 min read

Apr 14, 2026

The SaaS market hits $465B in 2026 and 70% of vendors use multi-tenancy. I built RetailOS — a 12-module Spring Boot monorepo with billing, inventory, GST invoicing, khata ledger, and offline sync — as a solo engineer. Here's every architecture decision.

build multi-tenant SaaS solo developerspring boot multi-module monoreposolo SaaS architecture 2026
Abstract editorial cover illustrating Building an Offline-First Trip Planner with React Native + WatermelonDB + Offlin

Building an Offline-First Trip Planner with React Native + WatermelonDB + Offline Maps

10 min read

Apr 13, 2026

Architecture decisions behind TripHive — a collaborative trip planner that works without internet using WatermelonDB, PowerSync, and MapLibre with downloadable offline tiles.

offline first react native appwatermelondb react nativeoffline maps react native
Abstract editorial cover illustrating Building an On-Device AI Scam Detector for Android

Building an On-Device AI Scam Detector for Android — Gemma 4 + LiteRT + Regex Fallback

11 min read

Apr 12, 2026

How I built ScamRakshak — a fully offline Android app that detects scams using a 3-tier AI inference engine with Gemma 4 on-device LLM, LiteRT classification, and regex fallback. Zero internet, zero data collection.

on device ai androidgemma on device llmscam detection app india
Abstract editorial cover illustrating Building a Family Budget App with 8 Financial Modules

Building a Family Budget App with 8 Financial Modules — Kotlin + Jetpack Compose, Fully Offline

10 min read

Apr 11, 2026

Architecture deep-dive into PaisaGuard — a privacy-first Android app for middle-class families with expense tracking, grocery budget mode, bill calendar, debt snowball, and 4 more modules. Zero backend.

family budget app androidkotlin jetpack compose finance appoffline budget tracker
Abstract editorial cover illustrating Building a Multi-Language React Native App with Expo SDK 52

Building a Multi-Language React Native App with Expo SDK 52 — SanatanApp Architecture

9 min read

Apr 5, 2026

How I architected a 5-language devotional app with bundled JSON content, offline-first storage, and expo-av audio streaming — shipping to Play Store at ~15MB.

react native i18nexpo sdk 52react native offline app
Abstract editorial cover illustrating Streaming Audio in React Native: expo-av with Public Domain Sources

Streaming Audio in React Native: expo-av with Public Domain Sources

7 min read

Apr 5, 2026

A practical guide to building a streaming audio player in React Native with expo-av — background playback, progress tracking, and global player state with zero backend cost.

expo-av audio streamingreact native audio playerexpo audio background playback
Abstract editorial cover illustrating From Idea to Play Store: Shipping SanatanApp in 4 Weeks

From Idea to Play Store: Shipping SanatanApp in 4 Weeks

8 min read

Apr 5, 2026

The full story of building and shipping a React Native app to Google Play — from problem discovery to architecture decisions to the actual Play Store submission process.

react native play storeship mobile app fastexpo eas build
Abstract editorial cover illustrating How Much Does It Cost to Build a Mobile App in India? Real Numbers from a Develo

How Much Does It Cost to Build a Mobile App in India? Real Numbers from a Developer (2026)

15 min read

Apr 5, 2026

Honest cost breakdown for building Android and iOS apps in India — from a freelance developer who has shipped apps to Play Store. No agency markup, no inflated estimates.

mobile app development cost Indiacost to build app India 2026hire app developer India
Abstract editorial cover illustrating How to Build an AI Chatbot for Your Business: Architecture, Cost & What Actually

How to Build an AI Chatbot for Your Business: Architecture, Cost & What Actually Works (2026)

16 min read

Apr 5, 2026

A developer's honest guide to building AI chatbots — WhatsApp bots, customer support agents, and LLM-powered assistants. What works, what doesn't, and what it actually costs.

build AI chatbot businessWhatsApp chatbot IndiaAI chatbot development cost
Abstract editorial cover illustrating Freelance Developer vs Agency in India: An Honest Comparison from the Developer

Freelance Developer vs Agency in India: An Honest Comparison from the Developer Side

15 min read

Apr 5, 2026

When should you hire a freelancer? When does an agency make sense? A working developer breaks down the real trade-offs — cost, quality, communication, and delivery.

freelance developer vs agency Indiahire software developer Indiafreelance vs agency cost
Abstract editorial cover illustrating Spring Boot vs Node.js for Your Startup Backend (2026)

Spring Boot vs Node.js for Your Startup Backend (2026)

7 min read

Apr 5, 2026

An honest comparison of Spring Boot and Node.js for startup backends — performance, hiring, ecosystem, and when each one actually makes sense.

spring boot vs nodejsbackend framework comparisonstartup tech stack 2026
Abstract editorial cover illustrating How to Build a SaaS MVP in 2026

How to Build a SaaS MVP in 2026 — Complete Tech Stack Guide

9 min read

Apr 5, 2026

A practical guide to building your SaaS MVP — tech stack choices, cost breakdown, timeline, and the mistakes that kill most first-time founders.

build saas mvpsaas tech stack 2026mvp development cost
Abstract editorial cover illustrating React Native vs Flutter in 2026: Which One for Your App?

React Native vs Flutter in 2026: Which One for Your App?

7 min read

Apr 5, 2026

A practical comparison of React Native and Flutter in 2026 — performance, ecosystem, hiring, and which one I recommend based on your specific situation.

react native vs flutter 2026cross platform app frameworkmobile app framework comparison
Abstract editorial cover illustrating WhatsApp Business API Integration Guide for Indian Startups

WhatsApp Business API Integration Guide for Indian Startups

8 min read

Apr 5, 2026

A practical guide to integrating WhatsApp Business API for Indian startups — providers, costs, message templates, and building automated bots that actually convert.

whatsapp business api indiawhatsapp bot integrationtwilio whatsapp api
Abstract editorial cover illustrating PostgreSQL vs MongoDB: Which Database for Your Startup? (2026)

PostgreSQL vs MongoDB: Which Database for Your Startup? (2026)

7 min read

Apr 5, 2026

A practical comparison of PostgreSQL and MongoDB for startups — when to use each, real performance numbers, and why most startups should just pick Postgres.

postgresql vs mongodbdatabase for startupsql vs nosql 2026
Abstract editorial cover illustrating How to Add AI to Your Existing Business App

How to Add AI to Your Existing Business App — Without Rebuilding Everything

8 min read

Apr 5, 2026

A practical guide to adding AI features to your existing application — where to start, what to avoid, and how to get real ROI without a complete rewrite.

integrate ai existing appadd ai to businessllm integration existing system
Abstract editorial cover illustrating How to Hire a Software Developer: 10 Questions to Ask Before Signing

How to Hire a Software Developer: 10 Questions to Ask Before Signing

7 min read

Apr 5, 2026

The 10 questions you should ask before hiring a freelance developer — how to evaluate technical skills, communication, and reliability without being technical yourself.

hire software developer questionsevaluate developertechnical interview freelancer
Abstract editorial cover illustrating How to Build an App Like Uber or Zomato

How to Build an App Like Uber or Zomato — Architecture & Real Costs

8 min read

Apr 5, 2026

The real architecture and costs behind building an on-demand app like Uber or Zomato — what you actually need for an MVP vs what agencies will try to sell you.

build app like uber costbuild app like zomatoon demand app development
Abstract editorial cover illustrating Why Your MVP Should Cost Under $10,000

Why Your MVP Should Cost Under $10,000 — And How to Make It Happen

7 min read

Apr 5, 2026

Most MVPs are overbuilt and overpriced. Here is how to scope, build, and launch a real product for under $10K — with examples from projects I have shipped.

mvp costcheap mvp developmentbuild mvp budget
Abstract editorial cover illustrating Microservices vs Monolith for Startups: Stop Overengineering

Microservices vs Monolith for Startups: Stop Overengineering

7 min read

Apr 5, 2026

Why your startup should start with a monolith, when microservices actually make sense, and how to avoid the architecture astronaut trap.

microservices vs monolith startupstartup architecturewhen to use microservices
Architecture diagram comparing OpenAI and AWS Bedrock data flow for financial applications

AWS Bedrock vs OpenAI — Which One to Pick for Your Startup (With Real Costs)

9 min read

Apr 2, 2026

I built a financial advisor AI with AWS Bedrock (Nova Lite) after starting with OpenAI. Here's a real cost and latency comparison from production, not a marketing page.

aws bedrock vs openaibedrock nova lite reviewaws bedrock cost
Abstract editorial cover illustrating Using RAG for SQL Generation

Using RAG for SQL Generation — Why Embeddings Beat Prompt Stuffing

8 min read

Jan 28, 2026

How pgvector embeddings improve LLM-to-SQL accuracy by providing schema context instead of dumping entire schemas into prompts.

rag sql generationpgvector embeddings sqltext to sql pgvector
Abstract editorial cover illustrating Building an MCP Server with Spring Boot

Building an MCP Server with Spring Boot — A Practical Guide

10 min read

Jan 20, 2026

Implementing the Model Context Protocol for AI assistant tool integration using Spring Boot and Spring AI.

mcp server spring bootmodel context protocol tutorialspring ai mcp
Abstract editorial cover illustrating Offline-First PWA Patterns

Offline-First PWA Patterns — Service Workers, IndexedDB, and Background Sync

7 min read

Jan 15, 2026

Service workers, IndexedDB, and background sync patterns used in MicroItinerary for reliable offline-first travel planning.

pwa offline sync indexeddbservice worker background syncoffline first web app

Flux de Contributions

rohitraj.tech Redesign

open source

backendscale·2026-01-31

Complete redesign from consultancy landing page to engineer-centric project directory with documentation-first aesthetic.

→ Live at rohitraj.tech

src/appsrc/componentssrc/data
GitHub →

MicroItinerary — AI Travel Planner

ai systems

MicroItinerary·2026-01-27

AI-powered PWA for planning annual travel itineraries. Features intelligent destination suggestions via OpenAI GPT-4, cost estimation in INR, and Splitwise-style expense splitting for group trips.

→ Full-stack PWA with offline support, Google OAuth, Redis caching

backendwebdocker-compose.yml
GitHub →

StellarMIND — Chat-to-SQL with pgvector

ai systems

spring-ai-mcp-server·2026-01-25

Spring Boot MCP server that converts natural language questions into read-only SQL using LLM. Retrieves schema context via pgvector for RAG-based query generation.

→ Natural language → SQL with safe read-only execution

stellarmind-serverstellarmind-client
GitHub →

AWS SQS Integration Patterns

open source

sqs·2021-12-27

Reference implementation for AWS SQS integration with Spring Boot including producer/consumer patterns, retry logic, and dead-letter queue handling.

producerconsumer
GitHub →

Résumé des Dépôts

backendscale
TypeScriptactive

Engineering work visibility — portfolios show results but not thinking

Full implementation: Next.js 16, React 19, documentation-first design system

src/appsrc/componentssrc/data
MicroItinerary
Javaactive

Travel apps lack intelligent suggestions and group expense management

Full-stack: Spring Boot 3.2.2 + Java 21 backend, React 18 + Vite PWA, OpenAI integration

backendwebtest
spring-ai-mcp-server
Javaactive

Natural language database queries with retrieval-augmented context via pgvector

Full MCP server implementation, pgvector integration, Chain-of-Thought UI

stellarmind-serverstellarmind-clientpostman
my-expense
Javamaintenance

Simple expense tracking without bloated features

Full-stack Java application with REST API

apipersistencereports
sqs
Javaarchived

Reference implementation for AWS SQS with Spring Boot

Producer/consumer patterns, retry logic, DLQ handling

producerconsumerconfig
wealwa-api
Javaarchived

Personal wealth tracking and portfolio management

REST API design, portfolio analytics

apiportfolioanalytics

Voir tous les dépôts sur GitHub →

Notes de Projet

backendscale (rohitraj.tech)

frontend

Pourquoi Ça Existe

Engineering work is often invisible. Traditional portfolios show polished results but not the thinking process.

Défi Technique Principal

Creating a documentation-first design system that feels like internal engineering docs while remaining visually professional.

Aperçu Architecture

  • Next.js 16 with App Router for static generation where possible
  • React 19 with server components for optimal performance
  • Tailwind v4 for utility-first styling
  • Data-driven content structure in src/data/ for easy updates

Compromis & Décisions

  • Chose vanilla CSS over component libraries for full control, at cost of development speed
  • Static generation means content updates require redeployment
  • Minimal dependencies reduce maintenance burden but limit feature velocity
Production — deployed on VercelView Repository →

MicroItinerary

ai systems

Pourquoi Ça Existe

Travel apps lack intelligent suggestions based on season/budget and don't handle group expense splitting well.

Défi Technique Principal

Integrating OpenAI GPT-4 for destination suggestions and cost estimation while maintaining offline-first PWA capabilities.

Aperçu Architecture

  • React 18 + Vite PWA frontend with IndexedDB for offline support
  • Spring Boot 3.2.2 + Java 21 backend with Flyway migrations
  • PostgreSQL 16 + Redis caching layer
  • Google OAuth 2.0 + JWT authentication
  • OpenAI GPT-4 API for AI features

Compromis & Décisions

  • PWA over native app — broader reach but limited device APIs
  • OpenAI API adds latency and cost vs simpler rule-based suggestions
  • Expense splitting algorithm prioritizes simplicity over Splitwise feature-parity
Development — backend and frontend functional, needs polishView Repository →

StellarMIND (spring-ai-mcp-server)

ai systems

Pourquoi Ça Existe

Business users need to query databases without knowing SQL. Existing tools lack context-aware query generation.

Défi Technique Principal

Retrieval-augmented SQL generation using pgvector for schema context, with strict read-only query enforcement for safety.

Aperçu Architecture

  • Spring Boot MCP server with Tool interface for query execution
  • pgvector for storing schema knowledge chunks and embeddings
  • Spring AI for LLM integration (provider-agnostic)
  • Chain-of-Thought (CoT) web interface for query debugging
  • Read-only SQL enforcement (only SELECT, WITH allowed)

Compromis & Décisions

  • Read-only restriction limits use cases but ensures safety
  • pgvector requires PostgreSQL — not database-agnostic
  • MCP transport (stdio) limits deployment patterns vs HTTP
Development — core query flow working, needs UI polishView Repository →
← ProjetsDépôts →

Rohit Raj — Ingénieur Backend & Systèmes IA

Services

AI Agent HostFounding Engineer for Hire in IndiaMobile App DevelopmentAI Chatbot DevelopmentFull-Stack Development

Recevoir les Mises à Jour