Building an On-Device AI Scam Detector for Android — Gemma 4 + LiteRT + Regex Fallback

TL;DR

How I built ScamRakshak — a fully offline Android app that detects scams using a 3-tier AI inference engine with Gemma 4 on-device LLM, LiteRT classification, and regex fallback. Zero internet, zero data collection. I built ScamRakshak, an on-device AI scam detector for Android using a 3-tier inference engine — Gemma 4 LLM via ML Kit GenAI for contextual analysis, LiteRT binary classifier for broad device support, and a regex fallback engine for universal coverage — with zero network permissions, Hindi-first bilingual output, and four input modes (text, screenshot OCR, URL analysis, UPI ID verification)

India's ₹1,750 Crore Scam Problem

By Rohit Raj — Founding Engineer · 10+ yrs MVP shipping · LinkedIn

I built ScamRakshak, an on-device AI scam detector for Android using a 3-tier inference engine — Gemma 4 LLM via ML Kit GenAI for contextual analysis, LiteRT binary classifier for broad device support, and a regex fallback engine for universal coverage — with zero network permissions, Hindi-first bilingual output, and four input modes (text, screenshot OCR, URL analysis, UPI ID verification) to protect India's 500 million Hindi-speaking smartphone users from digital fraud.

India loses over ₹1,750 crore annually to digital scams. WhatsApp forwards promising "KYC update or account blocked." SMS messages with "you've won ₹5,00,000, click here." Fake UPI payment requests from "Amazon refund department."

The victims aren't just elderly or non-technical users. Scammers are sophisticated — they use correct bank names, realistic formatting, and urgency tactics that catch even tech-savvy people off guard.

Existing solutions fail for three reasons:

They need internet — Cloud-based scam checkers don't work when the user is offline or has limited data
They collect data — Uploading suspicious messages to a server means sharing personal financial information
They're English-only — 500M+ Indians communicate primarily in Hindi and Hinglish

ScamRakshak flips all three: fully offline, zero data collection, Hindi-first. The app physically cannot send data anywhere — it has zero network permissions in the Android manifest.

The 3-Tier AI Inference Architecture

The biggest design challenge: how do you run scam detection on a ₹8,000 Android phone with no internet?

You can't assume the user has a flagship phone. You can't assume they have a recent Android version. You can't assume Google Play Services are up to date. You need a system that degrades gracefully.

Tier 1: ML Kit GenAI with Gemma 4 (Best quality)

Google's ML Kit GenAI API runs Gemma 4 — a small but capable LLM — entirely on-device via Google AICore. It understands context, can analyze message intent, and generates bilingual explanations.

When available, the prompt is: "Analyze this message for scam indicators. Return a risk score (0-100), the scam type if detected, and explain why in both Hindi and English."

Tier 2: LiteRT Classification (Fast fallback)

LiteRT (the successor to TensorFlow Lite) runs a lightweight scam classification model. No LLM, no text generation — just a binary classifier with confidence score. Faster than Tier 1, works on older devices, but can't explain WHY something is a scam.

Tier 3: Rule-Based Regex Engine (Universal fallback)

Pattern matching against known scam signatures:

UPI ID format validation
Known scam phone number patterns
Urgency keyword detection ("immediate", "account blocked", "last chance")
URL shortener detection (bit.ly, tinyurl in financial messages)
Lottery/prize claim patterns

This tier runs on literally any Android device. No ML runtime needed. No model download. Just regex patterns against a local database of scam signatures.

Fallback chain: The app checks if Tier 1 is available (requires AICore). If not, falls to Tier 2 (requires LiteRT runtime). If not, uses Tier 3 (always available). The user sees a risk score regardless of which tier processes the message.

Why Is Hindi-First Bilingual Analysis Essential?

Most scam messages in India arrive in Hindi, Hinglish (Hindi written in English script), or a mix of Hindi and English. A scam detector that only understands English misses the majority of threats.

Tier 1 (Gemma 4) handles this natively — the model understands Hindi, Hinglish, and code-switched text. The output is bilingual:

Example analysis of "Aapka SBI account block ho jayega. Turant ye link click karein":

Risk Score: 92/100 — High Risk

Hindi: "Yeh message scam hai. Koi bhi bank link click karne ko nahi kahegi. Apne bank branch mein call karein."

English: "This message is a scam. No bank asks you to click a link. Call your bank branch directly."

Tier 3 regex also supports Hindi patterns — keyword lists include both Devanagari script and transliterated Hindi. "तुरंत" (immediately) and "turant" both trigger urgency detection.

This bilingual approach is critical for the target audience. A Hindi-speaking user from Tier 2/3 India needs the explanation in their language. An English-speaking user needs it in English. Both get both.

What Input Modes Does ScamRakshak Support?

Scams arrive through multiple channels. ScamRakshak handles all of them:

1. Text Messages (Primary) User pastes a suspicious SMS or WhatsApp message. The AI analyzes the text, detects scam patterns, and returns a risk score with explanation.

2. Screenshots (CameraX + OCR) User takes a screenshot of a suspicious WhatsApp forward or social media post. CameraX captures the image, ML Kit Text Recognition extracts the text, and the scam analysis runs on the extracted text.

Why screenshots? Many Indian users receive scams as forwarded images — text embedded in images to bypass spam filters. OCR handles this.

3. URL Analysis User pastes a suspicious URL. The app checks: - Is it a URL shortener masking the real destination? - Does the domain mimic a legitimate brand? (e.g., "sbi-secure-update.com") - Does it match known phishing domain patterns?

All checks run locally against a pattern database. No HTTP request is made to the URL — the app never connects to potentially malicious sites.

4. UPI ID Verification User enters a UPI ID (e.g., "refund.amazon@paytm"). The app checks: - Format validity - Known scam UPI ID patterns - Suspicious naming patterns ("refund", "cashback", "lottery" in UPI handles)

UPI scams are the fastest-growing fraud vector in India. A dedicated input mode for UPI IDs catches scams that text analysis might miss.

Privacy Architecture: Zero Network Permissions

This isn't just a feature — it's an architectural constraint enforced at the Android manifest level.

xml

<!-- AndroidManifest.xml -->
<!-- No INTERNET permission. The app cannot make any network request. -->
<!-- No READ_CONTACTS. No ACCESS_FINE_LOCATION. No READ_SMS (user pastes manually). -->
<uses-permission android:name="android.permission.CAMERA" />  <!-- For screenshot OCR only -->

The app requests exactly one permission: CAMERA (for screenshot capture). No internet. No contacts. No SMS reading. No location.

Why this matters: - Users can verify by checking App Info → Permissions. Seeing "No permissions" builds immediate trust. - Even if a malicious update tried to exfiltrate data, it physically can't — there's no network permission. - This passes any security audit instantly. No data flows to audit because there are none.

Room database stores: - Scam pattern database (bundled in APK, updated via Play Store releases) - Scan history (for the user's reference only) - No personal data is stored. Messages are analyzed and discarded unless the user explicitly saves the scan result.

Tech Stack & What This Proves

Layer	Technology	Why
UI	Jetpack Compose + Material 3	Modern declarative UI, dark mode, accessibility
AI Tier 1	ML Kit GenAI (Gemma 4)	On-device LLM, bilingual understanding
AI Tier 2	LiteRT	Lightweight classification, broad device support
AI Tier 3	Regex engine	Universal fallback, zero dependencies
OCR	ML Kit Text Recognition	Extract text from screenshot images
Camera	CameraX	Modern camera API for screenshot capture
Database	Room	Local scan history, scam pattern storage
DI	Hilt	Standard Android dependency injection

What ScamRakshak demonstrates:

On-device AI is production-ready. Gemma 4 via ML Kit GenAI runs well on mid-range phones. You don't need cloud APIs for NLP tasks.

Graceful degradation is non-negotiable for India. Your app must work on a ₹8,000 phone running Android 8. The 3-tier fallback guarantees universal coverage.

Privacy-by-architecture beats privacy-by-policy. "We don't collect data" is a promise. "We have no network permission" is a guarantee.

Hindi-first isn't a translation — it's a design philosophy. Building for 500M Hindi speakers means the primary UX must be Hindi, not English with a translation toggle.

Single-purpose apps can be technically deep. ScamRakshak does one thing — detect scams — but the engineering behind 3-tier inference, bilingual output, multi-modal input, and zero-network architecture is substantial.

Model size and APK impact: The LiteRT binary classifier adds approximately 15MB to the APK size. The Gemma 4 model is not bundled — it downloads through Google AICore separately (approximately 2GB). The regex pattern database adds under 1MB. Total APK size is approximately 12MB, which is smaller than most social media apps. For users on limited storage, Tier 2 and Tier 3 provide full protection without the Gemma 4 model download.

Testing the 3-tier fallback chain: Each tier is tested independently with a curated dataset of known scam messages and legitimate messages. The fallback logic is tested by mocking AICore availability — when AICore returns unavailable, the app must seamlessly fall to Tier 2 without any user-visible delay. Integration tests verify that the same message produces consistent risk scores across tiers (within a 15-point tolerance), ensuring that a scam message flagged by Tier 1 is also caught by Tier 2 and Tier 3.

The pattern I run for founders in this situation is either a mobile app development or a fractional CTO engagement — pick based on whether you need shipped code or shipped *and* maintained code.

Two posts that pick up where this one ends: How to Build an AI Chatbot for Your Business: Architecture, Cost & What… and Building an Android Finance Tracker with SMS Auto-Import.

Frequently Asked Questions

Q: How does ScamRakshak differ from Truecaller's spam detection?

Truecaller focuses on identifying spam callers using a cloud-based phone number database — it requires internet access to check its database. ScamRakshak analyzes message content, URLs, UPI IDs, and screenshots using on-device AI with zero internet. Truecaller cannot analyze WhatsApp forwards, SMS text content, or screenshot-based scams. The two tools are complementary — Truecaller identifies who is calling, ScamRakshak analyzes what they are saying.

Q: Can the Gemma 4 model detect new scam types it was not specifically trained on?

Yes, because Gemma 4 is a general-purpose LLM that understands language intent, not just pattern matching. When a novel scam type emerges — say, a fake income tax refund scheme — Gemma 4 recognizes the urgency tactics, financial bait, and suspicious link patterns even without specific training on that scam category. The regex tier cannot detect novel scams, but the LLM tier provides an adaptive layer that evolves with scam sophistication.

Q: Why does ScamRakshak only request CAMERA permission?

CAMERA is needed solely for the screenshot OCR feature — capturing images of suspicious WhatsApp forwards or social media posts. The app does not request READ_SMS (user pastes messages manually), INTERNET (zero network architecture), READ_CONTACTS, or ACCESS_LOCATION. Minimizing permissions maximizes user trust and simplifies security audits. Users can verify the app's permissions in Android Settings at any time.

Q: What is the accuracy difference between Tier 1 and Tier 3?

Tier 1 (Gemma 4) provides the highest accuracy with contextual reasoning — it understands why a message is suspicious and generates bilingual explanations. Tier 3 (regex) has near-100% precision for known scam patterns but cannot detect novel scams or explain reasoning. Tier 2 (LiteRT) falls between the two — it catches structural patterns that regex misses but cannot generate explanations. For known scam types, all three tiers produce similar risk scores.

Q: How often is the scam pattern database updated?

The Room database containing scam signatures ships with APK updates through the Google Play Store, typically on a weekly release cycle. New scam patterns are identified from public reporting databases and security research, then added to the regex pattern set and LiteRT training data. The Gemma 4 model itself is updated by Google through AICore independently of the app release cycle.

TL;DR

India's ₹1,750 Crore Scam Problem

By Rohit Raj — Founding Engineer · 10+ yrs MVP shipping · LinkedIn

Existing solutions fail for three reasons:

They need internet — Cloud-based scam checkers don't work when the user is offline or has limited data
They collect data — Uploading suspicious messages to a server means sharing personal financial information
They're English-only — 500M+ Indians communicate primarily in Hindi and Hinglish

ScamRakshak flips all three: fully offline, zero data collection, Hindi-first. The app physically cannot send data anywhere — it has zero network permissions in the Android manifest.

The 3-Tier AI Inference Architecture

The biggest design challenge: how do you run scam detection on a ₹8,000 Android phone with no internet?

You can't assume the user has a flagship phone. You can't assume they have a recent Android version. You can't assume Google Play Services are up to date. You need a system that degrades gracefully.

Tier 1: ML Kit GenAI with Gemma 4 (Best quality)

Google's ML Kit GenAI API runs Gemma 4 — a small but capable LLM — entirely on-device via Google AICore. It understands context, can analyze message intent, and generates bilingual explanations.

When available, the prompt is: "Analyze this message for scam indicators. Return a risk score (0-100), the scam type if detected, and explain why in both Hindi and English."

Tier 2: LiteRT Classification (Fast fallback)

Tier 3: Rule-Based Regex Engine (Universal fallback)

Pattern matching against known scam signatures:

UPI ID format validation
Known scam phone number patterns
Urgency keyword detection ("immediate", "account blocked", "last chance")
URL shortener detection (bit.ly, tinyurl in financial messages)
Lottery/prize claim patterns

This tier runs on literally any Android device. No ML runtime needed. No model download. Just regex patterns against a local database of scam signatures.

Why Is Hindi-First Bilingual Analysis Essential?

Most scam messages in India arrive in Hindi, Hinglish (Hindi written in English script), or a mix of Hindi and English. A scam detector that only understands English misses the majority of threats.

Tier 1 (Gemma 4) handles this natively — the model understands Hindi, Hinglish, and code-switched text. The output is bilingual:

Example analysis of "Aapka SBI account block ho jayega. Turant ye link click karein":

Risk Score: 92/100 — High Risk

Hindi: "Yeh message scam hai. Koi bhi bank link click karne ko nahi kahegi. Apne bank branch mein call karein."

English: "This message is a scam. No bank asks you to click a link. Call your bank branch directly."

Tier 3 regex also supports Hindi patterns — keyword lists include both Devanagari script and transliterated Hindi. "तुरंत" (immediately) and "turant" both trigger urgency detection.

What Input Modes Does ScamRakshak Support?

Scams arrive through multiple channels. ScamRakshak handles all of them:

1. Text Messages (Primary) User pastes a suspicious SMS or WhatsApp message. The AI analyzes the text, detects scam patterns, and returns a risk score with explanation.

Why screenshots? Many Indian users receive scams as forwarded images — text embedded in images to bypass spam filters. OCR handles this.

All checks run locally against a pattern database. No HTTP request is made to the URL — the app never connects to potentially malicious sites.

UPI scams are the fastest-growing fraud vector in India. A dedicated input mode for UPI IDs catches scams that text analysis might miss.

Privacy Architecture: Zero Network Permissions

This isn't just a feature — it's an architectural constraint enforced at the Android manifest level.

xml

<!-- AndroidManifest.xml -->
<!-- No INTERNET permission. The app cannot make any network request. -->
<!-- No READ_CONTACTS. No ACCESS_FINE_LOCATION. No READ_SMS (user pastes manually). -->
<uses-permission android:name="android.permission.CAMERA" />  <!-- For screenshot OCR only -->

The app requests exactly one permission: CAMERA (for screenshot capture). No internet. No contacts. No SMS reading. No location.

Tech Stack & What This Proves

Layer	Technology	Why
UI	Jetpack Compose + Material 3	Modern declarative UI, dark mode, accessibility
AI Tier 1	ML Kit GenAI (Gemma 4)	On-device LLM, bilingual understanding
AI Tier 2	LiteRT	Lightweight classification, broad device support
AI Tier 3	Regex engine	Universal fallback, zero dependencies
OCR	ML Kit Text Recognition	Extract text from screenshot images
Camera	CameraX	Modern camera API for screenshot capture
Database	Room	Local scan history, scam pattern storage
DI	Hilt	Standard Android dependency injection

What ScamRakshak demonstrates:

On-device AI is production-ready. Gemma 4 via ML Kit GenAI runs well on mid-range phones. You don't need cloud APIs for NLP tasks.

Graceful degradation is non-negotiable for India. Your app must work on a ₹8,000 phone running Android 8. The 3-tier fallback guarantees universal coverage.

Privacy-by-architecture beats privacy-by-policy. "We don't collect data" is a promise. "We have no network permission" is a guarantee.

Hindi-first isn't a translation — it's a design philosophy. Building for 500M Hindi speakers means the primary UX must be Hindi, not English with a translation toggle.

Single-purpose apps can be technically deep. ScamRakshak does one thing — detect scams — but the engineering behind 3-tier inference, bilingual output, multi-modal input, and zero-network architecture is substantial.

The pattern I run for founders in this situation is either a mobile app development or a fractional CTO engagement — pick based on whether you need shipped code or shipped *and* maintained code.

Two posts that pick up where this one ends: How to Build an AI Chatbot for Your Business: Architecture, Cost & What… and Building an Android Finance Tracker with SMS Auto-Import.

Frequently Asked Questions

Q: How does ScamRakshak differ from Truecaller's spam detection?

Q: Can the Gemma 4 model detect new scam types it was not specifically trained on?

Q: Why does ScamRakshak only request CAMERA permission?

Q: What is the accuracy difference between Tier 1 and Tier 3?

Q: How often is the scam pattern database updated?

Building an On-Device AI Scam Detector for Android — Gemma 4 + LiteRT + Regex Fallback

TL;DR

India's ₹1,750 Crore Scam Problem

The 3-Tier AI Inference Architecture

Why Is Hindi-First Bilingual Analysis Essential?

What Input Modes Does ScamRakshak Support?

Privacy Architecture: Zero Network Permissions

Tech Stack & What This Proves

Frequently Asked Questions

Read Next

Inkling 975B: The Open-Weights Model Almost Nobody Should Self-Host (2026)

Bonsai 27B: A 27B Model on Your Phone — and the One Benchmark That Collapses (2026)

Building an On-Device AI Scam Detector for Android — Gemma 4 + LiteRT + Regex Fallback

TL;DR

India's ₹1,750 Crore Scam Problem

The 3-Tier AI Inference Architecture

Why Is Hindi-First Bilingual Analysis Essential?

What Input Modes Does ScamRakshak Support?

Privacy Architecture: Zero Network Permissions

Tech Stack & What This Proves

Frequently Asked Questions

Read Next

Inkling 975B: The Open-Weights Model Almost Nobody Should Self-Host (2026)

Bonsai 27B: A 27B Model on Your Phone — and the One Benchmark That Collapses (2026)