April 9, 2024
What Fuzzify Solves
Laxmi, Lakshmi, Laxhmi, or Lackshmy, which means exact-match search quietly misses valid records.
Fuzzify was built to make those searches tolerant, pronunciation-aware, and useful in real field conditions where speed matters.
Core Idea
How The Pipeline Works
- A user enters the name they want to search.
- A fine-tuned
Llama 3.2 1Bmodel predicts likely pronunciations in IPA form. - A custom embedder turns those pronunciation forms into vectors.
- The vectors are stored and queried in
Chroma DB. - Cosine similarity surfaces the closest matches across spelling variants.
Why This Approach Matters
- It improves recall for transliterated and inconsistently spelled names.
- It keeps the search lightweight enough for practical deployment.
- It moves the matching logic closer to how names sound, not just how they are typed.
- It reduces the friction of searching messy public datasets under time pressure.
Product Shape
Flutterhandled the mobile-first search experience.FastAPIserved the retrieval pipeline.Chroma DBstored the phonetic vectors.Unslothwas used while fine-tuning theLlama 3.2 1Bmodel.
Outcome
SIH Grand Finale 2024, Fuzzify demonstrates how a compact LLM pipeline can improve name lookup in public-sector workflows where data quality is uneven but search accuracy is critical.