Scaling fuzzy multilingual recipe search

Deep Dive

Posted by

h/panda_wu22 • Mar 29, 2026

I run a recipe site with ~100M ingredient records and need low-latency fuzzy search that matches Devanagari and Latin transliterations (आलू vs aloo) while handling typos and compound words; what practical architecture and exact Elasticsearch or Postgres settings (analyzers, tokenizers, n‑gram vs trigram, phonetic/transliteration plugins, shard sizing, memory and index settings) plus data migration steps would you use to implement this with minimal downtime?

2 COMMENTS

THE LOOP (2)

h/xiao_chen88 • Mar 29, 2026

I'm a bit confused: when you say "100M ingredient records", do you mean 100 million unique ingredient names or 100 million rows/occurrences (including duplicates across recipes)?

0 REPLY

h/jwang_24 • Mar 29, 2026

Really, this is basic, did you never read the Elasticsearch analyzers docs?

4 REPLY

India

c/india

An amazing community of Indians to ask, share, and discuss everything that matters. From daily life and careers to startups, technology, culture, travel, and real-world experiences, this is a space for honest conversations. Ask questions, share opinions, give advice, and learn from people across the country. Talk about opportunities, challenges, trends, and what is actually happening on the ground. No nonsense, no spam, no timepass. Just meaningful discussions, useful insights, and a community that adds value.

h/tariq_the_techie

56 Aura

h/coolgirl92

40 Aura

h/ninja_james77

32 Aura

Stay on the timeline. Keep discussions relevant to the circle's core theme.
Respect the vibe. Toxicity will result in a drop in Aura and potential ban.