Trigrams are a very straightforward solution to this but they have one major lim...

jmilum · on March 19, 2016

vodka and votka do share trigrams: __v, _vo, and ka_, as the PG rules for generating them add two spaces at the beginning and one at the end.

_pctq · on March 19, 2016

Super interesting approach, thanks for contributing it.

From what I see in your article, you used this for searching through mails, so I guess it works on a pretty decent volume of data, but just to confirm it: do you think it's viable on something like gitlab's multi-resources full text search? For reference, what is the size of this index for your mailbox? (assuming it's an usual contractor dev mailbox, with mainly lots of notification mails and discussions with daily customers)

PS: don't worry about other reactions, users have invaded our dev world, nowadays, the sad thing with having attractive salaries

elchief · on March 19, 2016

how about some in-database spelling auto-correction:

http://blog.databasepatterns.com/2014/08/postgresql-spelling...

kanak · on March 20, 2016

What do you recommend instead of LevelDB?

TheLogothete · on March 19, 2016

Or, you know, use a search engine. There are countless little things like that and out-of-the-box search in DBs will never catch up with.

mbenjaminsmith · on March 19, 2016

The resulting product is fast, easy to tune (for my particular application) and handles every edge case and language I've thrown at it.

Do you have any specific concern you'd like to share or are you just generally dismissive of other people's hard work?

TheLogothete · on March 19, 2016

I'm dismissive of pointless hard work.