Clever! A method that has worked well for me: divorced databases. The first data...

michaelmior · 2024-10-30T12:44:12 1730292252

It sounds like this is pretty similar to the approach that the post is advocating against although I can see your reasoning behind this.

avthar · 2024-10-30T20:59:17 1730321957

Post-co author here. This is actually something that we are considering implementing in future versions of pgai Vectorizer. You point the vectorizer at database A but tell it to create and store embeddings in database B. You can always do joins across the two databases with postgres FDWs and it would solve issues of load management if those are concerns. Neat idea and one on our radar!

therealdrag0 · 2024-11-01T01:42:24 1730425344

The limitation with that is no hybrid search, which is often needed. “Show me only results for this user or tenant or category etc.”