avatar
Tim Kellogg @timkellogg.me

you really need to capture the query and decompose it into multiple sub queries e.g. maybe get a 1B-3B LLM to rewrite the query into a DSL (e.g. a JSON breakdown of the various components and concepts in the query) and then push that logic into the database engine itself

aug 31, 2025, 11:06 am • 4 3

Replies

avatar
digiwallsstudio.bsky.social @digiwallsstudio.bsky.social

Are you talking about rewriting the query and producing something you could prefer metadata on?

aug 31, 2025, 11:25 am • 1 0 • view
avatar
Tim Kellogg @timkellogg.me

yeah, rewriting it into multiple queries with pareseable relationships between them

aug 31, 2025, 11:34 am • 1 0 • view
avatar
deen @sir-deenicus.bsky.social

With AND what we usually want is intersection. So either directly use a DSL or a small LLM parses it out. From these we can seek intersections: simple and easy is to matrix multiply on unit vectors and filter or, use SVD (more complex but much more flexible). Geometric mean of hadamard as fallback

aug 31, 2025, 2:52 pm • 1 0 • view
avatar
Tim Kellogg @timkellogg.me

alternatively, sparse approaches like SPLADE do this in latent space but use inverted indices (regular full text search, exact matches) arxiv.org/abs/2107.057...

aug 31, 2025, 11:06 am • 5 2 • view
avatar
Tim Kellogg @timkellogg.me

imo if search is done perfectly, you effectively drive your LLM context to infinity but it’s very much not a solved problem to illustrate how underdeveloped this space is — research from 5 years ago still seems like the best ideas (contrast that to LLMs)

aug 31, 2025, 11:06 am • 9 2 • view
avatar
Digital Diogenes @digitaldiogenes.bsky.social

I've wondered about this same thing - If the limitations of vector databases could be improved upon by using a very small tool-calling model for specifically handling various calls to a backend database for various identified subqueries.

aug 31, 2025, 2:02 pm • 1 0 • view