-
Notifications
You must be signed in to change notification settings - Fork 687
Description
Is there any documentation on how to set up the prerequisites sites for "nlweb_sites" for the WhoHandler?
The code in code/python/core/whoHandler.py in the function whoRetrieveInt tries to search using site='nlweb_sites'
but NLWeb returns "No answers were found for your query." when try to search over all sites
I have followed the guide to load data like this:
python -m data_loading.db_load https://feeds.libsyn.com/121695/rss Behind-the-Tech
The logs show:
=== NLWebHandler INIT === [MULTI-SITE-HANDLER] Initializing with query: what does kevin think of mcp? [MULTI-SITE-HANDLER] Params: {'score': 100, 'justification': 'Only available tool for this query type'} [Using who_endpoint: http://localhost:8000/who Calling sites_from_who_streaming with endpoint=http://localhost:8000/who, query=what does kevin think of mcp? [SSEStreamWrapper] broadcast_message called: conv_id=1763231581610, my_conv_id=1763231581610 [SSEStreamWrapper] Writing SSE data: data: {"message_type": "intermediate_message", "status": "Identifying relevant sites ...", "timestam... [SSEStreamWrapper] SSE data written successfully === NLWebHandler INIT === Query params: {'query': 'what does kevin think of mcp?', 'streaming': 'true'} [MULTI-SITE] WARNING: No sites returned from who endpoint!
Am I correct in assuming that there must be an entry in the vector db that contains a list of sites and this is called nlweb_sites? If so this isnt explained anywhere in the documentation. I am happy to contribute with some updates to clarify this once I understand whats going on.