-
-
Notifications
You must be signed in to change notification settings - Fork 196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Authority search very poor #6089
Comments
I want to stress the /select_authority search produces different results to the search box in the header and at /search the /select_authority search often produces no results when a public body name is entered into it. Currently for example on WhatDoTheyKnow.com /select_authority has no results for cabinet office whereas the main search returns many and the Cabinet Office is second. Many public body names result in no hits on the authority search eg Where there are results, putting bodies with many requests to the top (#40470 might help too. |
Email to WhatDoTheyKnow today:
They are correct, there are no results at: I'm going to reopen this, it's clearly a bug which is affecting many users who use a work-flow which takes them via the /select_authority page. |
Yeah okay something looks wrong here. Thanks for clarifying. |
Seems related notanumber/xapian-haystack#154 |
A WhatDoTheyKnow user has asked if we can make the results for searches on HS2 and High Speed 2 include the entry we have for High Speed Two (HS2) Limited, currently those terms don't result in a hit for the body. Also related - where the /select_authority search has no hits there is no message to say "no results" it just appears to do nothing. |
This isn't just an issue of case sensitivity, the search for Cabinet Office doesn't result in any hits irrespective of capitalisation. Interestingly, when typing cabinet office into the search box letter by letter the instant results shown while typing include the cabinet office while the search term is cabi, cabin, cabine, cabinet, cabinet o, and cabinet of but not when further letters are added. |
Yet
Yeah, notanumber/xapian-haystack#154 mentions issues to do with stemming which is related to this behaviour. |
Noting that I think we upgraded Xapian (or at least the underlying data format – can't remember) recently, and that I need to dig out the issue where this was discussed. (EDIT discussion: https://github.com/mysociety/sysadmin/issues/1305#issuecomment-630688979 / commit d54ba8e) |
@garethrees I don't think this is doing a Xapian search. It looks like this is all happening in SQL via the |
Actually, I'm wrong I was on the "View authorities" action at |
Yeah, |
Also seems to be affecting the Pro batch authority search at |
In production console:
Dev:
|
This fix has now been deployed |
Difficulty has been reported finding the body Transport for the North via the search at https://www.whatdotheyknow.com/select_authority When searching for Transport for the North the body with that name appears in 14th place in those authority search results, but it appears in 2nd place in the results of a research via the general site header. There is a related recent support inbox thread, subject: "Add authority - Transport for the North" |
The search box at:
https://www.whatdotheyknow.com/select_authority
is particularly poor, it produces worse results than the general site search.
A search for cabinet office on the select authority page on WhatDoTheyKnow.com currently gives no results.
I suspect a number of the cases of reported problems logged at #1179 are not actually issues with the general site search, but with the select authority search.
See also: #4426
The text was updated successfully, but these errors were encountered: