Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve search, results and filtering by adding taxon and clique identifier count information #143

Merged
merged 23 commits into from
Apr 25, 2024

Conversation

gaurav
Copy link
Contributor

@gaurav gaurav commented Feb 13, 2024

This PR combines several improvements to search, results and filtering:

  • It updates the search query to no longer duplicate the search query when doing an autocomplete query (see "Bone" works well when autocomplete=true but breaks when autocomplete=false #142).
    • This breaks hyphenated search terms in the autocomplete query, and I can't figure out why. For now, I've set it up so that we replace special characters with spaces in the autocomplete query (i.e. beta-secretase becomes (beta secretase*)) but we escape special characters in the non-autocomplete query (i.e. beta-secretase becomes (beta\-secretase*) since that still appears to work. I'll dig into this more deeply in Hyphen processing still a bit dodgy, possibly because of our choice of tokenizer #146.
  • It adds taxon and clique identifier count to values indexed during data loading.
  • It incorporates clique identifier count into both the returned results as well as the boosting and sorting of the returned results. It also tweaks the boosting values used in query fields and phrase fields.
  • It adds an only_taxa input field that allows filtering results to a list of NCBITaxon taxon identifiers (note that this will only work for terms that have taxon information, which at the moment is only cliques containing NCBIGene identifiers).

@gaurav gaurav changed the title Add support for filter by taxon and other new fields Improve search, results and filtering Apr 23, 2024
@gaurav gaurav changed the title Improve search, results and filtering Improve search, results and filtering by adding taxon and clique identifier count information Apr 23, 2024
@gaurav gaurav requested a review from cbizon April 23, 2024 16:34
@gaurav gaurav merged commit a913bdb into master Apr 25, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants