Support for suggest2 API and modernization #107

tfmorris · 2022-10-28T19:29:13Z

This is mainly an information PR for some work in progress to see if there's any interest. I added support for the suggest2 API which does keyword searching of all variants in addition to left anchored searches, but it wasn't as big an improvement as I hoped. Things could also be extended to support the many new information types that the LoC now includes (e.g. BIBFRAME works).

In addition to the new API, I added a bunch of additional stuff including:

Added support for the front end specified limit parameter to control number of choices returned
Added HTTP header identifying service as requested by LoC
Add HTTP timeouts, retries, and performance metrics (as well as switching to HTTPS)
switched to RapidFuzz instead of Fuzzy Wuzzy because of licensing chaos (plus it's faster)
switch to Conda for dependencies, updated all dependencies, and dropped unused ones
refactored code to reduce some of the redundancy
dropped Python 2 support

I couldn't find any documentation on the didyoumean API and it doesn't seem to return many results, so I'm not sure how useful it is. More generally it's unclear to me what the best API or APIs to use is and the Library of Congress doesn't really provide much guidance. It would probably take more time than I have available currently to do enough experimentation to figure it out, but I'd be happy to accept feedback from an expert.

* Add support for suggest2 API (disabled by default) * Honor limit parameter for number of choices returned * Add HTTP timeouts & retries with backoff * Add HTTP metrics * Add HTTP request header as required by LoC * refactor to minimize redundant code

Turn on caching by default

@ruebot

- Cherry-picked from yorkulibraries/lc-reconcile by @ruebot

tfmorris and others added 6 commits October 27, 2022 21:42

Switch to Conda and RapidFuzz. Remove unused dependencies

5bc0349

Remove Python 2 support. Organize imports

7982516

Add basic Dockerfile for container builds

876aa42

Add conda-forge channel & requests-cache

0cb884b

Turn on caching by default

Add LoC Genre/Forms reconcile functionality.

d7d8b97

- Cherry-picked from yorkulibraries/lc-reconcile by @ruebot

tfmorris mentioned this pull request Feb 10, 2023

Clarity on difference between versions reconciliation-api/specs#112

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for suggest2 API and modernization #107

Support for suggest2 API and modernization #107

tfmorris commented Oct 28, 2022

Support for suggest2 API and modernization #107

Are you sure you want to change the base?

Support for suggest2 API and modernization #107

Conversation

tfmorris commented Oct 28, 2022