Releases: NYPL/drb-etl-pipeline
v0.14.3
What's Changed
- NO-REF: Update logger to snake case by @kylevillegas93 in #445
- SFR-2304_Add Airtable API Authentication Flow by @mitri-slory in #446
- NO-REF: Refactoring statics by @kylevillegas93 in #448
- SFR-2344: Set New Relic Log Level to Warning by @kylevillegas93 in #452
- SFR-2316: Create a local backend for frontend CI by @kylevillegas93 in #450
- SFR-2342: Deploy to both CF and TF ECS Clusters in QA by @kylevillegas93 in #453
- Bump waitress from 2.1.2 to 3.0.1 by @dependabot in #454
- SFR-2349: Increasing CSV Field Size Limit by @kylevillegas93 in #456
- SFR-2347: Improve IA readability check exception logging by @kylevillegas93 in #455
- SFR-2346: Fix edition verion unpacking by @kylevillegas93 in #457
- SFR-2306: Integrating with Airtable to get backlist records by @kylevillegas93 in #449
- SFR-2352: Script to upload digital assets to Google Drive by @kylevillegas93 in #458
- SFR-2327: Removing S3Manager as an Ancestor of CoreProcess by @kylevillegas93 in #451
- NO-REF: Fix LOCProcess S3 Client by @kylevillegas93 in #461
- SFR-2308_PubBacklistMapping by @mitri-slory in #460
- SFR-2368_PubBacklistParityWithUofM by @mitri-slory in #463
- SFR-2288 Querying by OCLC Number uses metadata API MARCXML endpoint by @Apophenia in #464
- NO-REF: Migration script tweaks by @kylevillegas93 in #465
- NO-REF: Delete UofM & UofSC Processes by @mitri-slory in #466
- SFR-2102 Update declarative base import by @Apophenia in #467
- Add drive download utility functions and test by @Apophenia in #462
- SFR-2386 Remove empty related bib warning log by @kylevillegas93 in #468
- SFR-2381: Ready to Ingest Status for Airtable Records by @mitri-slory in #469
- SFR-2388: Adding ingest process functional tests by @kylevillegas93 in #470
- SFR-2384: Fulfill Manifest Pub Backlist S3 Bucket by @mitri-slory in #471
- SFR-2382: Incoporate Ready To Ingest Field from Airtable by @mitri-slory in #472
- SFR-2397: Deploy to the new terraform stacks in production by @kylevillegas93 in #474
- Sfr 2374 access parameter store by @Apophenia in #475
- SFR-2391: Delete Publisher Backlist Record Manifests by @mitri-slory in #473
Full Changelog: v0.14.2...v0.14.3
v0.14.2
d7b19f1 - NO-REF: Fix and simplify classify record updates (#447) (Kyle Villegas, 2024-11-14)
061d046 - NO-REF: Removing RabbitMQManager as a CoreProcess ancestor (#444) (Kyle Villegas, 2024-11-13)
a2f3991 - NO-REF: Remove RedisManager as an ancestor of CoreProcess (#443) (Kyle Villegas, 2024-11-13)
2561911 - NO-REF--Modify-fulfill-test-to-snake-case (#442) (Dmitri Slory, 2024-11-12)
2a88966 - NO-REF: Rename base mapping classes (#440) (Kyle Villegas, 2024-11-12)
e5b5214 - NO-REF: Fix local dev setup process (#439) (Kyle Villegas, 2024-11-12)
9f255f8 - NO-REF: Rename Oclc catalog manager dep (#438) (Kyle Villegas, 2024-11-12)
4088e06 - NO-REF: Removing redundant comments (#441) (Kyle Villegas, 2024-11-12)
6493b4a - NO-REF: add error handling error process init (#437) (Kyle Villegas, 2024-11-08)
af95c3d - NO-REF: Remove Elastic Search Manager Ancestor (#435) (Kyle Villegas, 2024-11-07)
c92de8f - SFR-2099: Edge Case API Tests for get/collection (#436) (Shejanul Ayan Islam, 2024-11-07)
06d63b2 - NO-REF: Remove kubernetes manifests (#429) (Kyle Villegas, 2024-11-06)
454b342 - NO-REF: Refactor NYPL Process (#433) (Kyle Villegas, 2024-11-06)
b5381ce - NO-REF: Fix CI (#434) (Kyle Villegas, 2024-11-06)
44ce645 - NO-REF: Updating release notes (#425) (Kyle Villegas, 2024-11-06)
4a5e3f3 - NO-REF: Refactoring GH Workflows (#430) (Kyle Villegas, 2024-11-01)
3dfa803 - NO-REF: Only run unit tests if requirements or source code changes (#428) (Kyle Villegas, 2024-10-31)
17ae067 - NO-REF: Pin werkzeug to 2.2.2 (#427) (Kyle Villegas, 2024-10-31)
68587d5 - NO-REF: Refactoring api integration tests (kyle, 2024-10-31)
v0.14.1
ee85835 - SFR-2280: Limit Hathi Records Ingested (#414) (Dmitri Slory, 2024-10-31)
e425444 - SFR-2294: Only cluster records that have a title (#424) (Kyle Villegas, 2024-10-30)
1a28eae - SFR-2284: Refactoring classify process (#421) (Kyle Villegas, 2024-10-30)
8f62634 - SFR-2291: Throw error if we fail to generate webpub (#422) (Kyle Villegas, 2024-10-30)
66faee2 - SFR-2292: Skip clustering records with no title (#423) (Kyle Villegas, 2024-10-30)
d1b9e2e - SFR-2265: Refactored Chicago ISAC mapping and processes (#411) (Dmitri Slory, 2024-10-29)
97a2c83 - SFR-2285: Fix has part item build (#420) (Kyle Villegas, 2024-10-28)
138411f - SFR-2283: Refactoring catalog process (#418) (Kyle Villegas, 2024-10-28)
3da2045 - SFR-2289: Reorganizing process file/folder structure (#417) (Kyle Villegas, 2024-10-28)
d2778ae - SFR-2285: Fix duplicate works (#416) (Kyle Villegas, 2024-10-25)
1e23df6 - SFR-2284: Improve Hathi Trust logging (#413) (Kyle Villegas, 2024-10-24)
3ca601b - SFR-2278: Improving cluster process logging (#412) (Kyle Villegas, 2024-10-23)
2ec6c38 - Classify logging improvements (#410) (Lyndsey M., 2024-10-23)
3d98887 - SFR-2262: Refactoring cluster process (#405) (Kyle Villegas, 2024-10-22)
5396107 - SFR-2277: Renaming process files and refiling ingest processes (#408) (Kyle Villegas, 2024-10-22)
2f72220 - SFR-2276: Deprecating ingest report (#407) (Kyle Villegas, 2024-10-22)
794ab52 - SFR-2267: Fixing duplicate work bug (#406) (Kyle Villegas, 2024-10-21)
2386334 - SFR-2256: Refactoring API Process (#401) (Kyle Villegas, 2024-10-21)
7bd2981 - SFR-2260: Refactoring DB Maintenance Process (#403) (Kyle Villegas, 2024-10-18)
e086f24 - SFR-2261: Refactoring db migration process (#404) (Kyle Villegas, 2024-10-18)
2e93c41 - NO-REF: Separating dev setup process and seed data process (#402) (Kyle Villegas, 2024-10-17)
2f35903 - SFR-2188: Removed Metrics_Type Column + Updated File Names (#400) (Fatima Rahman, 2024-10-16)
49c98e4 - SFR-2249: Refactor and clean up s3 file process (#399) (Kyle Villegas, 2024-10-16)
fc1ada9 - NO-REF: Improving report scalability (#366) (Kyle Villegas, 2024-10-15)
c081a15 - SFR-2220: Adding MET process ingestion count log (#398) (Kyle Villegas, 2024-10-15)
7dd7f35 - Changed deep copy to shallow copy of WorkIdentifiers array (#397) (Dmitri Slory, 2024-10-15)
eb3f816 - SFR-2240: Cleaning up NYPL ingestion process (#395) (Kyle Villegas, 2024-10-10)
8f54078 - SFR-2245: Refactor dev setup process and fix infinite cluster loop (#396) (Kyle Villegas, 2024-10-10)
2beac30 - SFR-2141: Delete Duplicate Work Identifiers (#356) (Dmitri Slory, 2024-10-09)
58dfe75 - SFR-2234 SFR-2235: Fix bugs in DOAB ingestion process (#394) (Kyle Villegas, 2024-10-07)
daf7bba - SFR-2216: Fixing NYPL ingest process locally (#393) (Kyle Villegas, 2024-10-07)
4d7a0aa - SFR-2216: Fixing LOC process ingestion (#392) (Kyle Villegas, 2024-10-07)
596efd4 - SFR-2216: Adding Gutenberg logging (#391) (Kyle Villegas, 2024-10-07)
a74a948 - SFR-2216: Improving DOAB logging and error handling (#390) (Kyle Villegas, 2024-10-07)
9d6508f - SFR-2216: Adding ingest limit and logging to HathiTrust ingest (#389) (Kyle Villegas, 2024-10-07)
89eef18 - SFR-2192: Adding ingest limit for MUSE process (#388) (Kyle Villegas, 2024-10-07)
7e8f279 - SFR-2214: get work api test (#387) (Shejanul Ayan Islam, 2024-10-03)
ae9a953 - SFR-2217 Fixing MUSE Mapping (#385) (Kyle Villegas, 2024-10-03)
edfb351 - SFR-2219: Improving cluster error handling and logging (#386) (Kyle Villegas, 2024-10-03)
52b34ba - NO-REF: Using exception for logging API errors (#381) (Kyle Villegas, 2024-10-02)
80b8b29 - SFR-2180: Get link for a single id (#383) (Shejanul Ayan Islam, 2024-10-01)
3466a2c - SFR-2181: get edition id (#384) (Shejanul Ayan Islam, 2024-10-01)
1192f48 - SFR-2052: Replace OCLC Worldcat API v1 calls with v2 calls (#382) (Lyndsey M., 2024-10-01)
b843107 - NO-REF: Ugrading Docker ElasticSearch Container (#380) (Kyle Villegas, 2024-10-01)
3de28bf - NO-REF: Changing print statements to New Relic logging (#376) (Kyle Villegas, 2024-09-27)
9014cab - NO-REF: Remove changelog (#378) (Kyle Villegas, 2024-09-27)
c94b083 - SFR-2187: Naming convention and column ordering updates to Counter 5 reporting (#377) (Fatima Rahman, 2024-09-27)
67b7924 - SFR-2105: Deprecate OCLC Classify Manager (#375) (Lyndsey M., 2024-09-27)
v0.14.0
Added
- Added auxiliary functions to build queries for OCLC search endpoints
- Removed aggregation result print statement
- Created local.yaml file to setup environment variables when running processes locally
- Implemented OCLC other editions call
- Added functionality for locally generating Counter 5 downloads reports to analytics folder
- Readded enter and exit functions to API DB client
- Updated DevelopmentSetUpProcess with database migration method
- Refactored info API
- Refactored links API and added error handling
- Refactored works and editions APIs and added error handling
- Generalized data aggregation within analytics folder
- Updated README release steps
- Added local S3 docker container via localstack
- Added error handling to citation API
- Implemented Counter 5 reporting for view counts
- Added error handling to GET collection endpoints
- Added error handling for utils API
- Added error handling to search API
- Refactored analytics report code
- Implemented country-level analytics report
- Implemented total-usage analytics report
- Added error handling to fulfill API
- Added uuid API validation
- Implemented OCLC Classify Process v2
- Added integration tests for UofM ingestion process
- Moving up db migration in dev setup
- Address Counter 5 report feedback from business analysts
- Updated fulfill process to check rights status before updating manifest
- Deleted fulfill script due to no longer being necessary
- Updating report book ID
- Adding OCLC query bibs call
- Finalizing OCLC implementation
- Implemented script to aggregate access logs
- Switching over to classify record by metadata v2
- Adding more specific logging and exception handling around OCLC manager errors
- Upgrading flask-cors
- implement restApi testing using pytest
- add search-a-collection test to rest_api tests
- Improving error handling and logging to OCLC classify process
Fixed
- Changed HATHI_DATAFILES outdated link in development, example, and local yaml files
- Resolved the errors when running the FulfillProcess on a daily and complete ingest time
- Changed HATHI_DATAFILES outdated link in development, example, and local yaml files
- Fixed edition API ID param
- Fixed usage type bug
- Fixed OCLC bib author mapping
- Fixed OCLC catalog query attempts bug
v0.13.1
2024-08-06 -- v0.13.1
Added
- New analytics folder for University Press project code. Contains methodology for generating Counter 5 reports
- New script to update current UofM manifests with fulfill endpoints to replace pdf/epub urls
- Updated README with appendix and additions to avaliable processes
- New process to add fulfill urls to Limited Access manifests and update fulfill_limited_access flags to True
- Updated README and added more information to installation steps
- Added Rights status to UofM mapping and Rights conditionals to UofM process
- Deprecated datetime.utcnow() method
- Added new field (publisher_project_source) to the records and items data models
- Ran database migration to add publisher_project_source field to records and items tables
- Filled out publisher_project_source field for UofM books
- Added editionID validation to editions API
- Updated README with steps on retrieving local-compose.yaml file and credentials
- Added more logging to proxy API
- Implement call to Worldcat to retrieve OCLC number
- Refactor OCLC Catalog Manager
- Added make integration command
- Upgrade RabbitMQ Docker image to 3.13
- Updated README with steps on running the processes locally
Fixed
- Resolved the format of fulfill endpoints in UofM manifests
- Added additional logging to the editions endpoint to debug
- Renamed Docker API container name to drb_local_api
- Renamed hosts for services in sample-compose file from docker bucket names to localhost