v0.6.0 Release #92

donaldcampbelljr · 2023-10-12T19:17:51Z

[0.6.0] - 2023-10-XX

Added

Add select_records, which allows for a single API for selecting attributes (result_identifiers) given filter_conditions and/or columns
Add retrieve_one, and retrieve_many which allows for selecting one or multiple records given record_identifier
Add pipestat reader submodule to read DB results via FastAPI endpoints: pipestat serve --config "config.yaml"
Add ability to create SamplePipestatManager and ProjectSamplePipestatManager which are sample/project specific PipestatManager objects.
Add PipestatBoss wrapper class which holds SamplePipestatManager and ProjectSamplePipestatManager classes.
Add to_dict methods to parsed schema object.
Add select_distinct function which retrieves unique results for a list of attributes.
Add pipestat link which creates a directory of symlinks for reported results
Add list_recent_results which allows for retrieving records filtered via a start and end time
Add reporting and retrieving results via item access,e.g. psm["sample1", "name_of_something"] = "name_of_something_string" or result = psm["sample1"]

Fixed

Added path expansion when creating database url.
added jinja2 requirement
pipeline_name column not populating in postgres db backend.

Changed

Removed retrieve, get_one_record, get_records function
Removed get_orm and replace with get_model
Removed get_table_name function
Refactor:
- sample_name -> record_identifier
- pipeline_type has been removed from most functions

Docs must be updated.
release Looper 1.6.0 at the same time.
Make sure pipestat 0.6.0 is working with Refgenie #124

Dev get records

* update version in prep for pipestat table * add table function * add initial stats function from looper, remove counter * fixing _create_stats_summary and get_file_for_project * fix sample_level stats reporting * add object reporting * remove redundancies * lint * add assertion for table generation to pre-existing test * use pipeline_type when retrieving samples * Fix stats table generation output to look better * fix return types * fix list vs List * fix typo for stats table * fix doc strings * update func names for disambiguation * fix key error issue in html_reports * clean up * adjust docstrings * update docstrings Co-authored-by: Nathan Sheffield <[email protected]> * adjust docstrings, rename html_reports_pipestat.py to reports.py * adjust LOGGER.info string * simplify objs and stats generation * remove old todos * remove sample_name, add pipeline_name to project field definitions * work towards using project_name instead of sample_name from project-level pipelines * fix check record bug, add better error message for pipeline type. * add checks for pipeline types, lint * fix get_samples with pipeline type * fix fetch_pipeline_results * add basic tests for project_level, fix associated bugs * allow table creation to use project_name if applicable * add passing project name to file_backend * refactor to use r_id as input * update doc strings * more refactoring of records vs samples to be more general * move table generation to reports.py * allow status file dir path to be associated with config path OR the file path to ensure looper compatibility. * add output_dir as a parameter for placing reports and stat files. Otherwise defaults to results_file dir or config_file dir * add obtaining schema_path from pipestat config for Looper compatibility * added select function to main pipestat class * added select_txt and select_distinct function to main pipestat class * added get_one_record function to main pipestat class #70 * Removing CLI req that one must supply config and schema because the user can supply schema within the config. For Looper compatibility. * Revert "added get_one_record function to main pipestat class #70" This reverts commit b477060. * Revert "added select_txt and select_distinct function to main pipestat class" This reverts commit 4588f4f. * Revert "added select function to main pipestat class" This reverts commit ee98145. * Add basic glossary page in the form of a table. Addresses: pepkit/looper#290 * remove redundant tag * add RecordNotFoundError across both backends #74 * Add PipelineTypeNotSuppliedError to hide dbbackend information #74 * Initial work towards a fastapi implementation of pipestat reader #22 * Initial work to separate classes #78 * working implementation of child classes for reporting using DBBACKEND. Manual testing confirmed. pytest Tests broken. #78 * implement report for filebackend * fix inheritance issue with pipeline_type * typo * major refactoring, many tests still broken #78 * Fix signatures for report and explicitly state input arguments. * remove pipeline type check * add projectpipestamanager and refactor project_name to record_identifier * fix get_status * fix tests * fix all tests #78 * add todo * code cleanup and remove getting_table_name * clean up * partial removal of pipeline_type logic from DB backend * remove pipeline_type input arguments from File Backend * clean up * add simple class to wrap SamplePipestatManager and ProjectPipestatManager * change pipestatmanager to mutable mapping #78 * lowercase attributes #78 * polish PipestatBoss #78 * env_vars for pipeline_type * update docs * working proof of concept to retrieve results #22 * update readme and add getting output schema * add else catch for outptu schema and fix typo * add ability to run with uvicorn if calling reader.py directly. * add __init__.py and main func * add more endpoints #22 * update endpoints, add Query * allow for more complex filtering using post and pydantic models #22 * Add image and file endpoints #22 * lint * Add basic arg parser to pass absolute path to config file * Add endpoint for get_records() * clean up * modify get_records to return new structure #75 * fix html report generation using get_records * attempt global creation of psm, does not work * fix creation of global psm * change fetching by filetype * add cli option for pipestat serve * simplify and fix pipestat serve #22 * add host and port arguments #22 * add config error * update readme --------- Co-authored-by: nsheff <[email protected]>

* first pass to allow for output_schema as a true JSON schema #85 * remove unused import * simplify logic, extend _safe_pop_one_mapping to handles multiple keys * disambiguation key vs keys in parsed schema * make samples and project objects instead of arrays, add more test assertions, clean up docstrings. * add status data to string representation of ParsedSchema

* begin work on file inking via pipestat * continue implementing link for an output_dir OR the results_file.yaml #89 * implemented symlinks for filebackend given output_dir #89 * implemented symlinks for filebackend using results.yaml, polish tests #89 * fix typo, clean up * more lcean up * begin rework to be backend agnostic * refactor to place in folders specific to a result_identifier * add more complex types, add temp directory for better testing * fix recursive finding of paths * clean up, move to abstract class, confirm works for both backends * remove unused test files * add jinja2 to requirements-all.txt #91 * complex example with collision * add warning * add cli option * remove unused functions

codecov · 2023-10-12T19:25:58Z

Codecov Report

Attention: 1458 lines in your changes are missing coverage. Please review.

Comparison is base (705c881) 88.32% compared to head (b53976f) 49.64%.

Files	Patch %	Lines
tests/test_pipestat.py	22.61%	503 Missing ⚠️
pipestat/backends/db_backend/dbbackend.py	13.18%	191 Missing ⚠️
pipestat/backends/file_backend/filebackend.py	52.17%	154 Missing ⚠️
pipestat/reports.py	76.36%	134 Missing ⚠️
pipestat/pipestat.py	53.33%	126 Missing ⚠️
pipestat/backends/db_backend/db_parsed_schema.py	38.33%	111 Missing ⚠️
pipestat/backends/db_backend/db_helpers.py	14.28%	54 Missing ⚠️
pipestat/pipestatreader/reader.py	43.90%	46 Missing ⚠️
pipestat/backends/abstract.py	27.08%	35 Missing ⚠️
tests/test_status.py	38.88%	33 Missing ⚠️
... and 6 more

Additional details and impacted files

@@             Coverage Diff             @@
##           master      #92       +/-   ##
===========================================
- Coverage   88.32%   49.64%   -38.68%     
===========================================
  Files          17       21        +4     
  Lines        2252     3501     +1249     
===========================================
- Hits         1989     1738      -251     
- Misses        263     1763     +1500

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…are not parsing correctly, tests broken #93

Fix reporting in pipestat summarize

…e", add try except

donaldcampbelljr · 2023-12-22T16:10:14Z

Closes milestone: https://github.com/pepkit/pipestat/milestone/3

donaldcampbelljr and others added 23 commits September 20, 2023 11:11

Add get_records to pipestat_manager and add related test #75

5b6f59c

clean up signatures #72

1544fda

clean up docstrings

4523ddc

remove superfluous get_table_name function

8d838cc

Reduce get_orm and get_model into one function. #71

175dc57

Fix CLI to use record-identifier instead of sample-name

5220129

Fix CLI to use pipeline_type #37

9b6aae1

modify get_records to return new structure #75

1722b0f

fix html report generation using get_records

f02678f

Merge pull request #83 from pepkit/dev_get_records

7de7b5e

Dev get records

add simple CLI test for reporting and retrieving #81

0f90ce7

Merge remote-tracking branch 'origin/dev' into dev

eb64b59

list to List

fd96c79

fixed typing in python3.8

99b1380

update changelog

c581afd

fix table_name_bug

b4a316c

"0.5.4a1" for alpha release

7e63d90

"0.6.0a1" for alpha release

239d110

add retrieve_distinct function for #70 and databio/bedhost#75

89119fb

update changelog

2d89b32

donaldcampbelljr added 6 commits October 12, 2023 16:03

clean up docstrings, remove unused variables, refactor link_dir

2040c72

clean up more docstrings

9e43274

remove unnecessary print statement to reduce verbosity

a2af669

Begin work on adding created and modified datetime, DateTime objects …

56941ef

…are not parsing correctly, tests broken #93

fix datetime objects #93

040dccd

fix pipeline_name field beginning with underscore #94

72523d8

donaldcampbelljr added 17 commits December 11, 2023 10:30

fix stats.tsv and objects.yaml generation to incorporate pipeline types

3daba6f

add project_objects page #130

f015074

minor polishing

9e57162

remove temp test

43ef21d

lint

a31f7f1

add passing looper's samples to pipestat summarize to populate table

ffc9bf5

remove unused code

3db1040

Merge pull request #131 from pepkit/dev_fix_reporting

7a37128

Fix reporting in pipestat summarize

version bump for pre-release v0.6.0a9

53a1c37

use "object_type" for image and file types or fall back on using "typ…

c784a95

…e", add try except

ensure project level objects are accurate on summary/index page

2bbe665

remove project name from stats and object file names

9f3ee8e

update changelog

6cf6f58

0.6.0a10 bump for pre-release

23808cd

allow project-level figures to display as columns instead of rows

8241748

0.6.0a11 pre-release

7dc0711

v0.6.0 release prep

ede50e3

donaldcampbelljr added 2 commits December 22, 2023 11:12

Merge branch 'master' into dev

cd2a5ee

lint

b53976f

donaldcampbelljr merged commit f5074ae into master Dec 22, 2023
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.6.0 Release #92

v0.6.0 Release #92

donaldcampbelljr commented Oct 12, 2023 •

edited

Loading

codecov bot commented Oct 12, 2023 •

edited

Loading

donaldcampbelljr commented Dec 22, 2023

v0.6.0 Release #92

v0.6.0 Release #92

Conversation

donaldcampbelljr commented Oct 12, 2023 • edited Loading

[0.6.0] - 2023-10-XX

Added

Fixed

Changed

codecov bot commented Oct 12, 2023 • edited Loading

Codecov Report

donaldcampbelljr commented Dec 22, 2023

donaldcampbelljr commented Oct 12, 2023 •

edited

Loading

codecov bot commented Oct 12, 2023 •

edited

Loading