Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merging Development into Main (v0.11.0 Release) #221

Merged
merged 31 commits into from
Dec 16, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
d8370b7
Bump version
apetkau Aug 24, 2023
329f68a
Coerce PMID column into string not integer
sgsutcliffe Jul 8, 2024
6af296a
Deprecation fixes.
emarinier Nov 26, 2024
480eab3
Updating resfinder DB.
emarinier Nov 27, 2024
5f36252
Updating resfinder db commits.
emarinier Nov 27, 2024
cd6f2fe
Pointfinder first pass db-update.
emarinier Nov 28, 2024
931bad0
Whitespaces, gene names.
emarinier Dec 4, 2024
06b09b8
Promoter names have changed from '_' to '-'.
emarinier Dec 4, 2024
e62c8fd
Promoter fixes.
emarinier Dec 5, 2024
e9badeb
Updating promoter-related tests.
emarinier Dec 5, 2024
74f9c6b
Fixing tests
emarinier Dec 6, 2024
44571ad
Updating test CI
emarinier Dec 9, 2024
9d3874a
Updating test CI
emarinier Dec 9, 2024
b583fb4
Updating drug keys.
emarinier Dec 9, 2024
5485a96
Reverting changes to ampC_promoter.
emarinier Dec 9, 2024
3507992
Merge pull request #217 from phac-nml/update_dbs
emarinier Dec 9, 2024
fc0520e
e.coli -> escherichia_coli
emarinier Dec 9, 2024
ae7e7d8
Removing Nones/blanks.
emarinier Dec 9, 2024
9f82e31
Updating changelog.
emarinier Dec 9, 2024
bb930f5
Updating ARG database versions.
emarinier Dec 9, 2024
22197c0
Adding Jupyter notebook changes.
emarinier Dec 9, 2024
1e15db7
Merge branch 'development' into fix-pointfinderdatabase-PMID-column
emarinier Dec 10, 2024
26770da
Merge pull request #212 from phac-nml/fix-pointfinderdatabase-PMID-co…
apetkau Dec 10, 2024
b6bad2f
Merge pull request #218 from phac-nml/update_drug_key
emarinier Dec 10, 2024
439a66e
Updating README
emarinier Dec 10, 2024
d53dcf8
Updating changelog.
emarinier Dec 10, 2024
d5c2d83
Update README.md
emarinier Dec 10, 2024
81b983e
Merge pull request #219 from phac-nml/update_docs
emarinier Dec 13, 2024
de6c1b9
Updating versions.
emarinier Dec 16, 2024
2dd0085
acrB 171 -> acrB 717
emarinier Dec 16, 2024
8fe219a
Merge pull request #220 from phac-nml/release-prep
emarinier Dec 16, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 4 additions & 2 deletions .github/workflows/ci-test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ jobs:
runs-on: ubuntu-latest

env:
DATABASE_COMMITS: '--resfinder-commit fa32d9a3cf0c12ec70ca4e90c45c0d590ee810bd --pointfinder-commit 8c694b9f336153e6d618b897b3b4930961521eb8 --plasmidfinder-commit c18e08c17a5988d4f075fc1171636e47546a323d'
DATABASE_COMMITS: '--resfinder-commit d1e607b8989260c7b6a3fbce8fa3204ecfc09022 --pointfinder-commit 694919f59a38980204009e7ade76bf319cb7df0b --plasmidfinder-commit c18e08c17a5988d4f075fc1171636e47546a323d'

strategy:
fail-fast: False
Expand Down Expand Up @@ -61,4 +61,6 @@ jobs:

- name: Run Tests
shell: bash -l {0}
run: python setup.py test
run: |
pip install pytest
pytest
8 changes: 8 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,11 @@
# Version 0.11.0

* Updated the Resfinder database to use the 2024-08-06 release.
* Updated the Pointfinder database to use the 2024-08-08 release.
* The Resfinder and Pointfinder databases now use FASTA record IDs with accession numbers (ex: `pmrA` -> `pmrA_1_CP055130.1`). StarAMR has been updated to support this, but database backwards compatibility is unlikely.
* The `genes_to_exclude` is affected by the above change and must now use gene IDs that exactly match the new FASTA record IDs with accession IDs.
* Removed ARG drug key entries with "None" or missing resistance.

# Version 0.10.0

* Updated the Plasmidfinder database to use the January 18th 2023 release.
Expand Down
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -145,7 +145,7 @@ If you wish to update to the latest ResFinder, PointFinder, and PlasmidFinder da
staramr db update --update-default
```

If you wish to switch to specific git commits of either ResFinder, PointFinder, or PlasmidFinder databases you may also pass `--resfinder-commit [COMMIT]`, `--pointfinder-commit [COMMIT]`, and `--plasmidfinder-commit [COMMIT]`.
If you wish to switch to specific git commits of either ResFinder, PointFinder, or PlasmidFinder databases you may also pass `--resfinder-commit [COMMIT]`, `--pointfinder-commit [COMMIT]`, and `--plasmidfinder-commit [COMMIT]`. However, please note that because of compatibility issues arising from changes in the source databases, this functionality is largely unsupported and is unlikely to work for versions of the databases that StarAMR wasn't released with.

## Restore Database

Expand Down Expand Up @@ -245,11 +245,11 @@ By default, the ResFinder/PointFinder/PlasmidFinder genes listed in [genes_to_ex

```
gene_id
aac(6')-Iaa_1_NC_003197
ColpVC_1__JX133088
gyrA_1_CP073768.1
pmrB_1_CP051284.1
```

Please make sure to include `gene_id` in the first line. The default exclusion list can also be disabled with `--no-exclude-genes`.
Please make sure to include `gene_id` in the first line. The default exclusion list can also be disabled with `--no-exclude-genes`. Gene IDs must exactly match the FASTA record IDs provided in the source databases.

## Complex Mutations

Expand Down Expand Up @@ -504,7 +504,7 @@ Restores the default database for `staramr`.

# Caveats

This software is still a work-in-progress. In particular, not all organisms stored in the PointFinder database are supported (only *salmonella*, *campylobacter* are currently supported). Additionally, the predicted phenotypes are for microbiological resistance and *not* clinical resistance. Phenotype/drug resistance predictions are an experimental feature which is continually being improved.
This software is still a work-in-progress. In particular, not all organisms stored in the PointFinder database are supported (only *enterococcus_faecalis*, *helicobacter_pylori*, *enterococcus_faecium*, *campylobacter*, *escherichia_coli*, *salmonella* are currently supported). Additionally, the predicted phenotypes are for microbiological resistance and *not* clinical resistance. Phenotype/drug resistance predictions are an experimental feature which is continually being improved.

`staramr` only works on assembled genomes and not directly on reads. A quick genome assembler you could use is [Shovill][shovill]. Or, you may also wish to try out the [ResFinder webservice][resfinder-web], or the command-line tools [rgi][] or [ariba][] which will work on sequence reads as well as genome assemblies. You may also wish to check out the [CARD webservice][card-web].

Expand Down
14 changes: 11 additions & 3 deletions scripts/data-conversion/pointfinder-drug-resistance.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
{
"data": {
"text/plain": [
"dict_keys(['Salmonella', 'Shigella E. coli', 'Campylobacter'])"
"dict_keys(['Salmonella', 'Shigella E. coli', 'Campylobacter', 'NCBI AMRfinder'])"
]
},
"execution_count": 1,
Expand All @@ -28,7 +28,7 @@
"source": [
"import pandas as pd\n",
"\n",
"pointfinder_file = '../../drug-key-update/pointfinder 072621.xlsx'\n",
"pointfinder_file = '../../pointfinder.xlsx'\n",
"\n",
"pointfinder_excel = pd.ExcelFile(pointfinder_file)\n",
"sheets_df_map_orig = {n: pd.read_excel(pointfinder_excel, sheet_name=n, header=None) for n in pointfinder_excel.sheet_names}\n",
Expand Down Expand Up @@ -407,6 +407,14 @@
"source": [
"pointfinder_df_reduced.to_csv('../../staramr/databases/resistance/data/ARG_drug_key_pointfinder.tsv', sep='\\t', index=False)"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "938df9d4-d16a-4855-aee0-5208fadfad68",
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
Expand All @@ -425,7 +433,7 @@
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.10.4"
"version": "3.12.3"
}
},
"nbformat": 4,
Expand Down
Loading
Loading