Major changes:
- Bumping version due to downstream changes in Acid Genomics dependencies.
- Now enforcing strict camel case for all function names.
- Renamed
bcbioBaseTestsURL
tobcbioBaseTestsUrl
. - Renamed
getGTFFileFromYAML
togetGtfFileFromYaml
. - Renamed
getMetricsFromYAML
togetMetricsFromYaml
. - Renamed
getSampleDataFromYAML
togetSampleDataFromYaml
. - Renamed
getUMITypeFromCommands
togetUmiTypeFromCommands
. - Removed stringi dependency in favor of AcidBase variants.
Minor changes:
- Removed utils dependency, due to change in test URL location.
Minor changes:
- Now requiring R 4.3.
- Updated lintr checks and enabled testthat to run in parallel.
- Removed
.
from global variables.
Minor changes:
- Updated lintr checks and testthat unit tests.
Major changes:
- New release series that requires R 4.2 / Bioconductor 3.15.
- Relaxed version pinning of Acid Genomics dependencies.
Minor changes:
- Bumped the minimum R dependency version to 4.2.
Major changes:
- Removed
copyToDropbox
function, which is not commonly used and is difficult to properly unit test.
Minor changes:
- Updated
importDataVersions
andimportProgramVersions
to now always use base R engine, as this avoids some parsing issues that can pop up when using readr engine instead. Primarily this applies to inconsistent date formatting in data versions return, which can cause a POSIX date error to return when using readr engine. sampleDirs
function now excludes nested pipeline directories such asbcbioRNASeq
, which was added in 2021.- Now reexporting
import
(from pipette), which is used in working examples.
Minor changes:
- Updated NAMESPACE to import some functions from methods rather than using
basejump reexports:
as
,is
,new
.
Minor changes:
- Now importing
str_match
from basejump rather than stringr, reducing the number of required packages in imports.
Minor changes:
- Renamed "blacklist" to "denylist" internally.
- Removed reexported functions that are no longer used in other packages.
Minor changes:
- Updated internal YAML parser to use improved
rbindToDataFrame
function (see AcidPlyr and/or basejump for detail) instead of the now deprecatedunlistToDataFrame
approach. The newrbindToDataFrame
function always returns 1:1 from nested list elements to rows, which is ideal for sample metadata and quality control metrics. Alternatively,rbindlist
from data.table (withfill = TRUE
) is also work a look, but it can introduce unwanted row expansion and there is no way currently to enforce 1:1 mapping. So we wrote our own function inside of basejump.
Minor changes:
getSampleDataFromYAML
: Fix for call toAcidPlyr::unlistToDataFrame
, which now returns "name" column for nested YAML processing. See internal.sampleYAML
generator for details.
Minor changes:
- NAMESPACE updates, following basejump v0.14 release.
- Hardened the YAML parser against nested metadata, and updated to use
unlistToDataFrame
function defined in AcidPlyr, rather than deprecated plyrldply
approach.
Minor changes:
projectDir
: Improved error message on match failure.
Minor changes:
- Updated package dependency version requirements.
Minor changes:
- Maintenance release, increasing minimum R dependency to 4.0.
Minor changes:
- Changed license from MIT to GPL-3.
Minor changes:
- Now using cli package to improve console messages.
Minor changes:
- Updated basejump dependencies.
- Rebuilt documentation and checked for Bioconductor 3.10 support.
Minor changes:
- Removed deprecated and defunct functions declared prior to v0.6.
- Deprecated
read*
functions in favor ofimport*
. This naming format is more consistent with other functions used in basejump.
Major changes:
- Moved
readSampleData
andreadTxGene
functions to basejump. These are shared methods applicable outside of bcbio. They are still reexported here inside the package, maintaining full backward compatibility.
Minor changes:
- Reduced the number of imports and dependencies required.
- Improved handling of characters and numerics in YAML parsing.
- Lightened the package by removing dplyr and magrittr dependencies.
copyToDropbox
now looks for the optional rdrop2 package, rather than importing as a dependency. This keeps the package lighter.
Minor changes:
- Miscellaneous documentation improvements. Now using AcidRoxygen as the shared parameter source, following the new conventions used in basejump.
- Updated dependency versions.
Minor changes:
- Updated basejump dependency versions.
Minor changes:
- Documentation improvements to pass BiocCheck. Need to have working examples in 80% of documentation files.
Minor changes:
- Updated basejump dependency.
- Improved Travis CI configuration.
Major changes:
- Merging this release from Acid Genomics fork back to canonical HBC repo. The v0.6 release series will be used as the base for new updates to bcbioRNASeq and bcbioSingleCell packages.
Minor changes:
- Hardened user metadata input checks, particularly in
readSampleData
. Updated internal.isSampleData
assert check function to return more informative error messages to the user on failure.
Major changes:
- Now pinned to R >= 3.5.
Minor changes:
- Improved Travis and AppVeyor CI configuration.
Minor changes:
- Improved testing to ensure compatibility with R 3.4.
Minor changes:
- Switched Travis CI configuration to use
rnaseq
Docker image. - Updated basejump dependencies.
- Updated testthat cache to use
tests.acidgenomics.com
URL.
Updated to work with basejump v0.10 release series.
Deprecations:
- Made the deprecated bcbio ggplot2 geom functions defunct. Applies to:
bcbio_geom_abline
,bcbio_geom_label
,bcbio_geom_label_average
,bcbio_geom_label_repel
. - Additional deprecations made defunct:
readLog
,readTx2gene
,readYAMLSampleData
,readYAMLSampleMetrics
.
Minor changes:
- Consolidated unit test data into Acid Genomics S3 bucket (tests.acidgenomics.com).
- Example data used for unit testing, which gets cached into
tests/testthat/
are now consistently formatted using kebab case instead of snake case.
Minor changes:
- Updated GitHub remotes to use acidgenomics instead of steinbaugh.
Minor changes:
- Migrated
metadataBlacklist
to basejump package, for use inmakeSummarizedExperiment
andmakeSingleCellExperiment
functions. Global variable is still reexported here. - Reduced the number of reexported functions, including pipe (
%>%
) andimport
.
Minor changes:
runDate()
: Updated assert check to be compatible with R 3.4 / BioC 3.6.
Minor changes:
- Documentation improvements.
Minor changes:
sampleDirs
: Added an informative message for user regarding sample name sanitization of cellular barcodes.
Minor changes:
- Added return value for
runDate
to documentation. - Updated Travis CI and AppVeyor CI configuration.
- Added comment regarding top-level sample metadata parsing in internal YAML code.
Minor changes:
- Miscellaneous improvements to documentation and goalie assert checks.
Minor changes:
- Updated documentation to reflect new conventions used in basejump. Switch
to using
logical(1)
instead ofstring
, for example. This approach was inspired by the conventions used in the checkmate package. - Switched documentation titles to use sentence case instead of title case.
Major changes:
- First release that has switched to using goalie internally for assert checks, in place of assertive package.
- Reworked internal YAML parsing functions and improved the assert checks.
Minor changes:
- Reorganized and removed some deprecated functions.
- Updated unit tests to reflect switch to goalie.
Minor changes:
- Miscellaneous documentation improvements.
Minor changes:
- Improve GTF file assert checks and unit testing.
Minor changes:
getBarcodeCutoffFromCommands
: Simplified flow incutoff
internal variable assignment.- Reorganized deprecated functions. See
deprecated.R
file. - Split out
projectDir
andrunDate
functions into separate R files, fromdetect.R
. - Split out YAML parsing functions into separate files from
yaml.R
. - Tweaked
metadataBlacklist
global and added improved comments. - Reworked
.assertIsSampleData
and.makeSampleData
internal code.
Minor changes:
Minor changes:
- Switched imports to simply basejump, instead of attempting to reference any basejump subpackages.
- Migrating to goalie package for internal assert checks.
Working towards a release candidate for Bioconductor submission.
New functions:
projectDir
: Returns the latest dated directory path inside a bcbio run. If bcbio has been run multiple times to the same upload directory, the function will return the latest project directory and warn the user.
Major changes:
readTx2gene
now returns atx2gene
class object.- Pinning to basejump Bioconductor release candidate.
- Migrated ggplot2 geoms to basejump package.
- YAML functions now return
DataFrame
instead ofdata.frame
. sampleDirs
is now stricter about loading sample directories with names that are not valid in [R][]. This now checks for non-alphanumeric characters, including spaces, dashes, and samples that begin with a number. The bcbio pipeline will be updated to enforce these rules, to avoid unexpected downstream behavior in [R][] due to invalid names.- Improved camel case sanitization of metrics columns in the YAML parser. Now
readYAMLSampleMetrics
should returnpercentGC
instead ofxGC
. 5'->3' bias should return asx5x3Bias
. Theplyr::ldply
call used to coerce from alist
todata.frame
will sanitize names, so we need to apply our rules before this step.
Minor changes:
- Simplified NAMESPACE imports.
- Updated deprecations.
- Linking out to recommended guidelines for development.
- Moved global
lanePattern
variable to basejump package.
Minor changes:
prepareTemplate
: Migrated function to basejump package, and simplifed to copy all files fromrmarkdown/shared
directory inside a package. Currently in use by bcbioRNASeq and bcbioSingleCell.
This is a maintenance release designed to simplify the package for long-term
stability. Here we are moving all of the current S4 method support to
basejump for consistency, since the methods apply to SummarizedExperiment
and are not bcbio-workflow specific. We are keeping the package simple by
exporting the read*
family of functions here, which are designed to integrate
with the bcbio output directories.
Major changes:
- Moved S4 method support to basejump:
flatFiles
,metrics
,plotCorrelationHeatmap
,plotDot
,plotGene
,plotHeatmap
,plotQuantileHeatmap
,plotViolin
. prepareSummarizedExperiment
has been moved to basejump and renamed tomakeSummarizedExperiment
, for consistency. The underlying code remains the same.- Moved the
SummarizedExperiment
tolist
coercion method to basejump. - Reduced the number of re-exported ggplot2 and viridis functions, for simplicity.
Minor changes:
- Offloaded
separatorBar
andupdateMsg
to basejump. - Now using roxygen2 v6.1 for documentation.
- Miscellaneous documentation improvements.
- Simplified installation instructions, referring the user to bcbioRNASeq and bcbioSingleCell packages instead.
- lintr check fixes.
Minor changes:
- Added hexadecimal color support to heatmap functions.
- Improved documentation, specifying the supported types for each argument more clearly.
- Improved messages in
readYAML
family of functions.
Major changes:
- Moved some generics that provide
SummarizedExperiment
method suport to basejump package:gene2symbol
,interestingGroups
,sampleData
,sampleNames
,sanitizeSampleData
,selectSamples
,uniteInterestingGroups
. - Moved method support for
SummarizedExperiment
to basejump:convertGenesToSymbols
,counts
,gene2symbol
,interestingGroups
,sampleData
, etc. - Heatmap functions now only provide
SummarizedExperiment
method support. The documentation for these heatmap functions has been improved (and simplified), showing the supported arguments more clearly.
Minor changes:
- Now using
transcriptID
instead oftxID
in tx2gene and GRanges metadata defined inrowRanges
slot of object. - Now using
aes
instead ofaes_string
for all internal ggplot2 code, which uses tidyeval (v3.0 update). - Moved
rse_bcb
andrse_bcb
example data to basejump, for unit testing. These datasets are still available in bcbioBase. - Made
assertFormalAnnotationCol
function defunct. No longer using to checkannotationCol
argument integration in heatmap functions, since we're now providing onlySummarizedExperiment
method support, instead of matrix method support (which is too complicated and error prone for users). - Moved
assertFormalInterestingGroups
function to basejump, where the other assert check functions are defined. This helps improve package consistency. - Updated unit tests to reflect changes in the number of exported generics.
Minor changes:
- Improved aggregate column handling in
bcbio_geom_abline
. - Improved code coverage back to 100%.
- Reorganized R methods files to use
-methods.R
as a suffix, as recommended by Bioconductor, in preparation for package submission.
New functions:
minimalSampleData
enables easy creation of a sample metadatadata.frame
by simplify specifying the sample names (e.g. "description" in bcbio YAML). This function was added for easy metadata handling for 10X Cell Ranger in the bcbioSingleCell package, but is generally applicable to other bcbio datasets and may be incorporated into bcbioRNASeq in a future update.
Minor changes:
- Improved internal
interestingGroups
handling insideplotHeatmap
family of functions. - Removed message about transgeneNames and spikeNames in
prepareSummarizedExperiment
if missing transcripts are present. Now the function simply lists the genes that don't have metadata inrowRanges
. - Reorganized assert checks imports in
bcbioBase-package.R
file.
Minor changes:
- Improved formatting of package NEWS.
- Improved code coverage by adding ggplot2 unit testing.
- Added
sampleData<-
method support for standarddata.frame
.
Minor changes:
- Added
overwrite
parameter toprepareTemplate
, disabled by default. - Removed validity check for
sampleData
accessor. - Improved vector return in
sampleNames
.
Minor changes:
- Migrated
sanitizeSampleData
from basejump to bcbioBase. - Don't return
interestingGroups
column insampleData
return whenclean = TRUE
. - Removed internal R Markdown shared files. These are already saved per bcbio R package.
New methods:
- Added
SummarizedExperiment
method support forsampleNames
generic.
New functions:
- Exporting ggplot2 convenience functions for easier plotting in
bcbioRNASeq and bcbioSingleCell packages:
bcbio_geom_abline
,bcbio_geom_label
,bcbio_geom_label_average
, andbcbio_geom_label_repel
.
Minor changes:
- Improved assert check messages for interesting groups.
- Switched back to internally using
make.unique
instead ofmake.names
forconvertGenesToSymbols
coercion on SummarizedExperiment objects. We're usingmake.names
only when coercing seurat objects (see bcbioSingleCell code). - Updated
RangedSummarizedExperiment
working example, based onbcbioRNASeq
example dataset.
Minor changes:
- Split out assertive imports so we can pin release to bioconda.
Minor changes:
- Updated recommended Bioconductor installation method for 3.7 release.
- Attempt to add
description
tometadataBlacklist
. readSampleData
: Usedescription
metadata column internally instead ofsampleID
.
Minor changes:
- Using
make.names
instead ofmake.unique
inconvertGenesToSymbols
method forSummarizedExperiment
class.
Major changes:
sampleData
now supportsclean = TRUE
argument, which will hide columns that contain metrics or other calculations from the user. This is useful when preparing an R Markdown report, where we only want to show relevant metadata (e.g. factor columns). This is enabled by default.- Now using
metadataBlacklist
to hide specific columns in sample metadata. sampleMetadata
is now deprecated in favor ofsampleData
. This function still works but will now warn the user, and should be removed from R Markdown template code.- Added
SummarizedExperiment
tolist
coercion support, which uses the same code asflatFiles
. readSampleData
no longer requires or recommendedfileName
column. Only thedescription
column is required for demultiplexed data. In the case of multiplexed samples (e.g. inDrops single-cell RNA-seq), thensampleName
,index
andsequence
names are also required. Multiplexed cell ranger data only requiressampleName
andindex
, since the index barcode isn't present in the counts matrix.
Minor changes:
sampleID
is no longer included in metadata priority columns. This is only to be used internally and should be hidden from the user where possible.prepareSampleData
is no longer needed and is now defunct.
Minor changes:
- Reworked internal code for
readYAML
family of functions. - Reorganized and improved default arguments for heatmap functions.
readSampleData
: switched to usingmerge
instead ofleft_join
internally.- Reexporting viridis family of color functions, including
inferno
, and the British variantscale_colour_viridis
.
plotHeatmap
,plotQuantileHeatmap
: Always attempt to convert genes to symbols for heatmaps.
Minor changes:
- Improved unit tests for
convertGenesToSymbols
andgene2symbol
. - Miscellaneous documentation fixes.
Major changes:
convertGenesToSymbols
now has method support forSummarizedExperiment
.- Updated example datasets to use
rse_bcb
andrse_dds
. - Simplified internal S4 method code for heatmap functions.
prepareSummarizedExperiment
now supports transgenes with thetransgeneNames
argument (e.g. "EGFP"), and FASTA spike-ins with thespikeNames
argument (e.g. "ERCCs"). Additionally,rowRanges
andcolData
are no longer required and can be leftNULL
, although this isn't generally recommended.
Minor changes:
- Consolidated
sampleYAML
function code. - Consolidated assert check imports into
bcbioBase-package.R
file. - Consolidated globals into
globals.R
file. - Improved internal pheatmap color support.
Major changes:
- Added
sampleData
generic. Providing method support forSummarizedExperiment
class. This generic provides consistent sample metadata support in both the bcbioRNASeq and bcbioSingleCell packages. For bcbioSingleCell,colData
returns information about the cells, not the samples. For bcbioRNASeq,colData
andsampleData
provide similar information on the samples. - Added
uniteInterestingGroups
as a generic. Provides support for adding aninterestingGroups
column to a data frame. - Added
SummarizedExperiment
method support forgene2symbol
,interestingGroups
generics. - Converted
prepareSummarizedExperiment
from a generic to a standard function. prepareSummarizedExperiment
now requires rowRanges and supports FASTA spike-ins withisSpike
argument. This helps set up aSingleCellExperiment
object in the bcbioSingleCell package.prepareTemplate
has been converted from a generic to a standard function.readDataVersions
andreadProgramVersions
now return an empty tibble if the file is missing. bcbio doesn't always output these files, so we have changed the behavior from stopping on a missing file to simply warning and returning empty.- Added
sampleDirs
function, that informs the user about the names of the sample directories in the bcbio run.
Minor changes:
- Improved internal factor sanitization of sample metadata for
readSampleMetadataFile
andsampleYAMLMetadata
. - Exporting
projectDirPattern
to match bcbio project directories (e.g.2018-01-01_rnaseq
) across packages. - Changed default of
rdsToken
incopyToDropbox
to beNULL
instead ofNA
. - Reduced the number of imports from basejump package.
- Now using
sessioninfo::session_info
instead ofdevtools::session_info
. - Added AppVeyor CI support for code testing on Windows.
- Made Travis CI checking stricter and added
BiocCheck
. - Simplified package dependencies in DESCRIPTION.
Deprecations:
- Deprecated
sampleMetadata
in favor ofsampleData
. - Deprecated
flatFiles
in favor of usingas(object, "list")
coercion method instead. - Deprecated
prepareSampleMetadata
in favor ofprepareSampleData
. - Made
annotable
method onSummarizedExperiment
objects defunct.
- Don't include sample metadata in
summaryYAMLMetrics
return.
- Now exporting
assertFormalInterestingGroups
in camel case.
- Added back
checkInterestingGroups
since this code is still present on bcbioRNASeq master branch.
- Added internal assert checks.
- Now exporting
sampleMetadata<-
assignment generic. - Updated encrypted token for rdrop2 working example.
- Added
assert_formal_interesting_groups
assert checks. DeprecatedcheckInterestingGroups
.
- Preparing version pinning for bioconda. Relaxed rlang dependency from v0.1.6 to v0.1.2 and tidyr dependency from v0.7.2 to v0.7.1.
- Added
copyToDropbox
function, which enables input of a list of local files and returns Dropbox paths using rdrop2. - Updated basejump dependency to v0.2.1.
- Added assertive checks for all functions.
- NAMESPACE export fixes.
- Minor code cleanup to pass lintr checks.
- Switched to rlang methods for errors, messages, and warnings:
abort
,inform
, andwarn
. - Messages now consistently use backticks instead of apostrophes, as recommended by the tidyverse style guide.
- Removed tibble rownames support from
prepareSampleMetadata
. - Improved code coverage to 100%.
- Re-export
assignAndSaveData
,loadData
,loadRemoteData
,saveData
from basejump package. - Update
prepareTemplate
function to use internally stored data.
- Initial release.