DCASE Utilities

A collection of utilities for Detection and Classification of Acoustic Scenes and Events

This collection of utilities is for Detection and Classification of Acoustic Scenes and Events (DCASE). These utilities were originally created for the DCASE challenge baseline systems (2016 & 2017) and are bundled into a standalone library to allow their re-usage in other research projects.

The main goal of the utilities is to streamline the research code, make it more readable, and easier to maintain. Most of the implemented utilities are related to audio datasets: handling meta data and various forms of other structured data, and provide standardized usage API to audio datasets from various sources.

Documentation

See https://dcase-repo.github.io/dcase_util/ for detailed instruction, manuals and tutorials.

Installation instructions

The latest stable release is available on PyPI, and you can install with pip: pip install dcase_util

Changelog

master

Update TUTRareSoundEvents_2017_DevelopmentSet and TUTRareSoundEvents_2017_EvaluationSet datasets
Add extract method to SpectralFeatureExtractor
Add automatic conversion of numeric fields when loading CSV data to ListDictContainer
Add filter and get_field_unique methods to ListDictContainer
Add MP4 to valid audio formats for AudioContainer
Add more appropriate error messages

0.1.8 / 2018-01-29

Add new formats for MetaDataContainer (cpickle, csv)
Fix unicode string support when printing container information
Fixing data contamination through data references while manipulating data
Add forced file formats when reading and saving containers
Add keras setup function
Add frame splitting method into AudioContainer
Some minor bug fixes.

0.1.7 / 2017-11-22

Fix data sequencing when overlapping sequencing is used.
Dataset class API modifications (e.g. copy returned metadata prevent accidental manipulation, uniform method names)
Fix datasets CHiMEHome_DomesticAudioTag_DevelopmentSet, TUTAcousticScenes_2017_EvaluationSet, and TUTSoundEvents_2017_EvaluationSet
Add intersection method for MetaDataContainer

0.1.6 / 2017-11-14

Fixing audio reading when target sampling rate is not set
Some minor tweaks to ui utilities.
Example audio is now 16-bit audio file in wav-format instead of FLAC used earlier.
ProbabilityContainer API is more compatible with MetaDataContainer now.
MetaDataItem is now compatible with field naming used previously in DCASE baselines
Add CHiMEHome_DomesticAudioTag_EvaluationSet dataset.
Some minor bug fixes.

0.1.5 / 2017-11-10

Fixing PYPI package

0.1.4 / 2017-11-10

Release PYPI package

0.1.0 / 2017-11-09

Initial public release

License

Code released under the MIT license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

DCASE Utilities

Documentation

Installation instructions

Changelog

master

0.1.8 / 2018-01-29

0.1.7 / 2017-11-22

0.1.6 / 2017-11-14

0.1.5 / 2017-11-10

0.1.4 / 2017-11-10

0.1.0 / 2017-11-09

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

DCASE Utilities

Documentation

Installation instructions

Changelog

master

0.1.8 / 2018-01-29

0.1.7 / 2017-11-22

0.1.6 / 2017-11-14

0.1.5 / 2017-11-10

0.1.4 / 2017-11-10

0.1.0 / 2017-11-09

License