Acoustic classification of Australian frogs based on enhanced features and machine learning algorithms

Xie, Jie, Towsey, Michael, Zhang, Jinglan, & Roe, Paul (2016) Acoustic classification of Australian frogs based on enhanced features and machine learning algorithms. Applied Acoustics, 113, pp. 193-201.

[img] PDF (670kB)
Administrators only until 1 December 2018 | Request a copy from author
Available under License Creative Commons Attribution Non-commercial No Derivatives 4.0.

View at publisher


Frogs are often considered as excellent indicators of the overall state of the natural environment, but a steady decrease in the frog population has been noticed worldwide. To monitor this change of frog population and optimise the protection policy, frog call classification has become an important bioacoustic research topic. However, automatic acoustic classification of frog calls has not been adequately addressed in the literature. In this paper, an enhanced feature representation for frog call classification using the temporal, perceptual and cepstral features is presented. With the enhanced feature representation, the time-frequency information of frog calls can be effectively represented, which gives a good classification performance. To be specific, each continuous frog recording is first segmented into individual syllables using the Harmar’s method. Then, temporal, perceptual, and cepstral features are calculated from each syllable: syllable duration, Shannon entropy, Renyi entropy, zero-crossing rate, averaged energy, oscillation rate, spectral centroid, spectral flatness, spectral roll-off, signal bandwidth, spectral flux, fundamental frequency, linear predictive coding, and Mel-frequency cepstral coefficients. Next, different feature vectors are fused to obtain different enhanced feature representations. Finally, different enhanced feature representations are compared using five machine learning algorithms: linear discriminant analysis, K-nearest neighbour, support vector machines, random forest, and artificial neural network. Experiment results show that our proposed feature representation could achieve better classification performance comparing to other methods with twenty-four frog species, which are geographically well distributed throughout Queensland, Australia.

Impact and interest:

0 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 96975
Item Type: Journal Article
Refereed: Yes
Keywords: Frog call classification, Soundscape ecology, Bioacoustics, Acoustic feature extraction
DOI: 10.1016/j.apacoust.2016.06.029
ISSN: 0003-682X
Divisions: Current > QUT Faculties and Divisions > Science & Engineering Faculty
Copyright Owner: Copyright 2016 Elsevier Ltd.
Deposited On: 17 Jul 2016 22:48
Last Modified: 04 Jul 2017 16:55

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page