Call and Component Evaluation for Improved Performance of Recognition of Killer Whale Individuals

Conference Paper

Title: Call and Component Evaluation for Improved Performance of Recognition of Killer Whale Individuals

Author:

Nichols, N.; Atlas, L.; Bowles, A.; Roch, M.

Publication Date:

September 20, 2010

Event Name:

OCEANS 2010 Seattle

Event Location:

Seattle, WA (US)

Pages:

Affiliation

University of Washington

Receptor:

Marine Mammals, Cetaceans

Language:

English

Document Access

Website:

External Link

Citation

Nichols, N.; Atlas, L.; Bowles, A.; Roch, M. (2010). Call and Component Evaluation for Improved Performance of Recognition of Killer Whale Individuals. Paper presented at OCEANS 2010 Seattle, Seattle, WA (US).

@conference{Nichols-2010-5566,
author = {Nichols, N and Atlas, L and Bowles, A and Roch, M},
title = {Call and Component Evaluation for Improved Performance of Recognition of Killer Whale Individuals},
year = {2010},
month = {sep},
series = {OCEANS 2010 Seattle},
pages = {4},
address = {Seattle, WA (US)},
url = {https://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5664444},
keywords = {Marine Mammals, Cetaceans},
}

Export Citation to BibTex

TY - CONF
TI - Call and Component Evaluation for Improved Performance of Recognition of Killer Whale Individuals
AU - Nichols, N
AU - Atlas, L
AU - Bowles, A
AU - Roch, M
T2 - OCEANS 2010 Seattle
C1 - Seattle, WA (US)
AB - The objective of this experiment was to determine the contribution of the initial broad band component of the SD1(1a/b) vocalization towards recognition of individual killer whales (Orcinus orca). Prior research showed classification using the SD1(1a/b) vocalization performed 23% better compared to classification using the SD3(1) vocalization. One possible theory for this observation was the presence of a broad band buzz at the initiation of the SD1 call. It was theorized the broad band buzz of the vocalization was more continuously sampling the frequency response of the vocal production mechanism, (classically described as the filter in the source-filter model of speech) and potentially contributed to the observed increase in recognition. Experiments were performed with vocalizations provided by Hubbs-SeaWorld Research Institute and consisted of 20 SD1(1a/b) vocalizations for each of four whales (2 male, 2 female). The broadband component was hand segmented from the vocalization. Classification was performed on the full and segmented vocalizations with a Gaussian mixture model, using mel-frequency cepstral coefficient feature vectors. Using the full vocalization, overall accuracy was 75 +/- 2% using a 95% confidence interval. Using only the segmented broad band component, overall accuracy was 56 +/- 2% using a 95% confidence interval. Chance performance was 25%. These results cannot definitively support or reject a source filter model, but do point to the need for focused research to develop appropriate feature vectors for individual identification using acoustic cues.
DA - 2010/09//
PY - 2010
SP - 4
UR - https://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5664444
LA - English
KW - Marine Mammals
KW - Cetaceans
ER -

Export Citation to RIS

Abstract

The objective of this experiment was to determine the contribution of the initial broad band component of the SD1(1a/b) vocalization towards recognition of individual killer whales (Orcinus orca). Prior research showed classification using the SD1(1a/b) vocalization performed 23% better compared to classification using the SD3(1) vocalization. One possible theory for this observation was the presence of a broad band buzz at the initiation of the SD1 call. It was theorized the broad band buzz of the vocalization was more continuously sampling the frequency response of the vocal production mechanism, (classically described as the filter in the source-filter model of speech) and potentially contributed to the observed increase in recognition. Experiments were performed with vocalizations provided by Hubbs-SeaWorld Research Institute and consisted of 20 SD1(1a/b) vocalizations for each of four whales (2 male, 2 female). The broadband component was hand segmented from the vocalization. Classification was performed on the full and segmented vocalizations with a Gaussian mixture model, using mel-frequency cepstral coefficient feature vectors. Using the full vocalization, overall accuracy was 75 +/- 2% using a 95% confidence interval. Using only the segmented broad band component, overall accuracy was 56 +/- 2% using a 95% confidence interval. Chance performance was 25%. These results cannot definitively support or reject a source filter model, but do point to the need for focused research to develop appropriate feature vectors for individual identification using acoustic cues.