TY - JOUR
T1 - An audio fingerprinting extraction algorithm based on lifting wavelet packet and improved optimal-basis selection
AU - Jiang, Yuantao
AU - Wu, Chunxue
AU - Deng, Kaifa
AU - Wu, Yan
N1 - Publisher Copyright:
© 2018, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2019/11/1
Y1 - 2019/11/1
N2 - Audio fingerprinting technology is widely applied to the analysis and processing of digital signal, especially in the application of speech recognition which is one of the most popular fields of the intelligent multimedia and artificial intelligence. Traditional audio fingerprinting extraction algorithm is based on the decomposition and reconstruction of the wavelet packet. But the requirement of computational capacity and memory is so large. So this paper proposed an algorithm which is based on the lifting wavelet packet and the improved optimal-basis selection to find the coefficient of optimal wavelet packet. Then the average of the logarithmic energy entropy is adopted as the characteristic parameter. And the capacity of computing and memory is better than the traditional algorithm because of the lifting wavelet packet which is more suitable for processing of speech online and the design of intelligent multimedia. And the experiment results indicate that this algorithm is not only robust for the audio which is handled by some kinds of methods and can reflect the overall characteristics of the audio very well, but also has good distinguishability between different audio.
AB - Audio fingerprinting technology is widely applied to the analysis and processing of digital signal, especially in the application of speech recognition which is one of the most popular fields of the intelligent multimedia and artificial intelligence. Traditional audio fingerprinting extraction algorithm is based on the decomposition and reconstruction of the wavelet packet. But the requirement of computational capacity and memory is so large. So this paper proposed an algorithm which is based on the lifting wavelet packet and the improved optimal-basis selection to find the coefficient of optimal wavelet packet. Then the average of the logarithmic energy entropy is adopted as the characteristic parameter. And the capacity of computing and memory is better than the traditional algorithm because of the lifting wavelet packet which is more suitable for processing of speech online and the design of intelligent multimedia. And the experiment results indicate that this algorithm is not only robust for the audio which is handled by some kinds of methods and can reflect the overall characteristics of the audio very well, but also has good distinguishability between different audio.
KW - Artificial intelligence
KW - Audio fingerprinting
KW - Improved optimum-basis selection algorithm
KW - Intelligent multimedia
KW - Lifting wavelet packet
KW - Speech recognition
UR - https://www.scopus.com/pages/publications/85057103175
U2 - 10.1007/s11042-018-6802-y
DO - 10.1007/s11042-018-6802-y
M3 - 文章
AN - SCOPUS:85057103175
SN - 1380-7501
VL - 78
SP - 30011
EP - 30025
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 21
ER -