Pitch estimation is still an open issue in contemporary signal processing research. Nowadays, growing momentum of machine learning techniques application in the data-driven society allows for tackling this problem from a new perspective. This work leverages such an opportunity to propose a refined Instantaneous Frequency and power based pitch Estimator method called IFE. It incorporates deep neural network based pitch estimation with audio front end used for extraction of instantaneous frequency and power of signal components. A thorough results analysis is performed and major advantages and shortcomings of this method are identified, leading to a wide array of suggestions for future improvement While IFE exhibits an instantaneous temporal resolution, a comparison is made against state-of-the-art pitch estimators operating on time windows, proving a comparable degree of prediction accuracy (up to 6% accuracy improvement) while maintaining the advantage of higher temporal resolution.
Autorzy
Informacje dodatkowe
- DOI
- Cyfrowy identyfikator dokumentu elektronicznego link otwiera się w nowej karcie 10.1109/hsi52170.2021.9538713
- Kategoria
- Aktywność konferencyjna
- Typ
- publikacja w wydawnictwie zbiorowym recenzowanym (także w materiałach konferencyjnych)
- Język
- angielski
- Rok wydania
- 2021
Źródło danych: MOSTWiedzy.pl - publikacja "IFE: NN-aided Instantaneous Pitch Estimation" link otwiera się w nowej karcie