24-27 October 2017
Faculty of Radio Physics, Electronics and Computer Systems
Europe/Kiev timezone

Voice embedding method for speaker recognition task

Not scheduled
15m
Faculty of Radio Physics, Electronics and Computer Systems

Faculty of Radio Physics, Electronics and Computer Systems

Faculty of Radio Physics, Electronics and Computer Systems of Taras Shevchenko National University of Kyiv, acad. Glushkov ave., 4g, Kyiv, Ukraine
Poster Computer Engineering

Speaker

Oleksandr Korniienko (Ph.D student. National Technical University of Ukraine”Igor Sikorsky Kyiv Polytechnic Institute”)

Description

Most cognitive services deal with voice understanding of emotions, speech and speaker recognition. Thus, the actual problem is creating a general approach for speech embedding, such as speaker recognition. The state-of-the-art speaker recognition methods have significant restrictions on their use because these methods are sensitive to durations of the speech signals. In this paper, we've proposed a new approach to the speech signals embedding using a recurrent neural network, which can be used for the speaker, speech and emotion recognition. It has been shown experimentally that the use of the proposed approach allowed to reduce the speaker recognition error equal rate by 7.5% compared with the state-of-the-art «i-vector» approach with voice models vector dimension 16 and 100, respectively, for 2 sec. speech signals.

Primary author

Oleksandr Korniienko (Ph.D student. National Technical University of Ukraine”Igor Sikorsky Kyiv Polytechnic Institute”)

Presentation Materials

There are no materials yet.