VGTU talpykla > Doktorantūros skyrius / Department for Doctoral Studies > Technologijos mokslų daktaro disertacijos ir jų santraukos >

Lietuvių   English
Please use this identifier to cite or link to this item: http://dspace.vgtu.lt/handle/1/4309

Title: Improvement of learning-based methods for localization of multiple sound sources
Other Titles: Mokymu gristų metodų keliems garso šaltiniams lokalizuoti tobulinimas
Authors: Sakavičius, Saulius
Issue Date: 8-Nov-2021
Publisher: Vilniaus Gedimino technikos universitetas
Citation: Sakavičius, S. 2021. Improvement of learning-based methods for localization of multiple sound sources: doctoral dissertation. Vilnius: Vilniaus Gedimino technikos universitetas. 162 p.
Abstract: Sound source localization is an important topic in humanmachine interacting, teleconferencing, security systems, as well as autonomous driving and robotics. While current state-of-the-art sound source localization methods allow localization of a single or a small number of sound sources in moderately reverberant environments, it is known that their performance deteriorates when the reverberation time is increased. Moreover, the localization of multiple sound sources is an even more difficult task. Learning-based sound source localization methods recently gained interest as they tend to outperform the state-of-the-art methods in multiple source localization cases in reverberant environments. Nevertheless, this branch of sound source localization methods is not yet sufficiently investigated. Therefore, this thesis is aimed to the research of such methods. Both regression-based and classification-based methods for single and multiple sound source localization in two-dimensional and three-dimensional space are investigated. Supervised and semi-supervised training strategies are researched. A dataset of tetrahedral microphone array signals is collected for the evaluation of the performance of sound source localization methods. The dissertation consist of an introduction, three chapters and general conclusions. In the introduction, the dissertation problem is formulated, the object of the research is defined and the aim of the thesis is presented. Next, the objectives of the thesis are formulated. A brief presentation of the research methodology is provided, followed by the outline of the scientific novelty of the thesis and the practical value of the research findings. Finally, the defended statements are formulated. The first chapter reveals the state of the art of sound source localization using microphone arrays and networks. In the section, most important sound source localization methods are outlined, with an emphasis on learning-based source localization methods. In the second chapter presented are the learning-based sound source localization methods suggested by the author. Specifically, the multi-layer perceptron-based method for single sound source localization in two dimensions, the convolutional neural network-based methods for multiple sound source localization in two and three dimensions and the Graph-Regularized Neural Network-based single sound source localization method. In the third chapter, the experimental setups for evaluation of the performance of the original methods, presented in the second chapter, and the results of the experimentation are presented. In the final chapter, the discussion on the experimental results is presented and the conclusions are drawn. The results of the thesis were published in six scientific publications: three papers in the reviewed scientific journals and three papers in other journals. Additionally, the results of the research were presented in five conferences.
Description: Doctoral dissertation
URI: http://dspace.vgtu.lt/handle/1/4309
Appears in Collections:Technologijos mokslų daktaro disertacijos ir jų santraukos

Files in This Item:

File Description SizeFormat
Saulius Sakavicius Thesis Erata 2021-12-09.pdf414.44 kBAdobe PDFView/Open
S_Sakaviciaus disertacija.pdf15.38 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback