Deep Learning for Noise Robust Distant Speech Recognition (PhD Thesis) (Record no. 815189)

MARC details
000 -LEADER
fixed length control field 03586nam a22002177a 4500
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 240524s2023 PK ||||| m||| 00| 0 eng d
022 ## - INTERNATIONAL STANDARD SERIAL NUMBER
ISSN-L phd
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title English
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 006.454378242
Item number KHA
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name Khan, Danish ur Rehman
9 (RLIN) 882062
245 ## - TITLE STATEMENT
Title Deep Learning for Noise Robust Distant Speech Recognition (PhD Thesis)
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Place of publication, distribution, etc. Karachi :
Name of publisher, distributor, etc. NED University of Engineering and Technology Department of Electronic Engineering,
Date of publication, distribution, etc. 2023
300 ## - PHYSICAL DESCRIPTION
Extent xxi, 22-159 p. :
Other physical details ill
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note Includes Bibliographical References
520 ## - SUMMARY, ETC.
Summary, etc. Abstract <br/>This thesis covers an important knowledge gap concerning Distant and noisy Robust Speech recognition. In addition, the study aims to research and develop a procedure to improve the speech recognition with distant and noisy scenarios. The information can be extracted from both clear and inarticulate speech signals by taking help of speech signal analysis processing and for signal exploration machine learning algorithms give vigorous analytical tools. <br/>This dissertation comprises of three parts, the first explores the best feature to do the analysis. The features are extracted then threshold is applied for every feature to develop an algorithm. The second part comprises of implementing an algorithm which serves as a classifier for Distant and noisy speech. To analyze the developed algorithm efficiency, third part comprises of comparing it with conventional algorithms. <br/>In this research we extracted, analyzed 14 signal features of TensorFlow speech commands dataset without noise, and mean and elevated one which includes 14 features MFCCs, RMS, CENS, "Mel-scaled spectrogram", "Spectral centroid", "Tonal centroid"(tonnetz), Spectral contrast, poly features, STFT, Chroma STFT, ZPR, LPCC, roll-off frequency, Rasta-PLP and Pitch of speech-by-speech and analysis processing, selecting the highly associated aspect of distant and noisy speech then we transformed feature dataset for machine learning model implementation we applied Deep and Machine learning (Convolutional neural network and LSTM, "Random Forest", KNN, SVM, Voting Model) with comparison of all features on Simple , Noisy, very noisy dataset as well as ensemble model and comparison of all models with ensemble. <br/>We applied machine and deep ensemble model, compared all models with ensemble and did recognition also compared the features and result for speech both with and without noise and distance respectively. <br/>The major findings described in the thesis indicate that: <br/>1. MFCCs, "Mel-scaled spectrogram", "Poly feature" and ZCR: 4 features introduced in this study. These features have significant effects on the classification accuracy of the algorithm. <br/>2. The developed Robust Ensemble algorithm serves as the classifier for distant and noisy speech recognition. <br/>3. The implemented algorithm shows 97% performance in distant and noisy speech recognition. <br/>4. Machine and deep ensemble model applied, compared with all models with ensemble and did recognition also compared .. It is shown that improved classification accuracy of almost 97% in most of the distant and noisy speech recognition with minor tradeoff. <br/>5. Correlation method provided the reduced feature set which shows the improved performance. <br/>6. Further improvement can be achieved by using Robust Ensemble deep learning algorithms on large data set <br/>
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN) 883062
Topical term or geographic name entry element Deep Learning Thesis
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN) 883063
Topical term or geographic name entry element Distant Speech Recognition Thesis
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN) 882384
Topical term or geographic name entry element LSTM Thesis
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier <a href="https://eaklibrary.neduet.edu.pk:8443/catalog/bk/books/toc/98658.pdf">https://eaklibrary.neduet.edu.pk:8443/catalog/bk/books/toc/98658.pdf</a>
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme Dewey Decimal Classification
Koha item type PHD Thesis
Suppress in OPAC No
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Physical Form Damaged status Not for loan Home library Current library Shelving location Date acquired Stock Type Total Checkouts Full call number Barcode Date last seen Accession Date Koha item type
    Dewey Decimal Classification Text, Hardcover     Reference Section Reference Section Reference Section 24/05/2024 Donation   006.454378242 KHA 98658 24/05/2024 24/05/2024 Reference Collection
    Dewey Decimal Classification Text, Hardcover     Reference Section Reference Section Reference Section 24/05/2024 Donation   006.454378242 KHA 98659 24/05/2024 24/05/2024 Reference Collection