Normal view MARC view ISBD view

Deep Learning for Noise Robust Distant Speech Recognition (PhD Thesis) (Record no. 815189)

MARC details
000 -LEADER
fixed length control field	03586nam a22002177a 4500
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	240524s2023 PK \|\|\|\|\| m\|\|\| 00\| 0 eng d
022 ## - INTERNATIONAL STANDARD SERIAL NUMBER
ISSN-L	phd
041 ## - LANGUAGE CODE
Language code of text/sound track or separate title	English
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.454378242
Item number	KHA
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name	Khan, Danish ur Rehman
9 (RLIN)	882062
245 ## - TITLE STATEMENT
Title	Deep Learning for Noise Robust Distant Speech Recognition (PhD Thesis)
260 ## - PUBLICATION, DISTRIBUTION, ETC.
Place of publication, distribution, etc.	Karachi :
Name of publisher, distributor, etc.	NED University of Engineering and Technology Department of Electronic Engineering,
Date of publication, distribution, etc.	2023
300 ## - PHYSICAL DESCRIPTION
Extent	xxi, 22-159 p. :
Other physical details	ill
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc. note	Includes Bibliographical References
520 ## - SUMMARY, ETC.
Summary, etc.	Abstract <br/>This thesis covers an important knowledge gap concerning Distant and noisy Robust Speech recognition. In addition, the study aims to research and develop a procedure to improve the speech recognition with distant and noisy scenarios. The information can be extracted from both clear and inarticulate speech signals by taking help of speech signal analysis processing and for signal exploration machine learning algorithms give vigorous analytical tools. <br/>This dissertation comprises of three parts, the first explores the best feature to do the analysis. The features are extracted then threshold is applied for every feature to develop an algorithm. The second part comprises of implementing an algorithm which serves as a classifier for Distant and noisy speech. To analyze the developed algorithm efficiency, third part comprises of comparing it with conventional algorithms. <br/>In this research we extracted, analyzed 14 signal features of TensorFlow speech commands dataset without noise, and mean and elevated one which includes 14 features MFCCs, RMS, CENS, "Mel-scaled spectrogram", "Spectral centroid", "Tonal centroid"(tonnetz), Spectral contrast, poly features, STFT, Chroma STFT, ZPR, LPCC, roll-off frequency, Rasta-PLP and Pitch of speech-by-speech and analysis processing, selecting the highly associated aspect of distant and noisy speech then we transformed feature dataset for machine learning model implementation we applied Deep and Machine learning (Convolutional neural network and LSTM, "Random Forest", KNN, SVM, Voting Model) with comparison of all features on Simple , Noisy, very noisy dataset as well as ensemble model and comparison of all models with ensemble. <br/>We applied machine and deep ensemble model, compared all models with ensemble and did recognition also compared the features and result for speech both with and without noise and distance respectively. <br/>The major findings described in the thesis indicate that: <br/>1. MFCCs, "Mel-scaled spectrogram", "Poly feature" and ZCR: 4 features introduced in this study. These features have significant effects on the classification accuracy of the algorithm. <br/>2. The developed Robust Ensemble algorithm serves as the classifier for distant and noisy speech recognition. <br/>3. The implemented algorithm shows 97% performance in distant and noisy speech recognition. <br/>4. Machine and deep ensemble model applied, compared with all models with ensemble and did recognition also compared .. It is shown that improved classification accuracy of almost 97% in most of the distant and noisy speech recognition with minor tradeoff. <br/>5. Correlation method provided the reduced feature set which shows the improved performance. <br/>6. Further improvement can be achieved by using Robust Ensemble deep learning algorithms on large data set <br/>
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	883062
Topical term or geographic name entry element	Deep Learning Thesis
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	883063
Topical term or geographic name entry element	Distant Speech Recognition Thesis
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	882384
Topical term or geographic name entry element	LSTM Thesis
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier	<a href="https://eaklibrary.neduet.edu.pk:8443/catalog/bk/books/toc/98658.pdf">https://eaklibrary.neduet.edu.pk:8443/catalog/bk/books/toc/98658.pdf</a>
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme	Dewey Decimal Classification
Koha item type	PHD Thesis
Suppress in OPAC	No

Holdings
Withdrawn status	Lost status	Source of classification or shelving scheme	Physical Form	Damaged status	Not for loan	Home library	Current library	Shelving location	Date acquired	Stock Type	Total Checkouts	Full call number	Barcode	Date last seen	Accession Date	Koha item type
		Dewey Decimal Classification	Text, Hardcover			Reference Section	Reference Section	Reference Section	24/05/2024	Donation		006.454378242 KHA	98658	24/05/2024	24/05/2024	Reference Collection
		Dewey Decimal Classification	Text, Hardcover			Reference Section	Reference Section	Reference Section	24/05/2024	Donation		006.454378242 KHA	98659	24/05/2024	24/05/2024	Reference Collection

Print
Suggest for purchase
Send to device
Save record
BIBTEX Dublin Core ISBD MARCXML RIS
More searches

Search for this title in:
Other Databases (Google Scholar) Open Library (openlibrary.org) Pakistan Union Catalogue (pastic.gov.pk) Other Libraries (WorldCat)