Please use this identifier to cite or link to this item: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7840
Title: Accessing Videos and Implementing Speaker Recognition System Using Speech Processing
Authors: Malhotra, Sachin
Chutani, Anurag
Goel, Tarun
Sharma, Neeru [Guided by]
Keywords: Accessing videos
Speech processing
Issue Date: 2014
Publisher: Jaypee University of Information Technology, Solan, H.P.
Abstract: Modern speechunderstandingsystemsmergeinterdisciplinarytechnologiesfromsignalpro- cessing, patternrecognition,naturallanguage,andlinguisticsintoaunifiedstatisticalframework. These systems,whichhaveapplicationsinawiderangeofsignalprocessingproblems,representa revolutioninDigitalSignalProcessing(DSP).Onceafielddominatedbyvector-orientedproces- sors andlinearalgebra-basedmathematics,thecurrentgenerationofDSP-basedsystemsrelyon sophisticated statisticalmodelsimplementedusingacomplexsoftwareparadigm.Suchsystems are nowcapableofunderstandingcontinuousspeechinputforvocabulariesofseveralthousand wordsinoperationalenvironments.Weexploredthecorecomponentsofmodernstatistically- based speechrecognitionsystems.Theobjectiveofthisprojectistoimplementaspeechrecogni- tion engineanddevelopasystemforspeakerrecognitionusingMelFrequencyCepstrumsand VectorQuantization.ThiswouldinvolvethedesignofanefficientMATLABcodeonaPC. Throughout thedevelopment,measureswillbetakentokeepthememoryrequirementandthe processing timeofthesoftwareassmallaspossible.EverySpeechRecognitionsystemmustbe judged ontwobasicfactorswhichgovernitsusability-accuracyandspeed.Unfortunately,one of themalmostinvariablycomesatthecostoftheother.Ahigheraccuracyrateimpliesawider training sequenceandahighernumberofiterationsinthelearningalgorithm.Ontheotherhand, accuracyremainsanimportantobjectiveofourproject.Theprecisionoftheabovetwomentioned algorithms thathavebeenuseddependalmostentirelyonthemodelparametersforeveryisolated wordwhichneedstobecalculatedattheveryoutset.Toimproveaccuracy,wecalculatethese parameters inaMATLABenvironmentderivingourresultsonalargenumberoftestsequences recorded inatypicalnoisyenvironment.
URI: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7840
Appears in Collections:B.Tech. Project Reports

Files in This Item:
File Description SizeFormat 
Accessing Videos and Implementing Speaker Recognition System Using Speech Processing.pdf1.13 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.