Accessing Videos and Implementing Speaker Recognition System Using Speech Processing

Malhotra, Sachin; Chutani, Anurag; Goel, Tarun; Sharma, Neeru [Guided by]

Please use this identifier to cite or link to this item: http://www.ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7840

Title:	Accessing Videos and Implementing Speaker Recognition System Using Speech Processing
Authors:	Malhotra, Sachin Chutani, Anurag Goel, Tarun Sharma, Neeru [Guided by]
Keywords:	Accessing videos Speech processing
Issue Date:	2014
Publisher:	Jaypee University of Information Technology, Solan, H.P.
Abstract:	Modern speechunderstandingsystemsmergeinterdisciplinarytechnologiesfromsignalpro- cessing, patternrecognition,naturallanguage,andlinguisticsintoaunifiedstatisticalframework. These systems,whichhaveapplicationsinawiderangeofsignalprocessingproblems,representa revolutioninDigitalSignalProcessing(DSP).Onceafielddominatedbyvector-orientedproces- sors andlinearalgebra-basedmathematics,thecurrentgenerationofDSP-basedsystemsrelyon sophisticated statisticalmodelsimplementedusingacomplexsoftwareparadigm.Suchsystems are nowcapableofunderstandingcontinuousspeechinputforvocabulariesofseveralthousand wordsinoperationalenvironments.Weexploredthecorecomponentsofmodernstatistically- based speechrecognitionsystems.Theobjectiveofthisprojectistoimplementaspeechrecogni- tion engineanddevelopasystemforspeakerrecognitionusingMelFrequencyCepstrumsand VectorQuantization.ThiswouldinvolvethedesignofanefficientMATLABcodeonaPC. Throughout thedevelopment,measureswillbetakentokeepthememoryrequirementandthe processing timeofthesoftwareassmallaspossible.EverySpeechRecognitionsystemmustbe judged ontwobasicfactorswhichgovernitsusability-accuracyandspeed.Unfortunately,one of themalmostinvariablycomesatthecostoftheother.Ahigheraccuracyrateimpliesawider training sequenceandahighernumberofiterationsinthelearningalgorithm.Ontheotherhand, accuracyremainsanimportantobjectiveofourproject.Theprecisionoftheabovetwomentioned algorithms thathavebeenuseddependalmostentirelyonthemodelparametersforeveryisolated wordwhichneedstobecalculatedattheveryoutset.Toimproveaccuracy,wecalculatethese parameters inaMATLABenvironmentderivingourresultsonalargenumberoftestsequences recorded inatypicalnoisyenvironment.
URI:	http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7840
Appears in Collections:	B.Tech. Project Reports

Files in This Item:

File	Description	Size	Format
Accessing Videos and Implementing Speaker Recognition System Using Speech Processing.pdf		1.13 MB	Adobe PDF	View/Open

Show full item record