Please use this identifier to cite or link to this item:
http://www.ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7840| Title: | Accessing Videos and Implementing Speaker Recognition System Using Speech Processing |
| Authors: | Malhotra, Sachin Chutani, Anurag Goel, Tarun Sharma, Neeru [Guided by] |
| Keywords: | Accessing videos Speech processing |
| Issue Date: | 2014 |
| Publisher: | Jaypee University of Information Technology, Solan, H.P. |
| Abstract: | Modern speechunderstandingsystemsmergeinterdisciplinarytechnologiesfromsignalpro- cessing, patternrecognition,naturallanguage,andlinguisticsintoaunifiedstatisticalframework. These systems,whichhaveapplicationsinawiderangeofsignalprocessingproblems,representa revolutioninDigitalSignalProcessing(DSP).Onceafielddominatedbyvector-orientedproces- sors andlinearalgebra-basedmathematics,thecurrentgenerationofDSP-basedsystemsrelyon sophisticated statisticalmodelsimplementedusingacomplexsoftwareparadigm.Suchsystems are nowcapableofunderstandingcontinuousspeechinputforvocabulariesofseveralthousand wordsinoperationalenvironments.Weexploredthecorecomponentsofmodernstatistically- based speechrecognitionsystems.Theobjectiveofthisprojectistoimplementaspeechrecogni- tion engineanddevelopasystemforspeakerrecognitionusingMelFrequencyCepstrumsand VectorQuantization.ThiswouldinvolvethedesignofanefficientMATLABcodeonaPC. Throughout thedevelopment,measureswillbetakentokeepthememoryrequirementandthe processing timeofthesoftwareassmallaspossible.EverySpeechRecognitionsystemmustbe judged ontwobasicfactorswhichgovernitsusability-accuracyandspeed.Unfortunately,one of themalmostinvariablycomesatthecostoftheother.Ahigheraccuracyrateimpliesawider training sequenceandahighernumberofiterationsinthelearningalgorithm.Ontheotherhand, accuracyremainsanimportantobjectiveofourproject.Theprecisionoftheabovetwomentioned algorithms thathavebeenuseddependalmostentirelyonthemodelparametersforeveryisolated wordwhichneedstobecalculatedattheveryoutset.Toimproveaccuracy,wecalculatethese parameters inaMATLABenvironmentderivingourresultsonalargenumberoftestsequences recorded inatypicalnoisyenvironment. |
| URI: | http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7840 |
| Appears in Collections: | B.Tech. Project Reports |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Accessing Videos and Implementing Speaker Recognition System Using Speech Processing.pdf | 1.13 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.