Speech Recognition Systems

Speech Recognition Systems
2 months
Certification paid
Course by EdX
Speech Recognition Systems
What will you learn?
Fundamentals of Speech Recognition
Basic Signal Processing for Speech Recogntion
Acoustic Modeling and Labeling
Common Algorithms for Language Modeling
Decoding Acoustic Features into Speech
About the course

Developing and understanding Automatic Speech Recognition (ASR) systems is an inter-disciplinary activity, taking expertise in linguistics, computer science, mathematics, and electrical engineering.

When a human speaks a word,they causetheir voice to make a time-varying pattern of sounds. These sounds are waves of pressure that propagate through the air. The sounds are captured by a sensor, such as a microphone or microphone array, and turned into a sequence of numbers representing the pressure change over time. The automatic speech recognition system converts this time-pressure signal into a time-frequency-energy signal. It has been trained on a curated set of labeled speech sounds, and labels the sounds it is presented with. These acoustic labels are combined with a model of word pronunciation and a model of word sequences, to create a textual representation of what was said.

Instead of exploring one part of this process deeply, this course is designed to give an overview of the components of a modern ASR system. In each lecture, we describe a component's purpose and general structure. In each lab, the student creates a functioning block of the system. At the end of the course, we will have built a speech recognition system almost entirely out of Python code.

edX offers financial assistance for learners who want to earn Verified Certificates but who may not be able to pay the fee. To apply for financial assistance, enroll in the course, then follow this link to complete an application for assistance.

Note: These courses will retire in June. Please enroll only if you are able to finish your coursework in time.

Speech Recognition Systems
Learn about the pieces of a modern automatic speech recognition (ASR) system as we cover fundamental acoustic and linguistic theory, data preparation, language modeling, acoustic modeling, and decoding.
Adrian Leven
Adrian Leven
Content Developer Microsoft Corporation
All the courses on this platform are free of charge. The authors are top universities and corporations that seek to maintain high quality standards. If you do not meet a deadline for assignments, you lose points. Like on other platforms, the videos in which the theory is explained are followed by practical assignments. Courses are available in English, Chinese, Spanish, French and Hindi.
Like any other website, konevy uses «cookies». These cookies are used to store information including visitor's preferences, and the pages on the website that the visitor accessed or visited. The information is used to optimize the users' experience by customizing our web page content based on visitors' browser type and/or other information. For more general information on cookies, please read the «What Are Cookies» article on Cookie Consent website.