0% found this document useful (0 votes)

102 views6 pages

تقنيات استخراج الميزات الصوتية

The CMU Pronouncing Dictionary is an open-source dictionary developed by Carnegie Mellon University that provides mappings of English words to their North American pronunciations. It is commonly used for speech recognition and synthesis applications. Some key features extracted from signals in the time domain include mean, variance, standard deviation, kurtosis, and waveforms lengths. Frequency domain features extracted using power spectral density estimation include mean frequency, median frequency, maximum to minimum drop in power density ratio, and signal to noise ratio. These features are used as inputs for machine learning models in speech and audio applications.

Uploaded by

Rowa salman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

102 views6 pages

تقنيات استخراج الميزات الصوتية

Uploaded by

Rowa salman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

‫الجامعة التكنولوجية – قسم علوم الحاسوب‬

‫تقرير االمتحان النهائي للفصل الدراسي ( الكورس الثاني)‬

‫‪ ‬لسنة ‪2020-2019‬‬

‫عنوان التقرير‬
‫)‪(Speech recognition‬‬
Speech recognition

Q/ What is the CMU Pronouncing Dictionary? Where can we used in

our subjects. Which is the organization has developed it?
solution:
The CMU Pronouncing Dictionary (also known as CMUdict) is an open-source pronouncing
dictionary originally created by the Speech Group at Carnegie Mellon University (CMU) for use in
speech recognition research.

CMUdict provides a mapping orthographic/phonetic for English words in their North American
pronunciations. It is commonly used to generate representations for speech recognition (ASR), e.g.
the CMU Sphinx system, and speech synthesis (TTS), e.g. the Festival system. CMUdict can be used
as a training corpus for building statistical grapheme-to-phoneme (g2p) models that will generate
pronunciations for words not yet included in the dictionary. The most recent release is 0.7b; it
contains over 134,000 entries. An interactive lookup version is available.

Applications
 The Unifon converter is based on the CMU Pronouncing Dictionary.
 The Natural Language Toolkit contains an interface to the CMU Pronouncing Dictionary.
 The Carnegie Mellon Logios] tool incorporates the CMU Pronouncing Dictionary.
 PronunDict, a pronunciation dictionary of American English, uses the CMU Pronouncing
Dictionary as its data source. Pronunciation is transcribed in IPA symbols. This dictionary also
supports searching by pronunciation.

Is there is any other dictionaries that have been used for the same purpose s?
Yes, there is, for example (LOGIOS Lexicon Tool).
Speech recognition

Q/ what are the feature extraction in time domain and feature

extraction in frequency?
solution:
Domain specific feature extraction :-

 Failure Mode: depending upon the failure type, certain rations,

differences, DFEs, etc. are extracted for tracking over time

 Operating Mode: specific sensors can be more/less critical in different

operating conditions of machines…
-raw sensors to be used for feature extraction…
- variances under different conditions itself can form basis for further
feature extraction

 Component Function: Features extracted on basis of knowledge

about specific components for which PHM desired…

 Known Relations: Certain relation types can be assumed between

variables of interest…this can affect features calculated for those relations…

Time domain features are extracted from signal:-

so they are easy to implement. The easy implementation is anadvantage of signals but major
disadvantage of time domain features comes from a non-stationary property of thesignal,
changing in statistical properties over time, but time domain features assume the data as a
stationary signal,
Time domain features are calculated from signal amplitude values, so much interference that is
acquired through the
recording come to be another disadvantage of these features…
[Link]
Mean is the most common and easy implemented feature of the time domain. It only finds the mean
of EMG amplitude values over sample length of the signal.

N
mean(μ) = 1 /𝑁 ∑𝑥𝑛
n=1

[Link]
Variance is also most common statistical method for time domain feature extraction.

N
var = 1/ 𝑁 − 1 ∑(x𝑛 − μ) 2
n=1
c. Standard Deviation

N
std(σ) = √ 1/ 𝑁 − 1 ∑(x𝑛 − μ) 2
n=1

e. Kurtosis

Kurtosis is measure of peakness of probability distribution or measure of fourth order cumulative .

4
𝑘𝑢𝑟𝑡 = 1/n ∑ (𝑋𝑛−μ)

σ4

f. Mean Absolute Deviation

The average of the absolute deviations of data points from their mean

N
MAD = 1/ 𝑁 ∑|𝑥𝑛 − 𝑂𝑅𝑇|
n=1
g. AR Coefficients

AR coefficients are popular feature extraction method for biological signals. AR modeling is getting an
equation which fits the signal. AR modeling tries to model the signal by previous data points of the signal

P
𝑥[𝑛] = −∑𝑎𝑘 𝑝 𝑘=1 x[n − k] + e[n]
k=1

h. Waveform Length

Waveform length is a measure of complexity of the EMG signal. It is defined as cumulative length of the
EMG waveform over the time segment.

n-1
𝑊𝐿 = ∑ |𝑥 − 𝑥𝑛|
n=1 n+1

Frequency Domain Features :-

Frequency domain features are extracted widely using Power Spectral Density(PSD). In this work
Periodogram is used in order to estimate Power Spectral Density. 6 frequency domain features are
extracted from PSD and their mathematical definitions are given below.

a. Mean Frequency
Mean frequency is an average frequency which is calculated as sum of product of the EMG power
spectrum and the frequency divided by total sum of the spectrum intensity

b. Median Frequency

Median frequency is a frequency at which the spectrum is divided into two regions with equal
amplitude.

c. Maximum to Minimum Drop in Power Density Ratio

Maximum to Minimum Drop in Power Density Ratio is the ratio of the highest mean power density
value and lowest mean power density value, with a frequency band user defined.

d. Signal to Noise Ratio

Signal to Noise Ratio is a ratio of the signal power and noise power[10]. The signal power and noise
power are estimated separately.

e. Power Spectrum Deformation

The Power Spectrum Deformation ratio is sensitive to changes in spectral symmetry and provides a
indication of spectral deformation.

f. Signal to Motion Artifact Ratio

As stated before motion artifact is low frequency artifact ofEMG signals. They are below 20Hz. The
signal to noise artifact ratio was computed as a ratio of the sum of all power densities for frequencies
below 600Hz and the sum of all power densities that exceed a straight line between the axis origin and
the highest mean power density value, with a frequency above 35Hz .

References

_Cemil Altın , Orhan Er [Link] Bozok

University,Electrical-Electronics Engineering, 66200, Yozgat, Turkey
- [Link]
usphinx/trunk/logios/
- [Link]
- [Link]
cirrusUserTesting=glent_m0&search=feature+extraction+andtime+domain+&title=Special
%3ASearch&go=Go&ns0=1

Scalar Features in Signal Analysis
No ratings yet
Scalar Features in Signal Analysis
9 pages
EMG Signals Recognition Using AI
No ratings yet
EMG Signals Recognition Using AI
6 pages
Integral Square - Temporal vs. Spectral Approach To Feature Extraction From Prehensile EMG Signals
No ratings yet
Integral Square - Temporal vs. Spectral Approach To Feature Extraction From Prehensile EMG Signals
7 pages
Feature Extraction From Sensor Data (Motion or Vibration)
No ratings yet
Feature Extraction From Sensor Data (Motion or Vibration)
24 pages
Lecture PPT CH 2
No ratings yet
Lecture PPT CH 2
21 pages
Signal Processing Essentials
No ratings yet
Signal Processing Essentials
12 pages
Feature Extraction Based On Circular Summary Statistics in ECG Signal Classification
No ratings yet
Feature Extraction Based On Circular Summary Statistics in ECG Signal Classification
3 pages
Techniques For Feature Extraction From EMG Signal
No ratings yet
Techniques For Feature Extraction From EMG Signal
4 pages
CBM342 BCI Unit III
No ratings yet
CBM342 BCI Unit III
16 pages
Biological Data Science Lecture3
No ratings yet
Biological Data Science Lecture3
23 pages
Filtering
No ratings yet
Filtering
34 pages
Noise Corruption of Empirical Mode Decomposition and Its Effect On Instantaneous Frequency
No ratings yet
Noise Corruption of Empirical Mode Decomposition and Its Effect On Instantaneous Frequency
28 pages
053 GC2013 Seismic Time-Frequency
No ratings yet
053 GC2013 Seismic Time-Frequency
4 pages
HHT FFT Differences
No ratings yet
HHT FFT Differences
8 pages
Speech Recognition with Fourier Transform
No ratings yet
Speech Recognition with Fourier Transform
86 pages
Module 3
No ratings yet
Module 3
36 pages
EMG Insights for Biomedical Engineers
No ratings yet
EMG Insights for Biomedical Engineers
30 pages
Zeiler 2010
No ratings yet
Zeiler 2010
8 pages
DMD Features for Epileptic EEG Analysis
No ratings yet
DMD Features for Epileptic EEG Analysis
4 pages
On The Genuine Relevance of The Data-Driven Signal Decomposition-Based Multiscale Permutation Entropy
No ratings yet
On The Genuine Relevance of The Data-Driven Signal Decomposition-Based Multiscale Permutation Entropy
17 pages
Exercise Muscle Fatigue Detection System Implementation Via Wireless Surface Electromyography and Empirical Mode Decomposition
No ratings yet
Exercise Muscle Fatigue Detection System Implementation Via Wireless Surface Electromyography and Empirical Mode Decomposition
4 pages
Lec 34
No ratings yet
Lec 34
10 pages
14 Computational Aspects in Statistical Signal Processing: D. Kundu
No ratings yet
14 Computational Aspects in Statistical Signal Processing: D. Kundu
23 pages
Denoising in Biomedical Signals Using Ensemble Empirical Mode Decomposition
No ratings yet
Denoising in Biomedical Signals Using Ensemble Empirical Mode Decomposition
7 pages
On An Algorithm
No ratings yet
On An Algorithm
5 pages
Spectral Correlation Measure For Selecting Intrinsic Mode Functions
No ratings yet
Spectral Correlation Measure For Selecting Intrinsic Mode Functions
8 pages
Variational Mode Decomposition: IEEE Transactions On Signal Processing October 2013
No ratings yet
Variational Mode Decomposition: IEEE Transactions On Signal Processing October 2013
16 pages
Digital Ic Design Lab2
No ratings yet
Digital Ic Design Lab2
44 pages
1 s2.0 S1877050915000976 Main
No ratings yet
1 s2.0 S1877050915000976 Main
9 pages
Introduction To Adaptive Arrays 2nd Edition Robert A. Monzingo Download
No ratings yet
Introduction To Adaptive Arrays 2nd Edition Robert A. Monzingo Download
61 pages
Presented By: Dr/Mohammedaouf
No ratings yet
Presented By: Dr/Mohammedaouf
44 pages
Determination of Muscle Fatigue in SEMG Signal Using Empirical Mode Decomposition
No ratings yet
Determination of Muscle Fatigue in SEMG Signal Using Empirical Mode Decomposition
6 pages
Palermo
No ratings yet
Palermo
4 pages
A New Formulation For Empirical Mode Decomposition Based On Constrained Optimization
No ratings yet
A New Formulation For Empirical Mode Decomposition Based On Constrained Optimization
10 pages
Exam Key: Advanced Signal Processing
No ratings yet
Exam Key: Advanced Signal Processing
103 pages
Feature Selection - New
No ratings yet
Feature Selection - New
41 pages
Lecture 15 - 23.09.2024 - Feature Selection
No ratings yet
Lecture 15 - 23.09.2024 - Feature Selection
47 pages
Depression Detection Model Based On Discrete Wavelet Transform Associated With Genetic Algorithm
No ratings yet
Depression Detection Model Based On Discrete Wavelet Transform Associated With Genetic Algorithm
19 pages
Coverage: - Measures of Central Tendency
No ratings yet
Coverage: - Measures of Central Tendency
52 pages
Empirical Mode Decomposition Guide
No ratings yet
Empirical Mode Decomposition Guide
5 pages
2009 Ecg Jsir
No ratings yet
2009 Ecg Jsir
5 pages
Hilbert-Huang Transform and Its Applications in Engineering and Biomedical Signal Analysis
No ratings yet
Hilbert-Huang Transform and Its Applications in Engineering and Biomedical Signal Analysis
8 pages
International Journal of Engineering Research and Development (IJERD)
No ratings yet
International Journal of Engineering Research and Development (IJERD)
5 pages
BSP LAB Report 5 Final
No ratings yet
BSP LAB Report 5 Final
25 pages
Real-Time R Peak Detection for HRV
No ratings yet
Real-Time R Peak Detection for HRV
4 pages
Empirical Mode Decomposition: An Analytical Approach For Sifting Process
No ratings yet
Empirical Mode Decomposition: An Analytical Approach For Sifting Process
5 pages
Gender Recognition Using Fast Fourier Transform With Ann
No ratings yet
Gender Recognition Using Fast Fourier Transform With Ann
6 pages
Fourier Transform in Bioengineering
No ratings yet
Fourier Transform in Bioengineering
6 pages
Seismic Attributes: Lesson Outline
No ratings yet
Seismic Attributes: Lesson Outline
18 pages
Frequency Domain Statistics
No ratings yet
Frequency Domain Statistics
12 pages
VI Lect - Notes#3 Btech Vii Sem Aug Dec2022
No ratings yet
VI Lect - Notes#3 Btech Vii Sem Aug Dec2022
164 pages
SPIS Exam Notes
No ratings yet
SPIS Exam Notes
8 pages
HCI M3 Lect24-09.05.25
No ratings yet
HCI M3 Lect24-09.05.25
21 pages
Estimation of The Impact of Dimensionality Reduction of The Feature Space On The Efficiency of Movements Classification Based On Surface Electromyography
No ratings yet
Estimation of The Impact of Dimensionality Reduction of The Feature Space On The Efficiency of Movements Classification Based On Surface Electromyography
4 pages
Hand Gesture Classification Using Emg Signal
No ratings yet
Hand Gesture Classification Using Emg Signal
5 pages
EMC Chap 3
No ratings yet
EMC Chap 3
28 pages
80386DX Control Registers Explained
No ratings yet
80386DX Control Registers Explained
3 pages
Network Analysis
No ratings yet
Network Analysis
7 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Space Travel: Limited Scope
No ratings yet
Space Travel: Limited Scope
3 pages
Unit 01 4 (Notes)
No ratings yet
Unit 01 4 (Notes)
10 pages
Moudgalya 2022 Rep. Prog. Phys. 85 086501
No ratings yet
Moudgalya 2022 Rep. Prog. Phys. 85 086501
32 pages
MBA Quantitative Analysis Course
No ratings yet
MBA Quantitative Analysis Course
2 pages
Ai ML Notes Unitwise
No ratings yet
Ai ML Notes Unitwise
19 pages
Kernel Methods for Multivariate Analysis
No ratings yet
Kernel Methods for Multivariate Analysis
12 pages
4Th Week Blind Search Methods: Artificial Intelligence 2011-1 Group 1
No ratings yet
4Th Week Blind Search Methods: Artificial Intelligence 2011-1 Group 1
12 pages
Algorithms 2
No ratings yet
Algorithms 2
17 pages
IBM Data Science Professional Certificate
No ratings yet
IBM Data Science Professional Certificate
22 pages
Computer Security: R. Shipsey
No ratings yet
Computer Security: R. Shipsey
36 pages
Introduction to Signal Processing
No ratings yet
Introduction to Signal Processing
24 pages
Thesis
No ratings yet
Thesis
11 pages
AI & ML: A Student's Overview
No ratings yet
AI & ML: A Student's Overview
16 pages
Control Systems for Engineers
No ratings yet
Control Systems for Engineers
9 pages
Wolfram - A New Kind of Science
No ratings yet
Wolfram - A New Kind of Science
1,197 pages
Bayesian Model Comparison and Characterization of Undrained Shear Stregnth
No ratings yet
Bayesian Model Comparison and Characterization of Undrained Shear Stregnth
9 pages
M. Santosh Kumar Datastructures Using C++
No ratings yet
M. Santosh Kumar Datastructures Using C++
7 pages
Distributed Momentum For Byzantine-Resilient Learning: Lian Et Al. 2015 Zhang Et Al. 2016 Dean Et Al. 2012
No ratings yet
Distributed Momentum For Byzantine-Resilient Learning: Lian Et Al. 2015 Zhang Et Al. 2016 Dean Et Al. 2012
20 pages
Actuarial Exam: Financial Derivatives
No ratings yet
Actuarial Exam: Financial Derivatives
10 pages
One-Step Linear Equations Practice
No ratings yet
One-Step Linear Equations Practice
4 pages
Measuring Numerical Errors Guide
No ratings yet
Measuring Numerical Errors Guide
9 pages
Stability Analysis with Routh-Hurwitz
No ratings yet
Stability Analysis with Routh-Hurwitz
64 pages
Equipment Selection: MI5073: Planificación Minera
No ratings yet
Equipment Selection: MI5073: Planificación Minera
29 pages
Chapter 5: Root Locus
No ratings yet
Chapter 5: Root Locus
25 pages
Machine Learning for Cost Estimation in Nepal
No ratings yet
Machine Learning for Cost Estimation in Nepal
62 pages
EE5075 Lecture 3A Correct
No ratings yet
EE5075 Lecture 3A Correct
34 pages
Chapter 4 Decision With Additional Information
No ratings yet
Chapter 4 Decision With Additional Information
23 pages
Hopfield Networks and Associative Memory
No ratings yet
Hopfield Networks and Associative Memory
66 pages
DIP Mid Term 14 - 09 - 23 Solution
No ratings yet
DIP Mid Term 14 - 09 - 23 Solution
8 pages
Data Structures and Algorithms (DSA)
No ratings yet
Data Structures and Algorithms (DSA)
2 pages
Active Structural Control: Abstract This Chapter Provides An Overview of Building Structure Modeling and
No ratings yet
Active Structural Control: Abstract This Chapter Provides An Overview of Building Structure Modeling and
37 pages

تقنيات استخراج الميزات الصوتية

Uploaded by

تقنيات استخراج الميزات الصوتية

Uploaded by

‫الجامعة التكنولوجية – قسم علوم الحاسوب‬

‫تقرير االمتحان النهائي للفصل الدراسي ( الكورس الثاني)‬

Q/ What is the CMU Pronouncing Dictionary? Where can we used in

Q/ what are the feature extraction in time domain and feature

 Failure Mode: depending upon the failure type, certain rations,

 Operating Mode: specific sensors can be more/less critical in different

 Component Function: Features extracted on basis of knowledge

 Known Relations: Certain relation types can be assumed between

Time domain features are extracted from signal:-

Kurtosis is measure of peakness of probability distribution or measure of fourth order cumulative .

f. Mean Absolute Deviation

Frequency Domain Features :-

c. Maximum to Minimum Drop in Power Density Ratio

d. Signal to Noise Ratio

e. Power Spectrum Deformation

f. Signal to Motion Artifact Ratio

_Cemil Altın , Orhan Er [Link] Bozok

You might also like