0% found this document useful (0 votes)

11 views17 pages

Audio Processing

Uploaded by

maritimjohn058

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views17 pages

Audio Processing

Uploaded by

maritimjohn058

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

1

Report

Name

Institution

Course

Instructor

Date
2

1. Use the MATLAB function xcorr() to compute the autocorrelation function, rather than

lpc’s FFT-based approach. Note that you will need to use the lags array from the output of

xcorr() to determine the index for the autocorrelation function array at zero lag.
3

2. Solve directly for the linear predictor co-efficients from Eq (5.16) using direct matrix

inversion, rather than the function levinson() that is used within lpc().

LPC coefficients: 1.0000 -0.9071 0.0110 0.0114 0.0555

Prediction error variance: 10.0980

3. Compute the error variance directly, rather than using the function levinson() that is

used

within lpc().
4

Prediction error variance: 66.9020

Part B – LPC analysis and synthesis of a vowel

1. Apply a 25-ms rectangular window to the speech signal, centered around the peak of

the vowel segment(/u/) of the signal.

2. Plot the autocorrelation function of this windowed segment.

3. Compute the linear prediction coefficients using your function mylpc() with 16 poles.

Linear Prediction Coefficients:

1.4130
5

-0.3409

-0.1293

0.0998

0.0897

-0.3642

0.1928

-0.0713

0.1013

-0.0670

-0.0819

0.2247

-0.1392

0.0810

-0.1577

0.0464

4. Plot the log-magnitude of the resulting frequency response:

6
7

The FFT of the windowed signal shows the actual spectral content. The LPC model

approximates the spectral envelope, capturing the overall shape. Peaks in the LPC spectrum

should correspond to formants in the original signal.

5. Using your estimates of the predictor coefficients from above, compute the prediction

error signal associated with this vowel segment and plot it. Also plot the original

windowed segment and the estimated signal segment. From the prediction error signal,

what conclusions can you draw about the model (i.e., all-pole/impulse-train-driven) and

estimation accuracy?

Prediction Error Variance: NaN

Audio signal loaded.

Length of the audio signal: 8139 samples.

Window length in samples: 400

Start index: 3870, End index: 4270

Windowed signal length: 401 samples.

Plots generated for original, estimated signal and prediction error signal.
8

Conclusions.
9

If the prediction error signal has low amplitude, it indicates that the all-pole model is accurately

capturing the characteristics of the signal. High amplitude in the prediction error signal may

suggest that the model does not adequately represent the signal dynamics. The estimated signal

should closely follow the shape of the original windowed signal if the LPC model is effective.

Any discrepancies in the prediction error signal may indicate modeling limitations.

6. Using the prediction error signal that you computed above, estimate the average pitch

period of the windowed vowel segment.

Estimated pitch period: 1 samples.

Estimated pitch frequency: 16000 Hz.

Conclusions about the estimated pitch period:

1. The estimated pitch period corresponds to the fundamental frequency of the vowel segment.

2. Variations in the pitch period can indicate changes in voice characteristics or vowel quality.
10

7. Using the linear prediction coefficients obtained when using the Hamming window,

synthesize and plot a 200-ms estimate of the vowel /uː/, assuming that the excitation

source is a perfectly periodic train of ideal impulses with the period estimated from

part B.6 above (when using the rectangular window). How does your synthesized
11

waveform differ from the original? Using the MATLAB function soundsc(), listen to

your synthesized vowel and the original recording of the vowel and describe how they

compare.

>> synthesis

Playing synthesized vowel...

Playing original vowel..

Part C – LPC analysis and synthesis of a fricative

1. Extract a segment of the speech signal from assgn1.wav that extends from the

approximate start of the consonant “s” (/s/) to the end of the speech signal and apply a
12

Hamming window to that segment. Using the MATLAB function soundsc(), listen to your

windowed signal, and check that it does not include any residual vowel energy from the

preceding /uː/.

2. Compute the linear prediction coefficients for this windowed fricative using your

function mylpc() with 8 poles.

Linear Prediction Coefficients:

1.0000

-0.0000

0.0000

-0.0000

0.0000

-0.0000

0.0000

3. Plot: i) the autocorrelation function, ii) the predicted signal segment together with

the windowed signal segment, iii) the prediction error signal, and iv) the log-spectrum of

the impulse response of the forward filter together with the log-spectrum of the windowed

fricative. () () H A A  =
13

4. Plot the log-spectrum of the prediction error signal, and comment on how “white”

(i.e., flat) the error signal spectrum is relative to the spectrum of the windowed fricative.
14
15

5. Repeat part C.4, increasing the number of poles in your prediction filter until you

judge that the error signal spectrum is sufficiently white. Discuss why that minimum

number of poles might be required to sufficiently whiten the error signal.

Model Complexity

The residual to the linear model forms the prediction error signal which quantifies the signal

portion that the model is incapable of predicting. Greater number of poles means that the model

is capable of following higher order representations of the signal leading to lower correlation in

the prediction error signal.

A white noise spectrum again suggests that the different frequencies have nearly equal power, do

not follow any systematic patterns and hence, the features derived using a particular model

possess the inherent features of the base signal.

Noise vs. Signal Components

Fewer poles may result in a prediction filter that is not able to adequately described the analysis

of the speech signal and therefore, there will be large residues that are correlated and hence are

not white.

As the number of poles is increased, structures on the signal or variations in the signal can be

taken into account, thereby arriving at a stronger model of the signal.

Trade-offs

Explorations for further poles means that the models can be fitted better, but there is depreciation

in accuracy for every addition in complexity. This is a big risk of having too many poles: by
16

modeling the noise, the equalizer ends up mapping not the character of the signal, but artifacts

that make the synthesis worse.

6. Using the MATLAB number generator function randn() to create a random (and

spectrally white) source signal with a duration of 225 ms, synthesize and plot estimates of

the fricative /s/ using the linear prediction coefficient from Part C.2 above (i.e., with 8

poles) and using the coefficients from Part C.5 above (i.e., with the minimum number of

poles that achieve sufficient whitening). Using the MATLAB function soundsc(), listen to

your two estimates and describe how they compare to each other and to the original

recording of the fricative /s/. Discuss whether the synthesized vowel or the synthesized

fricative sounds closer to its respective original signal segment, and why.

Comparison of synthesized sounds

With 8 poles of the synthesized sound, the details of the fricative may come out less clear and

sharp which may lead to less fitting of the overall sound to the intended fricative sound.

So the useful synthesized sound with the minimum number of poles is thought to be closer to the

original fricative, as the poles capture more of the spectral shape of the sound.

When comparing these synthesized sounds to the sound of the recording, these areas should

include: clarity, the frequency band and general quality of the sound. The synthesized fricative

with more poles might sound like more noisy or less clear compared to the fricative with

optimum number of poles might uphold better similarity to the actual spectra properties of the

original sound at time.

Discussion

Combination of vowels and fricatives may produce different outcomes because of difference in

their spectral profile. Vowels possess more harmonic content in their spectra than the consonants,

whereas, fricatives are richest in noise-like spectral components. As a result, the synthesized

fricative seems less natural compared to the synthesized vowel because the latter may still bear

resemblance with the actual frequency envelope.

Test2 SP
No ratings yet
Test2 SP
43 pages
Linear Predictive Coding in Speech Analysis
No ratings yet
Linear Predictive Coding in Speech Analysis
9 pages
Use of Spectral Autocorrelation in Spectral Envelope Linear Prediction For Speech Recognition
No ratings yet
Use of Spectral Autocorrelation in Spectral Envelope Linear Prediction For Speech Recognition
31 pages
Codificadores de Voz
No ratings yet
Codificadores de Voz
26 pages
LPC Analysis and Synthesis
No ratings yet
LPC Analysis and Synthesis
17 pages
Prepared By: Mamatha.K.S M.Tech (S.P) 1 Sem Guided By: Mr. Satish.M.N
No ratings yet
Prepared By: Mamatha.K.S M.Tech (S.P) 1 Sem Guided By: Mr. Satish.M.N
21 pages
Linear Predictive Coding Lab Guide
No ratings yet
Linear Predictive Coding Lab Guide
5 pages
LPC
No ratings yet
LPC
5 pages
LPC Vocoder Project
No ratings yet
LPC Vocoder Project
4 pages
342383676
No ratings yet
342383676
94 pages
Why Linear Prediction Analysis Is Important in Speech
No ratings yet
Why Linear Prediction Analysis Is Important in Speech
10 pages
LPC Vocoder Project Overview and Guide
No ratings yet
LPC Vocoder Project Overview and Guide
4 pages
Review On ELEC333: Spring 2011 Nico & Wilber
No ratings yet
Review On ELEC333: Spring 2011 Nico & Wilber
63 pages
Linear Predictive Coding Vocoder Guide
No ratings yet
Linear Predictive Coding Vocoder Guide
22 pages
ps7 Fall09
No ratings yet
ps7 Fall09
2 pages
E9 261 - Speech Information Processing: Homework # 3 Due Date: May 2, 2021
No ratings yet
E9 261 - Speech Information Processing: Homework # 3 Due Date: May 2, 2021
4 pages
LPC10 Speech Analysis Project
No ratings yet
LPC10 Speech Analysis Project
23 pages
Convention Paper 5452: Audio Engineering Society
100% (1)
Convention Paper 5452: Audio Engineering Society
10 pages
HW1 Solution
No ratings yet
HW1 Solution
7 pages
(Alli) Linear Predictive Modelling of Speech Signal
No ratings yet
(Alli) Linear Predictive Modelling of Speech Signal
25 pages
Speech Processing
No ratings yet
Speech Processing
12 pages
LPC Vocoder Speech Processing in MATLAB
No ratings yet
LPC Vocoder Speech Processing in MATLAB
16 pages
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
No ratings yet
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
6 pages
Speech Generation
No ratings yet
Speech Generation
11 pages
Estimation of Formant Frequency of Speech Signal by Linear Prediction Method and Wavelet Transform IJERTV2IS3371
No ratings yet
Estimation of Formant Frequency of Speech Signal by Linear Prediction Method and Wavelet Transform IJERTV2IS3371
6 pages
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
No ratings yet
IOSRJEN (WWW - Iosrjen.org) IOSR Journal of Engineering
5 pages
Homework 1
No ratings yet
Homework 1
3 pages
1 Pre-Lab: ECE 2026 Fall 2018 Lab #5: Spectrograms: Harmonic Lines & Aliasing
No ratings yet
1 Pre-Lab: ECE 2026 Fall 2018 Lab #5: Spectrograms: Harmonic Lines & Aliasing
9 pages
A Tutorial On Speech Synthesis Models
No ratings yet
A Tutorial On Speech Synthesis Models
8 pages
Module2 SSP
No ratings yet
Module2 SSP
70 pages
Lectures 7-8 Winter 2012
No ratings yet
Lectures 7-8 Winter 2012
73 pages
HW1 Solution
No ratings yet
HW1 Solution
7 pages
Speech Synthesis Using LPC Methods
No ratings yet
Speech Synthesis Using LPC Methods
5 pages
Linear Prediction
No ratings yet
Linear Prediction
94 pages
Overview of Code Excited Linear Prediction
No ratings yet
Overview of Code Excited Linear Prediction
23 pages
Aes2001 Bonada PDF
100% (1)
Aes2001 Bonada PDF
10 pages
Programming Assignment 1 Solutions
No ratings yet
Programming Assignment 1 Solutions
8 pages
Lec 65
No ratings yet
Lec 65
11 pages
Signal Lab 3,4 2 PDF
No ratings yet
Signal Lab 3,4 2 PDF
7 pages
Time-Frequency Analysis of Signals
No ratings yet
Time-Frequency Analysis of Signals
16 pages
LPC Modeling and Analysis Guide
No ratings yet
LPC Modeling and Analysis Guide
11 pages
DSP Project 2
No ratings yet
DSP Project 2
10 pages
ASP Exercises 1
No ratings yet
ASP Exercises 1
12 pages
Linear Prediction
No ratings yet
Linear Prediction
5 pages
American International University-Bangladesh (AIUB) Faculty of Engineering (EEE)
No ratings yet
American International University-Bangladesh (AIUB) Faculty of Engineering (EEE)
6 pages
Lab 03
No ratings yet
Lab 03
7 pages
Digital Signal Processing: Course
No ratings yet
Digital Signal Processing: Course
47 pages
Analysis of Music Time Series: Summary
No ratings yet
Analysis of Music Time Series: Summary
13 pages
Speech Lab
No ratings yet
Speech Lab
7 pages
A, (M) B, (M) : of For
No ratings yet
A, (M) B, (M) : of For
6 pages
ADSP Assignment
No ratings yet
ADSP Assignment
2 pages
Speech Signal Analysis Techniques
No ratings yet
Speech Signal Analysis Techniques
4 pages
R Assumingp Q Q Ci,: Chapter 6 - Speech Analysis
No ratings yet
R Assumingp Q Q Ci,: Chapter 6 - Speech Analysis
6 pages
Frequency Spectrum Basics
No ratings yet
Frequency Spectrum Basics
38 pages
2.2 Speech Processing: - Speech Synthesis. - Speech Recognition. - Speech Coding
No ratings yet
2.2 Speech Processing: - Speech Synthesis. - Speech Recognition. - Speech Coding
7 pages
AI Chapters 1 & 2
No ratings yet
AI Chapters 1 & 2
3 pages
Operating and Installation Guide For The Digital Instrument: Motoscope Tiny / Speedster / Vintage
No ratings yet
Operating and Installation Guide For The Digital Instrument: Motoscope Tiny / Speedster / Vintage
12 pages
IMS Cloud by Eng. Alali Khalaf
No ratings yet
IMS Cloud by Eng. Alali Khalaf
23 pages
Data Science & Machine Learning Bootcamp
No ratings yet
Data Science & Machine Learning Bootcamp
4 pages
Compact ECG-2150: Accurate Diagnosis
No ratings yet
Compact ECG-2150: Accurate Diagnosis
4 pages
Guidance Note On Techniques of Artificial Intelligence (AI) and The Role of Cost and Management Accountants.
No ratings yet
Guidance Note On Techniques of Artificial Intelligence (AI) and The Role of Cost and Management Accountants.
320 pages
ICSSR Annual Report2015 16
100% (1)
ICSSR Annual Report2015 16
519 pages
St. Gerard School Arts, Science and Technology Tibag Tarlac City
No ratings yet
St. Gerard School Arts, Science and Technology Tibag Tarlac City
2 pages
Send+ More Money Problem
No ratings yet
Send+ More Money Problem
1 page
Job Application Form
No ratings yet
Job Application Form
2 pages
CDMA Mobile Communication Course
No ratings yet
CDMA Mobile Communication Course
39 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
53 pages
Diode LED & MOSFET 2N7000 Analysis
No ratings yet
Diode LED & MOSFET 2N7000 Analysis
3 pages
Final Uace Subsidiary Ict NCDC Syllabus Highlights
No ratings yet
Final Uace Subsidiary Ict NCDC Syllabus Highlights
9 pages
Automatic Chocolate Vending Machine Design
No ratings yet
Automatic Chocolate Vending Machine Design
8 pages
IT Infrastructure Lecture - Data Center Networking - Networking - 9
No ratings yet
IT Infrastructure Lecture - Data Center Networking - Networking - 9
12 pages
Handbook of Satisfiability - Second Edition
No ratings yet
Handbook of Satisfiability - Second Edition
1,486 pages
Original Message
No ratings yet
Original Message
309 pages
Proposal - For Eleva Elevators Solid Edge Subcrption Proposal
No ratings yet
Proposal - For Eleva Elevators Solid Edge Subcrption Proposal
6 pages
Modelling Information For Registered Generators
No ratings yet
Modelling Information For Registered Generators
7 pages
Electrical Wiring Plan
88% (34)
Electrical Wiring Plan
30 pages
6 TH
No ratings yet
6 TH
16 pages
Netflix Debunker 3.0
No ratings yet
Netflix Debunker 3.0
78 pages
GITEX23DU Confirmation 2197101837
No ratings yet
GITEX23DU Confirmation 2197101837
5 pages
IT Branch Attendance
No ratings yet
IT Branch Attendance
2 pages
Network+ Certification Overview and Basics
No ratings yet
Network+ Certification Overview and Basics
54 pages
SQA Lecture 1
No ratings yet
SQA Lecture 1
55 pages
Oraimo Oraimo-SpaceBuds Hybrid ANC True Wireless Earbuds Jumia Nigeria
No ratings yet
Oraimo Oraimo-SpaceBuds Hybrid ANC True Wireless Earbuds Jumia Nigeria
1 page
Inspector General of Registration TN
No ratings yet
Inspector General of Registration TN
1 page
Aircraft Control Surfaces Explained
100% (1)
Aircraft Control Surfaces Explained
139 pages

Audio Processing

Uploaded by

Audio Processing

Uploaded by

1

LPC coefficients: 1.0000 -0.9071 0.0110 0.0114 0.0555

Prediction error variance: 10.0980

Prediction error variance: 66.9020

Part B – LPC analysis and synthesis of a vowel

the vowel segment(/u/) of the signal.

2. Plot the autocorrelation function of this windowed segment.

Linear Prediction Coefficients:

4. Plot the log-magnitude of the resulting frequency response:

should correspond to formants in the original signal.

Prediction Error Variance: NaN

Audio signal loaded.

Length of the audio signal: 8139 samples.

Window length in samples: 400

Start index: 3870, End index: 4270

Windowed signal length: 401 samples.

period of the windowed vowel segment.

Estimated pitch period: 1 samples.

Estimated pitch frequency: 16000 Hz.

Conclusions about the estimated pitch period:

Playing synthesized vowel...

Playing original vowel..

Part C – LPC analysis and synthesis of a fricative

function mylpc() with 8 poles.

Linear Prediction Coefficients:

number of poles might be required to sufficiently whiten the error signal.

the prediction error signal.

possess the inherent features of the base signal.

Noise vs. Signal Components

taken into account, thereby arriving at a stronger model of the signal.

that make the synthesis worse.

Comparison of synthesized sounds

original sound at time.

resemblance with the actual frequency envelope.

You might also like