Welcome to P K Kelkar Library, Online Public Access Catalogue (OPAC)

Normal view MARC view ISBD view

Bandwidth extension of speech using perceptual criteria /

By: Berisha, Visar [author.].
Contributor(s): Sandoval, Steven 1984-, [author.] | Liss, Julie Marie [author.].
Material type: materialTypeLabelBookSeries: Synthesis digital library of engineering and computer science: ; Synthesis lectures on algorithms and software in engineering: # 13.Publisher: San Rafael, California (1537 Fourth Street, San Rafael, CA 94901 USA) : Morgan & Claypool, 2013.Description: 1 PDF (xi, 71 pages) : illustrations.Content type: text Media type: electronic Carrier type: online resourceISBN: 9781627053143.Subject(s): Compression (Audiology) | Compressed speech | Speech perception | Psychoacoustics | Speech compression | bandwidth extension | loudness | psychoacousticsDDC classification: 006.454 Online resources: Abstract with links to resource | Abstract with links to full text Also available in print.
Contents:
1. Introduction -- 1.1 Motivation -- 1.2 Existing bandwidth extension algorithms -- 1.3 Existing perceptual models --
2. Principles of bandwidth extension -- 2.1 Explicit high-band generation -- 2.2 High-band generation based on the source/filter model -- 2.2.1 Envelope estimation -- 2.2.2 Excitation signal generation --
3. Psychoacoustics -- 3.1 General overview -- 3.1.1 Human auditory system -- 3.1.2 Masking -- 3.2 Existing psychoacoustic models -- 3.2.1 Auditory excitation patterns -- 3.2.2 Perceptual loudness --
4. Bandwidth extension using spline fitting -- 4.1 Overview of existing work -- 4.1.1 Bandwidth extension -- 4.1.2 Perceptual models --
5. Summary -- Notation -- Bibliography -- Authors' biographies.
Abstract: Bandwidth extension of speech is used in the International Telecommunication Union G.729.1 standard in which the narrowband bitstream is combined with quantized high-band parameters. Although this system produces high-quality wideband speech, the additional bits used to represent the high band can be further reduced. In addition to the algorithm used in the G.729.1 standard, bandwidth extension methods based on spectrum prediction have also been proposed. Although these algorithms do not require additional bits, they perform poorly when the correlation between the low and the high band is weak. In this book, two wideband speech coding algorithms that rely on bandwidth extension are developed. The algorithms operate as wrappers around existing narrowband compression schemes. More specifically, in these algorithms, the low band is encoded using an existing toll-quality narrowband system, whereas the high band is generated using the proposed extension techniques. The first method relies only on transmitted high-band information to generate the wideband speech. The second algorithm uses a constrained minimum mean square error estimator that combines transmitted high-band envelope information with a predictive scheme driven by narrowband features. Both algorithms make use of novel perceptual models based on loudness that determine optimum quantization strategies for wideband recovery and synthesis. Objective and subjective evaluations reveal that the proposed system performs at a lower average bit rate while improving speech quality when compared to other similar algorithms.
    average rating: 0.0 (0 votes)
Item type Current location Call number Status Date due Barcode Item holds
E books E books PK Kelkar Library, IIT Kanpur
Available EBKE537
Total holds: 0

Mode of access: World Wide Web.

System requirements: Adobe Acrobat Reader.

Part of: Synthesis digital library of engineering and computer science.

Series from website.

Includes bibliographical references (pages 61-69).

1. Introduction -- 1.1 Motivation -- 1.2 Existing bandwidth extension algorithms -- 1.3 Existing perceptual models --

2. Principles of bandwidth extension -- 2.1 Explicit high-band generation -- 2.2 High-band generation based on the source/filter model -- 2.2.1 Envelope estimation -- 2.2.2 Excitation signal generation --

3. Psychoacoustics -- 3.1 General overview -- 3.1.1 Human auditory system -- 3.1.2 Masking -- 3.2 Existing psychoacoustic models -- 3.2.1 Auditory excitation patterns -- 3.2.2 Perceptual loudness --

4. Bandwidth extension using spline fitting -- 4.1 Overview of existing work -- 4.1.1 Bandwidth extension -- 4.1.2 Perceptual models --

5. Summary -- Notation -- Bibliography -- Authors' biographies.

Abstract freely available; full-text restricted to subscribers or individual document purchasers.

Compendex

INSPEC

Google scholar

Google book search

Bandwidth extension of speech is used in the International Telecommunication Union G.729.1 standard in which the narrowband bitstream is combined with quantized high-band parameters. Although this system produces high-quality wideband speech, the additional bits used to represent the high band can be further reduced. In addition to the algorithm used in the G.729.1 standard, bandwidth extension methods based on spectrum prediction have also been proposed. Although these algorithms do not require additional bits, they perform poorly when the correlation between the low and the high band is weak. In this book, two wideband speech coding algorithms that rely on bandwidth extension are developed. The algorithms operate as wrappers around existing narrowband compression schemes. More specifically, in these algorithms, the low band is encoded using an existing toll-quality narrowband system, whereas the high band is generated using the proposed extension techniques. The first method relies only on transmitted high-band information to generate the wideband speech. The second algorithm uses a constrained minimum mean square error estimator that combines transmitted high-band envelope information with a predictive scheme driven by narrowband features. Both algorithms make use of novel perceptual models based on loudness that determine optimum quantization strategies for wideband recovery and synthesis. Objective and subjective evaluations reveal that the proposed system performs at a lower average bit rate while improving speech quality when compared to other similar algorithms.

Also available in print.

Title from PDF title page (viewed on December 20, 2013).

There are no comments for this item.

Log in to your account to post a comment.

Powered by Koha