Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
2006
The MPEG-4 Scalable to Lossless (SLS) audio coding is recently being developed to provide a unified solution for highcompression perceptual audio coding and high-quality lossless audio coding. SLS provides efficient Fine Granular Scalable (FGS) coding from AAC core layer to lossless, and achieves reasonable perceptual quality at its scalable coding range using a sequential bit-plane scanning method, which minimizes the audio distortion according to the spectral shape of the core layer quantization errors. In this paper, it is shown that the perceptual quality performance of SLS at intermediate rates can be further improved by incorporating psychoacoustic model into the bit-plane coding process. In addition, it is also found that such an improvement can be achieved by slightly tweaking the original bit-plane coding process of SLS and hence preserving its nice features such as compatibility to lossless coding and low complexity.
Electronics Letters, 2007
A simple and flexible bit-plane coding method is developed for scalable audio coding. It is different from the traditional bit-plane coding in that the optimal bit-plane scanning order is adapted to the scale between the energy of the residual signal and the audio mask. Both the perceptual quality and the lossless compression ratio at common lossy bitrates are greatly improved compared with the performance of state-of-the-art scalable audio.
2003
This paper presents a scalable lossless enhancement of MPEG-4 Advanced Audio Coding (AAC). Scalability is achieved in the frequency domain using the Integer Modified Discrete Cosine Transform (IntMDCT), which is an integer approximation of the MDCT providing perfect reconstruction. With this transform, and only minor extension of the bitstream syntax, the MPEG-4 AAC Scalable codec can be extended to a lossless operation. The system provides bit-exact reconstruction of the input signal independent of the implementation accuracy of the AAC core coder. Furthermore, scalability in sampling rate and reconstruction word length is supported.
2004
As the latest extension of MPEG-4 Audio coding, MPEG-4 Lossless Audio Coding includes a scalable audio coding solution (SLS) that integrates the functionalities of lossless audio coding, perceptual audio coding, and fine granular scalable audio coding into a single coder framework while providing backward compatibility to MPEG Advanced Audio Coding (AAC) at the bit-stream level. Despite its abundant functionalities, SLS still achieves a compression performance that is comparable to state-of-the-art non-scalable lossless audio coding algorithms. As a result, SLS provides a universal digital audio format for a variety of application domains including professional audio, Internet music, consumer electronics, broadcasting and others. This paper presents the structure of SLS and its latest developments during the MPEG standardization process.
Journal of the Audio …, 2007
Recently the MPEG Audio standardization group has successfully concluded the standardization process on technology for lossless coding of audio signals. A summary of the scalable lossless coding (SLS) technology as one of the results of this standardization work is given. MPEG-4 scalable lossless coding provides a fine-grain scalable lossless extension of the well-known MPEG-4 AAC perceptual audio coder up to fully lossless reconstruction at word lengths and sampling rates typically used for high-resolution audio. The underlying innovative technology is described in detail and its performance is characterized for lossless and near lossless representation, both in conjunction with an AAC coder and as a stand-alone compression engine. A number of application scenarios for the new technology are discussed.
2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008
A fully scalable audio coding structure based on a novel combination of the non-core MPEG-4 scalable lossless audio coding (SLS), the state-of-the-art psychoacoustic model, joint stereo coding and the perceptually prioritized bit-plane coding is presented in this paper. The psychoacoustic information is implicitly embedded in the scalable bitstream with negligible amount of side information and trivial modi cation to the standardized SLS decoder. Results of extensive evaluation show that the subjective quality of scalable audio is improved signi cantly.
2003
This papers presents an embedded fine grain scalable perceptual and lossless audio coding scheme. The enabling technology for this combined perceptual and lossless audio coding approach is the Integer Modified Discrete Cosine Transform (IntMDCT), which is an integer approximation of the MDCT based on the lifting scheme. It maintains the perfect reconstruction property and therefore enables efficient lossless coding in the frequency domain. The close approximation of the MDCT also allows to build a perceptual coding scheme based on the IntMDCT. In this paper a bitsliced arithmetic coding technique is applied to the IntMDCT values. Together with the encoded shape of the masking threshold a perceptually hierarchical bitstream is obtained, containing several stages of perceptual quality and extending to lossless operation when transmitted completely. A concept of encoding subslices is presented in order to obtain a fine adaptation to the masking threshold especially in the range of perceptually transparent quality.
2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004
Lossless coding will become the latest extension of the MPEG-4 audio standard. In response to a call for proposals, many companies have submitted lossless audio codecs for evaluation. The codec of the Technical University of Berlin was chosen as reference model for MPEG-4 Audio Lossless Coding (ALS), attaining working draft status in July 2003. The encoder is based on linear prediction, which enables high compression even with moderate complexity, while the corresponding decoder is straightforward. The paper describes the basic elements of the codec, points out envisaged applications, and gives an outline of the standardization process.
2007
In this paper we present Fast Perceptual Quantization (FPQ), a novel procedure to quantize and code audio signals. It employs the same psychoacoustics principles used in the popular MPEG/Audio coders, but substantially simplifies the complexity and computational needs of the encoding process. FPQ is based on defining a hierarchy of privileged quantization values so that the masking threshold calculated through a psychoacoustic model is leveraged to quantize the real values to the privileged ones when possible. The computational cost of this process is very low compared to MP3’s or AAC’s quantization/coding loops. Experimental results show that it is possible to achieve nearly transparent coding using as few as approximately 100 quantization values. This leads to very efficient bit compaction using Huffman or arithmetic coding so that nearly state-of-the-art performance can be achieved in terms of quality/bit-rate trade-off. Since quantization and codification (bit compaction) proced...
2004
Lossless coding will become the latest extension of the MPEG-4 audio standard. The lossless audio codec of the Technical University of Berlin was chosen as reference model for MPEG-4 Audio Lossless Coding (ALS). The MPEG-4 ALS encoder is based on linear prediction, which enables high compression even with moderate complexity, while the corresponding decoder is straightforward. The paper describes the
2004
Embedded lossless audio coding attempts to combine the higher compression ratios of perceptual coding with the perfect reconstruction of the original signal provided by lossless coding. This paper examines the residual signal of a perceptual audio coding base layer and considers its usage as an embedded bitstream for an embedded lossless coder. It is shown that the residual signal of a lossy perceptual audio coder retains correlation that can be exploited in lossless compression. This allows an embedded stream to be provided in a lossless coder with approximately 6% overhead over pure lossless audio coding. A 6% overhead appears to be a minimal cost for the backward compatibility and scalability afforded by embedded streams.
2016
Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT)
IEEE Transactions on Speech and Audio Processing, 1995
ABSTRACT A new audio transform coding technique is proposed that reduces the bitrate requirements of the perceptual transform audio coders by utilizing the stationarity characteristics of the audio signals. The method detects the frames that have significant audible content and codes them in a way similar to conventional perceptual transform coders. However, when successive data frames are found to be similar to those sections, then their audible differences only are coded. An error analysis for the proposed method is presented and results from tests on different types of audio material are listed, indicating that an average of 30% in compression gain (over the conventional perceptual audio coders bitrate) can be achieved, with a small deterioration in the audio quality of the coded signal. The proposed method has the advantage of easy adaptation within the perceptual transform coders architecture and adds only a small computational overhead to these systems
2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)., 2000
For the purpose of efficient audio coding at low rates, a new bit allocation strategy is proposed in this paper. The basic idea behind this approach is "Give bits to the band with the maximum NMR-Gain/bit" or "Retrieve bits from the band with the maximum bits/NMR-Loss". The notion of "bit-use efficiency" is suggested and it can be employed to construct a bit assignment algorithm operated at band-level as compared to the traditional framelevel bit assignment methods. Based on this strategy a new bit assignment scheme, called Max-BNLR, is designed for the MPEG-4 AAC. Simulation results show that the performance of the Max-BNLR scheme is significantly better than that of the MPEG-4 AAC Verification Model (VM) and is close to that of TB-ANMR [3], which is the (nearly) optimal solution. Moreover, the Max-BNLR scheme has the advantages of low computational complexity comparing to TB-ANMR.
Speech and Audio …, 2002
AbstractThis paper proposes a versatile perceptual audio coding method that achieves high compression ratios and is capable of low encoding/decoding delay. It accommodates a variety of source signals (including both music and speech) with different sampling rates. It is ...
[1991] Proceedings, Advanced Computer Technology, Reliable Systems and Applications, 1991
Based on frequency-domain techniques, coding of high-quality audio with bit rates down to 64 kbit/s is possible. This performance is achieved using perceptual coding. Transform coding can be used to get the best performance at very low bit rates. Real-time implementations of several types of low bit rate codecs have been developed. Standardization of low bit rate audio coding systems
2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009
This paper proposes a new bit plane coding method for signed integer sequences. This method consists in mapping successive bit planes onto quinary symbols (+, -, 0, 1, EoP ), where the symbol "EoP " stands for "End of Plane", and applying arithmetic coding. Sign bits are efficiently coded in combination with the corresponding most significant bit of non-zero integers. Moreover, bit planes are scanned and coded in a non-sequential manner to exploit the correlation between successive planes. Results for conversational transform coding of wideband speech and audio signals -sampled at 16 kHz -show that the performance/complexity of the proposed bitplane coder is near equivalent to non-embedded coding (stack-run coding), while offering additional flexibility (bitstream scalability).
2003
This paper proposes a technique for scalable and also provides for lossless compression. It reduces smooth objective scalability, in terms of SegSNR, from lossy to lossless compression. The proposal is built around the perceptual SPIHT algorithm, which is a modification of the SPIHT algorithm and is introduced in this paper. Both objective and subjective results are reported and demonstrate that the proposed method performs comparably with the MPEG-4 AAC coder at 16, 32 and 64 kbps, yet also achieves a scalable-to-lossless architecture.
Proceedings of the 5th WSEAS international …, 2005
Abstract: -- In this paper we present an efficient high quality perceptual audio coding scheme with a novel dynamically switched filter bank design and an associated bit allocation strategy. An additional processing block is added prior to the decomposition subband ...
2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)
This paper describes experimental evaluations of encoding parameters that are appropriate for MPEG-4 Audio Lossless Coding (ALS) to compress high-resolution audio. MPEG-4 ALS Simple Profile defines the values of encoding parameters, such as the maximum sampling frequency and quantization bit depth, for making it easier to implement in the receiving applications. However, ALS Simple Profile does not define values for 96-kHz high-resolution audio. Therefore, we propose a range of values for 96-kHz high-resolution audio by extending the ALS Simple profile and evaluating its performance.
Digital Signal Processing, 2003
This paper presents a very low-complexity audio codec that provides audio playback quality similar to the MPEG-I/audio level 3 codec at 64 Kbps for a monophonic-channel signal. This welldesigned low-sophistication scheme uses a simple noise-masking model, a specialized nonuniform quantizer, an effectively recursive refinement module, and an adaptive arithmetic coder with multiplication-free adaptation. For bit allocation, we propose an appropriate nonuniform quantizer incorporating noise-masking effects and designed for fast implementation as well as efficient acceleration of the proposed refinement process. This recursive refinement algorithm effectively improves recovered perceptual audio quality after quantization. The adaptive arithmetic coder uses two fast adaptation algorithms that do not require multiplication to quickly obtain efficient bit allocation. A Taylor series expansion is used to simplify the frequently executed functions in the masking threshold formulas. Thus, the proposed high-quality audio codec appears to be a very valuable consumer electronic approach or a software solution at low cost.
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.