• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1 /* -----------------------------------------------------------------------------
2 Software License for The Fraunhofer FDK AAC Codec Library for Android
3 
4 © Copyright  1995 - 2019 Fraunhofer-Gesellschaft zur Förderung der angewandten
5 Forschung e.V. All rights reserved.
6 
7  1.    INTRODUCTION
8 The Fraunhofer FDK AAC Codec Library for Android ("FDK AAC Codec") is software
9 that implements the MPEG Advanced Audio Coding ("AAC") encoding and decoding
10 scheme for digital audio. This FDK AAC Codec software is intended to be used on
11 a wide variety of Android devices.
12 
13 AAC's HE-AAC and HE-AAC v2 versions are regarded as today's most efficient
14 general perceptual audio codecs. AAC-ELD is considered the best-performing
15 full-bandwidth communications codec by independent studies and is widely
16 deployed. AAC has been standardized by ISO and IEC as part of the MPEG
17 specifications.
18 
19 Patent licenses for necessary patent claims for the FDK AAC Codec (including
20 those of Fraunhofer) may be obtained through Via Licensing
21 (www.vialicensing.com) or through the respective patent owners individually for
22 the purpose of encoding or decoding bit streams in products that are compliant
23 with the ISO/IEC MPEG audio standards. Please note that most manufacturers of
24 Android devices already license these patent claims through Via Licensing or
25 directly from the patent owners, and therefore FDK AAC Codec software may
26 already be covered under those patent licenses when it is used for those
27 licensed purposes only.
28 
29 Commercially-licensed AAC software libraries, including floating-point versions
30 with enhanced sound quality, are also available from Fraunhofer. Users are
31 encouraged to check the Fraunhofer website for additional applications
32 information and documentation.
33 
34 2.    COPYRIGHT LICENSE
35 
36 Redistribution and use in source and binary forms, with or without modification,
37 are permitted without payment of copyright license fees provided that you
38 satisfy the following conditions:
39 
40 You must retain the complete text of this software license in redistributions of
41 the FDK AAC Codec or your modifications thereto in source code form.
42 
43 You must retain the complete text of this software license in the documentation
44 and/or other materials provided with redistributions of the FDK AAC Codec or
45 your modifications thereto in binary form. You must make available free of
46 charge copies of the complete source code of the FDK AAC Codec and your
47 modifications thereto to recipients of copies in binary form.
48 
49 The name of Fraunhofer may not be used to endorse or promote products derived
50 from this library without prior written permission.
51 
52 You may not charge copyright license fees for anyone to use, copy or distribute
53 the FDK AAC Codec software or your modifications thereto.
54 
55 Your modified versions of the FDK AAC Codec must carry prominent notices stating
56 that you changed the software and the date of any change. For modified versions
57 of the FDK AAC Codec, the term "Fraunhofer FDK AAC Codec Library for Android"
58 must be replaced by the term "Third-Party Modified Version of the Fraunhofer FDK
59 AAC Codec Library for Android."
60 
61 3.    NO PATENT LICENSE
62 
63 NO EXPRESS OR IMPLIED LICENSES TO ANY PATENT CLAIMS, including without
64 limitation the patents of Fraunhofer, ARE GRANTED BY THIS SOFTWARE LICENSE.
65 Fraunhofer provides no warranty of patent non-infringement with respect to this
66 software.
67 
68 You may use this FDK AAC Codec software or modifications thereto only for
69 purposes that are authorized by appropriate patent licenses.
70 
71 4.    DISCLAIMER
72 
73 This FDK AAC Codec software is provided by Fraunhofer on behalf of the copyright
74 holders and contributors "AS IS" and WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES,
75 including but not limited to the implied warranties of merchantability and
76 fitness for a particular purpose. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR
77 CONTRIBUTORS BE LIABLE for any direct, indirect, incidental, special, exemplary,
78 or consequential damages, including but not limited to procurement of substitute
79 goods or services; loss of use, data, or profits, or business interruption,
80 however caused and on any theory of liability, whether in contract, strict
81 liability, or tort (including negligence), arising in any way out of the use of
82 this software, even if advised of the possibility of such damage.
83 
84 5.    CONTACT INFORMATION
85 
86 Fraunhofer Institute for Integrated Circuits IIS
87 Attention: Audio and Multimedia Departments - FDK AAC LL
88 Am Wolfsmantel 33
89 91058 Erlangen, Germany
90 
91 www.iis.fraunhofer.de/amm
92 amm-info@iis.fraunhofer.de
93 ----------------------------------------------------------------------------- */
94 
95 /**************************** AAC decoder library ******************************
96 
97    Author(s):   Manuel Jander
98 
99    Description:
100 
101 *******************************************************************************/
102 
103 #ifndef AACDECODER_LIB_H
104 #define AACDECODER_LIB_H
105 
106 /**
107  * \file   aacdecoder_lib.h
108  * \brief  FDK AAC decoder library interface header file.
109  *
110 
111 \page INTRO Introduction
112 
113 
114 \section SCOPE Scope
115 
116 This document describes the high-level application interface and usage of the
117 ISO/MPEG-2/4 AAC Decoder library developed by the Fraunhofer Institute for
118 Integrated Circuits (IIS). Depending on the library configuration, decoding of
119 AAC-LC (Low-Complexity), HE-AAC (High-Efficiency AAC v1 and v2), AAC-LD
120 (Low-Delay) and AAC-ELD (Enhanced Low-Delay) is implemented.
121 
122 All references to SBR (Spectral Band Replication) are only applicable to HE-AAC
123 and AAC-ELD configurations of the FDK library. All references to PS (Parametric
124 Stereo) are only applicable to HE-AAC v2 decoder configuration of the library.
125 
126 \section DecoderBasics Decoder Basics
127 
128 This document can only give a rough overview about the ISO/MPEG-2, ISO/MPEG-4
129 AAC audio and MPEG-D USAC coding standards. To understand all details referenced
130 in this document, you are encouraged to read the following documents.
131 
132 - ISO/IEC 13818-7 (MPEG-2 AAC) Standard, defines the syntax of MPEG-2 AAC audio
133 bitstreams.
134 - ISO/IEC 14496-3 (MPEG-4 AAC, subpart 1 and 4) Standard, defines the syntax of
135 MPEG-4 AAC audio bitstreams.
136 - ISO/IEC 23003-3 (MPEG-D USAC), defines MPEG-D USAC unified speech and audio
137 codec.
138 - Lutzky, Schuller, Gayer, Krämer, Wabnik, "A guideline to audio codec
139 delay", 116th AES Convention, May 8, 2004
140 
141 In short, MPEG Advanced Audio Coding is based on a time-to-frequency mapping of
142 the signal. The signal is partitioned into overlapping time portions and
143 transformed into frequency domain. The spectral components are then quantized
144 and coded using a highly efficient coding scheme.\n Encoded MPEG-2 and MPEG-4
145 AAC audio bitstreams are composed of frames. Contrary to MPEG-1/2 Layer-3 (mp3),
146 the length of individual frames is not restricted to a fixed number of bytes,
147 but can take any length between 1 and 768 bytes.
148 
149 In addition to the above mentioned frequency domain coding mode, MPEG-D USAC
150 also employs a time domain Algebraic Code-Excited Linear Prediction (ACELP)
151 speech coder core. This operating mode is selected by the encoder in order to
152 achieve the optimum audio quality for different content type. Several
153 enhancements allow achieving higher quality at lower bit rates compared to
154 MPEG-4 HE-AAC.
155 
156 
157 \page LIBUSE Library Usage
158 
159 
160 \section InterfaceDescritpion API Description
161 
162 All API header files are located in the folder /include of the release package.
163 The contents of each file is described in detail in this document. All header
164 files are provided for usage in specific C/C++ programs. The main AAC decoder
165 library API functions are located in aacdecoder_lib.h header file.
166 
167 In binary releases the decoder core resides in statically linkable libraries,
168 for example libAACdec.a.
169 
170 
171 \section Calling_Sequence Calling Sequence
172 
173 The following sequence is necessary for proper decoding of ISO/MPEG-2/4 AAC,
174 HE-AAC v2, or MPEG-D USAC bitstreams. In the following description, input stream
175 read and output write function details are left out, since they may be
176 implemented in a variety of configurations depending on the user's specific
177 requirements. The example implementation uses file-based input/output, and in
178 such case one may call mpegFileRead_Open() to open an input file and to allocate
179 memory for the required structures, and the corresponding mpegFileRead_Close()
180 to close opened files and to de-allocate associated structures.
181 mpegFileRead_Open() will attempt to detect the bitstream format and in case of
182 MPEG-4 file format or Raw Packets file format (a proprietary Fraunhofer IIS file
183 format suitable only for testing) it will read the Audio Specific Config data
184 (ASC). An unsuccessful attempt to recognize the bitstream format requires the
185 user to provide this information manually. For any other bitstream formats that
186 are usually applicable in streaming applications, the decoder itself will try to
187 synchronize and parse the given bitstream fragment using the FDK transport
188 library. Hence, for streaming applications (without file access) this step is
189 not necessary.
190 
191 
192 -# Call aacDecoder_Open() to open and retrieve a handle to a new AAC decoder
193 instance. \code aacDecoderInfo = aacDecoder_Open(transportType, nrOfLayers);
194 \endcode
195 -# If out-of-band config data (Audio Specific Config (ASC) or Stream Mux Config
196 (SMC)) is available, call aacDecoder_ConfigRaw() to pass this data to the
197 decoder before beginning the decoding process. If this data is not available in
198 advance, the decoder will configure itself while decoding, during the
199 aacDecoder_DecodeFrame() function call.
200 -# Begin decoding loop.
201 \code
202 do {
203 \endcode
204 -# Read data from bitstream file or stream buffer in to the driver program
205 working memory (a client-supplied input buffer "inBuffer" in framework). This
206 buffer will be used to load AAC bitstream data to the decoder.  Only when all
207 data in this buffer has been processed will the decoder signal an empty buffer.
208 For file-based input, you may invoke mpegFileRead_Read() to acquire new
209 bitstream data.
210 -# Call aacDecoder_Fill() to fill the decoder's internal bitstream input buffer
211 with the client-supplied bitstream input buffer. Note, if the data loaded in to
212 the internal buffer is not sufficient to decode a frame,
213 aacDecoder_DecodeFrame() will return ::AAC_DEC_NOT_ENOUGH_BITS until a
214 sufficient amount of data is loaded in to the internal buffer. For streaming
215 formats (ADTS, LOAS), it is acceptable to load more than one frame to the
216 decoder. However, for RAW file format (Fraunhofer IIS proprietary format), only
217 one frame may be loaded to the decoder per aacDecoder_DecodeFrame() call. For
218 least amount of communication delay, fill and decode should be performed on a
219 frame by frame basis. \code ErrorStatus = aacDecoder_Fill(aacDecoderInfo,
220 inBuffer, bytesRead, bytesValid); \endcode
221 -# Call aacDecoder_DecodeFrame(). This function decodes one frame and writes
222 decoded PCM audio data to a client-supplied buffer. It is the client's
223 responsibility to allocate a buffer which is large enough to hold the decoded
224 output data. \code ErrorStatus = aacDecoder_DecodeFrame(aacDecoderInfo,
225 TimeData, OUT_BUF_SIZE, flags); \endcode If the bitstream configuration (number
226 of channels, sample rate, frame size) is not known a priori, you may call
227 aacDecoder_GetStreamInfo() to retrieve a structure that contains this
228 information. You may use this data to initialize an audio output device. In the
229 example program, if the number of channels or the sample rate has changed since
230 program start or the previously decoded frame, the audio output device is then
231 re-initialized. If WAVE file output is chosen, a new WAVE file for each new
232 stream configuration is be created. \code p_si =
233 aacDecoder_GetStreamInfo(aacDecoderInfo); \endcode
234 -# Repeat steps 5 to 7 until no data is available to decode any more, or in case
235 of error. \code } while (bytesRead[0] > 0 || doFlush || doBsFlush ||
236 forceContinue); \endcode
237 -# Call aacDecoder_Close() to de-allocate all AAC decoder and transport layer
238 structures. \code aacDecoder_Close(aacDecoderInfo); \endcode
239 
240 \image latex decode.png "Decode calling sequence" width=11cm
241 
242 \image latex change_source.png "Change data source sequence" width 5cm
243 
244 \image latex conceal.png "Error concealment sequence" width=14cm
245 
246 \subsection Error_Concealment_Sequence Error Concealment Sequence
247 
248 There are different strategies to handle bit stream errors. Depending on the
249 system properties the product designer might choose to take different actions in
250 case a bit error occurs. In many cases the decoder might be able to do
251 reasonable error concealment without the need of any additional actions from the
252 system. But in some cases its not even possible to know how many decoded PCM
253 output samples are required to fill the gap due to the data error, then the
254 software surrounding the decoder must deal with the situation. The most simple
255 way would be to just stop audio playback and resume once enough bit stream data
256 and/or buffered output samples are available. More sophisticated designs might
257 also be able to deal with sender/receiver clock drifts or data drop outs by
258 using a closed loop control of FIFO fulness levels. The chosen strategy depends
259 on the final product requirements.
260 
261 The error concealment sequence diagram illustrates the general execution paths
262 for error handling.
263 
264 The macro IS_OUTPUT_VALID(err) can be used to identify if the audio output
265 buffer contains valid audio either from error free bit stream data or successful
266 error concealment. In case the result is false, the decoder output buffer does
267 not contain meaningful audio samples and should not be passed to any output as
268 it is. Most likely in case that a continuous audio output PCM stream is
269 required, the output buffer must be filled with audio data from the calling
270 framework. This might be e.g. an appropriate number of samples all zero.
271 
272 If error code ::AAC_DEC_TRANSPORT_SYNC_ERROR is returned by the decoder, under
273 some particular conditions it is possible to estimate lost frames due to the bit
274 stream error. In that case the bit stream is required to have a constant
275 bitrate, and compatible transport type. Audio samples for the lost frames can be
276 obtained by calling aacDecoder_DecodeFrame() with flag ::AACDEC_CONCEAL set
277 n-times where n is the count of lost frames. Please note that the decoder has to
278 have encountered valid configuration data at least once to be able to generate
279 concealed data, because at the minimum the sampling rate, frame size and amount
280 of audio channels needs to be known.
281 
282 If it is not possible to get an estimation of lost frames then a constant
283 fullness of the audio output buffer can be achieved by implementing different
284 FIFO control techniques e.g. just stop taking of samples from the buffer to
285 avoid underflow or stop filling new data to the buffer to avoid overflow. But
286 this techniques are out of scope of this document.
287 
288 For a detailed description of a specific error code please refer also to
289 ::AAC_DECODER_ERROR.
290 
291 \section BufferSystem Buffer System
292 
293 There are three main buffers in an AAC decoder application. One external input
294 buffer to hold bitstream data from file I/O or elsewhere, one decoder-internal
295 input buffer, and one to hold the decoded output PCM sample data. In resource
296 limited applications, the output buffer may be reused as an external input
297 buffer prior to the subsequence aacDecoder_Fill() function call.
298 
299 The external input buffer is set in the example program and its size is defined
300 by ::IN_BUF_SIZE. You may freely choose different buffer sizes. To feed the data
301 to the decoder-internal input buffer, use the function aacDecoder_Fill(). This
302 function returns important information regarding the number of bytes in the
303 external input buffer that have not yet been copied into the internal input
304 buffer (variable bytesValid). Once the external buffer has been fully copied, it
305 can be completely re-filled again. In case you wish to refill the buffer while
306 there are unprocessed bytes (bytesValid is unequal 0), you should preserve the
307 unconsumed data. However, we recommend to refill the buffer only when bytesValid
308 returns 0.
309 
310 The bytesValid parameter is an input and output parameter to the FDK decoder. As
311 an input, it signals how many valid bytes are available in the external buffer.
312 After consumption of the external buffer using aacDecoder_Fill() function, the
313 bytesValid parameter indicates if any of the bytes in the external buffer were
314 not consumed.
315 
316 \image latex dec_buffer.png "Life cycle of the external input buffer" width=9cm
317 
318 \page OutputFormat Decoder audio output
319 
320 \section OutputFormatObtaining Obtaining channel mapping information
321 
322 The decoded audio output format is indicated by a set of variables of the
323 CStreamInfo structure. While the struct members sampleRate, frameSize and
324 numChannels might be self explanatory, pChannelType and pChannelIndices require
325 some further explanation.
326 
327 These two arrays indicate the configuration of channel data within the output
328 buffer. Both arrays have CStreamInfo::numChannels number of cells. Each cell of
329 pChannelType indicates the channel type, which is described in the enum
330 ::AUDIO_CHANNEL_TYPE (defined in FDK_audio.h). The cells of pChannelIndices
331 indicate the sub index among the channels starting with 0 among channels of the
332 same audio channel type.
333 
334 The indexing scheme is structured as defined in MPEG-2/4 Standards. Indices
335 start from the front direction (a center channel if available, will always be
336 index 0) and increment, starting with the left side, pairwise (e.g. L, R) and
337 from front to back (Front L, Front R, Surround L, Surround R). For detailed
338 explanation, please refer to ISO/IEC 13818-7:2005(E), chapter 8.5.3.2.
339 
340 In case a Program Config is included in the audio configuration, the channel
341 mapping described within it will be adopted.
342 
343 In case of MPEG-D Surround the channel mapping will follow the same criteria
344 described in ISO/IEC 13818-7:2005(E), but adding corresponding top channels (if
345 available) to the channel types in order to avoid ambiguity. The examples below
346 explain these aspects in detail.
347 
348 \section OutputFormatChange Changing the audio output format
349 
350 For MPEG-4 audio the channel order can be changed at runtime through the
351 parameter
352 ::AAC_PCM_OUTPUT_CHANNEL_MAPPING. See the description of those
353 parameters and the decoder library function aacDecoder_SetParam() for more
354 detail.
355 
356 \section OutputFormatExample Channel mapping examples
357 
358 The following examples illustrate the location of individual audio samples in
359 the audio buffer that is passed to aacDecoder_DecodeFrame() and the expected
360 data in the CStreamInfo structure which can be obtained by calling
361 aacDecoder_GetStreamInfo().
362 
363 \subsection ExamplesStereo Stereo
364 
365 In case of ::AAC_PCM_OUTPUT_CHANNEL_MAPPING set to 1,
366 a AAC-LC bit stream which has channelConfiguration = 2 in its audio specific
367 config would lead to the following values in CStreamInfo:
368 
369 CStreamInfo::numChannels = 2
370 
371 CStreamInfo::pChannelType = { ::ACT_FRONT, ::ACT_FRONT }
372 
373 CStreamInfo::pChannelIndices = { 0, 1 }
374 
375 The output buffer will be formatted as follows:
376 
377 \verbatim
378   <left sample 0>  <left sample 1>  <left sample 2>  ... <left sample N>
379   <right sample 0> <right sample 1> <right sample 2> ... <right sample N>
380 \endverbatim
381 
382 Where N equals to CStreamInfo::frameSize .
383 
384 \subsection ExamplesSurround Surround 5.1
385 
386 In case of ::AAC_PCM_OUTPUT_CHANNEL_MAPPING set to 1,
387 a AAC-LC bit stream which has channelConfiguration = 6 in its audio specific
388 config, would lead to the following values in CStreamInfo:
389 
390 CStreamInfo::numChannels = 6
391 
392 CStreamInfo::pChannelType = { ::ACT_FRONT, ::ACT_FRONT, ::ACT_FRONT, ::ACT_LFE,
393 ::ACT_BACK, ::ACT_BACK }
394 
395 CStreamInfo::pChannelIndices = { 1, 2, 0, 0, 0, 1 }
396 
397 Since ::AAC_PCM_OUTPUT_CHANNEL_MAPPING is 1, WAV file channel ordering will be
398 used. For a 5.1 channel scheme, thus the channels would be: front left, front
399 right, center, LFE, surround left, surround right. Thus the third channel is the
400 center channel, receiving the index 0. The other front channels are front left,
401 front right being placed as first and second channels with indices 1 and 2
402 correspondingly. There is only one LFE, placed as the fourth channel and index
403 0. Finally both surround channels get the type definition ACT_BACK, and the
404 indices 0 and 1.
405 
406 The output buffer will be formatted as follows:
407 
408 \verbatim
409 <front left sample 0> <front right sample 0>
410 <center sample 0> <LFE sample 0>
411 <surround left sample 0> <surround right sample 0>
412 
413 <front left sample 1> <front right sample 1>
414 <center sample 1> <LFE sample 1>
415 <surround left sample 1> <surround right sample 1>
416 
417 ...
418 
419 <front left sample N> <front right sample N>
420 <center sample N> <LFE sample N>
421 <surround left sample N> <surround right sample N>
422 \endverbatim
423 
424 Where N equals to CStreamInfo::frameSize .
425 
426 \subsection ExamplesArib ARIB coding mode 2/1
427 
428 In case of ::AAC_PCM_OUTPUT_CHANNEL_MAPPING set to 1,
429 in case of a ARIB bit stream using coding mode 2/1 as described in ARIB STD-B32
430 Part 2 Version 2.1-E1, page 61, would lead to the following values in
431 CStreamInfo:
432 
433 CStreamInfo::numChannels = 3
434 
435 CStreamInfo::pChannelType = { ::ACT_FRONT, ::ACT_FRONT, ::ACT_BACK }
436 
437 CStreamInfo::pChannelIndices = { 0, 1, 0 }
438 
439 The audio channels will be placed as follows in the audio output buffer:
440 
441 \verbatim
442 <front left sample 0> <front right sample 0>  <mid surround sample 0>
443 
444 <front left sample 1> <front right sample 1> <mid surround sample 1>
445 
446 ...
447 
448 <front left sample N> <front right sample N> <mid surround sample N>
449 
450 Where N equals to CStreamInfo::frameSize .
451 
452 \endverbatim
453 
454 */
455 
456 #include "machine_type.h"
457 #include "FDK_audio.h"
458 
459 #include "genericStds.h"
460 /**
461  * \brief  AAC decoder error codes.
462  */
463 typedef enum {
464   AAC_DEC_OK =
465       0x0000, /*!< No error occurred. Output buffer is valid and error free. */
466   AAC_DEC_OUT_OF_MEMORY =
467       0x0002, /*!< Heap returned NULL pointer. Output buffer is invalid. */
468   AAC_DEC_UNKNOWN =
469       0x0005, /*!< Error condition is of unknown reason, or from a another
470                  module. Output buffer is invalid. */
471 
472   /* Synchronization errors. Output buffer is invalid. */
473   aac_dec_sync_error_start = 0x1000,
474   AAC_DEC_TRANSPORT_SYNC_ERROR = 0x1001, /*!< The transport decoder had
475                                             synchronization problems. Do not
476                                             exit decoding. Just feed new
477                                               bitstream data. */
478   AAC_DEC_NOT_ENOUGH_BITS = 0x1002, /*!< The input buffer ran out of bits. */
479   aac_dec_sync_error_end = 0x1FFF,
480 
481   /* Initialization errors. Output buffer is invalid. */
482   aac_dec_init_error_start = 0x2000,
483   AAC_DEC_INVALID_HANDLE =
484       0x2001, /*!< The handle passed to the function call was invalid (NULL). */
485   AAC_DEC_UNSUPPORTED_AOT =
486       0x2002, /*!< The AOT found in the configuration is not supported. */
487   AAC_DEC_UNSUPPORTED_FORMAT =
488       0x2003, /*!< The bitstream format is not supported.  */
489   AAC_DEC_UNSUPPORTED_ER_FORMAT =
490       0x2004, /*!< The error resilience tool format is not supported. */
491   AAC_DEC_UNSUPPORTED_EPCONFIG =
492       0x2005, /*!< The error protection format is not supported. */
493   AAC_DEC_UNSUPPORTED_MULTILAYER =
494       0x2006, /*!< More than one layer for AAC scalable is not supported. */
495   AAC_DEC_UNSUPPORTED_CHANNELCONFIG =
496       0x2007, /*!< The channel configuration (either number or arrangement) is
497                  not supported. */
498   AAC_DEC_UNSUPPORTED_SAMPLINGRATE = 0x2008, /*!< The sample rate specified in
499                                                 the configuration is not
500                                                 supported. */
501   AAC_DEC_INVALID_SBR_CONFIG =
502       0x2009, /*!< The SBR configuration is not supported. */
503   AAC_DEC_SET_PARAM_FAIL = 0x200A,  /*!< The parameter could not be set. Either
504                                        the value was out of range or the
505                                        parameter does  not exist. */
506   AAC_DEC_NEED_TO_RESTART = 0x200B, /*!< The decoder needs to be restarted,
507                                        since the required configuration change
508                                        cannot be performed. */
509   AAC_DEC_OUTPUT_BUFFER_TOO_SMALL =
510       0x200C, /*!< The provided output buffer is too small. */
511   aac_dec_init_error_end = 0x2FFF,
512 
513   /* Decode errors. Output buffer is valid but concealed. */
514   aac_dec_decode_error_start = 0x4000,
515   AAC_DEC_TRANSPORT_ERROR =
516       0x4001, /*!< The transport decoder encountered an unexpected error. */
517   AAC_DEC_PARSE_ERROR = 0x4002, /*!< Error while parsing the bitstream. Most
518                                    probably it is corrupted, or the system
519                                    crashed. */
520   AAC_DEC_UNSUPPORTED_EXTENSION_PAYLOAD =
521       0x4003, /*!< Error while parsing the extension payload of the bitstream.
522                  The extension payload type found is not supported. */
523   AAC_DEC_DECODE_FRAME_ERROR = 0x4004, /*!< The parsed bitstream value is out of
524                                           range. Most probably the bitstream is
525                                           corrupt, or the system crashed. */
526   AAC_DEC_CRC_ERROR = 0x4005,          /*!< The embedded CRC did not match. */
527   AAC_DEC_INVALID_CODE_BOOK = 0x4006,  /*!< An invalid codebook was signaled.
528                                           Most probably the bitstream is corrupt,
529                                           or the system  crashed. */
530   AAC_DEC_UNSUPPORTED_PREDICTION =
531       0x4007, /*!< Predictor found, but not supported in the AAC Low Complexity
532                  profile. Most probably the bitstream is corrupt, or has a wrong
533                  format. */
534   AAC_DEC_UNSUPPORTED_CCE = 0x4008, /*!< A CCE element was found which is not
535                                        supported. Most probably the bitstream is
536                                        corrupt, or has a wrong format. */
537   AAC_DEC_UNSUPPORTED_LFE = 0x4009, /*!< A LFE element was found which is not
538                                        supported. Most probably the bitstream is
539                                        corrupt, or has a wrong format. */
540   AAC_DEC_UNSUPPORTED_GAIN_CONTROL_DATA =
541       0x400A, /*!< Gain control data found but not supported. Most probably the
542                  bitstream is corrupt, or has a wrong format. */
543   AAC_DEC_UNSUPPORTED_SBA =
544       0x400B, /*!< SBA found, but currently not supported in the BSAC profile.
545                */
546   AAC_DEC_TNS_READ_ERROR = 0x400C, /*!< Error while reading TNS data. Most
547                                       probably the bitstream is corrupt or the
548                                       system crashed. */
549   AAC_DEC_RVLC_ERROR =
550       0x400D, /*!< Error while decoding error resilient data. */
551   aac_dec_decode_error_end = 0x4FFF,
552   /* Ancillary data errors. Output buffer is valid. */
553   aac_dec_anc_data_error_start = 0x8000,
554   AAC_DEC_ANC_DATA_ERROR =
555       0x8001, /*!< Non severe error concerning the ancillary data handling. */
556   AAC_DEC_TOO_SMALL_ANC_BUFFER = 0x8002,  /*!< The registered ancillary data
557                                              buffer is too small to receive the
558                                              parsed data. */
559   AAC_DEC_TOO_MANY_ANC_ELEMENTS = 0x8003, /*!< More than the allowed number of
560                                              ancillary data elements should be
561                                              written to buffer. */
562   aac_dec_anc_data_error_end = 0x8FFF
563 
564 } AAC_DECODER_ERROR;
565 
566 /** Macro to identify initialization errors. Output buffer is invalid. */
567 #define IS_INIT_ERROR(err)                                                    \
568   ((((err) >= aac_dec_init_error_start) && ((err) <= aac_dec_init_error_end)) \
569        ? 1                                                                    \
570        : 0)
571 /** Macro to identify decode errors. Output buffer is valid but concealed. */
572 #define IS_DECODE_ERROR(err)                 \
573   ((((err) >= aac_dec_decode_error_start) && \
574     ((err) <= aac_dec_decode_error_end))     \
575        ? 1                                   \
576        : 0)
577 /**
578  * Macro to identify if the audio output buffer contains valid samples after
579  * calling aacDecoder_DecodeFrame(). Output buffer is valid but can be
580  * concealed.
581  */
582 #define IS_OUTPUT_VALID(err) (((err) == AAC_DEC_OK) || IS_DECODE_ERROR(err))
583 
584 /*! \enum  AAC_MD_PROFILE
585  *  \brief The available metadata profiles which are mostly related to downmixing. The values define the arguments
586  *         for the use with parameter ::AAC_METADATA_PROFILE.
587  */
588 typedef enum {
589   AAC_MD_PROFILE_MPEG_STANDARD =
590       0, /*!< The standard profile creates a mixdown signal based on the
591             advanced downmix metadata (from a DSE). The equations and default
592             values are defined in ISO/IEC 14496:3 Ammendment 4. Any other
593             (legacy) downmix metadata will be ignored. No other parameter will
594             be modified.         */
595   AAC_MD_PROFILE_MPEG_LEGACY =
596       1, /*!< This profile behaves identical to the standard profile if advanced
597               downmix metadata (from a DSE) is available. If not, the
598             matrix_mixdown information embedded in the program configuration
599             element (PCE) will be applied. If neither is the case, the module
600             creates a mixdown using the default coefficients as defined in
601             ISO/IEC 14496:3 AMD 4. The profile can be used to support legacy
602             digital TV (e.g. DVB) streams.           */
603   AAC_MD_PROFILE_MPEG_LEGACY_PRIO =
604       2, /*!< Similar to the ::AAC_MD_PROFILE_MPEG_LEGACY profile but if both
605             the advanced (ISO/IEC 14496:3 AMD 4) and the legacy (PCE) MPEG
606             downmix metadata are available the latter will be applied.
607           */
608   AAC_MD_PROFILE_ARIB_JAPAN =
609       3 /*!< Downmix creation as described in ABNT NBR 15602-2. But if advanced
610              downmix metadata (ISO/IEC 14496:3 AMD 4) is available it will be
611              preferred because of the higher resolutions. In addition the
612            metadata expiry time will be set to the value defined in the ARIB
613            standard (see ::AAC_METADATA_EXPIRY_TIME).
614          */
615 } AAC_MD_PROFILE;
616 
617 /*! \enum  AAC_DRC_DEFAULT_PRESENTATION_MODE_OPTIONS
618  *  \brief Options for handling of DRC parameters, if presentation mode is not indicated in bitstream
619  */
620 typedef enum {
621   AAC_DRC_PARAMETER_HANDLING_DISABLED = -1, /*!< DRC parameter handling
622                                                disabled, all parameters are
623                                                applied as requested. */
624   AAC_DRC_PARAMETER_HANDLING_ENABLED =
625       0, /*!< Apply changes to requested DRC parameters to prevent clipping. */
626   AAC_DRC_PRESENTATION_MODE_1_DEFAULT =
627       1, /*!< Use DRC presentation mode 1 as default (e.g. for Nordig) */
628   AAC_DRC_PRESENTATION_MODE_2_DEFAULT =
629       2 /*!< Use DRC presentation mode 2 as default (e.g. for DTG DBook) */
630 } AAC_DRC_DEFAULT_PRESENTATION_MODE_OPTIONS;
631 
632 /**
633  * \brief AAC decoder setting parameters
634  */
635 typedef enum {
636   AAC_PCM_DUAL_CHANNEL_OUTPUT_MODE =
637       0x0002, /*!< Defines how the decoder processes two channel signals: \n
638                    0: Leave both signals as they are (default). \n
639                    1: Create a dual mono output signal from channel 1. \n
640                    2: Create a dual mono output signal from channel 2. \n
641                    3: Create a dual mono output signal by mixing both channels
642                  (L' = R' = 0.5*Ch1 + 0.5*Ch2). */
643   AAC_PCM_OUTPUT_CHANNEL_MAPPING =
644       0x0003, /*!< Output buffer channel ordering. 0: MPEG PCE style order, 1:
645                  WAV file channel order (default). */
646   AAC_PCM_LIMITER_ENABLE =
647       0x0004,                           /*!< Enable signal level limiting. \n
648                                              -1: Auto-config. Enable limiter for all
649                                            non-lowdelay configurations by default. \n
650                                               0: Disable limiter in general. \n
651                                               1: Enable limiter always.
652                                              It is recommended to call the decoder
653                                            with a AACDEC_CLRHIST flag to reset all
654                                            states when      the limiter switch is changed
655                                            explicitly. */
656   AAC_PCM_LIMITER_ATTACK_TIME = 0x0005, /*!< Signal level limiting attack time
657                                            in ms. Default configuration is 15
658                                            ms. Adjustable range from 1 ms to 15
659                                            ms. */
660   AAC_PCM_LIMITER_RELEAS_TIME = 0x0006, /*!< Signal level limiting release time
661                                            in ms. Default configuration is 50
662                                            ms. Adjustable time must be larger
663                                            than 0 ms. */
664   AAC_PCM_MIN_OUTPUT_CHANNELS =
665       0x0011, /*!< Minimum number of PCM output channels. If higher than the
666                  number of encoded audio channels, a simple channel extension is
667                  applied (see note 4 for exceptions). \n -1, 0: Disable channel
668                  extension feature. The decoder output contains the same number
669                  of channels as the encoded bitstream. \n 1:    This value is
670                  currently needed only together with the mix-down feature. See
671                           ::AAC_PCM_MAX_OUTPUT_CHANNELS and note 2 below. \n
672                     2:    Encoded mono signals will be duplicated to achieve a
673                  2/0/0.0 channel output configuration. \n 6:    The decoder
674                  tries to reorder encoded signals with less than six channels to
675                  achieve a 3/0/2.1 channel output signal. Missing channels will
676                  be filled with a zero signal. If reordering is not possible the
677                  empty channels will simply be appended. Only available if
678                  instance is configured to support multichannel output. \n 8:
679                  The decoder tries to reorder encoded signals with less than
680                  eight channels to achieve a 3/0/4.1 channel output signal.
681                  Missing channels will be filled with a zero signal. If
682                  reordering is not possible the empty channels will simply be
683                           appended. Only available if instance is configured to
684                  support multichannel output.\n NOTE: \n
685                      1. The channel signaling (CStreamInfo::pChannelType and
686                  CStreamInfo::pChannelIndices) will not be modified. Added empty
687                  channels will be signaled with channel type
688                         AUDIO_CHANNEL_TYPE::ACT_NONE. \n
689                      2. If the parameter value is greater than that of
690                  ::AAC_PCM_MAX_OUTPUT_CHANNELS both will be set to the same
691                  value. \n
692                      3. This parameter does not affect MPEG Surround processing.
693                  \n
694                      4. This parameter will be ignored if the number of encoded
695                  audio channels is greater than 8. */
696   AAC_PCM_MAX_OUTPUT_CHANNELS =
697       0x0012, /*!< Maximum number of PCM output channels. If lower than the
698                  number of encoded audio channels, downmixing is applied
699                  accordingly (see note 5 for exceptions). If dedicated metadata
700                  is available in the stream it will be used to achieve better
701                  mixing results. \n -1, 0: Disable downmixing feature. The
702                  decoder output contains the same number of channels as the
703                  encoded bitstream. \n 1:    All encoded audio configurations
704                  with more than one channel will be mixed down to one mono
705                  output signal. \n 2:    The decoder performs a stereo mix-down
706                  if the number encoded audio channels is greater than two. \n 6:
707                  If the number of encoded audio channels is greater than six the
708                  decoder performs a mix-down to meet the target output
709                  configuration of 3/0/2.1 channels. Only available if instance
710                  is configured to support multichannel output. \n 8:    This
711                  value is currently needed only together with the channel
712                  extension feature. See ::AAC_PCM_MIN_OUTPUT_CHANNELS and note 2
713                  below. Only available if instance is configured to support
714                  multichannel output. \n NOTE: \n
715                      1. Down-mixing of any seven or eight channel configuration
716                  not defined in ISO/IEC 14496-3 PDAM 4 is not supported by this
717                  software version. \n
718                      2. If the parameter value is greater than zero but smaller
719                  than ::AAC_PCM_MIN_OUTPUT_CHANNELS both will be set to same
720                  value. \n
721                      3. The operating mode of the MPEG Surround module will be
722                  set accordingly. \n
723                      4. Setting this parameter with any value will disable the
724                  binaural processing of the MPEG Surround module
725                      5. This parameter will be ignored if the number of encoded
726                  audio channels is greater than 8. */
727   AAC_METADATA_PROFILE =
728       0x0020, /*!< See ::AAC_MD_PROFILE for all available values. */
729   AAC_METADATA_EXPIRY_TIME = 0x0021, /*!< Defines the time in ms after which all
730                                         the bitstream associated meta-data (DRC,
731                                         downmix coefficients, ...) will be reset
732                                         to default if no update has been
733                                         received. Negative values disable the
734                                         feature. */
735 
736   AAC_CONCEAL_METHOD = 0x0100, /*!< Error concealment: Processing method. \n
737                                     0: Spectral muting. \n
738                                     1: Noise substitution (see ::CONCEAL_NOISE).
739                                   \n 2: Energy interpolation (adds additional
740                                   signal delay of one frame, see
741                                   ::CONCEAL_INTER. only some AOTs are
742                                   supported). \n */
743   AAC_DRC_BOOST_FACTOR =
744       0x0200, /*!< MPEG-4 / MPEG-D Dynamic Range Control (DRC): Scaling factor
745                  for boosting gain values. Defines how the boosting DRC factors
746                  (conveyed in the bitstream) will be applied to the decoded
747                  signal. The valid values range from 0 (don't apply boost
748                  factors) to 127 (fully apply boost factors). Default value is 0
749                  for MPEG-4 DRC and 127 for MPEG-D DRC. */
750   AAC_DRC_ATTENUATION_FACTOR = 0x0201, /*!< MPEG-4 / MPEG-D DRC: Scaling factor
751                                           for attenuating gain values. Same as
752                                             ::AAC_DRC_BOOST_FACTOR but for
753                                           attenuating DRC factors. */
754   AAC_DRC_REFERENCE_LEVEL =
755       0x0202, /*!< MPEG-4 / MPEG-D DRC: Target reference level / decoder target
756                  loudness.\n Defines the level below full-scale (quantized in
757                  steps of 0.25dB) to which the output audio signal will be
758                  normalized to by the DRC module.\n The parameter controls
759                  loudness normalization for both MPEG-4 DRC and MPEG-D DRC. The
760                  valid values range from 40 (-10 dBFS) to 127 (-31.75 dBFS).\n
761                    Example values:\n
762                    124 (-31 dBFS) for audio/video receivers (AVR) or other
763                  devices allowing audio playback with high dynamic range,\n 96
764                  (-24 dBFS) for TV sets or equivalent devices (default),\n 64
765                  (-16 dBFS) for mobile devices where the dynamic range of audio
766                  playback is restricted.\n Any value smaller than 0 switches off
767                  loudness normalization and MPEG-4 DRC. */
768   AAC_DRC_HEAVY_COMPRESSION =
769       0x0203, /*!< MPEG-4 DRC: En-/Disable DVB specific heavy compression (aka
770                  RF mode). If set to 1, the decoder will apply the compression
771                  values from the DVB specific ancillary data field. At the same
772                  time the MPEG-4 Dynamic Range Control tool will be disabled. By
773                    default, heavy compression is disabled. */
774   AAC_DRC_DEFAULT_PRESENTATION_MODE =
775       0x0204, /*!< MPEG-4 DRC: Default presentation mode (DRC parameter
776                  handling). \n Defines the handling of the DRC parameters boost
777                  factor, attenuation factor and heavy compression, if no
778                  presentation mode is indicated in the bitstream.\n For options,
779                  see ::AAC_DRC_DEFAULT_PRESENTATION_MODE_OPTIONS.\n Default:
780                  ::AAC_DRC_PARAMETER_HANDLING_DISABLED */
781   AAC_DRC_ENC_TARGET_LEVEL =
782       0x0205, /*!< MPEG-4 DRC: Encoder target level for light (i.e. not heavy)
783                  compression.\n If known, this declares the target reference
784                  level that was assumed at the encoder for calculation of
785                  limiting gains. The valid values range from 0 (full-scale) to
786                  127 (31.75 dB below full-scale). This parameter is used only
787                  with ::AAC_DRC_PARAMETER_HANDLING_ENABLED and ignored
788                  otherwise.\n Default: 127 (worst-case assumption).\n */
789   AAC_UNIDRC_SET_EFFECT = 0x0206, /*!< MPEG-D DRC: Request a DRC effect type for
790                                      selection of a DRC set.\n Supported indices
791                                      are:\n -1: DRC off. Completely disables
792                                      MPEG-D DRC.\n 0: None (default). Disables
793                                      MPEG-D DRC, but automatically enables DRC
794                                      if necessary to prevent clipping.\n 1: Late
795                                      night\n 2: Noisy environment\n 3: Limited
796                                      playback range\n 4: Low playback level\n 5:
797                                      Dialog enhancement\n 6: General
798                                      compression. Used for generally enabling
799                                      MPEG-D DRC without particular request.\n */
800   AAC_UNIDRC_ALBUM_MODE =
801       0x0207, /*!<  MPEG-D DRC: Enable album mode. 0: Disabled (default), 1:
802                  Enabled.\n Disabled album mode leads to application of gain
803                  sequences for fading in and out, if provided in the
804                  bitstream.\n Enabled album mode makes use of dedicated album
805                  loudness information, if provided in the bitstream.\n */
806   AAC_QMF_LOWPOWER = 0x0300, /*!< Quadrature Mirror Filter (QMF) Bank processing
807                                 mode. \n -1: Use internal default. Implies MPEG
808                                 Surround partially complex accordingly. \n 0:
809                                 Use complex QMF data mode. \n 1: Use real (low
810                                 power) QMF data mode. \n */
811   AAC_TPDEC_CLEAR_BUFFER =
812       0x0603 /*!< Clear internal bit stream buffer of transport layers. The
813                 decoder will start decoding at new data passed after this event
814                 and any previous data is discarded. */
815 
816 } AACDEC_PARAM;
817 
818 /**
819  * \brief This structure gives information about the currently decoded audio
820  * data. All fields are read-only.
821  */
822 typedef struct {
823   /* These five members are the only really relevant ones for the user. */
824   INT sampleRate; /*!< The sample rate in Hz of the decoded PCM audio signal. */
825   INT frameSize;  /*!< The frame size of the decoded PCM audio signal. \n
826                        Typically this is: \n
827                        1024 or 960 for AAC-LC \n
828                        2048 or 1920 for HE-AAC (v2) \n
829                        512 or 480 for AAC-LD and AAC-ELD \n
830                        768, 1024, 2048 or 4096 for USAC  */
831   INT numChannels; /*!< The number of output audio channels before the rendering
832                       module, i.e. the original channel configuration. */
833   AUDIO_CHANNEL_TYPE
834   *pChannelType; /*!< Audio channel type of each output audio channel. */
835   UCHAR *pChannelIndices; /*!< Audio channel index for each output audio
836                              channel. See ISO/IEC 13818-7:2005(E), 8.5.3.2
837                              Explicit channel mapping using a
838                              program_config_element() */
839   /* Decoder internal members. */
840   INT aacSampleRate; /*!< Sampling rate in Hz without SBR (from configuration
841                         info) divided by a (ELD) downscale factor if present. */
842   INT profile; /*!< MPEG-2 profile (from file header) (-1: not applicable (e. g.
843                   MPEG-4)).               */
844   AUDIO_OBJECT_TYPE
845   aot; /*!< Audio Object Type (from ASC): is set to the appropriate value
846           for MPEG-2 bitstreams (e. g. 2 for AAC-LC). */
847   INT channelConfig; /*!< Channel configuration (0: PCE defined, 1: mono, 2:
848                         stereo, ...                       */
849   INT bitRate;       /*!< Instantaneous bit rate.                   */
850   INT aacSamplesPerFrame;   /*!< Samples per frame for the AAC core (from ASC)
851                                divided by a (ELD) downscale factor if present. \n
852                                  Typically this is (with a downscale factor of 1):
853                                \n   1024 or 960 for AAC-LC \n   512 or 480 for
854                                AAC-LD   and AAC-ELD         */
855   INT aacNumChannels;       /*!< The number of audio channels after AAC core
856                                processing (before PS or MPS processing).       CAUTION: This
857                                are not the final number of output channels! */
858   AUDIO_OBJECT_TYPE extAot; /*!< Extension Audio Object Type (from ASC)   */
859   INT extSamplingRate; /*!< Extension sampling rate in Hz (from ASC) divided by
860                           a (ELD) downscale factor if present. */
861 
862   UINT outputDelay; /*!< The number of samples the output is additionally
863                        delayed by.the decoder. */
864   UINT flags; /*!< Copy of internal flags. Only to be written by the decoder,
865                  and only to be read externally. */
866 
867   SCHAR epConfig; /*!< epConfig level (from ASC): only level 0 supported, -1
868                      means no ER (e. g. AOT=2, MPEG-2 AAC, etc.)  */
869   /* Statistics */
870   INT numLostAccessUnits; /*!< This integer will reflect the estimated amount of
871                              lost access units in case aacDecoder_DecodeFrame()
872                                returns AAC_DEC_TRANSPORT_SYNC_ERROR. It will be
873                              < 0 if the estimation failed. */
874 
875   INT64 numTotalBytes; /*!< This is the number of total bytes that have passed
876                           through the decoder. */
877   INT64
878   numBadBytes; /*!< This is the number of total bytes that were considered
879                   with errors from numTotalBytes. */
880   INT64
881   numTotalAccessUnits;     /*!< This is the number of total access units that
882                               have passed through the decoder. */
883   INT64 numBadAccessUnits; /*!< This is the number of total access units that
884                               were considered with errors from numTotalBytes. */
885 
886   /* Metadata */
887   SCHAR drcProgRefLev; /*!< DRC program reference level. Defines the reference
888                           level below full-scale. It is quantized in steps of
889                           0.25dB. The valid values range from 0 (0 dBFS) to 127
890                           (-31.75 dBFS). It is used to reflect the average
891                           loudness of the audio in LKFS according to ITU-R BS
892                           1770. If no level has been found in the bitstream the
893                           value is -1. */
894   SCHAR
895   drcPresMode; /*!< DRC presentation mode. According to ETSI TS 101 154,
896                   this field indicates whether   light (MPEG-4 Dynamic Range
897                   Control tool) or heavy compression (DVB heavy
898                   compression)   dynamic range control shall take priority
899                   on the outputs.   For details, see ETSI TS 101 154, table
900                   C.33. Possible values are: \n   -1: No corresponding
901                   metadata found in the bitstream \n   0: DRC presentation
902                   mode not indicated \n   1: DRC presentation mode 1 \n   2:
903                   DRC presentation mode 2 \n   3: Reserved */
904 
905 } CStreamInfo;
906 
907 typedef struct AAC_DECODER_INSTANCE
908     *HANDLE_AACDECODER; /*!< Pointer to a AAC decoder instance. */
909 
910 #ifdef __cplusplus
911 extern "C" {
912 #endif
913 
914 /**
915  * \brief Initialize ancillary data buffer.
916  *
917  * \param self    AAC decoder handle.
918  * \param buffer  Pointer to (external) ancillary data buffer.
919  * \param size    Size of the buffer pointed to by buffer.
920  * \return        Error code.
921  */
922 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_AncDataInit(HANDLE_AACDECODER self,
923                                                     UCHAR *buffer, int size);
924 
925 /**
926  * \brief Get one ancillary data element.
927  *
928  * \param self   AAC decoder handle.
929  * \param index  Index of the ancillary data element to get.
930  * \param ptr    Pointer to a buffer receiving a pointer to the requested
931  * ancillary data element.
932  * \param size   Pointer to a buffer receiving the length of the requested
933  * ancillary data element.
934  * \return       Error code.
935  */
936 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_AncDataGet(HANDLE_AACDECODER self,
937                                                    int index, UCHAR **ptr,
938                                                    int *size);
939 
940 /**
941  * \brief Set one single decoder parameter.
942  *
943  * \param self   AAC decoder handle.
944  * \param param  Parameter to be set.
945  * \param value  Parameter value.
946  * \return       Error code.
947  */
948 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_SetParam(const HANDLE_AACDECODER self,
949                                                  const AACDEC_PARAM param,
950                                                  const INT value);
951 
952 /**
953  * \brief              Get free bytes inside decoder internal buffer.
954  * \param self         Handle of AAC decoder instance.
955  * \param pFreeBytes   Pointer to variable receiving amount of free bytes inside
956  * decoder internal buffer.
957  * \return             Error code.
958  */
959 LINKSPEC_H AAC_DECODER_ERROR
960 aacDecoder_GetFreeBytes(const HANDLE_AACDECODER self, UINT *pFreeBytes);
961 
962 /**
963  * \brief               Open an AAC decoder instance.
964  * \param transportFmt  The transport type to be used.
965  * \param nrOfLayers    Number of transport layers.
966  * \return              AAC decoder handle.
967  */
968 LINKSPEC_H HANDLE_AACDECODER aacDecoder_Open(TRANSPORT_TYPE transportFmt,
969                                              UINT nrOfLayers);
970 
971 /**
972  * \brief Explicitly configure the decoder by passing a raw AudioSpecificConfig
973  * (ASC) or a StreamMuxConfig (SMC), contained in a binary buffer. This is
974  * required for MPEG-4 and Raw Packets file format bitstreams as well as for
975  * LATM bitstreams with no in-band SMC. If the transport format is LATM with or
976  * without LOAS, configuration is assumed to be an SMC, for all other file
977  * formats an ASC.
978  *
979  * \param self    AAC decoder handle.
980  * \param conf    Pointer to an unsigned char buffer containing the binary
981  * configuration buffer (either ASC or SMC).
982  * \param length  Length of the configuration buffer in bytes.
983  * \return        Error code.
984  */
985 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_ConfigRaw(HANDLE_AACDECODER self,
986                                                   UCHAR *conf[],
987                                                   const UINT length[]);
988 
989 /**
990  * \brief Submit raw ISO base media file format boxes to decoder for parsing
991  * (only some box types are recognized).
992  *
993  * \param self    AAC decoder handle.
994  * \param buffer  Pointer to an unsigned char buffer containing the binary box
995  * data (including size and type, can be a sequence of multiple boxes).
996  * \param length  Length of the data in bytes.
997  * \return        Error code.
998  */
999 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_RawISOBMFFData(HANDLE_AACDECODER self,
1000                                                        UCHAR *buffer,
1001                                                        UINT length);
1002 
1003 /**
1004  * \brief Fill AAC decoder's internal input buffer with bitstream data from the
1005  * external input buffer. The function only copies such data as long as the
1006  * decoder-internal input buffer is not full. So it grabs whatever it can from
1007  * pBuffer and returns information (bytesValid) so that at a subsequent call of
1008  * %aacDecoder_Fill(), the right position in pBuffer can be determined to grab
1009  * the next data.
1010  *
1011  * \param self        AAC decoder handle.
1012  * \param pBuffer     Pointer to external input buffer.
1013  * \param bufferSize  Size of external input buffer. This argument is required
1014  * because decoder-internally we need the information to calculate the offset to
1015  * pBuffer, where the next available data is, which is then
1016  * fed into the decoder-internal buffer (as much as
1017  * possible). Our example framework implementation fills the
1018  * buffer at pBuffer again, once it contains no available valid bytes anymore
1019  * (meaning bytesValid equal 0).
1020  * \param bytesValid  Number of bitstream bytes in the external bitstream buffer
1021  * that have not yet been copied into the decoder's internal bitstream buffer by
1022  * calling this function. The value is updated according to
1023  * the amount of newly copied bytes.
1024  * \return            Error code.
1025  */
1026 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_Fill(HANDLE_AACDECODER self,
1027                                              UCHAR *pBuffer[],
1028                                              const UINT bufferSize[],
1029                                              UINT *bytesValid);
1030 
1031 #define AACDEC_CONCEAL                                                        \
1032   1 /*!< Flag for aacDecoder_DecodeFrame(): Trigger the built-in error        \
1033        concealment module to generate a substitute signal for one lost frame. \
1034        New input data will not be considered. */
1035 #define AACDEC_FLUSH                                                         \
1036   2 /*!< Flag for aacDecoder_DecodeFrame(): Flush all filterbanks to get all \
1037        delayed audio without having new input data. Thus new input data will \
1038        not be considered.*/
1039 #define AACDEC_INTR                                                         \
1040   4 /*!< Flag for aacDecoder_DecodeFrame(): Signal an input bit stream data \
1041        discontinuity. Resync any internals as necessary. */
1042 #define AACDEC_CLRHIST                                                        \
1043   8 /*!< Flag for aacDecoder_DecodeFrame(): Clear all signal delay lines and  \
1044        history buffers. CAUTION: This can cause discontinuities in the output \
1045        signal. */
1046 
1047 /**
1048  * \brief               Decode one audio frame
1049  *
1050  * \param self          AAC decoder handle.
1051  * \param pTimeData     Pointer to external output buffer where the decoded PCM
1052  * samples will be stored into.
1053  * \param timeDataSize  Size of external output buffer.
1054  * \param flags         Bit field with flags for the decoder: \n
1055  *                      (flags & AACDEC_CONCEAL) == 1: Do concealment. \n
1056  *                      (flags & AACDEC_FLUSH) == 2: Discard input data. Flush
1057  * filter banks (output delayed audio). \n (flags & AACDEC_INTR) == 4: Input
1058  * data is discontinuous. Resynchronize any internals as
1059  * necessary. \n (flags & AACDEC_CLRHIST) == 8: Clear all signal delay lines and
1060  * history buffers.
1061  * \return              Error code.
1062  */
1063 LINKSPEC_H AAC_DECODER_ERROR aacDecoder_DecodeFrame(HANDLE_AACDECODER self,
1064                                                     INT_PCM *pTimeData,
1065                                                     const INT timeDataSize,
1066                                                     const UINT flags);
1067 
1068 /**
1069  * \brief       De-allocate all resources of an AAC decoder instance.
1070  *
1071  * \param self  AAC decoder handle.
1072  * \return      void.
1073  */
1074 LINKSPEC_H void aacDecoder_Close(HANDLE_AACDECODER self);
1075 
1076 /**
1077  * \brief       Get CStreamInfo handle from decoder.
1078  *
1079  * \param self  AAC decoder handle.
1080  * \return      Reference to requested CStreamInfo.
1081  */
1082 LINKSPEC_H CStreamInfo *aacDecoder_GetStreamInfo(HANDLE_AACDECODER self);
1083 
1084 /**
1085  * \brief       Get decoder library info.
1086  *
1087  * \param info  Pointer to an allocated LIB_INFO structure.
1088  * \return      0 on success.
1089  */
1090 LINKSPEC_H INT aacDecoder_GetLibInfo(LIB_INFO *info);
1091 
1092 #ifdef __cplusplus
1093 }
1094 #endif
1095 
1096 #endif /* AACDECODER_LIB_H */
1097