-
-
Notifications
You must be signed in to change notification settings - Fork 28
/
readme.txt
141 lines (114 loc) · 6.25 KB
/
readme.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
Fixed-point MP3 decoder
Developed by RealNetworks, 2003
===============================
Overview
--------
This module contains a high-performance MPEG layer 3 audio decoder for 32-bit fixed-point
processors. The following is a quick summary of what is and is not supported:
Supported
- layer 3
- MPEG1, MPEG2, and MPEG2.5 (low sampling frequency extensions)
- constant bitrate, variable bitrate, and free bitrate modes
- mono and all stereo modes (normal stereo, joint stereo, dual-mono)
Not currently supported
- layers 1 and 2
Highlights
----------
- highly optimized for ARM processors (details in docs/ subdirectory)
- reference x86 implementation
- C and assembly code only (C++ not required)
- reentrant, statically linkable
- low memory (details in docs/ subdirectory)
- option to use Intel Integrated Performance Primitives (details below)
Supported platforms and toolchains
----------------------------------
This codec should run on any 32-bit fixed-point processor which can perform a full 32x32-bit
multiply (providing a 64-bit result). The following processors and toolchains are supported:
- x86, Microsoft Visual C++
- ARM, ARM Developer Suite (ADS)
- ARM, Microsoft Embedded Visual C++
- ARM, GNU toolchain (gcc)
- RISC-V, GNU toolchain (gcc)
ARM refers to any processor supporting ARM architecture v.4 or above. Thumb is not required.
Typically this means an ARM7TDMI or better (including ARM9, StrongARM, XScale, etc.)
Generally ADS produces the fastest code. EVC 3.0 does not support inline assembly code for
ARM targets, so calls to MULSHIFT32 (smull on ARM) are left as function calls. For the
fastest code on targets which do not normally use ADS consider compiling with ADS,
using the -S option to output assembly code, and feeding this assembly code to the assembler
of your choice. This might require some syntax changes in the .S file.
Adding support for a new processor is fairly simple. Simply add a new block to the file
real/assembly.h which implements the required inline assembly functions for your processor.
Something like
...
#elif defined NEW_PROCESSOR
/* you implement MULSHIFT32() and so forth */
#else
#error Unsupported platform in assembly.h
#endif
Optionally you can rewrite or add assembly language files optimized for your platform. Note
that many of the algorithms are designed for an ARM-type processor, so performance of the
unmodified C code might be noticeably worse on other architectures.
Adding support for a new toolchain should be straightforward. Use the sample projects or the
Umakefil as a template for which source files to include.
Directory structure
-------------------
fixpt/ platform-independent code and tables, public API
fixpt/docs algorithm notes, memory and CPU usage figures, optimization suggestions
fixpt/ipp source code which uses IPP for decoding (see the "IPP" section below)
fixpt/pub public header files
fixpt/real source code for RealNetworks' MP3 decoder
fixpt/testwrap sample code to build a command-line test application
Code organization
-----------------
fixpt/
mpadecobj.cpp optional shim which exports API used by Helix clients (see mp3/renderer)
mp3dec.c main decode functions, exports C-only API
mp3tabs.c common tables used by all implementations (bitrates, frame sizes, etc.)
fixpt/pub/
mp3common.h defines low-level codec API which mp3dec.c calls
mp3dec.h defines high-level codec API which applications call
mpadecobjfixpt.h optional C++ shim API (only necessary if mpadecobj.cpp is used)
statname.h symbols which get name-mangled by C preprocessor to allow static linking
fixpt/ipp source code for wrapper files which link in IPP libraries
fixpt/real full source code for RealNetworks MP3 decoder
To build an MP3 decoder library, you'll need to compile the top-level files and EITHER
real/*.c OR ipp/*.c and the correct IPP library.
Decoder using Real code: mp3dec.c + mp3tabs.c + real/*.c + real/arm/[files].s (if ARM)
Decoder using IPP code: mp3dec.c + mp3tabs.c + ipp/*.c + ippac*.lib
Although the real/ and ipp/ source code follow the same top-level API (for Dequantize(),
Subband(), etc.) mixing and matching is not guaranteed to work. The outputs might
be ordered differently for optimization purposes, scaled differently, etc.
IPP
---
For certain platforms Intel® has created highly-optimized object code libraries of DSP
routines. These are called the Intel® Integrated Performance Primitives (IPP). If IPP
libraries are available for a platform, this MP3 decoder can link them in and use them
instead of the RealNetworks source code. To use IPP, you still need to build the top-level
files (mp3dec.c, mp3tabs.c). You also build the files in ipp/*.c. These are just thin
wrappers which provide the glue logic between the top-level decode functions in
mp3dec.c and the optimized DSP primitives in the IPP libraries. IPP libraries are not
included in this module. You must obtain them WITH A LICENSE directly from Intel.
Further info on the latest versions of IPP (as of the date of this readme) is available
from the URLs below
Intel® Integrated Performance Primitives for the
Intel® PXA25x and PXA26x family of Processors, Version 3.0
* http://www.intel.com/design/pca/applicationsprocessors/swsup/IPPv30.htm
Intel® Integrated Performance Primitives for
Intel® Pentium® Processors and Intel® Itanium® Architectures
* http://www.intel.com/software/products/ipp/ipp30/
These sites explain how to obtain IPP and the terms under which IPP libraries may be used.
The code in this module is merely wrapper code which calls IPP functions. You are fully
responsible for adhering to the license agreement under which you obtain the IPP
libraries from Intel.
To-Do list and status
---------------------
faster/lower-memory dequantizer (in progress)
tighter fixed-point scaling (in progress)
remove all run-time math calls (/, %) (in progress)
ARM assembly code for DCT32, IMDCT (if necessary)
add test scripts and compliance docs
write 32-bit C polyphase filter
(where 64-bit MACC not available)
As these optimizations are completed, they will be added to the Helix codebase. Please
continue to check there for updates. Changes will be noted in this readme.
readme.txt last updated 07/23/03