06-floor0.xml 10 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257
  1. <?xml version="1.0" standalone="no"?>
  2. <!DOCTYPE section PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
  3. "http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd" [
  4. ]>
  5. <section id="vorbis-spec-floor0">
  6. <sectioninfo>
  7. <releaseinfo>
  8. $Id$
  9. </releaseinfo>
  10. </sectioninfo>
  11. <title>Floor type 0 setup and decode</title>
  12. <section>
  13. <title>Overview</title>
  14. <para>
  15. Vorbis floor type zero uses Line Spectral Pair (LSP, also alternately
  16. known as Line Spectral Frequency or LSF) representation to encode a
  17. smooth spectral envelope curve as the frequency response of the LSP
  18. filter. This representation is equivalent to a traditional all-pole
  19. infinite impulse response filter as would be used in linear predictive
  20. coding; LSP representation may be converted to LPC representation and
  21. vice-versa.</para>
  22. </section>
  23. <section>
  24. <title>Floor 0 format</title>
  25. <para>
  26. Floor zero configuration consists of six integer fields and a list of
  27. VQ codebooks for use in coding/decoding the LSP filter coefficient
  28. values used by each frame. </para>
  29. <section><title>header decode</title>
  30. <para>
  31. Configuration information for instances of floor zero decodes from the
  32. codec setup header (third packet). configuration decode proceeds as
  33. follows:</para>
  34. <screen>
  35. 1) [floor0_order] = read an unsigned integer of 8 bits
  36. 2) [floor0_rate] = read an unsigned integer of 16 bits
  37. 3) [floor0_bark_map_size] = read an unsigned integer of 16 bits
  38. 4) [floor0_amplitude_bits] = read an unsigned integer of six bits
  39. 5) [floor0_amplitude_offset] = read an unsigned integer of eight bits
  40. 6) [floor0_number_of_books] = read an unsigned integer of four bits and add 1
  41. 7) array [floor0_book_list] = read a list of [floor0_number_of_books] unsigned integers of eight bits each;
  42. </screen>
  43. <para>
  44. An end-of-packet condition during any of these bitstream reads renders
  45. this stream undecodable. In addition, any element of the array
  46. <varname>[floor0_book_list]</varname> that is greater than the maximum codebook
  47. number for this bitstream is an error condition that also renders the
  48. stream undecodable.</para>
  49. </section>
  50. <section id="vorbis-spec-floor0-decode">
  51. <title>packet decode</title>
  52. <para>
  53. Extracting a floor0 curve from an audio packet consists of first
  54. decoding the curve amplitude and <varname>[floor0_order]</varname> LSP
  55. coefficient values from the bitstream, and then computing the floor
  56. curve, which is defined as the frequency response of the decoded LSP
  57. filter.</para>
  58. <para>
  59. Packet decode proceeds as follows:</para>
  60. <screen>
  61. 1) [amplitude] = read an unsigned integer of [floor0_amplitude_bits] bits
  62. 2) if ( [amplitude] is greater than zero ) {
  63. 3) [coefficients] is an empty, zero length vector
  64. 4) [booknumber] = read an unsigned integer of <link linkend="vorbis-spec-ilog">ilog</link>( [floor0_number_of_books] ) bits
  65. 5) if ( [booknumber] is greater than the highest number decode codebook ) then packet is undecodable
  66. 6) [last] = zero;
  67. 7) vector [temp_vector] = read vector from bitstream using codebook number [floor0_book_list] element [booknumber] in VQ context.
  68. 8) add the scalar value [last] to each scalar in vector [temp_vector]
  69. 9) [last] = the value of the last scalar in vector [temp_vector]
  70. 10) concatenate [temp_vector] onto the end of the [coefficients] vector
  71. 11) if (length of vector [coefficients] is less than [floor0_order], continue at step 6
  72. }
  73. 12) done.
  74. </screen>
  75. <para>
  76. Take note of the following properties of decode:
  77. <itemizedlist>
  78. <listitem><simpara>An <varname>[amplitude]</varname> value of zero must result in a return code that indicates this channel is unused in this frame (the output of the channel will be all-zeroes in synthesis). Several later stages of decode don't occur for an unused channel.</simpara></listitem>
  79. <listitem><simpara>An end-of-packet condition during decode should be considered a
  80. nominal occruence; if end-of-packet is reached during any read
  81. operation above, floor decode is to return 'unused' status as if the
  82. <varname>[amplitude]</varname> value had read zero at the beginning of decode.</simpara></listitem>
  83. <listitem><simpara>The book number used for decode
  84. can, in fact, be stored in the bitstream in <link linkend="vorbis-spec-ilog">ilog</link>( <varname>[floor0_number_of_books]</varname> -
  85. 1 ) bits. Nevertheless, the above specification is correct and values
  86. greater than the maximum possible book value are reserved.</simpara></listitem>
  87. <listitem><simpara>The number of scalars read into the vector <varname>[coefficients]</varname>
  88. may be greater than <varname>[floor0_order]</varname>, the number actually
  89. required for curve computation. For example, if the VQ codebook used
  90. for the floor currently being decoded has a
  91. <varname>[codebook_dimensions]</varname> value of three and
  92. <varname>[floor0_order]</varname> is ten, the only way to fill all the needed
  93. scalars in <varname>[coefficients]</varname> is to to read a total of twelve
  94. scalars as four vectors of three scalars each. This is not an error
  95. condition, and care must be taken not to allow a buffer overflow in
  96. decode. The extra values are not used and may be ignored or discarded.</simpara></listitem>
  97. </itemizedlist>
  98. </para>
  99. </section>
  100. <section id="vorbis-spec-floor0-synth">
  101. <title>curve computation</title>
  102. <para>
  103. Given an <varname>[amplitude]</varname> integer and <varname>[coefficients]</varname>
  104. vector from packet decode as well as the [floor0_order],
  105. [floor0_rate], [floor0_bark_map_size], [floor0_amplitude_bits] and
  106. [floor0_amplitude_offset] values from floor setup, and an output
  107. vector size <varname>[n]</varname> specified by the decode process, we compute a
  108. floor output vector.</para>
  109. <para>
  110. If the value <varname>[amplitude]</varname> is zero, the return value is a
  111. length <varname>[n]</varname> vector with all-zero scalars. Otherwise, begin by
  112. assuming the following definitions for the given vector to be
  113. synthesized:</para>
  114. <informalequation>
  115. <mediaobject>
  116. <textobject><phrase>[lsp map equation]</phrase></textobject>
  117. <textobject role="tex"><phrase>
  118. <![CDATA[
  119. \begin{math}
  120. \mathrm{map}_i = \left\{
  121. \begin{array}{ll}
  122. \min (
  123. \mathtt{floor0\_bark\_map\_size} - 1,
  124. foobar
  125. ) & \textrm{for } i \in [0,n-1] \\
  126. -1 & \textrm{for } i = n
  127. \end{array}
  128. \right.
  129. \end {math}
  130. where
  131. \begin{math}
  132. foobar =
  133. \left\lfloor
  134. \mathrm{bark}\left(\frac{\mathtt{floor0\_rate} \cdot i}{2n}\right) \cdot \frac{\mathtt{floor0\_bark\_map\_size}} {\mathrm{bark}(.5 \cdot \mathtt{floor0\_rate})}
  135. \right\rfloor
  136. \end{math}
  137. and
  138. \begin{math}
  139. \mathrm{bark}(x) = 13.1 \arctan (.00074x) + 2.24 \arctan (.0000000185x^2 + .0001x)
  140. \end{math}
  141. ]]>
  142. </phrase></textobject>
  143. <imageobject><imagedata fileref="lspmap.png"/></imageobject>
  144. </mediaobject>
  145. </informalequation>
  146. <para>
  147. The above is used to synthesize the LSP curve on a Bark-scale frequency
  148. axis, then map the result to a linear-scale frequency axis.
  149. Similarly, the below calculation synthesizes the output LSP curve <varname>[output]</varname> on a log
  150. (dB) amplitude scale, mapping it to linear amplitude in the last step:</para>
  151. <orderedlist>
  152. <listitem><simpara> <varname>[i]</varname> = 0 </simpara></listitem>
  153. <listitem><simpara> <varname>[&#969;]</varname> = &#960; * map element <varname>[i]</varname> / <varname>[floor0_bark_map_size]</varname></simpara></listitem>
  154. <listitem><para>if ( <varname>[floor0_order]</varname> is odd ) {
  155. <orderedlist>
  156. <listitem><para>calculate <varname>[p]</varname> and <varname>[q]</varname> according to:
  157. <informalequation>
  158. <mediaobject>
  159. <textobject><phrase>[equation for odd lsp]</phrase></textobject>
  160. <textobject role="tex"><phrase>
  161. <![CDATA[
  162. \begin{eqnarray*}
  163. p & = & (1 - \cos^2\omega)\prod_{j=0}^{(\mathtt{order}-3)/2} 4 (\cos c_{2j+1} - \cos \omega)^2 \\
  164. q & = & \frac{1}{4} \prod_{j=0}^{(\mathtt{order}-1)/2} 4 (\cos c_{2j} - \cos \omega)^2
  165. \end{eqnarray*}
  166. ]]>
  167. </phrase></textobject>
  168. <imageobject><imagedata fileref="oddlsp.png"/></imageobject>
  169. </mediaobject>
  170. </informalequation>
  171. </para></listitem>
  172. </orderedlist>
  173. } else <varname>[floor0_order]</varname> is even {
  174. <orderedlist>
  175. <listitem><para>calculate <varname>[p]</varname> and <varname>[q]</varname> according to:
  176. <informalequation>
  177. <mediaobject>
  178. <textobject><phrase>[equation for even lsp]</phrase></textobject>
  179. <textobject role="tex"><phrase>
  180. <![CDATA[
  181. \begin{eqnarray*}
  182. p & = & \frac{(1 - \cos^2\omega)}{2} \prod_{j=0}^{(\mathtt{order}-2)/2} 4 (\cos c_{2j+1} - \cos \omega)^2 \\
  183. q & = & \frac{(1 + \cos^2\omega)}{2} \prod_{j=0}^{(\mathtt{order}-2)/2} 4 (\cos c_{2j} - \cos \omega)^2
  184. \end{eqnarray*}
  185. ]]>
  186. </phrase></textobject>
  187. <imageobject><imagedata fileref="evenlsp.png"/></imageobject>
  188. </mediaobject>
  189. </informalequation>
  190. </para></listitem>
  191. </orderedlist>
  192. }
  193. </para></listitem>
  194. <listitem><para>calculate <varname>[linear_floor_value]</varname> according to:
  195. <informalequation>
  196. <mediaobject>
  197. <textobject><phrase>[expression for floorval]</phrase></textobject>
  198. <textobject role="tex"><phrase>
  199. <![CDATA[
  200. \begin{math}
  201. \exp \left( .11512925 \left(\frac{\mathtt{amplitude} \cdot \mathtt{floor0\_amplitute\_offset}}{(2^{\mathtt{floor0\_amplitude\_bits}}-1)\sqrt{p+q}}
  202. - \mathtt{floor0\_amplitude\_offset} \right) \right)
  203. \end{math}
  204. ]]>
  205. </phrase></textobject>
  206. <imageobject><imagedata fileref="floorval.png"/></imageobject>
  207. </mediaobject>
  208. </informalequation>
  209. </para></listitem>
  210. <listitem><simpara><varname>[iteration_condition]</varname> = map element <varname>[i]</varname></simpara></listitem>
  211. <listitem><simpara><varname>[output]</varname> element <varname>[i]</varname> = <varname>[linear_floor_value]</varname></simpara></listitem>
  212. <listitem><simpara>increment <varname>[i]</varname></simpara></listitem>
  213. <listitem><simpara>if ( map element <varname>[i]</varname> is equal to <varname>[iteration_condition]</varname> ) continue at step 5</simpara></listitem>
  214. <listitem><simpara>if ( <varname>[i]</varname> is less than <varname>[n]</varname> ) continue at step 2</simpara></listitem>
  215. <listitem><simpara>done</simpara></listitem>
  216. </orderedlist>
  217. </section>
  218. </section>
  219. </section>