vorbis.html 8.2 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197
  1. <HTML><HEAD><TITLE>xiph.org: Ogg Vorbis documentation</TITLE>
  2. <BODY bgcolor="#ffffff" text="#202020" link="#006666" vlink="#000000">
  3. <nobr><img src="white-ogg.png"><img src="vorbisword2.png"></nobr><p>
  4. <h1><font color=#000070>
  5. Ogg Vorbis encoding format documentation
  6. </font></h1>
  7. <em>Last update to this document: July 15, 1999</em><br>
  8. <em>Last update to Vorbis documentation: July 21, 1999</em><p>
  9. <table><tr><td>
  10. <img src=wait.png>
  11. </td><td valign=center>
  12. As of writing, not all the below document
  13. links are live. They will be populated as we complete the
  14. documents.
  15. </td></tr></table>
  16. <p>
  17. <h2>Documents</h2>
  18. <ul>
  19. <li><a href="packet.html">Vorbis packet structure</a>
  20. <li><a href="envelope.html">Temporal envelope shaping and blocksize</a>
  21. <li><a href="mdct.html">Time domain segmentation and MDCT transform</a>
  22. <li><a href="resolution.html">The resolution floor</a>
  23. <li><a href="residuals.html">MDCT-domain fine structure</a><p>
  24. <li><a href="probmodel.html">The Vorbis probability model</a>
  25. <li><a href="bitpack.html">The Vorbis bitpacker</a><p>
  26. <li><a href="oggstream.html">Ogg bitstream overview</a>
  27. <li><a href="framing.html">Ogg logical bitstream and framing spec</a>
  28. <li><a href="vorbis-stream.html">Vorbis packet->Ogg bitstream
  29. mapping</a><p>
  30. <li><a href="programming.html">Programming with libvorbis</a><p>
  31. </ul>
  32. <h2>Description</h2>
  33. Ogg Vorbis is a general purpose compressed audio format
  34. for high quality (44.1-48.0kHz, 16+ bit, polyphonic) audio and music
  35. at moderate fixed and variable bitrates (40-80 kb/s/channel). This
  36. places Vorbis in the same class as audio representations including
  37. MPEG-1 audio layer 3, MPEG-4 audio (AAC and TwinVQ), and PAC.<p>
  38. Vorbis is the first of a planned family of Ogg multimedia coding
  39. formats being developed as part of Xiphophorus's Ogg multimedia
  40. project. See <a href="http://www.xiph.org/">http://www.xiph.org/</a>
  41. for more information.
  42. <h2>Vorbis technical documents</h2>
  43. A Vorbis encoder takes in overlapping (but contiguous) short-time
  44. segments of audio data. The encoder analyzes the content of the audio
  45. to determine an optimal compact representation; this phase of encoding
  46. is known as <em>analysis</em>. For each short-time block of sound,
  47. the encoder then packs an efficient representation of the signal, as
  48. determined by analysis, into a raw packet much smaller than the size
  49. required by the original signal; this phase is <em>coding</em>.
  50. Lastly, in a streaming environment, the raw packets are then
  51. structured into a continuous stream of octets; this last phase is
  52. <em>streaming</em>. Note that the stream of octets is referred to both
  53. as a 'byte-' and 'bit-'stream; the latter usage is acceptible as the
  54. stream of octets is a physical representation of a true logical
  55. bit-by-bit stream.<p>
  56. A Vorbis decoder performs a mirror image process of extracting the
  57. original sequence of raw packets from an Ogg stream (<em>stream
  58. decomposition</em>), reconstructing the signal representation from the
  59. raw data in the packet (<em>decoding</em>) and them reconstituting an
  60. audio signal from the decoded representation (<em>synthesis</em>).<p>
  61. The <a href="programming.html">Programming with libvorbis</a>
  62. documents discuss use of the reference Vorbis codec library
  63. (libvorbis) produced by Xiphophorus.<p>
  64. The data representations and algorithms necessary at each step to
  65. encode and decode Ogg Vorbis bitstreams are described by the below
  66. documents in sufficient detail to construct a complete Vorbis codec.
  67. Note that at the time of writing, Vorbis is still in a 'Request For
  68. Comments' stage of development; despite being in advanced stages of
  69. development, input from the multimedia community is welcome.<p>
  70. <h3>Vorbis analysis and synthesis</h3>
  71. Analysis begins by seperating an input audio stream into individual,
  72. overlapping short-time segments of audio data. These segments are
  73. then transformed into an alternate representation, seeking to
  74. represent the original signal in a more efficient form that codes into
  75. a smaller number of bytes. The analysis and transformation stage is
  76. the most complex element of producing a Vorbis bitstream.<p>
  77. The corresponding synthesis step in the decoder is simpler; there is
  78. no analysis to perform, merely a mechanical, deterministic
  79. reconstruction of the original audio data from the transform-domain
  80. representation.<p>
  81. <ul>
  82. <li><a href="packet.html">Vorbis packet structure</a>: Describes the basic analysis components necessary to produce Vorbis packets and the structure of the packet itself.
  83. <li><a href="envelope.html">Temporal envelope shaping and blocksize</a>: Use of temporal envelope shaping and variable blocksize to minimize time-domain energy leakage during wide dynamic range and spectral energy swings. Also discusses time-related principles of psychoacoustics.
  84. <li><a href="mdct.html">Time domain segmentation and MDCT transform</a>: Division of time domain data into individual overlapped, windowed short-time vectors and transformation using the MDCT
  85. <li><a href="resolution.html">The resolution floor</a>: Use of frequency doamin psychoacoustics, and the MDCT-domain noise, masking and resolution floors
  86. <li><a href="residuals.html">MDCT-domain fine structure</a>: Production, quantization and massaging of MDCT-spectrum fine structure
  87. </ul>
  88. <h3>Vorbis coding and decoding</h3>
  89. Coding and decoding converts the transform-domain representation of
  90. the original audio produced by analysis to and from a bitwise packed
  91. raw data packet. Coding and decoding consist of two logically
  92. orthogonal concepts, <em>back-end coding</em> and <em>bitpacking</em>.<p>
  93. <em>Back-end coding</em> uses a probability model to represent the raw numbers
  94. of the audio representation in as few physical bits as possible;
  95. familiar examples of back-end coding include Huffman coding and Vector
  96. Quantization.<p>
  97. <em>Bitpacking</em> arranges the variable sized words of the back-end
  98. coding into a vector of octets without wasting space. The octets
  99. produced by coding a single short-time audio segment is one raw Vorbis
  100. packet.<p>
  101. <ul>
  102. <li><a href="probmodel.html">The Vorbis probability model</a>
  103. <li><a href="bitpack.html">The Vorbis bitpacker</a>: Arrangement of
  104. variable bit-length words into an octet-aligned packet.
  105. </ul>
  106. <h3>Vorbis streaming and stream decomposition</h3>
  107. Vorbis packets contain the raw, bitwise-compressed representation of a
  108. snippet of audio. These packets contain no structure and cannot be
  109. strung together directly into a stream; for streamed transmission and
  110. storage, Vorbis packets are encoded into an Ogg bitstream.<p>
  111. <ul>
  112. <li><a href="oggstream.html">Ogg bitstream overview</a>: High-level
  113. description of Ogg logical bitstreams, how logical bitstreams
  114. (of mixed media types) can be combined into physical bitstreams, and
  115. restrictions on logical-to-physical mapping. Note that this document is
  116. not specific only to Ogg Vorbis.
  117. <li><a href="framing.html">Ogg logical bitstream and framing
  118. spec</a>: Low level, complete specification of Ogg logical
  119. bitstream pages. Note that this document is not specific only to Ogg
  120. Vorbis.
  121. <li><a href="vorbis-stream.html">Vorbis bitstream mapping</a>:
  122. Specifically describes mapping Vorbis data into an
  123. Ogg physical bitstream.
  124. </ul>
  125. <hr>
  126. <a href="http://www.xiph.org/">
  127. <img src="white-xifish.png" align=left border=0>
  128. </a>
  129. <font size=-2 color=#505050>
  130. Ogg is a <a href="http://www.xiph.org">Xiphophorus</a> effort to
  131. protect essential tenets of Internet multimedia from corporate
  132. hostage-taking; Open Source is the net's greatest tool to keep
  133. everyone honest. See <a href="http://www.xiph.org/about.html">About
  134. Xiphophorus</a> for details.
  135. <p>
  136. Ogg Vorbis is the first Ogg audio CODEC. Anyone may
  137. freely use and distribute the Ogg and Vorbis specification,
  138. whether in a private, public or corporate capacity. However,
  139. Xiphophorus and the Ogg project (xiph.org) reserve the right to set
  140. the Ogg/Vorbis specification and certify specification compliance.<p>
  141. Xiphophorus's Vorbis software CODEC implementation is distributed
  142. under the Lesser/Library GNU Public License. This does not restrict
  143. third parties from distributing independent implementations of Vorbis
  144. software under other licenses.<p>
  145. OggSquish, Vorbis, Xiphophorus and their logos are trademarks (tm) of
  146. <a href="http://www.xiph.org/">Xiphophorus</a>. These pages are
  147. copyright (C) 1994-2000 Xiphophorus. All rights reserved.<p>
  148. </body>