[code.view]

[top] / python / PyMOTW / docs / base64 / index.html


<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
  "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    
    <title>base64 – Encode binary data into ASCII characters &mdash; Python Module of the Week</title>
    <link rel="stylesheet" href="../_static/sphinxdoc.css" type="text/css" />
    <link rel="stylesheet" href="../_static/pygments.css" type="text/css" />
    <script type="text/javascript">
      var DOCUMENTATION_OPTIONS = {
        URL_ROOT:    '../',
        VERSION:     '1.132',
        COLLAPSE_INDEX: false,
        FILE_SUFFIX: '.html',
        HAS_SOURCE:  true
      };
    </script>
    <script type="text/javascript" src="../_static/jquery.js"></script>
    <script type="text/javascript" src="../_static/underscore.js"></script>
    <script type="text/javascript" src="../_static/doctools.js"></script>
    <link rel="author" title="About these documents" href="../about.html" />
    <link rel="top" title="Python Module of the Week" href="../index.html" />
    <link rel="up" title="Internet Data Handling" href="../internet_data.html" />
    <link rel="next" title="json – JavaScript Object Notation Serializer" href="../json/index.html" />
    <link rel="prev" title="Internet Data Handling" href="../internet_data.html" /> 
  </head>
  <body>
    <div class="related">
      <h3>Navigation</h3>
      <ul>
        <li class="right" style="margin-right: 10px">
          <a href="../genindex.html" title="General Index"
             accesskey="I">index</a></li>
        <li class="right" >
          <a href="../py-modindex.html" title="Python Module Index"
             >modules</a> |</li>
        <li class="right" >
          <a href="../json/index.html" title="json – JavaScript Object Notation Serializer"
             accesskey="N">next</a> |</li>
        <li class="right" >
          <a href="../internet_data.html" title="Internet Data Handling"
             accesskey="P">previous</a> |</li>
        <li><a href="../contents.html">PyMOTW</a> &raquo;</li>
          <li><a href="../internet_data.html" accesskey="U">Internet Data Handling</a> &raquo;</li> 
      </ul>
    </div>
      <div class="sphinxsidebar">
        <div class="sphinxsidebarwrapper">
  <h3><a href="../contents.html">Table Of Contents</a></h3>
  <ul>
<li><a class="reference internal" href="#">base64 &#8211; Encode binary data into ASCII characters</a><ul>
<li><a class="reference internal" href="#base-64-encoding">Base 64 Encoding</a></li>
<li><a class="reference internal" href="#base-64-decoding">Base 64 Decoding</a></li>
<li><a class="reference internal" href="#url-safe-variations">URL-safe Variations</a></li>
<li><a class="reference internal" href="#other-encodings">Other Encodings</a></li>
</ul>
</li>
</ul>

  <h4>Previous topic</h4>
  <p class="topless"><a href="../internet_data.html"
                        title="previous chapter">Internet Data Handling</a></p>
  <h4>Next topic</h4>
  <p class="topless"><a href="../json/index.html"
                        title="next chapter">json &#8211; JavaScript Object Notation Serializer</a></p>
  <h3>This Page</h3>
  <ul class="this-page-menu">
    <li><a href="../_sources/base64/index.txt"
           rel="nofollow">Show Source</a></li>
  </ul>
<div id="searchbox" style="display: none">
  <h3>Quick search</h3>
    <form class="search" action="../search.html" method="get">
      <input type="text" name="q" size="18" />
      <input type="submit" value="Go" />
      <input type="hidden" name="check_keywords" value="yes" />
      <input type="hidden" name="area" value="default" />
    </form>
    <p class="searchtip" style="font-size: 90%">
    Enter search terms or a module, class or function name.
    </p>
</div>
<script type="text/javascript">$('#searchbox').show(0);</script>
        </div>
      </div>

    <div class="document">
      <div class="documentwrapper">
        <div class="bodywrapper">
          <div class="body">
            
  <div class="section" id="module-base64">
<span id="base64-encode-binary-data-into-ascii-characters"></span><h1>base64 &#8211; Encode binary data into ASCII characters<a class="headerlink" href="#module-base64" title="Permalink to this headline">¶</a></h1>
<table class="docutils field-list" frame="void" rules="none">
<col class="field-name" />
<col class="field-body" />
<tbody valign="top">
<tr class="field"><th class="field-name">Purpose:</th><td class="field-body">The base64 module contains functions for translating binary data into a subset of ASCII suitable for transmission using plaintext protocols.</td>
</tr>
<tr class="field"><th class="field-name">Python Version:</th><td class="field-body">1.4 and later</td>
</tr>
</tbody>
</table>
<p>The base64, base32, and base16 encodings convert 8 bit bytes to values with 6, 5, or 4 bits of useful data per byte, allowing non-ASCII bytes to be encoded as ASCII characters for transmission over protocols that require plain ASCII, such as SMTP.  The <em>base</em> values correspond to the length of the alphabet used in each encoding.  There are also URL-safe variations of the original encodings that use slightly different results.</p>
<div class="section" id="base-64-encoding">
<h2>Base 64 Encoding<a class="headerlink" href="#base-64-encoding" title="Permalink to this headline">¶</a></h2>
<p>A basic example of encoding some text looks like this:</p>
<div class="highlight-python"><div class="highlight"><pre><span class="kn">import</span> <span class="nn">base64</span>

<span class="c"># Load this source file and strip the header.</span>
<span class="n">initial_data</span> <span class="o">=</span> <span class="nb">open</span><span class="p">(</span><span class="n">__file__</span><span class="p">,</span> <span class="s">&#39;rt&#39;</span><span class="p">)</span><span class="o">.</span><span class="n">read</span><span class="p">()</span><span class="o">.</span><span class="n">split</span><span class="p">(</span><span class="s">&#39;#end_pymotw_header&#39;</span><span class="p">)[</span><span class="mi">1</span><span class="p">]</span>

<span class="n">encoded_data</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b64encode</span><span class="p">(</span><span class="n">initial_data</span><span class="p">)</span>

<span class="n">num_initial</span> <span class="o">=</span> <span class="nb">len</span><span class="p">(</span><span class="n">initial_data</span><span class="p">)</span>
<span class="n">padding</span> <span class="o">=</span> <span class="p">{</span> <span class="mi">0</span><span class="p">:</span><span class="mi">0</span><span class="p">,</span> <span class="mi">1</span><span class="p">:</span><span class="mi">2</span><span class="p">,</span> <span class="mi">2</span><span class="p">:</span><span class="mi">1</span> <span class="p">}[</span><span class="n">num_initial</span> <span class="o">%</span> <span class="mi">3</span><span class="p">]</span>

<span class="k">print</span> <span class="s">&#39;</span><span class="si">%d</span><span class="s"> bytes before encoding&#39;</span> <span class="o">%</span> <span class="n">num_initial</span>
<span class="k">print</span> <span class="s">&#39;Expect </span><span class="si">%d</span><span class="s"> padding bytes&#39;</span> <span class="o">%</span> <span class="n">padding</span>
<span class="k">print</span> <span class="s">&#39;</span><span class="si">%d</span><span class="s"> bytes after encoding&#39;</span> <span class="o">%</span> <span class="nb">len</span><span class="p">(</span><span class="n">encoded_data</span><span class="p">)</span>
<span class="k">print</span>
<span class="c">#print encoded_data</span>
<span class="k">for</span> <span class="n">i</span> <span class="ow">in</span> <span class="nb">xrange</span><span class="p">((</span><span class="nb">len</span><span class="p">(</span><span class="n">encoded_data</span><span class="p">)</span><span class="o">/</span><span class="mi">40</span><span class="p">)</span><span class="o">+</span><span class="mi">1</span><span class="p">):</span>
    <span class="k">print</span> <span class="n">encoded_data</span><span class="p">[</span><span class="n">i</span><span class="o">*</span><span class="mi">40</span><span class="p">:(</span><span class="n">i</span><span class="o">+</span><span class="mi">1</span><span class="p">)</span><span class="o">*</span><span class="mi">40</span><span class="p">]</span>
</pre></div>
</div>
<p>The output shows the 558 bytes of the original source expand to 744 bytes after being encoded.</p>
<div class="admonition note">
<p class="first admonition-title">Note</p>
<p class="last">There are no carriage returns in the output produced by the library, so I have broken the encoded data up artificially to make it fit better on the page.</p>
</div>
<div class="highlight-python"><pre>$ python base64_b64encode.py

113 bytes before encoding
Expect 1 padding bytes
152 bytes after encoding

CgppbXBvcnQgYmFzZTY0CgojIExvYWQgdGhpcyBz
b3VyY2UgZmlsZSBhbmQgc3RyaXAgdGhlIGhlYWRl
ci4KaW5pdGlhbF9kYXRhID0gb3BlbihfX2ZpbGVf
XywgJ3J0JykucmVhZCgpLnNwbGl0KCc=</pre>
</div>
</div>
<div class="section" id="base-64-decoding">
<h2>Base 64 Decoding<a class="headerlink" href="#base-64-decoding" title="Permalink to this headline">¶</a></h2>
<p>The encoded string can be converted back to the original form by taking 4 bytes and converting them to the original 3, using a reverse lookup.  The <tt class="docutils literal"><span class="pre">b64decode()</span></tt> function does that for you.</p>
<div class="highlight-python"><div class="highlight"><pre><span class="kn">import</span> <span class="nn">base64</span>

<span class="n">original_string</span> <span class="o">=</span> <span class="s">&#39;This is the data, in the clear.&#39;</span>
<span class="k">print</span> <span class="s">&#39;Original:&#39;</span><span class="p">,</span> <span class="n">original_string</span>

<span class="n">encoded_string</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b64encode</span><span class="p">(</span><span class="n">original_string</span><span class="p">)</span>
<span class="k">print</span> <span class="s">&#39;Encoded :&#39;</span><span class="p">,</span> <span class="n">encoded_string</span>

<span class="n">decoded_string</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b64decode</span><span class="p">(</span><span class="n">encoded_string</span><span class="p">)</span>
<span class="k">print</span> <span class="s">&#39;Decoded :&#39;</span><span class="p">,</span> <span class="n">decoded_string</span>
</pre></div>
</div>
<p>The encoding process looks at each sequence of 24 bits in the input (3 bytes) and encodes those same 24 bits spread over 4 bytes in the output.  The last two characters, the <tt class="docutils literal"><span class="pre">==</span></tt>, are padding because the number of bits in the original string was not evenly divisible by 24 in this example.</p>
<div class="highlight-python"><pre>$ python base64_b64decode.py

Original: This is the data, in the clear.
Encoded : VGhpcyBpcyB0aGUgZGF0YSwgaW4gdGhlIGNsZWFyLg==
Decoded : This is the data, in the clear.</pre>
</div>
</div>
<div class="section" id="url-safe-variations">
<h2>URL-safe Variations<a class="headerlink" href="#url-safe-variations" title="Permalink to this headline">¶</a></h2>
<p>Because the default base64 alphabet may use <tt class="docutils literal"><span class="pre">+</span></tt> and <tt class="docutils literal"><span class="pre">/</span></tt>, and those two characters are used in URLs, it became necessary to specify an alternate encoding with substitutes for those characters.  The <tt class="docutils literal"><span class="pre">+</span></tt> is replaced with a <tt class="docutils literal"><span class="pre">-</span></tt>, and <tt class="docutils literal"><span class="pre">/</span></tt> is replaced with underscore (<tt class="docutils literal"><span class="pre">_</span></tt>).  Otherwise, the alphabet is the same.</p>
<div class="highlight-python"><div class="highlight"><pre><span class="kn">import</span> <span class="nn">base64</span>

<span class="k">for</span> <span class="n">original</span> <span class="ow">in</span> <span class="p">[</span> <span class="nb">chr</span><span class="p">(</span><span class="mi">251</span><span class="p">)</span> <span class="o">+</span> <span class="nb">chr</span><span class="p">(</span><span class="mi">239</span><span class="p">),</span> <span class="nb">chr</span><span class="p">(</span><span class="mi">255</span><span class="p">)</span> <span class="o">*</span> <span class="mi">2</span> <span class="p">]:</span>
    <span class="k">print</span> <span class="s">&#39;Original         :&#39;</span><span class="p">,</span> <span class="nb">repr</span><span class="p">(</span><span class="n">original</span><span class="p">)</span>
    <span class="k">print</span> <span class="s">&#39;Standard encoding:&#39;</span><span class="p">,</span> <span class="n">base64</span><span class="o">.</span><span class="n">standard_b64encode</span><span class="p">(</span><span class="n">original</span><span class="p">)</span>
    <span class="k">print</span> <span class="s">&#39;URL-safe encoding:&#39;</span><span class="p">,</span> <span class="n">base64</span><span class="o">.</span><span class="n">urlsafe_b64encode</span><span class="p">(</span><span class="n">original</span><span class="p">)</span>
    <span class="k">print</span>
</pre></div>
</div>
<div class="highlight-python"><pre>$ python base64_urlsafe.py

Original         : '\xfb\xef'
Standard encoding: ++8=
URL-safe encoding: --8=

Original         : '\xff\xff'
Standard encoding: //8=
URL-safe encoding: __8=</pre>
</div>
</div>
<div class="section" id="other-encodings">
<h2>Other Encodings<a class="headerlink" href="#other-encodings" title="Permalink to this headline">¶</a></h2>
<p>Besides base 64, the module provides functions for working with base 32 and base 16 (hex) encoded data.</p>
<div class="highlight-python"><div class="highlight"><pre><span class="kn">import</span> <span class="nn">base64</span>

<span class="n">original_string</span> <span class="o">=</span> <span class="s">&#39;This is the data, in the clear.&#39;</span>
<span class="k">print</span> <span class="s">&#39;Original:&#39;</span><span class="p">,</span> <span class="n">original_string</span>

<span class="n">encoded_string</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b32encode</span><span class="p">(</span><span class="n">original_string</span><span class="p">)</span>
<span class="k">print</span> <span class="s">&#39;Encoded :&#39;</span><span class="p">,</span> <span class="n">encoded_string</span>

<span class="n">decoded_string</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b32decode</span><span class="p">(</span><span class="n">encoded_string</span><span class="p">)</span>
<span class="k">print</span> <span class="s">&#39;Decoded :&#39;</span><span class="p">,</span> <span class="n">decoded_string</span>
</pre></div>
</div>
<div class="highlight-python"><pre>$ python base64_base32.py

Original: This is the data, in the clear.
Encoded : KRUGS4ZANFZSA5DIMUQGIYLUMEWCA2LOEB2GQZJAMNWGKYLSFY======
Decoded : This is the data, in the clear.</pre>
</div>
<p>The base 16 functions work with the hexadecimal alphabet.</p>
<div class="highlight-python"><div class="highlight"><pre><span class="kn">import</span> <span class="nn">base64</span>

<span class="n">original_string</span> <span class="o">=</span> <span class="s">&#39;This is the data, in the clear.&#39;</span>
<span class="k">print</span> <span class="s">&#39;Original:&#39;</span><span class="p">,</span> <span class="n">original_string</span>

<span class="n">encoded_string</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b16encode</span><span class="p">(</span><span class="n">original_string</span><span class="p">)</span>
<span class="k">print</span> <span class="s">&#39;Encoded :&#39;</span><span class="p">,</span> <span class="n">encoded_string</span>

<span class="n">decoded_string</span> <span class="o">=</span> <span class="n">base64</span><span class="o">.</span><span class="n">b16decode</span><span class="p">(</span><span class="n">encoded_string</span><span class="p">)</span>
<span class="k">print</span> <span class="s">&#39;Decoded :&#39;</span><span class="p">,</span> <span class="n">decoded_string</span>
</pre></div>
</div>
<div class="highlight-python"><pre>$ python base64_base16.py

Original: This is the data, in the clear.
Encoded : 546869732069732074686520646174612C20696E2074686520636C6561722E
Decoded : This is the data, in the clear.</pre>
</div>
<div class="admonition-see-also admonition seealso">
<p class="first admonition-title">See also</p>
<dl class="last docutils">
<dt><a class="reference external" href="http://docs.python.org/library/base64.html">base64</a></dt>
<dd>The standard library documentation for this module.</dd>
<dt><span class="target" id="index-0"></span><a class="rfc reference external" href="http://tools.ietf.org/html/rfc3548.html"><strong>RFC 3548</strong></a></dt>
<dd>The Base16, Base32, and Base64 Data Encodings</dd>
</dl>
</div>
</div>
</div>


          </div>
        </div>
      </div>
      <div class="clearer"></div>
    </div>
    <div class="related">
      <h3>Navigation</h3>
      <ul>
        <li class="right" style="margin-right: 10px">
          <a href="../genindex.html" title="General Index"
             >index</a></li>
        <li class="right" >
          <a href="../py-modindex.html" title="Python Module Index"
             >modules</a> |</li>
        <li class="right" >
          <a href="../json/index.html" title="json – JavaScript Object Notation Serializer"
             >next</a> |</li>
        <li class="right" >
          <a href="../internet_data.html" title="Internet Data Handling"
             >previous</a> |</li>
        <li><a href="../contents.html">PyMOTW</a> &raquo;</li>
          <li><a href="../internet_data.html" >Internet Data Handling</a> &raquo;</li> 
      </ul>
    </div>
    <div class="footer">
      &copy; Copyright Doug Hellmann.
      Last updated on Oct 24, 2010.
      Created using <a href="http://sphinx.pocoo.org/">Sphinx</a>.

    <br/><a href="http://creativecommons.org/licenses/by-nc-sa/3.0/us/" rel="license"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by-nc-sa/3.0/us/88x31.png"/></a>
    
    </div>
  </body>
</html>

[top] / python / PyMOTW / docs / base64 / index.html

contact | logmethods.com