<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Hacking the it crowd</title>
	<atom:link href="http://theitcrowd.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://theitcrowd.wordpress.com</link>
	<description>Extracting and decoding the it crowd season 2 easter eggs</description>
	<lastBuildDate>Thu, 20 Dec 2007 23:51:58 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='theitcrowd.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/8473c94d4305fd5fb99086f69d8002b9?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Hacking the it crowd</title>
		<link>http://theitcrowd.wordpress.com</link>
	</image>
			<item>
		<title>Linked on German Wikipedia</title>
		<link>http://theitcrowd.wordpress.com/2007/12/21/linked-on-german-wikipedia/</link>
		<comments>http://theitcrowd.wordpress.com/2007/12/21/linked-on-german-wikipedia/#comments</comments>
		<pubDate>Thu, 20 Dec 2007 23:51:58 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[Infocom]]></category>
		<category><![CDATA[Interactive Fiction]]></category>
		<category><![CDATA[Wikipedia]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/12/21/linked-on-german-wikipedia/</guid>
		<description><![CDATA[Hi, I just saw that this blog has been linked on the German Wikipedia. Now I will continue writing about the rest of the files. So, stay tuned, I will be back after Christmas.
Happy Holidays to all of you.
       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=18&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Hi, I just saw that this blog has been linked on the German Wikipedia. Now I will continue writing about the rest of the files. So, stay tuned, I will be back after Christmas.</p>
<p>Happy Holidays to all of you.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/18/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/18/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/18/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/18/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/18/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=18&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/12/21/linked-on-german-wikipedia/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
		<item>
		<title>Episode 2 aka Barber.z5</title>
		<link>http://theitcrowd.wordpress.com/2007/11/12/episode-2-aka-barberz5/</link>
		<comments>http://theitcrowd.wordpress.com/2007/11/12/episode-2-aka-barberz5/#comments</comments>
		<pubDate>Mon, 12 Nov 2007 10:40:38 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[Infocom]]></category>
		<category><![CDATA[Interactive Fiction]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/11/12/episode-2-aka-barberz5/</guid>
		<description><![CDATA[Let&#8217;s take a look at the 2nd Episode. As pointed out here I was unable to decode the file with the correct md5sum. Nevertheless I was able to use it.
Barber.z5 is a story file for a z-machine. See wikipedia

Just download one of the many interpreters and load up the file. It&#8217;s a text adventure, set [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=17&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Let&#8217;s take a look at the 2nd Episode. As pointed out <a href="http://theitcrowd.wordpress.com/2007/11/08/getting-the-files/#more-15">here</a> I was unable to decode the file with the correct md5sum. Nevertheless I was able to use it.</p>
<p>Barber.z5 is a story file for a z-machine. See <a href="http://en.wikipedia.org/wiki/Z-machine">wikipedia</a><br />
<span id="more-17"></span><br />
Just download one of the many interpreters and load up the file. It&#8217;s a text adventure, set in a well known environment. For instructions on how to play, take a look here:</p>
<ul>
<li><a href="http://www.microheaven.com/IFGuide/" target="_blank">A Beginner&#8217;s Guide  to Playing Interactive Fiction</a></li>
<li><a href="http://adamcadre.ac/content/if.txt" target="_blank">Interactive Fiction &#8211; Getting started</a></li>
<li><a href="http://www.malinche.net/playerguide.pdf" target="_blank">Interactive Fiction Player&#8217;s Guide</a></li>
</ul>
<p>If you want a walkthrough, try using &#8220;strings&#8221; at the linux prompt. Otherwise, here&#8217;s one tip, without it you won&#8217;t even be able to get into the office: <a href="http://myweb.tiscali.co.uk/themicrouser/reviews/book/assemblyprogrammingmadeeasy.htm" target="_blank">Take a look here</a></p>
<p>I haven&#8217;t solved the whole game myself yet, but when I do, I&#8217;ll inform you <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /><br />
Now, have a nice game-</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/17/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/17/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/17/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/17/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/17/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=17&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/11/12/episode-2-aka-barberz5/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
		<item>
		<title>We&#8217;ve got the files, but what now?</title>
		<link>http://theitcrowd.wordpress.com/2007/11/09/weve-got-the-files-but-what-now/</link>
		<comments>http://theitcrowd.wordpress.com/2007/11/09/weve-got-the-files-but-what-now/#comments</comments>
		<pubDate>Fri, 09 Nov 2007 19:49:38 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Exif]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[Geolocation]]></category>
		<category><![CDATA[ocr]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/11/09/weve-got-the-files-but-what-now/</guid>
		<description><![CDATA[After we extracted and decoded the files, we now want to use them. So, what are they, how to use them?
At this stage, we take a look at  http://www.ilovewillies.com (Only awaylable in the Google-cache)
Edit: I fixed the links


EasterEgg1.jpg
Barber.z5
Eggdump
Sophie&#8217;s Choice
EasterEgg3.part.2
Dr. Mike Thorne&#8217;s flashing dot at the side of your screen

These are all the informations we [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=16&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>After we extracted and decoded the files, we now want to use them. So, what are they, how to use them?</p>
<p>At this stage, we take a look at  <a href="http://www.ilovewillies.com" target="_blank">http://www.ilovewillies.com</a> (Only awaylable in the <a href="http://209.85.129.104/search?q=cache:uvbapyF30X4J:www.ilovewillies.com/+ilovewillies&amp;ct=clnk&amp;cd=1">Google-cache</a>)</p>
<p><strong>Edit: I fixed the links</strong></p>
<p><span id="more-16"></span></p>
<ol>
<li><a href="http://en.wikipedia.org/wiki/Exif#Geolocation">EasterEgg1.jpg</a></li>
<li><a href="http://en.wikipedia.org/wiki/Z-machine">Barber.z5</a></li>
<li><a href="http://en.wikipedia.org/wiki/BBC_Micro#Display_modes">Eggdump</a></li>
<li><a href="http://en.wikipedia.org/wiki/BBC_BASIC">Sophie&#8217;s Choice</a></li>
<li><a href="http://en.wikipedia.org/wiki/JPEG">EasterEgg3.part.2</a></li>
<li><a href="http://tv.cream.org/a-z/f/f2.htm#4computerbuffs">Dr. Mike Thorne</a>&#8217;s <a href="http://en.wikipedia.org/wiki/Tagged_Image_File_Format">flashing dot</a> <a href="http://en.wikipedia.org/wiki/Overscan">at the side of your screen</a></li>
</ol>
<p>These are all the informations we need.</p>
<p>Episode 1 gives us an image of Moss, the interesting part is the exif-geolocation. See: <a href="http://bbcentral.wordpress.com/2007/10/29/hacking-the-it-crowd-series-2-dvd-part-2/">BBCentral&#8217;s Blog</a></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/16/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/16/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/16/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=16&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/11/09/weve-got-the-files-but-what-now/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
		<item>
		<title>Getting the files</title>
		<link>http://theitcrowd.wordpress.com/2007/11/08/getting-the-files/</link>
		<comments>http://theitcrowd.wordpress.com/2007/11/08/getting-the-files/#comments</comments>
		<pubDate>Thu, 08 Nov 2007 08:45:53 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[ocr]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/11/08/getting-the-files/</guid>
		<description><![CDATA[After my last post, you will end up with a lot of text files. We will now convert these text files to the actual binary files.
To do that we will first have to remove duplicated files. Transcode will sometimes extract one subtitle twice. Use your favourite image viewer (best with automatic search for duplicated files) [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=15&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>After my last post, you will end up with a lot of text files. We will now convert these text files to the actual binary files.</p>
<p>To do that we will first have to remove duplicated files. Transcode will sometimes extract one subtitle twice. Use your favourite image viewer (best with automatic search for duplicated files) to identify those. Do not delete them, as it could of course be possible (even if not very probable) that it wasn&#8217;t transcode&#8217;s fault but they are actually two identical subtitles.</p>
<p>The next step is to join all txt-files and to remove accidentally inserted spaces. A simple<br />
<span style="font-family:courier new;">sed -e &#8220;s/ //g&#8221; subtitles-*txt &gt; base64.txt</span><br />
entered on the console will do.</p>
<p>Now we have one file containing the base64-encoded source, <span id="more-15"></span>the md5sum for that file and the original file name. As I didn&#8217;t find a program which could handle these files the way I wanted I wrote the following python script to extract the file and check the md5sum:</p>
<p><code><br />
#!/usr/bin/python<br />
import sys, base64, md5</code></p>
<p>def hexToString(md5bin):<br />
result = &#8220;&#8221;<br />
for ch in md5bin:<br />
realCh = hex(ord(ch)).replace(&#8220;0x&#8221;,&#8221;")<br />
result += realCh<br />
return result</p>
<p>if len(sys.argv) &lt; 2:<br />
print &#8220;&#8221;"Usage: %s in_b64_enc_file</p>
<p>in_b64_enc_file  &#8211; The Base64 encoded file to be converted<br />
&#8220;&#8221;"%sys.argv[0]<br />
sys.exit(0)</p>
<p>f = file(sys.argv[1], &#8216;rb&#8217;)<br />
s = f.readline()<br />
md5string = &#8220;&#8221;<br />
while (s.strip()):<br />
sarray = s.strip().split(&#8216;:&#8217;)<br />
if (sarray[0] == &#8220;Content-MD5&#8243;):<br />
md5base64 = sarray[1]<br />
md5bin = base64.decodestring(md5base64)<br />
md5string = hexToString(md5bin)<br />
print &#8220;MD5 should be: &#8221; + md5string<br />
if (len(sarray)&lt;2):<br />
sarray = s.strip().split(&#8220;=&#8221;)<br />
filename = sarray[1].strip(&#8216;&#8221;&#8216;)<br />
print filename<br />
s = f.readline()</p>
<p>fout = file(filename, &#8216;wb&#8217;)<br />
s = f.read()<br />
decoded = base64.decodestring(s)<br />
fout.write(decoded)<br />
f.close()<br />
fout.close()<br />
m = md5.new(decoded)<br />
print &#8220;MD5 is: &#8221; + hexToString(m.digest())<br />
print &#8220;MD5 matches: &#8221; + str(hexToString(m.digest()) == md5string)</p>
<p>If you were not careful enough (like me) and entered a wrong letter, or enter the right letter with the wrong case, when pgm2txt asked you, you will notice that now. Well, it&#8217;s only a question of hours, the convert the image to text again <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> .  If any one of you gets the right md5sum for Episode 2 (Barber.z5), please let me know. I always get it wrong even though I am able to use the file.</p>
<p>The next post will talk about the extracted file, and what to do with them.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/15/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/15/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/15/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/15/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/15/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=15&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/11/08/getting-the-files/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
		<item>
		<title>Converting images to text</title>
		<link>http://theitcrowd.wordpress.com/2007/11/05/converting-images-to-text/</link>
		<comments>http://theitcrowd.wordpress.com/2007/11/05/converting-images-to-text/#comments</comments>
		<pubDate>Mon, 05 Nov 2007 12:43:09 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[ocr]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/11/05/converting-images-to-text/</guid>
		<description><![CDATA[Now we get to the interesting part:
Converting the images to text.
I&#8217;m using pgm2txt for this purpose. Using database only mode (&#8220;-d&#8221;) ensures that we have full control. Using gocr intern heuristics gave many errors, e.g. &#8220;l&#8221; was read as &#8220;1&#8243;.
pgm2txt will ask you to enter the display text, whenever it cannot identify the characters. This [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=12&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Now we get to the interesting part:<br />
Converting the images to text.<br />
I&#8217;m using pgm2txt for this purpose. Using database only mode (&#8220;-d&#8221;) ensures that we have full control. Using gocr intern heuristics gave many errors, e.g. &#8220;l&#8221; was read as &#8220;1&#8243;.</p>
<p>pgm2txt will ask you to enter the display text, whenever it cannot identify the characters. This means in database only mode, you will have to enter every character at least once. That would be a problem, but as the letters aren&#8217;t always clearly separated, you will often end up having to enter the text for combinations like &#8220;KWY&#8221;. Nevertheless, after having trained the db for some time, all you need is patience and cpu power. After some time (1000 pics =&gt; several hours) you will  end up with a lot of text files, each containing one part of base64 encoded file.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/12/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/12/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/12/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=12&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/11/05/converting-images-to-text/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
		<item>
		<title>Extracting subtites</title>
		<link>http://theitcrowd.wordpress.com/2007/11/04/extracting-subtites/</link>
		<comments>http://theitcrowd.wordpress.com/2007/11/04/extracting-subtites/#comments</comments>
		<pubDate>Sun, 04 Nov 2007 14:15:00 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[hacking]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/11/04/extracting-subtites/</guid>
		<description><![CDATA[Step one is easy.
You will need the following programs:

transcode
mplayer
subtitleripper

Using with Linux just use the following script (originally taken from the Gentoo Wiki):

#!/bin/bash
lsdvd
echo &#8220;Please type in the stream number&#8221;
read DVDSTREAM;
mplayer -dvd-device /dev/dvd dvd://$DVDSTREAM -vo null -ao null -frames 0 -v 2&#62;&#38;1 &#124; grep sid
# ask the user for sid&#8230;# the correct number is 0&#215;20 + sid
echo [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=4&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Step one is easy.<br />
You will need the following programs:</p>
<ul>
<li>transcode</li>
<li>mplayer</li>
<li>subtitleripper</li>
</ul>
<p>Using with Linux just use the following script (originally taken from the Gentoo Wiki):<br />
<span style="font-family:courier new;"><br />
#!/bin/bash<br />
lsdvd<br />
echo &#8220;Please type in the stream number&#8221;<br />
read DVDSTREAM;<br />
mplayer -dvd-device /dev/dvd dvd://$DVDSTREAM -vo null -ao null -frames 0 -v 2&gt;&amp;1 | grep sid<br />
# ask the user for sid&#8230;# the correct number is 0&#215;20 + sid<br />
echo &#8220;Please type in the subtitle SID, type in hexadecimal and add 0&#215;20. Example: for sid 0, type 0&#215;20&#8243;<br />
read SID;</span></p>
<p>tccat -i /dev/dvd -T $DVDSTREAM -L | tcextract -x ps1 -t vob -a $SID &gt; subs<br />
subtitles2pgm -o subtitles-$DVDSTREAM -c 0,255,255,255<br />
We will end up with a lot of pictures, each containing one piece of the final base64 encoded file. The next step will be to convert these pictures into text using an ocr-programm</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/4/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/4/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/4/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/4/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/4/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=4&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/11/04/extracting-subtites/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
		<item>
		<title>What&#8217;s this all about?</title>
		<link>http://theitcrowd.wordpress.com/2007/11/04/whats-this-all-about/</link>
		<comments>http://theitcrowd.wordpress.com/2007/11/04/whats-this-all-about/#comments</comments>
		<pubDate>Sun, 04 Nov 2007 13:16:00 +0000</pubDate>
		<dc:creator>Sven</dc:creator>
				<category><![CDATA[The IT Crowd]]></category>
		<category><![CDATA[base64]]></category>
		<category><![CDATA[DVD]]></category>
		<category><![CDATA[Easter Egg]]></category>
		<category><![CDATA[Easteregg]]></category>
		<category><![CDATA[Geeks]]></category>
		<category><![CDATA[hacking]]></category>

		<guid isPermaLink="false">http://theitcrowd.wordpress.com/2007/11/04/whats-this-all-about/</guid>
		<description><![CDATA[It recently ordered the &#8220;The IT Crowd&#8221; DVD set from Amazon.uk. Season 1 contains some normal easter eggs, but the Season 2 DVD goes far beyond. The leet subtitles are actually base64 encoded files.
So, what to do now?

Extract subtitles from DVD
Convert subtitles to text
Decode text to files

       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=3&subd=theitcrowd&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>It recently ordered the &#8220;The IT Crowd&#8221; DVD set from Amazon.uk. Season 1 contains some normal easter eggs, but the Season 2 DVD goes far beyond. The leet subtitles are actually base64 encoded files.<br />
So, what to do now?</p>
<ol>
<li>Extract subtitles from DVD</li>
<li>Convert subtitles to text</li>
<li>Decode text to files</li>
</ol>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/theitcrowd.wordpress.com/3/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/theitcrowd.wordpress.com/3/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/theitcrowd.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/theitcrowd.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/theitcrowd.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/theitcrowd.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/theitcrowd.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/theitcrowd.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/theitcrowd.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/theitcrowd.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/theitcrowd.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/theitcrowd.wordpress.com/3/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=theitcrowd.wordpress.com&blog=2058396&post=3&subd=theitcrowd&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://theitcrowd.wordpress.com/2007/11/04/whats-this-all-about/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/56dae0385d8c38c718a6311147f01afe?s=96&#38;d=identicon" medium="image">
			<media:title type="html">Sven</media:title>
		</media:content>
	</item>
	</channel>
</rss>