<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Yahoo! Coder's Cookbook Blog</title>
	<atom:link href="http://ycoderscookbook.com/blog/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://ycoderscookbook.com/blog</link>
	<description>YCC - All things concerning Yahoo! Messenger</description>
	<lastBuildDate>Wed, 20 Jan 2010 20:09:04 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How Long Would It Take To Copy The Internet?</title>
		<link>http://ycoderscookbook.com/blog/?p=391</link>
		<comments>http://ycoderscookbook.com/blog/?p=391#comments</comments>
		<pubDate>Wed, 20 Jan 2010 20:09:04 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=391</guid>
		<description><![CDATA[Here is a non-Yahoo! related post that I responded to on Astahost.  Considering it is an interesting answer and it took some thought and time to create, I am also posting it here.  An Astahost member, FireFoxRules, asks the following question (http://www.astahost.com/Downloading-Internet-t21292.html):
I&#8217;m wondering if it is possible to save a copy of everything [...]]]></description>
			<content:encoded><![CDATA[<p>Here is a non-Yahoo! related post that I responded to on Astahost.  Considering it is an interesting answer and it took some thought and time to create, I am also posting it here.  An Astahost member, FireFoxRules, asks the following question (<a href="http://www.astahost.com/Downloading-Internet-t21292.html">http://www.astahost.com/Downloading-Internet-t21292.html</a>):</p>
<blockquote><p>I&#8217;m wondering if it is possible to save a copy of everything on the Internet. Ignoring ISP data transfer limitations (max GB per month), I have a download speed of approximately 4 Mbps.</p>
<p>The Internet isn&#8217;t limited to web pages though, it includes everything that is public accessible (not password-protected) which includes all music, videos, pictures, software, etc. Furthermore, I am not limiting it to HTTP servers as torrents, files on FTP servers and anything on peer-to-peer networks (Gnutella/LimeWire) will count as well.</p>
<p>Saving everything at its current state (ignoring changes to the live version after it is saved), how long will this take? What if I upgrade my Internet connection, or theoretically use all the bandwidth of (for example) educational institutions (universities), ISPs (Shaw, Comcast, etc) and large corporations (Microsoft, Google, etc).</p>
<p>I am not talking about indexing content, I mean saving the actual file. Every web page would be considered one file, and pictures, JavaScript, CSS, etc would be their own files.</p></blockquote>
<p>My response:</p>
<p>Interesting question.  I am actually surprised that that you, FireFoxRules, asked it as it sounds like a crazy idea that I would expect from a newb.  At any rate it did get me to think so I will propose an answer.</p>
<p><em>Assumptions</em>
<ul>
•	You have an insane Internet backbone connection will guaranteed reliability and speed.  I will assume that you have a 100 Mb/sec connection which is usually only available to ISP level organizations.<br />
•	You have an appropriately sized upstream connection to do all the requesting.<br />
•	You actually get the bandwidth you paid for.  I personally have a “10 Mb/sec down and 1 Mb/sec up” consumer cable connection.  I have never seen anything close to these numbers in real life.  The closest I have seen is 2 Mb/sec down (downloading ISOs from Microsoft MSDN) and there is a hard limit of around 115 kb/sec up that I constantly hit.  A more typical download speed is around 500 kb/sec for regular web browsing.<br />
•	We will ignore all network structure and latency issues and assume you have a direct connection to your target with no hops in between.</p>
<ul>
o	The nature of TCP/IP will limit you to around 80% of your bandwidth under ideal operation.  When you have only two computers on a network (the idea case) you will still never get 100% bandwidth because of TCP header overhead, IP header overhead, other traffic such as ARP requests, and IP timing issues.  A typical network usually sees only 45-50% bandwidth because of collisions.  A stressed out network may only get 10%.<br />
o	There is latency between your request and the data.</p>
<ul>
	Machine and router hardware delays.  Usually microseconds.<br />
	Every hop adds delay.  Usually milliseconds.<br />
	Server response time.  Usually small compared to everything else but could become an issue.  Ranges from milliseconds (typical) to minutes.</ul>
<p>o	In total you should expect to take at least 50% off your promised bandwidth in an idea case.  This brings out 100 Mb/sec connection to more like 50 Mb/sec; but as stated earlier, we are ignoring this.<br />
o	Internet speed is based on more than your connection speed.  The bandwidth of the server is also very important.  You may have sufficient bandwidth but if you request from a server that is slower than your connection, you are stuck with their speed.  I find that a typical website will only transfer up to 50 kb/sec so you will have to download from many different servers at the same time to fill your 100 Mb/sec pipe.</ul>
<p>•	You have enough computing power.  At 100 Mb/sec you are starting to get into the range of IDE hard drive data transfer range.  You will also want to have several threads going at the same time to maximize bandwidth utilization.  You want to download a different webpage while you are waiting on the request for a separate page.  Better yet, you want to keep your bandwidth pipe full even if you hit a slow server or a timeout which can be up to 2 minutes.  I would guess that you would need 150-300 threads or requests going at the same time to meet this demand.  A single computer likely will not be able to do this alone so you would end up with at least 5-10 servers on your end to pull this off.  This of course breaks the idea case of no network congestion or collisions as described earlier.<br />
•	You have enough storage space.  A quick search shows that YouTube alone has around 7.7 petabytes of content (http://beerpla.net/2008/08/14/how-to-find-out-the-number-of-videos-on-youtube/ ).  Newegg is showing 1TB hard drives for around $90.  With the needed hardware and controllers, you are looking at around $100/ TB.  At this rate you will need 7700 1 TB hard drives which would cost you around $770,000.  A related article on BackBlaze (http://blog.backblaze.com/2009/09/01/petabytes-on-a-budget-how-to-build-cheap-cloud-storage/) shows you how to build your own 67 TB 4U rack server for $7,867 including drives and rack hardware.  At the BackBlaze rate, 7.7 PB will cost $904,118 or almost 1 million dollars.</ul>
<p><em>Gottchas</em>
<ul>
•	Connection speeds are measured in BITS and not BYTES.  There are 8 bits to a byte so this means that you need to divide your connection speed by 8 right off the top.  This will make our 100 Mbit/sec connection a 12.5 Mbyte/sec connection.  With typical network delays, this would become 6.25 Mbyte/sec.</ul>
<p>Now let’s do some calculations (whips out trusty TI-89 calculator).</p>
<p>12.5 Mbyte/sec*60 seconds = 750 MB/min<br />
 750 MB/min* 60 mins = 45 GB/hour<br />
45 GB/hour *24 hours = 1080 GB/day or ~1 TB/day (1.08e12)</p>
<p>With the YouTube example above of 7.7 petabytes (10e15)…</p>
<p>7.7e15 Bytes/1.08e12 Bytes/day=7129.63 days<br />
7129.63 days/365 days/year = 19.5332 years</p>
<p><strong>Just downloading the YouTube database with an insane Internet connection will take you almost 20 years and almost 1 million dollars just in hard drive storage.</strong></p>
<p>Hope this answers your question <img src='http://ycoderscookbook.com/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=391</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>YCC Yahoo! Bot Maker 2.4.1 Released</title>
		<link>http://ycoderscookbook.com/blog/?p=388</link>
		<comments>http://ycoderscookbook.com/blog/?p=388#comments</comments>
		<pubDate>Fri, 31 Jul 2009 02:50:27 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=388</guid>
		<description><![CDATA[I have published the latest version of YCC Yahoo! Bot Maker which is now at version 2.4.1.  This update was necessary because of a change to the Yahoo! registration page which resulted in a “URI Format Exception” error message being displayed in YCC Yahoo Bot Maker.  The only other change is the addition [...]]]></description>
			<content:encoded><![CDATA[<p>I have published the latest version of YCC Yahoo! Bot Maker which is now at version 2.4.1.  This update was necessary because of a change to the Yahoo! registration page which resulted in a “URI Format Exception” error message being displayed in YCC Yahoo Bot Maker.  The only other change is the addition of the version number in the mail title bar.  If you already have version 2.4.0 installed then simply run the program again and the auto updater will do the rest.  You can download the latest version from <a href="http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.4.1.zip">http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.4.1.zip</a></p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=388</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>YCC Yahoo! Bot Maker 2.4.0 Down</title>
		<link>http://ycoderscookbook.com/blog/?p=386</link>
		<comments>http://ycoderscookbook.com/blog/?p=386#comments</comments>
		<pubDate>Sun, 12 Jul 2009 06:41:52 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=386</guid>
		<description><![CDATA[The current version of YCC Yahoo! Bot Maker, 2.4.0, is once again not working.  The error message is “URI Format Exception” which is most likely due to another Yahoo! registration page update.  Since most of the bot makers are currently having a lot of problems making accounts I’m not going to rush a [...]]]></description>
			<content:encoded><![CDATA[<p>The current version of YCC Yahoo! Bot Maker, 2.4.0, is once again not working.  The error message is “URI Format Exception” which is most likely due to another Yahoo! registration page update.  Since most of the bot makers are currently having a lot of problems making accounts I’m not going to rush a fix unless someone lets me know they need it.</p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=386</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Forum and Blog Update</title>
		<link>http://ycoderscookbook.com/blog/?p=384</link>
		<comments>http://ycoderscookbook.com/blog/?p=384#comments</comments>
		<pubDate>Fri, 19 Jun 2009 20:57:15 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=384</guid>
		<description><![CDATA[I updated the forum and blog software to their latest versions so let me know if there are any problems.
]]></description>
			<content:encoded><![CDATA[<p>I updated the forum and blog software to their latest versions so let me know if there are any problems.</p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=384</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Yahoo! Releases Its Own Version of Apache Hadoop</title>
		<link>http://ycoderscookbook.com/blog/?p=382</link>
		<comments>http://ycoderscookbook.com/blog/?p=382#comments</comments>
		<pubDate>Thu, 11 Jun 2009 19:49:25 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=382</guid>
		<description><![CDATA[A Slashdot article (http://developers.slashdot.org/story/09/06/10/2142256/Yahoo-Releases-Open-Source-Hadoop-Distribution) points to the fact that Yahoo! will release its own version of the Apache web server.  Yahoo! has used the Hadoop (http://hadoop.apache.org/core/) variety of Apache for years in its search engine and now some of those special modifications can be used under an open source license.  What makes Hadoop [...]]]></description>
			<content:encoded><![CDATA[<p>A Slashdot article (<a href="http://developers.slashdot.org/story/09/06/10/2142256/Yahoo-Releases-Open-Source-Hadoop-Distribution">http://developers.slashdot.org/story/09/06/10/2142256/Yahoo-Releases-Open-Source-Hadoop-Distribution</a>) points to the fact that Yahoo! will release its own version of the Apache web server.  Yahoo! has used the Hadoop (<a href="http://hadoop.apache.org/core/">http://hadoop.apache.org/core/</a>) variety of Apache for years in its search engine and now some of those special modifications can be used under an open source license.  What makes Hadoop unique is its ability to process vast amounts of information while in a cluster.  The official Yahoo! announcement can be found at <a href="http://developer.yahoo.net/blogs/hadoop/2009/06/yahoo_distribution_of_hadoop.html">http://developer.yahoo.net/blogs/hadoop/2009/06/yahoo_distribution_of_hadoop.html</a>.</p>
<p>The facts as given are not directly related to Yahoo! Messenger but this may give an indication as to what server software Yahoo! is using for Messenger.  To my knowledge Yahoo! has never stated what is running Messenger on the server side of things but I have seen an Apache signature many times.</p>
<p><a href="http://developer.yahoo.com/hadoop">http://developer.yahoo.com/hadoop</a>/<br />
<a href="http://ostatic.com/blog/at-hadoop-summit-yahoo-announces-its-tested-distribution">http://ostatic.com/blog/at-hadoop-summit-yahoo-announces-its-tested-distribution</a></p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=382</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>YCC Yahoo! Bot Maker 2.4.0 Released</title>
		<link>http://ycoderscookbook.com/blog/?p=379</link>
		<comments>http://ycoderscookbook.com/blog/?p=379#comments</comments>
		<pubDate>Fri, 01 May 2009 04:31:29 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=379</guid>
		<description><![CDATA[Due to a change to the Yahoo! registration page, YCC Yahoo! Bot Maker is now at version 2.4.0.  This version fixes the “unknown” error message displayed after a CAPTCHA has been entered.  Another minor change is that random domains are now turned on my default.  All previous versions will have to be [...]]]></description>
			<content:encoded><![CDATA[<p>Due to a change to the Yahoo! registration page, YCC Yahoo! Bot Maker is now at version 2.4.0.  This version fixes the “unknown” error message displayed after a CAPTCHA has been entered.  Another minor change is that random domains are now turned on my default.  All previous versions will have to be updated in order to create accounts again.  The good news is if you have version 2.3.1, you can now take advantage of the Update Now feature.  You can download YCC Yahoo! Bot Maker 2.4.0 directly from <a href="http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.4.0.zip">http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.4.0.zip</a> or visit the YCC Yahoo! Bot Maker website at <a href="http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.4.0.zip">http://ycoderscookbook.com/code/YCC_Yahoo_Bot_Maker.html</a> for more information.</p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=379</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>YCC Yahoo! Bot Maker 2.3.1 Down</title>
		<link>http://ycoderscookbook.com/blog/?p=377</link>
		<comments>http://ycoderscookbook.com/blog/?p=377#comments</comments>
		<pubDate>Thu, 30 Apr 2009 21:05:59 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=377</guid>
		<description><![CDATA[It arrears that YCC Yahoo! Bot Maker is once again suffering from downtime.  Preliminary analysis points to another change on the official Yahoo! registration page which is causing each name to be reported as made but showing an “unknown” error message.  I will take a look at the code tonight to see if [...]]]></description>
			<content:encoded><![CDATA[<p>It arrears that YCC Yahoo! Bot Maker is once again suffering from downtime.  Preliminary analysis points to another change on the official Yahoo! registration page which is causing each name to be reported as made but showing an “unknown” error message.  I will take a look at the code tonight to see if a quick fix can be issued.  If the fix is not issued within the next 12 hours then you will have to wait till after the weekend because I am going out of town.</p>
<p>P.S.  It looks like Yahoo! has been particularly active over the past few months.  This is the third fix I have had to issue in as much time.</p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=377</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>YCC Yahoo! Bot Maker 2.3.1 Released</title>
		<link>http://ycoderscookbook.com/blog/?p=373</link>
		<comments>http://ycoderscookbook.com/blog/?p=373#comments</comments>
		<pubDate>Sat, 25 Apr 2009 02:47:27 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=373</guid>
		<description><![CDATA[This is not a major release of YCC Yahoo! Bot Maker but you should still download and update your current version if you already have YCC Yahoo! Bot Maker.  The biggest change is to the automatic update system which should work now.  Since this is the fix, you will still have to manually [...]]]></description>
			<content:encoded><![CDATA[<p>This is not a major release of YCC Yahoo! Bot Maker but you should still download and update your current version if you already have YCC Yahoo! Bot Maker.  The biggest change is to the automatic update system which should work now.  Since this is the fix, you will still have to manually download and install this version.  The latest version can be found at <a href="http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.3.1.zip">http://ycoderscookbook.com/files/YCC Yahoo Bot Maker 2.3.1.zip</a></p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=373</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Spamming Can Solve Some of The World’s Problems</title>
		<link>http://ycoderscookbook.com/blog/?p=369</link>
		<comments>http://ycoderscookbook.com/blog/?p=369#comments</comments>
		<pubDate>Mon, 20 Apr 2009 22:32:04 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[General]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=369</guid>
		<description><![CDATA[A story on Slashdot entitled “Looking To Spammers To Solve Hard AI Problems” (http://tech.slashdot.org/article.pl?sid=09/04/19/024213#)points to an article in the New Scientist (http://www.newscientist.com/article/dn16973-innovation-harnessing-spammers-to-advance-ai.html) detailing how solving CAPTCHAs (those impossible to type word images) to spread SPAM can be a good thing.  Although SPAM is a nuisance, the act of solving CAPTCHAs automatically is a tough [...]]]></description>
			<content:encoded><![CDATA[<p>A story on Slashdot entitled “Looking To Spammers To Solve Hard AI Problems” (<a href="http://tech.slashdot.org/article.pl?sid=09/04/19/024213#">http://tech.slashdot.org/article.pl?sid=09/04/19/024213#</a>)points to an article in the New Scientist (<a href="http://www.newscientist.com/article/dn16973-innovation-harnessing-spammers-to-advance-ai.html">http://www.newscientist.com/article/dn16973-innovation-harnessing-spammers-to-advance-ai.html</a>) detailing how solving CAPTCHAs (those impossible to type word images) to spread SPAM can be a good thing.  Although SPAM is a nuisance, the act of solving CAPTCHAs automatically is a tough artificial intelligence (AI) problem.  Since the first appearance of CAPTCHAs, spammers have developed new and novel approaches to solving these image puzzles which in turn has helped the larger AI community.  In fact, a current spammer bounty is for half a million dollars to break the reCAPTCHA (<a href="http://recaptcha.net/learnmore.html">http://recaptcha.net/learnmore.html</a>) system. This turns out to be all most five times as lucrative as the top AI academic prize called the Loebner Prize (<a href="http://www.loebner.net/Prizef/loebner-prize.html">http://www.loebner.net/Prizef/loebner-prize.html</a>).  </p>
<p>The next question that some CAPTCHA researchers are proposing is not how to beat the spammers but what technology do they need a boost in.  By making a CAPTCHAs out of currently unreachable problems,  spammers of the future will pave the way for better AI, digital cameras, and other beneficial technologies.</p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=369</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>YCC Yahoo! Bot Maker 2.3.0 Released</title>
		<link>http://ycoderscookbook.com/blog/?p=366</link>
		<comments>http://ycoderscookbook.com/blog/?p=366#comments</comments>
		<pubDate>Sat, 18 Apr 2009 05:05:41 +0000</pubDate>
		<dc:creator>tansqrx</dc:creator>
				<category><![CDATA[Site]]></category>

		<guid isPermaLink="false">http://ycoderscookbook.com/blog/?p=366</guid>
		<description><![CDATA[The latest version of YCC Yahoo! Bot Maker is now at version 2.3.0.  The latest version fixes a problem where accounts were not being created due to a second security question added by Yahoo!  Version 2.3.0 also updated the security questions, fixed a bug that prevented the alternate email from being sent, and [...]]]></description>
			<content:encoded><![CDATA[<p>The latest version of YCC Yahoo! Bot Maker is now at version 2.3.0.  The latest version fixes a problem where accounts were not being created due to a second security question added by Yahoo!  Version 2.3.0 also updated the security questions, fixed a bug that prevented the alternate email from being sent, and made several internal changes.  Apparently the improved update system introduced in version 2.2.2 isn’t quite working as it should (this is the first time it is being used) so you may have to manually download and install the latest version from <a href="http://ycoderscookbook.com/code/YCC_Yahoo_Bot_Maker.html">.</p>
]]></content:encoded>
			<wfw:commentRss>http://ycoderscookbook.com/blog/?feed=rss2&amp;p=366</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
