<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Jimey’s Life &#38;&#38; Jimey’s World &#187; Cloud Computing</title>
	<atom:link href="http://jimey.com/category/c/distributed-system/cc/feed" rel="self" type="application/rss+xml" />
	<link>http://jimey.com</link>
	<description>Learning something new everyday</description>
	<lastBuildDate>Thu, 05 Aug 2010 08:03:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</title>
		<link>http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html</link>
		<comments>http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html#comments</comments>
		<pubDate>Thu, 06 May 2010 07:30:20 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[ant]]></category>
		<category><![CDATA[build.xml]]></category>
		<category><![CDATA[eclipse-plugin]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop-0.20.2]]></category>
		<category><![CDATA[lucid]]></category>
		<category><![CDATA[Ubuntu 10.04]]></category>

		<guid isPermaLink="false">http://jimey.com/2010/05/06/ubuntu10-04-%e7%bc%96%e8%af%91hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html</guid>
		<description><![CDATA[&#160;  先是准备工作,需要java5和apache-forrest-0.8  基本上也是问题一堆,因为ubuntu9.10开始不支持java-5-sun所以装java5-sun-jdk稍微麻烦了一下  可以参考这篇文章 http://blog.csdn.net/sunrock/archive/2010/04/29/5542989.aspx  修改源改成9.04的源然后安装java5-sun-jdk  接着  安装apache-forrest-0.8  http://for..... ]]></description>
			<content:encoded><![CDATA[<p>先是准备工作,需要java5和apache-forrest-0.8</p>
<p>基本上也是问题一堆,因为ubuntu9.10开始不支持java5所以装java5-sun-jdk稍微麻烦了一下</p>
<p>可以参考这篇文章 <a title="http://blog.csdn.net/sunrock/archive/2010/04/29/5542989.aspx" href="http://blog.csdn.net/sunrock/archive/2010/04/29/5542989.aspx">http://blog.csdn.net/sunrock/archive/2010/04/29/5542989.aspx</a></p>
<p>修改源改成9.04的源然后安装java5-sun-jdk</p>
<p>接着</p>
<p>安装apache-forrest-0.8</p>
<p><a href="http://forrest.apache.org/mirrors.cgi">http://forrest.apache.org/mirrors.cgi</a> For UNIX operating systems: apache-forrest-0.8.tar.gz<br />
解压后 我放在 /home/cloud/apache-forrest-0.8</p>
<p>直接下载来hadoop-0.20.2+228.tar.gz 解压缩即可</p>
<p>然后修改3个地方</p>
<p>1.修改$HADOOP_HOME/src/contrib/build-contrib.xml<br />
增加一行：&lt;property name=&#8221;eclipse.home&#8221; location=&#8221;/usr/lib/eclipse&#8221;/&gt;</p>
<p>2.修改 $HADOOP_HOME/src/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/src/java/org/apache/hadoop/eclipse/launch/HadoopApplicationLaunchShortcut.java<br />
注释掉原来的//import org.eclipse.jdt.internal.debug.ui.launcher.JavaApplicationLaunchShortcut;<br />
改为import org.eclipse.jdt.debug.ui.launchConfigurations.JavaApplicationLaunchShortcut;</p>
<p>3.修改$HADOOP_HOME/build.xml</p>
<p>增加</p>
<p>&lt;property name=&#8221;java5.home&#8221; location=&#8221;/usr/lib/jvm/java-1.5.0-sun-1.5.0.19/&#8221;/&gt;<br />
&lt;property name=&#8221;forrest.home&#8221; location=&#8221;/home/cloud/apache-forrest-0.8/&#8221;/&gt;</p>
<p>这2行</p>
<p>然后<a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> compile</p>
<p><a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> package 应该是没问题了</p>
<p>生成的eclipse plugin是在$HADOOP_HOME/build/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/</p>
<p>或者直接去 http://hadoop-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.googlecode.com/files/hadoop-0.20.3-dev-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.jar 下载吧</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2010-05-04 -- <a href="http://jimey.com/2010/05/04/ubuntu-10-04-installed-sunu002639s-jdk.html" title="ubuntu 10.04安装sun的jdk">ubuntu 10.04安装sun的jdk</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-12-02 -- <a href="http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html" title="Karmasphere Studio for Hadoop">Karmasphere Studio for Hadoop</a> (0)</li><li>2009-05-01 -- <a href="http://jimey.com/2009/05/01/hadoop-0-20-0.html" title="hadoop-0.20.0">hadoop-0.20.0</a> (0)</li><li>2009-04-28 -- <a href="http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html" title="Hadoop+MySQL结合">Hadoop+MySQL结合</a> (0)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-12 -- <a href="http://jimey.com/2008/11/12/sunday-the-day-to-do-the-wordcount-screenshots-mapreduce.html" title="周日那天做的mapreduce中的wordcount测试截图">周日那天做的mapreduce中的wordcount测试截图</a> (0)</li><li>2008-11-11 -- <a href="http://jimey.com/2008/11/11/hadoop-hbase-performance-evaluation.html" title="Hadoop HBase Performance Evaluation">Hadoop HBase Performance Evaluation</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/another-week-passed-and-concluded.html" title="又一周过去了，总结">又一周过去了，总结</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</title>
		<link>http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html</link>
		<comments>http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html#comments</comments>
		<pubDate>Thu, 22 Apr 2010 07:02:02 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[distributed]]></category>
		<category><![CDATA[FlockDB]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[HBase]]></category>
		<category><![CDATA[Pig]]></category>
		<category><![CDATA[real-time search]]></category>
		<category><![CDATA[SlideShare]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://jimey.com/2010/04/22/%e5%87%a0%e7%af%87%e5%85%b3%e4%ba%8etwitterhadoopcassandrapigflockdb%e7%9a%84slide.html</guid>
		<description><![CDATA[http://www.slideshare.net/kevinweil/nosql-at-twitter-nosql-eu-2010  http://www.slideshare.net/ryansking/scaling-twitter-with-cassandra  http://www.slideshare.net/nkallen/q-con-3770885  Big Data in Real-Time at Twitter  http://www.slideshare.net/al3x/building-distributed-systems-in-scala  &#160;  主要..... ]]></description>
			<content:encoded><![CDATA[<p><a title="http://www.slideshare.net/kevinweil/nosql-at-twitter-nosql-eu-2010" href="http://www.slideshare.net/kevinweil/nosql-at-twitter-nosql-eu-2010">http://www.slideshare.net/kevinweil/nosql-at-twitter-nosql-eu-2010</a></p>
<p><a title="http://www.slideshare.net/ryansking/scaling-twitter-with-cassandra" href="http://www.slideshare.net/ryansking/scaling-twitter-with-cassandra">http://www.slideshare.net/ryansking/scaling-twitter-with-cassandra</a></p>
<p><a title="http://www.slideshare.net/nkallen/q-con-3770885" href="http://www.slideshare.net/nkallen/q-con-3770885">http://www.slideshare.net/nkallen/q-con-3770885</a></p>
<p>Big Data in Real-Time at Twitter</p>
<p><a title="http://www.slideshare.net/al3x/building-distributed-systems-in-scala" href="http://www.slideshare.net/al3x/building-distributed-systems-in-scala">http://www.slideshare.net/al3x/building-distributed-systems-in-scala</a></p>
<p>&#160;</p>
<p>主要分析了功能上需求与分散系统采用所能完成的目的的合理搭配.</p>
<p>个人觉得是很不错的几篇slide有助于对twitter的一个基本了解,当然是背后构架的基本了解,各个功能与服务是通过什么来实现以及达到一个怎么样的高度</p>
<p>具体内容各位还是看slide吧,谁看谁知道.</p>
<p>&#160;</p>
<p>^^^^^^^^^^^^</p>
<p>再记下来本书名叫</p>
<p>Programming Scala (O’Reilly 2009)</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-03 -- <a href="http://jimey.com/2010/04/03/marketing-in-twitter.html" title="Marketing in Twitter">Marketing in Twitter</a> (0)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-23 -- <a href="http://jimey.com/2009/03/23/information-retrieval-course-2-weeks-to-a-summary-of.html" title="Information Retrieval 课程2周来的总结">Information Retrieval 课程2周来的总结</a> (0)</li><li>2009-03-09 -- <a href="http://jimey.com/2009/03/09/last-week-record-u0026amp-made-real-time-search-look-forward-to.html" title="上周记录&amp;国产real-time search期待">上周记录&amp;国产real-time search期待</a> (2)</li><li>2009-02-01 -- <a href="http://jimey.com/2009/02/01/wanted-to-terms-u0026amp.html" title="很想写点什么">很想写点什么</a> (0)</li><li>2009-01-21 -- <a href="http://jimey.com/2009/01/21/sorry-google-you-missed-the-real-time-web.html" title="Sorry Google, You Missed the Real-Time Web!">Sorry Google, You Missed the Real-Time Web!</a> (0)</li><li>2009-01-21 -- <a href="http://jimey.com/2009/01/21/frienddeck-a-friendfeed-search-tool.html" title="FriendDeck: A FriendFeed Search Tool">FriendDeck: A FriendFeed Search Tool</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-11 -- <a href="http://jimey.com/2008/11/11/hadoop-hbase-performance-evaluation.html" title="Hadoop HBase Performance Evaluation">Hadoop HBase Performance Evaluation</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/another-week-passed-and-concluded.html" title="又一周过去了，总结">又一周过去了，总结</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-05-05 -- <a href="http://jimey.com/2010/05/05/mocospace-architecture-3-billion-mobile-page-views-a-month.html" title="MocoSpace Architecture &#8211; 3 Billion Mobile Page Views a Month">MocoSpace Architecture &#8211; 3 Billion Mobile Page Views a Month</a> (0)</li><li>2010-04-01 -- <a href="http://jimey.com/2010/04/01/foolu002639s-day.html" title="愚人节">愚人节</a> (2)</li><li>2010-03-30 -- <a href="http://jimey.com/2010/03/30/the-garden-of-eden-subtitle-blog-station-uses-the-approach.html" title="伊甸园字幕博客站利用办法">伊甸园字幕博客站利用办法</a> (2)</li><li>2010-03-24 -- <a href="http://jimey.com/2010/03/24/twittersleepingtime.html" title="TwitterSleepingtime">TwitterSleepingtime</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>云端医疗信息系统的不成熟构思</title>
		<link>http://jimey.com/2010/04/12/cloud-of-medical-information-systems-immature-idea.html</link>
		<comments>http://jimey.com/2010/04/12/cloud-of-medical-information-systems-immature-idea.html#comments</comments>
		<pubDate>Mon, 12 Apr 2010 13:10:41 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[ERP]]></category>
		<category><![CDATA[hdfs]]></category>
		<category><![CDATA[HIS]]></category>
		<category><![CDATA[IC病历卡]]></category>
		<category><![CDATA[key/value]]></category>
		<category><![CDATA[OA]]></category>
		<category><![CDATA[OpenSource]]></category>
		<category><![CDATA[Private Cloud]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[云]]></category>
		<category><![CDATA[云计算]]></category>
		<category><![CDATA[信息系统]]></category>
		<category><![CDATA[倾向]]></category>
		<category><![CDATA[医生]]></category>
		<category><![CDATA[医疗]]></category>
		<category><![CDATA[医院]]></category>
		<category><![CDATA[卖方]]></category>
		<category><![CDATA[喜好]]></category>
		<category><![CDATA[存储]]></category>
		<category><![CDATA[安全]]></category>
		<category><![CDATA[工作站]]></category>
		<category><![CDATA[影像库]]></category>
		<category><![CDATA[成本]]></category>
		<category><![CDATA[技术革命]]></category>
		<category><![CDATA[投入]]></category>
		<category><![CDATA[数据]]></category>
		<category><![CDATA[无盘]]></category>
		<category><![CDATA[构思]]></category>
		<category><![CDATA[桌面]]></category>
		<category><![CDATA[正版]]></category>
		<category><![CDATA[绿色IT]]></category>
		<category><![CDATA[隐私]]></category>
		<category><![CDATA[高效率]]></category>

		<guid isPermaLink="false">http://jimey.com/2010/04/12/%e4%ba%91%e7%ab%af%e5%8c%bb%e7%96%97%e4%bf%a1%e6%81%af%e7%b3%bb%e7%bb%9f%e7%9a%84%e4%b8%8d%e6%88%90%e7%86%9f%e6%9e%84%e6%80%9d.html</guid>
		<description><![CDATA[其实题目想叫着医疗信息系统云计算化或者叫医疗信息化云计算  不过 - - 都觉得无法表达我想说的意思,所以把仨题目都丢这  昨晚本来是睡觉的时候,在构思了下我之前曾经发过的一篇blog 韩国医院看病流程 也许不是普遍事例  关于医疗系统的云化构思,在床上想了差不多一个小时,又来又爬起来   在医院信息系统云计算,不同于现有的..... ]]></description>
			<content:encoded><![CDATA[<p>其实题目想叫着医疗信息系统云计算化或者叫医疗信息化云计算</p>
<p>不过 &#8211; - 都觉得无法表达我想说的意思,所以把仨题目都丢这</p>
<p>昨晚本来是睡觉的时候,在构思了下我之前曾经发过的一篇blog <a href="http://jimey.com/2009/12/03/south-korean-hospital-processes-may-not-be-common-examples-of.html">韩国医院看病流程 也许不是普遍事例</a></p>
<p>关于医疗系统的云化构思,在床上想了差不多一个小时,又来又爬起来 </p>
<p>在医院信息系统云计算,不同于现有的OA,ERP,SQL数据库的模式.基于纯web的模式</p>
<p>下午在跟<a href="http://blog.it580.com/" target="_blank">davidpanchina</a>同学探讨关于这个云的具体实施想法</p>
<p>我才明白.医生工作站在国内没有市场，因为卫生部不认可电子病历。国内的情况是HIS+影像库，HIS应用主要是挂号，收费，药房，财务系统。 </p>
<p>我曾经的理解是: 现在老外做云OA已经是蛮成熟了.我个人理解现在的医院信息系统都是医生桌面软件+远端DB,桌面软件开发大多都是采用vb;vc;.net之类.而云的话,就是软件处理+DB全部在云端.医生这边只用web来和云做沟通.</p>
<p>果然到国内去就变样了.当然目前从我观察来看,韩国应该是可以这样做</p>
<p>有几个问题:</p>
<p>1.对比国内医生部分不会使用电脑,韩国医院基本上已经实现了信息化</p>
<p>2. 数据安全,数据隐私</p>
<p>3. IC病历卡与整个系统连结</p>
<p>4. 医生无盘工作站的桌面 (cost可以节约一些)</p>
<p>5. 投入与产出,国内医院目前应该还是个卖方市场,对于IT行业的投入与倾向的喜好?</p>
<p>6. 各个医院数据的统一与协调,当然这个数据即使不统一,对于云数据库来说,其实是个很容易解决的问题,反正都是key-value存储,即使value短那么一截,也没什么大不了,把能对得上的对上即可</p>
<p>7. 如果采用private cloud 在医院内部解决数据隐私的问题的话, 与现有的集群服务器又有什么差别? 答案: 如果采用Open Source的软件的话,价格上可能会有优势</p>
<p>另外接着说那个影像库</p>
<p>1. 其实就是个存储系统,说白了,如果往云上靠,那么可以用hdfs来做,用分布式存储的模式来 vs 目前的集群服务器容错机制</p>
<p>^^^^^^^^^^^^^^^^</p>
<p>最后讨论到最后也没什么特别的结论</p>
<p>当然我自己个人主要是想挂靠几个理念,1.云, 2.绿色IT, 3.技术革命, 4.成本, 5.高效率 </p>
<p>至于谁优谁劣倒是没说太多,只是想说云可以满足今后医疗信息系统的需求,当然目前来看可能国内还不太需要这个玩意,但是,相信经过一段时期发展,比如医生基本上都可以独立操作电脑,病患者可以独立操作IC卡读卡器,无人充值机,以及医院领导层对医院执行效率的提升的需求,另外是领导层对科技的倾向,软件正版化趋势(比如目前的产品采用盗版的话软件成本当然会低很多),</p>
<p>&#160;</p>
<p>心中构思的未来医院是:</p>
<p>先说门诊部:</p>
<p>1. 病人进入医院,有几位引导人员对看病流程做一个介绍,并且帮助不会使用无人挂号机的病人</p>
<p>2. 病人带着IC病历卡在门诊前台无人机挂号(类似目前地铁IC卡无人充值机默认),目的是解决医院挂号窗口少,队伍长排队的问题</p>
<p>3. 门诊医生在遇到病人的时候,拿病人的IC病历卡插入读卡器,在无盘工作站上登入云,针对病人病情做出诊断,然后把诊断情况存入云端数据库,比如需要进一步化验的话,可以请病人进一步化验</p>
<p>4. 病人拿着IC卡到化验处插卡,比如验血,化验部抽血病人即可离开,化验部门会通知病人几点几分到专用设备前查看结果</p>
<p>5. 化验部门把化验结果数据全部存入云端数据库,并且通知门诊医生,门诊医生在针对化验结果上做出自己的判断,比如开处方,比如再见面,比如化验其他项目</p>
<p>6. 待到几点几分的时候,病人带上IC卡可以在医院专用无人读卡设备前查询化验结果和门诊医生的意见or处方,可以打印结果,等等功能</p>
<p>7. 如果是处方的话,病人可以打印处方然后在外面药店自己买药,或者不打印处方,在医院内部联网的处方药窗口,递卡买药即可</p>
<p>住院部:</p>
<p>1. 医生不再拿着病历一个床一个床的看,直接弄台手持设备即可,比如ipad,比如各种table的笔记本</p>
<p>2. 各个病人有RFID tag,或者戴手腕上的条形码或者其他快捷识别方式,住院医生只需手持装置能够读取即可</p>
<p>3. 住院医生可根据病人状况,通过手持装置把数据写入云数据库</p>
<p>…………………</p>
<p>当然这些都是我不成熟的想法,可能很多就现实来说先决条件也许都是错误的, 我只是想来一个Brainstorm </p>
<p>最后,此文内容少许有保留,如果你想继续探讨,请留下你的看法与见解.</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2008-10-14 -- <a href="http://jimey.com/2008/10/14/cloud-computing-forecast.html" title="Cloud Computing展望">Cloud Computing展望</a> (5)</li><li>2010-04-11 -- <a href="http://jimey.com/2010/04/11/after-a-section-of-a-bird-finishing-under-the-environment.html" title="过了一小节鸟,整理下周围环境">过了一小节鸟,整理下周围环境</a> (2)</li><li>2010-03-17 -- <a href="http://jimey.com/2010/03/17/weekly-meeting.html" title="每周例会">每周例会</a> (0)</li><li>2009-12-03 -- <a href="http://jimey.com/2009/12/03/south-korean-hospital-processes-may-not-be-common-examples-of.html" title="韩国医院看病流程 也许不是普遍事例">韩国医院看病流程 也许不是普遍事例</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2009-03-06 -- <a href="http://jimey.com/2009/03/06/days-do-not-want-to-sleep-to-be-bright.html" title="天要亮了不想睡觉了">天要亮了不想睡觉了</a> (2)</li><li>2009-01-05 -- <a href="http://jimey.com/2009/01/05/ikunori-technology-news.html" title="几则科技新闻">几则科技新闻</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-15 -- <a href="http://jimey.com/2008/11/15/laboratory-disorder-desktop.html" title="实验室的乱桌面">实验室的乱桌面</a> (0)</li><li>2008-09-15 -- <a href="http://jimey.com/2008/09/15/fat-point-toxic-food.html" title="发点有毒食品">发点有毒食品</a> (2)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2010/04/12/cloud-of-medical-information-systems-immature-idea.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Karmasphere Studio for Hadoop</title>
		<link>http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html</link>
		<comments>http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html#comments</comments>
		<pubDate>Wed, 02 Dec 2009 06:21:42 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Cluster]]></category>
		<category><![CDATA[graphical]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[IDE]]></category>
		<category><![CDATA[Karmasphere]]></category>
		<category><![CDATA[MapReduce]]></category>
		<category><![CDATA[NetBeans]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html</guid>
		<description><![CDATA[Karmasphere Studio for Hadoop is a    MapReduce development environment (IDE) based on NetBeans. It enables you to rapidly prototype MapReduce jobs without the need for a cluster, and to deploy, monitor and debug jobs on Hadoop clusters or Amazon Elastic MapReduce clusters, all from within a graphic..... ]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.hadoopstudio.org/screenshots/composite-0.png"><img src="http://www.hadoopstudio.org/screenshots/composite-0-s.png" alt="Sample screenshot" width="350" height="241" /></a><a href="http://www.hadoopstudio.org/screenshots/composite-0.png"> </a></p>
<p>Karmasphere Studio for Hadoop is a <a href="http://en.wikipedia.org/wiki/MapReduce">MapReduce</a> development environment (IDE) based on <a href="http://jimey.com/tag/netbeans" class="st_tag internal_tag" rel="tag" title="Posts tagged with NetBeans">NetBeans</a>. It enables you to rapidly prototype MapReduce jobs without the need for a cluster, and to deploy, monitor and debug jobs on Hadoop clusters or Amazon Elastic MapReduce clusters, all from within a graphical desktop IDE.</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/another-week-passed-and-concluded.html" title="又一周过去了，总结">又一周过去了，总结</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/cluster-configuration-with-hadoop-introduction.html" title="Hadoop用cluster配置简介">Hadoop用cluster配置简介</a> (1)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-12 -- <a href="http://jimey.com/2008/11/12/sunday-the-day-to-do-the-wordcount-screenshots-mapreduce.html" title="周日那天做的mapreduce中的wordcount测试截图">周日那天做的mapreduce中的wordcount测试截图</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/hadoop-cluster-to-do-with-the-composition-of-cluster-machines.html" title="用Hadoop做集群机器构成的cluster">用Hadoop做集群机器构成的cluster</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/hadoop-user-group-meeting-oct-meeting-continue-last-unfinished-blog.html" title="Hadoop User Group Meeting (Oct.Meeting) Continue last unfinished blog">Hadoop User Group Meeting (Oct.Meeting) Continue last unfinished blog</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/then-boil-for-one-night-the-outcome-is-still-a-significant-drop.html" title="又熬一晚，成果还是显著滴">又熬一晚，成果还是显著滴</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/staying-up-late-at-night-the-spirit-and-sleepy-is-not-ok.html" title="熬夜熬夜，精神来了，想睡也不行">熬夜熬夜，精神来了，想睡也不行</a> (0)</li><li>2008-10-17 -- <a href="http://jimey.com/2008/10/17/hadoop-user-group-meeting-oct-meeting.html" title="Hadoop User Group Meeting (Oct.Meeting)">Hadoop User Group Meeting (Oct.Meeting)</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-05-05 -- <a href="http://jimey.com/2010/05/05/what-causes-errors-in-databases.html" title="what causes errors in databases.">what causes errors in databases.</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-05-01 -- <a href="http://jimey.com/2009/05/01/hadoop-0-20-0.html" title="hadoop-0.20.0">hadoop-0.20.0</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Eclipse+Hadoop 全linux环境开发</title>
		<link>http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html</link>
		<comments>http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html#comments</comments>
		<pubDate>Fri, 09 Oct 2009 09:42:58 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Eclipse]]></category>
		<category><![CDATA[eclipse-plugin]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop-0.20.1-eclipse-plugin.jar]]></category>
		<category><![CDATA[Ubuntu]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/10/09/eclipsehadoop-%e5%85%a8linux%e7%8e%af%e5%a2%83%e5%bc%80%e5%8f%91.html</guid>
		<description><![CDATA[http://trac.nchc.org.tw/cloud/wiki/waue/2009/0617  这篇文章已经叙述的很不错了  采用eclipse3.3 暂时是不会有任何问题，执行run as hadoop 那个wordcount会有错误提示     09/10/09 17:17:25 WARN conf.Configuration: DEPRECATED: hadoop-site.xml found in the classpath. Usage of hadoop-site.xml is deprecated. I..... ]]></description>
			<content:encoded><![CDATA[<p><a title="http://trac.nchc.org.tw/cloud/wiki/waue/2009/0617" href="http://trac.nchc.org.tw/cloud/wiki/waue/2009/0617">http://trac.nchc.org.tw/cloud/wiki/waue/2009/0617</a></p>
<p>这篇文章已经叙述的很不错了</p>
<p>采用eclipse3.3 暂时是不会有任何问题，执行run as hadoop 那个wordcount会有错误提示</p>
<p><a href="http://jimey.com/blog/wp-content/uploads/2009/10/hadoop.jpg"><img style="border-right-width: 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="hadoop" src="http://jimey.com/blog/wp-content/uploads/2009/10/hadoop_thumb.jpg" border="0" alt="hadoop" width="622" height="484" /></a></p>
<p>09/10/09 17:17:25 WARN conf.Configuration: DEPRECATED: hadoop-site.xml found in the classpath. Usage of hadoop-site.xml is deprecated. Instead use core-site.xml, mapred-site.xml and hdfs-site.xml to override</p>
<p>properties of core-default.xml, mapred-default.xml and hdfs-default.xml respectively<br />
Usage: wordcount &lt;in&gt; &lt;out&gt;</p>
<p>代码问题，可以暂时不用管</p>
<p>另外文章中给出的</p>
<blockquote>
<pre>JarFile="sample-0.1.jar"
MainFunc="Sample.WordCount"
LocalOutDir="/tmp/output"

all:help
jar:
  jar -cvf ${JarFile} -C bin/ .

run:
  hadoop jar ${JarFile} ${MainFunc} input output

clean:
  hadoop fs -rmr output

output:
  rm -rf ${LocalOutDir}
  hadoop fs -get output ${LocalOutDir}
  gedit ${LocalOutDir}/part-r-00000 &amp; 

help:
  @echo "Usage:"
  @echo " make jar     - Build Jar File."
  @echo " make clean   - Clean up Output directory on HDFS."
  @echo " make run     - Run your MapReduce code on Hadoop."
  @echo " make output  - Download and show output file"
  @echo " make help    - Show Makefile options."
  @echo " "
  @echo "Example:"
  @echo " make jar; make run; make output; make clean"</pre>
</blockquote>
<pre>这个makefile很不错，复制的时候注意使用tab</pre>
<pre>然后运行编译在eclipse3.3和3.4版本都很正常</pre>
<pre>目前有几个问题就是eclipse3.4版用<a href="http://jimey.com/tag/hadoop-0200" class="st_tag internal_tag" rel="tag" title="Posts tagged with hadoop-0.20.0">hadoop-0.20.0</a>-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.jar 我这边出错，读不到这个插件，
用hadoop-0.19.2-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.jar的时候连接不到本地的hadoop服务</pre>
<pre>还好用我最新编译的<a href="http://jimey.com/tag/hadoop-0-20-1-eclipse-plugin-jar" class="st_tag internal_tag" rel="tag" title="Posts tagged with hadoop-0.20.1-eclipse-plugin.jar">hadoop-0.20.1-eclipse-plugin.jar</a> 可以正常使用，不过蛮搞笑的是用了这个，
在perspective添加了MapReduce的时候那个Map/Reduce Locations不会显示出来，要手动添加，
我不知道是我个人问题，还是普遍情况</pre>
<pre><a href="http://jimey.com/blog/wp-content/uploads/2009/10/hadoop2.jpg"><img style="border-right-width: 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px" title="hadoop2" src="http://jimey.com/blog/wp-content/uploads/2009/10/hadoop2_thumb.jpg" border="0" alt="hadoop2" width="622" height="484" /></a></pre>
<pre>目前都可以连接到我的hadoop服务上，可以在eclipse里面查看上传文件</pre>
<pre>eclipse3.3版本我是用0.20.0的<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a></pre>
<pre>eclipse3.4版本我是用我自己编译的0.20.1的plugin  下载<a href="http://code.google.com/p/hadoop-eclipse-plugin/downloads/list">hadoop-0.20.1-eclipse-plugin.jar</a></pre>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2009-03-06 -- <a href="http://jimey.com/2009/03/06/the-use-of-eclipse-cdt-development-of-ns2.html" title="使用Eclipse CDT 开发NS2">使用Eclipse CDT 开发NS2</a> (0)</li><li>2009-03-04 -- <a href="http://jimey.com/2009/03/04/tcl-compiler-environment-win.html" title="Win环境下的TCL Compiler">Win环境下的TCL Compiler</a> (0)</li><li>2009-03-03 -- <a href="http://jimey.com/2009/03/03/install-vmware-server-on-ubuntu-8-04.html" title="install vmware server on ubuntu 8.04">install vmware server on ubuntu 8.04</a> (0)</li><li>2008-11-12 -- <a href="http://jimey.com/2008/11/12/sunday-the-day-to-do-the-wordcount-screenshots-mapreduce.html" title="周日那天做的mapreduce中的wordcount测试截图">周日那天做的mapreduce中的wordcount测试截图</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/cluster-configuration-with-hadoop-introduction.html" title="Hadoop用cluster配置简介">Hadoop用cluster配置简介</a> (1)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/hadoop-cluster-to-do-with-the-composition-of-cluster-machines.html" title="用Hadoop做集群机器构成的cluster">用Hadoop做集群机器构成的cluster</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/then-boil-for-one-night-the-outcome-is-still-a-significant-drop.html" title="又熬一晚，成果还是显著滴">又熬一晚，成果还是显著滴</a> (0)</li><li>2010-05-04 -- <a href="http://jimey.com/2010/05/04/ubuntu-10-04-installed-sunu002639s-jdk.html" title="ubuntu 10.04安装sun的jdk">ubuntu 10.04安装sun的jdk</a> (0)</li><li>2010-04-27 -- <a href="http://jimey.com/2010/04/27/the-linux-command-line-a-good-book-describes-linux-command-line.html" title="The Linux Command Line: 一本介绍 Linux 命令行的好书">The Linux Command Line: 一本介绍 Linux 命令行的好书</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</title>
		<link>http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html</link>
		<comments>http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html#comments</comments>
		<pubDate>Fri, 09 Oct 2009 07:00:23 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[ant]]></category>
		<category><![CDATA[build.xml]]></category>
		<category><![CDATA[eclipse-plugin]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google Code]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop-0.20.0]]></category>
		<category><![CDATA[hadoop-0.20.1]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/10/09/%e7%bc%96%e8%af%91hadoop-version-eclipse-plugin-jar.html</guid>
		<description><![CDATA[hadoop最新的0.20.1包里面已经没带eclipse的plugin了  之前用0.20.0的pluggin偶尔会出问题  所以想自编译之  从http://wiki.apache.org/hadoop/EclipsePlugIn  查阅  To build the Eclipse plug-in, you need the Hadoop source files and a working Eclipse environment (version 3.3+). When compiling Hadoop, the Eclip..... ]]></description>
			<content:encoded><![CDATA[<p>hadoop最新的0.20.1包里面已经没带eclipse的plugin了</p>
<p>之前用0.20.0的pluggin偶尔会出问题</p>
<p>所以想自编译之</p>
<p>从<a title="http://wiki.apache.org/hadoop/EclipsePlugIn" href="http://wiki.apache.org/hadoop/EclipsePlugIn">http://wiki.apache.org/hadoop/EclipsePlugIn</a></p>
<p>查阅</p>
<p>To build the Eclipse plug-in, you need the Hadoop source files and a working Eclipse environment (version 3.3+). When compiling Hadoop, the Eclipse plug-in will be built if it founds the Eclipse environment path in the <a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> property &quot;eclipse.home&quot;. The build framework looks for this property in ${hadoop-src-root}/src/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/build.properties and in $HOME/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.build.properties. </p>
<p>A typical $HOME/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.build.properties file would contain the following entry: eclipse.home=/path/to/eclipse </p>
<p>Then the plug-in should be built when compiling Hadoop: <a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> clean package (from the ${hadoop-src-root} directory), which will produce {hadoop-src-root}/build/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/hadoop-${version}-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.jar </p>
<p>To install the generated plug-in in your Eclipse environment, remove first all previous versions of the plug-in from your Eclipse environment and copy the hadoop-${version}-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.jar file generated as described above in your ${eclipse.home}/<a href="http://jimey.com/tag/plugins" class="st_tag internal_tag" rel="tag" title="Posts tagged with plugins">plugins</a>/ directory. When you restart Eclipse, the Map/Reduce perspective should be available. </p>
<p>得知 只需要在 <a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.build.properties 添加eclipse的路径 即可，没找到这个<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>.build.properties 只找到了build.properties 添加了eclipse的安装路径</p>
<p>然后在hadoop的根目录 <a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> clean package 生成 (需要loading一些classpath) 难道不能单独<a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a>?? 在eclipse-plus下也有build.xml&#160; 用<a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> compile之&#160; 会出现classpath错误生成60多个错误…</p>
<p>所以目前比较奇怪..</p>
<p>&#160;</p>
<p>update:</p>
<p>换了个思路,不跟着那个网站的走 不用那个<a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> clean package的命令</p>
<p>先在hadoop根目录下<a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> compile </p>
<p>再到src/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/下</p>
<p><a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> compile然后<a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> jar</p>
<p>这个时候会提示</p>
<p>BUILD FAILED    <br />/home/hadoop/hadoop-0.20.1/src/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/build.xml:69: Warning: Could not find file /home/hadoop/hadoop-0.20.1/build/hadoop-${version}-core.jar to copy.</p>
<p>再去src/contrib/ 编辑build-contrib.xml&#160;&#160; 添加 </p>
<p>&lt;property name=&quot;version&quot; value=&quot;0.20.1&quot;/&gt; </p>
<p>然后保存后</p>
<p>修改之前我习惯性的把hadoop-${version}目录改为hadoop再改回原来的，即/home/hadoop/hadoop/ 改为/home/hadoop/hadoop-0.20.1/</p>
<p>复制hadoop根目录下的hadoop-0.20.1-core.jar 到build/下</p>
<p>然后再去src/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/下&#160; <a href="http://jimey.com/tag/ant" class="st_tag internal_tag" rel="tag" title="Posts tagged with ant">ant</a> jar 即可/home/hadoop/hadoop-0.20.1/build/contrib/<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a>/<a href="http://jimey.com/tag/hadoop-0-20-1-eclipse-plugin-jar" class="st_tag internal_tag" rel="tag" title="Posts tagged with hadoop-0.20.1-eclipse-plugin.jar">hadoop-0.20.1-eclipse-plugin.jar</a> 生成</p>
<p>下载<a href="http://code.google.com/p/hadoop-eclipse-plugin/downloads/list" target="_blank">hadoop-0.20.1-eclipse-plugin.jar</a></p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-05-01 -- <a href="http://jimey.com/2009/05/01/hadoop-0-20-0.html" title="hadoop-0.20.0">hadoop-0.20.0</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-12-18 -- <a href="http://jimey.com/2009/12/18/feedburner-to-be-more-social.html" title="feedburner &#8211; To be more social">feedburner &#8211; To be more social</a> (0)</li><li>2009-12-02 -- <a href="http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html" title="Karmasphere Studio for Hadoop">Karmasphere Studio for Hadoop</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/read-the-paper-scaling-online-social-networks-without-pains.html" title="阅读paper:Scaling Online Social Networks without Pains">阅读paper:Scaling Online Social Networks without Pains</a> (0)</li><li>2009-10-06 -- <a href="http://jimey.com/2009/10/06/try-google-wave.html" title="Google Wave试用">Google Wave试用</a> (11)</li><li>2009-04-28 -- <a href="http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html" title="Hadoop+MySQL结合">Hadoop+MySQL结合</a> (0)</li><li>2009-03-27 -- <a href="http://jimey.com/2009/03/27/get-rid-of-live-mail-for-gmail-offline.html" title="干掉live mail换gmail offline">干掉live mail换gmail offline</a> (4)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2009-03-11 -- <a href="http://jimey.com/2009/03/11/how-to-use-google-appengine-free-space-to-do.html" title="如何利用GOOGLE APPENGINE来做免费空间">如何利用GOOGLE APPENGINE来做免费空间</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>hadoop-0.20.0</title>
		<link>http://jimey.com/2009/05/01/hadoop-0-20-0.html</link>
		<comments>http://jimey.com/2009/05/01/hadoop-0-20-0.html#comments</comments>
		<pubDate>Fri, 01 May 2009 05:33:36 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop-0.19.1]]></category>
		<category><![CDATA[hadoop-0.20.0]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/05/01/hadoop-0200.html</guid>
		<description><![CDATA[性能好像好了很多。  之前0.19.1版本运行完至少要1分钟  这个10秒都不用…     bin/hadoop jar hadoop-0.20.0-examples.jar wordcount dir dir-out1     09/05/01 14:28:30 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=JobTracker, sessionId=      09/05/01 14:28:30 INFO input.FileInputFormat: To..... ]]></description>
			<content:encoded><![CDATA[<p>性能好像好了很多。</p>
<p>之前0.19.1版本运行完至少要1分钟</p>
<p>这个10秒都不用…</p>
<blockquote><p>bin/hadoop jar <a href="http://jimey.com/tag/hadoop-0200" class="st_tag internal_tag" rel="tag" title="Posts tagged with hadoop-0.20.0">hadoop-0.20.0</a>-examples.jar wordcount dir dir-out1     <br />09/05/01 14:28:30 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=JobTracker, sessionId=      <br />09/05/01 14:28:30 INFO input.FileInputFormat: Total input paths to process : 1      <br />09/05/01 14:28:31 INFO mapred.JobClient: Running job: job_local_0001      <br />09/05/01 14:28:31 INFO input.FileInputFormat: Total input paths to process : 1      <br />09/05/01 14:28:31 INFO mapred.MapTask: io.sort.mb = 100      <br />09/05/01 14:28:31 INFO mapred.MapTask: data buffer = 79691776/99614720      <br />09/05/01 14:28:31 INFO mapred.MapTask: record buffer = 262144/327680      <br />09/05/01 14:28:32 INFO mapred.MapTask: Starting flush of map output      <br />09/05/01 14:28:32 INFO mapred.JobClient:&#160; map 0% reduce 0%      <br />09/05/01 14:28:32 INFO mapred.MapTask: Finished spill 0      <br />09/05/01 14:28:32 INFO mapred.TaskRunner: Task:attempt_local_0001_m_000000_0 is done. And is in the process of commiting      <br />09/05/01 14:28:32 INFO mapred.LocalJobRunner:      <br />09/05/01 14:28:32 INFO mapred.TaskRunner: Task &#8216;attempt_local_0001_m_000000_0&#8242; done.      <br />09/05/01 14:28:32 INFO mapred.LocalJobRunner:      <br />09/05/01 14:28:32 INFO mapred.Merger: Merging 1 sorted segments      <br />09/05/01 14:28:32 INFO mapred.Merger: Down to the last merge-pass, with 1 segments left of total size: 267034 bytes      <br />09/05/01 14:28:32 INFO mapred.LocalJobRunner:      <br />09/05/01 14:28:33 INFO mapred.JobClient:&#160; map 100% reduce 0%      <br />09/05/01 14:28:33 INFO mapred.TaskRunner: Task:attempt_local_0001_r_000000_0 is done. And is in the process of commiting      <br />09/05/01 14:28:33 INFO mapred.LocalJobRunner:      <br />09/05/01 14:28:33 INFO mapred.TaskRunner: Task attempt_local_0001_r_000000_0 is allowed to commit now      <br />09/05/01 14:28:33 INFO output.FileOutputCommitter: Saved output of task &#8216;attempt_local_0001_r_000000_0&#8242; to dir-out1      <br />09/05/01 14:28:33 INFO mapred.LocalJobRunner: reduce &gt; reduce      <br />09/05/01 14:28:33 INFO mapred.TaskRunner: Task &#8216;attempt_local_0001_r_000000_0&#8242; done.      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160; map 100% reduce 100%      <br />09/05/01 14:28:34 INFO mapred.JobClient: Job complete: job_local_0001      <br />09/05/01 14:28:34 INFO mapred.JobClient: Counters: 14      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160; FileSystemCounters      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; FILE_BYTES_READ=551968      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; HDFS_BYTES_READ=1383924      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; FILE_BYTES_WRITTEN=569086      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; HDFS_BYTES_WRITTEN=515816      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160; Map-Reduce Framework      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Reduce input groups=0      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Combine output records=18042      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Map input records=12761      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Reduce shuffle bytes=0      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Reduce output records=0      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Spilled Records=36084      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Map output bytes=1086784      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Combine input records=109855      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Map output records=109855      <br />09/05/01 14:28:34 INFO mapred.JobClient:&#160;&#160;&#160;&#160; Reduce input records=18042</p>
</blockquote>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-12-02 -- <a href="http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html" title="Karmasphere Studio for Hadoop">Karmasphere Studio for Hadoop</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-04-28 -- <a href="http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html" title="Hadoop+MySQL结合">Hadoop+MySQL结合</a> (0)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-12 -- <a href="http://jimey.com/2008/11/12/sunday-the-day-to-do-the-wordcount-screenshots-mapreduce.html" title="周日那天做的mapreduce中的wordcount测试截图">周日那天做的mapreduce中的wordcount测试截图</a> (0)</li><li>2008-11-11 -- <a href="http://jimey.com/2008/11/11/hadoop-hbase-performance-evaluation.html" title="Hadoop HBase Performance Evaluation">Hadoop HBase Performance Evaluation</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/another-week-passed-and-concluded.html" title="又一周过去了，总结">又一周过去了，总结</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/cluster-configuration-with-hadoop-introduction.html" title="Hadoop用cluster配置简介">Hadoop用cluster配置简介</a> (1)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/05/01/hadoop-0-20-0.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hadoop+MySQL结合</title>
		<link>http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html</link>
		<comments>http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html#comments</comments>
		<pubDate>Tue, 28 Apr 2009 12:09:17 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[DB]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[MySQL]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/04/28/hadoopmysql%e7%bb%93%e5%90%88.html</guid>
		<description><![CDATA[…… hadoop mail list 收到的 给没订阅的童鞋门参考 author : Ankur Goel [ankur.goel#corp.aol.com] ……………… hello hadoop users, Recently I had a chance to lead a team building a log-processing system that uses Hadoop and MySQL. The system&#8217;s goal was to process the incoming information as quickly as possible (real time or near real time), and make it [...... ]]></description>
			<content:encoded><![CDATA[<p>……</p>
<p>hadoop mail list 收到的</p>
<p>给没订阅的童鞋门参考</p>
<p>author : Ankur Goel [ankur.goel#corp.aol.com]</p>
<p>………………</p>
<p>hello hadoop users, </p>
<p>Recently I had a chance to lead a team building a log-processing system that uses Hadoop and MySQL. The system&#8217;s goal was to process the incoming information as quickly as possible (real time or near real time), and make it available for querying in MySQL. I thought it would be good to share the experience and the challenges with the community. Couldn&#8217;t think of a better place than these mailing lists as I am not much of a blogger <img src='http://jimey.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  </p>
<p>The information flow in the system looks something like </p>
<p>[Apache-Servers] -&gt; [Hadoop] -&gt; [MySQL-shards] -&gt; [Query-Tools] </p>
<p>Transferring from Apache-Servers to Hadoop was quite easy as we just had to organize the data in timely buckets (directories). Once that was running smooth we had to make sure that map-reduce jobs are fired at regular intervals and they pick up the right data. The jobs would then process/aggregate the date and dump the info into MySQL shards from the reducers [we have our own DB partioning set up]. This is where we hit major bottlenecks [any surprises? <img src='http://jimey.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> ] </p>
<p>The table engine used was InnoDB as there was a need for fast replication and writes but only moderate reads (should eventually support high read rates). The data would take up quite a while to load completely far away from being near-real time. And so our optimization journey begin. </p>
<p>1. We tried to optimize/tune InnoDB parameters like increasing the buffer pool size to 75 % of available RAM. This helped but only till the time DBs were lightly loaded i.e. innoDB had sufficient buffer pool to host the data and indexes. </p>
<p>2. We also realized that InnoDB has considerable locking overhead because of which write concurrency is really bad when you have a large number of concurrent threads doing writes. The default thread concurrency for us was set to no_of_cpu * 2 = 8 which is what the official documentation advises as the optimal limit. So we limited the number of reduce tasks and consequently the number of concurrent writes and boy the performance improved 4x. We were almost there <img src='http://jimey.com/blog/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  </p>
<p>3. Next thing we tried is the standard DB optimzation techniques like de-normalizing the schema and dropping constraints. This gave only a minor performance improvement, nothing earth shattering. Note that we were already caching connections in reducers to each MySQL shard and partionining logic was embedded into reducers. </p>
<p>4. Falling still short of our performance objectives, we finally we decided to get rid of JDBC writes from reducers and work on an alternative that uses MySQLs LOAD utility. </p>
<p>- The processing would partition the data into MySQL shard specific files resident in HDFS. </p>
<p>- A script would then spawn processes via ssh on different physical machines to download this data. </p>
<p>- Each spawned process just downloads the data for the shard it should upload to. </p>
<p>- All the processes then start uploading data in parallel into their respective MySQL shards using LOAD DATA infile. </p>
<p>This proved to be the fastest approach, even in the wake of increasing data loads. The enitre processing/loading would complete in less than 6 min. The system has been holding up quite well so far, even though we&#8217;ve had to limit the number of days for which we keep the data or else the MySQLs get overwhelmed. </p>
<p>Hope this is helpful to people. </p>
<p>Regards </p>
<p>-Ankur </p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-05-05 -- <a href="http://jimey.com/2010/05/05/what-causes-errors-in-databases.html" title="what causes errors in databases.">what causes errors in databases.</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-12-02 -- <a href="http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html" title="Karmasphere Studio for Hadoop">Karmasphere Studio for Hadoop</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-05-01 -- <a href="http://jimey.com/2009/05/01/hadoop-0-20-0.html" title="hadoop-0.20.0">hadoop-0.20.0</a> (0)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-12 -- <a href="http://jimey.com/2008/11/12/sunday-the-day-to-do-the-wordcount-screenshots-mapreduce.html" title="周日那天做的mapreduce中的wordcount测试截图">周日那天做的mapreduce中的wordcount测试截图</a> (0)</li><li>2008-11-11 -- <a href="http://jimey.com/2008/11/11/hadoop-hbase-performance-evaluation.html" title="Hadoop HBase Performance Evaluation">Hadoop HBase Performance Evaluation</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/another-week-passed-and-concluded.html" title="又一周过去了，总结">又一周过去了，总结</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>[ZT]Writing your own real-time search engine and Adwords service</title>
		<link>http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html</link>
		<comments>http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html#comments</comments>
		<pubDate>Thu, 16 Apr 2009 13:54:07 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[real-time search]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/04/16/ztwriting-your-own-real-time-search-engine-and-adwords-service.html</guid>
		<description><![CDATA[In some cases this technique is not applicable. For example, imagine the case of eCommerce applications. When you submit a bid on an auction, you expect to see it immediately popping up in the search results.This calls for a rather different architecture then the one typically used for Internet appl..... ]]></description>
			<content:encoded><![CDATA[<p>In some cases this technique is not applicable. For example, imagine the case of eCommerce applications. When you submit a bid on an auction, you expect to see it immediately popping up in the search results.This calls for a rather different architecture then the one typically used for Internet applications. The key is how fast we can put new data into the index server. Sounds simple, right? Well, most index servers are highly optimized for fast read, however they tend to be quite heavy on write operations.</p>
<p>&#160;</p>
<p>[<a href="http://natishalom.typepad.com/nati_shaloms_blog/2009/02/writing-your-own-adwards-service.html" target="_blank">via</a>]</p>
<p>无意间找到的，好像是我搜索push的时候找到的，整个博客的内容就是我想要的东西，剩下这几天有得看了</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-28 -- <a href="http://jimey.com/2010/04/28/zt-classified-papers-on-cloud-computing.html" title="[ZT]关于cloud computing论文的归类">[ZT]关于cloud computing论文的归类</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/read-the-paper-scaling-online-social-networks-without-pains.html" title="阅读paper:Scaling Online Social Networks without Pains">阅读paper:Scaling Online Social Networks without Pains</a> (0)</li><li>2009-06-24 -- <a href="http://jimey.com/2009/06/24/xu-jiumei-insomnia-for-this-thesis-i-xiangponaodai.html" title="许久没失眠,为了这论文我想破脑袋">许久没失眠,为了这论文我想破脑袋</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/local-search.html" title="Local Search">Local Search</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/several-shared-and-ideas.html" title="几个分享和想法">几个分享和想法</a> (0)</li><li>2009-03-23 -- <a href="http://jimey.com/2009/03/23/information-retrieval-course-2-weeks-to-a-summary-of.html" title="Information Retrieval 课程2周来的总结">Information Retrieval 课程2周来的总结</a> (0)</li><li>2009-03-09 -- <a href="http://jimey.com/2009/03/09/last-week-record-u0026amp-made-real-time-search-look-forward-to.html" title="上周记录&amp;国产real-time search期待">上周记录&amp;国产real-time search期待</a> (2)</li><li>2009-02-28 -- <a href="http://jimey.com/2009/02/28/zt-twitter-boss-u003d-real-time-search.html" title="[ZT]Twitter + BOSS = Real Time Search">[ZT]Twitter + BOSS = Real Time Search</a> (0)</li><li>2009-02-25 -- <a href="http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html" title="Zotero 1.5 Beta Released: Join Us In The Clouds">Zotero 1.5 Beta Released: Join Us In The Clouds</a> (0)</li><li>2009-02-14 -- <a href="http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html" title="Cloud Computing defined by Berkeley RAD Labs">Cloud Computing defined by Berkeley RAD Labs</a> (0)</li><li>2009-02-09 -- <a href="http://jimey.com/2009/02/09/essence-gas-god.html" title="精，气，神">精，气，神</a> (0)</li><li>2009-02-07 -- <a href="http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html" title="How Businesses Can Use P2P">How Businesses Can Use P2P</a> (0)</li><li>2009-02-01 -- <a href="http://jimey.com/2009/02/01/wanted-to-terms-u0026amp.html" title="很想写点什么">很想写点什么</a> (0)</li><li>2009-01-22 -- <a href="http://jimey.com/2009/01/22/human-beings-are-social-animals.html" title="人是社会性动物">人是社会性动物</a> (2)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>几个分享和想法</title>
		<link>http://jimey.com/2009/04/16/several-shared-and-ideas.html</link>
		<comments>http://jimey.com/2009/04/16/several-shared-and-ideas.html#comments</comments>
		<pubDate>Thu, 16 Apr 2009 04:36:34 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[放心情]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Google Reader]]></category>
		<category><![CDATA[mobile search]]></category>
		<category><![CDATA[one login]]></category>
		<category><![CDATA[push]]></category>
		<category><![CDATA[sensor]]></category>
		<category><![CDATA[ubiquitous computing]]></category>
		<category><![CDATA[分享]]></category>
		<category><![CDATA[想法]]></category>
		<category><![CDATA[搜索]]></category>
		<category><![CDATA[收获]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/04/16/%e5%87%a0%e4%b8%aa%e5%88%86%e4%ba%ab%e5%92%8c%e6%83%b3%e6%b3%95.html</guid>
		<description><![CDATA[1.push  2.mobile search  http://www.readwriteweb.com/archives/new_mishmash_of_a_mobile_search_engine_getfugu_pre.php  3 one login  http://www.readwriteweb.com/archives/myonelogin_single_sign-on_for_the_cloud.php  方案1暂时在调查中还不能放出更多detail 但是以这个单词为主题的内容也就那么一些  方案2 很不..... ]]></description>
			<content:encoded><![CDATA[<p>1.push</p>
<p>2.mobile search</p>
<p><a title="http://www.readwriteweb.com/archives/new_mishmash_of_a_mobile_search_engine_getfugu_pre.php" href="http://www.readwriteweb.com/archives/new_mishmash_of_a_mobile_search_engine_getfugu_pre.php">http://www.readwriteweb.com/archives/new_mishmash_of_a_mobile_search_engine_getfugu_pre.php</a></p>
<p>3 one login</p>
<p><a title="http://www.readwriteweb.com/archives/myonelogin_single_sign-on_for_the_cloud.php" href="http://www.readwriteweb.com/archives/myonelogin_single_sign-on_for_the_cloud.php">http://www.readwriteweb.com/archives/myonelogin_single_sign-on_for_the_cloud.php</a></p>
<p>方案1暂时在调查中还不能放出更多detail 但是以这个单词为主题的内容也就那么一些</p>
<p>方案2 很不错的想法，基于LBS 的应用，与搜索引擎结合 很不错，只是比以前的想法多了一个图片拍照与识别之后的搜索，更加便捷一些</p>
<p>当然，你可以加语音识别，等等等等一系列的识别上去，目的都是为了在有限的时间内，做更多的事情</p>
<p>方案3 概念还真的跟银行类似，把网站id 密码存银行，然后要上哪个网站，先进银行就好了。但是万一这银行倒闭怎么办，长期不输入帐号密码，再叫我去输入，我肯定会忘记的。。所以该银行每个月服务费3美元，以维持他不倒闭</p>
<p>4 ubiquitous computing + cloud computing 会成什么样？</p>
<p>昨天一直在想那个关于搜索idea的东西，如果ubiquitous computing + cloud computing 又会是如何？</p>
<p>sensor 满大街丢，手机 汽车 人 交通灯 树 垃圾箱 等等一切都装上这个sensor的话又会如何？</p>
<p>很多人在想，我为何要在我身上装一个只能发射消息，转发消息而不能给我带来任何好处的sensor?? 那就好比为何我要天天在twitter上发消息又不能给我带来任何有用的收获一样的问题。。</p>
<p>因为如果每个人都为其他人做一些贡献，做一些共享的话，最终自己也会是其中一受益人</p>
<p>就好比每天在twitter发一些看似无趣，无意义的内容，也许会对某些人有所帮助，即使无帮助，也不会带给太多人以影响，只是对那些follower们有影响，在搜索过滤系统的作用下，不会对更多人有影响</p>
<p>再说回这个ubiquitous computing + cloud computing&#160; 以sensor收集传输信息与手持装置互动。再加上强大的cloud computing作为强力计算源泉，analysis sensor所提供的信息，utilize cloud computing 满足各手持装置(手机，pda，车载系统，移动笔记本，等等)的请求 创造强大和谐社会的目标是可以提前实现的</p>
<p>通过sensor来分析人流密度，交通密度， 对树木进行园林管理，对垃圾箱清理时间进行安排</p>
<p>对于人流密度，又可以更加和谐的来用到商业上，比如营销，发广告传单的时候 lol </p>
<p>而交通密度，其实不仅仅是交通密度的一个指示，更多的扩展来讲，可以进行无人驾驶，通过sensor来感知对方车辆与自己的距离，速度，以及前进目的等等信息，利用gps 街边hot spot 接收 车载计算系统 来计算最优化的前进路线，这又是需要多么牛X的AI在里面吧</p>
<p>说来说去，其实都是还在为了人类能够在自己短短的生命里面做更多的事情，就好比无人驾驶，好比那些最优化设计，都是为了更快，更好，不用更多人的完成事情</p>
<p>说来说去，其实我们是在造人运动，这sensor就相当于人类的眼睛 鼻子 手 测速器等等 而车载电脑or cloud computing 就相当于大脑，这AI其实就是在用科学的方法寻找一个最终的选择</p>
<p>人总是会因为信息过量而会崩溃，比如每天我在看我的<a href="http://jimey.com/tag/google" class="st_tag internal_tag" rel="tag" title="Posts tagged with Google">google</a> reader的时候看着左上角那上百条更新的，十分的想崩溃</p>
<p>一直在试想，如果有机器能够替我筛选，选我一定会有收获，会有理解，可以满足我的好奇心，可以把我不认识的单词都翻译了，对于那些我完全没兴趣直接标记为已读，对于那些我今天暂时不想看，也许明天会看的暂时屏蔽掉。让我不要花费半个小时甚至一个小时时间来看这上百个更新 只要最少的时间把该看的都看掉，那生活就美好了</p>
<p>还是那么一句话，节约了我的时间，其实是为了能够让我在我有限的时间里面干更多的事情</p>
<p>也许有人会讲：难道你写博客不是在浪费自己的时间？ 这个其实跟上面那段，我干嘛要带个sensor满街跑，我干嘛要在twitter上无聊发消息一样的问题。答案其实也是，在你分享你的想法，你的收获的时候，最终大家都这样做的时候，这样其实你是在分享给你自己</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2008-11-24 -- <a href="http://jimey.com/2008/11/24/another-week-gone-by.html" title="又一周过去了">又一周过去了</a> (0)</li><li>2010-04-28 -- <a href="http://jimey.com/2010/04/28/zt-classified-papers-on-cloud-computing.html" title="[ZT]关于cloud computing论文的归类">[ZT]关于cloud computing论文的归类</a> (0)</li><li>2010-04-05 -- <a href="http://jimey.com/2010/04/05/the-back-light.html" title="那个回光">那个回光</a> (0)</li><li>2010-03-30 -- <a href="http://jimey.com/2010/03/30/the-garden-of-eden-subtitle-blog-station-uses-the-approach.html" title="伊甸园字幕博客站利用办法">伊甸园字幕博客站利用办法</a> (2)</li><li>2010-03-17 -- <a href="http://jimey.com/2010/03/17/weekly-meeting.html" title="每周例会">每周例会</a> (0)</li><li>2010-03-12 -- <a href="http://jimey.com/2010/03/12/a-week-and-into-his-new-role.html" title="一周了,进入角色了">一周了,进入角色了</a> (0)</li><li>2009-12-15 -- <a href="http://jimey.com/2009/12/15/hair-cut-relax.html" title="剪头发，放松？">剪头发，放松？</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/read-the-paper-scaling-online-social-networks-without-pains.html" title="阅读paper:Scaling Online Social Networks without Pains">阅读paper:Scaling Online Social Networks without Pains</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html" title="[ZT]Writing your own real-time search engine and Adwords service">[ZT]Writing your own real-time search engine and Adwords service</a> (0)</li><li>2009-03-27 -- <a href="http://jimey.com/2009/03/27/get-rid-of-live-mail-for-gmail-offline.html" title="干掉live mail换gmail offline">干掉live mail换gmail offline</a> (4)</li><li>2009-02-25 -- <a href="http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html" title="Zotero 1.5 Beta Released: Join Us In The Clouds">Zotero 1.5 Beta Released: Join Us In The Clouds</a> (0)</li><li>2009-02-17 -- <a href="http://jimey.com/2009/02/17/also-hypocritical.html" title="又矫情?">又矫情?</a> (2)</li><li>2009-02-14 -- <a href="http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html" title="Cloud Computing defined by Berkeley RAD Labs">Cloud Computing defined by Berkeley RAD Labs</a> (0)</li><li>2009-02-09 -- <a href="http://jimey.com/2009/02/09/essence-gas-god.html" title="精，气，神">精，气，神</a> (0)</li><li>2009-02-07 -- <a href="http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html" title="How Businesses Can Use P2P">How Businesses Can Use P2P</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/04/16/several-shared-and-ideas.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>windows环境下eclipse调试远端ubuntu环境的hadoop</title>
		<link>http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html</link>
		<comments>http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html#comments</comments>
		<pubDate>Thu, 26 Mar 2009 12:32:21 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Debug]]></category>
		<category><![CDATA[Eclipse]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop-0.19.1]]></category>
		<category><![CDATA[plugins]]></category>
		<category><![CDATA[Ubuntu 8.04]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/03/26/windows%e7%8e%af%e5%a2%83%e4%b8%8beclipse%e8%b0%83%e8%af%95%e8%bf%9c%e7%ab%afubuntu%e7%8e%af%e5%a2%83%e7%9a%84hadoop.html</guid>
		<description><![CDATA[基本上这个link说的很明白  强烈推荐  在run dialog中设置输入参数 这项，我有点sb了  以为是真实环境目录，后来出错才发现应该是hdfs中的虚拟目录……  在run方法中加入下面2句    conf.set(&#34;hadoop.job.ugi&#34;, &#34;hadoop,hadoop&#34;); //设置hadoop server用户名和密码        conf.set(&#34;mapred.syste..... ]]></description>
			<content:encoded><![CDATA[<p>基本上这个<a href="http://www.beoop.com/?p=244" target="_blank">link</a>说的很明白</p>
<p>强烈推荐</p>
<p>在run dialog中设置输入参数 这项，我有点sb了</p>
<p>以为是真实环境目录，后来出错才发现应该是hdfs中的虚拟目录……</p>
<p>在run方法中加入下面2句</p>
<div style="background: rgb(253,253,253); -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial"></div>
<div style="background-color: rgb(249,247,237); font-family: &quot;[object HTMLOptionElement]&quot;,&quot;Consolas&quot;,&quot;Lucida Console&quot;,&quot;Courier New&quot;" class="source">conf.<span style="color: rgb(255,0,0)">set</span>(<span style="color: rgb(0,0,255)">&quot;hadoop.job.ugi&quot;</span>, <span style="color: rgb(0,0,255)">&quot;hadoop,hadoop&quot;</span>); <span style="font-style: italic; color: rgb(0,136,0)">//设置hadoop server用户名和密码</span>    </p>
<p>conf.<span style="color: rgb(255,0,0)">set</span>(<span style="color: rgb(0,0,255)">&quot;mapred.system.dir&quot;</span>, <span style="color: rgb(0,0,255)">&quot;/home/hadoop/HadoopInstall/tmp/mapred/system/&quot;</span>); <span style="font-style: italic; color: rgb(0,136,0)">//指定系统路径</span>    </div>
<p>参照之前的 yahoo的<a href="http://public.yahoo.com/gogate/hadoop-tutorial/html/module3.html" target="_blank">教程</a>所述</p>
<p>click on the &quot;Advanced&quot; tab. There are two settings here which must be changed.</a></p>
<p>Scroll down to <tt>hadoop.job.ugi</tt>. It contains your current Windows login credentials. Highlight the first comma-separated value in this list (your username) and replace it with <tt>hadoop-user</tt>.</a></p>
<p>Next, scroll further down to <tt>mapred.system.dir</tt>. Erase the current value and set it to <tt>/hadoop/mapred/system</tt>. </a></p>
<p>在eclipse插件上修改这个参数。。但是完全不在我系统上显示，这个hadoop.job.ugi……所以总是无法连接</p>
<p>当然目前由于这个无法出现，在eclipse直接上传文件的功能暂时就无效了，当然，这个没所谓，还有其他办法，svn put.ftp等等等</p>
<p>重要的是，终于可以在windows 环境下eclipse中debug了。不用再把code put上去再运行了……</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-05-01 -- <a href="http://jimey.com/2009/05/01/hadoop-0-20-0.html" title="hadoop-0.20.0">hadoop-0.20.0</a> (0)</li><li>2009-03-06 -- <a href="http://jimey.com/2009/03/06/the-use-of-eclipse-cdt-development-of-ns2.html" title="使用Eclipse CDT 开发NS2">使用Eclipse CDT 开发NS2</a> (0)</li><li>2009-03-04 -- <a href="http://jimey.com/2009/03/04/tcl-compiler-environment-win.html" title="Win环境下的TCL Compiler">Win环境下的TCL Compiler</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-12-02 -- <a href="http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html" title="Karmasphere Studio for Hadoop">Karmasphere Studio for Hadoop</a> (0)</li><li>2009-10-28 -- <a href="http://jimey.com/2009/10/28/ubuntu-8-04-version-to-compile-rtorrent-0-85-pian.html" title="ubuntu 8.04版本编译rtorrent 0.85篇">ubuntu 8.04版本编译rtorrent 0.85篇</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/feedsky-feed-plug-in-plug-in-will-lead-to-failure-of-task-execution-wp-booking.html" title="Feedsky Feed 插件插件会导致wp的预订任务执行失败">Feedsky Feed 插件插件会导致wp的预订任务执行失败</a> (0)</li><li>2009-10-21 -- <a href="http://jimey.com/2009/10/21/wordpress-lifestream-plugin.html" title="WordPress Lifestream 插件">WordPress Lifestream 插件</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-07-13 -- <a href="http://jimey.com/2009/07/13/south-korea-to-see-youkuu002639s-ssh-proxy-ip-address.html" title="韩国ip看youku之ssh代理解决">韩国ip看youku之ssh代理解决</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html/feed</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Linux server clustered HDFS: access from Windows eclipse Java application</title>
		<link>http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html</link>
		<comments>http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html#comments</comments>
		<pubDate>Wed, 25 Mar 2009 13:17:53 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[Eclipse]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hdfs]]></category>
		<category><![CDATA[plugins]]></category>
		<category><![CDATA[Ubuntu]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html</guid>
		<description><![CDATA[&#160;  貌似有人也有一样的问题，找不到hadoop.job.ugi  不过我也找不到。。搞了3个小时实在是奇怪  还是用svn put上去好了。。  正常的配置是  You will now be asked to fill in a number of parameters identifying the server. To connect to the VMware image, the values are:    Location name: (Any descriptive n..... ]]></description>
			<content:encoded><![CDATA[<p>&#160;</p>
<p>貌似有人也有一样的问题，找不到<b>hadoop</b>.<b>job</b>.<b>ugi</b></p>
<p>不过我也找不到。。搞了3个小时实在是奇怪</p>
<p>还是用svn put上去好了。。</p>
<p>正常的配置是</p>
<p>You will now be asked to fill in a number of parameters identifying the server. To connect to the VMware image, the values are:</a></p>
<pre>Location name: (Any descriptive name you want; e.g., &quot;VMware server&quot;)Map/Reduce Master Host: (The IP address printed at startup)Map/Reduce Master Port: 9001DFS Master Port: 9000User name: hadoop-user</a></pre>
<p>Next, click on the &quot;Advanced&quot; tab. There are two settings here which must be changed.</a></p>
<p>Scroll down to <tt>hadoop.job.ugi</tt>. It contains your current Windows login credentials. Highlight the first comma-separated value in this list (your username) and replace it with <tt>hadoop-user</tt>.</a></p>
<p>Next, scroll further down to <tt>mapred.system.dir</tt>. Erase the current value and set it to <tt>/hadoop/mapred/system</tt>. </a></p>
<p>When you are done, click &quot;Finish.&quot; Your server will now appear in the Map/Reduce Locations panel. If you look in the Project Explorer (upper-left corner of Eclipse), you will see that the MapReduce plugin has added the ability to browse HDFS. Click the [+] buttons to expand the directory tree to see any files already there. If you inserted files into HDFS yourself, they will be visible in this tree.</a></p>
<p>[<a href="http://public.yahoo.com/gogate/hadoop-tutorial/html/module3.html" target="_blank">via</a>]</p>
<p>但是目前比较奇怪找不到这个<tt>hadoop.job.ugi</tt>无法显示。。</p>
<p><span id="more-1541"></span></p>
<p>by <a href="http://www.nabble.com/user/UserProfile.jtp?user=1307677">Alberto Forcén</a> Aug 05, 2008; 06:16pm :: Rate this Message: <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_blocked_gray.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_clear_rating.png" /></a> &#8211; Use ratings to moderate (<a href="http://www.nabble.com/help/Answer.jtp?id=16">?</a>)</p>
<p><a href="http://www.nabble.com/forum/Reply.jtp?post=18827403">Reply</a> | <a href="http://www.nabble.com/user/SendEmail.jtp?type=pm&amp;post=18827403">Reply to Author</a> | <a href="http://www.nabble.com/forum/PrintPost.jtp?post=18827403">Print</a> | <a href="http://www.nabble.com/">View Threaded</a> | <a href="http://www.nabble.com/Linux-server-clustered-HDFS%3A-access-from-Windows-eclipse-Java-application-p18827403.html">Show Only this Message</a></p>
<p>Hi all.<br />
  <br />I&#8217;m running a clustering HDFS on linux and I need to access files (I/O) from eclipse Java application running on Windows. It seems simple, but is it possible? </p>
<p>I have write code using API but I have a problem: when code invokes DistributedFileSystem.initialize() method I receive an exception: java.net.SocketTimeoutException </p>
<p>[code] </p>
<p>String ipStr = &quot;192.168.75.191&quot;; </p>
<p>String portStr = &quot;9000&quot;; </p>
<p>String uriStr = &quot;http://&quot; + ipStr + &quot;:&quot; + portStr; </p>
<p>Configuration conf = new Configuration(); </p>
<p>conf.set(&quot;<b>hadoop</b>.<b>job</b>.<b>ugi</b>&quot;, &quot;user,group&quot;); // Usuario y grupos a los que pertenece </p>
<p>DistributedFileSystem dfs = new DistributedFileSystem(); </p>
<p>dfs.initialize(new URI(uriStr), conf); </p>
<p>[/code] </p>
<p>[trace] </p>
<p>Exception in thread &quot;main&quot; java.net.SocketTimeoutException: timed out waiting for rpc response </p>
<p>at org.apache.<b>hadoop</b>.ipc.Client.call(Client.java:559) </p>
<p>at org.apache.<b>hadoop</b>.ipc.RPC$Invoker.invoke(RPC.java:212) </p>
<p>at org.apache.<b>hadoop</b>.dfs.$Proxy0.getProtocolVersion(Unknown Source) </p>
<p>at org.apache.<b>hadoop</b>.ipc.RPC.getProxy(RPC.java:313) </p>
<p>at org.apache.<b>hadoop</b>.dfs.DFSClient.createRPCNamenode(DFSClient.java:102) </p>
<p>at org.apache.<b>hadoop</b>.dfs.DFSClient.&lt;init&gt;(DFSClient.java:178) </p>
<p>at org.apache.<b>hadoop</b>.dfs.DistributedFileSystem.initialize(DistributedFileSystem.java:68) </p>
<p>at examples.<b>HadoopDFS</b>.main(<b>HadoopDFS</b>.java:153) </p>
<p>[/trace] </p>
<p>&#160;&#160;&#160;&#160;&#160; ______________________________________________ </p>
<p>Enviado desde Correo Yahoo! La bandeja de entrada más inteligente.</p>
<p><img alt="" align="absMiddle" src="http://www.nabble.com/images/icon_tri.png" width="12" height="12" /> </p>
<p><a name="a18830884"></a></p>
<h4><a name="a18830884">Re: Linux server clustered HDFS: access from Windows eclipse Java application</a></h4>
<p><a href="http://www.nabble.com/"><img border="0" alt="Click to flag this post" src="http://www.nabble.com/images/icon_unflagged.png" width="15" height="15" /></a> <img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /> </p>
<p>by <a href="http://www.nabble.com/user/UserProfile.jtp?user=1300241">Qin Gao-2</a> Aug 05, 2008; 10:13pm :: Rate this Message: <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_blocked_gray.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_clear_rating.png" /></a> &#8211; Use ratings to moderate (<a href="http://www.nabble.com/help/Answer.jtp?id=16">?</a>)</p>
<p><a href="http://www.nabble.com/forum/Reply.jtp?post=18830884">Reply</a> | <a href="http://www.nabble.com/user/SendEmail.jtp?type=pm&amp;post=18830884">Reply to Author</a> | <a href="http://www.nabble.com/forum/PrintPost.jtp?post=18830884">Print</a> | <a href="http://www.nabble.com/">View Threaded</a> | <a href="http://www.nabble.com/Re%3A-Linux-server-clustered-HDFS%3A-access-from-Windows-eclipse-Java-application-p18830884.html">Show Only this Message</a></p>
<p>I think IBM has a plugin that can access HDFS, I don&#8217;t know whether it<br />
  <br />contains source code, but maybe it helps. </p>
<p>www.alphaworks.*ibm*.com/tech/mapreducetools </p>
<p>On Tue, Aug 5, 2008 at 5:16 AM, Alberto Forcén &lt;<a href="http://www.nabble.com/user/SendEmail.jtp?type=post&amp;post=18830884&amp;i=0">otrebla81@&#8230;</a>&gt; wrote: </p>
<p>&gt; Hi all.<br />
  <br />&gt; </p>
<p>&gt; I&#8217;m running a clustering HDFS on linux and I need to access files (I/O) </p>
<p>&gt; from eclipse Java application running on Windows. It seems simple, but is it </p>
<p>&gt; possible? </p>
<p>&gt; </p>
<p>&gt; I have write code using API but I have a problem: when code invokes </p>
<p>&gt; DistributedFileSystem.initialize() method I receive an exception: </p>
<p>&gt; java.net.SocketTimeoutException </p>
<p>&gt; </p>
<p>&gt; </p>
<p>&gt; [code] </p>
<p>&gt; String ipStr = &quot;192.168.75.191&quot;; </p>
<p>&gt; String portStr = &quot;9000&quot;; </p>
<p>&gt; String uriStr = &quot;http://&quot; + ipStr + &quot;:&quot; + portStr; </p>
<p>&gt; </p>
<p>&gt; Configuration conf = new Configuration(); </p>
<p>&gt; conf.set(&quot;<b>hadoop</b>.<b>job</b>.<b>ugi</b>&quot;, &quot;user,group&quot;); // Usuario y grupos a los que </p>
<p>&gt; pertenece </p>
<p>&gt; </p>
<p>&gt; DistributedFileSystem dfs = new DistributedFileSystem(); </p>
<p>&gt; dfs.initialize(new URI(uriStr), conf); </p>
<p>&gt; [/code] </p>
<p>&gt; </p>
<p>&gt; [trace] </p>
<p>&gt; Exception in thread &quot;main&quot; java.net.SocketTimeoutException: timed out </p>
<p>&gt; waiting for rpc response </p>
<p>&gt; at org.apache.<b>hadoop</b>.ipc.Client.call(Client.java:559) </p>
<p>&gt; at org.apache.<b>hadoop</b>.ipc.RPC$Invoker.invoke(RPC.java:212) </p>
<p>&gt; at org.apache.<b>hadoop</b>.dfs.$Proxy0.getProtocolVersion(Unknown Source) </p>
<p>&gt; at org.apache.<b>hadoop</b>.ipc.RPC.getProxy(RPC.java:313) </p>
<p>&gt; at org.apache.<b>hadoop</b>.dfs.DFSClient.createRPCNamenode(DFSClient.java:102) </p>
<p>&gt; at org.apache.<b>hadoop</b>.dfs.DFSClient.&lt;init&gt;(DFSClient.java:178) </p>
<p>&gt; at </p>
<p>&gt; org.apache.<b>hadoop</b>.dfs.DistributedFileSystem.initialize(DistributedFileSystem.java:68) </p>
<p>&gt; at examples.<b>HadoopDFS</b>.main(<b>HadoopDFS</b>.java:153) </p>
<p>&gt; [/trace] </p>
<p>&gt; </p>
<p>&gt; </p>
<p>&gt;&#160;&#160;&#160;&#160;&#160; ______________________________________________ </p>
<p>&gt; Enviado desde Correo Yahoo! La bandeja de entrada más inteligente. </p>
<p>&gt; </p>
<p>&#8230; [<a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#">show rest of quote</a>]</p>
<p><img alt="" align="absMiddle" src="http://www.nabble.com/images/icon_tri.png" width="12" height="12" /> </p>
<p><a name="a18837468"></a></p>
<h4><a name="a18837468">Confusing NameNodeFailover page in Hadoop Wiki</a></h4>
<p><a href="http://www.nabble.com/"><img border="0" alt="Click to flag this post" src="http://www.nabble.com/images/icon_unflagged.png" width="15" height="15" /></a> <img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /> </p>
<p>by <a href="http://www.nabble.com/user/UserProfile.jtp?user=598721">Konstantin Shvachko</a> Aug 06, 2008; 03:42am :: Rate this Message: <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_blocked_gray.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_clear_rating.png" /></a> &#8211; Use ratings to moderate (<a href="http://www.nabble.com/help/Answer.jtp?id=16">?</a>)</p>
<p><a href="http://www.nabble.com/forum/Reply.jtp?post=18837468">Reply</a> | <a href="http://www.nabble.com/user/SendEmail.jtp?type=pm&amp;post=18837468">Reply to Author</a> | <a href="http://www.nabble.com/forum/PrintPost.jtp?post=18837468">Print</a> | <a href="http://www.nabble.com/">View Threaded</a> | <a href="http://www.nabble.com/Confusing-NameNodeFailover-page-in-Hadoop-Wiki-p18837468.html">Show Only this Message</a></p>
<p>I was wondering around <b>Hadoop</b> wiki and found this page dedicated to name-node failover. </p>
<p><a href="http://wiki.apache.org/hadoop/NameNodeFailover">http://wiki.apache.org/<b>hadoop</b>/NameNodeFailover</a> </p>
<p>I think it is confusing, contradicts other documentation on the subject and contains incorrect facts. See </p>
<p><a href="http://hadoop.apache.org/core/docs/current/hdfs_user_guide.html#Secondary+Namenode">http://<b>hadoop</b>.apache.org/core/docs/current/hdfs_user_guide.html#Secondary+Namenode</a> </p>
<p><a href="http://wiki.apache.org/hadoop/FAQ#7">http://wiki.apache.org/<b>hadoop</b>/FAQ#7</a> </p>
<p>Besides it contains some kind of discussion. </p>
<p>It is not that I am against discussions, lets have them on this list. </p>
<p>But I was trying to understand were all the confusion about secondary-node issues comes from lately&#8230; </p>
<p>Imho we either need to correct it or remove. </p>
<p>Thanks, </p>
<p>&#8211;Konstantin </p>
<p><img alt="" align="absMiddle" src="http://www.nabble.com/images/icon_tri.png" width="12" height="12" /> </p>
<p><a name="a18837686"></a></p>
<h4><a name="a18837686">Re: Linux server clustered HDFS: access from Windows eclipse Java application</a></h4>
<p><a href="http://www.nabble.com/"><img border="0" alt="Click to flag this post" src="http://www.nabble.com/images/icon_unflagged.png" width="15" height="15" /></a> <img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /> </p>
<p>by <a href="http://www.nabble.com/user/UserProfile.jtp?user=712397">Michael Bieniosek</a> Aug 06, 2008; 03:56am :: Rate this Message: <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_blocked_gray.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_clear_rating.png" /></a> &#8211; Use ratings to moderate (<a href="http://www.nabble.com/help/Answer.jtp?id=16">?</a>)</p>
<p><a href="http://www.nabble.com/forum/Reply.jtp?post=18837686">Reply</a> | <a href="http://www.nabble.com/user/SendEmail.jtp?type=pm&amp;post=18837686">Reply to Author</a> | <a href="http://www.nabble.com/forum/PrintPost.jtp?post=18837686">Print</a> | <a href="http://www.nabble.com/">View Threaded</a> | <a href="http://www.nabble.com/Re%3A-Linux-server-clustered-HDFS%3A-access-from-Windows-eclipse-Java-application-p18837686.html">Show Only this Message</a></p>
<p>Are you sure both ends are running the same version of <b>hadoop</b>? </p>
<p>On 8/5/08 2:16 AM, &quot;Alberto Forcén&quot; &lt;<a href="http://www.nabble.com/user/SendEmail.jtp?type=post&amp;post=18837686&amp;i=0">otrebla81@&#8230;</a>&gt; wrote: </p>
<p>Hi all. </p>
<p>I&#8217;m running a clustering HDFS on linux and I need to access files (I/O) from eclipse Java application running on Windows. It seems simple, but is it possible? </p>
<p>I have write code using API but I have a problem: when code invokes DistributedFileSystem.initialize() method I receive an exception: java.net.SocketTimeoutException </p>
<p>[code] </p>
<p>String ipStr = &quot;192.168.75.191&quot;; </p>
<p>String portStr = &quot;9000&quot;; </p>
<p>String uriStr = &quot;http://&quot; + ipStr + &quot;:&quot; + portStr; </p>
<p>Configuration conf = new Configuration(); </p>
<p>conf.set(&quot;<b>hadoop</b>.<b>job</b>.<b>ugi</b>&quot;, &quot;user,group&quot;); // Usuario y grupos a los que pertenece </p>
<p>DistributedFileSystem dfs = new DistributedFileSystem(); </p>
<p>dfs.initialize(new URI(uriStr), conf); </p>
<p>[/code] </p>
<p>[trace] </p>
<p>Exception in thread &quot;main&quot; java.net.SocketTimeoutException: timed out waiting for rpc response </p>
<p>at org.apache.<b>hadoop</b>.ipc.Client.call(Client.java:559) </p>
<p>at org.apache.<b>hadoop</b>.ipc.RPC$Invoker.invoke(RPC.java:212) </p>
<p>at org.apache.<b>hadoop</b>.dfs.$Proxy0.getProtocolVersion(Unknown Source) </p>
<p>at org.apache.<b>hadoop</b>.ipc.RPC.getProxy(RPC.java:313) </p>
<p>at org.apache.<b>hadoop</b>.dfs.DFSClient.createRPCNamenode(DFSClient.java:102) </p>
<p>at org.apache.<b>hadoop</b>.dfs.DFSClient.&lt;init&gt;(DFSClient.java:178) </p>
<p>at org.apache.<b>hadoop</b>.dfs.DistributedFileSystem.initialize(DistributedFileSystem.java:68) </p>
<p>at examples.<b>HadoopDFS</b>.main(<b>HadoopDFS</b>.java:153) </p>
<p>[/trace] </p>
<p>&#160;&#160;&#160;&#160;&#160; ______________________________________________ </p>
<p>Enviado desde Correo Yahoo! La bandeja de entrada más inteligente. </p>
<p><img alt="" align="absMiddle" src="http://www.nabble.com/images/icon_tri.png" width="12" height="12" /> </p>
<p><a name="a18855373"></a></p>
<h4><a name="a18855373">Re: Confusing NameNodeFailover page in Hadoop Wiki</a></h4>
<p><a href="http://www.nabble.com/"><img border="0" alt="Click to flag this post" src="http://www.nabble.com/images/icon_unflagged.png" width="15" height="15" /></a> <img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /> </p>
<p>by <a href="http://www.nabble.com/user/UserProfile.jtp?user=739">Doug Cutting-4</a> Aug 07, 2008; 01:39am :: Rate this Message: <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_blocked_gray.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_clear_rating.png" /></a> &#8211; Use ratings to moderate (<a href="http://www.nabble.com/help/Answer.jtp?id=16">?</a>)</p>
<p><a href="http://www.nabble.com/forum/Reply.jtp?post=18855373">Reply</a> | <a href="http://www.nabble.com/user/SendEmail.jtp?type=pm&amp;post=18855373">Reply to Author</a> | <a href="http://www.nabble.com/forum/PrintPost.jtp?post=18855373">Print</a> | <a href="http://www.nabble.com/">View Threaded</a> | <a href="http://www.nabble.com/Re%3A-Confusing-NameNodeFailover-page-in-Hadoop-Wiki-p18855373.html">Show Only this Message</a></p>
<p>Konstantin Shvachko wrote:<br />
  <br />&gt; Imho we either need to correct it or remove. </p>
<p>+1 </p>
<p>Doug </p>
<p><img alt="" align="absMiddle" src="http://www.nabble.com/images/icon_tri.png" width="12" height="12" /> </p>
<p><a name="a18867760"></a></p>
<h4><a name="a18867760">Re: Confusing NameNodeFailover page in Hadoop Wiki</a></h4>
<p><a href="http://www.nabble.com/"><img border="0" alt="Click to flag this post" src="http://www.nabble.com/images/icon_unflagged.png" width="15" height="15" /></a> <img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /><img border="0" src="http://www.nabble.com/images/icon_star_blue.png" width="12" height="12" /> </p>
<p>by <a href="http://www.nabble.com/user/UserProfile.jtp?user=3094">Steve Loughran</a> Aug 07, 2008; 07:34pm :: Rate this Message: <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_blocked_gray.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a><a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_star_grey.png" width="12" height="12" /></a> <a href="http://www.nabble.com/Linux-server-clustered-HDFS:-access-from-Windows-eclipse-Java-application-td18827403.html#"><img border="0" alt="" src="http://www.nabble.com/images/icon_clear_rating.png" /></a> &#8211; Use ratings to moderate (<a href="http://www.nabble.com/help/Answer.jtp?id=16">?</a>)</p>
<p><a href="http://www.nabble.com/forum/Reply.jtp?post=18867760">Reply</a> | <a href="http://www.nabble.com/user/SendEmail.jtp?type=pm&amp;post=18867760">Reply to Author</a> | <a href="http://www.nabble.com/forum/PrintPost.jtp?post=18867760">Print</a> | <a href="http://www.nabble.com/">View Threaded</a> | <a href="http://www.nabble.com/Re%3A-Confusing-NameNodeFailover-page-in-Hadoop-Wiki-p18867760.html">Show Only this Message</a></p>
<p>Doug Cutting wrote:<br />
  <br />&gt; Konstantin Shvachko wrote: </p>
<p>&gt;&gt; Imho we either need to correct it or remove. </p>
<p>&gt; </p>
<p>&gt; +1 </p>
<p>&gt; </p>
<p>&gt; Doug </p>
<p>I added some pages there on namenode/<b>jobtracker</b>, etc, linking to the </p>
<p>faiover doc, which I didnt compare to the svn docs to see what was </p>
<p>correct. Perhaps the failover page could be set up to say &quot;you can do </p>
<p>some things here&quot; and point to the full docs at SVN or the <b>hadoop</b> site </p>
<p>&#8211; </p>
<p>Steve Loughran&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160; <a href="http://www.1060.org/blogxter/publish/5">http://www.1060.org/blogxter/publish/5</a> </p>
<p>Author: Ant in Action&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160; <a href="http://antbook.org/">http://antbook.org/</a></p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html" title="基于Eclipse的Hadoop应用开发环境的配置">基于Eclipse的Hadoop应用开发环境的配置</a> (0)</li><li>2009-03-06 -- <a href="http://jimey.com/2009/03/06/the-use-of-eclipse-cdt-development-of-ns2.html" title="使用Eclipse CDT 开发NS2">使用Eclipse CDT 开发NS2</a> (0)</li><li>2009-03-04 -- <a href="http://jimey.com/2009/03/04/tcl-compiler-environment-win.html" title="Win环境下的TCL Compiler">Win环境下的TCL Compiler</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-12 -- <a href="http://jimey.com/2008/11/12/sunday-the-day-to-do-the-wordcount-screenshots-mapreduce.html" title="周日那天做的mapreduce中的wordcount测试截图">周日那天做的mapreduce中的wordcount测试截图</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/cluster-configuration-with-hadoop-introduction.html" title="Hadoop用cluster配置简介">Hadoop用cluster配置简介</a> (1)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/hadoop-cluster-to-do-with-the-composition-of-cluster-machines.html" title="用Hadoop做集群机器构成的cluster">用Hadoop做集群机器构成的cluster</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/then-boil-for-one-night-the-outcome-is-still-a-significant-drop.html" title="又熬一晚，成果还是显著滴">又熬一晚，成果还是显著滴</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-05-04 -- <a href="http://jimey.com/2010/05/04/ubuntu-10-04-installed-sunu002639s-jdk.html" title="ubuntu 10.04安装sun的jdk">ubuntu 10.04安装sun的jdk</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2010-04-12 -- <a href="http://jimey.com/2010/04/12/cloud-of-medical-information-systems-immature-idea.html" title="云端医疗信息系统的不成熟构思">云端医疗信息系统的不成熟构思</a> (3)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>基于Eclipse的Hadoop应用开发环境的配置</title>
		<link>http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html</link>
		<comments>http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html#comments</comments>
		<pubDate>Wed, 25 Mar 2009 09:48:03 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Eclipse]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop-0.19.1]]></category>
		<category><![CDATA[plugins]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/03/25/%e5%9f%ba%e4%ba%8eeclipse%e7%9a%84hadoop%e5%ba%94%e7%94%a8%e5%bc%80%e5%8f%91%e7%8e%af%e5%a2%83%e7%9a%84%e9%85%8d%e7%bd%ae.html</guid>
		<description><![CDATA[最新版的Hadoop 0.19.1中  hadoop-0.19.1contribeclipse-plugin  &#160;  有新版的hadoop-0.19.1-eclipse-plugin 插件  只可惜暂时不支持最新版的eclipse  所以无奈只能http://www.eclipse.org/downloads/download.php?file=/technology/epp/downloads/release/europa/winter/eclipse-java-europa-winter-win32.zip  选择..... ]]></description>
			<content:encoded><![CDATA[<p>最新版的Hadoop 0.19.1中</p>
<p>hadoop-0.19.1\contrib\<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a></p>
<p>&#160;</p>
<p>有新版的hadoop-0.19.1-<a href="http://jimey.com/tag/eclipse-plugin" class="st_tag internal_tag" rel="tag" title="Posts tagged with eclipse-plugin">eclipse-plugin</a> 插件</p>
<p>只可惜暂时不支持最新版的eclipse</p>
<p>所以无奈只能<a title="http://www.eclipse.org/downloads/download.php?file=/technology/epp/downloads/release/europa/winter/eclipse-java-europa-winter-win32.zip" href="http://www.eclipse.org/downloads/download.php?file=/technology/epp/downloads/release/europa/winter/eclipse-java-europa-winter-win32.zip">http://www.eclipse.org/downloads/download.php?file=/technology/epp/downloads/release/europa/winter/eclipse-java-europa-winter-win32.zip</a></p>
<p>选择europa版本的eclipse进行配置</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/windows-environment-eclipse-debugging-environment-hadoop-remote-ubuntu.html" title="windows环境下eclipse调试远端ubuntu环境的hadoop">windows环境下eclipse调试远端ubuntu环境的hadoop</a> (12)</li><li>2009-03-26 -- <a href="http://jimey.com/2009/03/26/midnight.html" title="半夜">半夜</a> (0)</li><li>2009-03-25 -- <a href="http://jimey.com/2009/03/25/linux-server-clustered-hdfs-access-from-windows-eclipse-java-application.html" title="Linux server clustered HDFS: access from Windows eclipse Java application">Linux server clustered HDFS: access from Windows eclipse Java application</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/eclipse-hadoop-full-linux-environment-for-development.html" title="Eclipse+Hadoop 全linux环境开发">Eclipse+Hadoop 全linux环境开发</a> (6)</li><li>2009-05-01 -- <a href="http://jimey.com/2009/05/01/hadoop-0-20-0.html" title="hadoop-0.20.0">hadoop-0.20.0</a> (0)</li><li>2010-05-06 -- <a href="http://jimey.com/2010/05/06/ubuntu10-04-compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-3-dev-eclipse-plugin-jar-download.html" title="Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar">Ubuntu10.04 编译hadoop-${version}-eclipse- plugin.jar and hadoop-0.20.3-dev-eclipse-plugin.jar</a> (0)</li><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li><li>2009-12-02 -- <a href="http://jimey.com/2009/12/02/karmasphere-studio-for-hadoop.html" title="Karmasphere Studio for Hadoop">Karmasphere Studio for Hadoop</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/feedsky-feed-plug-in-plug-in-will-lead-to-failure-of-task-execution-wp-booking.html" title="Feedsky Feed 插件插件会导致wp的预订任务执行失败">Feedsky Feed 插件插件会导致wp的预订任务执行失败</a> (0)</li><li>2009-10-21 -- <a href="http://jimey.com/2009/10/21/wordpress-lifestream-plugin.html" title="WordPress Lifestream 插件">WordPress Lifestream 插件</a> (0)</li><li>2009-10-09 -- <a href="http://jimey.com/2009/10/09/compiled-hadoop-version-eclipse-plugin-jar-and-hadoop-0-20-1-eclipse-plugin-jar-download.html" title="编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download">编译hadoop-${version}-eclipse-plugin.jar and hadoop-0.20.1-eclipse-plugin.jar download</a> (3)</li><li>2009-07-13 -- <a href="http://jimey.com/2009/07/13/south-korea-to-see-youkuu002639s-ssh-proxy-ip-address.html" title="韩国ip看youku之ssh代理解决">韩国ip看youku之ssh代理解决</a> (0)</li><li>2009-04-28 -- <a href="http://jimey.com/2009/04/28/hadoop-mysql-combination-of.html" title="Hadoop+MySQL结合">Hadoop+MySQL结合</a> (0)</li><li>2009-03-06 -- <a href="http://jimey.com/2009/03/06/the-use-of-eclipse-cdt-development-of-ns2.html" title="使用Eclipse CDT 开发NS2">使用Eclipse CDT 开发NS2</a> (0)</li><li>2009-03-04 -- <a href="http://jimey.com/2009/03/04/tcl-compiler-environment-win.html" title="Win环境下的TCL Compiler">Win环境下的TCL Compiler</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/03/25/eclipse-based-application-development-environment-configuration-of-hadoop.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Big Data: Technologies and Techniques for Large-Scale Data</title>
		<link>http://jimey.com/2009/03/24/big-data-technologies-and-techniques-for-large-scale-data.html</link>
		<comments>http://jimey.com/2009/03/24/big-data-technologies-and-techniques-for-large-scale-data.html#comments</comments>
		<pubDate>Mon, 23 Mar 2009 15:39:01 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Large-Scale Data]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/03/24/big-data-technologies-and-techniques-for-large-scale-data.html</guid>
		<description><![CDATA[[via] ... ]]></description>
			<content:encoded><![CDATA[<p>[<a href="http://radar.oreilly.com/2009/03/big-data-technologies-report.html" target="_blank">via</a>]</p>
<div style="padding-bottom: 0px; margin: 0px; padding-left: 0px; padding-right: 0px; display: inline; float: none; padding-top: 0px" id="scid:53357c8b-5919-4e32-8c25-305d27c17a37:304f29b3-f6a1-4336-9e7f-1d7c0ca625f7" class="wlWriterEditableSmartContent"><embed src="http://www.youtube.com/v/acimvXoKwhc" type="application/x-shockwave-flash" wmode="transparent" width="425" height="350"></embed></div>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-22 -- <a href="http://jimey.com/2010/04/22/several-articles-on-twitter-hadoop-cassandra-pig-flockdbu002639s-slide.html" title="几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide">几篇关于twitter,hadoop,Cassandra,Pig,FlockDB的slide</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/03/24/big-data-technologies-and-techniques-for-large-scale-data.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Zotero 1.5 Beta Released: Join Us In The Clouds</title>
		<link>http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html</link>
		<comments>http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html#comments</comments>
		<pubDate>Wed, 25 Feb 2009 05:55:59 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Zotero]]></category>

		<guid isPermaLink="false">http://jimey.com/2009/02/25/zotero-15-beta-released-join-us-in-the-clouds.html</guid>
		<description><![CDATA[Zotero 1.5 Beta Released: Join Us In The Clouds   Posted February 23rd, 2009 by Trevor   We are excited to announce two major releases. This morning the Zotero team launched both Zotero 1.5 and the first release of Zotero’s web application. Watch the screencast below for details on 1.5 and see the ..... ]]></description>
			<content:encoded><![CDATA[<h4><a href="http://www.zotero.org/blog/zotero-15-beta-released-join-us-in-the-clouds/">Zotero 1.5 Beta Released: Join Us In The Clouds </a></h4>
<p>Posted February 23rd, 2009 by Trevor </p>
<p>We are excited to announce two major releases. This morning the Zotero team launched both Zotero 1.5 and the first release of Zotero’s web application. Watch the <a href="http://www.youtube.com/watch?v=hLi3ZhUQDhs">screencast</a> below for details on 1.5 and see the list below for a brief list of Zotero’s new features.</p>
<p>&#160;</p>
<div style="padding-bottom: 0px; margin: 0px; padding-left: 0px; padding-right: 0px; display: inline; float: none; padding-top: 0px" id="scid:53357c8b-5919-4e32-8c25-305d27c17a37:c1294172-aadb-40ed-adbf-d1c4cd5c957c" class="wlWriterEditableSmartContent"><embed src="http://www.youtube.com/v/hLi3ZhUQDhs" type="application/x-shockwave-flash" wmode="transparent" width="425" height="350"></embed></div>
<li><a href="http://www.zotero.org/support/preferences/sync">Automatic synchronization</a> of collections across multiple computers. For example, sync your PC at work with your Mac laptop and your Linux desktop at home.</li>
<li>Free automatic backup of your library data at Zotero.org.</li>
<li>Automatic synchronization of your attachment files to a server of your choice (e.g. iDisk, Jungle Disk, or university-provided web storage).</li>
<li>Recover recently deleted items with Zotero’s trash can.</li>
<li><a href="http://www.zotero.org/support/notes">Rich-text notes     <br /></a></li>
<li><a href="http://www.zotero.org/support/styles">New style manager</a> allowing you to add and delete CSLs and legacy style formats</li>
<li><a href="http://www.zotero.org/support/retrieve_pdf_metadata">Automatic detection of PDF metadata</a> (author, title, etc.)</li>
<li><a href="http://www.zotero.org/support/proxies">Automatic detection and support for proxy servers</a></li>
<p>[<a href="http://www.zotero.org/blog/zotero-15-beta-released-join-us-in-the-clouds/" target="_blank">via</a>]</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-28 -- <a href="http://jimey.com/2010/04/28/zt-classified-papers-on-cloud-computing.html" title="[ZT]关于cloud computing论文的归类">[ZT]关于cloud computing论文的归类</a> (0)</li><li>2008-11-21 -- <a href="http://jimey.com/2008/11/21/cloud-computingu002639s-a-good-presentation-and-that-some-of-the-terms.html" title="Cloud Computing的一个不错的presentation以及提到的一些名词">Cloud Computing的一个不错的presentation以及提到的一些名词</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/read-the-paper-scaling-online-social-networks-without-pains.html" title="阅读paper:Scaling Online Social Networks without Pains">阅读paper:Scaling Online Social Networks without Pains</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html" title="[ZT]Writing your own real-time search engine and Adwords service">[ZT]Writing your own real-time search engine and Adwords service</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/several-shared-and-ideas.html" title="几个分享和想法">几个分享和想法</a> (0)</li><li>2009-02-14 -- <a href="http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html" title="Cloud Computing defined by Berkeley RAD Labs">Cloud Computing defined by Berkeley RAD Labs</a> (0)</li><li>2009-02-09 -- <a href="http://jimey.com/2009/02/09/essence-gas-god.html" title="精，气，神">精，气，神</a> (0)</li><li>2009-02-07 -- <a href="http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html" title="How Businesses Can Use P2P">How Businesses Can Use P2P</a> (0)</li><li>2009-01-21 -- <a href="http://jimey.com/2009/01/21/cisco-in-the-cloud-application-extension-platform-axp.html" title="Cisco in The Cloud: Application Extension Platform (AXP)">Cisco in The Cloud: Application Extension Platform (AXP)</a> (0)</li><li>2008-12-11 -- <a href="http://jimey.com/2008/12/11/microsoftu002639s-cloud-vision.html" title="Microsoft&#8217;s Cloud Vision">Microsoft&#8217;s Cloud Vision</a> (0)</li><li>2008-12-02 -- <a href="http://jimey.com/2008/12/02/cloud-computing-report.html" title="Cloud Computing Report">Cloud Computing Report</a> (0)</li><li>2008-11-24 -- <a href="http://jimey.com/2008/11/24/another-week-gone-by.html" title="又一周过去了">又一周过去了</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/look-at-the-papers-the-last-month-of-a-small-comments.html" title="最近一个月看的论文一个小点评">最近一个月看的论文一个小点评</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/staying-up-late-at-night-the-spirit-and-sleepy-is-not-ok.html" title="熬夜熬夜，精神来了，想睡也不行">熬夜熬夜，精神来了，想睡也不行</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cloud Computing defined by Berkeley RAD Labs</title>
		<link>http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html</link>
		<comments>http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html#comments</comments>
		<pubDate>Sat, 14 Feb 2009 07:11:19 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Berkeley]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[definition]]></category>

		<guid isPermaLink="false">http://jimey.cn/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html</guid>
		<description><![CDATA[Executive summary:  Cloud Computing, the long-held dream of computing as a utility, has the potential to transform a large part of the IT industry, making software even more attractive as a service and shaping the way IT hardware is designed and purchased. Developers with innovative ideas for new In..... ]]></description>
			<content:encoded><![CDATA[<h4>Executive summary:</h4>
<p>Cloud Computing, the long-held dream of computing as a utility, has the potential to transform a large part of the IT industry, making software even more attractive as a service and shaping the way IT hardware is designed and purchased. Developers with innovative ideas for new Internet services no longer require the large capital outlays in hardware to deploy their service or the human expense to operate it. They need not be concerned about over-provisioning for a service whose popularity does not meet their predictions, thus wasting costly resources, or under-provisioning for one that becomes wildly popular, thus missing potential customers and revenue. Moreover, companies with large batch-oriented tasks can get results as quickly as their programs can scale, since using 1000 servers for one hour costs no more than using one server for 1000 hours. This elasticity of resources, without paying a premium for large scale, is unprecedented in the history of IT. Cloud Computing refers to both the applications delivered as services over the Internet and the hardware and systems software in the datacenters that provide those services. The services themselves have long been referred to as Software as a Service (SaaS). The datacenter hardware and software is what we will call a Cloud. When a Cloud is made available in a pay-as-you-go manner to the general public, we call it a Public Cloud; the service being sold is Utility Computing. We use the term Private Cloud to refer to internal datacenters of a business or other organization, not made available to the general public. Thus, Cloud Computing is the sum of SaaS and Utility Computing, but does not include Private Clouds. People can be users or providers of SaaS, or users or providers of Utility Computing. We focus on SaaS Providers (Cloud Users) and Cloud Providers, which have received less attention than SaaS Users. From a hardware point of view, three aspects are new in Cloud Computing: </p>
<ol>
<li>The illusion of infinite computing resources available on demand, thereby eliminating the need for Cloud Computing users to plan far ahead for provisioning. </li>
<li>The elimination of an up-front commitment by Cloud users, thereby allowing companies to start small and increase hardware resources only when there is an increase in their needs. </li>
<li>The ability to pay for use of computing resources on a short-term basis as needed (e.g., processors by the hour and storage by the day) and release them as needed, thereby rewarding conservation by letting machines and storage go when they are no longer useful. </li>
</ol>
<p> We argue that the construction and operation of extremely large-scale, commodity-computer datacenters at low-cost locations was the key necessary enabler of Cloud Computing, for they uncovered the factors of 5 to 7 decrease in cost of electricity, network bandwidth, operations, software, and hardware available at these very large economies of scale. These factors, combined with statistical multiplexing to increase utilization compared a private cloud, meant that cloud computing could offer services below the costs of a medium-sized datacenter and yet still make a good profit. Any application needs a model of computation, a model of storage, and a model of communication. The statistical multiplexing necessary to achieve elasticity and the illusion of infinite capacity requires each of these resources to be virtualized to hide the implementation of how they are multiplexed and shared. Our view is that different utility computing offerings will be distinguished based on the level of abstraction presented to the programmer and the level of management of the resources.
<p>Amazon EC2 is at one end of the spectrum. An EC2 instance looks much like physical hardware, and users can control nearly the entire software stack, from the kernel upwards. This low level makes it inherently difficult for Amazon to offer automatic scalability and failover, because the semantics associated with replication and other state management issues are highly application-dependent. At the other extreme of the spectrum are application domain-specific platforms such as <a href="http://jimey.com/tag/google" class="st_tag internal_tag" rel="tag" title="Posts tagged with Google">Google</a> AppEngine. AppEngine is targeted exclusively at traditional web applications, enforcing an application structure of clean separation between a stateless computation tier and a stateful storage tier. AppEngine&#8217;s impressive automatic scaling and high-availability mechanisms, and the proprietary MegaStore data storage available to AppEngine applications, all rely on these constraints. Applications for Microsoft&#8217;s Azure are written using the .NET libraries, and compiled to the Common Language Runtime, a language-independent managed environment. Thus, Azure is intermediate between application frameworks like AppEngine and hardware virtual machines like EC2. When is Utility Computing preferable to running a Private Cloud? A first case is when demand for a service varies with time. Provisioning a data center for the peak load it must sustain a few days per month leads to underutilization at other times, for example. Instead, Cloud Computing lets an organization pay by the hour for computing resources, potentially leading to cost savings even if the hourly rate to rent a machine from a cloud provider is higher than the rate to own one. A second case is when demand is unknown in advance. For example, a web startup will need to support a spike in demand when it becomes popular, followed potentially by a reduction once some of the visitors turn away. Finally, organizations that perform batch analytics can use the &quot;cost associativity&quot; of cloud computing to finish computations faster: using 1000 EC2 machines for 1 hour costs the same as using 1 machine for 1000 hours. For the first case of a web business with varying demand over time and revenue proportional to user hours, we have captured the tradeoff in the equation below. </p>
<p><a href="http://1.bp.blogspot.com/_6NPFeKwIbrY/SZPIwo9Re7I/AAAAAAAAAAc/pvQMNTPQ2g4/s1600-h/equation1.png"><img border="0" alt="" src="http://1.bp.blogspot.com/_6NPFeKwIbrY/SZPIwo9Re7I/AAAAAAAAAAc/pvQMNTPQ2g4/s400/equation1.png" /></a></p>
<p>The left-hand side multiplies the net revenue per user-hour by the number of user-hours, giving the expected profit from using Cloud Computing. The right-hand side performs the same calculation for a fixed-capacity datacenter by factoring in the average utilization, including nonpeak workloads, of the datacenter. Whichever side is greater represents the opportunity for higher profit. </p>
<p>The table below previews our ranked list of critical obstacles to growth of Cloud Computing; the full discussion is in Section 7 of our paper. The first three concern adoption, the next five affect growth, and the last two are policy and business obstacles. Each obstacle is paired with an opportunity, ranging from product development to research projects, which can overcome that obstacle. </p>
<p>We predict Cloud Computing will grow, so developers should take it into account. All levels should aim at horizontal scalability of virtual machines over the efficiency on a single VM. In addition: </p>
<ul>
<li>Applications Software needs to both scale down rapidly as well as scale up, which is a new requirement. Such software also needs a pay-for-use licensing model to match needs of Cloud Computing. </li>
<li>Infrastructure Software needs to be aware that it is no longer running on bare metal but on VMs. Moreover, it needs to have billing built in from the beginning. </li>
<li>Hardware Systems should be designed at the scale of a container (at least a dozen racks), which will be is the minimum purchase size. Cost of operation will match performance and cost of purchase in importance, rewarding energy proportionality such as by putting idle portions of the memory, disk, and network into low power mode. Processors should work well with VMs, flash memory should be added to the memory hierarchy, and LAN switches and WAN routers must improve in bandwidth and cost. </li>
</ul>
<p> <strong>Table:</strong> Quick Preview of Top 10 Obstacles to and Opportunities for Growth of Cloud Computing.
<p><b>Obstacle</b>    <br /><b>Opportunity</b></p>
<p>1   <br />Availability of Service    <br />Use Multiple Cloud Providers; Use Elasticity to Prevent DDOS</p>
<p>2   <br />Data Lock-In    <br />Standardize APIs; Compatible SW to enable Surge Computing</p>
<p>3   <br />Data Confidentiality and Auditability    <br />Deploy Encryption, VLANs, Firewalls; Geographical Data Storage</p>
<p>4   <br />Data Transfer Bottlenecks    <br />FedExing Disks; Data Backup/Archival; Higher BW Switches</p>
<p>5   <br />Performance Unpredictability    <br />Improved VM Support; Flash Memory; Gang Schedule VMs</p>
<p>6   <br />Scalable Storage    <br />Invent Scalable Store</p>
<p>7   <br />Bugs in Large <a href="http://jimey.com/tag/distributed" class="st_tag internal_tag" rel="tag" title="Posts tagged with distributed">Distributed</a> Systems    <br />Invent Debugger that relies on <a href="http://jimey.com/tag/distributed" class="st_tag internal_tag" rel="tag" title="Posts tagged with distributed">Distributed</a> VMs</p>
<p>8   <br />Scaling Quickly    <br />Invent Auto-Scaler that relies on ML; Snapshots for Conservation</p>
<p>9   <br />Reputation Fate Sharing    <br />Offer reputation-guarding services like those for email</p>
<p>10   <br />Software Licensing    <br />Pay-for-use licenses; Bulk use sales</p>
<p>[<a href="http://berkeleyclouds.blogspot.com/" target="_blank">via</a>]</p>
<ul>
<li><a href="http://d1smfj0g31qzek.cloudfront.net/abovetheclouds.pdf">Above the Clouds White Paper (pdf)</a></li>
<li><a href="http://d1smfj0g31qzek.cloudfront.net/above_the_clouds.ppt.pdf">Above the Clouds Presentation (pdf)</a></li>
<li><a href="http://d1smfj0g31qzek.cloudfront.net/above_the_clouds.ppt">Above the Clouds Presentation (ppt)</a></li>
<li><a href="http://www.youtube.com/watch?v=IJCxqoh5ep4">Above the Clouds Video</a></li>
<li><a href="http://radlab.cs.berkeley.edu/">RAD Lab Home Page</a></li>
</ul>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-28 -- <a href="http://jimey.com/2010/04/28/zt-classified-papers-on-cloud-computing.html" title="[ZT]关于cloud computing论文的归类">[ZT]关于cloud computing论文的归类</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/read-the-paper-scaling-online-social-networks-without-pains.html" title="阅读paper:Scaling Online Social Networks without Pains">阅读paper:Scaling Online Social Networks without Pains</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html" title="[ZT]Writing your own real-time search engine and Adwords service">[ZT]Writing your own real-time search engine and Adwords service</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/several-shared-and-ideas.html" title="几个分享和想法">几个分享和想法</a> (0)</li><li>2009-02-25 -- <a href="http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html" title="Zotero 1.5 Beta Released: Join Us In The Clouds">Zotero 1.5 Beta Released: Join Us In The Clouds</a> (0)</li><li>2009-02-09 -- <a href="http://jimey.com/2009/02/09/essence-gas-god.html" title="精，气，神">精，气，神</a> (0)</li><li>2009-02-07 -- <a href="http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html" title="How Businesses Can Use P2P">How Businesses Can Use P2P</a> (0)</li><li>2008-12-02 -- <a href="http://jimey.com/2008/12/02/cloud-computing-report.html" title="Cloud Computing Report">Cloud Computing Report</a> (0)</li><li>2008-11-24 -- <a href="http://jimey.com/2008/11/24/another-week-gone-by.html" title="又一周过去了">又一周过去了</a> (0)</li><li>2008-11-21 -- <a href="http://jimey.com/2008/11/21/cloud-computingu002639s-a-good-presentation-and-that-some-of-the-terms.html" title="Cloud Computing的一个不错的presentation以及提到的一些名词">Cloud Computing的一个不错的presentation以及提到的一些名词</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/look-at-the-papers-the-last-month-of-a-small-comments.html" title="最近一个月看的论文一个小点评">最近一个月看的论文一个小点评</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/staying-up-late-at-night-the-spirit-and-sleepy-is-not-ok.html" title="熬夜熬夜，精神来了，想睡也不行">熬夜熬夜，精神来了，想睡也不行</a> (0)</li><li>2008-10-28 -- <a href="http://jimey.com/2008/10/28/microsoft-launches-windows-azure-for-cloud-computing.html" title="Microsoft Launches Windows Azure for Cloud Computing">Microsoft Launches Windows Azure for Cloud Computing</a> (0)</li><li>2008-10-14 -- <a href="http://jimey.com/2008/10/14/cloud-computing-forecast.html" title="Cloud Computing展望">Cloud Computing展望</a> (5)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How Businesses Can Use P2P</title>
		<link>http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html</link>
		<comments>http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html#comments</comments>
		<pubDate>Sat, 07 Feb 2009 03:43:00 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[data sharing]]></category>
		<category><![CDATA[P2P]]></category>
		<category><![CDATA[social networking]]></category>
		<category><![CDATA[zero configuration]]></category>

		<guid isPermaLink="false">http://jimey.cn/2009/02/07/how-businesses-can-use-p2p.html</guid>
		<description><![CDATA[Almost every description of P2P in the context of business infrastructure starts something like this: &#34;P2P is notorious for...&#34; This comes from many years of people associating P2P with illegal downloading, to the point that the terms are now almost synonymous. Such an association is inher..... ]]></description>
			<content:encoded><![CDATA[<p>Almost every description of P2P in the context of business infrastructure starts something like this: &quot;P2P is notorious for&#8230;&quot; This comes from many years of people associating P2P with illegal downloading, to the point that the terms are now almost synonymous. Such an association is inherently unfair, however, because no one equates TCP/IP and crime, despite the fact that TCP/IP is the protocol of choice for many cyber-criminals.</p>
<p>Rather than resorting to out-dated and inaccurate definitions, let&#8217;s start from scratch and consider the following: what is P2P, really? What is it good for? How can we use it to save and earn money?</p>
<h4>What&#8217;s in a Name?</h4>
<p>P2P stands for &quot;peer to peer.&quot; Put simply, it&#8217;s one method of establishing communication between parties. Uploading information to <a href="http://jimey.com/tag/google" class="st_tag internal_tag" rel="tag" title="Posts tagged with Google">Google</a> Docs to share it with colleagues is not P2P, but sending the same information as an attachment to email is, despite the fact that mail servers are involved. In this context, P2P doesn&#8217;t mean &quot;serverless communication&quot; so much as &quot;communication that is perceived to be serverless.&quot; Like email, instant messaging (IM) is considered P2P technology because even though servers are used quite extensively, there is no explicit act of uploading data to an intermediate location. With both email and IM, servers are used behind the scenes, so to speak.</p>
<p>Thus, &quot;P2P&quot; is as much a social term as a technical one. It connotes a grid or cloud of devices that are more or less equal, rather than a constellation of star-like servers with clusters of clients surrounding them. But from the purely technical point of view, there is a distinct difference between true P2P (in which data is not relayed through a server) and perceived P2P (in which data is relayed through a server, but we don&#8217;t see it happen).</p>
<h4>P2P and the Cloud</h4>
<p>As currently implemented, &quot;cloud computing&quot; is really just a new name for old-school client-server computing, except that the servers pretend to be redundant. Clients have little or no actual control over this redundancy and cannot even verify its existence. When a cloud-based service tells you that your data is stored securely, you have no choice but to trust it&#8230; or not.</p>
<p>Compare that with a P2P file-sharing network. On a P2P network, every peer can tell who has complete or partial copies of a given file, what percentage of the file is stored locally, as well as many other details. Doesn&#8217;t that seem like a better example of cloud storage? If not, then what is?</p>
<p>Even better, with this set-up you can easily control the level of redundancy: just add another client instance, have it share the same file, and you&#8217;ve increased your redundancy by one. However, you can&#8217;t reduce redundancy beyond the peers under your influence. If all peers don&#8217;t agree to remove a given file, no one can. There have been many ideas about implementing a kind of &quot;delete button&quot; for the web, but the closer we move towards cloud computing, the less likely such a scenario becomes.</p>
<p>Cloud services are chosen for their convenience (being accessible from everywhere with simple tools) and reliability (with redundant storage in stable data centers). P2P technologies increase both factors: they increase convenience because there is no uploading or downloading to and from the cloud, and they provide directly controllable redundancy and, thus, cost control.</p>
<p>In most cases, more reliability means higher prices, and not all data deserves the same level of service. With P2P platforms behind cloud services, developers could implement applications that allow multiple storage and processing schemes without much hassle. This is not always good for service providers, because flexible cost control means that customers can scale up and down freely as business and economic conditions demand. But for the industry as a whole, it is definitely a good thing because it stiffens competition and enables customers to better survive.</p>
<p>What is perhaps more significant about introducing P2P technology into cloud computing is that the P2P cloud would truly be a cloud, not just a 15-year-old client-server technology with a new sticker. If you are inviting us into the cloud, then let us truly be a part of it, instead of remaining a client that we can neither see nor control.</p>
<p>&#160;</p>
<h4>P2P as a Social Tool</h4>
<p>The traditional Internet (Web 1.0) was built mostly like a television network. Websites were controlled by a small group of content producers (editors, media teams, individual owners, etc.), and millions of users were consumers of that content. The barrier to entry was far lower than it was in the television industry, but it was still mainly a one-way road from the website to the user. The Internet&#8217;s infrastructure reflected this: data centers, thick pipes that connected them, and subscriber lines that could download much faster than upload.</p>
<p>Then <a href="http://jimey.com/tag/web-20" class="st_tag internal_tag" rel="tag" title="Posts tagged with Web 2.0">Web 2.0</a> came along, and everything started to change. Today, most popular sites were created by ordinary users who care less about owning content than about sharing it with others. Developers created ecosystems and gave users tools to track each other and exchange content, and sites became service providers instead of content sources.</p>
<p>What happened to the Internet&#8217;s infrastructure, then? Nothing. Imagine YouTube going out of business: the entire infrastructure for that video would disappear in a day, but the videos themselves would still exist, scattered among computers of individual users, stuck on cell phones, caught in caches, etc.</p>
<p>Now take that one step further: instead of YouTube, we have a P2P network, full of videos and convenient tools to watch and upload them. Nobody can close this network or put it out of business. Sufficiently large P2P networks are invincible; if you don&#8217;t believe that, ask the MPAA and RIAA.</p>
<p><a href="http://jimey.com/tag/web-20" class="st_tag internal_tag" rel="tag" title="Posts tagged with Web 2.0">Web 2.0</a> is P2P (in a social sense), done with Web 1.0 tools and old infrastructure. To unlock a box with a whole set of new services, we need to upgrade the infrastructure. Just as Gopher was replaced by WWW, and UUCP was replaced by SMTP, the current star-shaped web infrastructure will be replaced by a mesh-shaped cloud network. Data centers would still exist, but instead of providing bandwidth and servers, they would provide reliability and accessibility. (And a system of measurement would need to be established for both.)</p>
<p>We can call this a &quot;social Internet infrastructure&quot;: an infrastructure that reflects new social behavior, that allows anyone to connect and share content with anyone else, while still enjoying sufficient privacy and security. It&#8217;s not so much a revolution as an evolution: another step in a process that has been occuring for some time already. It happened to the telephone system &#8212; does anyone remember having to call a phone station to be put through to another person? I&#8217;ve only read about it in books. It will happen to the Internet eventually, too.</p>
<p>&#160;</p>
<h4>So, Show Us the Money</h4>
<p>What is P2P good for, then? To answer that, let&#8217;s first look at the potential benefits of an &quot;ideal&quot; P2P implementation, benefits that a business could use for competitive advantage or to implement an entirely new service.</p>
<p>The most talked-about advantage of P2P is its ability to cut bandwidth bills. Imagine how much money companies like Dell, Logitech and Microsoft spend on downloads of products and services? Logitech&#8217;s generic mouse driver is over 30 MB in size, and the number of its mice being used around the world is countless. A Microsoft Windows service pack can be as big as 300 MB. Now think of how many computers run Windows? (Remember that Apple has less than 25% market share, if that helps.) There are also media distribution companies, web accelerators, <a href="http://jimey.com/tag/distributed" class="st_tag internal_tag" rel="tag" title="Posts tagged with distributed">distributed</a> back-ups, the list goes on. And every one of these companies could dramatically reduce its bandwidth bill by using P2P. Properly implementing P2P content distribution isn&#8217;t easy; but when our industry had (almost) more money than it could spend, nobody cared to try it. Today, money isn&#8217;t just <em>an</em> issue; it&#8217;s <em>the</em> issue.</p>
<p>Look at Skype. For it, implementing P2P was not optional but mandatory. Creating a free multimedia service for millions of people without spending tens of millions of dollars on &quot;free&quot; infrastructure wasn&#8217;t (and still isn&#8217;t) possible. With P2P, Skype was able to provide free phone access without actually subsidizing users (this is not 100% accurate, but accurate enough for this example). All the money it collects from paid users is profit, and yet its creators failed with Joost, not because no ones wants free television, but because Joost had a different infrastructure. People were not into sharing TV as much as they were into simply talking to each other. And then came YouTube, which was funded first by venture capital and then by <a href="http://jimey.com/tag/google" class="st_tag internal_tag" rel="tag" title="Posts tagged with Google">Google</a>.</p>
<p>Another benefit of P2P is that it requires zero configuration. Skype is probably not the best IP phone around, nor was it the first; but you don&#8217;t have to be a telecommunications engineer to use it. You download the installer, run it, register yourself as a user, and off you go, from nothing to brilliant conversation in a few minutes.</p>
<p>As with content distribution, implementing a P2P network that requires no configuration isn&#8217;t an easy task, but it dramatically reduces the number of users who drop off from being intimidated by the technology or feeling they lack the necessary skill. For many services, this is the difference between 100,000 users and 10 million users, or between going out of business as soon as venture money dries up and being profitable within a year.</p>
<p>Zero-configuration P2P has to do with more than just P2P, though. It also implies being able to fully network with zero configuration: the ability to connect any device anywhere using any available connection. Unattended sensors, medical IT devices, military computers, none of these should require in-field configuration. The people who use them generally don&#8217;t have time to read instruction manuals. They should be able to open the box, insert batteries, and have a workable device within seconds. This is what zero configuration gives users: a choice, not just between high and low profitability, but between life and death.</p>
<p>Even in the case of lower-profile applications, zero-configuration P2P can cut deployment costs tremendously &#8212; and well-implemented P2P platforms could reduce those costs to almost nothing. On such a platform, for example, setting up a new message-processing server for a financial system would be as easy as opening the box, throwing the server on the rack, plugging in the ethernet and power cables, and nothing else.</p>
<p>Ubiquitous connectivity simplifies development costs. Message-passing platforms wouldn&#8217;t need to account for different types of hosts, relays, connectivity fall-backs, and so on. You would simply confirm that the peer is up and then send it a message. Done. Think of how many networked hosts out there are actually &quot;gateways&quot; between email and text messages, between Internet and Intranets, between X and Y technologies. When one networked device can securely connect with any other, many problems simply evaporate. Before TCP/IP took the world by storm, gazillions of networking technologies existed. No one remembers their names, not because they were inadequate, but because TCP/IP was everywhere, and it&#8217;s much easier to speak the common language than to teach others a &quot;better&quot; one.</p>
<p>All of these factors are hopefully pushing cloud networking technologies towards commoditization, which would expand markets, decrease infrastructure costs, and allow companies to deliver better products more cheaply.</p>
<p>[<a href="http://www.readwriteweb.com/archives/how_businesses_can_use_p2p.php" target="_blank">via</a>]</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-28 -- <a href="http://jimey.com/2010/04/28/zt-classified-papers-on-cloud-computing.html" title="[ZT]关于cloud computing论文的归类">[ZT]关于cloud computing论文的归类</a> (0)</li><li>2009-10-23 -- <a href="http://jimey.com/2009/10/23/read-the-paper-scaling-online-social-networks-without-pains.html" title="阅读paper:Scaling Online Social Networks without Pains">阅读paper:Scaling Online Social Networks without Pains</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/zt-writing-your-own-real-time-search-engine-and-adwords-service.html" title="[ZT]Writing your own real-time search engine and Adwords service">[ZT]Writing your own real-time search engine and Adwords service</a> (0)</li><li>2009-04-16 -- <a href="http://jimey.com/2009/04/16/several-shared-and-ideas.html" title="几个分享和想法">几个分享和想法</a> (0)</li><li>2009-04-09 -- <a href="http://jimey.com/2009/04/09/p2psim-installation.html" title="P2PSim安装">P2PSim安装</a> (0)</li><li>2009-02-25 -- <a href="http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html" title="Zotero 1.5 Beta Released: Join Us In The Clouds">Zotero 1.5 Beta Released: Join Us In The Clouds</a> (0)</li><li>2009-02-14 -- <a href="http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html" title="Cloud Computing defined by Berkeley RAD Labs">Cloud Computing defined by Berkeley RAD Labs</a> (0)</li><li>2009-02-09 -- <a href="http://jimey.com/2009/02/09/essence-gas-god.html" title="精，气，神">精，气，神</a> (0)</li><li>2008-12-02 -- <a href="http://jimey.com/2008/12/02/cloud-computing-report.html" title="Cloud Computing Report">Cloud Computing Report</a> (0)</li><li>2008-11-24 -- <a href="http://jimey.com/2008/11/24/another-week-gone-by.html" title="又一周过去了">又一周过去了</a> (0)</li><li>2008-11-21 -- <a href="http://jimey.com/2008/11/21/cloud-computingu002639s-a-good-presentation-and-that-some-of-the-terms.html" title="Cloud Computing的一个不错的presentation以及提到的一些名词">Cloud Computing的一个不错的presentation以及提到的一些名词</a> (0)</li><li>2008-11-18 -- <a href="http://jimey.com/2008/11/18/cloud-computing-on-the-next-week-to-discuss-the-curriculum-some-of-the-ideas.html" title="关于下下周cloud computing讨论课程的一些想法">关于下下周cloud computing讨论课程的一些想法</a> (0)</li><li>2008-11-10 -- <a href="http://jimey.com/2008/11/10/look-at-the-papers-the-last-month-of-a-small-comments.html" title="最近一个月看的论文一个小点评">最近一个月看的论文一个小点评</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/staying-up-late-at-night-the-spirit-and-sleepy-is-not-ok.html" title="熬夜熬夜，精神来了，想睡也不行">熬夜熬夜，精神来了，想睡也不行</a> (0)</li><li>2008-10-28 -- <a href="http://jimey.com/2008/10/28/microsoft-launches-windows-azure-for-cloud-computing.html" title="Microsoft Launches Windows Azure for Cloud Computing">Microsoft Launches Windows Azure for Cloud Computing</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/02/07/how-businesses-can-use-p2p.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Cisco in The Cloud: Application Extension Platform (AXP)</title>
		<link>http://jimey.com/2009/01/21/cisco-in-the-cloud-application-extension-platform-axp.html</link>
		<comments>http://jimey.com/2009/01/21/cisco-in-the-cloud-application-extension-platform-axp.html#comments</comments>
		<pubDate>Wed, 21 Jan 2009 10:57:05 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Cisco]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://jimey.cn/2009/01/21/cisco-in-the-cloud-application-extension-platform-axp.html</guid>
		<description><![CDATA[One doesn&#8217;t tend to associate Cisco with Web 2.0, social media or cloud computing. Cisco equals routers, the plumbing of the web. In other words, vital and hugely profitable; but not the &#8216;new, new thing&#8217; of the Internet. It turns out we have written before about how Cisco is using social media to reach enterprise [...... ]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.readwriteweb.com/images/cisco-logo.jpg" width="150" height="91" />One doesn&#8217;t tend to associate <a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> with <a href="http://jimey.com/tag/web-20" class="st_tag internal_tag" rel="tag" title="Posts tagged with Web 2.0">Web 2.0</a>, social media or cloud computing. <a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> equals routers, the plumbing of the web. In other words, vital and hugely profitable; but not the &#8216;new, new thing&#8217; of the Internet. It turns out <a href="http://www.readwriteweb.com/archives/cisco_social_media.php">we have written before</a> about how <a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> is using social media to reach enterprise clients and playing in the <a href="http://www.readwriteweb.com/archives/ciscos_social_networking_strategy_eos.php">enterprise social networking</a> space. Now <a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> wants to position its <a href="http://www.cisco.com/en/US/products/ps9701/index.html">Application Extension Platform (AXP)</a> in the cloud platform space and is encouraging developers with $100,000 in prizes for the most innovative applications.</p>
<p>The product AXP is not new. <a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> launched it about 4 years ago. But it is now ramping up the marketing. Read on for a rundown of AXP&#8217;s positioning in the market.</p>
<h4>The Networking Problems In Branch Offices</h4>
<p>On the consumer Internet, we connect directly to big server farms running our favorite services. In the Enterprise world, most users connect via a branch network. In this more complicated world, there are three concerns that are not well addressed by the platforms that emerged in the <a href="http://jimey.com/tag/web-20" class="st_tag internal_tag" rel="tag" title="Posts tagged with Web 2.0">Web 2.0</a> era: </p>
<ol>
<li><strong>Security at the branch level.</strong> Security at the data center or server farm may be great, but the branch is often a back-door threat. </li>
<li><strong>Performance.</strong> If a hundred people are working in a branch, dealing with some network traffic locally would improve performance. </li>
<li><strong>Survivability.</strong> Losing connectivity to the cloud is still a significant issue. If it happens to a single consumer, you shrug it off and say &quot;Oh well.&quot; If it stops a bank branch from functioning, well, that&#8217;s another story.</li>
</ol>
<p><a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> AXP aims to address these challenges in branch networks. The $100,000 contest is part of this, as it asks developers to think of what apps can run at the branch level within a <a href="http://jimey.com/tag/cisco" class="st_tag internal_tag" rel="tag" title="Posts tagged with Cisco">Cisco</a> network router (aka &quot;the box&quot;) &#8211; which is really a Linux server with the network services exposed via an API. February 27th is the final deadline for submissions, if you&#8217;re a developer interested in this challenge. Finalists will be announced in May and the winners will be announced in August/September.</p>
<p>For more on the technology, current usage cases, and competition rules see <a href="http://www.slideshare.net/ricmac/cisco-media-briefing-developer-contest-v43-presentation/">Cisco&#8217;s presentation</a> on Slideshare.</p>
<p><a href="http://www.slideshare.net/ricmac/cisco-media-briefing-developer-contest-v43-presentation?type=powerpoint">Cisco Media Briefing Developer Contest V4.3</a>    <br />View SlideShare <a href="http://www.slideshare.net/ricmac/cisco-media-briefing-developer-contest-v43-presentation?type=powerpoint">presentation</a> or <a href="http://www.slideshare.net/upload?type=powerpoint">Upload</a> your own.</p>
<p>[<a target="_blank" href="http://www.readwriteweb.com/archives/cisco_axp_challenge.php">via</a>]</p>
<h3  class="related_post_title">相关博文</h3><ul class="related_post"><li>2010-04-28 -- <a href="http://jimey.com/2010/04/28/zt-classified-papers-on-cloud-computing.html" title="[ZT]关于cloud computing论文的归类">[ZT]关于cloud computing论文的归类</a> (0)</li><li>2009-03-07 -- <a href="http://jimey.com/2009/03/07/my-web-2-0-topology.html" title="My Web 2.0 Topology">My Web 2.0 Topology</a> (0)</li><li>2009-02-28 -- <a href="http://jimey.com/2009/02/28/zt-twitter-boss-u003d-real-time-search.html" title="[ZT]Twitter + BOSS = Real Time Search">[ZT]Twitter + BOSS = Real Time Search</a> (0)</li><li>2009-02-25 -- <a href="http://jimey.com/2009/02/25/zotero-1-5-beta-released-join-us-in-the-clouds.html" title="Zotero 1.5 Beta Released: Join Us In The Clouds">Zotero 1.5 Beta Released: Join Us In The Clouds</a> (0)</li><li>2009-02-25 -- <a href="http://jimey.com/2009/02/25/south-koreau002639s-twitter-%eb%af%b8%ed%88%ac-%ed%85%8c%ec%9d%b4-me2day.html" title="韩国的twitter 미투테이(me2day)">韩国的twitter 미투테이(me2day)</a> (0)</li><li>2009-02-17 -- <a href="http://jimey.com/2009/02/17/to-receive-the-hand-httpjimey-com.html" title="把http://jimey.com/收到手">把http://jimey.com/收到手</a> (6)</li><li>2008-12-11 -- <a href="http://jimey.com/2008/12/11/microsoftu002639s-cloud-vision.html" title="Microsoft&#8217;s Cloud Vision">Microsoft&#8217;s Cloud Vision</a> (0)</li><li>2008-11-25 -- <a href="http://jimey.com/2008/11/25/web-10-0-after-watching.html" title="Web 10.0看后">Web 10.0看后</a> (0)</li><li>2008-11-25 -- <a href="http://jimey.com/2008/11/25/web-2-0-zaqizaba.html" title="Web 2.0 杂七杂八">Web 2.0 杂七杂八</a> (0)</li><li>2008-11-23 -- <a href="http://jimey.com/2008/11/23/web-2-0-summit_kevin-kelly-web-10-0.html" title="Web 2.0 SUMMIT_Kevin Kelly: Web 10.0">Web 2.0 SUMMIT_Kevin Kelly: Web 10.0</a> (0)</li><li>2008-11-21 -- <a href="http://jimey.com/2008/11/21/cloud-computingu002639s-a-good-presentation-and-that-some-of-the-terms.html" title="Cloud Computing的一个不错的presentation以及提到的一些名词">Cloud Computing的一个不错的presentation以及提到的一些名词</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/01/21/cisco-in-the-cloud-application-extension-platform-axp.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>软件算法可视化大型数据集</title>
		<link>http://jimey.com/2009/01/21/software-algorithms-visualization-of-large-data-sets.html</link>
		<comments>http://jimey.com/2009/01/21/software-algorithms-visualization-of-large-data-sets.html#comments</comments>
		<pubDate>Wed, 21 Jan 2009 06:48:23 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[data set]]></category>

		<guid isPermaLink="false">http://jimey.cn/2009/01/21/%e8%bd%af%e4%bb%b6%e7%ae%97%e6%b3%95%e5%8f%af%e8%a7%86%e5%8c%96%e5%a4%a7%e5%9e%8b%e6%95%b0%e6%8d%ae%e9%9b%86.html</guid>
		<description><![CDATA[科学家比以前拥有更多的可用数据，多到他们无力恰当检验。现在一个新的算法可以让他们更简单，也更廉价的可视化大型数据集(data set)。基于算法的软件可以运行在个人电脑上，只需2G内存。该项目由加州戴维斯分校计算机科学研究生Attila Gyulassy领导，作为他的博士论文主题。 目前超级计算机能模拟如洋流和燃烧之类的物理现..... ]]></description>
			<content:encoded><![CDATA[<p>科学家比以前拥有更多的可用数据，多到他们无力恰当检验。现在一个<a href="http://www.news.ucdavis.edu/search/news_detail.lasso?id=8929">新的算法</a>可以让他们更简单，也更廉价的<a href="http://www.computerworld.com.au/article/273477/software_algorithm_visualizes_large_data_sets?fp=16&amp;fpid=1">可视化大型数据集</a>(<a href="http://en.wikipedia.org/wiki/Data_set">data set</a>)。基于算法的软件可以运行在个人电脑上，只需2G内存。该项目由加州戴维斯分校计算机科学研究生<a href="http://graphics.cs.ucdavis.edu/%7Ejediati/">Attila Gyulassy</a>领导，作为他的博士论文主题。<i> 目前超级计算机能模拟如洋流和燃烧之类的物理现象，但产生的大量数据几乎不可能直接使用。Gyulassy的教授Bernd Hamann称，没有可视化工具的帮助，是无法理解数据的意义的。在笔记本上，Gyulassy以模拟两种液体融合去测试算法——一个在三维网格上超过 10亿个点的数据集。软件在24小时内分析数据，并勾画出了数秒内的现象。</i></p>
<h3  class="related_post_title">无关博文</h3><ul class="related_post"><li>2009-11-30 -- <a href="http://jimey.com/2009/11/30/last-fm-behind-the-technology.html" title="Last.fm背后的技术">Last.fm背后的技术</a> (2)</li><li>2008-09-26 -- <a href="http://jimey.com/2008/09/26/cell-phone-pictures-of-pp-chaos-inside.html" title="手机里面的乱拍的pp">手机里面的乱拍的pp</a> (1)</li><li>2008-11-01 -- <a href="http://jimey.com/2008/11/01/fresh-from-the-reality-at-last-the-next.html" title="终于食了下人间烟火">终于食了下人间烟火</a> (2)</li><li>2006-01-30 -- <a href="http://jimey.com/2006/01/30/midnight-to-go-home.html" title="半夜回家">半夜回家</a> (4)</li><li>2009-11-30 -- <a href="http://jimey.com/2009/11/30/2009-11-27-the-first-evening-of-myeong-dong-seoul-sejong-road-cheonggyecheon.html" title="2009.11.27 汉城首晚 明洞 世宗路 清溪川">2009.11.27 汉城首晚 明洞 世宗路 清溪川</a> (6)</li><li>2009-01-03 -- <a href="http://jimey.com/2009/01/03/back-to-the-week-did-not-return-home.html" title="回到了一周没回的家">回到了一周没回的家</a> (8)</li><li>2009-12-21 -- <a href="http://jimey.com/2009/12/21/reading-the-avarar-digital-3d-has.html" title="看完Avarar digital 3D了">看完Avarar digital 3D了</a> (2)</li><li>2008-12-19 -- <a href="http://jimey.com/2008/12/19/and-today-a-high-profile-of-the.html" title="今天又高调了">今天又高调了</a> (0)</li><li>2009-11-13 -- <a href="http://jimey.com/2009/11/13/utorrent-support-the-garden-of-eden-bt-software-bt-download-station-resources-u0026quotautomaticu0026quot-download-tutorial.html" title="bt软件uTorrent支持伊甸园bt下载站资源&ldquo;全自动&rdquo;下载教程">bt软件uTorrent支持伊甸园bt下载站资源&ldquo;全自动&rdquo;下载教程</a> (0)</li><li>2009-03-18 -- <a href="http://jimey.com/2009/03/18/wget-manual-wget-command-to-use.html" title="wget手册 wget命令使用方法">wget手册 wget命令使用方法</a> (1)</li><li>2008-07-25 -- <a href="http://jimey.com/2008/07/25/californication.html" title="Californication">Californication</a> (2)</li><li>2007-11-23 -- <a href="http://jimey.com/2007/11/23/dict-tcl-install.html" title="dict.TCL.install">dict.TCL.install</a> (0)</li><li>2008-10-14 -- <a href="http://jimey.com/2008/10/14/cloud-computing-forecast.html" title="Cloud Computing展望">Cloud Computing展望</a> (5)</li><li>2005-12-24 -- <a href="http://jimey.com/2005/12/24/funny-christmas-eve.html" title="搞笑的平安夜">搞笑的平安夜</a> (6)</li><li>2008-12-02 -- <a href="http://jimey.com/2008/12/02/cloud-computing-report.html" title="Cloud Computing Report">Cloud Computing Report</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/01/21/software-algorithms-visualization-of-large-data-sets.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>历数云计算为安全带来的七大益处</title>
		<link>http://jimey.com/2009/01/18/cloud-computing-for-the-security-of-enumerating-the-seven-benefits.html</link>
		<comments>http://jimey.com/2009/01/18/cloud-computing-for-the-security-of-enumerating-the-seven-benefits.html#comments</comments>
		<pubDate>Sun, 18 Jan 2009 06:13:59 +0000</pubDate>
		<dc:creator>jimey</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[hash]]></category>

		<guid isPermaLink="false">http://jimey.cn/2009/01/18/%e5%8e%86%e6%95%b0%e4%ba%91%e8%ae%a1%e7%ae%97%e4%b8%ba%e5%ae%89%e5%85%a8%e5%b8%a6%e6%9d%a5%e7%9a%84%e4%b8%83%e5%a4%a7%e7%9b%8a%e5%a4%84.html</guid>
		<description><![CDATA[二、事件快速反应     取证准备：在必要的时候，我可以利用基础架构即服务(Infrastructure-as-a-Service，简称IaaS)供应商提供的条件，为自己公司建立一个专门的取证服务器。当事件发生需要取证时，我只需要支付在线存储所产生的费用。而不需要额外配置人员去管理远程登陆及其软件，而我所要做的，就是点击云提供商Web界面中..... ]]></description>
			<content:encoded><![CDATA[<p>二、事件快速反应    <br /> 取证准备：在必要的时候，我可以利用基础架构即服务(Infrastructure-as-a-Service，简称IaaS)供应商提供的条件，为自己公司建立一个专门的取证服务器。当事件发生需要取证时，我只需要支付在线存储所产生的费用。而不需要额外配置人员去管理远程登陆及其软件，而我所要做的，就是点击云提供商Web界面中的一些按钮即可。如果一旦产生多个事件反应，我可以先复制一份，并把这些取证工作分发到不同部门或者人员手中，然后进行快速分析并得出结论。不过，为了充分发挥这项功能，取证软件供应商需要由过去传统的软件授权许可转变到新型网络许可模式。     <br /> 缩短取证时间：如果有某个服务器在云中出现了故障，我只要在云客户端点击鼠标，克隆该服务器并使得克隆后的服务器磁盘对取证服务器开放。我根本不需要临时寻找存储设备，并花时间等待其启动并进入使用状态，从而，大大缩短取证时间。     <br /> 降低服务器出错概率：和刚才讲述的情况类似，即使有某台服务器出现故障，也可以在极短时间内，快速克隆并拥有全新的服务器供使用。另外，在某些情况下，更换出故障的硬件也不会影响到取证的正常进行。     <br /> 取证更有针对性：在同一个云中，拥有克隆服务器的速度会快很多——克隆服务器可以更快的速度分发云提供商专门设计的文件系统。如果从网络流量角度来看的话，在同一个云中的服务器副本，可能并不会产生额外的费用。而如果没有云的话，要实现同样的目的，需要花费大量宝贵的时间和昂贵的硬件成本。在云环境下，我只需要对有用的取证支付存储费用。     <br /> <font color="#ff0000">隐藏取证痕迹：有一些云存储可以执行加密校验和散列( hash)。比如，Amazon S3 会在你存储数据的时候自动生成一个MD5 散列(hash)。在理论上，你也并不需要浪费时间去使用外部工具生成MD5加密校验，因为云已经完全具备这些功能。</font>     <br /> 缩短存取受保护数据时间：现在CPU性能已经十分强大。保护数据的密码，需要花费很长时间来检验，而现在云环境下配置强大的CPU，可以在短时间更大范围内检验出保护数据的密码性能。从而，批量处理受保护数据的存取工作也会变得简易快速.</p>
<p><a target="_blank" href="http://www.enet.com.cn/article/2008/0825/A20080825349627.shtml">[via]</a></p>
<p>红色部分，之前很久之前看的一篇文章一直在反复说hash，之前也不是很想明白究竟有什么意义，暂时这边看到一个文章说到了hash的作用之一，hm 暂时丢出来记录下</p>
<h3  class="related_post_title">无关博文</h3><ul class="related_post"><li>2008-07-09 -- <a href="http://jimey.com/2008/07/09/laboratories-continue-to-adjust-the.html" title="实验室继续调整">实验室继续调整</a> (0)</li><li>2009-02-07 -- <a href="http://jimey.com/2009/02/07/sigur-ros-we-play-endlessly.html" title="Sigur Ros &#8211; We Play Endlessly">Sigur Ros &#8211; We Play Endlessly</a> (0)</li><li>2008-08-21 -- <a href="http://jimey.com/2008/08/21/%e3%80%90%e3%80%91-zhuantie-qi-ridicule-the-whole-country-a-team-a-harmonious.html" title="【转帖】全国上下齐讽A队，一片和谐">【转帖】全国上下齐讽A队，一片和谐</a> (0)</li><li>2008-10-31 -- <a href="http://jimey.com/2008/10/31/clonezilla-clone-system-ubuntu.html" title="Clonezilla &#8211; Clone System Ubuntu">Clonezilla &#8211; Clone System Ubuntu</a> (0)</li><li>2005-10-13 -- <a href="http://jimey.com/2005/10/13/from-now-on-use-this-bloger-looks-pretty-good.html" title="从现在开始使用这个bloger。。。看起来蛮不错的">从现在开始使用这个bloger。。。看起来蛮不错的</a> (0)</li><li>2008-12-14 -- <a href="http://jimey.com/2008/12/14/a-deaf-girl-learns-to-play-violin-an-inspiring-must-view.html" title="A deaf girl learns to play violin &#8211; An Inspiring must view">A deaf girl learns to play violin &#8211; An Inspiring must view</a> (0)</li><li>2009-02-14 -- <a href="http://jimey.com/2009/02/14/cloud-computing-defined-by-berkeley-rad-labs.html" title="Cloud Computing defined by Berkeley RAD Labs">Cloud Computing defined by Berkeley RAD Labs</a> (0)</li><li>2008-11-17 -- <a href="http://jimey.com/2008/11/17/indie-pop-u0026amp-post-rock.html" title="Indie Pop &amp; Post Rock">Indie Pop &amp; Post Rock</a> (1)</li><li>2008-11-11 -- <a href="http://jimey.com/2008/11/11/%eb%b0%b0%eb%b9%bc%eb%a1%9c%eb%82%a0%ec%b6%95%ed%95%98%ed%95%b4-1111-under-memorial.html" title="배빼로날 축하해. 1111纪念下">배빼로날 축하해. 1111纪念下</a> (0)</li><li>2008-08-24 -- <a href="http://jimey.com/2008/08/24/chinese-version-of-qualnet-tutorial.html" title="QualNet教程中文版">QualNet教程中文版</a> (9)</li><li>2010-07-03 -- <a href="http://jimey.com/2010/07/03/%e9%a6%96%e5%b0%94%e7%ac%ac%e4%ba%8c%e5%a4%a9.html" title="首尔第二天">首尔第二天</a> (1)</li><li>2008-09-12 -- <a href="http://jimey.com/2008/09/12/dropbox.html" title="Dropbox">Dropbox</a> (0)</li><li>2009-02-09 -- <a href="http://jimey.com/2009/02/09/how-to-solve-the-8100030d-error-msn-occur.html" title="如何解决MSN出现8100030d错误">如何解决MSN出现8100030d错误</a> (0)</li><li>2009-11-08 -- <a href="http://jimey.com/2009/11/08/nate-on-a-new-feature-or-i-had-not-found-the-function.html" title="NATE ON 某新功能?or 我以前没发现的功能">NATE ON 某新功能?or 我以前没发现的功能</a> (1)</li><li>2009-03-27 -- <a href="http://jimey.com/2009/03/27/sync-twitter-to-rice-has.html" title="同步twitter到饭否">同步twitter到饭否</a> (0)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://jimey.com/2009/01/18/cloud-computing-for-the-security-of-enumerating-the-seven-benefits.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
