I attend this Webinar yesterday and it was pretty good. I like how straight they interaction between MongoDB and Hadoop. Check it out if you get a chance. http://www.10gen.com/presentations/webinar-whats-new-mongodb-hadoop-integration
Wrangling Customer Usage Data with Hadoop
Here is our session from the Hadoop Summit 2013. Title: Wrangling Customer Usage Data with Hadoop Slides: http://www.slideshare.net/Hadoop_Summit/hall-johnson-june271100amroom211v2 Description: At Clearwire we have a big data challenge: Processing millions of unique usage records comprising terabytes of data for millions of customers every week. Historically, massive purpose-built database solutions were used to process data, but weren’t particularly…
Something new, writing Android App in HTML 5
I wanted to write an App for my Android Tablet, something easy so I could get a basic understanding of what it would take to write an app. I figured that I could knock something out fairly quickly, but I didn’t know it would be this easy. I’m a fairly big user of Eclipse, so…
Windows Azure HDInsight ( Hadoop on Windows )
Lately I have been asked by a lot of my co-workers, if Hadoop runs on Windows. After going to the Hadoop Summit last month, I have been able to tell them about Azure HDInsight. Which is basically Apache Hadoop running on Windows Azure. It appears that Microsoft has been working with Hortonworks to bring Apache…
Hadoop to Hadoop Copy
Here recently I need to copy the content of one hadoop cluster to another for geo redundancy. Thankfully instead of have to write something to do it, Hadoop supply a hand tool to do it “DistCp (distributed copy)”. DistCp is a tool used for large inter/intra-cluster copying. It uses Map/Reduce to effect its distribution,…
Hortonworks Road Show "Big Business Value from Big Data and Hadoop"
This morning I went to the Hortonworks Road Show. It’s wasn’t Bad. I have to say out of the Hadoop Vendor I have talked to, I like Hortonworks business model the best. The fact that they are a large committer to the Apache Hadoop Project, along with several other sub projects such as Apache Ambari…
Make way for Hadoop in the 'Big Data' craze
Interesting bit on Hadoop a little over hyped if you ask me. http://www.marketwatch.com/story/make-way-for-hadoop-in-the-big-data-craze-2012-06-26?link=MW_latest_news –Regards
Working with Hadoop Streaming
Hadoop streaming is a utility that comes with the Hadoop distribution. The utility allows you to create and run map/reduce jobs with any executable or script as the mapper and/or the reducer. For example: shell> $HADOOP_HOME/bin/hadoop jar $HADOOP_HOME/hadoop-streaming.jar -input myInputDirs -output myOutputDir -mapper /bin/cat -reducer /bin/wc If you using the tar package from Apache Hadoop. You can find the…
Amazon Relational Database Service (Amazon RDS)
It appears that Amazon is introducing a new service specifically targeted at Relational Databases helpful hints. You can choose from MySQL, Oracle, and Microsoft Sql Server. Amazon Relational Database Service (Amazon RDS) is a web service that makes it easy to set up, operate, and scale a relational database in the cloud. It provides cost-efficient and resizable capacity while managing time-consuming…
Looking at the Hadoop MapReduce Capacity, Fair, and Hod Schedulers.
Today | started looking at the different MapReduce Schedulers, because I would like to be able to start the processing on a new jobs when slots became available. So I started look at the other schedulers that come with Hadoop. The Capacity Scheduler: The Capacity Scheduler is designed to run Hadoop Map-Reduce as a shared,…