Blog

  • Connecting Tableau to Google Cloud SQL

    Before connecting your Tableau Application up to your Google Cloud SQL Instance, you will need to make sure that you have assigned an IP Address to the instance. You will also need to allow the network in which your Tableau application is located, access to the Google Cloud SQL Instance.

    First, I recommend that you use an external source to determine your IP address, such as freegeoip.net or hostip.info, this will help eliminate any network translation issues.

    Now that you have your IP Address it is time to configure your Google Cloud SQL Instance. To grant access to your Tableau application, you need to do the following.

    1. Go to the Google Developers Console and select a project by clicking on the project name.
    2. In the sidebar on the left, click Storage > Cloud SQL.
    3. Find the instance to which you want to grant access and click the instance name. Click Edit.
    4. In the IPv4 Address section, select Assign an IPv4 address to my Cloud SQL instance to assign one to the instance.

    Note: There are charges when you assign an IPv4 address. For more information, see the pricing page.

    assign-ipOnce you have assigned the IP Address to your instance, you will need to allow the IP Address from your Tableau Application access to the instance by doing the following.

    allowed-network

    In the Allowed Networks area click on the blue border button with the plus sign. In the text box title Network add the IP Address that you obtained earlier.

    Now if you haven’t already, I would recommend that you create a read-only user with access to the schema that you want access.

    To configure Tableau Desktop 9.0 to Google Cloud SQL Instance, you need to configure a MySQL connection.

    tableau-more-servers

    1.) Open Tableau Desktop.
    2.) On the Left-hand side under Connect click “More Servers.”
    3.) Click on “MySQL.”
    4.) Fill in the Server text box with the IP Address that was assigned to your Google Cloud SQL Instance, the port should be 3306, which is the default. Now simply fill in the username and password and click OK.

    tableau-mysql-server-connection

    Congratulations! You’re all connected and ready to start building reports.

  • A Day At Chihuly Garden and Glass

    A Day At Chihuly Garden and Glass

    Chihuly Garden and Glass provides a look at the inspiration and influences that inform the career of artist Dale Chihuly. Located at Seattle Center, Chihuly Garden and Glass includes an Exhibition Hall, the centerpiece Glasshouse and a lush Garden. The Exhibition Hall contains eight galleries and three Drawing Walls, offering visitors a comprehensive look at Chihuly’s significant series of work; the Glasshouse presents a suspended 1,400-piece, 100-foot-long sculpture; the Garden is a backdrop for four monumental sculptures and other installations.

     

     

  • Elasticsearch

    Elasticsearch is a distributed restful search and analytic tool that is built on the top of Apache Lucene for high performance.

    Elasticsearch features include:

    Real-Time Data Indexing
    Scalability
    High Availability
    Multi-Tenancy
    Full Text Search
    Document Orientation

    The flow of data never stops so the question is how quickly can that data become available. Elasticsearch indexes the data in real-time allowing the for data to be made available as quickly as it can for real-time analytics.

    Elasticsearch is horizontally scalable. An organization can simply add nodes to expand the cluster capacity. Elasticsearch is resilient in detecting and removing failed nodes and rebalancing itself to ensure that an organization’s data is safe and accessible.

    Elasticsearch can also host multiple indices that can be queried independently or as a group. Elasticsearch structures it’s data in JSON documents. All fields are indexed by default and all the indices can be used in a single query.

    Clients can connect to Elasticsearch by using a standard HTTP REST library. This gives any programming language the ability to connect to Elasticsearch.

    Elasticsearch has been used to query 24 billion records in 900ms. It’s currently being used by companies such as Github, Foursquare, and Xing.

    website: http://www.elasticsearch.org/

  • 2014 AT&T Developer Summit

    I will be attending the AT&T Developer Summit in Las Vegas. I will also be taking part in the Summit Hackathon.

    “The AT&T Summit Hackathon is the premier hackathon of the year for the AT&T Developer Program. This year will be focused on wearable technologies and participants will be able to choose between a Wearables Track and an AT&T API Track. Finalists from each track will be featured in live fast pitches on stage with our executives during the keynote at the AT&T Developer Summit on January 6th. In addition, competitors will also have the ability to complete in accelerator challenges, details to be announced, which will offer prizes of up to $10,000 for eligible teams”

    more >>

  • SIC:// AT&T Hackathon 2013

    Well, since my last blog entry I went to my first Hackathon, the Seattle Interactive Conference / SIC:// AT&T Hackathon. Ended up joining a team with two complete strangers, Joan Jasak and Arunabh Verma, and we end up presenting at the Conference and taking 3rd place. Check out the video http://vimeo.com/78582527

  • Work Blog: Managing Your Linux Deployments with Spacewalk

    I have been using Spacewalk for a while now and really like a lot of the built-in functionality. I have been using it to build out and manage a lot of my Red Hat, and CentOS installations.

    The latest thing I have been using it for it to manage is my Hadoop cluster build out and configuration updates. I think that it helps to be able to control as much of it as possible from one management system. I know there are applications like Ambari out there, but to be honest who wants to add another tool if they don’t have to go to my site.

    Here’s the link to my work blog about it.

    http://gotomojo.com/managing-your-linux-deployments-with-spacewalk/

  • What does Facebook consider an average day's worth of data?

    Well according to this article from gigaom.com. The average day looks something like this.

    • 2.5 billion content items shared per day (status updates + wall posts + photos + videos + comments)
    • 2.7 billion Likes per day
    • 300 million photos uploaded per day
    • 100+ petabytes of disk space in one of FB’s largest Hadoop (HDFS) clusters
    • 105 terabytes of data scanned via Hive, Facebook’s Hadoop query language, every 30 minutes
    • 70,000 queries executed on these databases per day
    • 500+terabytes of new data ingested into the databases every day

    I also love this quote from the VP of Infrastructure.

    “If you aren’t taking advantage of big data, then you don’t have big data, you have just a pile of data,” said Jay Parikh, VP of infrastructure at Facebook on Wednesday. “Everything is interesting to us.”

  • Big Data for Small Business

    I have said it before and will say it again you don’t have to be fortune 500 company to use Big Data. Big Data is more about understanding your data, then it is about how big it is and understanding all your different data sources and gathering them into one place, so that you can analyze and understand it better.

    http://www.pcworld.com/article/2047486/how-small-businesses-can-mine-big-data.html

  • CentOS 6.4 service virt-who won't start – work around

    Here is the problem.

    [root@bob ~]# service virt-who start

    Démarrage de virt-who : Traceback (most recent call last):

    File “/usr/share/virt-who/virt-who.py”, line 33, in <module>

    from subscriptionmanager import SubscriptionManager, SubscriptionManagerError

    File “/usr/share/virt-who/subscriptionmanager.py”, line 24, in <module>

    import rhsm.connection as rhsm_connection

    ImportError: No module named rhsm.connection

    [FAILED]

     

    There is a simple work around. Install the Scientific Linux 6 python-rhsm package.

     

    Name : python-rhsm

    Version : 1.1.8 Vendor : Scientific Linux

    Release : 1.el6 Date : 2013-02-22 01:54:26

    Group : Development/Libraries Source RPM : python-rhsm-1.1.8-1.el6.src.rpm

    Size : 0.27 MB

    Packager : Scientific Linux

    Summary : A Python library to communicate with a Red Hat Unified Entitlement Platform

    Description :

    A small library for communicating with the REST interface of a Red Hat Unified

    Entitlement Platform. This interface is used for the management of system

    entitlements, certificates, and access to content.

     

    First install python-simplejson

     

    [root@bob ~]# yum install python-simplejson

     

    Then pick a mirror from http://rpm.pbone.net/index.php3/stat/4/idpl/20813982/dir/scientific_linux_6/com/python-rhsm-1.1.8-1.el6.x86_64.rpm.html and download python-rhsm-1.1.8-1.el6.x86_64.rpm and install it

     

    [root@bob ~]# rpm –install python-rhsm-1.1.8-1.el6.x86_64.rpm

     

    Then start virt-who

     

    [root@bob ~]# service virt-who start

  • What's New with MongoDB Hadoop Integration.

    I attend this Webinar yesterday and it was pretty good. I like how straight they interaction between MongoDB and Hadoop. Check it out if you get a chance.

    http://www.10gen.com/presentations/webinar-whats-new-mongodb-hadoop-integration