Tableau connection
    • PDF

    Tableau connection

    • PDF

    Article Summary

    Available in VPC

    Supporting fast and easy data visualization, Tableau is currently one of most commonly used BI solutions.
    This guide introduces how to connect NAVER Cloud Platform's Cloud Hadoop and Tableau.

    Please refer to the Official Tableau website for more information about Tableau.

    Preparations

    1. Create Cloud Hadoop cluster.
    2. Create Object Storage.
    3. Create Windows Server.
    Note

    We recommended you create Cloud Hadoop and Windows Server within the same VPC.

    1. Set ACG up.
      • Add Port 8286 to Cloud Hadoop ACG.
        hadoop-vpc-use-ex14_pre1_vpc_en

    Create table in Hive

    1. Update a sample data file to Object Storage.

      • Download and decompress the sample data here and upload the AllstarFull.csv file to Object Storage > Bucket Management.
        chadoop-4-5-001-vpc_en.png
    2. Create a table in the HUE Hive editor.

    DROP table allstarfull;
    
    CREATE external TABLE if not EXISTS `allstarfull` (
            `playerID` VARCHAR(20),
            `yearID` INT,
            `gameNum` INT,
            `gameID` VARCHAR(30),
            `teamID` VARCHAR(4),
            `lgID` VARCHAR(4),
            `GP` INT,
            `startingPos` INT
    )
    ROW FORMAT DELIMITED
    FIELDS TERMINATED BY ','
    location 's3a://deepdrive-hue/input/lahman2012/allstarfull';
    
    1. Use a simple query to see if the table is successfully created.
    SELECT * FROM allstarfull;
    

    Add Presto connector

    1. Add a connector to Presto > [CONFIGS] > Advanced connectors.properties.
      • Since you need to add a Hive connector, enter the following on connectors.to.add:
    {"hive":["connector.name=hive-hadoop2",
            "hive.metastore.uri=thrift://<METASTORE-HOST-IP>:9083",
            "hive.config.resources=/etc/hadoop/conf/core-site.xml,/etc/hadoop/conf/hdfs-site.xml",
            "hive.s3.use-instance-credentials=false",
            "hive.s3.aws-access-key=<API-ACCESS-KEY>",
            "hive.s3.aws-secret-key=<API-SECRET-KEY>",
            "hive.s3.endpoint=https://kr.object.private.ncloudstorage.com"]
            }
    
    Note

    is the private IP address for the m-001 node. You can find it in Ambari > Hosts.

    hadoop-vpc-use-ex14_connect1_vpc_en

    1. Please apply for configuration. Click [ACTIONS] > [Restart All] in the upper right corner, and then click the [CONFIRM RE ALL] button in the pop-up window.
    Note

    Please refer to the Analyzing Hive Warehouse data using Presto guide for more information about how to analyze data saved on Hive data warehouse in Presto.

    Install Tableau

    All the steps below should be performed in Windows Server.

    1. Download Tableau Desktop on the Tableau website.
    2. Download the Presto JDBC driver (presto-jdbc-0.268.jar) on the Presto website.
    3. Move the Presto JDBC driver to the drivers directory where Tableau is installed.
      hadoop-vpc-use-ex14_install1_vpc_en
    Note

    Tableau can explore a wide range of data from Presto, including spreadsheets and databases. Please refer to Tableau Desktop and Web Authoring Help for information on supported connectors.

    Install nginx-ssl.crt certificate

    1. Move the /etc/nginx/ssl/nginx-ssl.crt certificate from the Cloud Hadoop edge node to Windows Server.

    2. Double-click the nginx-ssl.crt certificate, and then click the [Install certificate] button.
      hadoop-vpc-use-ex14_install3_vpc_en

    3. Select "Local computer," and then click the [Next] button.
      hadoop-vpc-use-ex14_install4_vpc_en

    4. Select "Save all certificates to the following storage," followed by the certificate storage you want.

      • Set the certificate storage as "reliable root certificate authority."
        hadoop-vpc-use-ex14_install5_vpc_en
    5. Click the [Finish] button to complete installing the certificate.
      hadoop-vpc-use-ex14_install6_vpc_en

    Access Presto on Tableau Desktop

    1. Run Tableau Desktop and select Connect to server > View more > Presto.
      hadoop-vpc-use-ex14_visual1_vpc_en

    2. Enter the access information and click the [Login] button.

    Access information 
     - Server: Domain in Presto cluster
     - Port: Port in Presto coordinator (8286)
     - Catalog: Name of the catalog to use
     - Schema: Name of the schema to use
     - User name: Name of the user
     - SSL required: Check
    

    hadoop-vpc-use-ex14_visual2_vpc_en

    Check table and record

    1. Click Include and search to check table of the schema.
      hadoop-vpc-use-ex14_visual3_vpc_en

    2. Click the [Update now] button to load the data in the table.
      hadoop-vpc-use-ex14_visual4_vpc_en

    3. You can create a new sheet for easy visualization with drag and drop only.
      hadoop-vpc-use-ex14_visual5_vpc_en

    Note

    Please refer to the Official website for more information about using Tableau.


    Was this article helpful?

    Changing your password will log you out immediately. Use the new password to log back in.
    First name must have atleast 2 characters. Numbers and special characters are not allowed.
    Last name must have atleast 1 characters. Numbers and special characters are not allowed.
    Enter a valid email
    Enter a valid password
    Your profile has been successfully updated.