External Hive Metastore integration with Data Catalog (optional)
    • PDF

    External Hive Metastore integration with Data Catalog (optional)

    • PDF

    Article Summary

    Available in VPC

    It describes how to integrate Cloud Hadoop's Hive Metastore storage with NAVER Cloud Platform Data Catalog.

    Preparations

    1. Subscribe to the Data Catalog.

    To use Hive Metastore with Data Catalog

    1. It can be integrated with Cloud Hadoop version 2.0 or higher.
    2. N number of Cloud Hadoop clusters can be freely integrated with Data Catalog storage.

    Integrate external Hive Metastores

    Cloud Hadoop is automatically integrated with the Data Catalog service when it is created by checking whether the catalog is being used.
    Hive, Presto, Trino, Impala, and Spark services provided by Cloud Hadoop can also be used by utilizing Data Catalog as a meta repository.

    chadoop-datacatalog-cluster_en

    You can check the status of the Data Catalog integrated with Cloud Hadoop through the cluster details.

    chadoop-datacatalog_en

    Considerations when using Data Catalog

    1. If you create a Hive table without specifying a LOCATION, it will be saved to the object storage bucket address used when creating the data catalog.
    2. Even if you delete the Cloud Hadoop cluster, the table information created in the Data Catalog remains intact.

    Was this article helpful?

    Changing your password will log you out immediately. Use the new password to log back in.
    First name must have atleast 2 characters. Numbers and special characters are not allowed.
    Last name must have atleast 1 characters. Numbers and special characters are not allowed.
    Enter a valid email
    Enter a valid password
    Your profile has been successfully updated.