External Hive Metastore integration with Data Catalog (optional)

The latest service changes have not yet been reflected in this content. We will update the content as soon as possible. Please refer to the Korean version for information on the latest updates.

Available in VPC

It describes how to integrate Cloud Hadoop's Hive Metastore storage with NAVER Cloud Platform Data Catalog.

Preparations

Subscribe to the Data Catalog.
- For more information on using Data Catalog, see the Getting started with Data Catalog guide.

To use Hive Metastore with Data Catalog

It can be integrated with Cloud Hadoop version 2.0 or higher.
N number of Cloud Hadoop clusters can be freely integrated with Data Catalog storage.

Integrate external Hive Metastores

Cloud Hadoop is automatically integrated with the Data Catalog service when it is created by checking whether the catalog is being used.
Hive, Presto, Trino, Impala, and Spark services provided by Cloud Hadoop can also be used by utilizing Data Catalog as a meta repository.

chadoop-datacatalog-cluster_en

You can check the status of the Data Catalog integrated with Cloud Hadoop through the cluster details.

chadoop-datacatalog_en

Considerations when using Data Catalog

If you create a Hive table without specifying a LOCATION, it will be saved to the object storage bucket address used when creating the data catalog.
Even if you delete the Cloud Hadoop cluster, the table information created in the Data Catalog remains intact.