Classifier

Prev Next

Available in VPC

A classifier provides classification features to determine schema when scanning unstructured data. Even without a classifier configured, the scanner automatically infers schema for files such as CSV, JSON, and XML. However, with a classifier configured, you can infer data in a specified format for files without a defined format, or generate metadata using only specific data. In the Classifier menu, you can create and manage classifiers.

Create a classifier

To create a classifier:

  1. In the VPC environment on the NAVER Cloud Platform console, navigate to i_menu > Services > Big Data & Analytics > Data Catalog.
  2. Navigate to Configuration > Classifier.
  3. Click [Create classifier].
  4. In the Create classifier popup, enter the classifier information.
    • Name: Classifier name
    • Type: Data file format
    • Parsing settings: Enter the parsing settings based on the data file format.
      • CSV: Select a delimiter to separate data columns.
      • JSON: Specify a path to generate metadata only from the specified data.
        • Specify the store field (e.g., $.store.).
        • Specifies the author field under the book field within the store field (e.g., $.store.book[*].author).
      • XML: Specify a row tag to generate metadata only from the specified node data.
  5. Click [Create].

Search classifiers and view information

To search classifiers you created and view information:

  1. In the VPC environment on the NAVER Cloud Platform console, navigate to i_menu > Services > Big Data & Analytics > Data Catalog.
  2. Navigate to Configuration > Classifier.
  3. In the search bar, enter the classifier name and click i-datacatalog-search to search for the classifier and view its information.
    • Type: Data file format
    • Name: Classifier name
    • Details: Parsing settings specified during classifier creation
    • Creation date and time: Classifier creation date and time
    • Update date and time: Date and time when the classifier is last modified

Edit classifiers

To edit the information of classifiers you created:

  1. In the VPC environment on the NAVER Cloud Platform console, navigate to i_menu > Services > Big Data & Analytics > Data Catalog.
  2. Navigate to Configuration > Classifier.
  3. Click the classifier you want to edit, and then click [Edit].
  4. In the Edit classifier popup, edit the classifier information.
  5. Click [Edit].

Delete a classifier

To delete a classifier you created:

Caution
  • If you delete a classifier, scanners connected to it might not work correctly.
  • A deleted classifier cannot be recovered.
  1. In the VPC environment on the NAVER Cloud Platform console, navigate to i_menu > Services > Big Data & Analytics > Data Catalog.
  2. Navigate to Configuration > Classifier.
  3. Click the classifier you want to delete, and then click [Delete].
  4. When the notification popup window appears, read the cautions and click [Delete].