Available in VPC
A classifier provides classification features to determine schema when scanning unstructured data. Even without a classifier configured, the scanner automatically infers schema for files such as CSV, JSON, and XML. However, with a classifier configured, you can infer data in a specified format for files without a defined format, or generate metadata using only specific data. In the Classifier menu, you can create and manage classifiers.
Create a classifier
To create a classifier:
- In the VPC environment on the NAVER Cloud Platform console, navigate to
> Services > Big Data & Analytics > Data Catalog. - Navigate to Configuration > Classifier.
- Click [Create classifier].
- In the Create classifier popup, enter the classifier information.
- Name: Classifier name
- Type: Data file format
- Parsing settings: Enter the parsing settings based on the data file format.
- CSV: Select a delimiter to separate data columns.
- JSON: Specify a path to generate metadata only from the specified data.
- Specify the store field (e.g.,
$.store.). - Specifies the author field under the book field within the store field (e.g.,
$.store.book[*].author).
- Specify the store field (e.g.,
- XML: Specify a row tag to generate metadata only from the specified node data.
- Click [Create].
Search classifiers and view information
To search classifiers you created and view information:
- In the VPC environment on the NAVER Cloud Platform console, navigate to
> Services > Big Data & Analytics > Data Catalog. - Navigate to Configuration > Classifier.
- In the search bar, enter the classifier name and click
to search for the classifier and view its information.
- Type: Data file format
- Name: Classifier name
- Details: Parsing settings specified during classifier creation
- Creation date and time: Classifier creation date and time
- Update date and time: Date and time when the classifier is last modified
Edit classifiers
To edit the information of classifiers you created:
- In the VPC environment on the NAVER Cloud Platform console, navigate to
> Services > Big Data & Analytics > Data Catalog. - Navigate to Configuration > Classifier.
- Click the classifier you want to edit, and then click [Edit].
- In the Edit classifier popup, edit the classifier information.
- For more information on each item, see Create classifier.
- Click [Edit].
Delete a classifier
To delete a classifier you created:
Caution
- If you delete a classifier, scanners connected to it might not work correctly.
- A deleted classifier cannot be recovered.
- In the VPC environment on the NAVER Cloud Platform console, navigate to
> Services > Big Data & Analytics > Data Catalog. - Navigate to Configuration > Classifier.
- Click the classifier you want to delete, and then click [Delete].
- When the notification popup window appears, read the cautions and click [Delete].