Utilize APIs

release/20240425
English

Utilize APIs

Article Summary

Share feedback

Thanks for sharing your feedback!

Available in Classic and VPC

This page describes the APIs available from the Explorer menu. Click the [Get Started] button for each API to see the details of that API.

Tokenizer API

The Tokenizer API can count the number of tokens and characters in a sentence that you type. You can use the tokenizer to find the optimal number of tokens or to create efficient prompts.
The tokenizer (HCX) is an API that counts the number of tokens and characters in a given sentence in HyperCLOVA X model.

Sliding window API

The Sliding window API helps to keep conversations flowing in chat mode by scaling prompts and results to the maximum number of tokens HyperCLOVA X language model can handle.

In chat mode, if the user's conversation with the assistant exceeds the maximum number of tokens that can be processed by HyperCLOVA X language model, new conversations cannot be created. The Sliding window API can delete the oldest conversation turn in the conversation history of the user and the assistant to prevent such a situation. When deleting conversations, it starts with the conversation following the system directive, i.e. the earliest entered conversation turn.

clovastudio-explorer_slidingAPI01_ko

Note

The Sliding window API only works for models that are in chat mode (Chat completions API).
You will need to set the order so that the results of the Sliding window API are passed to the Chat completions API as they are.
Other settings, such as modelName and maxTokens, should be set to the same values as the Chat completions API settings you are using.

Caution

Because the chat history between the user and the assistant is deleted sequentially from the beginning of the conversation, newly created chats may not reflect previous chats.
In particular, if the maximum number of tokens generated by the result is set to a large number (the maxTokens value in the API), the conversation history will be deleted proportionally based on the number, so newly created conversations may not fully reflect previous conversations.

How the Sliding window works

In chat mode, if the sum of the total number of tokens in the entered conversations (A) and the maximum number of tokens in the new conversation (B=maxTokens) is greater than the maximum number of tokens the model can handle (X) (i.e., A+B>X), the Chat completions API will not generate any more conversations. To work around this, the Sliding window API deletes conversation turns from existing conversations based on the number of excess tokens (A+B-X). It deletes on a conversation turn basis to avoid deleting only part of a conversation turn (deleting the minimum number of conversation turns based on the number of excess tokens).
For example, if the number of excess tokens is 200, as shown in the figure below, using the Sliding window API will delete the two oldest conversation turns (100 and 200 tokens) of the existing conversation history. If the number of excess tokens is 100 tokens or less, only the oldest conversation turn in the existing conversation history will be deleted. This means that individual conversation turns are deleted, not pairs of conversations between the user and the assistant.

Sliding window API workflow

By using the Sliding window API, you can use the Chat completions API continuously without having to separately adjust the number of tokens for the entire conversation.

The following describes how the Sliding window API works.

Before using the Chat completions API, first call the Sliding window API and provide the prompt you want to enter (conversation content; body > messages) in the request.
Enter the result in the Sliding window API response (result > messages) into the Chat completions API request as is.
The model name and maximum number of tokens should be the same for the Chat completions API and the Sliding window API.

Segmentation API

The Segmentation API can separate paragraphs by topic by calculating the similarity between sentences. You can specify the number of tokens that can fit in a paragraph, and it can also split paragraphs based on context, even if there are no blank lines in the paragraph or the break is not clear. The number of paragraphs can be adjusted with the SegCount value. If you want automatic segmentation, set the SegCount value to -1. If the value you enter is 0 or greater, paragraphs are broken to that value.

The following describes how the Segmentation API works.

clovastudio-explorer03_segment_en

Summarization API

The Summarization API can split a given set of sentences into paragraphs and then summarize each of the paragraphs.

It breaks long documents into contextualized paragraphs and summarizes each paragraph. You can reduce the length of your text by removing unnecessary parts while retaining important information. You can also control the size of the summary by using the segMaxSize and segMinSize of the segmentation to limit the number of characters that can be included in a paragraph.

clovastudio-explorer03_summarize_en

The Summarization API can be utilized as follows:

Summarize long meeting minutes and understand the content of the minutes to generate key takeaways.
Summarize a long email to make it easier to understand what is important.
Summarize a report or script. Summarize paragraphs in context, making it easy to create a table of contents.

Note

Summarizing may not work well if the text is published on the web.

Embedding API

The Embedding API can convert input text to a vector value in the form of a number. Depending on your task and purpose, you can choose between the "cli-emb-dolphin" model and the "clir-sts-dolphin" model. Two models will give different similarity results for the same pair of sentences.

The following is how the Embedding API works and what each model does.
clovastudio-explorer03_embedding01_ko

The Embedding API can be utilized as follows:

Compute vector similarity between sentences to improve search performance. For example, you can measure the vector similarity of documents to a search keyword entered by a user and return the most relevant documents.
Compute the similarity between two sentences to determine the similarity of related documents, or compare the semantic similarity between sentences.
Cluster documents with similar characteristics.
Categorize documents. Vectorized text data can be used in trained models to perform a variety of classification tasks, such as classifying text based on topic or sentiment.

Embedding workflow

The embedding workflow consists of preparing the data, performing the embedding, saving the vector output, developing the API, and calling the output. Through this process, you can save the embedded output and use it in your database. The embedded vector output is available in 1024 dimensions.

The embedding workflow is as follows:

clovastudio-explorer03_embedding02_ko

Process long text

The maximum text length that the Embedding API can handle at one time is 500 tokens. If embedding is difficult due to the text token limit, we recommend using the chunking method to properly break up long texts. When chunking text, it is important to properly break the text into semantic units in order to extract the correct information. Chunking is the process of breaking text into smaller pieces, and includes Sliding window, Segmentation, Summarization operations.

Here are the types of chunking methods and the advantages and disadvantages of each of them.

Method	Description	Advantages	Disadvantages
Sliding window	Splits text into units of constant length	Easily extracts the correct answer to a query by breaking the text into smaller pieces	Because text is divided into units of length rather than units of semantics, the beginning and end of the text are poorly treated, and it is difficult to understand the meaning of the entire text
Segmentation	Separates text into meaningful paragraphs that make sense in context	Text can be grouped into meaningful units for better embedding performance	Long paragraphs make it difficult to identify where the query is answered
Summarization	Summarizes a long piece of text into a shorter version, focusing on the main points	Summarizes longer contextual text than Segmentation, making it easier to embed on a per-document basis	Long paragraphs make it difficult to identify where the query is answered

Note

CLOVA Studio provides a Segmentation API and a Summarization API. See Segmentation API and Summarization API for more information.

Create test app

We provide API guides and application creation tools for integrating the services provided by CLOVA Studio. After creating a test app, you can use curl and Python code to call the APIs provided by the Explorer.

After clicking the [Create test app] button, you can call the API using curl and Python code.
The information about the created test apps can be found on the Test apps tab of the App application status.
For more information on how to issue a test app, see Utilize samples and manage tasks.

Was this article helpful?

What's Next

Skill trainer screen

Table of contents

Tokenizer API
Sliding window API
Segmentation API
Summarization API
Embedding API
Create test app