Creating Ncloud TensorFlow Server (VPC, Private Subnet)

Article Summary

Share feedback

Thanks for sharing your feedback!

Available in VPC

1. About Ncloud TensorFlow Server

Ncloud TensorFlow Server provides typical deep learning and machine learning packages, including TensorFlow which is one of the most popular among data analysts, as installation types.

Most analysis jobs can be performed since data preprocessing, visualization, natural language processing, and machine learning and deep learning packages are provided. If necessary, you can easily install an additional package or upgrade an existing one by using the GUI or console command in Jupyter Notebook.

The web-based Jupyter Notebook allows you to perform jobs in the same analysis environment regardless of location, instantly check analysis results, and easily share them with others.

It also provides an administration script that enables you to easily manage Jupyter processes in a terminal environment.

For beginner analysts just starting their analysis jobs, machine learning (e.g., regression, classification, and clustering using scikit-learn) and deep learning regression and classification examples using Keras and TensorFlow, as well as data search and visualization, are provided.

2. FAQ before use

Q. What is Ncloud TensorFlow Server?

Ncloud TensorFlow Server facilitates quick implementation of deep learning and machine learning by providing typical deep learning and machine learning packages, including TensorFlow, as well as the web-based Jupyter Notebook that can write code.
Ncloud TensorFlow Server uses TensorFlow, which is an open source machine learning software library developed by the Google Brain team.

Q. What kinds of products are there?

There are two products, ubuntu-16 and centOS-7, depending on the OS. (OS versions may differ.)
The only difference lies in the OS, and the packages provided are the same. If you don't have a particular preference, then we recommend that you select the Ubuntu product, which is more commonly used.

Q. What types of packages are available, and can I use only the provided packages?

The main packages include TensorFlow, Keras, and PyTorch.
Since the package management tool called Anaconda is used, you can use the Conda binary to make additional installations of other packages.
Please refer to the Manage package section of the Jupyter Notebook user guide for information about installing and managing packages.

Q. Can I use Java or other languages besides Python?

Although TensorFlow provides APIs such as Java and Go, their stability isn't guaranteed. We recommend that you use Python.

Q. What server types are available?

The Ncloud TensorFlow Server types provided by NAVER Cloud Platform are divided into Standard, High Memory, CPU Intensive, and GPU. The features of each server type are as follows.

Standard: The standard server of the NAVER Cloud Platform service that can be used for various IT businesses. It provides well-balanced server specifications and boasts of high availability and stability.
High Memory: A high-memory server of more than 64 GB that is suitable for operating applications that are easily affected by memory performance. Up to 10 High Memory servers can be created, and the server number limit can be adjusted by making a request to Support.
CPU Intensive: The Intel Xeon processor is equipped with TensorFlow that is optimally built for AVX512 Vector operations. It is suitable for high-performance deep learning workload processing. This CPU type only supports the Ubuntu 16.04 OS at this time.
GPU: The server is equipped with GPUs with V100 and T4 specifications. It is suitable for environments that process data quickly. Up to 5 servers can be created, and the server number limit can be adjusted by making a request to Support.

Q. Ncloud TensorFlow Server has been created. Where do I write the code?

When creating the Ncloud TensorFlow Server in a private subnet, at least one additional public subnet is required since external access can't be made through a public IP.
- 1. Create a Ncloud TensorFlow Server VM in the private subnet.
- 1. Create a Windows VM in the public subnet.
- 1. Assign a public IP to the Windows VM and connect to the Windows VM.
- 1. (While connected to the Windows VM) Connect to Jupyter Notebook through the private IP of Ncloud TensorFlow Server VM.
The TensorFlow code can be written in Jupyter Notebook. Connect to [private IP address:18888] through the web browser after connecting to the Windows VM of the additionally created public subnet.
There are data processing, visualization, and TensorFlow example files in the "samples" folder, so please refer to them if you are new to data analysis.

Q. How do I use TensorBoard?

Connect to [private IP address:18889] through the web browser after connecting to the Windows VM of the additionally created public subnet (refer to 4. Configure Jupyter connection environment after creating Ncloud TensorFlow Server).
Connect to [private IP address:18889] through the web browser after connecting to the Windows VM of the additionally created public subnet (refer to 4. Configure Jupyter connection environment after creating Ncloud TensorFlow Server).
Since the TensorBoard directory is located under the default Jupyter path (/home/ncp/workspace), you can specify this directory as the log directory in the TensorFlow code for usage.

Q. I can't connect to Jupyter Notebook.

The Jupyter process may have stopped in the Ncloud TensorFlow Server VM. Restart the process by executing the "jup restart" command in the terminal (Refer to "Manage Jupyter process").

Q. (GPU only) When executing the TensorFlow code, the "out of memory" message is displayed and execution fails.

Method 1: When executing multiple code simultaneously, the OOM issue may occur if the code executed first does not release any GPU memory. Shut down the code executed first and try again.
Method 2: Following the instructions from the official TensorFlow document, add the code for limiting GPU memory as shown below.
Map TensorFlow so that the process can see nearly all the memory of every GPU by default (assume that it is included in CUDA_VISIBLE_DEVICES). This reduces memory fragmentation and allows for more efficient use of GPU memory resources, which are relatively more valuable, in devices. Use the tf.config.experimental.set_visible_devices method to adjust the GPUs that can be accessed from TensorFlow.

gpus = tf.config.experimental.list_physical_devices('GPU')
if gpus:
  # Limit so that TensorFlow can only use the first GPU
  try:
    tf.config.experimental.set_visible_devices(gpus[0], 'GPU')
  except RuntimeError as e:
    # Accessible devices must be set when starting the program
    print(e)

In some cases, it should be set so that the process is assigned to only a part of the available memory, or that memory can be used for the amount required by the process. TensorFlow offers two methods for this to be possible.

The first method involves calling tf.config.experimental.set_memory_growth and allowing an increase in memory. This assigns the amount of GPU memory required for assignment in runtime. A small amount of memory is assigned at first, and the GPU memory area assigned to the TensorFlow process is expanded when the program is executed and requires more GPU memory. Memory is not released since this worsens memory fragmentation. Enter the following code before a tensor or operation to allow the memory increase of a specific GPU.

gpus = tf.config.experimental.list_physical_devices('GPU')
if gpus:
  try:
    tf.config.experimental.set_memory_growth(gpus[0], True)
  except RuntimeError as e:
    # Memory increase must be set when starting the program.
    print(e)

3. Create Ncloud TensorFlow Server VM

Preliminary work 1. Create VPC

Create a VPC in Console > VPC. At least one VPC is required. You can create multiple Ncloud TensorFlow Server in one VPC.

Preliminary work 2. Create subnet

You can restrict access to only inside a specific VPC environment by creating a private subnet in a VPC.

Create a subnet by setting the property value of Internet gateway exclusiveness to N (Private).

Step 1. Connect to console

Connect to Console, and then click the Server menu.

Click the [Create server] button to create a server.

Step 2. Configure the Ncloud TensorFlow Server VM

Select a VPC, subnet, storage type, server type, and pricing plan, and then enter a server name.

VPC: Select the VPC created earlier. (E.g., vpc-tensorflow)
Subnet: Select Private Subnet. (E.g., vpc-sub-pri-tensorflow)
Storage type: Select a storage type to use as the boot storage.
- Select SSD for services that require high-performance I/O and HDD for general services. However, you can only use SSD as additional storage if the boot storage is SSD.
Server type: Select the server type you want.
- Some server specifications may not be supported, depending on the boot disk type.
- Select one server type from among Standard, High Memory, or GPU, depending on your purpose.
Select pricing plan: You can select either the monthly or hourly pricing plan.
Server name: Enter a server name.
- This name is used to identify servers, so it must be unique.
Network Interface: Configure the network interface. You can create up to 3 per server.
Public IP: Private Subnet can't assign a public IP. Select Not Set.
- If you want to create a public IP when creating a server, then the subnet type must be public subnet and the number of server must be 1.
Next: Click the [Next] button after completing the server configuration.

Step 3. Configure the authentication key

If you have an existing authentication key, select "Use an existing authentication key." If you don't have one, create a new authentication key according to the following procedure.

① Select Create a new authentication key.
② Enter an authentication key name.
③ Click the [Create and save authentication key] button to save the authentication key file to your local PC.
- You received a new authentication key.
- After saving it, please keep the authentication key in a safe place on your PC.
- The authentication key is used to obtain an initial admin password.
④ Click the [Next] button.

Step 4. Set the firewall

You can set a firewall by using an existing ACG.

You must select an ACG and select one of the existing ACGs that contain "VPC default ACG" that is provided by default when a VPC is created.
- You can select up to 5 ACGs of your own.
- You must select an ACG to activate the [Next] button so that you can proceed with the setting.
Select an ACG you want to configure, and then click the [Next] button.

Step 5. Final confirmation

Do the final confirmation of the setting.

Final confirmation: Make sure that the server image, server, authentication key, and ACG are set properly.
[Create server]: After final confirmation, click the button to create a server.
- This may take several minutes or longer to be completed.

Step 6. Check in the server list

Check the created server in the list.

Wait until the server is created, the package is installed, and the server status becomes Running.

4. Configure Jupyter connection environment after creating Ncloud TensorFlow Server

This guides you on how to connect to the VM of the Ncloud TensorFlow Server created in the private subnet. Since the private subnet can't be accessed from outside, you need to create a public subnet.

You can create a Windows VM in the public subnet and access the VM in the private subnet.

Create a public subnet

Create a subnet by setting the property value of Internet gateway exclusiveness to Y (Public).

Create Windows VM

After remotely accessing the Windows VM created in the public subnet, connect to the Ncloud TensorFlow Server VM created in the private subnet.

Windows VM must be created after selecting Public Subnet.

Select Windows VM

Select the Windows VM specification

Select a VPC, subnet, storage type, server type, and pricing plan, and then enter a server name.

Windows VM must be created after selecting Public Subnet.

VPC: Select the VPC created earlier. (E.g., vpc-tensorflow)
Subnet: Select Public Subnet. (E.g., vpc-sub-pub-tensorflow)
Storage type: Select a storage type to use as the boot storage.
- Select SSD for services that require high-performance I/O and HDD for general services. However, you can only use SSD as additional storage if the boot storage is SSD.
Server type: Select the server type you want.
- Some server specifications may not be supported, depending on the boot disk type.
- Select one server type from among Standard, High Memory, or GPU, depending on your purpose.
Select pricing plan: You can select either the monthly or hourly pricing plan.
Server name: Enter a server name.
- This name is used to identify servers, so it must be unique.
Network Interface: Configure the network interface. You can create up to 3 per server.
Public IP: If you want to automatically allocate a public IP while creating a VM, select the [Assign a new public IP] button.
- If you want to create a public IP when creating a server, then the subnet type must be public subnet and the number of server must be 1.
Next: Click the [Next] button after completing the server configuration.

Configure the authentication key of Windows VM

Configure the ACG of Windows VM

Check the creation of Windows VM for the last time

Request the use of public IP address

Create a public IP address to access the Windows VM. (You will be charged additional costs for using public IP addresses.)

Go to VPC > Server > Public IP and create a public IP.

① From Select target server, select a Windows server to assign the public IP address.

ACG settings

Add an SSH "TCP/22" rule for terminal connections and a Jupyter Notebook (TCP/18888) rule to ACG, as well as TensorBoard (TCP/18889) if necessary.

If you have already added the rules when creating a server, then you do not need to set ACG in this step.

① Select ACG from the left menu.
② Select the ACG that you chose when creating the server, and then click [Set ACG] at the top.
③ In the ACG rule setting, enter a protocol, access source, and allowed port and create it.
- Protocol: Select from TCP, UDP, and ICMP.
- Access source: Enter an IP address or ACG name.
- Allowed port: Specify a single port or a range of ports.
- For a terminal connection, the rule for SSH "TCP/22" must be set in ACG of a server to be connected. The "TCP/18888" rule must be additionally set to use Jupyter Notebook, and "TCP/18889" rule must be set to use TensorBoard.
- To add the rules, list a protocol, access source, and allowed port, and then click [Add].
④ When you're finished with the settings, click [Apply].
⑤ You can see the detailed information in the Details tab and the rules you have set up in the View rules tab.

Remote access to Windows VM

Connect to the Windows VM using a remote access program (e.g., remote desktop, Microsoft Remote Desktop). Enter the Public IP and Admin password of Windows VM.

Please refer to Windows Server Connection Guide.

Enter the Windows VM access account

① Enter the admin name of Windows VM.
② Enter the admin password of Windows VM.

Enter the public IP of Windows VM

① Enter the public IP of Windows VM.
② Enter the admin name and password of the Windows VM that was previously set.

Check the remote access of Windows VM

Confirm the operation of Jupyter Notebook

Connect to Windows VM remotely and connect to Jupyter Notebook. Open the Chrome browser, enter the private IP of the Ncloud TensorFlow Server VM, and then connect to Jupyter Notebook.

Connect to Jupyter Notebook from Windows VM

As it is not easy to use Jupyter Notebook due to Internet Explorer's security setting, it is recommended to use "Chrome browser."
The address to access Jupyter Notebook is http://{private-IP}:{18888}.
- For example, when the created private IP is 10.0.0.70, the access URL is http://10.0.0.70:18888.
The initial password for Jupyter Notebook is the server name of VM. For safe use, it is recommended to use it after changing the password.
- For instructions on how to change your Jupyter Notebook password, see Change Jupyter password.

Internet Explorer security settings of Windows VM

You need to enable JavaScript in the security settings to use Jupyter Notebook smoothly through Internet Explorer on a Windows operating system.

Internet Explorer > Tools > Internet Options > Security Tab > Custom > Allow JavaScript
You can use Jupyter Notebook by using Chrome after installing Chrome browser on a Windows VM.

5. Manage Jupyter Notebook

You need to connect to the Ncloud TensorFlow Server VM via a terminal program (such as PuTTY) and change the Jupyter password.

You can easily change your password and manage the Jupyter process through the provided administration script.

Get Admin Password

You need to know the admin password to connect to the Ncloud TensorFlow Server VM via a terminal program (such as PuTTY).

① Select Server from the left menu.
② Select the server, and Get admin password from Manage servers.
③ Click [Drag files here or click here] to attach the authentication key file (.pem) registered in your PC when creating a server.
④ After attaching it, click the [Confirm password] button to get a password to use when first logging in to the server via a terminal program.
- It is recommended to change the password to one that the admin can remember after connecting to the server.

How to connect to VM via a terminal

You can access the Ncloud TensorFlow Server VM with the admin password. After remotely accessing the Windows VM, use a terminal program to connect to the TensorFlow VM.

① After running the terminal program, enter the private IP of the Ncloud TensorFlow Server VM.
② Enter the connection information of Ncloud TensorFlow Server VM.
- login as: root
- Password: Enter the admin password confirmed in the previous stage.
③ It is recommended to change the password of the root account to one you can easily remember after logging in.
- Enter the "passwd root" command to enter a new password.

Change Jupyter password

The initial password used for Jupyter Notebook is set to the server name, so please change the password through the script provided to prevent unauthorized access.

① In the terminal, enter "jup passwd" to change the password.
- Confirm the initial password: Same as the "server name" from Console > Server > Server List
- A password must contain English letters, numbers, and special characters, and be at least 8 characters.
- If you execute the script to change a password, then the Jupyter processes will be automatically restarted.

Manage Jupyter process

If you have changed configuration files and are required to restart the Jupyter process, you can stop or restart the process via the provided script.

① Stop Jupyter process: $ jup stop
② Start Jupyter process: $ jup start
③ Stop and start Jupyter process: $ jup restart

Manage TensorBoard process

If you start the TensorBoard process for the first time after creating a server or are required to restart the process, then you can stop or restart the process via the provided script.

① Stop TensorBoard process: $ jup tb-stop
② Start TensorBoard process: $ jup tb-start

Was this article helpful?

What's Next

Creating Ncloud TensorFlow Server

Table of contents

1. About Ncloud TensorFlow Server
2. FAQ before use
3. Create Ncloud TensorFlow Server VM
4. Configure Jupyter connection environment after creating Ncloud TensorFlow Server
5. Manage Jupyter Notebook