[ Teaching ]  Leadtek GDMS Function (1) Overview
  Comments:

Leadtek GDMS Function (1) Overview

  By : Leadtek AI Expert     19

Leadtek GPU Docker Management System (GDMS) is a Docker-based GPU resource allocation and management software. GDMS uses an intuitive and graphical user interface to centrally manage AI and big data projects and development resources for universities/schools, research institutions, and corporations.


Combined with Leadtek WinFast GPU workstations and servers, GDMS allows you to maximize resource usage, manage your tasks more efficiently, and reduce the total cost of ownership. All kinds of schools, institutions, and corporations will benefit from GDMS in their development environment deployment, no matter it’s an AI development project, an AI training course, or a GPU accelerated application project.


There are 7 functions in GDMS: 

(1) Overview

(2) Server

(3) Container

(4) Image

(5) Task

(6) Log

(7) Settings


In this article, we will talk about function (1) Overview. 


(1) Overview

The Overview includes System Status and Monitor functionalities: 

System Status: View GPU Server status, Container and Project information. 

Monitor: View real-time (per second) hardware usage status of each GPU Server. 

 


1.  System Status 

System Status Displays: 

  •  GPU Server: List of managed servers including IP address, name, status (Online/Offline), and last online time.
  • Container: Overview of containers, including name, belonging GPU Server IP, status (Running/Exited), and creation time.
  • GPU Server Real-time Performance Monitoring: Real-time performance data for each GPU Server including CPU, memory, GPU core, and GPU RAM usage.
  • Export Record: Records all managed GPU Server system status every 30 minutes (average value). Data can be exported in CSV format for user-defined start to end date and time, including: 

and the data exported including: 

  1.  Date/Time, Server Name, IP, CPU Usage, Total System Memory (GB), Usage (GB), Usage Rate
  2. GPU CUDA Core Usage Rate, Total Docker Space (GB), Usage (GB), Usage Rate
  3. Total GPU Memory (GB), Usage (GB), Usage Rate, Configured GPU Memory (GB), Memory Percentage 
  4. GPU Memory in Exclusive mode/Share mode (GB), Percentage, Total GPU Cards, Configured GPU Cards, Percentage
  5. Total System Containers, Unmanaged Containers, Running and Stopped Containers, Other Status Containers. 


The System Status page refreshes automatically every 120 seconds by default. You can adjust to manual refresh by clicking the [Auto Sync] button. 



2.  Monitor 

Newly added functionality in the Overview, providing real-time hardware usage information (updated every second), including CPU usage (%), system memory usage (GB), all GPU CUDA core usage (%), and all GPU memory usage (GB). Different GPU Servers can be viewed by switching between them. 


Comments as following