Please follow the instructions in https://software.intel.com/content/www/us/en/develop/tools/oneapi/base-toolkit.html to install Intel oneAPI Base Toolkit on the management node according to your own needs.
Please follow the instructions in https://software.intel.com/content/www/us/en/develop/tools/oneapi/hpc-toolkit.html to install Intel oneAPI HPC Toolkit on the management node according to your own needs.
Before install AI Analytics Toolkit,make sure intel oneAPI Base Toolkit installed. Please follow the instructions in https://software.intel.com/content/www/us/en/develop/tools/oneapi/ai-analytics-toolkit.html to install Intel oneAPI AI Analytics Toolkit on the management node according to your own needs.
Run the following commands to initialize modulefiles for LiCO on the management node:
x# where is the intel oneapi installed, /opt/intel/oneapi is the default pathONEAPI_PATH="/opt/intel/oneapi"
source $ONEAPI_PATH/setvars.shbash $ONEAPI_PATH/modulefiles-setup.shPlease create a shared directory based on the upper directory of the oneapi installation path (task /opt/intel as an example):
Step 1. Manage node sharing /opt/intel for Intel oneAPI:
xxxxxxxxxxecho "/opt/intel *(ro,no_subtree_check,fsid=12)">> /etc/exportsexportfs -aStep 2. Configure the shared directory for cluster nodes:
xxxxxxxxxx# IP address of management node in the cluster intranetMANAGER_NODE_IP="192.168.0.1"
nodeshell all "echo '${MANAGER_NODE_IP}:/opt/intel /opt/intel nfs nfsvers=4.0,nodev,noatime \0 0' >> /etc/fstab"Step 3. Mount shared directory
xxxxxxxxxxnodeshell all mkdir -p /opt/intelnodeshell all mount /opt/intelRun the following commands to add module path for LiCO on the management node and compute nodes:
xxxxxxxxxx# where is the intel oneapi installed, /opt/intel/oneapi is the default pathONEAPI_PATH="/opt/intel/oneapi"
sed -i s#/opt/ohpc/pub/modulefiles#/opt/ohpc/pub/modulefiles:$ONEAPI_PATH/modulefiles#g \/etc/profile.d/lmod.sh
sed -i s#/opt/ohpc/pub/modulefiles#/opt/ohpc/pub/modulefiles:$ONEAPI_PATH/modulefiles#g \/etc/profile.d/lmod.csh
source /etc/profile.d/lmod.shRun the following commands on the management node:
xxxxxxxxxx# where is the intel oneapi installed, /opt/intel/oneapi is the default pathONEAPI_PATH="/opt/intel/oneapi"
sed -i s#/opt/ohpc/pub/modulefiles#/opt/ohpc/pub/modulefiles:$ONEAPI_PATH/modulefiles#g \/etc/lico/lico.ini.d/template.ini
lico lmod_syncxxxxxxxxxxvi /etc/lico/lico.ini.d/oneapi.ini
INTEL_MODULE_PATH = "<oneAPI install dir>"ENABLE = truexxxxxxxxxxlico init_runtimexxxxxxxxxx# add two kernel parameters in /etc/sysctl.conf on all compute nodes
nodeshell compute "echo 'kernel.kptr_restrict=0' >> /etc/sysctl.conf"nodeshell compute "echo 'kernel.perf_event_paranoid=0' >> /etc/sysctl.conf"nodeshell compute sysctl -p /etc/sysctl.confxxxxxxxxxx
vim /opt/intel/oneapi/vtune/latest/backend/config.yml
# change the type from passphrase to reverse-proxy
#type:passphrase type: reverse-proxyheader: AuthorizationRun the following commands on the management node to check if the installation is successful:
xxxxxxxxxx# The output may be /opt/intel/oneapi/intelpython/latest/bin/mpirunwhich mpirun
# The output may be /opt/intel/oneapi/intelpython/latest/bin/mpitunewhich mpitune
# The output may be /opt/intel/oneapi/mpi/2021.1.1/bin/mpiiccwhich mpiiccRun the following commands on the management node to check whether the module is successfully configured:
xxxxxxxxxx# The output contains /opt/intel/oneapi/modulefiles informationmodule avaRun the following commands on the management node to check whether the intel python is installed.
xxxxxxxxxx[root@head oneapi]# source /opt/intel/oneapi/setvars.sh [root@head oneapi]# conda env list# conda environments:
base * /opt/intel/oneapi/intelpython/latest2021.2.0 /opt/intel/oneapi/intelpython/latest/envs/2021.2.0pytorch /opt/intel/oneapi/intelpython/latest/envs/pytorchpytorch-1.7.0 /opt/intel/oneapi/intelpython/latest/envs/pytorch-1.7.0tensorflow /opt/intel/oneapi/intelpython/latest/envs/tensorflowtensorflow-2.3.0 /opt/intel/oneapi/intelpython/latest/envs/tensorflow-2.3.0