Some nodes in Triolith are equipped with Xeon Phi co-processor SE10/7120 series cards. The "ntelĀ® Manycore Platform Software Stack (MPSS) is necessary to run the IntelĀ® Xeon Phi co-processor.
The basic steps for Xeon Phi installation are described below
cd installdir
wget http://registrationcenter.intel.com/irc_nas/7661/mpss-3.5.1-linux.tar
sudo yum -y install elfutils.x86_64
sudo yum -y install rpm-build.x86_64
sudo yum -y install "kernel-devel-uname-r == $(uname -r)"
tar -xf mpss-3.5.1-linux.tar
cd mpss-3.5.1
rpmbuild --define "_topdir $(pwd)/kmod_nsc" --rebuild src/mpss-modules*.src.rpm
cp kmod_nsc/RPMS/x86_64/mpss-modules-* modules/
cp modules/*`uname -r`*.rpm .
sudo yum -y install *.rpm
sudo modprobe mic
sudo micctrl --initdefaults --users=none
sudo micflash -update -device all
After mic flash update the node will require a reboot for the changes to take effect.
To start using Xeon Phi following steps need to be done:
sudo micctrl --useradd=user
The user's public ssh key will be added to the mic card. For password less login this key should be pass-phrase less. Other methods of password less login for Xeon Phi card may be explored later.
sudo service mpss start
Once the service has started the specified user can use Xeon Phi card both in offload mode as well as in native mode.
ssh 172.31.1.1
To stop using Xeon Phi following steps need to be done:
sudo service mpss stop
sudo micctrl --userdel=user
sudo micctrl --cleanconfig
The Xeon Phi card has no persistence storage attached to it. We can NFS mount the host node file systems on the Xeon phi card. From host node run:
chkconfig nfs on
service nfs start
exportfs -o rw 172.31.1.1:/scratch/local/exported_folder
micctrl --addnfs=$(hostname -i):/scratch/local/exported_folder --dir=/scratch/local/exported_folder
From host node run:
micctrl --remnfs=/scratch/local/exported_folder
exportfs -u 172.31.1.1:/scratch/local/exported_folder
service nfs stop
chkconfig nfs off
On Triolith compute node images are refreshed with default os image every time the node is rebooted. Hence the above installation procedure is done through a script. The script is invoked when the node is made online. The user account creation on Xeon Phi card is done when the user gets SLURM allocation for the Xeon Phi node. The user account is deleted from Xeon Phi card when the user SLURM allocation ends.