Call for testers: the next phase of Apocrita¶
For much of the year we have been working on a major project to upgrade Apocrita to a new operating system. As part of the project, we have deployed a new package building tool to help us recompile all of the research applications to work on the new system. We are now calling for Apocrita users to preview and test this new system, ahead of our full roll-out, to help bring about a smoother and quicker transition.
This is an opportunity to check that your applications work on the new system, and for us to address any issues before we fully roll it out.
Notice
This blog post will receive further updates during the testing phase.
What is the new system?¶
The existing Apocrita cluster runs the CentOS7 Operating System, which is based off Redhat Enterprise Linux (RHEL) 7. This has reached end-of-life and we are replacing it with Rocky9, based off RHEL9. Apart from security and compliance reasons, this will provide the benefit of much newer libraries and applications.
The existing end-of-life platform is no longer able to support newer software releases - various packages are no longer working, as our community of R users have discovered recently. Our new platform will be able to offer newer versions of research applications, so this will be of particular interest to anyone wishing to get latest versions of software.
Additionally, due to recent decommissioning of older nodes, we have been able to compile the new applications against a more modern CPU architecture, providing better compatibility and performance across the whole cluster.
We have currently provisioned a new login node, ondemand service and various
types of compute and GPU node available for test users. This includes our
brand new vhm
high memory servers which can accommodate jobs up to 3TB RAM.
How can I use the new system?¶
Currently we have set an access list to run jobs on the upgraded nodes. To get added to the list, please send a message in the #pilot-testing channel on our slack channel, or send us an email and we'll add you.
Although jobs can we submitted from the CentOS7 frontend nodes, you will not be
able to view the list of newly-built applications from a CentOS7 node. To
access the Rocky9 login node, SSH into login-test.hpc.qmul.ac.uk
using your
existing Apocrita username, password and SSH key. To check what applications
are available, you can run module avail
from this server.
We are providing a new ondemand server, ondemand-dev.hpc.qmul.ac.uk which brings new functionality, and newer versions of applications (for example a newer RStudio).
We have a range of compute nodes running Rocky9, which can be checked with
nodestatus -t rocky
from the login server.
To run on the new nodes, you must:
- request to be added to the Rocky9 testers list
- request
-l rocky
to ensure your job is allocated to a Rocky9 node - ensure your job script is loading a module provided under Rocky9
Who should try the new system?¶
Any user who is typically running jobs with applications loaded with module
load
, ondemand or conda/virtualenv users should give it a try. We have
provisioned a few Rocky9 serial, parallel and GPU nodes dedicated for
testing. Any issues encountered should be reported via the aforementioned slack
channel, or via support ticket.
Changes for conda users¶
Due to some recent anaconda licence changes, we now only provide the
miniforge
module for using conda environments. This works in the same way as
anaconda
and miniconda
but does not use certain problematic anaconda
repositories. Miniforge is the officially supported installation method for the
faster Mamba libsolver and comes with the popular conda-forge channel
pre-configured as its only channel, where most of the common conda packages are
found.
We strongly advise not to use any existing conda environments on Rocky9 that
you created previously on CentOS7. Instead, you should start with fresh
environments. If you want to "migrate" old ones,
export them as a YML file
on a CentOS7 node and then use that same YML file to
re-create the environment
again on a Rocky9 node.
Photo by Chris Liverani on Unsplash.