Simplification of parallel queues on Apocrita

Posted on Thu 27 June 2019 in news by Simon Butcher

We are simplifying the way that the multi-node parallel jobs are run on the cluster.

Currently, users wishing to run multi-node MPI jobs on the public queues must choose beforehand whether to run on the nxv parallel nodes or the sdv parallel nodes, and to configure the job accordingly for …


Continue reading

Cluster Update Summary

Posted on Wed 12 June 2019 in news by Simon Butcher

As part of our commitment to providing stable and manageable systems, here is a round-up of some recent updates we have been working on behind the scenes:

1) Upgrade of all HPC cluster nodes to CentOS 7.6

Over the last couple of weeks, you may have noticed a few …


Continue reading

Running Machine Learning workloads on Apocrita

Posted on Fri 22 March 2019 in tutorial by Simon Butcher

In this tutorial we'll be showing you how to run a TensorFlow job using the GPU nodes on the Apocrita HPC cluster. We will expand upon the essentials provided on the QMUL HPC docs site, and provide more explanation of the process. We'll start with installation, and run through some …


Continue reading

Sizing your Apocrita jobs for quicker results

Posted on Mon 28 January 2019 in tutorial by Simon Butcher

At any one time, a typical HPC cluster is usually full. This is not such a bad thing, since it means the substantial investment is working hard for the money, rather than sitting idle. A less ideal situation is having to wait too long to get your research results. However …


Continue reading

Cluster Hardware Upgrades and Additions

Posted on Tue 04 December 2018 in news by Simon Butcher

As part of our commitment to regular upgrades to the HPC service, and to keep up with ever-growing demand, we are pleased to announce the addition of new hardware to the Apocrita HPC Cluster for the benefit of all QMUL Researchers.

The additions form part of series of exciting infrastructure …


Continue reading

AI tools

Posted on Sat 15 September 2018 in news by Simon Butcher

For the increasing number of Researchers working with Deep Learning technologies, we also have the existing GPU nodes attached to Apocrita. Tensorflow is available, and we are working on adding more machine learning tools.

We also host a local copy of ImageNet, a database of 14 Million annotated images for …


Continue reading

new sdv nodes

Posted on Sat 15 September 2018 in news by Simon Butcher

A small number of new sdv nodes running Intel Skylake processors have been deployed. A larger batch of these nodes will be added to the cluster before the end of the year.


Continue reading

POWER servers

Posted on Sat 15 September 2018 in news by Simon Butcher

QMUL installed two IBM AC922 POWER 9 servers to support research into deep learning and artificial intelligence, the first of their kind in UK HE. These servers come with a suite of customised Machine learning tools such as TensorFlow and Caffe. We are currently running a pilot scheme with a …


Continue reading

Short Queue

Posted on Sat 15 September 2018 in news by Simon Butcher

Please note that frontend/login nodes are for preparing and submitting your job scripts and running computational tasks directly on the frontend nodes is forbidden, since it can impair the use of the node for the 100+ other users logged in.

In addition to the primary queue, there is a …


Continue reading

Tier2 HPC

Posted on Sat 15 September 2018 in news by Simon Butcher

QMUL have access to powerful Tier 2 (formerly known as Regional) HPC resources, predominantly for EPSRC researchers. If you run multi-node parallel code (using MPI, for example), you will likely benefit from using the Tier2 clusters.


Continue reading