Speeding Up Grep Searches

Sometimes you may find yourself needing to filter a large amount of output using the grep command. However, grep can sometimes struggle when you try to filter files with an incredibly large number of lines, as it loads each line into RAM line-by-line. This can...

Read More in tutorial, Oct 27, 2022

All Stories

Speeding Up Grep Searches

Sometimes you may find yourself needing to filter a large amount of output using the grep command. However, grep can sometimes struggle when you try to filter files with an incredibly large number of lines, as it loads each line into RAM line-by-line. This can mean you can quickly exhaust even large amounts of requested RAM. There are a few...

in tutorial, Oct 27, 2022

R Tutorial Part Two - R vs RStudio

Following up from part one of our R tutorial we’ll be taking a look at the differences between R - the command-line language which can be loaded as a module and used in your Apocrita batch jobs - and Rstudio - the graphical development environment, accessed via a web server and provided via the OnDemand service.

in tutorial, Aug 11, 2022

R Tutorial Part One - Top Tips

The ITSR support team often receive tickets from R users that cover similar ground. So we thought we would collate our most frequent responses into some “Top Tips”! The tips below apply equally to Rscript but this article only covers the interactive R program.

in tutorial, Jul 06, 2022

RSE team activities up to June 2022

The RSE team in ITS Research has had a busy few years since we started sharing our work in this blog. In this post we look at some highlights of recent activity and what we have to look forward to.

in rse, Jun 27, 2022

Migrating to a new research storage system

As the current 2 PetaByte Research Data Storage on Apocrita reaches end of life this summer, we have procured a new storage system, providing 5PB of capacity. This means faster, bigger and cheaper storage for you, the researcher! Read on to discover the benefits…but first, an important notice.

in news, Jun 27, 2022

Using job statistics to increase job performance and reduce queueing time

You may wonder why some jobs start immediately but some wait in the queue for hours or days, even if your job is quite simple. If you notice your job has been queueing for a while, you may want to consider adjusting the requested resources to reduce queueing time and reduce any potential resource wastage as the job runs. Below,...

in tutorial, Jun 15, 2022

Speeding up and Parallelising R packages (using Rcpp and C++)

Polytomous variables can be used to model data that has two or more possible outcomes. For example, a survey with multiple-choice questions is polytomous. The R package, poLCA, does statistical clustering of polytomous variables. For example, grouping together survey results that are similar to each other.

in rse, Jun 13, 2022

Assessing code quality with the NAG Fortran compiler

The NAG Fortran compiler, like other compilers, has diagnostic capabilities which can help us write correct and portable Fortran programs. In this post we’ll look at these, comparing with those of the GCC and Intel compilers, and see how the compiler can be a valuable tool when developing or maintaining Fortran code.

in rse, May 20, 2022

Using compiler options to help debug programs

Once we’ve written a program more advanced than our “Hello, world!” example, we’re going to make mistakes. In this post, we’ll look at how we can use the very compilers we’re using to compile our program to pick up on some of these mistakes.

in rse, May 11, 2022

Visualising HEALPix results with Jupyter Notebook

In this tutorial we’ll be showing you how to visualise HEALPix results using Jupyter Notebook in our OnDemand appliance on the Apocrita HPC cluster. We’ll start with installing the required Python packages before demonstrating how to run the Healpy tutorial

in tutorial, Mar 07, 2022