Git Product home page Git Product logo

sheffield_hpc's People

Contributors

annakrystalli avatar anthonybrookfield avatar beckywilks avatar cadair avatar carldkennedy avatar cs1ds avatar davidwilby avatar dependabot[bot] avatar desmfryan avatar dmrogers75 avatar drj11 avatar fredsonnenwald avatar gyengen avatar jezcope avatar jkwmoore avatar joe-heffer-shef avatar lizzydapsy avatar mathew-hall avatar me1bdh avatar mikecroucher avatar mikeg64 avatar mondus avatar nahceigna avatar nikishe avatar philmikejones avatar ptheywood avatar sinaa avatar susheel avatar twinkarma avatar willfurnass avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

sheffield_hpc's Issues

Making TotalView debugger available?

I wonder whether the University can provide the TotalView debugger. An efficient use of Iceberg is of course parallelization. However, support for developing parallel code is rather limited on Iceberg. It'd be extremely useful to make something like TotalView available to the cluster and to individual developers.

git URL is wrong

this page: http://rcg.group.shef.ac.uk/iceberg/software/compilers/git.html
references the install script https://github.com/rcgsheffield/iceberg_software/blob/master/software/install_scripts/compilers/git/install_git_5.2.sh but that install script doesn't exist there.

I found it in https://github.com/rcgsheffield/iceberg_software/blob/master/software/install_scripts/apps/git/install_git_2.5.sh (apps instead of compilers)

So we could fix the URL, sure, and I was about to submit a PR for that, but really the URL for the original page is wrong. Should be http://rcg.group.shef.ac.uk/iceberg/software/app/git.html and I'm not sure how to change that.

Fact check - scratch

In the Abaqus section, I suggested that /scratch was used as the scratch directory. Is this correct?

I'm wondering if /fastdata would be an option. Any reason to not use fastdata for Abaqus scratch? How reliable is it for example?

Freesurfer

Supportworks request from: Marco Mcsweeney

MATLAB install notes required

There are no installation notes for MATLAB available. I understand that a basic install is straightforward but integration with the SGE is rather more tricky. How was it done for the current installs?

How best to include examples?

I've added a HelloWorld example in .cpp

https://github.com/mikecroucher/iceberg_software/blob/master/software/compilers/examples/helloworld/hello.cpp

to allow us to refer to something explicit when writing how to use the various compilers.

How should we be referring to such files in the text? Options include

  • Link to the github url (but may lead to people struggling to download it)
  • Include the code explicitly in the text so users copy and paste
  • A separate iceberg-examples repo - we could encourage users to git-clone this to get all examples.

As I write this, I'm favouring the final option. Thoughts?

Abaqus forces use of latest Intel Compiler

While writing the new user documentation for Abaqus, I noted an anomaly in that even though I had selected Intel Compiler 12.1 in my job submission file, it compiled using Intel 14.0
Investigation determined that we are doing something non-standard in our use of the abaqus command. Namely ::

#!/bin/bash
## 
#Record the use of this script by Stuart Mumford.
# logger -p local1.debug "iceberg_software: USER=$USER TYPE=monitor SOFTWARE=/usr/local/bin/abaqus"
#DS straight link will not work 
module load compilers/intel

export XLIB_SKIP_ARGB_VISUALS=1
mesa=" "
if [ "$1" == "cae" ] ;then
    mesa="-mesa"
fi
if [ -z $ABAQUSCOMMAND ] ; then 
   echo " Please use 'module load apps/abaqus' command first "
   exit 1
fi

 $ABAQUSCOMMAND $* $mesa

My main issue with this is that the abaqus command will always load the latest version of the Intel compiler. The practical upshot of this is that if we add a new version of the Intel Compiler, we will be changing the abaqus environment for users without them necessarily knowing! We have needlessly coupled Intel Compiler to Abaqus! Such tight coupling is rarely a good idea

This could lead to issues with reproducibility.

In summary:

  • It is impossible for us to upgrade the Intel Compiler without running the risk of breaking this Abaqus command.
  • We could inadvertently ruin a user’s set of results. E.g. if we update the Intel Compiler during a parameter sweep, different versions of the compiler will be used for different runs. This a Bad Idea!
  • A user from a different site will expect to control which version of the Intel Compiler they use. Here, we always force the latest.
  • This wrapper to abaqus is non-standard and may lead to further problems. It forces the -mesa switch for example. This switch is often, but not always, useful.

I suggest a change where we move a more standard way of working. Failing that, we should fully document this non-standard modification and warn users of the implications.

For now, I have modified the documentation such that it explicitly references the abq133 command.

Parallel environment info

Some random facts that need including (taken from a recent support call)

It's possible to use the hybrid openmpi environments to get exclusive access to a node, (this PE is intended for mixed Openmpi-OpenMP jobs, but can also be used for normal MPI jobs which require specific placement)
e.g.

$ -pe openmpi-hybrid-16 [number of cores]

will work like the normal openmpi-ib parallel environment, but place your processes in groups of 16 per node, basically filling up each node. (note this will likely increase your job wait time). The 16-core hybrid environment is only available on Iceberg's newest nodes.

You can also use 12-core hybrid PE to run on the older nodes, but you'd also need to add the arch=intel-x5650 option to make sure that the job ran on a machine with 12 cores only.

$ -pe openmpi-hybrid-12 [number of cores]

$ -l arch=intel-x5650

Bowtie2 examples required

It would be nice to have some examples of using Bowtie2 in batch mode. The current documentation is a little sparse http://rcg.group.shef.ac.uk/iceberg/software/apps/bowtie2.html

Also, a benchmark that takes a while (no more than a day) which we could use to demonstrate multithreading mode and try out different compiler switches at build time.

Hopefully such examples can be provided that aren't too large?

Perhaps the bioinfomatics community can help us out?

Mathematica doc enhancements

We need:-

  • The modulefile
  • Instructions on batch usage. Last time I checked, you couldn't run a notebook in batch.

CASTEP 9.0 available

Hi,

Asking on behalf of the Materials Science team, if we can have CASTEP 9.0 (it's labelled as 16.1 now for some reason?) installed on Iceberg, and for it to run along side 8.0 until we are happy that it is a stable version.

Thanks,

Chris

Example OpenACC compile and run

It would be useful to add a small but complete example of how to compile and run an OpenACC CUDA job using the PGI Compilers.

Document Octave install

I have installed a basic version of Octave 4.0. Its usable and includes the GUI but is missing support for several libraries (all of which will need to be compiled separately). I need to document this.

Perl documentation

Perl 5.x - was requested. We have it installed as part of the system but there is no documentation.

Ensure automounting of /data/username is explained

We sometimes get calls like this:

Today I cannot see my Iceberg /data/username directory on Filezilla/WinSCP. I
can navigate to it via Exceed (so I know it is there) but it isn't showing
up on the FTP programs.

Need to

  • ensure that the automounting behaviour is documented (yes; see PR #449)
  • also mention this in the troubleshooting section (PR #553)

Fortran Compiler with NAG library

So I created a very basic fortran code that calls the NAG library just so I can figure how to compile it.

PROGRAM ExampleNAG1


 USE nag_library, ONLY : A00AAF

        CALL A00AAF

END PROGRAM

I followed the old doc and the new doc.

So after loading the intel compiler and using both,

ifort sourcefile.f90    $NAGLIB    -I$NAGINC

and

ifort your_code.f90 -lnag_mkl -o your_code.exe

it errored with

catastrophic error: Too many errors, exiting
compilation aborted

However the examples worked when ran (nag_example a00aaf and nagexample24 d03phf)

The compile line for the first example is,

ifort -I/usr/local/packages5/nag/fll6i24dcl/nag_interface_blocks a00aaf.f90 -lnag_mkl -o a00aaf

and if I do the same with my example code, it compiles.

So is the nag_interface_blocks needed to compile the code every time?

SGE accounting docs

Documentation concerning some of the SGE accounting mechanism would be useful. Here's a random collection of facts:

qacct -j [jobid]

...you may end up with multiple jobs matching because our JobIDs have wrapped around.

The accounting file is at /usr/local/sge/live/default/accounting.
Records look like this

interactive.q:testnode03.iceberg.shef.ac.uk:fe:fe1abc:INTERACTIVE:26833:sge:0:1433190433:1433190450:1433190450:0:1:0:0.003999:0.001999:1612.000000:0:0:0:0:710:0:0:16.000000:8:0:0:0:11:3:SHEFFIELD:defaultdepartment:NONE:1:0:0.005998:0.000000:0.000000:-U cstest -u fe1mpc -l h_vmem=16G -I y -P SHEFFIELD:0.000000:NONE:0.000000:0:0

See man accounting for the fields.

Apparently this info also ends up on our splunk server. What's that? Where is it? How does it end up there? How can it be used. So many questions.....

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.