Cluster Status

From ACEnet
Jump to: navigation, search

Please click on the name of the cluster below in the table to quickly get to the corresponding section of this page. The outage schedule section is a single place where data about all scheduled ACEnet outages are represented.

Cluster Status Planned Outage Notes
Brasdor Online No outages
Mahone Online No outages
Placentia Online No outages
Fundy Online No outages
Glooscap Online Upgrade Feb 14 Adding new hardware
Courtenay Online No outages
Legend:
Online cluster is up and running
Offline all users cannot login or submit jobs
Online some users can login and/or there are problems

Outage schedule

  • Glooscap is back into production with 48 nodes (192 cores) online. New nodes will be brought online in the following weeks as provisioning and testing are completed. The /globalscratch filesystem has been merged into the /home filesystem, so most users on Glooscap will now have a single quota rather than two, and /home/$USER/scratch is no longer a symbolic link to /globalscratch/$USER. If you have scripts which make explicit reference to /globalscratch/<your-user-id> as part of a path, please change them to refer to ~/scratch instead.
15:00, February 20, 2012 (AST)

Brasdor

  • The cluster is back in service.
09:49, February 6, 2012 (AST)
  • We have experienced a power outage at StFX. Our staff are working on restoring the services.
17:13, February 5, 2012 (AST)

Mahone

  • The head node is back online now. Users will need to update their cron jobs.
15:10, August 26, 2011 (AST)
  • The hard drive has failed on the head node. It has been replaced now and we are in the process of installing the OS. The rest of the cluster is not affected, jobs are not affected.
09:47, August 26, 2011 (AST)

Placentia

  • Placentia is back operational.
10:56, February 14, 2012 (AST)
  • We had to reboot the meta-data server to restore the service.
10:14, February 14, 2012 (AST)

Fundy

  • The tape library has been fixed.
19:08, February 22, 2012 (AST)
  • There is a problem with the tape library. Users may have problems accessing large files that have been released to tape due to a long period of inactivity.
13:21, February 17, 2012 (AST)

Glooscap

  • Hardware Delivery Delay has caused the estimated online date to be pushed back to Feb 6th for the new storage at glooscap.
08:04, January 23, 2012 (AST)
  • Glooscap Dec 21st Outage complete, nodes back online
16:09, December 21st, 2011 (AST)

Courtenay

  • Back online.
15:01, December 19, 2011 (AST)
  • There is a network problem at UNBSJ. Working on this.
14:56, December 19, 2011 (AST)
Resources
User Support
News and Events
Organization
About Us