Cluster Status
From ACEnet
Please click on the name of the cluster below in the table to quickly get to the corresponding section of this page. The outage schedule section is a single place where data about all scheduled ACEnet outages are represented.
| Cluster | Status | Planned Outage | Notes |
|---|---|---|---|
| Brasdor | Online | No outages | |
| Mahone | Online | No outages | |
| Placentia | Online | No outages | |
| Fundy | Online | No outages | |
| Glooscap | Online | Upgrade Feb 14 | Adding new hardware |
| Courtenay | Online | No outages |
- Legend:
| Online | cluster is up and running |
| Offline | all users cannot login or submit jobs |
| Online | some users can login and/or there are problems |
Outage schedule
- Glooscap is back into production with 48 nodes (192 cores) online. New nodes will be brought online in the following weeks as provisioning and testing are completed. The /globalscratch filesystem has been merged into the /home filesystem, so most users on Glooscap will now have a single quota rather than two, and /home/$USER/scratch is no longer a symbolic link to /globalscratch/$USER. If you have scripts which make explicit reference to /globalscratch/<your-user-id> as part of a path, please change them to refer to ~/scratch instead.
- 15:00, February 20, 2012 (AST)
Brasdor
- The cluster is back in service.
- 09:49, February 6, 2012 (AST)
- We have experienced a power outage at StFX. Our staff are working on restoring the services.
- 17:13, February 5, 2012 (AST)
Mahone
- The head node is back online now. Users will need to update their cron jobs.
- 15:10, August 26, 2011 (AST)
- The hard drive has failed on the head node. It has been replaced now and we are in the process of installing the OS. The rest of the cluster is not affected, jobs are not affected.
- 09:47, August 26, 2011 (AST)
Placentia
- Placentia is back operational.
- 10:56, February 14, 2012 (AST)
- We had to reboot the meta-data server to restore the service.
- 10:14, February 14, 2012 (AST)
Fundy
- The tape library has been fixed.
- 19:08, February 22, 2012 (AST)
- There is a problem with the tape library. Users may have problems accessing large files that have been released to tape due to a long period of inactivity.
- 13:21, February 17, 2012 (AST)
Glooscap
- Hardware Delivery Delay has caused the estimated online date to be pushed back to Feb 6th for the new storage at glooscap.
- 08:04, January 23, 2012 (AST)
- Glooscap Dec 21st Outage complete, nodes back online
- 16:09, December 21st, 2011 (AST)
Courtenay
- Back online.
- 15:01, December 19, 2011 (AST)
- There is a network problem at UNBSJ. Working on this.
- 14:56, December 19, 2011 (AST)