Operating Status

Percent Utilization over the past 24 hours (Red Cloud)

Status and Downtime Notices

• Downtime for regular Web server maintenance will occur the Wednesday following the second Tuesday of every month.
• Only one e-mail will be sent per downtime, to all project members.
• Information concerning completion of the work or extension of the downtime will be posted on this page.

WHY: Jetstream2 Cornell Region will be upgraded to run Openstack 2025.1 "Epoxy" release

WHO WILL BE AFFECTED? Jetstream2 Cornell Region users. Affected users will be notified via a separate email.

WHAT WILL BE AFFECTED?

  • Users are encouraged to shelve their instances prior to 9 am on January 6 if possible. Shelving instances will simplify and speed up the software upgrade process.
  • Openstack API will be unavailable to manage cloud resources during the upgrade. Calls from Horizon web interface or Openstack CLI client will fail.
  • We will try our best to preserve running instances during the upgrade. Short network disruptions should be expected.
  • Shelved instances and volumes will be preserved.

NOTE:

  • Jetstream2 Cornell Region is a separate cloud system from Red Cloud. Red Cloud will NOT be affected by this upgrade.

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

OUTAGE REASON(S): To apply December 2025 security patches (Windows and Linux)

WHO WILL BE AFFECTED? Users of CAC managed resources housed in Rhodes Hall

WHAT WILL BE UNAVAILABLE WHILE THEY ARE PATCHED AND REBOOTED?

  • Servers: Linuxlogin, Mysql2, RT, Winlogin
  • https://www.cac.cornell.edu/
  • https://portal.cac.cornell.edu/

WHAT WILL NOT BE AFFECTED THIS MONTH?

  • Linux Clusters: Aclab, AIDA, Altas, CapeCrystal, Cayuga, CommCloud, Cqmetrics, Hopper, Marvin, Marx1, Mbot, Pool, Tardis3, Yuegroup
  • Red Cloud
  • Archival Storage

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

NEXT SCHEDULED MAINTENANCE: Wednesday, January 14th, 2026

WHY: There was a hardware hiccup with the storage system last night. The issue has been resolved and the cluster is back online.

WHO WILL BE AFFECTED? All Hopper cluster users

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

WHY: The external drive enclosure attached to the head node appears to be unresponsive. Rebooting did not resolve the problem. CAC systems staff will investigate tomorrow morning. User logins have been disabled.

WHO WILL BE AFFECTED? All hopper cluster users

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

WHY: CIT Security needs to upgrade firewall firmware. Network access will be disrupted for about 10 minutes during the maintenance window.

WHO WILL BE AFFECTED? Users of Red Cloud, Jetstream2 Cornell Region, and Wulab Ceph Storage. Other CAC systems will not be affected.

Red Cloud

  • Red Cloud instances will lose connectivity to storage for about 10 minutes during the maintenance window.
  • Users should shelve their instances prior to the maintenance window if possible, or limit file I/O in their running instances.
  • After the maintenance window, users should check their running instances and reboot if any file system becomes read-only.

Jetstream2 Cornell Region

  • Access to instances running in Jetstream2 Cornell region will be disrupted for 10 minutes. Openstack API will not be responsive.
  • Running instances will be not disrupted by the network outage.

Wulab Ceph Storage

  • Access to Wulab Ceph Storage will be disrupted for 10 minutes during the maintenance window.

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

OUTAGE REASON(S): To apply November 2025 security patches (Windows and Linux)

WHO WILL BE AFFECTED? Users of CAC managed resources housed in Rhodes Hall

WHAT WILL BE UNAVAILABLE WHILE THEY ARE PATCHED AND REBOOTED?

  • Servers: Linuxlogin, Mysql2, RT, Winlogin
  • CapeCrystal cluster will be upgraded
  • https://www.cac.cornell.edu/
  • https://portal.cac.cornell.edu/

WHAT WILL NOT BE AFFECTED THIS MONTH?

  • Linux Clusters: Aclab, AIDA, Altas, Cayuga, CommCloud, Cqmetrics, Hopper, Marvin, Marx1, Mbot, Pool, Tardis3, Yuegroup
  • Red Cloud
  • Archival Storage

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

NEXT SCHEDULED MAINTENANCE: Wednesday, December 10th, 2025

WHY: Resolve a ZFS crashing problem on AIDA cluster head node and Upgrade AIDA cluster to the latest Rocky Linux 9.6 / OpenHPC 3

WHO WILL BE AFFECTED? All users of AIDA Cluster

TIMELINE:

  • Users should save all work and log off AIDA cluster prior to 9 am on October 22.
  • Starting at 9 am on October 22, user logins will be disabled and all running jobs will be cancelled.
  • The entire cluster will be upgraded to run the latest versions of Rocky Linux 9.6 / OpenHPC 3.
  • The upgrade should be completed before the end of the day. Users will get an email notification when the cluster is back online.

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

OUTAGE REASON(S): To apply October 2025 security patches (Windows and Linux)

WHO WILL BE AFFECTED? Users of CAC managed resources housed in Rhodes Hall

WHAT WILL BE UNAVAILABLE WHILE THEY ARE PATCHED AND REBOOTED?

  • Servers: Linuxlogin, Mysql2, RT, Winlogin
  • Commcloud: will be unavailable for regular security patching and backup
  • Storage: Home directories of the rad332_0001 (vega partition) and na346_0001(astra partition) projects in the pool cluster will not be available during the downtime. Users in these 2 projects are encouraged to cancel their jobs, save all the changes, and log off the pool cluster prior to the start of downtime. This downtime is required to connect the file server to the pool2 cluster.
  • https://www.cac.cornell.edu/
  • https://portal.cac.cornell.edu/

WHAT WILL NOT BE AFFECTED THIS MONTH?

  • Linux Clusters: Aclab, AIDA, Altas, CapeCrystal, Cayuga, Cqmetrics, Hopper, Marvin, Marx1, Mbot, Pool, Tardis3, Yuegroup
  • Red Cloud
  • Archival Storage

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

NEXT SCHEDULED MAINTENANCE: Wednesday, November 12th, 2025

OUTAGE REASON(S): To apply September 2025 security patches (Windows and Linux)

WHO WILL BE AFFECTED? Users of CAC managed resources housed in Rhodes Hall

WHAT WILL BE UNAVAILABLE WHILE THEY ARE PATCHED AND REBOOTED?

  • Servers: Linuxlogin, Mysql2, RT, Winlogin
  • Storage: storage03 will be migrated from old hardware to a Red Cloud 2 instance
  • During this monthly maintenance changes to accounts and projects will not be available
  • https://www.cac.cornell.edu/
  • https://portal.cac.cornell.edu/

WHAT WILL NOT BE AFFECTED THIS MONTH?

  • Linux Clusters: Aclab, AIDA, Altas, CapeCrystal, Cayuga, CommCloud, Cqmetrics, Hopper, Marvin, Marx1, Mbot, Pool, Tardis3, Yuegroup
  • Red Cloud
  • Archival Storage

CORNELL SCIENTIFIC COMPUTING TRAINING:

  • https://its.weill.cornell.edu/scientific-computing-training-series

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

NEXT SCHEDULED MAINTENANCE: Wednesday, October 15th, 2025

OUTAGE REASON(S): To apply August 2025 security patches (Windows and Linux)

WHO WILL BE AFFECTED? Users of CAC managed resources housed in Rhodes Hall

WHAT WILL BE UNAVAILABLE WHILE THEY ARE PATCHED AND REBOOTED?

  • Servers: Linuxlogin, Mysql2, RT, Winlogin
  • https://www.cac.cornell.edu/
  • https://portal.cac.cornell.edu/

WHAT WILL NOT BE AFFECTED THIS MONTH?

  • Linux Clusters: Aclab, AIDA, Altas, CapeCrystal, Cayuga, CommCloud, Cqmetrics, Hopper, Marvin, Marx1, Mbot, Pool, Tardis3, Yuegroup
  • Red Cloud
  • Archival Storage

STATUS: Downtime status will be posted on this page.

QUESTIONS: https://portal.cac.cornell.edu/help.

NEXT SCHEDULED MAINTENANCE: Wednesday, September 10th, 2025