Seneca Cluster

General Information

Seneca cluster is an HPC cluster featuring CPUs, Nvidia H100 and A100 GPUs, a BeeGFS parallel file system and 400 Gb NDR infiniband interconnect. The cluster runs on Rocky Linux 9 and uses slurm 25.05.3 for job scheduling.

Hardware and Networking

Servers

Name	CPU Threads	RAM	GPUs
Login Node
seneca-login1	64	1 TB	N/A
Compute Nodes
c000[1-4]	128	1 TB	4 x Nvidia H100 80 GB SXM
c0005	128	512 GB	2 x Nvidia H100 80 GB PCIe
c0006	64	128 GB	2 x Nvidia A100 40 GB PCIe
c01[01-08]	256	1 TB	N/A
c01[09-12]	256	512 GB	N/A

GPUs

To access the GPUs, submit a Slurm job to the cac_gpu partition. You must include the --gres option for the desired GPU type, as explained in the Submit a Job section.

GPU	Streaming Multiprocessors	Memory	CPU-to-GPU Interconnect	NVLink between GPUs
Nvidia H100 80 GB SXM	132	3.35 TB/s (HBM3)	128 GB/s PCIe Gen5	900 GB/s, for 4 on same node
Nvidia H100 80 GB PCIe	114	2.0 TB/s (HBM2e)	128 GB/s PCIe Gen5	none
Nvidia A100 40 GB PCIe	108	1.55 TB/s (HBM2)	64 GB/s PCIe Gen4	none

Note: As compared to the H100 PCIe, the H100 SXM offers slightly more compute power, significantly faster memory, and very fast NVLink connections between GPUs on the same node.

Storage

The following storage is available. Note: storage on Seneca cluster are not backed up.

User Home Directory: Each user has a home directory with a 100 GB default quota. It can be access by the path ~.
BeeGFS Parallel File System: Each project has a directory on the BeeGFS parallel file system: /mnt/beegfs/<institution>/<project name>. The directory is readable and writable to all users belonging to the project. <institution> in the path is one of:
- ithaca: for projects from Ithaca campus
- qatar: for projects from Weill Cornell Medicine Qatar campus
- weill: for projects from Weill Cornell Medicine NYC campus
Local Scratch: Local scratch is mounted on /tmp on all nodes.

In addition to scp/sftp, the storage on seneca cluster is available via the Seneca Cluster Globus collection. See this page for more information on Globus transfer.

Networking

The cluster nodes are connected by the following networks:

Public network: The cluster head node and login node are connected to the public Internet via 25 Gb ethernet.
Private network: All cluster nodes are connected to a private 128.84.16.0/24 network via 25 Gb ethernet.
IPoIB network: All cluster nodes are connected to a private 128.84.17.0/24 network via IPoIB network for BeeGFS file system.

In addition, login and GPU nodes are connected by a NDR Infiniband interconnect (400 Gbit NDR).

Request Access

To get access to the Seneca cluster:

Cornell faculty and staff can create a CAC project with Seneca cluster access.
A current CAC project PI or proxy can add Seneca cluster to their project.
Other users can request to join a project with Seneca cluster access.

Create a CAC Project with Seneca Cluster Access

See Create a New CAC Project for instructions on creating a new CAC project with Seneca cluster access on the CAC Portal.

Billing and cost information

See CAC Rates for detailed billing and cost information.

Join a Seneca project

See How to Join an Existing Project for steps to request access to an existing Seneca project.

Access the Cluster

SSH

You must use an ssh key to log into the seneca cluster. Password logins are disabled. See the Initial Login via SSH Key section on how to generate an ssh keypair on CAC Portal to log into the cluster for the first time.

After being added to a project with access to Seneca cluster, you can:

Log into the CAC Portal by clicking on the Login button in the upper right corner.
Choose Cornell University or Weill Cornell Medical College as your organizational login and complete the login using your Cornell NetID or Weill CWID and password (not your CAC account password).
On the Portal dashboard, click on the Generate SSH key pair link in the Manage your CAC login credentials section in the upper right corner.
Click on the Generate a new SSH key button to generate a new ssh key pair.
Click on the Download your SSH private key button to download the private key to your computer.
On your computer, make sure the private key file is readable and writable to you only: chmod 600 <private key file>
SSH to seneca cluster using the private key you just downloaded: ssh -i <path to the private key file> <NetID>@seneca-login1.cac.cornell.edu
On seneca-login1, you can add additional public ssh keys to your ~/.ssh/authorized_keys file, 1 key per line. The key generated by CAC Portal is marked by the CAC_SSO comment in the authorized_keys file. Keys that do not end in CAC_SSO will be left alone by CAC Portal.

OpenOnDemand

URL: https://seneca-ood.cac.cornell.edu

You can access seneca cluster using a web browser via the cluster's OpenOnDemand interface. Before using OpenOnDemand, you must first log into the login node via ssh seneca-login1.cac.cornell.edu at least once.

Scheduler

Users gain access to compute nodes using the slurm scheduler. The Slurm Quick Start Guide is a great place to start. For more detailed explanations, see CAC's Slurm page.

Partitions

Currently the seneca cluster has 3 partitions:

Partition	Nodes	Resources
qatar_gpu	c000[1-2]	Each node has 128 CPU threads, 1 TB RAM, and Nvidia GPUs
cac_gpu	c000[3-6]	Different number of CPU threads, RAM, and Nvidia GPUs, see Servers
cac_cpu	c0[101-112]	Each node has 256 CPU threads and 1 TB RAM

Submit a Job

Interactive Job: srun --pty <options> /bin/bash

After slurm allocates a node as specified by your options, you will be given a login prompt on the compute node to run jobs interactively.

Batch Job: sbatch <options> <job script>

Slurm will run the job script on the allocated node(s) as specified by your options.

The following options are relevant in seneca cluster. Required options are denoted by *.

-P, or --partition*: Partition. There are three partitions available currently: cac_gpu, cac_cpu, qatar_gpu.
-A, or --account*: Slurm account/CAC project against which this job will be billed.

If you do not specify --account, your Slurm DefaultAccount will be used automatically.
You can view your DefaultAccount with:

  sacctmgr show user $USER format=User,DefaultAccount

Qatar accounts are always allowed to submit to the qatar_gpu partition. All other accounts are allowed to submit to the cac_gpu and cac_cpu and must have a positive balance at job submission time or the request will be rejected.

--gres*: Requested GPU resource

For the cac_gpu or qatar_gpu partition, at least 1 GPU must be requested like this:

--gres=gpu:<type>:<count>

The following GPU types are available:

GPU	`--gres` Option
Nvidia H100 80 GB SXM	`--gres=gpu:h100:<number of GPUs>`
Nvidia H100 80 GB PCIe	`--gres=gpu:h100pcie:<number of GPUs>`
Nvidia A100 40 GB PCIe	`--gres=gpu:a100:<number of GPUs>`

For example, to request 2 NVIDIA H100 GPUs: --gres=gpu:h100:2

--time*: Time limit in the HH:MM:SS format

Maximum time limit for the cac_gpu, qatar_gpu and cac_cpu partition is 24 hours (1 day).

If you need to run more than 24 hours in the cac_gpu, qatar_gpu and cac_cpu partition, email your request to help@cac.cornell.edu.

--qos: longrun for long running (time limit >24 hours) jobs

If approved to run more than 24 hours, use the --qos=longrun option to request time limit longer than 24 hours. For example:

  # OK: 1 day (24h) or less
  --time=24:00:00

  # OK: >24h with longrun QOS if your project is approved to run longer than 24 hours
  --time=72:00:00 --qos=longrun

  # FAIL: >24h without longrun QOS
  --time=72:00:00

Here are some minimal examples:

An interactive job with 4 CPU threads/2 physical CPU cores and 1 Nvidia H100 GPU with time limit of 2 hours:

srun -p cac_gpu --account=abc123_0002 --gres=gpu:h100:1 --time=02:00:00 -c 4 /bin/bash

The following job script (gpu_job.sh) can be submitted using the sbatch gpu_job.sh command to run on 4 CPU threads/2 Physical CPU cores and 1 Nvidia H100 GPU with time limit of 2 hours. Loading the anaconda3 module brings Python 3.13 into the environment, along with a number of useful packages:

  #!/bin/bash
  #SBATCH --job-name=my_gpu_job
  #SBATCH --account=abc123_0002
  #SBATCH --partition=cac_gpu
  #SBATCH --gres=gpu:h100:1
  #SBATCH --time=02:00:00
  #SBATCH -c 4

  module load anaconda3/2025.06
  python my_gpu_script.py

Software

General access software is installed via the module system. Current software includes (but is not limited to):

R
alphafold3
apptainer
anaconda3 (python3 and assorted packages - matplotlib, numpy, scipy, torch, etc.)
cuda/12.9
nanoplot
rstudio
tensorrt

To view a list of installed modules:

module avail

To load a software module (anaconda3, for example):

module load anaconda3

Users are eligible to compile and install software within their home directories, but space will count against your home directory quota.
Please do not compile anaconda or conda in your home directory as it is very large. Python packages are available by loading the anaconda3 module. Installed python packages are show by doing:

module load anaconda3
pip list

If you require additional python packages, please send a request to help@cac.cornell.edu.

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search