Anvil CPU

Description

Purdue's Anvil cluster built in partnership with Dell and AMD consists of 1,000 nodes with two 64-core AMD EPYC "Milan" processors each and delivers over 1 billion CPU core hours each year, with a peak performance of 5.1 petaflops. Each of these nodes has 256GB of DDR4-3200 memory. A separate set of 32 large memory nodes has 1TB of DDR4-3200 memory each. Anvil's nodes are interconnected with 100 Gbps Mellanox HDR100 InfiniBand.

Resource ID
738
Global Resource ID
anvil.purdue.access-ci.org
Resource Type
Compute
Latest Status
production
Latest Status Begin
Latest Status End
Project Affiliation
ACCESS
Organization Name
Purdue University
RP Description

Anvil CPU resources provide general-purpose computing nodes for a wide range of research workloads. They are suitable for data processing, simulations, and other tasks that do not require GPUs.

Support Links
Top Software (cached)
[{"name":"python","job_count":50475,"description":"Python is a high-level, interpreted programming language known for its simplicity and readability. It supports multiple programming paradigms and has a vast ecosystem of libraries and frameworks.","research_field":"Computer & Information Sciences, Software Engineering, Systems & Development","web_page":"https:\/\/www.python.org\/","documentation":"https:\/\/docs.python.org"},{"name":"orca","job_count":10934,"description":"Orca is a flexible, efficient, and powerful quantum chemistry software written by Frank Neese and co-workers. It provides accurate and reliable computational simulations of molecular structures and properties based on quantum mechanical methods.","research_field":"Chemical Sciences","web_page":"https:\/\/www.kofo.mpg.de\/en\/research\/services\/orca","documentation":"https:\/\/www.kofo.mpg.de\/970316\/orca_manual_5_0_4.pdf"},{"name":"lammps","job_count":7613,"description":"LAMMPS (Large-scale Atomic\/Molecular Massively Parallel Simulator) is a classical molecular dynamics code designed for simulating large-scale atomistic systems. It is highly versatile and can be used to model a wide range of materials and complex molecular structures.","research_field":"Chemical Sciences","web_page":"https:\/\/www.lammps.org\/","documentation":"https:\/\/docs.lammps.org\/Manual.html"},{"name":"r","job_count":2749,"description":"R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows, and MacOS.","research_field":"Computer Science","web_page":"https:\/\/www.r-project.org\/","documentation":"https:\/\/cran.r-project.org\/manuals.html"},{"name":"openfoam","job_count":2452,"description":"OpenFOAM (Open Source Field Operation and Manipulation) is a free, open-source computational fluid dynamics (CFD) software package developed by the OpenFOAM Foundation. It is widely used for simulating complex fluid flows in various industries and academic research.","research_field":"Physical Sciences","web_page":"https:\/\/www.openfoam.com\/","documentation":"https:\/\/doc.openfoam.com\/2312\/"},{"name":"q-espresso","job_count":2294},{"name":"gromacs","job_count":2200,"description":"GROMACS (GROningen MAssive Parallel MD for Molecular Dynamics) is a versatile package for molecular dynamics simulations with a strong emphasis on high-performance computing capabilities.","research_field":"Biological Sciences","web_page":"https:\/\/www.gromacs.org\/","documentation":"https:\/\/manual.gromacs.org\/current\/index.html"},{"name":"spec","job_count":2144},{"name":"citcoms","job_count":1319},{"name":"ncbi-blast","job_count":1229,"description":"BLAST+ (Basic Local Alignment Search Tool) is a suite of tools for performing similarity searches of nucleotide and protein databases. It is widely used for comparing biological sequences to identify homologous sequences and infer functional and evolutionary relationships.","research_field":"Biological Sciences","web_page":"https:\/\/blast.ncbi.nlm.nih.gov\/doc\/blast-help\/downloadblastdata.html","documentation":"https:\/\/github.com\/ncbi\/blast_plus_docs"}]
SSH Login Nodes
MFA Required
On
RP Account Required
Off
Login Text
  • Users can use their ACCESS account to receive an allocation and login.
  • Logging into your ACCESS account will require Duo two-factor authentication.
External Storage

Files in scratch directories are not recoverable. Files in scratch directories are not backed up. If you accidentally delete a file, a disk crashes, or old files are purged, they cannot be restored.

$PROJECTspace. The project space will be created for each allocation. $PROJECT and $WORK variables refer to the same location and can be used interchangeably.

ANVIL CEPH

Anvil Ceph is intended to provide scalable, fault-tolerant, and high-throughput storage for large or persistent research data. It supports both object and block storage, making it suitable for hosting shared datasets, storing long-term research outputs, and enabling data access for containerized or cloud-integrated workflows. Ceph complements the Lustre-based storage tiers by offering durable and easily expandable storage for diverse data management needs.

Inspecting file system quotas

To check the quota of different file systems, type myquota at the command line.

File Transfer Methods
Transfer Method
SCP
Data Transfer Node
anvil.rcac.purdue.edu
Recommended
Off
Transfer Method
Rsync
Data Transfer Node
anvil.rcac.purdue.edu
Recommended
Off
Jobs Information

Notes:

  • Specify your desired partition to prevent it from being automatically assigned to the shared queue
    • e.g.: -p wholenode
  • For node-exclusive (wholenode and wide) queues, your job will automatically be allocated one full node.

See Anvil example job scripts - https://www.rcac.purdue.edu/knowledge/anvil/run/examples

Storage Filesystems
Directory
Anvil ZFS
File System Path
/home
Quota
25 GB
Purge Policy
Not Purged
Notes
Home directories: area for storing personal software, scripts, compiling, editing, etc.
Directory
Anvil ZFS
File System Path
/apps
Quota
N/A
Purge Policy
Not Purged
Notes
Applications
Directory
Anvil GPFS
File System Path
/anvil
Quota
N/A
Purge Policy
Not Purged
Directory
Anvil GPFS
File System Path
/anvil/scratch
Quota
100 TB
Purge Policy
Files older than 30-day will be purged
Notes
User scratch: area for job I/O activity, temporary storage
Directory
Anvil GPFS
File System Path
/anvil/projects
Quota
5 TB
Purge Policy
Removed 90 days after allocation expiration
Notes
Per allocation: area for shared data in a project, common datasets and software installation
Directory
Anvil GPFS
File System Path
/anvil/datasets
Notes
Common data sets (not allocated to users)
Queue Specifications
Queue Name
debug
CPUs
256 cores AMD EPYC™ 7763 CPUs
RAM
256GB
Queue Name
wholenode
CPUs
2,048 cores AMD EPYC™ 7763 CPUs
RAM
256GB
Queue Name
wide
CPUs
7,168 cores AMD EPYC™ 7763 CPUs
RAM
256 GB
Queue Name
shared
CPUs
128 cores AMD EPYC™ 7763 CPUs
RAM
256 GB
Queue Name
highmem
CPUs
128 cores AMD EPYC™ 7763 CPUs
RAM
256 GB
Datasets
Dataset Name
AI
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/ai/index.html

Dataset Name
Covariates
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/Covariates/index.html

Dataset Name
Geospatial
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/geospatial/index.html

Dataset Name
Hydrological
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/hydrological/index.html

Dataset Name
iGenomes
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/igenomes/index.html

Dataset Name
Meteorological
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/meteorological/index.html

Dataset Name
GeoAI
Dataset Description

https://datasetdocs.readthedocs.io/en/latest/geoai/index.html