Computing Resources

 

The ARM Data Center offers computing infrastructure to support next-generation atmospheric model simulations, petascale data storage, and big-data analytics and machine learning for atmospheric and climate science research.

Users can request access to a high-performance computing facility that integrates a range of computer resources and storage systems.

The ARM Data Center provides a co-located data and computing platform that enables users to work with large volumes of ARM data without the need to download them. ARM’s heterogeneous and flexible computing architecture provides resources for conventional physics-based model simulations for data analysis and machine learning.

Cumulus Cluster

Cumulus is a midrange Dell system that consists of 16,384 processing cores with a 4-petabyte General Parallel File System (GPFS). It will be used for Large-Eddy Simulation (LES) ARM Symbiotic Simulation and Observation (LASSO) development and operation, radar data processing, large-scale reprocessing, value-added product generation, data quality analysis, and a variety of ARM-approved science projects.

Expectations for High-Performance Computing Users

  1. Code of Conduct. Principal investigators (PIs) must review and agree to ARM’s Code of Conduct when requesting access to ARM’s high-performance computing resources.
  2. Project Description. PIs must provide a clear use of ARM observational data and the computational approach when submitting their access request.
  3. Status Reports. If their project goes on longer than 1 year, PIs must provide annual status reports to ARM.
  4. Data Submission. High-performance computing projects that create new data products will be given an option to submit their data to the ARM Data Center. It is not required to create a data product to use the system.
  5. Final Report. A final report for the project is required to be submitted to ARM within 6 months of project completion.
  6. Acknowledgment of ARM Support. Investigators who receive ARM support for their work should use the following acknowledgment:

This research was supported by the Atmospheric Radiation Measurement (ARM) user facility, a U.S. Department of Energy (DOE) Office of Science user facility managed by the Biological and Environmental Research program.

Requesting Access to High-Performance Computing Resources

The Cumulus cluster is available to ARM users for their high-performance computing needs. Users can request to access this resource.

Active ARM user accounts are required for all PIs and co-PIs requesting access to ARM’s high-performance computing resources. PIs and co-PIs can create their individual ARM accounts by registering here. Requests to use the cluster cannot be completed until accounts have been created.

Requests to use the ARM high-performance computing resources should focus on activities that:

  • involve large volumes of ARM data, which would be prohibitive to download to other computer systems
  • require parallel processing using computationally intensive code applied to ARM data sets
  • analyze voluminous LASSO outputs.

Requests for the use of Cumulus will be reviewed quarterly and may undergo scientific peer review. Computational requests must:

  • clearly indicate the relevance of the proposed computational activities to the ARM mission
  • describe the ARM data sets to be analyzed
  • explain why ARM computational cluster resources are the most appropriate computational resources for achieving the science goals.

To submit a high-performance computing proposal, PIs must submit a request form. Review of proposal and response to PIs will be communicated within 4 weeks and may include requests for clarifying information and/or information regarding the review timeline.

Once approved, the ARM Data Center High-Performance Computing team will communicate with PIs to enable access to Cumulus and provide any technical support needed.

Large-scale simulations and data analytic applications that do not require the use of large volumes of ARM data may be more suitable for the U.S. Department of Energy’s (DOE) National Energy Research Scientific Computing Center or other DOE leadership computing user facilities, such as the Oak Ridge National Laboratory Leadership Computing Facility and Argonne National Laboratory Leadership Computing Facility. Allocations for these computational facilities should be requested directly from the individual facilities.

Executing a Project

All project allocations will be for a duration of 1 year and can be extended through a request process. A status report is required to be submitted by the PI on an annual basis throughout the duration of the project.

Closing Out a Project

When closing out a project, the PI should be prepared to submit a brief, final report of the outcome to the ARM Field Campaign Administrator to complete the ARM documentation.