Revolutionize your HPC
with NVMesh high-performance computing (HPC) cloud storage solutions
- NVMesh software provides an extremely cost effective method to achieve unheard of burst buffer bandwidth to compute nodes and sharing the storage across the existing low latency network fabric.
- NVMesh provides redundancy without impacting target CPUs.
- Therefore, there is no need for additional dedicated hardware or proprietary file system integrations since NVMesh storage is provisioned as a simple block device.
- NVMesh Supports large-scale modeling, simulation, analysis and visualization
Problem/Solution
The problem
High-performance computing applications consist of complex sets of processes that sometimes run for weeks. When any of these processes is interrupted, this could destroy the results of the entire compute job. This problem becomes worse as supercomputers become more powerful. Therefore, parallel computing applications use the concept of checkpoint-restart. This technique allows compute jobs to be restarted from the most recently saved checkpoint in case of an interruption. Checkpoints are typically saved in a shared, parallel file system. But as clusters become larger and the amount of memory per node increases, each individual checkpoint becomes larger and either takes more time to complete or requires a higher-performance file system. When a system is checkpointing it’s not computing, which reduces the availability score of the system.
The solution
Excelero’s NVMesh big data storage solution drastically shortens those moments of unavailability and enables supercomputing centers to maximize their availability score, effectively enabling high-performance computing storage, si it could finish check-pointing faster and start running the job.
NVMesh Operator for RedHat OpenShift on Azure is available now
Experience all the benefits of accelerating applications development with the superior elastic storage performance in a containerized environment.
NVMesh with Spectrum Scale (GPFS)
NVMesh offers low-latency local access to remote NVMe devices. This feature plays well with various Spectrum Scale capabilities.