Moab® HPC Suite – Basic Edition is a multi-dimensional policy-based workload management system that accelerates and automates the scheduling, managing, monitoring, and reporting of HPC workloads on massive scale, multi-technology installations. The Moab HPC Suite – Basic Edition patented multi-dimensional decision engine accelerates both the decisions and orchestration of workload across the ideal combination of diverse resources, including specialized resources like GPGPUs. The speed and accuracy of the decisions and scheduling automation optimizes workload throughput and resource utilization so more work is accomplished in less time with existing resources to control costs and increase the value out of HPC investments.
Moab HPC Suite – Basic Edition enables you to address pressing HPC challenges including:
- Delays to workload start and end times slowing results
- Inconsistent delivery on service guarantees and SLA commitments
- Under-utilization of resources
- How to efficiently manage workload across heterogeneous and hybrid systems of GPGPUs, hardware, and middleware etc.
- How to simplify job submission & management for users and administrators to maximize productivity
Moab HPC Suite – Basic Edition Accelerates Workload Across Infrastructure and Organizational Complexity
Moab HPC Suite – Basic Edition acts as the “brain” of an HPC system to accelerate and automate complex decision making processes. The patented decision engine is capable of making the complex multi-dimensional policy-based decisions needed to schedule workload to optimize job speed, job success and resource utilization. Moab HPC Suite – Basic Edition integrates decision-making data from and automates actions through your system’s existing mix of resource managers. This enables all the dimensions of real-time granular resource attributes and state as well as the timing of current and future resource commitments to be factored into more efficient and accurate scheduling and allocation decisions. It also dramatically simplifies the management tasks and processes across these complex, heterogeneous environments. Moab works with many of the major resource management and industry standard resource monitoring tools covering mixed hardware, network, storage and licenses.
Moab HPC Suite – Basic Edition policies are also able to factor in organizational priorities and complexities when scheduling workload and allocating resources. Moab ensures workload is processed according to organizational priorities and commitments and that resources are shared fairly across users, groups and even multiple organizations. This enables organizations to automatically enforce service guarantees and effectively manage organizational complexities with simple policy-based settings.
Managing the World’s Top Systems, Ready to Manage Yours
Moab manages the world’s largest, most scale-intensive and complex HPC environments in the world including 40% of the top 10 supercomputing systems, nearly 40% of the top 25 and 36% of the compute cores in the top 100 systems based on rankings from the www.Top500.org. So you know it is battle-tested and ready to efficiently and intelligently manage the complexities of your environment.
Moab HPC Suite – Basic Edition drives more ROI and results from your HPC environment including:
-
Improved job response times and job throughput with a workload decision engine that accelerates complex workload scheduling decisions to enable faster job start times and high throughput computing
-
Optimized resource utilization to 90-99 percent with multi-dimensional and predictive workload scheduling to accomplish more with your existing resources
-
Automated enforcement of service guarantees, priorities, and resource sharing agreements across users, groups, and projects
-
Increased productivity by simplifying HPC use, access, and control for both users and administrators with job arrays, job templates, optional user portal, and GUI administrator management and monitoring tool
-
Streamline job turnaround and reduce administrative burden by unifying and automating workload tasks and resource processes across diverse resources and mixed-system environments including GPGPUs
-
Provides a scalable workload management architecture that can manage peta-scale and beyond, is grid-ready, compatible with existing infrastructure, and extensible to manage your environment as it grows and evolves
Moab HPC Suite – Basic Edition accelerates workload processing with a patented multi-dimensional decision engine that self-optimizes workload placement, resource utilization and results output while ensuring organizational priorities are met across the users and groups leveraging the HPC environment.
Key capabilities include:
 |
Policy-driven scheduling intelligently places workload on optimal set of diverse resources to maximize job throughput and success as well as utilization and the meeting of workload and group priorities
- Priority, SLA and resource sharing policies ensure the highest priority workloads are processed first and resources are shared fairly across users and groups such as quality of service, hierarchical priority weighting, and fairshare targets, limits and weights policies
- Allocation policies optimize resource utilization and prevent job failures with granular resource modeling and scheduling, affinity- and node topology-based placement
- Backfill job scheduling speeds job throughput and maximizes utilization by scheduling smaller or less demanding jobs as they can fit around priority jobs and reservations to use all available resources
- Security policies control which users and groups can access which resources
- Checkpointing
|
 |
Real-time and predictive scheduling ensure job priorities and guarantees are proactively met as conditions and workload levels change
- Advanced reservations guarantee that jobs run when required
- Maintenance reservations reserve resources for planned future maintenance to avoid disruption to business workloads
- Predictive scheduling enables the future workload schedule to be continually forecasted and adjusted along with resource allocations to adapt to changes in conditions and new job and reservation requests
|
 |
Advanced scheduling and management of GPGPUs for jobs to maximize their utilization
- Automatic detection and management of GPGPUs in environment to eliminate manual configuration and make them immediately available for scheduling
- Exclusively allocate and schedule GPGPUs on a per-job basis
- Policy-based management & scheduling using GPGPU metrics
- Quick access to statistics on GPGPU utilization and key metrics for optimal management and issue diagnosis such as error counts, temperature, fan speed, and memory
|
 |
Easier submission, management, and control of job arrays improve user productivity and job throughput efficiency
- Users can easily submit thousands of sub-jobs with a single job submission with an array index differentiating each array sub-job
- Job array usage limit policies enforce number of job maximums by credentials or class
- Simplified reporting and management of job arrays for end users filters jobs to summarize, track and manage at the master job level
|
 |
Scalable job performance to large-scale, extreme-scale, and high-throughput computing environments
- Efficiently manages the submission and scheduling of hundreds of thousands of queued job submissions to support high throughput computing
- Fast scheduler response to user commands while scheduling so users and administrators get the real-time job information they need
- Fast job throughput rate to get results started and delivered faster and keep utilization of resources up
|
 |
Open and flexible management abstraction layer easily integrates with and automates management across existing heterogeneous resources and middleware to improve management efficiency
- Rich data integration and aggregation enables you to set powerful, multi-dimensional policies based on the existing real-time resource data monitored without adding any new agents
- Heterogeneous resource allocation & management for workloads across mixed hardware, specialty resources such as GPGPUs, and the multiple resource managers used to manage the resources
- Supports integration with job resource managers such as TORQUE and SLURM as well as integrating with many other types of resource managers such as HP Cluster Management Utility, Nagios, Ganglia, FlexLM, and others
|

 |
Ease of use and management improves productivity for both users and administrators
- Graphical administrator cluster management tool and portal provides unified workload management and reporting on resource utilization and status across the mixed resource environment to make management, issue diagnoses and performance optimization easier
- Optional customizable end-user portal provides visual job submission and management from any location, such as job forms, templates and start-time estimates, to reduce training and administrator requirements
- Job templates enable rapid submission of common jobs by pre-specifying the variety of resources needed for each job to reduce duplicate work and simplifying job submissions for users
|
Moab HPC Suite – Basic Edition is architected to integrate on top of your existing job resource managers and other types of resource managers in your environment to provide the policy-based scheduling and management of workloads and resource allocation. It makes the complex decisions based on all of the data it integrates from the various resource managers and then orchestrates the job and management actions through those resource managers. This makes it the ideal choice to integrate with existing and new systems as well as to manage your HPC system as it grows and expands in the future.
Moab HPC Suite – Basic Edition is designed with a patented intelligence engine architecture that enables it to integrate with and automate workload management across existing heterogeneous environments to improve management and workload efficiency. This unique architecture includes:
-
Industry leading multi-dimensional policies that automate the complex real-time decisions and actions for scheduling workload and adapting resources. These multi-dimensional policies can model and consider the workload requirements, resource attributes and affinities, SLAs and priorities to enable more complex and efficient decisions to be automated.
-
Real-time and predictive future environment scheduling & analytics that drive more accurate and efficient decisions and service guarantees as it can proactively adjust scheduling and resource allocations as it projects the impact of workload and resource condition changes.
-
Open & flexible management abstraction layer lets you integrate the data and orchestrate workload management actions across the chaos of complex heterogeneous IT environments and management middleware to maximize workload control, automation, and optimization. (Diverse hardware and resource types, management domains and silos, management tools, etc.)
Moab HPC Suite – Basic Edition Components:
Moab HPC Suite – Basic Edition includes the following integrated products and technologies for a complete cluster workload management solution:
-
Moab Workload Manager®: Patented intelligence engine that automates the complex decisions and automation for policy-based workload placement, scheduling and resource allocation
-
Moab Cluster Manager®: Graphical desktop administrator application for managing, configuring, monitoring, and reporting for Moab managed clusters
-
Moab Viewpoint™: Web-based user self-service job submission and management portal and administrator dashboard portal
-
Moab Services™: Integration interfaces to resource managers and third-party tools
-
Moab HPC Suite – Basic Edition is also integrated with TORQUE which is available as a free download on AdaptiveComputing.com. TORQUE is an open-source job/resource manager that provides continually updated information regarding the state of nodes and workload status. Adaptive Computing is the custodian of the TORQUE project and is actively developing the code base in cooperation with the TORQUE community to provide state of the art resource management. Each Moab HPC Suite – Basic Edition product subscription includes support for the Moab HPC Suite – Basic Edition as well as TORQUE, if you choose to use TORQUE as the job/resource manager for your cluster.
System Compatibility
Moab works with a variety of platforms. Many commonly used resource managers, operating systems, and architectures are supported.
Operating system support: for Linux (Debian, Fedora, FreeBSD, RedHat, SUSE), Unix (AIX, Solaris)
Resource Manager support: job resource managers such as TORQUE and SLURM as well as integrating with many other types of resource managers such as HP Cluster Management Utility, Nagios, Ganglia, FlexLM, and others
Hardware support: AMD x86, AMD Opteron, HP, Intel x86, Intel IA-32, Intel IA-64, IBM i-Series, IBM p-Series, IBM x-Series
Maximize Job Throughput and Meeting Organizational Priorities
Improve the speed and accuracy of the scheduling decisions and allocation automation to optimize job throughput so more work is accomplished in less time with your existing resources. This controls costs and increases the value your organization receives from its HPC investments. Moab® HPC Suite – Basic Edition provides a patented intelligence engine that accelerates both the complex decisions and orchestration of workload across the ideal combination of diverse resources. But with many users, groups and projects sharing valuable HPC resources, the relative priorities of the jobs and their results as well as service levels and resource sharing agreements are also critical factors to be considered in scheduling workload. These priorities must be balanced with the speed of processing all and as many jobs as quickly as possible. The Moab decision engine is able to factor in these organizational priorities and complexities when scheduling workload and allocating resources. Moab ensures workload is processed according to service levels, results delivery commitments, fair resource usage across users, groups and even multiple organizations. This enables the automatic enforcement of service guarantees and effective management of organizational complexities with simple policy-based settings.
Optimized Resource Utilization
Optimize the work and value you get out of existing expensive HPC resources to control the costs and need to invest in additional resources to meet current workload demand. Moab HPC Suite – Basic Edition optimizes resource utilization by 10-30% to 90-99% on a consistent basis with multi-dimensional policies such as allocation, backfill, and affinity that ensure the optimal resources are used for each job and that resources are optimally utilized and continuously processing jobs, including scheduling specialized resources like GPGPUs for the right jobs. The combination of real-time and unique predictive future scheduling capabilities in Moab enables the future workload schedule to be continually forecasted and adjusted along with resource allocations to adapt to changes in conditions and new job and reservation requests to keep resources utilized at maximum levels while meeting current and known future advance and maintenance reservations.
Extreme-Scale, High Throughput Computing Solution
High job throughput especially benefits customers with large quantities of data sets, variables and the need to run high volumes of simulations quickly for development and design efforts such as life sciences, engineering and manufacturing organizations as well as extreme-scale HPC environments. Moab HPC Suite – Basic Edition supports the hundreds of thousands of queued job submissions demanded for high throughput computing with fast and efficient scheduling and throughput of jobs, and sub-second user command responsiveness.