05-4 Madden High Performance Distributed Computing
05-4 Madden High Performance Distributed Computing
HIGH PERFORMANCE
DISTRIBUTED COMPUTING
June 2002
Surface Warfare Center Division
HiPer-D PROGRAM
DARPA GOAL: HiPer-D Premise: AEGIS GOAL:
Transition Computing New Computer Program Eliminate Capacity &
Technology to Military & System Architecture Scalability Bottlenecks
Required to Fully Exploit
COTS Technology
Replication Application
QoS Enterprise
Services QoS Broker
Specs. Mgt.
Constraint: Fulfill all required functionality of the application within the context of
the existing system and with minimal change to the other applications.
June 2002
Surface Warfare Center Division
DSS ARCHITECTURE
Display State Server
register for state updates
Display Tactical
State Applications
Submode To
Display
Status
state changes Monitor
(cro, hooked track, balltab) Alert
post alert
submode actions
Control remove alert
Alert Alert
Display Handler Handler Display
Data Data
Handler Handler
June 2002
Surface Warfare Center Division
DSS/AQuA ARCHITECTURE
Warm-passive replicas using AQuA
Display
Display Display Display Display
State Server
Gateway Gateway Gateway Gateway Gateway
Gateway
Proteus Ensemble Gateway Gateway Gateway Gateway
Dependability Gossip Object Tactical Tactical Tactical
Manager Server Factory Application Application Application
Replication Manager –
replicas managed on a One object factory per
process not object basis node where replicated
process may be allocated
Application
* AQuA passive replication performs state transfer after each Application CORBA ORB
Interceptors transform
method call on/by a replicated object. IIOP
CORBA method calls
Gateway
** Ensemble GCS provides FT Group membership, heartbeats, CORBA ORB into Ensemble messages.
failure detection, ordered atomic multicast. Gateway
Ensemble
June 2002
HiPer-D DEMO BLOCK DIAGRAM
QOS SPECS
G
RESOURCE USAGE NETWORK QOS U
UAV
FIRE SIM 21 CFF Amaranth Truth N
BROKER Display VIDEO
CALL
ALLOCATION DECISION
TO RESOURCE QuO FOR S
O FIRE.
MANAGEMENT I
T TO RESOURCE
MISSION
MANAGEMENT M
H PRIORITIES
MSHN MMWS
June 2002
Surface Warfare Center Division
STATE CONSISTENCY
Requirement
• Maintain state data consistency among replicated
components of a real-time distributed system to the
degree required to effect replica fail-over within the Application
required time limits
Replica
Technical Issues
• Multiple objects within a single application process
may interact to create a composite state
Composite
• Multiple threads may be required with a single
process – possibly multiple threads per object State
• Event timers may be used to initiate periodic
processing or monitor for time-out conditions
• Out-of-band communications, sometimes via
unreliable channels (sensor interfaces, legacy
interfaces, interfaces to persistent media)
• Time-based computations (e.g. an algorithm that
extrapolates data forward in time)
• Sequences of actions resulting from single initiating
event
• State shared across objects in disparate, distributed
processes
• High throughput/low-latency interactions, with only
a small subset of communications affecting the state
that must be maintained across replicas
• State may include both application/object state and
infrastructure/ORB state June 2002
HiPer-D DYNAMIC
Surface Warfare Center Division
RESOURCE MANAGEMENT
June 2002
HiPer-D DEMO 01
Surface Warfare Center Division
DRM ARCHITECTURE
Specifications Adaptive Resource Management Theater-Level
Theater-Level
Mission
Resource Policy-Selected MissionPriority
Priority
Resource Allocation Control
Control
QoS Manager
QoSSpec
Spec Manager Algorithms
Modifications
Modifications
Globus
System Host
HostLoad Network
NetworkLoad
System/ /QoS
QoS QoS Managers Load
Analyzer
Load
Analyzer
Specifications
Specifications Analyzer Analyzer
Monitor Control
Collector
Collector/ / History Host
Host Hardware
HardwareFault
Fault Application
Application
Correlator
Correlator Servers Discovery
Discovery Detection
Detection Control
Control
Quasar
Application
Application UNIX and WinNT Network
Network QuasarOS-Level
OS-Level UNIX
UNIXand
andNT
NTApp
App
OS Monitors Monitoring Feedback Control Agent
Instrumentation Monitoring Feedback Control Agent
Instrumentation Adaptation
Adaptation
Applications
Applications
June 2002
DEFINITION OF
Surface Warfare Center Division
DRM OPEN INTERFACES
• Ongoing effort to define and implement open DRM interfaces
• Use standards-based middleware where applicable
• Allow for incremental enhancement of DRM capabilities by research community
• Allow alternate components and algorithms to be easily integrated and evaluated
June 2002
DEFINITION OF
Surface Warfare Center Division
DRM OPEN INTERFACES
• Approach:
• Define and document interface requirements:
• data requirements (needed information, data formats, etc...)
• control flow requirements
• QoS requirements (timing, scalability, survivability, security, etc...)
• Identify and evaluate relevant CORBA services
• determine services that support most or all interface requirements
• identify and document shortfalls
• Define detailed interfaces and API’s based on selected services
• Implement defined interfaces
• Evaluate performance of defined interfaces
• Document lessons-learned and iterate
June 2002
DRM OPEN INTERFACE
Surface Warfare Center Division
EXAMPLE
Example: Distribution and Updating of System Specification Information
• Spec Info Needed Specification
Throughout RM Files
Infrastructure
• Different RM
File
Components Require File
Read
Different Spec Info Write
Request
• 10’s to 100’s of Clients
Specification System
• Clients (and possibly Trusted Response Client
Modifications Specification
Clients Connections
Servers) Brought Up, and Updates Server Spec Update
Down, and Moved at
Notification
Run-Time
Interface Requirements:
• Client Specification Requests • Derived Requirements:
• Reliable Soft Real-Time (Bounded) Response Time Requirements • Fault Tolerance
• “Timely” Client Notification when Server(s) Go Down • State Consistency
• Specification Updates and Distribution of Updates: • Scalability
• Reliable Near Real-Time Pub/Sub Model Needed for Notification of Updates
• Soft RT requirements for Maintaining State Consistency Throughout System (low secs)
• Optional Security / Authentication needed for Spec Updates
June 2002
Surface Warfare Center Division
NETWORK QoS CONTROL
• Defining Network QoS Monitoring and Control Framework
• defining requirements, responsibilities, and interactions between:
• applications
• middleware
• resource management
• network QoS policy
• network QoS monitoring and control mechanisms
• Issues:
• At what level(s) should network QoS control mechanisms be accessible?
• Do Applications and Middleware require knowledge of and/or access to network
QoS control mechanisms?
• How should Network QoS Control Policy be incorporated into the framework?
• How to monitor Application and Middleware-level network information?
• How to control Application and Middleware-level network interfaces?
June 2002
RM NETWORK QoS
Surface Warfare Center Division
CONTROL EXPERIMENT
Local vs. Global View of Goal: RM Monitoring and Control of Network
Network QoS Control Actions: QoS Via Interception and Remapping of
Application / Middleware-Initiated QoS
• Application-level: Calls
• local perspective only
• knowledge of impact on application • Transparent RM Control of Network QoS
performance only • Ability to Monitor and Control Middleware
• Resource Management-level: Network QoS Activity
• system-wide perspective
• system-level view of impact across Dynamic
all applications Resource
Management
Application
Mission Priorities QoS Requests
Application Requirements Config Info
App Statuses Middleware
Requested
Network QoS Controls
Middleware
Resource QoS Validation
Ordered
Network QoS Controller Network QoS
QoS Controls
Control Mechanisms Control Mechanisms
Notional Layering for Application-Level Notional Layering and Interfaces with Resource
Control of Network QoS (via Middleware) Management for Network QoS Control
June 2002
RM NETWORK QoS
Surface Warfare Center Division
CONTROL EXPERIMENT
Scenario: Unmanned Aerial Vehicle (UAV) Video Pipeline
UAV Video Pipeline Video
VideoMode
Mode
• Surveillance: Low Priority - Disable Reservation on Link
Switch • Targeting: High Priority - Enable Reservation on Link
using TAO Switch
A/V Streaming Service CORBA
CORBA
Video Video Video
VideoDisplay
A/V Stream A/V Stream Pipe Video
Video Video Display Video
Source MPEG Distribution MPEG Broker Display
Source Distribution Broker Display
UAV Video Frames Frames
(MPEG File) • Video Pipeline developed as joint effort by
BBN, NSWCDD, OOMWorks, and WuSTL
RSVP-Enabled Network Link
• RAPI developed by USC/ISI