F286 University of Florida Lockheed-Martin, Inc. Electronics and Missiles Division $32,792 Lockheed-Martin Sanders, Inc. 30,000 Gerhard Ritter Department of Computer and Information Science and Engineering University of Florida Gainesville FL 32611 352-392-1212 352-392-1220 ritter@cise.ufl.edu Mark Schmalz Department of Computer and Information Science and Engineering University of Florida Gainesville FL 32611 352-392-3984 352-392-1220 mssz@cise.ufl.edu Joseph Wilson Department of Computer and Information Science and Engineering University of Florida Gainesville FL 32611 352-392-1360 352-392-1220 jnw@cise.ufl.edu Adaptive Image Manager for Reconfigurable Architectures http://www.cise.ufl.edu/research/projects/AIM/ We plan to develop an Adaptive Image Manager (AIM), which would implement debugging and near-optimal mapping of algorithms to reconfigurable hardware (FPGAs, parallel processors, and SPMD networks of processors) under constraint of hardware status parameters. We plan to test the AIM system on several platforms. First, we would construct an FPGA simulator with extensive hardware fault/failure simulation capability. After running the AIM software using this simulator, we plan to test AIM on a commercially-available FPGA hardware. Second, we plan to test AIM on reconfigurable SIMD-parallel processors, such as the Lockheed-Martin PAL processor. Third, if time permits, we plan to test AIM network optimization capabilities on SPMD networks of workstations using libraries expressed in a unifying language such as the Message Passing Interface (MPI). The functionality of the AIM Software Layer, which is the primary focus of this study, would be derived from our Portable Image Manager (PIM), which translates image algebra statements into a hierarchical computational structure (directed acyclic graph or DAG) called an expression tree, with associated source-level debugging. Since DAG vertices represent operands or operations and its edges represent dataflow paths, there can exist a natural basis for a homomorphism between the expression tree and the dataflow graph of a target architecture. This facilitates algorithm-to-hardware mapping. PIM carries additional information (e.g., implementational details of operand datatype, structure, etc.) in a symbol table, which enriches the processes and structures we plan to employ for status monitoring in AIM.

The existing PIM scheduler employs a heuristic optimizer to produce a sequence of PIM server calls that conform to a predetermined, fixed protocol. We plan to expand the PIM protocol to yield AIM server calls that would (a) admit a wider range of datatypes or image/signal representations and (b) support interactive debugging. We plan to drastically enhance the PIM scheduler by adding the following modules:

High-level (Symbolic) Debugger would perform consistency checking, dependency analysis, and rewriting (as required) on a given expression tree to reduce erroneous or costly dependencies and subtree redundancy. Optimizing the subtree segmentation problem for various type, size, and form of subtrees typically encountered in ISP/ATR algorithms would be a key research topic.

Status Manager (SM) would gather status information of hardware/software modules and hardware interconnects via planned add-ons to existing libraries. The SM would post status information to a data structure (similar in concept to a blackboard), that would be queried by the AIM debuggers (modules A and C of this list). A key technical issue is the development of a unifying protocol for status reporting that would simplify the augmentation of hardware libraries to include a status interface partition.

Low-level (Execution) Debugger would accept status information from the Status Man ager and refine Scheduler output to produce near-optimal sequences of AIM server calls, to account for:

(1) Hardware reconfigurability as well as software reconfiguration strategies such as out-of-order execution and heuristic selection of alternative processing methods (e.g., multi-SIMD mesh reconfiguration via microcode swapping), to nearly optimize the time-space bandwidth product for a given configuration;

(2) Functionality and accuracy of available hardware, for example: (a) supported operations and associated computational precision, (b) hardware interconnect status at multiple resolution levels (e.g., network, subnet, processor, PE, chip, or gate); and (c) hardware faulting or failing behaviors (monitored at an effective resolution level, i.e., where monitoring overhead does not significantly impact algorithm performance).

The AIM Hardware Layer would contain equipment and associated hardware interface libraries (each specific to a given machine type) that would have (a) control/reconfiguration partitions and (b) a partition that would interface with hardware to report machine status. The status partition would derive from existing test routines (written in microcode) provided by hardware manufactur ers or subcontractors, who would provide technical assistance for library development. Subcontractor interactions would directly support development of these capabilities. New Start

Unifying model of computation (MOC), partitioned into three levels: network, processor, and gate. Each level would have a specific set of attributes for status monitoring and performance characterization, from which instances of various devices are derived (e.g., processing elements and chips are derived from the Processor Level). The MOC would be graph-structured to facilitate algorithm-to-architecture mapping using transformations between the algorithm dataflow graph and that of target architectures.

Requirements and design for debugging for reconfigurable architectures using (a) a source-code to middleware translator to detect syntax errors, (b) a symbolic debugger to detect and correct dependen cies, extract repeated subtrees, etc., and (c) and execution debugger to detect and generate alternative mapping or scheduling strategies for faulting or failing hardware components. We envision that the following three products that would be transferable to DoD and selected private-sector organizations:

(A) Due to its architecture independence, the AIM software could be configured to function with a variety of parallel and networked computers, thus realizing portability of AIM to various military, government, medical, and business applications, as discussed previously.

(B) The Lockheed-Martin PAL processor, together with AIM, would permit high-bandwidth image and signal processing (2.5 GFLOPs per PAL-II board) using interfaces such as Khoros (currently supported). This would realize robust, near-optimal, high-bandwidth SIMD-parallel computing for a variety of military and medical applications, at a reason able cost (less than $4k per board projected for PAL-II).

(C) Advanced reconfigurable computers based on field-programmable gate arrays (FPGAs) would be more feasible in practice, since AIM would provide a means for integrating such processors. Examples of such machines are Virtual Machine Works emulator boards, and various reconfigurable signal processors developed by Lockheed-Martin Sanders using Xilinx XC3000 and XC4000 PCI-compatible FPGA development platforms.

As discussed in specific technology transitions 1-3, below, results of the proposed study would immediately benefit military applications that required dynamic reconfigurability, especially as low-level tools for reconfiguring such processors are further developed and fielded.

Given the long history of success at UF in product development and technology transfer (summarized below), enhancement of product quality, scope, and delivery could feasibly be achieved in collaboration with the sponsor, subcontractors, and affiliated organizations. This would further support DoD goals of increasing research and development capabilities of military, government, university, and industrial organizations.

For example, the following specific technology transitions are a feature of our research during academic year 1997-98:

1. Lockheed-Martin Electronics and Missiles, Inc., Orlando FL -- will receive direct technology transfer as a result of our subcontract with this organization. Knowledge to be transferred will include, but not be limited to (a) new design concepts for the PAL SIMD processor, with performance and reliability analyses derived from our simulations using the AIM SIMD model, (b) improved fault isolation and remediation procedures for SIMD-parallel processors, (c) results of mapping various image processing and ATR algorithms to PAL, and (d) improvements in the PIM control software for PAL that are suggested by our AIM research.

2. Lockheed-Martin Sanders, Inc., Nashua NH -- will receive direct technology transfer as a result of our subcontract with this organization. Knowledge to be transferred will include, but not be limited to (a) new design concepts for reconfigurable computing using FPGAs, with performance and reliability analyses derived from our simulations using the AIM FPGA model, (b) improved fault isolation and remediation procedures for various types of FPGAs, (c) results of mapping various image processing and ATR algorithms to FPGA simulators and hardware, and (d) an assessment of advanced technology requirements for future FPGA implementations.

3. USAF Wright Laboratory (WL), Eglin AFB, FL -- UF has a long, productive relationship with WL/MN and plans to directly transfer knowledge to various POCs managing applications in advanced ATR, guidance, targeting, and fire control systems, etc. This will be done through an existing USAF contract with WL/MN.

Planned future technology transition paths that have not yet been implemented include contacting FPGA manufacturers (e.g., Xilinx) and FPGA-based computer manufacturers (e.g., Virtual Machine Works). Our initial conversations with such organizations indicated that they could benefit from our research results. In late Autumn 1997, we plan to ask if they are still interested in receiving research reports, papers, etc. that describe our research in FPGA fault isolation, which was their primary interest. This technology transfer could potentially faciliate improved reliability of FPGAs as well as FPGA-based computing devices. 15 JUL 97 DABT 63-97-C-0023, AIM Larry Carter, Contracting Officer Representative Adaptive Image Manager for Reconfigurable High-Performance Computer Systems 29 JUN 97 Due to the recent start date, we have not yet been able to reconcile our proposed effort with the expected committed funds. Thus the current award profile by year is an estimate that is certain to be revised. 28 JUN 00 97151,370 98180,000 99312,000 00132,296 151,370 0 151,370 31 DEC 97 180,000 29 JUN 97