|
20 September | ||
|
0730 |
Check-In / Poster Setup / Continental Breakfast | |
|
AUDITORIUM | ||
|
0830 |
Welcome | |
|
0835 |
Keynote Address | |
|
0905 |
Opening
Remarks | |
|
0915 |
Session
1: Advanced Hardware | |
|
0925 |
Future of
Embedded Software from an Historical Perspective
(Invited) | |
|
0955 |
Cell Processor (Invited) | |
|
1025 |
Break | |
|
1040 |
Applications Kernels on Graphics Processing Units: An
Analysis of Hidden Markov Models, Support Vector Machines,
Hyperspectral Imaging, and Latent Semantic Indexing | |
|
1110 |
Embedding Applications within a Storage
Appliance | |
|
1140 |
Poster /
Demo A: FPGAs Everywhere | |
|
1150 |
Poster / Demo A PrÈcis | |
|
Poster A.1 |
Super-FPGA:
Overcoming Von Neumann to Save Moore | |
|
Poster
A.2 |
Mapping of a
2D SAR Backprojection Algorithm to an SRC Reconfigurable Computing
MAP Processor | |
|
Poster A.3 |
Enhancing
FPGA-Based Encryption on the Cray XD1 | |
|
Poster
A.4 |
A
Superpipelined CORDIC Unit | |
|
Poster
A.5 |
Real-Time
FPGA Implementation of Adaptive Beamforming Using QR
Decomposition | |
|
Poster
A.6 |
A
Data-Driven SoC System for Embedded Continuous Speech
Recognition | |
|
Poster
A.7 |
Iterative
Demodulation and Turbo Decoding for Distributed Radio
Receivers | |
|
Poster
A.8 |
Automatic
Mapping of the MATLAB Code to Parallel FPGA's on the SGI
Altix | |
|
Poster A.9 |
Interface
Techniques for Microprocessors Embedded Within
FPGA's | |
|
Poster A.10 |
Parallel FFT
and Parallel Cyclic Convolution Algorithms with Regular Structure and No Processor
Intercommunication | |
|
Poster
A.11 |
A
Methodology for Exploring Finite-Precision Effects when Solving
Linear Systems of Equations with Least-Squares Techniques in
Fixed-Point Hardware | |
|
Poster A.12
|
Rapid
Prototyping of a Real-Time Range Compression
Processor | |
|
1235 |
Lunch / View Posters | |
| PARALLEL SESSIONS | ||
|
AUDITORIUM |
Room S2-180 | |
|
1345 |
Session
2: Hardware Architecture and System Metrics |
Focus
1: Algorithms and FPGAs |
|
1355 |
A Next
Generation Ultra-High Performance Scalable Processing Architecture
for Embedded Defense Signal and Image Processing
Applications |
Unbounded
Transactional Memory (Invited) |
|
1425 |
A VLIW
Processor with Hardware Functions: Increasing Performance
While Reducing Power |
High-Performance FPGA-Based QR Decomposition |
|
1455 |
Implementations of Signal Processing Kernels Using Stream
Virtual Machine for Raw Processor |
Pipelined
Data Path for an IEEE-754 64-Bit Floating-Point Jacobi
Solver |
|
1525 |
Break / View Posters |
Break / View Posters |
|
1550 |
The HPEC
Challenge Benchmark Suite |
Sparse
Matrix-Vector Multiplication Kernel on a Reconfigurable
Computer |
|
1620
|
A Relative
Development Time Productivity Metric for HPC
Systems |
GPGP:
General Purpose Computation Using Graphics Processors |
|
1650 |
Mitigating the Risks of High Performance Computational Science and
Engineering (Invited) | |
|
1720 |
Adjourn | |
|
1800 |
Reception (Burlington Marriott) | |
|
1845 |
Banquet
Presentation | |
|
1930 |
Banquet | |
|
21 September |
| |
|
0730 |
Check-In / Continental Breakfast | |
|
0830 |
Announcements | |
|
0835 |
Session
3: Advanced Parallel Environments | |
|
0845 |
X10
Programming (Invited) | |
|
0915 |
MathWorks
Recent and Future Solutions for High Productivity (Invited) | |
|
0945 |
Break | |
|
1000 |
Advanced
Hardware and Software Technologies for Ultra-long FFT's | |
|
1030 |
An
Interactive Approach to Parallel Combinatorial Algorithms with
Star-P | |
|
1100 |
Poster /
Demo B: High Performance Software Technologies | |
|
1110 |
Poster/Demo B PrÈcis | |
|
Poster B.1 |
Process and
Shared Object Library Scheduling for High-Performance
Hybrid-Reconfigurable Embedded Systems |
|
|
Poster B.2 |
Improving
Rapid Application Development Environments Through
Coordination | |
|
Poster B.3 |
Performance
Estimates of a STAP Benchmark on the IBM Cell Processor | |
|
Poster B.4 |
Open HPEC Systems: Design and Profiling Tools for Multiprocessor Signal
Processing Applications using MPI | |
|
Poster B.5 |
Integration
of Interactive MATLABÆ and Linux Clusters | |
|
Poster B.6 |
Application
of Functional Coverage-Driven-Verification (CDV) Methodology to
Real-Time Embedded Systems-on-Chip (SoC) for HW/SW State-Space
Co-verification and Architectural Exploration | |
|
Poster B.7 |
Combining
Moore's Law and Amdahl's Law Showing How Software Can Save Moore's
Price/Performance Model | |
|
Poster B.8 |
What Makes
HPEC Applications Challenging? - Understanding
Application/Architecture Interactions | |
|
Poster B.9 |
InfiniPath: A New High Speed, Low Latency Cluster
Interconnect | |
|
Poster B.10 |
How Code
Generation Will Save Moore's Law | |
|
Poster B.11 |
Evaluation
of Graphical Programming and Automated Code Generation
Software Tools for Use in Missile Defense Applications | |
|
Poster B.12 |
FPGA
Implementation of MIMO Wireless Receiver in Interference | |
|
Poster B.13 |
Applying
Model Driven Architecture to Radar Systems | |
|
Poster B.14 |
Scalable and
Portable Supercomputing | |
|
Poster B.15 |
Accelerating
Blocked Matrix-Matrix Multiplication Using a Software-Managed Memory
Hierarchy with DMA | |
|
1205 |
Lunch / View Posters | |
|
1335 |
Session
4: Award Finalist Session - Advanced Software | |
|
1345 |
pMapper: Automatic Mapping of Parallel Matlab
Programs | |
|
1415 |
VSIPL++Pro -
A High-Performance VSIPL++ Implementation | |
|
1445 |
High-Productivity Stream Programming for High-Performance
Systems | |
|
1515 |
Break | |
|
1540 |
Panel:
"Will Software Save Moore's Law?" | |
|
1730 |
Adjourn | |
|
22 September |
|
|
|
0730 |
Check-In / Continental Breakfast | |
|
0830 |
Announcements | |
|
0835 |
Software
Producibility (Invited) | |
|
0905 |
Session
5: Standards Usage and Update |
|
|
| ||
|
0925 |
Using
the OCP Standard for FPGA Reuse |
|
|
0940 |
OMG
Data-Distribution Service (DDS): Architectural
Update |
|
|
0955 |
Eclipse
Tools |
|
|
1010 |
Break |
|
|
1025 |
UML2.0
Profiles for Modeling Real-Time and Quality of Service |
|
|
1040 |
Integrating
VSIPL Support in the Dataflow Interchange Format |
|
|
1055 |
Implementation of an Embedded DoD VSIPL Application on the
DARPA Polymorphous Computing Architectures (PCA) RAW
Processor |
|
|
1115 |
Poster /
Demo C: Software Standards |
|
|
1125 |
Poster/Demo C PrÈcis |
|
|
Poster C.1 |
VSIPL
++: A Single Processing Library Scaling with Moore's Law |
|
|
Poster C.2 |
A
Software Methodology for Real-Time Target Recognition |
|
|
Poster C.3 |
An FPGA
API for VSIPL ++ |
|
|
Poster C.4 |
Implementation of Floating-Point VSIPL Functions on FPGA-Based
Reconfigurable Computers Using High-Level Languages |
|
|
Poster C.5 |
Evaluation of an Embedded Signal Processing System for Generic Air
Traffic Processor |
|
|
Poster C.6 |
Return
of the Hypercube: 10 Gbps per Node over 802.3av, Scaling
LInearly to 12 Nodese |
|
|
Poster C.7 |
FGPA-Based
Signal Acquisition System |
|
|
Poster C.8 |
The Scalable
Software Interconnect for Distributed Radar Signal Processing |
|
|
Poster C.9 |
A CMOS
Digital Decoder ASIC for an Advanced Digital Receiver |
|
|
1200 |
Lunch |
|
| PARALLEL SESSIONS | ||
|
1300 |
Session
6: Advanced Systems |
Focus 2:
Hardware Tools and Network Technologies |
|
1330 |
Adapting
Parallel Backprojection to an FPGA Enhanced Distributed Computing
Environment |
C-Based
Hardware Design Platform for a Dynamically Reconfigurable Processor |
|
1400 |
An
Embedded DoD Discrimination and Classification Processing Challenge
for DARPA Architectures for Cognitive Information Processing (ACIP) |
Application Development for Hybrid Pipelined Systems |
|
1430 |
Break (View Posters) |
Break (View Posters) |
|
1450 |
RapidIO-Based
Space System Architectures for Synthetic Aperture Radar and Ground
Moving Target Indicator |
A
Streaming Virtual Machine for GUPs |
|
1520 |
Case
Study: Real-Time Demonstration of a Knowledge-Aided STAP
Algorithm Using PVL |
An
Integrated Photonic Network for Multi-Processor
Applications |
|
1550 |
Performance Analysis of Kernel Benchmarks on Tiled Architectures |
A High
Performance Programming Model for Large-Scale Molecular Dynamics
Calculations on Reconfigurable Supercomputers |
|
1620 |
High
Performance, Environmentally Adaptive Fault Tolerant Computing
(EAFTC) |
High
Performance Computing from a General Formalism: Conformal
Computing© Techniques Illustrated with a Quantum Computing Example |
|
Adjourn
/ Awards |
||
![]() |
|||||||||||||
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
||||||||