| Title | Date Added | Company | |
|---|---|---|---|
![]() |
Speculative Threading: Creating New Methods of Thread-Level Parallelization | 2005-12-01 | Intel |
| This paper discusses the factors that are driving the need for speculative threading. It will then explain a new speculative threading model developed within Intel's research labs. Preliminary performance results using this model indicate the potential for significant processing gains with only a small increase in power requirements. Tomorrow's computing workloads from RMS and other applications of the future will require exponential increases in computing power. Multi-core platforms and multithreading are paving the way to meeting these performance needs. To do it though, they require a companion effort in accelerating the development of the thread-level parallelism necessary for reaching their full performance potential. | |||
![]() |
Speculative Threading: Creating New Method of Thread-Level Parallelization | 2005-12-01 | Intel |
| A new era of computing has begun and the word "Multi" provides a key clue. This new era will be characterized by multi-core processors enabling single-die multiprocessing of multithreaded applications. Signaling the start of this new era is the emergence of new workloads and usage models into mainstream computing. Intel is conducting extensive research on some of these new workloads, particularly those associated with Recognition, Mining and Synthesis (RMS). Multi-core processors are essential in helping attain this computing power. Intel researchers have been investigating a number of compiler techniques and microarchitectures that can provide highly threaded code through the use of speculative threading. This paper discusses the factors that are driving the need for speculative threading.
Tags: Parallel Processing |
|||
![]() |
A Power-Aware Run-Time System for High-Performance Computing | 2005-11-18 | Association for Computing Machinery |
| The High-Performance Computing (HPC) community has focused on performance, where performance is defined as speed. To achieve better performance per compute node, microprocessor vendors have not only doubled the number of transistors (and speed) every 18-24 months, but they have also doubled the power densities. Consequently, keeping a large-scale HPC system functioning properly requires continual cooling in a large machine room, thus resulting in substantial operational costs. Furthermore, the increase in power densities has led to a decrease in system reliability, thus leading to lost productivity. To address these problems, this paper proposes a power-aware algorithm that automatically and transparently adapts its voltage and frequency settings to achieve significant power reduction and energy savings with minimal impact on performance.
Tags: Processors, High Performance Computing |
|||
![]() |
Heterogeneous Chip Multiprocessors | 2005-11-01 | Institute of Electrical and Electronics Engineers |
| Heterogeneous (or asymmetric) chip multiprocessors present unique opportunities for improving system throughput, reducing processor power, and mitigating Amdahl's law. On-chip heterogeneity allows the processor to better match execution resources to each application's needs and to address a much wider spectrum of system loads - from low to high thread parallelism - with high efficiency.
Tags: ASICs - Chip Sets |
|||
![]() |
The MIPS32 24KE Core Family: High-Performance RISC Cores With DSP Enhancements | 2005-10-24 | MIPS Technologies |
| The MIPS32 24KE core family is the latest high-performance synthesizable microprocessor cores from MIPS Technologies, Inc. and features DSP enhancements at a negligible cost. The 24KE core family fills an important gap in the convergence of RISC processors and Digital Signal Processors (DSPs). This convergence is enabled by emerging market trends, and comes with cost advantages and technical challenges. The 24KE design meets these challenges quite handily as shown in this paper. The 24KE cores were built by adding the new DSP enhancements to the existing pipeline in the 24K cores. The performance benefit for DSP applications from these enhancements can be as much as two times compared to a 24K design.
Tags: RISC-Based Servers, High Performance Computing |
|||
![]() |
Intel Embedded Compact Extended Form Factor Single Board Computer Interface | 2005-10-01 | Intel |
| The Single Board Computer (SBC) small form factor solution has become increasingly important for meeting the challenges of the embedded environment. This paper serves both as an open standard form factor specification reference, and a white paper discussion of a proof-of-concept board. The Intel Embedded Compact Extended Form Factor (Intel ECX Form Factor) is an embedded, non-standard body, open standard, small form factor (SFF) single board computer. It is one of the industry's smallest and most widely adopted SBC form factors. The dimensions of the Intel ECX Form Factor SBC are 105 mm x 146 mm.
Tags: Interfaces - Buses - I-Os, Embedded Microprocessors |
|||
![]() |
Improving Application Efficiency Through Chip Multi-Threading | 2005-10-01 | Sun Microsystems |
| Chip multi-threading (CMT) brings to hardware the concept of multi-threading, similar to software multi-threading. A CMT-enabled processor, similar to software multi-threading, executes many software threads simultaneously within a processor on cores. So in a system with CMT processors, software threads can be executed simultaneously within one processor or across many processors. Executing software threads simultaneously within a single processor increases a processor's efficiency as wait latencies are minimized. The paper introduces CMT technology, such as chip multi-processing (CMP), chip multi-processing with hardware threads, and simultaneous multi-threading, from a software developer's perspective.
Tags: Software Engineering |
|||
![]() |
The Evolution of Build-Up Package Technology and Its Design Challenges | 2006-04-04 02:54:05 | |
| This paper reviews Sequential Build-Up (SBU) laminate substrate development from its beginning in 1988. These laminated substrates are nonuniform structures composed of three elements: a core, build-up layers, and finishing layers. Each element has evolved to meet the demands of packaging applications. Thin-film processing has greatly enhanced the wiring capability of SBU laminate substrates and has made this technology very suitable for high-performance designs. This paper focuses on the challenges encountered by IBM during the design, manufacture, and reliability testing phases of development of SBU substrates as solutions for Application-Specific Integrated Circuit (ASIC) and microprocessor packaging applications. | |||
![]() |
Improving the Performance of Embedded Superscalar Microprocessors by Adding Partial Pipelines | 2005-08-20 | University of York |
| This paper introduces adding a "Partial" pipeline to a base embedded superscalar microprocessor implementation to achieve cost effective performance improvements. This method is exemplified by adding a "Partial" Integer Pipeline (IP) to the TriCore TM 2.0 MCU/DSP core. The "Partial" IP pipeline, designed based on TriCore 2.0 simulation results of the EEMBC benchmark suite, executes a subset of TriCore 2.0 IP instructions. They used the basic block sampling and simulation technique to simulate enhanced TriCore 2.0 models, and obtained results indicating that adding the partial IP pipeline can achieve similar performance improvements to duplicating the full IP pipeline. Their approach can be applied to the early design stages of microprocessor development in order to explore design spaces.
Tags: Embedded Microprocessors |
|||
![]() |
Building Cutting-Edge Server Applications: Intel Xeon Processor Family Features the Intel NetBurst Microarchitecture With Hyper-Threading Technology | 2005-08-04 | Intel |
| The Intel NetBurst microarchitecture significantly enhances P6 microarchitecture as well as introduces new innovative performance related features, such as Execution Trace Cache, Rapid Execution Engine, Hyper-Pipelined technology, and Streaming SIMD Extensions 2. It also triples the throughput on the system bus, allowing the Intel Xeon processor family to deliver industry-leading performance and over the next several years. But typically (even with highly optimized code), not all of the available execution resources are used during each clock cycle, due to dependencies, memory latencies and data bottlenecks. Hyper-Threading technology provides two logical processors on each physical processor and lets applications execute separate tasks on each logical processor. This increases the utilization of resources on the processor, thereby reducing bottlenecks and increasing throughput for multi-threaded applications.
Tags: Application Servers, High Performance Computing |
Oracle Live Webcast
Increase Your Bottom Line with Network Intelligence
HP StorageWorks 2000sa Modular Smart Array
Enabling easy transition from direct attached to centralized storage.
Six Priorities for Today’s Economic Climate
Learn how to reduce costs and achieve maximum value from IT.
Give Your Business the Competitive Edge
With the industry's most connected business intelligence solution.
Protect Your Business Critical Systems
With award-winning disaster recovery solutions by NEC.
Free IT Salary Report 2009
Register and be the first to download this invaluable resource
Find out the top concerns of CIOs / IT managers in Asia-Pacific