Article | REF: H1012 V1

VLIW processors

Author: Daniel ETIEMBLE

Publication date: August 10, 2015, Review date: March 8, 2022

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


Overview

Français

ABSTRACT

VLIW processors rely on the compiler to exploit the parallelism between the instructions of a sequential code to launch multiple independent instructions at each clock cycle. Three examples are detailed. Successive processors of the TMS320Cx family of Texas Instruments come close to the pure VLIW approach. The Analog Device Tiger Sharc processor has both VLIW and superscalar features. The EPIC architecture implemented in Itanium processors distinguishes parallel instructions defined by the compiler and VLIW instructions: the hardware decides which instructions are executed at each clock cycle. The VLIW processors are widely used in DSP or multi-core and manycore processors for low power consumption.

Read this article from a comprehensive knowledge base, updated and supplemented with articles reviewed by scientific committees.

Read the article

AUTHOR

  • Daniel ETIEMBLE: Engineer INSA Lyon - Professor Emeritus, Université Paris Sud

 INTRODUCTION

This article examines the main features of processors with very long instruction words, generally referred to as VLIW (Very Long Instruction Word). A VLIW processor uses the instruction parallelism existing in a sequential program to start the execution of several elementary instructions, which make up the very long instruction word, at each clock cycle. Unlike superscalar processors, where the hardware dynamically determines which instructions can be executed simultaneously, VLIW processors rely totally or mainly on the compiler to constitute the very long instruction word. This characteristic helps explain why these processors perform best when the program behavior is known at compile-time, which is often the case for signal and image processing (DSP).

After a brief history of VLIW processors, the article presents the principles of VLIW execution. A "pedagogical" processor, close to the characteristics of existing VLIWs, is used to illustrate the essential features facilitating the use of the software pipeline to execute elementary instructions belonging to different iterations of a computation loop at each clock cycle. Details are given of how to make the execution of elementary instructions conditional, and how to handle multi-cycle computation instructions, with and without "spinning registers".

Three examples of VLIW processors are then presented, from the pure VLIW approach where the compiler controls all dependencies to approaches mixing VLIW features with hardware dependency control. The first example is the TMS320C6x family of signal processors from Texas Instruments, with successively C62 and C67, C64, C64+ and C674, then C66x. These processors are widely used as DSP processors, offering excellent performance and low power consumption. The second example is TigerSharc, which combines VLIW and superscalar features. A competitor to the TMS320C6x in the early 2000s, this processor has had no successor since. The third example is the EPIC architecture implemented in Itanium processors. This architecture distinguishes between parallel instructions defined by the compiler and the 128-bit long instruction (3 elementary instructions). The hardware decides which parallel instructions can actually start each cycle. VLIW features (predicated instructions, rotating registers, etc.) and hardware features (branch prediction, early memory accesses) are described in detail. Designed by Intel and HP to supplant superscalar processors with unordered execution, EPIC and Itanium processors failed to achieve this goal and have now been abandoned by Intel.

The VLIW approach has been very successful with DSP processors. It is also widely used for signal processing coprocessors in systems-on-a-chip. It is also a viable solution for low-power multi-core...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

KEYWORDS

instruction level parallelism   |   VLIW processor   |   software pipelining   |   Computer   |   signal processing


This article is included in

Software technologies and System architectures

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
VLIW processors