the future of computing performance - IEEE Computer Society

From The Editors

The Future of Computing Performance By Douglass Post

“The times, they are a-changing.” —Bob Dylan

T

he National Academy of Sciences (NAS) recently released a National Research Council (NRC) study entitled The Future

of Computing Per for m ance. It confir med w hat w e’re a ll observing—that is, that computing is undergoing a radical change.1 The study summarizes why the processing speed of individual computer chips will no longer increase dramatically each year, discusses the implications of this for computing, and outlines what is needed to continue to improve computing performance in the future. Prior to 2004, the computer microprocessor’s clock frequency (and the chip’s peak processing power) was increasing by a factor of 100 every decade; in 2004, due to the electronic properties of CMOS technology silicon chips, it became impossible to increase the clock frequency without increasing the power density to unacceptable levels. Applications have generally run faster each year with little or no changes in the program. In 2004, the rate of increase slowed to about a factor of two per decade. The clock speed has now reached a plateau of approximately 3-4 GHz. The interpretation of Moore’s law that implies that clock frequency will continue to increase exponentially is no longer valid (see http://en.wikipedia.org/wiki/ Multi-core_processor). The impact of this has been rumbling through the world of high-performance computing since 2004 and is now starting to affect general computing. Up until 2004, the continual reduction in computer chips’ feature size let chip manufacturers increase clock frequency while maintaining a constant power density for single processors. This is not good news for scientific and general computing. As the NRC study documents, progress in computing—and in a significant portion of US international technology competitiveness—is due to the conti nued growth of computing performance. However, all is not lost. It will still be possible to continue reducing microprocessors’ feature size for at least another five to 10 years, so Moore’s law will continue to be valid as stated in his original paper.2 Although microprocessors’ raw performance speed will no longer improve, chip 4

Copublished by the IEEE CS and the AIP

manufacturers can continue to increase the number of processors per chip without increasing the power density so that chips can perform more operations per second, even if single processor performance doesn’t increase. This has led to “many-core” chips, which hold an increasing number of processors. Companies have announced chip designs with several hundreds of cores (http://en.wikipedia.org/wiki/Multi-core_processor), so we can reasonably expect chips with thousands of cores and even more in the future. Unfortunately, achieving better performance with this new, massively parallel computer architecture requires major changes in most software.3 The days of obtaining performance increases due to higher processor speed are mostly over. Most computer programs are sequential— that is, they perform operations sequentially, not in parallel, which is problematic as exploiting the new, massively parallel computers will require programs that can perform many calculations in parallel, something only a few highly advanced programs are now able to do. The NRC study describes the implications of this for computing: • Massively parallel codes are more complex than sequential codes, and programming tools are rudimentary. • Modifying existing applications to exploit these new computer architectures will be a substantial job and might not even be feasible as not all sequential algorithms can be made parallel. • The architectures for massively parallel computers are still evolving rapidly, so programmers will be trying to hit a moving target. To increase the challenge, it’s likely that single chips will have many different types of processor architectures. For instance, many chip vendors have achieved increased

1521-9615/11/$26.00 © 2011 IEEE

Computing in Science & Engineering

• algorithms that can exploit parallel processing; • new computing “stacks” (applications, programming languages, compilers, runtime/virtual machines, ope rating systems, and architectures) that execute parallel rather than sequential programs and effectively manage software parallelism, hardware parallelism, power, memory, and other resources; • portable programming models that allow expert and typical programmers to express parallelism easily and allow software to be efficiently reused on multiple gene rations of evolving hardware; • parallel-computing architectures driven by applications, including enhancements of chip multiprocessors, conventional data parallel architectures, application-specific architectures, and radically different architectures; • open interface standards for parallel programming systems that promote cooperation and innovation to accelerate the transition to practical parallel computing systems; and • engineering and computer science educational programs that incorporate an increased emphasis on parallelism and use a variety of methods and approaches to better prepare students for the types of computing resources that they’ll encounter in their careers. The report summarizes each topic’s need and scope and argues convincingly that we must address these areas if we are to have applications that can exploit the new generation of computer architectures.

A

t the 22 March 2011 meeting of the National Academies in Washington, DC in which review panel members outlined the study, a member of the audience said, “There have been many studies in the past that pointed out the need for more R&D on these topics, and nothing happened. Why will it be different this time?” Samuel

July/August 2011

Fuller, the study chair, said the difference is that both chip and computer vendors and the federal agencies now recognize the problem and consequences if application performance doesn’t improve.3 Specifically, vendors recognize that no one will buy their new products if those pro ducts don’t offer greater functionality and performance, while the government recognizes that US economic and military competitiveness are strongly tied to increased computer performance. The NSF and the US Department of Energy are beginning to address these issues. But will these efforts be enough? Even if all of the research topics receive considerable support, it will be many years before the impact is felt. It won’t be easy and will take time for the computing community to go from approximately 1,000 parallel programmers to 100,000–200,000 parallel programmers. New computer languages and interface standards take time to mature and achieve broad adoption, and so on. Although it’s clear that computing has “suddenly” become a lot more challenging, the study also makes it clear that there are ways to continue improving computing performance if we take the right steps. As a start, I recommend that everyone in computing read the NRC study. References 1. S. Fuller and L. Millett, eds., The Future of Computing Perfor-

mance, 2011, Nat’l Academies Press, www.nap.edu/catalog. php?record_id=12980. 2. G. Moore, “Cramming More Components onto Integrated Circuits,” Electronics, vol. 38, no. 8, 1965, pp. 114–117. 3. D. Patterson, “The Trouble with Multi-Core,” Spectrum, vol. 47, no. 7, 2010, pp. 28–32.

Douglass Post is associate editor in chief of CiSE. Contact him at [email protected].

IEEE

processor speed of factors of 10 to 100 or more using datastreaming concepts employed by genera l-pu r pose GPUs (GPGPUs; see http://en.wikipedia.org/wiki/ GPGPU). Exploiting these “heterogeneous” architectures will require building applications of even greater complexity. The NRC study recommended six high-priority research and development programs1 that are needed to make the transition from sequential computing to massively parallel computing with heterogeneous processors:

THE #1 ARTIFICIAL INTELLIGENCE MAGAZINE!

IEEE Intelligent Systems delivers the latest peer-reviewed research on all aspects of artificial intelligence, focusing on practical, fielded applications. Contributors include leading experts in • Intelligent Agents • The Semantic Web • Natural Language Processing • Robotics • Machine Learning Visit us on the Web at

www.computer.org/intelligent

5

the future of computing performance - IEEE Computer Society

the future of computing performance - IEEE Computer Society

Suggest Documents

reconfigurable computing - IEEE Computer Society

Physiological Computing - IEEE Computer Society

Cloud Computing - IEEE Computer Society

Physiological Computing - IEEE Computer Society

Cloud Computing - IEEE Computer Society

Future Imperfect - IEEE Computer Society

Future Imperfect - IEEE Computer Society

Future Imperfect - IEEE Computer Society

Future Imperfect - IEEE Computer Society

The Future Visualization Platform - IEEE Computer Society

Security for mobile computing - IEEE Computer Society

Context-Aware Computing - IEEE Computer Society

Context-aware computing - IEEE Computer Society

Mastering Distributed Computing - IEEE Computer Society

Future Wireless Convergence Platforms - IEEE Computer Society

The Future of Web Apps - IEEE Computer Society

The Future of Software Infrastructure Protection - IEEE Computer Society

Computer Society of the IEEE

Social Computing in the Blogosphere - IEEE Computer Society

Computing Education: Beyond the Classroom - IEEE Computer Society

computing in astronomy: to see the unseen - IEEE Computer Society

the past, present, and future - IEEE Computer Society

John C. Hollar: History of Computing - IEEE Computer Society

Computing in Asia: A Sampling of Recent ... - IEEE Computer Society