COOL Chips XV
Yokohama Joho Bunka Center (Yokohama Media & Communications Center),
Yokohama, Japan
Call for Contribution
Call for Participation
Organizing Committee
Program Committee
Advisory Committee
Travel Information
Past COOLCHIPS

Sister Conferences HOT CHIPS HOT INTERCONNECTS



The 15th anniversary of IEEE annual symposium COOL15

CALL FOR PARTICIPATION



Keynote

Tofu Interconnect Controller for Fujitsu's Highly Scalable Supercomputer

Yuichiro Ajima (Fujitsu Ltd., Japan)

Abstract: The K computer, which is the current world's fastest supercomputer, combines 88,128 processor chips using an interconnection network called Tofu Interconnect. Fujitsu's new supercomputer system FX10 is also powered by the Tofu interconnect. We developed an interconnect controller (ICC) chip which integrates all active components of the Tofu interconnect. In this talk, we will present a technical overview of the ICC chip. The ICC chip provides a Tofu network router, four Tofu network interfaces, and a Tofu barrier interface. The Tofu network router provides four internal and ten external ports. Internal ports connect the Tofu network interface for each, and external ports are used to construct a six-dimensional mesh/torus network. The Tofu network interface supports Remote Direct Memory Access (RDMA) communication, and a Tofu barrier interface provides offload capability for synchronization and reduction communication.

Yuichiro Ajima is a system architect in the Next-Generation Technical Computing Unit at Fujitsu. His research focuses on high-performance computing system architecture. Ajima has a PhD in information engineering from the University of Tokyo. He is a member of the Information Processing Society of Japan and IEEE.




Nonvolatile Logic-in-Memory Architecture Using an MTJ/MOS-Hybrid Structure and Its Applications (tentative)

Takahiro Hanyu (Tohoku University, Japan)

Abstract: Communication bottleneck between memory and logic modules has increasingly become a serious problem, which causes large power dissipation in the recent nanometer-scaled VLSI chips. One method to solve such emerging VLSI-chip problems is to use "nonvolatile" logic-in-memory architecture. In this architecture, nonvolatile storage elements are distributed over a logic-circuit plane, so that it is expected to realize both ultra-low-power and reduced interconnection delay because of great reduction of global interconnection counts and volatile storage-element counts. In this presentation, I demonstrate concrete standby power-free logic circuits based on a nonvolatile logic-in-memory structure using magnetic tunnel junction (MTJ) devices in combination with MOS transistors. Since the MTJ device with a spin-injection write capability is only one device that has all the following superior features as large resistance ratio, virtually unlimited endurance, fast read/write accessibility, scalability, CMOS-process compatibility, and no volatility, it is very suited to implement the MOS/MTJ-hybrid logic circuit with logic-in-memory architecture. As typical examples of the proposed nonvolatile logic-in-memory circuitry, an MTJ-based nonvolatile Look-Up Table (LUT) circuit for an instant power-ON/OFF Field Programmable Gate Array and an MTJ-based nonvolatile Ternary Content-Addressable Memory are also demonstrated together with the fabricated test-chip results.

Takahiro Hanyu received the B.E., M.E. and D.E. degrees in Electronic Engineering from Tohoku University, Sendai, Japan, in 1984, 1986 and 1989, respectively. He is currently a Professor in the Research Institute of Electrical Communication, Tohoku University. His general research interests include nonvolatile logic circuits and their applications to ultra- low-power VLSI processors. He received the Sakai Memorial Award from the Information Processing Society of Japan in 2000, the Judge's Special Award at the 9th LSI Design of the Year from the Semiconductor Industry News of Japan in 2002, the APEX Paper Award of Japanese Society of Applied Physics in 2009, the Excellent Paper Award of IEICE, Japan in 2010, Ichikawa Academic Award in 2010, and the Best Paper Award at IEEE Computer Society International Symposium on VLSI 2010. Dr. Hanyu is a senior member of the IEEE.




The Expanding Universe of Embedded Imaging (tentative)

Masaki Hiraga (Morpho, Inc., Japan)

Abstract: TBD

Masaki Hiraga is President of Morpho,Inc. Masaki received his DSc degree from the University of Tokyo, Graduate School of Science, Department of Information Science. He founded Morpho, Inc. in 2004. Morpho, Inc. is a leading company of software imaging solutions for mobile devices. Customers utilizing Morpho's software technologies include carriers, processing platform providers and mobile device manufactures making the company a global leader in mobile imaging.




Application Scalability - Key to Low Power, Performance Growth, and Exascale

Wen-mei Hwu (Illinois Univ., USA)

Abstract: Parallelism has become the main venue of performance growth and power reduction. Once an application achieves good performance for a given hardware and data set, it must be able to scale effectively in terms of hardware parallelism and data size. Parallelism scalability allows the application to take advantage of a wide range of current and future generation hardware. Data scalability allows the application to handle the ever increasing data size in the real world while managing the ever limiting memory bandwidth. The rise of CPU-GPU heterogeneous computing has significantly boosted the pace of progress in this field. There has been rapid progress in numeric methods, algorithm design, programming techniques, compiler transformations and optimization tools for developing scalable applications. In preparation of petascale applications for deployment on Blue Waters, we have been further accelerating this revolution. In this talk, I will discuss these recent advances, their implications on the future course of computing and computer design.

Wen-mei W. Hwu is a Professor and holds the Sanders-AMD Endowed Chair in the Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign. He is also CTO of MulticoreWare Inc., chief scientist of UIUC Parallel Computing Institute and director of the IMPACT research group (www.crhc.uiuc.edu/Impact). He co-directs the UIUC CUDA Center of Excellence and serves as one of the principal investigators of the $208M NSF Blue Waters Petascale computer project. For his contributions, he received the ACM SigArch Maurice Wilkes Award, the ACM Grace Murray Hopper Award, the ISCA Influential Paper Award, and the Distinguished Alumni Award in Computer Science of the University of California, Berkeley. He is a fellow of IEEE and ACM. Dr. Hwu received his Ph.D. degree in Computer Science from the University of California, Berkeley.




Blue Gene/Q: Performance, Power, Space, and Efficiency

Shigenori Shimizu (IBM Japan Ltd, Japan)

Abstract: TBD

Dr. Shimizu is an IBM Distinguished Engineer working in systems and technology area. He received B.E., M.S., and Ph.D. degrees, all in instrumentation engineering, from Keio University, Japan, in 1977, 1979, and 1983, respectively. He joined IBM as a research staff of Japan Science Institute of IBM Japan (now the Tokyo Research Laboratory) in 1983, and now a technical staff of Japan STG laboratory. His areas of specialization have included high performance computing, parallel processing architecture and design, and various embedded technologies.






Panel Discussion

Topics: "Technology exchange: Supercomputing and Embedded computing"

Organizer & Moderator: Hideharu Amano (Keio Univ, Japan)


Special Sessions (invited lectures)

Advanced Virtual Prototyping of Multiprocessor SoCs

Frédéric Pétrot (TIMA Laboratory, France)

Abstract: Virtual prototyping is a technology whose goal is to simulate the behavior of an entire digital system, including the software running on the processors, and the digital hardware. It relies on specific modeling approaches, at different levels of abstraction, so that speed/accuracy trade-offs can be made. This talk will review the challenges of virtual prototyping techniques, and introduce the level of abstractions that have been agreed upon. We will then more specifically focus on the interpretation of software codes and detail two techniques, an interpretive one based on dynamic binary translation and a native one making use of hardware assisted virtualization.

Frédéric Pétrot received the DEA (master) and PhD degree in Computer Science from Université Pierre et Marie Curie (Paris VI), Paris, France, in respectively 1990 and 1994. From 1995 to 2004, he was assistant professor, and contributed actively to the Alliance VLSI CAD System and the Disydent ESL environment. F. Pétrot joined TIMA in September 2004, and holds a professor position at the Grenoble Institute of Technology, France, where, since 2007, he heads the System Level Synthesis group. His main research interests are in system level design of integrated systems, and include computer aided design of digital system, architecture and software for homogeneous and heterogeneous multiprocessor systems on chip.




The Challenges of Analyzing Embedded Processor Behavior In the Age of Complex SoCs

Markus Levy (EEMBC, USA)

Abstract: Drawing on the experience of the Embedded Microprocessor Benchmark Consortium (EEMBC), this presentation will detail the methodology used to develop benchmarks that target horizontal technologies such as floating-point and multicore and vertical technologies such as smartphones, automotive, and Android. In addition to performance-related aspects, I will also discuss battery-life measurement techniques for smartphones, a subject that is often fraught with misinterpretation and abuse. The advanced development effort of these benchmarks is faced with many challenges such as ensuring repeatability, portability, and the ability to defeat unwarranted optimizations. Furthermore, these diverse and popular topics present the design engineer with unique challenges in trying to understand how to analyze the embedded processor and system behavior. Therefore, this presentation will also explain how to apply these benchmark techniques to designing next-generation processors and systems, as well as for system designers making tradeoffs between performance and power.

Markus Levy is founder and president of EEMBC. He is also president of The Multicore Association and chairman of Multicore Developers Conference. Mr. Levy was previously a senior analyst at In-Stat/MDR and an editor at EDN magazine, focusing in both roles on processors for the embedded industry. Levy began his career in the semiconductor industry at Intel Corporation, where he served as both a senior applications engineer and customer training specialist for Intel's microprocessor and flash memory products. He is the co-author of Designing with Flash Memory, the only technical book on this subject, and received several patents while at Intel for his ideas related to flash memory architecture and usage as a disk drive alternative. He is also a volunteer firefighter.