Call for Participation

Keynote Presentations

 

Accelerating AI with Analog In-Memory-Computing

Stefano Ambrogio  (IBM Research)

Abstract:   The last decade has witnessed the pervasive spread of AI in a variety of domains, from image and video recognition and classification to speech and text transcription and generation. In general, we have observed a relentless run towards larger models with huge number of parameters. This has led to a dramatic increase in the computational workload, with the necessity of several CPUs and GPUs to train and inference neural networks. Therefore, improvements in the hardware have become more and more essential.  To accommodate for improved performance, in-memory computing provides a very interesting solution. While digital computing cores are limited by the data bandwidth between memory and processor, computation in the memory avoids the weight transfer, increasing power efficiency and speed. The talk will describe a general overview, highlighting our own 14-nm chip, based on 34 crossbar arrays of Phase-Change Memory technology, with a total of around 35 million devices. We demonstrate the efficiency of such architecture in a selection of MLPerf networks, demonstrating that Analog-AI can provide superior power performance with respect to digital cores, with comparable accuracy.  Then, we provide guidelines towards the next steps in the development of reliable and efficient Analog-AI chips, with specific focus on the architectural constraints and opportunities that are required to implement larger and improved Deep Neural Networks.

Stefano Ambrogio obtained his PhD in 2016 at Politecnico di Milano, Italy, studying the reliability of resistive memories and their application on neuromorphic networks. He is now a Senior Research Scientist at IBM-Research, Almaden, in the Analog AI team, working on hardware accelerators based on Non-Volatile Memories for neural network inference.

  

“Optical Neural Networks With In-Memory Computation and Tensor”

Stanley Cheung (Hewlett Packard Enterprise)

Abstract:  TBA.

Stanley Cheung is currently a Principal Research Scientist in the Large-Scale-Integrated-Photonics (LSIP) Laboratory within Hewlett Packard Enterprise. His interests lie within pushing the state-of-the-art in integrated photonics for communications and computing applications. Currently, he has shifted his attention towards the use of non-volatile, ultra-low-power, heterogeneous III-V/Si photonics for novel neuromorphic/brain inspired computing architectures as well as general purpose programmable optics.  He has authored and co-authored over 80+ journal and conference papers and was granted 11 patents with another 50+ pending.

“Processing-in-Memory: from Technology to Products”

Kyomin Sohn (Samsung Electronics)

Abstract:  The traditional computing architecture represented by Von Neuman maintains a simple memory hierarchy that is still in use to this day. However, the strong demand for computing power, which began with big data and AI applications, is evolving from a new memory hierarchy. In particular, the emergence of large language models in generative AI, requires higher bandwidth and higher capacity of memory. This talk provides an explanation of the key concepts of in-memory computing, which is referred as CIM (compute-in-memory) or PIM (processing-in-memory). CIM technology enables a memory array as a processing unit using inherent feature of multiplication between wordline and bitline. In contrast, PIM technology utilizes internal memory bandwidth by allocating processing units near memory array and activating them simultaneosly. The concept of PIM technology is already proven by HBM2-PIM and GDDR6-AiM from the major DRAM vendors. It is the time to apply this technology to the mass-produced DRAM products. In the system and application having low operational intensity of data, PIM technology looks very attractive to overcome the limitation. However, there are obstacles to apply PIM technology to the conventional DRAM directly. The challenges will be discussed and the several suggestions will be given. From the PIM technology to the DRAM products with PIM technology, this journey will go on.

Kyomin Sohn is a Samsung Master (VP of Technology) in Samsung Electronics and he is responsible for future architecture and circuit technology of DRAM. He received the B.S. and M.S. degrees in Electrical Engineering in 1994 and 1996, respectively, from Yonsei University, Seoul. From 1996 to 2003, he was with Samsung Electronics, Korea, involved in SRAM Design Team. He designed various kinds of high-speed SRAMs. He received the Ph.D. degree in EECS in 2007 from KAIST, Korea. He rejoined Samsung Electronics in 2007, where he has been involved in DRAM Design Team. His interests include 3D-DRAM, reliable memory design, and processing-in-memory. In addition, he has currently served as a Technical Program Committee member of Symposium on VLSI Circuits since 2012.

  

“Intel Foundry Advanced Packaging: enabling low-power client through AI and HPC”

Patrick Stover (Intel)

Abstract: TBA.

Patrick Stover is the currently the Director of Foveros Technology Development Programs in Intel’s Assembly Test Technology Development (ATTD) organization and is also director of Package/Assembly development for Intel IFS.  Pat joined Intel’s Technology Development organization and ATTD in 1996 after completing his bachelor’s degree in Chemical Engineering from the University of Illinois. He has been managing development programs in ATTD since 2007, driving the next generation of package, assembly, and test technologies to support Intel’s products. He has influenced the design, materials, processes, equipment optimization, advanced tools, and metrologies to realize Moore’s Law. He also works closely with high-volume manufacturing teams to ensure cost-effective manufacturing processes are developed and deployed to meet time-to-market and world-class manufacturability and quality standards.  Some of the programs and projects he has coordinated and managed include developing unique, reduced form factor CPU and Chipset packaging for the 1st generation Apple TV and MacBook Air products and for the 1st Intel Atom® and Intel client “ULX” and “Y” segment products. Pat has been driving the Foveros 3D Stacking technologies since 2017, delivering packaging solutions for the Intel® Core™ processor with Intel® hybrid technology (“Lakefield”), Intel® Core™ Ultra processors (“Meteor Lake”), and the Intel GPU Max (“Ponte Vecchio”), the most complex packaging solution that Intel has delivered.

“Hot AI by COOL SoCs”

Hoi-Jun Yoo   (KAIST)

Abstract: In the current landscape of computing, the prevalence of AI applications on mobile devices has emphasized the critical importance of designing energy-efficient System-on-Chip (SoC) systems to curtail power consumption. Recognizing this, we present a comprehensive suite of low-power design techniques tailored to address the intricacies of various AI applications. One of the key pillars of our approach lies in harnessing the inherent sparsity within Convolutional Neural Networks (CNN). By strategically leveraging sparsity, we can intelligently skip unnecessary operations during the inference process. Our innovative design includes specific strategies like the Single Zero Skipping Logic, Dual Zero Skipping Logic, and Triple Zero Skipping Logic. These mechanisms collectively contribute to achieving a state-of-the-art level of energy efficiency, setting a new standard in the field. Beyond CNN optimization, we introduce Spiking Neural Networks (SNN) to increase sparsity within the input feature map. This nuanced incorporation enhances our ability to tailor the SoC design to the specific characteristics of AI workloads, further contributing to gains in energy efficiency. Moreover, we explore the synergies between CNN and SNN, presenting an approach that capitalizes on the strengths of both architectures for high energy efficiency. The culmination of these advancements results in the development of a highly energy-efficient SoC, proficient in processing a myriad of AI applications with remarkable power efficiency. Our design techniques extend beyond conventional applications, achieving the state-of-the-art energy efficiency in specialized domains such as deep reinforcement learning, 3D rendering utilizing Neural Radiance Fields (Nerf), and natural language processing with Large Language Models (LLM). In summary, our multifaceted approach to SoC design not only addresses the pressing need for energy efficiency in the realm of neural network processing, but also pushes the boundaries of AI applications that can be processed, making significant progress toward sustainable high-performance computing.

Hoi-Jun Yoo (Fellow, IEEE) graduated from the Department of Electronics, Seoul National University, Seoul, South Korea, in 1983. He received the M.S. and Ph.D. degrees in electrical engineering from the Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea, in 1985 and 1988, respectively. Dr. Yoo has served as a member of the Executive Committee for the International Solid-State Circuits Conference (ISSCC), the Symposium on Very Large-Scale Integration (VLSI), and the Asian Solid-State Circuits Conference (A-SSCC), the TPC Chair of the A-SSCC 2008 and the International Symposium on Wearable Computer (ISWC) 2010, the IEEE Distinguished Lecturer from 2010 to 2011, the Far East Chair of the ISSCC from 2011 to 2012, the Technology Direction Sub-Committee Chair of the ISSCC in 2013, the TPC Vice-Chair of the ISSCC in 2014, and the TPC Chair of the ISSCC in 2015. More details are available at http://ssl.kaist.ac.kr

 

 

 

 

 

Panel Discussion

Topics “TBA”

Organizer and Moderator: TBA.

 

 

Special Sessions (invited lectures)

“Navigating Aging Realities: Integrating Reliability into Cutting-Edge Computing Systems”

Andy Yu-Guang Chen  (National Central Univ. Taiwan)

Abstract: As CMOS technology undergoes further scaling down, the emergence of the aging effect poses a significant threat to the lifetime reliability of computing systems, with the potential to induce performance degradation or timing failures. Effectively addressing these challenges necessitates a comprehensive understanding of how aging effects impact the outcomes of modern computing systems, prompting the development of methodologies dedicated to aging detection, mitigation, and tolerance. This presentation aims to provide a concise overview of major aging effects and their root causes. Subsequently, a deeper exploration will unfold, focusing on two pivotal aspects: (1) Aging-aware, energy-efficient task deployment for heterogeneous multicore systems, and (2) Aging-aware SRAM-based Computing-In-Memory architecture specifically tailored for multiply-accumulate operations. Throughout the talk, I will showcase innovative concepts devised by our research team to confront these challenges and elaborate on the encountered implementation difficulties. The overarching goal is to furnish the audience with a foundational background in the design of reliable computing systems and to inspire additional researchers to contribute to this dynamic and evolving field.

Andy Yu-Guang Chen received his B.S. and Ph.D. degrees in Computer Science from National Tsing Hua University, Hsinchu, Taiwan, in 2009 and 2016, respectively. He held the position of Lecturer at Missouri University of Science and Technology, MO, USA, in 2015, and later served as a research fellow at the University of Notre Dame, IN, USA, in 2016. Following this, Dr. Chen worked as a project assistant on the ICT project at St. Kitts and Nevis with ICDF Taiwan from 2016 to 2017. From 2017 to 2019, he was part of the Department of Computer Science and Engineering at Yuan Ze University. Currently, he is an Assistant Professor in the Department of Electrical Engineering at National Central University, Taoyuan, Taiwan, since 2019. Additionally, he has been an Adjunct Assistant Professor in the Department of Computer Science at National Tsing Hua University, Hsinchu, Taiwan, since 2018.  Dr. Chen’s research focuses on reliable circuit and system design, Computing-In-Memory (CIM) architecture design, AI for physical design, and hardware security. He has authored numerous technical papers and actively contributed as a committee member in major conferences such as DAC, ASP-DAC, A-SSCC, ISVLSI, GLSVLSI, SASIMI. He has also served as a reviewer for esteemed journals like TCAD, TVLSI, ACM JETC. Dr. Chen’s involvement extends to co-chairing The CAD Contest at ICCAD and CADathlon at ICCAD from 2019 to 2023. Notably, he has received several awards, including The Chinese Institute of Electrical Engineering Outstanding Young Electrical Engineer Award (2023), National Central University Outstanding Teaching Award (2023), National Central University Excellent Hostel Instructor (2023), and the Best Paper Award at the Workshop on Synthesis And System Integration of Mixed Information Technologies (SASIMI) in 2024.

 
 
 

“Title TBA”

SinNyoung Kim (IMEC)

Abstract:  TBA

Bio:  TBA