tinyML Talks on October 7, 2020 “Embedded ML research at TUM: Moving NN Inference to the Extreme Edge” by Rafael Stahl

Olga · October 8, 2020, 8:07pm

We held our next tinyML Talks webcast. Rafael Stahl from the Technical University of Munich has presented Embedded ML research at TUM: Moving NN Inference to the Extreme Edge on October 7, 2020 at 9:30 AM Pacific Time.

tinyML Talks logo

Moving neural network inference near to the data collecting or sensing nodes in IoT networks reduces communication to the cloud, enables always-on devices and improves privacy. The main challenge is the limited amount of computation and memory resources available at these extreme edge IoT devices.
In order to address this challenge we firstly present DeeperThings, a method to enable memory- and compute-constrained devices to run Convolutional Neural Networks (CNNs) in a distributed fashion. The devices join their resources to run CNNs with sizes that otherwise couldn’t fit in each single device’s memory. The inference task is partitioned between all participating devices by utilizing feature partitioning, weight partitioning and communication-aware layer fusion.

Secondly, we present a TinyML code generator tool that transforms a TensorFlow Lite model into static embedded source code. This eliminates the overhead of dynamic interpretation used in the standard TensorFlow Lite for Micro approach and grants increased flexibility for further model optimization such as flexible weight packing support.

Rafael Stahl is a doctoral candidate at the Technical University of Munich at the Chair of Electronic Design Automation in his third year. He received his Bachelor and Master in “Electrical Engineering and Information Technology” from TU Munich. He gathered work experience at the Fraunhofer Institute for Embedded Systems and Communication Technology, prototyping demos of connected car systems. Software reverse engineering is a long-time passion of his, that gained him in-depth programming and debugging experience. Currently he is looking to improve neural network inference through target-aware methods with the wider goal of reducing the memory footprint of embedded software. He received the Best Paper Award at SiPS 2019.

==========================

Watch on YouTube:
Rafael Stahl

Download presentation slide:
Rafael Stahl

Feel free to ask your questions on this thread and keep the conversation going!

Topic		Replies	Views
tinyML Talks on September 5, 2023 “Enabling on-device learning on STM32 microcontrollers” by Beatrice Rossi and Michele Craighero tinyML Talks	0	428	July 19, 2023
Two tinyML Talks on November 24, 2020 by Chao Xu from Eta Compute / Brian Turnquist and Rodney Dockter from Boon Logic tinyML Talks	1	1091	November 27, 2020
tinyML Talks on December 12, 2023 “Tools and Methodologies for Edge-AI Mixed-Signal Inference Accelerators” by Maen Mallah and Roland Müller from Fraunhofer Institute for Integrated Circuits IIS tinyML Talks	0	175	December 6, 2023
tinyML Talks on November 28, 2023 “tinyML: Designing Efficient Neural Architectures and Scaling Strategies for Edge Computing” by Francesco Paissan from Fondazione Bruno Kessler tinyML Talks	0	211	November 10, 2023
tinyML Talks on August 13, 2020 “Tutorial on micro-kernel based hardware acceleration" by Manu Rastogi tinyML Talks	0	714	August 15, 2020

tinyML Talks on October 7, 2020 “Embedded ML research at TUM: Moving NN Inference to the Extreme Edge” by Rafael Stahl

Related Topics