37C3 -  What is this? A machine learning model for ants?  @mediacccde
37C3 -  What is this? A machine learning model for ants?  @mediacccde
media.ccc.de | 37C3 - What is this? A machine learning model for ants? @mediacccde | Uploaded February 2024 | Updated October 2024, 1 week ago.
media.ccc.de/v/37c3-11844-what_is_this_a_machine_learning_model_for_ants

How to shrink deep learning models, and why you would want to.

This talk will give a brief introduction of deep learning models and the energy they consume for training and inference. We then discuss what methods currently exist for handling their complexity, and how neural network parameter counts could grow by orders of magnitude, despite the end of Moore's law.

Declared dead numerous times, the hype around deep learning is bigger than ever. With Large Language Models and Diffusion Models becoming a commodity, we ask the question of how bad their energy consumption *really* is, what we can do about it, and how it is possible to run cutting-edge language models on off-the-shelf GPUs.

We will look at the various ways that people have come up with to rein in the hunger for resources of deep learning models, and why we still struggle to keep up with the demands of modern neural network model architectures. From low-bitwidth integer representation, through pruning of redundant connections and using a large network to teach a small one, all the way to quickly adapting existing models using low-rank adaptation.

This talk aims to give the audience an estimation of the amount of energy modern machine learning models consume to allow for more informed decisions around their usage and regulations. In the second part, we discuss the most common techniques used for running modern architectures on commodity hardware, outside of data centers. Hopefully, deeper insights into these methods will help improve experimentation with and access to deep learning models.

etrommer

events.ccc.de/congress/2023/hub/event/what_is_this_a_machine_learning_model_for_ants

#37c3 #SustainabilityClimateJustice
37C3 -  What is this? A machine learning model for ants?MRMCD2024 No More Loopy Code: Data Science Goes Functional37C3 -  Rettet uns die KI?MRMCD2024 Energie aus der Tiefe: Das Meer der Energydrinks und ihre Geheimnisse.EH21 - La genèse de hadopiMRMCD2024 Kubernetes ohne InternetEH21 -  Eine kleine Geschichte über SicherheitslückenMRMCD2024 Dein ISMS, das unbekannte Wesen37C3 -  DevOps but for artworks in museums37C3 -  Was Digitale Gewalt mit Restaurantkritik zu tun hatEH21 -  Wir bauen unsere eigene Cloud mit OpenStack37C3 -  ANIMAL()CITY

37C3 - What is this? A machine learning model for ants? @mediacccde

SHARE TO X SHARE TO REDDIT SHARE TO FACEBOOK WALLPAPER