Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights

Konstantin Schürholt; Boris Knyazev; Xavier Giró-i-Nieto; Damian Borth

2022 NIPS NeurIPS 2022

Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights

Abstract

Learning representations of neural network weights given a model zoo is an emerg- ing and challenging area with many potential applications from model inspection, to neural architecture search or knowledge distillation. Recently, an autoencoder trained on a model zoo was able to learn a hyper-representation, which captures intrinsic and extrinsic properties of the models in the zoo. In this work, we ex- tend hyper-representations for generative use to sample new model weights. We propose layer-wise loss normalization which we demonstrate is key to generate high-performing models and several sampling methods based on the topology of hyper-representations. The models generated using our methods are diverse, per- formant and capable to outperform strong baselines as evaluated on several down- stream tasks: initialization, ensemble sampling and transfer learning. Our results indicate the potential of knowledge aggregation from model zoos to new models via hyper-representations thereby paving the avenue for novel research directions.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Konstantin Schürholt , Boris Knyazev , Xavier Giró-i-Nieto , Damian Borth

Topics

Machine Learning > Core Methods > Embedding Learning Machine Learning > Application Areas > Knowledge Distillation Deep Learning > Architectures > Autoencoders Deep Learning > Models > Generative Models Deep Learning > Learning Types > Representation Learning

Keywords

generative model model zoo knowledge aggregation neural network weight

Download PDF

Related papers

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching 2022

A Theoretical View on Sparsely Activated Networks 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks 2022

Matryoshka Representation Learning 2022

Off-Policy Evaluation with Deficient Support Using Side Information 2022