Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

Yi-Min Chou; Yi-Ming Chan; Jia-Hong Lee; Chih-Yi Chiu; Chu-Song Chen

2018 IJCAI IJCAI 2018

Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

Abstract

We propose a novel method to merge convolutional neural-nets for the inference stage. Given two well-trained networks that may have different architectures that handle different tasks, our method aligns the layers of the original networks and merges them into a unified model by sharing the representative codes of weights. The shared weights are further re-trained to fine-tune the performance of the merged model. The proposed method effectively produces a compact model that may run original tasks simultaneously on resource-limited devices. As it preserves the general architectures and leverages the co-used weights of well-trained networks, a substantial training overhead can be reduced to shorten the system development time. Experimental results demonstrate a satisfactory performance and validate the effectiveness of the method.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — model merging

🐣 Hot Topic Early Bird — model merging

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yi-Min Chou , Yi-Ming Chan , Jia-Hong Lee , Chih-Yi Chiu , Chu-Song Chen

Topics

Machine Learning > Application Areas > Model Merging Deep Learning > Architectures > Neural Networks

Keywords

model merging neural network merging weight sharing convolutional neural network inference optimization

Download PDF

Related papers

Semi-Supervised Multi-Modal Learning with Incomplete Modalities 2018

High-dimensional Similarity Learning via Dual-sparse Random Projection 2018

FISH-MML: Fisher-HSIC Multi-View Metric Learning 2018

Generative Warfare Nets: Ensemble via Adversaries and Collaborators 2018

Semi-Supervised Optimal Margin Distribution Machines 2018