Rethinking the Setting of Semi-supervised Learning on Graphs

Ziang Li; Ming Ding; Weikai Li; Zihan Wang; Ziyu Zeng; Yukuo Cen; Jie Tang

2022 IJCAI IJCAI 2022

Rethinking the Setting of Semi-supervised Learning on Graphs

Abstract

We argue that the present setting of semisupervised learning on graphs may result in unfair comparisons, due to its potential risk of over-tuning hyper-parameters for models. In this paper, we highlight the significant influence of tuning hyper-parameters, which leverages the label information in the validation set to improve the performance. To explore the limit of over-tuning hyperparameters, we propose ValidUtil, an approach to fully utilize the label information in the validation set through an extra group of hyper-parameters. With ValidUtil, even GCN can easily get high accuracy of 85.8% on Cora. To avoid over-tuning, we merge the training set and the validation set and construct an i.i.d. graph benchmark (IGB) consisting of 4 datasets. Each dataset contains 100 i.i.d. graphs sampled from a large graph to reduce the evaluation variance. Our experiments suggest that IGB is a more stable benchmark than previous datasets for semisupervised learning on graphs. Our code and data are released at https://github.com/THUDM/IGB/.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — graph benchmark

🐣 Hot Topic Early Bird — graph neural network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Ziang Li , Ming Ding , Weikai Li , Zihan Wang , Ziyu Zeng , Yukuo Cen , Jie Tang

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Optimization & Theory > Theory Deep Learning > Architectures > Graph Neural Networks Machine Learning > Learning Paradigms > Semi-Supervised Learning

Keywords

semi-supervised learning graph classification hyperparameter tuning node classification graph benchmark graph neural network

Download PDF

Related papers

Better Collective Decisions via Uncertainty Reduction 2022

Mixed Strategies for Security Games with General Defending Requirements 2022

Achieving Envy-Freeness with Limited Subsidies under Dichotomous Valuations 2022

Distortion in Voting with Top-t Preferences 2022

Let’s Agree to Agree: Targeting Consensus for Incomplete Preferences through Majority Dynamics 2022