COSNet: a cost sensitive neural network for semi-supervised learning in graphs

Authors:
Alberto Bertoni;Marco Frasca;Giorgio Valentini
Affiliations:
Dipartimento di Scienze dell' Informazione, Università degli Studi di Milano, Milano, Italia;Dipartimento di Scienze dell' Informazione, Università degli Studi di Milano, Milano, Italia;Dipartimento di Scienze dell' Informazione, Università degli Studi di Milano, Milano, Italia
Venue:
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Year:
2011

Citing 9
Cited 3

Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps

Bioinformatics
Fast protein classification with multiple networks

Bioinformatics
The rendezvous algorithm: multiclass semi-supervised learning with Markov random walks

Proceedings of the 24th international conference on Machine learning
A note on Platt's probabilistic outputs for support vector machines

Machine Learning
An efficient strategy for extensive integration of diverse biological data for protein function prediction

Bioinformatics
Nonlinear optimization using generalized hopfield networks

Neural Computation
Statistical properties of community structure in large social and information networks

Proceedings of the 17th international conference on World Wide Web
Letter: Applications of a neural network to watermarking capacity of digital image

Neurocomputing
Fast integration of heterogeneous data sources for predicting gene function with limited annotation

Bioinformatics

A Fast Ranking Algorithm for Predicting Gene Functions in Biomolecular Networks

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A neural network algorithm for semi-supervised node label learning from unbalanced data

Neural Networks
Pattern classification and clustering: A review of partially supervised learning approaches

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

The semi-supervised problem of learning node labels in graphs consists, given a partial graph labeling, in inferring the unknown labels of the unlabeled vertices. Several machine learning algorithms have been proposed for solving this problem, including Hopfield networks and label propagation methods; however, some issues have been only partially considered, e.g. the preservation of the prior knowledge and the unbalance between positive and negative labels. To address these items, we propose a Hopfield-based cost sensitive neural network algorithm (COSNet). The method factorizes the solution of the problem in two parts: 1) the subnetwork composed by the labelled vertices is considered, and the network parameters are estimated through a supervised algorithm; 2) the estimated parameters are extended to the subnetwork composed of the unlabeled vertices, and the attractor reached by the dynamics of this subnetwork allows to predict the labeling of the unlabeled vertices. The proposed method embeds in the neural algorithm the "a priori" knowledge coded in the labelled part of the graph, and separates node labels and neuron states, allowing to differentially weight positive and negative node labels. Moreover, COSNet introduces an efficient costsensitive strategy which allows to learn the near-optimal parameters of the network in order to take into account the unbalance between positive and negative node labels. Finally, the dynamics of the network is restricted to its unlabeled part, preserving the minimization of the overall objective function and significantly reducing the time complexity of the learning algorithm. COSNet has been applied to the genome-wide prediction of gene function in a model organism. The results, compared with those obtained by other semi-supervised label propagation algorithms and supervised machine learning methods, show the effectiveness of the proposed approach.