Laplacian co-hashing of terms and documents

  • Authors:
  • Dell Zhang;Jun Wang;Deng Cai;Jinsong Lu

  • Affiliations:
  • School of Business, Economics and Informatics Birkbeck, University of London, London, UK;Department of Computer Science, University College London, London, UK;State Key Lab of CADSCG, College of Computer Science, Zhejiang University, China;School of Business, Economics and Informatics Birkbeck, University of London, London, UK

  • Venue:
  • ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A promising way to accelerate similarity search is semantic hashing which designs compact binary codes for a large number of documents so that semantically similar documents are mapped to similar codes within a short Hamming distance. In this paper, we introduce the novel problem of co-hashing where both documents and terms are hashed simultaneously according to their semantic similarities. Furthermore, we propose a novel algorithm Laplacian Co-Hashing (LCH) to solve this problem which directly optimises the Hamming distance.