SEQOPTICS: A Protein Sequence Clustering Method

  • Authors:
  • Yonghui Chen;Kevin D. Reilly;Alan P. Sprague;Zhijie Guan

  • Affiliations:
  • University of Alabama at Birmingham, USA;University of Alabama at Birmingham, USA;University of Alabama at Birmingham, USA;University of California at San Diego, USA

  • Venue:
  • IMSCCS '06 Proceedings of the First International Multi-Symposiums on Computer and Computational Sciences - Volume 1 (IMSCCS'06) - Volume 01
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Protein sequence clustering has been widely used as a part of the analysis of protein structure and function. In most cases single link or graph-based clustering algorithms have been applied. In this paper, we demonstrate an approach of clustering proteins, SEQOPTICS (sequence clustering with OPTICS), which is based on OPTICS (Ordering Points To Identify the Clustering Structure), an attractive approach due to its emphasis on visualization of results and support for interactive work, e.g., in choosing parameters. OPTICS has not been used, as far as we know, for protein sequence clustering. We have implemented a system with OPTICS at its core to perform protein sequence clustering. In this paper, we test SEQOPTICS with four data sets from different data sources. Visualization of the sequence clustering structure is demonstrated. Our system was evaluated by comparison with other existing methods. Analysis of the results demonstrates that our system perform better by the Jaccard coefficient evaluation criterion.