SAM: SEQUENCE ALIGNMENT AND MODELING SOFTWARE SYSTEM

Authors:
Richard Hughey;Anders Krogh
Affiliations:
-;-
Venue:
SAM: SEQUENCE ALIGNMENT AND MODELING SOFTWARE SYSTEM
Year:
1995

Citing 0
Cited 7

A brief look at some machine learning problems in genomics

COLT '97 Proceedings of the tenth annual conference on Computational learning theory
A unified sequence-structure classification of protein sequences: combining sequence and structure in a map of the protein space

RECOMB '00 Proceedings of the fourth annual international conference on Computational molecular biology
Using a mixture of probabilistic decision trees for direct prediction of protein function

RECOMB '03 Proceedings of the seventh annual international conference on Research in computational molecular biology
Local prediction approach for protein classification using probabilistic suffix trees

APBC '04 Proceedings of the second conference on Asia-Pacific bioinformatics - Volume 29
Feel the information with VisPad: a large area vibrotactile device

Information Visualization
Improving protein secondary structure predictions by prediction fusion

Information Fusion
Experimental Evaluation of Protein Secondary Structure Predictors

ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Sequence Alignment and Modeling system (SAM) is a collection of flexible software tools for creating, refining, and using linear hidden Markov models for biological sequence analysis. The model states can be viewed as representing the sequence of columns in a multiple sequence alignment, with provisions for arbitrary position-dependent insertions and deletions in each sequence. The models are trained on a family of protein or nucleic acid sequences using an expectation-maximization algorithm and a variety of algorithmic heuristics. A trained model can then be used to both generate multiple alignments and search databases for new members of the family. SAM is written in the C programming language for Unix machines and MasPar parallel computers, and includes extensive documentation. The algorithms and methods used by SAM have been described in several pioneering papers from the University of California, Santa Cruz. These papers, as well as the SAM software suite, are available via anonymous ftp to ftp.cse.ucsc.edu in the pub/protein directory, or via the World-Wide Web to http://www.cse.ucsc.edu/research/compbio/.