A tool to find sequence pattern matches in the Protein Data Bank (PDB) database of protein structures.
Sequery is a tool to search the sequences of the protein structures in the Protein Data Bank (PDB) for a particular pattern of residues, which may include exact matches and acceptable substitutions based on a user-specified amino acid substitution matrix and/or a numerical threshold. Sequery was developed by Michael E. Pique, Michael A. Siani, Leslie A. Kuhn, Elizabeth D. Getzoff, and John A. Tainer, Department of Molecular Biology, The Scripps Research Institute and is distributed here with permission of the authors. Development of Sequery was supported in part by NSF grant BIR9631436.
A useful complement to Sequery is SSA (Superpositional Structure Assignment), which automates the assignment of secondary structure to tetrapeptides identified by a Sequery search. For more information and software availability, see SSA Information and Documentation.
For literature references related to Sequery, please see the section on Algorithmic Details.
Usage information for Sequery can be found in Using Sequery
Sequery will work on any UNIX system with a C-compiler and has been tested on Sun hardware running Solaris and SGI hardware running Irix. Additional scripts require awk, C-shell (csh) and Bourne-shells (sh) installed, as well as an accessible copy of the Protein Data Bank.
The latest version of Sequery, as well as the most recent version of this documentation, can be found at the home page for the Protein Structural Analysis and Design Laboratory, Department of Biochemistry, Michigan State University, and is distributed here with permission of the co-authors at The Scripps Research Institute, Michael Pique, Elizabeth Getzoff, and John Tainer. To install Sequery, perform the following steps:
zcat sequery.tar.Z | tar xvf -
Script Name | Variable to Set at the top of the script |
---|---|
addname | PDBHOME |
genpdbselectseq | PDBHOME SEQUERY_HOME |
minipdbextract | PDBHOME |
The PDBHOME variable should be set to the
directory of the Protein Data Bank files (e.g. /usr/pdb)
The SEQUERY_HOME variable should be set to
the directory in which Sequery was installed
(e.g. /usr/bin/sequery)
Located in the Sequery/share directory are several shell scripts, all of which assume the C-shell and Bourne-shell are installed in the standard locations, which are /bin/csh and /bin/sh. If you have difficulty running one or more of these scripts, you may edit the first line of each script to point to the correct shell location for your system. Sequery scripts which are dependent on shell locations include:
Script | Shell |
---|---|
addname | /bin/sh |
genpdbselectseq | /bin/sh |
genpdbseq | /bin/sh |
minipdbextract | /bin/csh |
Additionally, these scripts assume you have awk, nawk, or gawk (different implementations of a simple column- or field-oriented programming language) running on your system. (See The Free Software Foundation (GNU) web site for more information on obtaining and using gawk.)
Sequery was developed as a successor to Searchwild, which is described in
Collawn JF, Kuhn LA, Liu LF, Tainer JA, Trowbridge IS
Transplanted LDL and mannose-6-phosphate receptor
internalization signals promote high-efficiency endocytosis of
the transferrin receptor
EMBO J., 10(11) (Nov): 3247-3253 (1991)
Collawn JF, Stangel M, Kuhn LA, Esekogwu V, Jing SQ, Trowbridge
IS, Tainer JA
Transferrin receptor internalization sequence YXRF
implicates a tight turn as the structural recognition motif
for endocytosis
Cell, 63(5) (Nov 30): 1061-1072 (1990)
Other references related to the use of Sequery include the following:
Craig L, Sanschagrin PC, Rozek A, Lackie S, Kuhn LA, Scott JK
The Role of Structure in Antibody Cross-Reactivity Between
Peptides and Folded Proteins
J. Mol. Biol., 281(1) (Aug 7): 183-201 (1998)
Chang CP, Lazar CS, Walsh BJ, Komuro M, Collawn JF, Kuhn LA,
Tainer JA, Trowbridge IS, Farquhar MG, Rosenfeld MG
Ligand-induced internalization of the epidermal growth
factor receptor is mediated by multiple endocytic codes
analogous to the tyrosine motif found in constitutively
internalized receptors
J. Biol. Chem., 268(26) (Sep 15): 19312-19320
(1993)
Questions should be directed to Dr. Leslie Kuhn at:
kuhn@agua.bch.msu.eduor to Michael Pique at:
mp@scripps.edu