Automated Protein Structure Prediction Using Templates from the ...

Recommend Documents

protein secondary structure prediction using deep

(Deep Convolutional Neural Fields) for protein SS prediction. ... deep hierarchical architecture, but also interdependency between adjacent ..... L-BFGS78 to search for the optimal model parameters, which has also been .... Thompson, J. D., Higgins,

Protein Structure Prediction Using Stochastic Process ... - IJIRSET

[1] Vinicius Tragante do O, Renato Tinos, âDiversity Control in Genetic Algorithms for Protein Structure Predictionâ, 727-737, 2009. [2] Gabriel, P., Lima, T., ...

Protein Structure Prediction using Genetic Algorithm - International ...

application of genetic algorithm in protein structure prediction. Finally .... A structure alignment is essentially a list of residue pairs from two proteins that should.

Protein Structure Prediction Using Evolutionary Algorithms ... - LCC

The black cube represents the current location. (Right) Relative moves in a cubic lattice. The black cubes represent the current location and the previous one.

Prediction of protein structure from limited ...

three-dimensional structure. Assuming 100 possible conformations per residue. â 10200 possible structures for 100 residues. Cyrus Levinthal's paradox.

Protein structure prediction from sequence variation

Nov 8, 2012 - ABCG2 breast cancer resistance protein. Bacterial G-3-P transporter. In. Out. In. Out b c. Residue number. R esidue number. Residue number.

Protein structure prediction

of research are artificial intelligence, machine learning and bioinformatics. Prashant Singh Rana is an Assistant Professor at Computer Science and Engineering.

Automated structure prediction of weakly

structure prediction that consists of template identification by threading, followed by .... On-and-Off Lattice C-Alpha Side Chain Based (CAS) Model. A protein.

Automated antibody structure prediction using Accelrys tools: Results ...

May 16, 2014 - using Accelrys tools: Results and best practices. Marc Fasnacht,* Ken Butenhof, Anne Goupil-Lamy, Francisco Hernandez-Guzman,. Hongwei ...

Protein structure prediction - DiVA portal

Muggleton et al., 1992; Presnell et al., 1992). The breakthrough .... (Gregory et al., 1993; Nakata, 1995; Andreini et al., 2004; Sodhi et al., 2004;. Lin et al., 2005; ...

Protein Structure Prediction Supercomputer - CiteSeerX

first project of this type to utilize distributed computing for structure prediction. .... fined by the user) by using the list of targets in MFoldJob that are either new or ...

(PS) : protein structure prediction server

(the template) that is similar to the query (target sequence);. (ii) alignment of the target ... A number of servers have been developed for automated comparative ... Tel: +886 35712121-56942; Fax: +886 35729288; Email:[email protected].

prediction of protein secondary structure.

Hidden Markov Models (HMMs) have not been used much for this problem, as the complexity of the task makes ... of protein sequence and structure, with its own HMM grammar. ...... ture prediction for a single-sequence using hidden semi-.

Introduction to Protein Structure Prediction

Introduction to. Protein Structure Prediction. BMI/CS 776 www.biostat.wisc.edu/ bmi776/. Colin Dewey [email protected]. Spring 2013 ...

Evaluating protein structure-prediction schemes

IBM J. RES. & DEV. VOL. 45 NO. 3/4 MAY/JULY 2001. M. P. EASTWOOD ET AL. 475 .... over the three wells which are approximately square wells between rmin(k) and rmax(k). .... histogram technique [42, 43], which has found widespread application in the .

Protein Structure Prediction With Evolutionary

[email protected]. Abstract. Evolutionary algorithms have been success- fully applied to a variety of molecular structure prediction problems. In this paper ...

The PSIPRED protein structure prediction server

server allows users to submit a protein sequence, perform ... its name to the prediction server itself. .... runs very effectively on a cheap dual-processor Linux.

The PSIPRED protein structure prediction server

Abstract. Summary: The PSIPRED protein structure prediction server allows users to submit a protein sequence, perform a prediction of their choice and receive ...

Prediction of Protein secondary structure using Logical ... - CiteSeerX

Robson (GOR) [11] and Lim. Although ..... Zimmerman J.M., Eliezer N., Simha R. ..... J. M. Levin, B. Robson and J. Garnier, An algorithm for secondary structure ...

Protein Structure Prediction Using CABS - Laboratory of Theory of ...

best among all predictions submitted to CASP9 as the first models. ... number of publicly available web servers, which provide methods for protein structure.

Protein Secondary Structure Prediction using Feed-Forward ... - Uap-bd.

Thus they achieved 69.7% of the three states prediction accuracy. 3.2 Multiple Sequence Alignment. Rost et al. mentioned that with appropriate cutoffs applied.

Gene structure prediction using information on homologous protein ...

Guigo et al., 1992; Hutchinson and Hayden, 1992; Mural ...... of human metallothionein 1F with homologous sequences of metallothionein listed in Table II.

Exploring protein fold space by secondary structure prediction using ...

derived from the new scoring scheme for four different model proteomes was ... improvement for protein secondary structure prediction was .... Intel Pentium 4.

Protein Secondary Structure Prediction Using RT-RICO - Bentham Open

Jun 22, 2010 - and Ronald L. Frank. 2. 1Department of Computer ...... 2577â2637, 1983. [22]. P. Baldi, S. Brunak, Y. Chauvin, C. A. F. Andersen, and H. Niel-.

Automated Protein Structure Prediction Using Templates from the ...

Download PDF

7 downloads 94010 Views 126KB Size Report

Comment

Automated Protein Structure Prediction Using Templates from the CATH. Protein Family Database ... Correspondence email: [email protected].

Automated Protein Structure Prediction Using Templates from the CATH Protein Family Database Adrian Shepherd1,*, Christine Orengo1, Nigel Martin2, Roger Johnson2 1

Department of Biochemistry & Molecular Biology, University College London, Gower Street, London WC1E 6BT 2 Department of Computer Science, Birkbeck College, Malet Street, London WC1E 7HX * Correspondence email: [email protected] Recent developments in the CATH database include the generation of multiple structural alignments for each CATH homologous family with two or more non-identical structures (~400 families), using the program CORA (Orengo, 1999). This enables equivalent residues to be identified for each relative within the family so that structural and functional characteristics can be compared and consensus properties identified. To improve functional annotation within each family, a Dictionary of Homologous Superfamilies has also been set up. This includes any relevant functional information which can be electronically extracted from public databases (e.g. EC Classification numbers and functional keywords from the SWISS-PROT database). The DHS also contains information about conserved protein-ligand interactions, some of which correspond to consensus sequence motifs identified by PROSITE patterns. The CATH schema design reflects the need to model this multiple alignment data, equivalent residue positions and functional relationships for each protein family. Protocols have also been established for identifying sequence relatives for the CATH homologous superfamilies in the sequence databases. With more than 27 complete genomes, there are now more than 500,000 sequences in the translated Genbank (Genpept) database. Using a 1D-profile based approach PSI-BLAST (Altschul et al. 1997), we have developed a reliable method for identifying sequence relatives to all the non-identical domain structures in CATH and for integrating these sequences into the database. This has resulted in nearly 200,000 domain sequences being added to the database. This additional sequence data is essential for expanding the population of the families and thereby allowing us to develop highly sensitive search algorithms for recognising more distant relationships in the sequence databases and to partial sequences in the EST data. A number of techniques are currently being explored for doing this (e.g. Hidden Markov Models). A conceptual model has been developed in UML for the data and metadata in the CATH database and related data sources. Evaluation of both object (O2) and object-relational (Oracle8) DBMSs has taken place through the implementation of prototype databases, and as a result Oracle8i has been adopted for on-going schema and database development. The schema design supports future reclassification within CATH families, with the preservation of previous versions. Work is now under way on the population of the database from existing data sources and the development of database validation routines to maintain the integrity of related data. Once this has been completed, the next step is the investigation of database searching in the presence of uncertain information, such as the position of domain boundaries, which the schema is designed to capture. References: Orengo CA (1999). CORA - Topological fingerprints for protein structural families. Protein Science, 8, 699-715 Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research, 25, 3389-3402