Sunday, September 2, 2007

SheetsPair: a Database of Amino Acids Pairs in Protein Sheet Structures

Sheet is a basic secondary structural element of proteins. To provide a data resource for further analysis on "Sheet", a database which contains a large number of sheet structures was constructed.


The protein dataset utilized to populate the database was obtained from PDB, excluding those that have no sheet structure and those with modified residues, that is, the database is rigorous, precise and faithworthy, with no uncertain residues or modified residues or uncertain structures. The database now contains a total of 756,897 amino acids pairs in sheet structures of 10,704 proteins. There are more details in the publications.

The website interface is shown below:





Publication:
Ning Zhang, Jie Wu, Tao Zhang. SheetsPair: a database of amino acids pairs in protein sheetstructures. The 20th International CODATA Conference (CODATA2006 ABSTRACT,China,Beijing): 295
Ning Zhang, Jishou Ruan, Jie Wu, Tao Zhang. SheetsPair: a database of amino acids pairs inprotein sheet structures, Data science journal, 2007

1 comment:

Anonymous said...

I must digg your article so other folks can see it, really helpful, I had a tough time finding the results searching on the web, thanks.

- Thomas