Tag: protein sequence

Found 209 sources

Source	Match	ReputationScore*
UniProt Knowledgebase Universal Protein resource. A database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from the re ...		100%
Pfam The Pfam database is a large collection of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Pfam also generates higher-level groupings of related entries, known as clans. A clan is a collection of Pf ...		76%
Integrated resource of protein families, domains and functional sites InterPro is a resource that provides functional analysis of protein sequences by classifying them into families and predicting the presence of domains and important sites. To classify proteins in this way, InterPro uses predictive models, known as si ...		67%
Conserved Domain Database The Conserved Domain Database (CDD) brings together several collections of multiple sequence alignments representing conserved domains, including NCBI-curated domains, which use 3D-structure information to explicitly to define domain boundaries and p ...		64%
Simple Modular Architecture Research Tool SMART (Simple Modular Architecture Research Tool) is a web resource providing simple identification and extensive annotation of protein domains and the exploration of protein domain architectures. It allows the identification and annotation of geneti ...		61%
Protein ANalysis THrough Evolutionary Relationships: Classification of Genes and Proteins The PANTHER (Protein ANalysis THrough Evolutionary Relationships) Classification System is a unique resource that classifies genes by their functions, using published scientific experimental evidence and evolutionary relationships to predict function ...		58%
PROSITE PROSITE is a database of protein families and domains. PROSITE consists of documentation entries describing protein domains, families and functional sites as well as associated patterns and profiles to identify them.		51%
Evolutionary Genealogy of Genes: Non-supervised Orthologous Groups eggNOG (evolutionary genealogy of genes: Non-supervised Orthologous Groups) is a database of orthologous groups of genes. The orthologous groups are annotated with functional description lines (derived by identifying a common denominator for the gene ...		50%
Transporter Classification Database This freely accessible database details a comprehensive IUBMB approved classification system for membrane transport proteins known as the Transporter Classification (TC) system. The TC system is analogous to the Enzyme Commission (EC) system for clas ...		50%
PhosphoSite Plus PhosphoSite Plus provides extensive information on mammalian post-translational modifications (PTMs). The resource supersedes PhosphoSite a mammalian protein database that provides information about in vivo phosphorylation sites.		49%
MetaCyc MetaCyc is the largest curated collection of metabolic pathways currently available. It provides a comprehensive resource for metabolic pathways and enzymes from all domains of life. The pathways in MetaCyc are experimentally determined, small-molecu ...		47%
ConoServer ConoServer is a database specializing in sequences and structures of peptides expressed by marine cone snails. The database gives access to protein sequences, nucleic acid sequences and structural information on conopeptides. ConoServer's data are fi ...		46%
LIPID MAPS The LIPID MAPS Lipid Classification System is comprised of eight lipid categories, each with its own subclassification hierarchy. All lipids in the LIPID MAPS Structure Database (LMSD) have been classified using this system and have been assigned LIP ...		44%
Information system for G protein-coupled receptors The GPCRDB is a molecular-class information system that collects, combines, validates and stores large amounts of heterogenous data on G protein-coupled receptors (GPCRs). The GPCRDB contains data on sequences, ligand binding constants and mutations. ...		44%
InParanoid The InParanoid database provides a user interface to orthologs inferred by the InParanoid algorithm. InParanoid release 8 is based on the 66 reference proteomes that the 'Quest for Orthologs' community has agreed on using, plus 207 additional proteom ...		44%
Stanford HIV Drug Resistance Database The Stanford HIV Drug Resistance Database (HIVDB) is an essential resource for public health officials monitoring ADR and TDR, for scientists developing new ARV drugs, and for HIV care providers managing patients with HIVDR.		43%
UniRef The UniProt Reference Clusters are three separate datasets that compress sequence space at different resolutions, achieved by merging sequences and sub-sequences that are 100% (UniRef100), >=90% (UniRef90), or >=50% (UniRef50) identical, regardless o ...		43%
ProDom ProDom is a comprehensive set of protein domain families automatically generated from the UniProt Knowledge Database.		43%
Restriction enzymes and methylases database A collection of information about restriction enzymes and related proteins. It contains published and unpublished references, recognition and cleavage sites, isoschizomers, commercial availability, methylation sensitivity, crystal, genome, and sequen ...		43%
The Protein Database The Entrez Protein search and retrieval system contains protein entries that have been compiled from a variety of sources, including SwissProt, PIR, PRF, PDB, and translations from annotated coding regions in GenBank and RefSeq.		42%
BindingDB database of measured binding affinities BindingDB enables research by making a growing collection of high-quality, quantitative, protein-ligand binding data findable and usable. Funded by NIGMS/NIH.		41%
Orthologous MAtrix The OMA (“Orthologous MAtrix”) project is a method and database for the inference of orthologs among complete genomes. The distinctive features of OMA are its broad scope and size, high quality of inferences, feature-rich web interface, availability ...		41%
Extracellular Matrix Interaction Database MatrixDB stores experimental data established by full-length proteins, matricryptins, glycosaminoglycans, lipids and cations. MatrixDB reports interactions with individual polypeptide chains or with multimers (e.g. collagens, laminins, thrombospondin ...		40%
MobiDB MobiDB is a database of intrinsically disordered regions (IDRs) and related features from various sources and prediction tools. Different levels of reliability and different features are reported as different and independent annotations. The database ...		40%
TIGRFAMs TIGRFAMs is a collection of manually curated protein families focusing primarily on prokaryotic sequences.It consists of hidden Markov models (HMMs), multiple sequence alignments, Gene Ontology (GO) terminology, Enzyme Commission (EC) numbers, gene s ...		40%
Database of Orthologous Groups OrthoDB presents a catalog of eukaryotic orthologous protein-coding genes. Orthology refers to the last common ancestor of the species under consideration, and thus OrthoDB explicitly delineates orthologs at each radiation along the species phylogeny ...		39%
PRINTS PRINTS is a collection of groups of conserved protein motifs, called fingerprints, used to define a protein family. A fingerprint is a group of conserved motifs used to characterize a protein family. Usually, the motifs do not overlap, though they ma ...		39%
Termini-Oriented Protein Function INferred Database The Termini-Oriented Protein Function INferred Database (TopFIND) is an integrated knowledgebase focused on protein termini, their formation by proteases and functional implications. It contains information about the processing and the processing sta ...		38%
OrtholugeDB OrtholugeDB contains Ortholuge-based orthology predictions for completely sequenced bacterial and archaeal genomes. It is also a resource for reciprocal best BLAST-based ortholog predictions, in-paralog predictions (recently duplicated genes) and ort ...		38%
Kinase-Ligand Interaction Fingerprints and Structures database Kinase-Ligand Interaction Fingerprints and Structures database (KLIFS) is a database that revolves around the protein structure of catalytic kinase domains and the way kinase inhibitors can interact with them. Based on the underlying systematic and c ...		37%
MoonProt MoonProt Database is a manually curated, searchable, internet-based resource with information about the over 200 proteins that have been experimentally verified to be moonlighting proteins. Moonlighting proteins comprise a class of multifunctional pr ...		36%
Bacterial protein tYrosine Kinase database The Bacterial protein tYrosine Kinase database (BYKdb) contains computer-annotated BY-kinase sequences. The database web interface allows static and dynamic queries and provides integrated analysis tools including sequence annotation.		36%
ARAMEMNON ARAMEMNON is a curated database for Arabidopsis thaliana transmembrane (TM) proteins and transporters. The database compiles topology and signal sequence predictions and displays the results in a directly comparable graphical output format for presen ...		35%
The human DEPhOsphorylation Database DEPOD - the human DEPhOsphorylation Database is a manually curated database collecting human active and inactive phosphatases, their experimentally verified protein and non-protein substrates, and dephosphorylation site information, and pathways in w ...		35%
BAliBASE BAliBASE; a benchmark alignment database, including enhancements for repeats, transmembrane sequences and circular permutations.		35%
ProtoNet This resource is a hierarchical clustering of UniProt protein sequences into hierarchical trees. This resource allows for the study of sub-family and super-family of a protein, using UniRef50 clusters.		35%
Bactibase: database dedicated to bacteriocins BACTIBASE contains calculated or predicted physicochemical properties of bacteriocins produced by both Gram-positive and Gram-negative bacteria. The information in this database is very easy to extract and allows rapid prediction of relationships str ...		35%
RBPDB RNA-binding proteins and their specificities		35%
Major Intrinsic Proteins Modification Database This is a database of comparative protein structure models of the MIP (Major Intrinsic Protein) family of proteins. The MIPs have been identified from the completed genome sequence of organisms available at NCBI.		35%
short Open Reading Frame database sORFs.org is a database for sORFs identified using ribosome profiling. Starting from ribosome profiling, sORFs.org identifies sORFs, incorporates state-of-the-art tools and metrics and stores results in a public database. Two query interfaces are pro ...		34%
PHOSIDA Phosphorylation sites in various species identified by mass spectrometry		34%
Human Histone Database HIstome (Human histone database) is a freely available, specialist, electronic database dedicated to display information about human histone variants, sites of their post-translational modifications and about various histone modifying enzymes.		34%
PIR SuperFamily The PIR SuperFamily concept is being used as a guiding principle to provide comprehensive and non-overlapping clustering of UniProtKB sequences into a hierarchical order to reflect their evolutionary relationships.		34%
Mammalian Protein Localization Database LOCATE is a curated database that houses data describing the membrane organization and subcellular localization of proteins from the RIKEN FANTOM4 mouse and human protein sequence set.		34%
Non-Ribosomal Peptides Database Norine is a platform that includes a database of nonribosomal peptides together with tools for their analysis. Norine currently contains more than 1000 peptides.		33%
VDJdb: a curated database of T-cell receptors with known antigen specificity The primary goal of VDJdb is to facilitate access to existing information on T-cell receptor antigen specificities, i.e. the ability to recognize certain epitopes in certain MHC contexts. Our mission is to both aggregate the scarce TCR specificity in ...		33%
MiCroKiTS This resource is a collection of all proteins identified to be localized on kinetochore, centrosome, midbody, telomere and spindle from two fungi (S. cerevisiae and S. pombe) and five animals, including C. elegans, D. melanogaster, X. laevis, M. musc ...		32%
Telomerase Database The Telomerase Database is a Web-based tool for the study of structure, function, and evolution of the telomerase ribonucleoprotein. The objective of this database is to serve the research community by providing a comprehensive compilation of informa ...		32%
PeroxiBase Peroxibase provides access to peroxidase sequences from all kingdoms of life, and provides a series of bioinformatics tools and facilities suitable for analysing these sequences.		32%
PSORTdb Protein subcellular localization (SCL) is important for understanding protein function, genome annotation, and aids identification of potential cell surface diagnostic markers, drug targets, or vaccine components. PSORTdb comprises ePSORTdb, a manual ...		32%
NucleaRDB Families of nuclear hormone receptors		32%
SuperCYP Cytochrome P450 alleles and drug interactions		32%
PeroxisomeDB The aim of PEROXISOME database (PeroxisomeDB) is to gather, organise and integrate curated information on peroxisomal genes, their encoded proteins, their molecular function and metabolic pathway they belong to, and their related disorders.		32%
CLIPZ Experimentally-determined binding sites of RNA-binding proteins		31%
cpnDB Chaperonins are a diverse family of molecular chaperones present in the plastids, mitochondria, and cytoplasm of eukaryotes, and in bacteria and archaea. The family is divided into group I (CPN60, also known as Hsp60 or GroEL, found in bacteria, some ...		31%
KnotProt: A database of proteins with knots and slipknots KnotProt collects information about proteins with knots or slipknots. The knotting complexity of proteins is presented in the form of a matrix diagram that shows users the knot type of the entire polypeptide chain and of each of its subchains. The da ...		31%
ADPriboDB ADPriboDB is a database of ADP-ribosylated proteins and their literature-identified ADP-ribosylated residues. The database includes a variety of information for each entry, including any drug treatments performed to obtain the identification of the m ...		30%
CentrosomeDB CentrosomeDB is a collection of human and drosophila centrosomal genes that were reported in the literature and other sources. The database offers the possibility to study the evolution, function, and structure of the centrosome. They have compiled i ...		30%
Knottin database The KNOTTIN database provides standardized data on the knottin structural family (also referred to as the "Inhibitor Cystine Knot (ICK) motif/family/fold").		30%
Polbase Polbase is an open and searchable database providing information from published and unpublished sources on the biochemical, genetic, and structural information of DNA polymerases.		30%
PRIDB Protein-RNA Interface Database		30%
PREX PeroxiRedoxin classification indEX		30%
MimoDB Mimotope database, active site-mimicking peptides selected from phage-display libraries		30%
Olfactory Receptor Database ORDB began as a database of vertebrate OR genes and proteins and continues to support sequencing and analysis of these receptors by providing a comprehensive archive with search tools for this expanding family.		30%
FireDB fireDB is a database of Protein Data Bank structures, ligands and annotated functional site residues. The database can be accessed by PDB codes or UniProt accession numbers as well as keywords.		30%
Death Domain Database Death Domain Database is a manually curated database of protein-protein interactions for Death Domain Superfamily.		30%
HHMD Human Histone Modification Database		29%
COMBREX Computational Bridge to Experiments		29%
PLANT-PIs Plant protease inhibitors (PIs) can be counted among the defensive proteins that plants display to minimize the adverse effects deriving from the attack of phytophagous insects. They are usually present in seeds and storage tissues, but are also expr ...		29%
REFOLDdb REFOLDdb is a resource for the optimization of protein refolding, referring to published methods employed in the refolding of recombinant proteins. It stores a collection of published experimental approaches and records for refolding proteins. It con ...		29%
OMPdb Beta-barrel outer membrane proteins from Gram-negative bacteria		29%
gpDB GpDB is a publicly accessible, relational database of G-proteins and their interactions with GPCRs and effector molecules. The sequences are classified according to a hierarchy of different classes, families and sub-families, based on extensive liter ...		29%
RNA Binding Protein Variant Database RBP-Var is a database of functional variants involved in regulation mediated by RNA-binding proteins. Human genome variants can change the RNA structure and affect RNA-protein interactions.		29%
ThYme Thioester-active enzymes		28%
ProRepeat: An Integrated Repository for Studying Amino Acid Tandem Repeats in Proteins ProRepeat is an integrated curated repository and analysis platform for in-depth research on the biological characteristics of amino acid tandem repeats. ProRepeat collects repeats from all proteins included in the UniProt knowledgebase, together wit ...		28%
LocDB Experimental annotations of localization for Homo sapiens and Arabidopsis thaliana		28%
CharProtDB Experimentally Characterized Protein annotations		28%
FunShift Functional divergence between the subfamilies of a protein domain family		28%
PolyQ Polyglutamine Repeats in Proteins		28%
TRIP Protein-protein interactions for mammalian TRP channels		27%
TMPad The TransMembrane Protein Helix-Packing Database (TMPad) is an integrated repository of experimentally determined structural folds derived from helix-helix interactions in alpha-helical membrane proteins. TMPad includes geometric descriptors of helix ...		27%
SIMAP Protein sequences are of utmost importance for studying the function and evolution of genes and genomes. Therefore a rich collection of methods in computational biology relies on the analysis and comparison of protein sequences. Many of these intensi ...		27%
MeMotif Linear motifs in alpha-helical transmembrane proteins		27%
ProGlycProt Experimentally characterized Prokaryotic GlycoProteins		27%
VKCDB - Voltage-gated K+ Channel Database Voltage-gated potassium channel database		27%
mutLBSgeneDB Mutations in Ligand Binding Sites gene DataBase		27%
Lipase Engineering Database The Lipase Engineering Database (http://www.led.uni-stuttgart.de) integrates information on sequence, structure, and function of lipases, esterases, and related proteins. Sequence data on 806 protein entries are assigned to 38 homologous families, wh ...		27%
PPD The Protein pKa Database (PPD) v1.0 provides a compendium of protein residue-specific ionisation equilibria (pKa values), as collated from the primary literature, in the form of a web-accessible postgreSQL relational database. Ionizable residues play ...		27%
3DIV Three-dimensional (3D) chromatin structure is an emerging paradigm for understanding gene regulation mechanisms. Hi-C (high-throughput chromatin conformation capture), a method to detect long-range chromatin interactions, allows extensive genome-wide ...		26%
DescribePROT DescribePROT is a database containing annotations of 13 putative structural and functional properties at the amino acid level for ~1.4 million proteins from 83 popular/model organism, to be extended to hundreds of additional organisms. Users can sear ...		26%
Transmembrane Helices in Genome Sequences A web based database of Transmembrane Helices in Genome Sequences.		26%
Functional Coverage of the Proteome FCP is a publicly accessible web tool dedicated to analysing the current state and trends on the population of available structures along the classification schemes of enzymes and nuclear receptors, offering both graphical and quantitative data on th ...		26%
SitEx Projections of protein functional Sites on Exons		26%
Laminin Database Laminins (LM) correspond to a large number of heretotrimeric glycoproteins, playing and a major role in several cell functions, including differentiation, proliferation, adhesion, and migration [1-3]. In addition to binding to other extracellular mat ...		25%
dbPTM dbPTM is a databases which accumulates the biological information related to protein post-translational modification (PTM), such as the catalytic sites, structural information, solvent accessibility of residues, protein secondary structures, protein ...		25%
UniParc The UniProt archive (UniParc), part of the UniProt databases, is an archival protein sequence collection from all major publicly accessible resources. New and revised protein sequences are added daily into UniParc while not deleting the previous vers ...		25%
PIR - Protein Information Resource The Protein Information Resource (PIR) is an integrated public bioinformatics resource that supports genomic and proteomic research and scientific studies. PIR has provided many protein databases and analysis tools to the scientific community, includ ...		23%
PANDIT PANDIT is a collection of multiple sequence alignments and phylogenetic trees covering many common protein domains. It contains the seed protein sequence alignments from the Pfam-A (curated families) database; nucleotide sequence alignments derived f ...		23%
Cyanolyase Sequences and motifs of the phycobilin lyase protein family		23%
Protein Classification Benchmark Collection The Protein Classification Benchmark Collection was created in order to create standard datasets on which the performance of machine learning methods can be compared.		23%
Phospho3D Phospho3D is a database of three-dimensional structures of phosphorylation sites which stores information retrieved from the phospho.ELM database and which is enriched with structural information and annotations at the residue level. The database als ...		23%
SISYPHUS The SISYPHUS database contains manually curated multiple structural alignments constructed for a set of proteins with known three-dimensional structures that have revealed non-trivial structural relationships and whose structural similarity is ambigu ...		23%
TopDB Topology Data Bank of transmembrane proteins		23%
SwissSidechain SwissSidechain is a structural and molecular mechanics database of hundreds of non-natural amino-acid sidechains that can be used to study in silico their insertion into natural peptides or proteins.		23%
UCSD-Nature Signaling Gateway Molecule Pages Expert-authored and peer-reviewed information on mammalian proteins involved in cellular signaling		23%
UniSave The UniProtKB Sequence/Annotation Version database (UniSave) is a comprehensive archive of UniProtKB/Swiss-Prot a nd UniProtKB/TrEMBL entry versions. All changed Swiss-Prot and TrEMBL entries are loaded into the UniSave as part of the public UniProtK ...		23%
SuperSite Dictionary of binding sites in proteins		22%
ProTeus Signature sequences at the protein N- and C-termini		22%
CoPS Comprehensive peptide signature database		22%
Protein Clusters Related protein sequences (clusters)of Reference Sequence proteins encoded by complete genomes		22%
InterDom Putative protein domain interactions		22%
PDBSite 3D structure of protein functional sites		22%
Uniclust Clustered protein sequences and multiple sequence alignments		22%
LenVarDB Database of length variantion in protein domains		22%
Protein kinase resource The Protein Kinase Resource (PKR) is a curated information source which provides an integrated view of sequence and structure data combined with biochemical and genetic function data focused on a single family of proteins, the protein kinases. In add ...		22%
iPfam A database of Pfam domain interactions		22%
Minimotif Miner Search tools for short functional motifs involved in posttranslational modifications, binding to other proteins, nucleic acids, or small molecules		22%
MegaMotifbase Structural motifs in protein families and superfamilies		22%
TOPPR The Online Protein Processing Resource		22%
Secreted Protein Database Secreted proteins from human, mouse and rat		22%
eProS Energy profiles of protein structures		22%
Heme Protein Database Heme types, protein structures, axial ligands and Em values		22%
ValidNESs		22%
O-GLYCBASE O-GLYCBASE is a database of glycoproteins with O-linked and C-linked glycosylation sites. Entries with at least one experimentally verified glycosylation site have been compiled from protein sequence databases and literature. Each entry contains info ...		22%
PPT-DB Protein Property Prediction and Testing Database		22%
PRF Protein research foundation database of peptides: sequences, literature and unnatural amino acids		22%
RPG - Ribosomal Protein Gene database Ribosomal protein gene database		22%
Kinomer Classification of protein kinases encoded in various eukatotic species		22%
MALISAM Manual alignments for structurally analogous motifs in proteins		22%
OPTIC Orthologous and Paralogous Transcripts in Clades		22%
Membranome A database of single-pass membrane proteins		22%
eF-site - Electrostatic surface of Functional site Electrostatic potentials and hydrophobic properties of the active sites		22%
WDSPdb WD40 domain structure predictions		22%
DoBISCUIT Database Of BIoSynthesis clusters CUrated and InTegrated		22%
SelenoDB A database of selenoprotein genes, proteins and SECIS elements		22%
DAnCER Disease-Annotated Chromatin Epigenetics Resource		22%
ChromDB Chromatin-associated proteins in a broad range of organisms		22%
NPD - Nuclear Protein Database The NPD is a curated database that contains information on more than 1200 vertebrate proteins that are thought, or are known, to localise to the cell nucleus. Each entry is annotated with information on predicted protein size and isoelectric point, a ...		22%
Hits High throughput genome (HTG) and expressed sequence tag (EST) sequences are currently the most abundant nucleotide sequence classes in the public database. The large volume, high degree of fragmentation and lack of gene structure annotations prevent ...		22%
PlantTribes Families of protein-coding genes from five sequenced plant species		22%
UUCD Ubiquitin and ubiquitin-like conjugation database		22%
PLPMDB Pyridoxal-5'-phosphate dependent enzymes mutations		22%
Ribonuclease P Database RNase P sequences, alignments, and structures		22%
iProLINK iProLINK (integrated Protein Literature, INformation and Knowledge) is a resource to facilitate text mining research in the area of literature-based database curation, named entity recognition, and protein ontology development. This collection of ann ...		22%
KIDFamMap Kinase-inhibitor-disease family map		22%
PHYTOPROT Clusters of predicted plant proteins		22%
NRichD Efficiency of protein remote homology detection methods depends on the dispersion of the protein sequence space and the availability of intermediate sequences between two related protein families. In the absence of any structural evidence and natural ...		22%
PA-GOSUB Protein sequences from model organisms, GO assignment and subcellular localization		22%
PyIgClassify Clusters of conformations of antibody CDRs		22%
ZiFDB Zinc Finger DataBase		22%
WERAM Writers, Erasers and Readers of Histone Acetylation and Methylation		22%
RaftProt Lipid raft associated proteins in mammals		22%
TransportDB Sequences and classification of predicted membrane transporters encoded in complete genomes		22%
Animal Toxin Database Database of animal toxins		22%
ADDA - A Domain Database ADDA is a global clustering of protein sequences into protein domains and protein domain families. The database currently contains domains for 1.5 Mio sequences from UniProt, ENSEMBL, and other sequence databases. The domains are grouped into 123,000 ...		22%
iProClass The iProClass database provides value-added information reports for UniProtKB and unique NCBI Entrez protein sequences in UniParc, with links to over 175 biological databases, including databases for protein families, functions and pathways, interact ...		22%
Degradome Database Proteases, protease inhibitors and protease mutations in human, chimpanzee, mouse, and rat		22%
PIDD PIDD is a dedicated database and structural bio-informatics system for distance based protein modeling. The database is developed to host and analyze the statistical data for protein inter-atomic distances based on their distributions in databases of ...		22%
EENdb Engineered endonucleases: zinc finger nucleases and transcription activator-like effector nucleases		22%
PINT The first release of Protein-protein Interactions Thermodynamic Database (PINT) contains more than 1500 data of several thermodynamic parameters along with sequence and structural information, experimental conditions and literature information. Each ...		22%
MP:PD Membrane Proteins: Packing Densities, packing defects and internal water molecules		22%
PALI The database of Phylogeny and ALIgnment of homologous protein structures (PALI) contains structure-based sequence alignments and dendrograms based on information primarily derived from the structural alignments at domain level [1,2]. Protein domain d ...		22%
PhyloFacts The PhyloFacts resource contains pre-calculated structural and phylogenomic analysis of over 15,000 protein family "books" across the Tree of Life. Each book includes a multiple sequence alignment, one or more phylogenetic trees, predicted subfamilie ...		22%
SEVENS Seven-transmembrane-helix receptors (7-TMR), known as G-protein-coupled receptors [1], are important genes that work as the gateway of signal transudation induced by ligand binding. Recent progress in determination of human draft sequences [2,3] acce ...		22%
NBDB NBDB database provides profiles of Elementary Functional Loops (EFLs) involved in binding of nucleotide-containing ligands. Each EFL in form of a PSSM (position-specific scoring matrix) profile is complemented with the information on SCOP entities, s ...		22%
MulPSSM Representation of multiple sequence alignments of protein families in terms of Position Specific Scoring Matrices (PSSMs) is commonly used in the detection of remote homologues. A PSSM is generated with respect to one of the sequences involved in the ...		22%
COMe - Co-Ordination of Metals etc. COMe (Co-Ordination of Metals etc.) represents the classification of bioinorganic proteins. COMe consists of three types of entries: "bioinorganic motif", "molecule", and "complex protein"; each entry is assigned a unique identifier. A bioinorganic m ...		22%
OKCAM - now available at RhesusBase Ontology-based Knowledgebase for Cell Adhesion Molecules		22%
Peptaibol The Peptaibol Database is a sequence and structure resource for the unusual class of peptides known as peptaibols. The database includes sequence, biological source, and bibliographical data for the naturally-occurring peptaibols. Information is also ...		22%
ASC - Active Sequence Collection ASC (Active Sequences Collection) is a database of short amino acid sequences with known biological activity. The current version is substantially improved as compared to the previous release; it now includes more than 1300 different active short pro ...		22%
ProRule The ProRule database is a new section of PROSITE, which contains additional information about profiles. ProRule provides position specific-information about functionally and structurally relevant residues found in PROSITE profiles, as well as specifi ...		22%
Cybase CyBase is a curated database and information source for backbone-cyclised proteins. The database incorporates naturally occurring cyclic proteins as well as synthetic derivatives, grafted analogues and acyclic permutants. The database provides a cent ...		22%
eSLDB - eukaryotic Subcellular Localization database eSLDB (eukaryotic Subcellular Localization DataBase) collects the annotations of subcellular localization of eukaryotic proteomes. For each sequence, the database lists localization obtained adopting three different approaches: 1) experimentally dete ...		22%
ProTherm ProThermDB is a database for proteins and mutants with data on protein stability, an increase of 84% from the previous version. It contains several thermodynamic parameters such as melting temperature, free energy obtained with thermal and denaturant ...		22%
PRTAD PRTAD is a dedicated database and structural bioinformatics system for protein analysis and modeling. The database is developed to host and analyze the statistical data for protein residue level "virtual" bond and torsion angles, based on their distr ...		22%
3DSwap: Database of Proteins involved in 3D domain Swapping Protein oligomerization is a key biochemical step to perform the designated function of proteins. 3D domain swapping is a unique protein oligomerization phenomenon observed in a wide array of proteins involved in diverse functional roles. Apart from ...		22%
NMPdb - Nuclear matrix associated proteins database Nuclear matrix associated proteins database		22%
eBLOCKS Classifying proteins into families and super-families allows identification of functionally mportant conserved domains. The motifs and scoring matrices derived from such conserved regions provide computational tools to recognize similar patterns in n ...		22%
CyMoBase CyMoBase is an online database for manually annotated protein sequences of cytoskeletal and motor proteins and associated information. It currently offers more than 3000 sequences from 26 proteins in more than 350 species. Meta information linked to ...		22%
PFD - Protein Folding Database The Protein Folding Database (PFD) is a searchable collection of all annotated structural, methodological, kinetic and thermodynamic data relating to experimental protein folding studies. The database structure allows visualization of folding data in ...		22%
SUPFAM During the course of evolution, protein sequences derived from a common ancestor diverge by mutations, insertions and deletions, gene duplication and recombination and give rise to diverse families with no easily detectable sequence similarity. These ...		22%
NURSA NURSA is a resource within which bioinformatic and bench research efforts in the field of nuclear receptors can be pursued in a synergistic and multidisciplinary approach, using a common technological platform. The primary directive of the NURSA prog ...		22%
HRaP - Database of occurrence of HomoRepeats and Patterns in proteomes With active studying of disordered regions and their function we focus our attention on manifold long repeats of one amino acid (homorepeats) (1). Our database includes 122 proteomes, 97 eukaryotic and 25 bacterial ones that can be divided into 9 kin ...		22%
Defensins Knowledgebase The defensins knowledgebase is a manually curated database and information source devoted to the defensin family of antimicrobial peptides. The current version of the database holds a comprehensive collection of over 350 defensin records each contain ...		22%
BIOZON Biozon is a platform that allows for the storage, management, and analysis of interrelated proteins, genes, interactions, protein families, cellular pathways and more. These heterogeneous data types and the relations between them are locally warehous ...		22%
SENTRA SENTRA (http://www.ncbi.nlm.nih.gov/Complete_Genomes/SignalCensus.html) is a database of proteins associated with microbial signal transduction. The database currently includes the classical two-component signal transduction pathway proteins and meth ...		22%
DomIns - Database of Domain Insertions Proteins can be formed by single or multiple domains. The process of recombination at the molecular level has generated a wide variety of multi-domain proteins with specific domain organization to cater to the functional requirements of an organism. ...		22%
SBASE SBASE (http://www.icgeb.trieste.it/sbase) is an on-line collection of protein domain sequences and related computational tools designed to facilitate detection of domain homologies based on simple database search. The tenth - "jubilee release" of the ...		22%
CREMOFAC CREMOFAC is a dedicated web-database for ATP and Non-ATP dependent chromatin-remodeling factors. The database harbors factors from 49 different organisms reported in literature and facilitates a comprehensive search for them. It provides in-depth inf ...		22%
LOX-DB Due to their involvement in several diseases like cancer, inflammation, fever or arthritis, a lot of research is done on lipoxygenases yielding information about sequence, structure and function of these proteins. The LipOXygenases-DataBase (LOX-DB) ...		22%
NOPdb: Nucleolar Proteome Database The Nucleolar Proteome Database (NOPdb) archives data on more than 700 proteins that were identified by multiple mass spectrometry (MS) analyses from highly purified preparations of human nucleoli, the most prominent nuclear organelle. Each protein e ...		22%
SUBA The Arabidopsis Subcellular Database (SUBA, http://suba.plantenergy.uwa.edu.au) is maintained by the ARC Centre of Excellence in Plant Energy Biology at The University of Western Australia. The database contains publicly available protein subcellular ...		22%
SDAP SDAP (Structural Database of Allergenic Proteins) is a Web server that provides rapid, cross-referenced assess to the sequences, structures, and IgE epitopes of allergenic proteins. The SDAP core is a series of CGI scripts that process the user queri ...		22%
Wnt Database Wnt proteins form a family of highly conserved secreted signaling molecules that regulate cell-to-cell interactions during embryogenesis. Wnt genes and Wnt signaling are also implicated in cancer. Insights into the mechanisms of Wnt action have emerg ...		22%
EukProt EukProt is a database of published and publicly available predicted protein sets and unannotated genomes selected to represent eukaryotic diversity, including 742 species from all major supergroups as well as orphan taxa. The goal of the database is ...		22%
EVEREST - EVolutionary Ensembles of REcurrent SegmenTs EVEREST is an automatic computational process identifying protein domainsand classifying them into families. The EVEREST database contains 20,029families, each defined by one or more HMMER HMMs. EVEREST has beenthoroughly tested and evaluated, and ha ...		22%
GPCR NaVa database The GPCR NaVa database describes sequence variants within the family of human G Protein-Coupled Receptors (GPCRs). GPCRs regulate many physiological functions and are the targets for most of today's medicines. The acronym NaVa stands for Natural Vari ...		22%
RNRdb RNRdb - the Ribonucleotide Reductase Database - is a tool developed for ribonucleotide reductase (RNR) research. RNR is an enzyme that uses radical chemistry to reduce ribonucleotides to deoxyribonucleotides. Since this is the only pathway for the de ...		22%
NESbase Protein export from the nucleus is often mediated by a Leucine-rich nuclear export signal (NES) consisting of 4-5 hydrophobic residues within a region of approximately 10 amino acids. Many Leucine-rich NESs have been identified and reported in litera ...		22%
SRPDB Signal recognition particle (SRP) is an ribonucleoprotein particle designed to recognize secretory signal sequences as they emerge from the ribosome. SRP associates with the SRP-receptor in the ER membrane, is released from the ribosome, and recycled ...		22%
iUUCD The ubiquitin and ubiquitin-like (Ub/Ubl) conjugation is one of the most important post- translational modifications (PTMs) in proteins, and regulates a large number of cellular processes, such as cell cycle, signal transduction, apoptosis and auto ...		22%
PlantsP/PlantsT As one database with two functionally different web interfaces, PlantsP and PlantsT are plant-specific curated databases that combine sequence derived information with experimental functional genomics data. PlantsP focuses on proteins involved in the ...		22%
DSD Dehydrogenase enzymes belong to the oxidoreductase class and utilise the coenzymes NAD and NADP. Stereo-selectivity is focused on the C4 hydrogen atoms of the nicotinamide ring of NAD(P). Depending upon which hydrogen is transferred at the C4 locatio ...		22%
CSDBase - Cold Shock Domain database CSDBase (http://www.chemie.uni-marburg.de/~csdbase/) is an interactive Internet-embedded research platform providing detailed information on cold shock domain-containing proteins and bacterial cold shock responses. In its second release, access to CS ...		22%
NLSdb NLSdb is a database of nuclear localization signals (NLSs)and of nuclear proteins.NLSs are short stretches of residues mediating transport of nuclear proteins into the nucleus.The database contains 114 experimentally determined NLSs that were obtaine ...		22%
EROP-Moscow Natural oligopeptides may regulate nearly all vital processes. To date, the chemical structures of nearly 6000 oligopeptides have been identified from more than 1000 organisms representing all the biological kingdoms. We have compiled the known physi ...		22%
AAindex AAindex is a database of amino acid indices and amino acid mutation matrices. An amino acid index is a set of 20 numerical values representing various physicochemical and biochemical properties of amino acids. An amino acid mutation matrix is general ...		22%
OGRe - Organellar Genome Retrieval OGRe is a relational database containing information on completely sequenced animal mitochondrial genomes. It currently contains 473 species. This is the full set of complete metazoan mitochondrial genomes available as of July 2004. The structure of ...		22%
InterFil The Human Intermediate Filament Database (http://www.interfil.org) was initiated by the Human Genetics Unit, University of Dundee in 2001 and was revised by the Centre for Molecular Medicine and the Bioinformatics Institute in Singapore in 2006, from ...		22%

*ReputationScore indicates how established a given datasource is. Find out more.

Need help integrating and/or managing biomedical data?