Basic Search | Intermediate Search | Advanced SQL Search | Gene Image Map |  Home

Haemophilus ducreyi Search Results

Record: 1 of 1  
MiniMap IGR726 IGR732 IGR723 IGR728 IGR725 IGR729 IGR724 IGR733 IGR735 IGR730 IGR727 IGR731 IGR734 HD1012 HD1015 cspC, - HD1008 HD1013 HD1014 thyA, - HD1005 HD1011 hupA, - HD1004 HD1017 HD1009 HD1006 bioB, - HD1007 prc, - HD1010 HD1012 HD1015 cspC, - HD1008 HD1013 HD1014 thyA, - HD1005 HD1011 hupA, - HD1004 HD1017 HD1009 HD1006 bioB, - HD1007 prc, - HD1010 HD1012 cspC, - HD1008 HD1014 thyA, - HD1005 HD1016 HD1015 HD1013 HD1016 HD1011 hupA, - HD1004 HD1017 HD1009 HD1006 bioB, - HD1007 prc, - HD1010
* Calculated from Protein Sequence

Gene ID: HD1010

DNA Molecule Name:
1  

Genbank ID:
0

Gene Name:
prc  

Definition:
tail specific protease precursor

Gene Start:
802102

Gene Stop:
804168

Gene Length:
2067

Molecular Weight*:
78561

pI*:
10.00

Net Charge*:
17.20

EC:
3.4.21.-  

Functional Class:
Translation; Degradation of proteins, peptides, and glycopeptides  

Pathway: pathway table

Secondary Evidence:
Beebe KD, Shin J, Peng J, Chaudhury C, Khera J, Pei D.
Substrate recognition through a PDZ domain in tail-specific protease.
Biochemistry. 2000 Mar 21;39(11):3149-55.
PMID: 10715137

Comment:
Tsp is said to attack C-termini bearing nonpolar residues (Beebe et al., 2000).

Blast Summary:  PSI-Blast Search
Numerous significant hits in gapped BLAST to tail specific protease proteins; e.g. residues 18-685 are 64% similar to (U32840) tail specific protease of Haemophilus influenzae, residues 5-686 are 47% similar to (M75634) tail-specific protease of Escherichia coli, residues 5-658 are 47% similar to (D90827) tail-specific protease precursor of Escherichia coli.

COGS Summary:  COGS Search
BeTs to 10 clades of COG0793
COG name: Periplasmic protease
Functional Class: M
The phylogenetic pattern of COG0793 is -----qvCeB-huj--olinx
Number of proteins in this genome belonging to this COG is 1

Blocks Summary:  Blocks Search
***** IPB003581 (Tail specific protease) with a combined E-value of 5.4e-23.
    IPB003581B    398-407
    IPB003581C    467-496


ProDom Summary:  Protein Domain Search
Residues 544-680 are 64% similar to a (PROTEASE TAIL-SPECIFIC PROTEOME COMPLETE) protein domain (PD038233) which is seen in Q9CP01_PASMU.

Residues 29-262 are 67% similar to a (PROTEASE TAIL-SPECIFIC PROTEOME COMPLETE) protein domain (PD039119) which is seen in PRC_HAEIN.

Residues 265-329 are 58% similar to a (PROTEASE PERIPLASMIC TAIL-SPECIFIC SERINE) protein domain (PD406648) which is seen in PRC_HAEIN.

Residues 359-536 are 67% similar to a (PROTEASE PROTEOME COMPLETE PRECURSOR) protein domain (PD004132) which is seen in PRC_HAEIN.

Residues 265-330 are 62% similar to a (DOMAIN PROTEASE TIGHT JUNCTION) protein domain (PD000073) which is seen in Q9CP01_PASMU.



Paralogs:  Local Blast Search
HD1010 has no significant similarity (blastp p-value < 1e-3) to any other gene in this genome.


Pfam Summary:  Pfam Search
Residues 246 to 330 (E-value = 2.9e-12) place HD1010 in the PDZ family which is described as PDZ domain (Also known as DHR or GLGF) (PF00595)
Residues 363 to 548 (E-value = 2.5e-34) place HD1010 in the Peptidase_S41 family which is described as Peptidase family S41B (PF03572)

PDB Hit:
pdb|1FC7|A Chain A, Photosystem Ii D1 C-Terminal Processing Pro... 89 2e-018
pdb|1FC6|A Chain A, Photosystem Ii D1 C-Terminal Processing Pro... 87 7e-018

Gene Protein Sequence:
MLTKIRIFMKVNKLSRLIAFCLGTMISTAFAIEPKIKANELVLPQPTELH
NLSTKRVTARLTQSHYRKFNLDDEFAGKIFNRYIDWLDVAHNTFLQSDID
ELRTKYASKLDEELYEGKLDSAFEMYDVMVKRRYERYKHVLSLLDKPLDL
KGRDQIENEREKAPFPVTIEEANKLWEQRVKNDVISLTLKNKKWPEIKKT
LIKRYNLAIKRLTQVKPDDVLQTYLNAFAREIDPHTSYLSPRAAKAFQES
MNLSLEGIGATLSMEDDITTIKSLIPGAPAARSKKLGVGDKIVGVGQEKG
EIEDVIGWRLDDVVDKIKGKKGSKVRLEIEPAKAHKTKIITLTRDTVRLE
DSAAKLTIDKVAGKNIAVIKIPAFYIGLANDMHKLLAKIKTKKIEGLIVD
LRDNGGGSLTEVVELTRLFIKSSPVVQVRDAFNRIKVYEDEPQIQVDKSV
VTTNKPLYDGKLMVMINRHSASASEIFAAAMQDYNRAIIIGQQTFGKGTV
QQSRSLNFVYDLDRDPLGFIQYTIQKFYRIDGGSTQLKGVQADIKFPEII
DAEKTGESFEDNALPWDKIPAANYQEVGNARDVVAVLTEKHQARIAKEPE
FITLSENIAIRKERDQRKFTSLNLEERKKEDQADDMKRLKDLNARFIREG
KKPLKNLDALPKDYEGPDFFLKEAKQMMFDWLQINKKA$

Gene Nucleotide Sequence:  Sequence Viewer
ATGCTTACAAAAATTAGGATTTTTATGAAAGTAAATAAACTTAGTCGACT
TATTGCATTTTGTCTTGGAACAATGATAAGTACCGCTTTTGCAATTGAAC
CAAAAATTAAAGCAAATGAATTAGTTCTACCTCAACCAACAGAACTACAT
AATCTGTCAACTAAGCGTGTGACAGCACGTTTAACCCAATCACATTATCG
TAAATTTAATTTAGATGATGAATTTGCCGGTAAGATTTTTAATCGTTATA
TTGATTGGTTAGATGTCGCACACAATACTTTTTTACAATCTGATATTGAT
GAACTGCGGACTAAATATGCATCTAAATTAGATGAAGAACTATATGAAGG
TAAATTAGATTCCGCCTTTGAAATGTATGACGTAATGGTCAAGCGTCGCT
ATGAACGTTATAAACATGTACTTTCATTGCTAGATAAACCGCTTGATTTA
AAAGGCCGTGATCAAATTGAAAATGAACGTGAAAAAGCGCCTTTCCCGGT
TACGATAGAAGAAGCAAATAAGTTATGGGAACAGCGGGTTAAAAATGACG
TAATTAGCTTGACTCTAAAAAATAAAAAATGGCCAGAAATTAAGAAGACA
TTGATTAAGCGCTATAATTTAGCGATTAAACGTTTAACACAAGTTAAACC
TGATGATGTTTTACAAACTTATTTGAATGCATTTGCACGAGAAATAGATC
CGCATACGAGTTATCTTTCACCACGAGCGGCGAAAGCTTTCCAAGAAAGT
ATGAATTTGTCATTAGAAGGCATTGGTGCAACACTTTCCATGGAAGACGA
TATTACCACCATTAAATCATTGATTCCAGGCGCCCCAGCAGCTCGTAGCA
AAAAGCTTGGCGTAGGGGATAAGATTGTGGGTGTTGGCCAAGAGAAAGGT
GAAATTGAAGATGTTATTGGTTGGCGTTTAGATGATGTTGTTGACAAAAT
CAAGGGTAAAAAAGGCAGTAAGGTTCGGTTAGAGATTGAGCCAGCAAAAG
CGCATAAAACTAAAATCATTACGTTAACACGCGATACCGTTCGGTTAGAA
GACAGTGCCGCTAAATTAACAATTGATAAAGTAGCGGGTAAAAATATTGC
AGTGATTAAAATTCCTGCTTTTTATATCGGTTTAGCTAATGATATGCACA
AATTATTAGCAAAAATTAAGACTAAGAAAATAGAAGGATTAATTGTCGAT
TTACGTGATAACGGTGGTGGTTCTTTAACTGAAGTGGTTGAATTAACTAG
GTTATTTATTAAAAGCAGTCCTGTTGTACAGGTAAGAGATGCTTTTAATC
GTATCAAAGTGTACGAAGATGAGCCTCAAATACAAGTAGATAAGTCAGTA
GTAACAACAAATAAGCCATTATATGACGGAAAATTGATGGTAATGATAAA
CCGCCATAGCGCCTCAGCATCTGAAATTTTTGCGGCGGCAATGCAAGATT
ATAATCGTGCAATTATTATTGGCCAGCAAACTTTTGGTAAAGGCACTGTG
CAACAAAGTAGATCGCTTAATTTTGTTTATGATTTAGATCGAGATCCACT
AGGCTTTATTCAATATACGATCCAGAAATTCTATCGTATTGATGGTGGCA
GTACTCAATTAAAAGGTGTGCAAGCGGATATTAAATTCCCTGAAATTATT
GATGCGGAGAAAACCGGAGAAAGTTTTGAAGATAATGCATTACCATGGGA
TAAAATTCCAGCCGCTAATTATCAAGAAGTTGGAAATGCGCGTGATGTAG
TTGCTGTTTTAACAGAAAAACATCAAGCGCGCATTGCAAAAGAGCCTGAA
TTTATCACGTTAAGTGAGAATATTGCTATTCGTAAAGAACGCGATCAGCG
TAAATTTACCTCGCTGAATTTAGAAGAGCGTAAAAAAGAAGATCAAGCAG
ATGATATGAAACGCTTAAAAGATCTGAATGCCCGCTTTATACGAGAAGGT
AAAAAGCCCCTGAAAAATCTTGATGCATTACCAAAAGATTATGAAGGTCC
CGACTTTTTCTTAAAAGAAGCGAAACAAATGATGTTTGATTGGTTACAAA
TCAATAAAAAGGCATAA


Los Alamos National Laboratory     
Operated by the University of California for the National Nuclear Security Administration,
of the US Department of Energy.     Copyright © 2001 UC | Disclaimer/Privacy