Basic Search | Intermediate Search | Advanced SQL Search | Gene Image Map |  Home

Haemophilus ducreyi Search Results

Record: 1 of 1  
MiniMap IGR332 IGR335 IGR337 IGR331 IGR336 IGR338 IGR334 IGR330 IGR333 IGR339 rluD, - HD0469 eno, - HD0477 cyaY, - HD0473 HD0467 HD0470 lgtB, - HD0472 pfkA, - HD0465 lgtA, - HD0466 gcp, - HD0471 recQ, - HD0475 rluD, - HD0469 eno, - HD0477 cyaY, - HD0473 HD0467 HD0470 lgtB, - HD0472 pfkA, - HD0465 lgtA, - HD0466 gcp, - HD0471 recQ, - HD0475 Type: tandem, Name:  - 10 rluD, - HD0469 eno, - HD0477 cyaY, - HD0473 HD0467 HD0470 lgtB, - HD0472 pfkA, - HD0465 lgtA, - HD0466 gcp, - HD0471 recQ, - HD0475
* Calculated from Protein Sequence

Gene ID: HD0471

DNA Molecule Name:
1  

Genbank ID:
0

Gene Name:
gcp  

Definition:
sialylglycoprotease

Gene Start:
370972

Gene Stop:
372018

Gene Length:
1047

Molecular Weight*:
37681

pI*:
6.60

Net Charge*:
-2.01

EC:
3.4.24.57  

Functional Class:
Translation; Degradation of proteins, peptides, and glycopeptides  

Pathway: pathway table

Primary Evidence:
Sun,S., Schilling,B., Tarantino,L., Tullius,M.V., Gibson,B.W. and Munson,R.S. Jr.,
Cloning and characterization of the lipooligosaccharide galactosyltransferase II gene of Haemophilus ducreyi,
J. Bacteriol. 182 (8), 2292-2298 (2000).
Medline: 20200369

Secondary Evidence:
Abdullah,K.M., Lo,R.Y. and Mellors,A.
Cloning, nucleotide sequence, and expression of the Pasteurella
haemolytica A1 glycoprotease gene
J. Bacteriol. 173 (18), 5597-5603 (1991)
Medline: 91358346


Comment:
From Genbank:[gi:544376]
This enzyme is a neutral metalloprotease that cleaves specifically
O-sialoglycoproteins such as glycophorin A.

Blast Summary:  PSI-Blast Search
A match in gapped BLAST to a previously submitted sequence in Genbank: residues 1-348 are 96% similar to sialylglycoprotease from Haemophilus ducreyi (6942294|).

Matches in gapped BLAST to sialylglycoproteases.Residues 1-323 are 83% similar to this enzyme from Pasteurella haemolytica
(97190|).

COGS Summary:  COGS Search
BeTs to 17 clades of COG0533
COG name: Metal-dependent proteases with possible chaperone activity
Functional Class: O
The phylogenetic pattern of COG0533 is amtkYQvcebrhujgpolinx
Number of proteins in this genome belonging to this COG is 1

Blocks Summary:  Blocks Search
***** IPB000905 (Glycoprotease, (M22) metallo-protease family) with a combined E-value of 4.4e-97.
    IPB000905A    3-17
    IPB000905B    40-51
    IPB000905C    72-112
    IPB000905D    131-143
    IPB000905E    154-181
    IPB000905F    207-218
    IPB000905G    272-281
    IPB000905H    302-311


ProDom Summary:  Protein Domain Search
Residues 1-319 are 95% similar to a (ENDOPEPTIDASE O-SIALOGLYCOPROTEIN) protein domain (PD002367) which is seen in Q9L7A5_HAEDU.

Residues 320-348 are identical to a (PROTEASE SIALYLGLYCOPROTEASE) protein domain (PD223493) which is seen in Q9L7A5_HAEDU.



Paralogs:  Local Blast Search
HD0471 is paralogously related to HD1474 (4e-09).


Pfam Summary:  Pfam Search
Residues 2 to 283 (E-value = 2.9e-142) place HD0471 in the Peptidase_M22 family which is described as Glycoprotease family (PF00814)

PDB Hit:
No significant hits to the NCBI PDB database.

Gene Protein Sequence:
MRILGIETSCDETGVAIYDEQRGLIANQLYSQIEMHADYGGVVPELASRD
HIRKTLPLIQAALKEANLTASEIDGIAYTAGPGLVGALLVGATIARALAY
AWNVPALAVHHMEGHLMAPMLEENPPEFPFIALLISGGHTQLIKVAGVGE
YEILGESIDDAAGEAFDKTGKLLGLDYPAGVALSQLAEKGTPNRFVFPRP
MTDRPGLDFSFSGLKTFAANTINAQLDENGQLNEQTRCDIAHAFQQAVVD
TIIIKCKRALQQTGYSRLVMAGGVSANKQLRAELATMMQALKGQVYYPRP
QFCTDNGAMIAYTGFIRLKKGEKTDLSVSVKPRWPMTTLLKLSAHHKV$

Gene Nucleotide Sequence:  Sequence Viewer
ATGCGAATTTTAGGAATTGAAACTTCATGTGATGAAACTGGTGTTGCTAT
TTATGATGAGCAACGAGGTCTAATTGCAAATCAACTTTATAGCCAAATTG
AGATGCACGCTGATTATGGTGGTGTGGTTCCCGAGTTAGCTTCACGAGAT
CATATCCGTAAAACATTACCTTTGATTCAAGCGGCATTAAAAGAAGCAAA
TTTAACCGCTTCTGAGATTGATGGCATTGCTTATACGGCTGGACCAGGGC
TAGTTGGAGCCTTGCTTGTTGGTGCCACTATTGCACGTGCATTAGCTTAT
GCTTGGAATGTTCCCGCATTAGCTGTACATCATATGGAAGGGCATTTAAT
GGCACCTATGCTAGAAGAAAATCCTCCTGAATTTCCCTTTATTGCGCTGT
TGATTTCAGGTGGGCATACACAATTAATTAAAGTAGCGGGTGTAGGTGAA
TATGAAATTTTAGGCGAATCAATAGATGATGCTGCGGGAGAAGCTTTTGA
TAAAACGGGTAAATTGCTTGGTTTGGATTACCCTGCTGGTGTTGCTTTAT
CGCAATTAGCGGAAAAAGGTACGCCGAATCGTTTTGTTTTTCCTCGGCCA
ATGACGGATAGACCAGGTTTAGACTTTAGTTTCTCTGGTTTGAAAACCTT
TGCAGCAAATACGATTAATGCTCAGTTAGATGAAAATGGTCAGTTAAATG
AACAAACGCGTTGTGATATTGCACACGCATTTCAGCAAGCTGTTGTGGAT
ACGATTATTATTAAATGTAAGCGAGCATTACAGCAAACAGGCTATAGCCG
TTTAGTTATGGCGGGCGGTGTAAGTGCTAATAAACAGTTACGTGCAGAAT
TAGCTACTATGATGCAAGCATTAAAAGGTCAAGTCTATTATCCTCGTCCT
CAATTTTGTACAGATAATGGGGCAATGATTGCCTATACCGGTTTTATTCG
CTTGAAAAAGGGGGAAAAAACGGATTTAAGTGTAAGTGTAAAGCCTCGTT
GGCCAATGACAACTTTATTAAAATTATCAGCGCACCATAAAGTTTAA


Los Alamos National Laboratory     
Operated by the University of California for the National Nuclear Security Administration,
of the US Department of Energy.     Copyright © 2001 UC | Disclaimer/Privacy