Basic Search | Intermediate Search | Advanced SQL Search | Gene Image Map |  Home

Treponema pallidum Search Results

Record: 1 of 1  
MiniMap IGR189 IGR191 IGR192 IGR190 rpsL, - TP0243 rpsG, - TP0244 TP0245 rpoB, - TP0241 rpoC, - TP0242 rpsL, - TP0243 rpsG, - TP0244 TP0245 rpoB, - TP0241 rpoC, - TP0242 Type: tandem, Name:  - 237 rpsL, - TP0243 rpsG, - TP0244 TP0245 rpoB, - TP0241 rpoC, - TP0242
* Calculated from Protein Sequence

Gene ID: TP0242

DNA Molecule Name:
1  

Genbank ID:
3322512

Gene Name:
rpoC  

Definition:
DNA-directed RNA polymerase beta' subunit

Gene Start:
249134

Gene Stop:
253381

Gene Length:
4248

Molecular Weight*:
159796

pI*:
7.75

Net Charge*:
7.66

EC:
2.7.7.6  

Functional Class:
Transcription; DNA-dependent RNA polymerase  

Pathway: pathway table
Nucleotide metabolism; Purine Metabolism
Nucleotide metabolism; Pyrimidine Metabolism

Comment:
DNA-dependent RNA polymerase catalyzes the transcription of DNA into
RNA using the four ribonucleoside triphosphate substrates.

Reaction:
N NUCLEOSIDE TRIPHOSPHATE = N PYROPHOSPHATE + RNA(N).

See TP0212, rpoA.
See TP0241, rpoB.

Blast Summary:  PSI-Blast Search
Gapped BLAST revealed several significant hits to DNA-directed RNA
polymerase, beta prime subunit (rpoC). For example residues 6-1366
are 49% similar to RPOC_ECOLI DNA-directed RNA polymerase beta'
chain of Escherichia coli, (V00339).

COGS Summary:  COGS Search
BeTs to 17 clades of COG0086
COG name: DNA-dependent RNA polymerase beta' subunit (split gene in Mjan, Mthe, Aful, and Synechocystis)
Functional Class:  K
The phylogenetic pattern of COG0086 is AMTKYqVCebrhujgpolinx
Number of proteins in this genome belonging to this COG is 1

Blocks Summary:  Blocks Search
Residues 231-242, 252-305, 412-465, 721-775, 865-913, 1232-1286,
1155-1209, and 1339-1378 represent significant (100%) hits to blocks
BL00115A,B,C,E,F,G,H, Eukaryotic RNA polymerase II heptapeptide
repeat proteins. Residues 341-358 represent a significant (99%) hit
to block PR00692F, T-CELL SURFACE GLYCOPROTEIN CD4 SIGNATURE.
Residues 61-95, 586-632, and 1030-1068 represent significant (96%)
hits to blocks BL00544B,C,D, Methylmalonyl-CoA mutase proteins.

ProDom Summary:  Protein Domain Search
Residues 571-798 are 48% similar to DNA-DIRECTED RNA POLYMERASE DELTA
CHAIN of RPOD_SYNY3. Residues 12-135 are 73% similar to DNA-directed
RNA polymerase, beta'/gamma subunit of RPOC_BROTH. Residues
116-201, 221-310, 311-358, 328-341, 411-502, 804-871, 878-920,
1020-1099, 1137-1153, 1224-1296, 1322-1359, and 1345-1364 are good
matches to RpoC proteins.

Paralogs:  Local Blast Search
There is no evidence of paralogs in T. pallidum.

Pfam Summary:  Pfam Search
Residues 4 to 331 (E-value = 1.8e-168) place TP0242 in the RNA_pol_Rpb1_1 family which is described as RNA polymerase Rpb1, domain 1 (PF04997)
Residues 333 to 475 (E-value = 1.9e-84) place TP0242 in the RNA_pol_Rpb1_2 family which is described as RNA polymerase Rpb1, domain 2 (PF00623)
Residues 478 to 622 (E-value = 2.5e-33) place TP0242 in the RNA_pol_Rpb1_3 family which is described as RNA polymerase Rpb1, domain 3 (PF04983)
Residues 647 to 731 (E-value = 1.5e-31) place TP0242 in the RNA_pol_Rpb1_4 family which is described as RNA polymerase Rpb1, domain 4 (PF05000)
Residues 733 to 1326 (E-value = 2.6e-209) place TP0242 in the RNA_pol_Rpb1_5 family which is described as RNA polymerase Rpb1, domain 5 (PF04998)

Structural Feature(s):
Feature Type  Start  Stop
non-globular  
342  
412
coil-coil  
1373  
1400

PDB Hit:
none

Gene Protein Sequence:
MKDIRDFDSLQIKLASPDTIRAWSYGEVKKPETINYRTLRPEREGLFCER
IFGTTKEWECFCGKFKSIRYRGVICDRCGVEVTHFKVRRERMGHIELATP
VSHIWYYRCVPSRMGLLLDLQVIALRSVLYYEKYIVIEPGDTDLKKNQLL
TETEYNDAQERYGGGFTAGMGAEAIRTLLQNLDLDALVAQLREKMMEKGA
KSDKRLLRRIEIVENFRVSGNKPEWMILSVIPVIPPDLRPMVQLDGGRFA
TSDLNDLYRRVIHRNSRLIRLMELKAPDIIIRNEKRMLQEAVDALFDNSK
RKPAIKGASNRPLKSISDMLKGKQGRFRQNLLGKRVDYSGRSVIVVGPEL
KLWQCGLPTKMALELFKPFIMKKLVEKEIVSNIKKAKMLVEQESPKVFSV
LDEVVKEHPVMLNRAPTLHRLGIQAFEPVLVEGKAIRLHPLVCKPFNADF
DGDQMAVHVPLTQAAQMECWTLMLSNRNLLDPANGRTIVYPSQDMVLGLY
YLTKERSLPEGARPRRFSSVEEVMMAAEKGVIGWQDQIQVRYHKCDGQLV
VTTAGRLVLNEEVPAEIPFVNETLDDKRIRKLIERVFKRQDSWLAVQMLD
ALKTIGYTYATFFGATLSMDDIIVPEQKVQMLEKANKEVLAIASQYRGGH
ITQEERYNRVVEVWSKTSEELTSLMMETLERDKDGFNTIYMMATSGARGS
RNQIRQLAGMRGLMAKPSGDIIELPIRSNFKEGLNVIEFFISTNGARKGL
ADTALKTADAGYLTRRLVDIAQDVVVNEEDCGTINGIEYRAVKSGDEIIE
SLAERIVGKYTLERVEHPITHELLLDVNEYIDDERAEKVEEAGVESVKLR
TVLTCESKRGVCVCCYGRNLARNKIVEIGEAVGIVAAQSIGQPGTQLTMR
TFHVGGTASSTTEENRITFKYPILVKSIEGVHVKMEDGSQLFTRRGTLFF
HKTLAEYQLQEGDSVQVRDRARVLKDEVLYHTTDGQTVYASVSGFARIID
RTVYLVGPEQKTEIRNGSNVVIKADEYVPPGKTVATFDPFTEPILAEQDG
FVRYEDIILGSTLIEEVNTETGMVERRITTLKTGIQLQPRVFISDESGNA
LGSYYLPEEARLMVEEGAQVKAGTVIVKLAKAIQKTSDITGGLPRVSELF
EARRPKNAAVLAQISGVVSFKGLFKGKRIVVVRDHYGKEYKHLVSMSRQL
LVRDGDTVEAGERLCDGCFDPHDILAILGENALQNYLMNEIRDVYRVQGV
SINDQHIGLVVRQMLRKTEVVSVGDTRFIYGQQVDKYRFHEENRRVEAEG
GQPAVARPMFQGITKAALNIDSFISAASFQETNKVLTNAAIAGSVDDLCG
LKENVIIGHLIPAGTGMRRYRQVKLFDKNKRDLDVQMEEVIRRRKLEEEA
LAQAVAGMEGEPEGEA

Gene Nucleotide Sequence:  Sequence Viewer
ATGAAGGATATCCGGGATTTTGACAGTTTACAGATAAAGCTTGCCTCCCC
TGATACCATTCGGGCATGGTCCTATGGAGAGGTGAAAAAGCCTGAGACAA
TTAATTACCGCACGTTGCGTCCTGAACGTGAAGGGCTTTTTTGTGAACGC
ATTTTTGGTACTACAAAGGAATGGGAATGCTTTTGTGGAAAGTTTAAGTC
AATTCGGTACCGGGGTGTTATCTGCGATCGGTGCGGGGTGGAGGTAACGC
ATTTCAAGGTTCGCAGGGAGCGCATGGGGCATATTGAGCTTGCAACGCCT
GTTTCTCATATTTGGTACTACCGTTGTGTACCAAGTAGAATGGGTTTGTT
ACTCGATCTACAGGTGATCGCACTGCGTTCTGTTTTGTACTATGAGAAGT
ACATAGTTATAGAGCCGGGCGACACCGATTTAAAAAAGAATCAGTTGCTC
ACTGAAACTGAGTACAATGACGCGCAGGAGCGCTACGGTGGCGGCTTTAC
GGCGGGAATGGGAGCGGAGGCTATCCGTACCCTTTTGCAAAACCTTGACC
TTGACGCGCTTGTTGCACAGTTGCGTGAGAAGATGATGGAGAAGGGTGCG
AAAAGCGACAAACGCTTGCTGCGTCGCATAGAGATCGTAGAAAACTTTCG
GGTGTCGGGAAATAAGCCGGAATGGATGATTTTGAGCGTTATCCCGGTGA
TCCCGCCTGATTTGCGTCCTATGGTGCAGCTCGACGGAGGGCGTTTTGCT
ACCTCAGATCTCAATGACCTGTATCGGCGTGTGATCCACCGCAATAGCCG
TTTGATTCGGCTCATGGAACTGAAGGCGCCGGATATCATCATTCGGAACG
AAAAGCGCATGTTGCAAGAGGCAGTGGACGCGCTTTTTGATAATTCTAAG
CGCAAGCCCGCGATTAAAGGTGCGTCAAACCGGCCGCTTAAGTCTATTTC
TGACATGCTCAAGGGGAAGCAAGGGCGTTTTCGCCAGAATCTTTTGGGCA
AGCGGGTCGACTATTCCGGGCGTTCGGTTATCGTAGTGGGGCCTGAACTT
AAGTTGTGGCAGTGCGGGTTGCCTACAAAAATGGCGCTTGAGCTGTTTAA
GCCCTTTATTATGAAAAAGCTGGTTGAGAAAGAAATTGTCTCGAACATCA
AAAAGGCAAAGATGCTCGTGGAACAAGAGTCGCCGAAGGTATTTTCGGTG
TTGGATGAAGTGGTAAAAGAGCATCCAGTTATGCTTAATCGGGCGCCGAC
ATTGCATCGATTGGGCATTCAGGCTTTTGAGCCGGTGTTGGTGGAGGGGA
AGGCGATTCGTCTTCATCCGCTTGTGTGTAAACCTTTTAATGCTGATTTT
GATGGGGATCAAATGGCGGTGCATGTGCCGCTGACGCAGGCGGCACAGAT
GGAGTGTTGGACGCTCATGTTGTCGAATCGCAATTTGCTTGACCCTGCAA
ATGGGCGCACGATTGTGTATCCATCTCAGGACATGGTTCTGGGTTTGTAT
TATCTGACAAAGGAACGCTCTCTGCCGGAGGGTGCTCGTCCTCGCCGTTT
TTCCTCGGTGGAGGAGGTAATGATGGCTGCGGAAAAGGGGGTAATCGGCT
GGCAGGATCAGATTCAAGTGCGATATCACAAATGTGATGGTCAGCTTGTG
GTCACTACCGCAGGAAGACTTGTGTTGAATGAGGAAGTTCCCGCAGAGAT
TCCTTTTGTCAACGAAACGCTTGATGACAAACGCATCAGGAAATTAATTG
AGCGGGTGTTCAAGCGTCAGGATTCTTGGCTTGCGGTGCAGATGCTCGAT
GCACTGAAAACTATCGGTTATACCTACGCGACCTTCTTTGGTGCAACGCT
CAGTATGGACGACATCATCGTGCCTGAGCAGAAGGTGCAGATGCTCGAAA
AGGCGAACAAGGAAGTGCTAGCGATTGCGAGTCAATACCGCGGGGGGCAC
ATCACGCAAGAGGAGCGTTATAATCGCGTCGTTGAGGTGTGGTCTAAAAC
AAGTGAGGAGCTCACTTCGCTCATGATGGAAACACTTGAGCGCGACAAGG
ATGGATTTAATACCATTTACATGATGGCTACCTCAGGTGCGCGCGGGAGT
CGCAATCAAATCCGCCAACTGGCGGGAATGCGTGGCTTAATGGCAAAGCC
GAGTGGGGATATCATCGAATTGCCTATTCGTTCTAATTTTAAAGAGGGAC
TCAATGTCATTGAGTTTTTTATTTCTACCAACGGTGCACGCAAAGGGCTC
GCAGACACTGCGCTAAAGACCGCTGATGCGGGGTATTTGACACGTCGTCT
GGTTGATATCGCGCAAGATGTGGTGGTGAACGAGGAGGACTGTGGTACCA
TCAATGGCATTGAATATCGCGCGGTGAAGTCCGGCGATGAGATTATTGAA
TCGCTTGCTGAGCGCATCGTAGGAAAGTATACACTTGAACGTGTAGAACA
CCCCATCACCCATGAACTGCTGCTCGATGTGAACGAATACATCGACGATG
AGCGTGCAGAAAAGGTGGAAGAAGCGGGCGTGGAGTCAGTGAAGTTGCGC
ACCGTGCTCACGTGCGAATCTAAGCGAGGAGTGTGTGTGTGCTGCTACGG
GCGGAATCTTGCACGCAACAAAATTGTAGAAATTGGGGAGGCGGTTGGGA
TTGTAGCCGCTCAGTCCATTGGTCAGCCGGGTACGCAGCTGACAATGCGC
ACGTTCCATGTTGGGGGTACGGCAAGCAGTACTACGGAAGAGAACCGCAT
CACGTTTAAGTATCCCATACTGGTAAAGAGTATTGAGGGGGTGCATGTGA
AAATGGAGGATGGCTCTCAGCTGTTCACGCGTCGGGGGACGCTCTTTTTT
CACAAAACTCTGGCAGAGTATCAGCTTCAAGAGGGTGACAGCGTGCAGGT
GCGTGACCGCGCGCGGGTGCTAAAGGATGAGGTTCTCTACCACACCACCG
ATGGGCAGACGGTGTACGCTTCGGTGAGTGGTTTTGCGCGTATAATCGAT
CGAACCGTGTACCTGGTAGGGCCTGAGCAAAAGACGGAAATTCGCAATGG
TTCTAATGTAGTAATCAAGGCAGACGAGTATGTGCCGCCCGGAAAGACCG
TGGCTACGTTTGATCCGTTCACTGAACCTATTTTGGCAGAGCAGGATGGC
TTTGTGCGGTACGAAGATATTATTTTGGGCTCTACGCTCATCGAAGAGGT
AAATACTGAAACGGGGATGGTGGAGCGCAGGATTACGACGTTGAAAACAG
GAATACAGCTTCAACCGCGGGTATTCATCTCTGATGAGTCGGGGAATGCG
CTGGGTTCGTACTACTTGCCAGAGGAAGCGCGCTTGATGGTTGAAGAAGG
CGCGCAGGTGAAGGCGGGTACGGTCATTGTAAAACTGGCAAAAGCAATTC
AAAAGACATCGGATATTACGGGGGGGCTGCCGCGTGTTTCTGAATTATTT
GAAGCGCGGCGCCCTAAGAATGCGGCTGTCTTGGCACAGATTTCTGGGGT
TGTGTCGTTCAAAGGACTGTTTAAGGGTAAGCGTATTGTCGTGGTGCGTG
ACCATTACGGGAAGGAATATAAGCACCTCGTGTCCATGTCGCGTCAGCTT
TTAGTACGTGATGGAGATACGGTTGAGGCAGGCGAACGCTTGTGTGATGG
TTGCTTTGATCCCCATGATATCCTGGCAATTCTGGGTGAAAATGCTTTGC
AAAACTATTTGATGAATGAGATCCGTGACGTGTATCGTGTGCAGGGTGTT
TCAATCAATGACCAGCACATTGGTTTAGTGGTGCGGCAAATGCTACGAAA
GACAGAGGTTGTCTCGGTTGGGGACACGCGTTTTATCTACGGGCAACAGG
TGGATAAGTACCGTTTTCACGAAGAGAACCGTCGGGTTGAAGCGGAAGGG
GGGCAGCCTGCGGTTGCGCGCCCAATGTTCCAGGGTATAACGAAGGCGGC
GTTGAACATAGACTCTTTCATATCTGCGGCATCTTTCCAAGAAACGAACA
AGGTGCTCACCAATGCGGCGATTGCAGGCTCTGTTGATGACTTGTGTGGG
TTGAAGGAGAACGTCATTATAGGGCACTTAATTCCCGCAGGTACGGGGAT
GCGGCGTTATCGTCAGGTGAAGCTGTTTGACAAGAACAAGCGGGATCTTG
ATGTGCAGATGGAGGAAGTTATCAGGCGTAGAAAACTTGAAGAGGAGGCG
CTTGCCCAGGCAGTTGCGGGTATGGAAGGGGAACCTGAAGGCGAAGCG


Los Alamos National Laboratory     
Operated by the University of California for the National Nuclear Security Administration,
of the US Department of Energy.     Copyright © 2001 UC | Disclaimer/Privacy