BLASTP 2.2.1 [Apr-13-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= gi|15644635|ref|NP_206803.1| hypothetical protein
[Helicobacter pylori 26695]
(138 letters)
Database: /home/scwang/download_20020708_db/nr
1,026,957 sequences; 324,149,939 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_206803.1| (NC_000915) hypothetical protein [Helicoba... 271 7e-73
ref|NP_222723.1| (NC_000921) TRANSCRIPTION TERMINATION [Hel... 267 1e-71
ref|NP_281572.1| (NC_002163) transcription termination prot... 124 2e-28
ref|NP_660779.1| (NC_004061) N utilization substance protei... 79 1e-14
ref|NP_213090.1| (NC_000918) transcription termination NusB... 77 4e-14
ref|NP_240274.1| (NC_002528) N utilization substance protei... 74 3e-13
ref|NP_622912.1| (NC_003869) Transcription termination fact... 74 4e-13
ref|NP_470732.1| (NC_003212) similar to transcription termi... 72 1e-12
ref|NP_245667.1| (NC_002663) NusB [Pasteurella multocida] >... 72 1e-12
ref|NP_464884.1| (NC_003210) similar to transcription termi... 72 1e-12
ref|NP_406656.1| (NC_003143) N utilization substance protei... 71 2e-12
ref|NP_602432.1| (NC_003454) N utilization substance protei... 69 7e-12
ref|NP_562740.1| (NC_003366) probable N utilization substan... 69 1e-11
ref|NP_286157.1| (NC_002655) transcription termination; L f... 69 1e-11
ref|NP_219452.1| (NC_000919) N utilization substance protei... 69 1e-11
ref|NP_390312.1| (NC_000964) similar to transcription termi... 69 1e-11
ref|NP_459413.1| (NC_003197) transcription termination; L f... 68 2e-11
ref|NP_455012.1| (NC_003198) N utilization substance protei... 68 2e-11
ref|NP_229562.1| (NC_000853) N utilization substance protei... 67 3e-11
ref|NP_357984.1| (NC_003098) Transcription termination prot... 65 1e-10
ref|NP_344955.1| (NC_003028) N utilization substance protei... 65 1e-10
ref|NP_348703.1| (NC_003030) Transcription termination fact... 65 1e-10
ref|NP_439455.1| (NC_000907) N utilization substance protei... 65 1e-10
ref|NP_266850.1| (NC_002662) transcription termination prot... 65 2e-10
ref|NP_212241.1| (NC_001318) N-utilization substance protei... 63 6e-10
ref|NP_252741.1| (NC_002516) NusB protein [Pseudomonas aeru... 63 6e-10
ref|NP_607890.1| (NC_003485) putative transcriptional termi... 62 8e-10
ref|NP_636090.1| (NC_003902) transcription termination fact... 62 1e-09
ref|NP_372048.1| (NC_002758) hypothetical protein [Staphylo... 62 1e-09
ref|NP_269822.1| (NC_002737) putative transcriptional termi... 61 2e-09
ref|NP_625770.1| (NC_003888) putative NusB-family protein [... 61 2e-09
ref|NP_658218.1| (NC_003995) NusB, NusB family [Bacillus an... 60 3e-09
ref|NP_385322.1| (NC_003047) PUTATIVE N UTILIZATION SUBSTAN... 60 3e-09
ref|NP_662591.1| (NC_002932) N utilization substance protei... 60 3e-09
ref|NP_243651.1| (NC_002570) transcriptional terminator [Ba... 60 3e-09
ref|NP_301448.1| (NC_002677) putative transcription termina... 60 4e-09
ref|NP_641104.1| (NC_003919) transcription termination fact... 60 4e-09
ref|NP_298245.1| (NC_002488) transcription termination fact... 59 7e-09
ref|NP_295790.1| (NC_001263) N-utilization substance protei... 59 9e-09
ref|NP_441814.1| (NC_000911) N utilization substance protei... 58 2e-08
ref|NP_217049.1| (NC_000962) nusB [Mycobacterium tuberculos... 58 2e-08
ref|NP_337104.1| (NC_002755) N utilization substance protei... 58 2e-08
ref|NP_445404.1| (NC_002179) N utilization substance protei... 56 8e-08
ref|NP_225183.1| (NC_000922) CT832 hypothetical protein [Ch... 56 8e-08
ref|NP_531868.1| (NC_003304) N-utilization substance protei... 55 1e-07
ref|NP_354190.1| (NC_003062) AGR_C_2167p [Agrobacterium tum... 55 1e-07
ref|NP_231898.1| (NC_002505) N utilization substance protei... 55 1e-07
ref|NP_108511.1| (NC_002678) N-utilization substance protei... 55 1e-07
ref|NP_420173.1| (NC_002696) N utilization substance protei... 55 2e-07
gb|AAB95441.1| (AF002857) NUSB [Shigella flexneri] 53 7e-07
ref|NP_485800.1| (NC_003272) transcription termination fact... 53 7e-07
ref|NP_273725.1| (NC_003112) N utilization substance protei... 52 1e-06
ref|NP_283676.1| (NC_003116) putative RNA polymerase antite... 51 2e-06
ref|NP_600832.1| (NC_003450) COG0781:Transcription terminat... 51 2e-06
ref|NP_540103.1| (NC_003317) N UTILIZATION SUBSTANCE PROTEI... 50 3e-06
gb|AAF18280.1| (AF088897) N-utilization substance protein B... 48 2e-05
ref|NP_296598.1| (NC_002620) N utilization substance protei... 45 1e-04
pir||T05067 hypothetical protein M3E9.200 - Arabidopsis tha... 45 1e-04
ref|NP_567745.1| (NM_118770) putative protein [Arabidopsis ... 45 1e-04
ref|NP_078134.1| (NC_002162) transcription termination fact... 45 1e-04
ref|NP_359841.1| (NC_003103) N utilization substance protei... 44 3e-04
ref|NP_220353.1| (NC_000117) Transcription termination fact... 42 0.001
ref|NP_518832.1| (NC_003295) PROBABLE N UTILIZATION SUBSTAN... 41 0.002
ref|NP_600813.1| (NC_003450) COG0144:tRNA and rRNA cytosine... 40 0.004
ref|NP_220552.1| (NC_000963) N UTILIZATION SUBSTANCE PROTEI... 40 0.004
sp|Q9ZE01|NUSB_RICPR N utilization substance protein B homo... 40 0.004
dbj|BAB98992.1| (AP005279) tRNA and rRNA cytosine-C5-methyl... 40 0.004
ref|NP_661503.1| (NC_002932) Sun protein [Chlorobium tepidu... 39 0.010
>ref|NP_206803.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
sp|O24853|NUSB_HELPY N utilization substance protein B homolog (NusB protein)
pir||A64520 transcription termination factor NusB - Helicobacter pylori
(strain 26695)
gb|AAD07074.1| (AE000523) H. pylori predicted coding region HP0001 [Helicobacter
pylori 26695]
Length = 138
Score = 271 bits (694), Expect = 7e-73
Identities = 138/138 (100%), Positives = 138/138 (100%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI
Sbjct: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF
Sbjct: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
Query: 121 LNAILDSLSKKLTQKPLN 138
LNAILDSLSKKLTQKPLN
Sbjct: 121 LNAILDSLSKKLTQKPLN 138
>ref|NP_222723.1| (NC_000921) TRANSCRIPTION TERMINATION [Helicobacter pylori J99]
sp|Q9ZN57|NUSB_HELPJ N utilization substance protein B homolog (NusB protein)
pir||C71985 transcription termination factor nusB [similarity] - Helicobacter
pylori (strain J99)
gb|AAD05585.1| (AE001440) TRANSCRIPTION TERMINATION [Helicobacter pylori J99]
Length = 138
Score = 267 bits (683), Expect = 1e-71
Identities = 136/137 (99%), Positives = 136/137 (99%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI
Sbjct: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF
Sbjct: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
Query: 121 LNAILDSLSKKLTQKPL 137
LNAILDSLSKKL QKPL
Sbjct: 121 LNAILDSLSKKLAQKPL 137
>ref|NP_281572.1| (NC_002163) transcription termination protein [Campylobacter
jejuni]
pir||D81381 transcription termination factor nusB Cj0382c [similarity] -
Campylobacter jejuni (strain NCTC 11168)
emb|CAB74218.1| (AL139075) transcription termination protein [Campylobacter jejuni]
Length = 132
Score = 124 bits (311), Expect = 2e-28
Identities = 65/130 (50%), Positives = 86/130 (66%), Gaps = 1/130 (0%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
MATR Q R +V+ LLYAFE N + +L+EKKI+N Q F L+L+NG+L+ +N I
Sbjct: 1 MATRHQVRQSVISLLYAFEL-NSQNNVFVDEILDEKKIRNEQKNFTLNLYNGILDNLNNI 59
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
D + L D LG +E+AILRLGAYE+ FT T + I+INE IEL K A N+PKF
Sbjct: 60 DETLNSFLNDNQITALGHVERAILRLGAYELLFTDTPSAIVINEAIELAKELANDNSPKF 119
Query: 121 LNAILDSLSK 130
+N +LD+L K
Sbjct: 120 INGVLDALIK 129
>ref|NP_660779.1| (NC_004061) N utilization substance protein B [Buchnera aphidicola
str. Sg (Schizaphis graminum)]
gb|AAM67990.1| (AE014121) N utilization substance protein B [Buchnera aphidicola
str. Sg (Schizaphis graminum)]
Length = 138
Score = 78.6 bits (192), Expect = 1e-14
Identities = 47/131 (35%), Positives = 72/131 (54%), Gaps = 4/131 (3%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R +AR +++LY++E IK A L+EK KN + + L G+ ID L
Sbjct: 8 RRKARACALQMLYSWEISQNNIKDSAIEFLKEKNKKNIDIIYFYELIIGITYNCRSIDDL 67
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP--IIINECIELGKLYAEPNTPKFL 121
++P+L K LG +EKAILR+ YE+ + P + INE IEL KL+ ++ KF+
Sbjct: 68 MKPYLSR-SLKELGQIEKAILRISFYEL-YKRKDIPYKVSINEGIELAKLFGSEDSHKFI 125
Query: 122 NAILDSLSKKL 132
N +LD + K+
Sbjct: 126 NGVLDKAALKI 136
>ref|NP_213090.1| (NC_000918) transcription termination NusB [Aquifex aeolicus]
sp|O66530|NUSB_AQUAE N utilization substance protein B homolog (NusB protein)
pir||G70312 transcription termination factor nusB [similarity] - Aquifex
aeolicus
gb|AAC06491.1| (AE000675) transcription termination NusB [Aquifex aeolicus]
Length = 148
Score = 76.6 bits (187), Expect = 4e-14
Identities = 45/132 (34%), Positives = 71/132 (53%), Gaps = 2/132 (1%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINE 59
M R AR +LY ++ E ++ ++EEK IKN +A L + + I E
Sbjct: 1 MRYRKGARDTAFLVLYRWDLRGENPGELFKEVVEEKNIKNKDAYEYAKKLVDTAVRHIEE 60
Query: 60 IDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-IIINECIELGKLYAEPNTP 118
ID++IE HLK W RLG +E+ LRLG E+ F ++ P + + ++L K YA+
Sbjct: 61 IDSIIEKHLKGWSIDRLGYVERNALRLGVAELIFLKSKEPGRVFIDIVDLVKKYADEKAG 120
Query: 119 KFLNAILDSLSK 130
KF+N +L ++ K
Sbjct: 121 KFVNGVLSAIYK 132
>ref|NP_240274.1| (NC_002528) N utilization substance protein B [Buchnera sp. APS]
sp|P57535|NUSB_BUCAI N utilization substance protein B homolog (NusB protein)
dbj|BAB13160.1| (AP001119) N utilization substance protein B [Buchnera sp. APS]
Length = 143
Score = 73.9 bits (180), Expect = 3e-13
Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 2/130 (1%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R +AR +++LY++E + IK+ A L+EK KN + + L G+ ID L
Sbjct: 6 RRKARACALQVLYSWEISHNNIKESAIYFLKEKNKKNIDIVYFYELIIGITYDCKNIDNL 65
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYAEPNTPKFLN 122
++P+L K LG +E+AILR+ YE+ + INE IEL KL+ ++ KF+N
Sbjct: 66 MKPYLFR-SLKELGHIERAILRISFYELHKRNDIPYKVSINEGIELAKLFGSEDSHKFIN 124
Query: 123 AILDSLSKKL 132
+LD K+
Sbjct: 125 GVLDKAVFKM 134
>ref|NP_622912.1| (NC_003869) Transcription termination factor [Thermoanaerobacter
tengcongensis]
gb|AAM24516.1| (AE013090) Transcription termination factor [Thermoanaerobacter
tengcongensis]
Length = 140
Score = 73.6 bits (179), Expect = 4e-13
Identities = 41/128 (32%), Positives = 68/128 (53%), Gaps = 1/128 (0%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
RT+AR VV++LY ++ ++KI + EE Q + G +E + EID
Sbjct: 3 RTEAREWVVKMLYQYDVSKLPLEKIFENFYEEHD-PGEQKEYIEGTVRGTVEHLEEIDRE 61
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKFLNA 123
IE + KDW R+ ++ AILR YE+ + I INE +E+ K Y+ ++P F+N
Sbjct: 62 IEKYSKDWPLYRMPRIDLAILRCSMYEMLYGNIPVSISINEAVEIAKKYSTDDSPSFING 121
Query: 124 ILDSLSKK 131
+L + ++
Sbjct: 122 LLGAFVRE 129
>ref|NP_470732.1| (NC_003212) similar to transcription termination protein (NusB)
[Listeria innocua]
emb|CAC96627.1| (AL596168) similar to transcription termination protein (NusB)
[Listeria innocua]
Length = 128
Score = 71.6 bits (174), Expect = 1e-12
Identities = 38/127 (29%), Positives = 71/127 (54%), Gaps = 6/127 (4%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R +AR ++ L+ E + + +++E++ Q + L GV+ EIDA+
Sbjct: 3 RREAREKALQALFQIELNEMSLDQAIKNIMEDE-----QDDYMEQLVEGVMANKAEIDAI 57
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
IEP+L +W RL ++ ++LRL YEI + N + +NE IE+ K+Y++ + KF+N
Sbjct: 58 IEPNLDNWRIDRLNKVDLSLLRLSVYEIKYLDDVPNRVSLNESIEIAKIYSDEKSSKFIN 117
Query: 123 AILDSLS 129
+L +++
Sbjct: 118 GVLANIA 124
>ref|NP_245667.1| (NC_002663) NusB [Pasteurella multocida]
sp|P57868|NUSB_PASMU N utilization substance protein B homolog (NusB protein)
gb|AAK02814.1| (AE006110) NusB [Pasteurella multocida]
Length = 144
Score = 71.6 bits (174), Expect = 1e-12
Identities = 40/136 (29%), Positives = 75/136 (54%), Gaps = 2/136 (1%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
++ R +AR V+ LY++ +I + + E+ +K A+ LF E ++ +
Sbjct: 10 ISPRRRARECAVQALYSWYVSQNSPAEIELNFMAEQDLKGVDTAYFRRLFRQTAENVDAV 69
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPK 119
D ++ P+L D + L +EKAILRL YE+ F ++INE IE+ K++ ++ K
Sbjct: 70 DNIMIPYL-DREVSELDPIEKAILRLAVYELKFELDVPYKVVINEAIEVAKVFGAEDSHK 128
Query: 120 FLNAILDSLSKKLTQK 135
++N +LD ++ L++K
Sbjct: 129 YVNGVLDKVAPVLSRK 144
>ref|NP_464884.1| (NC_003210) similar to transcription termination protein (NusB)
[Listeria monocytogenes EGD-e]
emb|CAC99437.1| (AL591978) similar to transcription termination protein (NusB)
[Listeria monocytogenes]
Length = 128
Score = 71.6 bits (174), Expect = 1e-12
Identities = 38/127 (29%), Positives = 71/127 (54%), Gaps = 6/127 (4%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R +AR ++ L+ E + + +++E++ Q + L GV+ EIDA+
Sbjct: 3 RREAREKALQALFQIELNEMSLDQAIKNIMEDE-----QDDYMEKLVEGVMANKAEIDAI 57
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
IEP+L +W RL ++ ++LRL YEI + N + +NE IE+ K+Y++ + KF+N
Sbjct: 58 IEPNLDNWRMDRLSKVDLSLLRLSVYEIKYLDDVPNRVSLNESIEIAKIYSDEKSSKFIN 117
Query: 123 AILDSLS 129
+L +++
Sbjct: 118 GVLANIA 124
>ref|NP_406656.1| (NC_003143) N utilization substance protein B [Yersinia pestis]
emb|CAC92416.1| (AJ414155) N utilization substance protein B [Yersinia pestis]
Length = 138
Score = 71.2 bits (173), Expect = 2e-12
Identities = 39/135 (28%), Positives = 72/135 (52%), Gaps = 2/135 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R +AR V+ LY+++ +I + L E+ +K+ +A+ L +GV +D
Sbjct: 4 AARRRARECAVQALYSWQLSKNDIADVELQFLSEQDVKDVDIAYFRELLSGVAVNAASLD 63
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIG-FTPTQNPIIINECIELGKLYAEPNTPKF 120
AL+ P L + LG +E+A+LR+ +E+ + INE IEL K + ++ KF
Sbjct: 64 ALMAPFLSR-QLEELGQVERAVLRIALFELSKRDDVPYKVAINEAIELAKTFGAEDSHKF 122
Query: 121 LNAILDSLSKKLTQK 135
+N +LD ++ + ++
Sbjct: 123 VNGVLDKVAPTVRKR 137
>ref|NP_602432.1| (NC_003454) N utilization substance protein B [Fusobacterium
nucleatum subsp. nucleatum ATCC 25586]
gb|AAL93731.1| (AE010469) N utilization substance protein B [Fusobacterium
nucleatum subsp. nucleatum ATCC 25586]
Length = 153
Score = 69.3 bits (168), Expect = 7e-12
Identities = 41/132 (31%), Positives = 72/132 (54%), Gaps = 8/132 (6%)
Query: 7 ARGAVVELLY---AFESGNEEIKKIASSMLEEKK-----IKNNQLAFALSLFNGVLEKIN 58
AR V +L++ A ES +EE+K+ L+ + + NQL F S +G+ + +
Sbjct: 19 AREEVFKLVFGVEATESASEELKQAFDIYLQNSEELIGTLNENQLEFLKSSIDGIAKNYD 78
Query: 59 EIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTP 118
I +I+ + ++W ++R+G +E+A+L + YE F +I NE IEL K Y +
Sbjct: 79 NIKDIIKKNTQNWAYERIGVVERALLIVATYEFIFKNAPIEVIANEIIELAKEYGNEKSY 138
Query: 119 KFLNAILDSLSK 130
+F+N IL ++ K
Sbjct: 139 EFVNGILANIEK 150
>ref|NP_562740.1| (NC_003366) probable N utilization substance protein B [Clostridium
perfringens]
dbj|BAB81530.1| (AP003191) probable N utilization substance protein B [Clostridium
perfringens]
Length = 135
Score = 68.6 bits (166), Expect = 1e-11
Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 3/132 (2%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL--AFALSLFNGVLEKINEID 61
R ++R +++L Y E +E + +S +E + I + L A+ S G+ E ++D
Sbjct: 3 RVKSREYLLQLAYQMEITSETALETFNSFMENEDISKDDLDLAYIKSGLLGIEENKEKLD 62
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKF 120
+LIE L W R+ + +ILR+ YEI F + INE IEL K Y++ + F
Sbjct: 63 SLIESQLVKWKLNRISKVNLSILRISTYEILFAEDVPGKVSINEAIELCKKYSDNKSVSF 122
Query: 121 LNAILDSLSKKL 132
+N +LD + K +
Sbjct: 123 INGVLDKVYKNM 134
>ref|NP_286157.1| (NC_002655) transcription termination; L factor [Escherichia coli
O157:H7 EDL933]
ref|NP_308496.1| (NC_002695) transcription termination factor NusB [Escherichia coli
O157:H7]
ref|NP_414950.1| (NC_000913) transcription termination; L factor [Escherichia coli
K12]
sp|P04381|NUSB_ECOLI N utilization substance protein B (NusB protein)
pir||I51822 nusB protein - Escherichia coli
pir||FJECB transcription termination factor nusB [validated] - Escherichia
coli
pdb|1BAQ| Antitermination Factor Nusb From Escherichia Coli, Nmr, 18
Structures
pdb|1EY1|A Chain A, Solution Structure Of Escherichia Coli Nusb
gb|AAA24228.1| (M26839) nusB [Escherichia coli]
emb|CAA45737.1| (X64395) nusB (ssyB) [Escherichia coli]
emb|CAA25289.1| (X00681) nusB protein [Escherichia coli]
gb|AAB40172.1| (U82664) N utilization substance protein B [Escherichia coli]
gb|AAC73519.1| (AE000148) transcription termination; L factor [Escherichia coli
K12]
gb|AAG54765.1|AE005221_2 (AE005221) transcription termination; L factor [Escherichia coli
O157:H7 EDL933]
dbj|BAB33892.1| (AP002551) transcription termination factor NusB [Escherichia coli
O157:H7]
emb|CAC44764.1| (AJ313516) N utilisation substance protein B [Expression vector
pNCO113-nusB/nusE]
prf||2111328A NusB protein [Escherichia coli]
Length = 139
Score = 68.6 bits (166), Expect = 1e-11
Identities = 39/126 (30%), Positives = 67/126 (52%), Gaps = 2/126 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R +AR V+ LY+++ +I + L E+ +K+ + + L GV +D
Sbjct: 4 AARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLAGVATNTAYLD 63
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKF 120
L++P+L + LG +EKA+LR+ YE+ + + INE IEL K + ++ KF
Sbjct: 64 GLMKPYLSRL-LEELGQVEKAVLRIALYELSKRSDVPYKVAINEAIELAKSFGAEDSHKF 122
Query: 121 LNAILD 126
+N +LD
Sbjct: 123 VNGVLD 128
>ref|NP_219452.1| (NC_000919) N utilization substance protein B (nusB) [Treponema
pallidum]
sp|O83979|NUSB_TREPA N utilization substance protein B homolog (NusB protein)
pir||C71253 probable transcription termination factor nusB - syphilis
spirochete
gb|AAC65965.1| (AE001269) N utilization substance protein B (nusB) [Treponema
pallidum]
Length = 141
Score = 68.6 bits (166), Expect = 1e-11
Identities = 33/89 (37%), Positives = 55/89 (61%), Gaps = 1/89 (1%)
Query: 43 LAFALSLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-II 101
L F+ LF G LE + EID + L+ WDF RL ++KAILRL AY + F P ++
Sbjct: 51 LGFSRLLFLGTLEHLREIDGCVSSRLEHWDFVRLNKVDKAILRLSAYSLLFQKDIPPVVV 110
Query: 102 INECIELGKLYAEPNTPKFLNAILDSLSK 130
I+E + + + + ++ +F+N +LD+++K
Sbjct: 111 IHEAVSIARDFGTDDSFRFVNGVLDNIAK 139
>ref|NP_390312.1| (NC_000964) similar to transcription termination [Bacillus
subtilis]
sp|P54520|NUSB_BACSU N utilization substance protein B homolog (NusB protein)
pir||F69960 transcription termination factor nusB homolog yqhZ [similarity] -
Bacillus subtilis
dbj|BAA12571.1| (D84432) YqhZ [Bacillus subtilis]
emb|CAB14363.1| (Z99116) similar to transcription termination [Bacillus subtilis]
Length = 131
Score = 68.6 bits (166), Expect = 1e-11
Identities = 38/132 (28%), Positives = 69/132 (51%), Gaps = 5/132 (3%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R AR ++ L+ + + + + L+E+K F L +GVLE +++D +
Sbjct: 3 RRTAREKALQALFQIDVSDIAVNEAIEHALDEEKTD----PFFEQLVHGVLEHQDQLDEM 58
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKFLN 122
I HL +W R+ ++++AILRL AYE+ + + +NE IEL K + + KF+N
Sbjct: 59 ISKHLVNWKLDRIANVDRAILRLAAYEMAYAEDIPVNVSMNEAIELAKRFGDDKATKFVN 118
Query: 123 AILDSLSKKLTQ 134
+L ++ + Q
Sbjct: 119 GVLSNIKSDIGQ 130
>ref|NP_459413.1| (NC_003197) transcription termination; L factor [Salmonella
typhimurium LT2]
gb|AAL19372.1| (AE008715) transcription termination; L factor [Salmonella
typhimurium LT2]
Length = 139
Score = 68.2 bits (165), Expect = 2e-11
Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 2/126 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R +AR V+ LY+++ +I + L E+ +K+ + + L +GV +D
Sbjct: 4 AARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLSGVATNSAYLD 63
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIG-FTPTQNPIIINECIELGKLYAEPNTPKF 120
L++P+L + LG +EKA+LR+ +E+ + + INE IEL K + ++ KF
Sbjct: 64 GLMKPYLSRL-LEELGQVEKAVLRIALFELSKRSDVPYKVAINEAIELAKTFGAEDSHKF 122
Query: 121 LNAILD 126
+N +LD
Sbjct: 123 VNGVLD 128
>ref|NP_455012.1| (NC_003198) N utilization substance protein B [Salmonella enterica
subsp. enterica serovar Typhi]
emb|CAD08874.1| (AL627266) N utilization substance protein B [Salmonella enterica
subsp. enterica serovar Typhi]
Length = 139
Score = 67.8 bits (164), Expect = 2e-11
Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 2/126 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R +AR V+ LY+++ +I + L E+ +K+ + + L +GV +D
Sbjct: 4 AARHRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLSGVATNSAYLD 63
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIG-FTPTQNPIIINECIELGKLYAEPNTPKF 120
L++P+L + LG +EKA+LR+ +E+ + + INE IEL K + ++ KF
Sbjct: 64 GLMKPYLSRL-LEELGQVEKAVLRIALFELSKRSDVPYKVAINEAIELAKTFGAEDSHKF 122
Query: 121 LNAILD 126
+N +LD
Sbjct: 123 VNGVLD 128
>ref|NP_229562.1| (NC_000853) N utilization substance protein B [Thermotoga maritima]
sp|Q9X286|NUSB_THEMA N utilization substance protein B homolog (NusB protein)
pir||D72212 transcription termination factor nusB TM1765 [similarity] -
Thermotoga maritima (strain MSB8)
gb|AAD36829.1|AE001815_3 (AE001815) N utilization substance protein B [Thermotoga maritima]
Length = 142
Score = 67.4 bits (163), Expect = 3e-11
Identities = 45/137 (32%), Positives = 74/137 (53%), Gaps = 9/137 (6%)
Query: 4 RTQARGAVVELLYAFE-SGNEEIKKIASSMLEE---KKIKNNQLAFALSLFNGVLEKINE 59
R + R AV + L+ E +E++++I +L+E KK K + A G+ E ++
Sbjct: 5 RRRMRLAVFKALFQHEFRRDEDLEQILEEILDETYDKKAKED----ARRYIRGIKENLSM 60
Query: 60 IDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTP 118
ID LI +L+ W RL +++ +LRL YE+ F + I+E IE+ K Y N+
Sbjct: 61 IDDLISRYLEKWSLNRLSVVDRNVLRLATYELLFEKDIPIEVTIDEAIEIAKRYGTENSG 120
Query: 119 KFLNAILDSLSKKLTQK 135
KF+N ILD ++K+ K
Sbjct: 121 KFVNGILDRIAKEHAPK 137
>ref|NP_357984.1| (NC_003098) Transcription termination protein [Streptococcus
pneumoniae R6]
gb|AAK99194.1| (AE008419) Transcription termination protein [Streptococcus
pneumoniae R6]
Length = 146
Score = 65.5 bits (158), Expect = 1e-10
Identities = 40/127 (31%), Positives = 67/127 (52%), Gaps = 2/127 (1%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINE 59
+ +R Q R + L + E G + + +++ + QL AF + L +GV K E
Sbjct: 12 LESRRQLRKCAFQALMSLEFGTDVETACRFAYTHDREDTDVQLPAFLIDLVSGVQAKKEE 71
Query: 60 IDALIEPHLK-DWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTP 118
+D I HLK W +RL +E+ +LRLG +EI T + +NE IEL K +++ +
Sbjct: 72 LDKQITQHLKAGWTIERLTLVERNLLRLGVFEITSFDTPQLVAVNEAIELAKDFSDQKSA 131
Query: 119 KFLNAIL 125
+F+N +L
Sbjct: 132 RFINGLL 138
>ref|NP_344955.1| (NC_003028) N utilization substance protein B [Streptococcus
pneumoniae TIGR4]
gb|AAK74595.1| (AE007354) N utilization substance protein B [Streptococcus
pneumoniae TIGR4]
Length = 140
Score = 65.5 bits (158), Expect = 1e-10
Identities = 40/127 (31%), Positives = 67/127 (52%), Gaps = 2/127 (1%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINE 59
+ +R Q R + L + E G + + +++ + QL AF + L +GV K E
Sbjct: 6 LESRRQLRKCAFQALMSLEFGTDVETACRFAYTHDREDTDVQLPAFLIDLVSGVQAKKEE 65
Query: 60 IDALIEPHLK-DWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTP 118
+D I HLK W +RL +E+ +LRLG +EI T + +NE IEL K +++ +
Sbjct: 66 LDKQITQHLKAGWTIERLTLVERNLLRLGVFEITSFDTPQLVAVNEAIELAKDFSDQKSA 125
Query: 119 KFLNAIL 125
+F+N +L
Sbjct: 126 RFINGLL 132
>ref|NP_348703.1| (NC_003030) Transcription termination factor NusB [Clostridium
acetobutylicum]
gb|AAK80043.1|AE007710_13 (AE007710) Transcription termination factor NusB [Clostridium
acetobutylicum]
Length = 135
Score = 65.1 bits (157), Expect = 1e-10
Identities = 38/133 (28%), Positives = 61/133 (45%), Gaps = 1/133 (0%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R ++R ++LL+ I E +I+N + + G+ E + ID+
Sbjct: 3 RKKSREVAMKLLFEISINKNSISDTIEHYKENNEIENLDFEYIERILRGIDENMEYIDSK 62
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
IE K W R+ + ILR+ AYEI F + NE +EL K YAE N+ F+N
Sbjct: 63 IEESSKKWKISRISKINITILRMAAYEIFFEKDIPCKVSANEAVELAKSYAEENSFSFVN 122
Query: 123 AILDSLSKKLTQK 135
++ +L +K
Sbjct: 123 GVIGNLINSSEEK 135
>ref|NP_439455.1| (NC_000907) N utilization substance protein B (nusB) [Haemophilus
influenzae Rd]
sp|P45150|NUSB_HAEIN N utilization substance protein B homolog (NusB protein)
pir||D64115 transcription termination factor nusB - Haemophilus influenzae
(strain Rd KW20)
gb|AAC22951.1| (U32810) N utilization substance protein B (nusB) [Haemophilus
influenzae Rd]
Length = 144
Score = 65.1 bits (157), Expect = 1e-10
Identities = 37/135 (27%), Positives = 69/135 (50%), Gaps = 2/135 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
+ R +AR V+ LY++ +++ + + ++ + + LF +E I +D
Sbjct: 11 SARRRARECTVQALYSWAVSGNTAEQVELAFVLDQDMDGVDKPYFRKLFRQTIENIETVD 70
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKF 120
I P++ D F L +E AILRL YE+ F ++INE IE+ K++ + K+
Sbjct: 71 FSISPYI-DRAFDELDPIETAILRLAVYELRFELDVPYKVVINEAIEVAKVFGADESHKY 129
Query: 121 LNAILDSLSKKLTQK 135
+N +LD ++ L +K
Sbjct: 130 INGVLDKIAPALGRK 144
>ref|NP_266850.1| (NC_002662) transcription termination protein NusB [Lactococcus
lactis subsp. lactis]
gb|AAK04792.1|AE006302_10 (AE006302) transcription termination protein NusB [Lactococcus
lactis subsp. lactis]
Length = 323
Score = 64.7 bits (156), Expect = 2e-10
Identities = 31/83 (37%), Positives = 55/83 (65%), Gaps = 1/83 (1%)
Query: 49 LFNGVLEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIE 107
L +GVL+K +++A I +L K W F RL +E+AIL++ +YEI +T T + + +NE +E
Sbjct: 241 LVDGVLDKKEDLEANISKYLTKTWSFSRLTLVEQAILQVSSYEILYTETPDVVAVNEAVE 300
Query: 108 LGKLYAEPNTPKFLNAILDSLSK 130
L K +++ + +F+N +L + K
Sbjct: 301 LSKDFSDEKSSRFINGVLTNFLK 323
>ref|NP_212241.1| (NC_001318) N-utilization substance protein B (nusB) [Borrelia
burgdorferi]
sp|O51134|NUSB_BORBU N utilization substance protein B homolog (NusB protein)
pir||C70113 probable transcription termination factor nusB - Lyme disease
spirochete
gb|AAC66498.1| (AE001123) N-utilization substance protein B (nusB) [Borrelia
burgdorferi]
Length = 145
Score = 62.8 bits (151), Expect = 6e-10
Identities = 38/117 (32%), Positives = 64/117 (54%), Gaps = 3/117 (2%)
Query: 19 ESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINEIDALIEPHLKDWDFKRLG 77
+S ++I I + ++ I+N + +F SL G + + ID+LI +W +R+
Sbjct: 22 QSAMDDIFDIFNIEDKDLDIENESIKSFYSSLVIGTFDNLEHIDSLIRDISLNWSLERMD 81
Query: 78 SMEKAILRLGAYEIGFTPTQNP--IIINECIELGKLYAEPNTPKFLNAILDSLSKKL 132
++ AILR+G Y + F +N II+E I + K Y N+ KF+N ILD+L K +
Sbjct: 82 KVDLAILRMGVYSLKFQNFENSKRAIIDEAILIAKKYGSKNSYKFINGILDALLKNM 138
>ref|NP_252741.1| (NC_002516) NusB protein [Pseudomonas aeruginosa]
pir||G83140 NusB protein PA4052 [imported] - Pseudomonas aeruginosa (strain
PAO1)
gb|AAG07439.1|AE004821_12 (AE004821) NusB protein [Pseudomonas aeruginosa]
Length = 159
Score = 62.8 bits (151), Expect = 6e-10
Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 2/132 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R +AR V+ LY+++ + + +I + + A+ + +GV + +E+D
Sbjct: 19 AARRKARSLAVQALYSWQIAGQPLHEIEAQFRTDNDFSEVDGAYFHEILHGVPRQKSELD 78
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYAEPNTPKF 120
+ EP L D + +E AILRL YE+ ++INE IEL K + + KF
Sbjct: 79 STFEPCL-DRPLAEIDPVELAILRLSTYELRNRIDVPYKVVINEGIELAKTFGATDGHKF 137
Query: 121 LNAILDSLSKKL 132
+N +LD L+ +L
Sbjct: 138 VNGVLDKLAPRL 149
>ref|NP_607890.1| (NC_003485) putative transcriptional terminator [Streptococcus
pyogenes MGAS8232]
gb|AAL98389.1| (AE010095) putative transcriptional terminator [Streptococcus
pyogenes MGAS8232]
Length = 150
Score = 62.4 bits (150), Expect = 8e-10
Identities = 35/83 (42%), Positives = 49/83 (58%), Gaps = 2/83 (2%)
Query: 45 FALSLFNGVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIG-FTPTQNPIII 102
F LSL GV E+D LI HLK W +RL +K +LRLG +EI F T + + +
Sbjct: 55 FLLSLVTGVNNHKEELDNLISTHLKKGWSLERLTLTDKTLLRLGLFEIKYFDETPDRVAL 114
Query: 103 NECIELGKLYAEPNTPKFLNAIL 125
NE IE+ K Y++ + KF+N +L
Sbjct: 115 NEIIEVAKKYSDETSAKFINGLL 137
>ref|NP_636090.1| (NC_003902) transcription termination factor NusB [Xanthomonas
campestris pv. campestris str. ATCC 33913]
gb|AAM40014.1| (AE012168) transcription termination factor NusB [Xanthomonas
campestris pv. campestris str. ATCC 33913]
Length = 159
Score = 62.0 bits (149), Expect = 1e-09
Identities = 36/124 (29%), Positives = 68/124 (54%), Gaps = 2/124 (1%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R++AR ++ +YA++ K++ + E+ + LA+ SL GVL +E+D
Sbjct: 21 RSRARRRALQAVYAWQIAGGFAKQVIAQFAHEQAHEVADLAYFESLVEGVLSNRSELDTA 80
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
+ P+L D + + ++E+A+LRL AYE+ + ++INE IE K + + ++N
Sbjct: 81 LTPYL-DRGVEEVDAIERAVLRLAAYELLYRQDVPYRVVINEAIETAKRFGSEHGHTYVN 139
Query: 123 AILD 126
+LD
Sbjct: 140 GVLD 143
>ref|NP_372048.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
ref|NP_374638.1| (NC_002745) hypothetical protein, simialr to transcription
termination factor [Staphylococcus aureus subsp. aureus
N315]
ref|NP_646294.1| (NC_003923) ORFID:MW1477~hypothetical protein, similar to
transcription termination factor [Staphylococcus aureus
subsp. aureus MW2]
dbj|BAB42617.1| (AP003134) ORFID:SA1355~hypothetical protein, similar to
transcription termination factor [Staphylococcus aureus
subsp. aureus N315]
dbj|BAB57686.1| (AP003362) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
dbj|BAB95342.1| (AP004827) ORFID:MW1477~hypothetical protein, similar to
transcription termination factor [Staphylococcus aureus
subsp. aureus MW2]
Length = 129
Score = 62.0 bits (149), Expect = 1e-09
Identities = 29/82 (35%), Positives = 49/82 (59%)
Query: 49 LFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIEL 108
L +GV + +D I P+LKDW RL ++ ILR+ YEI + T +++NE +EL
Sbjct: 48 LVSGVKDHEPVLDETISPYLKDWTIARLLKTDRIILRMATYEILHSDTPAKVVMNEAVEL 107
Query: 109 GKLYAEPNTPKFLNAILDSLSK 130
K +++ + KF+N +L ++ K
Sbjct: 108 TKQFSDDDHYKFINGVLSNIKK 129
>ref|NP_269822.1| (NC_002737) putative transcriptional terminator [Streptococcus
pyogenes] [Streptococcus pyogenes M1 GAS]
gb|AAK34543.1| (AE006609) putative transcriptional terminator [Streptococcus
pyogenes M1 GAS]
Length = 150
Score = 61.2 bits (147), Expect = 2e-09
Identities = 35/83 (42%), Positives = 49/83 (58%), Gaps = 2/83 (2%)
Query: 45 FALSLFNGVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIG-FTPTQNPIII 102
F LSL GV E+D LI HLK W +RL +K +LRLG +EI F T + + +
Sbjct: 55 FLLSLVTGVNNHKEELDNLISTHLKKGWSLERLTLTDKTLLRLGLFEIKYFDKTPDRVAL 114
Query: 103 NECIELGKLYAEPNTPKFLNAIL 125
NE IE+ K Y++ + KF+N +L
Sbjct: 115 NEIIEVVKKYSDETSAKFINGLL 137
>ref|NP_625770.1| (NC_003888) putative NusB-family protein [Streptomyces coelicolor
A3(2)]
emb|CAB93370.1| (AL357523) putative NusB-family protein [Streptomyces coelicolor
A3(2)]
Length = 142
Score = 60.8 bits (146), Expect = 2e-09
Identities = 33/134 (24%), Positives = 66/134 (48%), Gaps = 4/134 (2%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQ---LAFALSLFNGVLEKI 57
MA R AR ++L+ + ++ + + + + Q + + L G +
Sbjct: 1 MAARNTARKRAFQILFEGDQRGADVLTVLADWVRHSRSDTRQPPVSEYTMELVEGYAGRA 60
Query: 58 NEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPN 116
ID LI + DW R+ +++ ILRLGAYE+ + T + ++++E ++L K ++
Sbjct: 61 ERIDELIAQYSVDWTLDRMPVVDRNILRLGAYELLWVDATPDAVVLDEMVQLAKEFSTDE 120
Query: 117 TPKFLNAILDSLSK 130
+P F+N +L L +
Sbjct: 121 SPAFINGLLGRLKE 134
>ref|NP_658218.1| (NC_003995) NusB, NusB family [Bacillus anthracis A2012] [Bacillus
anthracis str. A2012]
Length = 130
Score = 60.5 bits (145), Expect = 3e-09
Identities = 38/131 (29%), Positives = 67/131 (51%), Gaps = 5/131 (3%)
Query: 4 RTQARGAVVELLYAFE-SGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDA 62
R AR ++ LY + +G E K + L+E + N F SL G +E ID
Sbjct: 3 RRTARERAMQALYQMDITGELEPKVAVENTLDEGEETNE---FLESLVVGFVENKEVIDE 59
Query: 63 LIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFL 121
I +LK W +R+ ++++ILR+ YE+ + + + INE IE+ K + + + +F+
Sbjct: 60 AIRQNLKKWKLERISIVDRSILRVAVYEMKYMEEIPHNVTINEAIEIAKTFGDEESRRFI 119
Query: 122 NAILDSLSKKL 132
N +L ++ L
Sbjct: 120 NGVLSNIKDTL 130
>ref|NP_385322.1| (NC_003047) PUTATIVE N UTILIZATION SUBSTANCE PROTEIN B
[Sinorhizobium meliloti]
emb|CAC45795.1| (AL591786) PUTATIVE N UTILIZATION SUBSTANCE PROTEIN B
[Sinorhizobium meliloti]
Length = 160
Score = 60.5 bits (145), Expect = 3e-09
Identities = 42/139 (30%), Positives = 69/139 (49%), Gaps = 10/139 (7%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKN--------NQLAFALSLFNGVLE 55
R AR A V+ LY + G + +I + E + K ++ S+ GV+
Sbjct: 16 RGAARLAAVQALYQMDVGGTGVLEIVAEYEEHRLGKELDGDTYLRADASWFRSIVAGVVR 75
Query: 56 KINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
++D LI L+D W RL S +AILR G +EI P+I+ E +E+ K +
Sbjct: 76 DQRKLDPLIGSALQDDWALSRLDSTVRAILRAGTFEILERKDVPVPVIVTEYVEIAKAFF 135
Query: 114 EPNTPKFLNAILDSLSKKL 132
+ PK +NA+LD ++K++
Sbjct: 136 QDEEPKLVNAVLDRIAKQV 154
>ref|NP_662591.1| (NC_002932) N utilization substance protein B [Chlorobium tepidum
TLS]
gb|AAM72933.1| (AE012925) N utilization substance protein B [Chlorobium tepidum
TLS]
Length = 164
Score = 60.5 bits (145), Expect = 3e-09
Identities = 38/128 (29%), Positives = 67/128 (51%), Gaps = 3/128 (2%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKN-NQLAFALSLFNGVLEKINEIDA 62
R Q R +++ LY E + + A+ +L ++ + + N + F L ++ EID
Sbjct: 5 RRQLREKIIQALYTLELRDVDTDSAANWLLTKEIMDDPNAMKFFNHLMQSIVRNREEIDR 64
Query: 63 LIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-IIINECIELGKLYAEPN-TPKF 120
I H +WD R+ ++K ILR+ EI + P + INE IE+ K ++ + + KF
Sbjct: 65 YIAKHTFNWDMSRIAIIDKNILRMALAEILYCEDIPPKVSINEAIEIAKKFSSTDKSSKF 124
Query: 121 LNAILDSL 128
+N ILD++
Sbjct: 125 VNGILDAI 132
>ref|NP_243651.1| (NC_002570) transcriptional terminator [Bacillus halodurans]
sp|Q9K965|NUSB_BACHD N utilization substance protein B homolog (NusB protein)
dbj|BAB06504.1| (AP001516) transcriptional terminator [Bacillus halodurans]
Length = 134
Score = 60.5 bits (145), Expect = 3e-09
Identities = 35/123 (28%), Positives = 64/123 (51%), Gaps = 4/123 (3%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R +R V+ LY + + ++K S+L+E + ++ F L +G + E+D L
Sbjct: 3 RRLSRLRAVQALYQMDVIDTSMEKAIESVLDEGEEASS---FMSDLVSGTVTHQEELDRL 59
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKFLN 122
HL+ W R+G++++AILR+ YE+ + + NE IEL K + + +F+N
Sbjct: 60 YADHLQGWTVDRIGNVDRAILRMALYELYYVDDIPKNVSFNEAIELAKAFGGEDAGRFIN 119
Query: 123 AIL 125
+L
Sbjct: 120 GVL 122
>ref|NP_301448.1| (NC_002677) putative transcription termination protein
[Mycobacterium leprae]
sp|Q9CCR9|NUSB_MYCLE N utilization substance protein B homolog (NusB protein)
emb|CAC30031.1| (AL583918) putative transcription termination protein
[Mycobacterium leprae]
Length = 190
Score = 60.1 bits (144), Expect = 4e-09
Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 4/126 (3%)
Query: 4 RTQARGAVVELLYAFESGNE---EIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
R QAR V+LL+ E+ + EI ++ S++ + K + + + GV E I
Sbjct: 8 RHQARKRAVDLLFEAEARDLSPLEIIEVRSALAKSKLDVAPLHPYTVVVAQGVSEHTARI 67
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPK 119
D LI HL+ W RL ++++AILR+ +E+ + P+ ++E +EL K + ++P
Sbjct: 68 DELIISHLQGWKLDRLPAVDRAILRVSIWELLYADDVPEPVAVDEAVELAKELSTDDSPG 127
Query: 120 FLNAIL 125
F+N +L
Sbjct: 128 FVNGLL 133
>ref|NP_641104.1| (NC_003919) transcription termination factor NusB [Xanthomonas
axonopodis pv. citri str. 306]
gb|AAM35640.1| (AE011705) transcription termination factor NusB [Xanthomonas
axonopodis pv. citri str. 306]
Length = 159
Score = 60.1 bits (144), Expect = 4e-09
Identities = 35/124 (28%), Positives = 67/124 (53%), Gaps = 2/124 (1%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R++AR ++ +YA++ K++ + E+ + LA+ +L GVL E+D
Sbjct: 21 RSRARRRALQAVYAWQISGGFAKQVIAQFAHEQAHEVADLAYFENLVEGVLSNRAELDTA 80
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
+ P+L D + + ++E+A+LRL AYE+ + ++INE IE K + + ++N
Sbjct: 81 LTPYL-DRSVEEVDAIERAVLRLAAYELLYRQDVPYRVVINEAIETAKRFGSEHGHTYVN 139
Query: 123 AILD 126
+LD
Sbjct: 140 GVLD 143
>ref|NP_298245.1| (NC_002488) transcription termination factor [Xylella fastidiosa
9a5c]
pir||D82741 transcription termination factor XF0955 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF83765.1|AE003934_7 (AE003934) transcription termination factor [Xylella fastidiosa
9a5c]
Length = 157
Score = 59.3 bits (142), Expect = 7e-09
Identities = 36/126 (28%), Positives = 66/126 (51%), Gaps = 2/126 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R++AR ++ +YA++ K++ + E+ + LA+ L GVL E+D
Sbjct: 20 ALRSRARRRALQAVYAWQISGGVAKQVIAHFAHEQAYEVADLAYFEDLVEGVLTHCAELD 79
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKF 120
+ P+L D + + ++E+A+LRLGAYE+ + ++INE I K + +
Sbjct: 80 EKLTPYL-DRTIEEVDAIERAVLRLGAYELLYRQDVPYRVVINEAIMTAKRFGSKYGHTY 138
Query: 121 LNAILD 126
+N +LD
Sbjct: 139 VNGVLD 144
>ref|NP_295790.1| (NC_001263) N-utilization substance protein B [Deinococcus
radiodurans]
pir||E75318 transcription termination factor nusB DR2067 [similarity] -
Deinococcus radiodurans (strain R1)
gb|AAF11617.1|AE002043_2 (AE002043) N-utilization substance protein B [Deinococcus
radiodurans]
Length = 192
Score = 58.9 bits (141), Expect = 9e-09
Identities = 33/143 (23%), Positives = 71/143 (49%), Gaps = 8/143 (5%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKI---ASSMLEE-----KKIKNNQLAFALSLFNG 52
+ TR AR V +L+ + G+ ++ + A ++ E ++ + L FA L G
Sbjct: 12 VGTRRAAREFVFRVLFEADRGDVPLQAVFTRAEGVMREGDDTFPQLGPDALHFAEELVTG 71
Query: 53 VLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLY 112
+ ID ++ ++ W F ++ + +LRL YE+ +TP +P +I + + + +
Sbjct: 72 LERHREAIDDVLHRTIRGWTFDQMAQTDLNVLRLATYELMYTPEPHPPVIESAVRIARKF 131
Query: 113 AEPNTPKFLNAILDSLSKKLTQK 135
++ +F+N +L LS+ L ++
Sbjct: 132 GGDDSGRFVNGVLAGLSRNLREE 154
>ref|NP_441814.1| (NC_000911) N utilization substance protein B [Synechocystis sp.
PCC 6803]
sp|P74395|NUSB_SYNY3 N utilization substance protein B homolog (NusB protein)
pir||S76233 transcription termination factor nusB sll0271 [similarity] -
Synechocystis sp. (strain PCC 6803)
dbj|BAA18492.1| (D90914) N utilization substance protein B [Synechocystis sp. PCC
6803]
Length = 275
Score = 58.2 bits (139), Expect = 2e-08
Identities = 28/89 (31%), Positives = 48/89 (53%)
Query: 45 FALSLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINE 104
FAL L V + +ID ++ + DW RL +++ ILRL E+ + + INE
Sbjct: 176 FALELIGTVCRRRQQIDEQLQEAMVDWQLSRLAKIDQDILRLAIAELDYLGVPQKVAINE 235
Query: 105 CIELGKLYAEPNTPKFLNAILDSLSKKLT 133
+EL K Y+ + +F+N +L +++K T
Sbjct: 236 AVELAKRYSGQDGHRFINGVLRRVTEKKT 264
>ref|NP_217049.1| (NC_000962) nusB [Mycobacterium tuberculosis H37Rv]
sp|P95020|NUSB_MYCTU N utilization substance protein B homolog (NusB protein)
pir||A70658 transcription termination factor nusB [similarity] - Mycobacterium
tuberculosis (strain H37RV)
pdb|1EYV|B Chain B, The Crystal Structure Of Nusb From Mycobacterium
Tuberculosis
pdb|1EYV|A Chain A, The Crystal Structure Of Nusb From Mycobacterium
Tuberculosis
emb|CAB06175.1| (Z83863) nusB [Mycobacterium tuberculosis H37Rv]
Length = 156
Score = 57.8 bits (138), Expect = 2e-08
Identities = 36/126 (28%), Positives = 63/126 (49%), Gaps = 4/126 (3%)
Query: 4 RTQARGAVVELLYAFESGN---EEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
R QAR V LL+ E E+ +++ E K + ++ GV E I
Sbjct: 10 RHQARKRAVALLFEAEVRGISAAEVVDTRAALAEAKPDIARLHPYTAAVARGVSEHAAHI 69
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPK 119
D LI HL+ W RL ++++AILR+ +E + P++++E ++L K + ++P
Sbjct: 70 DDLITAHLRGWTLDRLPAVDRAILRVSVWELLHAADVPEPVVVDEAVQLAKELSTDDSPG 129
Query: 120 FLNAIL 125
F+N +L
Sbjct: 130 FVNGVL 135
>ref|NP_337104.1| (NC_002755) N utilization substance protein B [Mycobacterium
tuberculosis CDC1551]
gb|AAK46918.1| (AE007097) N utilization substance protein B [Mycobacterium
tuberculosis CDC1551]
Length = 290
Score = 57.8 bits (138), Expect = 2e-08
Identities = 36/126 (28%), Positives = 63/126 (49%), Gaps = 4/126 (3%)
Query: 4 RTQARGAVVELLYAFESGN---EEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
R QAR V LL+ E E+ +++ E K + ++ GV E I
Sbjct: 10 RHQARKRAVALLFEAEVRGISAAEVVDTRAALAEAKPDIARLHPYTAAVARGVSEHAAHI 69
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPK 119
D LI HL+ W RL ++++AILR+ +E + P++++E ++L K + ++P
Sbjct: 70 DDLITAHLRGWTLDRLPAVDRAILRVSVWELLHAADVPEPVVVDEAVQLAKELSTDDSPG 129
Query: 120 FLNAIL 125
F+N +L
Sbjct: 130 FVNGVL 135
>ref|NP_445404.1| (NC_002179) N utilization substance protein B, putative
[Chlamydophila pneumoniae AR39]
pir||B81530 N utilization substance protein B, probable CP0866 [imported] -
Chlamydophila pneumoniae (strain AR39)
gb|AAF38655.1| (AE002245) N utilization substance protein B, putative
[Chlamydophila pneumoniae AR39]
Length = 163
Score = 55.8 bits (133), Expect = 8e-08
Identities = 34/122 (27%), Positives = 61/122 (49%), Gaps = 1/122 (0%)
Query: 8 RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEPH 67
R ++++LYA + + ++ + + + AL+ +LEK E+D +I
Sbjct: 29 REIILQMLYALDMAPSAEDSLVPLLMSQTAVSQKHVLVALNQTKSILEKSQELDLIIGNA 88
Query: 68 LKDWDFKRLGSMEKAILRLGAYEIGFTPTQN-PIIINECIELGKLYAEPNTPKFLNAILD 126
LK+ F L +EK +LRL +E ++P N I+I E I L K ++ F+ AIL+
Sbjct: 89 LKNKSFDSLDLVEKNVLRLTLFEHFYSPPINKAILIAEAIRLVKKFSYSEACPFIQAILN 148
Query: 127 SL 128
+
Sbjct: 149 DI 150
>ref|NP_225183.1| (NC_000922) CT832 hypothetical protein [Chlamydophila pneumoniae
CWL029]
ref|NP_301044.1| (NC_002491) CT832 hypothetical protein [Chlamydophila pneumoniae
J138]
sp|Q9Z6S0|NUSB_CHLPN N utilization substance protein B homolog (NusB protein)
pir||F72010 CT832 hypothetical protein - Chlamydophila pneumoniae (strain
CWL029)
gb|AAD19126.1| (AE001679) CT832 hypothetical protein [Chlamydophila pneumoniae
CWL029]
dbj|BAA99196.1| (AP002548) CT832 hypothetical protein [Chlamydophila pneumoniae
J138]
Length = 160
Score = 55.8 bits (133), Expect = 8e-08
Identities = 34/122 (27%), Positives = 61/122 (49%), Gaps = 1/122 (0%)
Query: 8 RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEPH 67
R ++++LYA + + ++ + + + AL+ +LEK E+D +I
Sbjct: 26 REIILQMLYALDMAPSAEDSLVPLLMSQTAVSQKHVLVALNQTKSILEKSQELDLIIGNA 85
Query: 68 LKDWDFKRLGSMEKAILRLGAYEIGFTPTQN-PIIINECIELGKLYAEPNTPKFLNAILD 126
LK+ F L +EK +LRL +E ++P N I+I E I L K ++ F+ AIL+
Sbjct: 86 LKNKSFDSLDLVEKNVLRLTLFEHFYSPPINKAILIAEAIRLVKKFSYSEACPFIQAILN 145
Query: 127 SL 128
+
Sbjct: 146 DI 147
>ref|NP_531868.1| (NC_003304) N-utilization substance protein B [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42184.1| (AE009080) N-utilization substance protein B [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 165
Score = 55.5 bits (132), Expect = 1e-07
Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 10/139 (7%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSM--------LEEKKIKNNQLAFALSLFNGVLE 55
R AR A V+ LY + G + ++ + ++ ++ S+ +GV+
Sbjct: 21 RGAARLAAVQALYQMDVGGTGVMEVVAEYEAHRLGQEVDGDTYLKADPSWFRSIVSGVVR 80
Query: 56 KINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
+ID L+ L +DW RL + +AILR G +EI +I+ E +E+ + +
Sbjct: 81 DQTKIDPLVRSALLEDWPLSRLDATVRAILRAGTFEILERKDVPVAVIVTEYVEIARAFF 140
Query: 114 EPNTPKFLNAILDSLSKKL 132
E + PK +NA+LD ++K++
Sbjct: 141 EHDEPKLVNAVLDRIAKQV 159
>ref|NP_354190.1| (NC_003062) AGR_C_2167p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86975.1| (AE008046) AGR_C_2167p [Agrobacterium tumefaciens str. C58
(Cereon)]
Length = 169
Score = 55.5 bits (132), Expect = 1e-07
Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 10/139 (7%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSM--------LEEKKIKNNQLAFALSLFNGVLE 55
R AR A V+ LY + G + ++ + ++ ++ S+ +GV+
Sbjct: 25 RGAARLAAVQALYQMDVGGTGVMEVVAEYEAHRLGQEVDGDTYLKADPSWFRSIVSGVVR 84
Query: 56 KINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
+ID L+ L +DW RL + +AILR G +EI +I+ E +E+ + +
Sbjct: 85 DQTKIDPLVRSALLEDWPLSRLDATVRAILRAGTFEILERKDVPVAVIVTEYVEIARAFF 144
Query: 114 EPNTPKFLNAILDSLSKKL 132
E + PK +NA+LD ++K++
Sbjct: 145 EHDEPKLVNAVLDRIAKQV 163
>ref|NP_231898.1| (NC_002505) N utilization substance protein B [Vibrio cholerae]
pir||B82098 N utilization substance protein B VC2267 [imported] - Vibrio
cholerae (group O1 strain N16961)
gb|AAF95411.1| (AE004298) N utilization substance protein B [Vibrio cholerae]
Length = 156
Score = 55.1 bits (131), Expect = 1e-07
Identities = 39/149 (26%), Positives = 69/149 (46%), Gaps = 16/149 (10%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQ--------------LAFAL 47
A R AR ++ +Y+++ E + I L K + +++
Sbjct: 8 AARRNARQFALQAIYSWQITKENVATIEEQFLTSGKYDEEEHRAAEPALAAPETDVSYFR 67
Query: 48 SLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTP-TQNPIIINECI 106
L GV+ NE+D+ + P + + L ME A+LRL YE+ ++INE I
Sbjct: 68 DLLAGVVLNHNELDSKLRPFVSR-PMQDLDMMELALLRLAMYEMTRREDVPYKVVINEAI 126
Query: 107 ELGKLYAEPNTPKFLNAILDSLSKKLTQK 135
EL K++A ++ KF+N +LD + + +K
Sbjct: 127 ELAKVFAAEDSHKFVNGVLDKAAPHVRKK 155
>ref|NP_108511.1| (NC_002678) N-utilization substance protein B [Mesorhizobium loti]
dbj|BAB54297.1| (AP003014) N-utilization substance protein B [Mesorhizobium loti]
Length = 155
Score = 55.1 bits (131), Expect = 1e-07
Identities = 38/139 (27%), Positives = 67/139 (47%), Gaps = 10/139 (7%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSM--------LEEKKIKNNQLAFALSLFNGVLE 55
R AR A V+ LY + + +I + ++ + + ++ GV+E
Sbjct: 7 RGAARLAAVQALYQMDVAGSGVFEITAEYEAFRLGKEVDGALYREADAQWFRAILTGVVE 66
Query: 56 KINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
ID +I L D W RL S +AILR G YE+ +I++E +++ K +
Sbjct: 67 DQKTIDPVIRQALTDDWPLSRLDSTLRAILRAGVYELMKREDVPVAVIVSEYVDIAKAFY 126
Query: 114 EPNTPKFLNAILDSLSKKL 132
E + PK +NA+LD +S+++
Sbjct: 127 EEDEPKLVNAVLDRVSRRV 145
>ref|NP_420173.1| (NC_002696) N utilization substance protein B [Caulobacter
crescentus CB15]
gb|AAK23341.1| (AE005811) N utilization substance protein B [Caulobacter
crescentus CB15]
Length = 149
Score = 54.7 bits (130), Expect = 2e-07
Identities = 41/139 (29%), Positives = 69/139 (49%), Gaps = 14/139 (10%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLE---EKKIKNNQLA-----FALSLFNGVLE 55
R+ AR A V+ LY E + + E ++ ++ QLA F L GV+
Sbjct: 9 RSVARLAAVQALYQMEVSGAGVDSVIREFGEHRFDRDVEGEQLAAADETFFADLARGVVT 68
Query: 56 KINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF---TPTQNPIIINECIELGKL 111
+ID I L W +RL + +A+LR GA+E+ + PT+ ++INE +E+ K
Sbjct: 69 NQAKIDQGIVKRLASGWRLERLDATARAVLRAGAFELMYRSDVPTE--VVINEYVEIAKS 126
Query: 112 YAEPNTPKFLNAILDSLSK 130
+ E F+N LD++++
Sbjct: 127 FFEGPESGFINGALDAIAR 145
>gb|AAB95441.1| (AF002857) NUSB [Shigella flexneri]
Length = 101
Score = 52.8 bits (125), Expect = 7e-07
Identities = 28/90 (31%), Positives = 49/90 (54%), Gaps = 1/90 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
A R +AR V+ LY+++ +I + L E+ +K+ + + L GV K +D
Sbjct: 4 AARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLAGVATKTAYLD 63
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEI 91
L++P+L + LG +EKA+LR+ YE+
Sbjct: 64 GLMKPYLSRL-LEELGQVEKAVLRIALYEL 92
>ref|NP_485800.1| (NC_003272) transcription termination factor [Nostoc sp. PCC 7120]
dbj|BAB73459.1| (AP003587) transcription termination factor [Nostoc sp. PCC 7120]
Length = 211
Score = 52.8 bits (125), Expect = 7e-07
Identities = 28/85 (32%), Positives = 45/85 (52%)
Query: 45 FALSLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINE 104
+A+ L + E+ + ID I L DW RL +++ ILR+ E+ F N + INE
Sbjct: 119 YAIKLVKIINEERSVIDEQITSALVDWQVTRLAQIDRDILRIAVAEMMFFNLPNSVAINE 178
Query: 105 CIELGKLYAEPNTPKFLNAILDSLS 129
+EL K Y+ +F+N +L +S
Sbjct: 179 AVELAKRYSGDEGHRFINGVLRRVS 203
>ref|NP_273725.1| (NC_003112) N utilization substance protein B [Neisseria
meningitidis MC58]
pir||A81172 transcription termination factor nusB NMB0683 [similarity] -
Neisseria meningitidis (group B strain MD58)
gb|AAF41101.1| (AE002422) N utilization substance protein B [Neisseria
meningitidis MC58]
Length = 141
Score = 51.6 bits (122), Expect = 1e-06
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 2/130 (1%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R ++R V+ +Y +IA ++ E LF G E
Sbjct: 5 RRRSRELAVQAVYQSLINRTAAPEIAKNIREMSDFAKADEELFNKLFFGTQTNAAEYIRQ 64
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTP-TQNPIIINECIELGKLYAEPNTPKFLN 122
I P L D D K L +E+A+L +E+ P T P+IINE IE+ K + + KF+N
Sbjct: 65 IRP-LLDRDEKDLNPIERAVLLTACHELSAMPETPYPVIINEAIEVTKTFGGTDGHKFVN 123
Query: 123 AILDSLSKKL 132
ILD L+ ++
Sbjct: 124 GILDKLAAQI 133
>ref|NP_283676.1| (NC_003116) putative RNA polymerase antitermination factor
[Neisseria meningitidis Z2491]
pir||H81934 transcription termination factor nusB NMA0885 [similarity] -
Neisseria meningitidis (group A strain Z2491)
emb|CAB84165.1| (AL162754) putative RNA polymerase antitermination factor
[Neisseria meningitidis Z2491]
Length = 141
Score = 51.2 bits (121), Expect = 2e-06
Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 2/130 (1%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
R ++R V+ +Y +IA ++ E LF G E
Sbjct: 5 RRRSRELAVQAVYQSLINRTAAPEIAKNIREMPDFAKADEELFNKLFFGTQTNAAEYIRQ 64
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTP-TQNPIIINECIELGKLYAEPNTPKFLN 122
I P L D D K L +E+A+L +E+ P T P+IINE IE+ K + + KF+N
Sbjct: 65 IRP-LLDRDEKDLNPIERAVLLTACHELSAMPETPYPVIINEAIEVTKTFGGTDGHKFVN 123
Query: 123 AILDSLSKKL 132
ILD L+ ++
Sbjct: 124 GILDKLAAQI 133
>ref|NP_600832.1| (NC_003450) COG0781:Transcription termination factor
[Corynebacterium glutamicum]
dbj|BAB99011.1| (AP005279) Transcription termination factor [Corynebacterium
glutamicum ATCC 13032]
Length = 227
Score = 50.8 bits (120), Expect = 2e-06
Identities = 34/138 (24%), Positives = 73/138 (52%), Gaps = 10/138 (7%)
Query: 3 TRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLA----FALSLFNGVLEKIN 58
+R +AR V++L+ ES + + I + + N +A + ++ NGV +++
Sbjct: 13 SRYKARMRAVDILFEAESRDVDPVAIIDDRHKLARDTNPIVAPVAEYTETIINGVAVELD 72
Query: 59 EIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF---TPTQNPIIINECIELGKLYAE 114
+D + H+ + W RL S+++AILR+ ++E+ + P I+ E +E+ Y+
Sbjct: 73 TLDVFLAEHIAETWTLGRLPSVDRAILRVASWEMIYNADVPVTTAIV--EAVEIASEYSG 130
Query: 115 PNTPKFLNAILDSLSKKL 132
+ ++NA LD+++ K+
Sbjct: 131 DKSSAYINATLDAMASKV 148
>ref|NP_540103.1| (NC_003317) N UTILIZATION SUBSTANCE PROTEIN B [Brucella melitensis]
gb|AAL52367.1| (AE009558) N UTILIZATION SUBSTANCE PROTEIN B [Brucella melitensis]
Length = 171
Score = 50.4 bits (119), Expect = 3e-06
Identities = 28/80 (35%), Positives = 47/80 (58%), Gaps = 2/80 (2%)
Query: 52 GVLEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELG 109
GV+E ++D +I L +DW RL S +AILR GA+E+ +I++E +++
Sbjct: 76 GVVEDQLKLDPMIHQALTEDWPLSRLDSTLRAILRAGAWELKARKDVPTAVIVSEYVDIA 135
Query: 110 KLYAEPNTPKFLNAILDSLS 129
K + + PK +NA+LD L+
Sbjct: 136 KAFYTEDEPKLVNAVLDRLA 155
>gb|AAF18280.1| (AF088897) N-utilization substance protein B [Zymomonas mobilis]
Length = 129
Score = 47.8 bits (112), Expect = 2e-05
Identities = 29/102 (28%), Positives = 51/102 (49%), Gaps = 2/102 (1%)
Query: 33 LEEKKIKNNQLAFALSLFNGVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEI 91
+E+ + +F + GV + EID +I +L + W RL + ILR G YE+
Sbjct: 22 IEDATYTKAEPSFFDDIVRGVGTRCEEIDRVISENLSERWSLDRLDRPMRQILRAGTYEL 81
Query: 92 GFTP-TQNPIIINECIELGKLYAEPNTPKFLNAILDSLSKKL 132
P +I+E I++ + + F+N +LD+++KKL
Sbjct: 82 LARPDVPTATVISEYIDVANAFYDRQEKNFVNGLLDTVAKKL 123
>ref|NP_296598.1| (NC_002620) N utilization substance protein B, putative [Chlamydia
muridarum]
sp|Q9PL88|NUSB_CHLMU N utilization substance protein B homolog (NusB protein)
pir||A81727 transcription termination factor nusB TC0219 [similarity] -
Chlamydia muridarum (strain Nigg)
gb|AAF39091.1| (AE002289) N utilization substance protein B, putative [Chlamydia
muridarum]
Length = 164
Score = 45.4 bits (106), Expect = 1e-04
Identities = 31/137 (22%), Positives = 62/137 (44%), Gaps = 4/137 (2%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
+ + R V++ LYA E + + S ++ E + + +AL + +E+DAL
Sbjct: 23 KQKLRELVLQALYALEMAPKGEDSLVSLLMTEASVSKKNVLYALMFCKAIRANQSELDAL 82
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPI----IINECIELGKLYAEPNTPK 119
+ ++ L +E+ ILR+ +E +PI +I E L K ++
Sbjct: 83 LNATIRTTTLANLTIIERNILRMMLFEHQQNQESSPIPTAVLIAETTRLIKKFSYVEGSS 142
Query: 120 FLNAILDSLSKKLTQKP 136
+ A+L S+ ++ Q+P
Sbjct: 143 LILAVLGSIFDQVAQEP 159
>pir||T05067 hypothetical protein M3E9.200 - Arabidopsis thaliana
emb|CAA18233.1| (AL022223) putative protein [Arabidopsis thaliana]
emb|CAB79492.1| (AL161565) putative protein [Arabidopsis thaliana]
Length = 286
Score = 45.4 bits (106), Expect = 1e-04
Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 2/90 (2%)
Query: 43 LAFALSLFNGVLEKINEIDALIEP-HLKDWDFKRLGS-MEKAILRLGAYEIGFTPTQNPI 100
L FA L V++K + +IE DW G +E +IL L E+ T++PI
Sbjct: 178 LRFAKKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFSILHLAMSEVAVLETRHPI 237
Query: 101 IINECIELGKLYAEPNTPKFLNAILDSLSK 130
+INE ++L K + + + P+ +N L + K
Sbjct: 238 VINEAVDLAKRFCDGSAPRIINGCLRTFVK 267
>ref|NP_567745.1| (NM_118770) putative protein [Arabidopsis thaliana]
gb|AAK96755.1| (AY054564) putative protein [Arabidopsis thaliana]
Length = 301
Score = 45.4 bits (106), Expect = 1e-04
Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 2/90 (2%)
Query: 43 LAFALSLFNGVLEKINEIDALIEP-HLKDWDFKRLGS-MEKAILRLGAYEIGFTPTQNPI 100
L FA L V++K + +IE DW G +E +IL L E+ T++PI
Sbjct: 193 LRFAKKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFSILHLAMSEVAVLETRHPI 252
Query: 101 IINECIELGKLYAEPNTPKFLNAILDSLSK 130
+INE ++L K + + + P+ +N L + K
Sbjct: 253 VINEAVDLAKRFCDGSAPRIINGCLRTFVK 282
>ref|NP_078134.1| (NC_002162) transcription termination factor [Ureaplasma
urealyticum]
pir||A82909 transcription termination factor UU300 [imported] - Ureaplasma
urealyticum
gb|AAF30709.1|AE002127_7 (AE002127) transcription termination factor [Ureaplasma
urealyticum]
Length = 127
Score = 45.1 bits (105), Expect = 1e-04
Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 1/77 (1%)
Query: 53 VLEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKL 111
+L+ ++ +I+P + KDW F+RL +E+A+L E T III++ +
Sbjct: 49 ILDNYEQLTKMIKPLISKDWTFERLSYVEQALLLSAYGEYLVLKTPKKIIIDQTLITTHN 108
Query: 112 YAEPNTPKFLNAILDSL 128
Y+ + KF+NAILD L
Sbjct: 109 YSNNESYKFINAILDQL 125
>ref|NP_359841.1| (NC_003103) N utilization substance protein B [Rickettsia conorii]
sp|Q92J65|NUSB_RICCN N utilization substance protein B homolog (NusB protein)
gb|AAL02742.1| (AE008588) N utilization substance protein B [Rickettsia conorii]
Length = 156
Score = 43.9 bits (102), Expect = 3e-04
Identities = 38/143 (26%), Positives = 70/143 (48%), Gaps = 15/143 (10%)
Query: 7 ARGAVVELLYA-FESGNEEIKKIASSMLEEKKIKN------NQLAFALS------LFNGV 53
AR A V+ +Y N+++ I ++L + N L +LS L V
Sbjct: 12 ARIAAVQAIYQNILQNNDDMDDIMQNVLSFYQNNNAITDLPENLKISLSISHFKMLVKSV 71
Query: 54 LEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-IIINECIELGKL 111
E I+++D +I+ HL D D + + +A+LR+ E+ F PT ++INE ++
Sbjct: 72 FENIHKLDEIIDNHLTNDKDPAHMPILLRALLRVSICELLFCPTTPAKVVINEYTDIAND 131
Query: 112 YAEPNTPKFLNAILDSLSKKLTQ 134
+ F+N++LD ++K+ T+
Sbjct: 132 MLNEHEIGFVNSVLDKIAKEHTR 154
>ref|NP_220353.1| (NC_000117) Transcription termination factor [Chlamydia
trachomatis]
sp|O84839|NUSB_CHLTR N utilization substance protein B homolog (NusB protein)
pir||H71464 transcription termination factor nusB CT832 [similarity] -
Chlamydia trachomatis (serotype D, strain UW3/Cx)
gb|AAC68429.1| (AE001356) Transcription termination factor [Chlamydia trachomatis]
Length = 168
Score = 42.0 bits (97), Expect = 0.001
Identities = 30/129 (23%), Positives = 57/129 (43%), Gaps = 4/129 (3%)
Query: 4 RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
+ + R V++ LYA E E + S ++ E + A+AL + ++DAL
Sbjct: 21 KQKLRELVLQALYALEIDPEGEDSLVSLLMTEASVSKKNAAYALMFCRAIRANQPDLDAL 80
Query: 64 IEPHLKDWDFKRLGSMEKAILRLGAYE----IGFTPTQNPIIINECIELGKLYAEPNTPK 119
++ ++ RL +E+ ILR+ +E P ++I E L K ++
Sbjct: 81 LDATIRTTTLARLTIIERNILRMMLFEHQQNQDCCPVPVAVLIAETTRLIKKFSYSEGSS 140
Query: 120 FLNAILDSL 128
+ A+L S+
Sbjct: 141 LILAVLGSI 149
>ref|NP_518832.1| (NC_003295) PROBABLE N UTILIZATION SUBSTANCE B (TRANSCRIPTIONAL
ANTITERMINATOR)(L FACTOR) TRANSCRIPTION REGULATOR
PROTEIN [Ralstonia solanacearum]
emb|CAD14241.1| (AL646060) PROBABLE N UTILIZATION SUBSTANCE B (TRANSCRIPTIONAL
ANTITERMINATOR)(L FACTOR) TRANSCRIPTION REGULATOR
PROTEIN [Ralstonia solanacearum]
Length = 161
Score = 41.2 bits (95), Expect = 0.002
Identities = 30/132 (22%), Positives = 60/132 (44%), Gaps = 2/132 (1%)
Query: 2 ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
+ R +AR ++ LY + + + + + + + A +L +G + + +
Sbjct: 22 SARRRARELALQGLYQWLLNRNDPGVVEAHLHDAQGFNKADRAHFDALLHGAIREEATLT 81
Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPKF 120
P L D L +E+A L +GAYE + ++INE +EL K + K+
Sbjct: 82 ESFTPFL-DRPVAELSPVERAALLVGAYELVHCVDIPYKVVINEAVELAKTFGGVEGYKY 140
Query: 121 LNAILDSLSKKL 132
+N +LD L+ ++
Sbjct: 141 VNGVLDKLAAQV 152
>ref|NP_600813.1| (NC_003450) COG0144:tRNA and rRNA cytosine-C5-methylases
[Corynebacterium glutamicum]
Length = 511
Score = 40.0 bits (92), Expect = 0.004
Identities = 30/128 (23%), Positives = 57/128 (44%), Gaps = 9/128 (7%)
Query: 8 RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEP- 66
R E+L +G + +L + + AFA + G L + +D +I+
Sbjct: 77 REIAFEVLDRVRTGEAYANLVLPRLLSKHNLSGRDAAFATEITYGTLRNVGLLDEVIKAA 136
Query: 67 ---HLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKFLNA 123
L D D + L +LRLGAY++ FT ++ ++ +++ + F NA
Sbjct: 137 SGRELSDIDPEVLD-----VLRLGAYQVMFTRVEDHAAVDTSVKMVGGLKKFQATGFANA 191
Query: 124 ILDSLSKK 131
IL ++++K
Sbjct: 192 ILRNITRK 199
>ref|NP_220552.1| (NC_000963) N UTILIZATION SUBSTANCE PROTEIN B (nusB) [Rickettsia
prowazekii]
pir||F71726 transcription termination factor nusB RP162 - Rickettsia prowazekii
emb|CAA14629.1| (AJ235270) N UTILIZATION SUBSTANCE PROTEIN B (nusB) [Rickettsia
prowazekii]
Length = 174
Score = 40.0 bits (92), Expect = 0.004
Identities = 27/99 (27%), Positives = 55/99 (55%), Gaps = 6/99 (6%)
Query: 39 KNNQLAFALSLFN----GVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF 93
KN +++ ++S F V E IN++D +I+ HL + D + + +A+LR+ E+ F
Sbjct: 71 KNFKISLSISHFKMLVKSVFENINKLDEIIDNHLTNAKDSVHMPILLRALLRVSICELLF 130
Query: 94 -TPTQNPIIINECIELGKLYAEPNTPKFLNAILDSLSKK 131
+ T ++INE ++ + F+N+ILD ++++
Sbjct: 131 CSTTPAKVVINEYTDIANDLLNEHEIGFVNSILDKIAQE 169
>sp|Q9ZE01|NUSB_RICPR N utilization substance protein B homolog (NusB protein)
Length = 155
Score = 40.0 bits (92), Expect = 0.004
Identities = 27/99 (27%), Positives = 55/99 (55%), Gaps = 6/99 (6%)
Query: 39 KNNQLAFALSLFN----GVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF 93
KN +++ ++S F V E IN++D +I+ HL + D + + +A+LR+ E+ F
Sbjct: 52 KNFKISLSISHFKMLVKSVFENINKLDEIIDNHLTNAKDSVHMPILLRALLRVSICELLF 111
Query: 94 -TPTQNPIIINECIELGKLYAEPNTPKFLNAILDSLSKK 131
+ T ++INE ++ + F+N+ILD ++++
Sbjct: 112 CSTTPAKVVINEYTDIANDLLNEHEIGFVNSILDKIAQE 150
>dbj|BAB98992.1| (AP005279) tRNA and rRNA cytosine-C5-methylases [Corynebacterium
glutamicum ATCC 13032]
Length = 444
Score = 40.0 bits (92), Expect = 0.004
Identities = 30/128 (23%), Positives = 57/128 (44%), Gaps = 9/128 (7%)
Query: 8 RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEP- 66
R E+L +G + +L + + AFA + G L + +D +I+
Sbjct: 10 REIAFEVLDRVRTGEAYANLVLPRLLSKHNLSGRDAAFATEITYGTLRNVGLLDEVIKAA 69
Query: 67 ---HLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKFLNA 123
L D D + L +LRLGAY++ FT ++ ++ +++ + F NA
Sbjct: 70 SGRELSDIDPEVLD-----VLRLGAYQVMFTRVEDHAAVDTSVKMVGGLKKFQATGFANA 124
Query: 124 ILDSLSKK 131
IL ++++K
Sbjct: 125 ILRNITRK 132
>ref|NP_661503.1| (NC_002932) Sun protein [Chlorobium tepidum TLS]
gb|AAM71845.1| (AE012834) Sun protein [Chlorobium tepidum TLS]
Length = 428
Score = 38.9 bits (89), Expect = 0.010
Identities = 31/131 (23%), Positives = 60/131 (45%), Gaps = 7/131 (5%)
Query: 1 MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
M R A ++EL G + +++ + M E + N A A L G L+ +
Sbjct: 1 MTARELALRVLLEL-----DGMRKSEELLNRMHEHAGLGKNDRALAKELVAGTLKYRLQC 55
Query: 61 DALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPK 119
D +I + D+ + ++ K ILRLG Y+ + +NE ++L + + + +
Sbjct: 56 DFIIARFYRH-DYAKAATVLKHILRLGVYQLLRLDRVPKSAAVNESVKLARKFKGDHLAR 114
Query: 120 FLNAILDSLSK 130
+N +L ++SK
Sbjct: 115 LVNGLLRNISK 125
Database: /home/scwang/download_20020708_db/nr
Posted date: Aug 7, 2002 12:55 PM
Number of letters in database: 324,149,939
Number of sequences in database: 1,026,957
Lambda K H
0.316 0.135 0.372
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,846,108
Number of Sequences: 1026957
Number of extensions: 3220351
Number of successful extensions: 8799
Number of sequences better than 1.0e-02: 68
Number of HSP's better than 0.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 39
Number of HSP's that attempted gapping in prelim test: 8713
Number of HSP's gapped (non-prelim): 68
length of query: 138
length of database: 324,149,939
effective HSP length: 114
effective length of query: 24
effective length of database: 207,076,841
effective search space: 4969844184
effective search space used: 4969844184
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 89 (38.9 bits)