BLASTP 2.2.1 [Apr-13-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|15644635|ref|NP_206803.1| hypothetical protein
[Helicobacter pylori 26695]
         (138 letters)

Database: /home/scwang/download_20020708_db/nr
           1,026,957 sequences; 324,149,939 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_206803.1|  (NC_000915) hypothetical protein [Helicoba...   271  7e-73
ref|NP_222723.1|  (NC_000921) TRANSCRIPTION TERMINATION [Hel...   267  1e-71
ref|NP_281572.1|  (NC_002163) transcription termination prot...   124  2e-28
ref|NP_660779.1|  (NC_004061) N utilization substance protei...    79  1e-14
ref|NP_213090.1|  (NC_000918) transcription termination NusB...    77  4e-14
ref|NP_240274.1|  (NC_002528) N utilization substance protei...    74  3e-13
ref|NP_622912.1|  (NC_003869) Transcription termination fact...    74  4e-13
ref|NP_470732.1|  (NC_003212) similar to transcription termi...    72  1e-12
ref|NP_245667.1|  (NC_002663) NusB [Pasteurella multocida] >...    72  1e-12
ref|NP_464884.1|  (NC_003210) similar to transcription termi...    72  1e-12
ref|NP_406656.1|  (NC_003143) N utilization substance protei...    71  2e-12
ref|NP_602432.1|  (NC_003454) N utilization substance protei...    69  7e-12
ref|NP_562740.1|  (NC_003366) probable N utilization substan...    69  1e-11
ref|NP_286157.1|  (NC_002655) transcription termination; L f...    69  1e-11
ref|NP_219452.1|  (NC_000919) N utilization substance protei...    69  1e-11
ref|NP_390312.1|  (NC_000964) similar to transcription termi...    69  1e-11
ref|NP_459413.1|  (NC_003197) transcription termination; L f...    68  2e-11
ref|NP_455012.1|  (NC_003198) N utilization substance protei...    68  2e-11
ref|NP_229562.1|  (NC_000853) N utilization substance protei...    67  3e-11
ref|NP_357984.1|  (NC_003098) Transcription termination prot...    65  1e-10
ref|NP_344955.1|  (NC_003028) N utilization substance protei...    65  1e-10
ref|NP_348703.1|  (NC_003030) Transcription termination fact...    65  1e-10
ref|NP_439455.1|  (NC_000907) N utilization substance protei...    65  1e-10
ref|NP_266850.1|  (NC_002662) transcription termination prot...    65  2e-10
ref|NP_212241.1|  (NC_001318) N-utilization substance protei...    63  6e-10
ref|NP_252741.1|  (NC_002516) NusB protein [Pseudomonas aeru...    63  6e-10
ref|NP_607890.1|  (NC_003485) putative transcriptional termi...    62  8e-10
ref|NP_636090.1|  (NC_003902) transcription termination fact...    62  1e-09
ref|NP_372048.1|  (NC_002758) hypothetical protein [Staphylo...    62  1e-09
ref|NP_269822.1|  (NC_002737) putative transcriptional termi...    61  2e-09
ref|NP_625770.1|  (NC_003888) putative NusB-family protein [...    61  2e-09
ref|NP_658218.1|  (NC_003995) NusB, NusB family [Bacillus an...    60  3e-09
ref|NP_385322.1|  (NC_003047) PUTATIVE N UTILIZATION SUBSTAN...    60  3e-09
ref|NP_662591.1|  (NC_002932) N utilization substance protei...    60  3e-09
ref|NP_243651.1|  (NC_002570) transcriptional terminator [Ba...    60  3e-09
ref|NP_301448.1|  (NC_002677) putative transcription termina...    60  4e-09
ref|NP_641104.1|  (NC_003919) transcription termination fact...    60  4e-09
ref|NP_298245.1|  (NC_002488) transcription termination fact...    59  7e-09
ref|NP_295790.1|  (NC_001263) N-utilization substance protei...    59  9e-09
ref|NP_441814.1|  (NC_000911) N utilization substance protei...    58  2e-08
ref|NP_217049.1|  (NC_000962) nusB [Mycobacterium tuberculos...    58  2e-08
ref|NP_337104.1|  (NC_002755) N utilization substance protei...    58  2e-08
ref|NP_445404.1|  (NC_002179) N utilization substance protei...    56  8e-08
ref|NP_225183.1|  (NC_000922) CT832 hypothetical protein [Ch...    56  8e-08
ref|NP_531868.1|  (NC_003304) N-utilization substance protei...    55  1e-07
ref|NP_354190.1|  (NC_003062) AGR_C_2167p [Agrobacterium tum...    55  1e-07
ref|NP_231898.1|  (NC_002505) N utilization substance protei...    55  1e-07
ref|NP_108511.1|  (NC_002678) N-utilization substance protei...    55  1e-07
ref|NP_420173.1|  (NC_002696) N utilization substance protei...    55  2e-07
gb|AAB95441.1|  (AF002857) NUSB [Shigella flexneri]                53  7e-07
ref|NP_485800.1|  (NC_003272) transcription termination fact...    53  7e-07
ref|NP_273725.1|  (NC_003112) N utilization substance protei...    52  1e-06
ref|NP_283676.1|  (NC_003116) putative RNA polymerase antite...    51  2e-06
ref|NP_600832.1|  (NC_003450) COG0781:Transcription terminat...    51  2e-06
ref|NP_540103.1|  (NC_003317) N UTILIZATION SUBSTANCE PROTEI...    50  3e-06
gb|AAF18280.1|  (AF088897) N-utilization substance protein B...    48  2e-05
ref|NP_296598.1|  (NC_002620) N utilization substance protei...    45  1e-04
pir||T05067  hypothetical protein M3E9.200 - Arabidopsis tha...    45  1e-04
ref|NP_567745.1|  (NM_118770) putative protein [Arabidopsis ...    45  1e-04
ref|NP_078134.1|  (NC_002162) transcription termination fact...    45  1e-04
ref|NP_359841.1|  (NC_003103) N utilization substance protei...    44  3e-04
ref|NP_220353.1|  (NC_000117) Transcription termination fact...    42  0.001
ref|NP_518832.1|  (NC_003295) PROBABLE N UTILIZATION SUBSTAN...    41  0.002
ref|NP_600813.1|  (NC_003450) COG0144:tRNA and rRNA cytosine...    40  0.004
ref|NP_220552.1|  (NC_000963) N UTILIZATION SUBSTANCE PROTEI...    40  0.004
sp|Q9ZE01|NUSB_RICPR  N utilization substance protein B homo...    40  0.004
dbj|BAB98992.1|  (AP005279) tRNA and rRNA cytosine-C5-methyl...    40  0.004
ref|NP_661503.1|  (NC_002932) Sun protein [Chlorobium tepidu...    39  0.010
>ref|NP_206803.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
 sp|O24853|NUSB_HELPY N utilization substance protein B homolog (NusB protein)
 pir||A64520 transcription termination factor NusB - Helicobacter pylori
           (strain 26695)
 gb|AAD07074.1| (AE000523) H. pylori predicted coding region HP0001 [Helicobacter
           pylori 26695]
          Length = 138

 Score =  271 bits (694), Expect = 7e-73
 Identities = 138/138 (100%), Positives = 138/138 (100%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI
Sbjct: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
           DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF
Sbjct: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120

Query: 121 LNAILDSLSKKLTQKPLN 138
           LNAILDSLSKKLTQKPLN
Sbjct: 121 LNAILDSLSKKLTQKPLN 138
>ref|NP_222723.1| (NC_000921) TRANSCRIPTION TERMINATION [Helicobacter pylori J99]
 sp|Q9ZN57|NUSB_HELPJ N utilization substance protein B homolog (NusB protein)
 pir||C71985 transcription termination factor nusB [similarity] - Helicobacter
           pylori (strain J99)
 gb|AAD05585.1| (AE001440) TRANSCRIPTION TERMINATION [Helicobacter pylori J99]
          Length = 138

 Score =  267 bits (683), Expect = 1e-71
 Identities = 136/137 (99%), Positives = 136/137 (99%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI
Sbjct: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
           DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF
Sbjct: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120

Query: 121 LNAILDSLSKKLTQKPL 137
           LNAILDSLSKKL QKPL
Sbjct: 121 LNAILDSLSKKLAQKPL 137
>ref|NP_281572.1| (NC_002163) transcription termination protein [Campylobacter
           jejuni]
 pir||D81381 transcription termination factor nusB Cj0382c [similarity] -
           Campylobacter jejuni (strain NCTC 11168)
 emb|CAB74218.1| (AL139075) transcription termination protein [Campylobacter jejuni]
          Length = 132

 Score =  124 bits (311), Expect = 2e-28
 Identities = 65/130 (50%), Positives = 86/130 (66%), Gaps = 1/130 (0%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           MATR Q R +V+ LLYAFE  N +       +L+EKKI+N Q  F L+L+NG+L+ +N I
Sbjct: 1   MATRHQVRQSVISLLYAFEL-NSQNNVFVDEILDEKKIRNEQKNFTLNLYNGILDNLNNI 59

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKF 120
           D  +   L D     LG +E+AILRLGAYE+ FT T + I+INE IEL K  A  N+PKF
Sbjct: 60  DETLNSFLNDNQITALGHVERAILRLGAYELLFTDTPSAIVINEAIELAKELANDNSPKF 119

Query: 121 LNAILDSLSK 130
           +N +LD+L K
Sbjct: 120 INGVLDALIK 129
>ref|NP_660779.1| (NC_004061) N utilization substance protein B [Buchnera aphidicola
           str. Sg (Schizaphis graminum)]
 gb|AAM67990.1| (AE014121) N utilization substance protein B [Buchnera aphidicola
           str. Sg (Schizaphis graminum)]
          Length = 138

 Score = 78.6 bits (192), Expect = 1e-14
 Identities = 47/131 (35%), Positives = 72/131 (54%), Gaps = 4/131 (3%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R +AR   +++LY++E     IK  A   L+EK  KN  + +   L  G+      ID L
Sbjct: 8   RRKARACALQMLYSWEISQNNIKDSAIEFLKEKNKKNIDIIYFYELIIGITYNCRSIDDL 67

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP--IIINECIELGKLYAEPNTPKFL 121
           ++P+L     K LG +EKAILR+  YE+ +     P  + INE IEL KL+   ++ KF+
Sbjct: 68  MKPYLSR-SLKELGQIEKAILRISFYEL-YKRKDIPYKVSINEGIELAKLFGSEDSHKFI 125

Query: 122 NAILDSLSKKL 132
           N +LD  + K+
Sbjct: 126 NGVLDKAALKI 136
>ref|NP_213090.1| (NC_000918) transcription termination NusB [Aquifex aeolicus]
 sp|O66530|NUSB_AQUAE N utilization substance protein B homolog (NusB protein)
 pir||G70312 transcription termination factor nusB [similarity] - Aquifex
           aeolicus
 gb|AAC06491.1| (AE000675) transcription termination NusB [Aquifex aeolicus]
          Length = 148

 Score = 76.6 bits (187), Expect = 4e-14
 Identities = 45/132 (34%), Positives = 71/132 (53%), Gaps = 2/132 (1%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINE 59
           M  R  AR     +LY ++   E   ++   ++EEK IKN     +A  L +  +  I E
Sbjct: 1   MRYRKGARDTAFLVLYRWDLRGENPGELFKEVVEEKNIKNKDAYEYAKKLVDTAVRHIEE 60

Query: 60  IDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-IIINECIELGKLYAEPNTP 118
           ID++IE HLK W   RLG +E+  LRLG  E+ F  ++ P  +  + ++L K YA+    
Sbjct: 61  IDSIIEKHLKGWSIDRLGYVERNALRLGVAELIFLKSKEPGRVFIDIVDLVKKYADEKAG 120

Query: 119 KFLNAILDSLSK 130
           KF+N +L ++ K
Sbjct: 121 KFVNGVLSAIYK 132
>ref|NP_240274.1| (NC_002528) N utilization substance protein B [Buchnera sp. APS]
 sp|P57535|NUSB_BUCAI N utilization substance protein B homolog (NusB protein)
 dbj|BAB13160.1| (AP001119) N utilization substance protein B [Buchnera sp. APS]
          Length = 143

 Score = 73.9 bits (180), Expect = 3e-13
 Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 2/130 (1%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R +AR   +++LY++E  +  IK+ A   L+EK  KN  + +   L  G+      ID L
Sbjct: 6   RRKARACALQVLYSWEISHNNIKESAIYFLKEKNKKNIDIVYFYELIIGITYDCKNIDNL 65

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYAEPNTPKFLN 122
           ++P+L     K LG +E+AILR+  YE+         + INE IEL KL+   ++ KF+N
Sbjct: 66  MKPYLFR-SLKELGHIERAILRISFYELHKRNDIPYKVSINEGIELAKLFGSEDSHKFIN 124

Query: 123 AILDSLSKKL 132
            +LD    K+
Sbjct: 125 GVLDKAVFKM 134
>ref|NP_622912.1| (NC_003869) Transcription termination factor [Thermoanaerobacter
           tengcongensis]
 gb|AAM24516.1| (AE013090) Transcription termination factor [Thermoanaerobacter
           tengcongensis]
          Length = 140

 Score = 73.6 bits (179), Expect = 4e-13
 Identities = 41/128 (32%), Positives = 68/128 (53%), Gaps = 1/128 (0%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           RT+AR  VV++LY ++     ++KI  +  EE      Q  +      G +E + EID  
Sbjct: 3   RTEAREWVVKMLYQYDVSKLPLEKIFENFYEEHD-PGEQKEYIEGTVRGTVEHLEEIDRE 61

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKFLNA 123
           IE + KDW   R+  ++ AILR   YE+ +      I INE +E+ K Y+  ++P F+N 
Sbjct: 62  IEKYSKDWPLYRMPRIDLAILRCSMYEMLYGNIPVSISINEAVEIAKKYSTDDSPSFING 121

Query: 124 ILDSLSKK 131
           +L +  ++
Sbjct: 122 LLGAFVRE 129
>ref|NP_470732.1| (NC_003212) similar to transcription termination protein (NusB)
           [Listeria innocua]
 emb|CAC96627.1| (AL596168) similar to transcription termination protein (NusB)
           [Listeria innocua]
          Length = 128

 Score = 71.6 bits (174), Expect = 1e-12
 Identities = 38/127 (29%), Positives = 71/127 (54%), Gaps = 6/127 (4%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R +AR   ++ L+  E     + +   +++E++     Q  +   L  GV+    EIDA+
Sbjct: 3   RREAREKALQALFQIELNEMSLDQAIKNIMEDE-----QDDYMEQLVEGVMANKAEIDAI 57

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
           IEP+L +W   RL  ++ ++LRL  YEI +     N + +NE IE+ K+Y++  + KF+N
Sbjct: 58  IEPNLDNWRIDRLNKVDLSLLRLSVYEIKYLDDVPNRVSLNESIEIAKIYSDEKSSKFIN 117

Query: 123 AILDSLS 129
            +L +++
Sbjct: 118 GVLANIA 124
>ref|NP_245667.1| (NC_002663) NusB [Pasteurella multocida]
 sp|P57868|NUSB_PASMU N utilization substance protein B homolog (NusB protein)
 gb|AAK02814.1| (AE006110) NusB [Pasteurella multocida]
          Length = 144

 Score = 71.6 bits (174), Expect = 1e-12
 Identities = 40/136 (29%), Positives = 75/136 (54%), Gaps = 2/136 (1%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           ++ R +AR   V+ LY++        +I  + + E+ +K    A+   LF    E ++ +
Sbjct: 10  ISPRRRARECAVQALYSWYVSQNSPAEIELNFMAEQDLKGVDTAYFRRLFRQTAENVDAV 69

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPK 119
           D ++ P+L D +   L  +EKAILRL  YE+ F       ++INE IE+ K++   ++ K
Sbjct: 70  DNIMIPYL-DREVSELDPIEKAILRLAVYELKFELDVPYKVVINEAIEVAKVFGAEDSHK 128

Query: 120 FLNAILDSLSKKLTQK 135
           ++N +LD ++  L++K
Sbjct: 129 YVNGVLDKVAPVLSRK 144
>ref|NP_464884.1| (NC_003210) similar to transcription termination protein (NusB)
           [Listeria monocytogenes EGD-e]
 emb|CAC99437.1| (AL591978) similar to transcription termination protein (NusB)
           [Listeria monocytogenes]
          Length = 128

 Score = 71.6 bits (174), Expect = 1e-12
 Identities = 38/127 (29%), Positives = 71/127 (54%), Gaps = 6/127 (4%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R +AR   ++ L+  E     + +   +++E++     Q  +   L  GV+    EIDA+
Sbjct: 3   RREAREKALQALFQIELNEMSLDQAIKNIMEDE-----QDDYMEKLVEGVMANKAEIDAI 57

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
           IEP+L +W   RL  ++ ++LRL  YEI +     N + +NE IE+ K+Y++  + KF+N
Sbjct: 58  IEPNLDNWRMDRLSKVDLSLLRLSVYEIKYLDDVPNRVSLNESIEIAKIYSDEKSSKFIN 117

Query: 123 AILDSLS 129
            +L +++
Sbjct: 118 GVLANIA 124
>ref|NP_406656.1| (NC_003143) N utilization substance protein B [Yersinia pestis]
 emb|CAC92416.1| (AJ414155) N utilization substance protein B [Yersinia pestis]
          Length = 138

 Score = 71.2 bits (173), Expect = 2e-12
 Identities = 39/135 (28%), Positives = 72/135 (52%), Gaps = 2/135 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           A R +AR   V+ LY+++    +I  +    L E+ +K+  +A+   L +GV      +D
Sbjct: 4   AARRRARECAVQALYSWQLSKNDIADVELQFLSEQDVKDVDIAYFRELLSGVAVNAASLD 63

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIG-FTPTQNPIIINECIELGKLYAEPNTPKF 120
           AL+ P L     + LG +E+A+LR+  +E+         + INE IEL K +   ++ KF
Sbjct: 64  ALMAPFLSR-QLEELGQVERAVLRIALFELSKRDDVPYKVAINEAIELAKTFGAEDSHKF 122

Query: 121 LNAILDSLSKKLTQK 135
           +N +LD ++  + ++
Sbjct: 123 VNGVLDKVAPTVRKR 137
>ref|NP_602432.1| (NC_003454) N utilization substance protein B [Fusobacterium
           nucleatum subsp. nucleatum ATCC 25586]
 gb|AAL93731.1| (AE010469) N utilization substance protein B [Fusobacterium
           nucleatum subsp. nucleatum ATCC 25586]
          Length = 153

 Score = 69.3 bits (168), Expect = 7e-12
 Identities = 41/132 (31%), Positives = 72/132 (54%), Gaps = 8/132 (6%)

Query: 7   ARGAVVELLY---AFESGNEEIKKIASSMLEEKK-----IKNNQLAFALSLFNGVLEKIN 58
           AR  V +L++   A ES +EE+K+     L+  +     +  NQL F  S  +G+ +  +
Sbjct: 19  AREEVFKLVFGVEATESASEELKQAFDIYLQNSEELIGTLNENQLEFLKSSIDGIAKNYD 78

Query: 59  EIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTP 118
            I  +I+ + ++W ++R+G +E+A+L +  YE  F      +I NE IEL K Y    + 
Sbjct: 79  NIKDIIKKNTQNWAYERIGVVERALLIVATYEFIFKNAPIEVIANEIIELAKEYGNEKSY 138

Query: 119 KFLNAILDSLSK 130
           +F+N IL ++ K
Sbjct: 139 EFVNGILANIEK 150
>ref|NP_562740.1| (NC_003366) probable N utilization substance protein B [Clostridium
           perfringens]
 dbj|BAB81530.1| (AP003191) probable N utilization substance protein B [Clostridium
           perfringens]
          Length = 135

 Score = 68.6 bits (166), Expect = 1e-11
 Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 3/132 (2%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL--AFALSLFNGVLEKINEID 61
           R ++R  +++L Y  E  +E   +  +S +E + I  + L  A+  S   G+ E   ++D
Sbjct: 3   RVKSREYLLQLAYQMEITSETALETFNSFMENEDISKDDLDLAYIKSGLLGIEENKEKLD 62

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKF 120
           +LIE  L  W   R+  +  +ILR+  YEI F       + INE IEL K Y++  +  F
Sbjct: 63  SLIESQLVKWKLNRISKVNLSILRISTYEILFAEDVPGKVSINEAIELCKKYSDNKSVSF 122

Query: 121 LNAILDSLSKKL 132
           +N +LD + K +
Sbjct: 123 INGVLDKVYKNM 134
>ref|NP_286157.1| (NC_002655) transcription termination; L factor [Escherichia coli
           O157:H7 EDL933]
 ref|NP_308496.1| (NC_002695) transcription termination factor NusB [Escherichia coli
           O157:H7]
 ref|NP_414950.1| (NC_000913) transcription termination; L factor [Escherichia coli
           K12]
 sp|P04381|NUSB_ECOLI N utilization substance protein B (NusB protein)
 pir||I51822 nusB protein - Escherichia coli
 pir||FJECB transcription termination factor nusB [validated] - Escherichia
           coli
 pdb|1BAQ|   Antitermination Factor Nusb From Escherichia Coli, Nmr, 18
           Structures
 pdb|1EY1|A Chain A, Solution Structure Of Escherichia Coli Nusb
 gb|AAA24228.1| (M26839) nusB [Escherichia coli]
 emb|CAA45737.1| (X64395) nusB (ssyB) [Escherichia coli]
 emb|CAA25289.1| (X00681) nusB protein [Escherichia coli]
 gb|AAB40172.1| (U82664) N utilization substance protein B [Escherichia coli]
 gb|AAC73519.1| (AE000148) transcription termination; L factor [Escherichia coli
           K12]
 gb|AAG54765.1|AE005221_2 (AE005221) transcription termination; L factor [Escherichia coli
           O157:H7 EDL933]
 dbj|BAB33892.1| (AP002551) transcription termination factor NusB [Escherichia coli
           O157:H7]
 emb|CAC44764.1| (AJ313516) N utilisation substance protein B [Expression vector
           pNCO113-nusB/nusE]
 prf||2111328A NusB protein [Escherichia coli]
          Length = 139

 Score = 68.6 bits (166), Expect = 1e-11
 Identities = 39/126 (30%), Positives = 67/126 (52%), Gaps = 2/126 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           A R +AR   V+ LY+++    +I  +    L E+ +K+  + +   L  GV      +D
Sbjct: 4   AARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLAGVATNTAYLD 63

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKF 120
            L++P+L     + LG +EKA+LR+  YE+   +     + INE IEL K +   ++ KF
Sbjct: 64  GLMKPYLSRL-LEELGQVEKAVLRIALYELSKRSDVPYKVAINEAIELAKSFGAEDSHKF 122

Query: 121 LNAILD 126
           +N +LD
Sbjct: 123 VNGVLD 128
>ref|NP_219452.1| (NC_000919) N utilization substance protein B (nusB) [Treponema
           pallidum]
 sp|O83979|NUSB_TREPA N utilization substance protein B homolog (NusB protein)
 pir||C71253 probable transcription termination factor nusB - syphilis
           spirochete
 gb|AAC65965.1| (AE001269) N utilization substance protein B (nusB) [Treponema
           pallidum]
          Length = 141

 Score = 68.6 bits (166), Expect = 1e-11
 Identities = 33/89 (37%), Positives = 55/89 (61%), Gaps = 1/89 (1%)

Query: 43  LAFALSLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-II 101
           L F+  LF G LE + EID  +   L+ WDF RL  ++KAILRL AY + F     P ++
Sbjct: 51  LGFSRLLFLGTLEHLREIDGCVSSRLEHWDFVRLNKVDKAILRLSAYSLLFQKDIPPVVV 110

Query: 102 INECIELGKLYAEPNTPKFLNAILDSLSK 130
           I+E + + + +   ++ +F+N +LD+++K
Sbjct: 111 IHEAVSIARDFGTDDSFRFVNGVLDNIAK 139
>ref|NP_390312.1| (NC_000964) similar to transcription termination [Bacillus
           subtilis]
 sp|P54520|NUSB_BACSU N utilization substance protein B homolog (NusB protein)
 pir||F69960 transcription termination factor nusB homolog yqhZ [similarity] -
           Bacillus subtilis
 dbj|BAA12571.1| (D84432) YqhZ [Bacillus subtilis]
 emb|CAB14363.1| (Z99116) similar to transcription termination [Bacillus subtilis]
          Length = 131

 Score = 68.6 bits (166), Expect = 1e-11
 Identities = 38/132 (28%), Positives = 69/132 (51%), Gaps = 5/132 (3%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R  AR   ++ L+  +  +  + +     L+E+K       F   L +GVLE  +++D +
Sbjct: 3   RRTAREKALQALFQIDVSDIAVNEAIEHALDEEKTD----PFFEQLVHGVLEHQDQLDEM 58

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKFLN 122
           I  HL +W   R+ ++++AILRL AYE+ +       + +NE IEL K + +    KF+N
Sbjct: 59  ISKHLVNWKLDRIANVDRAILRLAAYEMAYAEDIPVNVSMNEAIELAKRFGDDKATKFVN 118

Query: 123 AILDSLSKKLTQ 134
            +L ++   + Q
Sbjct: 119 GVLSNIKSDIGQ 130
>ref|NP_459413.1| (NC_003197) transcription termination; L factor [Salmonella
           typhimurium LT2]
 gb|AAL19372.1| (AE008715) transcription termination; L factor [Salmonella
           typhimurium LT2]
          Length = 139

 Score = 68.2 bits (165), Expect = 2e-11
 Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 2/126 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           A R +AR   V+ LY+++    +I  +    L E+ +K+  + +   L +GV      +D
Sbjct: 4   AARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLSGVATNSAYLD 63

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIG-FTPTQNPIIINECIELGKLYAEPNTPKF 120
            L++P+L     + LG +EKA+LR+  +E+   +     + INE IEL K +   ++ KF
Sbjct: 64  GLMKPYLSRL-LEELGQVEKAVLRIALFELSKRSDVPYKVAINEAIELAKTFGAEDSHKF 122

Query: 121 LNAILD 126
           +N +LD
Sbjct: 123 VNGVLD 128
>ref|NP_455012.1| (NC_003198) N utilization substance protein B [Salmonella enterica
           subsp. enterica serovar Typhi]
 emb|CAD08874.1| (AL627266) N utilization substance protein B [Salmonella enterica
           subsp. enterica serovar Typhi]
          Length = 139

 Score = 67.8 bits (164), Expect = 2e-11
 Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 2/126 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           A R +AR   V+ LY+++    +I  +    L E+ +K+  + +   L +GV      +D
Sbjct: 4   AARHRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLSGVATNSAYLD 63

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIG-FTPTQNPIIINECIELGKLYAEPNTPKF 120
            L++P+L     + LG +EKA+LR+  +E+   +     + INE IEL K +   ++ KF
Sbjct: 64  GLMKPYLSRL-LEELGQVEKAVLRIALFELSKRSDVPYKVAINEAIELAKTFGAEDSHKF 122

Query: 121 LNAILD 126
           +N +LD
Sbjct: 123 VNGVLD 128
>ref|NP_229562.1| (NC_000853) N utilization substance protein B [Thermotoga maritima]
 sp|Q9X286|NUSB_THEMA N utilization substance protein B homolog (NusB protein)
 pir||D72212 transcription termination factor nusB TM1765 [similarity] -
           Thermotoga maritima (strain MSB8)
 gb|AAD36829.1|AE001815_3 (AE001815) N utilization substance protein B [Thermotoga maritima]
          Length = 142

 Score = 67.4 bits (163), Expect = 3e-11
 Identities = 45/137 (32%), Positives = 74/137 (53%), Gaps = 9/137 (6%)

Query: 4   RTQARGAVVELLYAFE-SGNEEIKKIASSMLEE---KKIKNNQLAFALSLFNGVLEKINE 59
           R + R AV + L+  E   +E++++I   +L+E   KK K +    A     G+ E ++ 
Sbjct: 5   RRRMRLAVFKALFQHEFRRDEDLEQILEEILDETYDKKAKED----ARRYIRGIKENLSM 60

Query: 60  IDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTP 118
           ID LI  +L+ W   RL  +++ +LRL  YE+ F       + I+E IE+ K Y   N+ 
Sbjct: 61  IDDLISRYLEKWSLNRLSVVDRNVLRLATYELLFEKDIPIEVTIDEAIEIAKRYGTENSG 120

Query: 119 KFLNAILDSLSKKLTQK 135
           KF+N ILD ++K+   K
Sbjct: 121 KFVNGILDRIAKEHAPK 137
>ref|NP_357984.1| (NC_003098) Transcription termination protein [Streptococcus
           pneumoniae R6]
 gb|AAK99194.1| (AE008419) Transcription termination protein [Streptococcus
           pneumoniae R6]
          Length = 146

 Score = 65.5 bits (158), Expect = 1e-10
 Identities = 40/127 (31%), Positives = 67/127 (52%), Gaps = 2/127 (1%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINE 59
           + +R Q R    + L + E G +       +   +++  + QL AF + L +GV  K  E
Sbjct: 12  LESRRQLRKCAFQALMSLEFGTDVETACRFAYTHDREDTDVQLPAFLIDLVSGVQAKKEE 71

Query: 60  IDALIEPHLK-DWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTP 118
           +D  I  HLK  W  +RL  +E+ +LRLG +EI    T   + +NE IEL K +++  + 
Sbjct: 72  LDKQITQHLKAGWTIERLTLVERNLLRLGVFEITSFDTPQLVAVNEAIELAKDFSDQKSA 131

Query: 119 KFLNAIL 125
           +F+N +L
Sbjct: 132 RFINGLL 138
>ref|NP_344955.1| (NC_003028) N utilization substance protein B [Streptococcus
           pneumoniae TIGR4]
 gb|AAK74595.1| (AE007354) N utilization substance protein B [Streptococcus
           pneumoniae TIGR4]
          Length = 140

 Score = 65.5 bits (158), Expect = 1e-10
 Identities = 40/127 (31%), Positives = 67/127 (52%), Gaps = 2/127 (1%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINE 59
           + +R Q R    + L + E G +       +   +++  + QL AF + L +GV  K  E
Sbjct: 6   LESRRQLRKCAFQALMSLEFGTDVETACRFAYTHDREDTDVQLPAFLIDLVSGVQAKKEE 65

Query: 60  IDALIEPHLK-DWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTP 118
           +D  I  HLK  W  +RL  +E+ +LRLG +EI    T   + +NE IEL K +++  + 
Sbjct: 66  LDKQITQHLKAGWTIERLTLVERNLLRLGVFEITSFDTPQLVAVNEAIELAKDFSDQKSA 125

Query: 119 KFLNAIL 125
           +F+N +L
Sbjct: 126 RFINGLL 132
>ref|NP_348703.1| (NC_003030) Transcription termination factor NusB [Clostridium
           acetobutylicum]
 gb|AAK80043.1|AE007710_13 (AE007710) Transcription termination factor NusB [Clostridium
           acetobutylicum]
          Length = 135

 Score = 65.1 bits (157), Expect = 1e-10
 Identities = 38/133 (28%), Positives = 61/133 (45%), Gaps = 1/133 (0%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R ++R   ++LL+        I        E  +I+N    +   +  G+ E +  ID+ 
Sbjct: 3   RKKSREVAMKLLFEISINKNSISDTIEHYKENNEIENLDFEYIERILRGIDENMEYIDSK 62

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
           IE   K W   R+  +   ILR+ AYEI F       +  NE +EL K YAE N+  F+N
Sbjct: 63  IEESSKKWKISRISKINITILRMAAYEIFFEKDIPCKVSANEAVELAKSYAEENSFSFVN 122

Query: 123 AILDSLSKKLTQK 135
            ++ +L     +K
Sbjct: 123 GVIGNLINSSEEK 135
>ref|NP_439455.1| (NC_000907) N utilization substance protein B (nusB) [Haemophilus
           influenzae Rd]
 sp|P45150|NUSB_HAEIN N utilization substance protein B homolog (NusB protein)
 pir||D64115 transcription termination factor nusB - Haemophilus influenzae
           (strain Rd KW20)
 gb|AAC22951.1| (U32810) N utilization substance protein B (nusB) [Haemophilus
           influenzae Rd]
          Length = 144

 Score = 65.1 bits (157), Expect = 1e-10
 Identities = 37/135 (27%), Positives = 69/135 (50%), Gaps = 2/135 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           + R +AR   V+ LY++       +++  + + ++ +      +   LF   +E I  +D
Sbjct: 11  SARRRARECTVQALYSWAVSGNTAEQVELAFVLDQDMDGVDKPYFRKLFRQTIENIETVD 70

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKF 120
             I P++ D  F  L  +E AILRL  YE+ F       ++INE IE+ K++    + K+
Sbjct: 71  FSISPYI-DRAFDELDPIETAILRLAVYELRFELDVPYKVVINEAIEVAKVFGADESHKY 129

Query: 121 LNAILDSLSKKLTQK 135
           +N +LD ++  L +K
Sbjct: 130 INGVLDKIAPALGRK 144
>ref|NP_266850.1| (NC_002662) transcription termination protein NusB [Lactococcus
           lactis subsp. lactis]
 gb|AAK04792.1|AE006302_10 (AE006302) transcription termination protein NusB [Lactococcus
           lactis subsp. lactis]
          Length = 323

 Score = 64.7 bits (156), Expect = 2e-10
 Identities = 31/83 (37%), Positives = 55/83 (65%), Gaps = 1/83 (1%)

Query: 49  LFNGVLEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIE 107
           L +GVL+K  +++A I  +L K W F RL  +E+AIL++ +YEI +T T + + +NE +E
Sbjct: 241 LVDGVLDKKEDLEANISKYLTKTWSFSRLTLVEQAILQVSSYEILYTETPDVVAVNEAVE 300

Query: 108 LGKLYAEPNTPKFLNAILDSLSK 130
           L K +++  + +F+N +L +  K
Sbjct: 301 LSKDFSDEKSSRFINGVLTNFLK 323
>ref|NP_212241.1| (NC_001318) N-utilization substance protein B (nusB) [Borrelia
           burgdorferi]
 sp|O51134|NUSB_BORBU N utilization substance protein B homolog (NusB protein)
 pir||C70113 probable transcription termination factor nusB - Lyme disease
           spirochete
 gb|AAC66498.1| (AE001123) N-utilization substance protein B (nusB) [Borrelia
           burgdorferi]
          Length = 145

 Score = 62.8 bits (151), Expect = 6e-10
 Identities = 38/117 (32%), Positives = 64/117 (54%), Gaps = 3/117 (2%)

Query: 19  ESGNEEIKKIASSMLEEKKIKNNQL-AFALSLFNGVLEKINEIDALIEPHLKDWDFKRLG 77
           +S  ++I  I +   ++  I+N  + +F  SL  G  + +  ID+LI     +W  +R+ 
Sbjct: 22  QSAMDDIFDIFNIEDKDLDIENESIKSFYSSLVIGTFDNLEHIDSLIRDISLNWSLERMD 81

Query: 78  SMEKAILRLGAYEIGFTPTQNP--IIINECIELGKLYAEPNTPKFLNAILDSLSKKL 132
            ++ AILR+G Y + F   +N    II+E I + K Y   N+ KF+N ILD+L K +
Sbjct: 82  KVDLAILRMGVYSLKFQNFENSKRAIIDEAILIAKKYGSKNSYKFINGILDALLKNM 138
>ref|NP_252741.1| (NC_002516) NusB protein [Pseudomonas aeruginosa]
 pir||G83140 NusB protein PA4052 [imported] - Pseudomonas aeruginosa  (strain
           PAO1)
 gb|AAG07439.1|AE004821_12 (AE004821) NusB protein [Pseudomonas aeruginosa]
          Length = 159

 Score = 62.8 bits (151), Expect = 6e-10
 Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 2/132 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           A R +AR   V+ LY+++   + + +I +    +        A+   + +GV  + +E+D
Sbjct: 19  AARRKARSLAVQALYSWQIAGQPLHEIEAQFRTDNDFSEVDGAYFHEILHGVPRQKSELD 78

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYAEPNTPKF 120
           +  EP L D     +  +E AILRL  YE+         ++INE IEL K +   +  KF
Sbjct: 79  STFEPCL-DRPLAEIDPVELAILRLSTYELRNRIDVPYKVVINEGIELAKTFGATDGHKF 137

Query: 121 LNAILDSLSKKL 132
           +N +LD L+ +L
Sbjct: 138 VNGVLDKLAPRL 149
>ref|NP_607890.1| (NC_003485) putative transcriptional terminator [Streptococcus
           pyogenes MGAS8232]
 gb|AAL98389.1| (AE010095) putative transcriptional terminator [Streptococcus
           pyogenes MGAS8232]
          Length = 150

 Score = 62.4 bits (150), Expect = 8e-10
 Identities = 35/83 (42%), Positives = 49/83 (58%), Gaps = 2/83 (2%)

Query: 45  FALSLFNGVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIG-FTPTQNPIII 102
           F LSL  GV     E+D LI  HLK  W  +RL   +K +LRLG +EI  F  T + + +
Sbjct: 55  FLLSLVTGVNNHKEELDNLISTHLKKGWSLERLTLTDKTLLRLGLFEIKYFDETPDRVAL 114

Query: 103 NECIELGKLYAEPNTPKFLNAIL 125
           NE IE+ K Y++  + KF+N +L
Sbjct: 115 NEIIEVAKKYSDETSAKFINGLL 137
>ref|NP_636090.1| (NC_003902) transcription termination factor NusB [Xanthomonas
           campestris pv. campestris str. ATCC 33913]
 gb|AAM40014.1| (AE012168) transcription termination factor NusB [Xanthomonas
           campestris pv. campestris str. ATCC 33913]
          Length = 159

 Score = 62.0 bits (149), Expect = 1e-09
 Identities = 36/124 (29%), Positives = 68/124 (54%), Gaps = 2/124 (1%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R++AR   ++ +YA++      K++ +    E+  +   LA+  SL  GVL   +E+D  
Sbjct: 21  RSRARRRALQAVYAWQIAGGFAKQVIAQFAHEQAHEVADLAYFESLVEGVLSNRSELDTA 80

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
           + P+L D   + + ++E+A+LRL AYE+ +       ++INE IE  K +   +   ++N
Sbjct: 81  LTPYL-DRGVEEVDAIERAVLRLAAYELLYRQDVPYRVVINEAIETAKRFGSEHGHTYVN 139

Query: 123 AILD 126
            +LD
Sbjct: 140 GVLD 143
>ref|NP_372048.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
           aureus Mu50]
 ref|NP_374638.1| (NC_002745) hypothetical protein, simialr to transcription
           termination factor [Staphylococcus aureus subsp. aureus
           N315]
 ref|NP_646294.1| (NC_003923) ORFID:MW1477~hypothetical protein, similar to
           transcription termination factor [Staphylococcus aureus
           subsp. aureus MW2]
 dbj|BAB42617.1| (AP003134) ORFID:SA1355~hypothetical protein, similar to
           transcription termination factor [Staphylococcus aureus
           subsp. aureus N315]
 dbj|BAB57686.1| (AP003362) hypothetical protein [Staphylococcus aureus subsp.
           aureus Mu50]
 dbj|BAB95342.1| (AP004827) ORFID:MW1477~hypothetical protein, similar to
           transcription termination factor [Staphylococcus aureus
           subsp. aureus MW2]
          Length = 129

 Score = 62.0 bits (149), Expect = 1e-09
 Identities = 29/82 (35%), Positives = 49/82 (59%)

Query: 49  LFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIEL 108
           L +GV +    +D  I P+LKDW   RL   ++ ILR+  YEI  + T   +++NE +EL
Sbjct: 48  LVSGVKDHEPVLDETISPYLKDWTIARLLKTDRIILRMATYEILHSDTPAKVVMNEAVEL 107

Query: 109 GKLYAEPNTPKFLNAILDSLSK 130
            K +++ +  KF+N +L ++ K
Sbjct: 108 TKQFSDDDHYKFINGVLSNIKK 129
>ref|NP_269822.1| (NC_002737) putative transcriptional terminator [Streptococcus
           pyogenes] [Streptococcus pyogenes M1 GAS]
 gb|AAK34543.1| (AE006609) putative transcriptional terminator [Streptococcus
           pyogenes M1 GAS]
          Length = 150

 Score = 61.2 bits (147), Expect = 2e-09
 Identities = 35/83 (42%), Positives = 49/83 (58%), Gaps = 2/83 (2%)

Query: 45  FALSLFNGVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIG-FTPTQNPIII 102
           F LSL  GV     E+D LI  HLK  W  +RL   +K +LRLG +EI  F  T + + +
Sbjct: 55  FLLSLVTGVNNHKEELDNLISTHLKKGWSLERLTLTDKTLLRLGLFEIKYFDKTPDRVAL 114

Query: 103 NECIELGKLYAEPNTPKFLNAIL 125
           NE IE+ K Y++  + KF+N +L
Sbjct: 115 NEIIEVVKKYSDETSAKFINGLL 137
>ref|NP_625770.1| (NC_003888) putative NusB-family protein [Streptomyces coelicolor
           A3(2)]
 emb|CAB93370.1| (AL357523) putative NusB-family protein [Streptomyces coelicolor
           A3(2)]
          Length = 142

 Score = 60.8 bits (146), Expect = 2e-09
 Identities = 33/134 (24%), Positives = 66/134 (48%), Gaps = 4/134 (2%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQ---LAFALSLFNGVLEKI 57
           MA R  AR    ++L+  +    ++  + +  +   +    Q     + + L  G   + 
Sbjct: 1   MAARNTARKRAFQILFEGDQRGADVLTVLADWVRHSRSDTRQPPVSEYTMELVEGYAGRA 60

Query: 58  NEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPN 116
             ID LI  +  DW   R+  +++ ILRLGAYE+ +   T + ++++E ++L K ++   
Sbjct: 61  ERIDELIAQYSVDWTLDRMPVVDRNILRLGAYELLWVDATPDAVVLDEMVQLAKEFSTDE 120

Query: 117 TPKFLNAILDSLSK 130
           +P F+N +L  L +
Sbjct: 121 SPAFINGLLGRLKE 134
>ref|NP_658218.1| (NC_003995) NusB, NusB family [Bacillus anthracis A2012] [Bacillus
           anthracis str. A2012]
          Length = 130

 Score = 60.5 bits (145), Expect = 3e-09
 Identities = 38/131 (29%), Positives = 67/131 (51%), Gaps = 5/131 (3%)

Query: 4   RTQARGAVVELLYAFE-SGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDA 62
           R  AR   ++ LY  + +G  E K    + L+E +  N    F  SL  G +E    ID 
Sbjct: 3   RRTARERAMQALYQMDITGELEPKVAVENTLDEGEETNE---FLESLVVGFVENKEVIDE 59

Query: 63  LIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFL 121
            I  +LK W  +R+  ++++ILR+  YE+ +     + + INE IE+ K + +  + +F+
Sbjct: 60  AIRQNLKKWKLERISIVDRSILRVAVYEMKYMEEIPHNVTINEAIEIAKTFGDEESRRFI 119

Query: 122 NAILDSLSKKL 132
           N +L ++   L
Sbjct: 120 NGVLSNIKDTL 130
>ref|NP_385322.1| (NC_003047) PUTATIVE N UTILIZATION SUBSTANCE PROTEIN B
           [Sinorhizobium meliloti]
 emb|CAC45795.1| (AL591786) PUTATIVE N UTILIZATION SUBSTANCE PROTEIN B
           [Sinorhizobium meliloti]
          Length = 160

 Score = 60.5 bits (145), Expect = 3e-09
 Identities = 42/139 (30%), Positives = 69/139 (49%), Gaps = 10/139 (7%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKN--------NQLAFALSLFNGVLE 55
           R  AR A V+ LY  + G   + +I +   E +  K            ++  S+  GV+ 
Sbjct: 16  RGAARLAAVQALYQMDVGGTGVLEIVAEYEEHRLGKELDGDTYLRADASWFRSIVAGVVR 75

Query: 56  KINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
              ++D LI   L+D W   RL S  +AILR G +EI        P+I+ E +E+ K + 
Sbjct: 76  DQRKLDPLIGSALQDDWALSRLDSTVRAILRAGTFEILERKDVPVPVIVTEYVEIAKAFF 135

Query: 114 EPNTPKFLNAILDSLSKKL 132
           +   PK +NA+LD ++K++
Sbjct: 136 QDEEPKLVNAVLDRIAKQV 154
>ref|NP_662591.1| (NC_002932) N utilization substance protein B [Chlorobium tepidum
           TLS]
 gb|AAM72933.1| (AE012925) N utilization substance protein B [Chlorobium tepidum
           TLS]
          Length = 164

 Score = 60.5 bits (145), Expect = 3e-09
 Identities = 38/128 (29%), Positives = 67/128 (51%), Gaps = 3/128 (2%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKN-NQLAFALSLFNGVLEKINEIDA 62
           R Q R  +++ LY  E  + +    A+ +L ++ + + N + F   L   ++    EID 
Sbjct: 5   RRQLREKIIQALYTLELRDVDTDSAANWLLTKEIMDDPNAMKFFNHLMQSIVRNREEIDR 64

Query: 63  LIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-IIINECIELGKLYAEPN-TPKF 120
            I  H  +WD  R+  ++K ILR+   EI +     P + INE IE+ K ++  + + KF
Sbjct: 65  YIAKHTFNWDMSRIAIIDKNILRMALAEILYCEDIPPKVSINEAIEIAKKFSSTDKSSKF 124

Query: 121 LNAILDSL 128
           +N ILD++
Sbjct: 125 VNGILDAI 132
>ref|NP_243651.1| (NC_002570) transcriptional terminator [Bacillus halodurans]
 sp|Q9K965|NUSB_BACHD N utilization substance protein B homolog (NusB protein)
 dbj|BAB06504.1| (AP001516) transcriptional terminator [Bacillus halodurans]
          Length = 134

 Score = 60.5 bits (145), Expect = 3e-09
 Identities = 35/123 (28%), Positives = 64/123 (51%), Gaps = 4/123 (3%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R  +R   V+ LY  +  +  ++K   S+L+E +  ++   F   L +G +    E+D L
Sbjct: 3   RRLSRLRAVQALYQMDVIDTSMEKAIESVLDEGEEASS---FMSDLVSGTVTHQEELDRL 59

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPKFLN 122
              HL+ W   R+G++++AILR+  YE+ +       +  NE IEL K +   +  +F+N
Sbjct: 60  YADHLQGWTVDRIGNVDRAILRMALYELYYVDDIPKNVSFNEAIELAKAFGGEDAGRFIN 119

Query: 123 AIL 125
            +L
Sbjct: 120 GVL 122
>ref|NP_301448.1| (NC_002677) putative transcription termination protein
           [Mycobacterium leprae]
 sp|Q9CCR9|NUSB_MYCLE N utilization substance protein B homolog (NusB protein)
 emb|CAC30031.1| (AL583918) putative transcription termination protein
           [Mycobacterium leprae]
          Length = 190

 Score = 60.1 bits (144), Expect = 4e-09
 Identities = 38/126 (30%), Positives = 68/126 (53%), Gaps = 4/126 (3%)

Query: 4   RTQARGAVVELLYAFESGNE---EIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           R QAR   V+LL+  E+ +    EI ++ S++ + K        + + +  GV E    I
Sbjct: 8   RHQARKRAVDLLFEAEARDLSPLEIIEVRSALAKSKLDVAPLHPYTVVVAQGVSEHTARI 67

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFT-PTQNPIIINECIELGKLYAEPNTPK 119
           D LI  HL+ W   RL ++++AILR+  +E+ +      P+ ++E +EL K  +  ++P 
Sbjct: 68  DELIISHLQGWKLDRLPAVDRAILRVSIWELLYADDVPEPVAVDEAVELAKELSTDDSPG 127

Query: 120 FLNAIL 125
           F+N +L
Sbjct: 128 FVNGLL 133
>ref|NP_641104.1| (NC_003919) transcription termination factor NusB [Xanthomonas
           axonopodis pv. citri str. 306]
 gb|AAM35640.1| (AE011705) transcription termination factor NusB [Xanthomonas
           axonopodis pv. citri str. 306]
          Length = 159

 Score = 60.1 bits (144), Expect = 4e-09
 Identities = 35/124 (28%), Positives = 67/124 (53%), Gaps = 2/124 (1%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R++AR   ++ +YA++      K++ +    E+  +   LA+  +L  GVL    E+D  
Sbjct: 21  RSRARRRALQAVYAWQISGGFAKQVIAQFAHEQAHEVADLAYFENLVEGVLSNRAELDTA 80

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKFLN 122
           + P+L D   + + ++E+A+LRL AYE+ +       ++INE IE  K +   +   ++N
Sbjct: 81  LTPYL-DRSVEEVDAIERAVLRLAAYELLYRQDVPYRVVINEAIETAKRFGSEHGHTYVN 139

Query: 123 AILD 126
            +LD
Sbjct: 140 GVLD 143
>ref|NP_298245.1| (NC_002488) transcription termination factor [Xylella fastidiosa
           9a5c]
 pir||D82741 transcription termination factor XF0955 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF83765.1|AE003934_7 (AE003934) transcription termination factor [Xylella fastidiosa
           9a5c]
          Length = 157

 Score = 59.3 bits (142), Expect = 7e-09
 Identities = 36/126 (28%), Positives = 66/126 (51%), Gaps = 2/126 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           A R++AR   ++ +YA++      K++ +    E+  +   LA+   L  GVL    E+D
Sbjct: 20  ALRSRARRRALQAVYAWQISGGVAKQVIAHFAHEQAYEVADLAYFEDLVEGVLTHCAELD 79

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYEIGF-TPTQNPIIINECIELGKLYAEPNTPKF 120
             + P+L D   + + ++E+A+LRLGAYE+ +       ++INE I   K +       +
Sbjct: 80  EKLTPYL-DRTIEEVDAIERAVLRLGAYELLYRQDVPYRVVINEAIMTAKRFGSKYGHTY 138

Query: 121 LNAILD 126
           +N +LD
Sbjct: 139 VNGVLD 144
>ref|NP_295790.1| (NC_001263) N-utilization substance protein B [Deinococcus
           radiodurans]
 pir||E75318 transcription termination factor nusB DR2067 [similarity] -
           Deinococcus radiodurans (strain R1)
 gb|AAF11617.1|AE002043_2 (AE002043) N-utilization substance protein B [Deinococcus
           radiodurans]
          Length = 192

 Score = 58.9 bits (141), Expect = 9e-09
 Identities = 33/143 (23%), Positives = 71/143 (49%), Gaps = 8/143 (5%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKI---ASSMLEE-----KKIKNNQLAFALSLFNG 52
           + TR  AR  V  +L+  + G+  ++ +   A  ++ E      ++  + L FA  L  G
Sbjct: 12  VGTRRAAREFVFRVLFEADRGDVPLQAVFTRAEGVMREGDDTFPQLGPDALHFAEELVTG 71

Query: 53  VLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLY 112
           +      ID ++   ++ W F ++   +  +LRL  YE+ +TP  +P +I   + + + +
Sbjct: 72  LERHREAIDDVLHRTIRGWTFDQMAQTDLNVLRLATYELMYTPEPHPPVIESAVRIARKF 131

Query: 113 AEPNTPKFLNAILDSLSKKLTQK 135
              ++ +F+N +L  LS+ L ++
Sbjct: 132 GGDDSGRFVNGVLAGLSRNLREE 154
>ref|NP_441814.1| (NC_000911) N utilization substance protein B [Synechocystis sp.
           PCC 6803]
 sp|P74395|NUSB_SYNY3 N utilization substance protein B homolog (NusB protein)
 pir||S76233 transcription termination factor nusB sll0271 [similarity] -
           Synechocystis sp. (strain PCC 6803)
 dbj|BAA18492.1| (D90914) N utilization substance protein B [Synechocystis sp. PCC
           6803]
          Length = 275

 Score = 58.2 bits (139), Expect = 2e-08
 Identities = 28/89 (31%), Positives = 48/89 (53%)

Query: 45  FALSLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINE 104
           FAL L   V  +  +ID  ++  + DW   RL  +++ ILRL   E+ +      + INE
Sbjct: 176 FALELIGTVCRRRQQIDEQLQEAMVDWQLSRLAKIDQDILRLAIAELDYLGVPQKVAINE 235

Query: 105 CIELGKLYAEPNTPKFLNAILDSLSKKLT 133
            +EL K Y+  +  +F+N +L  +++K T
Sbjct: 236 AVELAKRYSGQDGHRFINGVLRRVTEKKT 264
>ref|NP_217049.1| (NC_000962) nusB [Mycobacterium tuberculosis H37Rv]
 sp|P95020|NUSB_MYCTU N utilization substance protein B homolog (NusB protein)
 pir||A70658 transcription termination factor nusB [similarity] - Mycobacterium
           tuberculosis (strain H37RV)
 pdb|1EYV|B Chain B, The Crystal Structure Of Nusb From Mycobacterium
           Tuberculosis
 pdb|1EYV|A Chain A, The Crystal Structure Of Nusb From Mycobacterium
           Tuberculosis
 emb|CAB06175.1| (Z83863) nusB [Mycobacterium tuberculosis H37Rv]
          Length = 156

 Score = 57.8 bits (138), Expect = 2e-08
 Identities = 36/126 (28%), Positives = 63/126 (49%), Gaps = 4/126 (3%)

Query: 4   RTQARGAVVELLYAFESGN---EEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           R QAR   V LL+  E       E+    +++ E K        +  ++  GV E    I
Sbjct: 10  RHQARKRAVALLFEAEVRGISAAEVVDTRAALAEAKPDIARLHPYTAAVARGVSEHAAHI 69

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPK 119
           D LI  HL+ W   RL ++++AILR+  +E +       P++++E ++L K  +  ++P 
Sbjct: 70  DDLITAHLRGWTLDRLPAVDRAILRVSVWELLHAADVPEPVVVDEAVQLAKELSTDDSPG 129

Query: 120 FLNAIL 125
           F+N +L
Sbjct: 130 FVNGVL 135
>ref|NP_337104.1| (NC_002755) N utilization substance protein B [Mycobacterium
           tuberculosis CDC1551]
 gb|AAK46918.1| (AE007097) N utilization substance protein B [Mycobacterium
           tuberculosis CDC1551]
          Length = 290

 Score = 57.8 bits (138), Expect = 2e-08
 Identities = 36/126 (28%), Positives = 63/126 (49%), Gaps = 4/126 (3%)

Query: 4   RTQARGAVVELLYAFESGN---EEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           R QAR   V LL+  E       E+    +++ E K        +  ++  GV E    I
Sbjct: 10  RHQARKRAVALLFEAEVRGISAAEVVDTRAALAEAKPDIARLHPYTAAVARGVSEHAAHI 69

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPK 119
           D LI  HL+ W   RL ++++AILR+  +E +       P++++E ++L K  +  ++P 
Sbjct: 70  DDLITAHLRGWTLDRLPAVDRAILRVSVWELLHAADVPEPVVVDEAVQLAKELSTDDSPG 129

Query: 120 FLNAIL 125
           F+N +L
Sbjct: 130 FVNGVL 135
>ref|NP_445404.1| (NC_002179) N utilization substance protein B, putative
           [Chlamydophila pneumoniae AR39]
 pir||B81530 N utilization substance protein B, probable CP0866 [imported] -
           Chlamydophila pneumoniae (strain AR39)
 gb|AAF38655.1| (AE002245) N utilization substance protein B, putative
           [Chlamydophila pneumoniae AR39]
          Length = 163

 Score = 55.8 bits (133), Expect = 8e-08
 Identities = 34/122 (27%), Positives = 61/122 (49%), Gaps = 1/122 (0%)

Query: 8   RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEPH 67
           R  ++++LYA +        +   ++ +  +    +  AL+    +LEK  E+D +I   
Sbjct: 29  REIILQMLYALDMAPSAEDSLVPLLMSQTAVSQKHVLVALNQTKSILEKSQELDLIIGNA 88

Query: 68  LKDWDFKRLGSMEKAILRLGAYEIGFTPTQN-PIIINECIELGKLYAEPNTPKFLNAILD 126
           LK+  F  L  +EK +LRL  +E  ++P  N  I+I E I L K ++      F+ AIL+
Sbjct: 89  LKNKSFDSLDLVEKNVLRLTLFEHFYSPPINKAILIAEAIRLVKKFSYSEACPFIQAILN 148

Query: 127 SL 128
            +
Sbjct: 149 DI 150
>ref|NP_225183.1| (NC_000922) CT832 hypothetical protein [Chlamydophila pneumoniae
           CWL029]
 ref|NP_301044.1| (NC_002491) CT832 hypothetical protein [Chlamydophila pneumoniae
           J138]
 sp|Q9Z6S0|NUSB_CHLPN N utilization substance protein B homolog (NusB protein)
 pir||F72010 CT832 hypothetical protein - Chlamydophila pneumoniae  (strain
           CWL029)
 gb|AAD19126.1| (AE001679) CT832 hypothetical protein [Chlamydophila pneumoniae
           CWL029]
 dbj|BAA99196.1| (AP002548) CT832 hypothetical protein [Chlamydophila pneumoniae
           J138]
          Length = 160

 Score = 55.8 bits (133), Expect = 8e-08
 Identities = 34/122 (27%), Positives = 61/122 (49%), Gaps = 1/122 (0%)

Query: 8   RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEPH 67
           R  ++++LYA +        +   ++ +  +    +  AL+    +LEK  E+D +I   
Sbjct: 26  REIILQMLYALDMAPSAEDSLVPLLMSQTAVSQKHVLVALNQTKSILEKSQELDLIIGNA 85

Query: 68  LKDWDFKRLGSMEKAILRLGAYEIGFTPTQN-PIIINECIELGKLYAEPNTPKFLNAILD 126
           LK+  F  L  +EK +LRL  +E  ++P  N  I+I E I L K ++      F+ AIL+
Sbjct: 86  LKNKSFDSLDLVEKNVLRLTLFEHFYSPPINKAILIAEAIRLVKKFSYSEACPFIQAILN 145

Query: 127 SL 128
            +
Sbjct: 146 DI 147
>ref|NP_531868.1| (NC_003304) N-utilization substance protein B [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
 gb|AAL42184.1| (AE009080) N-utilization substance protein B [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
          Length = 165

 Score = 55.5 bits (132), Expect = 1e-07
 Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 10/139 (7%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSM--------LEEKKIKNNQLAFALSLFNGVLE 55
           R  AR A V+ LY  + G   + ++ +          ++         ++  S+ +GV+ 
Sbjct: 21  RGAARLAAVQALYQMDVGGTGVMEVVAEYEAHRLGQEVDGDTYLKADPSWFRSIVSGVVR 80

Query: 56  KINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
              +ID L+   L +DW   RL +  +AILR G +EI         +I+ E +E+ + + 
Sbjct: 81  DQTKIDPLVRSALLEDWPLSRLDATVRAILRAGTFEILERKDVPVAVIVTEYVEIARAFF 140

Query: 114 EPNTPKFLNAILDSLSKKL 132
           E + PK +NA+LD ++K++
Sbjct: 141 EHDEPKLVNAVLDRIAKQV 159
>ref|NP_354190.1| (NC_003062) AGR_C_2167p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 gb|AAK86975.1| (AE008046) AGR_C_2167p [Agrobacterium tumefaciens str. C58
           (Cereon)]
          Length = 169

 Score = 55.5 bits (132), Expect = 1e-07
 Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 10/139 (7%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSM--------LEEKKIKNNQLAFALSLFNGVLE 55
           R  AR A V+ LY  + G   + ++ +          ++         ++  S+ +GV+ 
Sbjct: 25  RGAARLAAVQALYQMDVGGTGVMEVVAEYEAHRLGQEVDGDTYLKADPSWFRSIVSGVVR 84

Query: 56  KINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
              +ID L+   L +DW   RL +  +AILR G +EI         +I+ E +E+ + + 
Sbjct: 85  DQTKIDPLVRSALLEDWPLSRLDATVRAILRAGTFEILERKDVPVAVIVTEYVEIARAFF 144

Query: 114 EPNTPKFLNAILDSLSKKL 132
           E + PK +NA+LD ++K++
Sbjct: 145 EHDEPKLVNAVLDRIAKQV 163
>ref|NP_231898.1| (NC_002505) N utilization substance protein B [Vibrio cholerae]
 pir||B82098 N utilization substance protein B VC2267 [imported] - Vibrio
           cholerae (group O1 strain N16961)
 gb|AAF95411.1| (AE004298) N utilization substance protein B [Vibrio cholerae]
          Length = 156

 Score = 55.1 bits (131), Expect = 1e-07
 Identities = 39/149 (26%), Positives = 69/149 (46%), Gaps = 16/149 (10%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQ--------------LAFAL 47
           A R  AR   ++ +Y+++   E +  I    L   K    +              +++  
Sbjct: 8   AARRNARQFALQAIYSWQITKENVATIEEQFLTSGKYDEEEHRAAEPALAAPETDVSYFR 67

Query: 48  SLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTP-TQNPIIINECI 106
            L  GV+   NE+D+ + P +     + L  ME A+LRL  YE+         ++INE I
Sbjct: 68  DLLAGVVLNHNELDSKLRPFVSR-PMQDLDMMELALLRLAMYEMTRREDVPYKVVINEAI 126

Query: 107 ELGKLYAEPNTPKFLNAILDSLSKKLTQK 135
           EL K++A  ++ KF+N +LD  +  + +K
Sbjct: 127 ELAKVFAAEDSHKFVNGVLDKAAPHVRKK 155
>ref|NP_108511.1| (NC_002678) N-utilization substance protein B [Mesorhizobium loti]
 dbj|BAB54297.1| (AP003014) N-utilization substance protein B [Mesorhizobium loti]
          Length = 155

 Score = 55.1 bits (131), Expect = 1e-07
 Identities = 38/139 (27%), Positives = 67/139 (47%), Gaps = 10/139 (7%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSM--------LEEKKIKNNQLAFALSLFNGVLE 55
           R  AR A V+ LY  +     + +I +          ++    +     +  ++  GV+E
Sbjct: 7   RGAARLAAVQALYQMDVAGSGVFEITAEYEAFRLGKEVDGALYREADAQWFRAILTGVVE 66

Query: 56  KINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELGKLYA 113
               ID +I   L D W   RL S  +AILR G YE+         +I++E +++ K + 
Sbjct: 67  DQKTIDPVIRQALTDDWPLSRLDSTLRAILRAGVYELMKREDVPVAVIVSEYVDIAKAFY 126

Query: 114 EPNTPKFLNAILDSLSKKL 132
           E + PK +NA+LD +S+++
Sbjct: 127 EEDEPKLVNAVLDRVSRRV 145
>ref|NP_420173.1| (NC_002696) N utilization substance protein B [Caulobacter
           crescentus CB15]
 gb|AAK23341.1| (AE005811) N utilization substance protein B [Caulobacter
           crescentus CB15]
          Length = 149

 Score = 54.7 bits (130), Expect = 2e-07
 Identities = 41/139 (29%), Positives = 69/139 (49%), Gaps = 14/139 (10%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLE---EKKIKNNQLA-----FALSLFNGVLE 55
           R+ AR A V+ LY  E     +  +     E   ++ ++  QLA     F   L  GV+ 
Sbjct: 9   RSVARLAAVQALYQMEVSGAGVDSVIREFGEHRFDRDVEGEQLAAADETFFADLARGVVT 68

Query: 56  KINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF---TPTQNPIIINECIELGKL 111
              +ID  I   L   W  +RL +  +A+LR GA+E+ +    PT+  ++INE +E+ K 
Sbjct: 69  NQAKIDQGIVKRLASGWRLERLDATARAVLRAGAFELMYRSDVPTE--VVINEYVEIAKS 126

Query: 112 YAEPNTPKFLNAILDSLSK 130
           + E     F+N  LD++++
Sbjct: 127 FFEGPESGFINGALDAIAR 145
>gb|AAB95441.1| (AF002857) NUSB [Shigella flexneri]
          Length = 101

 Score = 52.8 bits (125), Expect = 7e-07
 Identities = 28/90 (31%), Positives = 49/90 (54%), Gaps = 1/90 (1%)

Query: 2  ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
          A R +AR   V+ LY+++    +I  +    L E+ +K+  + +   L  GV  K   +D
Sbjct: 4  AARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLAGVATKTAYLD 63

Query: 62 ALIEPHLKDWDFKRLGSMEKAILRLGAYEI 91
           L++P+L     + LG +EKA+LR+  YE+
Sbjct: 64 GLMKPYLSRL-LEELGQVEKAVLRIALYEL 92
>ref|NP_485800.1| (NC_003272) transcription termination factor [Nostoc sp. PCC 7120]
 dbj|BAB73459.1| (AP003587) transcription termination factor [Nostoc sp. PCC 7120]
          Length = 211

 Score = 52.8 bits (125), Expect = 7e-07
 Identities = 28/85 (32%), Positives = 45/85 (52%)

Query: 45  FALSLFNGVLEKINEIDALIEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINE 104
           +A+ L   + E+ + ID  I   L DW   RL  +++ ILR+   E+ F    N + INE
Sbjct: 119 YAIKLVKIINEERSVIDEQITSALVDWQVTRLAQIDRDILRIAVAEMMFFNLPNSVAINE 178

Query: 105 CIELGKLYAEPNTPKFLNAILDSLS 129
            +EL K Y+     +F+N +L  +S
Sbjct: 179 AVELAKRYSGDEGHRFINGVLRRVS 203
>ref|NP_273725.1| (NC_003112) N utilization substance protein B [Neisseria
           meningitidis MC58]
 pir||A81172 transcription termination factor nusB NMB0683 [similarity] -
           Neisseria meningitidis (group B strain MD58)
 gb|AAF41101.1| (AE002422) N utilization substance protein B [Neisseria
           meningitidis MC58]
          Length = 141

 Score = 51.6 bits (122), Expect = 1e-06
 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 2/130 (1%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R ++R   V+ +Y          +IA ++ E              LF G      E    
Sbjct: 5   RRRSRELAVQAVYQSLINRTAAPEIAKNIREMSDFAKADEELFNKLFFGTQTNAAEYIRQ 64

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTP-TQNPIIINECIELGKLYAEPNTPKFLN 122
           I P L D D K L  +E+A+L    +E+   P T  P+IINE IE+ K +   +  KF+N
Sbjct: 65  IRP-LLDRDEKDLNPIERAVLLTACHELSAMPETPYPVIINEAIEVTKTFGGTDGHKFVN 123

Query: 123 AILDSLSKKL 132
            ILD L+ ++
Sbjct: 124 GILDKLAAQI 133
>ref|NP_283676.1| (NC_003116) putative RNA polymerase antitermination factor
           [Neisseria meningitidis Z2491]
 pir||H81934 transcription termination factor nusB NMA0885 [similarity] -
           Neisseria meningitidis (group A strain Z2491)
 emb|CAB84165.1| (AL162754) putative RNA polymerase antitermination factor
           [Neisseria meningitidis Z2491]
          Length = 141

 Score = 51.2 bits (121), Expect = 2e-06
 Identities = 39/130 (30%), Positives = 59/130 (45%), Gaps = 2/130 (1%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           R ++R   V+ +Y          +IA ++ E              LF G      E    
Sbjct: 5   RRRSRELAVQAVYQSLINRTAAPEIAKNIREMPDFAKADEELFNKLFFGTQTNAAEYIRQ 64

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTP-TQNPIIINECIELGKLYAEPNTPKFLN 122
           I P L D D K L  +E+A+L    +E+   P T  P+IINE IE+ K +   +  KF+N
Sbjct: 65  IRP-LLDRDEKDLNPIERAVLLTACHELSAMPETPYPVIINEAIEVTKTFGGTDGHKFVN 123

Query: 123 AILDSLSKKL 132
            ILD L+ ++
Sbjct: 124 GILDKLAAQI 133
>ref|NP_600832.1| (NC_003450) COG0781:Transcription termination factor
           [Corynebacterium glutamicum]
 dbj|BAB99011.1| (AP005279) Transcription termination factor [Corynebacterium
           glutamicum ATCC 13032]
          Length = 227

 Score = 50.8 bits (120), Expect = 2e-06
 Identities = 34/138 (24%), Positives = 73/138 (52%), Gaps = 10/138 (7%)

Query: 3   TRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLA----FALSLFNGVLEKIN 58
           +R +AR   V++L+  ES + +   I     +  +  N  +A    +  ++ NGV  +++
Sbjct: 13  SRYKARMRAVDILFEAESRDVDPVAIIDDRHKLARDTNPIVAPVAEYTETIINGVAVELD 72

Query: 59  EIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF---TPTQNPIIINECIELGKLYAE 114
            +D  +  H+ + W   RL S+++AILR+ ++E+ +    P    I+  E +E+   Y+ 
Sbjct: 73  TLDVFLAEHIAETWTLGRLPSVDRAILRVASWEMIYNADVPVTTAIV--EAVEIASEYSG 130

Query: 115 PNTPKFLNAILDSLSKKL 132
             +  ++NA LD+++ K+
Sbjct: 131 DKSSAYINATLDAMASKV 148
>ref|NP_540103.1| (NC_003317) N UTILIZATION SUBSTANCE PROTEIN B [Brucella melitensis]
 gb|AAL52367.1| (AE009558) N UTILIZATION SUBSTANCE PROTEIN B [Brucella melitensis]
          Length = 171

 Score = 50.4 bits (119), Expect = 3e-06
 Identities = 28/80 (35%), Positives = 47/80 (58%), Gaps = 2/80 (2%)

Query: 52  GVLEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEI-GFTPTQNPIIINECIELG 109
           GV+E   ++D +I   L +DW   RL S  +AILR GA+E+         +I++E +++ 
Sbjct: 76  GVVEDQLKLDPMIHQALTEDWPLSRLDSTLRAILRAGAWELKARKDVPTAVIVSEYVDIA 135

Query: 110 KLYAEPNTPKFLNAILDSLS 129
           K +   + PK +NA+LD L+
Sbjct: 136 KAFYTEDEPKLVNAVLDRLA 155
>gb|AAF18280.1| (AF088897) N-utilization substance protein B [Zymomonas mobilis]
          Length = 129

 Score = 47.8 bits (112), Expect = 2e-05
 Identities = 29/102 (28%), Positives = 51/102 (49%), Gaps = 2/102 (1%)

Query: 33  LEEKKIKNNQLAFALSLFNGVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEI 91
           +E+      + +F   +  GV  +  EID +I  +L + W   RL    + ILR G YE+
Sbjct: 22  IEDATYTKAEPSFFDDIVRGVGTRCEEIDRVISENLSERWSLDRLDRPMRQILRAGTYEL 81

Query: 92  GFTP-TQNPIIINECIELGKLYAEPNTPKFLNAILDSLSKKL 132
              P      +I+E I++   + +     F+N +LD+++KKL
Sbjct: 82  LARPDVPTATVISEYIDVANAFYDRQEKNFVNGLLDTVAKKL 123
>ref|NP_296598.1| (NC_002620) N utilization substance protein B, putative [Chlamydia
           muridarum]
 sp|Q9PL88|NUSB_CHLMU N utilization substance protein B homolog (NusB protein)
 pir||A81727 transcription termination factor nusB TC0219 [similarity] -
           Chlamydia muridarum (strain Nigg)
 gb|AAF39091.1| (AE002289) N utilization substance protein B, putative [Chlamydia
           muridarum]
          Length = 164

 Score = 45.4 bits (106), Expect = 1e-04
 Identities = 31/137 (22%), Positives = 62/137 (44%), Gaps = 4/137 (2%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           + + R  V++ LYA E   +    + S ++ E  +    + +AL     +    +E+DAL
Sbjct: 23  KQKLRELVLQALYALEMAPKGEDSLVSLLMTEASVSKKNVLYALMFCKAIRANQSELDAL 82

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPI----IINECIELGKLYAEPNTPK 119
           +   ++      L  +E+ ILR+  +E       +PI    +I E   L K ++      
Sbjct: 83  LNATIRTTTLANLTIIERNILRMMLFEHQQNQESSPIPTAVLIAETTRLIKKFSYVEGSS 142

Query: 120 FLNAILDSLSKKLTQKP 136
            + A+L S+  ++ Q+P
Sbjct: 143 LILAVLGSIFDQVAQEP 159
>pir||T05067 hypothetical protein M3E9.200 - Arabidopsis thaliana
 emb|CAA18233.1| (AL022223) putative protein [Arabidopsis thaliana]
 emb|CAB79492.1| (AL161565) putative protein [Arabidopsis thaliana]
          Length = 286

 Score = 45.4 bits (106), Expect = 1e-04
 Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 2/90 (2%)

Query: 43  LAFALSLFNGVLEKINEIDALIEP-HLKDWDFKRLGS-MEKAILRLGAYEIGFTPTQNPI 100
           L FA  L   V++K +    +IE     DW     G  +E +IL L   E+    T++PI
Sbjct: 178 LRFAKKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFSILHLAMSEVAVLETRHPI 237

Query: 101 IINECIELGKLYAEPNTPKFLNAILDSLSK 130
           +INE ++L K + + + P+ +N  L +  K
Sbjct: 238 VINEAVDLAKRFCDGSAPRIINGCLRTFVK 267
>ref|NP_567745.1| (NM_118770) putative protein [Arabidopsis thaliana]
 gb|AAK96755.1| (AY054564) putative protein [Arabidopsis thaliana]
          Length = 301

 Score = 45.4 bits (106), Expect = 1e-04
 Identities = 28/90 (31%), Positives = 46/90 (51%), Gaps = 2/90 (2%)

Query: 43  LAFALSLFNGVLEKINEIDALIEP-HLKDWDFKRLGS-MEKAILRLGAYEIGFTPTQNPI 100
           L FA  L   V++K +    +IE     DW     G  +E +IL L   E+    T++PI
Sbjct: 193 LRFAKKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFSILHLAMSEVAVLETRHPI 252

Query: 101 IINECIELGKLYAEPNTPKFLNAILDSLSK 130
           +INE ++L K + + + P+ +N  L +  K
Sbjct: 253 VINEAVDLAKRFCDGSAPRIINGCLRTFVK 282
>ref|NP_078134.1| (NC_002162) transcription termination factor [Ureaplasma
           urealyticum]
 pir||A82909 transcription termination factor UU300 [imported] - Ureaplasma
           urealyticum
 gb|AAF30709.1|AE002127_7 (AE002127) transcription termination factor [Ureaplasma
           urealyticum]
          Length = 127

 Score = 45.1 bits (105), Expect = 1e-04
 Identities = 26/77 (33%), Positives = 43/77 (55%), Gaps = 1/77 (1%)

Query: 53  VLEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKL 111
           +L+   ++  +I+P + KDW F+RL  +E+A+L     E     T   III++ +     
Sbjct: 49  ILDNYEQLTKMIKPLISKDWTFERLSYVEQALLLSAYGEYLVLKTPKKIIIDQTLITTHN 108

Query: 112 YAEPNTPKFLNAILDSL 128
           Y+   + KF+NAILD L
Sbjct: 109 YSNNESYKFINAILDQL 125
>ref|NP_359841.1| (NC_003103) N utilization substance protein B [Rickettsia conorii]
 sp|Q92J65|NUSB_RICCN N utilization substance protein B homolog (NusB protein)
 gb|AAL02742.1| (AE008588) N utilization substance protein B [Rickettsia conorii]
          Length = 156

 Score = 43.9 bits (102), Expect = 3e-04
 Identities = 38/143 (26%), Positives = 70/143 (48%), Gaps = 15/143 (10%)

Query: 7   ARGAVVELLYA-FESGNEEIKKIASSMLEEKKIKN------NQLAFALS------LFNGV 53
           AR A V+ +Y      N+++  I  ++L   +  N        L  +LS      L   V
Sbjct: 12  ARIAAVQAIYQNILQNNDDMDDIMQNVLSFYQNNNAITDLPENLKISLSISHFKMLVKSV 71

Query: 54  LEKINEIDALIEPHL-KDWDFKRLGSMEKAILRLGAYEIGFTPTQNP-IIINECIELGKL 111
            E I+++D +I+ HL  D D   +  + +A+LR+   E+ F PT    ++INE  ++   
Sbjct: 72  FENIHKLDEIIDNHLTNDKDPAHMPILLRALLRVSICELLFCPTTPAKVVINEYTDIAND 131

Query: 112 YAEPNTPKFLNAILDSLSKKLTQ 134
               +   F+N++LD ++K+ T+
Sbjct: 132 MLNEHEIGFVNSVLDKIAKEHTR 154
>ref|NP_220353.1| (NC_000117) Transcription termination factor [Chlamydia
           trachomatis]
 sp|O84839|NUSB_CHLTR N utilization substance protein B homolog (NusB protein)
 pir||H71464 transcription termination factor nusB CT832 [similarity] -
           Chlamydia trachomatis (serotype D, strain UW3/Cx)
 gb|AAC68429.1| (AE001356) Transcription termination factor [Chlamydia trachomatis]
          Length = 168

 Score = 42.0 bits (97), Expect = 0.001
 Identities = 30/129 (23%), Positives = 57/129 (43%), Gaps = 4/129 (3%)

Query: 4   RTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDAL 63
           + + R  V++ LYA E   E    + S ++ E  +     A+AL     +     ++DAL
Sbjct: 21  KQKLRELVLQALYALEIDPEGEDSLVSLLMTEASVSKKNAAYALMFCRAIRANQPDLDAL 80

Query: 64  IEPHLKDWDFKRLGSMEKAILRLGAYE----IGFTPTQNPIIINECIELGKLYAEPNTPK 119
           ++  ++     RL  +E+ ILR+  +E        P    ++I E   L K ++      
Sbjct: 81  LDATIRTTTLARLTIIERNILRMMLFEHQQNQDCCPVPVAVLIAETTRLIKKFSYSEGSS 140

Query: 120 FLNAILDSL 128
            + A+L S+
Sbjct: 141 LILAVLGSI 149
>ref|NP_518832.1| (NC_003295) PROBABLE N UTILIZATION SUBSTANCE B (TRANSCRIPTIONAL
           ANTITERMINATOR)(L FACTOR) TRANSCRIPTION REGULATOR
           PROTEIN [Ralstonia solanacearum]
 emb|CAD14241.1| (AL646060) PROBABLE N UTILIZATION SUBSTANCE B (TRANSCRIPTIONAL
           ANTITERMINATOR)(L FACTOR) TRANSCRIPTION REGULATOR
           PROTEIN [Ralstonia solanacearum]
          Length = 161

 Score = 41.2 bits (95), Expect = 0.002
 Identities = 30/132 (22%), Positives = 60/132 (44%), Gaps = 2/132 (1%)

Query: 2   ATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEID 61
           + R +AR   ++ LY +     +   + + + + +       A   +L +G + +   + 
Sbjct: 22  SARRRARELALQGLYQWLLNRNDPGVVEAHLHDAQGFNKADRAHFDALLHGAIREEATLT 81

Query: 62  ALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPKF 120
               P L D     L  +E+A L +GAYE +        ++INE +EL K +      K+
Sbjct: 82  ESFTPFL-DRPVAELSPVERAALLVGAYELVHCVDIPYKVVINEAVELAKTFGGVEGYKY 140

Query: 121 LNAILDSLSKKL 132
           +N +LD L+ ++
Sbjct: 141 VNGVLDKLAAQV 152
>ref|NP_600813.1| (NC_003450) COG0144:tRNA and rRNA cytosine-C5-methylases
           [Corynebacterium glutamicum]
          Length = 511

 Score = 40.0 bits (92), Expect = 0.004
 Identities = 30/128 (23%), Positives = 57/128 (44%), Gaps = 9/128 (7%)

Query: 8   RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEP- 66
           R    E+L    +G      +   +L +  +     AFA  +  G L  +  +D +I+  
Sbjct: 77  REIAFEVLDRVRTGEAYANLVLPRLLSKHNLSGRDAAFATEITYGTLRNVGLLDEVIKAA 136

Query: 67  ---HLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKFLNA 123
               L D D + L      +LRLGAY++ FT  ++   ++  +++     +     F NA
Sbjct: 137 SGRELSDIDPEVLD-----VLRLGAYQVMFTRVEDHAAVDTSVKMVGGLKKFQATGFANA 191

Query: 124 ILDSLSKK 131
           IL ++++K
Sbjct: 192 ILRNITRK 199
>ref|NP_220552.1| (NC_000963) N UTILIZATION SUBSTANCE PROTEIN B (nusB) [Rickettsia
           prowazekii]
 pir||F71726 transcription termination factor nusB RP162 - Rickettsia prowazekii
 emb|CAA14629.1| (AJ235270) N UTILIZATION SUBSTANCE PROTEIN B (nusB) [Rickettsia
           prowazekii]
          Length = 174

 Score = 40.0 bits (92), Expect = 0.004
 Identities = 27/99 (27%), Positives = 55/99 (55%), Gaps = 6/99 (6%)

Query: 39  KNNQLAFALSLFN----GVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF 93
           KN +++ ++S F      V E IN++D +I+ HL +  D   +  + +A+LR+   E+ F
Sbjct: 71  KNFKISLSISHFKMLVKSVFENINKLDEIIDNHLTNAKDSVHMPILLRALLRVSICELLF 130

Query: 94  -TPTQNPIIINECIELGKLYAEPNTPKFLNAILDSLSKK 131
            + T   ++INE  ++       +   F+N+ILD ++++
Sbjct: 131 CSTTPAKVVINEYTDIANDLLNEHEIGFVNSILDKIAQE 169
>sp|Q9ZE01|NUSB_RICPR N utilization substance protein B homolog (NusB protein)
          Length = 155

 Score = 40.0 bits (92), Expect = 0.004
 Identities = 27/99 (27%), Positives = 55/99 (55%), Gaps = 6/99 (6%)

Query: 39  KNNQLAFALSLFN----GVLEKINEIDALIEPHLKD-WDFKRLGSMEKAILRLGAYEIGF 93
           KN +++ ++S F      V E IN++D +I+ HL +  D   +  + +A+LR+   E+ F
Sbjct: 52  KNFKISLSISHFKMLVKSVFENINKLDEIIDNHLTNAKDSVHMPILLRALLRVSICELLF 111

Query: 94  -TPTQNPIIINECIELGKLYAEPNTPKFLNAILDSLSKK 131
            + T   ++INE  ++       +   F+N+ILD ++++
Sbjct: 112 CSTTPAKVVINEYTDIANDLLNEHEIGFVNSILDKIAQE 150
>dbj|BAB98992.1| (AP005279) tRNA and rRNA cytosine-C5-methylases [Corynebacterium
           glutamicum ATCC 13032]
          Length = 444

 Score = 40.0 bits (92), Expect = 0.004
 Identities = 30/128 (23%), Positives = 57/128 (44%), Gaps = 9/128 (7%)

Query: 8   RGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEIDALIEP- 66
           R    E+L    +G      +   +L +  +     AFA  +  G L  +  +D +I+  
Sbjct: 10  REIAFEVLDRVRTGEAYANLVLPRLLSKHNLSGRDAAFATEITYGTLRNVGLLDEVIKAA 69

Query: 67  ---HLKDWDFKRLGSMEKAILRLGAYEIGFTPTQNPIIINECIELGKLYAEPNTPKFLNA 123
               L D D + L      +LRLGAY++ FT  ++   ++  +++     +     F NA
Sbjct: 70  SGRELSDIDPEVLD-----VLRLGAYQVMFTRVEDHAAVDTSVKMVGGLKKFQATGFANA 124

Query: 124 ILDSLSKK 131
           IL ++++K
Sbjct: 125 ILRNITRK 132
>ref|NP_661503.1| (NC_002932) Sun protein [Chlorobium tepidum TLS]
 gb|AAM71845.1| (AE012834) Sun protein [Chlorobium tepidum TLS]
          Length = 428

 Score = 38.9 bits (89), Expect = 0.010
 Identities = 31/131 (23%), Positives = 60/131 (45%), Gaps = 7/131 (5%)

Query: 1   MATRTQARGAVVELLYAFESGNEEIKKIASSMLEEKKIKNNQLAFALSLFNGVLEKINEI 60
           M  R  A   ++EL      G  + +++ + M E   +  N  A A  L  G L+   + 
Sbjct: 1   MTARELALRVLLEL-----DGMRKSEELLNRMHEHAGLGKNDRALAKELVAGTLKYRLQC 55

Query: 61  DALIEPHLKDWDFKRLGSMEKAILRLGAYE-IGFTPTQNPIIINECIELGKLYAEPNTPK 119
           D +I    +  D+ +  ++ K ILRLG Y+ +          +NE ++L + +   +  +
Sbjct: 56  DFIIARFYRH-DYAKAATVLKHILRLGVYQLLRLDRVPKSAAVNESVKLARKFKGDHLAR 114

Query: 120 FLNAILDSLSK 130
            +N +L ++SK
Sbjct: 115 LVNGLLRNISK 125
  Database: /home/scwang/download_20020708_db/nr
    Posted date:  Aug 7, 2002 12:55 PM
  Number of letters in database: 324,149,939
  Number of sequences in database:  1,026,957
  
Lambda     K      H
   0.316    0.135    0.372 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 81,846,108
Number of Sequences: 1026957
Number of extensions: 3220351
Number of successful extensions: 8799
Number of sequences better than 1.0e-02: 68
Number of HSP's better than  0.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 39
Number of HSP's that attempted gapping in prelim test: 8713
Number of HSP's gapped (non-prelim): 68
length of query: 138
length of database: 324,149,939
effective HSP length: 114
effective length of query: 24
effective length of database: 207,076,841
effective search space: 4969844184
effective search space used: 4969844184
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 89 (38.9 bits)