BLASTP 2.2.1 [Apr-13-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|15645134|ref|NP_207304.1| conserved hypothetical
protein [Helicobacter pylori 26695]
         (212 letters)

Database: /home/scwang/download_20020708_db/nr
           1,026,957 sequences; 324,149,939 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_207304.1|  (NC_000915) conserved hypothetical protein...   429  e-120
ref|NP_223175.1|  (NC_000921) putative [Helicobacter pylori ...   412  e-114
gb|AAD15563.1|  (AF111170) unknown [Homo sapiens]                 142  2e-33
ref|NP_281634.1|  (NC_002163) hypothetical protein Cj0447 [C...   141  6e-33
dbj|BAB23110.1|  (AK003991) data source:SPTR, source key:O95...   132  3e-30
dbj|BAB28691.1|  (AK013174) data source:SPTR, source key:O95...   130  1e-29
gb|AAF56679.1|  (AE003759) CG6001 gene product [Drosophila m...   121  6e-27
gb|EAA08524.1|  (AAAB01008880) agCP2498 [Anopheles gambiae s...   111  5e-24
pir||T20117  hypothetical protein C50F4.11 - Caenorhabditis ...   102  2e-21
ref|NP_505461.1|  (NM_073060) C50F4.16.p [Caenorhabditis ele...   101  5e-21
ref|NP_457011.1|  (NC_003198) conserved hypothetical protein...    69  3e-11
ref|NP_289019.1|  (NC_002655) orf, hypothetical protein [Esc...    64  9e-10
ref|NP_416962.1|  (NC_000913) orf, hypothetical protein [Esc...    64  1e-09
ref|NP_348478.1|  (NC_003030) Nudix (MutT) family hydrolase ...    60  2e-08
ref|NP_406525.1|  (NC_003143) conserved hypothetical protein...    59  3e-08
ref|XP_138284.1|  (XM_138284) similar to RIKEN cDNA 1110030M...    59  4e-08
gb|AAB46945.1|  (L34011) ORF; putative [Escherichia coli]          58  8e-08
ref|NP_354155.1|  (NC_003062) AGR_C_2106p [Agrobacterium tum...    53  2e-06
ref|NP_531834.1|  (NC_003304) NTP pyrophosphohydrolase, MutT...    53  2e-06
ref|NP_637907.1|  (NC_003902) conserved hypothetical protein...    50  2e-05
ref|NP_562814.1|  (NC_003366) conserved hypothetical protein...    50  2e-05
ref|NP_385285.1|  (NC_003047) CONSERVED HYPOTHETICAL PROTEIN...    49  4e-05
ref|NP_643040.1|  (NC_003919) conserved hypothetical protein...    47  2e-04
ref|NP_558683.1|  (NC_003364) ADP ribose hydrolase, putative...    47  2e-04
ref|NP_253658.1|  (NC_002516) conserved hypothetical protein...    44  0.001
ref|NP_350184.1|  (NC_003030) Nudix (MutT) family hydrolase ...    42  0.006
ref|NP_343553.1|  (NC_002754) Conserved hypothetical protein...    42  0.006
>ref|NP_207304.1| (NC_000915) conserved hypothetical protein [Helicobacter pylori
           26695]
 pir||C64583 conserved hypothetical protein HP0507 - Helicobacter pylori
           (strain 26695)
 gb|AAD07572.1| (AE000565) conserved hypothetical protein [Helicobacter pylori
           26695]
          Length = 212

 Score =  429 bits (1104), Expect = e-120
 Identities = 212/212 (100%), Positives = 212/212 (100%)

Query: 1   MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
           MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY
Sbjct: 1   MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60

Query: 61  EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120
           EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL
Sbjct: 61  EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120

Query: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER 180
           EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER
Sbjct: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER 180

Query: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
           SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV
Sbjct: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
>ref|NP_223175.1| (NC_000921) putative [Helicobacter pylori J99]
 pir||F71928 hypothetical protein jhp0457 - Helicobacter pylori (strain J99)
 gb|AAD06038.1| (AE001480) putative [Helicobacter pylori J99]
          Length = 212

 Score =  412 bits (1059), Expect = e-114
 Identities = 204/212 (96%), Positives = 207/212 (97%)

Query: 1   MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
           MSYFKN FNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY
Sbjct: 1   MSYFKNIFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60

Query: 61  EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120
           EKESDCFVIVKQFRPAIYAR F+FK DQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL
Sbjct: 61  EKESDCFVIVKQFRPAIYARNFYFKRDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120

Query: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER 180
           EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAE H+ LKVSKGGGIDTE+IEVLFLER
Sbjct: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEAHEGLKVSKGGGIDTEKIEVLFLER 180

Query: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
           SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV
Sbjct: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
>gb|AAD15563.1| (AF111170) unknown [Homo sapiens]
          Length = 290

 Score =  142 bits (358), Expect = 2e-33
 Identities = 81/203 (39%), Positives = 114/203 (55%), Gaps = 18/203 (8%)

Query: 24  CSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYA---- 79
           C++S ++    +HY +   +K+WD +K+ DSV VLL+       V+VKQFRPA+YA    
Sbjct: 80  CAASPYLRPLTLHYRQNGAQKSWDFMKTHDSVTVLLFNSSRRSLVLVKQFRPAVYAGEVE 139

Query: 80  RRFHFKC---DQDQTID---------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
           RRF       DQD   +         G T ELCAGLVD+   SLEE+AC+EA EECGY +
Sbjct: 140 RRFPGSLAAVDQDGPRELQPALPGSAGVTVELCAGLVDQPGLSLEEVACKEAWEECGYHL 199

Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALD 185
           +P +L  +  ++S  GL+GS QT++Y EV    +   GGG+  + E IEV+ L    A  
Sbjct: 200 APSDLRRVATYWSGVGLTGSRQTMFYTEVTDAQRSGPGGGLVEEGELIEVVHLPLEGAQA 259

Query: 186 FIMDFQYAKTTGLSLAILWHLKK 208
           F  D    KT G+   + W L +
Sbjct: 260 FADDPDIPKTLGVIFGVSWFLSQ 282
>ref|NP_281634.1| (NC_002163) hypothetical protein Cj0447 [Campylobacter jejuni]
 pir||B81389 hypothetical protein Cj0447 [imported] - Campylobacter jejuni
           (strain NCTC 11168)
 emb|CAB75085.1| (AL139075) hypothetical protein Cj0447 [Campylobacter jejuni]
          Length = 198

 Score =  141 bits (355), Expect = 6e-33
 Identities = 84/190 (44%), Positives = 113/190 (59%), Gaps = 21/190 (11%)

Query: 27  SNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKC 86
           SN+I+ KR  Y       TWD I+S DSV+VLLY KE + F+ V+QFR  ++  + H   
Sbjct: 14  SNYIKPKRFAYESNGRLCTWDFIESKDSVSVLLYHKELESFIFVRQFRIPLWYHQMH--- 70

Query: 87  DQDQTID---GYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATG 143
           D+D   D   GYT ELC+GLVDK   SLEEIA EE +EE GY  +PKNLE IG FY+  G
Sbjct: 71  DKDYVKDDDMGYTIELCSGLVDK-KLSLEEIAKEECIEELGY--APKNLEKIGDFYTGFG 127

Query: 144 LSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFIMDFQ-----YAKTTGL 198
              S Q+ Y+ EV +  K+S GGG+D E IE ++++       + DF+       +T  L
Sbjct: 128 SGVSKQSFYFVEVDEKDKISSGGGVDDEEIEAVYVK-------VQDFEKKCKNMIRTPLL 180

Query: 199 SLAILWHLKK 208
             A +W LK+
Sbjct: 181 DFAYMWFLKE 190
>dbj|BAB23110.1| (AK003991) data source:SPTR, source key:O95848,
           evidence:ISS~homolog to HYPOTHETICAL 31.5 KDA
           PROTEIN~putative [Mus musculus]
 gb|AAH25444.1| (BC025444) RIKEN cDNA 1110030M18 gene [Mus musculus]
          Length = 222

 Score =  132 bits (332), Expect = 3e-30
 Identities = 76/199 (38%), Positives = 110/199 (55%), Gaps = 18/199 (9%)

Query: 24  CSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRF- 82
           C+ S ++    +HY ++  +K+WD +K+ DSV +L++       V+VKQFRPA+YA    
Sbjct: 12  CAHSPYLRPFTLHYRQDGVQKSWDFMKTHDSVTILMFNSSRRSLVLVKQFRPAVYAGEVE 71

Query: 83  -HFK-----CDQDQTID---------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
            HF       +QDQ  +         G   ELCAG+VD+   SLEE AC+EA EECGY++
Sbjct: 72  RHFPGSLTAVNQDQPQELQQALPGSAGVMVELCAGIVDQPGLSLEEAACKEAWEECGYRL 131

Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALD 185
            P +L  +  + S  GL+ S QT++YAEV    +   GGG+  + E IEV+ L    A  
Sbjct: 132 VPTDLRRVATYMSGVGLTSSRQTMFYAEVTDAQRGGPGGGLAEEGELIEVIHLNLDDAQA 191

Query: 186 FIMDFQYAKTTGLSLAILW 204
           F  +    KT G+  AI W
Sbjct: 192 FADNPDIPKTLGVIYAISW 210
>dbj|BAB28691.1| (AK013174) data source:SPTR, source key:O95848,
           evidence:ISS~homolog to HYPOTHETICAL 31.5 KDA
           PROTEIN~putative [Mus musculus]
          Length = 223

 Score =  130 bits (326), Expect = 1e-29
 Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 18/199 (9%)

Query: 24  CSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRF- 82
           C+ S ++    +HY ++  +K+WD +K+ DSV +L++       V+VKQFRPA+YA    
Sbjct: 13  CAHSPYLRPFTLHYRQDGVQKSWDFMKTHDSVTILMFNSSRRSLVLVKQFRPAVYAGEVE 72

Query: 83  -HFK-----CDQDQTID---------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
            HF       +QDQ  +         G   ELCAG+VD+   SLEE AC+EA EECGY++
Sbjct: 73  RHFPGSLTAVNQDQPQELQQALPGSAGVMVELCAGIVDQPGLSLEEAACKEAWEECGYRL 132

Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALD 185
            P +L  +  + S  GL+ S QT++YAEV    +   GGG+  + E IEV+ L    A  
Sbjct: 133 VPTDLRRVATYMSGVGLTSSRQTMFYAEVTDAQRGGPGGGLAEEGELIEVIHLNLDDAQA 192

Query: 186 FIMDFQYAKTTGLSLAILW 204
              +    KT G+  AI W
Sbjct: 193 IADNPDIPKTLGVIYAISW 211
>gb|AAF56679.1| (AE003759) CG6001 gene product [Drosophila melanogaster]
          Length = 1351

 Score =  121 bits (303), Expect = 6e-27
 Identities = 70/176 (39%), Positives = 100/176 (56%), Gaps = 10/176 (5%)

Query: 17   SSVYLEPC-SSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRP 75
            S ++L P    S +++  R++Y +   +K WD++K  DSVA++LY       V+V+QFRP
Sbjct: 1144 SKIWLGPLPQDSPYVKPFRLYYVQNGVEKNWDLLKVHDSVAIILYNTSRQKLVLVRQFRP 1203

Query: 76   AIYARRFHFKCDQDQTID--------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
            A+Y             +D        G T ELCAG+VDK NKS  EIA EE +EECGY +
Sbjct: 1204 AVYHGIISSAKGTFDEVDLKEFPPAIGVTLELCAGIVDK-NKSWVEIAREEVVEECGYDV 1262

Query: 128  SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKA 183
              + +E +  + S  G SG+ QT+YY EV    K + GGG+D E IEV+ L   +A
Sbjct: 1263 PVERIEEVMVYRSGVGSSGAKQTMYYCEVTDADKATGGGGVDDEIIEVVELSLEEA 1318
 Score =  120 bits (300), Expect = 1e-26
 Identities = 76/182 (41%), Positives = 104/182 (56%), Gaps = 16/182 (8%)

Query: 17   SSVYLEPC-SSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRP 75
            S ++  P    SN+I+  R+HY E + +K  DIIK++D V V+LY K  +  + V+QFR 
Sbjct: 941  SKIWFGPMPKDSNWIKPGRLHYIENDVEKQVDIIKTIDGVVVILYNKAREKLIFVRQFRG 1000

Query: 76   AIYARRFHFKCDQDQTID-----------GYTYELCAGLVDKANKSLEEIACEEALEECG 124
            A+Y +  H     D +             G T ELC G VDK +KSL EIA EE LEECG
Sbjct: 1001 AVY-QGIHSAGSPDMSKGEADLEQFPPEVGVTLELCGGAVDK-DKSLAEIAKEEVLEECG 1058

Query: 125  YQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVL--FLERSK 182
            Y++  ++L+ +  + S  G S S  +L+Y EV    KVS GGGI  ERI+VL   LE S+
Sbjct: 1059 YEVPTESLQHVYDYRSGIGTSSSAMSLFYCEVCDAQKVSAGGGIGEERIQVLEMSLEESR 1118

Query: 183  AL 184
             L
Sbjct: 1119 QL 1120
>gb|EAA08524.1| (AAAB01008880) agCP2498 [Anopheles gambiae str. PEST]
          Length = 238

 Score =  111 bits (278), Expect = 5e-24
 Identities = 65/168 (38%), Positives = 98/168 (57%), Gaps = 10/168 (5%)

Query: 25  SSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHF 84
           + S +++  R HY +   +K+WD++K  DSV+++++       V VKQFRPA+Y      
Sbjct: 41  ADSPYVKPFRFHYTQNGKQKSWDLLKVHDSVSIVIFNVTRKKLVFVKQFRPAVYHGIISG 100

Query: 85  KCDQDQTID--------GYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIG 136
              +  +ID          T ELCAG++DK   ++E IA EE LEECGY I  + +E I 
Sbjct: 101 DGVEPGSIDMKKYPPELAVTMELCAGIIDKPISTIE-IAREEVLEECGYDIPVERIEEII 159

Query: 137 QFYSATGLSGSLQTLYYAEVHKNLKV-SKGGGIDTERIEVLFLERSKA 183
           ++ S  G SG+ QTL+YAEV    K+ S GGG+D E I+V+  +  +A
Sbjct: 160 RYRSGVGTSGAEQTLFYAEVTDEDKIASAGGGVDDEIIDVVEYDLEEA 207
>pir||T20117 hypothetical protein C50F4.11 - Caenorhabditis elegans
          Length = 1092

 Score =  102 bits (255), Expect = 2e-21
 Identities = 63/179 (35%), Positives = 93/179 (51%), Gaps = 14/179 (7%)

Query: 1   MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
           M  F N     S I D S   +    S + +   M Y  +   +  +  + + SV++LL+
Sbjct: 635 MKKFVNFLMTASKISDVSFVTD--FVSKYQKGMEMSYTLDGNSRVSEFNQKMSSVSILLF 692

Query: 61  EKESDCFVIVKQFRPAIYARRFHFKCDQD----QTID--------GYTYELCAGLVDKAN 108
            ++ + F++V+QFRPAI+        +        ID        GYT ELCAGL+DK  
Sbjct: 693 HRDLEQFLLVRQFRPAIFTASISNSPENHGKEFDKIDWSSYDSETGYTIELCAGLIDKEG 752

Query: 109 KSLEEIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGG 167
            S  EIA EE  EECGY++ P +L  +  F      SGS Q LYYAE+ +++K+S+GGG
Sbjct: 753 LSPREIASEEVAEECGYRVDPDDLIHVITFVVGAHQSGSAQHLYYAEIDESMKISEGGG 811
>ref|NP_505461.1| (NM_073060) C50F4.16.p [Caenorhabditis elegans]
 emb|CAC42268.1| (Z70750) cDNA EST EMBL:AU111192 comes from this gene~cDNA EST
           EMBL:AU115025 comes from this gene [Caenorhabditis
           elegans]
          Length = 450

 Score =  101 bits (252), Expect = 5e-21
 Identities = 55/145 (37%), Positives = 82/145 (55%), Gaps = 12/145 (8%)

Query: 35  MHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQD----Q 90
           M Y  +   +  +  + + SV++LL+ ++ + F++V+QFRPAI+        +       
Sbjct: 25  MSYTLDGNSRVSEFNQKMSSVSILLFHRDLEQFLLVRQFRPAIFTASISNSPENHGKEFD 84

Query: 91  TID--------GYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSAT 142
            ID        GYT ELCAGL+DK   S  EIA EE  EECGY++ P +L  +  F    
Sbjct: 85  KIDWSSYDSETGYTIELCAGLIDKEGLSPREIASEEVAEECGYRVDPDDLIHVITFVVGA 144

Query: 143 GLSGSLQTLYYAEVHKNLKVSKGGG 167
             SGS Q LYYAE+ +++K+S+GGG
Sbjct: 145 HQSGSAQHLYYAEIDESMKISEGGG 169
>ref|NP_457011.1| (NC_003198) conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Typhi]
 ref|NP_461412.1| (NC_003197) putative pyrophosphohydrolase [Salmonella typhimurium
           LT2]
 gb|AAL21371.1| (AE008811) putative pyrophosphohydrolase [Salmonella typhimurium
           LT2]
 emb|CAD07707.1| (AL627274) conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Typhi]
          Length = 191

 Score = 69.3 bits (168), Expect = 3e-11
 Identities = 52/169 (30%), Positives = 81/169 (47%), Gaps = 22/169 (13%)

Query: 26  SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
           S N+  L+ + Y  + T++  ++I+    V        +LLY       V+V+QFR A +
Sbjct: 14  SDNYFTLRNITY--DLTRRNGEVIRHKREVYDRGNGATILLYNSTKKTVVLVRQFRVATW 71

Query: 79  ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
                     +   DG   E CAGL+D  N   E    +EA+EE GY +    +  I + 
Sbjct: 72  V---------NGNQDGMLIETCAGLLD--NDEPEVCIRKEAIEETGYDVG--EVRKIFEL 118

Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
           Y + G    L   + AE H + + S GGG++ E IEVL L  S+AL+ +
Sbjct: 119 YMSPGGVTELIHFFIAEYHDSERASIGGGVEDEEIEVLELPFSRALEMV 167
>ref|NP_289019.1| (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7
           EDL933]
 ref|NP_311356.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
 gb|AAG57576.1|AE005476_1 (AE005476) orf, hypothetical protein [Escherichia coli O157:H7
           EDL933]
 dbj|BAB36752.1| (AP002561) hypothetical protein [Escherichia coli O157:H7]
          Length = 191

 Score = 64.3 bits (155), Expect = 9e-10
 Identities = 50/169 (29%), Positives = 80/169 (46%), Gaps = 22/169 (13%)

Query: 26  SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
           S N+  L  + Y  + T+K  ++I+    V        +LLY  +    V+++QFR A +
Sbjct: 14  SDNYFTLHNITY--DLTRKDGEVIRHKREVYDRGNGATILLYNAKKKTVVLIRQFRVATW 71

Query: 79  ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
                     +    G   E CAGL+D  N   E    +EA+EE GY++    +  + + 
Sbjct: 72  V---------NGNESGQLIETCAGLLD--NDEPEVCIRKEAIEETGYEVG--EVRKLFEL 118

Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
           Y + G    L   + AE   N + + GGG++ E IEVL L  S+AL+ I
Sbjct: 119 YMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVLELPFSQALEMI 167
>ref|NP_416962.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
 sp|P37128|YFFH_ECOLI Hypothetical protein yffH
 pir||B65022 yffH protein - Escherichia coli (strain K-12)
 gb|AAC75520.1| (AE000333) orf, hypothetical protein [Escherichia coli K12]
 dbj|BAA16341.1| (D90875) similar to [SwissProt Accession Number P37128]
           [Escherichia coli]
          Length = 191

 Score = 63.9 bits (154), Expect = 1e-09
 Identities = 50/169 (29%), Positives = 80/169 (46%), Gaps = 22/169 (13%)

Query: 26  SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
           S N+  L  + Y  + T+K  ++I+    V        +LLY  +    V+++QFR A +
Sbjct: 14  SDNYFTLHNITY--DLTRKDGEVIRHKREVYDRGNGATILLYNTKKKTVVLIRQFRVATW 71

Query: 79  ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
                     +    G   E CAGL+D  N   E    +EA+EE GY++    +  + + 
Sbjct: 72  V---------NGNESGQLIESCAGLLD--NDEPEVCIRKEAIEETGYEVG--EVRKLFEL 118

Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
           Y + G    L   + AE   N + + GGG++ E IEVL L  S+AL+ I
Sbjct: 119 YMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVLELPFSQALEMI 167
>ref|NP_348478.1| (NC_003030) Nudix (MutT) family hydrolase [Clostridium
           acetobutylicum]
 gb|AAK79818.1|AE007694_4 (AE007694) Nudix (MutT) family hydrolase [Clostridium
           acetobutylicum]
          Length = 172

 Score = 59.7 bits (143), Expect = 2e-08
 Identities = 45/151 (29%), Positives = 79/151 (51%), Gaps = 17/151 (11%)

Query: 37  YNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYT 96
           Y  +   +T+DI+K+ D+V+  +   E D  ++VKQFR AI           D+T+    
Sbjct: 15  YKRDVNGRTYDILKNYDAVSAFI-TNEFDDVLMVKQFRAAI----------MDETL---- 59

Query: 97  YELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEV 156
            E+ AG +D A +S E+    E  EE   +I   NL+ I  +    G S S+  LY+A++
Sbjct: 60  -EIPAGCLDIAGESPEDCLIREIKEETNLKIDKINLKKIISYKPTLGFSTSVLHLYHAKI 118

Query: 157 HKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
            K+  +S     D +  EVL+++++  +++I
Sbjct: 119 KKSDLISNKVN-DEDVNEVLWVDKNTFIEYI 148
>ref|NP_406525.1| (NC_003143) conserved hypothetical protein [Yersinia pestis]
 emb|CAC92277.1| (AJ414155) conserved hypothetical protein [Yersinia pestis]
          Length = 198

 Score = 59.3 bits (142), Expect = 3e-08
 Identities = 41/137 (29%), Positives = 65/137 (46%), Gaps = 13/137 (9%)

Query: 53  DSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLE 112
           +   +LLY ++    V+++QFR   Y          +    G   E CAGL+D  N S E
Sbjct: 46  NGATILLYNRQQGTVVLIEQFRMPTYV---------NGNASGMLLEACAGLLD--NDSPE 94

Query: 113 EIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTER 172
                EA+EE GYQ+    ++ + + Y + G    L   + AE H + K++   G++ E 
Sbjct: 95  ACIRREAMEETGYQVD--KVQKLFEAYMSPGGVTELVYFFAAEYHPDQKITDEVGVEDEV 152

Query: 173 IEVLFLERSKALDFIMD 189
           IEV+ L    AL  + D
Sbjct: 153 IEVVELPFHDALAMVAD 169
>ref|XP_138284.1| (XM_138284) similar to RIKEN cDNA 1110030M18 gene [Mus musculus]
          Length = 237

 Score = 58.9 bits (141), Expect = 4e-08
 Identities = 53/176 (30%), Positives = 78/176 (44%), Gaps = 10/176 (5%)

Query: 38  NEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPA-----IYARRFHFKCDQDQTI 92
           +++  +K+WD +K+ DSV +L++       V+VKQFRPA       AR        D   
Sbjct: 51  SQDGVQKSWDFMKTHDSVTILMFNSSRRSLVLVKQFRPAGIPNMHEARPLQGAGSVDIPS 110

Query: 93  DGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF--YSATGLSGSLQT 150
                E   G V      +   A  EA ++  +Q     L  +      S  GL+ S QT
Sbjct: 111 PPLPREWYLGRVKLHRVLIIRKAEREAWKQGVWQ-GHTGLSAVPVLPDRSGVGLTSSRQT 169

Query: 151 LYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALDFIMDFQYAKTTGLSLAILW 204
           ++YAEV    +   GGG+  + E IEV+ L    A  F  +    KT G+  AI W
Sbjct: 170 MFYAEVTDAQRGGPGGGLAEEGELIEVIHLNLDDAQAFADNPDIPKTLGVIYAISW 225
>gb|AAB46945.1| (L34011) ORF; putative [Escherichia coli]
          Length = 157

 Score = 57.8 bits (138), Expect = 8e-08
 Identities = 45/158 (28%), Positives = 73/158 (45%), Gaps = 22/158 (13%)

Query: 26  SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
           S N+  L  + Y  + T+K  ++I+    V        +LLY  +    V+++QFR A +
Sbjct: 14  SDNYFTLHNITY--DLTRKDGEVIRHKREVYDRGNGATILLYNTKKKTVVLIRQFRVATW 71

Query: 79  ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
                     +    G   E CAGL+D  N   E    +EA+EE GY++    +  + + 
Sbjct: 72  V---------NGNESGQLIESCAGLLD--NDEPEVCIRKEAIEETGYEVG--EVRKLFEL 118

Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVL 176
           Y + G    L   + AE   N + + GGG++ E IEVL
Sbjct: 119 YMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVL 156
>ref|NP_354155.1| (NC_003062) AGR_C_2106p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 gb|AAK86940.1| (AE008043) AGR_C_2106p [Agrobacterium tumefaciens str. C58
           (Cereon)]
          Length = 254

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 37/148 (25%), Positives = 78/148 (52%), Gaps = 19/148 (12%)

Query: 29  FIELKRMHYNEENTK-KTWDIIKSL----DSVAVLLYEKESDCFVIVKQFRPAIYARRFH 83
           F+ ++++ +++     +T  I++ +     + A+LLY+ + D  V+V+QFRPA +     
Sbjct: 75  FVHMQKLIFDQRMPDGRTMRIVREVHDHGSAAAILLYDVKRDSVVMVRQFRPAAFV---- 130

Query: 84  FKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATG 143
              + DQ+   +  E+ AGL+D  +    +    EA+EE GY +  + +E +   Y++ G
Sbjct: 131 ---NGDQS---FMIEVPAGLLD--DDDAADAIRREAMEESGYAV--EKVEYLFDMYASPG 180

Query: 144 LSGSLQTLYYAEVHKNLKVSKGGGIDTE 171
                 +L+ A +  +++   GGG++ E
Sbjct: 181 TLTEKVSLFVARIDLDVQAGNGGGLEDE 208
>ref|NP_531834.1| (NC_003304) NTP pyrophosphohydrolase, MutT family [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
 gb|AAL42150.1| (AE009076) NTP pyrophosphohydrolase, MutT family [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
          Length = 209

 Score = 53.1 bits (126), Expect = 2e-06
 Identities = 37/148 (25%), Positives = 78/148 (52%), Gaps = 19/148 (12%)

Query: 29  FIELKRMHYNEENTK-KTWDIIKSL----DSVAVLLYEKESDCFVIVKQFRPAIYARRFH 83
           F+ ++++ +++     +T  I++ +     + A+LLY+ + D  V+V+QFRPA +     
Sbjct: 30  FVHMQKLIFDQRMPDGRTMRIVREVHDHGSAAAILLYDVKRDSVVMVRQFRPAAFV---- 85

Query: 84  FKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATG 143
              + DQ+   +  E+ AGL+D  +    +    EA+EE GY +  + +E +   Y++ G
Sbjct: 86  ---NGDQS---FMIEVPAGLLD--DDDAADAIRREAMEESGYAV--EKVEYLFDMYASPG 135

Query: 144 LSGSLQTLYYAEVHKNLKVSKGGGIDTE 171
                 +L+ A +  +++   GGG++ E
Sbjct: 136 TLTEKVSLFVARIDLDVQAGNGGGLEDE 163
>ref|NP_637907.1| (NC_003902) conserved hypothetical protein [Xanthomonas campestris
           pv. campestris str. ATCC 33913]
 gb|AAM41831.1| (AE012367) conserved hypothetical protein [Xanthomonas campestris
           pv. campestris str. ATCC 33913]
          Length = 199

 Score = 50.1 bits (118), Expect = 2e-05
 Identities = 51/192 (26%), Positives = 81/192 (41%), Gaps = 33/192 (17%)

Query: 26  SSNFIELKRMHYNEENTKKTWDIIKSL-----DSVAVLLYEKESDCFVIVKQFR-PAIYA 79
           S N+  L+++ ++ +     W  +        +   +LLY +     ++ +QFR P +  
Sbjct: 20  SDNWYVLRKVTFDFQRKDGRWQTLSREAYDRGNGATILLYSRARQTVMLTRQFRLPTLL- 78

Query: 80  RRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFY 139
                    +   DG   E CAGL+D+ +   E    +E  EE GY+I  +N+  + + +
Sbjct: 79  ---------NGNPDGMLIEACAGLLDQDDP--EACIRKETEEETGYRI--ENVRKVFEAF 125

Query: 140 SATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALDF----------- 186
            + G        +  E     KVS GGG+  D E IEVL L    AL             
Sbjct: 126 MSPGSVTERLYFFVGEYVDGDKVSAGGGVEEDGEEIEVLELSLDAALAMIATGGIADAKT 185

Query: 187 IMDFQYAKTTGL 198
           IM  QYAK  G+
Sbjct: 186 IMLLQYAKLHGV 197
>ref|NP_562814.1| (NC_003366) conserved hypothetical protein [Clostridium
           perfringens]
 dbj|BAB81604.1| (AP003192) conserved hypothetical protein [Clostridium perfringens]
          Length = 184

 Score = 50.1 bits (118), Expect = 2e-05
 Identities = 40/141 (28%), Positives = 69/141 (48%), Gaps = 19/141 (13%)

Query: 63  ESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEE 122
           E+D FV++KQFRPA               I+ Y YE  AGL+D    ++ + A  E  EE
Sbjct: 63  ENDEFVLLKQFRPA---------------INDYIYEFPAGLIDNGEDAI-KAATRELFEE 106

Query: 123 CGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSK 182
            G  ++ ++   I   Y++ G+S     +   +V+ N  +S     + E IEV+ + R +
Sbjct: 107 TGL-LASESEYLIKPSYTSVGMSDESVAVVKMKVYGN--ISTENLEENEEIEVIKVPRKE 163

Query: 183 ALDFIMDFQYAKTTGLSLAIL 203
           A +F+ +   +  T L L+ +
Sbjct: 164 AKNFVKENNVSIKTALVLSFM 184
>ref|NP_385285.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
 emb|CAC45758.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
          Length = 198

 Score = 48.9 bits (115), Expect = 4e-05
 Identities = 52/190 (27%), Positives = 91/190 (47%), Gaps = 33/190 (17%)

Query: 5   KNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTK-KTWDIIKSLD----SVAVLL 59
           KNA  +  +ID  +V+        FI L+++   +E +   T  +++ +     +  +LL
Sbjct: 3   KNADARFRIIDRKTVW------DGFINLEQITIEQEMSDGSTARLVREVHDHGRAATILL 56

Query: 60  YEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEA 119
           ++ E    V+V+Q R  ++         Q +T  GY  E  AGL+D   ++ E   C EA
Sbjct: 57  FDPERQVVVLVRQLRLPVFL--------QGET--GYLLEAPAGLLD--GEAPEVAICREA 104

Query: 120 LEECGYQISPKNLETIGQFYSATGLSGSL---QTLYYAEVHKNLKVSKGGGI--DTERIE 174
           +EE GY+I     ET    + A    GS+    + +   +  + KV+ GGG+  + E IE
Sbjct: 105 MEETGYRI-----ETAMHLFDAYMSPGSITERTSFFLGLIDISKKVAAGGGLAHEGEDIE 159

Query: 175 VLFLERSKAL 184
           VL +   +A+
Sbjct: 160 VLEISFDEAV 169
>ref|NP_643040.1| (NC_003919) conserved hypothetical protein [Xanthomonas axonopodis
           pv. citri str. 306]
 gb|AAM37576.1| (AE011912) conserved hypothetical protein [Xanthomonas axonopodis
           pv. citri str. 306]
          Length = 206

 Score = 46.6 bits (109), Expect = 2e-04
 Identities = 50/194 (25%), Positives = 80/194 (40%), Gaps = 37/194 (19%)

Query: 26  SSNFIELKRMHYNEENTKKTWDIIKSL-----DSVAVLLYEKESDCFVIVKQFR-PAIYA 79
           S N+  L+++ ++ +     W  +        +   +LLY +     ++ +QFR P +  
Sbjct: 20  SDNWYVLRKVTFDFQRKDGRWQSLSREAYDRGNGATILLYSRARQTVMLTRQFRLPTLL- 78

Query: 80  RRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIAC--EEALEECGYQISPKNLETIGQ 137
                    +   DG   E CAGL+D+     + + C  +E  EE GY+I   N+  + +
Sbjct: 79  ---------NGNPDGMLIEACAGLLDQD----DALTCIRKETEEETGYRID--NVRKVFE 123

Query: 138 FYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALDF--------- 186
            + + G        +  E     KV  GGG+  D E IEVL L    AL           
Sbjct: 124 AFMSPGSVTERLYFFVGEYFDADKVGDGGGLEEDGEEIEVLELSLDAALAMIGTGEIADA 183

Query: 187 --IMDFQYAKTTGL 198
             IM  QYAK  G+
Sbjct: 184 KTIMLLQYAKLHGV 197
>ref|NP_558683.1| (NC_003364) ADP ribose hydrolase, putative (mutT/nudix family
           protein) [Pyrobaculum aerophilum]
 gb|AAD00531.1| (U82369) MutT homolog [Pyrobaculum aerophilum]
 gb|AAL62865.1| (AE009773) ADP ribose hydrolase, putative (mutT/nudix family
           protein) [Pyrobaculum aerophilum]
          Length = 170

 Score = 46.6 bits (109), Expect = 2e-04
 Identities = 39/123 (31%), Positives = 62/123 (49%), Gaps = 23/123 (18%)

Query: 68  VIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
           V+V+QFRPA+ +               +T E+ AG +D   +S EE A  E +EE GY+ 
Sbjct: 47  VLVRQFRPALKS---------------WTIEIPAGTLD-GGESPEEAAVREMIEETGYK- 89

Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSK--GGGIDTERIEVLFLERSKALD 185
            P  L  +  FY + G+S  L  ++YA+  + + + +   G ID   +EVL  E  + L 
Sbjct: 90  -PLRLTPLLDFYPSPGISNELIRIFYADELEYVGIGRRDPGEID---MEVLLKEPGEVLR 145

Query: 186 FIM 188
            I+
Sbjct: 146 AII 148
>ref|NP_253658.1| (NC_002516) conserved hypothetical protein [Pseudomonas aeruginosa]
 pir||A83024 conserved hypothetical protein PA4971 [imported] - Pseudomonas
           aeruginosa (strain PAO1)
 gb|AAG08356.1|AE004910_3 (AE004910) conserved hypothetical protein [Pseudomonas aeruginosa]
          Length = 205

 Score = 44.3 bits (103), Expect = 0.001
 Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 11/72 (15%)

Query: 53  DSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLE 112
           D+V VL Y+ + DC V+++QFR               +  + +  EL AGL+DK ++  E
Sbjct: 54  DAVCVLPYDPQRDCVVLIEQFRVGA----------MQKLANPWLLELVAGLIDK-DEQPE 102

Query: 113 EIACEEALEECG 124
           E+A  EA+EE G
Sbjct: 103 EVAHREAMEEAG 114
>ref|NP_350184.1| (NC_003030) Nudix (MutT) family hydrolase [Clostridium
           acetobutylicum]
 gb|AAK81524.1|AE007856_8 (AE007856) Nudix (MutT) family hydrolase [Clostridium
           acetobutylicum]
          Length = 202

 Score = 41.6 bits (96), Expect = 0.006
 Identities = 42/188 (22%), Positives = 76/188 (40%), Gaps = 37/188 (19%)

Query: 21  LEPCSSSNFIELKRMHY-NEENTKKTWDII-----------------KSLDSVAVLLYEK 62
           L P + + F+ L  + Y N++   + W +                  +  D+  +  + +
Sbjct: 9   LTPLAETKFLSLYDIEYKNKKQDTRHWTVASRKDYKALSDQYLNGAAEKTDAAIIAAFHE 68

Query: 63  ESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEE 122
           ++   V +KQFR  +               + Y YEL AGL+D A +  E  A  E  EE
Sbjct: 69  DTHKIVCIKQFRVPL---------------NDYVYELPAGLID-AGEDFEAAARRELKEE 112

Query: 123 CGYQISPKNLE-TIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERS 181
            G  +   N E +  + Y++ G++     + +      L  SK    D E IEV+ L ++
Sbjct: 113 TGLTLLDINYEKSKKRVYASAGMTDESAAMIFCTCSGTL--SKDYLEDDEDIEVMLLSKN 170

Query: 182 KALDFIMD 189
           +    + D
Sbjct: 171 EVKKLLND 178
>ref|NP_343553.1| (NC_002754) Conserved hypothetical protein [Sulfolobus
           solfataricus]
 gb|AAK42343.1| (AE006823) Conserved hypothetical protein [Sulfolobus solfataricus]
          Length = 166

 Score = 41.6 bits (96), Expect = 0.006
 Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 18/100 (18%)

Query: 56  AVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIA 115
           +V++  K ++  ++++QFRP                ID + YEL AG +++    L   A
Sbjct: 34  SVVIIPKINNEIILIRQFRP---------------VIDKWIYELPAGTIEEGEDPL-NTA 77

Query: 116 CEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAE 155
             E +EE GY+     ++ I  FY++ G++     LY AE
Sbjct: 78  NRELIEEIGYEAG--KMKEIISFYASPGITTEYMRLYLAE 115
  Database: /home/scwang/download_20020708_db/nr
    Posted date:  Aug 7, 2002 12:55 PM
  Number of letters in database: 324,149,939
  Number of sequences in database:  1,026,957
  
Lambda     K      H
   0.319    0.135    0.393 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 128,754,802
Number of Sequences: 1026957
Number of extensions: 4930273
Number of successful extensions: 10555
Number of sequences better than 1.0e-02: 27
Number of HSP's better than  0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 10508
Number of HSP's gapped (non-prelim): 31
length of query: 212
length of database: 324,149,939
effective HSP length: 116
effective length of query: 96
effective length of database: 205,022,927
effective search space: 19682200992
effective search space used: 19682200992
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 95 (41.2 bits)