BLASTP 2.2.1 [Apr-13-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= gi|15645134|ref|NP_207304.1| conserved hypothetical
protein [Helicobacter pylori 26695]
(212 letters)
Database: /home/scwang/download_20020708_db/nr
1,026,957 sequences; 324,149,939 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_207304.1| (NC_000915) conserved hypothetical protein... 429 e-120
ref|NP_223175.1| (NC_000921) putative [Helicobacter pylori ... 412 e-114
gb|AAD15563.1| (AF111170) unknown [Homo sapiens] 142 2e-33
ref|NP_281634.1| (NC_002163) hypothetical protein Cj0447 [C... 141 6e-33
dbj|BAB23110.1| (AK003991) data source:SPTR, source key:O95... 132 3e-30
dbj|BAB28691.1| (AK013174) data source:SPTR, source key:O95... 130 1e-29
gb|AAF56679.1| (AE003759) CG6001 gene product [Drosophila m... 121 6e-27
gb|EAA08524.1| (AAAB01008880) agCP2498 [Anopheles gambiae s... 111 5e-24
pir||T20117 hypothetical protein C50F4.11 - Caenorhabditis ... 102 2e-21
ref|NP_505461.1| (NM_073060) C50F4.16.p [Caenorhabditis ele... 101 5e-21
ref|NP_457011.1| (NC_003198) conserved hypothetical protein... 69 3e-11
ref|NP_289019.1| (NC_002655) orf, hypothetical protein [Esc... 64 9e-10
ref|NP_416962.1| (NC_000913) orf, hypothetical protein [Esc... 64 1e-09
ref|NP_348478.1| (NC_003030) Nudix (MutT) family hydrolase ... 60 2e-08
ref|NP_406525.1| (NC_003143) conserved hypothetical protein... 59 3e-08
ref|XP_138284.1| (XM_138284) similar to RIKEN cDNA 1110030M... 59 4e-08
gb|AAB46945.1| (L34011) ORF; putative [Escherichia coli] 58 8e-08
ref|NP_354155.1| (NC_003062) AGR_C_2106p [Agrobacterium tum... 53 2e-06
ref|NP_531834.1| (NC_003304) NTP pyrophosphohydrolase, MutT... 53 2e-06
ref|NP_637907.1| (NC_003902) conserved hypothetical protein... 50 2e-05
ref|NP_562814.1| (NC_003366) conserved hypothetical protein... 50 2e-05
ref|NP_385285.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN... 49 4e-05
ref|NP_643040.1| (NC_003919) conserved hypothetical protein... 47 2e-04
ref|NP_558683.1| (NC_003364) ADP ribose hydrolase, putative... 47 2e-04
ref|NP_253658.1| (NC_002516) conserved hypothetical protein... 44 0.001
ref|NP_350184.1| (NC_003030) Nudix (MutT) family hydrolase ... 42 0.006
ref|NP_343553.1| (NC_002754) Conserved hypothetical protein... 42 0.006
>ref|NP_207304.1| (NC_000915) conserved hypothetical protein [Helicobacter pylori
26695]
pir||C64583 conserved hypothetical protein HP0507 - Helicobacter pylori
(strain 26695)
gb|AAD07572.1| (AE000565) conserved hypothetical protein [Helicobacter pylori
26695]
Length = 212
Score = 429 bits (1104), Expect = e-120
Identities = 212/212 (100%), Positives = 212/212 (100%)
Query: 1 MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY
Sbjct: 1 MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
Query: 61 EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120
EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL
Sbjct: 61 EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120
Query: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER 180
EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER
Sbjct: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER 180
Query: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV
Sbjct: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
>ref|NP_223175.1| (NC_000921) putative [Helicobacter pylori J99]
pir||F71928 hypothetical protein jhp0457 - Helicobacter pylori (strain J99)
gb|AAD06038.1| (AE001480) putative [Helicobacter pylori J99]
Length = 212
Score = 412 bits (1059), Expect = e-114
Identities = 204/212 (96%), Positives = 207/212 (97%)
Query: 1 MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
MSYFKN FNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY
Sbjct: 1 MSYFKNIFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
Query: 61 EKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120
EKESDCFVIVKQFRPAIYAR F+FK DQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL
Sbjct: 61 EKESDCFVIVKQFRPAIYARNFYFKRDQDQTIDGYTYELCAGLVDKANKSLEEIACEEAL 120
Query: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLER 180
EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAE H+ LKVSKGGGIDTE+IEVLFLER
Sbjct: 121 EECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEAHEGLKVSKGGGIDTEKIEVLFLER 180
Query: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV
Sbjct: 181 SKALDFIMDFQYAKTTGLSLAILWHLKKFKNV 212
>gb|AAD15563.1| (AF111170) unknown [Homo sapiens]
Length = 290
Score = 142 bits (358), Expect = 2e-33
Identities = 81/203 (39%), Positives = 114/203 (55%), Gaps = 18/203 (8%)
Query: 24 CSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYA---- 79
C++S ++ +HY + +K+WD +K+ DSV VLL+ V+VKQFRPA+YA
Sbjct: 80 CAASPYLRPLTLHYRQNGAQKSWDFMKTHDSVTVLLFNSSRRSLVLVKQFRPAVYAGEVE 139
Query: 80 RRFHFKC---DQDQTID---------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
RRF DQD + G T ELCAGLVD+ SLEE+AC+EA EECGY +
Sbjct: 140 RRFPGSLAAVDQDGPRELQPALPGSAGVTVELCAGLVDQPGLSLEEVACKEAWEECGYHL 199
Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALD 185
+P +L + ++S GL+GS QT++Y EV + GGG+ + E IEV+ L A
Sbjct: 200 APSDLRRVATYWSGVGLTGSRQTMFYTEVTDAQRSGPGGGLVEEGELIEVVHLPLEGAQA 259
Query: 186 FIMDFQYAKTTGLSLAILWHLKK 208
F D KT G+ + W L +
Sbjct: 260 FADDPDIPKTLGVIFGVSWFLSQ 282
>ref|NP_281634.1| (NC_002163) hypothetical protein Cj0447 [Campylobacter jejuni]
pir||B81389 hypothetical protein Cj0447 [imported] - Campylobacter jejuni
(strain NCTC 11168)
emb|CAB75085.1| (AL139075) hypothetical protein Cj0447 [Campylobacter jejuni]
Length = 198
Score = 141 bits (355), Expect = 6e-33
Identities = 84/190 (44%), Positives = 113/190 (59%), Gaps = 21/190 (11%)
Query: 27 SNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKC 86
SN+I+ KR Y TWD I+S DSV+VLLY KE + F+ V+QFR ++ + H
Sbjct: 14 SNYIKPKRFAYESNGRLCTWDFIESKDSVSVLLYHKELESFIFVRQFRIPLWYHQMH--- 70
Query: 87 DQDQTID---GYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATG 143
D+D D GYT ELC+GLVDK SLEEIA EE +EE GY +PKNLE IG FY+ G
Sbjct: 71 DKDYVKDDDMGYTIELCSGLVDK-KLSLEEIAKEECIEELGY--APKNLEKIGDFYTGFG 127
Query: 144 LSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFIMDFQ-----YAKTTGL 198
S Q+ Y+ EV + K+S GGG+D E IE ++++ + DF+ +T L
Sbjct: 128 SGVSKQSFYFVEVDEKDKISSGGGVDDEEIEAVYVK-------VQDFEKKCKNMIRTPLL 180
Query: 199 SLAILWHLKK 208
A +W LK+
Sbjct: 181 DFAYMWFLKE 190
>dbj|BAB23110.1| (AK003991) data source:SPTR, source key:O95848,
evidence:ISS~homolog to HYPOTHETICAL 31.5 KDA
PROTEIN~putative [Mus musculus]
gb|AAH25444.1| (BC025444) RIKEN cDNA 1110030M18 gene [Mus musculus]
Length = 222
Score = 132 bits (332), Expect = 3e-30
Identities = 76/199 (38%), Positives = 110/199 (55%), Gaps = 18/199 (9%)
Query: 24 CSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRF- 82
C+ S ++ +HY ++ +K+WD +K+ DSV +L++ V+VKQFRPA+YA
Sbjct: 12 CAHSPYLRPFTLHYRQDGVQKSWDFMKTHDSVTILMFNSSRRSLVLVKQFRPAVYAGEVE 71
Query: 83 -HFK-----CDQDQTID---------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
HF +QDQ + G ELCAG+VD+ SLEE AC+EA EECGY++
Sbjct: 72 RHFPGSLTAVNQDQPQELQQALPGSAGVMVELCAGIVDQPGLSLEEAACKEAWEECGYRL 131
Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALD 185
P +L + + S GL+ S QT++YAEV + GGG+ + E IEV+ L A
Sbjct: 132 VPTDLRRVATYMSGVGLTSSRQTMFYAEVTDAQRGGPGGGLAEEGELIEVIHLNLDDAQA 191
Query: 186 FIMDFQYAKTTGLSLAILW 204
F + KT G+ AI W
Sbjct: 192 FADNPDIPKTLGVIYAISW 210
>dbj|BAB28691.1| (AK013174) data source:SPTR, source key:O95848,
evidence:ISS~homolog to HYPOTHETICAL 31.5 KDA
PROTEIN~putative [Mus musculus]
Length = 223
Score = 130 bits (326), Expect = 1e-29
Identities = 75/199 (37%), Positives = 109/199 (54%), Gaps = 18/199 (9%)
Query: 24 CSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRF- 82
C+ S ++ +HY ++ +K+WD +K+ DSV +L++ V+VKQFRPA+YA
Sbjct: 13 CAHSPYLRPFTLHYRQDGVQKSWDFMKTHDSVTILMFNSSRRSLVLVKQFRPAVYAGEVE 72
Query: 83 -HFK-----CDQDQTID---------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
HF +QDQ + G ELCAG+VD+ SLEE AC+EA EECGY++
Sbjct: 73 RHFPGSLTAVNQDQPQELQQALPGSAGVMVELCAGIVDQPGLSLEEAACKEAWEECGYRL 132
Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALD 185
P +L + + S GL+ S QT++YAEV + GGG+ + E IEV+ L A
Sbjct: 133 VPTDLRRVATYMSGVGLTSSRQTMFYAEVTDAQRGGPGGGLAEEGELIEVIHLNLDDAQA 192
Query: 186 FIMDFQYAKTTGLSLAILW 204
+ KT G+ AI W
Sbjct: 193 IADNPDIPKTLGVIYAISW 211
>gb|AAF56679.1| (AE003759) CG6001 gene product [Drosophila melanogaster]
Length = 1351
Score = 121 bits (303), Expect = 6e-27
Identities = 70/176 (39%), Positives = 100/176 (56%), Gaps = 10/176 (5%)
Query: 17 SSVYLEPC-SSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRP 75
S ++L P S +++ R++Y + +K WD++K DSVA++LY V+V+QFRP
Sbjct: 1144 SKIWLGPLPQDSPYVKPFRLYYVQNGVEKNWDLLKVHDSVAIILYNTSRQKLVLVRQFRP 1203
Query: 76 AIYARRFHFKCDQDQTID--------GYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
A+Y +D G T ELCAG+VDK NKS EIA EE +EECGY +
Sbjct: 1204 AVYHGIISSAKGTFDEVDLKEFPPAIGVTLELCAGIVDK-NKSWVEIAREEVVEECGYDV 1262
Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKA 183
+ +E + + S G SG+ QT+YY EV K + GGG+D E IEV+ L +A
Sbjct: 1263 PVERIEEVMVYRSGVGSSGAKQTMYYCEVTDADKATGGGGVDDEIIEVVELSLEEA 1318
Score = 120 bits (300), Expect = 1e-26
Identities = 76/182 (41%), Positives = 104/182 (56%), Gaps = 16/182 (8%)
Query: 17 SSVYLEPC-SSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRP 75
S ++ P SN+I+ R+HY E + +K DIIK++D V V+LY K + + V+QFR
Sbjct: 941 SKIWFGPMPKDSNWIKPGRLHYIENDVEKQVDIIKTIDGVVVILYNKAREKLIFVRQFRG 1000
Query: 76 AIYARRFHFKCDQDQTID-----------GYTYELCAGLVDKANKSLEEIACEEALEECG 124
A+Y + H D + G T ELC G VDK +KSL EIA EE LEECG
Sbjct: 1001 AVY-QGIHSAGSPDMSKGEADLEQFPPEVGVTLELCGGAVDK-DKSLAEIAKEEVLEECG 1058
Query: 125 YQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVL--FLERSK 182
Y++ ++L+ + + S G S S +L+Y EV KVS GGGI ERI+VL LE S+
Sbjct: 1059 YEVPTESLQHVYDYRSGIGTSSSAMSLFYCEVCDAQKVSAGGGIGEERIQVLEMSLEESR 1118
Query: 183 AL 184
L
Sbjct: 1119 QL 1120
>gb|EAA08524.1| (AAAB01008880) agCP2498 [Anopheles gambiae str. PEST]
Length = 238
Score = 111 bits (278), Expect = 5e-24
Identities = 65/168 (38%), Positives = 98/168 (57%), Gaps = 10/168 (5%)
Query: 25 SSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHF 84
+ S +++ R HY + +K+WD++K DSV+++++ V VKQFRPA+Y
Sbjct: 41 ADSPYVKPFRFHYTQNGKQKSWDLLKVHDSVSIVIFNVTRKKLVFVKQFRPAVYHGIISG 100
Query: 85 KCDQDQTID--------GYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIG 136
+ +ID T ELCAG++DK ++E IA EE LEECGY I + +E I
Sbjct: 101 DGVEPGSIDMKKYPPELAVTMELCAGIIDKPISTIE-IAREEVLEECGYDIPVERIEEII 159
Query: 137 QFYSATGLSGSLQTLYYAEVHKNLKV-SKGGGIDTERIEVLFLERSKA 183
++ S G SG+ QTL+YAEV K+ S GGG+D E I+V+ + +A
Sbjct: 160 RYRSGVGTSGAEQTLFYAEVTDEDKIASAGGGVDDEIIDVVEYDLEEA 207
>pir||T20117 hypothetical protein C50F4.11 - Caenorhabditis elegans
Length = 1092
Score = 102 bits (255), Expect = 2e-21
Identities = 63/179 (35%), Positives = 93/179 (51%), Gaps = 14/179 (7%)
Query: 1 MSYFKNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTKKTWDIIKSLDSVAVLLY 60
M F N S I D S + S + + M Y + + + + + SV++LL+
Sbjct: 635 MKKFVNFLMTASKISDVSFVTD--FVSKYQKGMEMSYTLDGNSRVSEFNQKMSSVSILLF 692
Query: 61 EKESDCFVIVKQFRPAIYARRFHFKCDQD----QTID--------GYTYELCAGLVDKAN 108
++ + F++V+QFRPAI+ + ID GYT ELCAGL+DK
Sbjct: 693 HRDLEQFLLVRQFRPAIFTASISNSPENHGKEFDKIDWSSYDSETGYTIELCAGLIDKEG 752
Query: 109 KSLEEIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGG 167
S EIA EE EECGY++ P +L + F SGS Q LYYAE+ +++K+S+GGG
Sbjct: 753 LSPREIASEEVAEECGYRVDPDDLIHVITFVVGAHQSGSAQHLYYAEIDESMKISEGGG 811
>ref|NP_505461.1| (NM_073060) C50F4.16.p [Caenorhabditis elegans]
emb|CAC42268.1| (Z70750) cDNA EST EMBL:AU111192 comes from this gene~cDNA EST
EMBL:AU115025 comes from this gene [Caenorhabditis
elegans]
Length = 450
Score = 101 bits (252), Expect = 5e-21
Identities = 55/145 (37%), Positives = 82/145 (55%), Gaps = 12/145 (8%)
Query: 35 MHYNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQD----Q 90
M Y + + + + + SV++LL+ ++ + F++V+QFRPAI+ +
Sbjct: 25 MSYTLDGNSRVSEFNQKMSSVSILLFHRDLEQFLLVRQFRPAIFTASISNSPENHGKEFD 84
Query: 91 TID--------GYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSAT 142
ID GYT ELCAGL+DK S EIA EE EECGY++ P +L + F
Sbjct: 85 KIDWSSYDSETGYTIELCAGLIDKEGLSPREIASEEVAEECGYRVDPDDLIHVITFVVGA 144
Query: 143 GLSGSLQTLYYAEVHKNLKVSKGGG 167
SGS Q LYYAE+ +++K+S+GGG
Sbjct: 145 HQSGSAQHLYYAEIDESMKISEGGG 169
>ref|NP_457011.1| (NC_003198) conserved hypothetical protein [Salmonella enterica
subsp. enterica serovar Typhi]
ref|NP_461412.1| (NC_003197) putative pyrophosphohydrolase [Salmonella typhimurium
LT2]
gb|AAL21371.1| (AE008811) putative pyrophosphohydrolase [Salmonella typhimurium
LT2]
emb|CAD07707.1| (AL627274) conserved hypothetical protein [Salmonella enterica
subsp. enterica serovar Typhi]
Length = 191
Score = 69.3 bits (168), Expect = 3e-11
Identities = 52/169 (30%), Positives = 81/169 (47%), Gaps = 22/169 (13%)
Query: 26 SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
S N+ L+ + Y + T++ ++I+ V +LLY V+V+QFR A +
Sbjct: 14 SDNYFTLRNITY--DLTRRNGEVIRHKREVYDRGNGATILLYNSTKKTVVLVRQFRVATW 71
Query: 79 ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
+ DG E CAGL+D N E +EA+EE GY + + I +
Sbjct: 72 V---------NGNQDGMLIETCAGLLD--NDEPEVCIRKEAIEETGYDVG--EVRKIFEL 118
Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
Y + G L + AE H + + S GGG++ E IEVL L S+AL+ +
Sbjct: 119 YMSPGGVTELIHFFIAEYHDSERASIGGGVEDEEIEVLELPFSRALEMV 167
>ref|NP_289019.1| (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7
EDL933]
ref|NP_311356.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
gb|AAG57576.1|AE005476_1 (AE005476) orf, hypothetical protein [Escherichia coli O157:H7
EDL933]
dbj|BAB36752.1| (AP002561) hypothetical protein [Escherichia coli O157:H7]
Length = 191
Score = 64.3 bits (155), Expect = 9e-10
Identities = 50/169 (29%), Positives = 80/169 (46%), Gaps = 22/169 (13%)
Query: 26 SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
S N+ L + Y + T+K ++I+ V +LLY + V+++QFR A +
Sbjct: 14 SDNYFTLHNITY--DLTRKDGEVIRHKREVYDRGNGATILLYNAKKKTVVLIRQFRVATW 71
Query: 79 ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
+ G E CAGL+D N E +EA+EE GY++ + + +
Sbjct: 72 V---------NGNESGQLIETCAGLLD--NDEPEVCIRKEAIEETGYEVG--EVRKLFEL 118
Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
Y + G L + AE N + + GGG++ E IEVL L S+AL+ I
Sbjct: 119 YMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVLELPFSQALEMI 167
>ref|NP_416962.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
sp|P37128|YFFH_ECOLI Hypothetical protein yffH
pir||B65022 yffH protein - Escherichia coli (strain K-12)
gb|AAC75520.1| (AE000333) orf, hypothetical protein [Escherichia coli K12]
dbj|BAA16341.1| (D90875) similar to [SwissProt Accession Number P37128]
[Escherichia coli]
Length = 191
Score = 63.9 bits (154), Expect = 1e-09
Identities = 50/169 (29%), Positives = 80/169 (46%), Gaps = 22/169 (13%)
Query: 26 SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
S N+ L + Y + T+K ++I+ V +LLY + V+++QFR A +
Sbjct: 14 SDNYFTLHNITY--DLTRKDGEVIRHKREVYDRGNGATILLYNTKKKTVVLIRQFRVATW 71
Query: 79 ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
+ G E CAGL+D N E +EA+EE GY++ + + +
Sbjct: 72 V---------NGNESGQLIESCAGLLD--NDEPEVCIRKEAIEETGYEVG--EVRKLFEL 118
Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
Y + G L + AE N + + GGG++ E IEVL L S+AL+ I
Sbjct: 119 YMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVLELPFSQALEMI 167
>ref|NP_348478.1| (NC_003030) Nudix (MutT) family hydrolase [Clostridium
acetobutylicum]
gb|AAK79818.1|AE007694_4 (AE007694) Nudix (MutT) family hydrolase [Clostridium
acetobutylicum]
Length = 172
Score = 59.7 bits (143), Expect = 2e-08
Identities = 45/151 (29%), Positives = 79/151 (51%), Gaps = 17/151 (11%)
Query: 37 YNEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYT 96
Y + +T+DI+K+ D+V+ + E D ++VKQFR AI D+T+
Sbjct: 15 YKRDVNGRTYDILKNYDAVSAFI-TNEFDDVLMVKQFRAAI----------MDETL---- 59
Query: 97 YELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEV 156
E+ AG +D A +S E+ E EE +I NL+ I + G S S+ LY+A++
Sbjct: 60 -EIPAGCLDIAGESPEDCLIREIKEETNLKIDKINLKKIISYKPTLGFSTSVLHLYHAKI 118
Query: 157 HKNLKVSKGGGIDTERIEVLFLERSKALDFI 187
K+ +S D + EVL+++++ +++I
Sbjct: 119 KKSDLISNKVN-DEDVNEVLWVDKNTFIEYI 148
>ref|NP_406525.1| (NC_003143) conserved hypothetical protein [Yersinia pestis]
emb|CAC92277.1| (AJ414155) conserved hypothetical protein [Yersinia pestis]
Length = 198
Score = 59.3 bits (142), Expect = 3e-08
Identities = 41/137 (29%), Positives = 65/137 (46%), Gaps = 13/137 (9%)
Query: 53 DSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLE 112
+ +LLY ++ V+++QFR Y + G E CAGL+D N S E
Sbjct: 46 NGATILLYNRQQGTVVLIEQFRMPTYV---------NGNASGMLLEACAGLLD--NDSPE 94
Query: 113 EIACEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTER 172
EA+EE GYQ+ ++ + + Y + G L + AE H + K++ G++ E
Sbjct: 95 ACIRREAMEETGYQVD--KVQKLFEAYMSPGGVTELVYFFAAEYHPDQKITDEVGVEDEV 152
Query: 173 IEVLFLERSKALDFIMD 189
IEV+ L AL + D
Sbjct: 153 IEVVELPFHDALAMVAD 169
>ref|XP_138284.1| (XM_138284) similar to RIKEN cDNA 1110030M18 gene [Mus musculus]
Length = 237
Score = 58.9 bits (141), Expect = 4e-08
Identities = 53/176 (30%), Positives = 78/176 (44%), Gaps = 10/176 (5%)
Query: 38 NEENTKKTWDIIKSLDSVAVLLYEKESDCFVIVKQFRPA-----IYARRFHFKCDQDQTI 92
+++ +K+WD +K+ DSV +L++ V+VKQFRPA AR D
Sbjct: 51 SQDGVQKSWDFMKTHDSVTILMFNSSRRSLVLVKQFRPAGIPNMHEARPLQGAGSVDIPS 110
Query: 93 DGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF--YSATGLSGSLQT 150
E G V + A EA ++ +Q L + S GL+ S QT
Sbjct: 111 PPLPREWYLGRVKLHRVLIIRKAEREAWKQGVWQ-GHTGLSAVPVLPDRSGVGLTSSRQT 169
Query: 151 LYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALDFIMDFQYAKTTGLSLAILW 204
++YAEV + GGG+ + E IEV+ L A F + KT G+ AI W
Sbjct: 170 MFYAEVTDAQRGGPGGGLAEEGELIEVIHLNLDDAQAFADNPDIPKTLGVIYAISW 225
>gb|AAB46945.1| (L34011) ORF; putative [Escherichia coli]
Length = 157
Score = 57.8 bits (138), Expect = 8e-08
Identities = 45/158 (28%), Positives = 73/158 (45%), Gaps = 22/158 (13%)
Query: 26 SSNFIELKRMHYNEENTKKTWDIIKSLDSV-------AVLLYEKESDCFVIVKQFRPAIY 78
S N+ L + Y + T+K ++I+ V +LLY + V+++QFR A +
Sbjct: 14 SDNYFTLHNITY--DLTRKDGEVIRHKREVYDRGNGATILLYNTKKKTVVLIRQFRVATW 71
Query: 79 ARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQF 138
+ G E CAGL+D N E +EA+EE GY++ + + +
Sbjct: 72 V---------NGNESGQLIESCAGLLD--NDEPEVCIRKEAIEETGYEVG--EVRKLFEL 118
Query: 139 YSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVL 176
Y + G L + AE N + + GGG++ E IEVL
Sbjct: 119 YMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVL 156
>ref|NP_354155.1| (NC_003062) AGR_C_2106p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86940.1| (AE008043) AGR_C_2106p [Agrobacterium tumefaciens str. C58
(Cereon)]
Length = 254
Score = 53.1 bits (126), Expect = 2e-06
Identities = 37/148 (25%), Positives = 78/148 (52%), Gaps = 19/148 (12%)
Query: 29 FIELKRMHYNEENTK-KTWDIIKSL----DSVAVLLYEKESDCFVIVKQFRPAIYARRFH 83
F+ ++++ +++ +T I++ + + A+LLY+ + D V+V+QFRPA +
Sbjct: 75 FVHMQKLIFDQRMPDGRTMRIVREVHDHGSAAAILLYDVKRDSVVMVRQFRPAAFV---- 130
Query: 84 FKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATG 143
+ DQ+ + E+ AGL+D + + EA+EE GY + + +E + Y++ G
Sbjct: 131 ---NGDQS---FMIEVPAGLLD--DDDAADAIRREAMEESGYAV--EKVEYLFDMYASPG 180
Query: 144 LSGSLQTLYYAEVHKNLKVSKGGGIDTE 171
+L+ A + +++ GGG++ E
Sbjct: 181 TLTEKVSLFVARIDLDVQAGNGGGLEDE 208
>ref|NP_531834.1| (NC_003304) NTP pyrophosphohydrolase, MutT family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42150.1| (AE009076) NTP pyrophosphohydrolase, MutT family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 209
Score = 53.1 bits (126), Expect = 2e-06
Identities = 37/148 (25%), Positives = 78/148 (52%), Gaps = 19/148 (12%)
Query: 29 FIELKRMHYNEENTK-KTWDIIKSL----DSVAVLLYEKESDCFVIVKQFRPAIYARRFH 83
F+ ++++ +++ +T I++ + + A+LLY+ + D V+V+QFRPA +
Sbjct: 30 FVHMQKLIFDQRMPDGRTMRIVREVHDHGSAAAILLYDVKRDSVVMVRQFRPAAFV---- 85
Query: 84 FKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFYSATG 143
+ DQ+ + E+ AGL+D + + EA+EE GY + + +E + Y++ G
Sbjct: 86 ---NGDQS---FMIEVPAGLLD--DDDAADAIRREAMEESGYAV--EKVEYLFDMYASPG 135
Query: 144 LSGSLQTLYYAEVHKNLKVSKGGGIDTE 171
+L+ A + +++ GGG++ E
Sbjct: 136 TLTEKVSLFVARIDLDVQAGNGGGLEDE 163
>ref|NP_637907.1| (NC_003902) conserved hypothetical protein [Xanthomonas campestris
pv. campestris str. ATCC 33913]
gb|AAM41831.1| (AE012367) conserved hypothetical protein [Xanthomonas campestris
pv. campestris str. ATCC 33913]
Length = 199
Score = 50.1 bits (118), Expect = 2e-05
Identities = 51/192 (26%), Positives = 81/192 (41%), Gaps = 33/192 (17%)
Query: 26 SSNFIELKRMHYNEENTKKTWDIIKSL-----DSVAVLLYEKESDCFVIVKQFR-PAIYA 79
S N+ L+++ ++ + W + + +LLY + ++ +QFR P +
Sbjct: 20 SDNWYVLRKVTFDFQRKDGRWQTLSREAYDRGNGATILLYSRARQTVMLTRQFRLPTLL- 78
Query: 80 RRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQISPKNLETIGQFY 139
+ DG E CAGL+D+ + E +E EE GY+I +N+ + + +
Sbjct: 79 ---------NGNPDGMLIEACAGLLDQDDP--EACIRKETEEETGYRI--ENVRKVFEAF 125
Query: 140 SATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALDF----------- 186
+ G + E KVS GGG+ D E IEVL L AL
Sbjct: 126 MSPGSVTERLYFFVGEYVDGDKVSAGGGVEEDGEEIEVLELSLDAALAMIATGGIADAKT 185
Query: 187 IMDFQYAKTTGL 198
IM QYAK G+
Sbjct: 186 IMLLQYAKLHGV 197
>ref|NP_562814.1| (NC_003366) conserved hypothetical protein [Clostridium
perfringens]
dbj|BAB81604.1| (AP003192) conserved hypothetical protein [Clostridium perfringens]
Length = 184
Score = 50.1 bits (118), Expect = 2e-05
Identities = 40/141 (28%), Positives = 69/141 (48%), Gaps = 19/141 (13%)
Query: 63 ESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEE 122
E+D FV++KQFRPA I+ Y YE AGL+D ++ + A E EE
Sbjct: 63 ENDEFVLLKQFRPA---------------INDYIYEFPAGLIDNGEDAI-KAATRELFEE 106
Query: 123 CGYQISPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERSK 182
G ++ ++ I Y++ G+S + +V+ N +S + E IEV+ + R +
Sbjct: 107 TGL-LASESEYLIKPSYTSVGMSDESVAVVKMKVYGN--ISTENLEENEEIEVIKVPRKE 163
Query: 183 ALDFIMDFQYAKTTGLSLAIL 203
A +F+ + + T L L+ +
Sbjct: 164 AKNFVKENNVSIKTALVLSFM 184
>ref|NP_385285.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
emb|CAC45758.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
Length = 198
Score = 48.9 bits (115), Expect = 4e-05
Identities = 52/190 (27%), Positives = 91/190 (47%), Gaps = 33/190 (17%)
Query: 5 KNAFNQKSLIDDSSVYLEPCSSSNFIELKRMHYNEENTK-KTWDIIKSLD----SVAVLL 59
KNA + +ID +V+ FI L+++ +E + T +++ + + +LL
Sbjct: 3 KNADARFRIIDRKTVW------DGFINLEQITIEQEMSDGSTARLVREVHDHGRAATILL 56
Query: 60 YEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEA 119
++ E V+V+Q R ++ Q +T GY E AGL+D ++ E C EA
Sbjct: 57 FDPERQVVVLVRQLRLPVFL--------QGET--GYLLEAPAGLLD--GEAPEVAICREA 104
Query: 120 LEECGYQISPKNLETIGQFYSATGLSGSL---QTLYYAEVHKNLKVSKGGGI--DTERIE 174
+EE GY+I ET + A GS+ + + + + KV+ GGG+ + E IE
Sbjct: 105 MEETGYRI-----ETAMHLFDAYMSPGSITERTSFFLGLIDISKKVAAGGGLAHEGEDIE 159
Query: 175 VLFLERSKAL 184
VL + +A+
Sbjct: 160 VLEISFDEAV 169
>ref|NP_643040.1| (NC_003919) conserved hypothetical protein [Xanthomonas axonopodis
pv. citri str. 306]
gb|AAM37576.1| (AE011912) conserved hypothetical protein [Xanthomonas axonopodis
pv. citri str. 306]
Length = 206
Score = 46.6 bits (109), Expect = 2e-04
Identities = 50/194 (25%), Positives = 80/194 (40%), Gaps = 37/194 (19%)
Query: 26 SSNFIELKRMHYNEENTKKTWDIIKSL-----DSVAVLLYEKESDCFVIVKQFR-PAIYA 79
S N+ L+++ ++ + W + + +LLY + ++ +QFR P +
Sbjct: 20 SDNWYVLRKVTFDFQRKDGRWQSLSREAYDRGNGATILLYSRARQTVMLTRQFRLPTLL- 78
Query: 80 RRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIAC--EEALEECGYQISPKNLETIGQ 137
+ DG E CAGL+D+ + + C +E EE GY+I N+ + +
Sbjct: 79 ---------NGNPDGMLIEACAGLLDQD----DALTCIRKETEEETGYRID--NVRKVFE 123
Query: 138 FYSATGLSGSLQTLYYAEVHKNLKVSKGGGI--DTERIEVLFLERSKALDF--------- 186
+ + G + E KV GGG+ D E IEVL L AL
Sbjct: 124 AFMSPGSVTERLYFFVGEYFDADKVGDGGGLEEDGEEIEVLELSLDAALAMIGTGEIADA 183
Query: 187 --IMDFQYAKTTGL 198
IM QYAK G+
Sbjct: 184 KTIMLLQYAKLHGV 197
>ref|NP_558683.1| (NC_003364) ADP ribose hydrolase, putative (mutT/nudix family
protein) [Pyrobaculum aerophilum]
gb|AAD00531.1| (U82369) MutT homolog [Pyrobaculum aerophilum]
gb|AAL62865.1| (AE009773) ADP ribose hydrolase, putative (mutT/nudix family
protein) [Pyrobaculum aerophilum]
Length = 170
Score = 46.6 bits (109), Expect = 2e-04
Identities = 39/123 (31%), Positives = 62/123 (49%), Gaps = 23/123 (18%)
Query: 68 VIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEECGYQI 127
V+V+QFRPA+ + +T E+ AG +D +S EE A E +EE GY+
Sbjct: 47 VLVRQFRPALKS---------------WTIEIPAGTLD-GGESPEEAAVREMIEETGYK- 89
Query: 128 SPKNLETIGQFYSATGLSGSLQTLYYAEVHKNLKVSK--GGGIDTERIEVLFLERSKALD 185
P L + FY + G+S L ++YA+ + + + + G ID +EVL E + L
Sbjct: 90 -PLRLTPLLDFYPSPGISNELIRIFYADELEYVGIGRRDPGEID---MEVLLKEPGEVLR 145
Query: 186 FIM 188
I+
Sbjct: 146 AII 148
>ref|NP_253658.1| (NC_002516) conserved hypothetical protein [Pseudomonas aeruginosa]
pir||A83024 conserved hypothetical protein PA4971 [imported] - Pseudomonas
aeruginosa (strain PAO1)
gb|AAG08356.1|AE004910_3 (AE004910) conserved hypothetical protein [Pseudomonas aeruginosa]
Length = 205
Score = 44.3 bits (103), Expect = 0.001
Identities = 26/72 (36%), Positives = 40/72 (55%), Gaps = 11/72 (15%)
Query: 53 DSVAVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLE 112
D+V VL Y+ + DC V+++QFR + + + EL AGL+DK ++ E
Sbjct: 54 DAVCVLPYDPQRDCVVLIEQFRVGA----------MQKLANPWLLELVAGLIDK-DEQPE 102
Query: 113 EIACEEALEECG 124
E+A EA+EE G
Sbjct: 103 EVAHREAMEEAG 114
>ref|NP_350184.1| (NC_003030) Nudix (MutT) family hydrolase [Clostridium
acetobutylicum]
gb|AAK81524.1|AE007856_8 (AE007856) Nudix (MutT) family hydrolase [Clostridium
acetobutylicum]
Length = 202
Score = 41.6 bits (96), Expect = 0.006
Identities = 42/188 (22%), Positives = 76/188 (40%), Gaps = 37/188 (19%)
Query: 21 LEPCSSSNFIELKRMHY-NEENTKKTWDII-----------------KSLDSVAVLLYEK 62
L P + + F+ L + Y N++ + W + + D+ + + +
Sbjct: 9 LTPLAETKFLSLYDIEYKNKKQDTRHWTVASRKDYKALSDQYLNGAAEKTDAAIIAAFHE 68
Query: 63 ESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIACEEALEE 122
++ V +KQFR + + Y YEL AGL+D A + E A E EE
Sbjct: 69 DTHKIVCIKQFRVPL---------------NDYVYELPAGLID-AGEDFEAAARRELKEE 112
Query: 123 CGYQISPKNLE-TIGQFYSATGLSGSLQTLYYAEVHKNLKVSKGGGIDTERIEVLFLERS 181
G + N E + + Y++ G++ + + L SK D E IEV+ L ++
Sbjct: 113 TGLTLLDINYEKSKKRVYASAGMTDESAAMIFCTCSGTL--SKDYLEDDEDIEVMLLSKN 170
Query: 182 KALDFIMD 189
+ + D
Sbjct: 171 EVKKLLND 178
>ref|NP_343553.1| (NC_002754) Conserved hypothetical protein [Sulfolobus
solfataricus]
gb|AAK42343.1| (AE006823) Conserved hypothetical protein [Sulfolobus solfataricus]
Length = 166
Score = 41.6 bits (96), Expect = 0.006
Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 18/100 (18%)
Query: 56 AVLLYEKESDCFVIVKQFRPAIYARRFHFKCDQDQTIDGYTYELCAGLVDKANKSLEEIA 115
+V++ K ++ ++++QFRP ID + YEL AG +++ L A
Sbjct: 34 SVVIIPKINNEIILIRQFRP---------------VIDKWIYELPAGTIEEGEDPL-NTA 77
Query: 116 CEEALEECGYQISPKNLETIGQFYSATGLSGSLQTLYYAE 155
E +EE GY+ ++ I FY++ G++ LY AE
Sbjct: 78 NRELIEEIGYEAG--KMKEIISFYASPGITTEYMRLYLAE 115
Database: /home/scwang/download_20020708_db/nr
Posted date: Aug 7, 2002 12:55 PM
Number of letters in database: 324,149,939
Number of sequences in database: 1,026,957
Lambda K H
0.319 0.135 0.393
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 128,754,802
Number of Sequences: 1026957
Number of extensions: 4930273
Number of successful extensions: 10555
Number of sequences better than 1.0e-02: 27
Number of HSP's better than 0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 10508
Number of HSP's gapped (non-prelim): 31
length of query: 212
length of database: 324,149,939
effective HSP length: 116
effective length of query: 96
effective length of database: 205,022,927
effective search space: 19682200992
effective search space used: 19682200992
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 95 (41.2 bits)