BLASTP 2.2.1 [Apr-13-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= gi|15645190|ref|NP_207360.1| hypothetical protein
[Helicobacter pylori 26695]
(215 letters)
Database: /home/scwang/download_20020708_db/nr
1,026,957 sequences; 324,149,939 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_207360.1| (NC_000915) hypothetical protein [Helicoba... 435 e-121
ref|NP_223230.1| (NC_000921) putative [Helicobacter pylori ... 433 e-121
sp|P46928|YKGB_PASHA Hypothetical 23.0 kDa protein in PURT/... 155 2e-37
ref|NP_438387.1| (NC_000907) conserved hypothetical transme... 153 1e-36
ref|NP_459558.1| (NC_003197) putative inner membrane protei... 135 4e-31
ref|NP_455149.1| (NC_003198) putative membrane protein [Sal... 133 1e-30
ref|NP_414835.1| (NC_000913) orf, hypothetical protein [Esc... 131 5e-30
sp|P75685|YKGB_ECOLI Hypothetical protein ykgB 130 8e-30
ref|NP_286027.1| (NC_002655) orf, hypothetical protein [Esc... 130 1e-29
gb|AAB18029.1| (U73857) similar to H. influenzae protein HI... 103 2e-21
gb|AAK62495.1| (AY034092) MC21 [Micrococcus sp. 28] 66 2e-10
ref|NP_231227.1| (NC_002505) conserved hypothetical protein... 54 2e-06
>ref|NP_207360.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
pir||E64590 hypothetical protein HP0565 - Helicobacter pylori (strain 26695)
gb|AAD07629.1| (AE000570) H. pylori predicted coding region HP0565 [Helicobacter
pylori 26695]
Length = 215
Score = 435 bits (1119), Expect = e-121
Identities = 215/215 (100%), Positives = 215/215 (100%)
Query: 1 MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY
Sbjct: 1 MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
Query: 61 KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL
Sbjct: 61 KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180
GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF
Sbjct: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180
Query: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS
Sbjct: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
>ref|NP_223230.1| (NC_000921) putative [Helicobacter pylori J99]
pir||F71923 hypothetical protein jhp0512 - Helicobacter pylori (strain J99)
gb|AAD06088.1| (AE001484) putative [Helicobacter pylori J99]
Length = 215
Score = 433 bits (1114), Expect = e-121
Identities = 213/215 (99%), Positives = 214/215 (99%)
Query: 1 MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY
Sbjct: 1 MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
Query: 61 KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAE LGITIMILGILVLL
Sbjct: 61 KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEGLGITIMILGILVLL 120
Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180
GLWMPLMGV+GGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF
Sbjct: 121 GLWMPLMGVVGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180
Query: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS
Sbjct: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
>sp|P46928|YKGB_PASHA Hypothetical 23.0 kDa protein in PURT/MPA1 5'region
pir||B56691 mpa1 5'-region hypothetical protein - Pasteurella haemolytica
gb|AAB28916.1| (S68137) orf 5' of mpa1 [Mannheimia haemolytica]
Length = 211
Score = 155 bits (393), Expect = 2e-37
Identities = 85/194 (43%), Positives = 116/194 (58%), Gaps = 15/194 (7%)
Query: 10 VITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQ 69
++ +Q + IAI I+ +WIGGLK YEA+GIA FV+NSPFFS+MYK K
Sbjct: 14 IVAPMQRQFINFVRIAICIVMVWIGGLKVCQYEADGIAHFVSNSPFFSYMYK--KGPNLV 71
Query: 70 HKMSESQSMQEEMQDNPK---IVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPL 126
+ M+ + NP+ + +N EWHKEN TY + +G TI+ +G+L L G+W P+
Sbjct: 72 DDGTGKMVMEYTLHKNPEGKMVAKNIEWHKENGTYTASYIIGATIVTVGLLTLSGIWFPV 131
Query: 127 MGVIGGLLVAGMTITTLSFLFTTPEVFV----------NQHFPWLSGAGRLVVKDLALFA 176
G+ GGLL GM+I TLSF+ TTPE++V FP+LS GRL+VKD+ + A
Sbjct: 132 SGMAGGLLTFGMSIVTLSFMITTPEIWVPNLGGDMPTPAHGFPYLSAVGRLIVKDVIMMA 191
Query: 177 GGLFVAGFDAKRYL 190
GGL A A R L
Sbjct: 192 GGLVAAAECANRIL 205
>ref|NP_438387.1| (NC_000907) conserved hypothetical transmembrane protein
[Haemophilus influenzae Rd]
sp|P44577|YKGB_HAEIN Hypothetical protein HI0219
pir||E64145 hypothetical protein HI0219 - Haemophilus influenzae (strain Rd
KW20)
gb|AAC21887.1| (U32707) conserved hypothetical transmembrane protein [Haemophilus
influenzae Rd]
Length = 213
Score = 153 bits (386), Expect = 1e-36
Identities = 88/197 (44%), Positives = 116/197 (58%), Gaps = 15/197 (7%)
Query: 10 VITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQ 69
++ +Q + IAIFI+ WIGGLK YEA+GIA FV+NSPFFS+MY+ + P
Sbjct: 18 IVAPMQRQFINFIRIAIFIVMAWIGGLKVCQYEADGIAHFVSNSPFFSYMYE-KGPNLVP 76
Query: 70 HKMSESQSMQEEMQDNPK---IVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPL 126
+ E M+ + NP+ + +N EWHKEN TY + +G I+ +GIL L G+W
Sbjct: 77 NDKGELV-MEYTLHKNPEGKMVAKNIEWHKENGTYTASYIIGAIIVTVGILTLAGIWNAT 135
Query: 127 MGVIGGLLVAGMTITTLSFLFTTPEVFVNQ----------HFPWLSGAGRLVVKDLALFA 176
G+ GGLL GM+I TLSFL TTPE +V FP+LSG GRLV+KD+ + A
Sbjct: 136 AGLAGGLLTFGMSIVTLSFLITTPEAWVPNLGGDLPTPAYGFPYLSGVGRLVIKDIIMMA 195
Query: 177 GGLFVAGFDAKRYLEGK 193
GGL A A R L K
Sbjct: 196 GGLTAAAECANRILARK 212
>ref|NP_459558.1| (NC_003197) putative inner membrane protein [Salmonella typhimurium
LT2]
gb|AAL19517.1| (AE008722) putative inner membrane protein [Salmonella typhimurium
LT2]
Length = 186
Score = 135 bits (339), Expect = 4e-31
Identities = 78/189 (41%), Positives = 108/189 (56%), Gaps = 17/189 (8%)
Query: 8 LEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAY 67
L ++++ LG L+ ++I I+FIWIG LKFVPYEA+ I PFVANSPF SF Y+ + Y
Sbjct: 5 LRLLSQGDRLGLTLIRLSIAIVFIWIGLLKFVPYEADSITPFVANSPFMSFFYEHPE-EY 63
Query: 68 KQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLM 127
+QH E + EE + W N TY ++ LG+ +I+ LVL +
Sbjct: 64 RQHLTHEGELKPEE----------RAWQTANNTYAFSDGLGVVELIIAALVLANPVSRWL 113
Query: 128 GVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAGGLFV 181
G+ GG+L TLSFL TTPEV+V + FP+LSGAGRLV+KD + AG + +
Sbjct: 114 GLAGGVLAFLTPFVTLSFLITTPEVWVMPLGDAHYGFPYLSGAGRLVLKDTLMLAGAVMI 173
Query: 182 AGFDAKRYL 190
A+ L
Sbjct: 174 MADSARSLL 182
>ref|NP_455149.1| (NC_003198) putative membrane protein [Salmonella enterica subsp.
enterica serovar Typhi]
emb|CAD05048.1| (AL627267) putative membrane protein [Salmonella enterica subsp.
enterica serovar Typhi]
Length = 186
Score = 133 bits (335), Expect = 1e-30
Identities = 77/189 (40%), Positives = 107/189 (55%), Gaps = 17/189 (8%)
Query: 8 LEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAY 67
L ++++ LG L+ ++I I+FIWIG LKFVPYEA+ I PFVANSPF SF Y+ + Y
Sbjct: 5 LRLLSQGDRLGLTLIRLSIAIVFIWIGLLKFVPYEADSITPFVANSPFMSFFYEHPE-EY 63
Query: 68 KQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLM 127
+QH E + EE + W N TY ++ LG+ +I+ LVL +
Sbjct: 64 RQHLTHEGELKPEE----------RAWQTANNTYAFSDGLGVVELIIAALVLANPVSRWL 113
Query: 128 GVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAGGLFV 181
G+ GG+L TLSFL TTPE +V + FP+LSGAGRLV+KD + AG + +
Sbjct: 114 GLAGGVLAFLTPFVTLSFLITTPEAWVMPLGDAHYGFPYLSGAGRLVLKDTLMLAGAVMI 173
Query: 182 AGFDAKRYL 190
A+ L
Sbjct: 174 MADSARSLL 182
>ref|NP_414835.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
pir||E64756 membrane protein ykgB - Escherichia coli
gb|AAC73404.1| (AE000137) orf, hypothetical protein [Escherichia coli K12]
Length = 200
Score = 131 bits (330), Expect = 5e-30
Identities = 78/214 (36%), Positives = 119/214 (55%), Gaps = 21/214 (9%)
Query: 1 MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
M ++ L ++++ +G L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP SF Y
Sbjct: 1 MFTMEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFY 60
Query: 61 KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
+ + YKQ+ E + E + W N TY + LG+ +I+ +LVL
Sbjct: 61 EHPED-YKQYLTHEGEYKPEA----------RAWQTANNTYGFSNGLGVVEVIIALLVLA 109
Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLAL 174
+G++GGL+ + TLSFL TTPE +V + FP+LSGAGRLV+KD +
Sbjct: 110 NPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLM 169
Query: 175 FAGGLFVAGFDAKRYLEGKGFCLMDRSSVGIKTK 208
AG + + A+ L+ + + SS +KT+
Sbjct: 170 LAGAVMIMADSAREILKQRS----NESSSTLKTE 199
>sp|P75685|YKGB_ECOLI Hypothetical protein ykgB
Length = 197
Score = 130 bits (328), Expect = 8e-30
Identities = 77/211 (36%), Positives = 118/211 (55%), Gaps = 21/211 (9%)
Query: 4 LKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFE 63
++ L ++++ +G L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP SF Y+
Sbjct: 1 MEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHP 60
Query: 64 KPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLW 123
+ YKQ+ E + E + W N TY + LG+ +I+ +LVL
Sbjct: 61 ED-YKQYLTHEGEYKPEA----------RAWQTANNTYGFSNGLGVVEVIIALLVLANPV 109
Query: 124 MPLMGVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAG 177
+G++GGL+ + TLSFL TTPE +V + FP+LSGAGRLV+KD + AG
Sbjct: 110 NRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLMLAG 169
Query: 178 GLFVAGFDAKRYLEGKGFCLMDRSSVGIKTK 208
+ + A+ L+ + + SS +KT+
Sbjct: 170 AVMIMADSAREILKQRS----NESSSTLKTE 196
>ref|NP_286027.1| (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7
EDL933]
ref|NP_308366.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
gb|AAG54635.1|AE005208_3 (AE005208) orf, hypothetical protein [Escherichia coli O157:H7
EDL933]
dbj|BAB33762.1| (AP002551) hypothetical protein [Escherichia coli O157:H7]
Length = 197
Score = 130 bits (326), Expect = 1e-29
Identities = 73/194 (37%), Positives = 110/194 (56%), Gaps = 17/194 (8%)
Query: 4 LKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFE 63
++ L ++++ +G L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP SF Y+
Sbjct: 1 MEKYLHLLSRGDKIGLALIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHP 60
Query: 64 KPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLW 123
+ YKQ+ E + E + W N TY + LG+ +I+ +LVL
Sbjct: 61 ED-YKQYLTHEGEYKPEA----------RAWQSANNTYGFSNGLGVVEVIIALLVLANPV 109
Query: 124 MPLMGVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAG 177
+G++GGL+ + TLSFL TTPE +V + FP+LSGAGRLV+KD + AG
Sbjct: 110 NRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLMLAG 169
Query: 178 GLFVAGFDAKRYLE 191
+ + A+ L+
Sbjct: 170 AVMIMADSAREILK 183
>gb|AAB18029.1| (U73857) similar to H. influenzae protein HI0219 [Escherichia coli]
Length = 163
Score = 103 bits (256), Expect = 2e-21
Identities = 57/151 (37%), Positives = 85/151 (55%), Gaps = 11/151 (7%)
Query: 1 MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
M ++ L ++++ +G L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP SF Y
Sbjct: 1 MFTMEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFY 60
Query: 61 KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
+ + YKQ+ E + E + W N TY + LG+ +I+ +LVL
Sbjct: 61 EHPED-YKQYLTHEGEYKPEA----------RAWQTANNTYGFSNGLGVVEVIIALLVLA 109
Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPE 151
+G++GGL+ + TLSFL TTPE
Sbjct: 110 NPVNRWLGLLGGLMAFTTPLVTLSFLITTPE 140
>gb|AAK62495.1| (AY034092) MC21 [Micrococcus sp. 28]
Length = 235
Score = 66.2 bits (160), Expect = 2e-10
Identities = 48/172 (27%), Positives = 72/172 (40%), Gaps = 36/172 (20%)
Query: 12 TKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQHK 71
T ++ +G ++ ++ +WIG LKF YE E I P V +SP FS + K
Sbjct: 89 TGIEQMGNSVLRYSLVTNLVWIGSLKFQDYEMENIRPLVTSSPLFSGVLK---------- 138
Query: 72 MSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLMGVIG 131
K+ E K A+ +G+ + +G L+ P +G
Sbjct: 139 ---------------KLGEKK----------TAQLIGVAEIGMGALIAAKPLAPRASALG 173
Query: 132 GLLVAGMTITTLSFLFTTPEVFVNQHFPW-LSGAGRLVVKDLALFAGGLFVA 182
L GM +TTLSF+ TTP V H LS G+ ++KD L + A
Sbjct: 174 SLGAVGMFVTTLSFMATTPGVQQENHGKTKLSMVGQFLLKDTVLLGASILTA 225
>ref|NP_231227.1| (NC_002505) conserved hypothetical protein [Vibrio cholerae]
pir||F82180 conserved hypothetical protein VC1587 [imported] - Vibrio cholerae
(group O1 strain N16961)
gb|AAF94741.1| (AE004236) conserved hypothetical protein [Vibrio cholerae]
Length = 146
Score = 53.5 bits (127), Expect = 2e-06
Identities = 40/136 (29%), Positives = 62/136 (45%), Gaps = 36/136 (26%)
Query: 19 GYLMHI-AIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQHKMSESQS 77
GYL+ + + +I IW+G KF P EA+ I P V N P ++Y+F S
Sbjct: 14 GYLIGVLGVSLILIWLGIYKFTPTEAKLIEPLVLNHPLMGWIYQF-------------LS 60
Query: 78 MQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLMGVIGGLLVAG 137
+Q V+ +G T +I+GI +L+GL + G+
Sbjct: 61 IQ----------------------AVSNLIGATEIIVGIGLLIGLRSSKVAYYSGIASMV 98
Query: 138 MTITTLSFLFTTPEVF 153
+ I+TLSFL TTP+ +
Sbjct: 99 IFISTLSFLITTPDTW 114
Database: /home/scwang/download_20020708_db/nr
Posted date: Aug 7, 2002 12:55 PM
Number of letters in database: 324,149,939
Number of sequences in database: 1,026,957
Lambda K H
0.326 0.142 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 136,177,885
Number of Sequences: 1026957
Number of extensions: 5605101
Number of successful extensions: 21484
Number of sequences better than 1.0e-02: 12
Number of HSP's better than 0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 21456
Number of HSP's gapped (non-prelim): 14
length of query: 215
length of database: 324,149,939
effective HSP length: 116
effective length of query: 99
effective length of database: 205,022,927
effective search space: 20297269773
effective search space used: 20297269773
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 95 (41.2 bits)