BLASTP 2.2.1 [Apr-13-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|15645190|ref|NP_207360.1| hypothetical protein
[Helicobacter pylori 26695]
         (215 letters)

Database: /home/scwang/download_20020708_db/nr
           1,026,957 sequences; 324,149,939 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_207360.1|  (NC_000915) hypothetical protein [Helicoba...   435  e-121
ref|NP_223230.1|  (NC_000921) putative [Helicobacter pylori ...   433  e-121
sp|P46928|YKGB_PASHA  Hypothetical 23.0 kDa protein in PURT/...   155  2e-37
ref|NP_438387.1|  (NC_000907) conserved hypothetical transme...   153  1e-36
ref|NP_459558.1|  (NC_003197) putative inner membrane protei...   135  4e-31
ref|NP_455149.1|  (NC_003198) putative membrane protein [Sal...   133  1e-30
ref|NP_414835.1|  (NC_000913) orf, hypothetical protein [Esc...   131  5e-30
sp|P75685|YKGB_ECOLI  Hypothetical protein ykgB                   130  8e-30
ref|NP_286027.1|  (NC_002655) orf, hypothetical protein [Esc...   130  1e-29
gb|AAB18029.1|  (U73857) similar to H. influenzae protein HI...   103  2e-21
gb|AAK62495.1|  (AY034092) MC21 [Micrococcus sp. 28]               66  2e-10
ref|NP_231227.1|  (NC_002505) conserved hypothetical protein...    54  2e-06
>ref|NP_207360.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
 pir||E64590 hypothetical protein HP0565 - Helicobacter pylori (strain 26695)
 gb|AAD07629.1| (AE000570) H. pylori predicted coding region HP0565 [Helicobacter
           pylori 26695]
          Length = 215

 Score =  435 bits (1119), Expect = e-121
 Identities = 215/215 (100%), Positives = 215/215 (100%)

Query: 1   MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
           MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY
Sbjct: 1   MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60

Query: 61  KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
           KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL
Sbjct: 61  KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120

Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180
           GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF
Sbjct: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180

Query: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
           VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS
Sbjct: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
>ref|NP_223230.1| (NC_000921) putative [Helicobacter pylori J99]
 pir||F71923 hypothetical protein jhp0512 - Helicobacter pylori (strain J99)
 gb|AAD06088.1| (AE001484) putative [Helicobacter pylori J99]
          Length = 215

 Score =  433 bits (1114), Expect = e-121
 Identities = 213/215 (99%), Positives = 214/215 (99%)

Query: 1   MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
           MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY
Sbjct: 1   MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60

Query: 61  KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
           KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAE LGITIMILGILVLL
Sbjct: 61  KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEGLGITIMILGILVLL 120

Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180
           GLWMPLMGV+GGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF
Sbjct: 121 GLWMPLMGVVGGLLVAGMTITTLSFLFTTPEVFVNQHFPWLSGAGRLVVKDLALFAGGLF 180

Query: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
           VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS
Sbjct: 181 VAGFDAKRYLEGKGFCLMDRSSVGIKTKCSSGCCS 215
>sp|P46928|YKGB_PASHA Hypothetical 23.0 kDa protein in PURT/MPA1 5'region
 pir||B56691 mpa1 5'-region hypothetical protein - Pasteurella haemolytica
 gb|AAB28916.1| (S68137) orf 5' of mpa1 [Mannheimia haemolytica]
          Length = 211

 Score =  155 bits (393), Expect = 2e-37
 Identities = 85/194 (43%), Positives = 116/194 (58%), Gaps = 15/194 (7%)

Query: 10  VITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQ 69
           ++  +Q      + IAI I+ +WIGGLK   YEA+GIA FV+NSPFFS+MYK  K     
Sbjct: 14  IVAPMQRQFINFVRIAICIVMVWIGGLKVCQYEADGIAHFVSNSPFFSYMYK--KGPNLV 71

Query: 70  HKMSESQSMQEEMQDNPK---IVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPL 126
              +    M+  +  NP+   + +N EWHKEN TY  +  +G TI+ +G+L L G+W P+
Sbjct: 72  DDGTGKMVMEYTLHKNPEGKMVAKNIEWHKENGTYTASYIIGATIVTVGLLTLSGIWFPV 131

Query: 127 MGVIGGLLVAGMTITTLSFLFTTPEVFV----------NQHFPWLSGAGRLVVKDLALFA 176
            G+ GGLL  GM+I TLSF+ TTPE++V             FP+LS  GRL+VKD+ + A
Sbjct: 132 SGMAGGLLTFGMSIVTLSFMITTPEIWVPNLGGDMPTPAHGFPYLSAVGRLIVKDVIMMA 191

Query: 177 GGLFVAGFDAKRYL 190
           GGL  A   A R L
Sbjct: 192 GGLVAAAECANRIL 205
>ref|NP_438387.1| (NC_000907) conserved hypothetical transmembrane protein
           [Haemophilus influenzae Rd]
 sp|P44577|YKGB_HAEIN Hypothetical protein HI0219
 pir||E64145 hypothetical protein HI0219 - Haemophilus influenzae (strain Rd
           KW20)
 gb|AAC21887.1| (U32707) conserved hypothetical transmembrane protein [Haemophilus
           influenzae Rd]
          Length = 213

 Score =  153 bits (386), Expect = 1e-36
 Identities = 88/197 (44%), Positives = 116/197 (58%), Gaps = 15/197 (7%)

Query: 10  VITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQ 69
           ++  +Q      + IAIFI+  WIGGLK   YEA+GIA FV+NSPFFS+MY+ + P    
Sbjct: 18  IVAPMQRQFINFIRIAIFIVMAWIGGLKVCQYEADGIAHFVSNSPFFSYMYE-KGPNLVP 76

Query: 70  HKMSESQSMQEEMQDNPK---IVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPL 126
           +   E   M+  +  NP+   + +N EWHKEN TY  +  +G  I+ +GIL L G+W   
Sbjct: 77  NDKGELV-MEYTLHKNPEGKMVAKNIEWHKENGTYTASYIIGAIIVTVGILTLAGIWNAT 135

Query: 127 MGVIGGLLVAGMTITTLSFLFTTPEVFVNQ----------HFPWLSGAGRLVVKDLALFA 176
            G+ GGLL  GM+I TLSFL TTPE +V             FP+LSG GRLV+KD+ + A
Sbjct: 136 AGLAGGLLTFGMSIVTLSFLITTPEAWVPNLGGDLPTPAYGFPYLSGVGRLVIKDIIMMA 195

Query: 177 GGLFVAGFDAKRYLEGK 193
           GGL  A   A R L  K
Sbjct: 196 GGLTAAAECANRILARK 212
>ref|NP_459558.1| (NC_003197) putative inner membrane protein [Salmonella typhimurium
           LT2]
 gb|AAL19517.1| (AE008722) putative inner membrane protein [Salmonella typhimurium
           LT2]
          Length = 186

 Score =  135 bits (339), Expect = 4e-31
 Identities = 78/189 (41%), Positives = 108/189 (56%), Gaps = 17/189 (8%)

Query: 8   LEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAY 67
           L ++++   LG  L+ ++I I+FIWIG LKFVPYEA+ I PFVANSPF SF Y+  +  Y
Sbjct: 5   LRLLSQGDRLGLTLIRLSIAIVFIWIGLLKFVPYEADSITPFVANSPFMSFFYEHPE-EY 63

Query: 68  KQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLM 127
           +QH   E +   EE          + W   N TY  ++ LG+  +I+  LVL       +
Sbjct: 64  RQHLTHEGELKPEE----------RAWQTANNTYAFSDGLGVVELIIAALVLANPVSRWL 113

Query: 128 GVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAGGLFV 181
           G+ GG+L       TLSFL TTPEV+V      +  FP+LSGAGRLV+KD  + AG + +
Sbjct: 114 GLAGGVLAFLTPFVTLSFLITTPEVWVMPLGDAHYGFPYLSGAGRLVLKDTLMLAGAVMI 173

Query: 182 AGFDAKRYL 190
               A+  L
Sbjct: 174 MADSARSLL 182
>ref|NP_455149.1| (NC_003198) putative membrane protein [Salmonella enterica subsp.
           enterica serovar Typhi]
 emb|CAD05048.1| (AL627267) putative membrane protein [Salmonella enterica subsp.
           enterica serovar Typhi]
          Length = 186

 Score =  133 bits (335), Expect = 1e-30
 Identities = 77/189 (40%), Positives = 107/189 (55%), Gaps = 17/189 (8%)

Query: 8   LEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAY 67
           L ++++   LG  L+ ++I I+FIWIG LKFVPYEA+ I PFVANSPF SF Y+  +  Y
Sbjct: 5   LRLLSQGDRLGLTLIRLSIAIVFIWIGLLKFVPYEADSITPFVANSPFMSFFYEHPE-EY 63

Query: 68  KQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLM 127
           +QH   E +   EE          + W   N TY  ++ LG+  +I+  LVL       +
Sbjct: 64  RQHLTHEGELKPEE----------RAWQTANNTYAFSDGLGVVELIIAALVLANPVSRWL 113

Query: 128 GVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAGGLFV 181
           G+ GG+L       TLSFL TTPE +V      +  FP+LSGAGRLV+KD  + AG + +
Sbjct: 114 GLAGGVLAFLTPFVTLSFLITTPEAWVMPLGDAHYGFPYLSGAGRLVLKDTLMLAGAVMI 173

Query: 182 AGFDAKRYL 190
               A+  L
Sbjct: 174 MADSARSLL 182
>ref|NP_414835.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
 pir||E64756 membrane protein ykgB - Escherichia coli
 gb|AAC73404.1| (AE000137) orf, hypothetical protein [Escherichia coli K12]
          Length = 200

 Score =  131 bits (330), Expect = 5e-30
 Identities = 78/214 (36%), Positives = 119/214 (55%), Gaps = 21/214 (9%)

Query: 1   MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
           M  ++  L ++++   +G  L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP  SF Y
Sbjct: 1   MFTMEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFY 60

Query: 61  KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
           +  +  YKQ+   E +   E           + W   N TY  +  LG+  +I+ +LVL 
Sbjct: 61  EHPED-YKQYLTHEGEYKPEA----------RAWQTANNTYGFSNGLGVVEVIIALLVLA 109

Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLAL 174
                 +G++GGL+     + TLSFL TTPE +V      +  FP+LSGAGRLV+KD  +
Sbjct: 110 NPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLM 169

Query: 175 FAGGLFVAGFDAKRYLEGKGFCLMDRSSVGIKTK 208
            AG + +    A+  L+ +     + SS  +KT+
Sbjct: 170 LAGAVMIMADSAREILKQRS----NESSSTLKTE 199
>sp|P75685|YKGB_ECOLI Hypothetical protein ykgB
          Length = 197

 Score =  130 bits (328), Expect = 8e-30
 Identities = 77/211 (36%), Positives = 118/211 (55%), Gaps = 21/211 (9%)

Query: 4   LKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFE 63
           ++  L ++++   +G  L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP  SF Y+  
Sbjct: 1   MEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHP 60

Query: 64  KPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLW 123
           +  YKQ+   E +   E           + W   N TY  +  LG+  +I+ +LVL    
Sbjct: 61  ED-YKQYLTHEGEYKPEA----------RAWQTANNTYGFSNGLGVVEVIIALLVLANPV 109

Query: 124 MPLMGVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAG 177
              +G++GGL+     + TLSFL TTPE +V      +  FP+LSGAGRLV+KD  + AG
Sbjct: 110 NRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLMLAG 169

Query: 178 GLFVAGFDAKRYLEGKGFCLMDRSSVGIKTK 208
            + +    A+  L+ +     + SS  +KT+
Sbjct: 170 AVMIMADSAREILKQRS----NESSSTLKTE 196
>ref|NP_286027.1| (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7
           EDL933]
 ref|NP_308366.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
 gb|AAG54635.1|AE005208_3 (AE005208) orf, hypothetical protein [Escherichia coli O157:H7
           EDL933]
 dbj|BAB33762.1| (AP002551) hypothetical protein [Escherichia coli O157:H7]
          Length = 197

 Score =  130 bits (326), Expect = 1e-29
 Identities = 73/194 (37%), Positives = 110/194 (56%), Gaps = 17/194 (8%)

Query: 4   LKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFE 63
           ++  L ++++   +G  L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP  SF Y+  
Sbjct: 1   MEKYLHLLSRGDKIGLALIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHP 60

Query: 64  KPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLW 123
           +  YKQ+   E +   E           + W   N TY  +  LG+  +I+ +LVL    
Sbjct: 61  ED-YKQYLTHEGEYKPEA----------RAWQSANNTYGFSNGLGVVEVIIALLVLANPV 109

Query: 124 MPLMGVIGGLLVAGMTITTLSFLFTTPEVFV------NQHFPWLSGAGRLVVKDLALFAG 177
              +G++GGL+     + TLSFL TTPE +V      +  FP+LSGAGRLV+KD  + AG
Sbjct: 110 NRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLMLAG 169

Query: 178 GLFVAGFDAKRYLE 191
            + +    A+  L+
Sbjct: 170 AVMIMADSAREILK 183
>gb|AAB18029.1| (U73857) similar to H. influenzae protein HI0219 [Escherichia coli]
          Length = 163

 Score =  103 bits (256), Expect = 2e-21
 Identities = 57/151 (37%), Positives = 85/151 (55%), Gaps = 11/151 (7%)

Query: 1   MQALKSLLEVITKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMY 60
           M  ++  L ++++   +G  L+ ++I I+F+WIG LKFVPYEA+ I PFVANSP  SF Y
Sbjct: 1   MFTMEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFY 60

Query: 61  KFEKPAYKQHKMSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLL 120
           +  +  YKQ+   E +   E           + W   N TY  +  LG+  +I+ +LVL 
Sbjct: 61  EHPED-YKQYLTHEGEYKPEA----------RAWQTANNTYGFSNGLGVVEVIIALLVLA 109

Query: 121 GLWMPLMGVIGGLLVAGMTITTLSFLFTTPE 151
                 +G++GGL+     + TLSFL TTPE
Sbjct: 110 NPVNRWLGLLGGLMAFTTPLVTLSFLITTPE 140
>gb|AAK62495.1| (AY034092) MC21 [Micrococcus sp. 28]
          Length = 235

 Score = 66.2 bits (160), Expect = 2e-10
 Identities = 48/172 (27%), Positives = 72/172 (40%), Gaps = 36/172 (20%)

Query: 12  TKLQNLGGYLMHIAIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQHK 71
           T ++ +G  ++  ++    +WIG LKF  YE E I P V +SP FS + K          
Sbjct: 89  TGIEQMGNSVLRYSLVTNLVWIGSLKFQDYEMENIRPLVTSSPLFSGVLK---------- 138

Query: 72  MSESQSMQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLMGVIG 131
                          K+ E K           A+ +G+  + +G L+      P    +G
Sbjct: 139 ---------------KLGEKK----------TAQLIGVAEIGMGALIAAKPLAPRASALG 173

Query: 132 GLLVAGMTITTLSFLFTTPEVFVNQHFPW-LSGAGRLVVKDLALFAGGLFVA 182
            L   GM +TTLSF+ TTP V    H    LS  G+ ++KD  L    +  A
Sbjct: 174 SLGAVGMFVTTLSFMATTPGVQQENHGKTKLSMVGQFLLKDTVLLGASILTA 225
>ref|NP_231227.1| (NC_002505) conserved hypothetical protein [Vibrio cholerae]
 pir||F82180 conserved hypothetical protein VC1587 [imported] - Vibrio cholerae
           (group O1 strain N16961)
 gb|AAF94741.1| (AE004236) conserved hypothetical protein [Vibrio cholerae]
          Length = 146

 Score = 53.5 bits (127), Expect = 2e-06
 Identities = 40/136 (29%), Positives = 62/136 (45%), Gaps = 36/136 (26%)

Query: 19  GYLMHI-AIFIIFIWIGGLKFVPYEAEGIAPFVANSPFFSFMYKFEKPAYKQHKMSESQS 77
           GYL+ +  + +I IW+G  KF P EA+ I P V N P   ++Y+F              S
Sbjct: 14  GYLIGVLGVSLILIWLGIYKFTPTEAKLIEPLVLNHPLMGWIYQF-------------LS 60

Query: 78  MQEEMQDNPKIVENKEWHKENRTYLVAEALGITIMILGILVLLGLWMPLMGVIGGLLVAG 137
           +Q                       V+  +G T +I+GI +L+GL    +    G+    
Sbjct: 61  IQ----------------------AVSNLIGATEIIVGIGLLIGLRSSKVAYYSGIASMV 98

Query: 138 MTITTLSFLFTTPEVF 153
           + I+TLSFL TTP+ +
Sbjct: 99  IFISTLSFLITTPDTW 114
  Database: /home/scwang/download_20020708_db/nr
    Posted date:  Aug 7, 2002 12:55 PM
  Number of letters in database: 324,149,939
  Number of sequences in database:  1,026,957
  
Lambda     K      H
   0.326    0.142    0.434 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 136,177,885
Number of Sequences: 1026957
Number of extensions: 5605101
Number of successful extensions: 21484
Number of sequences better than 1.0e-02: 12
Number of HSP's better than  0.0 without gapping: 11
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 21456
Number of HSP's gapped (non-prelim): 14
length of query: 215
length of database: 324,149,939
effective HSP length: 116
effective length of query: 99
effective length of database: 205,022,927
effective search space: 20297269773
effective search space used: 20297269773
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 95 (41.2 bits)