BLASTP 2.2.1 [Apr-13-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|15645643|ref|NP_207819.1| hypothetical protein
[Helicobacter pylori 26695]
         (178 letters)

Database: /home/scwang/download_20020708_db/nr
           1,026,957 sequences; 324,149,939 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_207819.1|  (NC_000915) hypothetical protein [Helicoba...   354  3e-97
ref|NP_223114.1|  (NC_000921) putative [Helicobacter pylori ...   316  7e-86
ref|NP_347472.1|  (NC_003030) Putative beta-D-galactosidase ...    59  2e-08
ref|NP_346130.1|  (NC_003028) conserved hypothetical protein...    49  2e-05
ref|NP_246192.1|  (NC_002663) unknown [Pasteurella multocida...    46  2e-04
ref|NP_345785.1|  (NC_003028) conserved hypothetical protein...    45  3e-04
gb|AAK69522.1|AF282849_3  (AF282849) YiaL [Klebsiella oxytoca]     45  5e-04
ref|NP_462569.1|  (NC_003197) putative cytoplasmic protein [...    44  6e-04
ref|NP_461996.1|  (NC_003197) putative mannitol dehydrogenas...    44  8e-04
ref|NP_406511.1|  (NC_003143) conserved hypothetical protein...    43  0.001
ref|NP_458258.1|  (NC_003198) conserved hypothetical protein...    42  0.002
>ref|NP_207819.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
 pir||E64648 hypothetical protein HP1029 - Helicobacter pylori (strain 26695)
 gb|AAD08084.1| (AE000611) H. pylori predicted coding region HP1029 [Helicobacter
           pylori 26695]
          Length = 178

 Score =  354 bits (908), Expect = 3e-97
 Identities = 178/178 (100%), Positives = 178/178 (100%)

Query: 1   MAIFGELSSLGHLFKKTQELEILHEYLKEVMQKGSKANQRVLNLATNTEFQVPLGHGIFS 60
           MAIFGELSSLGHLFKKTQELEILHEYLKEVMQKGSKANQRVLNLATNTEFQVPLGHGIFS
Sbjct: 1   MAIFGELSSLGHLFKKTQELEILHEYLKEVMQKGSKANQRVLNLATNTEFQVPLGHGIFS 60

Query: 61  IEQSYCLEHAKESEKGFFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVY 120
           IEQSYCLEHAKESEKGFFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVY
Sbjct: 61  IEQSYCLEHAKESEKGFFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVY 120

Query: 121 EPVSEASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVKAPKGLIKLKL 178
           EPVSEASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVKAPKGLIKLKL
Sbjct: 121 EPVSEASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVKAPKGLIKLKL 178
>ref|NP_223114.1| (NC_000921) putative [Helicobacter pylori J99]
 pir||A71939 hypothetical protein jhp0395 - Helicobacter pylori (strain J99)
 gb|AAD05970.1| (AE001473) putative [Helicobacter pylori J99]
          Length = 178

 Score =  316 bits (810), Expect = 7e-86
 Identities = 158/178 (88%), Positives = 165/178 (91%)

Query: 1   MAIFGELSSLGHLFKKTQELEILHEYLKEVMQKGSKANQRVLNLATNTEFQVPLGHGIFS 60
           MAIFGEL SLGHLFKKTQELEILH YL++VMQKGS+ANQRVLNLA NTEFQVPL HGIFS
Sbjct: 1   MAIFGELGSLGHLFKKTQELEILHGYLQDVMQKGSEANQRVLNLAINTEFQVPLEHGIFS 60

Query: 61  IEQSYCLEHAKESEKGFFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVY 120
           IEQSYCLEHAKE EKGFFESH++YVDFQLI+KGVEGAK   IN+AVIK PYDEKRDLIVY
Sbjct: 61  IEQSYCLEHAKEGEKGFFESHRQYVDFQLIIKGVEGAKVADINRAVIKTPYDEKRDLIVY 120

Query: 121 EPVSEASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVKAPKGLIKLKL 178
           EPVSEASFL L AGMLAIF ENDAHALRFYGESFEKYREEPIFKAVVK PKGLIKLKL
Sbjct: 121 EPVSEASFLHLDAGMLAIFLENDAHALRFYGESFEKYREEPIFKAVVKMPKGLIKLKL 178
>ref|NP_347472.1| (NC_003030) Putative beta-D-galactosidase [Clostridium
           acetobutylicum]
 gb|AAK78812.1|AE007599_8 (AE007599) Putative beta-D-galactosidase [Clostridium
           acetobutylicum]
          Length = 152

 Score = 59.3 bits (142), Expect = 2e-08
 Identities = 38/119 (31%), Positives = 57/119 (46%), Gaps = 12/119 (10%)

Query: 35  SKANQRVLNLATNTEFQVPLGHGIFSIE--------QSYCLEHAKESEKGFFESHKKYVD 86
           +K  +R      NT+ Q  L  G + I+        QSY     K+  +  FESH+KY+D
Sbjct: 16  NKEMERAFEFLKNTDIQ-KLSDGKYEIDSDNVYASVQSY---ETKDKSEKKFESHEKYID 71

Query: 87  FQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVYEPVSEASFLRLHAGMLAIFFENDAH 145
            Q IVKG E  +   I    ++  Y +++D+I Y+    +S + L      IFF ND H
Sbjct: 72  IQYIVKGKEFIEWSPIQNLSVEEAYSDEKDVIFYKDGKLSSKINLEDNYFCIFFPNDGH 130
>ref|NP_346130.1| (NC_003028) conserved hypothetical protein [Streptococcus
           pneumoniae TIGR4]
 ref|NP_359128.1| (NC_003098) Hypothetical protein [Streptococcus pneumoniae R6]
 gb|AAC44392.1| (U43526) ORF-1 [Streptococcus pneumoniae]
 gb|AAK75770.1| (AE007462) conserved hypothetical protein [Streptococcus pneumoniae
           TIGR4]
 gb|AAL00339.1| (AE008522) Hypothetical protein [Streptococcus pneumoniae R6]
          Length = 150

 Score = 49.3 bits (116), Expect = 2e-05
 Identities = 29/104 (27%), Positives = 48/104 (45%), Gaps = 8/104 (7%)

Query: 66  CLEHAKESEKG-FFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVYEPVS 124
           C  +  + + G FFE+H+KY+D  L+++  E           +   YDE++D+ +Y    
Sbjct: 50  CFTYLADGQAGAFFETHQKYLDIHLVLENEEAMAVTSPENVSVTQEYDEEKDIELYTGKV 109

Query: 125 EASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVK 168
           E   + L AG   I F  D H  +       +  +EP+ K V K
Sbjct: 110 E-QLVHLRAGECLITFPEDLHQPKV------RINDEPVKKVVFK 146
>ref|NP_246192.1| (NC_002663) unknown [Pasteurella multocida]
 gb|AAK03339.1| (AE006164) unknown [Pasteurella multocida]
          Length = 158

 Score = 45.8 bits (107), Expect = 2e-04
 Identities = 24/68 (35%), Positives = 34/68 (49%), Gaps = 1/68 (1%)

Query: 79  ESHKKYVDFQLIVKGVEG-AKAVGINQAVIKNPYDEKRDLIVYEPVSEASFLRLHAGMLA 137
           E H+ Y+D Q +  G E    A+ +    I  PYD  RD++ Y+ V     L +  G  A
Sbjct: 63  EVHRHYIDVQYLHSGFERIGVAIDLGNNDIAKPYDATRDILFYQNVENEVQLIMRPGHFA 122

Query: 138 IFFENDAH 145
           IFF +D H
Sbjct: 123 IFFPSDVH 130
>ref|NP_345785.1| (NC_003028) conserved hypothetical protein [Streptococcus
           pneumoniae TIGR4]
 gb|AAK75425.1| (AE007431) conserved hypothetical protein [Streptococcus pneumoniae
           TIGR4]
          Length = 152

 Score = 45.4 bits (106), Expect = 3e-04
 Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 8/104 (7%)

Query: 66  CLEHAKESEKG-FFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVYEPVS 124
           C+ +  +   G  FE+HKKY+D  ++V+  E        +A  + P+ E++D+  Y+   
Sbjct: 50  CMTYLADGVPGDIFETHKKYLDIHIVVENTEKMAVTSPVRAQSRVPFSEEKDIAFYDS-K 108

Query: 125 EASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVK 168
           +   + L  G + + FE D H  + +        +E + K V+K
Sbjct: 109 DYQIVELLPGNMLVTFEEDLHQPKIH------CNDETVKKLVIK 146
>gb|AAK69522.1|AF282849_3 (AF282849) YiaL [Klebsiella oxytoca]
          Length = 154

 Score = 44.7 bits (104), Expect = 5e-04
 Identities = 38/138 (27%), Positives = 55/138 (39%), Gaps = 12/138 (8%)

Query: 37  ANQRVLNLATNTEFQVPLGHGIFSIEQSYCLEHA-----KESEKGFFESHKKYVDFQLIV 91
           A +R L+    T+F   L  G+  I+             +++ +   E H++Y+D Q + 
Sbjct: 17  AIERALDFLRTTDFHA-LAPGVVEIDGQNIFAQVIDLTTRDAAENRPEVHRRYLDIQFLA 75

Query: 92  KGVEGAK-AVGINQAVIKNPYDEKRDLIVYEPVSEASFLRLHAGMLAIFFENDAHALRFY 150
            G E    A+      I     E+RD+I Y      SF  +  G  AIFF  D H     
Sbjct: 76  SGEEKIGIAIDTGNNQISESLLEQRDIIFYHDSEHESFFEMTPGNYAIFFPQDVHR---- 131

Query: 151 GESFEKYREEPIFKAVVK 168
                K    PI K VVK
Sbjct: 132 -PGCNKTVATPIRKIVVK 148
>ref|NP_462569.1| (NC_003197) putative cytoplasmic protein [Salmonella typhimurium
           LT2]
 gb|AAL22528.1| (AE008870) putative cytoplasmic protein [Salmonella typhimurium
           LT2]
          Length = 154

 Score = 44.3 bits (103), Expect = 6e-04
 Identities = 37/138 (26%), Positives = 58/138 (41%), Gaps = 12/138 (8%)

Query: 37  ANQRVLNLATNTEFQVPLGHGIFSIEQSYCLEH-----AKESEKGFFESHKKYVDFQLIV 91
           A ++ L+   NT+F+  L  G+  I+             +++ +   E H++Y+D Q + 
Sbjct: 17  AIEQALDFLRNTDFRT-LEPGVVEIDGKNIFAQIIDMTTRDAAENRPEVHRRYLDIQFLA 75

Query: 92  KGVEG-AKAVGINQAVIKNPYDEKRDLIVYEPVSEASFLRLHAGMLAIFFENDAHALRFY 150
            G E    A+      I     E+RD+I Y      SF+ +  G  A+FF  D H     
Sbjct: 76  WGEEKIGVAIDTGNNQISESLLEQRDIIFYHDSEHESFIEMIPGSYALFFPQDVHR---- 131

Query: 151 GESFEKYREEPIFKAVVK 168
                K    PI K VVK
Sbjct: 132 -PGCNKSIATPIRKIVVK 148
>ref|NP_461996.1| (NC_003197) putative mannitol dehydrogenase [Salmonella typhimurium
           LT2]
 gb|AAL21955.1| (AE008841) putative mannitol dehydrogenase [Salmonella typhimurium
           LT2]
          Length = 157

 Score = 43.9 bits (102), Expect = 8e-04
 Identities = 33/121 (27%), Positives = 51/121 (41%), Gaps = 8/121 (6%)

Query: 53  PLGHGIFSIEQSYCLEHAKESEKGF---FESHKKYVDFQLIVKGVEGAKAVGINQAVIKN 109
           P G    S  +S+ +    E+        E HKKY+D Q+++ G E           IK 
Sbjct: 34  PAGRYELSFPESFLMISEGETHSSLNRKAELHKKYIDVQILLSGYEEIGYSNKIDTRIKE 93

Query: 110 PYDEKRDLIVYEPVSEASFLRLHAGMLAIFFENDAHALRFYGESFEKYREEPIFKAVVKA 169
                 D+I  E V+   F+ L+ G  A+F+ N  H          + +  P+ KA+VK 
Sbjct: 94  LEHLPDDIIFPECVANEQFVTLNPGDFALFYPNQVHR-----PLCTRGKPAPVKKAIVKI 148

Query: 170 P 170
           P
Sbjct: 149 P 149
>ref|NP_406511.1| (NC_003143) conserved hypothetical protein [Yersinia pestis]
 emb|CAC92262.1| (AJ414154) conserved hypothetical protein [Yersinia pestis]
          Length = 151

 Score = 43.1 bits (100), Expect = 0.001
 Identities = 23/74 (31%), Positives = 36/74 (48%), Gaps = 2/74 (2%)

Query: 72  ESEKGFFESHKKYVDFQLIVKGVEGAKAVGINQAVIKNPYDEKRDLIVYEPVSEASFLRL 131
           ES+K   E H+ Y D QL++ G+EG +   +       PY    D  +   + + S LR+
Sbjct: 54  ESKKA--ELHRTYADVQLLISGIEGIEYSTLTPTEHLEPYHPDDDYQLIADIPDKSQLRM 111

Query: 132 HAGMLAIFFENDAH 145
             GM A+F   + H
Sbjct: 112 LPGMFAVFLPGEPH 125
>ref|NP_458258.1| (NC_003198) conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Typhi]
 emb|CAD07959.1| (AL627281) conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Typhi]
          Length = 154

 Score = 42.4 bits (98), Expect = 0.002
 Identities = 29/91 (31%), Positives = 39/91 (41%), Gaps = 6/91 (6%)

Query: 79  ESHKKYVDFQLIVKGVEG-AKAVGINQAVIKNPYDEKRDLIVYEPVSEASFLRLHAGMLA 137
           E H++Y+D Q +  G E    A+      I     E+RD+I Y      SF+ +  G  A
Sbjct: 63  EVHRRYLDIQFLAWGEEKIGVAIDTGNNQISESLLEQRDIIFYHDSEHESFIEMIPGSYA 122

Query: 138 IFFENDAHALRFYGESFEKYREEPIFKAVVK 168
           +FF  D H          K    PI K VVK
Sbjct: 123 LFFPQDVHR-----PGCNKSIATPIRKIVVK 148
  Database: /home/scwang/download_20020708_db/nr
    Posted date:  Aug 7, 2002 12:55 PM
  Number of letters in database: 324,149,939
  Number of sequences in database:  1,026,957
  
Lambda     K      H
   0.320    0.138    0.385 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 105,117,762
Number of Sequences: 1026957
Number of extensions: 4077595
Number of successful extensions: 9850
Number of sequences better than 1.0e-02: 11
Number of HSP's better than  0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 9842
Number of HSP's gapped (non-prelim): 11
length of query: 178
length of database: 324,149,939
effective HSP length: 113
effective length of query: 65
effective length of database: 208,103,798
effective search space: 13526746870
effective search space used: 13526746870
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 93 (40.4 bits)