BLASTP 2.2.1 [Apr-13-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|15645481|ref|NP_207656.1| hypothetical protein
[Helicobacter pylori 26695]
         (223 letters)

Database: /home/scwang/download_20020708_db/nr
           1,026,957 sequences; 324,149,939 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_207656.1|  (NC_000915) hypothetical protein [Helicoba...   448  e-125
ref|NP_223514.1|  (NC_000921) putative [Helicobacter pylori ...   434  e-121
ref|NP_281584.1|  (NC_002163) hypothetical protein Cj0394c [...   154  5e-37
ref|NP_228691.1|  (NC_000853) conserved hypothetical protein...    59  4e-08
ref|NP_252969.1|  (NC_002516) hypothetical protein [Pseudomo...    57  2e-07
ref|NP_603658.1|  (NC_003454) Bvg accessory factor [Fusobact...    50  2e-05
ref|NP_487936.1|  (NC_003272) hypothetical protein [Nostoc s...    49  4e-05
ref|NP_441440.1|  (NC_000911) unknown protein [Synechocystis...    49  5e-05
ref|NP_214321.1|  (NC_000918) putative protein [Aquifex aeol...    48  9e-05
ref|NP_299083.1|  (NC_002488) conserved hypothetical protein...    43  0.003
dbj|BAA21476.1|  (AB005550) simillar to Bacillus subtilis, h...    42  0.007
>ref|NP_207656.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
 pir||F64627 hypothetical protein HP0862 - Helicobacter pylori (strain 26695)
 gb|AAD07916.1| (AE000596) H. pylori predicted coding region HP0862 [Helicobacter
           pylori 26695]
          Length = 223

 Score =  448 bits (1153), Expect = e-125
 Identities = 223/223 (100%), Positives = 223/223 (100%)

Query: 1   MPARQSFTDLKNLVLCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKA 60
           MPARQSFTDLKNLVLCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKA
Sbjct: 1   MPARQSFTDLKNLVLCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKA 60

Query: 61  LLNCYPNAKNIAGFFHLETDYVGLGIDRQMACLAVNNGVVVDAGSAITIDLIKEGKHLGG 120
           LLNCYPNAKNIAGFFHLETDYVGLGIDRQMACLAVNNGVVVDAGSAITIDLIKEGKHLGG
Sbjct: 61  LLNCYPNAKNIAGFFHLETDYVGLGIDRQMACLAVNNGVVVDAGSAITIDLIKEGKHLGG 120

Query: 121 CILPGLAQYIHAYKKSAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACIQHLAKNQ 180
           CILPGLAQYIHAYKKSAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACIQHLAKNQ
Sbjct: 121 CILPGLAQYIHAYKKSAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACIQHLAKNQ 180

Query: 181 KIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK 223
           KIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK
Sbjct: 181 KIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK 223
>ref|NP_223514.1| (NC_000921) putative [Helicobacter pylori J99]
 pir||G71887 hypothetical protein jhp0796 - Helicobacter pylori (strain J99)
 gb|AAD06372.1| (AE001509) putative [Helicobacter pylori J99]
          Length = 223

 Score =  434 bits (1115), Expect = e-121
 Identities = 210/223 (94%), Positives = 221/223 (98%)

Query: 1   MPARQSFTDLKNLVLCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKA 60
           MPARQSF DLK+L+LCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKA
Sbjct: 1   MPARQSFKDLKDLILCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKA 60

Query: 61  LLNCYPNAKNIAGFFHLETDYVGLGIDRQMACLAVNNGVVVDAGSAITIDLIKEGKHLGG 120
           LLNCYPNAKNIAGFFHLETDY+GLGIDRQMACLAV NGV+VDAGSAITIDL+KEGKHLGG
Sbjct: 61  LLNCYPNAKNIAGFFHLETDYIGLGIDRQMACLAVVNGVIVDAGSAITIDLVKEGKHLGG 120

Query: 121 CILPGLAQYIHAYKKSAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACIQHLAKNQ 180
           CILPGLAQY+HAYKKSAKILEQPFKALDSLEVLPK+TRDAVNYGM+LS+I+CIQHLAK+Q
Sbjct: 121 CILPGLAQYVHAYKKSAKILEQPFKALDSLEVLPKNTRDAVNYGMILSIISCIQHLAKDQ 180

Query: 181 KIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK 223
           KIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK
Sbjct: 181 KIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK 223
>ref|NP_281584.1| (NC_002163) hypothetical protein Cj0394c [Campylobacter jejuni]
 pir||H81382 hypothetical protein Cj0394c [imported] - Campylobacter jejuni
           (strain NCTC 11168)
 emb|CAB74230.1| (AL139075) hypothetical protein Cj0394c [Campylobacter jejuni]
          Length = 209

 Score =  154 bits (390), Expect = 5e-37
 Identities = 80/204 (39%), Positives = 124/204 (60%), Gaps = 1/204 (0%)

Query: 13  LVLCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKALLNCYPNAKNIA 72
           ++LCDIGN+  +F  + + F+   +       +++IFYI+VNE  ++ L N   N  N+ 
Sbjct: 1   MLLCDIGNSNANFLDDNKYFTLNIDQFLEFKNEQKIFYINVNEHLKEHLKN-QKNFINLE 59

Query: 73  GFFHLETDYVGLGIDRQMACLAVNNGVVVDAGSAITIDLIKEGKHLGGCILPGLAQYIHA 132
            +F  +T Y GLGIDR  AC  + +GVVVDAGSAITID+I    HLGG ILPG+A Y   
Sbjct: 60  PYFLFDTIYQGLGIDRIAACYTIEDGVVVDAGSAITIDIISNSIHLGGFILPGIANYKKI 119

Query: 133 YKKSAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACIQHLAKNQKIYLCGGDAKYL 192
           Y   +  L+  F    SL+  P+ T DA++YG+   +   I+  A+N+K+Y  GGD ++L
Sbjct: 120 YSHISPRLKSEFNTQVSLDAFPQKTMDALSYGVFKGIYLLIKDAAQNKKLYFTGGDGQFL 179

Query: 193 SAFLPHSVCKERLVFDGMEIALKK 216
           + +  H++  + L+F GM+  +K+
Sbjct: 180 ANYFDHAIYDKLLIFRGMKKIIKE 203
>ref|NP_228691.1| (NC_000853) conserved hypothetical protein [Thermotoga maritima]
 pir||D72320 conserved hypothetical protein - Thermotoga maritima (strain MSB8)
 gb|AAD35964.1|AE001754_1 (AE001754) conserved hypothetical protein [Thermotoga maritima]
          Length = 246

 Score = 58.9 bits (141), Expect = 4e-08
 Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 10/122 (8%)

Query: 97  NGVVVDAGSAITIDLIKEGKHLGGCILPGLAQYIHA-YKKSAKILEQPFKALDSLEVLPK 155
           NG+++D G+A T+DL+  G + GG ILPG    +H+ ++ +AK+     K  D   V+ K
Sbjct: 120 NGIIIDMGTATTVDLVVNGSYEGGAILPGFFMMVHSLFRGTAKLPLVEVKPADF--VVGK 177

Query: 156 STRDAVNYGMV-------LSVIACIQHLAKNQKIYLCGGDAKYLSAFLPHSVCKERLVFD 208
            T + +  G+V         +I  I+ +  +  + L GG +K +   + H +  E L   
Sbjct: 178 DTEENIRLGVVNGSVYALEGIIGRIKEVYGDLPVVLTGGQSKIVKDMIKHEIFDEDLTIK 237

Query: 209 GM 210
           G+
Sbjct: 238 GV 239
>ref|NP_252969.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
 pir||H83111 hypothetical protein PA4279 [imported] - Pseudomonas aeruginosa
           (strain PAO1)
 gb|AAG07667.1|AE004843_9 (AE004843) hypothetical protein [Pseudomonas aeruginosa]
          Length = 248

 Score = 57.0 bits (136), Expect = 2e-07
 Identities = 50/166 (30%), Positives = 86/166 (51%), Gaps = 21/166 (12%)

Query: 67  NAKNIAGFFHLETDYVGLGIDRQMACLAVNN-----GVVVDAGSAITIDLI-KEGKHLGG 120
           + K +AG  +   DY  LG+DR +A +A ++      +V+D G+A+T DL+  +G HLGG
Sbjct: 81  SGKQLAGVRNGYLDYQRLGLDRWLALVAAHHLAKKACLVIDLGTAVTSDLVAADGVHLGG 140

Query: 121 CILPGLAQYIHAYKKSAKILE----QPFKALDSLEVLPKSTRDAVNYGMVLSV------- 169
            I PG+       +   + +     +  +AL SL+   ++T +AV  G +L +       
Sbjct: 141 YICPGMTLMRSQLRTHTRRIRYDDAEARRALASLQP-GQATAEAVERGCLLMLRGFVREQ 199

Query: 170 --IACIQHLAKNQKIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIA 213
             +AC + L  + +I+L GGDA+ +   L  +     LVF G+ +A
Sbjct: 200 YAMAC-ELLGPDCEIFLTGGDAELVRDELAGARIMPDLVFVGLALA 244
>ref|NP_603658.1| (NC_003454) Bvg accessory factor [Fusobacterium nucleatum subsp.
           nucleatum ATCC 25586]
 gb|AAL94957.1| (AE010586) Bvg accessory factor [Fusobacterium nucleatum subsp.
           nucleatum ATCC 25586]
          Length = 256

 Score = 50.1 bits (118), Expect = 2e-05
 Identities = 42/130 (32%), Positives = 62/130 (47%), Gaps = 17/130 (13%)

Query: 80  DYVGLGIDR------QMACLAVNNGVVVDAGSAITIDLIKEGKHLGGCILPGLAQYIHA- 132
           +Y G G DR       M      N V+ D G+A T D++K+G ++GG ILPG+   I+A 
Sbjct: 104 NYTGFGADRIIDITEAMQKYPDKNLVIFDFGTATTYDVLKKGVYIGGGILPGIDMSINAL 163

Query: 133 YKKSAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACIQHLAK------NQKIYL-- 184
           Y  +AK+    F    S  VL   T   +   +       I+H+ K      N++I++  
Sbjct: 164 YGNTAKLPRVKFTTPSS--VLGTDTMKQIQAAIFFGYAGQIKHIIKKINEELNEEIFVLA 221

Query: 185 CGGDAKYLSA 194
            GG  K LSA
Sbjct: 222 TGGLGKILSA 231
>ref|NP_487936.1| (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
 dbj|BAB75595.1| (AP003594) ORF_ID:alr3896~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 276

 Score = 48.9 bits (115), Expect = 4e-05
 Identities = 45/158 (28%), Positives = 77/158 (48%), Gaps = 23/158 (14%)

Query: 77  LETDYVGLGIDRQMACLAVNNG-----VVVDAGSAITIDLIKEGKHL-GGCILPGLA-QY 129
           L   Y  LGIDR +A            +V+DAG+A+T      GK+L GG ILPG+  Q+
Sbjct: 112 LNNIYPTLGIDRALALWGAGMSWGFPVLVIDAGTALTFTAADGGKNLVGGAILPGVGLQF 171

Query: 130 IHAYKKSAKILEQPFKALDSLEV-LPKSTRDAVNYGMVLSVIACIQ-------HLAKNQK 181
               +++ ++ +   +A+ SL      +T +A+  G++ ++IA ++        L  + K
Sbjct: 172 ASLGQQTGQLPQVEMEAIKSLPPRFALNTTEAIQSGVIYTLIAGMRDFTEEWLSLFPDGK 231

Query: 182 IYLCGGD----AKYLSAFLP----HSVCKERLVFDGME 211
           + + GGD      YL A  P      + +  L+F GM+
Sbjct: 232 VAIKGGDRILLLNYLQALYPDLAARLIVEPNLIFWGMQ 269
>ref|NP_441440.1| (NC_000911) unknown protein [Synechocystis sp. PCC 6803]
 pir||S75559 hypothetical protein slr0812 - Synechocystis sp. (strain PCC 6803)
 dbj|BAA18120.1| (D90911) ORF_ID:slr0812~unknown protein [Synechocystis sp. PCC
           6803]
          Length = 257

 Score = 48.5 bits (114), Expect = 5e-05
 Identities = 50/153 (32%), Positives = 69/153 (44%), Gaps = 25/153 (16%)

Query: 81  YVGLGIDRQMACLAVNNG-----VVVDAGSAITIDLIKEGKHL-GGCILPGLAQYIHAYK 134
           Y   GIDR +A L          +VVD G+A+TI    + K L GG ILPGL   +    
Sbjct: 96  YPSFGIDRALAGLGTGLTYGFPCLVVDGGTALTITGFDQDKKLVGGAILPGLGLQLATL- 154

Query: 135 KSAKILEQPFKALDSLEVLPK----STRDAVNYGMVLSVIACIQ-HLAKNQKIY------ 183
              ++   P   +D L  LP      T  A+  G+V  V+  +Q +L   QK++      
Sbjct: 155 -GDRLAALPKLEMDQLTELPDRWALDTPSAIFSGVVYGVLGALQSYLQDWQKLFPGAAMV 213

Query: 184 LCGGDAKYLSAFL-PHS-----VCKERLVFDGM 210
           + GGD K L  FL  HS        + L+F GM
Sbjct: 214 ITGGDGKILHGFLKEHSPNLSVAWDDNLIFLGM 246
>ref|NP_214321.1| (NC_000918) putative protein [Aquifex aeolicus]
 pir||E70465 hypothetical protein aq_1924 - Aquifex aeolicus
 gb|AAC07720.1| (AE000763) putative protein [Aquifex aeolicus]
          Length = 229

 Score = 47.8 bits (112), Expect = 9e-05
 Identities = 42/138 (30%), Positives = 67/138 (48%), Gaps = 19/138 (13%)

Query: 75  FHLETDYVG---LGIDRQMACLAVN-----NGVVVDAGSAITIDLIKEGKHLGGCILPGL 126
           F ++ DY     LG DR     +       N VV+ AG+A+ IDL+ EGK  GG I  GL
Sbjct: 72  FPIQVDYKTPETLGTDRVALAYSAKKFYGKNVVVISAGTALVIDLVLEGKFKGGFITLGL 131

Query: 127 AQYIHAYKKSAKILEQPFKALDSLEV-LPKSTRDAVNYG-------MVLSVIACIQHLAK 178
            + +      A+ + + F   + +E+ L +STR+ V  G        + S +   + + K
Sbjct: 132 GKKLKILSDLAEGIPEFFP--EEVEIFLGRSTRECVLGGAYRESTEFIKSTLKLWRKVFK 189

Query: 179 NQ-KIYLCGGDAKYLSAF 195
            + K+ + GG+ KY S F
Sbjct: 190 RKFKVVITGGEGKYFSKF 207
>ref|NP_299083.1| (NC_002488) conserved hypothetical protein [Xylella fastidiosa
           9a5c]
 pir||A82637 conserved hypothetical protein XF1795 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF84603.1|AE004001_8 (AE004001) conserved hypothetical protein [Xylella fastidiosa 9a5c]
          Length = 242

 Score = 42.7 bits (99), Expect = 0.003
 Identities = 45/143 (31%), Positives = 67/143 (46%), Gaps = 21/143 (14%)

Query: 85  GIDRQMACLAV---NNGVVVDAGSAITIDLIK-EGKHLGGCILPG---LAQYIHAYKKSA 137
           G+DR +A L      N +VV  G+A+TIDL+   G HLGG I      + Q +HA  +  
Sbjct: 98  GVDRFLALLGSYGEGNVLVVGVGTALTIDLLAANGCHLGGRISASPTLMRQALHARAE-- 155

Query: 138 KILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIACI--------QHLAKNQKIYLCGGDA 189
              + P    + LE   + T DA+  G   + +A I        Q L ++ ++ L GG  
Sbjct: 156 ---QLPLSGGNYLE-FAEDTEDALVSGCNGAAVALIERSLYEAHQRLDQSVRLLLHGGGV 211

Query: 190 KYLSAFLPHSVCKERLVFDGMEI 212
             L  +L   V +  LV DG+ I
Sbjct: 212 ASLLPWLGDVVHRPTLVLDGLAI 234
>dbj|BAA21476.1| (AB005550) simillar to Bacillus subtilis, hypothetical 26.2 KD
           protein in FTSH-CYSK intergenic region: Acc# P37564
           [Desulfovibrio vulgaris]
          Length = 212

 Score = 41.6 bits (96), Expect = 0.007
 Identities = 28/96 (29%), Positives = 47/96 (48%), Gaps = 8/96 (8%)

Query: 84  LGIDRQMACLAVN-------NGVVVDAGSAITIDLIKEGKHLGGCILPGLAQYIHAY-KK 135
           +G DR +A  A         + V VD G+A T D ++ G +LGG I PG+     A   +
Sbjct: 109 VGADRLVAAYAARRLYPGPRSLVSVDFGTATTFDCVEGGAYLGGLICPGVLSSAGALSSR 168

Query: 136 SAKILEQPFKALDSLEVLPKSTRDAVNYGMVLSVIA 171
           +AK+     +  +   V+ +ST  ++N+G +    A
Sbjct: 169 TAKLPRISLEVEEDSPVIGRSTTTSLNHGFIFGFAA 204
  Database: /home/scwang/download_20020708_db/nr
    Posted date:  Aug 7, 2002 12:55 PM
  Number of letters in database: 324,149,939
  Number of sequences in database:  1,026,957
  
Lambda     K      H
   0.322    0.139    0.408 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 138,241,154
Number of Sequences: 1026957
Number of extensions: 5469029
Number of successful extensions: 12094
Number of sequences better than 1.0e-02: 11
Number of HSP's better than  0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 12081
Number of HSP's gapped (non-prelim): 11
length of query: 223
length of database: 324,149,939
effective HSP length: 117
effective length of query: 106
effective length of database: 203,995,970
effective search space: 21623572820
effective search space used: 21623572820
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 95 (41.2 bits)