BLASTP 2.2.1 [Apr-13-2001]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= gi|15645306|ref|NP_207476.1| hypothetical protein
[Helicobacter pylori 26695]
(126 letters)
Database: /home/scwang/download_20020708_db/nr
1,026,957 sequences; 324,149,939 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_207476.1| (NC_000915) hypothetical protein [Helicoba... 265 9e-71
ref|NP_223341.1| (NC_000921) putative [Helicobacter pylori ... 228 9e-60
ref|NP_208080.1| (NC_000915) hypothetical protein [Helicoba... 167 1e-41
ref|NP_223926.1| (NC_000921) putative [Helicobacter pylori ... 140 3e-33
ref|NP_603031.1| (NC_003454) unknown [Fusobacterium nucleat... 50 3e-06
ref|NP_193981.1| (NM_118376) putative protein [Arabidopsis ... 47 3e-05
ref|NP_521271.1| (NC_003295) PUTATIVE TRANSMEMBRANE PROTEIN... 43 7e-04
ref|NP_662240.1| (NC_002932) OmpA family protein [Chlorobiu... 42 0.002
ref|NP_249316.1| (NC_002516) hypothetical protein [Pseudomo... 40 0.005
sp|P04261|K2C3_BOVIN Keratin, type II cytoskeletal 60 kDa, ... 40 0.005
ref|NP_245491.1| (NC_002663) Lpp [Pasteurella multocida] >g... 40 0.006
ref|NP_597834.1| (NC_001491) putative glycine-rich protein ... 40 0.006
>ref|NP_207476.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
pir||B64605 hypothetical protein HP0682 - Helicobacter pylori (strain 26695)
gb|AAD14888.1| (AE000581) H. pylori predicted coding region HP0682 [Helicobacter
pylori 26695]
Length = 126
Score = 265 bits (676), Expect = 9e-71
Identities = 126/126 (100%), Positives = 126/126 (100%)
Query: 1 MCQTCLESQFLNLNFMKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGG 60
MCQTCLESQFLNLNFMKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGG
Sbjct: 1 MCQTCLESQFLNLNFMKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGG 60
Query: 61 VGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGVDREPQNKEPQAPREPIRDLYDYGY 120
VGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGVDREPQNKEPQAPREPIRDLYDYGY
Sbjct: 61 VGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGVDREPQNKEPQAPREPIRDLYDYGY 120
Query: 121 SFGHAW 126
SFGHAW
Sbjct: 121 SFGHAW 126
>ref|NP_223341.1| (NC_000921) putative [Helicobacter pylori J99]
pir||H71908 hypothetical protein jhp0623 - Helicobacter pylori (strain J99)
gb|AAD06203.1| (AE001494) putative [Helicobacter pylori J99]
Length = 116
Score = 228 bits (581), Expect = 9e-60
Identities = 105/116 (90%), Positives = 110/116 (94%)
Query: 11 LNLNFMKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGGVGGFVGDKMG 70
+NL+FMKGFVMSGLRTFSCVVVLCGAM NVA+A PKIEARGELGKF+GG VG FVGDKMG
Sbjct: 1 MNLHFMKGFVMSGLRTFSCVVVLCGAMVNVAVAGPKIEARGELGKFVGGAVGNFVGDKMG 60
Query: 71 GFVGGAIGGYIGSEIGDRVEDYIRGVDREPQNKEPQAPREPIRDLYDYGYSFGHAW 126
GFVGGAIGGYIGSE+GDRVEDYIRGVDREPQNKEPQ PREPIRD YDYGYSFGHAW
Sbjct: 61 GFVGGAIGGYIGSEVGDRVEDYIRGVDREPQNKEPQTPREPIRDFYDYGYSFGHAW 116
>ref|NP_208080.1| (NC_000915) hypothetical protein [Helicobacter pylori 26695]
pir||H64680 hypothetical protein HP1288 - Helicobacter pylori (strain 26695)
gb|AAD08364.1| (AE000633) H. pylori predicted coding region HP1288 [Helicobacter
pylori 26695]
Length = 132
Score = 167 bits (424), Expect = 1e-41
Identities = 88/130 (67%), Positives = 94/130 (71%), Gaps = 21/130 (16%)
Query: 1 MCQTCLESQFLNLNFMKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGG 60
MCQTCLE+QFLNLNFMKGFVMSGL+ FSCVVVLCGAMAN AIA PKIEARGE G+F GG
Sbjct: 1 MCQTCLETQFLNLNFMKGFVMSGLKAFSCVVVLCGAMANTAIAGPKIEARGEFGRFWGGA 60
Query: 61 VGGFVGDKMGGFVGGAIGG----------------YIGSEIGDRVEDYIRGVDREPQNKE 104
VGG +G +GG VGGA+GG G EIGDRVEDYIRGVDR E
Sbjct: 61 VGGAIGGGVGGAVGGAVGGPAGGWAGRLVGGSVGREFGREIGDRVEDYIRGVDR-----E 115
Query: 105 PQAPREPIRD 114
PQAPREP D
Sbjct: 116 PQAPREPTYD 125
>ref|NP_223926.1| (NC_000921) putative [Helicobacter pylori J99]
pir||H71831 hypothetical protein jhp1208 - Helicobacter pylori (strain J99)
gb|AAD06810.1| (AE001547) putative [Helicobacter pylori J99]
Length = 117
Score = 140 bits (353), Expect = 3e-33
Identities = 76/115 (66%), Positives = 81/115 (70%), Gaps = 21/115 (18%)
Query: 16 MKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGGVGGFVGDKMGGFVGG 75
MKGFVMSGLRTFSCVVVLCGAMANVAIA PKIEARGE G+F GG VGG +G +GG +GG
Sbjct: 1 MKGFVMSGLRTFSCVVVLCGAMANVAIAGPKIEARGEFGRFWGGAVGGAIGGGVGGAMGG 60
Query: 76 AIGG----------------YIGSEIGDRVEDYIRGVDREPQNKEPQAPREPIRD 114
A+GG G EIGDRVEDYIRGVDR EPQAPREP D
Sbjct: 61 AVGGPAGGWAGRLVGGSVGREFGREIGDRVEDYIRGVDR-----EPQAPREPTYD 110
>ref|NP_603031.1| (NC_003454) unknown [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
gb|AAL94330.1| (AE010526) unknown [Fusobacterium nucleatum subsp. nucleatum ATCC
25586]
Length = 130
Score = 50.4 bits (119), Expect = 3e-06
Identities = 25/53 (47%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 52 ELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVED-YIRGVDREPQNK 103
ELG +GG GG +G K GG GG GGYIG +IG R D + R +R +K
Sbjct: 59 ELGGIVGGAAGGTIGGKFGGSKGGLAGGYIGGKIGSRAGDSWERRTNRSAHDK 111
>ref|NP_193981.1| (NM_118376) putative protein [Arabidopsis thaliana]
pir||T05444 hypothetical protein F7K2.80 - Arabidopsis thaliana
emb|CAA22155.1| (AL033545) putative protein [Arabidopsis thaliana]
emb|CAB79205.1| (AL161557) putative protein [Arabidopsis thaliana]
Length = 490
Score = 47.4 bits (111), Expect = 3e-05
Identities = 22/55 (40%), Positives = 33/55 (60%)
Query: 32 VLCGAMANVAIASPKIEARGELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIG 86
V G + + + E+ GE+G +GGG+GG G ++GG +GG IGG G E+G
Sbjct: 174 VRVGGIGGLLGGGDRGESGGEVGGVLGGGIGGARGGELGGVLGGDIGGARGGEVG 228
Score = 43.1 bits (100), Expect = 5e-04
Identities = 21/46 (45%), Positives = 26/46 (55%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGV 96
GE+G IGG GG G ++GG +GG IGG G E G + GV
Sbjct: 101 GEIGGVIGGDCGGARGGEVGGVMGGDIGGLFGGEFGGTIGGDTGGV 146
Score = 41.6 bits (96), Expect = 0.002
Identities = 18/46 (39%), Positives = 27/46 (58%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGV 96
G +G+ +GGG GG G ++GG +GG GG +G + G + GV
Sbjct: 273 GAVGEILGGGTGGARGGEVGGVLGGEFGGVLGGDSGGARGGEVGGV 318
Score = 41.2 bits (95), Expect = 0.002
Identities = 18/46 (39%), Positives = 25/46 (54%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGV 96
GE+G +GG +GG G + GG +GG GG G +G + GV
Sbjct: 117 GEVGGVMGGDIGGLFGGEFGGTIGGDTGGVCGGAVGGALGGVTGGV 162
Score = 40.8 bits (94), Expect = 0.003
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIG 86
GELG +GG +GG G ++GG +GG GG G E+G
Sbjct: 209 GELGGVLGGDIGGARGGEVGGVLGGDRGGARGGEVG 244
Score = 40.4 bits (93), Expect = 0.004
Identities = 20/46 (43%), Positives = 26/46 (56%)
Query: 50 RGELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRG 95
RGE G +GG +GG +G GG +GG +GG IG G V + G
Sbjct: 188 RGESGGEVGGVLGGGIGGARGGELGGVLGGDIGGARGGEVGGVLGG 233
Score = 39.7 bits (91), Expect = 0.006
Identities = 18/45 (40%), Positives = 24/45 (53%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRG 95
G++G GG GG +G GG GGA+GG +G G E + G
Sbjct: 125 GDIGGLFGGEFGGTIGGDTGGVCGGAVGGALGGVTGGVREGEVGG 169
Score = 39.3 bits (90), Expect = 0.008
Identities = 17/35 (48%), Positives = 23/35 (65%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEI 85
GE+G +GGG GG G ++GG +GG GG G E+
Sbjct: 313 GEVGGVLGGGTGGARGGEVGGVLGGDSGGARGGEV 347
>ref|NP_521271.1| (NC_003295) PUTATIVE TRANSMEMBRANE PROTEIN [Ralstonia solanacearum]
emb|CAD16859.1| (AL646073) PUTATIVE TRANSMEMBRANE PROTEIN [Ralstonia solanacearum]
Length = 143
Score = 42.7 bits (99), Expect = 7e-04
Identities = 24/75 (32%), Positives = 35/75 (46%)
Query: 32 VLCGAMANVAIASPKIEARGELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVED 91
V+ GA+ A + +R G IGG +GG VG G +GG GG +G+ G
Sbjct: 53 VIGGAIGGGAGGALTSNSRDRKGAIIGGAIGGGVGTAAGNAMGGRTGGIVGAAAGGGAGA 112
Query: 92 YIRGVDREPQNKEPQ 106
+ G + N EP+
Sbjct: 113 ALGGHIQRSSNAEPE 127
>ref|NP_662240.1| (NC_002932) OmpA family protein [Chlorobium tepidum TLS]
gb|AAM72582.1| (AE012894) OmpA family protein [Chlorobium tepidum TLS]
Length = 231
Score = 41.6 bits (96), Expect = 0.002
Identities = 25/74 (33%), Positives = 37/74 (49%), Gaps = 5/74 (6%)
Query: 54 GKFIGGGVGGFVGDKMGGFV-----GGAIGGYIGSEIGDRVEDYIRGVDREPQNKEPQAP 108
G GG +GG +G G +V G AIGG G+ IGD ++ + +E Q + +
Sbjct: 41 GAAAGGLIGGIIGSNNGSWVQGALIGAAIGGAAGAVIGDYMDKQADEIRQEVQGAKVERV 100
Query: 109 REPIRDLYDYGYSF 122
E IR ++D G F
Sbjct: 101 GEGIRVVFDTGLLF 114
>ref|NP_249316.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
pir||T44549 hypothetical protein [imported] - Pseudomonas aeruginosa
pir||G83567 hypothetical protein PA0625 [imported] - Pseudomonas aeruginosa
(strain PAO1)
dbj|BAA83164.1| (AB030825) PRF20~putative tail length determinator protein~similar
to yqbO gene of B. subtilis [Pseudomonas aeruginosa]
gb|AAG04014.1|AE004498_9 (AE004498) hypothetical protein [Pseudomonas aeruginosa]
Length = 745
Score = 40.0 bits (92), Expect = 0.005
Identities = 29/81 (35%), Positives = 37/81 (44%), Gaps = 17/81 (20%)
Query: 44 SPKIEARGELGKFIGGGVGG------------FVGDKMGGFVGGAIGGYIGSEIGDRVED 91
S K A GE G + G + G VG +GG VGGAIG + GSE+G R+
Sbjct: 606 SEKTVAYGEAGGSLAGSLAGAALGASIGSVVPVVGTLIGGLVGGAIGAWGGSELGGRLGR 665
Query: 92 YIRG-----VDREPQNKEPQA 107
+ G D +P PQA
Sbjct: 666 SLAGDPPAASDNKPAVAVPQA 686
>sp|P04261|K2C3_BOVIN Keratin, type II cytoskeletal 60 kDa, component III
pir||A02947 keratin, 60K type II cytoskeletal, component III - bovine
(fragment)
gb|AAA30602.1| (K03535) cytokeratin type II, component III [Bos taurus]
Length = 182
Score = 40.0 bits (92), Expect = 0.005
Identities = 17/48 (35%), Positives = 30/48 (62%)
Query: 39 NVAIASPKIEARGELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIG 86
N+++ + + + G GGG+GG +G +GG +GG +GG +GS +G
Sbjct: 72 NISVVTNTVSSGYGGGSGFGGGLGGGLGGGLGGGLGGGLGGGLGSGLG 119
>ref|NP_245491.1| (NC_002663) Lpp [Pasteurella multocida]
gb|AAK02638.1| (AE006091) Lpp [Pasteurella multocida]
Length = 154
Score = 39.7 bits (91), Expect = 0.006
Identities = 24/90 (26%), Positives = 41/90 (44%), Gaps = 4/90 (4%)
Query: 12 NLNFMKGFVMSGLRTFSCVVVLCGAMANVAIASPKIEARGELGKFIGGGVGGFVGDKMGG 71
N + G V G + + G + + + +++G LG F GG +GG VG +GG
Sbjct: 21 NTDIYSGNVYEGNQAKEVRSISYGTIVSSRPVKIQADSQGVLGGFGGGALGGIVGSGIGG 80
Query: 72 ----FVGGAIGGYIGSEIGDRVEDYIRGVD 97
+ +G G+ IG + E+ + VD
Sbjct: 81 GTGQMIATTVGAIAGAVIGAKAEEKLNQVD 110
>ref|NP_597834.1| (NC_001491) putative glycine-rich protein [Equine herpesvirus 1]
Length = 263
Score = 39.7 bits (91), Expect = 0.006
Identities = 20/62 (32%), Positives = 32/62 (51%)
Query: 51 GELGKFIGGGVGGFVGDKMGGFVGGAIGGYIGSEIGDRVEDYIRGVDREPQNKEPQAPRE 110
G +G +GG +GG +G MGG +GG +GG +G +G + + G+ P P
Sbjct: 95 GLMGGLMGGLMGGLMGGLMGGLMGGLMGGLMGGLMGGLMGGLMGGLMGGLMVPAPPLPAP 154
Query: 111 PI 112
P+
Sbjct: 155 PL 156
Database: /home/scwang/download_20020708_db/nr
Posted date: Aug 7, 2002 12:55 PM
Number of letters in database: 324,149,939
Number of sequences in database: 1,026,957
Lambda K H
0.323 0.145 0.455
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 87,310,551
Number of Sequences: 1026957
Number of extensions: 3718221
Number of successful extensions: 27056
Number of sequences better than 1.0e-02: 12
Number of HSP's better than 0.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 26846
Number of HSP's gapped (non-prelim): 104
length of query: 126
length of database: 324,149,939
effective HSP length: 102
effective length of query: 24
effective length of database: 219,400,325
effective search space: 5265607800
effective search space used: 5265607800
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 90 (39.3 bits)