BLASTP 2.2.1 [Apr-13-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|15645542|ref|NP_207718.1| conserved hypothetical
protein [Helicobacter pylori 26695]
         (381 letters)

Database: /home/scwang/download_20020708_db/nr
           1,026,957 sequences; 324,149,939 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_207718.1|  (NC_000915) conserved hypothetical protein...   763  0.0
ref|NP_223578.1|  (NC_000921) putative [Helicobacter pylori ...   724  0.0
ref|NP_282597.1|  (NC_002163) ypothetical protein Cj1457c [C...   305  4e-82
ref|NP_295714.1|  (NC_001263) conserved hypothetical protein...   159  7e-38
ref|NP_406822.1|  (NC_003143) conserved hypothetical protein...   123  4e-27
ref|NP_246549.1|  (NC_002663) unknown [Pasteurella multocida...   121  2e-26
ref|NP_438860.1|  (NC_000907) conserved hypothetical protein...   119  6e-26
ref|NP_289294.1|  (NC_002655) putative hydrogenase subunit [...   115  6e-25
ref|NP_230181.1|  (NC_002505) conserved hypothetical protein...   115  6e-25
ref|NP_457317.1|  (NC_003198) conserved hypothetical protein...   114  2e-24
ref|NP_417225.1|  (NC_000913) putative hydrogenase subunit [...   114  2e-24
ref|NP_461849.1|  (NC_003197) paral putative hydrogenase sub...   112  7e-24
ref|NP_637074.1|  (NC_003902) hydrogenase subunit [Xanthomon...   107  3e-22
ref|NP_642054.1|  (NC_003919) hydrogenase subunit [Xanthomon...   100  2e-20
ref|NP_252316.1|  (NC_002516) conserved hypothetical protein...   100  3e-20
ref|NP_248367.1|  (NC_000909) conserved hypothetical protein...    94  2e-18
ref|NP_070505.1|  (NC_000917) conserved hypothetical protein...    90  4e-17
ref|NP_214182.1|  (NC_000918) hypothetical protein [Aquifex ...    86  7e-16
ref|NP_276642.1|  (NC_000916) conserved protein [Methanother...    86  9e-16
ref|NP_618162.1|  (NC_003552) conserved hypothetical protein...    78  1e-13
ref|NP_112582.1|  (NM_031292) hypothetical protein DKFZp434G...    73  5e-12
ref|NP_648231.1|  (NM_139974) CG6745 gene product [Drosophil...    73  5e-12
ref|XP_128203.1|  (XM_128203) RIKEN cDNA 3000003F02 [Mus mus...    72  1e-11
ref|NP_579284.1|  (NC_003413) hypothetical protein [Pyrococc...    71  2e-11
ref|NP_632138.1|  (NC_003901) conserved protein [Methanosarc...    71  2e-11
dbj|BAB67790.1|  (AB067484) KIAA1897 protein [Homo sapiens]        70  4e-11
ref|NP_143398.1|  (NC_000961) hypothetical protein [Pyrococc...    70  5e-11
dbj|BAB89856.1|  (AP003295) hypothetical protein~similar to ...    67  3e-10
ref|NP_126313.1|  (NC_000868) hypothetical protein [Pyrococc...    67  3e-10
ref|NP_014886.1|  (NC_001147) Hypothetical ORF; Yor243cp [Sa...    67  3e-10
emb|CAA22009.1|  (AL033502) hypothetical protein [Candida al...    64  2e-09
ref|NP_247567.1|  (NC_000909) conserved hypothetical protein...    63  5e-09
ref|NP_595812.1|  (NC_003423) hypothetical protein [Schizosa...    63  6e-09
ref|NP_613961.1|  (NC_003551) Uncharacterized conserved prot...    60  5e-08
ref|NP_393692.1|  (NC_002578) conserved hypothetical protein...    58  2e-07
ref|NP_505653.1|  (NM_073252) Uncharacterized protein family...    57  5e-07
ref|NP_187133.1|  (NM_111354) unknown protein [Arabidopsis t...    55  1e-06
gb|EAA11729.1|  (AAAB01008960) ebiP4385 [Anopheles gambiae s...    55  2e-06
ref|NP_111899.1|  (NC_002689) Uncharacterized conserved prot...    54  4e-06
gb|AAG18841.1|  (AE004988) Vng0243c [Halobacterium sp. NRC-1]      50  4e-05
ref|NP_444184.1|  (NC_002607) Uncharacterized conserved prot...    50  4e-05
ref|NP_560515.1|  (NC_003364) conserved hypothetical protein...    48  2e-04
ref|XP_110124.1|  (XM_110124) RIKEN cDNA 3000003F02 [Mus mus...    44  0.003
ref|NP_376057.1|  (NC_003106) 363aa long conserved hypotheti...    43  0.007
ref|NP_341730.1|  (NC_002754) Conserved hypothetical protein...    43  0.007
>ref|NP_207718.1| (NC_000915) conserved hypothetical protein [Helicobacter pylori
           26695]
 sp|P55985|Y926_HELPY Hypothetical protein HP0926
 pir||F64635 conserved hypothetical protein HP0926 - Helicobacter pylori
           (strain 26695)
 gb|AAD07971.1| (AE000602) conserved hypothetical protein [Helicobacter pylori
           26695]
          Length = 381

 Score =  763 bits (1971), Expect = 0.0
 Identities = 381/381 (100%), Positives = 381/381 (100%)

Query: 1   MNLNFMPLLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEML 60
           MNLNFMPLLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEML
Sbjct: 1   MNLNFMPLLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEML 60

Query: 61  QIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYH 120
           QIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYH
Sbjct: 61  QIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYH 120

Query: 121 HNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEG 180
           HNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEG
Sbjct: 121 HNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEG 180

Query: 181 LKILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLS 240
           LKILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLS
Sbjct: 181 LKILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLS 240

Query: 241 VDSDTLKTLKNQAHPFKILEGDVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKK 300
           VDSDTLKTLKNQAHPFKILEGDVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKK
Sbjct: 241 VDSDTLKTLKNQAHPFKILEGDVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKK 300

Query: 301 ALYAKNLSLEIEKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKG 360
           ALYAKNLSLEIEKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKG
Sbjct: 301 ALYAKNLSLEIEKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKG 360

Query: 361 SYASALLKEIKHEKGENNDEF 381
           SYASALLKEIKHEKGENNDEF
Sbjct: 361 SYASALLKEIKHEKGENNDEF 381
>ref|NP_223578.1| (NC_000921) putative [Helicobacter pylori J99]
 sp|Q9ZKS5|Y926_HELPJ Hypothetical protein JHP0860
 pir||C71878 hypothetical protein jhp0860 - Helicobacter pylori (strain J99)
 gb|AAD06443.1| (AE001516) putative [Helicobacter pylori J99]
          Length = 381

 Score =  724 bits (1868), Expect = 0.0
 Identities = 359/381 (94%), Positives = 369/381 (96%)

Query: 1   MNLNFMPLLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEML 60
           MNLNFMPLLHAYNH SIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEML
Sbjct: 1   MNLNFMPLLHAYNHVSIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEML 60

Query: 61  QIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYH 120
           QIFSQILGV+IAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQE+NLKILSLNYH
Sbjct: 61  QIFSQILGVKIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQERNLKILSLNYH 120

Query: 121 HNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEG 180
           HNKIKLGHLKGNRFFMRFKKMTPLNAQKT+QVLEQIAQFGMPNYFGSQRFGKFNDNH+EG
Sbjct: 121 HNKIKLGHLKGNRFFMRFKKMTPLNAQKTEQVLEQIAQFGMPNYFGSQRFGKFNDNHKEG 180

Query: 181 LKILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLS 240
           LKILQN+TKFAHQKLNAFLISSYQSYLFN+LLSKRLEISKIISAFSVKE+LEFFKQKNLS
Sbjct: 181 LKILQNETKFAHQKLNAFLISSYQSYLFNSLLSKRLEISKIISAFSVKESLEFFKQKNLS 240

Query: 241 VDSDTLKTLKNQAHPFKILEGDVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKK 300
           V S+ LK LKNQAHPFKILEGDVM HYPYGKFFDALELEKE ERFL KE  PTGLLDGKK
Sbjct: 241 VHSNALKALKNQAHPFKILEGDVMRHYPYGKFFDALELEKESERFLNKEAVPTGLLDGKK 300

Query: 301 ALYAKNLSLEIEKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKG 360
           ALYAKNLSLEIEK FQHNLLS HAKTLGSRRFFWVFVEN+TSQY+KEKAQFEL FYLPKG
Sbjct: 301 ALYAKNLSLEIEKGFQHNLLSGHAKTLGSRRFFWVFVENITSQYIKEKAQFELEFYLPKG 360

Query: 361 SYASALLKEIKHEKGENNDEF 381
           SYASALLKEIKHEKGENNDEF
Sbjct: 361 SYASALLKEIKHEKGENNDEF 381
>ref|NP_282597.1| (NC_002163) ypothetical protein Cj1457c [Campylobacter jejuni]
 pir||H81291 hypothetical protein Cj1457c [imported] - Campylobacter jejuni
           (strain NCTC 11168)
 emb|CAB73880.1| (AL139078) ypothetical protein Cj1457c [Campylobacter jejuni]
          Length = 372

 Score =  305 bits (782), Expect = 4e-82
 Identities = 166/373 (44%), Positives = 238/373 (63%), Gaps = 12/373 (3%)

Query: 2   NLNFMPLLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQ 61
           N  F PL ++  H+ I+ +F+ ++ DF V E PLYEFS  GEH ++ + K  L+T E L+
Sbjct: 7   NTIFKPL-YSLKHSPINAYFSKNSDDFVVRERPLYEFSGKGEHLILHINKKDLTTNEALK 65

Query: 62  IFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHH 121
           I S+  GV+I + GYAGLKDK   T Q++S+PKK+   L    SNF    LKIL +  H 
Sbjct: 66  ILSEASGVKIRDFGYAGLKDKQGSTFQYLSMPKKFESFL----SNFSHPKLKILEIFTHE 121

Query: 122 NKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGL 181
           NK+++GHLKGN FF+R KK+ P +A K +Q L  + + G  NYFG QRFGKF DN++EGL
Sbjct: 122 NKLRIGHLKGNSFFIRLKKVLPSDALKLEQALMNLDKQGFANYFGYQRFGKFGDNYKEGL 181

Query: 182 KILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSV 241
           +IL+ + K  + K+  FLIS++QS LFN  LSKR+E+S   + FS KE ++ +K     +
Sbjct: 182 EILRGK-KMKNVKMKEFLISAFQSELFNRYLSKRVELSHFANDFSEKELIQIYK-----I 235

Query: 242 DSDTLKTLKNQAHPFKILEGDVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKKA 301
             +  K LK Q   FK+L+G+V+ HYP+GK F   +L  E  RF  ++++  GLL G KA
Sbjct: 236 SKEEAKELKKQEQFFKLLKGEVLGHYPFGKCFLCEDLSAELGRFKARDISAMGLLIGAKA 295

Query: 302 L-YAKNLSLEIEKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKG 360
               + L+L +E E   + L   AK  GSRRF W ++E +  +Y +EKA F + F+L KG
Sbjct: 296 YETGEGLALNLENEIFKDTLEFKAKMQGSRRFMWGYLEELKWRYDEEKAHFCIEFFLQKG 355

Query: 361 SYASALLKEIKHE 373
           SYA+ +L+EI H+
Sbjct: 356 SYATVVLEEILHK 368
>ref|NP_295714.1| (NC_001263) conserved hypothetical protein [Deinococcus
           radiodurans]
 pir||D75328 conserved hypothetical protein - Deinococcus radiodurans  (strain
           R1)
 gb|AAF11542.1|AE002037_3 (AE002037) conserved hypothetical protein [Deinococcus radiodurans]
          Length = 353

 Score =  159 bits (401), Expect = 7e-38
 Identities = 117/347 (33%), Positives = 176/347 (50%), Gaps = 44/347 (12%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V EVP Y    +GE+  + V K+  +T  +++     LGVR  ++G AGLKD++A+T
Sbjct: 24  DFQVQEVPAYLPGGSGEYLYLHVEKTRHTTAHVVRELCAQLGVRDRDVGVAGLKDRHAVT 83

Query: 87  TQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNA 146
           TQ++SLP K  P +     +F    ++IL  + H NK+ +GHL GNRF +R +    + A
Sbjct: 84  TQWLSLPAKVEPRM----GDFSLPGVRILETSRHTNKLGMGHLHGNRFVVRVRGAAGM-A 138

Query: 147 QKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLISSYQSY 206
           ++  + L  +AQ G+PNYFG QRFG    N +EGL++L+ +++    ++  FL SS QS 
Sbjct: 139 EQAGETLATLAQGGVPNYFGPQRFGLGGLNAEEGLRVLRGESELRDPRVRRFLTSSVQSA 198

Query: 207 LFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILEGDVMCH 266
           +FNAL+S RLE              E F                      ++L GD+   
Sbjct: 199 IFNALVSLRLE-------------REVFD---------------------RLLTGDMAKK 224

Query: 267 YPYGKFFDALELEKEGERFLKKEVAPTGLLDGKKA--LYAKNLSLEIEKEFQHNLLSS-H 323
           +  G  F   +   E  R  + EV+ TG L G+K   L A   +LE E      L     
Sbjct: 225 HDTGGVFLVEDAGAETPRAQRGEVSATGTLFGRKVKPLTADAGALEAEALALFGLSPQVF 284

Query: 324 AKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEI 370
           A   G RR   VF     ++   E   + L F LPKGS+A+++L+E+
Sbjct: 285 ASRKGDRRLIRVF--PAEAEVRPEDDGYVLAFTLPKGSFATSVLREV 329
>ref|NP_406822.1| (NC_003143) conserved hypothetical protein [Yersinia pestis]
 emb|CAC92589.1| (AJ414156) conserved hypothetical protein [Yersinia pestis]
          Length = 349

 Score =  123 bits (308), Expect = 4e-27
 Identities = 104/360 (28%), Positives = 173/360 (47%), Gaps = 50/360 (13%)

Query: 23  SSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDK 82
           ++  DF V E   +E    GEH ++++RK+G +T  +    ++   +    + YAGLKD+
Sbjct: 21  ANPEDFVVVEDLGFEPDGEGEHLLVRIRKNGCNTQFVADYLARFAKLHPRLVSYAGLKDR 80

Query: 83  NALTTQF--ISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKK 140
           +A+T Q+  + LP K AP L    + F+ +  ++L    H  K+++G LKGN F +  + 
Sbjct: 81  HAVTEQWFCLHLPGKEAPDL----ATFELEGCEVLEAVRHKRKLRIGSLKGNAFTLVLRH 136

Query: 141 MTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLI 200
           +T  + Q  +Q L+QIA  G+PNYFGSQRFG+  +N  +      N+ +   +   +F +
Sbjct: 137 IT--DRQDVEQRLQQIAAQGVPNYFGSQRFGRGGNNLVQARLWANNEIRVKERSKRSFYL 194

Query: 201 SSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILE 260
           S+ +S +FN + S RL                          +  L T         +LE
Sbjct: 195 SASRSAMFNLISSYRL--------------------------AQQLSTT--------VLE 220

Query: 261 GDVMCHYPYGKFF--DALELEKEGERFLKKEVAPTGLLDGKKALYAKNLSLEIEKEF--- 315
           GD +     G +F   A EL    +R    E+  T  L G   L     +L  E+     
Sbjct: 221 GDALQLSGRGSWFVAQADELAALQQRVTAGELNITAPLPGDSELGTHGEALAFEQACLAE 280

Query: 316 QHNLLS--SHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIKHE 373
           Q  LLS     +  GSRR   +  +N+ S +  +    EL F+LP GS+A+++++EI ++
Sbjct: 281 QTELLSLIKRERVEGSRRAVLLKPQNMISNWWDD-VTLELSFWLPAGSFATSVVREIMNQ 339
>ref|NP_246549.1| (NC_002663) unknown [Pasteurella multocida]
 gb|AAK03694.1| (AE006198) unknown [Pasteurella multocida]
          Length = 335

 Score =  121 bits (303), Expect = 2e-26
 Identities = 101/372 (27%), Positives = 174/372 (46%), Gaps = 50/372 (13%)

Query: 6   MPLLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQ 65
           M L + +          +   DF V E   YE S  GE   ++VRK+  +TL + +  +Q
Sbjct: 1   MELAYLHTRPEQYARLKAECADFIVKENLGYEMSGDGEFVAVKVRKTDCNTLFVGEKLAQ 60

Query: 66  ILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIK 125
            +G+    +GYAGLKD+ A+T Q+  L     P    +  +FQ + + IL +  HH KI+
Sbjct: 61  FVGISERNMGYAGLKDRKAVTEQWFCLHMPGQP--TPDFRSFQLEGVDILEVTRHHRKIR 118

Query: 126 LGHLKGNRFFMRFKKMTPLNAQKTKQV---LEQIAQFGMPNYFGSQRFGKFNDNHQEGLK 182
            G L+GN F +  +      A++T ++   L  I Q G PNYF  QRFG+   N  + L+
Sbjct: 119 TGSLEGNHFEILLR-----GAKETDELNVRLNNIKQCGFPNYFTEQRFGRDGHNLTQALR 173

Query: 183 ILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVD 242
             Q +     +K  +F +S+ +S +FN ++S+R+              L+  +Q      
Sbjct: 174 WAQGEINVKDRKKRSFYLSAARSEVFNLVVSERIA-------------LQLAQQ------ 214

Query: 243 SDTLKTLKNQAHPFKILEGDVMCHYPYGKFFDALELEKEG---ERFLKKEVAPTGLLDGK 299
                          +L GD++       +F A E E       R  ++++  T  L G+
Sbjct: 215 ---------------VLRGDMLQLQGSHSWFQADEKEDLNALQARLEQQDILLTAPLIGE 259

Query: 300 KALYAKNLSLEIEKEFQHNL-LSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLP 358
           +   A ++  ++ ++ Q  L L +  +   +RR   +  + +   +V E    +L FYLP
Sbjct: 260 QNPPATDIENQLVEQHQALLTLMAKERMKAARRPMLMQAQALQWAFVAE--GLKLAFYLP 317

Query: 359 KGSYASALLKEI 370
            GSYA+AL++E+
Sbjct: 318 AGSYATALVREV 329
>ref|NP_438860.1| (NC_000907) conserved hypothetical protein [Haemophilus influenzae
           Rd]
 sp|P44039|YGBO_HAEIN Protein HI0701
 pir||C64012 conserved hypothetical protein HI0701 - Haemophilus influenzae
           (strain Rd KW20)
 gb|AAC22360.1| (U32753) conserved hypothetical protein [Haemophilus influenzae Rd]
          Length = 339

 Score =  119 bits (298), Expect = 6e-26
 Identities = 100/355 (28%), Positives = 162/355 (45%), Gaps = 58/355 (16%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V E   YE S  GE   + VRK+  +TL + +  ++  GV    +GYAGLKD+ A+T
Sbjct: 26  DFIVKEHLGYEMSGDGEFVALYVRKTDCNTLFVGEKLAKFAGVSERNMGYAGLKDRRAVT 85

Query: 87  TQF--ISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPL 144
            Q+  + +P    P    + S F+   ++IL++  H+ KI+ G L+GN F +  +     
Sbjct: 86  EQWFCLQMPGMETP----DFSQFELDGVEILTVTRHNRKIRTGSLEGNYFDILLRGAEES 141

Query: 145 NAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLISSYQ 204
           +  K +  L+ +A FG PNYF  QRFG+   N  + L+  Q + K   +K  +F +S+ +
Sbjct: 142 DELKAR--LDFVANFGFPNYFTEQRFGRDGHNLTQALRWAQGEIKVKDRKKRSFYLSAAR 199

Query: 205 SYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILEGDVM 264
           S +FN +++ R+E S I                             NQ  P  I++  + 
Sbjct: 200 SEIFNLVVAARIEKSTI-----------------------------NQVLPNDIVQ--LA 228

Query: 265 CHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKKALYAKNLSLEIEKEFQ-HNLLSSH 323
             + + K  +  +L     R   +++  T  L G+  L A  +  EI  +    + L   
Sbjct: 229 GSHSWFKADEKEDLTALQVRLENQDILLTAPLIGEDILAASEIENEIVNQHSAFDPLMKQ 288

Query: 324 AKTLGSRR--------FFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEI 370
            +   +RR        F W F          E     L FYLP GSYA+AL++E+
Sbjct: 289 ERMKAARRPLLMKAKGFSWAF----------EPEGLRLKFYLPAGSYATALVREL 333
>ref|NP_289294.1| (NC_002655) putative hydrogenase subunit [Escherichia coli O157:H7
           EDL933]
 ref|NP_311626.1| (NC_002695) putative hydrogenase subunit [Escherichia coli O157:H7]
 gb|AAG57852.1|AE005502_6 (AE005502) putative hydrogenase subunit [Escherichia coli O157:H7
           EDL933]
 dbj|BAB37022.1| (AP002562) putative hydrogenase subunit [Escherichia coli O157:H7]
          Length = 349

 Score =  115 bits (289), Expect = 6e-25
 Identities = 97/365 (26%), Positives = 175/365 (47%), Gaps = 54/365 (14%)

Query: 23  SSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDK 82
           ++  DF V E   +E    GEH ++++ K+G +T  +    ++ L +   E+ +AG KDK
Sbjct: 22  ANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDK 81

Query: 83  NALTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKK 140
           +A+T Q++   +P K  P L    S FQ +  ++L    H  K++LG LKGN F +  ++
Sbjct: 82  HAVTEQWLCARVPGKEMPDL----SAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLRE 137

Query: 141 MTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLI 200
           ++  N    +Q L  I   G+PNYFG+QRFG    N Q  L+  Q  T    +   +F +
Sbjct: 138 VS--NRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGALRWAQTNTPVRDRNKRSFWL 195

Query: 201 SSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILE 260
           S+ +S LFN ++++RL+                                  +A   ++++
Sbjct: 196 SAARSALFNQIVAERLK----------------------------------KADVNQVVD 221

Query: 261 GDVMCHYPYGKFFDAL--ELEKEGERFLKKEVAPTGLLDG-------KKALYAKNLSLEI 311
           GD +     G +F A   EL +   R   KE+  T  L G       ++AL  +  ++  
Sbjct: 222 GDALQLAGRGSWFVATTEELVELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAA 281

Query: 312 EKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIK 371
           E E Q  L+    K   +RR   ++ + ++  +  +    E+ F+LP GS+A+++++E+ 
Sbjct: 282 ETELQALLV--REKVEAARRAMLLYPQQLSWNW-WDDVTVEIRFWLPAGSFATSVVRELI 338

Query: 372 HEKGE 376
           +  G+
Sbjct: 339 NTTGD 343
>ref|NP_230181.1| (NC_002505) conserved hypothetical protein [Vibrio cholerae]
 pir||E82311 conserved hypothetical protein VC0530 [imported] - Vibrio cholerae
           (group O1 strain N16961)
 gb|AAF93698.1| (AE004139) conserved hypothetical protein [Vibrio cholerae]
          Length = 361

 Score =  115 bits (289), Expect = 6e-25
 Identities = 91/351 (25%), Positives = 171/351 (47%), Gaps = 44/351 (12%)

Query: 28  FCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTT 87
           F V+EV  Y  +  GEH ++++RK+G +T  +    ++  GV    + +AGLKD++A+T 
Sbjct: 28  FQVNEVLGYSLTGHGEHLMVRIRKTGENTSFVANELAKACGVPSRAVSWAGLKDRHAVTE 87

Query: 88  QFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQ 147
           Q++S+        + +    Q  +++IL +  H  K++ G L+GN F +   +++ + A 
Sbjct: 88  QWLSVHLPNGETPDFSAFLAQYPSIEILEVTRHDKKLRPGDLQGNEFVVTLSEVSDVAAV 147

Query: 148 KTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLISSYQSYL 207
            ++  LE +A+ G+PNYFGSQRFG+  +N  E  +  ++  +  +Q   +  +S+ +S++
Sbjct: 148 LSR--LETVAELGVPNYFGSQRFGRHGNNLSEARRWGRDNVRSRNQNQRSLYLSAARSWI 205

Query: 208 FNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILEGDVMCHY 267
           FN ++SKR+E  +   A  ++ ++   +Q+  +VD D                       
Sbjct: 206 FNQIVSKRIE--QGCFARFIEGDIALAEQQMFNVDGD----------------------- 240

Query: 268 PYGKFFDALELEKEGERFLKKEVAPTGLLDGKKALYAKNLSL-----EIEKEFQHNLLSS 322
                     L    +R    EVA +  L G  AL     +L     E++ E     L  
Sbjct: 241 ----------LALWDQRLQAGEVAISAALAGDNALPTSGQALPLEQAELDAEPDLMALIR 290

Query: 323 HAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIKHE 373
             +    RR   +  +N++ Q   ++ Q  L F L  GS+A++L++E+  E
Sbjct: 291 GNRMRHDRRAIALKAQNLSWQV--QEDQITLRFSLDAGSFATSLVRELIEE 339
>ref|NP_457317.1| (NC_003198) conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Typhi]
 emb|CAD06034.1| (AL627276) conserved hypothetical protein [Salmonella enterica
           subsp. enterica serovar Typhi]
          Length = 349

 Score =  114 bits (284), Expect = 2e-24
 Identities = 97/363 (26%), Positives = 174/363 (47%), Gaps = 50/363 (13%)

Query: 23  SSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDK 82
           ++  DF V E   +     GEH ++++ K+G +T  +  + ++ L +   E+ +AG KDK
Sbjct: 22  ANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADVLAKFLKIHAREVSFAGQKDK 81

Query: 83  NALTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKK 140
           +A+T Q++   +P K  P    + S FQ +  K+L    H  K++LG LKGN F +  ++
Sbjct: 82  HAVTEQWLCARVPGKEMP----DFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLRE 137

Query: 141 MTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLI 200
           ++     +T+  L+ I   G+PNYFG+QRFG    N Q  L+  Q+      +   +F +
Sbjct: 138 ISDRRDVETR--LQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWL 195

Query: 201 SSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILE 260
           S+ +S LFN ++ +RL+                             K   NQ     +++
Sbjct: 196 SAARSALFNQIVHQRLK-----------------------------KPDFNQ-----VVD 221

Query: 261 GDVMCHYPYGKFFDAL--ELEKEGERFLKKEVAPTGLLDGKKALYAKNLSLEIEKE--FQ 316
           GD +     G +F A   EL +   R  +KE+  T  L G      +  +L  E++   Q
Sbjct: 222 GDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQ 281

Query: 317 HNLLSS---HAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIKHE 373
             +L S     K   SRR   ++ + ++  +  +    EL F+LP GS+A+++++E+ + 
Sbjct: 282 ETVLQSLLLREKVEASRRAMLLYPQQLSWNW-WDDVTVELRFWLPAGSFATSVVRELINT 340

Query: 374 KGE 376
            G+
Sbjct: 341 MGD 343
>ref|NP_417225.1| (NC_000913) putative hydrogenase subunit [Escherichia coli K12]
 sp|Q57261|YGBO_ECOLI Protein ygbO
 pir||I69731 hypothetical protein b2745 - Escherichia coli
 gb|AAA69255.1| (U29579) was ORF_f292 and ORF_f255 before splice; ORF_f349
           [Escherichia coli]
 gb|AAA79838.1| (L07942) ORF1 [Escherichia coli]
 gb|AAC75787.1| (AE000358) putative hydrogenase subunit [Escherichia coli K12]
          Length = 349

 Score =  114 bits (284), Expect = 2e-24
 Identities = 96/365 (26%), Positives = 174/365 (47%), Gaps = 54/365 (14%)

Query: 23  SSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDK 82
           ++  DF V E   +E    GEH ++++ K+G +T  +    ++ L +   E+ +AG KDK
Sbjct: 22  ANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDK 81

Query: 83  NALTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKK 140
           +A+T Q++   +P K  P L    S FQ +  ++L    H  K++LG LKGN F +  ++
Sbjct: 82  HAVTEQWLCARVPGKEMPDL----SAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLRE 137

Query: 141 MTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLI 200
           ++  N    +Q L  I   G+PNYFG+QRFG    N Q   +  Q  T    +   +F +
Sbjct: 138 VS--NRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWL 195

Query: 201 SSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILE 260
           S+ +S LFN ++++RL+                                  +A   ++++
Sbjct: 196 SAARSALFNQIVAERLK----------------------------------KADVNQVVD 221

Query: 261 GDVMCHYPYGKFFDAL--ELEKEGERFLKKEVAPTGLLDG-------KKALYAKNLSLEI 311
           GD +     G +F A   EL +   R   KE+  T  L G       ++AL  +  ++  
Sbjct: 222 GDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAA 281

Query: 312 EKEFQHNLLSSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIK 371
           E E Q  L+    K   +RR   ++ + ++  +  +    E+ F+LP GS+A+++++E+ 
Sbjct: 282 ETELQALLV--REKVEAARRAMLLYPQQLSWNW-WDDVTVEIRFWLPAGSFATSVVRELI 338

Query: 372 HEKGE 376
           +  G+
Sbjct: 339 NTTGD 343
>ref|NP_461849.1| (NC_003197) paral putative hydrogenase subunit [Salmonella
           typhimurium LT2]
 gb|AAL21808.1| (AE008833) paral putative hydrogenase subunit [Salmonella
           typhimurium LT2]
          Length = 349

 Score =  112 bits (280), Expect = 7e-24
 Identities = 97/363 (26%), Positives = 173/363 (46%), Gaps = 50/363 (13%)

Query: 23  SSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDK 82
           ++  DF V E   +     GEH ++++ K+G +T  +    ++ L +   E+ +AG KDK
Sbjct: 22  ANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDK 81

Query: 83  NALTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKK 140
           +A+T Q++   +P K  P    + S FQ +  K+L    H  K++LG LKGN F +  ++
Sbjct: 82  HAVTEQWLCARVPGKEMP----DFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLRE 137

Query: 141 MTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLI 200
           ++     +T+  L+ I   G+PNYFG+QRFG    N Q  L+  Q+      +   +F +
Sbjct: 138 ISDRRDVETR--LQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWL 195

Query: 201 SSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILE 260
           S+ +S LFN ++ +RL+                             K   NQ     +++
Sbjct: 196 SAARSALFNQIVHQRLK-----------------------------KPDFNQ-----VVD 221

Query: 261 GDVMCHYPYGKFFDAL--ELEKEGERFLKKEVAPTGLLDGKKALYAKNLSLEIEKE--FQ 316
           GD +     G +F A   EL +   R  +KE+  T  L G      +  +L  E++   Q
Sbjct: 222 GDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQ 281

Query: 317 HNLLSS---HAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIKHE 373
             +L S     K   SRR   ++ + ++  +  +    EL F+LP GS+A+++++E+ + 
Sbjct: 282 ETVLQSLLLREKVEASRRAMLLYPQQLSWNW-WDDVTVELRFWLPAGSFATSVVRELINT 340

Query: 374 KGE 376
            G+
Sbjct: 341 MGD 343
>ref|NP_637074.1| (NC_003902) hydrogenase subunit [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gb|AAM40998.1| (AE012272) hydrogenase subunit [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 367

 Score =  107 bits (266), Expect = 3e-22
 Identities = 71/223 (31%), Positives = 116/223 (51%), Gaps = 18/223 (8%)

Query: 8   LLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQIL 67
           L  A+  A +     S A DF V E+P ++ S  GEH ++ VRK G +T  + +  +Q  
Sbjct: 7   LPRAHGAAVLTAAMRSVAEDFQVDELPAFDASGEGEHLLLTVRKRGQNTAYVAKRLAQWA 66

Query: 68  GVRIAELGYAGLKDKNALTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIK 125
           G+    +GYAGLKD++A+TTQ  S  LPK+ AP    + S   + +++++   +H+ K++
Sbjct: 67  GIAEMGIGYAGLKDRHAVTTQRFSVHLPKRIAP----DLSALDDDDMQVVEHTWHNRKLQ 122

Query: 126 LGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGK-----------FN 174
            G L GNRF +  +++    A    + L  IA  G+PN+FG QRFG+           F 
Sbjct: 123 RGALHGNRFVLTLREVVGDQAVIDAR-LHAIAARGIPNWFGEQRFGRDGGNVAAALAMFG 181

Query: 175 DNHQEGLKILQNQTKFAHQKLNAFLISSYQSYLFNALLSKRLE 217
              Q    +     +       + L+S+ +S LFN +L+ R+E
Sbjct: 182 HTRQPDGTLAPAPKRRLRNDQRSLLLSAARSALFNQVLTARVE 224
>ref|NP_642054.1| (NC_003919) hydrogenase subunit [Xanthomonas axonopodis pv. citri
           str. 306]
 gb|AAM36590.1| (AE011804) hydrogenase subunit [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 369

 Score =  100 bits (250), Expect = 2e-20
 Identities = 67/225 (29%), Positives = 117/225 (51%), Gaps = 22/225 (9%)

Query: 8   LLHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQIL 67
           L  A+  A +     S+  DF V E+P +E S  GEH ++ VRK G +T  + +  +   
Sbjct: 7   LPRAHGAAVLSAAMRSTPDDFQVDELPAFEPSGEGEHLLLTVRKRGQNTAYIAKKLAHWA 66

Query: 68  GVRIAELGYAGLKDKNALTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIK 125
           G+    + YAGLKD++A+TTQ  S  LP++ AP    + +   +  ++++  ++H+ K++
Sbjct: 67  GIAEMGVSYAGLKDRHAVTTQRFSVHLPRRIAP----DIAALDDTQMQVVESSWHNRKLQ 122

Query: 126 LGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKIL- 184
            G L GNRF +  +++        +Q L+ IA  G+PN+FG QRFG+   N    L +  
Sbjct: 123 RGALHGNRFVLTLRQVQG-ERDAIEQRLQAIAARGIPNWFGEQRFGRDGGNVAAALAMFG 181

Query: 185 -------------QNQTKFAHQKLNAFLISSYQSYLFNALLSKRL 216
                         ++ +  H +  + L+S+ +S LFN +L  R+
Sbjct: 182 HVQADDGTLLPAPTSRRRLRHDQ-RSMLLSAARSALFNRVLGARV 225
>ref|NP_252316.1| (NC_002516) conserved hypothetical protein [Pseudomonas aeruginosa]
 pir||H83193 conserved hypothetical protein PA3626 [imported] - Pseudomonas
           aeruginosa (strain PAO1)
 gb|AAG07014.1|AE004782_12 (AE004782) conserved hypothetical protein [Pseudomonas aeruginosa]
          Length = 355

 Score =  100 bits (249), Expect = 3e-20
 Identities = 101/355 (28%), Positives = 151/355 (42%), Gaps = 51/355 (14%)

Query: 25  ARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNA 84
           A DF V EV     S  GEH  + V K GL+T E  +   +  GV+   + YAGLKD+ A
Sbjct: 28  AEDFQVDEVLEIPLSGEGEHLWLWVEKRGLNTEEAARRLGRAAGVQQKNVSYAGLKDRQA 87

Query: 85  LTTQFIS--LPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMT 142
           LT Q+ S  LP K  P    +    +  +L+IL    H  K++ G    N F +R   +T
Sbjct: 88  LTRQWFSLHLPGKADP----DLGAAEGADLRILRCTRHSRKLQRGAHAANGFTLR---LT 140

Query: 143 PLNAQKT--KQVLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLI 200
            L A++      LE+IA  G+PNYFG QRFG    N  +     +     A++ L +  +
Sbjct: 141 GLRAERAPLDARLERIAADGVPNYFGLQRFGHGGGNLVDARSCAEQDLLPANRNLRSRFL 200

Query: 201 SSYQSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDTLKTLKNQAHPFKILE 260
           S+ +SYLFN LL++R+                                   +    +   
Sbjct: 201 SAGRSYLFNRLLAERVA----------------------------------EGSWNRAAV 226

Query: 261 GDVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLL--DGKKALYAKNLSLEIEKEFQHN 318
           GD++       FF A E E    R    ++ PTG L  +G     A  L  E+    +  
Sbjct: 227 GDLLAFTDSRSFFLAGEEECRDARLAALDLHPTGPLWGEGDPPSGAGVLDRELALAGREP 286

Query: 319 LLSSHAKTLG---SRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEI 370
            L       G    RR   + ++ +   Y  E    +L F LP G +A+ +++EI
Sbjct: 287 ALCRWLAKAGMAHERRILRLPIQGLAWHY-PEPDVLQLEFVLPAGCFATVVVREI 340
>ref|NP_248367.1| (NC_000909) conserved hypothetical protein [Methanococcus
           jannaschii] [Methanocaldococcus jannaschii]
 sp|Q58759|YD64_METJA Hypothetical protein MJ1364
 pir||C64470 hypothetical protein MJ1364 - Methanococcus jannaschii
 gb|AAB99372.1| (U67576) conserved hypothetical protein [Methanococcus jannaschii]
           [Methanocaldococcus jannaschii]
          Length = 487

 Score = 94.4 bits (233), Expect = 2e-18
 Identities = 101/388 (26%), Positives = 171/388 (44%), Gaps = 59/388 (15%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V E+  +         + ++ K  + +L+     ++   + + ++GY GLKD++ALT
Sbjct: 100 DFIVEEIIDFNKIAGDRCYLYKLTKRNIESLKAFSYIAKKFKIPLKDIGYCGLKDRHALT 159

Query: 87  TQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNA 146
           TQ+IS+PKKY  L      +  E NLK L L      + LG L+GNRF +  + +   + 
Sbjct: 160 TQYISIPKKYGKL------SLDEPNLK-LELIGESKFLLLGDLEGNRFTITVRGLKKEDI 212

Query: 147 QKTKQVLEQIAQFGMPNYFGSQRFGKFND-----------NHQEGLKILQNQTKFAHQK- 194
            K K+ L+ + +FG PNYF SQRFG   D           N++E +KIL  + K + +K 
Sbjct: 213 PKIKENLKYL-EFGAPNYFDSQRFGSVFDKKFIAKEVIKGNYEEAVKILLTKYKKSEKKL 271

Query: 195 ---LNAFLISSY------QSYLFNALLSKRLEISKIISAFSVKENLEFFKQKNLSVDSDT 245
              L  F+  ++        Y+    +  RL ++ +     +K++ ++  +K LS   D 
Sbjct: 272 IKDLKRFIDKNWGDWDKIWEYIKENNIKSRLYVNMV---KELKKSNDY--KKALSYVDDR 326

Query: 246 LKTL--------------KNQAHPFKILEGDVMCHYPYGKFFDALELEKEGERFLKKEVA 291
           LK +              K     +   E  V   Y  G      ++++E    LK +  
Sbjct: 327 LKKIFVAAYQSYLWNECVKELLRKYVPEEDRVYYEYECGTLMFYKKMDEEVFNILKDKKF 386

Query: 292 PTGLLDGKKALYAKNLSLEIEKE-----FQHNLLSSHAKTLGSRRFFWVFVENV------ 340
           PT   D + +   K +  EI K       + N +    K + S R      +N+      
Sbjct: 387 PTIAPDIEYSGEEKEIIEEILKREGLTMEELNNIGELGKFIYSERKILSIPKNLKIGEFE 446

Query: 341 TSQYVKEKAQFELGFYLPKGSYASALLK 368
             +  K K +  L + L KGSYA+ ++K
Sbjct: 447 EDELNKGKYKITLSYELEKGSYATIIIK 474
>ref|NP_070505.1| (NC_000917) conserved hypothetical protein [Archaeoglobus fulgidus]
 pir||D69459 conserved hypothetical protein AF1677 - Archaeoglobus fulgidus
 gb|AAB89568.1| (AE000987) conserved hypothetical protein [Archaeoglobus fulgidus]
          Length = 411

 Score = 90.1 bits (222), Expect = 4e-17
 Identities = 104/389 (26%), Positives = 160/389 (40%), Gaps = 57/389 (14%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V E+  +  S+ G+  +I+V K    TL   ++ S  LG+    + +AG KDK ALT
Sbjct: 27  DFYVEEIAEFNLSDEGDFLIIRVEKKNWDTLNFARVLSNALGISQKRISFAGTKDKRALT 86

Query: 87  TQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNA 146
            Q+ S+      + ++       K+ KI  + Y    I+LG L GN  F R +     + 
Sbjct: 87  VQYFSI----YGVKKEEIERVNLKDAKIEVIGYARRAIQLGDLLGN--FFRIRVYGCRDG 140

Query: 147 QKTKQVLEQIAQFGMPNYFGSQRFGKFN-DNHQEGLKILQNQTKFA-------------- 191
           +  ++   ++ + G PN+FG QRFG      H+ G  ILQN  + A              
Sbjct: 141 EIFQETRNELMEKGTPNFFGLQRFGSIRFITHEVGKLILQNNYEEAFWVYVAKPFEGENE 200

Query: 192 -------------HQKLNAFLISSYQSYLFNALLSKRLEISKIISAFSVKENLE--FFKQ 236
                          KL    +  Y  Y  N L   R   S+  +  S+ +NL+  F   
Sbjct: 201 EVRKIREILWETRDAKLGLRELPKYLRYERNLLQKLREGKSEEEALLSLPKNLKMMFVHA 260

Query: 237 KNLSVDSDTLKTLKNQAHPFKIL-EGDVMCHYPY---GKFFDALELE--KEGERFLKKE- 289
               + +  L     Q    K L EGD  C+  +     F D  E+E  +   RFL KE 
Sbjct: 261 YQSYIFNRLLSERIRQFGSLKTLEEGDFACYLTFKTRPTFSDCSEVEVNEARVRFLVKER 320

Query: 290 VAPTGL----LDGKKALYAKNLSLEIEKEFQHNLLSSHAK-----TLGSRRFFWVFVENV 340
           VA   L     D K   +++ ++L+   E   +L S   K     + GS R     +E+ 
Sbjct: 321 VASLALPLVGYDTKLKGWSR-IALDFLSEDNLDLSSFKTKHKEFSSSGSYRPADTLIEHT 379

Query: 341 TSQYVKEKAQFELGFYLPKGSYASALLKE 369
              +          FYLP+G YA+  L+E
Sbjct: 380 GLSFTDS----TFSFYLPRGCYATVFLRE 404
>ref|NP_214182.1| (NC_000918) hypothetical protein [Aquifex aeolicus]
 pir||F70448 conserved hypothetical protein aq_1723 - Aquifex aeolicus
 gb|AAC07586.1| (AE000753) hypothetical protein [Aquifex aeolicus]
          Length = 385

 Score = 85.9 bits (211), Expect = 7e-16
 Identities = 43/155 (27%), Positives = 83/155 (52%), Gaps = 1/155 (0%)

Query: 17  IDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGY 76
           +D        +F V E+   +    G++A   ++K  ++TL+ ++  S   G+ +  +G+
Sbjct: 1   MDIRIKEKPEEFYVKEIKKLDLKEKGQYAYFLLKKKDMTTLDAVRHISHRFGIPLKNIGF 60

Query: 77  AGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFM 136
           AGLKDK A+T Q+IS+       + K    ++ +NL++  L +    ++LG ++GN F +
Sbjct: 61  AGLKDKKAVTEQYISVKDLNEEKIRK-MDGYRTENLELKFLGFSDKGLELGEIEGNYFEV 119

Query: 137 RFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFG 171
             + +T  + +   ++ E +  +G  NYFG QRFG
Sbjct: 120 VVRGVTKYHRRVFPRMKELVENYGCENYFGEQRFG 154
>ref|NP_276642.1| (NC_000916) conserved protein [Methanothermobacter
           thermautotrophicus] [Methanothermobacter
           thermautotrophicus str. Delta H]
 pir||H69070 conserved hypothetical protein MTH1529 - Methanobacterium
           thermoautotrophicum (strain Delta H)
 gb|AAB86003.1| (AE000913) conserved protein [Methanothermobacter
           thermautotrophicus str. Delta H]
          Length = 415

 Score = 85.5 bits (210), Expect = 9e-16
 Identities = 58/170 (34%), Positives = 92/170 (54%), Gaps = 8/170 (4%)

Query: 26  RDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNAL 85
           RDF V E+PL E S +G +  I + K G +TL++L   ++ L +    +G+AG+KDK A+
Sbjct: 23  RDFEVEEIPLTEPSGSGPNTWIWIEKEGRTTLDVLLDIARELHLDRRRMGFAGMKDKRAV 82

Query: 86  TTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKM---T 142
           T Q+I +    AP  E      + +N+K L +  +  K+++G L GNRF +  +      
Sbjct: 83  TRQWICV-SNTAP-SEVKAIEERIRNVKFLRVTANEKKLRMGQLLGNRFRILIRDTEIDE 140

Query: 143 PLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDN-HQEGLKILQNQTKFA 191
           PL  +  K  L+++   G+PNY+G QRFG    N H  G  ++    K A
Sbjct: 141 PL--ETAKATLQELEDKGVPNYYGWQRFGSPRANTHLVGRALVHGDVKGA 188
>ref|NP_618162.1| (NC_003552) conserved hypothetical protein [Methanosarcina
           acetivorans str. C2A] [Methanosarcina acetivorans C2A]
 gb|AAM06642.1| (AE011031) conserved hypothetical protein [Methanosarcina
           acetivorans str. C2A] [Methanosarcina acetivorans C2A]
          Length = 441

 Score = 78.2 bits (191), Expect = 1e-13
 Identities = 105/423 (24%), Positives = 172/423 (39%), Gaps = 70/423 (16%)

Query: 9   LHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILG 68
           L++ N   +         DF V E+   E    G + V+++ K    T    +  ++IL 
Sbjct: 17  LYSTNTEGLGGRLRQEVEDFIVKEITNREEGKDGRYLVLELTKRDWDTHHFTRTLAKILQ 76

Query: 69  VRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGH 128
           +    +  AG KDK ALTTQ IS+    A  +EK       K++++  L      ++LG 
Sbjct: 77  ISQKRISVAGTKDKRALTTQKISIFDIDALEIEK----IHLKDVELKVLGRSRKSVELGD 132

Query: 129 LKGNRFFMRFKKMTPLNAQKTKQVLEQ-----IAQFGMPNYFGSQRFGKFND-NHQEGLK 182
           L GN F +  + ++  + ++T+ +LE+     + Q G+PN+FG QRFG      H  G  
Sbjct: 133 LWGNEFIITIRDISS-SPEETRTILEKTNSKVLTQGGVPNFFGVQRFGSVRSVTHLVGKA 191

Query: 183 ILQ-------------------NQTKFAHQKLNAF-----------LISSYQSYLFNALL 212
           I++                    +TK A Q +              L   ++  + N L+
Sbjct: 192 IVEGNFEKAAMLYIAEPFPEEPEETKAARQFVKETRDFKEGLKTYPLRLGHERAMMNHLI 251

Query: 213 SKRLEISKIISAFSV--KENLEFFKQKNLSVDSDTL--KTLKNQAHPFKILEGDVMCHYP 268
           S   + S    AFSV  K     F     S   + +  + +++     + +EGD++C   
Sbjct: 252 SNPEDYS---GAFSVLPKNLYRMFVHAYQSYIYNMILCRRIESGISLNRAVEGDIVCFRN 308

Query: 269 YGKFFDALELEK-------EGERFLKKEVA-PTGLLDGKKALYAKNLSLEIEKEFQHNLL 320
                D+ + EK          R +K   A  T  L G    +A  +  EIE +    L 
Sbjct: 309 EAGLPDSSKTEKVTSETVNAMNRLIKHGRAFITAPLPGFNTEFASGVPGEIESKILKELR 368

Query: 321 SS----------HAKTLGSRRFFWVFVENVTSQYVKE----KAQFELGFYLPKGSYASAL 366
            S             + G+RR   + VE        E    K++  L F LPKGSYA+ +
Sbjct: 369 VSLEGFNVEEFPEMSSKGTRREVLLQVEPKFEVGEDELNPGKSKTVLEFMLPKGSYATTV 428

Query: 367 LKE 369
           L+E
Sbjct: 429 LRE 431
>ref|NP_112582.1| (NM_031292) hypothetical protein DKFZp434G1415 [Homo sapiens]
 emb|CAB66693.1| (AL136759) hypothetical protein [Homo sapiens]
          Length = 701

 Score = 73.2 bits (178), Expect = 5e-12
 Identities = 46/152 (30%), Positives = 76/152 (49%), Gaps = 8/152 (5%)

Query: 44  HAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKN 103
           +    +RK  L   E +   +  LGV  ++  YAGLKDK A+T Q + + K     L+  
Sbjct: 302 YTAFTLRKENLEMFEAIGFLAIKLGVIPSDFSYAGLKDKKAITYQAMVVRKVTPERLKNI 361

Query: 104 TSNFQEKNLKILSLNYHHNKIKLGHLKGNRF---FMRFKKMTPLNA---QKTKQVLEQIA 157
               ++K + + ++    + ++LG LKGN F       KK    +A   ++  + +E + 
Sbjct: 362 EKEIEKKRMNVFNIRSVDDSLRLGQLKGNHFDIVIRNLKKQINDSANLRERIMEAIENVK 421

Query: 158 QFGMPNYFGSQRFGKFNDNH--QEGLKILQNQ 187
           + G  NY+G QRFGK    H  Q GL +L+N+
Sbjct: 422 KKGFVNYYGPQRFGKGRKVHTDQIGLALLKNE 453
>ref|NP_648231.1| (NM_139974) CG6745 gene product [Drosophila melanogaster]
 gb|AAF50409.1| (AE003555) CG6745 gene product [Drosophila melanogaster]
 gb|AAL39337.1| (AY069192) GH24787p [Drosophila melanogaster]
          Length = 734

 Score = 73.2 bits (178), Expect = 5e-12
 Identities = 42/156 (26%), Positives = 83/156 (52%), Gaps = 4/156 (2%)

Query: 38  FSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYA 97
           +S   ++    V K+ L T ++    +  L +R +++ Y+G+KDK A TTQ  S+ ++  
Sbjct: 231 WSFPADYVTFLVHKTNLVTSDVASTLAARLNLRPSQVNYSGIKDKRAKTTQKFSVKRRTP 290

Query: 98  PLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIA 157
             +     +  ++N+ I +  +  N +KLG L+GNRF +  + +     ++ +Q L+ + 
Sbjct: 291 ESILVAARS--QRNVHIGNFGFESNTLKLGDLQGNRFRIALRHIAKEKREEIEQALQSLK 348

Query: 158 QFGMPNYFGSQRFGKFND--NHQEGLKILQNQTKFA 191
           + G  NY+G QRFG       ++ G+ +L++  K A
Sbjct: 349 ERGFINYYGLQRFGNSASVPTYEVGVALLKHDYKLA 384
>ref|XP_128203.1| (XM_128203) RIKEN cDNA 3000003F02 [Mus musculus]
          Length = 702

 Score = 72.0 bits (175), Expect = 1e-11
 Identities = 48/152 (31%), Positives = 79/152 (51%), Gaps = 8/152 (5%)

Query: 44  HAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKN 103
           +    ++K  L T E + + +  LGV  ++  YAGLKDK A+T Q + + K     L+  
Sbjct: 302 YTAFTLQKENLETFEAIGLLAVKLGVIPSDFSYAGLKDKRAITYQSMVVKKVTPERLKSI 361

Query: 104 TSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKM-TPLN--AQKTKQVLEQIAQF- 159
               ++K + + ++    + ++LG LKGN F +  + +   LN  A  T+++LE I    
Sbjct: 362 KEEIEKKRMNVFNIRSVGDCLRLGQLKGNHFEIIIRHLRNQLNDSANLTERILEAIENVK 421

Query: 160 --GMPNYFGSQRFGKFN--DNHQEGLKILQNQ 187
             G  NY+G QRFGK       Q GL +L+N+
Sbjct: 422 NKGFVNYYGPQRFGKGQKIQTDQIGLALLKNE 453
>ref|NP_579284.1| (NC_003413) hypothetical protein [Pyrococcus furiosus DSM 3638]
 gb|AAL81679.1| (AE010256) hypothetical protein [Pyrococcus furiosus DSM 3638]
          Length = 407

 Score = 71.2 bits (173), Expect = 2e-11
 Identities = 50/162 (30%), Positives = 84/162 (50%), Gaps = 16/162 (9%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V EV        G+  +  ++K    T+  ++  ++ +GV  +E+G+AG KD++A+T
Sbjct: 27  DFIVKEVIPKSIFKAGKCKIYILKKKNWETMAAIKEIAKRVGVHYSEIGFAGTKDRHAVT 86

Query: 87  TQFISLPKKYAPLLEKNTSNFQEKNLKILSLNY--HHNKIKLGHLKGNRFFMRFKKMTPL 144
            Q+IS+ +           N +E  ++ + L +  +   +KLG L GN F +R ++  P 
Sbjct: 87  YQYISICRDV---------NLEEVKIRDIELKFVGYGRPLKLGFLLGNFFKIRVRESNP- 136

Query: 145 NAQKTKQVLEQIAQ-FGMPNYFGSQRFG-KFNDNHQEGLKIL 184
                  V+E+  +  G PNYFG QRFG K + NH  G  +L
Sbjct: 137 --SLLPSVIEEAKEKGGFPNYFGIQRFGEKRSVNHVVGKLLL 176
>ref|NP_632138.1| (NC_003901) conserved protein [Methanosarcina mazei Goe1]
 gb|AAM29810.1| (AE013232) conserved protein [Methanosarcina mazei Goe1]
          Length = 438

 Score = 71.2 bits (173), Expect = 2e-11
 Identities = 50/168 (29%), Positives = 84/168 (49%), Gaps = 10/168 (5%)

Query: 9   LHAYNHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILG 68
           L++ +   +         DF V E+   E    G++ ++++ K    T  + +  S+IL 
Sbjct: 14  LYSTDTTGLGGQLRQEIEDFIVKEITNREEGEEGKYLIVELTKRDWDTHHLTRTLSRILQ 73

Query: 69  VRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGH 128
           V    +  AG KDK ALTTQ IS+    A  +EK       K++++  L      ++LG 
Sbjct: 74  VSQKRISVAGTKDKRALTTQKISIFDTDASEIEK----IHLKDIELKVLGRSRKSVELGD 129

Query: 129 LKGNRFFMRFKKMTPLNAQKTKQVL-----EQIAQFGMPNYFGSQRFG 171
           L GN F +  + +   + ++T+ +L     E +AQ G+PN+FG QRFG
Sbjct: 130 LWGNDFRITVRNIEN-SPEETEALLKKTTDEILAQGGVPNFFGIQRFG 176
>dbj|BAB67790.1| (AB067484) KIAA1897 protein [Homo sapiens]
          Length = 645

 Score = 70.1 bits (170), Expect = 4e-11
 Identities = 47/146 (32%), Positives = 73/146 (49%), Gaps = 4/146 (2%)

Query: 42  GEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLE 101
           G +    + K    T++ + + S+ L V+     Y G KDK A+T Q I++ K  A  L 
Sbjct: 239 GSYCHFVLYKENKDTMDAINVLSKYLRVKPNIFSYMGTKDKRAITVQEIAVLKITAQRLA 298

Query: 102 KNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGM 161
               N    N K+ + +Y  N +KLG L+GN F +  + +T  + Q  +Q +  + + G 
Sbjct: 299 H--LNKCLMNFKLGNFSYQKNPLKLGELQGNHFTVVLRNITGTDDQ-VQQAMNSLKEIGF 355

Query: 162 PNYFGSQRFGKFN-DNHQEGLKILQN 186
            NY+G QRFG      +Q G  ILQN
Sbjct: 356 INYYGMQRFGTTAVPTYQVGRAILQN 381
>ref|NP_143398.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||H71030 hypothetical protein PH1538 - Pyrococcus horikoshii
 dbj|BAA30648.1| (AP000006) 409aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 409

 Score = 69.7 bits (169), Expect = 5e-11
 Identities = 50/160 (31%), Positives = 84/160 (52%), Gaps = 12/160 (7%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V EV        G   +  ++K    T+  ++  ++ +G+  +E+G+AG KD++A+T
Sbjct: 28  DFIVKEVIPKSIFRGGRCRIYLLKKKNWETMAAIKEIAKRIGIHYSEIGFAGTKDRHAVT 87

Query: 87  TQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNA 146
            Q+IS+ +      E +  N   K++ +  + Y    +KLG L GN F +R +  TP   
Sbjct: 88  YQYISICR------EVDLENVAIKDVTLRFVGY-GRPLKLGLLLGNFFKIRVRDTTP--- 137

Query: 147 QKTKQVLEQIAQ-FGMPNYFGSQRFG-KFNDNHQEGLKIL 184
           +    +L++  +  G PNYFG QRFG K + NH  G  +L
Sbjct: 138 ELLPNILKEAKEKGGFPNYFGIQRFGEKRSVNHIVGKLLL 177
>dbj|BAB89856.1| (AP003295) hypothetical protein~similar to Arabidopsis thaliana
           chromosome 3, At3g04820 [Oryza sativa (japonica
           cultivar-group)]
          Length = 606

 Score = 67.4 bits (163), Expect = 3e-10
 Identities = 52/181 (28%), Positives = 84/181 (45%), Gaps = 14/181 (7%)

Query: 69  VRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGH 128
           +R    G+AG KDK A+TTQ +++ K  A  L    S      +K+   +Y    + LG 
Sbjct: 185 IRPRSFGFAGTKDKRAVTTQQVTVFKVQASRLVALNSKLI--GIKVGDFSYVKEGLALGQ 242

Query: 129 LKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFG--SQRFGKFNDNHQEGLKILQN 186
           L GNRF +  + +   +    K  L+ +   G  NY+G   QR  K+  N+ + L  +  
Sbjct: 243 LMGNRFTITLRSVVTESEDVIKAALDGLITNGFINYYGLQLQRLKKYPGNYLQALMAIP- 301

Query: 187 QTKFAHQKLNAFLISSYQSYLFNALLSKRLE---ISKIISAFSVKENLEFFKQKNLSVDS 243
                 + L    + SYQSYL+N   S R++   IS+++    V +    F+Q  L   S
Sbjct: 302 ------RTLRLMYVHSYQSYLWNHAASMRVQKYGISRVVEGDLVYKKEAPFEQGALKATS 355

Query: 244 D 244
           +
Sbjct: 356 E 356
>ref|NP_126313.1| (NC_000868) hypothetical protein [Pyrococcus abyssi]
 pir||A75183 hypothetical protein PAB0430 - Pyrococcus abyssi (strain Orsay)
 emb|CAB49544.1| (AJ248284) hypothetical protein [Pyrococcus abyssi]
          Length = 406

 Score = 67.4 bits (163), Expect = 3e-10
 Identities = 49/164 (29%), Positives = 89/164 (53%), Gaps = 12/164 (7%)

Query: 27  DFCVHEV-PLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNAL 85
           DF V E+ P   F   G   +  ++K    T+  ++  ++ +G+  +E+G+AG KD++A+
Sbjct: 24  DFIVREIIPKSIFK--GNCQIYLMKKRNWETIAAIKEIAKRIGIHYSEIGFAGTKDRHAV 81

Query: 86  TTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLN 145
           T Q+IS+ +     +EK     ++  LK +    +   +KLG L GN F +R + +    
Sbjct: 82  TYQYISVCRDVRKEVEK--LKIRDVELKFVG---YGRPLKLGFLLGNFFLIRVRDVK--R 134

Query: 146 AQKTKQVLEQI-AQFGMPNYFGSQRFG-KFNDNHQEGLKILQNQ 187
            +   +++E++  + G PNYFG QRFG K + NH  G  +L+ +
Sbjct: 135 PELIPKIIEELKIKGGFPNYFGIQRFGEKRSVNHIVGKLLLEGK 178
>ref|NP_014886.1| (NC_001147) Hypothetical ORF; Yor243cp [Saccharomyces cerevisiae]
 sp|Q08647|YO7T_YEAST Hypothetical 77.0 kDa protein in HES1-SEC63 intergenic region
 pir||S67136 hypothetical protein YOR243c - yeast (Saccharomyces cerevisiae)
 emb|CAA99464.1| (Z75151) ORF YOR243c [Saccharomyces cerevisiae]
          Length = 676

 Score = 67.0 bits (162), Expect = 3e-10
 Identities = 45/149 (30%), Positives = 77/149 (51%), Gaps = 8/149 (5%)

Query: 49  VRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQ 108
           + K    T+E + + +++L V    + YAG KD+ A+T Q +S+ K    L   N  N  
Sbjct: 224 LHKENKDTMEAVNVITKLLRVPSRVIRYAGTKDRRAVTCQRVSISK--IGLDRLNALNRT 281

Query: 109 EKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVL-----EQIAQFGMPN 163
            K + I + N+    + LG LKGN F +  + +T  N++ + + +     + +++ G  N
Sbjct: 282 LKGMIIGNYNFSDASLNLGDLKGNEFVVVIRDVTTGNSEVSLEEIVSNGCKSLSENGFIN 341

Query: 164 YFGSQRFGKFN-DNHQEGLKILQNQTKFA 191
           YFG QRFG F+   H  G ++L +  K A
Sbjct: 342 YFGMQRFGTFSISTHTIGRELLLSNWKKA 370
>emb|CAA22009.1| (AL033502) hypothetical protein [Candida albicans]
          Length = 747

 Score = 64.3 bits (155), Expect = 2e-09
 Identities = 48/186 (25%), Positives = 85/186 (44%), Gaps = 23/186 (12%)

Query: 49  VRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQ 108
           V K    T+E+     ++L +    + YAG KD+   T Q  S+   +  +L  N  N  
Sbjct: 272 VYKQNRDTMEVANNIGKLLRINHKFINYAGTKDRRGATCQRFSI--NHGKVLRVNALNKS 329

Query: 109 EKN-LKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQ---------------- 151
           ++N   + S +Y  + +KLG LKGN F +  + + P + Q+ +Q                
Sbjct: 330 KRNGFTLGSFSYEDHPLKLGDLKGNEFTIVIRDIKPHHQQQQQQEQQPQDQSQSIESIVT 389

Query: 152 -VLEQIAQFGMPNYFGSQRFGKFNDNHQEGLKILQNQTKFAHQKLNAFLISSYQSYLFNA 210
              E + + G  NYFG QRFG F+ +  E  K + N+     Q+    L+S  +S    +
Sbjct: 390 SCFESLQKNGFINYFGMQRFGSFSISTHEFGKFILNEN---WQEFVELLLSDQESVAPGS 446

Query: 211 LLSKRL 216
           + ++++
Sbjct: 447 IEARKI 452
>ref|NP_247567.1| (NC_000909) conserved hypothetical protein [Methanococcus
           jannaschii] [Methanocaldococcus jannaschii]
 sp|Q58008|Y588_METJA Hypothetical protein MJ0588
 pir||D64373 hypothetical protein homolog MJ0588 - Methanococcus jannaschii
 gb|AAB98579.1| (U67507) conserved hypothetical protein [Methanococcus jannaschii]
           [Methanocaldococcus jannaschii]
          Length = 430

 Score = 63.2 bits (152), Expect = 5e-09
 Identities = 86/354 (24%), Positives = 154/354 (43%), Gaps = 36/354 (10%)

Query: 42  GEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLE 101
           G +    + K   +TL+ ++  +  +G +    G+AG KDK A+TTQ +     +   LE
Sbjct: 91  GNYIHFTLEKRNWTTLDAIREIANRVGKQRKHFGFAGNKDKYAVTTQRVGC---FNVKLE 147

Query: 102 KNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFG- 160
            +    + K + +      + KI+LG L GNRF +R ++   L  ++ ++ L ++ +   
Sbjct: 148 -DLMKVKIKGIILRDFQKTNRKIRLGDLWGNRFTIRVRE-PELKGKELEEALNKLCKLKY 205

Query: 161 MPNYFGSQRFGKFND-NHQEGLKILQNQTKFAHQKL--NAFLISSYQSYLFNALLSK--- 214
             NY+G QRFG      H  G  I++   + A              +S L   L+ +   
Sbjct: 206 FLNYYGVQRFGTTRPITHIVGRFIIERDWEGAFHAYCGTPLPYDDKKSKLARELVDEENF 265

Query: 215 RLEISKIISAF-----SVKENLEFFK-QKNLSVDSDTLKTLKNQAHPFKILEGDVMCHYP 268
           +    K   AF      +K  +E    QK   +    L+ +   A+   +    +   + 
Sbjct: 266 KEAYKKFPKAFFYERRMIKAYIETGSYQKAFMILPPYLRCMFINAYQSYLFNEIINRRFE 325

Query: 269 YGKFFDALELEKEGERFLKKEVAPTGLLDGKKALYAKNLSLEIEKEF--QHNL------L 320
           YG  F+ +    EG+  +  +  P+G L G K  +A  +  EIE+E   + NL      +
Sbjct: 326 YG--FEPM----EGDILI--DNVPSGALFGYKTRFASGIQGEIEREIYERENLSPEDFKI 377

Query: 321 SSHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEIKHEK 374
                 +G RR     + N+  +Y  E   + L F L KG+YA+++L+E   +K
Sbjct: 378 GEFGSFIGDRRAMIGKIYNM--KYWIEDDSYVLQFCLKKGNYATSVLREFIEKK 429
>ref|NP_595812.1| (NC_003423) hypothetical protein [Schizosaccharomyces pombe]
 sp|O74343|YH2X_SCHPO Hypothetical protein C1A4.09 in chromosome II
 pir||T39858 hypothetical protein SPBC1A4.09 - fission yeast
           (Schizosaccharomyces pombe)
 emb|CAA20114.1| (AL031174) hypothetical protein [Schizosaccharomyces pombe]
          Length = 680

 Score = 62.8 bits (151), Expect = 6e-09
 Identities = 45/155 (29%), Positives = 78/155 (50%), Gaps = 7/155 (4%)

Query: 42  GEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAP-LL 100
           GE+    + K    +++ L   +++L V    L  AG KD+  +T Q +++    A  L 
Sbjct: 238 GEYCHFHLYKENRDSMDCLGKIARLLKVPTRTLSIAGTKDRRGVTCQRVAIHHVRASRLA 297

Query: 101 EKNTSNFQEKNLKILSLNYHH--NKIKLGHLKGNRFFMRFKKM-TPLNAQKTKQVLEQIA 157
           + N+ + +      L  NY +  + ++LG LKGN F +  + + TP   +K  + L  + 
Sbjct: 298 QLNSGSLKNSTYGFLLGNYSYKNSNLRLGDLKGNEFHIVVRNVITP--KEKVVEALNSLK 355

Query: 158 QFGMPNYFGSQRFGKFN-DNHQEGLKILQNQTKFA 191
           + G  NYFG QRFG  +   H  G+++LQ+  K A
Sbjct: 356 EHGFINYFGLQRFGTSSVGTHTIGVRLLQSDWKGA 390
>ref|NP_613961.1| (NC_003551) Uncharacterized conserved protein [Methanopyrus
           kandleri AV19]
 gb|AAM01891.1| (AE010360) Uncharacterized conserved protein [Methanopyrus kandleri
           AV19]
          Length = 400

 Score = 59.7 bits (143), Expect = 5e-08
 Identities = 36/135 (26%), Positives = 68/135 (49%), Gaps = 6/135 (4%)

Query: 37  EFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKY 96
           +F   G H +  + K    T++ ++  +Q L       G AG+KDK A+T+Q +++    
Sbjct: 45  QFGGRGPHTLFYLEKYDWDTMKAVRRIAQALRKHHRHFGIAGMKDKRAVTSQRVTVRGVP 104

Query: 97  APLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQI 156
             +L    +  + ++LKI+ +     K++ G L GNRF +  +        +  + + ++
Sbjct: 105 PGVL----ARLRIRDLKIVPMGRARRKLRPGDLWGNRFVITVRGAKVRRLPEALRTVREL 160

Query: 157 AQFGMPNYFGSQRFG 171
              G+PNY+G QRFG
Sbjct: 161 G--GVPNYYGLQRFG 173
>ref|NP_393692.1| (NC_002578) conserved hypothetical protein [Thermoplasma
           acidophilum]
 emb|CAC11360.1| (AL445063) conserved hypothetical protein [Thermoplasma
           acidophilum]
          Length = 411

 Score = 57.8 bits (138), Expect = 2e-07
 Identities = 46/173 (26%), Positives = 76/173 (43%), Gaps = 9/173 (5%)

Query: 13  NHASIDFHFNSSARDFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIA 72
           N A        +  DF V EV     S+ G++ +I+       T  +++  +  L +   
Sbjct: 11  NRARPVIRIKENPEDFTVEEVADIPKSDNGKYTIIKAEIIDWDTNRIVEEIASALRISRK 70

Query: 73  ELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGN 132
            + YAG KDK A   Q+  +    AP+   + S    K  +I+      + ++LG L  N
Sbjct: 71  RISYAGTKDKRARKIQYFCI---NAPV---DVSVLAFKGFRIIDRFRSDHYLRLGDLTAN 124

Query: 133 RFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRFGKFNDN-HQEGLKIL 184
            F +RF+  +    ++  + + +    G PNYFG QRFG    N H  G  I+
Sbjct: 125 HFRIRFQGASDDYIEQRYEKMMECG--GFPNYFGQQRFGSRRRNTHDVGRLIV 175
>ref|NP_505653.1| (NM_073252) Uncharacterized protein family UPF0024 [Caenorhabditis
           elegans]
 sp|Q17426|YQ4B_CAEEL Hypothetical 64.6 kDa protein B0024.11 in chromosome V
 pir||T18646 hypothetical protein B0024.11 - Caenorhabditis elegans
 emb|CAA94883.1| (Z71178) Weak similarity with Haemophilus Influenzae protein HI0701
           (Swiss Prot accession number P44039), contains
           similarity to Pfam domain: PF01142 (Uncharacterized
           protein family UPF0024), Score=810.7, E-value=1.8e-240,
           N=1~cDNA EST yk256g10.3 comes>
          Length = 577

 Score = 56.6 bits (135), Expect = 5e-07
 Identities = 39/137 (28%), Positives = 69/137 (49%), Gaps = 5/137 (3%)

Query: 51  KSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYA-PLLEKNTSNFQE 109
           K    T    Q+ ++ L V    +   G+KDK A+T+Q +S+ K +   +L+ N+   + 
Sbjct: 158 KENKETSFACQLIAKFLNVGPNNIRTHGIKDKRAVTSQRVSVTKVHERTILDLNS---KL 214

Query: 110 KNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQR 169
           + +++    Y  + +++G   GNRF +  + +   + Q   Q LE     G  NYFG+QR
Sbjct: 215 RGIRVFGCEYKDDPVQMGAHWGNRFSIVLRSLPDDSEQLLHQRLETFQNTGFINYFGTQR 274

Query: 170 FGKFNDNHQE-GLKILQ 185
           FG  +    E GL I++
Sbjct: 275 FGSRSSTTAEIGLAIVK 291
>ref|NP_187133.1| (NM_111354) unknown protein [Arabidopsis thaliana]
 gb|AAG51420.1|AC009465_20 (AC009465) unknown protein; 78996-83414 [Arabidopsis thaliana]
          Length = 699

 Score = 55.1 bits (131), Expect = 1e-06
 Identities = 36/129 (27%), Positives = 63/129 (47%), Gaps = 2/129 (1%)

Query: 40  NTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPL 99
           + G+     + K    T E L +  ++LGV+    G++G KDK +++TQ +++ K+ A  
Sbjct: 224 HVGKFLRFHLYKENKDTQEALGLIGKMLGVQPKSFGFSGTKDKRSVSTQRVTVFKQQASK 283

Query: 100 LEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQF 159
           L     N +   +K+         + LG L GNRF +  + +   + +  KQ  E + + 
Sbjct: 284 LA--ALNKRLFGIKVGDFCNVKEGLLLGQLMGNRFTITLRGVVADSEETIKQSAESLGKD 341

Query: 160 GMPNYFGSQ 168
           G  NYFG Q
Sbjct: 342 GFINYFGLQ 350
>gb|EAA11729.1| (AAAB01008960) ebiP4385 [Anopheles gambiae str. PEST]
          Length = 558

 Score = 54.7 bits (130), Expect = 2e-06
 Identities = 39/143 (27%), Positives = 69/143 (47%), Gaps = 5/143 (3%)

Query: 51  KSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLEKNTSNFQEK 110
           K  L T++      Q L    + L YAG KD+ A TTQ++ +  +    +     +    
Sbjct: 164 KENLDTIQATMQLGQKLFCAPSVLTYAGTKDRRAKTTQWMCIKTREPAKIVAAARHI--P 221

Query: 111 NLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGMPNYFGSQRF 170
           N+ + +  +  + +KLG L+GNRF +  +++T  +       LE   + G  NY+G QRF
Sbjct: 222 NVSVGNFTFKPDTLKLGQLQGNRFRIALRQVT-ASDDTINACLEAFREKGFINYYGLQRF 280

Query: 171 GKFN--DNHQEGLKILQNQTKFA 191
           G       ++ G+++L+   K A
Sbjct: 281 GNSAAVPTYKIGIEMLKGNWKGA 303
>ref|NP_111899.1| (NC_002689) Uncharacterized conserved protein [Thermoplasma
           volcanium]
 dbj|BAB60548.1| (AP000996) hypothetical protein [Thermoplasma volcanium]
          Length = 409

 Score = 53.5 bits (127), Expect = 4e-06
 Identities = 50/209 (23%), Positives = 88/209 (41%), Gaps = 18/209 (8%)

Query: 27  DFCVHEVPLYEFSNTGEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALT 86
           DF V E+   E    G++ +I+ R     T  +    ++ L +    + +AG KDK A+ 
Sbjct: 19  DFSVEEIADIEPDPNGKYTIIKARVRDWDTNRIAAEIARRLHMSRKRVTFAGTKDKRAVK 78

Query: 87  TQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNA 146
            Q+  +      +     S    K+ +++      + + LG L  N F +RF  + P   
Sbjct: 79  LQYFCINSADVDV----ASLSGIKDFEVIESFKSSHYLTLGDLIANHFKIRFYGIDP--E 132

Query: 147 QKTKQVLEQIAQFGMPNYFGSQRFGKFNDN-HQEGLKILQNQTKFAHQKLNAFLISSYQS 205
              ++ +  I++ G PN+FG QRFG    N H+ G  I++ + + A +K           
Sbjct: 133 MFRERYVHIISKGGFPNFFGDQRFGSRRRNTHEIGKLIIKGEYEEAVKK----------- 181

Query: 206 YLFNALLSKRLEISKIISAFSVKENLEFF 234
           Y+++    K       I     K  LE F
Sbjct: 182 YIYDEKYDKESYRKHFIDTLDYKTALERF 210
>gb|AAG18841.1| (AE004988) Vng0243c [Halobacterium sp. NRC-1]
          Length = 575

 Score = 50.1 bits (118), Expect = 4e-05
 Identities = 48/180 (26%), Positives = 85/180 (46%), Gaps = 20/180 (11%)

Query: 24  SARDFCVHEVPLYEF----SNTGEHAVIQVRKS--GLSTLEMLQIFSQILGVRIAELGYA 77
           S  DF V E+  ++     + TG++  + VR +     T +  +  +  +G+    + +A
Sbjct: 171 SPADFRVRELEAFDTQPADAPTGDYPWLVVRATLHEWDTNDFARELANTVGMSRERVRWA 230

Query: 78  GLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMR 137
           G KD++A+TTQ  ++    A  +       + +N  I  +      ++ G L GN F + 
Sbjct: 231 GTKDRHAVTTQLFAVRDLDAAQVP------EIRNADIEVVGRAGRGLEFGDLAGNAFEIV 284

Query: 138 FKKMTPLNAQKTKQVLEQIAQFG-----MPNYFGSQRFG-KFNDNHQEGLKILQNQTKFA 191
            +   P   ++   V +++A FG      PNYFG QRFG K    H+ GL IL++  + A
Sbjct: 285 VRD--PDAPERAAAVADELAAFGGGTVGTPNYFGQQRFGSKRPVTHEVGLAILRDDWEAA 342
>ref|NP_444184.1| (NC_002607) Uncharacterized conserved protein [Halobacterium sp.
           NRC-1]
          Length = 434

 Score = 50.1 bits (118), Expect = 4e-05
 Identities = 48/180 (26%), Positives = 85/180 (46%), Gaps = 20/180 (11%)

Query: 24  SARDFCVHEVPLYEF----SNTGEHAVIQVRKS--GLSTLEMLQIFSQILGVRIAELGYA 77
           S  DF V E+  ++     + TG++  + VR +     T +  +  +  +G+    + +A
Sbjct: 30  SPADFRVRELEAFDTQPADAPTGDYPWLVVRATLHEWDTNDFARELANTVGMSRERVRWA 89

Query: 78  GLKDKNALTTQFISLPKKYAPLLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMR 137
           G KD++A+TTQ  ++    A  +       + +N  I  +      ++ G L GN F + 
Sbjct: 90  GTKDRHAVTTQLFAVRDLDAAQVP------EIRNADIEVVGRAGRGLEFGDLAGNAFEIV 143

Query: 138 FKKMTPLNAQKTKQVLEQIAQFG-----MPNYFGSQRFG-KFNDNHQEGLKILQNQTKFA 191
            +   P   ++   V +++A FG      PNYFG QRFG K    H+ GL IL++  + A
Sbjct: 144 VRD--PDAPERAAAVADELAAFGGGTVGTPNYFGQQRFGSKRPVTHEVGLAILRDDWEAA 201
>ref|NP_560515.1| (NC_003364) conserved hypothetical protein [Pyrobaculum aerophilum]
 gb|AAL64697.1| (AE009913) conserved hypothetical protein [Pyrobaculum aerophilum]
          Length = 413

 Score = 47.8 bits (112), Expect = 2e-04
 Identities = 35/130 (26%), Positives = 61/130 (46%), Gaps = 9/130 (6%)

Query: 42  GEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLE 101
           G    I V K  + T++++   ++ LG+   ++   GLKD  A+T+Q IS+    A L  
Sbjct: 58  GNWTWIHVVKRNVDTIKLVLRLARALGLSHRDVSVGGLKDTKAVTSQIISVRGPVANLP- 116

Query: 102 KNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGM 161
                 Q   ++ L +      I    + GNRF +  + +  ++    +  LE + +   
Sbjct: 117 ------QLPGVQFLGMWSMDKPITPSQIYGNRFTIILRDVERISC--AEGALEALKKTAA 168

Query: 162 PNYFGSQRFG 171
           PNY+G QRFG
Sbjct: 169 PNYYGYQRFG 178
>ref|XP_110124.1| (XM_110124) RIKEN cDNA 3000003F02 [Mus musculus]
          Length = 355

 Score = 43.9 bits (102), Expect = 0.003
 Identities = 33/103 (32%), Positives = 53/103 (51%), Gaps = 9/103 (8%)

Query: 94  KKYAP-LLEKNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKM-TPLN--AQKT 149
           KK  P  L+      ++K + + ++    + ++LG LKGN F +  + +   LN  A  T
Sbjct: 4   KKVTPERLKSIKEEIEKKRMNVFNIRSVGDCLRLGQLKGNHFEIIIRHLRNQLNDSANLT 63

Query: 150 KQVLEQIAQF---GMPNYFGSQRFGKFN--DNHQEGLKILQNQ 187
           +++LE I      G  NY+G QRFGK       Q GL +L+N+
Sbjct: 64  ERILEAIENVKNKGFVNYYGPQRFGKGQKIQTDQIGLALLKNE 106
>ref|NP_376057.1| (NC_003106) 363aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
 dbj|BAB65166.1| (AP000981) 363aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
          Length = 363

 Score = 42.7 bits (99), Expect = 0.007
 Identities = 35/130 (26%), Positives = 58/130 (43%), Gaps = 13/130 (10%)

Query: 42  GEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLE 101
           G++AV  +RK G+    ++   S+IL  +I    Y G+KD NA+T Q +     Y   ++
Sbjct: 46  GKYAVYLLRKRGIDHFTVISEISKILHSKIH---YIGIKDTNAITEQIV-----YTTNVK 97

Query: 102 KNTSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQFGM 161
                ++     +  L Y ++K    +  GN F +  +       +K    L  I    +
Sbjct: 98  NIIEKYENDKFLLTFLGYSNSKF---NHTGNIFEIEIETDDIKEFEKRVNKLRSIKY--L 152

Query: 162 PNYFGSQRFG 171
           P Y G QRFG
Sbjct: 153 PAYIGYQRFG 162
>ref|NP_341730.1| (NC_002754) Conserved hypothetical protein [Sulfolobus
           solfataricus]
 gb|AAK40520.1| (AE006655) Conserved hypothetical protein [Sulfolobus solfataricus]
          Length = 377

 Score = 42.7 bits (99), Expect = 0.007
 Identities = 80/349 (22%), Positives = 140/349 (39%), Gaps = 53/349 (15%)

Query: 42  GEHAVIQVRKSGLSTLEMLQIFSQILGVRIAELGYAGLKDKNALTTQFISLPKKYAPLLE 101
           G++AV  + K G+     +   S++  V  +++ Y G+KD NA+T+Q +     Y PL E
Sbjct: 53  GKYAVFLLTKWGIDHFSAI---SEVQKVLHSKVNYIGIKDANAITSQLV-----YIPLNE 104

Query: 102 KN--TSNFQEKNLKILSLNYHHNKIKLGHLKGNRFFMRFKKMTPLNAQKTKQVLEQIAQF 159
           K      +Q ++  +  L +  +  KL H  GN F +       L+      V+E+I Q 
Sbjct: 105 KQELIEKYQSRSFILKFLGF--SSKKLNH-TGNIFEI------SLSISDFDIVIERIEQI 155

Query: 160 G----MPNYFGSQRFGKFND-NHQEGLKILQNQTKFAHQKLNAFLISSYQSYLFNALLSK 214
                +P + G QRFG      H  G  +L+   + A      +LI +Y    F +   +
Sbjct: 156 KKNPYLPAFIGYQRFGTRRPITHLIGKYLLRRDWEKAF-----YLILTYP---FLSESKE 207

Query: 215 RLEISKIISAFSVKENLEF----FKQ-----KNLSVDSDTLKTLKNQAHPFKI----LEG 261
            ++I K+I     KE +      FKQ     KN    +     LK+   P  +     + 
Sbjct: 208 TIDIRKLIMEGDFKEAVRSIPSKFKQEKLLLKNYMRFNSYYLALKSSFIPISLYLDAYQS 267

Query: 262 DVMCHYPYGKFFDALELEKEGERFLKKEVAPTGLLDGKKALYAKNLSLEIEKEFQHNLLS 321
            +   Y   K  +   L  +    ++  +      D  K +Y       +++  + N   
Sbjct: 268 YLFNLYLSRKLDEYKNLNDKVNLLIRIPIYFNNCDDVCKEIY-------LDEGIERNFFK 320

Query: 322 SHAKTLGSRRFFWVFVENVTSQYVKEKAQFELGFYLPKGSYASALLKEI 370
                +  R        N+    V E+ +  + F L +G YA+ LL+EI
Sbjct: 321 LQEFKISLRDLVRKAFMNIRDLKVNEETK-TISFVLERGMYATILLREI 368
  Database: /home/scwang/download_20020708_db/nr
    Posted date:  Aug 7, 2002 12:55 PM
  Number of letters in database: 324,149,939
  Number of sequences in database:  1,026,957
  
Lambda     K      H
   0.320    0.136    0.386 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 232,436,030
Number of Sequences: 1026957
Number of extensions: 9636381
Number of successful extensions: 22440
Number of sequences better than 1.0e-02: 45
Number of HSP's better than  0.0 without gapping: 26
Number of HSP's successfully gapped in prelim test: 19
Number of HSP's that attempted gapping in prelim test: 22287
Number of HSP's gapped (non-prelim): 96
length of query: 381
length of database: 324,149,939
effective HSP length: 123
effective length of query: 258
effective length of database: 197,834,228
effective search space: 51041230824
effective search space used: 51041230824
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 98 (42.4 bits)