>ref|NP_223213.1| cag island protein, CYTOTOXICITY ASSOCIATED IMMUNODOMINANT ANTIGEN
[Helicobacter pylori J99]
Length = 1167
Score = 1996 bits (5171), Expect = 0.0
Identities = 1022/1191 (85%), Positives = 1078/1191 (89%), Gaps = 29/1191 (2%)
Query: 1 MTNETIDQTRTPDQTQSQTAFDPQQFINNLQVAFIKVDNVVASFDPDQKPIVDKNDRDNR 60
MTNE I+Q Q Q++ AF+PQQFINNLQVAFIKVDNVVASFDP+QKPIVDKNDRDNR
Sbjct: 1 MTNEAINQ-----QPQTEAAFNPQQFINNLQVAFIKVDNVVASFDPNQKPIVDKNDRDNR 55
Query: 61 QAFDGISQLREEYSNKAIKNPTKKNQYFSDFIDKSNDLINKDNLIDVESSTKSFQKFGDQ 120
QAF+ ISQLREE++NKAIKNPTKKNQYFS FI KSNDLI+KDNLID SS KSFQKFG Q
Sbjct: 56 QAFEKISQLREEFANKAIKNPTKKNQYFSSFISKSNDLIDKDNLIDTGSSIKSFQKFGTQ 115
Query: 121 RYQIFTSWVSHQKDPSKINTRSIRNFMENIIQPPIPDDKEKAEFLKSAKQSFAGIIIGNQ 180
RYQIF +WVSHQ DPSKINT+ IR FMENIIQPPI DDKEKAEFL+SAKQ+FAGIIIGNQ
Sbjct: 116 RYQIFMNWVSHQNDPSKINTQKIRGFMENIIQPPISDDKEKAEFLRSAKQAFAGIIIGNQ 175
Query: 181 IRTDQKFMGVFDESLKERQEAEKNG----GPTGGDWLDIFLSFIFNKKQSSDVKEAINQE 236
IR+DQKFMGVFDESLKERQEAEKNG PTGGDWLDIFLSF+FNKKQSSD+KE +NQE
Sbjct: 176 IRSDQKFMGVFDESLKERQEAEKNGEPNGDPTGGDWLDIFLSFVFNKKQSSDLKETLNQE 235
Query: 237 PVPHVQPDIATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQ 296
PVPHVQPD+ATTTTDIQ LPPEARDLLDERGNFSKFTLGDM MLDVEGVADIDPNYKFNQ
Sbjct: 236 PVPHVQPDVATTTTDIQSLPPEARDLLDERGNFSKFTLGDMNMLDVEGVADIDPNYKFNQ 295
Query: 297 LLIHNNALSSVLMGSHNGIEPEKVSLLYAGNGGFGDKHDWNATVGYKDQQGNNVATLINV 356
LLIHNNALSSVLMGSHNGIEPEKVSLLY NGG +HDWNATVGYK+Q+G+NVATLINV
Sbjct: 296 LLIHNNALSSVLMGSHNGIEPEKVSLLYGNNGGPEARHDWNATVGYKNQRGDNVATLINV 355
Query: 357 HMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIRNKVDFMEFLAQNNTKLD 416
HMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEI+NKVDFMEFLAQNN KLD
Sbjct: 356 HMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIQNKVDFMEFLAQNNAKLD 415
Query: 417 NLSEKEKEKFQNEIEDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFNNGDLSYTLK 476
NLS+KEKEKFQNEIEDFQKDSKAYLDALGND IAFVSKKD KH AL+ EF NG+LSYTLK
Sbjct: 416 NLSKKEKEKFQNEIEDFQKDSKAYLDALGNDHIAFVSKKDKKHLALVAEFGNGELSYTLK 475
Query: 477 DYGKKADKALDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGATNGVSHLEAG 536
DYGKKADKALDRE TLQGSLKHDGVMFVDYSNFKYTNASK+P+KGVGATNGVSHLEAG
Sbjct: 476 DYGKKADKALDREAKTTLQGSLKHDGVMFVDYSNFKYTNASKSPDKGVGATNGVSHLEAG 535
Query: 537 FNKVAVFNLPDLNNLAITSFVRRNLENKLTAKGLSLQEANKLIKDFLSSNKELAGKALNF 596
F+KVAVFNLP+LNNLAITS VR++LE+KL AKGLS QEANKL+KDFLSSNKEL GKALNF
Sbjct: 536 FSKVAVFNLPNLNNLAITSVVRQDLEDKLIAKGLSPQEANKLVKDFLSSNKELVGKALNF 595
Query: 597 NKAVAEAKSTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQ 656
NKAVAEAK+TGNYDEVK+AQKDLEKSL+KRE LEK+V K LESKSGNKNKMEAK+QANSQ
Sbjct: 596 NKAVAEAKNTGNYDEVKQAQKDLEKSLKKRERLEKDVAKNLESKSGNKNKMEAKSQANSQ 655
Query: 657 KDEIFALINKEANRDARAIAYTQNLKGIKRELSDKLEKISKDLKDFSKSFDEFKNGKNKD 716
KDEIFALINKEANRDARAIAY QNLKGIKRELSDKLE I+KDLKDFSKSFDEFKNGKNKD
Sbjct: 656 KDEIFALINKEANRDARAIAYAQNLKGIKRELSDKLENINKDLKDFSKSFDEFKNGKNKD 715
Query: 717 FSKAEETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSV 776
FSKAEETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENS+
Sbjct: 716 FSKAEETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSI 775
Query: 777 KDVIINQKVTDKVDNLNQAVSVAKAMGDFSRVEQVLADLKNFSKEQLAQQAQKNEDFNTG 836
KDVIINQK+TDKVDNLNQAVSVAKA GDFS VEQ LADLKNFSKEQLAQQAQKNEDFNTG
Sbjct: 776 KDVIINQKITDKVDNLNQAVSVAKATGDFSGVEQALADLKNFSKEQLAQQAQKNEDFNTG 835
Query: 837 KNSELYQSVKNSVNKTLVGNGLSGIEATALAKNFSDIKKELNEKFKNF-NNNNNGLKNST 895
KNS LYQSVKN VN TLVGNGLS EAT L+KNFSDIKKELN K NF NNNNNGL+NST
Sbjct: 836 KNSALYQSVKNGVNGTLVGNGLSKAEATTLSKNFSDIKKELNAKLGNFNNNNNNGLENST 895
Query: 896 EPIYAKVNKKKTGQVASPEEPIYTQVAKKVNAKIDRLNQIASGLGGVGQAAGFPLKRHDK 955
E PIYTQVAKKV AKIDRL+QIASGLG VGQAA F LKRHDK
Sbjct: 896 E-------------------PIYTQVAKKVKAKIDRLDQIASGLGDVGQAASFLLKRHDK 936
Query: 956 VDDLSKVGLSASPEPIYATIDDLGGPFPLKRHDKVDDLSKVGRSRNQELAQKIDNLNQAV 1015
VDDLSKVGLSA+ EPIYATIDDLGGPFPLKRHDKVDDLSKVG SR Q+L QKIDNLNQAV
Sbjct: 937 VDDLSKVGLSANHEPIYATIDDLGGPFPLKRHDKVDDLSKVGLSREQKLTQKIDNLNQAV 996
Query: 1016 SEAKAGFFGNLEQTIDKLKDSTKKNVMNLYVESAKKVPASLSAKLDNYAINSHTRINSNI 1075
SEAKA F NL+Q IDKLKDSTKKNV+NLYVESAKKVP SLSAKLDNYA NSHTRINSN+
Sbjct: 997 SEAKASHFDNLDQMIDKLKDSTKKNVVNLYVESAKKVPTSLSAKLDNYATNSHTRINSNV 1056
Query: 1076 QNGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGSVSLSEYDKIGFNQKNMKDYSDSFKF 1135
+NG INEKATGMLTQKN EWLKLVNDKIVAHNVGS LS YDKIGFNQKNMKDYSDSFKF
Sbjct: 1057 KNGTINEKATGMLTQKNSEWLKLVNDKIVAHNVGSAPLSAYDKIGFNQKNMKDYSDSFKF 1116
Query: 1136 STKLNNAVKDIKSGFTHFLANAFSTGYYCLARENAEHGIKNVNTKGGFQKS 1186
ST+L+NAVKDIKSGF FL N FS G Y L + + EHG+KN NTKGGFQKS
Sbjct: 1117 STRLSNAVKDIKSGFVQFLTNIFSMGSYSLMKASVEHGVKNTNTKGGFQKS 1167