BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (295 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P36943 Putative attaching and effacing protein homolog ... 620 e-176 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 306 8e-82 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 303 3e-81 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 301 2e-80 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 293 3e-78 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 291 1e-77 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 290 6e-77 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 281 3e-74 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 278 2e-73 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 275 1e-72 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 274 3e-72 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 272 9e-72 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 267 3e-70 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 265 1e-69 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 259 6e-68 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 255 1e-66 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 250 5e-65 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 249 9e-65 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 246 6e-64 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 244 3e-63 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 242 9e-63 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 236 6e-61 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 236 8e-61 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 236 1e-60 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 233 4e-60 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 233 5e-60 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 233 8e-60 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 231 3e-59 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 229 1e-58 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 228 2e-58 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 228 3e-58 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 226 7e-58 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 225 1e-57 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 222 1e-56 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 221 2e-56 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 220 5e-56 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 219 7e-56 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 219 1e-55 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 218 2e-55 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 218 2e-55 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 212 1e-53 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 210 4e-53 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 210 5e-53 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 199 1e-49 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 199 1e-49 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 198 2e-49 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 193 6e-48 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 190 4e-47 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 182 9e-45 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 181 2e-44 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 177 4e-43 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 172 1e-41 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 170 6e-41 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 159 7e-38 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 157 3e-37 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 155 2e-36 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 150 7e-35 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 142 2e-32 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 121 3e-26 UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 118 2e-25 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 117 4e-25 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 115 2e-24 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 108 3e-22 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 94 7e-18 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 89 1e-16 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 87 6e-16 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 84 7e-15 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 74 7e-12 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 72 3e-11 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 69 2e-10 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 68 4e-10 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 66 1e-09 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 65 2e-09 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 65 4e-09 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 64 7e-09 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 64 9e-09 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 64 9e-09 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 62 2e-08 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 61 6e-08 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 60 8e-08 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 59 3e-07 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 57 6e-07 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 56 2e-06 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 52 3e-05 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 52 3e-05 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 50 1e-04 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 49 1e-04 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 48 4e-04 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 45 0.004 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 44 0.010 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 43 0.013 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 42 0.025 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 41 0.053 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 40 0.095 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 620 bits (1599), Expect = e-176, Method: Compositional matrix adjust. Identities = 295/295 (100%), Positives = 295/295 (100%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT Sbjct: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA Sbjct: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI Sbjct: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ Sbjct: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 306 bits (783), Expect = 8e-82, Method: Compositional matrix adjust. Identities = 152/282 (53%), Positives = 187/282 (66%), Gaps = 11/282 (3%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 RF S L R VA I QVLFP+A T R + TA Sbjct: 5 RFHSSRLTRAVASLCIVTQVLFPVAST----AGHRVAAPQAAPAVLSEQDATA-----AQ 55 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 VA A L S +S G AT+ A QEWL ++GT RV L +D+DF+LK Sbjct: 56 VAGMTTQAAGMLQSGMNSRQAAEMARGYATSTAQSAFQEWLSQWGTVRVTLGLDEDFTLK 115 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 S+ ++L P +DTP N+LFTQ + HRTDDR Q N G GWRHF+ D+MAGVN F DHDL+ Sbjct: 116 GSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFA-PDYMAGVNLFFDHDLT 174 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWP 250 R H+R+G+G EYWRD LKL ANGY+R SGW+ +P+++ DY+ RPANGWD+RAEGYLPA+P Sbjct: 175 RYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYLPAYP 234 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 QLGA+LMYEQYYGDEV LFGKDKRQ+DPHA +A ++YTPVPL Sbjct: 235 QLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPL 276 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 303 bits (777), Expect = 3e-81, Method: Compositional matrix adjust. Identities = 134/222 (60%), Positives = 172/222 (77%), Gaps = 2/222 (0%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DFSLK Sbjct: 145 VAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDFSLK 204 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS E L+P Y+TP N++F+Q +HRTDDRTQ+N G GWR+F+ + WM+GVN FIDHDL+ Sbjct: 205 SSSFEFLHPWYETPDNLVFSQHTLHRTDDRTQTNHGIGWRYFT-SSWMSGVNMFIDHDLT 263 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWP 250 R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LPAWP Sbjct: 264 RYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLPAWP 323 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 QLG L+YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 324 QLGGKLVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPL 365 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 301 bits (770), Expect = 2e-80, Method: Compositional matrix adjust. Identities = 132/222 (59%), Positives = 172/222 (77%), Gaps = 2/222 (0%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DFSLK Sbjct: 145 VAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDFSLK 204 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS E L+P Y+TP N++F+Q +HRTD+RTQ+N G GWR+F+ + WM+GVN FIDHDL+ Sbjct: 205 SSSFEFLHPWYETPDNLVFSQHTLHRTDNRTQTNHGIGWRYFT-SSWMSGVNMFIDHDLT 263 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWP 250 R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LPAWP Sbjct: 264 RYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLPAWP 323 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 QLG ++YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 324 QLGGKVVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPL 365 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 293 bits (751), Expect = 3e-78, Method: Compositional matrix adjust. Identities = 128/223 (57%), Positives = 169/223 (75%), Gaps = 1/223 (0%) Query: 70 KNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 + +A A+ AG FL++ P DA + GMAT A+ E+Q+WL ++GTAR++L+VD FS Sbjct: 136 QKIAGIASQAGNFLANSPRGDAAASIARGMATGAASTEVQQWLSQFGTARLQLDVDNKFS 195 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 LK+S L++L P+Y+ P ++FTQG++HRTDDRTQ+N+G G R F+ + +M G NTF+D+D Sbjct: 196 LKNSQLDLLIPLYEQPDKLVFTQGSLHRTDDRTQTNLGMGMRWFN-DGYMLGGNTFLDYD 254 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 LSR H R+G+G EYWRDYLK+ AN Y+R + W+ S D DYQERPANGWD+ EG++PA Sbjct: 255 LSRDHARMGMGVEYWRDYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSLEGWVPAL 314 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 PQLG +L YEQYYG EV LFGKD RQKDPHAI+ V YTP PL Sbjct: 315 PQLGGNLKYEQYYGKEVALFGKDNRQKDPHAITVGVNYTPFPL 357 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 291 bits (746), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 126/227 (55%), Positives = 172/227 (75%), Gaps = 2/227 (0%) Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 N+E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+ Sbjct: 136 NLEQQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMTDWLSRFGTARITLGVDE 195 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 DFSLK+S + L+P Y+TP N+ F+Q +HRTD+RTQ N G GWRHF+ WM+G+N F Sbjct: 196 DFSLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTPT-WMSGINFFF 254 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGY 245 DHDLSR H+R G+GAEYWRDYLKLS+NGY+R + W+ +P+++ DY+ RPANGWD+RAEG+ Sbjct: 255 DHDLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRAEGW 314 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 LPAWP LG L+YEQYYGDEV LF KD RQ +PHAI+A + YTP PL Sbjct: 315 LPAWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPL 361 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 290 bits (741), Expect = 6e-77, Method: Compositional matrix adjust. Identities = 132/246 (53%), Positives = 176/246 (71%), Gaps = 10/246 (4%) Query: 48 QHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQE 107 Q+AV P +N +E +AS + GT LS +S+ G A+++A+ Sbjct: 129 QNAVPP--------ANGENTLENQIASTSQRVGTLLSQDMNSEQASGMARGWASSEASGA 180 Query: 108 IQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIG 167 + +WL +GTA++ L VD+DFSLK+S + L+P YDTP +LF+Q +HRTDDRTQ N G Sbjct: 181 MTDWLNNFGTAKISLGVDEDFSLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDRTQINTG 240 Query: 168 FGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 GWRHF+ + WM+G+N F DHDLSR H+R G+GAEYWRDYLKLS+N YI +GW+ +P++ Sbjct: 241 LGWRHFTPS-WMSGINLFFDHDLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWRSAPEL 299 Query: 228 E-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVT 286 + DY+ RPANGWD+RAEG+LPAWPQLG L+YEQYYGDEV LF K+ RQ +PHAI+A + Sbjct: 300 DNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAITAGLN 359 Query: 287 YTPVPL 292 YTP PL Sbjct: 360 YTPFPL 365 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 281 bits (718), Expect = 3e-74, Method: Compositional matrix adjust. Identities = 132/273 (48%), Positives = 186/273 (68%), Gaps = 12/273 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + VAW+ I++Q L+P ++FTP ++ HA + S A+ + ++S AA A Sbjct: 18 KVVAWSTIALQALYPALLSFTPTIS----HASAVKASQA----AAEQQELRGLSSLAAQA 69 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 G + ++ +F A+A +E+ EWL KYG AR++LNVD FSLKDS+ + LY Sbjct: 70 GRSI----ENGHAGSFAANTVPAQATKEVVEWLQKYGNARIQLNVDDAFSLKDSAFDFLY 125 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P D ++LF+Q ++HRTDDRTQ+NIG G+R+F+ ++ M G N F D+DLSR H R+G Sbjct: 126 PWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGANLFYDYDLSRHHARMGA 185 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 G EYWRDYL+ AN Y+R S WK S D++DYQERPA+GWDI +G+LP++PQLGASL YE Sbjct: 186 GVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYTQGWLPSYPQLGASLKYE 245 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +YYG VGLFG D Q++P+A + ++YTPVPL Sbjct: 246 KYYGKNVGLFGSDHLQENPYAFTGGISYTPVPL 278 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 278 bits (710), Expect = 2e-73, Method: Compositional matrix adjust. Identities = 125/225 (55%), Positives = 170/225 (75%), Gaps = 5/225 (2%) Query: 72 VASFAANAGTFLSSQPDSDA----TRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 +A A+ G L + P+S+A R+ A AKA QEI +WL G RVKL+ D+D Sbjct: 128 LAQSASQVGNTLQNNPNSEALNDLARSSALSAANAKAGQEISDWLNGKGKVRVKLDADRD 187 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FS+K+S L++L P++++ ++M+F+QG++HRTDDRTQSN+G G+R+F+ + + G NTF D Sbjct: 188 FSVKNSQLDLLVPLWESESHMIFSQGSVHRTDDRTQSNLGLGYRYFA-DSYALGANTFYD 246 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 HD SRSH+R+G+GAEY R++ KL+ NGY+R S WK SPD ++Y+ERPANGWDIRAEGYLP Sbjct: 247 HDWSRSHSRLGLGAEYQRNFFKLATNGYLRLSNWKDSPDFDNYEERPANGWDIRAEGYLP 306 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 ++P LGA L YEQYYGD VGLFGKD +QK+PHAI+ Y+P PL Sbjct: 307 SYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPHAITFGGNYSPFPL 351 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 275 bits (704), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 132/253 (52%), Positives = 175/253 (69%), Gaps = 2/253 (0%) Query: 41 PVMAARAQHAVQPRLSMGNTTVTADNNVE-KNVASFAANAGTFLSSQPDSDATRNFITGM 99 P+MAA+ + G + + N + + VA +A+ AG+FL+S SDA + M Sbjct: 111 PLMAAKDNKNASDAAAPGRSASAEEGNEQAQKVAGYASQAGSFLASSAKSDAAASMARNM 170 Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 AT +A Q+WL +GTARV+L+ DK+FSLK+S ++L P+YD N +FTQG++HRTD Sbjct: 171 ATVEAGGAFQQWLSHFGTARVQLDADKNFSLKNSQFDLLLPLYDQGDNFVFTQGSLHRTD 230 Query: 160 DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 RTQ+++G GWRH S + +M G N F D DLSR H R G G EYWR++LKL N Y+R S Sbjct: 231 SRTQASLGAGWRH-STSTYMLGGNLFGDFDLSRDHARAGAGLEYWRNFLKLGVNSYLRLS 289 Query: 220 GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH 279 GWK SPD+EDYQERPANGWD+R + ++P+ PQLG L YEQYYG EV LFG D RQ++PH Sbjct: 290 GWKDSPDLEDYQERPANGWDVRGQAWVPSLPQLGGKLTYEQYYGKEVALFGVDSRQRNPH 349 Query: 280 AISAEVTYTPVPL 292 AI+ + YTPVPL Sbjct: 350 AITVGINYTPVPL 362 Score = 41.6 bits (96), Expect = 0.036, Method: Compositional matrix adjust. Identities = 19/34 (55%), Positives = 21/34 (61%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQP 53 R +AW NI+VQV FPLAV F P MA QP Sbjct: 20 RRLAWFNIAVQVAFPLAVAFPPAMAGEQHFLPQP 53 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 274 bits (700), Expect = 3e-72, Method: Compositional matrix adjust. Identities = 124/229 (54%), Positives = 169/229 (73%), Gaps = 1/229 (0%) Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 D+ + +A A+ AG FLS P+ DA + G TA+A+ ++Q+WL ++GTARV+L+ Sbjct: 162 GDDAGARKMADVASRAGAFLSDNPNGDAALSLARGEVTAEASGQLQQWLNQFGTARVQLD 221 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 D+ FS K+S ++L P+Y+ +++FTQG++HRTDDRTQ N+GFG R+F+ + +M G N Sbjct: 222 ADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQVNLGFGLRYFAPS-YMLGGN 280 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F D+DLSR+H+R G+G EYWRD+LKLSANGY+R S W S D +DYQERPANGWDIRA+ Sbjct: 281 IFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNSSDFKDYQERPANGWDIRAQ 340 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +LP+ PQLG L YEQYYG V LFGK+ Q+DP AI+A V +TP PL Sbjct: 341 AWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAGVNFTPFPL 389 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 272 bits (696), Expect = 9e-72, Method: Compositional matrix adjust. Identities = 123/228 (53%), Positives = 169/228 (74%), Gaps = 1/228 (0%) Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 D+ +A A+ AGT LS+ PD DA + G +A A+ ++Q+WL ++GTARV+L Sbjct: 141 DDEGAMKMADMASRAGTLLSNSPDGDAALSMARGQISAVASGQVQQWLNQFGTARVQLEA 200 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+ FSLK+S +++L P Y+ +LFTQG++HRTDDRTQ+N+GFG R+F+ + +M G N Sbjct: 201 DEHFSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAPS-YMLGGNI 259 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F D+DLS H+R G+G EYWRD+LKLSANGY+R S W+ SP++++YQERPANGWDIRA+ Sbjct: 260 FGDYDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQA 319 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +LP+ PQLG L YEQYYG V LFGK+ Q++P AI+A V +TP PL Sbjct: 320 WLPSLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPL 367 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 267 bits (683), Expect = 3e-70, Method: Compositional matrix adjust. Identities = 129/288 (44%), Positives = 182/288 (63%), Gaps = 13/288 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTP-VMAARAQHAV--QPRLSMGNTTVTA 64 +K + + +++ V WANI +Q +FPL++ FTP VMAA A +PR + Sbjct: 16 NKNKQLNKTRISKSVVWANIVIQAIFPLSIAFTPAVMAAETVGASDEKPR---------S 66 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 + E++ A+ A + L++ + + G A N+ +Q+W ++G+A+V+LN+ Sbjct: 67 ASQAEQSTANAATRLASILTNDDSAKQASSIARGTAANAGNEALQKWFNQFGSAKVQLNL 126 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+ SLK S L++L P+ D+P + FTQ DDR N+G G RHF M G N Sbjct: 127 DEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQ-MLGYNL 185 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+DHD S SHTRIGVGAEY RD++ L+ANGY SGWK SPD++ Y E+ ANG+D+R+E Sbjct: 186 FVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSEA 245 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 YLP PQLG L+YEQY+GDEVGLFG D RQK+P A++ V YTP+PL Sbjct: 246 YLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPL 293 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 265 bits (677), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 126/238 (52%), Positives = 160/238 (67%), Gaps = 3/238 (1%) Query: 55 LSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGK 114 L+ NT +T E+N+A A + L+S D A R + G+A ANQ WL Sbjct: 76 LAPENTALTDTQTTERNLAKTATTSAQMLNSG-DKAAARQ-LRGLAVGNANQAANSWLNN 133 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 +GTAR++ NVD L S +ML P YDTP+ M FTQ I R D RT +N+G G RHF Sbjct: 134 FGTARLQANVDDRGDLDGSQFDMLMPFYDTPSQMAFTQFGIRRIDKRTTANLGIGIRHFI 193 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERP 234 +DWM G N F+D D++R HTR+G GAEY RDYLKL+ANGY+R S W+ SPD Y ERP Sbjct: 194 -DDWMVGYNLFLDRDITRDHTRVGAGAEYARDYLKLAANGYLRLSDWRDSPDFSSYSERP 252 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 A G+D+RAE YLP+ PQLG LMYEQY+G++VGLFGKD RQ++P AI+A + YTP+PL Sbjct: 253 ATGFDLRAEAYLPSLPQLGGKLMYEQYFGNDVGLFGKDNRQQNPAAITAGINYTPIPL 310 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 259 bits (663), Expect = 6e-68, Method: Compositional matrix adjust. Identities = 120/224 (53%), Positives = 156/224 (69%) Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E++VA A AG L DA R +T A+ +A + +WL ++GTA+ +L+V DF Sbjct: 40 EQSVAQTAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQWLNQFGTAKTQLSVVSDF 99 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SSL++L P Y+TP N+LFTQ + D R +N G G R+F+ N WM G N F D Sbjct: 100 SLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWMLGYNVFYDV 159 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D ++ R G+G E WRDYLKLSANGY R S W++SP + DY ERPA+GWDIRAEG+LPA Sbjct: 160 DWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDIRAEGWLPA 219 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +PQLG L+YEQYYG+EV LFG+ +RQK+PHAI+A VT+TP L Sbjct: 220 YPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSL 263 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 255 bits (652), Expect = 1e-66, Method: Compositional matrix adjust. Identities = 116/214 (54%), Positives = 145/214 (67%), Gaps = 1/214 (0%) Query: 79 AGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEML 138 AG LS+ SDA N T + N Q+WL ++GTARV+LNVD DF L +S+L++L Sbjct: 168 AGKLLSNDNTSDAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLL 227 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIG 198 P+ D+ +++LFTQ + D R NIG G R + G DWM G NTF D+DL+ + R+G Sbjct: 228 VPLKDSESSLLFTQLGVRNKDSRNTVNIGAGIRQYQG-DWMYGANTFFDNDLTGKNRRVG 286 Query: 199 VGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMY 258 VGAE DYLK SAN Y +GW +S D Y ERPA+G+DIR E YLPA+PQLG LMY Sbjct: 287 VGAEVATDYLKFSANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMY 346 Query: 259 EQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 E+Y GDEV LFGKD RQKDPHA++ V YTPVPL Sbjct: 347 EKYRGDEVALFGKDDRQKDPHAVTLGVNYTPVPL 380 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 250 bits (638), Expect = 5e-65, Method: Compositional matrix adjust. Identities = 126/282 (44%), Positives = 174/282 (61%), Gaps = 16/282 (5%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F S+ ++ + W+ I +Q+LFPL F PV AA P + TTV + E + Sbjct: 12 FFLSLKSKLIIWSQIVLQILFPLFTVF-PVHAA-------PATTTKETTVAMPYSQELST 63 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 + + +GT D ++ TGMAT+ A +Q+WL ++GTARV+LNVD + + D Sbjct: 64 LASSTASGT--------DGAKSAATGMATSAAASSVQQWLSQFGTARVQLNVDDNGNWDD 115 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 S++++L P+YD +LFTQ + D RT N+G G R F +WM G N F D D + Sbjct: 116 SAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVFFDDDFTG 175 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 + R+G GAE W +YLKLSAN Y+ + W S D DY E+PA+G+DIRAEGYLPA+PQL Sbjct: 176 KNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGYLPAYPQL 235 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 GA LMYEQYYGD+V LF D Q +P A++ ++YTPVPL Q Sbjct: 236 GAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQ 277 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 249 bits (635), Expect = 9e-65, Method: Compositional matrix adjust. Identities = 110/205 (53%), Positives = 149/205 (72%), Gaps = 2/205 (0%) Query: 88 DSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN 147 D D T+ I+ ++ +K+NQ+I++WL ++G ARV L+ DK+ +LK+SS E+L P+Y+ Sbjct: 112 DIDVTQYAISQIS-SKSNQKIEQWLNQFGHARVSLSADKNLTLKNSSAELLIPLYEQKEK 170 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 ++F Q HR D R+Q N G G+R+F+ +M G+N F DHDL+ H R+G+GAE WRDY Sbjct: 171 LIFAQTNYHRKDLRSQFNYGIGYRYFT-EKFMVGINGFYDHDLTHHHNRLGIGAEIWRDY 229 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 KLS+N Y R S W+ S +I DY ERPANGWDIR EGY PA+PQLG L++EQYYG EVG Sbjct: 230 FKLSSNHYHRLSSWRASNNILDYSERPANGWDIRTEGYFPAYPQLGTKLIFEQYYGKEVG 289 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPL 292 LFGKDKR K+PH + + YTP+PL Sbjct: 290 LFGKDKRDKNPHTYTLGINYTPIPL 314 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 246 bits (629), Expect = 6e-64, Method: Compositional matrix adjust. Identities = 117/229 (51%), Positives = 154/229 (67%), Gaps = 1/229 (0%) Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 AD +E +AS T L++ + ++ I A + N Q+WL ++GTARV++N Sbjct: 106 ADVLLENKLASHVQTGATALATSNAAKSSERMIRSAANNEFNSSAQQWLSQFGTARVQMN 165 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 V+ DF L S++++L PIYD ++LFTQ D+R NIG G R F N+WM GVN Sbjct: 166 VNDDFKLDGSAVDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQ-NNWMYGVN 224 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 TF D+D++ + R+GVGAE W DYLKLSAN YI S W +S D DY ERPANG+D+RAE Sbjct: 225 TFFDNDMTGKNRRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAE 284 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 YLP+ PQLG LMYE+Y G+EV LFGKD RQK+PHA++A V YTP+PL Sbjct: 285 AYLPSHPQLGGKLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPL 333 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 244 bits (622), Expect = 3e-63, Method: Compositional matrix adjust. Identities = 106/224 (47%), Positives = 156/224 (69%), Gaps = 1/224 (0%) Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E VA A+ AG F S+ PD + T+ F + T A+ Q+W ++G++++ L DK F Sbjct: 140 EIRVAQLASQAGKFFSTNPDQEKTKAFARELLTTAASSYAQDWFNRFGSSQIHLEADKKF 199 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S +++L P Y+T N++F+Q ++HR + R ++N+G G R + G M G NTF D+ Sbjct: 200 SLKNSQIDLLMPWYETEDNLIFSQTSLHRKEGRIETNLGLGARWY-GEGQMIGGNTFFDY 258 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D+SR H+R+G+G EY RD+LKLSAN Y R SGW+ S D+ D+ RP+NGWD+RAEG+LP+ Sbjct: 259 DISRKHSRLGLGVEYRRDFLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVRAEGWLPS 318 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +P +G L YEQYYGD V LFG Q++P++I+A + YTP+PL Sbjct: 319 YPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSITAGLNYTPIPL 362 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 242 bits (618), Expect = 9e-63, Method: Compositional matrix adjust. Identities = 117/234 (50%), Positives = 151/234 (64%), Gaps = 7/234 (2%) Query: 65 DNN------VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA 118 DNN VE +A A T LS+ + + + A+ + N Q+WL ++GTA Sbjct: 131 DNNKDNRLSVENTLAGHAVAGATALSNGDVAKSGERMVRSAASNEFNNSAQQWLSQFGTA 190 Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 RV+LN++ DF L S+ ++L P+YD ++LFTQ D R N+G G R F GN W Sbjct: 191 RVQLNINDDFHLDGSAADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGN-W 249 Query: 179 MAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGW 238 M G NTF D+DL+ + RIGVGAE W DYLKLSAN Y + W +S D DY ERPANG+ Sbjct: 250 MYGANTFFDNDLTGKNRRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGY 309 Query: 239 DIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 D+RAE YLP++PQLG MYE+Y GD+V LFGKD RQK+PHAI+A V YTP+PL Sbjct: 310 DLRAEAYLPSYPQLGGKAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPL 363 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 236 bits (603), Expect = 6e-61, Method: Compositional matrix adjust. Identities = 124/290 (42%), Positives = 171/290 (58%), Gaps = 17/290 (5%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 H ++ ++ + S+L + V WA I +Q+ FPL V P A+ A + +S Sbjct: 2 HEQSIMEKNTLKISLLKKIVIWAQILLQIAFPLLVL--PAHASSGPGATETDMSD----- 54 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 AS + + ++Q +DA +N T +AT A ++EWL +GTA+V L Sbjct: 55 ----------ASTLSASLASSAAQNGADAMKNTATHLATTHAASTVEEWLSHFGTAQVTL 104 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 +VD + + +S+ + L P+YD ++LFTQ I D RT NIG G R F DWM G Sbjct: 105 DVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFGG 164 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F D D + + RIG GAE W +YLKLSAN YI S W S D ++Y E+PA+G+D+RA Sbjct: 165 NVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVRA 224 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 EGYLP++PQLGA LMYEQYYGD V LF KD Q +P A++ + YTPVPL Sbjct: 225 EGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPL 274 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 236 bits (602), Expect = 8e-61, Method: Compositional matrix adjust. Identities = 104/221 (47%), Positives = 153/221 (69%), Gaps = 3/221 (1%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 +AS A + G LS+ D+ + + + T K N IQ W +GTA ++L VDK+FSLK Sbjct: 135 IASIATDVGNILSN--DNISKNSALLNKITNKVNSHIQSWFENFGTAHIQLQVDKNFSLK 192 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 +S LE+L+P+++ + F+QG I DD+ SNIG G+R F N WM G N+FID+DL Sbjct: 193 NSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDN-WMLGGNSFIDYDLR 251 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 + H+R+G+G EYW+D LKL AN Y+R S W+ S +I DY+ERPANG D+ + +LP++PQ Sbjct: 252 KEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIKSWLPSYPQ 311 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +G + YE+YYGD+V LFG++ RQ++PH+ + ++YTP PL Sbjct: 312 IGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPL 352 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 236 bits (601), Expect = 1e-60, Method: Compositional matrix adjust. Identities = 109/198 (55%), Positives = 137/198 (69%), Gaps = 5/198 (2%) Query: 99 MATAKANQEI----QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 MA + N EI Q+WLG++GTAR++ N + DF S++++L P+YD ++ FTQ Sbjct: 162 MARSAVNNEISSSAQQWLGQFGTARIQFNTNDDFEFDSSAIDVLIPLYDNQKSLFFTQLG 221 Query: 155 IHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANG 214 D R NIG G R F N WM G NTF D+D++ ++ R+G+GAE W DYLKLSANG Sbjct: 222 GRNKDSRNTINIGAGVRAFLTN-WMYGANTFFDNDITGNNRRVGIGAEAWTDYLKLSANG 280 Query: 215 YIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKR 274 Y + W +S D DY ERPANG+D+RAE YLPA+PQLG LMYEQY GDEV LFGKDKR Sbjct: 281 YFGTTDWHQSRDFADYNERPANGYDLRAETYLPAYPQLGGKLMYEQYNGDEVALFGKDKR 340 Query: 275 QKDPHAISAEVTYTPVPL 292 QKDPHAI+ + YTPV L Sbjct: 341 QKDPHAITVGINYTPVSL 358 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 233 bits (595), Expect = 4e-60, Method: Compositional matrix adjust. Identities = 129/297 (43%), Positives = 180/297 (60%), Gaps = 15/297 (5%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTF-TPVMAARAQHAVQP-RLSMG 58 M +++ H + + R + +A I +Q+ P ++ + + V A A+ G Sbjct: 12 MLFFRSTHMRSKTR-----KLLACIQIVLQLAPPSSLIYLSSVFNANAEEITSSAEKEQG 66 Query: 59 NTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA 118 N +D N +VA A AG+ LSS SDA + + T KA QEWL ++GTA Sbjct: 67 NP---SDQNAS-SVAQTAVQAGSLLSSDNASDALGSAVVSAVTGKAASSAQEWLSQFGTA 122 Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 RV ++ D+ F+L DS L++L P+Y+ N+LFTQ R DDR N GFG+RHF+ + W Sbjct: 123 RVNISTDEHFTLSDSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHFN-DGW 181 Query: 179 MAGVNTFIDHDLS-RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 M G N F D +S H R+G+ E DYL +SANGY+R S W S +DY ER A+G Sbjct: 182 MWGTNVFYDRQVSGNQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADG 241 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFG--KDKRQKDPHAISAEVTYTPVPL 292 +DIRA GYLPA+PQLGA+++YEQY+GD VGLFG +D RQKDP+A++ + YTPVPL Sbjct: 242 FDIRATGYLPAYPQLGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPL 298 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 233 bits (595), Expect = 5e-60, Method: Compositional matrix adjust. Identities = 118/224 (52%), Positives = 148/224 (66%), Gaps = 1/224 (0%) Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E VA G L+S S+A ATA AN EI +WL KYGTA+++LN+DK+F Sbjct: 172 ESRVAGQLMGVGRVLASPQSSNAASEMARSWATAAANDEIVKWLSKYGTAQLQLNIDKNF 231 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SL S+L+ L P YDTPT FTQ D R NIG G R S N+W+ GVN F DH Sbjct: 232 SLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTRTLS-NNWLFGVNAFYDH 290 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 DLS ++R+G+G+E W DYL+LS NGY+R S W +S D+ DY ERPANG+D+RA ++P Sbjct: 291 DLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNERPANGFDVRANAWMPT 350 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 PQLG LMYEQY+GD VGLFGKD Q++P+A + V YTP PL Sbjct: 351 LPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPL 394 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 233 bits (593), Expect = 8e-60, Method: Compositional matrix adjust. Identities = 111/209 (53%), Positives = 149/209 (71%), Gaps = 2/209 (0%) Query: 84 SSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYD 143 SS P + AT + + ++ AN+EIQ+WLG+YGTA+V+LNVD FSL++SSL+ L+ YD Sbjct: 163 SSDPTTVAT-DVVRSEVSSTANKEIQKWLGQYGTAQVRLNVDDKFSLRESSLDWLFSFYD 221 Query: 144 TPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEY 203 + + ++FTQ I D R +N+G G R GN W+ G NTF D+DL+ ++R+G GAE Sbjct: 222 SSSAIIFTQLGIRNKDHRNTANLGLGGRISMGN-WILGANTFYDNDLTGINSRLGFGAEA 280 Query: 204 WRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG 263 W DYL+LSAN Y+R + W +S D D+ ERPANG+DIR +LP PQLG LMYEQY G Sbjct: 281 WTDYLQLSANSYMRLNNWHQSRDFIDHDERPANGFDIRTNAWLPVLPQLGGKLMYEQYSG 340 Query: 264 DEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 D V LFGKDK QK+P+A++A +TYTP PL Sbjct: 341 DSVALFGKDKLQKNPYAVTAGITYTPFPL 369 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 231 bits (588), Expect = 3e-59, Method: Compositional matrix adjust. Identities = 120/278 (43%), Positives = 166/278 (59%), Gaps = 7/278 (2%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y + AR + ++ P+ ++ A +A P+LS + + V Sbjct: 119 YRIFARGFEHVGVGDEIDIPVDMSSLNTQAGQA-----PKLSSAMREPSRAEKEAQAVGQ 173 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 + T S++P S+A MAT AN+EIQ+WL KYGTARV+LN+DK+FSL +S+ Sbjct: 174 LMSVGATLSSTRP-SEAAAGMARSMATNAANEEIQQWLSKYGTARVQLNLDKNFSLSESA 232 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 L+ P++D+ FTQ D R N+G G R + WM GVN F DHDL+ + Sbjct: 233 LDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVGARTLL-DRWMLGVNMFYDHDLTGHN 291 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 +R+G+GAE W DYL+LS NGY+R S W +S D DY ER ANG+DIRA +LPA PQLG Sbjct: 292 SRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADYDERAANGFDIRANAWLPALPQLGG 351 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 L+YEQY G+ V LFGK+ Q++P+A++A V YTP PL Sbjct: 352 KLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPFPL 389 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 229 bits (583), Expect = 1e-58, Method: Compositional matrix adjust. Identities = 103/194 (53%), Positives = 134/194 (69%), Gaps = 1/194 (0%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRT 158 +A +AN Q WL ++GTARV+LN+D + SLK S+ +ML P+YD ++LF+Q + Sbjct: 83 VAVGEANDAAQHWLSQFGTARVQLNLDNNLSLKGSAFDMLLPLYDDQKSLLFSQFGLRNH 142 Query: 159 DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRA 218 D R NIG G R N WM G N F D D++ + RIG GAE W DYLKLSAN Y+R Sbjct: 143 DSRNTINIGAGVRTLQDN-WMYGANVFFDRDITGKNNRIGFGAEAWTDYLKLSANSYLRL 201 Query: 219 SGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDP 278 + W +S D DY ERPANG+D+R E YLPA+PQ+G +L YEQY G+EV LFGKD RQK+P Sbjct: 202 TDWHQSRDFADYNERPANGYDLRVEAYLPAYPQIGTNLKYEQYKGNEVALFGKDDRQKNP 261 Query: 279 HAISAEVTYTPVPL 292 +A +A + YTP+PL Sbjct: 262 YAFTAGINYTPIPL 275 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 228 bits (582), Expect = 2e-58, Method: Compositional matrix adjust. Identities = 116/264 (43%), Positives = 158/264 (59%), Gaps = 17/264 (6%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 +Q+LFP V +A A QP +++ V V S A GT ++ Sbjct: 1 MQLLFPF------VTSAYTYAASQPPVAVP---------VPTQVTSLLAAGGT--ETENG 43 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 S+ ++ T MAT A ++EWL +GTA V LN D++ + +SS++ L P+YD ++ Sbjct: 44 SNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSV 103 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 LFTQ + D RT NIG G R F+ +WM G N F D D + + R+G+GAE W DYL Sbjct: 104 LFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYL 163 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 KL+AN YI + W S D DY E+PA+G+DIRAEGYLPA+PQLGA +MYEQYYG+ V L Sbjct: 164 KLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVAL 223 Query: 269 FGKDKRQKDPHAISAEVTYTPVPL 292 F KD Q DP A++ + YTP+ L Sbjct: 224 FDKDHLQNDPSAVTMGLNYTPISL 247 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 228 bits (580), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 105/194 (54%), Positives = 136/194 (70%), Gaps = 1/194 (0%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRT 158 M NQEI++WL ++GTA+V LN DK+FSLK+SSL+ L P YD+ + + F+Q I Sbjct: 133 MVGDAVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLFFSQLGIRNK 192 Query: 159 DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRA 218 D R N+G G R N W+ G+NTF D+DL+ + RIG+GAE W DYL+L+ANGY R Sbjct: 193 DSRNTLNLGVGIRTLE-NGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQLAANGYFRL 251 Query: 219 SGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDP 278 +GW S D DY+ERPA G D+RA YLPA PQLG LMYEQY G+ V LFGKD Q++P Sbjct: 252 NGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALFGKDNLQRNP 311 Query: 279 HAISAEVTYTPVPL 292 +A++A + YTPVPL Sbjct: 312 YAVTAGINYTPVPL 325 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 226 bits (576), Expect = 7e-58, Method: Compositional matrix adjust. Identities = 129/284 (45%), Positives = 181/284 (63%), Gaps = 12/284 (4%) Query: 20 RCVAWANISVQVLFPLAVTFT---PVMAARAQHAVQPR----LSMGNTTVTADNNVEKNV 72 R A+ + +QV+F +F P AA Q + ++ +T ++ KN+ Sbjct: 6 RLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEAGKNL 65 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 A+ ++ G+ LS +DA N +A +IQ+WL ++GTA+V L +DKD SL + Sbjct: 66 AAILSSTGSMLSQDNKTDALINSAINNGSAYVTGQIQQWLQQFGTAKVNLGLDKDLSLDN 125 Query: 133 SSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 +SL++L P+YD N+LFTQ R DDR N+G G+R+F+ + WM G+NTF D +S Sbjct: 126 ASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFA-DRWMWGINTFYDRQIS 184 Query: 192 -RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 +H R+G+G E +Y KLSANGY R SGWK S + EDYQER ANG+DIRAEGYLPAWP Sbjct: 185 DNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDIRAEGYLPAWP 244 Query: 251 QLGASLMYEQYYGDEVGLF--GKDKRQKDPHAISAEVTYTPVPL 292 QLGA L++EQYYGD+V LF +D RQ++P+A++A V YTP PL Sbjct: 245 QLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPL 288 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 225 bits (574), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 116/262 (44%), Positives = 161/262 (61%), Gaps = 15/262 (5%) Query: 41 PVMAARAQHAVQPRL-SMGNTTVTADNNVEKNVASFAANAGTFLS--------SQPDSDA 91 PV+ A A+ L S+G+ + +NN E A + GTFLS SQ D Sbjct: 29 PVIPAYAKMLDNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQIAQDY 88 Query: 92 TRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFT 151 RN ++ AT +EI+ WL K G ++ ++ DK FS+K+S + L P YD +LFT Sbjct: 89 ARNKVSSEAT----KEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLFT 144 Query: 152 QGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLS 211 Q +HR D+R +N G G R+F + G+N FIDHDLS +HTR+G+G EYW+DYLKL+ Sbjct: 145 QHTLHRYDERFHTNNGIGLRYFHEKSTI-GMNAFIDHDLSHAHTRVGLGVEYWQDYLKLN 203 Query: 212 ANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFG 270 AN Y + WK + ++ D+ +PA+GWDI+ EG+LP +P LG +L YEQYYGD V LFG Sbjct: 204 ANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALFG 263 Query: 271 KDKRQKDPHAISAEVTYTPVPL 292 K KRQK+P+A + +TP PL Sbjct: 264 KTKRQKNPNAATIGANWTPFPL 285 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 222 bits (566), Expect = 1e-56, Method: Compositional matrix adjust. Identities = 107/222 (48%), Positives = 137/222 (61%), Gaps = 2/222 (0%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 VAS + G LSS+ A GM + + ++EWLG G A+VKL D Sbjct: 124 VASHLSQVGNSLSSEDRVGAFSRLAKGMLLSSTAKTVEEWLGHIGQAQVKLQADDKNDFS 183 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 S +++ P+YD P + F+Q R D R NIG G RH+ +DWM G N F D +S Sbjct: 184 GSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYV-SDWMFGYNIFFDQQIS 242 Query: 192 -RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 +H R+G G E RDY+KLSAN Y R GWK S +EDY ER ANG+DIR E YLP +P Sbjct: 243 GNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAANGYDIRTEAYLPHYP 302 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 QLG LMYEQY+GDEV LFG ++RQK+P A++A V+YTP+PL Sbjct: 303 QLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYTPIPL 344 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 221 bits (563), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 98/189 (51%), Positives = 130/189 (68%) Query: 104 ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQ 163 A +++WL ++GTARV+LNVD + DS+++ L P+YD+ MLFTQ + DDR Sbjct: 74 AGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLAPLYDSQKAMLFTQLGLRAPDDRVT 133 Query: 164 SNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKK 223 N G G R F ++WM G N F D D + + R+G GAE W + LKLSAN Y+ + W Sbjct: 134 GNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGFGAEAWTNNLKLSANTYLGTTNWHS 193 Query: 224 SPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISA 283 S D +DY E+PA+G+D+RAEGYLPA+PQLGA LMYEQYYGD+V LF KD Q +P A++ Sbjct: 194 SRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYEQYYGDKVALFDKDDLQSNPSAVTV 253 Query: 284 EVTYTPVPL 292 V+YTPVPL Sbjct: 254 GVSYTPVPL 262 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 220 bits (560), Expect = 5e-56, Method: Compositional matrix adjust. Identities = 107/233 (45%), Positives = 151/233 (64%), Gaps = 6/233 (2%) Query: 65 DNNVEKNVASFAANAGTFLSSQPDS----DATRNFITGMATAKANQEIQEWLGKYGTARV 120 +++ E+ +A + NA F S + ++ D +++ A A EI WL K G AR+ Sbjct: 59 EDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHWLSKAGNARL 118 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 +N+DK S+K S L+ L P Y+ +LF+Q +IHRTD R Q+N G G RHF N M Sbjct: 119 NINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLRHFQQNS-MI 177 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERPANGWD 239 GVN F DHDLS H+R+G G EY +DY+++SAN Y+ S W+ + ++ +DY RPANGWD Sbjct: 178 GVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDYNARPANGWD 237 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 I+ EG+LP + LGA+L EQYYGD+V LFGK++RQKDP A + V ++P PL Sbjct: 238 IQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPFPL 290 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 219 bits (559), Expect = 7e-56, Method: Compositional matrix adjust. Identities = 105/222 (47%), Positives = 140/222 (63%), Gaps = 2/222 (0%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 VAS + G+ LSS+ +A G+ + + ++EWLG G A+VKL VD Sbjct: 85 VASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIGKAQVKLQVDDKNDFS 144 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 S L + P+Y+ P + F+Q R D R NIG G RH+ +DWM G N F+D +S Sbjct: 145 GSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYL-SDWMLGYNVFLDQQIS 203 Query: 192 -RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 +H R+G+G E RDY+KLSAN Y R GWK S +EDY ER A+G+DIR E YLP +P Sbjct: 204 GNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAASGYDIRTEAYLPYYP 263 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 QLG LMYEQY+G+EV LFG ++RQK+P A++A V+YTP PL Sbjct: 264 QLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPL 305 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 219 bits (557), Expect = 1e-55, Method: Compositional matrix adjust. Identities = 105/213 (49%), Positives = 141/213 (66%), Gaps = 1/213 (0%) Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 G+ L++ DA + MAT+ N +I +WL +YGTAR++LN D+DFSL +S+L+ L Sbjct: 152 GSALAASGRVDALHHMARTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLL 211 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ T LFTQ D R +NIG G R F ++WM G N F D+D + + R+G+ Sbjct: 212 PLYDSQTLTLFTQQGFRNKDRRNIANIGIGTR-FIHHEWMMGGNAFYDNDFTGDNKRVGL 270 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W D +LSANGY R + W +S D DY ERPANG D+RA G+LPA P LG SL+YE Sbjct: 271 GAELWTDSFQLSANGYFRLTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYE 330 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 Y+GD V LFGKD Q++P+AI+ +YTP L Sbjct: 331 HYFGDNVALFGKDHLQRNPYAITLGGSYTPFSL 363 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 218 bits (556), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 123/277 (44%), Positives = 166/277 (59%), Gaps = 11/277 (3%) Query: 26 NISVQVLFPLAVTFTPV--MAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFL 83 N +L P P+ +A +A + P L MGN V ++ E+ A+ A G Sbjct: 76 NAGESLLLPANSPLFPLDPLAGKAIASNLPELGMGNDPVPLVSSGEQKTAAAAHAVGAQN 135 Query: 84 SSQPDSDATRN----FITGMATAKA----NQEIQEWLGKYGTARVKLNVDKDFSLKDSSL 135 + SD +N + G A A+ Q+ QE LGK+G A+V L VD + SL S+ Sbjct: 136 WNNMTSDQMKNQAESWAKGQAKAQVVDPLRQQAQELLGKFGKAQVNLAVDDNGSLSKSAF 195 Query: 136 EMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHT 195 + P Y+ + F+Q +HR D+R N+G G R F DW+ G NTF+D D+SR+H+ Sbjct: 196 SLFSPWYENDAMVAFSQVGVHRQDNRMIGNLGAGVR-FDQGDWLFGANTFLDQDISRNHS 254 Query: 196 RIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGAS 255 R+G+G E+W D LKL++N Y SGWK S D +DY ERPA G+D+ A+GYLPA+ QLGAS Sbjct: 255 RLGLGLEWWADNLKLASNYYHPLSGWKDSKDFDDYLERPARGFDVHAQGYLPAYQQLGAS 314 Query: 256 LMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 +YEQYYGDEV LFGKD QKDPHA++ V YTP PL Sbjct: 315 AVYEQYYGDEVALFGKDNLQKDPHAVTVGVDYTPFPL 351 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 218 bits (556), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 106/236 (44%), Positives = 150/236 (63%), Gaps = 11/236 (4%) Query: 19 ARC-VAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAA 77 A C V W+ I+ Q+L P+ T P ++ + + A++ +AS AA Sbjct: 12 ASCAVTWSVIATQILSPVTFTLIPA------NSFASSANTESAQTNANDEYANELASLAA 65 Query: 78 NAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEM 137 NAG L++ + F +A+A +E+ +WL +YG AR+KLNVD+ F+LKD++ + Sbjct: 66 NAGQSLAN----NTAGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDESFTLKDAAFDF 121 Query: 138 LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRI 197 LYP D+ +LF+Q ++HRTDDR Q+NIG G RHF+ ++ M G N F D+DLSR H+R Sbjct: 122 LYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFYDYDLSRHHSRA 181 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 G+G EYWRDY++ AN Y S WK S DI+DY ERPANGWD+ AEG+LP +PQLG Sbjct: 182 GLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWLPVYPQLG 237 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 212 bits (540), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 101/199 (50%), Positives = 129/199 (64%), Gaps = 1/199 (0%) Query: 94 NFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 N M ANQE++ WL ++GT +V +N DK FSLK+SSL+ L P YD+ + + F+Q Sbjct: 71 NITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKESSLDWLLPWYDSASYVFFSQL 130 Query: 154 AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSAN 213 I D R NIG G R F WM G NT D+D++ + RIGVGAE W DYL+LSAN Sbjct: 131 GIRNKDSRNTLNIGAGVRTFQ-QSWMYGFNTSYDNDMTGHNHRIGVGAEAWTDYLQLSAN 189 Query: 214 GYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDK 273 GY R +GW +S D DY ERPA+G DI + YLPA PQLG L YEQY G+ V LFGKD Sbjct: 190 GYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGERVALFGKDN 249 Query: 274 RQKDPHAISAEVTYTPVPL 292 Q +P+A++ + YTP+P Sbjct: 250 LQSNPYAVTTGLIYTPIPF 268 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 210 bits (535), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 97/185 (52%), Positives = 129/185 (69%), Gaps = 1/185 (0%) Query: 108 IQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIG 167 IQ+WL ++GT +L+ D SLK+SSL+ L PIYDT N F Q D R N+G Sbjct: 172 IQKWLSQWGTVESQLSFDSKASLKNSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLG 231 Query: 168 FGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 +G RH N WM G+N F D+D++ ++ R+G+G E DYL +++N Y+R + W +S D Sbjct: 232 WGVRHVY-NGWMYGLNNFFDYDITGNNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDF 290 Query: 228 EDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 DY ERPANG+D+R G+LPA+PQ+G L+YEQYYGDEVGLFGKD RQKDP AI+A V++ Sbjct: 291 YDYDERPANGFDMRVNGWLPAYPQIGGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSW 350 Query: 288 TPVPL 292 TP PL Sbjct: 351 TPFPL 355 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 210 bits (534), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 113/257 (43%), Positives = 147/257 (57%), Gaps = 8/257 (3%) Query: 41 PVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDS-----DATRNF 95 P++AA +L+ + VT N + ++AA L SQ S D ++ Sbjct: 131 PLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDT 190 Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155 G+A +A+ ++Q WL YGTA V L +F SSL+ L P YD+ + F Q Sbjct: 191 ALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDFLLPFYDSEKMLAFGQVGA 248 Query: 156 HRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGY 215 D R +N+G G R F + M G N FID D S +TR+G+G EYWRDY K S NGY Sbjct: 249 RYIDSRFTANLGAGQRFFLPEN-MLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGY 307 Query: 216 IRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275 R SGW +S + +DY ERPANG+DIR GYLP++P LGA LMYEQYYGD V LF DK Q Sbjct: 308 FRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQ 367 Query: 276 KDPHAISAEVTYTPVPL 292 +P A + V YTP+PL Sbjct: 368 SNPGAATVGVNYTPIPL 384 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 199 bits (505), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 96/189 (50%), Positives = 124/189 (65%), Gaps = 1/189 (0%) Query: 106 QEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSN 165 Q+ Q+ LG++G A+V L++D +L S+ + P YD+ +LF+Q IH D+R N Sbjct: 78 QQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYLLFSQINIHHQDNRKIGN 137 Query: 166 IGFGWR-HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 G G R + + G N FIDHD SR H R G+GAE DYLK SAN Y S WK S Sbjct: 138 FGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADYLKFSANYYHPLSHWKDS 197 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 PD +DY ERPA G+D+R++GYLPA+PQLG S +YE Y+GDEV LFGK RQKDP A++ Sbjct: 198 PDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVALFGKSHRQKDPRALTLG 257 Query: 285 VTYTPVPLT 293 + YTPVPL Sbjct: 258 IDYTPVPLV 266 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 199 bits (505), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 95/212 (44%), Positives = 138/212 (65%), Gaps = 3/212 (1%) Query: 82 FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPI 141 L+S P A +++I ++ Q+WL ++GTA++ LNVD L +SS+++L P Sbjct: 115 LLNSDPKKLA-QDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPF 173 Query: 142 YDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVG 200 YD + ++++Q D R N+G G R F N+WM G NTF D+DL+ +++R +G Sbjct: 174 YDDKDHWLVYSQYGYRHKDSRDTVNLGIGTRLFI-NNWMYGANTFYDNDLTGNNSRFSLG 232 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 E W +YLK+SAN Y R S W + D+ +Y ERPANG+D+ A+ YLP+ P LGA + YEQ Sbjct: 233 GELWTNYLKMSANAYFRLSDWHNARDLVNYYERPANGYDLIADMYLPSMPSLGAKIKYEQ 292 Query: 261 YYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 Y+GD V LFGK+KRQKDP+A + V YTP+PL Sbjct: 293 YFGDNVALFGKNKRQKDPYAATIGVNYTPIPL 324 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 198 bits (503), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 92/201 (45%), Positives = 131/201 (65%), Gaps = 2/201 (0%) Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFT 151 +++I ++ Q+WL ++GTA++ LNVD L +SS+++L P YD + ++++ Sbjct: 132 QDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHWLIYS 191 Query: 152 QGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLS 211 Q D R N+G G R F NDWM G NTF D+DL+ +++R +G E W +YLK+S Sbjct: 192 QYGYRHKDSRDTVNLGIGTRLFI-NDWMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMS 250 Query: 212 ANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK 271 AN Y R S W S D+ +Y ERPANG+D+ A+ YLPA P LGA + YEQY+GD V LFG Sbjct: 251 ANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQYFGDNVALFGT 310 Query: 272 DKRQKDPHAISAEVTYTPVPL 292 + RQKDP+A + V YTP+PL Sbjct: 311 NNRQKDPYAATIGVNYTPIPL 331 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 193 bits (490), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 102/230 (44%), Positives = 143/230 (62%), Gaps = 12/230 (5%) Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E NVAS A + + S A + +TG+A A+Q +WLG+YG ARV+LN + Sbjct: 131 EDNVASAATQLWGIMGNDNSSRAAESAVTGVAAGLASQAAADWLGQYGNARVQLNSN--- 187 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 S+ ++ ++L P+ +T N+LF Q + +RT +N+G G R F+ + WM GVNTF D+ Sbjct: 188 SIGNA--DVLIPLTETQNNLLFGQLGVRYNGERTTNNVGLGVRSFT-DSWMFGVNTFYDY 244 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERPANGWDIRAEGYL 246 DL+ ++R+GVG E W D LK SANGY R + W +S D+EDY ERPANG+D+RAE YL Sbjct: 245 DLTGKNSRLGVGGEAWTDNLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYL 304 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKD----PHAISAEVTYTPVPL 292 P++PQLG LMYE+Y+G V L D P A + + YTP+PL Sbjct: 305 PSYPQLGGRLMYEKYFGKGVALNSGSTSPDDLGDSPSAFTVGLNYTPIPL 354 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 190 bits (483), Expect = 4e-47, Method: Compositional matrix adjust. Identities = 92/200 (46%), Positives = 127/200 (63%), Gaps = 1/200 (0%) Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQ 152 R+++ T+ A I++ L YG R L++ + L SS++ P YD T + F+Q Sbjct: 80 RSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTVYFSQ 139 Query: 153 GAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSA 212 + R +DRT NIG G R ++ + ++ G N F D+D +R H R+G+GAE W DYLK S Sbjct: 140 FSAQRKEDRTIGNIGLGVR-YNFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYLKFSG 198 Query: 213 NGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 N Y S WK S D + Y+ERPA GWDIRAE +LPA+PQLG +++EQYYG+EV LFG D Sbjct: 199 NYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVALFGTD 258 Query: 273 KRQKDPHAISAEVTYTPVPL 292 +KDP A++ V Y PVPL Sbjct: 259 SLEKDPFAVTLGVKYQPVPL 278 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 182 bits (463), Expect = 9e-45, Method: Compositional matrix adjust. Identities = 91/191 (47%), Positives = 119/191 (62%), Gaps = 2/191 (1%) Query: 103 KANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRT-DDR 161 +AN LG GTAR ++ +D DF++ + ++L P+ + +LFTQ + R DR Sbjct: 195 EANAYAASLLGAMGTARTRVTLDDDFNMVTAEADLLLPLAEEQQTLLFTQFGLRRNGQDR 254 Query: 162 TQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 T +N+G G RHF + WM G N F D+DL+ H R GVGAE WRDYLKL AN Y S W Sbjct: 255 TIANLGVGQRHFL-DRWMLGYNLFADYDLTNRHWRAGVGAEAWRDYLKLGANFYTPLSSW 313 Query: 222 KKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAI 281 + SP E +ER A G D+R E YLPA+PQ ASL EQY G+ VGL D+ ++DPHAI Sbjct: 314 RDSPRFEGMEERAARGMDVRLEAYLPAYPQWSASLTAEQYLGERVGLLDADQLERDPHAI 373 Query: 282 SAEVTYTPVPL 292 +A + Y P PL Sbjct: 374 TAGLHYNPFPL 384 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 181 bits (459), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 96/244 (39%), Positives = 142/244 (58%), Gaps = 14/244 (5%) Query: 56 SMGNTTVT--ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLG 113 ++G+T + +NN EK +SF G LSS D + N+ + NQ+I +WL Sbjct: 92 NLGSTKILPEENNNEEKFASSFTL-MGDILSSDNFVDNSINYAKSIGQGLVNQQINDWLN 150 Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF 173 +YG AR+ + DK+ S + L P+ D P N+LFTQ + DR N+G G+R + Sbjct: 151 QYGKARISFSSDKNISG-----DFLLPVIDEPNNLLFTQLGLRNNTDRNTINLGLGYRKY 205 Query: 174 SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD--IEDYQ 231 N WM G+NTF D+D + + R+GVG E W DYLKL+ NGY + W +S ++DY Sbjct: 206 WRN-WMFGINTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSKISVMDDYD 264 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL---FGKDKRQKDPHAISAEVTYT 288 ERPA G+D+RAE YLP +PQLG+S+ YE+Y+G + L + + D ++ + YT Sbjct: 265 ERPATGFDVRAEAYLPKYPQLGSSIKYEKYFGKGIHLGTGVNPEYLKDDAQSLIMGLNYT 324 Query: 289 PVPL 292 P+PL Sbjct: 325 PIPL 328 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 177 bits (449), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 92/230 (40%), Positives = 137/230 (59%), Gaps = 13/230 (5%) Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 EK A L++ ++A N+ NQ+I +WL +YG ARV+++ Sbjct: 110 EKQFVQGATQIAQGLANNNATEAAINYARNRGEGLLNQKISDWLNQYGKARVQIS----- 164 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 S K ++L P+ D P ++LF+Q I + R+ +N+G G+R + N WM G+N+F D+ Sbjct: 165 SNKTGDADLLLPLIDKPNSLLFSQIGIRANEQRSTTNLGLGYRQYQQN-WMWGINSFYDY 223 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERPANGWDIRAEGYL 246 D+S + R G+G E W YLKL+ NGY R + W +S ++ DY ERPANG+D+RAEGYL Sbjct: 224 DISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQSFLHEMRDYDERPANGFDLRAEGYL 283 Query: 247 PAWPQLGASLMYEQYYGDEVGL----FGKDKRQKDPHAISAEVTYTPVPL 292 P++P LGA YEQY+GD V L KD + +P A++ ++YTP PL Sbjct: 284 PSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKD-NPSAVTFGLSYTPFPL 332 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 172 bits (436), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 84/204 (41%), Positives = 126/204 (61%), Gaps = 3/204 (1%) Query: 92 TRNFITGMATAKANQEIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-ML 149 R F A+++ + WL +G ++RV ++ ++F+ + + ++L P++++ + M+ Sbjct: 128 VRQFGQDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMI 187 Query: 150 FTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 F+Q + DDRT NIG G R+F G WM G N F D+D S S+ RIG+GAE D L+ Sbjct: 188 FSQLGVRHADDRTTGNIGLGARYF-GEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALR 246 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 L+ANGY + +GW S I D+ ERPANGWDI +LP +PQLG + YEQYYGD V L Sbjct: 247 LAANGYFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALI 306 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 + + Q +P A + V +TP+PL Sbjct: 307 SRGRLQHNPSAATLGVNWTPIPLV 330 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 170 bits (430), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 87/221 (39%), Positives = 125/221 (56%), Gaps = 6/221 (2%) Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS--LKD 132 + +A + L S P D + G + +Q I+ WL +YG AR+ LN D S L Sbjct: 18 YTKSAASLLKSGPAFD---QYAAGKISQLTSQAIEGWLKQYGNARITLNAQSDNSTALAG 74 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS ++L+ +++ + + + Q H D N+G G R+F N M G N F D +++ Sbjct: 75 SSADLLFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNIN 134 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 +R GVG E WRDY K S NGY S W+ S +EDY E+ A+G+D++ E YLP + Q Sbjct: 135 SGVSRSGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQ 194 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 LG L YEQY+GD V LF + Q DP AI+ ++YTP+PL Sbjct: 195 LGGHLKYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPL 235 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 159 bits (403), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 84/205 (40%), Positives = 118/205 (57%), Gaps = 6/205 (2%) Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 SD +++ I M A E ++ G R L + D L S+++ YP+YD + + Sbjct: 90 SDISKSGIADMGFAALQPETEK---SAGEVRANLPL-SDGKLTSGSIDLFYPLYDGDSRL 145 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS-RSHTRIGVGAEYWRDY 207 F Q R D R N+G G R+F G DW G NTF D +S +H R+G G EYWRDY Sbjct: 146 FFGQVGARRFDGRNIVNLGIGQRYFQG-DWALGYNTFYDIQISGNAHQRLGFGLEYWRDY 204 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L LSANGY + W S ++ Y ER ANG+DIRA+G+ P +PQL L +EQY+GD++ Sbjct: 205 LYLSANGYFGLTDWYSSSALDGYAERAANGYDIRAQGWFPVYPQLSGKLKFEQYFGDDIA 264 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPL 292 L R K+P+A++ + YTP+ L Sbjct: 265 LLNHQNRYKNPYALTMGLEYTPIQL 289 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 157 bits (398), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 78/188 (41%), Positives = 113/188 (60%), Gaps = 2/188 (1%) Query: 106 QEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSN 165 Q ++ + +G + L+VD S +L P Y +++LF+Q ++++RT + Sbjct: 191 QNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVLFSQLGF-QSEERTIGH 249 Query: 166 IGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 IG G R + ++ G N FID+DL + H R+ +G E +Y KLS N Y + W+ S Sbjct: 250 IGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYFKLSTNYYWPITKWRDS 309 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 D+EDY ERPA G+DIR +GYLP +PQLG + YEQY+G EV LF K KRQK+P A+S Sbjct: 310 FDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVALFNKTKRQKNPKAVSIG 369 Query: 285 VTYTPVPL 292 + Y P PL Sbjct: 370 IDYRPFPL 377 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 155 bits (391), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 86/233 (36%), Positives = 123/233 (52%), Gaps = 5/233 (2%) Query: 65 DNNVEKNVASFAANAG----TFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 D EK A+ A G +SSQ ++ G A++ Q+ QE L G A++ Sbjct: 66 DAEREKEWATMAKQLGERNLNNVSSQQVRTRAESYAVGQASSVLQQQAQELLSPLGNAKL 125 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 L + SS ++ P+YD + ++Q + + + + N G G R +G DW+ Sbjct: 126 SLVMSDQGDFSGSSGQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAG-DWLL 184 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 G NT +D D R H R +GAE W D+L+ SAN Y S + D + RPA+G+DI Sbjct: 185 GYNTVLDSDFERHHNRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDI 244 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +GYLP + Q+G SL YEQY+G+ V LFG K+Q DP A+ V YTPVPL Sbjct: 245 TTQGYLPFYRQIGGSLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLV 297 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 150 bits (378), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 98/293 (33%), Positives = 150/293 (51%), Gaps = 32/293 (10%) Query: 18 LARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN------ 71 LAR +AW + Q+L P AA AQ A+ P + T AD++V+K Sbjct: 8 LARWLAWVLVGTQLLTP---------AALAQ-AMLPEI----TRSGADSSVDKTDQPEAE 53 Query: 72 -VASFAANAGTFLSSQPDSDATRNFITGMATAKAN----QEIQEWLGKYGTARVKLNVDK 126 +AS A++ G+ L SD +N I + AN I+ WL + R + ++ Sbjct: 54 WLASRASSLGSLLQEGNISDFAKNQIQALPQTIANDGITSGIKHWLPE-AQFRGGITLED 112 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAG 181 + + ++L P+Y + +++LF Q + D+ R N G GWR G DW+ G Sbjct: 113 ASKYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVG-DWLLG 171 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 +N+F+D D+ H R +G E +RD + L+ N Y S WK S + ERPA G D+R Sbjct: 172 INSFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVR 231 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 +G LP+ P GA L +EQY+GD+V + G D +DP A + +T+ PVPL + Sbjct: 232 LKGALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVE 284 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 80/239 (33%), Positives = 122/239 (51%), Gaps = 9/239 (3%) Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA---RV 120 +DN ++ +A A++ L D + I + AN + E + R Sbjct: 29 SDNEIQSWIAGTASSISPHLQEGTLEDYAKGKIKALPGQAANHLVNEGIKSAFPEIIFRG 88 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSG 175 +N++ + S +M P+ +T +++LF Q D+ RT N+G G+R Sbjct: 89 GVNLEDGAKYRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYRQ-EV 147 Query: 176 NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 N W+ GVNTF+D D+ SH R G+G E ++D L S N Y +GWK S E + ERPA Sbjct: 148 NGWLLGVNTFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAAHELHDERPA 207 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G+D+R +G LP +P L YEQYYGD+V L G ++P A A++ + PVPL + Sbjct: 208 YGFDLRTKGTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGADLVWNPVPLLE 266 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 121 bits (304), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 83/241 (34%), Positives = 118/241 (48%), Gaps = 13/241 (5%) Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGM---------ATAKANQEIQEW 111 TV ++ + K +A A + G S DSD + G+ A +A E ++ Sbjct: 99 TVHDNDQLAKKIAEAAKSIGE-ASMNSDSDRSLREEAGIWVFNRFRDAAKQRAASEGEQL 157 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L YG A V L + D S SS +++ P D + + F+Q I +++ + N G G R Sbjct: 158 LSPYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQLGIEQSEYGSVGNAGLGQR 217 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +G+ W G N F+D L R +GAE W YL+ SAN Y SG + + Sbjct: 218 WIAGS-WRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSANYYQPLSGCRNHSN--SAL 274 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 R A G+DI GYLP + QLG +L YEQY G+ V LF +P A+S + YTPVP Sbjct: 275 MRMARGYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGNAVANPAAVSLGINYTPVP 334 Query: 292 L 292 L Sbjct: 335 L 335 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 67/193 (34%), Positives = 101/193 (52%), Gaps = 3/193 (1%) Query: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 + + NQ ++ WL +G A V + VD + S P+ D + ++Q + + D+ Sbjct: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 SN+G G R GN W+ G NTF D+ L + R G GAE W +YL+LSAN Y + Sbjct: 155 GLVSNVGVGQRWARGN-WLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 Query: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 W + ++ +R A G+D+ A +P + L S+ EQY+GD V LF +P A Sbjct: 214 WHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 Query: 281 ISAEVTYTPVPLT 293 +S + YTPVPL Sbjct: 272 LSLGLNYTPVPLV 284 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 117 bits (294), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 67/193 (34%), Positives = 102/193 (52%), Gaps = 3/193 (1%) Query: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 +A+ NQ ++ WL +G A V + VD S P D + ++Q + R +D Sbjct: 48 SAQVNQHLESWLSPWGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQED 107 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 SN+G G R ++ + W+ G NTF D+ L R G+GAE W +YL+LSAN Y + Sbjct: 108 GLVSNVGIGQR-WARDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFAS 166 Query: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 W + ++ +R A G+D+ A+ +P + L + EQY+GD V LF K +P A Sbjct: 167 WHERSATQE--QRMARGYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLA 224 Query: 281 ISAEVTYTPVPLT 293 +S + YTPVPL Sbjct: 225 VSLGLNYTPVPLV 237 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 115 bits (287), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 65/193 (33%), Positives = 101/193 (52%), Gaps = 3/193 (1%) Query: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 ++ E + L +G A V L VD++ + SS + P D + ++Q + + + Sbjct: 67 SSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGVSQQNQ 126 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 N G G R +G+ W+ G NTF D +R G GAE W DYL+LSAN Y G Sbjct: 127 GLVGNAGIGQRWTAGH-WLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQPLGG 185 Query: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 W+ + ++R A G+D+ A+ YLP + + S+ +EQY+GD+V LF +P A Sbjct: 186 WEHRAGL--LEQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYHNPVA 243 Query: 281 ISAEVTYTPVPLT 293 + ++YTPVPL Sbjct: 244 VKVGLSYTPVPLV 256 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 108 bits (269), Expect = 3e-22, Method: Composition-based stats. Identities = 84/257 (32%), Positives = 120/257 (46%), Gaps = 33/257 (12%) Query: 65 DNNVEKNVASFA-------ANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 DNN+ K F+ AG S+ DS + + MA++ N E L K Sbjct: 292 DNNLSKEDQEFSNKVMKVIQTAGAIYDSE-DSKSKEEIVKNMASSYLNTSANE-LAKEFI 349 Query: 118 ARVKLNVDKDFSLK-------DSSLEMLYPIY--DTPTNMLFTQGAIHR-TDDRTQSNIG 167 + +++ DFS + + L PI D P F Q I +DRT + G Sbjct: 350 DSLNTSINTDFSFNYNERSGFSGNAKALLPIVSEDNPKISYFLQSGIGEFANDRTIGHFG 409 Query: 168 FGWRHF--------SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 G R++ SGN M G+N+ DHD SR H R+ +GAE D L +AN Y R S Sbjct: 410 GGIRYYPNATALNNSGN-IMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAFNANVYQRLS 468 Query: 220 GWKKSPDIE-DY-QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKR 274 W S D + DY QERPANGWD + + P+ + Q+YG++VG+FG D Sbjct: 469 SWIDSYDFDKDYVQERPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGIFGANSVDDL 528 Query: 275 QKDPHAISAEVTYTPVP 291 +K+P ++Y+P P Sbjct: 529 EKNPLIYEGGISYSPFP 545 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 93.6 bits (231), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 54/145 (37%), Positives = 80/145 (55%), Gaps = 3/145 (2%) Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 M ++Q + + D SN+G G R ++ + W+ G NTF D+ L + R G GAE W +Y Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQR-WAQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEY 59 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L+LSAN Y + W+ ++R A G+DI A+ LP + + S+ EQY+GD V Sbjct: 60 LRLSANYYQPFADWQTH--TATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVD 117 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPL 292 LF +P A+ + YTPVPL Sbjct: 118 LFDSGTGYHNPVALKLGLNYTPVPL 142 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 89.4 bits (220), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 65/222 (29%), Positives = 100/222 (45%), Gaps = 12/222 (5%) Query: 79 AGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA------RVKLNVDKDFSLKD 132 AG+ S++ D D + G A A+AN+ +QE + R++ V+ DFS KD Sbjct: 83 AGSRASARVDGD----LLKGQAEAQANELLQEGVRLANQTELPFLRRLQGGVNYDFSNKD 138 Query: 133 SSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDL 190 SL++ + ++ + + Q + H + R N G RH G N F+D++ Sbjct: 139 LSLDLRTIDEVHRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEF 198 Query: 191 SRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 ++H R +G E L N Y SGWK + E +ERPA+GWD+ A P Sbjct: 199 GKNHLRGSLGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALP 258 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 L Y ++ G V F + Q++ V Y PVPL Sbjct: 259 GLAIKGQYFRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPL 300 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 87.0 bits (214), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 51/135 (37%), Positives = 66/135 (48%), Gaps = 5/135 (3%) Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 R +++G G+R + + GVN F D+DLSR HTR+ VGAEY DY S N Y S Sbjct: 36 RDFAHLGLGYRQLDDSQFF-GVNVFFDYDLSRQHTRVSVGAEYGLDYGTFSTNAYFPLSN 94 Query: 221 WKKSPD----IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQK 276 WK SPD + E+ A GWD+ E YLP + L QY G V K Sbjct: 95 WKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLDTRWKFGLTAGQYLGRYVEHSDGSLPSK 154 Query: 277 DPHAISAEVTYTPVP 291 +P+ S + P P Sbjct: 155 NPYHFSLSTEFRPDP 169 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 83.6 bits (205), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 59/206 (28%), Positives = 96/206 (46%), Gaps = 8/206 (3%) Query: 95 FITGMATAKANQEIQE---WLGKYGTA---RVKLNVDKDFSLKDSSLEM--LYPIYDTPT 146 F+ A A+AN +Q+ W + G R++ NV DFS +D ++++ + ++ Sbjct: 105 FLRSQAQAQANVLVQQGVQWANETGLPWLRRLEGNVSYDFSGRDVAVDVRTIDALHLDQD 164 Query: 147 NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRD 206 L Q H + R N G R +G+ + G N F+D+++ + H R +GAE Sbjct: 165 RALLLQLGGHNQNHRPTVNAGVVARSAAGSSLILGGNAFLDYEVGKRHLRGSLGAEAVAA 224 Query: 207 YLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 L N Y SGWK + E +ERPA GWD+ A L + Y ++ G +V Sbjct: 225 QFTLYGNVYAPLSGWKAAKRAERREERPAAGWDVGFTARPEAVQGLALNAQYFRWRGAQV 284 Query: 267 GLFGKDKRQKDPHAISAEVTYTPVPL 292 F + +++P + Y PVPL Sbjct: 285 DYFDDGRYRRNPSGFKYGIEYRPVPL 310 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 73.6 bits (179), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 52/174 (29%), Positives = 80/174 (45%), Gaps = 13/174 (7%) Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLG-----KYGTARVKLNVDKDFSLKDSSLEMLY 139 S DS+ ++ + T+ A+ + +G + T V N+ + S D + +L Sbjct: 28 SADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKEGDSTVD--IGVLK 85 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRIG 198 D P + LF Q ++R D RT N+GFG+R + ++ WM GVN F DH+ H R G Sbjct: 86 AFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAFYDHEFPNDHKRNG 145 Query: 199 VGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 VG E L+ N Y +G+ I+D + D R G+ A P L Sbjct: 146 VGFEVVSSVLESRVNSYNGTTGY-----IKDKSGTDSKVLDGRDMGFKVALPYL 194 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 71.6 bits (174), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 51/169 (30%), Positives = 74/169 (43%), Gaps = 4/169 (2%) Query: 127 DFSLKDSSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 DF +SL++ + +Y N Q H +DR +N G +R + M G N Sbjct: 153 DFESGRTSLQLNTIDEVYRAGRNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANG 212 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+D++ + H R VG E L N Y S WK + +E+PA+G D+ G Sbjct: 213 FLDYEFGKQHLRGSVGLEVIAPEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDV-GVG 271 Query: 245 YLPAW-PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 Y PA+ P L S + ++ G EV F + Q V Y PV L Sbjct: 272 YRPAFAPGLSLSATHFRWNGAEVDYFDNGRTQAGAKGFKVGVEYRPVSL 320 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 69.3 bits (168), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 42/136 (30%), Positives = 69/136 (50%), Gaps = 14/136 (10%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + ++ I+ Q FP+A++ TP + + A + +LS +NN + +A + Sbjct: 16 KIFSYFIIASQFSFPIALSLTPTIQSYAATVEENKLSTN-----TENNNGRWLAQQTSQL 70 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 GT LSS DA ++ A +K N+EI+ W +YG A++ L VDK F+LK L+ L+ Sbjct: 71 GTILSSDNTHDAASQYLINQANSKVNREIENWFNQYGKAQINLGVDKHFTLKTQKLKSLF 130 Query: 140 PIYDTPTNMLFTQGAI 155 LFT+ I Sbjct: 131 ---------LFTKQTI 137 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 67.8 bits (164), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 87/186 (46%), Gaps = 8/186 (4%) Query: 56 SMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQE----W 111 ++G+ +V ++ EK +A+ A + R+++ G A + +Q+ Sbjct: 59 NLGSASVN-ESGTEKKLATLARQMAEVNQDENTDQTWRSYLLGEAKDRVLDRLQQKSEAL 117 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML-FTQGAIHRTDDRTQSNIGFGW 170 L G V L+VD+ SS ++L P+ D T L ++Q + DD N+G Sbjct: 118 LSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRGLTYSQLGLQGVDDGVVGNMGLRQ 177 Query: 171 RHFSGNDWMAGVNTFIDHDLSRSHTRIG-VGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +G W+ G N F D L++ +R G +GAE DYL LS+N Y SG + D ED Sbjct: 178 RWNAGR-WLLGYNVFYDQYLNQDASRRGSIGAEARSDYLTLSSNYYYPLSGMHAANDDED 236 Query: 230 YQERPA 235 R A Sbjct: 237 ELLRMA 242 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 66.2 bits (160), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 37/106 (34%), Positives = 54/106 (50%), Gaps = 3/106 (2%) Query: 143 DTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRIGVGA 201 D N FTQG++ D+RT N+G G+R S N + G+N F DH+ H R +G Sbjct: 108 DDIFNTYFTQGSVFYEDNRTTLNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGL 167 Query: 202 EYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 E +++AN Y + WK + +ER +G+DI A LP Sbjct: 168 EARTTVWEINANKYWATTKWKTGKN--GLEERALDGYDIEAGVPLP 211 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 65.5 bits (158), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 47/141 (33%), Positives = 67/141 (47%), Gaps = 7/141 (4%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP--TNMLFTQGAIHRTDD-RTQSNIGFGW 170 K+ TA + L+ + S L ++ PI D N++FTQ ++ +DD R N+GFG Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 171 RHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +D + G N F DH+L H R +G E L AN Y SGWK + + Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKSG--LNN 124 Query: 230 YQERPANGWDIRAEGYLPAWP 250 E+ NG D+ LP P Sbjct: 125 INEKALNGSDVELGMPLPYLP 145 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 64.7 bits (156), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 45/171 (26%), Positives = 72/171 (42%), Gaps = 2/171 (1%) Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L D D SL + + + L Q +H + R +N G R + + G Sbjct: 83 LRYDLDPGRLSFSLRTIDDLMVSERRALMLQAGLHNQNQRPTANTGIVLRQQASPGLIVG 142 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F+D++ + H R +G E + L AN Y SGWK + +ERPA G+D+ Sbjct: 143 SNAFLDYEFGKQHVRGSLGLEAIAPHYSLYANYYAPLSGWKGARRDSRREERPAAGYDL- 201 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 G L + L Y +++G + +F + Q++ V Y P L Sbjct: 202 -GGQLSSDAGLSLQAAYFRWHGAGIDVFDSGRAQRNASGFRYGVAYQPGAL 251 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 63.5 bits (153), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 48/169 (28%), Positives = 72/169 (42%), Gaps = 4/169 (2%) Query: 127 DFSLKDSSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 DF +SLE+ + +Y N Q H ++R +N+G +R M G N Sbjct: 149 DFDNGRTSLELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYRRDINERLMLGANA 208 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+D++ ++ H R +G E N Y SGW + +ERPA+G D+ + Sbjct: 209 FLDYEFAKQHLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRREERPASGMDLGMK- 267 Query: 245 YLPAW-PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 Y P + P L Y ++ G V F + Q V Y PVPL Sbjct: 268 YSPGFAPGLSLKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVPL 316 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 33/109 (30%), Positives = 57/109 (52%), Gaps = 1/109 (0%) Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155 + + + + NQ+++ WL +G+A V +NVD + S P+ D + ++Q + Sbjct: 131 VRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLGL 190 Query: 156 HRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYW 204 + D SN+G G R ++ + W+ G NTF D+ L + R G GAE W Sbjct: 191 TQQTDGLVSNVGIGQR-WAQDGWLLGYNTFYDNLLDENLQRAGFGAEAW 238 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 63.5 bits (153), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 49/135 (36%), Positives = 62/135 (45%), Gaps = 15/135 (11%) Query: 165 NIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 N G G RH G + +AGVN + D+ + S SH+R G+G E + L L+ N YI +G Sbjct: 184 NAGLGLRHLIGEELLAGVNGYWDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIAGTGT 243 Query: 222 KK-SPDIEDYQERPANGWDIRAEGYLPAWPQL-----GASLMYEQYYGDEVGLFGKDKRQ 275 K S + DY ER GWD LP+ P + G Y D G GK Q Sbjct: 244 KTISTNNTDYYERVVPGWDFELGYRLPSNPNIAFFARGFRWDYRN-RNDNTGFQGKVTYQ 302 Query: 276 KDPHA-----ISAEV 285 PH IS EV Sbjct: 303 MTPHVRLDSWISNEV 317 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 62.0 bits (149), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 49/147 (33%), Positives = 75/147 (51%), Gaps = 13/147 (8%) Query: 145 PTNMLFTQGAIH-RTDDRTQSNIGFGWRHFSGNDWMAGVNTFID---HDLSRSHTRIGVG 200 P ++F Q + T + Q N+G G R G++ + G+N F D + S ++TR G+G Sbjct: 196 PMGLMFGQARVTLETSAQPQVNVGLGSRFRLGDEAIVGLNGFWDLRTTNYSTAYTRWGIG 255 Query: 201 AE-YWRDYLKLSANGYIRASGWKK-SPDIEDYQERPANGWDIRAEGYLPAWPQL-----G 253 AE +W+ + +L N YI S K + + DY ER GWD+ +P++PQL G Sbjct: 256 AEGFWKSF-ELRNNWYINGSADKNITINNIDYVERVVPGWDVEVGYRIPSYPQLAIFVRG 314 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHA 280 + Y Q + D G+ G Q PHA Sbjct: 315 FNWDY-QDHSDNSGIEGSVNWQATPHA 340 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 60.8 bits (146), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 4/127 (3%) Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS-GNDWMA 180 + +DK+ S + +Y + +T LF Q + ++RT N GFG RH + N + Sbjct: 90 MGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTINTGFGARHINDANTVIT 149 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 G N F D++L H R+G G E + AN Y S K+ QE +G+D Sbjct: 150 GYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVS---KTLTYNGIQETALDGYDA 206 Query: 241 RAEGYLP 247 + LP Sbjct: 207 KLTANLP 213 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 60.1 bits (144), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 46/161 (28%), Positives = 73/161 (45%), Gaps = 9/161 (5%) Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNV-DKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + A K +++I + G V L+ D D + S+ + I T + FTQ + Sbjct: 23 VASQALNKVSEKISNLIPGEGITEVSLDYNDGDEDQLNFSILGVRDIETTDNSNFFTQFS 82 Query: 155 IHR----TDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 + + R NIG G+R S + ++M G NTF D DL+ R+G+G E L Sbjct: 83 LMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGLGIEAKGSILD 142 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 L+AN Y + S S + +E+ +GWD +P P Sbjct: 143 LTANSYTKIS---NSEVVNGDREQVLSGWDFNLTSQIPRAP 180 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 32/86 (37%), Positives = 53/86 (61%), Gaps = 3/86 (3%) Query: 51 VQPRLSMGNTTVTADN---NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQE 107 VQ ++S N T N N+E+ +AS + G+ L+ +S+ N G A+++A+ Sbjct: 159 VQAQVSEKNLTPPPGNSSGNLEQQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGV 218 Query: 108 IQEWLGKYGTARVKLNVDKDFSLKDS 133 + +WL ++GTAR+ L VD+DFSLK+S Sbjct: 219 MTDWLSRFGTARITLGVDEDFSLKNS 244 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 57.4 bits (137), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 65/202 (32%), Positives = 95/202 (47%), Gaps = 25/202 (12%) Query: 99 MATAKANQEIQEWLGKYGTARVKLN--VDKDFSLKDSSLEMLYPIY-----DTPTNMLFT 151 A AKAN EIQ+ + + V ++ + D S +SL L + D T + F+ Sbjct: 94 FANAKANGEIQK-IPFFAQTSVNISGGTESDTSFSINSLMKLGELAKDDQGDLKT-LAFS 151 Query: 152 QG--AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEY-WR 205 Q A + + NIG G R+ + M G N F D+ D S +H+R+G+G EY W+ Sbjct: 152 QARFATATNAEGSTINIGLGIRNRPDDISMVGANAFWDYRMTDYSDAHSRLGLGGEYFWK 211 Query: 206 DYLKLSANGYIRASGWKKSPDIE--DYQERPANGWDIRAEGYLPAWPQL-----GASLMY 258 D+ + N Y+ + +K I+ DYQER GWD+ LP P+L G + Y Sbjct: 212 DF-EFRNNWYMAITN-EKDVIIKGVDYQERVVPGWDLEVGYRLPNNPELAFYIRGFNWDY 269 Query: 259 EQYYGDEVGLFGKDKRQKDPHA 280 +Y D GL G Q PH Sbjct: 270 -KYTQDNSGLEGAVSWQATPHV 290 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 32/114 (28%), Positives = 55/114 (48%), Gaps = 4/114 (3%) Query: 140 PIY--DTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF-SGNDWMAGVNTFIDHDLSRSHTR 196 P+Y + + +F +G D R + G+RH S N M G N H+ R+H R Sbjct: 68 PVYVSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQR 127 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 I GAE ++++N Y R + WK + +++ +E+ G+D+ +P P Sbjct: 128 ISYGAEIRTSVFEINSNYYHRLTDWKLT-GVDNNEEKARGGYDVELALAVPYVP 180 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 52.0 bits (123), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 37/135 (27%), Positives = 65/135 (48%), Gaps = 10/135 (7%) Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK----DFSLKDSSLEMLYPIYDTP 145 DA +N + + N + ++ ++G +++V K + S + + L P+ + Sbjct: 21 DAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKGDEASYSIITTQPLAPLSEDG 80 Query: 146 TNMLFTQGAI----HRTDDRTQSNIGFGWR-HFSGNDWMAGVNTFIDHDLSRSHTRIGVG 200 + LF QG++ D RT N+G G R G + G+N+F D++ S H R+ +G Sbjct: 81 SR-LFWQGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIVGINSFYDYEFSAKHKRMSLG 139 Query: 201 AEYWRDYLKLSANGY 215 EY R +LS N Y Sbjct: 140 GEYKRSNAELSVNKY 154 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 51.6 bits (122), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 37/117 (31%), Positives = 60/117 (51%), Gaps = 11/117 (9%) Query: 180 AGVNTFIDHDLSRSHTRIGVGAEYWR-DYL-KLSANGYIRASGWKKSPDIEDYQERPANG 237 GVN F D +++ + R+ +G++Y +Y+ LS+N Y SG D+ N Sbjct: 154 VGVNAFWDVEMNSGNHRLSLGSKYDDPNYIFNLSSNIYFPLSGKGSEDDL-------VNS 206 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 DIRAEG + Q +SL E ++GD++ + + H +A + YTP+PL Q Sbjct: 207 IDIRAEGAITPTVQFHSSL--EFFFGDDIQINDDYDPTNNSHKFTAGLDYTPIPLLQ 261 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 3/100 (3%) Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 +F Q +++ ++ N+G G R +D + G+N F D+ SH R G G E Sbjct: 97 IFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQRNGAGLEVLSSV 156 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 L +N Y SG + D E +GWD+R + +LP Sbjct: 157 FDLRSNIYDATSGIEAVSTSRD--EEAMDGWDMRLDYHLP 194 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 79/306 (25%), Positives = 114/306 (37%), Gaps = 62/306 (20%) Query: 46 RAQHAVQPRLSMGN-----TTVTADNNVEKNV-------ASFAANAGTFLSSQPDSDA-- 91 RA AV L G T V ADN V AS+A L+S SD Sbjct: 51 RACKAVAGALEAGQSVRCETLVDADNQSNSTVQKIFVTGASYATRIFPLLNSASLSDGIQ 110 Query: 92 ------TRNFITGMATAKANQEIQEWLGKYGTAR--VKLNVDKDFSLKDSSLEMLYPIYD 143 +++FI A N+ + + + V D D + +SL L + Sbjct: 111 KMLWMDSKSFIVSFAHDYLNEYVLKQIPFLSQTEFGVGFESDADMTYYLNSLISLAQLGS 170 Query: 144 T----PTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHT 195 P +LF QG+ +N+G G R ++ M G N F D+ + S S++ Sbjct: 171 DDNGYPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYS 230 Query: 196 RIGVGAEYWRDYLKLSANGYIRASGWKKSP-----------------------DIEDYQE 232 R G GAE W D KL+ N YI +G K+ + E Sbjct: 231 RWGAGAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDE 290 Query: 233 RPANGWDIRAEGYLPAWPQLGASLMYEQY----YGDEVGLFGKDKRQKDPHA-----ISA 283 R GWD+ LP++PQL + ++ D G+ G Q PH IS+ Sbjct: 291 RVVPGWDVALNYRLPSYPQLSLGIRGFRWDYMRKSDNSGVEGSVNWQATPHTNLSAWISS 350 Query: 284 EVTYTP 289 E+ P Sbjct: 351 EIPAYP 356 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 39/172 (22%), Positives = 75/172 (43%), Gaps = 10/172 (5%) Query: 130 LKDSSLEMLYPIYDT----PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 L++ S++M + Y N+ F Q IH N G G RH + + + G NTF Sbjct: 90 LQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKYAIGYNTF 149 Query: 186 IDHDLSRSHTR---IGVGAEYW-RDYLKLSANGYIRASGWKKSPDIEDYQ-ERPANGWDI 240 +S+ ++ I VG EYW + L + N Y + + ++ P +G + Sbjct: 150 YHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHYPRSGHQL 209 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 + P + + + EQ+ ++ +K+ D + +S ++ Y P+P+ Sbjct: 210 YIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPM 260 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 38/113 (33%), Positives = 56/113 (49%), Gaps = 7/113 (6%) Query: 134 SLEMLYPI--YDTPT-NMLFTQGAIHRTDDR-TQSNIGFGWRHFSGNDW-MAGVNTFIDH 188 S+E + P+ YD + ++ FTQ I R D T NIG G+R S +D + G + F DH Sbjct: 122 SVETVQPLGHYDNSSRDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDH 181 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ-ERPANGWDI 240 H R+ G EY + N Y AS ++ D+ + ER ANG+ + Sbjct: 182 RFLNRHNRLSAGLEYMSGESEFRFNWYGSASD-ERVLDVNLHTLERVANGYTV 233 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 43.5 bits (101), Expect = 0.010, Method: Compositional matrix adjust. Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 12/88 (13%) Query: 137 MLYPIYDT--PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH------ 188 M++P + + P + A + ++G G RHFS N WM G+N + D+ Sbjct: 102 MIFPFFSSCRPFQIFLDGKAFLFDHGKWGGSVGIGLRHFSYNGWMVGLNGYYDYRRFNGW 161 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYI 216 DL+ ++G+G E D ++ NGY+ Sbjct: 162 DLN----QLGLGVELLGDCVEFRVNGYL 185 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 42.7 bits (99), Expect = 0.013, Method: Compositional matrix adjust. Identities = 37/109 (33%), Positives = 53/109 (48%), Gaps = 6/109 (5%) Query: 135 LEMLYPI--YD-TPTNMLFTQGAI-HRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHD 189 +E L P+ YD T ++ FTQ + + D T +N+G G+R + ND G N F DH Sbjct: 133 VETLQPLGHYDETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHR 192 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGW 238 +H R+ VG EY N Y SG ++S D E +NG+ Sbjct: 193 FRGNHGRMSVGLEYVSGIGAFRMNWYRGVSG-ERSLDGATRMENVSNGY 240 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 42.0 bits (97), Expect = 0.025, Method: Compositional matrix adjust. Identities = 39/188 (20%), Positives = 70/188 (37%), Gaps = 19/188 (10%) Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNM------LFTQGAIHRTDDRTQSNIGFGWRHFS 174 K N +K+ S++ + + + P NM F Q + + G G R Sbjct: 95 KYNNQSQIQIKNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVY 154 Query: 175 GNDWMAGVNTFIDHDLSRSHTR---IGVGAEYWRDYLKLSANGYIRASGW---KKSPDIE 228 + G N H +S ++ I +G EYW LK N Y + K+ Sbjct: 155 NKKHIIGYNACYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNH 214 Query: 229 DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE----VGLFGKDKRQKDPHAISAE 284 Y + P G+ I A+ P + + +EQ D+ + + + + H + Sbjct: 215 HYYQYPKIGYQICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVS 271 Query: 285 VTYTPVPL 292 + Y P+P+ Sbjct: 272 LEYQPIPM 279 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 40.8 bits (94), Expect = 0.053, Method: Compositional matrix adjust. Identities = 24/62 (38%), Positives = 33/62 (53%), Gaps = 1/62 (1%) Query: 165 NIGFGWRHFSGNDW-MAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKK 223 +IG G+R S N+ G+NTF D+ +R+G+G EY K+SAN Y S K Sbjct: 182 SIGAGYRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLSEKKT 241 Query: 224 SP 225 P Sbjct: 242 KP 243 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 40.0 bits (92), Expect = 0.095, Method: Compositional matrix adjust. Identities = 24/65 (36%), Positives = 34/65 (52%), Gaps = 1/65 (1%) Query: 162 TQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 T +N+G G+R S ++ GVNTF DH S+ + RI G EY ++ AN Y + Sbjct: 142 TVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLNS 201 Query: 221 WKKSP 225 K P Sbjct: 202 TKSEP 206 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P36943 Putative attaching and effacing protein homolog ... 453 e-126 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 358 2e-97 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 355 7e-97 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 355 1e-96 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 353 4e-96 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 352 9e-96 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 351 2e-95 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 349 8e-95 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 348 2e-94 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 346 5e-94 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 345 1e-93 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 344 3e-93 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 343 5e-93 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 342 8e-93 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 342 9e-93 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 340 3e-92 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 338 1e-91 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 333 3e-90 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 332 7e-90 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 330 3e-89 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 329 7e-89 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 328 2e-88 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 326 4e-88 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 326 5e-88 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 325 1e-87 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 320 4e-86 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 316 6e-85 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 316 6e-85 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 315 9e-85 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 314 2e-84 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 314 2e-84 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 312 9e-84 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 311 1e-83 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 311 2e-83 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 310 5e-83 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 305 1e-81 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 304 2e-81 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 304 2e-81 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 303 7e-81 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 302 7e-81 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 301 1e-80 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 300 4e-80 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 296 6e-79 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 295 1e-78 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 293 5e-78 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 287 3e-76 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 283 4e-75 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 281 2e-74 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 280 3e-74 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 276 9e-73 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 270 3e-71 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 270 4e-71 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 267 3e-70 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 267 4e-70 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 264 2e-69 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 261 2e-68 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 259 1e-67 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 254 4e-66 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 248 2e-64 UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 239 1e-61 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 236 5e-61 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 233 5e-60 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 219 1e-55 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 213 5e-54 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 199 1e-49 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 194 4e-48 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 193 7e-48 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 186 9e-46 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 177 4e-43 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 155 2e-36 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 147 3e-34 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 147 4e-34 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 144 4e-33 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 143 5e-33 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 142 1e-32 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 142 1e-32 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 136 1e-30 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 135 1e-30 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 132 2e-29 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 128 2e-28 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 127 5e-28 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 126 1e-27 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 122 2e-26 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 114 3e-24 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 114 4e-24 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 111 3e-23 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 103 8e-21 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 83 1e-14 Sequences not found previously or not previously below threshold: UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 97 7e-19 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 82 2e-14 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 82 2e-14 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 79 2e-13 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 75 3e-12 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 65 3e-09 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 65 4e-09 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 63 1e-08 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 62 2e-08 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 62 2e-08 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 60 8e-08 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 58 2e-07 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 58 5e-07 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 57 5e-07 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 56 1e-06 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 56 1e-06 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 56 2e-06 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 55 2e-06 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 53 1e-05 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 53 1e-05 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 53 1e-05 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 53 1e-05 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 53 2e-05 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 52 2e-05 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 52 3e-05 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 52 3e-05 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 50 1e-04 UniRef50_A8PQI7 Putative outer membrane autotransporter barrel d... 49 2e-04 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 49 2e-04 UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachl... 47 6e-04 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 47 6e-04 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 47 6e-04 UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryoc... 46 0.002 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 45 0.002 UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellu... 45 0.003 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 45 0.004 UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legione... 44 0.006 UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microco... 43 0.017 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 453 bits (1166), Expect = e-126, Method: Composition-based stats. Identities = 295/295 (100%), Positives = 295/295 (100%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT Sbjct: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA Sbjct: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI Sbjct: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ Sbjct: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 358 bits (918), Expect = 2e-97, Method: Composition-based stats. Identities = 147/287 (51%), Positives = 185/287 (64%), Gaps = 11/287 (3%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 RF S L R VA I QVLFP+ A ++ + Sbjct: 1 MISARFHSSRLTRAVASLCIVTQVLFPV---------ASTAGHRVAAPQAAPAVLSEQDA 51 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 VA A L S +S G AT+ A QEWL ++GT RV L +D+D Sbjct: 52 TAAQVAGMTTQAAGMLQSGMNSRQAAEMARGYATSTAQSAFQEWLSQWGTVRVTLGLDED 111 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F+LK S+ ++L P +DTP N+LFTQ + HRTDDR Q N G GWRHF+ + +MAGVN F D Sbjct: 112 FTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFAPD-YMAGVNLFFD 170 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYL 246 HDL+R H+R+G+G EYWRD LKL ANGY+R SGW+ +P+++ DY+ RPANGWD+RAEGYL Sbjct: 171 HDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYL 230 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA+PQLGA+LMYEQYYGDEV LFGKDKRQ+DPHA +A ++YTPVPL Sbjct: 231 PAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLI 277 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 355 bits (912), Expect = 7e-97, Method: Composition-based stats. Identities = 133/284 (46%), Positives = 183/284 (64%), Gaps = 10/284 (3%) Query: 10 QPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVE 69 + ++ AR ++ PL A + + + ++ Sbjct: 85 RKLNQFRTFARGFDHLQPGDELDVPL---------APLPAVTWAEETPVPASASKEDLQA 135 Query: 70 KNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 + +A A+ AG FL++ P DA + GMAT A+ E+Q+WL ++GTAR++L+VD FS Sbjct: 136 QKIAGIASQAGNFLANSPRGDAAASIARGMATGAASTEVQQWLSQFGTARLQLDVDNKFS 195 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 LK+S L++L P+Y+ P ++FTQG++HRTDDRTQ+N+G G R F + +M G NTF+D+D Sbjct: 196 LKNSQLDLLIPLYEQPDKLVFTQGSLHRTDDRTQTNLGMGMRWF-NDGYMLGGNTFLDYD 254 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 LSR H R+G+G EYWRDYLK+ AN Y+R + W+ S D DYQERPANGWD+ EG++PA Sbjct: 255 LSRDHARMGMGVEYWRDYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSLEGWVPAL 314 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG +L YEQYYG EV LFGKD RQKDPHAI+ V YTP PL Sbjct: 315 PQLGGNLKYEQYYGKEVALFGKDNRQKDPHAITVGVNYTPFPLL 358 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 355 bits (910), Expect = 1e-96, Method: Composition-based stats. Identities = 121/279 (43%), Positives = 165/279 (59%), Gaps = 7/279 (2%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y + AR + ++ P+ ++ A +A P+LS + + V Sbjct: 119 YRIFARGFEHVGVGDEIDIPVDMSSLNTQAGQA-----PKLSSAMREPSRAEKEAQAVGQ 173 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 + G LSS S+A MAT AN+EIQ+WL KYGTARV+LN+DK+FSL +S+ Sbjct: 174 LMS-VGATLSSTRPSEAAAGMARSMATNAANEEIQQWLSKYGTARVQLNLDKNFSLSESA 232 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 L+ P++D+ FTQ D R N+G G R + WM GVN F DHDL+ + Sbjct: 233 LDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVGARTLL-DRWMLGVNMFYDHDLTGHN 291 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 +R+G+GAE W DYL+LS NGY+R S W +S D DY ER ANG+DIRA +LPA PQLG Sbjct: 292 SRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADYDERAANGFDIRANAWLPALPQLGG 351 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 L+YEQY G+ V LFGK+ Q++P+A++A V YTP PL Sbjct: 352 KLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPFPLL 390 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 353 bits (906), Expect = 4e-96, Method: Composition-based stats. Identities = 124/296 (41%), Positives = 174/296 (58%), Gaps = 11/296 (3%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 M +++ H + + R + +A I +Q+ P ++ + + +A + S Sbjct: 12 MLFFRSTHMRSKTR-----KLLACIQIVLQLAPPSSLIYLSSV--FNANAEEITSSAEKE 64 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 + +VA A AG+ LSS SDA + + T KA QEWL ++GTARV Sbjct: 65 QGNPSDQNASSVAQTAVQAGSLLSSDNASDALGSAVVSAVTGKAASSAQEWLSQFGTARV 124 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 ++ D+ F+L DS L++L P+Y+ N+LFTQ R DDR N GFG+RHF + WM Sbjct: 125 NISTDEHFTLSDSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHF-NDGWMW 183 Query: 181 GVNTFIDHDLS-RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G N F D +S H R+G+ E DYL +SANGY+R S W S +DY ER A+G+D Sbjct: 184 GTNVFYDRQVSGNQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFD 243 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 IRA GYLPA+PQLGA+++YEQY+GD VGLFG D RQKDP+A++ + YTPVPL Sbjct: 244 IRATGYLPAYPQLGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLV 299 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 352 bits (903), Expect = 9e-96, Method: Composition-based stats. Identities = 126/289 (43%), Positives = 175/289 (60%), Gaps = 16/289 (5%) Query: 6 TGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD 65 + F S+ ++ + W+ I +Q+LFPL F PV AA P + TTV Sbjct: 5 SIKNNNSFFLSLKSKLIIWSQIVLQILFPLFTVF-PVHAA-------PATTTKETTVAMP 56 Query: 66 NNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 + E + + + +GT D ++ TGMAT+ A +Q+WL ++GTARV+LNVD Sbjct: 57 YSQELSTLASSTASGT--------DGAKSAATGMATSAAASSVQQWLSQFGTARVQLNVD 108 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 + + DS++++L P+YD +LFTQ + D RT N+G G R F +WM G N F Sbjct: 109 DNGNWDDSAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVF 168 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 D D + + R+G GAE W +YLKLSAN Y+ + W S D DY E+PA+G+DIRAEGY Sbjct: 169 FDDDFTGKNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGY 228 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LPA+PQLGA LMYEQYYGD+V LF D Q +P A++ ++YTPVPL Q Sbjct: 229 LPAYPQLGAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQ 277 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 351 bits (900), Expect = 2e-95, Method: Composition-based stats. Identities = 127/245 (51%), Positives = 176/245 (71%), Gaps = 2/245 (0%) Query: 50 AVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQ 109 A ++ + N+E+ +AS + G+ L+ +S+ N G A+++A+ + Sbjct: 119 AQVSENNLTPPPGNSSGNLEQQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMT 178 Query: 110 EWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFG 169 +WL ++GTAR+ L VD+DFSLK+S + L+P Y+TP N+ F+Q +HRTD+RTQ N G G Sbjct: 179 DWLSRFGTARITLGVDEDFSLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLG 238 Query: 170 WRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE- 228 WRHF+ WM+G+N F DHDLSR H+R G+GAEYWRDYLKLS+NGY+R + W+ +P+++ Sbjct: 239 WRHFTP-TWMSGINFFFDHDLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDN 297 Query: 229 DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYT 288 DY+ RPANGWD+RAEG+LPAWP LG L+YEQYYGDEV LF KD RQ +PHAI+A + YT Sbjct: 298 DYEARPANGWDVRAEGWLPAWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYT 357 Query: 289 PVPLT 293 P PL Sbjct: 358 PFPLM 362 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 349 bits (895), Expect = 8e-95, Method: Composition-based stats. Identities = 135/281 (48%), Positives = 184/281 (65%), Gaps = 13/281 (4%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 ++ AR ++ P + Q+AV P +N +E +A Sbjct: 98 QFRTFARGFDNVRQGEELDVPATTLQK---SHEQQNAVPPA--------NGENTLENQIA 146 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 S + GT LS +S+ G A+++A+ + +WL +GTA++ L VD+DFSLK+S Sbjct: 147 STSQRVGTLLSQDMNSEQASGMARGWASSEASGAMTDWLNNFGTAKISLGVDEDFSLKNS 206 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 + L+P YDTP +LF+Q +HRTDDRTQ N G GWRHF+ + WM+G+N F DHDLSR Sbjct: 207 QFDFLHPWYDTPDYLLFSQHTLHRTDDRTQINTGLGWRHFTPS-WMSGINLFFDHDLSRY 265 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQL 252 H+R G+GAEYWRDYLKLS+N YI +GW+ +P+++ DY+ RPANGWD+RAEG+LPAWPQL Sbjct: 266 HSRAGLGAEYWRDYLKLSSNAYIGLTGWRSAPELDNDYEARPANGWDLRAEGWLPAWPQL 325 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G L+YEQYYGDEV LF K+ RQ +PHAI+A + YTP PL Sbjct: 326 GGKLVYEQYYGDEVALFDKNDRQSNPHAITAGLNYTPFPLL 366 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 348 bits (892), Expect = 2e-94, Method: Composition-based stats. Identities = 126/279 (45%), Positives = 177/279 (63%), Gaps = 6/279 (2%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 A ++ P + + + T D+ +A Sbjct: 96 LRTFAHGFDNLQPGDELDVPAVMP-----DGKPDSPAKTGDEQAATPPLKDDEGAMKMAD 150 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 A+ AGT LS+ PD DA + G +A A+ ++Q+WL ++GTARV+L D+ FSLK+S Sbjct: 151 MASRAGTLLSNSPDGDAALSMARGQISAVASGQVQQWLNQFGTARVQLEADEHFSLKNSQ 210 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L P Y+ +LFTQG++HRTDDRTQ+N+GFG R+F+ + +M G N F D+DLS H Sbjct: 211 VDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAPS-YMLGGNIFGDYDLSHEH 269 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 +R G+G EYWRD+LKLSANGY+R S W+ SP++++YQERPANGWDIRA+ +LP+ PQLG Sbjct: 270 SRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQAWLPSLPQLGG 329 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 L YEQYYG V LFGK+ Q++P AI+A V +TP PL Sbjct: 330 KLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLL 368 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 346 bits (888), Expect = 5e-94, Method: Composition-based stats. Identities = 123/292 (42%), Positives = 171/292 (58%), Gaps = 17/292 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 H ++ ++ + S+L + V WA I +Q+ FPL V P A+ A + +S +T Sbjct: 1 MHEQSIMEKNTLKISLLKKIVIWAQILLQIAFPLLV--LPAHASSGPGATETDMSDASTL 58 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + + ++Q +DA +N T +AT A ++EWL +GTA+V Sbjct: 59 SASLASS---------------AAQNGADAMKNTATHLATTHAASTVEEWLSHFGTAQVT 103 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L+VD + + +S+ + L P+YD ++LFTQ I D RT NIG G R F DWM G Sbjct: 104 LDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFG 163 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + RIG GAE W +YLKLSAN YI S W S D ++Y E+PA+G+D+R Sbjct: 164 GNVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVR 223 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AEGYLP++PQLGA LMYEQYYGD V LF KD Q +P A++ + YTPVPL Sbjct: 224 AEGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPLI 275 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 345 bits (885), Expect = 1e-93, Method: Composition-based stats. Identities = 129/274 (47%), Positives = 184/274 (67%), Gaps = 12/274 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + VAW+ I++Q L+P ++FTP ++ ++ + A+ + ++S AA A Sbjct: 18 KVVAWSTIALQALYPALLSFTPTISH--------ASAVKASQAAAEQQELRGLSSLAAQA 69 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 G + + +F A+A +E+ EWL KYG AR++LNVD FSLKDS+ + LY Sbjct: 70 GRSIENG----HAGSFAANTVPAQATKEVVEWLQKYGNARIQLNVDDAFSLKDSAFDFLY 125 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P D ++LF+Q ++HRTDDRTQ+NIG G+R+F+ ++ M G N F D+DLSR H R+G Sbjct: 126 PWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGANLFYDYDLSRHHARMGA 185 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 G EYWRDYL+ AN Y+R S WK S D++DYQERPA+GWDI +G+LP++PQLGASL YE Sbjct: 186 GVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYTQGWLPSYPQLGASLKYE 245 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +YYG VGLFG D Q++P+A + ++YTPVPL Sbjct: 246 KYYGKNVGLFGSDHLQENPYAFTGGISYTPVPLV 279 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 344 bits (882), Expect = 3e-93, Method: Composition-based stats. Identities = 128/289 (44%), Positives = 181/289 (62%), Gaps = 9/289 (3%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTP-VMAARAQHAVQPRLSMGNTTVT 63 K +K + + +++ V WANI +Q +FPL++ FTP VMAA A + Sbjct: 13 KQLNKNKQLNKTRISKSVVWANIVIQAIFPLSIAFTPAVMAAETVGASDEK-------PR 65 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 + + E++ A+ A + L++ + + G A N+ +Q+W ++G+A+V+LN Sbjct: 66 SASQAEQSTANAATRLASILTNDDSAKQASSIARGTAANAGNEALQKWFNQFGSAKVQLN 125 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 +D+ SLK S L++L P+ D+P + FTQ DDR N+G G RHF M G N Sbjct: 126 LDEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQ-MLGYN 184 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F+DHD S SHTRIGVGAEY RD++ L+ANGY SGWK SPD++ Y E+ ANG+D+R+E Sbjct: 185 LFVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSE 244 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 YLP PQLG L+YEQY+GDEVGLFG D RQK+P A++ V YTP+PL Sbjct: 245 AYLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPL 293 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 343 bits (880), Expect = 5e-93, Method: Composition-based stats. Identities = 143/281 (50%), Positives = 190/281 (67%), Gaps = 12/281 (4%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 ++ AR ++ PL + +P AR A+Q + + VA Sbjct: 97 QFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM--------QVA 146 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DFSLK S Sbjct: 147 EMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDFSLKSS 206 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 S E L+P Y+TP N++F+Q +HRTDDRTQ+N G GWR+F+ + WM+GVN FIDHDL+R Sbjct: 207 SFEFLHPWYETPDNLVFSQHTLHRTDDRTQTNHGIGWRYFT-SSWMSGVNMFIDHDLTRY 265 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQL 252 HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LPAWPQL Sbjct: 266 HTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLPAWPQL 325 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G L+YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 326 GGKLVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 342 bits (878), Expect = 8e-93, Method: Composition-based stats. Identities = 124/234 (52%), Positives = 170/234 (72%), Gaps = 1/234 (0%) Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR 119 + D+ + +A A+ AG FLS P+ DA + G TA+A+ ++Q+WL ++GTAR Sbjct: 158 ASALGDDAGARKMADVASRAGAFLSDNPNGDAALSLARGEVTAEASGQLQQWLNQFGTAR 217 Query: 120 VKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM 179 V+L+ D+ FS K+S ++L P+Y+ +++FTQG++HRTDDRTQ N+GFG R+F+ + +M Sbjct: 218 VQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQVNLGFGLRYFAPS-YM 276 Query: 180 AGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G N F D+DLSR+H+R G+G EYWRD+LKLSANGY+R S W S D +DYQERPANGWD Sbjct: 277 LGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNSSDFKDYQERPANGWD 336 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 IRA+ +LP+ PQLG L YEQYYG V LFGK+ Q+DP AI+A V +TP PL Sbjct: 337 IRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAGVNFTPFPLL 390 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 342 bits (877), Expect = 9e-93, Method: Composition-based stats. Identities = 141/281 (50%), Positives = 190/281 (67%), Gaps = 12/281 (4%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 ++ AR ++ PL + +P AR A+Q + + VA Sbjct: 97 QFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM--------QVA 146 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DFSLK S Sbjct: 147 EMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDFSLKSS 206 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 S E L+P Y+TP N++F+Q +HRTD+RTQ+N G GWR+F+ + WM+GVN FIDHDL+R Sbjct: 207 SFEFLHPWYETPDNLVFSQHTLHRTDNRTQTNHGIGWRYFT-SSWMSGVNMFIDHDLTRY 265 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQL 252 HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LPAWPQL Sbjct: 266 HTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLPAWPQL 325 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G ++YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 326 GGKVVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 340 bits (873), Expect = 3e-92, Method: Composition-based stats. Identities = 110/283 (38%), Positives = 157/283 (55%), Gaps = 14/283 (4%) Query: 11 PRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK 70 +FR ++ ++ P + + N + + ++ Sbjct: 8 NQFRS--FSKPFIQLGSGDEIDIPRITPLP-----------EKITTAENAKTVSSSQYKE 54 Query: 71 NVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSL 130 +A T L+ A + +A +AN Q WL ++GTARV+LN+D + SL Sbjct: 55 RLAHNLLKGATVLADDNTPLAAASMARSVAVGEANDAAQHWLSQFGTARVQLNLDNNLSL 114 Query: 131 KDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDL 190 K S+ +ML P+YD ++LF+Q + D R NIG G R ++WM G N F D D+ Sbjct: 115 KGSAFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQ-DNWMYGANVFFDRDI 173 Query: 191 SRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 + + RIG GAE W DYLKLSAN Y+R + W +S D DY ERPANG+D+R E YLPA+P Sbjct: 174 TGKNNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLPAYP 233 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 Q+G +L YEQY G+EV LFGKD RQK+P+A +A + YTP+PL Sbjct: 234 QIGTNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLI 276 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 338 bits (867), Expect = 1e-91, Method: Composition-based stats. Identities = 124/291 (42%), Positives = 166/291 (57%), Gaps = 11/291 (3%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 ++ K+ L + A+ + F + + S + Sbjct: 100 QSIAKKYNITVDELKKLNAYRT--------FSKPFASLTTGDEIEVPRKESSFFSNNPNE 151 Query: 65 DNN--VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +N V+ +A A AG LS+ SDA N T + N Q+WL ++GTARV+L Sbjct: 152 NNKKDVDDLLARNAMGAGKLLSNDNTSDAASNMARSAVTNEINASSQQWLNQFGTARVQL 211 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD DF L +S+L++L P+ D+ +++LFTQ + D R NIG G R + G+ WM G Sbjct: 212 NVDSDFKLDNSALDLLVPLKDSESSLLFTQLGVRNKDSRNTVNIGAGIRQYQGD-WMYGA 270 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF D+DL+ + R+GVGAE DYLK SAN Y +GW +S D Y ERPA+G+DIR Sbjct: 271 NTFFDNDLTGKNRRVGVGAEVATDYLKFSANTYFGLTGWHQSRDFSSYDERPADGFDIRT 330 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 E YLPA+PQLG LMYE+Y GDEV LFGKD RQKDPHA++ V YTPVPL Sbjct: 331 EAYLPAYPQLGGKLMYEKYRGDEVALFGKDDRQKDPHAVTLGVNYTPVPLV 381 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 333 bits (855), Expect = 3e-90, Method: Composition-based stats. Identities = 117/242 (48%), Positives = 154/242 (63%), Gaps = 3/242 (1%) Query: 54 RLSMGNTTVTADN--NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEW 111 + S + DN +VE +A A T LS+ + + + A+ + N Q+W Sbjct: 124 KASPFSVDNNKDNRLSVENTLAGHAVAGATALSNGDVAKSGERMVRSAASNEFNNSAQQW 183 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L ++GTARV+LN++ DF L S+ ++L P+YD ++LFTQ D R N+G G R Sbjct: 184 LSQFGTARVQLNINDDFHLDGSAADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVR 243 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 F GN WM G NTF D+DL+ + RIGVGAE W DYLKLSAN Y + W +S D DY Sbjct: 244 TFQGN-WMYGANTFFDNDLTGKNRRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYN 302 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPANG+D+RAE YLP++PQLG MYE+Y GD+V LFGKD RQK+PHAI+A V YTP+P Sbjct: 303 ERPANGYDLRAEAYLPSYPQLGGKAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIP 362 Query: 292 LT 293 L Sbjct: 363 LV 364 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 332 bits (852), Expect = 7e-90, Method: Composition-based stats. Identities = 120/254 (47%), Positives = 153/254 (60%), Gaps = 1/254 (0%) Query: 40 TPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGM 99 P + P + + + E VA G L+S S+A Sbjct: 143 IPHSGSSLTKPGSPAAATPLSPHADTSERESRVAGQLMGVGRVLASPQSSNAASEMARSW 202 Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 ATA AN EI +WL KYGTA+++LN+DK+FSL S+L+ L P YDTPT FTQ D Sbjct: 203 ATAAANDEIVKWLSKYGTAQLQLNIDKNFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRD 262 Query: 160 DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 R NIG G R S N W+ GVN F DHDLS ++R+G+G+E W DYL+LS NGY+R S Sbjct: 263 HRNTLNIGIGTRTLSNN-WLFGVNAFYDHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLS 321 Query: 220 GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH 279 W +S D+ DY ERPANG+D+RA ++P PQLG LMYEQY+GD VGLFGKD Q++P+ Sbjct: 322 DWHQSRDLADYNERPANGFDVRANAWMPTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPY 381 Query: 280 AISAEVTYTPVPLT 293 A + V YTP PL Sbjct: 382 AFTVGVNYTPFPLL 395 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 330 bits (847), Expect = 3e-89, Method: Composition-based stats. Identities = 127/279 (45%), Positives = 167/279 (59%), Gaps = 22/279 (7%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 S + R + +Q F + F V A E++VA Sbjct: 8 SSQVRRVAVYGLAGLQFFFQVTPAFAGVFQAD----------------------EQSVAQ 45 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 A AG L DA R +T A+ +A + +WL ++GTA+ +L+V DFSLK SS Sbjct: 46 TAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQWLNQFGTAKTQLSVVSDFSLKGSS 105 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 L++L P Y+TP N+LFTQ + D R +N G G R+F+ N WM G N F D D ++ Sbjct: 106 LDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWMLGYNVFYDVDWRNTN 165 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R G+G E WRDYLKLSANGY R S W++SP + DY ERPA+GWDIRAEG+LPA+PQLG Sbjct: 166 RRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDIRAEGWLPAYPQLGG 225 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 L+YEQYYG+EV LFG+ +RQK+PHAI+A VT+TP L Sbjct: 226 KLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLL 264 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 329 bits (843), Expect = 7e-89, Method: Composition-based stats. Identities = 119/279 (42%), Positives = 165/279 (59%), Gaps = 15/279 (5%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y ++ ++ P + + + + ++A D +E +AS Sbjct: 71 YRTFSKPFTALTSGDEIDIPRKASPFSIDSEKNKNA--------------DVLLENKLAS 116 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 T L++ + ++ I A + N Q+WL ++GTARV++NV+ DF L S+ Sbjct: 117 HVQTGATALATSNAAKSSERMIRSAANNEFNSSAQQWLSQFGTARVQMNVNDDFKLDGSA 176 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L PIYD ++LFTQ D+R NIG G R F N WM GVNTF D+D++ + Sbjct: 177 VDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNN-WMYGVNTFFDNDMTGKN 235 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R+GVGAE W DYLKLSAN YI S W +S D DY ERPANG+D+RAE YLP+ PQLG Sbjct: 236 RRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAEAYLPSHPQLGG 295 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYE+Y G+EV LFGKD RQK+PHA++A V YTP+PL Sbjct: 296 KLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLL 334 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 328 bits (840), Expect = 2e-88, Method: Composition-based stats. Identities = 137/284 (48%), Positives = 181/284 (63%), Gaps = 5/284 (1%) Query: 11 PRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD-NNVE 69 + R R A +Q P P+MAA+ + G + + N Sbjct: 84 NQLRELNQLRTFAHGLNGLQ---PGDDVDVPLMAAKDNKNASDAAAPGRSASAEEGNEQA 140 Query: 70 KNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 + VA +A+ AG+FL+S SDA + MAT +A Q+WL +GTARV+L+ DK+FS Sbjct: 141 QKVAGYASQAGSFLASSAKSDAAASMARNMATVEAGGAFQQWLSHFGTARVQLDADKNFS 200 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 LK+S ++L P+YD N +FTQG++HRTD RTQ+++G GWRH S + +M G N F D D Sbjct: 201 LKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWRH-STSTYMLGGNLFGDFD 259 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 LSR H R G G EYWR++LKL N Y+R SGWK SPD+EDYQERPANGWD+R + ++P+ Sbjct: 260 LSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQERPANGWDVRGQAWVPSL 319 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG L YEQYYG EV LFG D RQ++PHAI+ + YTPVPL Sbjct: 320 PQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPLI 363 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 326 bits (837), Expect = 4e-88, Method: Composition-based stats. Identities = 116/265 (43%), Positives = 158/265 (59%), Gaps = 17/265 (6%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 +Q+LFP V +A A QP +++ T V S A GT ++ Sbjct: 1 MQLLFPF------VTSAYTYAASQPPVAVPVPT---------QVTSLLAAGGT--ETENG 43 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 S+ ++ T MAT A ++EWL +GTA V LN D++ + +SS++ L P+YD ++ Sbjct: 44 SNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSV 103 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 LFTQ + D RT NIG G R F+ +WM G N F D D + + R+G+GAE W DYL Sbjct: 104 LFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYL 163 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 KL+AN YI + W S D DY E+PA+G+DIRAEGYLPA+PQLGA +MYEQYYG+ V L Sbjct: 164 KLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVAL 223 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 F KD Q DP A++ + YTP+ L Sbjct: 224 FDKDHLQNDPSAVTMGLNYTPISLV 248 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 326 bits (836), Expect = 5e-88, Method: Composition-based stats. Identities = 127/280 (45%), Positives = 166/280 (59%), Gaps = 16/280 (5%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 + ++ A ++ P Q L+ NT +T E+N+A Sbjct: 48 QLRTFSKPFAKLQAGDELEIP-------------QAQSNLGLAPENTALTDTQTTERNLA 94 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 A + L+S A + G+A ANQ WL +GTAR++ NVD L S Sbjct: 95 KTATTSAQMLNSGD--KAAARQLRGLAVGNANQAANSWLNNFGTARLQANVDDRGDLDGS 152 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 +ML P YDTP+ M FTQ I R D RT +N+G G RHF +DWM G N F+D D++R Sbjct: 153 QFDMLMPFYDTPSQMAFTQFGIRRIDKRTTANLGIGIRHFI-DDWMVGYNLFLDRDITRD 211 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 HTR+G GAEY RDYLKL+ANGY+R S W+ SPD Y ERPA G+D+RAE YLP+ PQLG Sbjct: 212 HTRVGAGAEYARDYLKLAANGYLRLSDWRDSPDFSSYSERPATGFDLRAEAYLPSLPQLG 271 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYEQY+G++VGLFGKD RQ++P AI+A + YTP+PL Sbjct: 272 GKLMYEQYFGNDVGLFGKDNRQQNPAAITAGINYTPIPLV 311 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 121/278 (43%), Positives = 164/278 (58%), Gaps = 11/278 (3%) Query: 25 ANISVQVLFPLAVTFTPV--MAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTF 82 N +L P P+ +A +A + P L MGN V ++ E+ A+ A G Sbjct: 75 LNAGESLLLPANSPLFPLDPLAGKAIASNLPELGMGNDPVPLVSSGEQKTAAAAHAVGAQ 134 Query: 83 LSSQPDSDATRNFITGMATAKA--------NQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 + SD +N A +A Q+ QE LGK+G A+V L VD + SL S+ Sbjct: 135 NWNNMTSDQMKNQAESWAKGQAKAQVVDPLRQQAQELLGKFGKAQVNLAVDDNGSLSKSA 194 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 + P Y+ + F+Q +HR D+R N+G G R G+ W+ G NTF+D D+SR+H Sbjct: 195 FSLFSPWYENDAMVAFSQVGVHRQDNRMIGNLGAGVRFDQGD-WLFGANTFLDQDISRNH 253 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 +R+G+G E+W D LKL++N Y SGWK S D +DY ERPA G+D+ A+GYLPA+ QLGA Sbjct: 254 SRLGLGLEWWADNLKLASNYYHPLSGWKDSKDFDDYLERPARGFDVHAQGYLPAYQQLGA 313 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 S +YEQYYGDEV LFGKD QKDPHA++ V YTP PL Sbjct: 314 SAVYEQYYGDEVALFGKDNLQKDPHAVTVGVDYTPFPL 351 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 320 bits (820), Expect = 4e-86, Method: Composition-based stats. Identities = 106/232 (45%), Positives = 159/232 (68%), Gaps = 1/232 (0%) Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 ++ + E VA A+ AG F S+ PD + T+ F + T A+ Q+W ++G++++ Sbjct: 133 PSSASENEIRVAQLASQAGKFFSTNPDQEKTKAFARELLTTAASSYAQDWFNRFGSSQIH 192 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L DK FSLK+S +++L P Y+T N++F+Q ++HR + R ++N+G G R + G M G Sbjct: 193 LEADKKFSLKNSQIDLLMPWYETEDNLIFSQTSLHRKEGRIETNLGLGARWY-GEGQMIG 251 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 NTF D+D+SR H+R+G+G EY RD+LKLSAN Y R SGW+ S D+ D+ RP+NGWD+R Sbjct: 252 GNTFFDYDISRKHSRLGLGVEYRRDFLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVR 311 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AEG+LP++P +G L YEQYYGD V LFG Q++P++I+A + YTP+PL Sbjct: 312 AEGWLPSYPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSITAGLNYTPIPLV 363 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 316 bits (810), Expect = 6e-85, Method: Composition-based stats. Identities = 121/279 (43%), Positives = 157/279 (56%), Gaps = 19/279 (6%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y A+ + ++ P P N TV A+N VAS Sbjct: 100 YRTFAKPFTALTVGDEIDVP--------------RKKSPFTVDNNVTVPAENG----VAS 141 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 AA LS + + N + + Q+WLG++GTAR++ N + DF S+ Sbjct: 142 NAAAGAALLSHGDAAKSAENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDDFEFDSSA 201 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L P+YD ++ FTQ D R NIG G R F N WM G NTF D+D++ ++ Sbjct: 202 IDVLIPLYDNQKSLFFTQLGGRNKDSRNTINIGAGVRAFLTN-WMYGANTFFDNDITGNN 260 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R+G+GAE W DYLKLSANGY + W +S D DY ERPANG+D+RAE YLPA+PQLG Sbjct: 261 RRVGIGAEAWTDYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPAYPQLGG 320 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYEQY GDEV LFGKDKRQKDPHAI+ + YTPV L Sbjct: 321 KLMYEQYNGDEVALFGKDKRQKDPHAITVGINYTPVSLV 359 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 316 bits (810), Expect = 6e-85, Method: Composition-based stats. Identities = 129/289 (44%), Positives = 181/289 (62%), Gaps = 12/289 (4%) Query: 16 SVLARCVAWANISVQVLFPLAVTF---TPVMAARAQHAVQPR----LSMGNTTVTADNNV 68 + R A+ + +QV+F +F P AA Q + ++ +T ++ Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 KN+A+ ++ G+ LS +DA N +A +IQ+WL ++GTA+V L +DKD Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDALINSAINNGSAYVTGQIQQWLQQFGTAKVNLGLDKDL 121 Query: 129 SLKDSSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 SL ++SL++L P+YD N+LFTQ R DDR N+G G+R+F+ + WM G+NTF D Sbjct: 122 SLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFA-DRWMWGINTFYD 180 Query: 188 HDLS-RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 +S +H R+G+G E +Y KLSANGY R SGWK S + EDYQER ANG+DIRAEGYL Sbjct: 181 RQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDIRAEGYL 240 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 PAWPQLGA L++EQYYGD+V LF D RQ++P+A++A V YTP PL Sbjct: 241 PAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLV 289 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 315 bits (808), Expect = 9e-85, Method: Composition-based stats. Identities = 125/243 (51%), Positives = 176/243 (72%), Gaps = 5/243 (2%) Query: 56 SMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDA----TRNFITGMATAKANQEIQEW 111 + + + + + +A A+ G L + P+S+A R+ A AKA QEI +W Sbjct: 112 TTQSLPHSTSSPNDSLLAQSASQVGNTLQNNPNSEALNDLARSSALSAANAKAGQEISDW 171 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L G RVKL+ D+DFS+K+S L++L P++++ ++M+F+QG++HRTDDRTQSN+G G+R Sbjct: 172 LNGKGKVRVKLDADRDFSVKNSQLDLLVPLWESESHMIFSQGSVHRTDDRTQSNLGLGYR 231 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +F+ + + G NTF DHD SRSH+R+G+GAEY R++ KL+ NGY+R S WK SPD ++Y+ Sbjct: 232 YFA-DSYALGANTFYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRLSNWKDSPDFDNYE 290 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPANGWDIRAEGYLP++P LGA L YEQYYGD VGLFGKD +QK+PHAI+ Y+P P Sbjct: 291 ERPANGWDIRAEGYLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPHAITFGGNYSPFP 350 Query: 292 LTQ 294 L + Sbjct: 351 LLK 353 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 314 bits (805), Expect = 2e-84, Method: Composition-based stats. Identities = 113/264 (42%), Positives = 154/264 (58%), Gaps = 10/264 (3%) Query: 36 AVTFTPVMAARAQHAVQPRLSMGNTTVTA------DNNVEKNVASFAANAGTFLSSQPDS 89 + F + + S +T A + E + + G L++ S Sbjct: 67 SSAFENLHPNNEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAA---S 123 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML 149 + M NQEI++WL ++GTA+V LN DK+FSLK+SSL+ L P YD+ + + Sbjct: 124 GRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLF 183 Query: 150 FTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 F+Q I D R N+G G R N W+ G+NTF D+DL+ + RIG+GAE W DYL+ Sbjct: 184 FSQLGIRNKDSRNTLNLGVGIRTL-ENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQ 242 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 L+ANGY R +GW S D DY+ERPA G D+RA YLPA PQLG LMYEQY G+ V LF Sbjct: 243 LAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALF 302 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 GKD Q++P+A++A + YTPVPL Sbjct: 303 GKDNLQRNPYAVTAGINYTPVPLL 326 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 314 bits (805), Expect = 2e-84, Method: Composition-based stats. Identities = 111/263 (42%), Positives = 162/263 (61%), Gaps = 7/263 (2%) Query: 36 AVTFTPVMAARAQHAVQPRL-SMGNTTVTADNNVEKNVASFAANAGTFLSSQ----PDSD 90 + PV+ A A+ L S+G+ + +NN E A + GTFLS + S Sbjct: 24 SSAIMPVIPAYAKMLDNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQ 83 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 +++ +++A +EI+ WL K G ++ ++ DK FS+K+S + L P YD +LF Sbjct: 84 IAQDYARNKVSSEATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLF 143 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 TQ +HR D+R +N G G R+F + G+N FIDHDLS +HTR+G+G EYW+DYLKL Sbjct: 144 TQHTLHRYDERFHTNNGIGLRYFHEKSTI-GMNAFIDHDLSHAHTRVGLGVEYWQDYLKL 202 Query: 211 SANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 +AN Y + WK + ++ D+ +PA+GWDI+ EG+LP +P LG +L YEQYYGD V LF Sbjct: 203 NANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALF 262 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 GK KRQK+P+A + +TP PL Sbjct: 263 GKTKRQKNPNAATIGANWTPFPL 285 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 312 bits (800), Expect = 9e-84, Method: Composition-based stats. Identities = 106/223 (47%), Positives = 136/223 (60%), Gaps = 2/223 (0%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 VAS + G LSS+ A GM + + ++EWLG G A+VKL D Sbjct: 124 VASHLSQVGNSLSSEDRVGAFSRLAKGMLLSSTAKTVEEWLGHIGQAQVKLQADDKNDFS 183 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 S +++ P+YD P + F+Q R D R NIG G RH+ + WM G N F D +S Sbjct: 184 GSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYVSD-WMFGYNIFFDQQIS 242 Query: 192 RS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 + H R+G G E RDY+KLSAN Y R GWK S +EDY ER ANG+DIR E YLP +P Sbjct: 243 GNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAANGYDIRTEAYLPHYP 302 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 QLG LMYEQY+GDEV LFG ++RQK+P A++A V+YTP+PL Sbjct: 303 QLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYTPIPLV 345 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 311 bits (798), Expect = 1e-83, Method: Composition-based stats. Identities = 116/280 (41%), Positives = 166/280 (59%), Gaps = 7/280 (2%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 ++ + + ++ P+ A V P + N V +NN + Sbjct: 98 QFRTFPQGFEQVSSGEEIDIPV--PIIAEQGATKVSVVTP--NEVNCPVGIENNPQTK-- 151 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 + L+S + + + ++ AN+EIQ+WLG+YGTA+V+LNVD FSL++S Sbjct: 152 EYVKRVSALLASSDPTTVATDVVRSEVSSTANKEIQKWLGQYGTAQVRLNVDDKFSLRES 211 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 SL+ L+ YD+ + ++FTQ I D R +N+G G R GN W+ G NTF D+DL+ Sbjct: 212 SLDWLFSFYDSSSAIIFTQLGIRNKDHRNTANLGLGGRISMGN-WILGANTFYDNDLTGI 270 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 ++R+G GAE W DYL+LSAN Y+R + W +S D D+ ERPANG+DIR +LP PQLG Sbjct: 271 NSRLGFGAEAWTDYLQLSANSYMRLNNWHQSRDFIDHDERPANGFDIRTNAWLPVLPQLG 330 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYEQY GD V LFGKDK QK+P+A++A +TYTP PL Sbjct: 331 GKLMYEQYSGDSVALFGKDKLQKNPYAVTAGITYTPFPLL 370 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 311 bits (796), Expect = 2e-83, Method: Composition-based stats. Identities = 116/282 (41%), Positives = 153/282 (54%), Gaps = 19/282 (6%) Query: 28 SVQVLFPL-----------AVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFA 76 Q++ PL + P++AA +L+ + VT N + ++A Sbjct: 107 GQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYA 166 Query: 77 ANAGTFLSSQPDS-----DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 A L SQ S D ++ G+A +A+ ++Q WL YGTA V L +F Sbjct: 167 AQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD-- 224 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SSL+ L P YD+ + F Q D R +N+G G R F + M G N FID D S Sbjct: 225 GSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPEN-MLGYNVFIDQDFS 283 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 +TR+G+G EYWRDY K S NGY R SGW +S + +DY ERPANG+DIR GYLP++P Sbjct: 284 GDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPA 343 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LGA LMYEQYYGD V LF DK Q +P A + V YTP+PL Sbjct: 344 LGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLV 385 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 310 bits (793), Expect = 5e-83, Method: Composition-based stats. Identities = 117/274 (42%), Positives = 160/274 (58%), Gaps = 12/274 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + +A I +Q P+A++ + + A LS + DN A A Sbjct: 2 KSMAIMQILLQTALPVALSMSATVRA-------AELSQNTHSADKDNINSPYSAQM-TQA 53 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 T LSS + A MA+ A +++WL ++GTARV+LNVD + DS+++ L Sbjct: 54 ATALSSGNAAGAG----ASMASGYAGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLA 109 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ MLFTQ + DDR N G G R F ++WM G N F D D + + R+G Sbjct: 110 PLYDSQKAMLFTQLGLRAPDDRVTGNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGF 169 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W + LKLSAN Y+ + W S D +DY E+PA+G+D+RAEGYLPA+PQLGA LMYE Sbjct: 170 GAEAWTNNLKLSANTYLGTTNWHSSRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYE 229 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 QYYGD+V LF KD Q +P A++ V+YTPVPL Sbjct: 230 QYYGDKVALFDKDDLQSNPSAVTVGVSYTPVPLI 263 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 305 bits (781), Expect = 1e-81, Method: Composition-based stats. Identities = 110/237 (46%), Positives = 153/237 (64%), Gaps = 4/237 (1%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 + N + + +A + L+ D + ++K+NQ+I++WL ++G Sbjct: 83 LNNQDEAIPSTEGEELAKIIVDNSFLLNKDID---VTQYAISQISSKSNQKIEQWLNQFG 139 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 ARV L+ DK+ +LK+SS E+L P+Y+ ++F Q HR D R+Q N G G+R+F+ Sbjct: 140 HARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTNYHRKDLRSQFNYGIGYRYFT-E 198 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 +M G+N F DHDL+ H R+G+GAE WRDY KLS+N Y R S W+ S +I DY ERPAN Sbjct: 199 KFMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHYHRLSSWRASNNILDYSERPAN 258 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 GWDIR EGY PA+PQLG L++EQYYG EVGLFGKDKR K+PH + + YTP+PL Sbjct: 259 GWDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRDKNPHTYTLGINYTPIPLV 315 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 304 bits (779), Expect = 2e-81, Method: Composition-based stats. Identities = 103/224 (45%), Positives = 153/224 (68%), Gaps = 3/224 (1%) Query: 70 KNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 + +AS A + G LS+ S + + T K N IQ W +GTA ++L VDK+FS Sbjct: 133 QQIASIATDVGNILSNDNISK--NSALLNKITNKVNSHIQSWFENFGTAHIQLQVDKNFS 190 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 LK+S LE+L+P+++ + F+QG I DD+ SNIG G+R F ++WM G N+FID+D Sbjct: 191 LKNSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFY-DNWMLGGNSFIDYD 249 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 L + H+R+G+G EYW+D LKL AN Y+R S W+ S +I DY+ERPANG D+ + +LP++ Sbjct: 250 LRKEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIKSWLPSY 309 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQ+G + YE+YYGD+V LFG++ RQ++PH+ + ++YTP PL Sbjct: 310 PQIGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLM 353 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 304 bits (778), Expect = 2e-81, Method: Composition-based stats. Identities = 104/228 (45%), Positives = 139/228 (60%), Gaps = 2/228 (0%) Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 VAS + G+ LSS+ +A G+ + + ++EWLG G A+VKL VD Sbjct: 80 QQASLVASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIGKAQVKLQVDD 139 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 S L + P+Y+ P + F+Q R D R NIG G RH+ + WM G N F+ Sbjct: 140 KNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYLSD-WMLGYNVFL 198 Query: 187 DHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 D +S + H R+G+G E RDY+KLSAN Y R GWK S +EDY ER A+G+DIR E Y Sbjct: 199 DQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAASGYDIRTEAY 258 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LP +PQLG LMYEQY+G+EV LFG ++RQK+P A++A V+YTP PL Sbjct: 259 LPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLV 306 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 303 bits (775), Expect = 7e-81, Method: Composition-based stats. Identities = 108/281 (38%), Positives = 150/281 (53%), Gaps = 16/281 (5%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + ++ +S+ ++F + A++ N + Sbjct: 5 FNTLTVTKIISRLILSIGLIFGIFTYGFSQQHYFNSEALENPAE--------HNEAFNKI 56 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 S + S N M ANQE++ WL ++GT +V +N DK FSLK+ Sbjct: 57 ISTGTSLA-------VSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKE 109 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 SSL+ L P YD+ + + F+Q I D R NIG G R F + WM G NT D+D++ Sbjct: 110 SSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQS-WMYGFNTSYDNDMTG 168 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 + RIGVGAE W DYL+LSANGY R +GW +S D DY ERPA+G DI + YLPA PQL Sbjct: 169 HNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQL 228 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G L YEQY G+ V LFGKD Q +P+A++ + YTP+P Sbjct: 229 GGKLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFI 269 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 302 bits (774), Expect = 7e-81, Method: Composition-based stats. Identities = 114/269 (42%), Positives = 156/269 (57%), Gaps = 6/269 (2%) Query: 25 ANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLS 84 AN V P+ + RA + G+ T D ++ G+ L+ Sbjct: 102 ANAGELVDSPINDAIAININ-RASQNNKNNAGAGSLTKEQDPMDSLSI----RGVGSALA 156 Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDT 144 + DA + MAT+ N +I +WL +YGTAR++LN D+DFSL +S+L+ L P+YD+ Sbjct: 157 ASGRVDALHHMARTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLLPLYDS 216 Query: 145 PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYW 204 T LFTQ D R +NIG G R F ++WM G N F D+D + + R+G+GAE W Sbjct: 217 QTLTLFTQQGFRNKDRRNIANIGIGTR-FIHHEWMMGGNAFYDNDFTGDNKRVGLGAELW 275 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD 264 D +LSANGY R + W +S D DY ERPANG D+RA G+LPA P LG SL+YE Y+GD Sbjct: 276 TDSFQLSANGYFRLTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYEHYFGD 335 Query: 265 EVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V LFGKD Q++P+AI+ +YTP L Sbjct: 336 NVALFGKDHLQRNPYAITLGGSYTPFSLL 364 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 301 bits (772), Expect = 1e-80, Method: Composition-based stats. Identities = 93/212 (43%), Positives = 134/212 (63%), Gaps = 2/212 (0%) Query: 83 LSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIY 142 L ++ +++I ++ Q+WL ++GTA++ LNVD L +SS+++L P Y Sbjct: 122 LLNKDPKKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFY 181 Query: 143 DTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGA 201 D + ++++Q D R N+G G R F NDWM G NTF D+DL+ +++R +G Sbjct: 182 DDKDHWLIYSQYGYRHKDSRDTVNLGIGTRLFI-NDWMYGANTFYDNDLTGNNSRFSLGG 240 Query: 202 EYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 E W +YLK+SAN Y R S W S D+ +Y ERPANG+D+ A+ YLPA P LGA + YEQY Sbjct: 241 ELWTNYLKMSANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQY 300 Query: 262 YGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +GD V LFG + RQKDP+A + V YTP+PL Sbjct: 301 FGDNVALFGTNNRQKDPYAATIGVNYTPIPLI 332 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 300 bits (768), Expect = 4e-80, Method: Composition-based stats. Identities = 95/223 (42%), Positives = 138/223 (61%), Gaps = 3/223 (1%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 + S L+S P +++I ++ Q+WL ++GTA++ LNVD L Sbjct: 105 ILSHGTKILGLLNSDPK-KLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLD 163 Query: 132 DSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDL 190 +SS+++L P YD + ++++Q D R N+G G R F N WM G NTF D+DL Sbjct: 164 ESSVDLLVPFYDDKDHWLVYSQYGYRHKDSRDTVNLGIGTRLFINN-WMYGANTFYDNDL 222 Query: 191 SRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWP 250 + +++R +G E W +YLK+SAN Y R S W + D+ +Y ERPANG+D+ A+ YLP+ P Sbjct: 223 TGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNARDLVNYYERPANGYDLIADMYLPSMP 282 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LGA + YEQY+GD V LFGK+KRQKDP+A + V YTP+PL Sbjct: 283 SLGAKIKYEQYFGDNVALFGKNKRQKDPYAATIGVNYTPIPLI 325 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 296 bits (758), Expect = 6e-79, Method: Composition-based stats. Identities = 115/299 (38%), Positives = 169/299 (56%), Gaps = 23/299 (7%) Query: 6 TGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRL------SMGN 59 + +P+ S+L W+ + P++ + AQ L ++ Sbjct: 5 SSKLKPKLPNSLLLSTAIWSTAIL-----------PMVPSYAQIVHLDDLPTLGGQAIQF 53 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQ----PDSDATRNFITGMATAKANQEIQEWLGKY 115 +++ E+ +A + NA F S + +D +++ A A EI WL K Sbjct: 54 EGTQPEDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHWLSKA 113 Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSG 175 G AR+ +N+DK S+K S L+ L P Y+ +LF+Q +IHRTD R Q+N G G RHF Sbjct: 114 GNARLNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLRHFQQ 173 Query: 176 NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERP 234 N M GVN F DHDLS H+R+G G EY +DY+++SAN Y+ S W+ + ++ +DY RP Sbjct: 174 NS-MIGVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDYNARP 232 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 ANGWDI+ EG+LP + LGA+L EQYYGD+V LFGK++RQKDP A + V ++P PL Sbjct: 233 ANGWDIQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPFPLL 291 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 295 bits (755), Expect = 1e-78, Method: Composition-based stats. Identities = 99/254 (38%), Positives = 139/254 (54%), Gaps = 9/254 (3%) Query: 48 QHAVQPRLSMGNTTVTADNNVEKNVASF--------AANAGTFLSSQPDSDATRNFITGM 99 Q+ P L N K++ A L+ + R+++ Sbjct: 27 QNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINSATQGFENLTPEALKSQARSYLQSQ 86 Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 T+ A I++ L YG R L++ + L SS++ P YD T + F+Q + R + Sbjct: 87 ITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTVYFSQFSAQRKE 146 Query: 160 DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 DRT NIG G R+ + + ++ G N F D+D +R H R+G+GAE W DYLK S N Y S Sbjct: 147 DRTIGNIGLGVRY-NFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYLKFSGNYYHPLS 205 Query: 220 GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH 279 WK S D + Y+ERPA GWDIRAE +LPA+PQLG +++EQYYG+EV LFG D +KDP Sbjct: 206 DWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVALFGTDSLEKDPF 265 Query: 280 AISAEVTYTPVPLT 293 A++ V Y PVPL Sbjct: 266 AVTLGVKYQPVPLI 279 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 293 bits (750), Expect = 5e-78, Method: Composition-based stats. Identities = 108/236 (45%), Positives = 145/236 (61%), Gaps = 2/236 (0%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 T +N E A+ A S S + MA++ AN IQ+WL ++GT Sbjct: 123 NRPLDTKVDNNENYSANKTKAAVNV-SESNKSPEALGVASSMASSAANNAIQKWLSQWGT 181 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND 177 +L+ D SLK+SSL+ L PIYDT N F Q D R N+G+G RH N Sbjct: 182 VESQLSFDSKASLKNSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVY-NG 240 Query: 178 WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 WM G+N F D+D++ ++ R+G+G E DYL +++N Y+R + W +S D DY ERPANG Sbjct: 241 WMYGLNNFFDYDITGNNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANG 300 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +D+R G+LPA+PQ+G L+YEQYYGDEVGLFGKD RQKDP AI+A V++TP PL Sbjct: 301 FDMRVNGWLPAYPQIGGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLL 356 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 99/240 (41%), Positives = 133/240 (55%), Gaps = 1/240 (0%) Query: 55 LSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGK 114 +D + L+ T + + Q+ Q+ LG+ Sbjct: 27 AETSGAKPISDQQFADWGKNLGGQDWNTLNRDKAQSKTTQWAKEKIISPLQQQAQDLLGR 86 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 +G A+V L++D +L S+ + P YD+ +LF+Q IH D+R N G G R Sbjct: 87 FGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYLLFSQINIHHQDNRKIGNFGLGHRIEL 146 Query: 175 GN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQER 233 + + + G N FIDHD SR H R G+GAE DYLK SAN Y S WK SPD +DY ER Sbjct: 147 PSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADYLKFSANYYHPLSHWKDSPDFDDYLER 206 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA G+D+R++GYLPA+PQLG S +YE Y+GDEV LFGK RQKDP A++ + YTPVPL Sbjct: 207 PAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVALFGKSHRQKDPRALTLGIDYTPVPLV 266 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 283 bits (725), Expect = 4e-75, Method: Composition-based stats. Identities = 89/241 (36%), Positives = 137/241 (56%), Gaps = 11/241 (4%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 + + EK A L++ ++A N+ NQ+I +WL +YG Sbjct: 99 ETSQAKQVESAEKQFVQGATQIAQGLANNNATEAAINYARNRGEGLLNQKISDWLNQYGK 158 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND 177 ARV+++ +K ++L P+ D P ++LF+Q I + R+ +N+G G+R + N Sbjct: 159 ARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGIRANEQRSTTNLGLGYRQYQQN- 212 Query: 178 WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERPA 235 WM G+N+F D+D+S + R G+G E W YLKL+ NGY R + W +S ++ DY ERPA Sbjct: 213 WMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQSFLHEMRDYDERPA 272 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKRQKDPHAISAEVTYTPVPL 292 NG+D+RAEGYLP++P LGA YEQY+GD V L + +P A++ ++YTP PL Sbjct: 273 NGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKDNPSAVTFGLSYTPFPL 332 Query: 293 T 293 Sbjct: 333 L 333 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 281 bits (719), Expect = 2e-74, Method: Composition-based stats. Identities = 110/271 (40%), Positives = 152/271 (56%), Gaps = 18/271 (6%) Query: 34 PLAVTFTPVMAARAQH---AVQPRLSMGNTTVTADNN---VEKNVASFAANAGTFLSSQP 87 PL + TP+ A P L + +N E NVAS A + + Sbjct: 90 PLNGSTTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASAATQLWGIMGNDN 149 Query: 88 DSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN 147 S A + +TG+A A+Q +WLG+YG ARV+LN S + ++L P+ +T N Sbjct: 150 SSRAAESAVTGVAAGLASQAAADWLGQYGNARVQLN-----SNSIGNADVLIPLTETQNN 204 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 +LF Q + +RT +N+G G R F+ + WM GVNTF D+DL+ ++R+GVG E W D Sbjct: 205 LLFGQLGVRYNGERTTNNVGLGVRSFT-DSWMFGVNTFYDYDLTGKNSRLGVGGEAWTDN 263 Query: 208 LKLSANGYIRASGWKKS--PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 LK SANGY R + W +S D+EDY ERPANG+D+RAE YLP++PQLG LMYE+Y+G Sbjct: 264 LKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLPSYPQLGGRLMYEKYFGKG 323 Query: 266 VGLFGK----DKRQKDPHAISAEVTYTPVPL 292 V L D P A + + YTP+PL Sbjct: 324 VALNSGSTSPDDLGDSPSAFTVGLNYTPIPL 354 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 280 bits (717), Expect = 3e-74, Method: Composition-based stats. Identities = 90/253 (35%), Positives = 135/253 (53%), Gaps = 16/253 (6%) Query: 43 MAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATA 102 + A A L+ V V+ V + R F Sbjct: 92 IPAAKPAATTLPLAPATVQVAKPGKVDGKV-------------DDKTTNVRQFGQDQLNT 138 Query: 103 KANQEIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDT-PTNMLFTQGAIHRTDD 160 A+++ + WL +G ++RV ++ ++F+ + + ++L P++++ M+F+Q + DD Sbjct: 139 LASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQLGVRHADD 198 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 RT NIG G R+F G WM G N F D+D S S+ RIG+GAE D L+L+ANGY + +G Sbjct: 199 RTTGNIGLGARYF-GEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAANGYFKLTG 257 Query: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 W S I D+ ERPANGWDI +LP +PQLG + YEQYYGD V L + + Q +P A Sbjct: 258 WHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRGRLQHNPSA 317 Query: 281 ISAEVTYTPVPLT 293 + V +TP+PL Sbjct: 318 ATLGVNWTPIPLV 330 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 276 bits (705), Expect = 9e-73, Method: Composition-based stats. Identities = 95/244 (38%), Positives = 141/244 (57%), Gaps = 12/244 (4%) Query: 56 SMGNTTV-TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGK 114 ++G+T + +NN E+ AS G LSS D + N+ + NQ+I +WL + Sbjct: 92 NLGSTKILPEENNNEEKFASSFTLMGDILSSDNFVDNSINYAKSIGQGLVNQQINDWLNQ 151 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 YG AR+ + DK+ S + L P+ D P N+LFTQ + DR N+G G+R + Sbjct: 152 YGKARISFSSDKNI-----SGDFLLPVIDEPNNLLFTQLGLRNNTDRNTINLGLGYRKYW 206 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP--DIEDYQE 232 N WM G+NTF D+D + + R+GVG E W DYLKL+ NGY + W +S ++DY E Sbjct: 207 RN-WMFGINTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSKISVMDDYDE 265 Query: 233 RPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL---FGKDKRQKDPHAISAEVTYTP 289 RPA G+D+RAE YLP +PQLG+S+ YE+Y+G + L + + D ++ + YTP Sbjct: 266 RPATGFDVRAEAYLPKYPQLGSSIKYEKYFGKGIHLGTGVNPEYLKDDAQSLIMGLNYTP 325 Query: 290 VPLT 293 +PL Sbjct: 326 IPLL 329 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 270 bits (691), Expect = 3e-71, Method: Composition-based stats. Identities = 105/247 (42%), Positives = 150/247 (60%), Gaps = 11/247 (4%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + R + V W+ I+ Q+L P+ T P ++ + + A++ Sbjct: 2 TMVNKKLR-RKASCAVTWSVIATQILSPVTFTLIPA------NSFASSANTESAQTNAND 54 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 +AS AANAG L++ F +A+A +E+ +WL +YG AR+KLNVD+ Sbjct: 55 EYANELASLAANAGQSLANNT----AGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDE 110 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 F+LKD++ + LYP D+ +LF+Q ++HRTDDR Q+NIG G RHF+ ++ M G N F Sbjct: 111 SFTLKDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFY 170 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 D+DLSR H+R G+G EYWRDY++ AN Y S WK S DI+DY ERPANGWD+ AEG+L Sbjct: 171 DYDLSRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWL 230 Query: 247 PAWPQLG 253 P +PQLG Sbjct: 231 PVYPQLG 237 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 270 bits (690), Expect = 4e-71, Method: Composition-based stats. Identities = 89/291 (30%), Positives = 144/291 (49%), Gaps = 16/291 (5%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 + + LAR +AW + Q+L P A+ A+A R ++ D + Sbjct: 2 KLTPTPLARWLAWVLVGTQLLTPAAL-------AQAMLPEITRSGADSSVDKTDQPEAEW 54 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKY---GTARVKLNVDKDF 128 +AS A++ G+ L SD +N I + AN I + + R + ++ Sbjct: 55 LASRASSLGSLLQEGNISDFAKNQIQALPQTIANDGITSGIKHWLPEAQFRGGITLEDAS 114 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVN 183 + + ++L P+Y + +++LF Q + D+ R N G GWR G+ W+ G+N Sbjct: 115 KYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVGD-WLLGIN 173 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 +F+D D+ H R +G E +RD + L+ N Y S WK S + ERPA G D+R + Sbjct: 174 SFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVRLK 233 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G LP+ P GA L +EQY+GD+V + G D +DP A + +T+ PVPL + Sbjct: 234 GALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVE 284 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 267 bits (683), Expect = 3e-70, Method: Composition-based stats. Identities = 93/218 (42%), Positives = 127/218 (58%), Gaps = 2/218 (0%) Query: 78 NAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEM 137 + + S+ + R + +AN LG GTAR ++ +D DF++ + ++ Sbjct: 170 TSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDDDFNMVTAEADL 229 Query: 138 LYPIYDTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTR 196 L P+ + +LFTQ + R DRT +N+G G RHF + WM G N F D+DL+ H R Sbjct: 230 LLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFL-DRWMLGYNLFADYDLTNRHWR 288 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASL 256 GVGAE WRDYLKL AN Y S W+ SP E +ER A G D+R E YLPA+PQ ASL Sbjct: 289 AGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYLPAYPQWSASL 348 Query: 257 MYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 EQY G+ VGL D+ ++DPHAI+A + Y P PL + Sbjct: 349 TAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLK 386 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 267 bits (682), Expect = 4e-70, Method: Composition-based stats. Identities = 90/239 (37%), Positives = 132/239 (55%), Gaps = 8/239 (3%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 +TT T + + + + +A + L S P D + G + +Q I+ WL +YG Sbjct: 3 SSTTQTGETISDSTL--YTKSAASLLKSGPAFD---QYAAGKISQLTSQAIEGWLKQYGN 57 Query: 118 ARVKLNVDKDFS--LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFS 174 AR+ LN D S L SS ++L+ +++ + + + Q H D N+G G R+F Sbjct: 58 ARITLNAQSDNSTALAGSSADLLFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFM 117 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERP 234 N M G N F D +++ +R GVG E WRDY K S NGY S W+ S +EDY E+ Sbjct: 118 TNKTMLGYNVFYDRNINSGVSRSGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKA 177 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+G+D++ E YLP + QLG L YEQY+GD V LF + Q DP AI+ ++YTP+PL Sbjct: 178 ADGYDMQIEAYLPTYAQLGGHLKYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLI 236 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 264 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 92/279 (32%), Positives = 135/279 (48%), Gaps = 10/279 (3%) Query: 19 ARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAAN 78 A +AW ++ P P + Q +G D EK A+ A Sbjct: 25 AMGLAWLCGAL----PAYAESPPAPDSVVQQPANDLPELGGNASN-DAEREKEWATMAKQ 79 Query: 79 AG----TFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 G +SSQ ++ G A++ Q+ QE L G A++ L + SS Sbjct: 80 LGERNLNNVSSQQVRTRAESYAVGQASSVLQQQAQELLSPLGNAKLSLVMSDQGDFSGSS 139 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 ++ P+YD + ++Q + + + + N G G R +G+ W+ G NT +D D R H Sbjct: 140 GQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGD-WLLGYNTVLDSDFERHH 198 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R +GAE W D+L+ SAN Y S + D + RPA+G+DI +GYLP + Q+G Sbjct: 199 NRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQGYLPFYRQIGG 258 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 SL YEQY+G+ V LFG K+Q DP A+ V YTPVPL Sbjct: 259 SLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLV 297 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 79/203 (38%), Positives = 114/203 (56%), Gaps = 2/203 (0%) Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 N Q ++ + +G + L+VD S +L P Y +++LF Sbjct: 176 YIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVLF 235 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 +Q ++++RT +IG G R + ++ G N FID+DL + H R+ +G E +Y K Sbjct: 236 SQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYFK 294 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 LS N Y + W+ S D+EDY ERPA G+DIR +GYLP +PQLG + YEQY+G EV LF Sbjct: 295 LSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVALF 354 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 K KRQK+P A+S + Y P PL Sbjct: 355 NKTKRQKNPKAVSIGIDYRPFPL 377 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 259 bits (661), Expect = 1e-67, Method: Composition-based stats. Identities = 83/206 (40%), Positives = 118/206 (57%), Gaps = 6/206 (2%) Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 SD +++ I M A E ++ G R L + D L S+++ YP+YD + + Sbjct: 90 SDISKSGIADMGFAALQPETEK---SAGEVRANLPL-SDGKLTSGSIDLFYPLYDGDSRL 145 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDY 207 F Q R D R N+G G R+F G+ W G NTF D +S + H R+G G EYWRDY Sbjct: 146 FFGQVGARRFDGRNIVNLGIGQRYFQGD-WALGYNTFYDIQISGNAHQRLGFGLEYWRDY 204 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L LSANGY + W S ++ Y ER ANG+DIRA+G+ P +PQL L +EQY+GD++ Sbjct: 205 LYLSANGYFGLTDWYSSSALDGYAERAANGYDIRAQGWFPVYPQLSGKLKFEQYFGDDIA 264 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 L R K+P+A++ + YTP+ L Sbjct: 265 LLNHQNRYKNPYALTMGLEYTPIQLI 290 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 254 bits (648), Expect = 4e-66, Method: Composition-based stats. Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 34/290 (11%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + + WA ++ Q+ P+ +DN ++ + Sbjct: 3 FSKKPITKYITWAIVTSQIPLPVI-------------------------ADSDNEIQSWI 37 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFS 129 A A++ L D + I + AN + E + R +N++ Sbjct: 38 AGTASSISPHLQEGTLEDYAKGKIKALPGQAANHLVNEGIKSAFPEIIFRGGVNLEDGAK 97 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVNT 184 + S +M P+ +T +++LF Q D+ RT N+G G+R N W+ GVNT Sbjct: 98 YRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYRQ-EVNGWLLGVNT 156 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+D D+ SH R G+G E ++D L S N Y +GWK S E + ERPA G+D+R +G Sbjct: 157 FLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAAHELHDERPAYGFDLRTKG 216 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LP +P L YEQYYGD+V L G ++P A A++ + PVPL + Sbjct: 217 TLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGADLVWNPVPLLE 266 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 72/238 (30%), Positives = 110/238 (46%), Gaps = 10/238 (4%) Query: 63 TADNNVEKNVASFAANAGTFLSSQP---DSDATRNFI----TGMATAKANQEIQEWLGKY 115 + EK A G + + + F +A+ NQ ++ WL + Sbjct: 3 PESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLSPW 62 Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSG 175 G A V + VD S P D + ++Q + R +D SN+G G R + Sbjct: 63 GNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRW-AR 121 Query: 176 NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 + W+ G NTF D+ L R G+GAE W +YL+LSAN Y + W + + ++R A Sbjct: 122 DGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWHERSATQ--EQRMA 179 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G+D+ A+ +P + L + EQY+GD V LF K +P A+S + YTPVPL Sbjct: 180 RGYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLV 237 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 72/239 (30%), Positives = 113/239 (47%), Gaps = 10/239 (4%) Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRN-------FITGMATAKANQEIQEWLGK 114 +++ EK+ A + G + D + + + NQ ++ WL Sbjct: 49 APENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSP 108 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 +G A V + VD + S P+ D + ++Q + + D+ SN+G G R Sbjct: 109 WGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR 168 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERP 234 GN W+ G NTF D+ L + R G GAE W +YL+LSAN Y + W + + ++R Sbjct: 169 GN-WLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQ--EQRM 225 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A G+D+ A +P + L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 236 bits (603), Expect = 5e-61, Method: Composition-based stats. Identities = 72/239 (30%), Positives = 112/239 (46%), Gaps = 10/239 (4%) Query: 62 VTADNNVEKNVASFAANAGTFLSSQPD---SDATRNFI----TGMATAKANQEIQEWLGK 114 +N EK+ A G + R+F ++ E + L Sbjct: 21 APENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTRLRDAVSSSITSEAESLLSP 80 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 +G A V L VD++ + SS + P D + ++Q + + + N G G R + Sbjct: 81 WGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGVSQQNQGLVGNAGIGQRWTA 140 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERP 234 G+ W+ G NTF D +R G GAE W DYL+LSAN Y GW+ + ++R Sbjct: 141 GH-WLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQPLGGWEHRAGL--LEQRM 197 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A G+D+ A+ YLP + + S+ +EQY+GD+V LF +P A+ ++YTPVPL Sbjct: 198 ARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYHNPVAVKVGLSYTPVPLV 256 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 233 bits (594), Expect = 5e-60, Method: Composition-based stats. Identities = 79/240 (32%), Positives = 114/240 (47%), Gaps = 11/240 (4%) Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFI--------TGMATAKANQEIQEWL 112 TV ++ + K +A A + G + + R A +A E ++ L Sbjct: 99 TVHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVFNRFRDAAKQRAASEGEQLL 158 Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRH 172 YG A V L + D S SS +++ P D + + F+Q I +++ + N G G R Sbjct: 159 SPYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQLGIEQSEYGSVGNAGLGQRW 218 Query: 173 FSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQE 232 +G+ W G N F+D L R +GAE W YL+ SAN Y SG + + Sbjct: 219 IAGS-WRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSANYYQPLSGCRNHSNSA--LM 275 Query: 233 RPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 R A G+DI GYLP + QLG +L YEQY G+ V LF +P A+S + YTPVPL Sbjct: 276 RMARGYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGNAVANPAAVSLGINYTPVPL 335 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 73/275 (26%), Positives = 118/275 (42%), Gaps = 15/275 (5%) Query: 27 ISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQ 86 +++Q + P P +AR A R ++ + + +A AG+ S++ Sbjct: 34 LTLQTVAPAFAQGAPSFSAR--PAQADRQDAADSAMLRVAQTARQLAQR-QAAGSRASAR 90 Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTA------RVKLNVDKDFSLKDSSLEM--L 138 D D + G A A+AN+ +QE + R++ V+ DFS KD SL++ + Sbjct: 91 VDGD----LLKGQAEAQANELLQEGVRLANQTELPFLRRLQGGVNYDFSNKDLSLDLRTI 146 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIG 198 ++ + + Q + H + R N G RH G N F+D++ ++H R Sbjct: 147 DEVHRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLRGS 206 Query: 199 VGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMY 258 +G E L N Y SGWK + E +ERPA+GWD+ A P L Y Sbjct: 207 LGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKGQY 266 Query: 259 EQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 ++ G V F + Q++ V Y PVPL Sbjct: 267 FRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPLV 301 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 213 bits (543), Expect = 5e-54, Method: Composition-based stats. Identities = 72/302 (23%), Positives = 124/302 (41%), Gaps = 12/302 (3%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTF-TPVMAA-RAQHAVQPRLSMGN 59 +H ++ +R A +++Q P+A P +A + A ++ Sbjct: 12 AHLPARGRRHWYRRHRAGAAGMSAVLAMQAAAPVAYGQGAPTFSATQVADAASNAVAQPG 71 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQE---WLGKYG 116 T + +A G + D F+ A A+AN +Q+ W + G Sbjct: 72 AVETRVAQTIQALAQAREAGGARQDGRASLDG--QFLRSQAQAQANVLVQQGVQWANETG 129 Query: 117 TA---RVKLNVDKDFSLKDSSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 R++ NV DFS +D ++++ + ++ L Q H + R N G R Sbjct: 130 LPWLRRLEGNVSYDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVAR 189 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +G+ + G N F+D+++ + H R +GAE L N Y SGWK + E + Sbjct: 190 SAAGSSLILGGNAFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERRE 249 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA GWD+ A L + Y ++ G +V F + +++P + Y PVP Sbjct: 250 ERPAAGWDVGFTARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVP 309 Query: 292 LT 293 L Sbjct: 310 LI 311 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 56/268 (20%), Positives = 89/268 (33%), Gaps = 9/268 (3%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQP-----DS 89 LA P+ A + D + +A+ A + + + D Sbjct: 54 LAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGARVDG 113 Query: 90 DATRNFITGMATAKANQEI----QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 D + + + L + L+ D + L + +Y Sbjct: 114 DYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVYRAG 173 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N Q H +DR +N G +R + M G N F+D++ + H R VG E Sbjct: 174 RNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLEVIA 233 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L N Y S WK + +E+PA+G D+ P L S + ++ G E Sbjct: 234 PEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWNGAE 293 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V F + Q V Y PV L Sbjct: 294 VDYFDNGRTQAGAKGFKVGVEYRPVSLV 321 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 194 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 63/280 (22%), Positives = 108/280 (38%), Gaps = 18/280 (6%) Query: 23 AWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTF 82 ++ +Q PLAV A+ + R G+ + V VA A + Sbjct: 47 VCLSLGMQAAAPLAVL------AQGAPEMTNRPEAGDIVPSD---VLTQVAVRAQDLARR 97 Query: 83 LSSQPDSDAT-RNFITGMATAKANQEIQEWLGKYGTA------RVKLNVDKDFSLKDSSL 135 + + + +++ A+ NQ +QE + + ++ ++ DF +SL Sbjct: 98 QADRREGAQVDADYLKQQGQAQFNQFLQEGVRAANESGLRFLRNLQGDLRHDFDNGRTSL 157 Query: 136 EM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 E+ + +Y N Q H ++R +N+G +R M G N F+D++ ++ Sbjct: 158 ELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYRRDINERLMLGANAFLDYEFAKQ 217 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 H R +G E N Y SGW + +ERPA+G D+ + P L Sbjct: 218 HLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRREERPASGMDLGMKYSPGFAPGLS 277 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 Y ++ G V F + Q V Y PVPL Sbjct: 278 LKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVPLL 317 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 193 bits (490), Expect = 7e-48, Method: Composition-based stats. Identities = 77/255 (30%), Positives = 114/255 (44%), Gaps = 24/255 (9%) Query: 59 NTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA 118 N D V AG S+ DS + + MA++ N E ++ Sbjct: 293 NNLSKEDQEFSNKVMKVIQTAGAIYDSE-DSKSKEEIVKNMASSYLNTSANELAKEF-ID 350 Query: 119 RVKLNVDKDFSLK-------DSSLEMLYPIY--DTPTNMLFTQGAIHR-TDDRTQSNIGF 168 + +++ DFS + + L PI D P F Q I +DRT + G Sbjct: 351 SLNTSINTDFSFNYNERSGFSGNAKALLPIVSEDNPKISYFLQSGIGEFANDRTIGHFGG 410 Query: 169 GWRHFSG-------NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 G R++ + M G+N+ DHD SR H R+ +GAE D L +AN Y R S W Sbjct: 411 GIRYYPNATALNNSGNIMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAFNANVYQRLSSW 470 Query: 222 KKSPDIE-DY-QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKRQK 276 S D + DY QERPANGWD + + P+ + Q+YG++VG+FG D +K Sbjct: 471 IDSYDFDKDYVQERPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGIFGANSVDDLEK 530 Query: 277 DPHAISAEVTYTPVP 291 +P ++Y+P P Sbjct: 531 NPLIYEGGISYSPFP 545 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 186 bits (472), Expect = 9e-46, Method: Composition-based stats. Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 3/146 (2%) Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 M ++Q + + D SN+G G R + + W+ G NTF D+ L + R G GAE W +Y Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEY 59 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L+LSAN Y + W+ ++R A G+DI A+ LP + + S+ EQY+GD V Sbjct: 60 LRLSANYYQPFADWQTHT--ATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVD 117 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LF +P A+ + YTPVPL Sbjct: 118 LFDSGTGYHNPVALKLGLNYTPVPLL 143 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 177 bits (449), Expect = 4e-43, Method: Composition-based stats. Identities = 57/251 (22%), Positives = 88/251 (35%), Gaps = 6/251 (2%) Query: 43 MAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATA 102 M + A+ V + +E VA N T S A Sbjct: 6 MPSPARLLTLLLCPTLLPPVAYGSAIESEVA---RNLWTRAQHPDTSPGLAQSALDAGVA 62 Query: 103 K-ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR 161 Q L L D D SL + + + L Q +H + R Sbjct: 63 AGLQASRQTGLPWLRHLDGGLRYDLDPGRLSFSLRTIDDLMVSERRALMLQAGLHNQNQR 122 Query: 162 TQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 +N G R + + G N F+D++ + H R +G E + L AN Y SGW Sbjct: 123 PTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGLEAIAPHYSLYANYYAPLSGW 182 Query: 222 KKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAI 281 K + +ERPA G+D+ G L + L Y +++G + +F + Q++ Sbjct: 183 KGARRDSRREERPAAGYDL--GGQLSSDAGLSLQAAYFRWHGAGIDVFDSGRAQRNASGF 240 Query: 282 SAEVTYTPVPL 292 V Y P L Sbjct: 241 RYGVAYQPGAL 251 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 57/210 (27%), Positives = 87/210 (41%), Gaps = 7/210 (3%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 P A T A + Q L + ++ EK +A+ A + Sbjct: 35 LPAAAWVTQPENDAALLSQQQALPNLGSASVNESGTEKKLATLARQMAEVNQDENTDQTW 94 Query: 93 RNFITGMATAKANQEIQE----WLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN- 147 R+++ G A + +Q+ L G V L+VD+ SS ++L P+ D T Sbjct: 95 RSYLLGEAKDRVLDRLQQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRG 154 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRD 206 + ++Q + DD N+G R +G W+ G N F D L++ R +GAE D Sbjct: 155 LTYSQLGLQGVDDGVVGNMGLRQRWNAG-RWLLGYNVFYDQYLNQDASRRGSIGAEARSD 213 Query: 207 YLKLSANGYIRASGWKKSPDIEDYQERPAN 236 YL LS+N Y SG + D ED R A Sbjct: 214 YLTLSSNYYYPLSGMHAANDDEDELLRMAR 243 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 147 bits (372), Expect = 3e-34, Method: Composition-based stats. Identities = 67/289 (23%), Positives = 108/289 (37%), Gaps = 39/289 (13%) Query: 28 SVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK--------NVASFAANA 79 Q L + + F +++ A + +N K A+++ Sbjct: 3 ISQALTSITLVFGSILSVSANEYKFEEIKFNQIPNEQNNYEPKDKLDEYIIKGANYSTKF 62 Query: 80 GTFLSSQPDSDATRNFITG------------MATAKANQEIQEWLGKYGTARVKLN--VD 125 +++ D + A AKAN EIQ+ + + V ++ + Sbjct: 63 VPLMNNGAKGDEYTGIMADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTE 121 Query: 126 KDFSLKDSSLEMLYPIYDTP----TNMLFTQGAIHRTDD--RTQSNIGFGWRHFSGNDWM 179 D S +SL L + + F+Q + + NIG G R+ + M Sbjct: 122 SDTSFSINSLMKLGELAKDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISM 181 Query: 180 AGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERPA 235 G N F D+ D S +H+R+G+G EY+ + N Y+ + K DYQER Sbjct: 182 VGANAFWDYRMTDYSDAHSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVV 241 Query: 236 NGWDIRAEGYLPAWPQL-----GASLMYEQYYGDEVGLFGKDKRQKDPH 279 GWD+ LP P+L G + Y +Y D GL G Q PH Sbjct: 242 PGWDLEVGYRLPNNPELAFYIRGFNWDY-KYTQDNSGLEGAVSWQATPH 289 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 147 bits (372), Expect = 4e-34, Method: Composition-based stats. Identities = 74/291 (25%), Positives = 108/291 (37%), Gaps = 57/291 (19%) Query: 46 RAQHAVQPRLSMGN-----TTVTADNNVEKNV-------ASFAANAGTFLSSQPDSDA-- 91 RA AV L G T V ADN V AS+A L+S SD Sbjct: 51 RACKAVAGALEAGQSVRCETLVDADNQSNSTVQKIFVTGASYATRIFPLLNSASLSDGIQ 110 Query: 92 ------TRNFITGMATAKANQEIQEWLGKYGTAR--VKLNVDKDFSLKDSSLEMLYPIYD 143 +++FI A N+ + + + V D D + +SL L + Sbjct: 111 KMLWMDSKSFIVSFAHDYLNEYVLKQIPFLSQTEFGVGFESDADMTYYLNSLISLAQLGS 170 Query: 144 TPTN----MLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHT 195 +LF QG+ + +N+G G R ++ M G N F D+ + S S++ Sbjct: 171 DDNGYPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYS 230 Query: 196 RIGVGAEYWRDYLKLSANGYIRASGWKKSPD-----------------------IEDYQE 232 R G GAE W D KL+ N YI +G K+ + E Sbjct: 231 RWGAGAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDE 290 Query: 233 RPANGWDIRAEGYLPAWPQLGASLMYEQY----YGDEVGLFGKDKRQKDPH 279 R GWD+ LP++PQL + ++ D G+ G Q PH Sbjct: 291 RVVPGWDVALNYRLPSYPQLSLGIRGFRWDYMRKSDNSGVEGSVNWQATPH 341 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 63/234 (26%), Positives = 103/234 (44%), Gaps = 25/234 (10%) Query: 70 KNVASFAANAGTFLSSQPDSD-------ATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + A++AA G + + D + + AN++I++ + + + L Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 123 N--VDKDFSLKDSSLEMLYPI-YDT---PTNMLFTQGAIH-RTDDRTQSNIGFGWRHFSG 175 + D + S L I YD P ++F Q + T + Q N+G G R G Sbjct: 168 DAATSSDLTGYLDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRLG 227 Query: 176 NDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKK-SPDIEDYQ 231 ++ + G+N F D + S ++TR G+GAE + +L N YI S K + + DY Sbjct: 228 DEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDYV 287 Query: 232 ERPANGWDIRAEGYLPAWPQL-----GASLMYEQYYGDEVGLFGKDKRQKDPHA 280 ER GWD+ +P++PQL G + Y Q + D G+ G Q PHA Sbjct: 288 ERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDY-QDHSDNSGIEGSVNWQATPHA 340 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 75/180 (41%), Gaps = 10/180 (5%) Query: 123 NVDKDFSLKDSSLEMLYPIYDT----PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 L++ S++M + Y N+ F Q IH N G G RH + + + Sbjct: 83 TYKSKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKY 142 Query: 179 MAGVNTFIDHDLSRSHTR---IGVGAEYW-RDYLKLSANGYIRASGWKKSPDIEDYQ-ER 233 G NTF +S+ ++ I VG EYW + L + N Y + + ++ Sbjct: 143 AIGYNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHY 202 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P +G + + P + + + EQ+ ++ +K+ D + +S ++ Y P+P+ Sbjct: 203 PRSGHQLYIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPML 261 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 88/198 (44%), Gaps = 11/198 (5%) Query: 73 ASFAANAGTFLS-SQPDSDATRNFITGMATAKANQEIQEWL-----GKYGTARVKLNVDK 126 A + G LS S DS+ ++ + T+ A+ + + + T V N+ + Sbjct: 15 AVLTMSLGFSLSVSADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKE 74 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTF 185 S D + +L D P + LF Q ++R D RT N+GFG+R + ++ WM GVN F Sbjct: 75 GDSTVD--IGVLKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAF 132 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 DH+ H R GVG E L+ N Y +G+ D + +G D+ + Sbjct: 133 YDHEFPNDHKRNGVGFEVVSSVLESRVNSYNGTTGY--IKDKSGTDSKVLDGRDMGFKVA 190 Query: 246 LPAWPQLGASLMYEQYYG 263 LP P + + Q+ G Sbjct: 191 LPYLPGMMFGMNAVQWKG 208 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 51/136 (37%), Positives = 66/136 (48%), Gaps = 5/136 (3%) Query: 160 DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 R +++G G+R + + GVN F D+DLSR HTR+ VGAEY DY S N Y S Sbjct: 35 GRDFAHLGLGYRQLDDSQF-FGVNVFFDYDLSRQHTRVSVGAEYGLDYGTFSTNAYFPLS 93 Query: 220 GWKKSPD----IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275 WK SPD + E+ A GWD+ E YLP + L QY G V Sbjct: 94 NWKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLDTRWKFGLTAGQYLGRYVEHSDGSLPS 153 Query: 276 KDPHAISAEVTYTPVP 291 K+P+ S + P P Sbjct: 154 KNPYHFSLSTEFRPDP 169 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 58/233 (24%), Positives = 96/233 (41%), Gaps = 11/233 (4%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 A V L+ + +A + + + T L + D +A +N K Sbjct: 11 APLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAG-DVEALKNTAIDGVVEKGVG 69 Query: 107 EIQEWLGKYGTARVKLNVD-KDFSLKDSSLEMLYPIYDTPT--NMLFTQGAIHRTDDRTQ 163 + +L +Y V+LN + S L ++ P+ D N FTQG++ D+RT Sbjct: 70 VTKSFLEQYFPT-VELNFGAQGGSKPSGGLLVVAPLSDPDDIFNTYFTQGSVFYEDNRTT 128 Query: 164 SNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 N+G G+R S N + G+N F DH+ H R +G E +++AN Y + WK Sbjct: 129 LNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINANKYWATTKWK 188 Query: 223 KSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD---EVGLFGKD 272 + +ER +G+DI A LP + Q+ + + G D Sbjct: 189 TGKN--GLEERALDGYDIEAGVPLPYMNWATVFVKNFQWDSEISGSKDIKGND 239 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 8/171 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD---ATRNFI-- 96 + A+AQ + + EK+ A A D D R F Sbjct: 70 TIRAQAQDPFDQNRLPDLGMMPESHEGEKHFAEMAKAFSEASMKNNDLDTGEQARQFAFG 129 Query: 97 --TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + + + NQ+++ WL +G+A V +NVD + S P+ D + ++Q Sbjct: 130 QVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLG 189 Query: 155 IHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 + + D SN+G G R + + W+ G NTF D+ L + R G GAE W Sbjct: 190 LTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWG 239 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 49/172 (28%), Positives = 77/172 (44%), Gaps = 11/172 (6%) Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNV-DKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + A K +++I + G V L+ D D + S+ + I T + FTQ + Sbjct: 23 VASQALNKVSEKISNLIPGEGITEVSLDYNDGDEDQLNFSILGVRDIETTDNSNFFTQFS 82 Query: 155 IHRTD----DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 + + R NIG G+R S + ++M G NTF D DL+ R+G+G E L Sbjct: 83 LMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGLGIEAKGSILD 142 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 L+AN Y + S S + +E+ +GWD +P P A + Y Y Sbjct: 143 LTANSYTKIS---NSEVVNGDREQVLSGWDFNLTSQIPRAPW--ARINYNGY 189 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 128 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 35/150 (23%), Positives = 64/150 (42%), Gaps = 6/150 (4%) Query: 140 PIY--DTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF-SGNDWMAGVNTFIDHDLSRSHTR 196 P+Y + + +F +G D R + G+RH S N M G N H+ R+H R Sbjct: 68 PVYVSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQR 127 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASL 256 I GAE ++++N Y R + WK +++ +E+ G+D+ +P P + Sbjct: 128 ISYGAEIRTSVFEINSNYYHRLTDWKL-TGVDNNEEKARGGYDVELALAVPYVPSAHFRV 186 Query: 257 MYEQYYGDEVGLFGKDKRQKDPHAISAEVT 286 + + G + + D + V+ Sbjct: 187 KHFCWNG--IASNDSNNPIDDLKGNTFSVS 214 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 127 bits (319), Expect = 5e-28, Method: Composition-based stats. Identities = 48/156 (30%), Positives = 69/156 (44%), Gaps = 7/156 (4%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYD--TPTNMLFTQGAIHRTDD-RTQSNIGFGW 170 K+ TA + L+ + S L ++ PI D N++FTQ ++ +DD R N+GFG Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 171 RHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +D + G N F DH+L H R +G E L AN Y SGWK + + Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKS--GLNN 124 Query: 230 YQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 E+ NG D+ LP P + G Sbjct: 125 INEKALNGSDVELGMPLPYLPWTNLYYRSFNWEGAS 160 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 57/222 (25%), Positives = 94/222 (42%), Gaps = 23/222 (10%) Query: 80 GTFLSSQPD---SDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFSLKDS 133 T L+++ S+ N +A+ K + + + KY A V ++ + + + Sbjct: 86 WTSLNNKNGIEWSNQISNLALNLASNKLSDYATKTIQKYPFVLGASVNFDIRTEGA-TNI 144 Query: 134 SLEMLYPIYD-------TPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTF 185 ++L+ I D + + F + + + N G G RH G + +AGVN + Sbjct: 145 GGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSNDSTWNAGLGLRHLIGEELLAGVNGY 204 Query: 186 IDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKK-SPDIEDYQERPANGWDIR 241 D+ + S SH+R G+G E + L L+ N YI +G K S + DY ER GWD Sbjct: 205 WDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIAGTGTKTISTNNTDYYERVVPGWDFE 264 Query: 242 AEGYLPAWPQLGASLMYEQY----YGDEVGLFGKDKRQKDPH 279 LP+ P + ++ D G GK Q PH Sbjct: 265 LGYRLPSNPNIAFFARGFRWDYRNRNDNTGFQGKVTYQMTPH 306 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 46/191 (24%), Positives = 80/191 (41%), Gaps = 15/191 (7%) Query: 71 NVASFAANAGTFLSSQPDSDATRNFITGMATAKAN---QEIQEWL---GKYGTARVK--- 121 A+ N G L+ +A + + +A AN ++++ + + + Sbjct: 28 KFATIVKNIGNALNIGQGEEAVESEVNTLAVDAANAGLDQVEDKVLSTSNFTHFELSVGS 87 Query: 122 --LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSG-NDW 178 + +DK+ S + +Y + +T LF Q + ++RT N GFG RH + N Sbjct: 88 DTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTINTGFGARHINDANTV 147 Query: 179 MAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGW 238 + G N F D++L H R+G G E + AN Y S K+ QE +G+ Sbjct: 148 ITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVS---KTLTYNGIQETALDGY 204 Query: 239 DIRAEGYLPAW 249 D + LP + Sbjct: 205 DAKLTANLPYF 215 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 14/149 (9%) Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 L Q I ++ + G G + + GVN F D +++ + R+ +G++Y Sbjct: 124 LVWQANIDYKNEDILISNGIGI--LPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPN 181 Query: 208 --LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 LS+N Y SG D+ N DIRAEG + Q +SL E ++GD+ Sbjct: 182 YIFNLSSNIYFPLSGKGSEDDL-------VNSIDIRAEGAITPTVQFHSSL--EFFFGDD 232 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 + + + H +A + YTP+PL Q Sbjct: 233 IQINDDYDPTNNSHKFTAGLDYTPIPLLQ 261 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats. Identities = 38/132 (28%), Positives = 64/132 (48%), Gaps = 5/132 (3%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + ++ I+ Q FP+A++ TP + + A + +LS +NN Sbjct: 4 MNNTLLDKLRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENKLST-----NTENN 58 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + +A + GT LSS DA ++ A +K N+EI+ W +YG A++ L VDK Sbjct: 59 NGRWLAQQTSQLGTILSSDNTHDAASQYLINQANSKVNREIENWFNQYGKAQINLGVDKH 118 Query: 128 FSLKDSSLEMLY 139 F+LK L+ L+ Sbjct: 119 FTLKTQKLKSLF 130 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 38/151 (25%), Positives = 68/151 (45%), Gaps = 12/151 (7%) Query: 76 AANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV----DKDFSLK 131 A G +S DA +N + + N + ++ ++G +++V + S Sbjct: 9 ATAGGKGVSE--VLDAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKGDEASYS 66 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAI----HRTDDRTQSNIGFGWRHFS-GNDWMAGVNTFI 186 + + L P+ + + + F QG++ D RT N+G G R G + G+N+F Sbjct: 67 IITTQPLAPLSEDGSRL-FWQGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIVGINSFY 125 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIR 217 D++ S H R+ +G EY R +LS N Y Sbjct: 126 DYEFSAKHKRMSLGGEYKRSNAELSVNKYWG 156 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 103 bits (257), Expect = 8e-21, Method: Composition-based stats. Identities = 40/170 (23%), Positives = 73/170 (42%), Gaps = 8/170 (4%) Query: 82 FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR---VKLNVDKDFSLKDSSLEML 138 S+ + ++ G+A++ + LG+ + + L V + F SL + Sbjct: 29 ISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFK-PTISLTNV 86 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF-SGNDWMAGVNTFIDHDLSRSHTRI 197 I + + +F Q +++ ++ N+G G R + + + G+N F D+ SH R Sbjct: 87 NMISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQRN 146 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 G G E L +N Y SG + D E +GWD+R + +LP Sbjct: 147 GAGLEVLSSVFDLRSNIYDATSGIEAVSTSRD--EEAMDGWDMRLDYHLP 194 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 97.0 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 19/200 (9%) Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV---------DKDFSLKDSSL 135 + + ++ A + + I+ W AR L ++ + Sbjct: 22 ASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRSREGAKPTFRA 75 Query: 136 EMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH 194 L+ I N + +Q + DD N G +R + + + G+N F DH + H Sbjct: 76 ISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNIFYDHQFNTGH 135 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R G+G E ++ N Y + ++ E A G+D +P P Sbjct: 136 ARTGLGFEMKSSVYDVNINFYEAQTEIHH---VDGVPEVAAGGYDAEIGAQVPYLPWAKV 192 Query: 255 SLMYEQYYGDEVGLFGKDKR 274 Q+ + + + + Sbjct: 193 YYKAYQWNNETLNIKDGETL 212 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 82.8 bits (203), Expect = 1e-14, Method: Composition-based stats. Identities = 27/88 (30%), Positives = 49/88 (55%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 A ++ + N+E+ +AS + G+ L+ +S+ N G A+++A+ Sbjct: 158 DVQAQVSEKNLTPPPGNSSGNLEQQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASG 217 Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSS 134 + +WL ++GTAR+ L VD+DFSLK+S Sbjct: 218 VMTDWLSRFGTARITLGVDEDFSLKNSR 245 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 82.4 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 65/178 (36%), Gaps = 19/178 (10%) Query: 131 KDSSLEML-----YPI-YDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 K+ S++ YP + F Q + + G G R + G N Sbjct: 105 KNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVYNKKHIIGYNA 164 Query: 185 FIDHDLSRSHTR---IGVGAEYWRDYLKLSANGYIRAS-GWKKSPDIED--YQERPANGW 238 H +S ++ I +G EYW LK N Y + + +I + Y + P G+ Sbjct: 165 CYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNHHYYQYPKIGY 224 Query: 239 DIRAEGYLPAWPQLGASLMYEQYYGD----EVGLFGKDKRQKDPHAISAEVTYTPVPL 292 I A+ P + + +EQ D + + + + H + + Y P+P+ Sbjct: 225 QICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVSLEYQPIPM 279 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 82.0 bits (201), Expect = 2e-14, Method: Composition-based stats. Identities = 42/163 (25%), Positives = 70/163 (42%), Gaps = 17/163 (10%) Query: 87 PDSDATRNFITGMA-----TAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYP 140 D+DA + + + +A + + W+ R L++ + K +E L P Sbjct: 84 SDTDAVNSALQAVVMTGVHSAMHGSKAKPWMQ-----RTVLSLRFQKNWKPLYGVETLQP 138 Query: 141 IY---DTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHT 195 + +T ++ FTQ + D T +N+G G+R + +D G N F DH +H Sbjct: 139 LGHYDETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHRFRGNHG 198 Query: 196 RIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGW 238 R+ VG EY N Y SG ++S D E +NG+ Sbjct: 199 RMSVGLEYVSGIGAFRMNWYRGVSG-ERSLDGATRMENVSNGY 240 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 78.5 bits (192), Expect = 2e-13, Method: Composition-based stats. Identities = 44/161 (27%), Positives = 65/161 (40%), Gaps = 6/161 (3%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPI--YD 143 + DA I +A + + + GK R L+ K S+E + P+ YD Sbjct: 74 SNVDAVNRAINAVAMSNVSNAMYGAKGKPWMRRTTLSFQFQEGWKPLYSVETVQPLGHYD 133 Query: 144 TPTN-MLFTQGAI-HRTDDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 + + FTQ I +D T NIG G+R S + + G + F DH H R+ G Sbjct: 134 NSSRDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDHRFLNRHNRLSAG 193 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 EY + N Y AS + ER ANG+ + Sbjct: 194 LEYMSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTVE 234 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 75.0 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 32/150 (21%), Positives = 51/150 (34%), Gaps = 17/150 (11%) Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH-RTDDRTQSNIGFGWRHFSG 175 R+ + ++L P++ ++LF DD + NIG RH Sbjct: 32 NPRIDFEGKLGNDRSIAEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHMLD 91 Query: 176 NDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRA---SGWKKSPDIED 229 N W G + D ++ +G E L AN YI S + S D D Sbjct: 92 NGWNLGGYGYFDRRKSPYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTVD 151 Query: 230 Y----------QERPANGWDIRAEGYLPAW 249 + +ER G+D +P + Sbjct: 152 FSGTTITYRAGEERSMRGYDAEVGWRIPVF 181 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 65.0 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 36/140 (25%), Positives = 55/140 (39%), Gaps = 17/140 (12%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM-----LFTQGAIHRTDD-RTQSNIGFG 169 G K++ D ++ +SS P Y + +G + D +IG G Sbjct: 127 GNGGEKISSDAYWNGGESSYIGDDPKYKAAARLAQQPSYLDKGETVQHDSLGVVGSIGAG 186 Query: 170 WRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI- 227 +R S N+ G+NTF D+ +R+G+G EY K+SAN Y S K P Sbjct: 187 YRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLSEKKTKPYYF 246 Query: 228 ---------EDYQERPANGW 238 D P +G+ Sbjct: 247 ENSLVIVPRADEFHYPEDGY 266 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 64.6 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 25/96 (26%), Positives = 41/96 (42%), Gaps = 13/96 (13%) Query: 164 SNIGFGWRHFSGNDWMAGVNTFIDH-DLSR-SHTRIGVGAEYWRDYLKLSANGYIRASG- 220 ++G G RHFS N WM G+N + D+ + ++G+G E D ++ NGY+ + Sbjct: 131 GSVGIGLRHFSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLGDCVEFRVNGYLPVNKN 190 Query: 221 -W-------KKSPDIEDYQER--PANGWDIRAEGYL 246 W +ER +G D +L Sbjct: 191 RWDQCCLFNYSGSYFATLRERGYVWSGLDTEIGTWL 226 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 62.7 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 45/260 (17%), Positives = 79/260 (30%), Gaps = 32/260 (12%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 F + T A ++ G T + N T + SD + Sbjct: 148 FTASPPRTLAEAGWTTAPQVVAINKGTTPSNLPAATSHRLVQAEPNVPTDTKTGEKSDTS 207 Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQ 152 + +A+ + + + + + F + L P + + F Sbjct: 208 NDT-----NTEADTSTNLGIPYFVDTEFRGSTRRQFGGINLRL----PFWQDDQSFAFAD 258 Query: 153 GAIHRTDDRTQ-SNIGFGWRHFSG----NDWMAGVNTFIDHDLSR---SHTRIGVGAEYW 204 + T N+G +R N W+ G + F D S + + +GAE Sbjct: 259 VHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSENGFQYHQGSLGAELV 318 Query: 205 RDYLKLSANGYIRAS-----GWKKSPDIEDYQERPANGW-------DIRAEGYLPAWPQL 252 + NGY+ S G + + Q R ANG + E A Sbjct: 319 NKKFEFRVNGYLPGSNPNVVGQRTINGVLGIQPR-ANGLGTNIVQQTLTLEARERALAGF 377 Query: 253 GASLMYEQYYGDEV--GLFG 270 + ++ D+V GLFG Sbjct: 378 DFEAGHRHHFNDKVSLGLFG 397 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 44/169 (26%), Positives = 65/169 (38%), Gaps = 45/169 (26%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 T +N+G G+R S ++ GVNTF DH S+ + RI G EY ++ AN Y + Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 220 GWKKSPD----IEDYQE----------------RPANGWDIR----------AEGYLPAW 249 K P E Y E + +G+D+ A Y+ A+ Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 250 PQLGASLMYEQYYGDEVGL---FGKDK-------RQKDPHAISAEVTYT 288 G + +G+ L GK Q PH +S +V YT Sbjct: 261 HWNGLGVKT---HGEGPALALNVGKSHGWQAGTTLQLTPH-VSLDVGYT 305 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 49/169 (28%), Gaps = 23/169 (13%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 A A A QE Y + L + P+ ++ F + Sbjct: 22 AGAYAQNAGQEKWSTY----LDLEGKVGSKRDIGEANLFIPVVQDARSLYFANVRARMAN 77 Query: 160 DRTQ-SNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGY 215 ++G G RH W G F+D + S+ + +G E AN Y Sbjct: 78 GGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTTYNNSYDQATLGVEALGRQFDWRANVY 137 Query: 216 IRASGWKKSPDIED---------------YQERPANGWDIRAEGYLPAW 249 + + +ER G+DI A LP + Sbjct: 138 QPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERALPGFDIEAGWRLPVF 186 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 60.0 bits (144), Expect = 8e-08, Method: Composition-based stats. Identities = 32/151 (21%), Positives = 52/151 (34%), Gaps = 23/151 (15%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG--AIHRTDDRTQSNIGFGWRHFSGN 176 V ++ + + + + PI +N+LF + + R N G G+R + Sbjct: 35 SVDVSGKAGETRRIGEVNLFLPIAQDDSNLLFLDLRTSFDNLEQRE-GNFGLGYRAMQDS 93 Query: 177 DWMAGVNTFIDHDLSR-SH--TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY--- 230 W G F D S H ++I G E N Y+ +KS ++ED Sbjct: 94 GWNLGAYAFYDRRRSSEGHYFSQITTGLEALGQDFDARINAYLPIG--RKSYEVEDSARV 151 Query: 231 ------------QERPANGWDIRAEGYLPAW 249 ER +G D LP + Sbjct: 152 DLSGGSIQILSGLERAYHGGDAELGWRLPVF 182 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 58.5 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 50/174 (28%), Gaps = 19/174 (10%) Query: 112 LGKYGTARVKLNVDKDF---SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIG 167 ++G R + + + P+ F H + R +N+G Sbjct: 266 FSEFGYVRGAYTFGEGIGIRHNYSTLTALFAPLVPYDDYYPFLDLRAHYIKNKRWAANVG 325 Query: 168 FGWRHFS-GNDWMAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 G R ++ G N + D+ + + G G E++ + ++ N Y Sbjct: 326 GGLRWRDCMTGFIFGANLYYDYRNTTQTDFNQFGFGLEFFTNCFEMRLNAYFPVGDVTHC 385 Query: 225 PD--IEDY----------QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 D DY E G D+ P YY +V Sbjct: 386 EDHVFSDYIGPYYAVCGLTEIAQKGVDLEVGHTFWKCPYFSVFGAIGGYYYTDV 439 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 57.7 bits (138), Expect = 5e-07, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 46/144 (31%), Gaps = 17/144 (11%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFSGND 177 +++ D S + + +L PIY T + +LFT+ D + + N+ G+R N Sbjct: 36 ESGVSIGTDNSSRGEAA-LLLPIYQTDSGLLFTELRGKLFDAGSKEGNLALGYRKMINNR 94 Query: 178 WMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG------------WK 222 W G+ D + + G E N Y S Sbjct: 95 WAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDFRINAYNALSSAQAYPQPVEAELIG 154 Query: 223 KSPDIEDYQERPANGWDIRAEGYL 246 I E P +G+D Sbjct: 155 NQLFITSAAEVPLSGYDFELGHRF 178 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 57.3 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 61/201 (30%), Gaps = 44/201 (21%) Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 + ++P+ + L+ A+ TD++ Q ++G G+R + + G F Sbjct: 43 DYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFGG 102 Query: 188 HDLSRSHTRIGV---GAEYWRDYLKLSANGYIRASGWKKSPD------IEDYQE------ 232 + ++ R+ + G E + N YI + + E Sbjct: 103 YSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVFL 162 Query: 233 ---RPANGWDIRAEGYLPAWPQ--------------------LGASLMYEQYYGDEVGLF 269 +G DI+A L +P G + E + V L Sbjct: 163 MHQYAGSGADIKAGYQL--FPHSSLKGYLGSYYFSPAETNNVWGGAAGLEYWLTQGVKLI 220 Query: 270 GK---DKRQKDPHAISAEVTY 287 G D +A + + Sbjct: 221 GSYSYDNLHHSTYAFGIGLEW 241 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 48/152 (31%), Gaps = 20/152 (13%) Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFID- 187 + + P+ ++ F + DD + + N G +R + W+AG+ F D Sbjct: 53 NDNGQGLLFIPLAQDEESLFFADLRGNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDV 112 Query: 188 --HDLSRSHTRIGVGAEYWRDYLKLSANGYIR--------------ASGWKKSPDIEDYQ 231 S + G E NGY+ SG + + + Sbjct: 113 RRSQYSNIFRQGSFGFELLSIEWDFRVNGYVPSQKQQRVDSLNTAYLSG--NNIVMRAGE 170 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYG 263 ER G D L ++P+ Y G Sbjct: 171 ERAYWGTDFEVGRLLKSFPESNLDAELRGYVG 202 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 57/176 (32%), Gaps = 17/176 (9%) Query: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 + N + +G T + + ++ S L P+ N+++ D Sbjct: 148 QSDLNNTSRHTVGARFTVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIYIDLKSKLYDA 207 Query: 161 RT-QSNIGFGWRHFSGNDWMAGVNTFIDHDL-SRSHTRI-GVGAEYWRDYLKLSANGY-- 215 + + + G +R G+N F D + R +G E + L+ N Y Sbjct: 208 KEGEVSTGIVFRRQMSPLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKSFSLNGNYYRS 267 Query: 216 IRASGWKKSPDIEDY-----------QERPA-NGWDIRAEGYLPAWPQLGASLMYE 259 + + E + ER A NG+D+ L + + S + Sbjct: 268 NKKTTISSVKSFEFHDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIHGSAFFF 323 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 27/109 (24%), Positives = 46/109 (42%), Gaps = 5/109 (4%) Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQ-SNIGFGWR-H 172 Y R + N + + + L P+ + ++ Q + TD N+G R + Sbjct: 47 YFDVRNQSNSGVGYQHGFTQIGALTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGRVY 106 Query: 173 FSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRA 218 +G D + G N + D+D S +++IG G E L L AN Y+ Sbjct: 107 DAGRDRIWGANVYYDNDETTYSNRYSQIGFGFESLGQNLDLRANAYLPT 155 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 55.4 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 48/120 (40%), Gaps = 7/120 (5%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKD--SSLEML-YPIYDTPTNMLFTQGAIHR-TDDRT 162 + EW+ A ++ V K ++ +S + P+ D+ + F IH +R Sbjct: 44 QANEWVFPPTLAYLQGVVGKGIGEQNGYASFGIFTIPLLDSNGQLFFD-ARIHNLRHERW 102 Query: 163 QSNIGFGWRHFSG-NDWMAGVNTFIDHDLSR-SHTRIGVGAEYWRDYLKLSANGYIRASG 220 +N+G G R + G+N F D+ +R + ++G G E NGY Sbjct: 103 AANVGVGTRIAIPCTNLFFGINFFYDYRRTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 49/142 (34%), Gaps = 30/142 (21%) Query: 120 VKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGWR 171 V V + D + ++ P++ + +LF + + + ++G G+R Sbjct: 51 VNAGVKSSDAYTDGNFSIVAPVWSSLGAEGTLSGGVLFLEPYTSYGEGGEIAASLGLGYR 110 Query: 172 H-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLK 209 + F G N F +D + ++GVG E+ YL+ Sbjct: 111 YLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGVEFGNRYLE 170 Query: 210 LSANGYIRASGWKKSPDIEDYQ 231 N YI S + + + + Sbjct: 171 FRGNYYIPLSDKQVAEQFKTRE 192 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 58/173 (33%), Gaps = 27/173 (15%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD--SSLEMLYPIYDTP-TNMLFTQGAI 155 +A +A E + L R+ + D + LE P+ TP N+ F +G + Sbjct: 22 LAQTEAESETADTLRI--KPRLGIGHTSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRL 79 Query: 156 --HRTDDRTQSNIGFGWRHFSGNDW-MAGVNTFIDHDLSRSHT--RIGVGAEYWRDYLKL 210 D N+ G+R +S N + G D+ + +T ++G+G E Sbjct: 80 FLDNDDANLGGNLILGYRTYSANSHRIWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDF 139 Query: 211 SANGYIRASGWKKSPDIED-----------------YQERPANGWDIRAEGYL 246 NGY+ ++ +E GWD L Sbjct: 140 RVNGYLPIGDTRQGVGDAGVRDIFFRRNFLILEQGQNKEAAMGGWDAEVGAKL 192 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 53.1 bits (126), Expect = 1e-05, Method: Composition-based stats. Identities = 24/124 (19%), Positives = 39/124 (31%), Gaps = 10/124 (8%) Query: 102 AKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-D 160 EW + D S + L P Y + + +D D Sbjct: 19 GSVQAADSEWKP---NTQAYFAAGDDRSYFGLAG--LIPFYQDGKRLGYADLRYSSSDVD 73 Query: 161 RTQSNIGFGWRHFSGND-WMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYI 216 + N+G G+R + N+ + G D S R + ++ GAE D +N Y Sbjct: 74 TDEINLGAGFRSLNENETAIYGFYGSYDLRKSATERDYRQLTFGAELLTDTWDYRSNFYF 133 Query: 217 RASG 220 Sbjct: 134 PTGD 137 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 52.7 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 3/69 (4%) Query: 154 AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH---TRIGVGAEYWRDYLKL 210 D + ++ G G R + + G+NT+ D+ R ++GVG E D + Sbjct: 14 GYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVGVGFEMLSDCFDV 73 Query: 211 SANGYIRAS 219 NGY+ S Sbjct: 74 RINGYLPVS 82 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 52.7 bits (125), Expect = 2e-05, Method: Composition-based stats. Identities = 23/84 (27%), Positives = 32/84 (38%), Gaps = 4/84 (4%) Query: 138 LYPIYDTPTNMLFTQG-AIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH- 194 L P MLF DR +N+G G R++ N D + G N + D+D + Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 195 -TRIGVGAEYWRDYLKLSANGYIR 217 +G G E Y N Y Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFP 178 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 20/77 (25%), Positives = 29/77 (37%), Gaps = 22/77 (28%) Query: 166 IGFGWRH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEY 203 +G GWRH F + G N F +D + + ++GVG E Sbjct: 146 LGLGWRHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGIEA 205 Query: 204 WRDYLKLSANGYIRASG 220 YL++ N YI S Sbjct: 206 GTRYLEVRGNYYIPLSD 222 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 51.9 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 30/138 (21%), Positives = 43/138 (31%), Gaps = 21/138 (15%) Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFID 187 + S+ + P+ D+ F D DR +NIG G R G + G + D Sbjct: 25 DGAEGSIAVAIPLADSDAARTFLDLRGSIDDADRRVANIGIGHRFRLG-AVVLGGAVYYD 83 Query: 188 H---DLSRSHTRIGVGAEYWRDYLKLSANGYIR----------------ASGWKKSPDIE 228 DL ++ V + L L AN Y SG I Sbjct: 84 RVRTDLESDFSQATVSLDLMTADLDLRANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIF 143 Query: 229 DYQERPANGWDIRAEGYL 246 +E G+D L Sbjct: 144 QPREVTLKGFDAEVGYRL 161 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 51.6 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 20/75 (26%), Positives = 32/75 (42%), Gaps = 4/75 (5%) Query: 150 FTQGAIHRTDDRT---QSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWR 205 +T+ T ++G G+R S N+ GVN F+D + ++ RI G EY Sbjct: 154 YTKINTDEKSSETLGIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVN 213 Query: 206 DYLKLSANGYIRASG 220 ++ AN Y Sbjct: 214 GLNEVYANVYRGLGD 228 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 49.6 bits (117), Expect = 1e-04, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 53/147 (36%), Gaps = 32/147 (21%) Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQSN-I 166 GT L ++ S+ + P+Y T ++LF + + + ++ + Sbjct: 52 LGTVTAGLKTSDAYTDGHFSI--VAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASSL 109 Query: 167 GFGWRH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYW 204 G G+RH F G + F +D + + ++GVG E Sbjct: 110 GLGFRHLFGSQPLTALSANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEAG 169 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDYQ 231 Y+++ N YI S + + + + Sbjct: 170 TRYVEVRGNYYIPLSDKQLAEETRTRE 196 >UniRef50_A8PQI7 Putative outer membrane autotransporter barrel domain n=5 Tax=Rickettsiella grylli RepID=A8PQI7_9COXI Length = 1171 Score = 49.2 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 38/197 (19%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGN 176 AR NV + + P+ + + A+ + ++G G+R Sbjct: 34 ARFSGNVYGSTKYVVGQADAMLPLVGDAQHNFYIDPALTSGSNWEGHGDLGLGYRWIQNG 93 Query: 177 DWMAGVNTFIDHDLSRSHTRI---GVGAEYWRDYLKLSANGYI----------------R 217 + G F +++ ++ RI G E NGY R Sbjct: 94 SAILGGYLFGEYNRMDNNVRIWTMNPGIEALGSRWDAHLNGYFVMDNRSKVVGTDLEFVR 153 Query: 218 ASGWKKSPDIEDYQERPANGWDIRAEGYL-PAWPQ-----------------LGASLMYE 259 G ++ D + NG D++ L P P LG ++ E Sbjct: 154 FRGHSAVYNLFDVTQNVGNGGDVKLGYQLFPKTPLKAFVGSYFFSPAETKNILGGAVGLE 213 Query: 260 QYYGDEVGLFGKDKRQK 276 + V +F K Sbjct: 214 YWANRNVKVFASYTYDK 230 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 57/158 (36%), Gaps = 24/158 (15%) Query: 133 SSLEMLYPIYDTPTNMLFTQG-AIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDL 190 +S+ P + +++FT + + +N+G G+R F D + GV+ + D D Sbjct: 108 TSVGGFLPFFRDENSLIFTDIRGLMTNGGKGGANVGVGYRQFVPELDRIFGVSGWYDFD- 166 Query: 191 SRSHT----RIGVGAEYWRDYLKLSANGYIRASGWKKSPD------------IEDYQERP 234 H + GV E YL NGY+ ++ + I + R Sbjct: 167 -NGHREAFNQFGVSFESIGRYLDWRVNGYLPVEDNEEISNQILGAAGFQNNFILLNRGRS 225 Query: 235 AN----GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 + G+D G P + G S YY + Sbjct: 226 VDSAYKGFDTEIGGPFPILGRYGMSGYVGMYYYANTDV 263 >UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA61_9CHLA Length = 188 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 15/53 (28%), Positives = 24/53 (45%), Gaps = 2/53 (3%) Query: 164 SNIGFGWRHFSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANG 214 N G G+R + N F DH S + ++G+G E + + +L NG Sbjct: 77 VNAGVGFRKIYAPQTIWDANLFYDHPKSSYDHYNQVGLGLELFHELWELRLNG 129 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 39/155 (25%), Positives = 54/155 (34%), Gaps = 22/155 (14%) Query: 133 SSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQ-SNIGFGWRHFSGN-DWMAGVNTFIDHD 189 S+L +L P P +MLF TD N+G GWR ++ N D + V + D+D Sbjct: 142 SNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLGAGWRAYNDNLDKIFTVAGWYDYD 201 Query: 190 LSR--SHTRIGVGAEYWRDYLKLSANGYIR-----------ASGWKKSPDIEDYQERPAN 236 + ++G+ E YL NGY SG Y R Sbjct: 202 DGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEIIISNNLSGSAYFQTDRIYLNRTRR 261 Query: 237 ------GWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 G D G LP + G YY + Sbjct: 262 SESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYYNS 296 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 43/114 (37%), Gaps = 7/114 (6%) Query: 117 TARVKLNVDKDFSLKD--SSLEMLYPIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWRH 172 R + + D + L+ P+ P + + F +G + + N+ FG R Sbjct: 74 KPRWGIGYSTSGAGYDGFTRLDSFLPLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRF 133 Query: 173 FSGN-DWMAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLKLSANGYIRASGWKK 223 ++ + + + G D + + ++GVG E + + NGY + Sbjct: 134 YNQSLNRIFGGYLGFDRRDTGNSTFHQLGVGVETLGEVWDVRLNGYFPLGDTRD 187 >UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLP1_ACAM1 Length = 1022 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 29/153 (18%), Positives = 60/153 (39%), Gaps = 11/153 (7%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAIHR 157 + +A+ + ++G + N + + LE P++ P + F +G + Sbjct: 28 QPSTQASDL--RFSPRFG---IGANSPSSGTNTTTRLETFVPVWQKPGRALTFFEGRLLL 82 Query: 158 TDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLKLSAN 213 D NI FG+R +S + + G + D + ++ ++ +G E + L N Sbjct: 83 DDQGNPGGNILFGFRQYSDDLKRIFGGHLGFDIRNTDNNTFQQLSLGIESLGKDVDLHLN 142 Query: 214 GYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 GY ++ ++ NG D R G + Sbjct: 143 GYWPVGSTRRQTRQRIFEVLQLNG-DPRFTGNI 174 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 45.4 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 14/29 (48%), Positives = 19/29 (65%), Gaps = 2/29 (6%) Query: 267 GLFGKD--KRQKDPHAISAEVTYTPVPLT 293 G+FG RQ++PHAI+ + Y PVPL Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLV 31 >UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellular organisms RepID=B4D818_9BACT Length = 5429 Score = 45.4 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 43/103 (41%), Gaps = 4/103 (3%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGND 177 RV ++ D SL+ L P+ +L+ + +D +IGFG+RH Sbjct: 74 RVTFGLEFYEHQIDESLDTLVPLATPQNGVLYFNPKLSLSDRLNPSVSIGFGYRHLLKAR 133 Query: 178 WMAGVNTFIDHDLSR-SHT--RIGVGAEYWRDYLKLSANGYIR 217 + T + D + H + GVGAE ++ AN Y+ Sbjct: 134 RSSSGETSLRSDYTNFDHHVNQFGVGAEVMSRWVDFRANYYLP 176 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 62/169 (36%), Gaps = 15/169 (8%) Query: 50 AVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQ 109 ++ + N N +K A T D + N I N Sbjct: 13 SLLNADELDNALKNNQNKWQKFNYQATQKAPTIKEENIDFKSALNGILSNVLENKN---- 68 Query: 110 EWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFG 169 G + N+D F ++ ++ L +Y+ N L Q + T D + G Sbjct: 69 ------GIDKTDGNLD--FQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLI 120 Query: 170 WRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEY-WRDYLKLSANGYIR 217 R+ +D++ G+N FID + ++ GAE + ++K +N Y+ Sbjct: 121 NRY-EKDDFLLGINGFIDGQKEQKESK-SFGAELGYYQFVKAYSNYYVP 167 >UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZT8_9GAMM Length = 785 Score = 43.8 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 74/210 (35%), Gaps = 34/210 (16%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNI 166 ++ W G + R LNV ++D + L P+ ML+ GA+ T T + Sbjct: 31 QVWAWGGPW-KPRQTLNVQGGHGMQDY-YDALLPLSGNAERMLYANGALAATHHETGGEL 88 Query: 167 GFGWRHFS-GNDWMAGVNTFIDHDLSRSH---TRIGVGAEYWRDYLKLSANGYIRAS--- 219 G G+RH N+++ G + + H ++ +G E++ + A+ Y+ S Sbjct: 89 GLGYRHIILNNEYVIGGFALMGRYQTNYHNMFNQLTLGTEFFGSIWEGRAHLYLPVSRRT 148 Query: 220 -------------GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 G K E G D+ +P P+L YY + + Sbjct: 149 KFVRSRSEGLSFQGHKLFGIQTTTYEHAEGGADVEIGHVIPGIPKLRGFAG---YYNNGL 205 Query: 267 GLFGKD---------KRQKDPHAISAEVTY 287 G K+ R + + +Y Sbjct: 206 GNEHKNINGGYGRFEYRYNNHFTFTLGDSY 235 >UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZV2_9CYAN Length = 1059 Score = 42.7 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 32/135 (23%), Positives = 53/135 (39%), Gaps = 21/135 (15%) Query: 133 SSLEMLYPIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWRHF-SGNDWMAGVNTFIDHD 189 SLE PI P + + F +G + D T I G R + S + + G D Sbjct: 67 FSLEGFVPITQNPGSTVTFLEGQLRLFTDSTMGGTILLGQRFYNSTQNRILGGYLSYDTR 126 Query: 190 LSRSH--TRIGVGAEYWRDYLKLSANGYIR-------------ASGWKKSPDIEDYQER- 233 + + +IG G E D L N Y+ G++++ + ++++R Sbjct: 127 DTGNSLFHQIGAGFERLGDDWDLRVNAYLPVGERRPEVDESFSLRGFQENNLLLNHRQRF 186 Query: 234 --PANGWDIRAEGYL 246 G+DI A G L Sbjct: 187 EAAMAGFDIEAGGRL 201 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P36943 Putative attaching and effacing protein homolog ... 385 e-105 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 345 9e-94 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 337 3e-91 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 336 4e-91 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 335 9e-91 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 335 1e-90 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 333 4e-90 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 330 4e-89 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 330 5e-89 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 327 3e-88 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 324 2e-87 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 324 2e-87 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 323 4e-87 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 322 7e-87 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 321 2e-86 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 321 2e-86 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 320 3e-86 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 320 5e-86 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 314 2e-84 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 314 3e-84 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 313 5e-84 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 311 2e-83 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 310 4e-83 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 308 2e-82 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 306 5e-82 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 303 5e-81 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 303 5e-81 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 302 9e-81 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 300 6e-80 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 297 3e-79 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 296 7e-79 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 290 3e-77 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 289 6e-77 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 289 9e-77 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 288 2e-76 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 287 3e-76 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 287 3e-76 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 287 3e-76 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 286 6e-76 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 284 2e-75 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 284 3e-75 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 283 6e-75 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 282 1e-74 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 280 4e-74 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 279 6e-74 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 278 2e-73 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 273 7e-72 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 271 2e-71 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 268 1e-70 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 263 5e-69 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 258 1e-67 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 257 3e-67 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 253 8e-66 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 248 1e-64 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 248 3e-64 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 244 2e-63 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 244 2e-63 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 240 4e-62 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 238 1e-61 UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 232 1e-59 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 229 7e-59 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 228 2e-58 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 220 5e-56 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 212 2e-53 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 209 7e-53 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 206 6e-52 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 191 2e-47 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 185 2e-45 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 177 3e-43 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 158 2e-37 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 157 3e-37 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 149 9e-35 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 148 3e-34 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 145 2e-33 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 137 4e-31 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 137 5e-31 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 133 5e-30 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 133 7e-30 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 133 9e-30 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 132 2e-29 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 131 2e-29 UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 131 2e-29 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 130 6e-29 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 119 1e-25 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 118 3e-25 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 117 4e-25 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 117 6e-25 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 115 1e-24 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 115 2e-24 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 113 5e-24 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 112 1e-23 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 107 4e-22 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 107 4e-22 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 107 6e-22 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 106 1e-21 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 103 5e-21 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 102 2e-20 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 100 5e-20 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 99 1e-19 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 99 2e-19 UniRef50_A8PQI7 Putative outer membrane autotransporter barrel d... 98 5e-19 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 93 1e-17 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 92 2e-17 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 92 2e-17 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 92 2e-17 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 92 3e-17 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 91 4e-17 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 89 1e-16 UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryoc... 87 5e-16 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 85 2e-15 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 85 2e-15 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 85 3e-15 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 81 6e-14 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 80 1e-13 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 80 1e-13 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 78 3e-13 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 72 3e-11 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 70 1e-10 UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachl... 68 3e-10 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 66 1e-09 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 62 3e-08 Sequences not found previously or not previously below threshold: UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microco... 75 3e-12 UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum ... 75 4e-12 UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 74 6e-12 UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickett... 71 4e-11 UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Plancto... 70 7e-11 UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Plancto... 70 1e-10 UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastop... 70 1e-10 UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legione... 67 6e-10 UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanoth... 62 2e-08 UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyano... 62 2e-08 UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellu... 57 6e-07 UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=... 54 7e-06 UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Plancto... 53 9e-06 UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6... 49 2e-04 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 45 0.004 UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=... 44 0.006 UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicib... 43 0.009 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 43 0.015 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 295/295 (100%), Positives = 295/295 (100%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT Sbjct: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA Sbjct: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI Sbjct: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ Sbjct: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 345 bits (886), Expect = 9e-94, Method: Composition-based stats. Identities = 134/291 (46%), Positives = 184/291 (63%), Gaps = 10/291 (3%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 H + ++ AR ++ PL A + + Sbjct: 78 HMSLEALRKLNQFRTFARGFDHLQPGDELDVPL---------APLPAVTWAEETPVPASA 128 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + ++ + +A A+ AG FL++ P DA + GMAT A+ E+Q+WL ++GTAR++L Sbjct: 129 SKEDLQAQKIAGIASQAGNFLANSPRGDAAASIARGMATGAASTEVQQWLSQFGTARLQL 188 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 +VD FSLK+S L++L P+Y+ P ++FTQG++HRTDDRTQ+N+G G R F + +M G Sbjct: 189 DVDNKFSLKNSQLDLLIPLYEQPDKLVFTQGSLHRTDDRTQTNLGMGMRWF-NDGYMLGG 247 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF+D+DLSR H R+G+G EYWRDYLK+ AN Y+R + W+ S D DYQERPANGWD+ Sbjct: 248 NTFLDYDLSRDHARMGMGVEYWRDYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSL 307 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG++PA PQLG +L YEQYYG EV LFGKD RQKDPHAI+ V YTP PL Sbjct: 308 EGWVPALPQLGGNLKYEQYYGKEVALFGKDNRQKDPHAITVGVNYTPFPLL 358 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 337 bits (864), Expect = 3e-91, Method: Composition-based stats. Identities = 126/286 (44%), Positives = 177/286 (61%), Gaps = 6/286 (2%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + A ++ P + + + T D+ Sbjct: 89 ALRKLNVLRTFAHGFDNLQPGDELDVPAVMP-----DGKPDSPAKTGDEQAATPPLKDDE 143 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 +A A+ AGT LS+ PD DA + G +A A+ ++Q+WL ++GTARV+L D+ Sbjct: 144 GAMKMADMASRAGTLLSNSPDGDAALSMARGQISAVASGQVQQWLNQFGTARVQLEADEH 203 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSLK+S +++L P Y+ +LFTQG++HRTDDRTQ+N+GFG R+F+ +M G N F D Sbjct: 204 FSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAP-SYMLGGNIFGD 262 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 +DLS H+R G+G EYWRD+LKLSANGY+R S W+ SP++++YQERPANGWDIRA+ +LP Sbjct: 263 YDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQAWLP 322 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + PQLG L YEQYYG V LFGK+ Q++P AI+A V +TP PL Sbjct: 323 SLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLL 368 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 336 bits (863), Expect = 4e-91, Method: Composition-based stats. Identities = 130/286 (45%), Positives = 184/286 (64%), Gaps = 17/286 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ P A ++ + N+ Sbjct: 93 LRKLNQFRTFARGFDNVRQGDELDVP---------------AQVSENNLTPPPGNSSGNL 137 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 138 EQQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMTDWLSRFGTARITLGVDEDF 197 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S + L+P Y+TP N+ F+Q +HRTD+RTQ N G GWRHF+ WM+G+N F DH Sbjct: 198 SLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTP-TWMSGINFFFDH 256 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DLSR H+R G+GAEYWRDYLKLS+NGY+R + W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 257 DLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRAEGWLP 316 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWP LG L+YEQYYGDEV LF KD RQ +PHAI+A + YTP PL Sbjct: 317 AWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLM 362 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 335 bits (860), Expect = 9e-91, Method: Composition-based stats. Identities = 109/280 (38%), Positives = 155/280 (55%), Gaps = 12/280 (4%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 ++ ++ ++ P + + N + + ++ +A Sbjct: 9 QFRSFSKPFIQLGSGDEIDIPRITPLP-----------EKITTAENAKTVSSSQYKERLA 57 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 T L+ A + +A +AN Q WL ++GTARV+LN+D + SLK S Sbjct: 58 HNLLKGATVLADDNTPLAAASMARSVAVGEANDAAQHWLSQFGTARVQLNLDNNLSLKGS 117 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 + +ML P+YD ++LF+Q + D R NIG G R N WM G N F D D++ Sbjct: 118 AFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDN-WMYGANVFFDRDITGK 176 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 + RIG GAE W DYLKLSAN Y+R + W +S D DY ERPANG+D+R E YLPA+PQ+G Sbjct: 177 NNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLPAYPQIG 236 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +L YEQY G+EV LFGKD RQK+P+A +A + YTP+PL Sbjct: 237 TNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLI 276 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 335 bits (859), Expect = 1e-90, Method: Composition-based stats. Identities = 132/286 (46%), Positives = 179/286 (62%), Gaps = 13/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ P A +N + Sbjct: 93 LRRLNQFRTFARGFDNVRQGEELDVPATTLQKSHEQQNAV-----------PPANGENTL 141 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E +AS + GT LS +S+ G A+++A+ + +WL +GTA++ L VD+DF Sbjct: 142 ENQIASTSQRVGTLLSQDMNSEQASGMARGWASSEASGAMTDWLNNFGTAKISLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S + L+P YDTP +LF+Q +HRTDDRTQ N G GWRHF+ WM+G+N F DH Sbjct: 202 SLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDRTQINTGLGWRHFTP-SWMSGINLFFDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DLSR H+R G+GAEYWRDYLKLS+N YI +GW+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG L+YEQYYGDEV LF K+ RQ +PHAI+A + YTP PL Sbjct: 321 AWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAITAGLNYTPFPLL 366 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 333 bits (854), Expect = 4e-90, Method: Composition-based stats. Identities = 118/279 (42%), Positives = 159/279 (56%), Gaps = 7/279 (2%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y + AR + ++ P+ ++ A +A E Sbjct: 119 YRIFARGFEHVGVGDEIDIPVDMSSLNTQAGQAPKLSSAMREPSRA------EKEAQAVG 172 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 + G LSS S+A MAT AN+EIQ+WL KYGTARV+LN+DK+FSL +S+ Sbjct: 173 QLMSVGATLSSTRPSEAAAGMARSMATNAANEEIQQWLSKYGTARVQLNLDKNFSLSESA 232 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 L+ P++D+ FTQ D R N+G G R + WM GVN F DHDL+ + Sbjct: 233 LDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVGARTLL-DRWMLGVNMFYDHDLTGHN 291 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 +R+G+GAE W DYL+LS NGY+R S W +S D DY ER ANG+DIRA +LPA PQLG Sbjct: 292 SRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADYDERAANGFDIRANAWLPALPQLGG 351 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 L+YEQY G+ V LFGK+ Q++P+A++A V YTP PL Sbjct: 352 KLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPFPLL 390 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 330 bits (845), Expect = 4e-89, Method: Composition-based stats. Identities = 143/286 (50%), Positives = 191/286 (66%), Gaps = 12/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 92 LRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM------ 143 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DF Sbjct: 144 --QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SS E L+P Y+TP N++F+Q +HRTDDRTQ+N G GWR+F+ + WM+GVN FIDH Sbjct: 202 SLKSSSFEFLHPWYETPDNLVFSQHTLHRTDDRTQTNHGIGWRYFT-SSWMSGVNMFIDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG L+YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 321 AWPQLGGKLVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 330 bits (845), Expect = 5e-89, Method: Composition-based stats. Identities = 141/286 (49%), Positives = 191/286 (66%), Gaps = 12/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 92 LRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM------ 143 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DF Sbjct: 144 --QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SS E L+P Y+TP N++F+Q +HRTD+RTQ+N G GWR+F+ + WM+GVN FIDH Sbjct: 202 SLKSSSFEFLHPWYETPDNLVFSQHTLHRTDNRTQTNHGIGWRYFT-SSWMSGVNMFIDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG ++YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 321 AWPQLGGKVVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 327 bits (838), Expect = 3e-88, Method: Composition-based stats. Identities = 147/287 (51%), Positives = 185/287 (64%), Gaps = 11/287 (3%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 RF S L R VA I QVLFP+ A ++ + Sbjct: 1 MISARFHSSRLTRAVASLCIVTQVLFPV---------ASTAGHRVAAPQAAPAVLSEQDA 51 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 VA A L S +S G AT+ A QEWL ++GT RV L +D+D Sbjct: 52 TAAQVAGMTTQAAGMLQSGMNSRQAAEMARGYATSTAQSAFQEWLSQWGTVRVTLGLDED 111 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F+LK S+ ++L P +DTP N+LFTQ + HRTDDR Q N G GWRHF+ + +MAGVN F D Sbjct: 112 FTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFAPD-YMAGVNLFFD 170 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYL 246 HDL+R H+R+G+G EYWRD LKL ANGY+R SGW+ +P+++ DY+ RPANGWD+RAEGYL Sbjct: 171 HDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYL 230 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA+PQLGA+LMYEQYYGDEV LFGKDKRQ+DPHA +A ++YTPVPL Sbjct: 231 PAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLI 277 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 324 bits (831), Expect = 2e-87, Method: Composition-based stats. Identities = 122/285 (42%), Positives = 161/285 (56%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + Y ++ A ++ P + N +V Sbjct: 113 LKKLNAYRTFSKPFASLTTGDEIEVPRKESSFF---------------SNNPNENNKKDV 157 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 + +A A AG LS+ SDA N T + N Q+WL ++GTARV+LNVD DF Sbjct: 158 DDLLARNAMGAGKLLSNDNTSDAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDF 217 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L +S+L++L P+ D+ +++LFTQ + D R NIG G R + G+ WM G NTF D+ Sbjct: 218 KLDNSALDLLVPLKDSESSLLFTQLGVRNKDSRNTVNIGAGIRQYQGD-WMYGANTFFDN 276 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 DL+ + R+GVGAE DYLK SAN Y +GW +S D Y ERPA+G+DIR E YLPA Sbjct: 277 DLTGKNRRVGVGAEVATDYLKFSANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPA 336 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +PQLG LMYE+Y GDEV LFGKD RQKDPHA++ V YTPVPL Sbjct: 337 YPQLGGKLMYEKYRGDEVALFGKDDRQKDPHAVTLGVNYTPVPLV 381 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 324 bits (830), Expect = 2e-87, Method: Composition-based stats. Identities = 124/296 (41%), Positives = 174/296 (58%), Gaps = 11/296 (3%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 M +++ H + + R + +A I +Q+ P ++ + + +A + S Sbjct: 12 MLFFRSTHMRSKTR-----KLLACIQIVLQLAPPSSLIYLS--SVFNANAEEITSSAEKE 64 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 + +VA A AG+ LSS SDA + + T KA QEWL ++GTARV Sbjct: 65 QGNPSDQNASSVAQTAVQAGSLLSSDNASDALGSAVVSAVTGKAASSAQEWLSQFGTARV 124 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 ++ D+ F+L DS L++L P+Y+ N+LFTQ R DDR N GFG+RHF + WM Sbjct: 125 NISTDEHFTLSDSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHF-NDGWMW 183 Query: 181 GVNTFIDHDLS-RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G N F D +S H R+G+ E DYL +SANGY+R S W S +DY ER A+G+D Sbjct: 184 GTNVFYDRQVSGNQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFD 243 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 IRA GYLPA+PQLGA+++YEQY+GD VGLFG D RQKDP+A++ + YTPVPL Sbjct: 244 IRATGYLPAYPQLGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLV 299 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 323 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 121/287 (42%), Positives = 161/287 (56%), Gaps = 8/287 (2%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + ++ A + ++ P + + P + + + Sbjct: 117 SQLKKINQFRKFAHGIDKIGAGDEIDIPHSGSSL-------TKPGSPAAATPLSPHADTS 169 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 E VA G L+S S+A ATA AN EI +WL KYGTA+++LN+DK Sbjct: 170 ERESRVAGQLMGVGRVLASPQSSNAASEMARSWATAAANDEIVKWLSKYGTAQLQLNIDK 229 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 +FSL S+L+ L P YDTPT FTQ D R NIG G R S N W+ GVN F Sbjct: 230 NFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTRTLSNN-WLFGVNAFY 288 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 DHDLS ++R+G+G+E W DYL+LS NGY+R S W +S D+ DY ERPANG+D+RA ++ Sbjct: 289 DHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNERPANGFDVRANAWM 348 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P PQLG LMYEQY+GD VGLFGKD Q++P+A + V YTP PL Sbjct: 349 PTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPLL 395 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 322 bits (826), Expect = 7e-87, Method: Composition-based stats. Identities = 128/309 (41%), Positives = 180/309 (58%), Gaps = 19/309 (6%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLA-VTFTPVMAARAQHAVQPRLSM---- 57 H + + ++ P + P + A + Sbjct: 83 HLTPEALRKLNQRRTFTYGFDNLQPGDKLNVPAIKLDDEPDVPAARLDNKANLPAARLDN 142 Query: 58 -------------GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA 104 + D+ + +A A+ AG FLS P+ DA + G TA+A Sbjct: 143 KPDVPAIIWGQEGSAASALGDDAGARKMADVASRAGAFLSDNPNGDAALSLARGEVTAEA 202 Query: 105 NQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS 164 + ++Q+WL ++GTARV+L+ D+ FS K+S ++L P+Y+ +++FTQG++HRTDDRTQ Sbjct: 203 SGQLQQWLNQFGTARVQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQV 262 Query: 165 NIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 N+GFG R+F+ +M G N F D+DLSR+H+R G+G EYWRD+LKLSANGY+R S W S Sbjct: 263 NLGFGLRYFAP-SYMLGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNS 321 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 D +DYQERPANGWDIRA+ +LP+ PQLG L YEQYYG V LFGK+ Q+DP AI+A Sbjct: 322 SDFKDYQERPANGWDIRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAG 381 Query: 285 VTYTPVPLT 293 V +TP PL Sbjct: 382 VNFTPFPLL 390 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 117/279 (41%), Positives = 156/279 (55%), Gaps = 17/279 (6%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y +R ++ P + V + E +A Sbjct: 103 YRTFSRPFTALTTGDEIDIPRKASPFSVDNNKDNRLSV----------------ENTLAG 146 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 A T LS+ + + + A+ + N Q+WL ++GTARV+LN++ DF L S+ Sbjct: 147 HAVAGATALSNGDVAKSGERMVRSAASNEFNNSAQQWLSQFGTARVQLNINDDFHLDGSA 206 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 ++L P+YD ++LFTQ D R N+G G R F GN WM G NTF D+DL+ + Sbjct: 207 ADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGN-WMYGANTFFDNDLTGKN 265 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 RIGVGAE W DYLKLSAN Y + W +S D DY ERPANG+D+RAE YLP++PQLG Sbjct: 266 RRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGYDLRAEAYLPSYPQLGG 325 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 MYE+Y GD+V LFGKD RQK+PHAI+A V YTP+PL Sbjct: 326 KAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPLV 364 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 321 bits (822), Expect = 2e-86, Method: Composition-based stats. Identities = 124/289 (42%), Positives = 172/289 (59%), Gaps = 16/289 (5%) Query: 6 TGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD 65 + F S+ ++ + W+ I +Q+LFPL F PV AA A + T Sbjct: 5 SIKNNNSFFLSLKSKLIIWSQIVLQILFPLFTVF-PVHAAPAT------TTKETTVAMPY 57 Query: 66 NNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 + +AS S+ +D ++ TGMAT+ A +Q+WL ++GTARV+LNVD Sbjct: 58 SQELSTLAS---------STASGTDGAKSAATGMATSAAASSVQQWLSQFGTARVQLNVD 108 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 + + DS++++L P+YD +LFTQ + D RT N+G G R F +WM G N F Sbjct: 109 DNGNWDDSAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVF 168 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 D D + + R+G GAE W +YLKLSAN Y+ + W S D DY E+PA+G+DIRAEGY Sbjct: 169 FDDDFTGKNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGY 228 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LPA+PQLGA LMYEQYYGD+V LF D Q +P A++ ++YTPVPL Q Sbjct: 229 LPAYPQLGAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQ 277 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 320 bits (821), Expect = 3e-86, Method: Composition-based stats. Identities = 122/292 (41%), Positives = 168/292 (57%), Gaps = 17/292 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 H ++ ++ + S+L + V WA I +Q+ FPL V P A+ A + +S +T Sbjct: 1 MHEQSIMEKNTLKISLLKKIVIWAQILLQIAFPLLV--LPAHASSGPGATETDMSDASTL 58 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + + +DA +N T +AT A ++EWL +GTA+V Sbjct: 59 SASLASSAAQ---------------NGADAMKNTATHLATTHAASTVEEWLSHFGTAQVT 103 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L+VD + + +S+ + L P+YD ++LFTQ I D RT NIG G R F DWM G Sbjct: 104 LDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFG 163 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + RIG GAE W +YLKLSAN YI S W S D ++Y E+PA+G+D+R Sbjct: 164 GNVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVR 223 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AEGYLP++PQLGA LMYEQYYGD V LF KD Q +P A++ + YTPVPL Sbjct: 224 AEGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPLI 275 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 320 bits (819), Expect = 5e-86, Method: Composition-based stats. Identities = 118/279 (42%), Positives = 164/279 (58%), Gaps = 15/279 (5%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y ++ ++ P + + + + ++A +E +AS Sbjct: 71 YRTFSKPFTALTSGDEIDIPRKASPFSIDSEKNKNADVL--------------LENKLAS 116 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 T L++ + ++ I A + N Q+WL ++GTARV++NV+ DF L S+ Sbjct: 117 HVQTGATALATSNAAKSSERMIRSAANNEFNSSAQQWLSQFGTARVQMNVNDDFKLDGSA 176 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L PIYD ++LFTQ D+R NIG G R F N WM GVNTF D+D++ + Sbjct: 177 VDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNN-WMYGVNTFFDNDMTGKN 235 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R+GVGAE W DYLKLSAN YI S W +S D DY ERPANG+D+RAE YLP+ PQLG Sbjct: 236 RRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAEAYLPSHPQLGG 295 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYE+Y G+EV LFGKD RQK+PHA++A V YTP+PL Sbjct: 296 KLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLL 334 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 314 bits (805), Expect = 2e-84, Method: Composition-based stats. Identities = 130/291 (44%), Positives = 187/291 (64%), Gaps = 17/291 (5%) Query: 8 HKQPRFRYS-----VLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + S + + VAW+ I++Q L+P ++FTP ++ ++ + Sbjct: 1 MSNKKISRSNGATGPVNKVVAWSTIALQALYPALLSFTPTISH--------ASAVKASQA 52 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 A+ + ++S AA AG + + +F A+A +E+ EWL KYG AR++L Sbjct: 53 AAEQQELRGLSSLAAQAGRSIENG----HAGSFAANTVPAQATKEVVEWLQKYGNARIQL 108 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD FSLKDS+ + LYP D ++LF+Q ++HRTDDRTQ+NIG G+R+F+ ++ M G Sbjct: 109 NVDDAFSLKDSAFDFLYPWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGA 168 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F D+DLSR H R+G G EYWRDYL+ AN Y+R S WK S D++DYQERPA+GWDI Sbjct: 169 NLFYDYDLSRHHARMGAGVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYT 228 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +G+LP++PQLGASL YE+YYG VGLFG D Q++P+A + ++YTPVPL Sbjct: 229 QGWLPSYPQLGASLKYEKYYGKNVGLFGSDHLQENPYAFTGGISYTPVPLV 279 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 314 bits (804), Expect = 3e-84, Method: Composition-based stats. Identities = 124/288 (43%), Positives = 179/288 (62%), Gaps = 7/288 (2%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 K +K + + +++ V WANI +Q +FPL++ FTP + A + + + Sbjct: 13 KQLNKNKQLNKTRISKSVVWANIVIQAIFPLSIAFTPAVMAAET------VGASDEKPRS 66 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 + E++ A+ A + L++ + + G A N+ +Q+W ++G+A+V+LN+ Sbjct: 67 ASQAEQSTANAATRLASILTNDDSAKQASSIARGTAANAGNEALQKWFNQFGSAKVQLNL 126 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+ SLK S L++L P+ D+P + FTQ DDR N+G G RHF M G N Sbjct: 127 DEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQ-MLGYNL 185 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+DHD S SHTRIGVGAEY RD++ L+ANGY SGWK SPD++ Y E+ ANG+D+R+E Sbjct: 186 FVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSEA 245 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 YLP PQLG L+YEQY+GDEVGLFG D RQK+P A++ V YTP+PL Sbjct: 246 YLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPL 293 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 313 bits (802), Expect = 5e-84, Method: Composition-based stats. Identities = 126/285 (44%), Positives = 166/285 (58%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + + ++ A ++ P Q L+ NT +T Sbjct: 43 LKKINQLRTFSKPFAKLQAGDELEIP-------------QAQSNLGLAPENTALTDTQTT 89 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+N+A A + L+S A + G+A ANQ WL +GTAR++ NVD Sbjct: 90 ERNLAKTATTSAQMLNSGD--KAAARQLRGLAVGNANQAANSWLNNFGTARLQANVDDRG 147 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L S +ML P YDTP+ M FTQ I R D RT +N+G G RHF + WM G N F+D Sbjct: 148 DLDGSQFDMLMPFYDTPSQMAFTQFGIRRIDKRTTANLGIGIRHFIDD-WMVGYNLFLDR 206 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++R HTR+G GAEY RDYLKL+ANGY+R S W+ SPD Y ERPA G+D+RAE YLP+ Sbjct: 207 DITRDHTRVGAGAEYARDYLKLAANGYLRLSDWRDSPDFSSYSERPATGFDLRAEAYLPS 266 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG LMYEQY+G++VGLFGKD RQ++P AI+A + YTP+PL Sbjct: 267 LPQLGGKLMYEQYFGNDVGLFGKDNRQQNPAAITAGINYTPIPLV 311 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 311 bits (796), Expect = 2e-83, Method: Composition-based stats. Identities = 127/286 (44%), Positives = 168/286 (58%), Gaps = 22/286 (7%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + S + R + +Q F + F V A Sbjct: 1 MVKTNPSSSQVRRVAVYGLAGLQFFFQVTPAFAGVFQAD--------------------- 39 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 E++VA A AG L DA R +T A+ +A + +WL ++GTA+ +L+V D Sbjct: 40 -EQSVAQTAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQWLNQFGTAKTQLSVVSD 98 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSLK SSL++L P Y+TP N+LFTQ + D R +N G G R+F+ N WM G N F D Sbjct: 99 FSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWMLGYNVFYD 158 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 D ++ R G+G E WRDYLKLSANGY R S W++SP + DY ERPA+GWDIRAEG+LP Sbjct: 159 VDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDIRAEGWLP 218 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+PQLG L+YEQYYG+EV LFG+ +RQK+PHAI+A VT+TP L Sbjct: 219 AYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLL 264 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 310 bits (794), Expect = 4e-83, Method: Composition-based stats. Identities = 108/285 (37%), Positives = 169/285 (59%), Gaps = 10/285 (3%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ ++ ++ P+ A ++ + Sbjct: 89 LRKLNQFRTFSQNFENLQPGDELDIPM---------APLPIVEWDDDKPEIVLPSSASEN 139 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E VA A+ AG F S+ PD + T+ F + T A+ Q+W ++G++++ L DK F Sbjct: 140 EIRVAQLASQAGKFFSTNPDQEKTKAFARELLTTAASSYAQDWFNRFGSSQIHLEADKKF 199 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S +++L P Y+T N++F+Q ++HR + R ++N+G G R + G M G NTF D+ Sbjct: 200 SLKNSQIDLLMPWYETEDNLIFSQTSLHRKEGRIETNLGLGARWY-GEGQMIGGNTFFDY 258 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D+SR H+R+G+G EY RD+LKLSAN Y R SGW+ S D+ D+ RP+NGWD+RAEG+LP+ Sbjct: 259 DISRKHSRLGLGVEYRRDFLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVRAEGWLPS 318 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +P +G L YEQYYGD V LFG Q++P++I+A + YTP+PL Sbjct: 319 YPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSITAGLNYTPIPLV 363 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 118/279 (42%), Positives = 154/279 (55%), Gaps = 19/279 (6%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y A+ + ++ P + V A E VAS Sbjct: 100 YRTFAKPFTALTVGDEIDVPRKKSPFTVDNNVTVPA------------------ENGVAS 141 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 AA LS + + N + + Q+WLG++GTAR++ N + DF S+ Sbjct: 142 NAAAGAALLSHGDAAKSAENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDDFEFDSSA 201 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L P+YD ++ FTQ D R NIG G R F N WM G NTF D+D++ ++ Sbjct: 202 IDVLIPLYDNQKSLFFTQLGGRNKDSRNTINIGAGVRAFLTN-WMYGANTFFDNDITGNN 260 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R+G+GAE W DYLKLSANGY + W +S D DY ERPANG+D+RAE YLPA+PQLG Sbjct: 261 RRVGIGAEAWTDYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPAYPQLGG 320 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYEQY GDEV LFGKDKRQKDPHAI+ + YTPV L Sbjct: 321 KLMYEQYNGDEVALFGKDKRQKDPHAITVGINYTPVSLV 359 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 306 bits (784), Expect = 5e-82, Method: Composition-based stats. Identities = 125/290 (43%), Positives = 183/290 (63%), Gaps = 20/290 (6%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + + ++ ++ PL + + + + Sbjct: 80 LKKLNQLRKFSKPFEALTTGDEIDIPLIG---------------NNFTTQSLPHSTSSPN 124 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN----QEIQEWLGKYGTARVKLNV 124 + +A A+ G L + P+S+A + A + AN QEI +WL G RVKL+ Sbjct: 125 DSLLAQSASQVGNTLQNNPNSEALNDLARSSALSAANAKAGQEISDWLNGKGKVRVKLDA 184 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+DFS+K+S L++L P++++ ++M+F+QG++HRTDDRTQSN+G G+R+F+ + + G NT Sbjct: 185 DRDFSVKNSQLDLLVPLWESESHMIFSQGSVHRTDDRTQSNLGLGYRYFA-DSYALGANT 243 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F DHD SRSH+R+G+GAEY R++ KL+ NGY+R S WK SPD ++Y+ERPANGWDIRAEG Sbjct: 244 FYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRLSNWKDSPDFDNYEERPANGWDIRAEG 303 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 YLP++P LGA L YEQYYGD VGLFGKD +QK+PHAI+ Y+P PL + Sbjct: 304 YLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPHAITFGGNYSPFPLLK 353 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 303 bits (776), Expect = 5e-81, Method: Composition-based stats. Identities = 116/281 (41%), Positives = 169/281 (60%), Gaps = 9/281 (3%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAAR-AQHAVQPRLSMGNTTVTADNNVEKNV 72 ++ + + ++ P+ P++A + A + N V +NN + Sbjct: 98 QFRTFPQGFEQVSSGEEIDIPV-----PIIAEQGATKVSVVTPNEVNCPVGIENNPQTK- 151 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 + L+S + + + ++ AN+EIQ+WLG+YGTA+V+LNVD FSL++ Sbjct: 152 -EYVKRVSALLASSDPTTVATDVVRSEVSSTANKEIQKWLGQYGTAQVRLNVDDKFSLRE 210 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 SSL+ L+ YD+ + ++FTQ I D R +N+G G R GN W+ G NTF D+DL+ Sbjct: 211 SSLDWLFSFYDSSSAIIFTQLGIRNKDHRNTANLGLGGRISMGN-WILGANTFYDNDLTG 269 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 ++R+G GAE W DYL+LSAN Y+R + W +S D D+ ERPANG+DIR +LP PQL Sbjct: 270 INSRLGFGAEAWTDYLQLSANSYMRLNNWHQSRDFIDHDERPANGFDIRTNAWLPVLPQL 329 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G LMYEQY GD V LFGKDK QK+P+A++A +TYTP PL Sbjct: 330 GGKLMYEQYSGDSVALFGKDKLQKNPYAVTAGITYTPFPLL 370 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 303 bits (776), Expect = 5e-81, Method: Composition-based stats. Identities = 116/265 (43%), Positives = 158/265 (59%), Gaps = 17/265 (6%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 +Q+LFP V +A A QP +++ T V S A GT ++ Sbjct: 1 MQLLFPF------VTSAYTYAASQPPVAVPVPT---------QVTSLLAAGGT--ETENG 43 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 S+ ++ T MAT A ++EWL +GTA V LN D++ + +SS++ L P+YD ++ Sbjct: 44 SNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSV 103 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 LFTQ + D RT NIG G R F+ +WM G N F D D + + R+G+GAE W DYL Sbjct: 104 LFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYL 163 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 KL+AN YI + W S D DY E+PA+G+DIRAEGYLPA+PQLGA +MYEQYYG+ V L Sbjct: 164 KLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVAL 223 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 F KD Q DP A++ + YTP+ L Sbjct: 224 FDKDHLQNDPSAVTMGLNYTPISLV 248 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 302 bits (774), Expect = 9e-81, Method: Composition-based stats. Identities = 137/284 (48%), Positives = 181/284 (63%), Gaps = 5/284 (1%) Query: 11 PRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD-NNVE 69 + R R A +Q P P+MAA+ + G + + N Sbjct: 84 NQLRELNQLRTFAHGLNGLQ---PGDDVDVPLMAAKDNKNASDAAAPGRSASAEEGNEQA 140 Query: 70 KNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 + VA +A+ AG+FL+S SDA + MAT +A Q+WL +GTARV+L+ DK+FS Sbjct: 141 QKVAGYASQAGSFLASSAKSDAAASMARNMATVEAGGAFQQWLSHFGTARVQLDADKNFS 200 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 LK+S ++L P+YD N +FTQG++HRTD RTQ+++G GWRH S + +M G N F D D Sbjct: 201 LKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWRH-STSTYMLGGNLFGDFD 259 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 LSR H R G G EYWR++LKL N Y+R SGWK SPD+EDYQERPANGWD+R + ++P+ Sbjct: 260 LSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQERPANGWDVRGQAWVPSL 319 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG L YEQYYG EV LFG D RQ++PHAI+ + YTPVPL Sbjct: 320 PQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPLI 363 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 300 bits (767), Expect = 6e-80, Method: Composition-based stats. Identities = 123/304 (40%), Positives = 167/304 (54%), Gaps = 18/304 (5%) Query: 6 TGHKQPRFRYSVLARCV-------AWANISVQVLFPLAVTFTPV--MAARAQHAVQPRLS 56 Q L + N +L P P+ +A +A + P L Sbjct: 49 QIALQSGLDLRTLRKLNNGSLDKRDELNAGESLLLPANSPLFPLDPLAGKAIASNLPELG 108 Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA--------NQEI 108 MGN V ++ E+ A+ A G + SD +N A +A Q+ Sbjct: 109 MGNDPVPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQAESWAKGQAKAQVVDPLRQQA 168 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 QE LGK+G A+V L VD + SL S+ + P Y+ + F+Q +HR D+R N+G Sbjct: 169 QELLGKFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAMVAFSQVGVHRQDNRMIGNLGA 228 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 G R G+ W+ G NTF+D D+SR+H+R+G+G E+W D LKL++N Y SGWK S D + Sbjct: 229 GVRFDQGD-WLFGANTFLDQDISRNHSRLGLGLEWWADNLKLASNYYHPLSGWKDSKDFD 287 Query: 229 DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYT 288 DY ERPA G+D+ A+GYLPA+ QLGAS +YEQYYGDEV LFGKD QKDPHA++ V YT Sbjct: 288 DYLERPARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVALFGKDNLQKDPHAVTVGVDYT 347 Query: 289 PVPL 292 P PL Sbjct: 348 PFPL 351 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 297 bits (760), Expect = 3e-79, Method: Composition-based stats. Identities = 113/264 (42%), Positives = 154/264 (58%), Gaps = 10/264 (3%) Query: 36 AVTFTPVMAARAQHAVQPRLSMGNTTVTA------DNNVEKNVASFAANAGTFLSSQPDS 89 + F + + S +T A + E + + G L++ S Sbjct: 67 SSAFENLHPNNEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAA---S 123 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML 149 + M NQEI++WL ++GTA+V LN DK+FSLK+SSL+ L P YD+ + + Sbjct: 124 GRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLF 183 Query: 150 FTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 F+Q I D R N+G G R N W+ G+NTF D+DL+ + RIG+GAE W DYL+ Sbjct: 184 FSQLGIRNKDSRNTLNLGVGIRTL-ENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQ 242 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 L+ANGY R +GW S D DY+ERPA G D+RA YLPA PQLG LMYEQY G+ V LF Sbjct: 243 LAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALF 302 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 GKD Q++P+A++A + YTPVPL Sbjct: 303 GKDNLQRNPYAVTAGINYTPVPLL 326 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 296 bits (757), Expect = 7e-79, Method: Composition-based stats. Identities = 117/274 (42%), Positives = 160/274 (58%), Gaps = 12/274 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + +A I +Q P+A++ + + A LS + DN A A Sbjct: 2 KSMAIMQILLQTALPVALSMSATVRAA-------ELSQNTHSADKDNINSPYSAQM-TQA 53 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 T LSS + A MA+ A +++WL ++GTARV+LNVD + DS+++ L Sbjct: 54 ATALSSGNAAGAGA----SMASGYAGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLA 109 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ MLFTQ + DDR N G G R F ++WM G N F D D + + R+G Sbjct: 110 PLYDSQKAMLFTQLGLRAPDDRVTGNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGF 169 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W + LKLSAN Y+ + W S D +DY E+PA+G+D+RAEGYLPA+PQLGA LMYE Sbjct: 170 GAEAWTNNLKLSANTYLGTTNWHSSRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYE 229 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 QYYGD+V LF KD Q +P A++ V+YTPVPL Sbjct: 230 QYYGDKVALFDKDDLQSNPSAVTVGVSYTPVPLI 263 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 290 bits (743), Expect = 3e-77, Method: Composition-based stats. Identities = 110/280 (39%), Positives = 166/280 (59%), Gaps = 16/280 (5%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 + V + ++ P+ V+F P+ T + + +A Sbjct: 90 QGRVFLNGIKNIKEGDEINVPV-VSFAPIKWGEE------------ETKEQGSGNLQQIA 136 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 S A + G LS+ S + + T K N IQ W +GTA ++L VDK+FSLK+S Sbjct: 137 SIATDVGNILSNDNISK--NSALLNKITNKVNSHIQSWFENFGTAHIQLQVDKNFSLKNS 194 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 LE+L+P+++ + F+QG I DD+ SNIG G+R F N WM G N+FID+DL + Sbjct: 195 QLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDN-WMLGGNSFIDYDLRKE 253 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 H+R+G+G EYW+D LKL AN Y+R S W+ S +I DY+ERPANG D+ + +LP++PQ+G Sbjct: 254 HSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIKSWLPSYPQIG 313 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + YE+YYGD+V LFG++ RQ++PH+ + ++YTP PL Sbjct: 314 GDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLM 353 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 289 bits (740), Expect = 6e-77, Method: Composition-based stats. Identities = 129/289 (44%), Positives = 180/289 (62%), Gaps = 12/289 (4%) Query: 16 SVLARCVAWANISVQVLFPLAVTF---TPVMAARAQHAVQP----RLSMGNTTVTADNNV 68 + R A+ + +QV+F +F P AA Q ++ +T ++ Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 KN+A+ ++ G+ LS +DA N +A +IQ+WL ++GTA+V L +DKD Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDALINSAINNGSAYVTGQIQQWLQQFGTAKVNLGLDKDL 121 Query: 129 SLKDSSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 SL ++SL++L P+YD N+LFTQ R DDR N+G G+R+F+ + WM G+NTF D Sbjct: 122 SLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFA-DRWMWGINTFYD 180 Query: 188 HDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 +S + H R+G+G E +Y KLSANGY R SGWK S + EDYQER ANG+DIRAEGYL Sbjct: 181 RQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDIRAEGYL 240 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 PAWPQLGA L++EQYYGD+V LF D RQ++P+A++A V YTP PL Sbjct: 241 PAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLV 289 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 289 bits (739), Expect = 9e-77, Method: Composition-based stats. Identities = 110/236 (46%), Positives = 152/236 (64%), Gaps = 4/236 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 N + + +A + L+ D + ++K+NQ+I++WL ++G Sbjct: 84 NNQDEAIPSTEGEELAKIIVDNSFLLNKDID---VTQYAISQISSKSNQKIEQWLNQFGH 140 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND 177 ARV L+ DK+ +LK+SS E+L P+Y+ ++F Q HR D R+Q N G G+R+F+ Sbjct: 141 ARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTNYHRKDLRSQFNYGIGYRYFT-EK 199 Query: 178 WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 +M G+N F DHDL+ H R+G+GAE WRDY KLS+N Y R S W+ S +I DY ERPANG Sbjct: 200 FMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHYHRLSSWRASNNILDYSERPANG 259 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 WDIR EGY PA+PQLG L++EQYYG EVGLFGKDKR K+PH + + YTP+PL Sbjct: 260 WDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRDKNPHTYTLGINYTPIPLV 315 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 108/281 (38%), Positives = 149/281 (53%), Gaps = 16/281 (5%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + ++ +S+ ++F + A++ N + Sbjct: 5 FNTLTVTKIISRLILSIGLIFGIFTYGFSQQHYFNSEALENPA--------EHNEAFNKI 56 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 S + S N M ANQE++ WL ++GT +V +N DK FSLK+ Sbjct: 57 ISTGTSLA-------VSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKE 109 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 SSL+ L P YD+ + + F+Q I D R NIG G R F WM G NT D+D++ Sbjct: 110 SSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQ-SWMYGFNTSYDNDMTG 168 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 + RIGVGAE W DYL+LSANGY R +GW +S D DY ERPA+G DI + YLPA PQL Sbjct: 169 HNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQL 228 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G L YEQY G+ V LFGKD Q +P+A++ + YTP+P Sbjct: 229 GGKLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFI 269 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 111/263 (42%), Positives = 162/263 (61%), Gaps = 7/263 (2%) Query: 36 AVTFTPVMAARAQHAVQPRL-SMGNTTVTADNNVEKNVASFAANAGTFLSSQ----PDSD 90 + PV+ A A+ L S+G+ + +NN E A + GTFLS + S Sbjct: 24 SSAIMPVIPAYAKMLDNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQ 83 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 +++ +++A +EI+ WL K G ++ ++ DK FS+K+S + L P YD +LF Sbjct: 84 IAQDYARNKVSSEATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLF 143 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 TQ +HR D+R +N G G R+F + G+N FIDHDLS +HTR+G+G EYW+DYLKL Sbjct: 144 TQHTLHRYDERFHTNNGIGLRYFHEKSTI-GMNAFIDHDLSHAHTRVGLGVEYWQDYLKL 202 Query: 211 SANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 +AN Y + WK + ++ D+ +PA+GWDI+ EG+LP +P LG +L YEQYYGD V LF Sbjct: 203 NANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALF 262 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 GK KRQK+P+A + +TP PL Sbjct: 263 GKTKRQKNPNAATIGANWTPFPL 285 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 106/238 (44%), Positives = 136/238 (57%), Gaps = 2/238 (0%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 VAS + G LSS+ A GM + + ++EWLG G Sbjct: 109 PNQEEEQQATQQASMVASHLSQVGNSLSSEDRVGAFSRLAKGMLLSSTAKTVEEWLGHIG 168 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A+VKL D S +++ P+YD P + F+Q R D R NIG G RH+ + Sbjct: 169 QAQVKLQADDKNDFSGSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYVSD 228 Query: 177 DWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 WM G N F D +S + H R+G G E RDY+KLSAN Y R GWK S +EDY ER A Sbjct: 229 -WMFGYNIFFDQQISGNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAA 287 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 NG+DIR E YLP +PQLG LMYEQY+GDEV LFG ++RQK+P A++A V+YTP+PL Sbjct: 288 NGYDIRTEAYLPHYPQLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYTPIPLV 345 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 117/288 (40%), Positives = 157/288 (54%), Gaps = 19/288 (6%) Query: 22 VAWANISVQVLFPL-----------AVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK 70 + A Q++ PL + P++AA +L+ + VT N + Sbjct: 101 MMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD 160 Query: 71 NV----ASFAANAGTFLSSQP-DSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 A AA+ G+ L S+ + D ++ G+A +A+ ++Q WL YGTA V L Sbjct: 161 KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSG 220 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 +F SSL+ L P YD+ + F Q D R +N+G G R F + M G N F Sbjct: 221 NNFD--GSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPEN-MLGYNVF 277 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 ID D S +TR+G+G EYWRDY K S NGY R SGW +S + +DY ERPANG+DIR GY Sbjct: 278 IDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGY 337 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LP++P LGA LMYEQYYGD V LF DK Q +P A + V YTP+PL Sbjct: 338 LPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLV 385 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 286 bits (732), Expect = 6e-76, Method: Composition-based stats. Identities = 114/269 (42%), Positives = 156/269 (57%), Gaps = 6/269 (2%) Query: 25 ANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLS 84 AN V P+ + RA + G+ T D ++ G+ L+ Sbjct: 102 ANAGELVDSPINDAIAININ-RASQNNKNNAGAGSLTKEQDPMDSLSI----RGVGSALA 156 Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDT 144 + DA + MAT+ N +I +WL +YGTAR++LN D+DFSL +S+L+ L P+YD+ Sbjct: 157 ASGRVDALHHMARTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLLPLYDS 216 Query: 145 PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYW 204 T LFTQ D R +NIG G R F ++WM G N F D+D + + R+G+GAE W Sbjct: 217 QTLTLFTQQGFRNKDRRNIANIGIGTR-FIHHEWMMGGNAFYDNDFTGDNKRVGLGAELW 275 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD 264 D +LSANGY R + W +S D DY ERPANG D+RA G+LPA P LG SL+YE Y+GD Sbjct: 276 TDSFQLSANGYFRLTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYEHYFGD 335 Query: 265 EVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V LFGKD Q++P+AI+ +YTP L Sbjct: 336 NVALFGKDHLQRNPYAITLGGSYTPFSLL 364 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 284 bits (727), Expect = 2e-75, Method: Composition-based stats. Identities = 91/212 (42%), Positives = 133/212 (62%), Gaps = 2/212 (0%) Query: 83 LSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIY 142 L ++ +++I ++ Q+WL ++GTA++ LNVD L +SS+++L P Y Sbjct: 122 LLNKDPKKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFY 181 Query: 143 DTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGA 201 D + ++++Q D R N+G G R F + WM G NTF D+DL+ +++R +G Sbjct: 182 DDKDHWLIYSQYGYRHKDSRDTVNLGIGTRLFIND-WMYGANTFYDNDLTGNNSRFSLGG 240 Query: 202 EYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 E W +YLK+SAN Y R S W S D+ +Y ERPANG+D+ A+ YLPA P LGA + YEQY Sbjct: 241 ELWTNYLKMSANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQY 300 Query: 262 YGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +GD V LFG + RQKDP+A + V YTP+PL Sbjct: 301 FGDNVALFGTNNRQKDPYAATIGVNYTPIPLI 332 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 284 bits (726), Expect = 3e-75, Method: Composition-based stats. Identities = 100/265 (37%), Positives = 140/265 (52%), Gaps = 9/265 (3%) Query: 37 VTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASF--------AANAGTFLSSQPD 88 F Q+ P L N K++ A L+ + Sbjct: 16 AAFAAPEINVKQNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINSATQGFENLTPEAL 75 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 R+++ T+ A I++ L YG R L++ + L SS++ P YD T + Sbjct: 76 KSQARSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTV 135 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 F+Q + R +DRT NIG G R+ + + ++ G N F D+D +R H R+G+GAE W DYL Sbjct: 136 YFSQFSAQRKEDRTIGNIGLGVRY-NFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYL 194 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 K S N Y S WK S D + Y+ERPA GWDIRAE +LPA+PQLG +++EQYYG+EV L Sbjct: 195 KFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVAL 254 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 FG D +KDP A++ V Y PVPL Sbjct: 255 FGTDSLEKDPFAVTLGVKYQPVPLI 279 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 283 bits (724), Expect = 6e-75, Method: Composition-based stats. Identities = 109/289 (37%), Positives = 153/289 (52%), Gaps = 17/289 (5%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 ++ + + ++ P + T Sbjct: 85 PLSKLYKLNQFRSFHKSFYDLSGGDEIDIPAS---------------NNYSFENRPLDTK 129 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 +N E A+ A S S + MA++ AN IQ+WL ++GT +L+ Sbjct: 130 VDNNENYSANKTKAAVNV-SESNKSPEALGVASSMASSAANNAIQKWLSQWGTVESQLSF 188 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D SLK+SSL+ L PIYDT N F Q D R N+G+G RH N WM G+N Sbjct: 189 DSKASLKNSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVY-NGWMYGLNN 247 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F D+D++ ++ R+G+G E DYL +++N Y+R + W +S D DY ERPANG+D+R G Sbjct: 248 FFDYDITGNNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANGFDMRVNG 307 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LPA+PQ+G L+YEQYYGDEVGLFGKD RQKDP AI+A V++TP PL Sbjct: 308 WLPAYPQIGGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLL 356 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 282 bits (721), Expect = 1e-74, Method: Composition-based stats. Identities = 104/238 (43%), Positives = 139/238 (58%), Gaps = 2/238 (0%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 VAS + G+ LSS+ +A G+ + + ++EWLG G Sbjct: 70 PNREEEQKATQQASLVASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIG 129 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A+VKL VD S L + P+Y+ P + F+Q R D R NIG G RH+ + Sbjct: 130 KAQVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYLSD 189 Query: 177 DWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 WM G N F+D +S + H R+G+G E RDY+KLSAN Y R GWK S +EDY ER A Sbjct: 190 -WMLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAA 248 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +G+DIR E YLP +PQLG LMYEQY+G+EV LFG ++RQK+P A++A V+YTP PL Sbjct: 249 SGYDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLV 306 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 280 bits (716), Expect = 4e-74, Method: Composition-based stats. Identities = 100/266 (37%), Positives = 138/266 (51%), Gaps = 1/266 (0%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 ++ + P + +D + L+ Sbjct: 1 MENEIGGTLINKPGHDMPKLPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRDKA 60 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 T + + Q+ Q+ LG++G A+V L++D +L S+ + P YD+ + Sbjct: 61 QSKTTQWAKEKIISPLQQQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYL 120 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 LF+Q IH D+R N G G R + + + G N FIDHD SR H R G+GAE DY Sbjct: 121 LFSQINIHHQDNRKIGNFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADY 180 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 LK SAN Y S WK SPD +DY ERPA G+D+R++GYLPA+PQLG S +YE Y+GDEV Sbjct: 181 LKFSANYYHPLSHWKDSPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVA 240 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LFGK RQKDP A++ + YTPVPL Sbjct: 241 LFGKSHRQKDPRALTLGIDYTPVPLV 266 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 279 bits (715), Expect = 6e-74, Method: Composition-based stats. Identities = 95/237 (40%), Positives = 143/237 (60%), Gaps = 3/237 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 ++ ++ + + S L+S P +++I ++ Q+WL ++GT Sbjct: 91 EDSHKDGNHPLPPLILSHGTKILGLLNSDPK-KLAQDYIVNKLNSQITSNTQKWLSQFGT 149 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A++ LNVD L +SS+++L P YD + ++++Q D R N+G G R F N Sbjct: 150 AKINLNVDHRGRLDESSVDLLVPFYDDKDHWLVYSQYGYRHKDSRDTVNLGIGTRLFINN 209 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 WM G NTF D+DL+ +++R +G E W +YLK+SAN Y R S W + D+ +Y ERPAN Sbjct: 210 -WMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNARDLVNYYERPAN 268 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G+D+ A+ YLP+ P LGA + YEQY+GD V LFGK+KRQKDP+A + V YTP+PL Sbjct: 269 GYDLIADMYLPSMPSLGAKIKYEQYFGDNVALFGKNKRQKDPYAATIGVNYTPIPLI 325 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 278 bits (710), Expect = 2e-73, Method: Composition-based stats. Identities = 115/303 (37%), Positives = 168/303 (55%), Gaps = 23/303 (7%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 + + +P+ S+L W+ + P++ + AQ L Sbjct: 1 MNPPSSKLKPKLPNSLLLSTAIWSTAIL-----------PMVPSYAQIVHLDDLPTLGGQ 49 Query: 62 VTA------DNNVEKNVASFAANAGTFLSSQ----PDSDATRNFITGMATAKANQEIQEW 111 +++ E+ +A + NA F S + +D +++ A A EI W Sbjct: 50 AIQFEGTQPEDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHW 109 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L K G AR+ +N+DK S+K S L+ L P Y+ +LF+Q +IHRTD R Q+N G G R Sbjct: 110 LSKAGNARLNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLR 169 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDY 230 HF N M GVN F DHDLS H+R+G G EY +DY+++SAN Y+ S W+ + ++ +DY Sbjct: 170 HFQQNS-MIGVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDY 228 Query: 231 QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPV 290 RPANGWDI+ EG+LP + LGA+L EQYYGD+V LFGK++RQKDP A + V ++P Sbjct: 229 NARPANGWDIQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPF 288 Query: 291 PLT 293 PL Sbjct: 289 PLL 291 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 273 bits (697), Expect = 7e-72, Method: Composition-based stats. Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 11/242 (4%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 + + EK A L++ ++A N+ NQ+I +WL +YG Sbjct: 98 KETSQAKQVESAEKQFVQGATQIAQGLANNNATEAAINYARNRGEGLLNQKISDWLNQYG 157 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 ARV+++ +K ++L P+ D P ++LF+Q I + R+ +N+G G+R + N Sbjct: 158 KARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGIRANEQRSTTNLGLGYRQYQQN 212 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERP 234 WM G+N+F D+D+S + R G+G E W YLKL+ NGY R + W +S ++ DY ERP Sbjct: 213 -WMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQSFLHEMRDYDERP 271 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKRQKDPHAISAEVTYTPVP 291 ANG+D+RAEGYLP++P LGA YEQY+GD V L + +P A++ ++YTP P Sbjct: 272 ANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKDNPSAVTFGLSYTPFP 331 Query: 292 LT 293 L Sbjct: 332 LL 333 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 91/261 (34%), Positives = 136/261 (52%), Gaps = 16/261 (6%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRN 94 T + A A L+ V V+ V + R Sbjct: 84 ATAGQTIWIPAAKPAATTLPLAPATVQVAKPGKVDGKV-------------DDKTTNVRQ 130 Query: 95 FITGMATAKANQEIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDT-PTNMLFTQ 152 F A+++ + WL +G ++RV ++ ++F+ + + ++L P++++ M+F+Q Sbjct: 131 FGQDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQ 190 Query: 153 GAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSA 212 + DDRT NIG G R+F G WM G N F D+D S S+ RIG+GAE D L+L+A Sbjct: 191 LGVRHADDRTTGNIGLGARYF-GEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAA 249 Query: 213 NGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 NGY + +GW S I D+ ERPANGWDI +LP +PQLG + YEQYYGD V L + Sbjct: 250 NGYFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRG 309 Query: 273 KRQKDPHAISAEVTYTPVPLT 293 + Q +P A + V +TP+PL Sbjct: 310 RLQHNPSAATLGVNWTPIPLV 330 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 268 bits (686), Expect = 1e-70, Method: Composition-based stats. Identities = 110/272 (40%), Positives = 152/272 (55%), Gaps = 18/272 (6%) Query: 33 FPLAVTFTPVMAARAQH---AVQPRLSMGNTTVTADNN---VEKNVASFAANAGTFLSSQ 86 PL + TP+ A P L + +N E NVAS A + + Sbjct: 89 APLNGSTTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASAATQLWGIMGND 148 Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPT 146 S A + +TG+A A+Q +WLG+YG ARV+LN S + ++L P+ +T Sbjct: 149 NSSRAAESAVTGVAAGLASQAAADWLGQYGNARVQLN-----SNSIGNADVLIPLTETQN 203 Query: 147 NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRD 206 N+LF Q + +RT +N+G G R F+ + WM GVNTF D+DL+ ++R+GVG E W D Sbjct: 204 NLLFGQLGVRYNGERTTNNVGLGVRSFT-DSWMFGVNTFYDYDLTGKNSRLGVGGEAWTD 262 Query: 207 YLKLSANGYIRASGWKKS--PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD 264 LK SANGY R + W +S D+EDY ERPANG+D+RAE YLP++PQLG LMYE+Y+G Sbjct: 263 NLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLPSYPQLGGRLMYEKYFGK 322 Query: 265 EVGLFGK----DKRQKDPHAISAEVTYTPVPL 292 V L D P A + + YTP+PL Sbjct: 323 GVALNSGSTSPDDLGDSPSAFTVGLNYTPIPL 354 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 263 bits (673), Expect = 5e-69, Method: Composition-based stats. Identities = 89/291 (30%), Positives = 144/291 (49%), Gaps = 16/291 (5%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 + + LAR +AW + Q+L P A+ A+A R ++ D + Sbjct: 2 KLTPTPLARWLAWVLVGTQLLTPAAL-------AQAMLPEITRSGADSSVDKTDQPEAEW 54 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKY---GTARVKLNVDKDF 128 +AS A++ G+ L SD +N I + AN I + + R + ++ Sbjct: 55 LASRASSLGSLLQEGNISDFAKNQIQALPQTIANDGITSGIKHWLPEAQFRGGITLEDAS 114 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVN 183 + + ++L P+Y + +++LF Q + D+ R N G GWR G+ W+ G+N Sbjct: 115 KYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVGD-WLLGIN 173 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 +F+D D+ H R +G E +RD + L+ N Y S WK S + ERPA G D+R + Sbjct: 174 SFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVRLK 233 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G LP+ P GA L +EQY+GD+V + G D +DP A + +T+ PVPL + Sbjct: 234 GALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVE 284 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 258 bits (660), Expect = 1e-67, Method: Composition-based stats. Identities = 90/279 (32%), Positives = 133/279 (47%), Gaps = 10/279 (3%) Query: 19 ARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAAN 78 A +AW ++ P P + Q +G D EK A+ A Sbjct: 25 AMGLAWLCGAL----PAYAESPPAPDSVVQQPANDLPELGGNASN-DAEREKEWATMAKQ 79 Query: 79 AGTFLSSQPDSDA----TRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 G + S ++ G A++ Q+ QE L G A++ L + SS Sbjct: 80 LGERNLNNVSSQQVRTRAESYAVGQASSVLQQQAQELLSPLGNAKLSLVMSDQGDFSGSS 139 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 ++ P+YD + ++Q + + + + N G G R +G+ W+ G NT +D D R H Sbjct: 140 GQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGD-WLLGYNTVLDSDFERHH 198 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R +GAE W D+L+ SAN Y S + D + RPA+G+DI +GYLP + Q+G Sbjct: 199 NRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQGYLPFYRQIGG 258 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 SL YEQY+G+ V LFG K+Q DP A+ V YTPVPL Sbjct: 259 SLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLV 297 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 257 bits (657), Expect = 3e-67, Method: Composition-based stats. Identities = 93/236 (39%), Positives = 135/236 (57%), Gaps = 11/236 (4%) Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +NN E+ AS G LSS D + N+ + NQ+I +WL +YG AR+ Sbjct: 100 PEENNNEEKFASSFTLMGDILSSDNFVDNSINYAKSIGQGLVNQQINDWLNQYGKARISF 159 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 + D K+ S + L P+ D P N+LFTQ + DR N+G G+R + N WM G+ Sbjct: 160 SSD-----KNISGDFLLPVIDEPNNLLFTQLGLRNNTDRNTINLGLGYRKYWRN-WMFGI 213 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP--DIEDYQERPANGWDI 240 NTF D+D + + R+GVG E W DYLKL+ NGY + W +S ++DY ERPA G+D+ Sbjct: 214 NTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSKISVMDDYDERPATGFDV 273 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGL---FGKDKRQKDPHAISAEVTYTPVPLT 293 RAE YLP +PQLG+S+ YE+Y+G + L + + D ++ + YTP+PL Sbjct: 274 RAEAYLPKYPQLGSSIKYEKYFGKGIHLGTGVNPEYLKDDAQSLIMGLNYTPIPLL 329 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 253 bits (645), Expect = 8e-66, Method: Composition-based stats. Identities = 93/222 (41%), Positives = 127/222 (57%), Gaps = 2/222 (0%) Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 + + S+ + R + +AN LG GTAR ++ +D DF++ + Sbjct: 166 EQVPTSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDDDFNMVTA 225 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 ++L P+ + +LFTQ + R DRT +N+G G RHF + WM G N F D+DL+ Sbjct: 226 EADLLLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFL-DRWMLGYNLFADYDLTN 284 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 H R GVGAE WRDYLKL AN Y S W+ SP E +ER A G D+R E YLPA+PQ Sbjct: 285 RHWRAGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYLPAYPQW 344 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 ASL EQY G+ VGL D+ ++DPHAI+A + Y P PL + Sbjct: 345 SASLTAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLK 386 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 105/247 (42%), Positives = 150/247 (60%), Gaps = 11/247 (4%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + R + V W+ I+ Q+L P+ T P ++ + + A++ Sbjct: 2 TMVNKKLR-RKASCAVTWSVIATQILSPVTFTLIPA------NSFASSANTESAQTNAND 54 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 +AS AANAG L++ F +A+A +E+ +WL +YG AR+KLNVD+ Sbjct: 55 EYANELASLAANAGQSLANN----TAGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDE 110 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 F+LKD++ + LYP D+ +LF+Q ++HRTDDR Q+NIG G RHF+ ++ M G N F Sbjct: 111 SFTLKDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFY 170 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 D+DLSR H+R G+G EYWRDY++ AN Y S WK S DI+DY ERPANGWD+ AEG+L Sbjct: 171 DYDLSRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWL 230 Query: 247 PAWPQLG 253 P +PQLG Sbjct: 231 PVYPQLG 237 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 248 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 79/203 (38%), Positives = 114/203 (56%), Gaps = 2/203 (0%) Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 N Q ++ + +G + L+VD S +L P Y +++LF Sbjct: 176 YIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVLF 235 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 +Q ++++RT +IG G R + ++ G N FID+DL + H R+ +G E +Y K Sbjct: 236 SQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYFK 294 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 LS N Y + W+ S D+EDY ERPA G+DIR +GYLP +PQLG + YEQY+G EV LF Sbjct: 295 LSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVALF 354 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 K KRQK+P A+S + Y P PL Sbjct: 355 NKTKRQKNPKAVSIGIDYRPFPL 377 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 244 bits (624), Expect = 2e-63, Method: Composition-based stats. Identities = 87/222 (39%), Positives = 126/222 (56%), Gaps = 6/222 (2%) Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS--LKD 132 + +A + L S P D + G + +Q I+ WL +YG AR+ LN D S L Sbjct: 18 YTKSAASLLKSGPAFD---QYAAGKISQLTSQAIEGWLKQYGNARITLNAQSDNSTALAG 74 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS ++L+ +++ + + + Q H D + N+G G R+F N M G N F D +++ Sbjct: 75 SSADLLFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNIN 134 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 +R GVG E WRDY K S NGY S W+ S +EDY E+ A+G+D++ E YLP + Q Sbjct: 135 SGVSRSGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQ 194 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LG L YEQY+GD V LF + Q DP AI+ ++YTP+PL Sbjct: 195 LGGHLKYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLI 236 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 244 bits (623), Expect = 2e-63, Method: Composition-based stats. Identities = 85/246 (34%), Positives = 122/246 (49%), Gaps = 10/246 (4%) Query: 49 HAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEI 108 + + K N SD +++ I M A E Sbjct: 54 QHQTDDDATQGGDIPKSAMSGKRWLQHQTNDDVM----QGSDISKSGIADMGFAALQPET 109 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 ++ G R L + D L S+++ YP+YD + + F Q R D R N+G Sbjct: 110 EK---SAGEVRANLPL-SDGKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGI 165 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 G R+F G+ W G NTF D +S + H R+G G EYWRDYL LSANGY + W S + Sbjct: 166 GQRYFQGD-WALGYNTFYDIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSAL 224 Query: 228 EDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 + Y ER ANG+DIRA+G+ P +PQL L +EQY+GD++ L R K+P+A++ + Y Sbjct: 225 DGYAERAANGYDIRAQGWFPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEY 284 Query: 288 TPVPLT 293 TP+ L Sbjct: 285 TPIQLI 290 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 240 bits (613), Expect = 4e-62, Method: Composition-based stats. Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 34/290 (11%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + + WA ++ Q+ P+ +DN ++ + Sbjct: 3 FSKKPITKYITWAIVTSQIPLPVI-------------------------ADSDNEIQSWI 37 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFS 129 A A++ L D + I + AN + E + R +N++ Sbjct: 38 AGTASSISPHLQEGTLEDYAKGKIKALPGQAANHLVNEGIKSAFPEIIFRGGVNLEDGAK 97 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVNT 184 + S +M P+ +T +++LF Q D+ RT N+G G+R N W+ GVNT Sbjct: 98 YRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYRQEV-NGWLLGVNT 156 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+D D+ SH R G+G E ++D L S N Y +GWK S E + ERPA G+D+R +G Sbjct: 157 FLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAAHELHDERPAYGFDLRTKG 216 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LP +P L YEQYYGD+V L G ++P A A++ + PVPL + Sbjct: 217 TLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGADLVWNPVPLLE 266 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 238 bits (608), Expect = 1e-61, Method: Composition-based stats. Identities = 73/239 (30%), Positives = 109/239 (45%), Gaps = 10/239 (4%) Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSD---ATRNFI----TGMATAKANQEIQEWLGK 114 + + EK A G + D + F +A+ NQ ++ WL Sbjct: 2 MPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLSP 61 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 +G A V + VD S P D + ++Q + R +D SN+G G R + Sbjct: 62 WGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRW-A 120 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERP 234 + W+ G NTF D+ L R G+GAE W +YL+LSAN Y + W ++R Sbjct: 121 RDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWH--ERSATQEQRM 178 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A G+D+ A+ +P + L + EQY+GD V LF K +P A+S + YTPVPL Sbjct: 179 ARGYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLV 237 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 75/286 (26%), Positives = 120/286 (41%), Gaps = 10/286 (3%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 + R + + + + T A +++ EK+ A Sbjct: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAE 61 Query: 75 FAANAGTFLSSQPDSDATRNF-------ITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + G + D + + + NQ ++ WL +G A V + VD + Sbjct: 62 IVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNE 121 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 S P+ D + ++Q + + D+ SN+G G R GN W+ G NTF D Sbjct: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYNTFYD 180 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 + L + R G GAE W +YL+LSAN Y + W + ++R A G+D+ A +P Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMP 238 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 239 FYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 229 bits (585), Expect = 7e-59, Method: Composition-based stats. Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 10/257 (3%) Query: 44 AARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD---SDATRNFI---- 96 A+ +N EK+ A G + R+F Sbjct: 3 QAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTRL 62 Query: 97 TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH 156 ++ E + L +G A V L VD++ + SS + P D + ++Q + Sbjct: 63 RDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGVS 122 Query: 157 RTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYI 216 + + N G G R +G+ W+ G NTF D +R G GAE W DYL+LSAN Y Sbjct: 123 QQNQGLVGNAGIGQRWTAGH-WLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 217 RASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQK 276 GW+ + ++R A G+D+ A+ YLP + + S+ +EQY+GD+V LF Sbjct: 182 PLGGWEHRAGLL--EQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYH 239 Query: 277 DPHAISAEVTYTPVPLT 293 +P A+ ++YTPVPL Sbjct: 240 NPVAVKVGLSYTPVPLV 256 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 70/302 (23%), Positives = 122/302 (40%), Gaps = 12/302 (3%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTF-TPVMAA-RAQHAVQPRLSMGN 59 +H ++ +R A +++Q P+A P +A + A ++ Sbjct: 12 AHLPARGRRHWYRRHRAGAAGMSAVLAMQAAAPVAYGQGAPTFSATQVADAASNAVAQPG 71 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA- 118 T + +A G + D F+ A A+AN +Q+ + Sbjct: 72 AVETRVAQTIQALAQAREAGGARQDGRASLDG--QFLRSQAQAQANVLVQQGVQWANETG 129 Query: 119 -----RVKLNVDKDFSLKDSSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 R++ NV DFS +D ++++ + ++ L Q H + R N G R Sbjct: 130 LPWLRRLEGNVSYDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVAR 189 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +G+ + G N F+D+++ + H R +GAE L N Y SGWK + E + Sbjct: 190 SAAGSSLILGGNAFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERRE 249 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA GWD+ A L + Y ++ G +V F + +++P + Y PVP Sbjct: 250 ERPAAGWDVGFTARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVP 309 Query: 292 LT 293 L Sbjct: 310 LI 311 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 220 bits (560), Expect = 5e-56, Method: Composition-based stats. Identities = 82/259 (31%), Positives = 118/259 (45%), Gaps = 11/259 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFI----- 96 A A+ A + TV ++ + K +A A + G + + R Sbjct: 80 PFADPARFAKMQQQLPELGTVHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVF 139 Query: 97 ---TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 A +A E ++ L YG A V L + D S SS +++ P D + + F+Q Sbjct: 140 NRFRDAAKQRAASEGEQLLSPYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQL 199 Query: 154 AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSAN 213 I +++ + N G G R +G W G N F+D L R +GAE W YL+ SAN Sbjct: 200 GIEQSEYGSVGNAGLGQRWIAG-SWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSAN 258 Query: 214 GYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDK 273 Y SG + + R A G+DI GYLP + QLG +L YEQY G+ V LF Sbjct: 259 YYQPLSGCRNHSNSA--LMRMARGYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGN 316 Query: 274 RQKDPHAISAEVTYTPVPL 292 +P A+S + YTPVPL Sbjct: 317 AVANPAAVSLGINYTPVPL 335 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 56/268 (20%), Positives = 89/268 (33%), Gaps = 9/268 (3%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQP-----DS 89 LA P+ A + D + +A+ A + + + D Sbjct: 54 LAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGARVDG 113 Query: 90 DATRNFITGMATAKANQEI----QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 D + + + L + L+ D + L + +Y Sbjct: 114 DYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVYRAG 173 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N Q H +DR +N G +R + M G N F+D++ + H R VG E Sbjct: 174 RNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLEVIA 233 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L N Y S WK + +E+PA+G D+ P L S + ++ G E Sbjct: 234 PEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWNGAE 293 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V F + Q V Y PV L Sbjct: 294 VDYFDNGRTQAGAKGFKVGVEYRPVSLV 321 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 209 bits (533), Expect = 7e-53, Method: Composition-based stats. Identities = 73/276 (26%), Positives = 118/276 (42%), Gaps = 15/276 (5%) Query: 26 NISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSS 85 +++Q + P P +AR A R ++ + + +A AG+ S+ Sbjct: 33 VLTLQTVAPAFAQGAPSFSAR--PAQADRQDAADSAMLRVAQTARQLAQR-QAAGSRASA 89 Query: 86 QPDSDATRNFITGMATAKANQEIQEWLGKYGTA------RVKLNVDKDFSLKDSSLEM-- 137 + D D + G A A+AN+ +QE + R++ V+ DFS KD SL++ Sbjct: 90 RVDGD----LLKGQAEAQANELLQEGVRLANQTELPFLRRLQGGVNYDFSNKDLSLDLRT 145 Query: 138 LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRI 197 + ++ + + Q + H + R N G RH G N F+D++ ++H R Sbjct: 146 IDEVHRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLRG 205 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLM 257 +G E L N Y SGWK + E +ERPA+GWD+ A P L Sbjct: 206 SLGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKGQ 265 Query: 258 YEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 Y ++ G V F + Q++ V Y PVPL Sbjct: 266 YFRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPLV 301 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 69/302 (22%), Positives = 109/302 (36%), Gaps = 18/302 (5%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 +SH K + R R A ++ +Q PLAV A+ + R G+ Sbjct: 25 VSHAKGSGRNRRRRAQRAASSAVCLSLGMQAAAPLAVL------AQGAPEMTNRPEAGDI 78 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDAT-RNFITGMATAKANQEIQEW-------- 111 + V VA A + + + + +++ A+ NQ +QE Sbjct: 79 VPSD---VLTQVAVRAQDLARRQADRREGAQVDADYLKQQGQAQFNQFLQEGVRAANESG 135 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L + L D D L + +Y N Q H ++R +N+G +R Sbjct: 136 LRFLRNLQGDLRHDFDNGRTSLELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYR 195 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 M G N F+D++ ++ H R +G E N Y SGW + + Sbjct: 196 RDINERLMLGANAFLDYEFAKQHLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRRE 255 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA+G D+ + P L Y ++ G V F + Q V Y PVP Sbjct: 256 ERPASGMDLGMKYSPGFAPGLSLKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVP 315 Query: 292 LT 293 L Sbjct: 316 LL 317 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 76/268 (28%), Positives = 112/268 (41%), Gaps = 22/268 (8%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 N D V AG S+ S + + MA++ N Sbjct: 281 KTQKALNDNKKDNNLSKEDQEFSNKVMKVIQTAGAIYDSED-SKSKEEIVKNMASSYLNT 339 Query: 107 EIQEWLGKY-GTARVKLNVDKDFSLK-----DSSLEMLYPIY--DTPTNMLFTQGAIHRT 158 E ++ + +N D F+ + + L PI D P F Q I Sbjct: 340 SANELAKEFIDSLNTSINTDFSFNYNERSGFSGNAKALLPIVSEDNPKISYFLQSGIGEF 399 Query: 159 -DDRTQSNIGFGWRHFSG-------NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 +DRT + G G R++ + M G+N+ DHD SR H R+ +GAE D L Sbjct: 400 ANDRTIGHFGGGIRYYPNATALNNSGNIMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAF 459 Query: 211 SANGYIRASGWKKSPDIE-DY-QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 +AN Y R S W S D + DY QERPANGWD + + P+ + Q+YG++VG+ Sbjct: 460 NANVYQRLSSWIDSYDFDKDYVQERPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGI 519 Query: 269 FGK---DKRQKDPHAISAEVTYTPVPLT 293 FG D +K+P ++Y+P P Sbjct: 520 FGANSVDDLEKNPLIYEGGISYSPFPAL 547 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 57/252 (22%), Positives = 88/252 (34%), Gaps = 6/252 (2%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMAT 101 M + A+ V + +E VA N T S Sbjct: 5 SMPSPARLLTLLLCPTLLPPVAYGSAIESEVA---RNLWTRAQHPDTSPGLAQSALDAGV 61 Query: 102 AK-ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 A Q L L D D SL + + + L Q +H + Sbjct: 62 AAGLQASRQTGLPWLRHLDGGLRYDLDPGRLSFSLRTIDDLMVSERRALMLQAGLHNQNQ 121 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 R +N G R + + G N F+D++ + H R +G E + L AN Y SG Sbjct: 122 RPTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGLEAIAPHYSLYANYYAPLSG 181 Query: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 WK + +ERPA G+D+ G L + L Y +++G + +F + Q++ Sbjct: 182 WKGARRDSRREERPAAGYDL--GGQLSSDAGLSLQAAYFRWHGAGIDVFDSGRAQRNASG 239 Query: 281 ISAEVTYTPVPL 292 V Y P L Sbjct: 240 FRYGVAYQPGAL 251 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 3/146 (2%) Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 M ++Q + + D SN+G G R + + W+ G NTF D+ L + R G GAE W +Y Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEY 59 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L+LSAN Y + W+ ++R A G+DI A+ LP + + S+ EQY+GD V Sbjct: 60 LRLSANYYQPFADWQTHT--ATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVD 117 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LF +P A+ + YTPVPL Sbjct: 118 LFDSGTGYHNPVALKLGLNYTPVPLL 143 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 57/233 (24%), Positives = 95/233 (40%), Gaps = 11/233 (4%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 A V L+ + +A + + + T L + +A +N K Sbjct: 11 APLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAGD-VEALKNTAIDGVVEKGVG 69 Query: 107 EIQEWLGKYGTARVKLNVD-KDFSLKDSSLEMLYPIYDTPT--NMLFTQGAIHRTDDRTQ 163 + +L +Y V+LN + S L ++ P+ D N FTQG++ D+RT Sbjct: 70 VTKSFLEQYF-PTVELNFGAQGGSKPSGGLLVVAPLSDPDDIFNTYFTQGSVFYEDNRTT 128 Query: 164 SNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 N+G G+R S N + G+N F DH+ H R +G E +++AN Y + WK Sbjct: 129 LNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINANKYWATTKWK 188 Query: 223 KSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD---EVGLFGKD 272 + +ER +G+DI A LP + Q+ + + G D Sbjct: 189 TGKN--GLEERALDGYDIEAGVPLPYMNWATVFVKNFQWDSEISGSKDIKGND 239 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 157 bits (398), Expect = 3e-37, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 105/288 (36%), Gaps = 37/288 (12%) Query: 28 SVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK--------NVASFAANA 79 Q L + + F +++ A + +N K A+++ Sbjct: 3 ISQALTSITLVFGSILSVSANEYKFEEIKFNQIPNEQNNYEPKDKLDEYIIKGANYSTKF 62 Query: 80 GTFLSSQPDSDATRNFITG------------MATAKANQEIQEWLGKYGTARVKLN--VD 125 +++ D + A AKAN EIQ+ + + V ++ + Sbjct: 63 VPLMNNGAKGDEYTGIMADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTE 121 Query: 126 KDFSLKDSSLEMLYPIYDTP----TNMLFTQGAIHRTDD--RTQSNIGFGWRHFSGNDWM 179 D S +SL L + + F+Q + + NIG G R+ + M Sbjct: 122 SDTSFSINSLMKLGELAKDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISM 181 Query: 180 AGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERPA 235 G N F D+ D S +H+R+G+G EY+ + N Y+ + K DYQER Sbjct: 182 VGANAFWDYRMTDYSDAHSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVV 241 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 GWD+ LP P+L + + D GL G Q PH Sbjct: 242 PGWDLEVGYRLPNNPELAFYIRGFNWDYKYTQDNSGLEGAVSWQATPH 289 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 149 bits (377), Expect = 9e-35, Method: Composition-based stats. Identities = 55/210 (26%), Positives = 84/210 (40%), Gaps = 7/210 (3%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 P A T A + Q L + ++ EK +A+ A + Sbjct: 35 LPAAAWVTQPENDAALLSQQQALPNLGSASVNESGTEKKLATLARQMAEVNQDENTDQTW 94 Query: 93 RNF----ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN- 147 R++ + Q+ + L G V L+VD+ SS ++L P+ D T Sbjct: 95 RSYLLGEAKDRVLDRLQQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRG 154 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRD 206 + ++Q + DD N+G R +G W+ G N F D L++ R +GAE D Sbjct: 155 LTYSQLGLQGVDDGVVGNMGLRQRWNAG-RWLLGYNVFYDQYLNQDASRRGSIGAEARSD 213 Query: 207 YLKLSANGYIRASGWKKSPDIEDYQERPAN 236 YL LS+N Y SG + D ED R A Sbjct: 214 YLTLSSNYYYPLSGMHAANDDEDELLRMAR 243 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 60/233 (25%), Positives = 100/233 (42%), Gaps = 23/233 (9%) Query: 70 KNVASFAANAGTFLSSQPDSD-------ATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + A++AA G + + D + + AN++I++ + + + L Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 123 N--VDKDFSLKDSSLEMLYPI-YDT---PTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSG 175 + D + S L I YD P ++F Q + T + Q N+G G R G Sbjct: 168 DAATSSDLTGYLDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRLG 227 Query: 176 NDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-DIEDYQ 231 ++ + G+N F D + S ++TR G+GAE + +L N YI S K + DY Sbjct: 228 DEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDYV 287 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQY----YGDEVGLFGKDKRQKDPHA 280 ER GWD+ +P++PQL + + + D G+ G Q PHA Sbjct: 288 ERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDYQDHSDNSGIEGSVNWQATPHA 340 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 11/198 (5%) Query: 73 ASFAANAGTFLS-SQPDSDATRNFITGMATAKANQEIQEWL-----GKYGTARVKLNVDK 126 A + G LS S DS+ ++ + T+ A+ + + + T V N+ + Sbjct: 15 AVLTMSLGFSLSVSADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKE 74 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTF 185 S D + +L D P + LF Q ++R D RT N+GFG+R + ++ WM GVN F Sbjct: 75 GDSTVD--IGVLKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAF 132 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 DH+ H R GVG E L+ N Y +G D + +G D+ + Sbjct: 133 YDHEFPNDHKRNGVGFEVVSSVLESRVNSYNGTTG--YIKDKSGTDSKVLDGRDMGFKVA 190 Query: 246 LPAWPQLGASLMYEQYYG 263 LP P + + Q+ G Sbjct: 191 LPYLPGMMFGMNAVQWKG 208 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 8/171 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD---ATRNFI-- 96 + A+AQ + + EK+ A A D D R F Sbjct: 70 TIRAQAQDPFDQNRLPDLGMMPESHEGEKHFAEMAKAFSEASMKNNDLDTGEQARQFAFG 129 Query: 97 --TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + + + NQ+++ WL +G+A V +NVD + S P+ D + ++Q Sbjct: 130 QVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLG 189 Query: 155 IHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 + + D SN+G G R + + W+ G NTF D+ L + R G GAE W Sbjct: 190 LTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWG 239 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 51/136 (37%), Positives = 65/136 (47%), Gaps = 5/136 (3%) Query: 160 DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 R +++G G+R + GVN F D+DLSR HTR+ VGAEY DY S N Y S Sbjct: 35 GRDFAHLGLGYRQL-DDSQFFGVNVFFDYDLSRQHTRVSVGAEYGLDYGTFSTNAYFPLS 93 Query: 220 GWKKSPD----IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275 WK SPD + E+ A GWD+ E YLP + L QY G V Sbjct: 94 NWKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLDTRWKFGLTAGQYLGRYVEHSDGSLPS 153 Query: 276 KDPHAISAEVTYTPVP 291 K+P+ S + P P Sbjct: 154 KNPYHFSLSTEFRPDP 169 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 133 bits (336), Expect = 5e-30, Method: Composition-based stats. Identities = 48/156 (30%), Positives = 69/156 (44%), Gaps = 7/156 (4%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYD--TPTNMLFTQGAIHRTDD-RTQSNIGFGW 170 K+ TA + L+ + S L ++ PI D N++FTQ ++ +DD R N+GFG Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 171 RHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +D + G N F DH+L H R +G E L AN Y SGWK + + Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKS--GLNN 124 Query: 230 YQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 E+ NG D+ LP P + G Sbjct: 125 INEKALNGSDVELGMPLPYLPWTNLYYRSFNWEGAS 160 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 133 bits (334), Expect = 7e-30, Method: Composition-based stats. Identities = 57/222 (25%), Positives = 93/222 (41%), Gaps = 23/222 (10%) Query: 80 GTFLSSQPD---SDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFSLKDS 133 T L+++ S+ N +A+ K + + + KY A V ++ + + + Sbjct: 86 WTSLNNKNGIEWSNQISNLALNLASNKLSDYATKTIQKYPFVLGASVNFDIRTEGA-TNI 144 Query: 134 SLEMLYPIYD-------TPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTF 185 ++L+ I D + + F + + + N G G RH G + +AGVN + Sbjct: 145 GGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSNDSTWNAGLGLRHLIGEELLAGVNGY 204 Query: 186 IDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKK-SPDIEDYQERPANGWDIR 241 D+ S SH+R G+G E + L L+ N YI +G K S + DY ER GWD Sbjct: 205 WDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIAGTGTKTISTNNTDYYERVVPGWDFE 264 Query: 242 AEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 LP+ P + ++ D G GK Q PH Sbjct: 265 LGYRLPSNPNIAFFARGFRWDYRNRNDNTGFQGKVTYQMTPH 306 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 133 bits (334), Expect = 9e-30, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 81/217 (37%), Gaps = 15/217 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN---QEIQEWL- 112 + A+ N G L+ +A + + +A AN ++++ + Sbjct: 14 SALPLSAQEVAKSGKFATIVKNIGNALNIGQGEEAVESEVNTLAVDAANAGLDQVEDKVL 73 Query: 113 --GKYGTARVKLNVD-----KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSN 165 + + + D K+ S + +Y + +T LF Q + ++RT N Sbjct: 74 STSNFTHFELSVGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTIN 133 Query: 166 IGFGWRHFSG-NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 GFG RH + N + G N F D++L H R+G G E + AN Y S K+ Sbjct: 134 TGFGARHINDANTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVS---KT 190 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 QE +G+D + LP + + Sbjct: 191 LTYNGIQETALDGYDAKLTANLPYFYSSNLYGKLSNW 227 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 64/287 (22%), Positives = 102/287 (35%), Gaps = 49/287 (17%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNV----ASFAANAGTFLSSQPDSDAT----- 92 +A + R N+ + + AS+A L+S SD Sbjct: 55 AVAGALEAGQSVRCETLVDADNQSNSTVQKIFVTGASYATRIFPLLNSASLSDGIQKMLW 114 Query: 93 ---RNFITGMATAKANQEIQEWLGKYGTAR--VKLNVDKDFSLKDSSLEMLYPIYDTPTN 147 ++FI A N+ + + + V D D + +SL L + Sbjct: 115 MDSKSFIVSFAHDYLNEYVLKQIPFLSQTEFGVGFESDADMTYYLNSLISLAQLGSDDNG 174 Query: 148 ----MLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHDLSR---SHTRIGV 199 +LF QG+ + +N+G G R ++ M G N F D+ + S++R G Sbjct: 175 YPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYSRWGA 234 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDI-----------------------EDYQERPAN 236 GAE W D KL+ N YI +G K+ + ER Sbjct: 235 GAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDERVVP 294 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 GWD+ LP++PQL + ++ D G+ G Q PH Sbjct: 295 GWDVALNYRLPSYPQLSLGIRGFRWDYMRKSDNSGVEGSVNWQATPH 341 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 35/150 (23%), Positives = 64/150 (42%), Gaps = 6/150 (4%) Query: 140 PIY--DTPTNMLFTQGAIHRTDDRTQSNIGFGWRH-FSGNDWMAGVNTFIDHDLSRSHTR 196 P+Y + + +F +G D R + G+RH S N M G N H+ R+H R Sbjct: 68 PVYVSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQR 127 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASL 256 I GAE ++++N Y R + WK +++ +E+ G+D+ +P P + Sbjct: 128 ISYGAEIRTSVFEINSNYYHRLTDWK-LTGVDNNEEKARGGYDVELALAVPYVPSAHFRV 186 Query: 257 MYEQYYGDEVGLFGKDKRQKDPHAISAEVT 286 + + G + + D + V+ Sbjct: 187 KHFCWNG--IASNDSNNPIDDLKGNTFSVS 214 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 19/200 (9%) Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV---------DKDFSLKDSSL 135 + + ++ A + + I+ W AR L ++ + Sbjct: 22 ASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRSREGAKPTFRA 75 Query: 136 EMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH 194 L+ I N + +Q + DD N G +R + + + G+N F DH + H Sbjct: 76 ISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNIFYDHQFNTGH 135 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R G+G E ++ N Y + ++ E A G+D +P P Sbjct: 136 ARTGLGFEMKSSVYDVNINFYEAQTEIHH---VDGVPEVAAGGYDAEIGAQVPYLPWAKV 192 Query: 255 SLMYEQYYGDEVGLFGKDKR 274 Q+ + + + + Sbjct: 193 YYKAYQWNNETLNIKDGETL 212 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 13/201 (6%) Query: 92 TRNFITGMATAKANQEIQEWLGKYGTARVKLNV-DKDFSLKDSSLEMLYPIYDTPTNMLF 150 + A K +++I + G V L+ D D + S+ + I T + F Sbjct: 19 ANADVASQALNKVSEKISNLIPGEGITEVSLDYNDGDEDQLNFSILGVRDIETTDNSNFF 78 Query: 151 TQGAIHRTD----DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 TQ ++ + R NIG G+R S + ++M G NTF D DL+ R+G+G E Sbjct: 79 TQFSLMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGLGIEAKG 138 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L L+AN Y + S S + +E+ +GWD +P P + ++ ++ Sbjct: 139 SILDLTANSYTKIS---NSEVVNGDREQVLSGWDFNLTSQIPRAPWARINYNGYKWETEK 195 Query: 266 VGLFGKDKRQKDPHAISAEVT 286 G ++ + +++ +VT Sbjct: 196 ----GSADQKGNIYSLELDVT 212 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 19/186 (10%) Query: 123 NVDKDFSLKDSSLEMLY-----PI-YDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 N +K+ S++ + P + F Q + + G G R Sbjct: 97 NNQSQIQIKNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVYNK 156 Query: 177 DWMAGVNTFIDHDLSRSHTR---IGVGAEYWRDYLKLSANGYIRASGWKKSP---DIEDY 230 + G N H +S ++ I +G EYW LK N Y + S Y Sbjct: 157 KHIIGYNACYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNHHY 216 Query: 231 QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD----EVGLFGKDKRQKDPHAISAEVT 286 + P G+ I A+ P + + +EQ D + + + + H + + Sbjct: 217 YQYPKIGYQICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVSLE 273 Query: 287 YTPVPL 292 Y P+P+ Sbjct: 274 YQPIPM 279 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 52/211 (24%), Positives = 80/211 (37%), Gaps = 11/211 (5%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPI--YD 143 + DA I +A + + + GK R L+ K S+E + P+ YD Sbjct: 74 SNVDAVNRAINAVAMSNVSNAMYGAKGKPWMRRTTLSFQFQEGWKPLYSVETVQPLGHYD 133 Query: 144 TPTN-MLFTQGAIHR-TDDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 + + FTQ I R +D T NIG G+R S + + G + F DH H R+ G Sbjct: 134 NSSRDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDHRFLNRHNRLSAG 193 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 EY + N Y AS + ER ANG+ + + Sbjct: 194 LEYMSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTVEYGKTFKNARWARVYVEGYH 253 Query: 261 YYG----DEVGLFGKDKRQKDPHAISAEVTY 287 + D+ GL + Q P +S ++ Y Sbjct: 254 WNQERQADKNGLRVGSELQLTPR-VSVDMGY 283 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 117 bits (294), Expect = 4e-25, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 14/149 (9%) Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 L Q I ++ + G G + + GVN F D +++ + R+ +G++Y Sbjct: 124 LVWQANIDYKNEDILISNGIGI--LPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPN 181 Query: 208 --LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 LS+N Y SG D+ N DIRAEG + Q +SL E ++GD+ Sbjct: 182 YIFNLSSNIYFPLSGKGSEDDL-------VNSIDIRAEGAITPTVQFHSSL--EFFFGDD 232 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 + + + H +A + YTP+PL Q Sbjct: 233 IQINDDYDPTNNSHKFTAGLDYTPIPLLQ 261 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 117 bits (292), Expect = 6e-25, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 75/180 (41%), Gaps = 10/180 (5%) Query: 123 NVDKDFSLKDSSLEMLYPIYDT----PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 L++ S++M + Y N+ F Q IH N G G RH + + + Sbjct: 83 TYKSKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKY 142 Query: 179 MAGVNTFIDHDLSRSHTR---IGVGAEYW-RDYLKLSANGYIRASGWKKSPDIEDYQ-ER 233 G NTF +S+ ++ I VG EYW + L + N Y + + ++ Sbjct: 143 AIGYNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHY 202 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P +G + + P + + + EQ+ ++ +K+ D + +S ++ Y P+P+ Sbjct: 203 PRSGHQLYIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPML 261 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 80/262 (30%), Gaps = 32/262 (12%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 F + T A ++ G T + N T + SD + Sbjct: 148 FTASPPRTLAEAGWTTAPQVVAINKGTTPSNLPAATSHRLVQAEPNVPTDTKTGEKSDTS 207 Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQ 152 + +A+ + + + + + F + L P + + F Sbjct: 208 NDT-----NTEADTSTNLGIPYFVDTEFRGSTRRQFGGINLRL----PFWQDDQSFAFAD 258 Query: 153 GAIHRTDDRTQ-SNIGFGWRHFSG----NDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 + T N+G +R N W+ G + F D S + + + +GAE Sbjct: 259 VHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSENGFQYHQGSLGAELV 318 Query: 205 RDYLKLSANGYIRAS-----GWKKSPDIEDYQERPANGW-------DIRAEGYLPAWPQL 252 + NGY+ S G + + Q R ANG + E A Sbjct: 319 NKKFEFRVNGYLPGSNPNVVGQRTINGVLGIQPR-ANGLGTNIVQQTLTLEARERALAGF 377 Query: 253 GASLMYEQYYGDEV--GLFGKD 272 + ++ D+V GLFG Sbjct: 378 DFEAGHRHHFNDKVSLGLFGGY 399 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 48/151 (31%), Gaps = 17/151 (11%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH-RTDDRTQSNIGFGWRHFS 174 R+ + ++L P++ ++LF DD + NIG RH Sbjct: 31 WNPRIDFEGKLGNDRSIAEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHML 90 Query: 175 GNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY- 230 N W G + D ++ +G E L AN YI + D D Sbjct: 91 DNGWNLGGYGYFDRRKSPYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTV 150 Query: 231 ------------QERPANGWDIRAEGYLPAW 249 +ER G+D +P + Sbjct: 151 DFSGTTITYRAGEERSMRGYDAEVGWRIPVF 181 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 113 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 81/211 (38%), Gaps = 12/211 (5%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPIY--- 142 D+DA + + + + + K R L++ + K +E L P+ Sbjct: 84 SDTDAVNSALQAVVMTGVHSAMHGSKAKPWMQRTVLSLRFQKNWKPLYGVETLQPLGHYD 143 Query: 143 DTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 +T ++ FTQ + D T +N+G G+R + +D G N F DH +H R+ VG Sbjct: 144 ETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHRFRGNHGRMSVG 203 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 EY N Y SG + S D E +NG+ + + Sbjct: 204 LEYVSGIGAFRMNWYRGVSGER-SLDGATRMENVSNGYTAEYGTSFKNARWARVYMEAYR 262 Query: 261 Y----YGDEVGLFGKDKRQKDPHAISAEVTY 287 + D+ GL + Q P IS ++ Y Sbjct: 263 WQLRRSADKHGLRIGTELQLTPR-ISVDMGY 292 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 41/170 (24%), Positives = 73/170 (42%), Gaps = 8/170 (4%) Query: 82 FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR---VKLNVDKDFSLKDSSLEML 138 S+ + ++ G+A++ + LG+ + + L V + F SL + Sbjct: 29 ISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFK-PTISLTNV 86 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRI 197 I + + +F Q +++ ++ N+G G R +D + G+N F D+ SH R Sbjct: 87 NMISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQRN 146 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 G G E L +N Y SG + D E +GWD+R + +LP Sbjct: 147 GAGLEVLSSVFDLRSNIYDATSGIEAVSTSRD--EEAMDGWDMRLDYHLP 194 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 7/143 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + ++ I+ Q FP+A++ TP + + A + +LS +NN Sbjct: 4 MNNTLLDKLRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENKLST-----NTENN 58 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + +A + GT LSS DA ++ A +K N+EI+ W +YG A++ L VDK Sbjct: 59 NGRWLAQQTSQLGTILSSDNTHDAASQYLINQANSKVNREIENWFNQYGKAQINLGVDKH 118 Query: 128 FSLKDSSLEMLYPIYDTPTNMLF 150 F+LK L+ L T ++F Sbjct: 119 FTLKTQKLKSL--FLFTKQTIIF 139 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 50/169 (29%), Gaps = 23/169 (13%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 A A A QE Y + L + P+ ++ F + Sbjct: 22 AGAYAQNAGQEKWSTY----LDLEGKVGSKRDIGEANLFIPVVQDARSLYFANVRARMAN 77 Query: 160 DRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGY 215 + ++G G RH W G F+D + S+ + +G E AN Y Sbjct: 78 GGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTTYNNSYDQATLGVEALGRQFDWRANVY 137 Query: 216 IRASGWKKSPDIEDY---------------QERPANGWDIRAEGYLPAW 249 + + +ER G+DI A LP + Sbjct: 138 QPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERALPGFDIEAGWRLPVF 186 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 67/151 (44%), Gaps = 12/151 (7%) Query: 76 AANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV----DKDFSLK 131 A G +S DA +N + + N + ++ ++G +++V + S Sbjct: 9 ATAGGKGVSE--VLDAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKGDEASYS 66 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAI----HRTDDRTQSNIGFGWRHFSG-NDWMAGVNTFI 186 + + L P+ + + + F QG++ D RT N+G G R + G+N+F Sbjct: 67 IITTQPLAPLSEDGSRL-FWQGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIVGINSFY 125 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIR 217 D++ S H R+ +G EY R +LS N Y Sbjct: 126 DYEFSAKHKRMSLGGEYKRSNAELSVNKYWG 156 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 50/174 (28%), Gaps = 19/174 (10%) Query: 112 LGKYGTARVKLNVDKDF---SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIG 167 ++G R + + + P+ F H + R +N+G Sbjct: 266 FSEFGYVRGAYTFGEGIGIRHNYSTLTALFAPLVPYDDYYPFLDLRAHYIKNKRWAANVG 325 Query: 168 FGWRHFS-GNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 G R ++ G N + D+ + + G G E++ + ++ N Y Sbjct: 326 GGLRWRDCMTGFIFGANLYYDYRNTTQTDFNQFGFGLEFFTNCFEMRLNAYFPVGDVTHC 385 Query: 225 PD--IEDY----------QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 D DY E G D+ P YY +V Sbjct: 386 EDHVFSDYIGPYYAVCGLTEIAQKGVDLEVGHTFWKCPYFSVFGAIGGYYYTDV 439 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 103 bits (258), Expect = 5e-21, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 61/201 (30%), Gaps = 44/201 (21%) Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 + ++P+ + L+ A+ TD++ Q ++G G+R + + G F Sbjct: 43 DYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFGG 102 Query: 188 HDLSRSHTRIGV---GAEYWRDYLKLSANGYIRASGWKKSPD------IEDYQE------ 232 + ++ R+ + G E + N YI + + E Sbjct: 103 YSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVFL 162 Query: 233 ---RPANGWDIRAEGYLPAWPQ--------------------LGASLMYEQYYGDEVGLF 269 +G DI+A L +P G + E + V L Sbjct: 163 MHQYAGSGADIKAGYQL--FPHSSLKGYLGSYYFSPAETNNVWGGAAGLEYWLTQGVKLI 220 Query: 270 GK---DKRQKDPHAISAEVTY 287 G D +A + + Sbjct: 221 GSYSYDNLHHSTYAFGIGLEW 241 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 52/151 (34%), Gaps = 21/151 (13%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGN 176 V ++ + + + + PI +N+LF + ++ + N G G+R + Sbjct: 34 PSVDVSGKAGETRRIGEVNLFLPIAQDDSNLLFLDLRTSFDNLEQREGNFGLGYRAMQDS 93 Query: 177 DWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY--- 230 W G F D S ++I G E N Y+ +KS ++ED Sbjct: 94 GWNLGAYAFYDRRRSSEGHYFSQITTGLEALGQDFDARINAYLPIG--RKSYEVEDSARV 151 Query: 231 ------------QERPANGWDIRAEGYLPAW 249 ER +G D LP + Sbjct: 152 DLSGGSIQILSGLERAYHGGDAELGWRLPVF 182 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 49/171 (28%), Gaps = 19/171 (11%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 F + + + + + + V + D +L PIY T + +LFT+ Sbjct: 14 FALSITFTEHSLASSDKWDPWLESGVSIGTDNS---SRGEAALLLPIYQTDSGLLFTELR 70 Query: 155 IHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKL 210 D + + N+ G+R N W G+ D + + G E Sbjct: 71 GKLFDAGSKEGNLALGYRKMINNRWAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDF 130 Query: 211 SANGYIRASG------------WKKSPDIEDYQERPANGWDIRAEGYLPAW 249 N Y S I E P +G+D Sbjct: 131 RINAYNALSSAQAYPQPVEAELIGNQLFITSAAEVPLSGYDFELGHRFSVL 181 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 99.3 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 56/185 (30%), Gaps = 23/185 (12%) Query: 102 AKANQEIQEWLGKYGT-------ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + A + E+ ++ A + + P+ ++ F Sbjct: 18 SYAQDPVPEYQPEWFQEEDYLYRAYFDFTGQAGGVNDNGQGLLFIPLAQDEESLFFADLR 77 Query: 155 IHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKL 210 + DD + + N G +R + W+AG+ F D S + G E Sbjct: 78 GNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDVRRSQYSNIFRQGSFGFELLSIEWDF 137 Query: 211 SANGYIRASGWKKSPDIEDY------------QERPANGWDIRAEGYLPAWPQLGASLMY 258 NGY+ + ++ + +ER G D L ++P+ Sbjct: 138 RVNGYVPSQKQQRVDSLNTAYLSGNNIVMRAGEERAYWGTDFEVGRLLKSFPESNLDAEL 197 Query: 259 EQYYG 263 Y G Sbjct: 198 RGYVG 202 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 98.9 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 59/188 (31%), Gaps = 26/188 (13%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAI-- 155 +A +A E + L + + LE P+ TP N+ F +G + Sbjct: 22 LAQTEAESETADTLRIKPRLGIGHTSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRLFL 81 Query: 156 HRTDDRTQSNIGFGWRHFSGNDW-MAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLKLSA 212 D N+ G+R +S N + G D+ + + ++G+G E Sbjct: 82 DNDDANLGGNLILGYRTYSANSHRIWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDFRV 141 Query: 213 NGYIRASGWKKSPDIED-----------------YQERPANGWDIRAEGYLPAWPQLGAS 255 NGY+ ++ +E GWD L ++G Sbjct: 142 NGYLPIGDTRQGVGDAGVRDIFFRRNFLILEQGQNKEAAMGGWDAEVGAKL---ARIGID 198 Query: 256 LMYEQYYG 263 Y G Sbjct: 199 GDLRGYGG 206 >UniRef50_A8PQI7 Putative outer membrane autotransporter barrel domain n=5 Tax=Rickettsiella grylli RepID=A8PQI7_9COXI Length = 1171 Score = 97.8 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 38/197 (19%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGN 176 AR NV + + P+ + + A+ + ++G G+R Sbjct: 34 ARFSGNVYGSTKYVVGQADAMLPLVGDAQHNFYIDPALTSGSNWEGHGDLGLGYRWIQNG 93 Query: 177 DWMAGVNTFIDHDLSRSHTRIG---VGAEYWRDYLKLSANGYI----------------R 217 + G F +++ ++ RI G E NGY R Sbjct: 94 SAILGGYLFGEYNRMDNNVRIWTMNPGIEALGSRWDAHLNGYFVMDNRSKVVGTDLEFVR 153 Query: 218 ASGWKKSPDIEDYQERPANGWDIRAEGYL-PAWPQ-----------------LGASLMYE 259 G ++ D + NG D++ L P P LG ++ E Sbjct: 154 FRGHSAVYNLFDVTQNVGNGGDVKLGYQLFPKTPLKAFVGSYFFSPAETKNILGGAVGLE 213 Query: 260 QYYGDEVGLFGKDKRQK 276 + V +F K Sbjct: 214 YWANRNVKVFASYTYDK 230 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 57/182 (31%), Gaps = 20/182 (10%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-N 165 + E + + +S+ P + +++FT T+ N Sbjct: 82 SVDEIFNPIFRVDARGGQLYGYDEGYTSVGGFLPFFRDENSLIFTDIRGLMTNGGKGGAN 141 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R F D + GV+ + D D + GV E YL NGY+ + Sbjct: 142 VGVGYRQFVPELDRIFGVSGWYDFDNGHREAFNQFGVSFESIGRYLDWRVNGYLPVEDNE 201 Query: 223 KSPDI----EDYQER------------PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 + + +Q G+D G P + G S YY Sbjct: 202 EISNQILGAAGFQNNFILLNRGRSVDSAYKGFDTEIGGPFPILGRYGMSGYVGMYYYANT 261 Query: 267 GL 268 + Sbjct: 262 DV 263 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 68/184 (36%), Gaps = 26/184 (14%) Query: 120 VKLNVDKDFSL-----KDSSLEMLYPIYDTPTNM-LFTQGAIHRTDDR-TQSNIGFGWRH 172 + LN SL + M++P + + +F G D ++G G RH Sbjct: 80 LNLNYTFGKSLGCQKSYGTFGGMIFPFFSSCRPFQIFLDGKAFLFDHGKWGGSVGIGLRH 139 Query: 173 FSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWK-------- 222 FS N WM G+N + D+ ++G+G E D ++ NGY+ + + Sbjct: 140 FSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLGDCVEFRVNGYLPVNKNRWDQCCLFN 199 Query: 223 -KSPDIEDYQER--PANGWDIRAEGYL--PAWPQ-LGASLMYEQYYGDEV---GLFGKDK 273 +ER +G D +L P+ Q +G + YY F D+ Sbjct: 200 YSGSYFATLRERGYVWSGLDTEIGTWLVKPSCCQDIGLYVAAGPYYYRRSHDQDFFFHDQ 259 Query: 274 RQKD 277 + Sbjct: 260 KHHT 263 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 62/178 (34%), Gaps = 22/178 (12%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDS--SLEML-YPIYDTPTNMLFTQGAIHR-TDDRT 162 + EW+ A ++ V K ++ S + P+ D+ + F IH +R Sbjct: 44 QANEWVFPPTLAYLQGVVGKGIGEQNGYASFGIFTIPLLDSNGQLFF-DARIHNLRHERW 102 Query: 163 QSNIGFGWRHFSG-NDWMAGVNTFIDHDLSR-SHTRIGVGAEYWRDYLKLSANGYIRASG 220 +N+G G R + G+N F D+ +R + ++G G E NGY Sbjct: 103 AANVGVGTRIAIPCTNLFFGINFFYDYRRTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 Query: 221 ---WKKSPDIEDYQ---------ERPANGWDIRAEGYLPAW-PQLGASLMY--EQYYG 263 K + + +G D+ E L W P L + Y+ Sbjct: 163 RSLRKHPKVFRFHDNLFAACTQIQNSLSGGDLELETSLRRWDPCLCFDVYIAPGGYFY 220 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 58/191 (30%), Gaps = 32/191 (16%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 G +A A+ + +++ + GT + S+ + P+ D+ F Sbjct: 2 TALGAGSAAADPALMDFVLRPGT-----------DGAEGSIAVAIPLADSDAARTFLDLR 50 Query: 155 IHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKL 210 D DR +NIG G R G + G + D DL ++ V + L L Sbjct: 51 GSIDDADRRVANIGIGHRFRLG-AVVLGGAVYYDRVRTDLESDFSQATVSLDLMTADLDL 109 Query: 211 SANGYIRA----------------SGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 AN Y SG I +E G+D L A Sbjct: 110 RANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIFQPREVTLKGFDAEVGYRLGAIEGYDV 169 Query: 255 SLMYEQYYGDE 265 Y + Sbjct: 170 RAFAGGYRYTD 180 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 91.6 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 39/155 (25%), Positives = 54/155 (34%), Gaps = 22/155 (14%) Query: 133 SSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHD 189 S+L +L P P +MLF TD N+G GWR ++ N D + V + D+D Sbjct: 142 SNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLGAGWRAYNDNLDKIFTVAGWYDYD 201 Query: 190 LSR--SHTRIGVGAEYWRDYLKLSANGYIR-----------ASGWKKSPDIEDYQERPAN 236 + ++G+ E YL NGY SG Y R Sbjct: 202 DGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEIIISNNLSGSAYFQTDRIYLNRTRR 261 Query: 237 ------GWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 G D G LP + G YY + Sbjct: 262 SESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYYNS 296 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 57/177 (32%), Gaps = 17/177 (9%) Query: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 + N + +G T + + ++ S L P+ N+++ D Sbjct: 148 QSDLNNTSRHTVGARFTVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIYIDLKSKLYDA 207 Query: 161 RT-QSNIGFGWRHFSGNDWMAGVNTFIDHDL-SRSHTRI-GVGAEYWRDYLKLSANGY-- 215 + + + G +R G+N F D + R +G E + L+ N Y Sbjct: 208 KEGEVSTGIVFRRQMSPLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKSFSLNGNYYRS 267 Query: 216 IRASGWKKSPDIEDY-----------QERPA-NGWDIRAEGYLPAWPQLGASLMYEQ 260 + + E + ER A NG+D+ L + + S + Sbjct: 268 NKKTTISSVKSFEFHDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIHGSAFFFY 324 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 89.3 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 75/261 (28%), Gaps = 56/261 (21%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 +A N V++L+ ++ A P L + + ++ + + ++ Sbjct: 1 MAIACMNSLVRLLWTSFCFTPLLIPAAIAQTEIPSLPKADAVPESHPSLGSPLQAQTPDS 60 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 + + + ++G + + + L+ Sbjct: 61 PPSTTPDLTTLQIK-------------------PRWG---IGYSTSGAGYDGFTRLDSFL 98 Query: 140 PIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--H 194 P+ P + + F +G + + N+ FG R ++ + + + G D + + Sbjct: 99 PLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRFYNQSLNRIFGGYLGFDRRDTGNSTF 158 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKK-----------------------------SP 225 ++GVG E + + NGY + Sbjct: 159 HQLGVGVETLGEVWDVRLNGYFPLGDTRDLVDETAFDTGFQLTDRFFSDHFLVIQGKRQR 218 Query: 226 DIEDYQERPANGWDIRAEGYL 246 + E G+D+ L Sbjct: 219 GQVRHFEAAMTGFDLEVGARL 239 >UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLP1_ACAM1 Length = 1022 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 60/157 (38%), Gaps = 11/157 (7%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQG 153 + +A+ + ++G + N + + LE P++ P + F +G Sbjct: 24 IAEPQPSTQASDL--RFSPRFG---IGANSPSSGTNTTTRLETFVPVWQKPGRALTFFEG 78 Query: 154 AIHRTDDRT-QSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLK 209 + D NI FG+R +S + + G + D + ++ ++ +G E + Sbjct: 79 RLLLDDQGNPGGNILFGFRQYSDDLKRIFGGHLGFDIRNTDNNTFQQLSLGIESLGKDVD 138 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 L NGY ++ ++ NG D R G + Sbjct: 139 LHLNGYWPVGSTRRQTRQRIFEVLQLNG-DPRFTGNI 174 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 41/131 (31%), Gaps = 17/131 (12%) Query: 149 LFTQG-AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 +F D + ++ G G R + + G+NT+ D+ R ++GVG E Sbjct: 8 VFIDLDGYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVGVGFEML 67 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDY-------------QERPANGWDIRAEGYLPAWPQ 251 D + NGY+ S S + E G D L + Sbjct: 68 SDCFDVRINGYLPVSEKVHSHQCLSFHYSGTDFHASRCKLEYAYGGLDAEIGKPLLTYYD 127 Query: 252 LGASLMYEQYY 262 YY Sbjct: 128 FDLYGAVGPYY 138 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 51/191 (26%), Gaps = 36/191 (18%) Query: 102 AKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-D 160 EW + D S L L P Y + + +D D Sbjct: 19 GSVQAADSEWKP---NTQAYFAAGDDRSYFG--LAGLIPFYQDGKRLGYADLRYSSSDVD 73 Query: 161 RTQSNIGFGWRHFSGND-WMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYI 216 + N+G G+R + N+ + G D S R + ++ GAE D +N Y Sbjct: 74 TDEINLGAGFRSLNENETAIYGFYGSYDLRKSATERDYRQLTFGAELLTDTWDYRSNFYF 133 Query: 217 RASGWKKS-----------PDIEDYQ--------------ERPANGWDIRAEGYLPAWPQ 251 + + E +G DI G L + Sbjct: 134 PTGDDSYQVGNAEDDVTVESEFVGHDLVRTTTTVGGGTIFEEALSGADIEV-GRLLNFDN 192 Query: 252 LGASLMYEQYY 262 Y+ Sbjct: 193 FEMRGYLGAYH 203 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 84.7 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 57/176 (32%), Gaps = 26/176 (14%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-NIGFGWR 171 G Y R + N + + + L P+ + ++ Q + TD N+G R Sbjct: 45 GTYFDVRNQSNSGVGYQHGFTQIGALTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGR 104 Query: 172 -HFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASG------- 220 + +G D + G N + D+D S +++IG G E L L AN Y+ Sbjct: 105 VYDAGRDRIWGANVYYDNDETTYSNRYSQIGFGFESLGQNLDLRANAYLPTGSSDKVIGP 164 Query: 221 ---------WKKSPDIED--YQERPANGWDIRAEGYLPAWPQLG-ASLMYEQYYGD 264 + E G D +P + Y+ D Sbjct: 165 NGLSNTLFYTGNQLNFTGSYLSEEALRGADFELG--IPVTQNMSWLRAYGGGYFYD 218 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 80.8 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 44/131 (33%), Gaps = 22/131 (16%) Query: 138 LYPIYDTPTNMLFTQG-AIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS-- 193 L P MLF DR +N+G G R++ N D + G N + D+D + Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKK---------SPDIEDY-----QER----PA 235 +G G E Y N Y ++ S +D +ER Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFPVGPTEQLLSQSVVTGSQRFQDTRILFDRERIVGLAP 214 Query: 236 NGWDIRAEGYL 246 G+D L Sbjct: 215 KGFDAEFGMPL 225 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 59/166 (35%), Gaps = 39/166 (23%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 T +N+G G+R S ++ GVNTF DH S+ + RI G EY ++ AN Y + Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 220 GWKKSPD----IEDYQE----------------RPANGWDIRAEGYLPAWPQLGASLMYE 259 K P E Y E + +G+D+ A + Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 260 QYYGDEVGLFGKD-----------------KRQKDPHAISAEVTYT 288 + G V G+ Q PH +S +V YT Sbjct: 261 HWNGLGVKTHGEGPALALNVGKSHGWQAGTTLQLTPH-VSLDVGYT 305 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 35/140 (25%), Positives = 53/140 (37%), Gaps = 17/140 (12%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML----FTQGAIHRTDD--RTQSNIGFG 169 G K++ D ++ +SS P Y + + D +IG G Sbjct: 127 GNGGEKISSDAYWNGGESSYIGDDPKYKAAARLAQQPSYLDKGETVQHDSLGVVGSIGAG 186 Query: 170 WRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI- 227 +R S N+ G+NTF D+ +R+G+G EY K+SAN Y S K P Sbjct: 187 YRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLSEKKTKPYYF 246 Query: 228 ---------EDYQERPANGW 238 D P +G+ Sbjct: 247 ENSLVIVPRADEFHYPEDGY 266 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 78.1 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 55/126 (43%), Gaps = 18/126 (14%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ R ++ A ++ + N+ Sbjct: 138 LRKLNQFRTFVR---NVRPGDELDV---------------QAQVSEKNLTPPPGNSSGNL 179 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 180 EQQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDF 239 Query: 129 SLKDSS 134 SLK+S Sbjct: 240 SLKNSR 245 >UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZV2_9CYAN Length = 1059 Score = 75.1 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 47/145 (32%), Gaps = 21/145 (14%) Query: 123 NVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWRHFSG-NDWM 179 + + SLE PI P + + F +G + D T I G R ++ + + Sbjct: 57 SEGAGYQDPFFSLEGFVPITQNPGSTVTFLEGQLRLFTDSTMGGTILLGQRFYNSTQNRI 116 Query: 180 AGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD----------- 226 G D + + +IG G E D L N Y+ + D Sbjct: 117 LGGYLSYDTRDTGNSLFHQIGAGFERLGDDWDLRVNAYLPVGERRPEVDESFSLRGFQEN 176 Query: 227 -----IEDYQERPANGWDIRAEGYL 246 E G+DI A G L Sbjct: 177 NLLLNHRQRFEAAMAGFDIEAGGRL 201 >UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQN0_PARL1 Length = 675 Score = 74.7 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 56/180 (31%), Gaps = 28/180 (15%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR-TQSNIGFGW 170 G + A L+ ++D P++ + ++LF + T+ N G+ Sbjct: 31 WGPWIEAGGFLSTERD----RGEATAFMPLFQSGESLLFADVKGKLFSEGVTEGNFALGY 86 Query: 171 RHFSGNDWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 R + D G+ D S + + G E NG++ + K +P + Sbjct: 87 RRMTAWDVNLGLWGGYDIRESVSGNTFDQAAFGIEALAADYDFRLNGFVPLADGKAAPGM 146 Query: 228 EDYQ------------ERPANGWDIRAEGYLPAWPQLG-------ASLMYEQYYGDEVGL 268 + E G++ LP LG L Y D+ L Sbjct: 147 ARVELSGSQILLTGGRELVLGGFEGEVGWRLP-LEALGADRERHEFRLYAGGYRFDDSDL 205 >UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YK40_ANASP Length = 1487 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 61/186 (32%), Gaps = 23/186 (12%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS--SLEMLYPIYDTP-TNMLFTQGAIH 156 A+ + Q + T RV + + + +S S E P+ P ++ F QG + Sbjct: 19 ASTVSAQTPASTTAQVFTPRVGVRYTTEGAGYESFSSFEGFLPVLQIPGNSLTFLQGKLL 78 Query: 157 RTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSA 212 +D + NI G R FS + + G + + ++G+G E Sbjct: 79 LDNDSNLATNILLGHRIFSEEANRVIGGYISYSTRDTGKSNFDQLGLGFETLG-VWDFRF 137 Query: 213 NGYIRASGWKKSPDIED---------------YQERPANGWDIRAEGYLPAWPQLGASLM 257 N Y+ +G + + + + E +G D L + Sbjct: 138 NAYLPLNGSENQVEQANLPFFQGDSLMVQRSRFLEVAMSGVDAEVGTRLASLGSGDLRGY 197 Query: 258 YEQYYG 263 YY Sbjct: 198 AGVYYY 203 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 72.0 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 49/143 (34%), Gaps = 30/143 (20%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + D + ++ P++ + +LF + + + ++G G+ Sbjct: 50 TVNAGVKSSDAYTDGNFSIVAPVWSSLGAEGTLSGGVLFLEPYTSYGEGGEIAASLGLGY 109 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 R+ F G N F +D + ++GVG E+ YL Sbjct: 110 RYLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGVEFGNRYL 169 Query: 209 KLSANGYIRASGWKKSPDIEDYQ 231 + N YI S + + + + Sbjct: 170 EFRGNYYIPLSDKQVAEQFKTRE 192 >UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PN48_9COXI Length = 607 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 60/224 (26%), Gaps = 51/224 (22%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG-AIHRTDDRTQSN 165 + +E L +A V +++ + + L+ + TD + Sbjct: 28 QAREPLPPRFSAEAYTGV-----YTVGRADLMVSLDGDGQHNLYVDPQGGYGTDQEWYGD 82 Query: 166 IGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R S + + G F H + S G E N YI +G Sbjct: 83 VGLGYRWISNDAAIVGWYVFAGHSCVENSSGFWITNPGVEIMGSRWDARINAYIPVAGRS 142 Query: 223 K------------------------SPDIEDYQERPANGWDIRAEGYLPA---------- 248 S + ++ NG D R L + Sbjct: 143 DDLGGIESTTAGPSFFTGHSELRTVSFTAFNEVQQVGNGADARVGYQLFSGVPLKAVVGA 202 Query: 249 ----WPQL----GASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 P G + ++ D V +F + H+ Sbjct: 203 YFFEIPHAENVRGGGAGVDYWFDDYVRVFARYNYDNRQHSQVVG 246 >UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK3_9PLAN Length = 967 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 55/201 (27%), Gaps = 36/201 (17%) Query: 98 GMATAKANQEIQEWLG---KYGTARVKLNVDKDFSLKD------SSLEMLYPIYD--TPT 146 G+ + N ++ E G +G R + SS + +P+ + Sbjct: 31 GVPQEEINGDVSELFGDSGWFGRYRPHFGYRYEAGDTIGRIGGLSSFDAFFPLLEGEDSD 90 Query: 147 NMLFTQGAIHRTDDR--TQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGA 201 + F + DD SN+G G R + G + D + S ++ G Sbjct: 91 WLTFIDARLLLGDDNHNLGSNVGVGARQYIPEYQRTIGAYIYYDTRDAGYASFDQVSGGI 150 Query: 202 EYWRDYLKLSANGYIRASGWKKSP--------------------DIEDYQERPANGWDIR 241 E D N Y+ + Y + G D+ Sbjct: 151 ETLGDIWDARLNWYVPTGQTRNQYATTHTSGGSYKFVGHYLTGGTFTRYYQAAMKGLDME 210 Query: 242 AEGYLPAWPQLGASLMYEQYY 262 A + + Y+ Sbjct: 211 AGAKFYSNESMDLRAYAGWYH 231 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 52/148 (35%), Gaps = 32/148 (21%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQSN- 165 GT L + D ++ P+Y T ++LF + + + ++ Sbjct: 51 YLGTVTAGLKTSD--AYTDGHFSIVAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASS 108 Query: 166 IGFGWRH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEY 203 +G G+RH F G + F +D + + ++GVG E Sbjct: 109 LGLGFRHLFGSQPLTALSANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEA 168 Query: 204 WRDYLKLSANGYIRASGWKKSPDIEDYQ 231 Y+++ N YI S + + + + Sbjct: 169 GTRYVEVRGNYYIPLSDKQLAEETRTRE 196 >UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK4_9PLAN Length = 786 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 46/158 (29%), Gaps = 28/158 (17%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPI--YDTPTNMLFTQGAI--HRTDDRTQSNIGF 168 +G R + SSL+ P+ + + F + + SN+GF Sbjct: 53 PHFGY-RYQAGDTIGRIGGLSSLDGFLPLLEAEDGNWLTFLDARLLLDDQNQNLGSNVGF 111 Query: 169 GWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP 225 G R + G + D + R+ +++ G E D N Y+ + Sbjct: 112 GARQYLPEWGRTIGGYVYYDTRDTGTRNFSQVSGGIETLGDLWDARLNWYVPTGSRRSLV 171 Query: 226 D--------------------IEDYQERPANGWDIRAE 243 + Y + G D+ A Sbjct: 172 GTSHTVGGPSQFIGHYLYGGILTRYYQAAMTGVDMEAG 209 >UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZRN5_9PLAN Length = 792 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 28/190 (14%) Query: 84 SSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEML---YP 140 S+Q D + + + A+ G+Y R+ + + + D S P Sbjct: 24 SAQQAGDDIQPGLISGTSTFASPYANGQGGEYF-PRISVQHRTEGAGYDYSFTDFRAWVP 82 Query: 141 IYD--TPTNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHT- 195 +Y+ ++ F GA +D+ N G R +S N G D+ + + T Sbjct: 83 LYESYDSKSLTFFDGAFLLANDQNVGMNAVVGQRFYSDNYGRTFGGYVGYDNRDTGNQTV 142 Query: 196 -RIGVGAEYWRDYLKLSANGYIRAS-----------------GWKKSPDIEDYQERPANG 237 ++ G E + NGY + G+ E G Sbjct: 143 GQVVTGFESLGR-IDFRVNGYFPTTSDPTMTGQTGFFDPTYVGYNIQLSQLTQYEVAMKG 201 Query: 238 WDIRAEGYLP 247 +D G LP Sbjct: 202 FDAEIGGALP 211 >UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA61_9CHLA Length = 188 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 38/111 (34%), Gaps = 12/111 (10%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLY------PIYDTPTNMLFTQGAIH-RTDDRTQSN 165 +Y + D S+ L P+ +F+ H T N Sbjct: 22 NEYFKTYLSYKGGNDGLGYHSNYASLDLMCFPLPL---EDITIFSDLKGHWLTRHHYAVN 78 Query: 166 IGFGWRHFSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANG 214 G G+R + N F DH S + ++G+G E + + +L NG Sbjct: 79 AGVGFRKIYAPQTIWDANLFYDHPKSSYDHYNQVGLGLELFHELWELRLNG 129 >UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZT8_9GAMM Length = 785 Score = 67.4 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 66/205 (32%), Gaps = 33/205 (16%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 G R LNV + + L P+ ML+ GA+ T T +G G+R Sbjct: 35 WGGPWKPRQTLNV-QGGHGMQDYYDALLPLSGNAERMLYANGALAATHHETGGELGLGYR 93 Query: 172 H-FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS-------- 219 H N+++ G + ++ +G E++ + A+ Y+ S Sbjct: 94 HIILNNEYVIGGFALMGRYQTNYHNMFNQLTLGTEFFGSIWEGRAHLYLPVSRRTKFVRS 153 Query: 220 --------GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK 271 G K E G D+ +P P+L YY + +G K Sbjct: 154 RSEGLSFQGHKLFGIQTTTYEHAEGGADVEIGHVIPGIPKLRGFA---GYYNNGLGNEHK 210 Query: 272 D---------KRQKDPHAISAEVTY 287 + R + + +Y Sbjct: 211 NINGGYGRFEYRYNNHFTFTLGDSY 235 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 50/143 (34%), Gaps = 30/143 (20%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + + ++ P++ T ++++ + + + ++G GW Sbjct: 91 TVTSGVKTSDVYTEGNFSIVAPVFSTLGADATLSGDVIYLEPYTSSGEGGEIAASLGLGW 150 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 RH F + G N F +D + + ++GVG E YL Sbjct: 151 RHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGIEAGTRYL 210 Query: 209 KLSANGYIRASGWKKSPDIEDYQ 231 ++ N YI S + + + Sbjct: 211 EVRGNYYIPLSDKQLAEQTRTRE 233 >UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QR03_CYAP0 Length = 1985 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 45/132 (34%), Gaps = 11/132 (8%) Query: 114 KYGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAI--HRTDDRTQ-SN 165 +Y T RV + ++ + E +PI + FT+G + D +N Sbjct: 97 RYFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRAN 156 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 G R +S + D + G D + + GVG E + N YI + Sbjct: 157 FLVGHRLYSQDHDRVYGAYIGYDLRDTKYNKFNQFGVGLETLGSFWDARFNAYIPLGTTQ 216 Query: 223 KSPDIEDYQERP 234 + + P Sbjct: 217 QQIGQTNTDLNP 228 >UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K1T2_CYAP8 Length = 1873 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 45/127 (35%), Gaps = 11/127 (8%) Query: 114 KYGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAI--HRTDDRTQ-SN 165 +Y T RV + ++ + E +PI + FT+G + D +N Sbjct: 97 RYFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRAN 156 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 G R +S + + + G D + + GVG E D+ N YI + Sbjct: 157 FLVGHRLYSQDHNRVYGAYIGYDLRDTKYNKFNQFGVGIETLGDFWDARFNAYIPLGTTQ 216 Query: 223 KSPDIED 229 + + Sbjct: 217 QQIGQTN 223 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 61.6 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 ++G G+R S N+ GVN F+D + ++ RI G EY ++ AN Y Sbjct: 168 GIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVNGLNEVYANVYRGLG 227 Query: 220 GW 221 Sbjct: 228 DK 229 >UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellular organisms RepID=B4D818_9BACT Length = 5429 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 43/103 (41%), Gaps = 4/103 (3%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGND 177 RV ++ D SL+ L P+ +L+ + +D +IGFG+RH Sbjct: 74 RVTFGLEFYEHQIDESLDTLVPLATPQNGVLYFNPKLSLSDRLNPSVSIGFGYRHLLKAR 133 Query: 178 WMAGVNTFIDHDLSR-SHT--RIGVGAEYWRDYLKLSANGYIR 217 + T + D + H + GVGAE ++ AN Y+ Sbjct: 134 RSSSGETSLRSDYTNFDHHVNQFGVGAEVMSRWVDFRANYYLP 176 >UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=27 Tax=Cyanobacteria RepID=Q0IAR8_SYNS3 Length = 401 Score = 53.9 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 37/294 (12%), Positives = 83/294 (28%), Gaps = 50/294 (17%) Query: 31 VLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD 90 + L + V + A ++ + + +S S Sbjct: 5 LSLGLLASAISVASLPAIAQEDGGAALLRQQRDKLLEQIEQLKQRKEQLEAQIS---GSA 61 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 ++ + N ++ + + + + + P+ ++ F Sbjct: 62 QGKDDAFDLQEISLNDAVK------FNWGFQGALQGAGTPNQAGIGGFLPLSVGENSVWF 115 Query: 151 TQGAI-----HRTDDRTQSNIG-----------FGWRHFSGND-WMAGVNTFIDH----- 188 ++ + N G+R +G+ WM G+N D Sbjct: 116 LDALANANFSDYENNSSIINTDVAGTTISTSSRLGYRWLNGDRSWMYGLNAGYDSRPMNT 175 Query: 189 ------------DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 + S ++ V AE + L+A I ++ + YQ N Sbjct: 176 GGTDTGINVSGTEKSAFFQQVVVNAEAVSNDWNLNAYALIPIGDTEQDLN-SFYQGGALN 234 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH----AISAEVT 286 + + ++ P+L AS+ Y GD G + + ++A V Sbjct: 235 TYGLDVGYFI--TPELNASVGYYYQNGDLGSADGSGVLGRVAYEISNGLTAGVN 286 >UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C500_9PLAN Length = 1337 Score = 53.5 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 47/142 (33%), Gaps = 24/142 (16%) Query: 144 TPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGV 199 M+F + RT+ G G+R ++ + D + G + + D D S ++ + Sbjct: 145 DDAGMMFGNFRLWRTNRGNLGGGAGLGYRFYNYDTDRIFGTSFYYDRDDSTDKIFQQLAL 204 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ------------------ERPANGWDIR 241 E Y + N Y+ ++ ++E + G+D Sbjct: 205 NVETMGRYWDANGNFYLPIGNREQQLNLEFNDGSQRFSGFNVLYDQTRTIGKSMRGFDAE 264 Query: 242 AEGYLPAWPQLGASLMYEQYYG 263 +P W +L Y G Sbjct: 265 IG--VPIWGELAQQFQARAYAG 284 >UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6_SALET Length = 197 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 13/88 (14%), Positives = 27/88 (30%), Gaps = 4/88 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + AR ++ P P+ + + ++ Sbjct: 67 LKKLNGLRTFARGFDHLQAGDELDVPAV----PLTGGKGDNNRHDARGPFAADRENEDAQ 122 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFI 96 + + A+ AG+FL+S PD A + Sbjct: 123 AQQMVGMASQAGSFLASHPDGQAAAGMV 150 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 14/29 (48%), Positives = 19/29 (65%), Gaps = 2/29 (6%) Query: 267 GLFGKD--KRQKDPHAISAEVTYTPVPLT 293 G+FG RQ++PHAI+ + Y PVPL Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLV 31 >UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=1 Tax=Synechococcus sp. RS9916 RepID=Q05XC6_9SYNE Length = 404 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 65/223 (29%), Gaps = 46/223 (20%) Query: 112 LGKYGTARVKLNVDKDFSLK--DSSLEMLYPIYDTPTNMLFTQG--AIHRTDDRTQSNI- 166 + K R + + + ++ PI T ++ F D + S+I Sbjct: 29 IRKTTKFRWNVFSKSQGAGTPNQAGGQVFIPISTTRKSIFFLDALATADFGDALSTSSIV 88 Query: 167 -----G--------FGWRHFSGNDWM-AGVNTFID-HDLSRS------------------ 193 G G+R + N + GVN D +S Sbjct: 89 NTPVEGTTFSTSSRIGYRWLNDNGDILFGVNAGYDSRPISTGIPSRYSWAPRSLLQPQDV 148 Query: 194 -HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 +I GAE + + + + + ++ Y + + I E L Sbjct: 149 FFQQIAFGAELVTNNIAIKPYALVPVGKTEDVLNLF-YSGGALDTYGIDIEHSFDEL--L 205 Query: 253 GASLMYEQYYGDEVGLFGKDKRQK---DPHA-ISAEVTYTPVP 291 AS+ Y GD G + +P S V YT P Sbjct: 206 TASIGYYYQQGDLTYANGSGLKSTIAINPAGSFSMGVEYTYDP 248 >UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CT24_9RHOB Length = 771 Score = 43.5 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 38/132 (28%), Gaps = 17/132 (12%) Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDH-- 188 + + + +P + + R + Q +I R + W GV F D Sbjct: 34 STGIALSFPFAIEENRATIARLSYGRDEGHNAQLSIEAMRRMTLAHGWTVGVGVFADSST 93 Query: 189 -DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-------------DIEDYQERP 234 D+ +++G+ + R + + N Y+ + + + Sbjct: 94 DDIGNRFSQVGMSGDLQRGIFQANLNAYLPVGTKSHADARYDALAEMDGTIRFKGGRSLA 153 Query: 235 ANGWDIRAEGYL 246 G D Sbjct: 154 LRGLDAEVGARF 165 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 42.7 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 63/168 (37%), Gaps = 14/168 (8%) Query: 51 VQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQE 110 + + + N + A + + ++ + G+ + Sbjct: 13 SLLNADELDNALKNNQNKWQKFNYQATQKAPTIKEENID--FKSALNGILSNVLE----- 65 Query: 111 WLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGW 170 K G + N+D F ++ ++ L +Y+ N L Q + T D + G Sbjct: 66 --NKNGIDKTDGNLD--FQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLIN 121 Query: 171 RHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEY-WRDYLKLSANGYIR 217 R+ +D++ G+N FID + ++ GAE + ++K +N Y+ Sbjct: 122 RY-EKDDFLLGINGFIDGQKEQKESK-SFGAELGYYQFVKAYSNYYVP 167 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.137 0.379 Lambda K H 0.267 0.0416 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,705,662,067 Number of Sequences: 3077464 Number of extensions: 79554209 Number of successful extensions: 152570 Number of sequences better than 1.0e-01: 143 Number of HSP's better than 0.1 without gapping: 293 Number of HSP's successfully gapped in prelim test: 73 Number of HSP's that attempted gapping in prelim test: 151575 Number of HSP's gapped (non-prelim): 465 length of query: 295 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 167 effective length of database: 646,480,964 effective search space: 107962320988 effective search space used: 107962320988 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 92 (40.0 bits)