BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (464 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 951 0.0 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 699 0.0 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 607 e-172 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 535 e-150 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 432 e-119 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 387 e-106 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 275 4e-72 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 271 4e-71 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 254 4e-66 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 228 3e-58 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 223 2e-56 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 219 2e-55 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 215 2e-54 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 212 2e-53 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 211 4e-53 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 211 4e-53 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 207 6e-52 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 207 9e-52 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 206 1e-51 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 202 3e-50 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 199 1e-49 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 198 3e-49 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 197 7e-49 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 194 5e-48 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 194 5e-48 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 194 6e-48 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 193 1e-47 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 191 4e-47 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 190 8e-47 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 190 1e-46 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 189 1e-46 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 189 2e-46 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 189 2e-46 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 189 2e-46 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 188 3e-46 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 187 9e-46 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 187 1e-45 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 186 1e-45 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 186 2e-45 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 185 3e-45 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 184 6e-45 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 184 8e-45 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 183 1e-44 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 182 2e-44 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 182 2e-44 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 177 8e-43 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 177 8e-43 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 177 1e-42 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 177 1e-42 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 176 2e-42 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 175 3e-42 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 174 4e-42 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 174 7e-42 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 173 1e-41 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 171 6e-41 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 169 2e-40 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 167 8e-40 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 164 6e-39 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 164 9e-39 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 160 7e-38 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 157 8e-37 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 148 5e-34 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 137 8e-31 UniRef50_P36943 Putative attaching and effacing protein homolog ... 118 5e-25 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 94 2e-17 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 85 5e-15 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 74 9e-12 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 69 3e-10 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 68 6e-10 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 67 1e-09 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 67 1e-09 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 57 1e-06 UniRef50_C0B2E7 Putative uncharacterized protein n=1 Tax=Proteus... 55 8e-06 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 50 1e-04 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 49 4e-04 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 49 5e-04 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 48 7e-04 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 45 0.004 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 45 0.006 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 44 0.012 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 42 0.048 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 951 bits (2459), Expect = 0.0, Method: Compositional matrix adjust. Identities = 464/464 (100%), Positives = 464/464 (100%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA Sbjct: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN Sbjct: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD Sbjct: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 Query: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY 240 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY 240 Query: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN Sbjct: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 Query: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP Sbjct: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 Query: 361 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG Sbjct: 361 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 Query: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP Sbjct: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust. Identities = 329/415 (79%), Positives = 376/415 (90%) Query: 48 MAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 M PE+H+GEK FAE+VK FGE SM DNGLDTGEQAK FA +VRDALS QVNQH+ESWLS Sbjct: 1 MMPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLS 60 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 PWGNASV+V+VDN+G F GSRGSWF+P QDN RYLTWSQLGLT+Q++GLVSNVG+GQRWA Sbjct: 61 PWGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRWA 120 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMAR 227 R WL+GYNTFYDNLLDE+LQRAG GAEAWGEYLRLSAN+YQPFA+WHE++ATQEQRMAR Sbjct: 121 RDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWHERSATQEQRMAR 180 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYD++A+MRMPFYQHL+T VS+EQYFGD VDLF+SG GYHNP+A+SLGLNYTPVPLVTVT Sbjct: 181 GYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLVTVT 240 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 AQHKQGESG +QNNLGLNLNYRFGVPLKKQL+A EVAES+SLRGSRYD+PQRN+LP +EY Sbjct: 241 AQHKQGESGVSQNNLGLNLNYRFGVPLKKQLAASEVAESKSLRGSRYDSPQRNSLPVIEY 300 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 RQRKTL+VFLATPPWDL+PGETVPLKLQ+RS +GIR + WQGDTQ LSLT GA A+S +G Sbjct: 301 RQRKTLSVFLATPPWDLQPGETVPLKLQVRSLHGIRHVSWQGDTQALSLTAGANADSIDG 360 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRW 462 WT+IMP W + EGA + WRLSVVVED +GQRVSSNEITL L EPF A+S+D+ RW Sbjct: 361 WTIIMPTWDSSEGAIHRWRLSVVVEDEKGQRVSSNEITLALTEPFMAMSDDDPRW 415 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 607 bits (1564), Expect = e-172, Method: Compositional matrix adjust. Identities = 286/434 (65%), Positives = 344/434 (79%) Query: 30 QKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGK 89 ++A NPFD N D LPDLG+APEN+ EKHFA ++K FGE S D+ L G+QA+ FA + Sbjct: 2 RQAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTR 61 Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 +RDA+S + ES LSPWGNA+VD+ VD EG+F GS GS F P QDN+RYLTWSQ+G+ Sbjct: 62 LRDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGV 121 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 +QQ+ GLV N G+GQRW G+WL+GYNTFYD L D++ RAGFGAEAWG+YLRLSAN+YQ Sbjct: 122 SQQNQGLVGNAGIGQRWTAGHWLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 210 PFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 P W + EQRMARGYD+TA+ +PFYQH+NTSVS EQYFGD+V+LF+SG+GYHNP Sbjct: 182 PLGGWEHRAGLLEQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYHNP 241 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 VA+ +GL+YTPVPLVTV+A H+QGESG +QN+LGL LNYRFGVPL KQLS EVA S+SL Sbjct: 242 VAVKVGLSYTPVPLVTVSAHHRQGESGVSQNDLGLKLNYRFGVPLNKQLSPDEVAASRSL 301 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 RGSRYD +R N+P +E+RQRKTL+VFLATPPWDL GETV LKLQ+RSR+GIRQL WQG Sbjct: 302 RGSRYDRVERTNVPVMEFRQRKTLSVFLATPPWDLSAGETVALKLQVRSRHGIRQLSWQG 361 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 DTQ LSLTP + SA+GWT+IMP W N GASN WRLSV VED QGQRV+SN ITL L Sbjct: 362 DTQALSLTPPIDSTSADGWTVIMPAWDNSPGASNSWRLSVTVEDEQGQRVTSNWITLKLS 421 Query: 450 EPFDALSNDELRWE 463 P L D+ R+E Sbjct: 422 VPVQTLPQDDPRYE 435 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 535 bits (1379), Expect = e-150, Method: Compositional matrix adjust. Identities = 255/323 (78%), Positives = 283/323 (87%) Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 +TWSQLGLTQQ +GLVSNVG+GQRWA+ WL+GYNTFYDNLLDENLQRAGFGAEAWGEYL Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEYL 60 Query: 202 RLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFN 261 RLSAN+YQPFA W TAT EQRMARGYD+ A++R+PFYQH+NTSVSLEQYFGD VDLF+ Sbjct: 61 RLSANYYQPFADWQTHTATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVDLFD 120 Query: 262 SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAG 321 SGTGYHNPVAL LGLNYTPVPL+T+TAQHKQGESG +QNNLGL LNYRFGVPLKKQL+A Sbjct: 121 SGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYRFGVPLKKQLAAS 180 Query: 322 EVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG 381 EVA+SQSLRGSRYD PQRN+LPT+EYRQRKTLTVFLATPPWDL PGETV LKLQ+RS +G Sbjct: 181 EVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLATPPWDLTPGETVALKLQVRSVHG 240 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS 441 IR L WQGDTQ LSLT G S EGWT+IMP W + EGA+N WRLSVVVED +GQRVSS Sbjct: 241 IRHLSWQGDTQALSLTAGTDTRSTEGWTIIMPAWDHREGAANRWRLSVVVEDEKGQRVSS 300 Query: 442 NEITLTLVEPFDALSNDELRWEP 464 NEITL L EPF + +D W+P Sbjct: 301 NEITLALTEPFITMPDDNPHWQP 323 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 432 bits (1110), Expect = e-119, Method: Compositional matrix adjust. Identities = 228/472 (48%), Positives = 293/472 (62%), Gaps = 13/472 (2%) Query: 1 MSRFVPRIIPFYLL---LLVAGGTANAQSTFEQKA-------ANP--FDNNNDGLPDLGM 48 MS+F + LL L+V G T N F ++A A+P F LP+LG Sbjct: 40 MSQFYRYLTLSCLLPAVLVVGGFTLNDALAFTEQARVDDAPFADPARFAKMQQQLPELGT 99 Query: 49 APENHDGEKHFAEIVKDFGETSMN-DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 +N K AE K GE SMN D+ E+A + + RDA Q+ E LS Sbjct: 100 VHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVFNRFRDAAKQRAASEGEQLLS 159 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 P+G ASV + + ++G F GS P QDN YLT+SQLG+ Q + G V N G+GQRW Sbjct: 160 PYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQLGIEQSEYGSVGNAGLGQRWI 219 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMAR 227 G+W VGYN F D+LL + QR GAEAWG+YLR SAN+YQP + + + RMAR Sbjct: 220 AGSWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSANYYQPLSGCRNHSNSALMRMAR 279 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYD+T R +PFY+ L ++S EQY G+ VDLFNSG NP A+SLG+NYTPVPL T++ Sbjct: 280 GYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGNAVANPAAVSLGINYTPVPLFTLS 339 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 A HK+G+ GE+Q+ L +NYR GV L +QLSA VA +QSL GSRYD RNN P + + Sbjct: 340 ASHKEGDGGESQDKFALKMNYRLGVALSQQLSADNVAAAQSLSGSRYDGVNRNNSPVMAF 399 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 RQ KTL+VFLATPPW L+PGET+PLKLQI I+ + WQGDTQ LSLTP +G Sbjct: 400 RQLKTLSVFLATPPWQLQPGETLPLKLQIAHSNAIKAVSWQGDTQALSLTPPPNNVDPQG 459 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 W++IMP W + +GA+N W LSV +ED++ QRV+SN ITL L P + D Sbjct: 460 WSIIMPAWNSQQGANNSWHLSVTLEDSKHQRVTSNWITLKLSPPMTLQAADR 511 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 387 bits (993), Expect = e-106, Method: Compositional matrix adjust. Identities = 207/463 (44%), Positives = 291/463 (62%), Gaps = 14/463 (3%) Query: 7 RIIPFYLLLLVAGGTANAQSTFEQKAANP------FDNNNDGLPDLG-MAPENHDGEKHF 59 +++PF L A G A A +P + LP+LG A + + EK + Sbjct: 14 KVVPFATGCLPAMGLAWLCGALPAYAESPPAPDSVVQQPANDLPELGGNASNDAEREKEW 73 Query: 60 AEIVKDFGETSMND-NGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKV 118 A + K GE ++N+ + +A+++A+G+ L QQ + LSP GNA + + + Sbjct: 74 ATMAKQLGERNLNNVSSQQVRTRAESYAVGQASSVLQQQA----QELLSPLGNAKLSLVM 129 Query: 119 DNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTF 178 ++G F+GS G F PL D + LT+SQLGL QQ G + N G+GQRW G+WL+GYNT Sbjct: 130 SDQGDFSGSSGQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGDWLLGYNTV 189 Query: 179 YDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMR 236 D+ + + RA GAEAWG++LR SAN+Y P +A +Q + R A GYD+T + Sbjct: 190 LDSDFERHHNRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQGY 249 Query: 237 MPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESG 296 +PFY+ + S+S EQY+G+ VDLF SG ++P A+ LG+NYTPVPLVTV A HK GE G Sbjct: 250 LPFYRQIGGSLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLVTVKALHKMGEGG 309 Query: 297 ENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVF 356 +Q+ + L LNYR GVPL KQ+S VA+++SLRGSRYDN +R N+P + ++QRKTL VF Sbjct: 310 VSQDQVELALNYRLGVPLVKQISPEYVAQAKSLRGSRYDNIERKNVPVMAFKQRKTLQVF 369 Query: 357 LATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ 416 LATPPW L+PGET+PL L+I++ I ++ WQGDTQ LSLTP +N GW+LI+P W Sbjct: 370 LATPPWRLQPGETLPLVLEIKTTNKITRVSWQGDTQALSLTPSQNSNDPHGWSLIVPQWD 429 Query: 417 NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 + A+N + LSV +ED++ Q V+SN I L + P S E Sbjct: 430 DSPDAANRYHLSVTLEDDKQQLVTSNWIQLQVTPPLTVSSEIE 472 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 275 bits (702), Expect = 4e-72, Method: Compositional matrix adjust. Identities = 149/411 (36%), Positives = 233/411 (56%), Gaps = 7/411 (1%) Query: 43 LPDLGM----APENHDGEKHFAEIVKDFGETSMNDNGLD-TGEQAKAFALGKVRDALSQQ 97 LP+LGM P GE+ A G + N+ D QA+++A G+ + + Sbjct: 104 LPELGMGNDPVPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQAESWAKGQAKAQVVDP 163 Query: 98 VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLV 157 + Q + L +G A V++ VD+ G + S S F P +ND + +SQ+G+ +QDN ++ Sbjct: 164 LRQQAQELLGKFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAMVAFSQVGVHRQDNRMI 223 Query: 158 SNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ 217 N+G G R+ +G+WL G NTF D + N R G G E W + L+L++N+Y P + W + Sbjct: 224 GNLGAGVRFDQGDWLFGANTFLDQDISRNHSRLGLGLEWWADNLKLASNYYHPLSGWKDS 283 Query: 218 TATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLG 275 + +R ARG+D+ A+ +P YQ L S EQY+GD V LF +P A+++G Sbjct: 284 KDFDDYLERPARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVALFGKDNLQKDPHAVTVG 343 Query: 276 LNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYD 335 ++YTP PL T+ HK G+ G+N LGL ++Y+ G L+KQL G VA +SL+GSRYD Sbjct: 344 VDYTPFPLATLKVSHKMGKDGKNNTELGLQVSYQIGTALEKQLDPGNVAAMRSLKGSRYD 403 Query: 336 NPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILS 395 RN LEY+++ L++ LA P L G+ ++ +RS+Y I + W GD L Sbjct: 404 LVDRNYDIVLEYKEKAVLSLDLAAVPMTLLEGDVYMMQPLVRSKYRITSVSWHGDAVPLL 463 Query: 396 LTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITL 446 L P A AN+ +GW + +P W GA+N + LS+ + D +G + +SN++ + Sbjct: 464 LVPTAGANNPQGWQITLPAWDATPGATNLYTLSISIVDEKGHQATSNDVEI 514 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 271 bits (693), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 139/198 (70%), Positives = 157/198 (79%), Gaps = 8/198 (4%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 +SR V R LLLL A GT AQ A +PFD N LPDLGM PE+H+GEKHFA Sbjct: 50 LSRIVFRSFSLSLLLLAASGTIRAQ------AQDPFDQNR--LPDLGMMPESHEGEKHFA 101 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 E+ K F E SM +N LDTGEQA+ FA G+VRD +S+QVNQ +ESWLS WG+ASVD+ VDN Sbjct: 102 EMAKAFSEASMKNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDN 161 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 EGHF GSRGSWF+PLQD RYLTWSQLGLTQQ +GLVSNVG+GQRWA+ WL+GYNTFYD Sbjct: 162 EGHFNGSRGSWFIPLQDKQRYLTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYD 221 Query: 181 NLLDENLQRAGFGAEAWG 198 NLLDENLQRAGFGAEAWG Sbjct: 222 NLLDENLQRAGFGAEAWG 239 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 254 bits (650), Expect = 4e-66, Method: Compositional matrix adjust. Identities = 151/415 (36%), Positives = 229/415 (55%), Gaps = 13/415 (3%) Query: 43 LPDLGMAPENHDG----EKHFAEIVKDFGETSMNDNGLDTGE-QAKAFALGKVRDALSQQ 97 LPD+ + E ++ FA+ K+ G N D + + +A K+ L QQ Sbjct: 20 LPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRDKAQSKTTQWAKEKIISPLQQQ 79 Query: 98 VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLV 157 + L +G A V++ +DN+G+ S S F P D+++YL +SQ+ + QDN + Sbjct: 80 A----QDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYLLFSQINIHHQDNRKI 135 Query: 158 SNVGVGQR--WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH 215 N G+G R N L+GYN F D+ RAG GAEA +YL+ SAN+Y P + W Sbjct: 136 GNFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADYLKFSANYYHPLSHWK 195 Query: 216 EQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALS 273 + + +R A+GYDL ++ +P Y L S E YFGD V LF +P AL+ Sbjct: 196 DSPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVALFGKSHRQKDPRALT 255 Query: 274 LGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSR 333 LG++YTPVPLVT+ A+HK G+ G+ + + Y+FG PL QL V + +SL+GSR Sbjct: 256 LGIDYTPVPLVTLGAKHKYGQQGKKDTQIDVAFRYQFGSPLSAQLDPDNVNQLRSLKGSR 315 Query: 334 YDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQI 393 YD RNN LEY++++ L LA P L GE+ L+ ++S+Y I LIW GD Sbjct: 316 YDLVDRNNDIVLEYKEKQVLFADLAAVPDSLMEGESYILRPLVKSKYPIIDLIWLGDLLP 375 Query: 394 LSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 L L A +++ +GW + +P W + GASN ++L++ +ED + RV++N I + + Sbjct: 376 LQLLATAGSHNPQGWQITLPAWSSVAGASNRYQLALSLEDQKNHRVTTNTIEIQV 430 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 228 bits (582), Expect = 3e-58, Method: Compositional matrix adjust. Identities = 138/390 (35%), Positives = 209/390 (53%), Gaps = 25/390 (6%) Query: 67 GETSMNDNG--LDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 G T + D+ L A++ A+G+ DA QH WLS +G A V + +DN Sbjct: 63 GATVLADDNTPLAAASMARSVAVGEANDAA-----QH---WLSQFGTARVQLNLDNNLSL 114 Query: 125 TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLD 184 GS +PL D+ + L +SQ GL D+ N+G G R + NW+ G N F+D + Sbjct: 115 KGSAFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDNWMYGANVFFDRDIT 174 Query: 185 ENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQH 242 R GFGAEAW +YL+LSAN Y WH+ A +R A GYDL +P Y Sbjct: 175 GKNNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLPAYPQ 234 Query: 243 LNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNL 302 + T++ EQY G+ V LF NP A + G+NYTP+PL+T+ A+ + G+ G N N+ Sbjct: 235 IGTNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLITIGAEQRAGKGGRNDTNI 294 Query: 303 GLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPW 362 + LNYR G P + Q+ VA S++L GSRYD +RNN LEY+++ + + L P Sbjct: 295 SIQLNYRLGEPWQSQIDPSAVAASRTLAGSRYDLVERNNNIVLEYQKQDLIQLVL---PN 351 Query: 363 DLKPG--ETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQA-NSAEGWTLIMPDWQNGE 419 + E + ++ Q+ ++YG++++ W DT ++ G S++ ++ +P + G Sbjct: 352 QMTGSAFEIIKVEAQVTAKYGLKRIDW--DTAVIVAAGGVVTQTSSQNISIKLPPYTAG- 408 Query: 420 GASNHWRLSVVVEDNQGQRV--SSNEITLT 447 SN + LS V DNQG S+ +IT+T Sbjct: 409 --SNVYMLSAVAYDNQGNTSNHSTTQITVT 436 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 223 bits (567), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 128/379 (33%), Positives = 202/379 (53%), Gaps = 8/379 (2%) Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 VR A S + N + WLS +G A V + ++++ H GS +PL DN++ + ++QLG Sbjct: 168 VRSAASNEFNNSAQQWLSQFGTARVQLNINDDFHLDGSAADVLIPLYDNEKSILFTQLGA 227 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 +D+ N+G G R +GNW+ G NTF+DN L +R G GAEAW +YL+LSAN Y Sbjct: 228 RNKDSRNTVNMGAGVRTFQGNWMYGANTFFDNDLTGKNRRIGVGAEAWTDYLKLSANNYF 287 Query: 210 PFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 WH+ +R A GYDL A +P Y L E+Y GD V LF Sbjct: 288 GITDWHQSRDFIDYNERPANGYDLRAEAYLPSYPQLGGKAMYEKYRGDDVALFGKDNRQK 347 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 NP A++ G+NYTP+PLVT+ A+H+ G+ G+N +N+ LNYR G + + VA S+ Sbjct: 348 NPHAITAGVNYTPIPLVTIGAEHRAGKGGQNDSNINFQLNYRLGETWQSHIDPSAVAASR 407 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLK--PGETVPLKLQIRSRYGIRQL 385 +L GSRYD +RNN L+Y+++ + + L P L P + + Q+ + +G+ ++ Sbjct: 408 TLAGSRYDLVERNNHIVLDYQKQNLVRLSL---PDSLAGDPFSQLSVTAQVTATHGLERI 464 Query: 386 IWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEIT 445 WQ ++++ + S G + +P++Q N + L+ + D QG S + Sbjct: 465 DWQ-SAELMAAGGVLKQTSKNGLEITLPEYQMNRTGGNSYILNAIAYDTQGNASSQASML 523 Query: 446 LTLVEPFDALSNDELRWEP 464 +T+ ++N L P Sbjct: 524 ITVNAQKINIANSTLVAVP 542 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 219 bits (557), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 120/382 (31%), Positives = 205/382 (53%), Gaps = 6/382 (1%) Query: 72 NDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSW 131 N N +AK + R+ L Q+V+++ +G +++ VDN+G F SR Sbjct: 163 NINREKIKSEAKFYIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLL 222 Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLV--GYNTFYDNLLDENLQR 189 P N+ ++ +SQLG Q + + ++G+GQR+ + + GYN F D LD+ +R Sbjct: 223 LTPWYKNNSHVLFSQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKR 281 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSV 247 G EA Y +LS N+Y P W + ++ +R A G+D+ + +P Y L + Sbjct: 282 MSIGTEAASNYFKLSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKM 341 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 EQYFG V LFN NP A+S+G++Y P PL ++ HK G++ + LGL LN Sbjct: 342 KYEQYFGKEVALFNKTKRQKNPKAVSIGIDYRPFPLASIYVDHKLGQNHHRETKLGLTLN 401 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPG 367 Y+FG PL QL + E+++L+ +R RN +EY++++ L+V L ++ G Sbjct: 402 YQFGTPLSSQLDPNNLNEARNLKQNRLAPVDRNYNIVMEYKEKQLLSVDLPAMDKNILEG 461 Query: 368 ETVPLKLQIRSRYGIRQLIWQGDT-QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWR 426 + ++ I+++Y I+ + W GD Q+ + A NS GW +I+P+W + + A N +R Sbjct: 462 DIYVIRPLIKNKYPIKTVSWLGDVSQLSLSSSSADKNSPVGWKIILPEWNSEKDAKNTYR 521 Query: 427 LSVVVEDNQGQRVSSNEITLTL 448 L++ +ED +G + SN + + + Sbjct: 522 LAIQIEDTKGHQAISNYMDIVV 543 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 215 bits (548), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 124/366 (33%), Positives = 202/366 (55%), Gaps = 8/366 (2%) Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 +R A + + N + WLS +G A V + V+++ GS VP+ DN + + ++QLG Sbjct: 138 IRSAANNEFNSSAQQWLSQFGTARVQMNVNDDFKLDGSAVDVLVPIYDNQKSILFTQLGA 197 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 +DN N+G G R + NW+ G NTF+DN + +R G GAEAW +YL+LSAN Y Sbjct: 198 RNKDNRNTVNIGAGVRTFQNNWMYGVNTFFDNDMTGKNRRVGVGAEAWTDYLKLSANSYI 257 Query: 210 PFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 + WH+ A +R A GYD+ A +P + L + E+Y G+ V LF Sbjct: 258 GTSDWHQSRDFADYNERPANGYDVRAEAYLPSHPQLGGKLMYEKYRGEEVALFGKDNRQK 317 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 NP A++ G+NYTP+PL+TV A+H+ G+ +N +++ NYR G + ++ VA ++ Sbjct: 318 NPHAVTAGVNYTPIPLLTVGAEHRAGKGSKNDSSINFQFNYRLGESWQSHINPSAVAATR 377 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 +L GSRYD +RNN L+Y++++ + + L + K G+ + Q+ S+YG+ ++ W Sbjct: 378 TLAGSRYDLVERNNNIVLDYQKQELIRLSLP-ERVEGKAGDIATVNAQVTSKYGLERIDW 436 Query: 388 QGDTQILSLTPGAQAN-SAEGWTLIMPDWQNGEGAS-NHWRLSVVVEDNQGQRVSSNEIT 445 D+ L G + S+ ++ +P +Q G + N + LS + D QG R +S+ T Sbjct: 437 --DSAALIAAGGTLSKGSSNSISITLPPYQASVGNTPNSYTLSAIAFDTQGNRSNSSS-T 493 Query: 446 LTLVEP 451 L V P Sbjct: 494 LINVSP 499 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 134/419 (31%), Positives = 215/419 (51%), Gaps = 29/419 (6%) Query: 34 NPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDA 93 NP +NN + DL A G+ NDN D A R A Sbjct: 148 NPNENNKKDVDDL------------LARNAMGAGKLLSNDNTSDA-------ASNMARSA 188 Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 ++ ++N + WL+ +G A V + VD++ S VPL+D++ L ++QLG+ +D Sbjct: 189 VTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSLLFTQLGVRNKD 248 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 + N+G G R +G+W+ G NTF+DN L +R G GAE +YL+ SAN Y Sbjct: 249 SRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLKFSANTYFGLTG 308 Query: 214 WHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 WH+ ++ ++R A G+D+ +P Y L + E+Y GD V LF +P A Sbjct: 309 WHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALFGKDDRQKDPHA 368 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 ++LG+NYTPVPLVT+ A+H++G+ N ++ + LNYR G P Q+ VA +++L G Sbjct: 369 VTLGVNYTPVPLVTIGAEHREGKGNNNNTSVNVQLNYRMGQPWNDQIDQSAVAANRTLAG 428 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 SRYD +RNN L+Y++++ + + L G + L Q+R++YG ++ W D Sbjct: 429 SRYDLVERNNNIVLDYKKQELIHLVLPD-RISGSGGGAITLTAQVRAKYGFSRIEW--DA 485 Query: 392 QILSLTPGAQAN-SAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQG----QRVSSNEIT 445 L G+ + + ++ +P +Q+ SN +S V D QG + V+S E+T Sbjct: 486 TPLENAGGSTSPLTQSSLSVTLPFYQHILRTSNTHTISAVAYDAQGNASNRAVTSIEVT 544 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 211 bits (538), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 128/361 (35%), Positives = 189/361 (52%), Gaps = 11/361 (3%) Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPL-QDNDRY 141 K A + + L+ Q+ + + WLS +G A +++ VD+ G S VP D D + Sbjct: 128 KKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHW 187 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 L +SQ G +D+ N+G+G R +W+ G NTFYDN L N R G E W YL Sbjct: 188 LIYSQYGYRHKDSRDTVNLGIGTRLFINDWMYGANTFYDNDLTGNNSRFSLGGELWTNYL 247 Query: 202 RLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 ++SAN Y + WH +R A GYDL A M +P L + EQYFGD V L Sbjct: 248 KMSANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQYFGDNVAL 307 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F + +P A ++G+NYTP+PL+T +K G+ G++ LN+NYRFGVPL +QLS Sbjct: 308 FGTNNRQKDPYAATIGVNYTPIPLITAGVDYKLGKEGKSDGIFSLNMNYRFGVPLSEQLS 367 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEY-RQRKTLTVFLATPPWDLKPGETVPLKLQIRS 378 V +SL GSRYD +RNN L Y +++K + + GE P +QI+S Sbjct: 368 PENVGSLRSLAGSRYDLVERNNNIILNYLKKQKHFRLLVPVIEIIGYGGEIKP--IQIQS 425 Query: 379 RYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR 438 ++ +IW D L G + G+T+ +P++Q N + ++ +D+Q QR Sbjct: 426 DTPLKNIIW--DMPELFQKNGGIIKNTNGYTIQLPEYQ--PDGKNDYTITGTSKDDQ-QR 480 Query: 439 V 439 V Sbjct: 481 V 481 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 211 bits (538), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 138/454 (30%), Positives = 218/454 (48%), Gaps = 45/454 (9%) Query: 40 NDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLD-----TGEQAKAFALGKVRDAL 94 N+ LPDLG D + + + +K+ G + ++ T E K+ A ++ + Sbjct: 28 NESLPDLGSQAAQQDEQTNKGKSLKERGADYVINSATQGFENLTPEALKSQARSYLQSQI 87 Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + ++E LSP+G ++ + G GS +FVP DN + +SQ ++++ Sbjct: 88 TSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTVYFSQFSAQRKED 147 Query: 155 GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 + N+G+G R+ +L+G N FYD +R G GAEAW +YL+ S N+Y P + W Sbjct: 148 RTIGNIGLGVRYNFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYLKFSGNYYHPLSDW 207 Query: 215 H--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 E E+R ARG+D+ A +P Y L + EQY+G+ V LF + + +P A+ Sbjct: 208 KDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVALFGTDSLEKDPFAV 267 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS 332 +LG+ Y PVPL+ V K G ++ LNY+FGVPLK QL +V+ + SL GS Sbjct: 268 TLGVKYQPVPLIVVGTDFKAGTGDNTDLSVNATLNYQFGVPLKDQLDPDKVSAAHSLMGS 327 Query: 333 RYDNPQRNNLPTLEYRQRKTLTVFL--------ATPPWDLK--PGETVPLKLQ------- 375 R+D +RNN LEY+++ L V L P +K P E + L+ Sbjct: 328 RHDFVERNNFIVLEYKEKDPLYVTLWLKADVTNEHPECVIKDTPEEAIGLEKCKWTINAL 387 Query: 376 IRSRYGIRQLIWQGDTQILS-----------LTPGAQANS-AEG----WTLIMPDWQNGE 419 I Y I WQ S + P + N+ EG W L++P WQ Sbjct: 388 INHHYKIVAASWQAKNNAASWQAKNNAARTLVMPVIKENTLTEGNNNHWNLVLPAWQYSS 447 Query: 420 GAS-----NHWRLSVVVEDNQGQRVSSNEITLTL 448 + N WR+ + +ED +G R +S + +T+ Sbjct: 448 DQAEQEKLNTWRVRLALEDEKGNRQNSGVVEITV 481 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 207 bits (527), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 127/360 (35%), Positives = 185/360 (51%), Gaps = 14/360 (3%) Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPL-QDNDRY 141 K A + + L+ Q+ + + WLS +G A +++ VD+ G S VP D D + Sbjct: 121 KKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHW 180 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 L +SQ G +D+ N+G+G R NW+ G NTFYDN L N R G E W YL Sbjct: 181 LVYSQYGYRHKDSRDTVNLGIGTRLFINNWMYGANTFYDNDLTGNNSRFSLGGELWTNYL 240 Query: 202 RLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 ++SAN Y + WH +R A GYDL A M +P L + EQYFGD V L Sbjct: 241 KMSANAYFRLSDWHNARDLVNYYERPANGYDLIADMYLPSMPSLGAKIKYEQYFGDNVAL 300 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F +P A ++G+NYTP+PL+T +K G+ G++ N+NYRFGVPL +QLS Sbjct: 301 FGKNKRQKDPYAATIGVNYTPIPLITAGIDYKLGKEGKSDGIFSFNVNYRFGVPLSEQLS 360 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKP--GETVPLKLQIR 377 V+ +SL GSRYD +RNN L Y +++ L P ++ GE P +QI+ Sbjct: 361 PENVSSLRSLAGSRYDLVERNNNIILNYLKKQQHFRLLV-PVIEISSYGGEVKP--IQIQ 417 Query: 378 SRYGIRQLIWQGDTQILSLTPGAQAN--SAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQ 435 S + + W D L G N S G+T+ +P++Q N + ++ +D+Q Sbjct: 418 SDTPFKNVTW--DIPELFQKNGGMINIESTHGYTIQLPEYQ--PDGKNDYTITGTSKDDQ 473 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 207 bits (526), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 114/374 (30%), Positives = 200/374 (53%), Gaps = 9/374 (2%) Query: 80 EQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND 139 E+ KAFA R+ L+ + + + W + +G++ + ++ D + S+ +P + + Sbjct: 161 EKTKAFA----RELLTTAASSYAQDWFNRFGSSQIHLEADKKFSLKNSQIDLLMPWYETE 216 Query: 140 RYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGE 199 L +SQ L +++ + +N+G+G RW ++G NTF+D + R G G E + Sbjct: 217 DNLIFSQTSLHRKEGRIETNLGLGARWYGEGQMIGGNTFFDYDISRKHSRLGLGVEYRRD 276 Query: 200 YLRLSANFYQPFAAWHEQ--TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRV 257 +L+LSAN Y + W A R + G+D+ A +P Y H+ ++ EQY+GD V Sbjct: 277 FLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVRAEGWLPSYPHIGGKLTYEQYYGDSV 336 Query: 258 DLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQ 317 LF + NP +++ GLNYTP+PLVT A+H+QG++ + + GL LNY+FG K+ Sbjct: 337 ALFGTKNLQQNPYSITAGLNYTPIPLVTFNAEHRQGKASKQDSRFGLQLNYQFGKTWKQH 396 Query: 318 LSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIR 377 L G V +SL G+RYD RNN LEY++ + + +A GE +PL + Sbjct: 397 LDPGSVTTFRSLMGNRYDFVSRNNHIVLEYKKNDVIQLNIANSITGYA-GEKIPLSFTVA 455 Query: 378 SRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQ 437 S+YG+ L W +T + + Q N ++L++P ++N ++N++ +S V D +G Sbjct: 456 SKYGLSHLKWNAETLVAAGGHIVQENGK--YSLVLPAYRNDAKSANNYTISAVAIDKKGN 513 Query: 438 RVSSNEITLTLVEP 451 + + + + +P Sbjct: 514 ISPNTMLRVVVTQP 527 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 206 bits (524), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 129/398 (32%), Positives = 204/398 (51%), Gaps = 17/398 (4%) Query: 68 ETSMNDNGLDTGEQAKAFALGKVRDA----LSQQVNQHVESWLSPWGNASVDVKVDNEGH 123 ET + + TG A+ A G+ D + VNQ ++ WL+ +G A V++ D Sbjct: 106 ETEAVNKMISTG--ARLAASGRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKNFS 163 Query: 124 FTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLL 183 S W P D+ +L +SQLG+ +D+ N+GVG R WL G NTFYDN L Sbjct: 164 LKESSLDWLAPWYDSASFLFFSQLGIRNKDSRNTLNLGVGIRTLENGWLYGLNTFYDNDL 223 Query: 184 DENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQ 241 + R G GAEAW +YL+L+AN Y WH + ++R A G DL A +P Sbjct: 224 TGHNHRIGLGAEAWTDYLQLAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALP 283 Query: 242 HLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNN 301 L + EQY G+RV LF NP A++ G+NYTPVPL+TV + G+S +++ Sbjct: 284 QLGGKLMYEQYTGERVALFGKDNLQRNPYAVTAGINYTPVPLLTVGVDQRMGKSSKHETQ 343 Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 L +NYR G + QLS VA ++ L SRY+ RNN LEY++++ + + L+ Sbjct: 344 WNLQMNYRLGESFQSQLSPSAVAGTRLLAESRYNLVDRNNNIVLEYQKQQVVKLTLSPAT 403 Query: 362 WDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGA 421 PG+ + Q++ +R+++W D ++++ S + L++P ++ Sbjct: 404 ISGLPGQVYQVNAQVQGASAVREIVWS-DAELIAAGGTLTPLSTTQFNLVLPPYKRTAQV 462 Query: 422 S--------NHWRLSVVVEDNQGQRVSSNEITLTLVEP 451 S N + LS + D+QG R +S +++T+ +P Sbjct: 463 SRVTDDLTANFYSLSALAVDHQGNRSNSFTLSVTVQQP 500 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 202 bits (513), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 122/362 (33%), Positives = 192/362 (53%), Gaps = 9/362 (2%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ AK+ A R A++ +++ + WL +G A + +++ F S +PL DN Sbjct: 153 GDAAKS-AENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDDFEFDSSAIDVLIPLYDN 211 Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 + L ++QLG +D+ N+G G R NW+ G NTF+DN + N +R G GAEAW Sbjct: 212 QKSLFFTQLGGRNKDSRNTINIGAGVRAFLTNWMYGANTFFDNDITGNNRRVGIGAEAWT 271 Query: 199 EYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 +YL+LSAN Y WH+ A +R A GYDL A +P Y L + EQY GD Sbjct: 272 DYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPAYPQLGGKLMYEQYNGDE 331 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LF +P A+++G+NYTPV LVTV H+ G+S ++ +++ L NYR + Sbjct: 332 VALFGKDKRQKDPHAITVGINYTPVSLVTVGIDHRAGKSSKSDSSINLQFNYRLSNSWQS 391 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDL--KPGETVPLKL 374 + VA +++L GSR D +RNN L+Y++++ L + L P L G+ L Sbjct: 392 HIDPSAVAVTRTLAGSRQDLVERNNNIVLDYQKQELLRLSL---PEQLTGSAGDNAILTA 448 Query: 375 QIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDN 434 QI S+Y ++++ W ++ +++ S + T+ P +Q G SN + LS + D Sbjct: 449 QIESKYEVQRVEWDANS-LIAAGGNISTTSQKDVTITFPPYQYQVGVSNIYALSAIAYDV 507 Query: 435 QG 436 G Sbjct: 508 NG 509 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 199 bits (507), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 120/364 (32%), Positives = 190/364 (52%), Gaps = 13/364 (3%) Query: 77 DTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQ 136 D+ +QA + A G +A N+ ++ W + +G+A V + +D + GS+ +PL Sbjct: 89 DSAKQASSIARGTAANA----GNEALQKWFNQFGSAKVQLNLDEKLSLKGSQLDVLLPLT 144 Query: 137 DNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEA 196 D+ LT++QLG D+ + NVG+GQR ++GYN F D+ + R G GAE Sbjct: 145 DSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQMLGYNLFVDHDASYSHTRIGVGAEY 204 Query: 197 WGEYLRLSANFYQPFAAWHEQTATQ--EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFG 254 +++ L+AN Y + W ++++A G+DL + +P L + EQYFG Sbjct: 205 GRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSEAYLPTLPQLGGKLIYEQYFG 264 Query: 255 DRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPL 314 D V LF NP+A++LG+NYTP+PL TV HK G +G N L NY FG PL Sbjct: 265 DEVGLFGVDNRQKNPLAVTLGVNYTPIPLFTVGVDHKMGRAGMNDTRFNLGFNYAFGTPL 324 Query: 315 KKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPG--ETVPL 372 QL + VA +SL GSRY+ RNN ++YR++ +T+ L P + +T+PL Sbjct: 325 AHQLDSDAVAIKRSLMGSRYNLVDRNNQIVMKYRKQNRVTLEL---PARVSGAARQTMPL 381 Query: 373 KLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVE 432 ++ GI ++ W+ L+L G S W + +P + +G +N +R+S + Sbjct: 382 VANATAQQGIDRIEWEASA--LTLAGGKITGSGNNWQITLPSYLSGGEGNNTYRISAIAY 439 Query: 433 DNQG 436 D G Sbjct: 440 DTLG 443 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 198 bits (504), Expect = 3e-49, Method: Compositional matrix adjust. Identities = 117/375 (31%), Positives = 194/375 (51%), Gaps = 8/375 (2%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ A + A G A S +V Q WLS +G A + + VDN+ S+ +PL + Sbjct: 155 GDAAASIARGMATGAASTEVQQ----WLSQFGTARLQLDVDNKFSLKNSQLDLLIPLYEQ 210 Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 L ++Q L + D+ +N+G+G RW +++G NTF D L + R G G E W Sbjct: 211 PDKLVFTQGSLHRTDDRTQTNLGMGMRWFNDGYMLGGNTFLDYDLSRDHARMGMGVEYWR 270 Query: 199 EYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 +YL++ AN Y W + A ++R A G+D++ +P L ++ EQY+G Sbjct: 271 DYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSLEGWVPALPQLGGNLKYEQYYGKE 330 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LF +P A+++G+NYTP PL+T +A +QG++G+N LG+ LN + G P + Sbjct: 331 VALFGKDNRQKDPHAITVGVNYTPFPLLTFSADQRQGKAGQNDTRLGVQLNIQLGTPWQH 390 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376 QL V ++L GSRYD RNN LEYR+++ + ++ A GE L + I Sbjct: 391 QLDTSAVGAMRTLAGSRYDLVDRNNNIVLEYRKKEVIHLYTADHLAGY-AGEQKSLNVSI 449 Query: 377 RSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQG 436 ++YG+ ++ W ++L+ S + +++++PD+ N + +S V D G Sbjct: 450 NTKYGLERIDWSA-PELLAAGGKIVQESIDNYSIVLPDYNFDSANGNVYEISGVAIDTHG 508 Query: 437 QRVSSNEITLTLVEP 451 + TLT+ +P Sbjct: 509 NVSKKAKTTLTVTQP 523 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 197 bits (501), Expect = 7e-49, Method: Compositional matrix adjust. Identities = 134/429 (31%), Positives = 212/429 (49%), Gaps = 27/429 (6%) Query: 45 DLGMAPEN------HDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQV 98 +LG+APEN E++ A+ + L++G++A A L R Sbjct: 73 NLGLAPENTALTDTQTTERNLAKTATTSAQM------LNSGDKAAARQL---RGLAVGNA 123 Query: 99 NQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVS 158 NQ SWL+ +G A + VD+ G GS+ +P D + ++Q G+ + D + Sbjct: 124 NQAANSWLNNFGTARLQANVDDRGDLDGSQFDMLMPFYDTPSQMAFTQFGIRRIDKRTTA 183 Query: 159 NVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT 218 N+G+G R +W+VGYN F D + + R G GAE +YL+L+AN Y + W + Sbjct: 184 NLGIGIRHFIDDWMVGYNLFLDRDITRDHTRVGAGAEYARDYLKLAANGYLRLSDWRDSP 243 Query: 219 --ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGL 276 ++ +R A G+DL A +P L + EQYFG+ V LF NP A++ G+ Sbjct: 244 DFSSYSERPATGFDLRAEAYLPSLPQLGGKLMYEQYFGNDVGLFGKDNRQQNPAAITAGI 303 Query: 277 NYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDN 336 NYTP+PLVTV KQG +G + L +NY G P KQ+S V ++L+GSR D Sbjct: 304 NYTPIPLVTVGIDRKQGSAGNGETLFNLGVNYEVGTPWAKQISPDAVNARRTLQGSRNDL 363 Query: 337 PQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSL 396 +RNN LEY+++ + ++++ + ET L + + S+YG+R + Q D L+ Sbjct: 364 VERNNQIVLEYKKQDVINLYVSN-NVSGRAAETKQLVVSVTSKYGLRNI--QFDQGALAA 420 Query: 397 TPGAQANSAEG-WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDAL 455 G + L +P +G N+W +S + D +G +SN +TLV+ D Sbjct: 421 AGGKIIPQGPSQFALQLPPQPSG---GNNWTISAIASDVKGN--TSNR-AVTLVQLQDTP 474 Query: 456 SNDELRWEP 464 + W P Sbjct: 475 ATISGTWTP 483 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 194 bits (494), Expect = 5e-48, Method: Compositional matrix adjust. Identities = 117/380 (30%), Positives = 192/380 (50%), Gaps = 16/380 (4%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 A G R + N+ ++ WLS +G A V + +D + S WF+P+ D+ ++ Sbjct: 190 AAGMARSMATNAANEEIQQWLSKYGTARVQLNLDKNFSLSESALDWFIPVWDSANLTAFT 249 Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 QLG +D N+GVG R W++G N FYD+ L + R G GAEAW +YL+LS Sbjct: 250 QLGARNKDRRNTINLGVGARTLLDRWMLGVNMFYDHDLTGHNSRLGIGAEAWTDYLQLST 309 Query: 206 NFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 N Y + WH+ A ++R A G+D+ A +P L + EQY G+ V LF Sbjct: 310 NGYMRLSNWHQSRDFADYDERAANGFDIRANAWLPALPQLGGKLVYEQYIGENVALFGKE 369 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 NP AL+ G+NYTP PL+TV + G++G N + L+YR G+ + Q+ V Sbjct: 370 NLQRNPYALTAGVNYTPFPLLTVGVDERLGKAGRNDTQFSIQLSYRPGLSWQSQIDPSSV 429 Query: 324 AESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIR 383 A + + SRY+ RNN LEY++++ + + L+ + G + ++S+Y + Sbjct: 430 AAIRQIAESRYNLVDRNNDIVLEYKKQEVIKLALSHHAINDLAGAVYTVSANLKSKYALD 489 Query: 384 QLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ------------NGEGASNHWRLSVVV 431 Q+ WQ D +++ ++L++P ++ E A+N ++L V Sbjct: 490 QVSWQ-DGGLVAAGGQLTVIDKNHFSLMLPPYRPAQAKSDAHQTSTAEIAANTYQLIAVA 548 Query: 432 EDNQGQRVSSNEITLTLVEP 451 DNQG + S++E +V+P Sbjct: 549 FDNQGNQ-SNSETLRVVVQP 567 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 194 bits (494), Expect = 5e-48, Method: Compositional matrix adjust. Identities = 117/347 (33%), Positives = 185/347 (53%), Gaps = 15/347 (4%) Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL 156 Q + V WL +GNA + + VD+ S + P D +++ +SQ L + D+ Sbjct: 89 QATKEVVEWLQKYGNARIQLNVDDAFSLKDSAFDFLYPWIDKKQHVLFSQTSLHRTDDRT 148 Query: 157 VSNVGVGQRW-ARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW- 214 +N+G+G R+ N ++G N FYD L + R G G E W +YLR AN Y + W Sbjct: 149 QTNIGMGYRYFTADNSMLGANLFYDYDLSRHHARMGAGVEYWRDYLRAGANAYLRLSKWK 208 Query: 215 --HEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 H+ QE R A G+D+ + +P Y L S+ E+Y+G V LF S NP A Sbjct: 209 DSHDLDDYQE-RPADGWDIYTQGWLPSYPQLGASLKYEKYYGKNVGLFGSDHLQENPYAF 267 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS 332 + G++YTPVPLVT++A+HKQG+S + + G+ +NYR G+PL KQL + VA + ++ Sbjct: 268 TGGISYTPVPLVTLSAEHKQGQSNTHDSRFGIEINYRPGIPLAKQLDSDNVALMREVQHG 327 Query: 333 RYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLK--PGETVPLKLQI-RSRYGIRQLIWQG 389 RYD +RNN LEYR++ L + L P ++ G +P+ + + +S +GI+ + W Sbjct: 328 RYDFVERNNNIVLEYRKKSVLKIRL---PESVQGEGGAVIPVTISLDKSHWGIQSVEW-- 382 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQG 436 + + G + S W L +P + G +NHW++ D +G Sbjct: 383 NDSAFTAAGGRISGSGTSWQLTLPAYT--PGGTNHWQIGATARDVKG 427 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 194 bits (493), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 120/380 (31%), Positives = 193/380 (50%), Gaps = 16/380 (4%) Query: 74 NGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFV 133 N L + ++ AF+ + L + VE WL G A V ++ D++ F+GS F+ Sbjct: 133 NSLSSEDRVGAFSR-LAKGMLLSSTAKTVEEWLGHIGQAQVKLQADDKNDFSGSEVDLFI 191 Query: 134 PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENL-QRAGF 192 PL D L +SQ G + D + N+G+GQR +W+ GYN F+D + N +R GF Sbjct: 192 PLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYVSDWMFGYNIFFDQQISGNAHRRVGF 251 Query: 193 GAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLE 250 G E +Y++LSAN Y W T ++ +R A GYD+ +P Y L + E Sbjct: 252 GGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAANGYDIRTEAYLPHYPQLGGKLMYE 311 Query: 251 QYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRF 310 QYFGD V LF NP AL+ G++YTP+PLV++ H G G+ + + + +NY Sbjct: 312 QYFGDEVALFGINERQKNPSALTAGVSYTPIPLVSLGLDHTIGNGGKKKTGVNVAVNYEI 371 Query: 311 GVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDL--KPGE 368 P +KQ+ V +++L GSR D RNN LEYR+++ +T+ L P + K Sbjct: 372 NTPWQKQIDPAAVQATRTLAGSRMDLVDRNNNIVLEYRKQQVVTLNL---PEKISGKEAL 428 Query: 369 TVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAE--GWTLIMPDWQNGEGASNHWR 426 +P+ +R+G+ ++ W I + G Q +S + + +P + +GA+N + Sbjct: 429 VLPINYTFNARHGLDRIEWDAADVIQA---GGQVSSQGNLAYHVALPPYI--DGAANAYV 483 Query: 427 LSVVVEDNQGQRVSSNEITL 446 LS D +G +S+ + Sbjct: 484 LSGRAVDKKGNYSTSSSTNI 503 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 193 bits (490), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 115/374 (30%), Positives = 198/374 (52%), Gaps = 8/374 (2%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ A + A G++ S QV Q WL+ +G A V ++ D S+ +P + Sbjct: 165 GDAALSMARGQISAVASGQVQQ----WLNQFGTARVQLEADEHFSLKNSQVDLLIPFYEQ 220 Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 + L ++Q L + D+ +N+G G R+ ++++G N F D L R G G E W Sbjct: 221 NDELLFTQGSLHRTDDRTQANLGFGLRYFAPSYMLGGNIFGDYDLSHEHSRTGIGVEYWR 280 Query: 199 EYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 ++L+LSAN Y + W + +E +R A G+D+ A+ +P L ++ EQY+G Sbjct: 281 DFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQAWLPSLPQLGGKLTYEQYYGKG 340 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LF NP A++ G+N+TP PL+ + A+H+QG SG+N + + +YR G+P ++ Sbjct: 341 VALFGKENLQQNPRAITAGVNFTPFPLLMLGAEHRQGASGKNDKRISADFSYRLGLPWQQ 400 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376 Q++ VA +SL GSRYD +RNN L+YR+++T+ + GE L + + Sbjct: 401 QINPQAVATMRSLAGSRYDLVERNNHILLQYRKKETVRLHTVDRVTGYA-GEKKSLGVSV 459 Query: 377 RSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQG 436 S+YG+ ++ W + +L+ A W++I+P++Q G A N W +S V D +G Sbjct: 460 NSQYGLERIDWSA-SSLLACGGQLVREDAGNWSVILPEYQPGAQAVNTWTVSGVAVDKKG 518 Query: 437 QRVSSNEITLTLVE 450 + + +T+ + Sbjct: 519 NVSARADTQVTVAQ 532 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 191 bits (486), Expect = 4e-47, Method: Compositional matrix adjust. Identities = 144/453 (31%), Positives = 224/453 (49%), Gaps = 46/453 (10%) Query: 10 PFYLLLLVAGGTANAQ---STFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDF 66 P L+ L + ANA+ S+ E++ NP D N A+ Sbjct: 40 PSSLIYLSSVFNANAEEITSSAEKEQGNPSDQN----------------ASSVAQTAVQA 83 Query: 67 GETSMNDNGLDTGEQAKAFALGK-VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFT 125 G +DN D ALG V A++ + + WLS +G A V++ D + Sbjct: 84 GSLLSSDNASD--------ALGSAVVSAVTGKAASSAQEWLSQFGTARVNISTDEHFTLS 135 Query: 126 GSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDE 185 S VPL + + L ++QLG + D+ + N G G R W+ G N FYD + Sbjct: 136 DSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHFNDGWMWGTNVFYDRQVSG 195 Query: 186 NL-QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQH 242 N QR G E +YL +SAN Y + W ++ Q+ +R+A G+D+ A +P Y Sbjct: 196 NQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFDIRATGYLPAYPQ 255 Query: 243 LNTSVSLEQYFGDRVDLF--NSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 L ++ EQYFGD V LF + +P A+++GLNYTPVPLVT+ K G+SGEN Sbjct: 256 LGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLVTMGLNQKMGKSGENDT 315 Query: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 + L L + GVPL QL +VA ++L+G R D RNN LEYR+++ +++ L Sbjct: 316 QVNLGLTWTPGVPLSAQLDPSQVALRRTLQGGRLDLVDRNNNIVLEYRKQELISLAL--- 372 Query: 361 PWDLKPGETV--PLKLQIRSRYGIRQLIWQGDTQIL---SLTPGAQANSAEGWTLIMPDW 415 P +L+ E P+ +++++YG+ ++ WQGD+ +TPG+ E + +P W Sbjct: 373 PAELEGAEQSKRPVTAKVKAKYGLDRIEWQGDSFFSHGGKITPGSN---PEQVVMTLPVW 429 Query: 416 QNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 G G SN + LS D +G ++ + +T+ Sbjct: 430 V-GSG-SNSYTLSATAWDKKGNASAAERVNVTV 460 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 190 bits (483), Expect = 8e-47, Method: Compositional matrix adjust. Identities = 116/363 (31%), Positives = 191/363 (52%), Gaps = 11/363 (3%) Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFT--GSRGSWFVPL-QDNDRYLTWSQL 147 +D L+ ++ E+WL+ +G +S V + + +F G +PL + ++ +SQL Sbjct: 133 QDQLNTLASEQAETWLNGFGGSS-RVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQL 191 Query: 148 GLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANF 207 G+ D+ N+G+G R+ W++G N F+DN + +R G GAE + LRL+AN Sbjct: 192 GVRHADDRTTGNIGLGARYFGEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAANG 251 Query: 208 YQPFAAWHEQ--TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTG 265 Y WH+ A ++R A G+D+ +P Y L V EQY+GD V L + G Sbjct: 252 YFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRGRL 311 Query: 266 YHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAE 325 HNP A +LG+N+TP+PLV++ A H+ +GLNLN+ FG L LS V Sbjct: 312 QHNPSAATLGVNWTPIPLVSIDAGHRMSMQRGEDTTVGLNLNWNFGRSLDWHLSPDAVET 371 Query: 326 SQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQL 385 +SL GSRYD RNN ++YR++ +T LA ++ T L + + +++G+ ++ Sbjct: 372 QRSLAGSRYDLVSRNNEIVMDYREQTVITFSLANAIQGVE-STTHSLGVSVWAKHGLGKI 430 Query: 386 IWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEIT 445 +W D L G S L++P ++ +GA N + LS + DN+G+ ++ Sbjct: 431 VW--DDATLVNAGGKIVGSGANSVLVLPAYK--DGADNRYTLSAIAYDNKGKASPRAQVQ 486 Query: 446 LTL 448 +T+ Sbjct: 487 ITV 489 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 190 bits (482), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 119/370 (32%), Positives = 184/370 (49%), Gaps = 20/370 (5%) Query: 99 NQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVS 158 N + WLS +G A + + +D GS W +P D T++QLG +D+ Sbjct: 208 NDEIVKWLSKYGTAQLQLNIDKNFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTL 267 Query: 159 NVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ- 217 N+G+G R NWL G N FYD+ L R G G+EAW +YL+LS N Y + WH+ Sbjct: 268 NIGIGTRTLSNNWLFGVNAFYDHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSR 327 Query: 218 -TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGL 276 A +R A G+D+ A MP L + EQYFGD V LF NP A ++G+ Sbjct: 328 DLADYNERPANGFDVRANAWMPTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGV 387 Query: 277 NYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDN 336 NYTP PL+T+ + G++ E+ + LNYR G + Q+ V S+ + SRY+ Sbjct: 388 NYTPFPLLTLGVDQRLGKNSEHDTQFNVQLNYRIGDDWRAQVDPSAVPHSRLISESRYNL 447 Query: 337 PQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSL 396 +RNN LEY+++ + + L + +PG + ++++YG++ ++WQ D + +S Sbjct: 448 VERNNNIVLEYQKQNIMHLSLPSDTLSGQPGSEHMISAILQTKYGLQDIVWQ-DAEFISA 506 Query: 397 TPGAQANSAEGWTLIMPDWQ--------------NGEGASNHWRLSVVVEDNQGQRVSSN 442 Q + L +P ++ E A+N + LS D +G + SN Sbjct: 507 GGKLQRQDKTHFNLTLPSYRYSATARRSGSHATAQAEIAANTYHLSATAFDTKGNQ--SN 564 Query: 443 EITLTL-VEP 451 I LT+ VEP Sbjct: 565 TINLTVTVEP 574 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 189 bits (481), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 126/389 (32%), Positives = 206/389 (52%), Gaps = 25/389 (6%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ AK ALG + S Q+ ++WL +G A V+++ N +F GS + +P D+ Sbjct: 184 GDYAKDTALGIAGNQASSQL----QAWLQHYGTAEVNLQSGN--NFDGSSLDFLLPFYDS 237 Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 ++ L + Q+G D+ +N+G GQR+ ++GYN F D + R G G E W Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297 Query: 199 EYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 +Y + S N Y + WHE ++ +R A G+D+ +P Y L + EQY+GD Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P + Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGE--TVPLKL 374 Q+ V E ++L GSRYD QRNN LEY+++ L++ + P D+ E T ++L Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI---PHDINGTERSTQKIQL 474 Query: 375 QIRSRYGIRQLIWQGDTQILSLTPGAQ---ANSAEGWTLIMPDWQNGEGASNHWRLSVVV 431 ++S+YG+ +++W D+ + S Q + SA+ + I+P + +G SN ++++ Sbjct: 475 IVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAYV--QGGSNVYKVTARA 531 Query: 432 EDNQGQRVSSNEITLTLVEPFDALSNDEL 460 D G SSN + LT+ LSN ++ Sbjct: 532 YDRNGN--SSNNVLLTIT----VLSNGQV 554 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 117/374 (31%), Positives = 190/374 (50%), Gaps = 18/374 (4%) Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 R ++ NQ V+ WL+ +G V+V D + S W +P D+ Y+ +SQLG+ Sbjct: 74 RSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKESSLDWLLPWYDSASYVFFSQLGIR 133 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 +D+ N+G G R + +W+ G+NT YDN + + R G GAEAW +YL+LSAN Y Sbjct: 134 NKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHNHRIGVGAEAWTDYLQLSANGYFR 193 Query: 211 FAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHN 268 WH+ A +R A G D+ + +P L + EQY G+RV LF N Sbjct: 194 LNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGGKLKYEQYRGERVALFGKDNLQSN 253 Query: 269 PVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS 328 P A++ GL YTP+P +T+ + G+S +++ L ++YR G + Q S VA ++ Sbjct: 254 PYAVTTGLIYTPIPFITLGVDQRMGKSRQHEIQWNLQMDYRLGESFRSQFSPAVVAGTRL 313 Query: 329 LRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQ 388 L SRY+ +RN LEY+++ T+ + + PG+ + QI+S+ +++++W Sbjct: 314 LAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGLPGQVYSVSAQIQSQSALQRILWN 373 Query: 389 GDTQILSLTPGAQANSAEGWTLIMPDWQ-------------NGEGASNHWRLSVVVEDNQ 435 D Q ++ SA + +++P ++ E A N + LS DN Sbjct: 374 -DAQWVAAGGKLIPVSATDYNVVLPPYKPMAPASRTVGKTGESEAAVNTYTLSATAIDNH 432 Query: 436 GQRVSSNEITLTLV 449 G SSN TLT++ Sbjct: 433 GN--SSNPATLTVI 444 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 189 bits (480), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 111/357 (31%), Positives = 180/357 (50%), Gaps = 4/357 (1%) Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 +S + NQ +E WL+ +G+A V + D S +PL + L ++Q ++D Sbjct: 123 ISSKSNQKIEQWLNQFGHARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTNYHRKD 182 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 N G+G R+ ++VG N FYD+ L + R G GAE W +Y +LS+N Y ++ Sbjct: 183 LRSQFNYGIGYRYFTEKFMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHYHRLSS 242 Query: 214 WHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 W + +R A G+D+ P Y L T + EQY+G V LF NP Sbjct: 243 WRASNNILDYSERPANGWDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRDKNPHT 302 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 +LG+NYTP+PLVT+ A+ + G NNL +NL+YR G L QL+ V ++L G Sbjct: 303 YTLGINYTPIPLVTLNAERRIGLHDRADNNLNINLSYRIGESLASQLNPDNVKAIRTLAG 362 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 SRYD RNN LEY+ ++TL + E L++Q++++Y + + W + Sbjct: 363 SRYDFVNRNNDMILEYK-KETLVFLSMVDSINGYAKEERDLQVQVKTKYPLANIEWSA-S 420 Query: 392 QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 ++ + + + +T+I+P +Q G N + +S V D G R + + T+ + Sbjct: 421 KLNAQGGQIKHHGGTHYTVILPQYQIGAIEKNSYIISAVAIDTHGNRSAPVQTTVIV 477 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 189 bits (480), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 124/428 (28%), Positives = 206/428 (48%), Gaps = 29/428 (6%) Query: 42 GLPDLGM-----APENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQ 96 LP LG+ A + EK F + + N+N + A+ R+ Sbjct: 91 ALPTLGIKETSQAKQVESAEKQFVQGATQIAQGLANNNATEA-------AINYARNRGEG 143 Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL 156 +NQ + WL+ +G A V + + G +PL D L +SQ+G+ + Sbjct: 144 LLNQKISDWLNQYGKARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGIRANEQRS 198 Query: 157 VSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 +N+G+G R + NW+ G N+FYD + R G G E W YL+L+ N Y WH+ Sbjct: 199 TTNLGLGYRQYQQNWMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQ 258 Query: 217 ----QTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTG---YHNP 269 + ++R A G+DL A +P Y HL EQYFGD V L ++ T NP Sbjct: 259 SFLHEMRDYDERPANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKDNP 318 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 A++ GL+YTP PL+T+ Q QG+S N + +G+ YRFG+PL QL+ V +SL Sbjct: 319 SAVTFGLSYTPFPLLTLKTQVSQGDS--NDSLIGMEFAYRFGIPLAAQLNPDNVDLMRSL 376 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI-RSRYGIRQLIWQ 388 G+RYD RN ++YR+++ L + L + +T+ +K + +++YG+ +++W Sbjct: 377 AGNRYDFVDRNYNIVMQYRKQEILAISLPDSAM-AEAAQTIAIKATVQKAKYGLNKILWS 435 Query: 389 GDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 ++L+ S L++P + S + LS V DN+G + + + + + Sbjct: 436 A-PELLAKGGKINETSTTTIDLVLPAYDEDNQGSKAYTLSAVGVDNEGNKSKAAVMVIHV 494 Query: 449 VEPFDALS 456 + D + Sbjct: 495 TQSKDGFA 502 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 188 bits (478), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 103/293 (35%), Positives = 159/293 (54%), Gaps = 8/293 (2%) Query: 100 QHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSN 159 + VE WL G A V ++VD++ F+GS FVPL + L +SQ G + D + N Sbjct: 119 KSVEEWLGHIGKAQVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMN 178 Query: 160 VGVGQRWARGNWLVGYNTFYDNLLDENL-QRAGFGAEAWGEYLRLSANFYQPFAAWHEQT 218 +G+GQR +W++GYN F D + N +R G G E +Y++LSAN Y W T Sbjct: 179 IGLGQRHYLSDWMLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNST 238 Query: 219 ATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGL 276 ++ +R A GYD+ +P+Y L + EQYFG+ V LF NP AL+ + Sbjct: 239 RLEDYDERAASGYDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASV 298 Query: 277 NYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDN 336 +YTP PLV + +H G SG+N+ + L +NY P +KQ+ V +++L GSR D Sbjct: 299 SYTPFPLVNLALEHTIGNSGKNKTGVNLAVNYEINTPWQKQIDPAAVKATRTLAGSRMDL 358 Query: 337 PQRNNLPTLEYRQRKTLTVFLATPPWDL--KPGETVPLKLQIRSRYGIRQLIW 387 RNN LEYR+++ +T+ L P + K + +P+ +R+G+ ++ W Sbjct: 359 VDRNNNIVLEYRKQQVVTLNL---PAKVSGKEKQVLPINYTFNARHGLDRIEW 408 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 187 bits (474), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 140/438 (31%), Positives = 214/438 (48%), Gaps = 19/438 (4%) Query: 35 PFDNNND----GLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKV 90 P N+N L + M + D + AE+ + G D +D+ EQA + A G V Sbjct: 117 PLINSNSPEARNLKAMQMERDGKDPQMQVAEMAQQSGTLLARD--MDS-EQAASMARGWV 173 Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 + S Q WLS WG A V + VD + S + P + L +SQ L Sbjct: 174 ASSASAQATD----WLSRWGTARVSLGVDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLH 229 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 + DN +N G+G R+ +W+ G N F D+ L R G G E W +YL+LS N Y Sbjct: 230 RTDNRTQTNHGIGWRYFTSSWMSGVNMFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLR 289 Query: 211 FAAWH---EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 + W E E R A G+DL A +P + L V EQY+GD V LF + Sbjct: 290 LSNWRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKVVYEQYYGDEVALFGKDERQN 349 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 +P A++ GL+YTPVPL++ +A+ +QG+ GEN +G+ L + G L+KQL EVA + Sbjct: 350 DPHAITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQKQLDPAEVAARR 409 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 SL GSRYD RNN LEYR+++ + + L T P KPGE L ++++Y ++ + Sbjct: 410 SLVGSRYDLVDRNNNIVLEYRKKELVRLTL-TDPLKGKPGEVKSLVSSLQTKYALKG--Y 466 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNGE--GASNHWRLSVVVEDNQGQRVSSNEIT 445 + L G A S + + +P ++ N + ++V ED++G E Sbjct: 467 DIEAASLQSAGGKVAVSGKDIQVTIPPYRFTAMPETDNTYPIAVTAEDSKGNFSRREESM 526 Query: 446 LTLVEPFDALSNDELRWE 463 + + +P +L+ L + Sbjct: 527 VVVEKPTLSLAGSTLSVD 544 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 121/383 (31%), Positives = 186/383 (48%), Gaps = 16/383 (4%) Query: 74 NGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFV 133 N E A+ +A + A + WLS WG V + +D + GS + Sbjct: 72 NSRQAAEMARGYATSTAQSAFQE--------WLSQWGTVRVTLGLDEDFTLKGSAFDLLL 123 Query: 134 PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFG 193 P D L ++Q + D+ N G G R +++ G N F+D+ L R G G Sbjct: 124 PWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFAPDYMAGVNLFFDHDLTRYHSRMGLG 183 Query: 194 AEAWGEYLRLSANFYQPFAAWHEQTATQ---EQRMARGYDLTARMRMPFYQHLNTSVSLE 250 E W + L+L AN Y + W + E R A G+D+ A +P Y L ++ E Sbjct: 184 GEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYLPAYPQLGATLMYE 243 Query: 251 QYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRF 310 QY+GD V LF +P A + GL+YTPVPL++++A+ KQG+ GEN LNL Y Sbjct: 244 QYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLISLSAEQKQGKGGENDTRFALNLTYTP 303 Query: 311 GVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETV 370 GV L QL VA +SL GSR+D +RNN LEYR+++ + + L P K GE Sbjct: 304 GVSLAHQLDPDAVAYRRSLAGSRHDLVERNNNIVLEYRKKELVKLQLHDPVTG-KGGEQK 362 Query: 371 PLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG--ASNHWRLS 428 PL ++S+Y ++ L + + L G A T+ +P+++ N +R++ Sbjct: 363 PLVASLQSKYALKTL--RAEAAELQSAGGVVNTEANQVTVTLPEYRYTATPQTDNVYRVA 420 Query: 429 VVVEDNQGQRVSSNEITLTLVEP 451 V ED +G R + E ++ ++ P Sbjct: 421 VTAEDEKGNRSNREEASVVVLAP 443 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 186 bits (473), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 116/380 (30%), Positives = 193/380 (50%), Gaps = 18/380 (4%) Query: 93 ALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQ 152 A + + Q + WL+ G V + D + S+ VPL +++ ++ +SQ + + Sbjct: 159 AANAKAGQEISDWLNGKGKVRVKLDADRDFSVKNSQLDLLVPLWESESHMIFSQGSVHRT 218 Query: 153 DNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 D+ SN+G+G R+ ++ +G NTFYD+ + R G GAE + +L+ N Y + Sbjct: 219 DDRTQSNLGLGYRYFADSYALGANTFYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRLS 278 Query: 213 AWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPV 270 W + E+R A G+D+ A +P Y L ++ EQY+GD V LF NP Sbjct: 279 NWKDSPDFDNYEERPANGWDIRAEGYLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPH 338 Query: 271 ALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLR 330 A++ G NY+P PL+ + +QG+ G+N G++LNY G PL QL ++ S+SL Sbjct: 339 AITFGGNYSPFPLLKFSVDQRQGKGGQNDTRFGIDLNYTLGTPLSHQLDRNQLIASRSLI 398 Query: 331 GSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGD 390 +RYD RNN LEYR++ TL++ LA GE L++ + S G+ ++ W Sbjct: 399 ANRYDFVDRNNNIVLEYRKKNTLSLKLAQQVSGYT-GERKSLEVSVNSSNGLERIDWDAP 457 Query: 391 T------QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEI 444 QI+ PG +++I+P++Q G GA+N + ++ DN G +S + Sbjct: 458 ELLSNGGQIIQEGPGL-------YSVIVPEFQYGVGAANQYIVNATAYDNSGN--ASQQA 508 Query: 445 TLTLVEPFDALSNDELRWEP 464 + T+V A+S + P Sbjct: 509 STTVVVTASAVSTTHSEFTP 528 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 186 bits (471), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 138/438 (31%), Positives = 215/438 (49%), Gaps = 19/438 (4%) Query: 35 PFDNNND----GLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKV 90 P N+N L + M + D + AE+ + G D +D+ EQA + A G V Sbjct: 117 PLINSNSPEARNLKAMQMERDGKDPQMQVAEMAQQSGTLLARD--MDS-EQAASMARGWV 173 Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 + S Q WLS WG A V + VD + S + P + L +SQ L Sbjct: 174 ASSASAQATD----WLSRWGTARVSLGVDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLH 229 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 + D+ +N G+G R+ +W+ G N F D+ L R G G E W +YL+LS N Y Sbjct: 230 RTDDRTQTNHGIGWRYFTSSWMSGVNMFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLR 289 Query: 211 FAAWH---EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 + W E E R A G+DL A +P + L + EQY+GD V LF + Sbjct: 290 LSNWRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFGKDERQN 349 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 +P A++ GL+YTPVPL++ +A+ +QG+ GEN +G+ L + G L+KQL EVA + Sbjct: 350 DPHAITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQKQLDPAEVAARR 409 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 SL GSRYD RNN LEYR+++ + + L T P KPGE L ++++Y ++ + Sbjct: 410 SLVGSRYDLVDRNNNIVLEYRKKELVRLTL-TDPLKGKPGEVKSLVSSLQTKYALKG--Y 466 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNGE--GASNHWRLSVVVEDNQGQRVSSNEIT 445 + L G A S + + +P ++ N + ++V ED++G E Sbjct: 467 DIEAASLQSAGGKVAVSGKDIQVTIPPYRFTAMPETDNTYPIAVTAEDSKGNFSRREESM 526 Query: 446 LTLVEPFDALSNDELRWE 463 + + +P +L++ L + Sbjct: 527 VVVEKPTLSLTDSTLSVD 544 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 185 bits (469), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 124/402 (30%), Positives = 210/402 (52%), Gaps = 21/402 (5%) Query: 55 GEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASV 114 G + A++ G ++DN G+ A + A G+V S Q+ Q WL+ +G A V Sbjct: 166 GARKMADVASRAG-AFLSDN--PNGDAALSLARGEVTAEASGQLQQ----WLNQFGTARV 218 Query: 115 DVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVG 174 + D F S+ PL + L ++Q L + D+ N+G G R+ ++++G Sbjct: 219 QLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQVNLGFGLRYFAPSYMLG 278 Query: 175 YNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLT 232 N F D L R G G E W ++L+LSAN Y + W+ + ++ +R A G+D+ Sbjct: 279 GNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNSSDFKDYQERPANGWDIR 338 Query: 233 ARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQ 292 A+ +P L ++ EQY+G V LF +P A++ G+N+TP PL+T+ A+H+Q Sbjct: 339 AQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAGVNFTPFPLLTLNAEHRQ 398 Query: 293 GESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKT 352 G SG+N LG++ +Y+ G+P ++Q++ VA +SL GSRYD +RNN L+YR+++ Sbjct: 399 GASGKNDKRLGVDFSYQLGMPWQQQINPQAVATMRSLAGSRYDLVERNNHILLQYRKKEV 458 Query: 353 L---TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG-W 408 + TV T GE L + + S YG+ ++ W + L G EG W Sbjct: 459 IRLHTVGRVTG----YAGERKSLGVSVNSSYGLERIDWSASS--LLAAGGKLVRENEGSW 512 Query: 409 TLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVE 450 ++I+P+ + GE +N W ++ V D +G + + +T+ + Sbjct: 513 SVILPEHKPGE--ANSWTITGVAVDKKGNVSTGADTQVTVAQ 552 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 184 bits (467), Expect = 6e-45, Method: Compositional matrix adjust. Identities = 107/363 (29%), Positives = 182/363 (50%), Gaps = 5/363 (1%) Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 R + VN + WL+ +G A + + D + S W +PL D+ ++Q G Sbjct: 169 RTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLLPLYDSQTLTLFTQQGFR 228 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 +D ++N+G+G R+ W++G N FYDN + +R G GAE W + +LSAN Y Sbjct: 229 NKDRRNIANIGIGTRFIHHEWMMGGNAFYDNDFTGDNKRVGLGAELWTDSFQLSANGYFR 288 Query: 211 FAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHN 268 AWH+ + +R A G DL A +P HL S+ E YFGD V LF N Sbjct: 289 LTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYEHYFGDNVALFGKDHLQRN 348 Query: 269 PVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS 328 P A++LG +YTP L+T+ + + G+ G LGL++NYR G L QL + +++ Sbjct: 349 PYAITLGGSYTPFSLLTLEVKQRLGKQGNQDTQLGLHINYRLGADLPAQLDPAALVAART 408 Query: 329 LRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQ 388 + +RYD +RN+ L+Y++++ L + +T + PG + + ++ S+YG+R L W Sbjct: 409 IAKTRYDLVERNHNIVLQYQEQQRLKI-KSTEYLEGYPGNSSEIYAEVVSKYGVRNLQWM 467 Query: 389 GDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 ++ G Q L + + N + + V+ D +GQ + + +++ Sbjct: 468 NVAAFVA--AGGQIMELPNNRLKITYPPYNDNGDNRYHIDVMAYDTRGQSSNISTTQISV 525 Query: 449 VEP 451 ++P Sbjct: 526 LKP 528 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 184 bits (466), Expect = 8e-45, Method: Composition-based stats. Identities = 113/377 (29%), Positives = 191/377 (50%), Gaps = 15/377 (3%) Query: 57 KHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDV 116 + A I D G NDN +K AL + ++ +VN H++SW +G A + + Sbjct: 133 QQIASIATDVGNILSNDN------ISKNSAL---LNKITNKVNSHIQSWFENFGTAHIQL 183 Query: 117 KVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYN 176 +VD S+ P+ ++D L +SQ G++ D+ +SN+G+G R NW++G N Sbjct: 184 QVDKNFSLKNSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDNWMLGGN 243 Query: 177 TFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTAR 234 +F D L + R G G E W + L+L AN Y + W + E+R A G DL + Sbjct: 244 SFIDYDLRKEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIK 303 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 +P Y + + E+Y+GD V LF NP + +LG++YTP PL++ A+HK G Sbjct: 304 SWLPSYPQIGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLMSFKAEHKMGS 363 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 + N + +G +NY+ P + Q++ + + L G RYD +RNN L+YR+++ + Sbjct: 364 NNINDSRIGFEINYQIHTPWESQINPVLIPAMRKLAGQRYDLVERNNNIILDYRKKEIIK 423 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSA-EGWTLIMP 413 + GE L +++ S+Y + ++ W +T I + G N +++I+P Sbjct: 424 IDGVDVISGFS-GEKKRLDIRVNSKYPVDRIDWLANTFIAN--GGKIINEGLHNYSIILP 480 Query: 414 DWQNGEGASNHWRLSVV 430 D++N E S LS + Sbjct: 481 DYRNQENNSYTIDLSAI 497 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 183 bits (464), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 112/368 (30%), Positives = 187/368 (50%), Gaps = 23/368 (6%) Query: 76 LDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNA--SVDVKVDNEGHFTGSRGSWFV 133 L +G +A GK+ SQ +Q +E WL +GNA +++ + DN GS Sbjct: 26 LKSGPAFDQYAAGKI----SQLTSQAIEGWLKQYGNARITLNAQSDNSTALAGSSADLLF 81 Query: 134 PLQDNDRYLTWSQLGLTQQDN-GLVSNVGVGQRWARGN-WLVGYNTFYDNLLDENLQRAG 191 L + D L + Q QD ++ NVG+GQR+ N ++GYN FYD ++ + R+G Sbjct: 82 GLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNINSGVSRSG 141 Query: 192 FGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSL 249 G E W +Y + S N Y + W EQ +++ A GYD+ +P Y L + Sbjct: 142 VGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQLGGHLKY 201 Query: 250 EQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYR 309 EQYFGD V LF++ +P A+++G++YTP+PL+T +K+G + ++ +NY Sbjct: 202 EQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLITFALDYKKGNDSLDDTSISAAINYA 261 Query: 310 FGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGET 369 GVP +Q+S+ V +SL GSR+D RNN ++YR++ + + L T + + + Sbjct: 262 IGVPWSQQISSDYVQTRRSLVGSRFDFVSRNNDIVMQYRKQDVIKLILPT-QLNGQATQQ 320 Query: 370 VPLKLQIRSRYGIRQLIWQGDTQIL----SLTPGAQANSAEGWTLIMPDWQ-----NGEG 420 +PL + ++ G+ + W + +L ++ PG+ A +T+ +P NG Sbjct: 321 LPLVATVEAKNGLDHIQWDSSSSLLQAGGTVIPGSDATH---FTVSLPATAGQYVLNGTA 377 Query: 421 ASNHWRLS 428 NH S Sbjct: 378 YDNHHNAS 385 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 182 bits (463), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 122/370 (32%), Positives = 193/370 (52%), Gaps = 9/370 (2%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 A G R S + + + WL+ +G A + + VD + S+ + P D YL +S Sbjct: 165 ASGMARGWASSEASGAMTDWLNNFGTAKISLGVDEDFSLKNSQFDFLHPWYDTPDYLLFS 224 Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 Q L + D+ N G+G R +W+ G N F+D+ L RAG GAE W +YL+LS+ Sbjct: 225 QHTLHRTDDRTQINTGLGWRHFTPSWMSGINLFFDHDLSRYHSRAGLGAEYWRDYLKLSS 284 Query: 206 NFYQPFAAWH---EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNS 262 N Y W E E R A G+DL A +P + L + EQY+GD V LF+ Sbjct: 285 NAYIGLTGWRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFDK 344 Query: 263 GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGE 322 NP A++ GLNYTP PL+T++A+ +QG+ GEN ++L ++ ++KQL+ E Sbjct: 345 NDRQSNPHAITAGLNYTPFPLLTLSAEQRQGKQGENDTRFAVDLTWQPSSSMQKQLNPDE 404 Query: 323 VAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGI 382 VA +SL GSRYD RNN LEYR+++ + + L P K GE PL ++++Y + Sbjct: 405 VAGRRSLAGSRYDLIDRNNNIVLEYRKKELIRLSLL-DPVKGKSGEIKPLVSSLQTKYAL 463 Query: 383 RQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ--NGEGASNHWRLSVVVEDNQGQRVS 440 + + + L G + S + T+ +P ++ N N W + V ED +G +S Sbjct: 464 KG--YNIEAAALEAAGGKVSTSGKDITVTLPGYRFTNTPETDNTWSIDVTAEDVKG-NLS 520 Query: 441 SNEITLTLVE 450 +E ++ +++ Sbjct: 521 RHEQSMVVIQ 530 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 182 bits (462), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 111/363 (30%), Positives = 189/363 (52%), Gaps = 11/363 (3%) Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 R+ + + + WLS +G A V + D S+ +PL D ++Q L Sbjct: 168 RNMATVEAGGAFQQWLSHFGTARVQLDADKNFSLKNSQFDLLLPLYDQGDNFVFTQGSLH 227 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 + D+ +++G G R + +++G N F D L + RAG G E W +L+L N Y Sbjct: 228 RTDSRTQASLGAGWRHSTSTYMLGGNLFGDFDLSRDHARAGAGLEYWRNFLKLGVNSYLR 287 Query: 211 FAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHN 268 + W + ++ +R A G+D+ + +P L ++ EQY+G V LF + N Sbjct: 288 LSGWKDSPDLEDYQERPANGWDVRGQAWVPSLPQLGGKLTYEQYYGKEVALFGVDSRQRN 347 Query: 269 PVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS 328 P A+++G+NYTPVPL+T+ A+ +QG+SG++ L LN+NY GVP + Q+ VA +S Sbjct: 348 PHAITVGINYTPVPLITLGAEQRQGQSGKSDTRLTLNMNYHLGVPWRAQVDPTAVAAMRS 407 Query: 329 LRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLK---PGETVPLKLQIRSRYGIRQL 385 L GS+YD +RNN LEYR+++ + + A DL GE L + + SR+G+ ++ Sbjct: 408 LAGSQYDLVERNNNIVLEYRKKEIVRLKTA----DLVTGYTGEQKSLGVSVNSRHGLERI 463 Query: 386 IWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEIT 445 W D L+ G + + +++P +Q+ N + +S V D +G R S ++ Sbjct: 464 DW--DASALNAAGGKIVQNGRDYAVVLPAYQSSAQGVNTYTVSGVAVDTKGNRSSRSDTQ 521 Query: 446 LTL 448 +T+ Sbjct: 522 VTV 524 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 177 bits (449), Expect = 8e-43, Method: Compositional matrix adjust. Identities = 113/340 (33%), Positives = 176/340 (51%), Gaps = 8/340 (2%) Query: 120 NEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFY 179 ++G T F PL D D L + Q+G + D + N+G+GQR+ +G+W +GYNTFY Sbjct: 123 SDGKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGIGQRYFQGDWALGYNTFY 182 Query: 180 DNLLDENL-QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQ--EQRMARGYDLTARMR 236 D + N QR GFG E W +YL LSAN Y W+ +A +R A GYD+ A+ Sbjct: 183 DIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSALDGYAERAANGYDIRAQGW 242 Query: 237 MPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESG 296 P Y L+ + EQYFGD + L N Y NP AL++GL YTP+ L+++ G Sbjct: 243 FPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEYTPIQLISLGIDRTFSHRG 302 Query: 297 ENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVF 356 ++ + L+ NY+ GVPL +Q+ ++L +RY +RNN L++R+R L+++ Sbjct: 303 KDDTKVNLSFNYQLGVPLSQQIDPTVAPVKRTLADNRYHLVERNNNIVLKHRERAQLSLY 362 Query: 357 LATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ 416 L T GE + +Y ++ + W D + + A S + + P++ Sbjct: 363 LPTGLSGF-GGERKLINFSFNGKYRLKHIQW-NDGALRARGGRIIALSNNSYVVQFPNYS 420 Query: 417 NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 + SNH +S V D QG +S+E+ + + P ALS Sbjct: 421 RQQ--SNHITISAVAHDEQGNVSNSSEMGVLINVPV-ALS 457 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 177 bits (449), Expect = 8e-43, Method: Compositional matrix adjust. Identities = 104/297 (35%), Positives = 161/297 (54%), Gaps = 11/297 (3%) Query: 101 HVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNV 160 VE WLS +G A V + VD++G++ S + PL D+ + + ++QLGL D+ + N Sbjct: 77 SVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLAPLYDSQKAMLFTQLGLRAPDDRVTGNF 136 Query: 161 GVGQR-WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA 219 G+G R + NW+ G N F+D+ + +R GFGAEAW L+LSAN Y WH Sbjct: 137 GLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGFGAEAWTNNLKLSANTYLGTTNWHSSRD 196 Query: 220 TQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLN 277 + ++ A G+D+ A +P Y L + EQY+GD+V LF+ NP A+++G++ Sbjct: 197 FDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYEQYYGDKVALFDKDDLQSNPSAVTVGVS 256 Query: 278 YTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNP 337 YTPVPL+T +++G+ ++ + G+N Y FG L QLS+ EV +SL GSRYD Sbjct: 257 YTPVPLITAAVDYRRGQDSMDETHFGVNFRYNFGQSLSSQLSSSEVQNLRSLAGSRYDLV 316 Query: 338 QRNNLPTLEYRQRKT--------LTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 +RNN L+Y+++K LT P D TV ++ +R + Sbjct: 317 ERNNEIVLQYKEKKQNNAVADMLLTTVKDNSPADGVTANTVTVRATTSDGTPVRNTV 373 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 177 bits (448), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 111/371 (29%), Positives = 186/371 (50%), Gaps = 13/371 (3%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 A VR +S N+ ++ WL +G A V + VD++ S W D+ + ++ Sbjct: 170 ATDVVRSEVSSTANKEIQKWLGQYGTAQVRLNVDDKFSLRESSLDWLFSFYDSSSAIIFT 229 Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 QLG+ +D+ +N+G+G R + GNW++G NTFYDN L R GFGAEAW +YL+LSA Sbjct: 230 QLGIRNKDHRNTANLGLGGRISMGNWILGANTFYDNDLTGINSRLGFGAEAWTDYLQLSA 289 Query: 206 NFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 N Y WH+ ++R A G+D+ +P L + EQY GD V LF Sbjct: 290 NSYMRLNNWHQSRDFIDHDERPANGFDIRTNAWLPVLPQLGGKLMYEQYSGDSVALFGKD 349 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 NP A++ G+ YTP PL+T ++G++G++ + L+Y G V Sbjct: 350 KLQKNPYAVTAGITYTPFPLLTFGIDERRGKAGKSDTQFNIQLSYHLGESWLSLTDPSAV 409 Query: 324 AESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIR 383 A ++ L +RY+ RNN LEY+++ L + +T G+ + +I S++ + Sbjct: 410 AGTRQLAEARYNLVDRNNNIVLEYQKQDILNI-TSTEQLRGYSGDNGIILTKIVSKHNVE 468 Query: 384 QLIWQGDTQILSLTPGAQANSAE----GWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRV 439 ++ W + +L+ A NS E + P +Q +N + + VV D++G R Sbjct: 469 RVEWINISALLA----AGGNSVELPGRKLAITYPPYQ--IDGNNTYHVDVVAYDSRGNRS 522 Query: 440 SSNEITLTLVE 450 + + +T+++ Sbjct: 523 NISTTAITVLQ 533 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 177 bits (448), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 123/410 (30%), Positives = 197/410 (48%), Gaps = 20/410 (4%) Query: 39 NNDGLPDLG---MAPENHDGEKHFAEIVKDFGE-TSMNDNGLDTGEQAKAFALGKVRDAL 94 +N LP LG + EN + E AE K G S D + A+ +A R+ + Sbjct: 39 DNKELPSLGSDQIIDEN-NTEHLAAEYTKTVGTFLSQKKTMKDLSQIAQDYA----RNKV 93 Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 S + + +E WLS GN +++ D + S+ W +P D + L ++Q L + D Sbjct: 94 SSEATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLFTQHTLHRYDE 153 Query: 155 GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 +N G+G R+ +G N F D+ L R G G E W +YL+L+AN Y +W Sbjct: 154 RFHTNNGIGLRYFHEKSTIGMNAFIDHDLSHAHTRVGLGVEYWQDYLKLNANSYFGLTSW 213 Query: 215 HEQTATQEQ---RMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 + + A G+D+ +P Y HL ++ EQY+GD V LF NP A Sbjct: 214 KSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALFGKTKRQKNPNA 273 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 ++G N+TP PL T+ A HK G + + L + FG L L +VAE++ L G Sbjct: 274 ATIGANWTPFPLFTLNASHKLGSEKQVETQAKLQFTWTFGKNLAHHLDPTKVAETRRLSG 333 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 +RYD +RNN L Y+++ L + L + + G++VPL S+Y ++ + WQ Sbjct: 334 NRYDFVERNNNIILNYQKKTVLHLSLPSKIQGIT-GQSVPLVKSFTSKYPLKHIEWQAP- 391 Query: 392 QILSLTPGAQANSAEGWTLIMPDWQNGEGAS-----NHWRLSVVVEDNQG 436 + L++ G+ ++ + TL +P +Q A N +RL + D +G Sbjct: 392 EFLAV-GGSISSDDQTATLTLPSYQTSNAAKDVQRINRYRLRAIAYDIKG 440 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 127/426 (29%), Positives = 211/426 (49%), Gaps = 43/426 (10%) Query: 43 LPDLG---MAPENHDGEKHFAEIVKDFGETSMNDNGLDTG-EQAKAFALGKVRDALSQQV 98 LP+LG + PE ++ E+ FA G+ +DN +D AK+ G V Sbjct: 90 LPNLGSTKILPEENNNEEKFASSFTLMGDILSSDNFVDNSINYAKSIGQG--------LV 141 Query: 99 NQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVS 158 NQ + WL+ +G A + D G + +P+ D L ++QLGL + Sbjct: 142 NQQINDWLNQYGKARISFSSD-----KNISGDFLLPVIDEPNNLLFTQLGLRNNTDRNTI 196 Query: 159 NVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT 218 N+G+G R NW+ G NTFYD R G G EAW +YL+L+ N Y WH+ Sbjct: 197 NLGLGYRKYWRNWMFGINTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSK 256 Query: 219 AT----QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH------N 268 + ++R A G+D+ A +P Y L +S+ E+YFG + L GTG + + Sbjct: 257 ISVMDDYDERPATGFDVRAEAYLPKYPQLGSSIKYEKYFGKGIHL---GTGVNPEYLKDD 313 Query: 269 PVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS 328 +L +GLNYTP+PL+T+ A+ G+ N + L++NYRFGVPL +QL+ V +S Sbjct: 314 AQSLIMGLNYTPIPLLTLKAERSIGD--RNDTKISLDVNYRFGVPLSQQLNPDAVDVMRS 371 Query: 329 LRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDL--KPGETVPLKLQI-RSRYGIRQL 385 L G++YD RN ++YR++ L +FL P ++ + +T + + + +++YG++ + Sbjct: 372 LVGNKYDFVDRNYDIVMQYRKQDLLNIFL---PREIVGEARDTHRINVTVNKTKYGLKNI 428 Query: 386 IWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASN---HWRLSVVVEDNQGQRVSSN 442 W D +++ + S + P + + +N + +S + DN G SN Sbjct: 429 KWIIDPKLIEDKGHFKQISQTEGIITFPIYNSLNEKNNLPAEYYISAIGTDNNGNE--SN 486 Query: 443 EITLTL 448 + T + Sbjct: 487 KATTII 492 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 175 bits (444), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 105/283 (37%), Positives = 153/283 (54%), Gaps = 13/283 (4%) Query: 102 VESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVG 161 VE WLS +G A V + VD+ G++ S + PL DN + + ++QLG+ D N+G Sbjct: 90 VEEWLSHFGTAQVTLDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIG 149 Query: 162 VGQR--WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA 219 +G R + R +W+ G N F+D+ +R GFGAEAW YL+LSAN Y + WH Sbjct: 150 LGVRTFYVR-DWMFGGNVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGD 208 Query: 220 --TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLN 277 ++ A GYD+ A +P + L + EQY+GD V LF+ NP A+++GLN Sbjct: 209 FDNYNEKPADGYDVRAEGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLN 268 Query: 278 YTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNP 337 YTPVPL+T +K+G+ ++ LN +Y + Q+S +VA +SL GSRYD Sbjct: 269 YTPVPLITAGIDYKRGQDSMDEMKFSLNFHYALDSSWQSQISPEQVATRRSLAGSRYDLV 328 Query: 338 QRNNLPTLEYRQRKTLTVF----LAT----PPWDLKPGETVPL 372 RNN L+Y+++ T LAT P D +TV L Sbjct: 329 DRNNEIILQYKKKATSKAVADMTLATIKNNSPADGTSADTVTL 371 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 174 bits (442), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 100/274 (36%), Positives = 148/274 (54%), Gaps = 7/274 (2%) Query: 80 EQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND 139 + AK+ A G A + V Q WLS +G A V + VD+ G++ S PL DN Sbjct: 73 DGAKSAATGMATSAAASSVQQ----WLSQFGTARVQLNVDDNGNWDDSAVDLLAPLYDNK 128 Query: 140 RYLTWSQLGLTQQDNGLVSNVGVGQR-WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 + + ++QLGL D N+G+G R + NW+ G N F+D+ +R GFGAEAW Sbjct: 129 KAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVFFDDDFTGKNRRVGFGAEAWT 188 Query: 199 EYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 YL+LSAN Y WH + ++ A GYD+ A +P Y L + EQY+GD+ Sbjct: 189 NYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGYLPAYPQLGAKLMYEQYYGDK 248 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LF++ NP A++ G++YTPVPLV + +K+G+ + +N Y FG + Sbjct: 249 VALFDTDHLQSNPSAVTTGISYTPVPLVQLAVDYKRGQDSMDDTQFQVNFRYDFGHDWRY 308 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQR 350 Q+ V +SL GSRYD +RNN L+Y+++ Sbjct: 309 QIDPENVKAERSLAGSRYDLVERNNQIVLQYKKK 342 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 174 bits (441), Expect = 7e-42, Method: Compositional matrix adjust. Identities = 103/286 (36%), Positives = 153/286 (53%), Gaps = 13/286 (4%) Query: 68 ETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGS 127 ET NGL + A + A G ++ VE WLS +G A V++ D G++ S Sbjct: 39 ETENGSNGLKS--TATSMATGAAANS--------VEEWLSHFGTAEVNLNTDENGNWDNS 88 Query: 128 RGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQR-WARGNWLVGYNTFYDNLLDEN 186 + PL DN + + ++QLGL D N+G+G R + NW+ G N F+D+ Sbjct: 89 SIDFLAPLYDNKKSVLFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGK 148 Query: 187 LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLN 244 +R G GAEAW +YL+L+AN Y WH A ++ A G+D+ A +P Y L Sbjct: 149 NRRVGIGAEAWTDYLKLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLG 208 Query: 245 TSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGL 304 V EQY+G+ V LF+ ++P A+++GLNYTP+ LVT +K+G+ ++ L Sbjct: 209 AKVMYEQYYGENVALFDKDHLQNDPSAVTMGLNYTPISLVTAGIDYKRGQDSQDDVKFSL 268 Query: 305 NLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQR 350 N Y G +Q SA +VA +SL GSRYD RNN L+Y+++ Sbjct: 269 NFRYAIGESWSQQTSADQVALRRSLAGSRYDLVNRNNEIILQYKKK 314 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 173 bits (438), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 125/362 (34%), Positives = 179/362 (49%), Gaps = 36/362 (9%) Query: 105 WLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQ 164 WL +GNA V + ++ G+ +PL + L + QLG+ +NVG+G Sbjct: 173 WLGQYGNARVQLNSNSIGN-----ADVLIPLTETQNNLLFGQLGVRYNGERTTNNVGLGV 227 Query: 165 RWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT-ATQE- 222 R +W+ G NTFYD L R G G EAW + L+ SAN Y WH+ A E Sbjct: 228 RSFTDSWMFGVNTFYDYDLTGKNSRLGVGGEAWTDNLKFSANGYFRLTDWHQSVLADMED 287 Query: 223 --QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGY-----HNPVALSLG 275 +R A G+D+ A +P Y L + E+YFG V L NSG+ +P A ++G Sbjct: 288 YNERPANGFDVRAEAYLPSYPQLGGRLMYEKYFGKGVAL-NSGSTSPDDLGDSPSAFTVG 346 Query: 276 LNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYD 335 LNYTP+PL TV HK+G++ N+ LGLN NYRFGVP Q++ V +SL GSRYD Sbjct: 347 LNYTPIPLFTVDVAHKKGQNTNNELQLGLNFNYRFGVPWVDQINKNAVGLMRSLMGSRYD 406 Query: 336 NPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKL--QIRSRYGIRQLIWQGDT-- 391 RN ++Y ++ + + L P L L L I ++YG ++ W Sbjct: 407 IVDRNYNIVMQYEKQDLIKLTL---PETLAAYAITNLSLTGNITAKYGAERMEWSAPALM 463 Query: 392 ----QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLT 447 I+ LT E ++ +P +Q + A N +++S V D +G R SN T T Sbjct: 464 AAGGSIIPLT-------MESASVTLPPYQQVQTA-NSYQISAVAYDVRGNR--SNTATTT 513 Query: 448 LV 449 LV Sbjct: 514 LV 515 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 171 bits (432), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 120/409 (29%), Positives = 201/409 (49%), Gaps = 17/409 (4%) Query: 51 ENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWG 110 E + K+ A I+ G DN D A + + S V ++ WL +G Sbjct: 57 EEDEAGKNLAAILSSTGSMLSQDNKTD------ALINSAINNG-SAYVTGQIQQWLQQFG 109 Query: 111 NASVDVKVDNEGHF-TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARG 169 A V++ +D + S D + L ++Q G + D+ + NVG+G R+ Sbjct: 110 TAKVNLGLDKDLSLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFAD 169 Query: 170 NWLVGYNTFYDNLLDENL-QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMA 226 W+ G NTFYD + +N +R G G E Y +LSAN Y+ + W + + ++ +R+A Sbjct: 170 RWMWGINTFYDRQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVA 229 Query: 227 RGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTG--YHNPVALSLGLNYTPVPLV 284 GYD+ A +P + L + EQY+GD V LF+ NP A++ G+NYTP PLV Sbjct: 230 NGYDIRAEGYLPAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLV 289 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 ++ K G+ N + L +N+ G LK QL + V ++L GSR D RNN Sbjct: 290 SIGLNQKMGKGNHNDTQIDLAVNWMLGSSLKSQLDSDAVKARRTLLGSRLDLINRNNNIV 349 Query: 345 LEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANS 404 LEYR++ +++ + + ET+P+ + ++S+Y + + W+ D L G + + Sbjct: 350 LEYRKQDLISLKVQNKVTGTE-SETLPVSVNVKSKYPLDHISWEDDN--LVKNGGKISEN 406 Query: 405 AEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFD 453 W++ +P +Q G N + +S DNQG + +++ +T+ V FD Sbjct: 407 NGSWSVTLPHYQQNSGEKNLYVVSATAWDNQGNKSNASHMTVE-VSGFD 454 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 169 bits (428), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 117/374 (31%), Positives = 187/374 (50%), Gaps = 30/374 (8%) Query: 56 EKHFAEIVKDFGETSMNDNGLDTGEQA-KAFALGKVRDALSQQVNQHVESWLSPWGNASV 114 E+ A+ + G N D Q + A G+ DA++Q WL+ +G A Sbjct: 40 EQSVAQTAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQ--------WLNQFGTAKT 91 Query: 115 DVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLV 173 + V ++ GS +P + + + ++QLG+ D +N G+G R+ N W++ Sbjct: 92 QLSVVSDFSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWML 151 Query: 174 GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ-TATQ-EQRMARGYDL 231 GYN FYD +R G G EAW +YL+LSAN Y+ + W + T T ++R A G+D+ Sbjct: 152 GYNVFYDVDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDI 211 Query: 232 TARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHK 291 A +P Y L + EQY+G+ V LF NP A++ G+ +TP L+T ++ Sbjct: 212 RAEGWLPAYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLLTAGVDYR 271 Query: 292 QGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRK 351 +G++G + L L L YR G PL QL + V +SL +R + RNN LEYR++ Sbjct: 272 RGKNGADDTRLNLGLTYRIGEPLAHQLDSSRVGAQRSLAANRLELVNRNNDVVLEYRKQT 331 Query: 352 TLTVFLATPPWDLKPGE--TVPLKLQIRSRYGIRQL------IWQGDTQILSLTPGAQAN 403 +T+ L PP D+ E TV L Q+ ++YG+ ++ + Q +I+S N Sbjct: 332 LITLQL--PP-DVYGAELTTVTLTPQVNAKYGLSRIELDDAELRQAGGKIIS-------N 381 Query: 404 SAEGWTLIMPDWQN 417 + TL +P W + Sbjct: 382 TGNQITLQLPAWSS 395 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 167 bits (423), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 115/361 (31%), Positives = 182/361 (50%), Gaps = 9/361 (2%) Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 S Q + + WLS +G A + + VD + S+ + P + L +SQ L + D Sbjct: 170 SSQASGAMTDWLSRFGTARITLGVDEDFSLKNSQFDFLHPWYETPDNLFFSQHTLHRTDE 229 Query: 155 GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 N G+G R W+ G N F+D+ L RAG GAE W +YL+LS+N Y W Sbjct: 230 RTQINNGLGWRHFTPTWMSGINFFFDHDLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNW 289 Query: 215 H---EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 E E R A G+D+ A +P + HL + EQY+GD V LF+ NP A Sbjct: 290 RSAPELDNDYEARPANGWDVRAEGWLPAWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHA 349 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 ++ GLNYTP PL+T +A+ +QG+ GEN ++ ++ G ++KQL EV +SL G Sbjct: 350 ITAGLNYTPFPLMTFSAEQRQGKQGENDTRFAVDFTWQPGSAMQKQLDPNEVDARRSLAG 409 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 SR+D RNN LEYR+++ + + L T P K GE L ++++Y ++ + + Sbjct: 410 SRFDLVDRNNNIVLEYRKKELVRLTL-TDPVTGKSGEVKSLVSSLQTKYALKG--YNVEA 466 Query: 392 QILSLTPGAQANSAEGWTLIMPDWQ--NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 L G + + + +P ++ + N W + V ED +G S+ E ++ +V Sbjct: 467 TALEAAGGKVVTTGKDILVTLPAYRFTSTPETDNTWPIEVTAEDVKG-NFSNREQSMVVV 525 Query: 450 E 450 + Sbjct: 526 Q 526 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 164 bits (415), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 108/349 (30%), Positives = 169/349 (48%), Gaps = 9/349 (2%) Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ-- 151 ++ N + S L G A V +D++ + + +PL + + L ++Q GL + Sbjct: 192 FEEEANAYAASLLGAMGTARTRVTLDDDFNMVTAEADLLLPLAEEQQTLLFTQFGLRRNG 251 Query: 152 QDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPF 211 QD ++N+GVGQR W++GYN F D L RAG GAEAW +YL+L ANFY P Sbjct: 252 QDR-TIANLGVGQRHFLDRWMLGYNLFADYDLTNRHWRAGVGAEAWRDYLKLGANFYTPL 310 Query: 212 AAWHEQTATQ--EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 ++W + + E+R ARG D+ +P Y + S++ EQY G+RV L ++ +P Sbjct: 311 SSWRDSPRFEGMEERAARGMDVRLEAYLPAYPQWSASLTAEQYLGERVGLLDADQLERDP 370 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 A++ GL+Y P PL+ + + + ++ L L ++ G L L+ V +SL Sbjct: 371 HAITAGLHYNPFPLLKMDVEQVEASGRQHDTRFTLGLEWKLGATLWDMLNPSSV--DKSL 428 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 G R+D +RNN LEYR + L L + G+ + L L I+ I + W G Sbjct: 429 AGMRHDLIERNNDMVLEYRDKVLLKASL-NDQYSAVEGQALTLTLNIQHSRQIASIQWLG 487 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGE-GASNHWRLSVVVEDNQGQ 437 D LS A + L +P G SN + + +V D G Sbjct: 488 DVLGLSGLSPADTAGQDKRALTLPSLPTYRIGQSNQYPVVAIVTDIDGH 536 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 164 bits (414), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 101/360 (28%), Positives = 180/360 (50%), Gaps = 16/360 (4%) Query: 41 DGLPDLGMAPENHDG---EKHFAEIVKDFGETSMN-----DNGLDTGEQAKAFALGKVRD 92 D LP LG +G E + ++G+ + N N + + A+ +A K + Sbjct: 41 DDLPTLGGQAIQFEGTQPEDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAAN 100 Query: 93 ALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQ 152 + ++ WLS GNA +++ +D + S+ W VP + L +SQ + + Sbjct: 101 MATDEITH----WLSKAGNARLNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRT 156 Query: 153 DNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 D L +N G+G R + N ++G N F+D+ L R GFG E +Y+R+SAN Y + Sbjct: 157 DGRLQTNNGIGLRHFQQNSMIGVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLS 216 Query: 213 AWHEQTATQEQ---RMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 W + + R A G+D+ +P Y +L ++ LEQY+GD V LF +P Sbjct: 217 TWRSASELADDYNARPANGWDIQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDP 276 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 +A ++G+N++P PL+ + A+HK G SG N+ N + N+ G L + L VA ++ + Sbjct: 277 MAATVGVNWSPFPLLAINAEHKIGNSGTNETNAKVAFNWLLGRSLAQHLDTSAVAATRHI 336 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 +RYD RNN LEY+++ +++ L + GE + + + ++Y + +++ + Sbjct: 337 STNRYDFINRNNNIVLEYQKKSLISLSLPKVIQGMT-GEELSIIRNLTTKYPLEKIVIEA 395 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 160 bits (406), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 114/376 (30%), Positives = 176/376 (46%), Gaps = 15/376 (3%) Query: 85 FALGKVRDALSQQVNQHVESWLS---PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY 141 +A GK++ Q N V + P V +++ + S F+P+Q+ Sbjct: 56 YAKGKIKALPGQAANHLVNEGIKSAFPEIIFRGGVNLEDGAKYRSSEFDMFIPVQETTSS 115 Query: 142 LTWSQLGLTQQDNGLVS-----NVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEA 196 L + QLG DN NVG+G R WL+G NTF D + + R G G E Sbjct: 116 LLFGQLGFRDHDNSSFDGRTYVNVGMGYRQEVNGWLLGVNTFLDADIRYSHLRGGIGGEV 175 Query: 197 WGEYLRLSANFYQPFAAWHEQTATQ--EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFG 254 + + L S N+Y P W A + ++R A G+DL + +P + + ++ EQY+G Sbjct: 176 YKDSLAFSGNYYFPLTGWKTSAAHELHDERPAYGFDLRTKGTLPDFPWFSGELTYEQYYG 235 Query: 255 DRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPL 314 D+VDL +GT NP A L + PVPL+ V A ++ +G +Q GL +NY FG PL Sbjct: 236 DKVDLLGNGTLSRNPRAAGADLVWNPVPLLEVRAGYRDAGNGGSQAEGGLRVNYSFGTPL 295 Query: 315 KKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKL 374 +QL V + S +R RN + YR++ + A P L G V L Sbjct: 296 HEQLDYRNVG-APSNTTNRRAFVDRNYDIVMAYREQASKIRITAMPVSGLS-GTLVTLMA 353 Query: 375 QIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDN 434 + SRY I ++ W GD ++L+ G Q + G LI+P + L + V D+ Sbjct: 354 TVDSRYPIEKVEWSGDAELLA---GLQLQGSLGSGLILPQLPLTVTDGQEYSLYLTVTDS 410 Query: 435 QGQRVSSNEITLTLVE 450 +G RV+S I + + + Sbjct: 411 RGTRVTSERIPVRVTQ 426 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 157 bits (397), Expect = 8e-37, Method: Compositional matrix adjust. Identities = 107/381 (28%), Positives = 182/381 (47%), Gaps = 23/381 (6%) Query: 98 VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNG-- 155 + ++ WL P + +++ + + +PL + + + QLGL DN Sbjct: 91 ITSGIKHWL-PEAQFRGGITLEDASKYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSF 149 Query: 156 ---LVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 N G+G R G+WL+G N+F D + + R G E + + + L+ N+Y P + Sbjct: 150 NGRFFVNTGIGWRQDVGDWLLGINSFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLS 209 Query: 213 AWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPV 270 W ++R A G D+ + +P ++ EQYFGD+VD+ + + +P Sbjct: 210 DWKASKVQPLHDERPATGIDVRLKGALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPA 269 Query: 271 ALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV--AESQS 328 A + + + PVPLV + A +K S +Q GLNLNY FGVPL+ QL +V A + + Sbjct: 270 AFTGAITWKPVPLVEIKAGYKDAGSSGSQTEAGLNLNYTFGVPLRAQLDPSQVRPASNTT 329 Query: 329 LRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQ 388 R + D RN +EYR++ + A+P + + G+TV L I SRY + ++ W Sbjct: 330 NRTAFVD---RNYNIVMEYREQASRIRVYASPV-NGQSGDTVTLSATINSRYPVERIEWT 385 Query: 389 GDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 GD +++ G Q L +PD + + L + V D++G V+S I +T+ Sbjct: 386 GDAELIG---GLQQQGNVNSGLRLPDLSLDVTENKEYSLYLKVTDSRGNSVTSERIPVTV 442 Query: 449 ------VEPFDALSNDELRWE 463 P+ + +DE+R E Sbjct: 443 SINPESFTPYLNVLHDEVRRE 463 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 148 bits (373), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 100/378 (26%), Positives = 174/378 (46%), Gaps = 13/378 (3%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 ALG S N ++ WLS WG + D++ S W +P+ D D + Sbjct: 156 ALGVASSMASSAANNAIQKWLSQWGTVESQLSFDSKASLKNSSLDWLIPIYDTDENTWFI 215 Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 Q G +D+ N+G G R W+ G N F+D + N +R G G EA +YL +++ Sbjct: 216 QAGGRNKDSRNTVNLGWGVRHVYNGWMYGLNNFFDYDITGNNRRLGLGVEARTDYLSIAS 275 Query: 206 NFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 N Y WH+ ++R A G+D+ +P Y + + EQY+GD V LF Sbjct: 276 NAYLRMNNWHQSRDFYDYDERPANGFDMRVNGWLPAYPQIGGKLVYEQYYGDEVGLFGKD 335 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 +P A++ G+++TP PL+++ HK G++G++ ++ L L +R L QL V Sbjct: 336 DRQKDPKAITAGVSWTPFPLLSLGVDHKIGQAGKHDTSVNLQLTWRPSDSLSSQLMPDNV 395 Query: 324 AESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIR 383 A S+ L SRYD RNN LEYR+++ +++ L+ + G + + + ++ G+ Sbjct: 396 AASRLLSKSRYDLVDRNNNIVLEYRKQQLISLKLSHGEINAPGGTSHTIIATVAAKSGLS 455 Query: 384 QLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQN----------GEGASNHWRLSVVVED 433 + W ++ +A + + +P + N G N + L V + Sbjct: 456 DITWNA-ANFIAAGGKIKAIDKTVFAITLPPYINQGSDRKTQKSGAQGGNAYTLIAVAQS 514 Query: 434 NQGQRVSSNEITLTLVEP 451 + G E+ + ++ P Sbjct: 515 DDGSISEPKELHVNVLPP 532 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 137 bits (345), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 77/191 (40%), Positives = 112/191 (58%), Gaps = 8/191 (4%) Query: 42 GLPDLGMAPENHDG-EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQ 100 LP+LG A N G EK A + + E + ++N T + +++ LG+ +D + ++ Q Sbjct: 56 ALPNLGSASVNESGTEKKLATLARQMAEVNQDEN---TDQTWRSYLLGEAKDRVLDRLQQ 112 Query: 101 HVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND-RYLTWSQLGLTQQDNGLVSN 159 E+ LSP G +V + VD G F GS G +PL D R LT+SQLGL D+G+V N Sbjct: 113 KSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRGLTYSQLGLQGVDDGVVGN 172 Query: 160 VGVGQRWARGNWLVGYNTFYDNLLDENLQRAG-FGAEAWGEYLRLSANFYQPFAAWHEQT 218 +G+ QRW G WL+GYN FYD L+++ R G GAEA +YL LS+N+Y P + H Sbjct: 173 MGLRQRWNAGRWLLGYNVFYDQYLNQDASRRGSIGAEARSDYLTLSSNYYYPLSGMHAAN 232 Query: 219 ATQEQ--RMAR 227 +++ RMAR Sbjct: 233 DDEDELLRMAR 243 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 118 bits (295), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 67/193 (34%), Positives = 101/193 (52%), Gaps = 3/193 (1%) Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + + NQ ++ WL +G A V + VD + S P+ D + ++Q + + D+ Sbjct: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 Query: 155 GLVSNVGVGQRWARGN-WLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 SN+G G R GN W+ G NTF D+ L + R G GAE W +YL+LSAN Y + Sbjct: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 Query: 214 WHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 W + ++ +R A G+D+ A +P + L S+ EQY+GD V LF +P A Sbjct: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 Query: 272 LSLGLNYTPVPLV 284 +S + YTPVPL Sbjct: 281 ISAEVTYTPVPLT 293 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 93.6 bits (231), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 71/222 (31%), Positives = 105/222 (47%), Gaps = 6/222 (2%) Query: 137 DNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLV-GYNTFYDNLLDENLQRAGFGAE 195 D DR L QLG Q++ N GV R A G+ L+ G N F D + + R GAE Sbjct: 162 DQDRALLL-QLGGHNQNHRPTVNAGVVARSAAGSSLILGGNAFLDYEVGKRHLRGSLGAE 220 Query: 196 AWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYF 253 A L N Y P + W ++ +E+R A G+D+ R Q L + ++ Sbjct: 221 AVAAQFTLYGNVYAPLSGWKAAKRAERREERPAAGWDVGFTARPEAVQGLALNAQYFRWR 280 Query: 254 GDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 G +VD F+ G NP G+ Y PVPL+ V + + +SGE Q ++ L + G P Sbjct: 281 GAQVDYFDDGRYRRNPSGFKYGIEYRPVPLIGVGVEQARLQSGERQTSVQLGVRLNLGEP 340 Query: 314 LKKQLSAGEVAESQSL-RGSRY-DNPQRNNLPTLEYRQRKTL 353 L +QL G + RG+R D +R N L+ R++K + Sbjct: 341 LSRQLRRGAQDTAPPFDRGARLQDFVRRENRIVLDTRRKKIV 382 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 85.1 bits (209), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 3/155 (1%) Query: 92 DALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ 151 D LS Q + V WL +GNA + + VD + + P D+ Y+ +SQ L + Sbjct: 82 DTLSAQATKEVVDWLQQYGNARIKLNVDESFTLKDAAFDFLYPWMDSKDYVLFSQTSLHR 141 Query: 152 QDNGLVSNVGVGQR-WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 D+ +N+G+G R + N ++G N FYD L + RAG G E W +Y+R AN Y Sbjct: 142 TDDRNQANIGLGLRHFTTDNAMLGANIFYDYDLSRHHSRAGLGVEYWRDYMRFGANTYFG 201 Query: 211 FAAWHEQTATQE--QRMARGYDLTARMRMPFYQHL 243 + W + + +R A G+D++A +P Y L Sbjct: 202 LSDWKDSRDIDDYFERPANGWDVSAEGWLPVYPQL 236 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats. Identities = 72/334 (21%), Positives = 148/334 (44%), Gaps = 32/334 (9%) Query: 40 NDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVN 99 ND D ++ E+ + ++++ G +++ E K A + + ++ Sbjct: 287 NDNKKDNNLSKEDQEFSNKVMKVIQTAGAIYDSEDSKSKEEIVKNMASSYLNTSANELAK 346 Query: 100 QHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPL--QDNDRYLTWSQLGLTQ-QDNGL 156 + ++S L+ N + F+G+ + +P+ +DN + + Q G+ + ++ Sbjct: 347 EFIDS-LNTSINTDFSFNYNERSGFSGNAKA-LLPIVSEDNPKISYFLQSGIGEFANDRT 404 Query: 157 VSNVGVGQRW--------ARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFY 208 + + G G R+ GN ++G N+ YD+ +R GAEA + L +AN Y Sbjct: 405 IGHFGGGIRYYPNATALNNSGNIMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAFNANVY 464 Query: 209 QPFAAW-----HEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 Q ++W ++ QE R A G+D + P +++ + Q++G++V +F + Sbjct: 465 QRLSSWIDSYDFDKDYVQE-RPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGIFGAN 523 Query: 264 TG---YHNPVALSLGLNYTPVPLVTVTAQH-KQGESGENQNNLGLNLNYRFGVPLKKQ-- 317 + NP+ G++Y+P P +T T H + ES + ++ N+N +PL ++ Sbjct: 524 SVDDLEKNPLIYEGGISYSPFPALTFTLSHSRSAESSKKNTSINANIN----IPLDEKAM 579 Query: 318 ---LSAGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 S ++ G+R R+ LEYR Sbjct: 580 KLAFEPKLAGISNTIEGTRTQFIDRDYSMVLEYR 613 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 44/151 (29%), Positives = 65/151 (43%), Gaps = 2/151 (1%) Query: 170 NWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMAR 227 + VG N F D +N R G E L N Y P + W ++ +E+R A Sbjct: 185 HMAVGANAFLDYEFGKNHLRGSLGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPAS 244 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 G+D+ R++ L ++ G VD F++G N G+ Y PVPLV V Sbjct: 245 GWDVGVRLQPEALPGLAIKGQYFRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPLVAVG 304 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQL 318 + + G Q + L +N G PL +QL Sbjct: 305 LEQTKVLGGARQTTVQLGVNLSLGEPLSRQL 335 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 68.2 bits (165), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 59/214 (27%), Positives = 93/214 (43%), Gaps = 8/214 (3%) Query: 146 QLGLTQQDNGLVSNVG-VGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLS 204 QLG Q++ +N G V +R +VG N F D + R G E L Sbjct: 180 QLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLEVIAPEFSLY 239 Query: 205 ANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNS 262 N Y P + W ++ +E++ A G D+ R F L+ S + ++ G VD F++ Sbjct: 240 GNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWNGAEVDYFDN 299 Query: 263 GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQL---S 319 G +G+ Y PV LV+V + + G + + L LN PL KQL + Sbjct: 300 GRTQAGAKGFKVGVEYRPVSLVSVGLEQTKVIGGGRETRMQLGLNINLSEPLSKQLRRDA 359 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTL 353 +G A S R R+ +R N L R+++ + Sbjct: 360 SGTPAFSPDAR--RHALVERENRIVLNTRRKEII 391 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 65/233 (27%), Positives = 102/233 (43%), Gaps = 20/233 (8%) Query: 135 LQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDENLQRAGFG 193 L ++R Q GL Q+ +N G+ R A +VG N F D + R G Sbjct: 102 LMVSERRALMLQAGLHNQNQRPTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLG 161 Query: 194 AEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQ 251 EA + L AN+Y P + W + + +E+R A GYDL ++ + +SL+ Sbjct: 162 LEAIAPHYSLYANYYAPLSGWKGARRDSRREERPAAGYDLGGQLSS------DAGLSLQA 215 Query: 252 -YF---GDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 YF G +D+F+SG N G+ Y P L + + G+ Q ++ LN+ Sbjct: 216 AYFRWHGAGIDVFDSGRAQRNASGFRYGVAYQPGALFNIGLNQTRTLDGQKQTSVQLNVR 275 Query: 308 YRFGVPLKKQLSAGEVAESQ--SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLA 358 P +QL ESQ +L R+ +R + L R RK +T+ L+ Sbjct: 276 INLQEPPSRQLR----RESQPFNLTSRRHQWVERESRIVLNTR-RKAITLPLS 323 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 3/208 (1%) Query: 146 QLGLTQQDNGLVSNVG-VGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLS 204 QLG Q+N +N+G V +R ++G N F D + R G EA Sbjct: 176 QLGGHNQNNRPTANLGGVYRRDINERLMLGANAFLDYEFAKQHLRGSLGVEAIAPEFSFY 235 Query: 205 ANFYQPFAAW--HEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNS 262 N Y P + W ++ +E+R A G DL + F L+ + ++ G VD F++ Sbjct: 236 GNVYAPMSGWTGAKRDNRREERPASGMDLGMKYSPGFAPGLSLKANYFRWNGAAVDYFDN 295 Query: 263 GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGE 322 G G+ Y PVPL+++ + + G +Q ++ L + PL KQL G Sbjct: 296 GRTQDRATGFKYGVQYKPVPLLSLGVEQTRVIGGASQTSVQLGVALNLSEPLSKQLRRGG 355 Query: 323 VAESQSLRGSRYDNPQRNNLPTLEYRQR 350 +L R +R N L RQ+ Sbjct: 356 ETPVFNLDAHRNALVERENRIVLNTRQK 383 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 73/305 (23%), Positives = 122/305 (40%), Gaps = 24/305 (7%) Query: 158 SNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ 217 +++G+G R + G N F+D L R GAE +Y S N Y P + W + Sbjct: 39 AHLGLGYRQLDDSQFFGVNVFFDYDLSRQHTRVSVGAEYGLDYGTFSTNAYFPLSNWKDS 98 Query: 218 TATQE------QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 E ++ A+G+DL +P ++ QY G V+ + NP Sbjct: 99 PDHYEGMNSLVEKAAKGWDLNLETYLPLDTRWKFGLTAGQYLGRYVEHSDGSLPSKNPYH 158 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL-- 329 SL + P P + ++ + + Q G+N + + L QSL Sbjct: 159 FSLSTEFRPDPAWAFSLGYQTEQGAKEQWIAGIN----YSLSLSGLYEGERRLSQQSLLP 214 Query: 330 RGSRY-DNPQRNNLPTLEYRQRKTLTVFLATPPWDLK---PGETVPLKLQIRSRYGIRQL 385 + R D QR++ LEY+Q K + + P L + + ++++ I Sbjct: 215 KPERLTDFVQRDHNMVLEYKQ-KFAEISIRLPESALVTELSQQMLSSWMEVKGGADIVSY 273 Query: 386 IWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEIT 445 WQGD + QA+S T I P ++ A+N LSV + GQ SN + Sbjct: 274 QWQGDAA--NYLNDIQASSP---TFIAPAYRY--DANNTLSLSVSYKLRSGQIKQSNTMK 326 Query: 446 LTLVE 450 +T+ + Sbjct: 327 ITVTD 331 >UniRef50_C0B2E7 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E7_9ENTR Length = 815 Score = 54.7 bits (130), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 43/151 (28%), Positives = 71/151 (47%), Gaps = 9/151 (5%) Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 +GL+ NY+FG P+ QL + + L S+YD RNN +Y+++ L+ L TP Sbjct: 1 MGLSFNYQFGTPINAQLDPNNIKPLRLLENSKYDFVDRNNNIVFDYQEQSYLS--LKTP- 57 Query: 362 WDLKPG---ETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNG 418 DL G E + + + S G+ + G ++ L + L +P + Sbjct: 58 -DLIEGYSNEQKTVTISVESSAGLDYIDIDG-SRFLQHGGRIIEQGQNSYLLYLPYYDQQ 115 Query: 419 EGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 EGA+N + + D +G R SS+E T +V Sbjct: 116 EGATNTYNIVATAYDKKG-RASSSETTKVVV 145 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 35/111 (31%), Positives = 53/111 (47%), Gaps = 7/111 (6%) Query: 133 VPLQDNDRYLTWSQLGLTQQDNG---LVSNVGVGQRWAR--GNWLVGYNTFYDNLLDENL 187 + DN + T L + Q+ N ++ N+G+G R N++ G NTFYD L E Sbjct: 69 IETTDNSNFFTQFSL-MNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQ 127 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMP 238 R G G EA G L L+AN Y + +EQ ++ G+D ++P Sbjct: 128 DRLGLGIEAKGSILDLTANSYTKISNSEVVNGDREQVLS-GWDFNLTSQIP 177 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 69/287 (24%), Positives = 121/287 (42%), Gaps = 19/287 (6%) Query: 77 DTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVD-NEGHFTGSRGSWFVPL 135 D EQ K+ + ++ + S V+ + + LSP + +V+V + EG T G Sbjct: 30 DDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFD-TVEVSTNLKEGDSTVDIG-VLKAF 87 Query: 136 QDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW--ARGNWLVGYNTFYDNLLDENLQRAGFG 193 DN ++Q+ L + D N+G G R A W+ G N FYD+ + +R G G Sbjct: 88 GDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAFYDHEFPNDHKRNGVG 147 Query: 194 AEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYF 253 E L N Y + + + + ++ G D+ ++ +P+ + ++ Q+ Sbjct: 148 FEVVSSVLESRVNSYNGTTGYIKDKSGTDSKVLDGRDMGFKVALPYLPGMMFGMNAVQWK 207 Query: 254 GDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFG-- 311 G +D + SL N + L V +K + ++ +++ LN + FG Sbjct: 208 G--IDGLKDQKMRKYSLGGSLSDN---LSLSYVRTDYKDA-AKKDIDSISLNYTWAFGQE 261 Query: 312 ---VPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTV 355 P LS + E + L RYD +R N L ++ TLTV Sbjct: 262 KHVRPTLFALS-DKAYEFKKLGAERYDLVKREN--NLVKKKSGTLTV 305 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 34/111 (30%), Positives = 52/111 (46%), Gaps = 5/111 (4%) Query: 134 PLQD--NDRYLTWSQLGLTQQDNGLVS-NVGVGQRWARGN--WLVGYNTFYDNLLDENLQ 188 P+ D ++ + ++Q L D+ + N+G G R + LVGYN FYD+ LD + Q Sbjct: 33 PISDPSDNENIIFTQASLFLSDDSRETINLGFGNRKLINDDTLLVGYNLFYDHELDYDHQ 92 Query: 189 RAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPF 239 RA G EA L AN Y + W ++ G D+ M +P+ Sbjct: 93 RASIGIEAISSVGSLRANQYYGLSGWKSGLNNINEKALNGSDVELGMPLPY 143 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 48.1 bits (113), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 43/197 (21%), Positives = 87/197 (44%), Gaps = 9/197 (4%) Query: 130 SWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQR-WARGNWLVGYNTFYDNLLDENLQ 188 S++ + + L++ QLG+ + + N G G+R + +GYNTFY + + Sbjct: 100 SFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKYAIGYNTFYHCPISKQSS 159 Query: 189 R---AGFGAEAW-GEYLRLSANFYQPFAAWHEQTATQEQRMA---RGYDLTARMRMPFYQ 241 + G E W L + N+Y ++ +T+ Q+ + G+ L + + P + Sbjct: 160 QPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHYPRSGHQLYIQTKFPRFF 219 Query: 242 HLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNN 301 + LEQ+ ++ ++ LSL LNY P+P++ + + N Sbjct: 220 EFTGKIKLEQFIYEKKYK-KIFNKKNSDYYLSLDLNYQPIPMLGFSINNIFVNKQYNSTI 278 Query: 302 LGLNLNYRFGVPLKKQL 318 + + Y+FG P+ +Q+ Sbjct: 279 CRVLIAYQFGTPIIEQI 295 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust. Identities = 65/268 (24%), Positives = 99/268 (36%), Gaps = 31/268 (11%) Query: 59 FAEIVKDFGETSMNDNGLDTGEQAKAFALGKVR-DALSQQVNQHVESWLSPWGNASVDVK 117 FA IVK+ G N + GE+A + + DA + ++Q + LS ++ Sbjct: 29 FATIVKNIG----NALNIGQGEEAVESEVNTLAVDAANAGLDQVEDKVLSTSNFTHFELS 84 Query: 118 V-------DNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR-- 168 V D T + L++ + ++Q +N N G G R Sbjct: 85 VGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTINTGFGARHINDA 144 Query: 169 GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARG 228 + GYN FYD L +R G G E AN YQ + QE + G Sbjct: 145 NTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVSKTLTYNGIQETAL-DG 203 Query: 229 YDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVT--V 286 YD +P++ N L + D + T ++ G+N P +T V Sbjct: 204 YDAKLTANLPYFYSSNLYGKLSNW----KDAASYETEHYEA-----GINAEIAPNLTLRV 254 Query: 287 TAQHKQGESGENQNNLGLNLNYRFGVPL 314 AQHK+ N NN + + VPL Sbjct: 255 AAQHKK-----NSNNTEAVASINYSVPL 277 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust. Identities = 33/110 (30%), Positives = 48/110 (43%), Gaps = 4/110 (3%) Query: 134 PLQDNDRYLT--WSQLGLTQQDNGLVSNVGVGQRWARGN--WLVGYNTFYDNLLDENLQR 189 PL D D ++Q + +DN N+G+G R N L G N FYD+ + R Sbjct: 103 PLSDPDDIFNTYFTQGSVFYEDNRTTLNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGR 162 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPF 239 G EA ++AN Y W E+R GYD+ A + +P+ Sbjct: 163 TSIGLEARTTVWEINANKYWATTKWKTGKNGLEERALDGYDIEAGVPLPY 212 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust. Identities = 39/189 (20%), Positives = 73/189 (38%), Gaps = 16/189 (8%) Query: 140 RYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYNTFYDN---LLDENLQRAGFGAE 195 + L + Q+G+ + G G+R ++GYN Y + + G E Sbjct: 126 KILYFLQIGMKNFTENKMIVFGSGKRLVYNKKHIIGYNACYHHPISTIQSQPYSINIGGE 185 Query: 196 AWGEYLRLSANFY----QPFAAWHEQTATQEQRMAR-GYDLTARMRMPFYQHLNTSVSLE 250 W L+ N Y + F ++ + + + GY + A+ P+ + E Sbjct: 186 YWYRNLKFIFNNYYNINEIFYSYKNISNHHYYQYPKIGYQICAKSNFPYISEFIGQIKFE 245 Query: 251 QYFGDR----VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNL 306 Q D+ + +N+ H L + L Y P+P+ ++ ++ + L Sbjct: 246 QCVYDKTRNNIRFWNANNKNH---ILCVSLEYQPIPMFNLSINNRFIYKKYCNTFFTITL 302 Query: 307 NYRFGVPLK 315 NY+F VPLK Sbjct: 303 NYQFHVPLK 311 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 42.0 bits (97), Expect = 0.048, Method: Compositional matrix adjust. Identities = 43/179 (24%), Positives = 76/179 (42%), Gaps = 10/179 (5%) Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGA--EAWGE 199 L W + ++ L+SN G+G VG N F+D ++ R G+ + Sbjct: 124 LVWQANIDYKNEDILISN-GIGILPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPNY 182 Query: 200 YLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 LS+N Y P + E + D+ A + ++S LE +FGD + + Sbjct: 183 IFNLSSNIYFPLSG-----KGSEDDLVNSIDIRAEGAITPTVQFHSS--LEFFFGDDIQI 235 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQL 318 + +N + GL+YTP+PL+ + + + + + + L NY PL +QL Sbjct: 236 NDDYDPTNNSHKFTAGLDYTPIPLLQLGVEATKVQDHDVGYGVYLYFNYDPWRPLNEQL 294 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 607 e-172 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 543 e-153 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 535 e-150 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 531 e-149 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 530 e-149 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 529 e-149 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 525 e-147 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 524 e-147 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 521 e-146 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 517 e-145 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 516 e-145 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 513 e-144 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 512 e-144 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 512 e-143 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 509 e-143 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 507 e-142 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 505 e-141 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 504 e-141 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 504 e-141 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 503 e-141 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 502 e-140 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 502 e-140 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 499 e-139 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 498 e-139 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 492 e-138 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 490 e-137 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 489 e-137 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 482 e-134 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 481 e-134 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 479 e-133 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 476 e-133 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 474 e-132 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 473 e-132 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 472 e-131 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 470 e-131 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 467 e-130 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 467 e-130 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 462 e-129 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 461 e-128 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 460 e-128 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 460 e-128 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 456 e-127 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 454 e-126 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 452 e-125 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 450 e-125 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 449 e-125 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 443 e-123 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 442 e-122 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 441 e-122 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 439 e-121 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 430 e-119 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 416 e-114 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 413 e-114 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 408 e-112 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 407 e-112 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 403 e-111 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 398 e-109 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 397 e-109 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 391 e-107 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 376 e-103 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 373 e-102 UniRef50_P36943 Putative attaching and effacing protein homolog ... 324 4e-87 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 264 5e-69 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 246 1e-63 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 245 3e-63 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 239 1e-61 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 239 3e-61 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 237 5e-61 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 234 5e-60 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 220 7e-56 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 211 5e-53 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 198 3e-49 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 185 3e-45 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 177 1e-42 UniRef50_C0B2E7 Putative uncharacterized protein n=1 Tax=Proteus... 153 1e-35 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 117 7e-25 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 115 3e-24 Sequences not found previously or not previously below threshold: UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 133 1e-29 UniRef50_Q4ACI6 Invasin (Fragment) n=1 Tax=Edwardsiella tarda Re... 125 5e-27 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 108 5e-22 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 100 1e-19 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 100 2e-19 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 96 2e-18 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 95 7e-18 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 89 4e-16 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 88 9e-16 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 87 1e-15 UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 85 7e-15 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 82 7e-14 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 81 9e-14 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 77 2e-12 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 74 1e-11 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 65 7e-09 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 61 1e-07 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 60 2e-07 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 59 3e-07 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 59 3e-07 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 57 2e-06 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 57 2e-06 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 56 3e-06 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 55 4e-06 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 55 5e-06 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 53 2e-05 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 52 3e-05 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 52 6e-05 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 50 1e-04 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 50 1e-04 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 50 3e-04 UniRef50_UPI0000E0F7DB beta-glycosidase-like protein n=1 Tax=Gla... 48 5e-04 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 48 6e-04 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 48 6e-04 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 48 8e-04 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 46 0.002 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 46 0.002 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 46 0.003 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 46 0.004 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 46 0.004 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 45 0.005 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 45 0.007 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 44 0.011 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 42 0.049 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 42 0.056 UniRef50_A1AQZ5 Fibronectin, type III domain protein n=2 Tax=Des... 41 0.064 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 41 0.074 UniRef50_B9XJ25 Na-Ca exchanger/integrin-beta4 n=1 Tax=bacterium... 41 0.098 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 607 bits (1566), Expect = e-172, Method: Composition-based stats. Identities = 464/464 (100%), Positives = 464/464 (100%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA Sbjct: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN Sbjct: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD Sbjct: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 Query: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY 240 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY 240 Query: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN Sbjct: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 Query: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP Sbjct: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 Query: 361 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG Sbjct: 361 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 Query: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP Sbjct: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 543 bits (1399), Expect = e-153, Method: Composition-based stats. Identities = 286/434 (65%), Positives = 344/434 (79%) Query: 30 QKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGK 89 ++A NPFD N D LPDLG+APEN+ EKHFA ++K FGE S D+ L G+QA+ FA + Sbjct: 2 RQAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTR 61 Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 +RDA+S + ES LSPWGNA+VD+ VD EG+F GS GS F P QDN+RYLTWSQ+G+ Sbjct: 62 LRDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGV 121 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 +QQ+ GLV N G+GQRW G+WL+GYNTFYD L D++ RAGFGAEAWG+YLRLSAN+YQ Sbjct: 122 SQQNQGLVGNAGIGQRWTAGHWLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 210 PFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 P W + EQRMARGYD+TA+ +PFYQH+NTSVS EQYFGD+V+LF+SG+GYHNP Sbjct: 182 PLGGWEHRAGLLEQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYHNP 241 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 VA+ +GL+YTPVPLVTV+A H+QGESG +QN+LGL LNYRFGVPL KQLS EVA S+SL Sbjct: 242 VAVKVGLSYTPVPLVTVSAHHRQGESGVSQNDLGLKLNYRFGVPLNKQLSPDEVAASRSL 301 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 RGSRYD +R N+P +E+RQRKTL+VFLATPPWDL GETV LKLQ+RSR+GIRQL WQG Sbjct: 302 RGSRYDRVERTNVPVMEFRQRKTLSVFLATPPWDLSAGETVALKLQVRSRHGIRQLSWQG 361 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 DTQ LSLTP + SA+GWT+IMP W N GASN WRLSV VED QGQRV+SN ITL L Sbjct: 362 DTQALSLTPPIDSTSADGWTVIMPAWDNSPGASNSWRLSVTVEDEQGQRVTSNWITLKLS 421 Query: 450 EPFDALSNDELRWE 463 P L D+ R+E Sbjct: 422 VPVQTLPQDDPRYE 435 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 535 bits (1379), Expect = e-150, Method: Composition-based stats. Identities = 120/427 (28%), Positives = 201/427 (47%), Gaps = 11/427 (2%) Query: 35 PFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDAL 94 P + P A + + A I G N D A R Sbjct: 114 PAVTWAEETPVPASASKEDLQAQKIAGIASQAGNFLANSPRGDA-------AASIARGMA 166 Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + + V+ WLS +G A + + VDN+ S+ +PL + L ++Q L + D+ Sbjct: 167 TGAASTEVQQWLSQFGTARLQLDVDNKFSLKNSQLDLLIPLYEQPDKLVFTQGSLHRTDD 226 Query: 155 GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 +N+G+G RW +++G NTF D L + R G G E W +YL++ AN Y W Sbjct: 227 RTQTNLGMGMRWFNDGYMLGGNTFLDYDLSRDHARMGMGVEYWRDYLKIGANNYLRLTNW 286 Query: 215 HEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 + ++R A G+D++ +P L ++ EQY+G V LF +P A+ Sbjct: 287 RDSKDFADYQERPANGWDMSLEGWVPALPQLGGNLKYEQYYGKEVALFGKDNRQKDPHAI 346 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS 332 ++G+NYTP PL+T +A +QG++G+N LG+ LN + G P + QL V ++L GS Sbjct: 347 TVGVNYTPFPLLTFSADQRQGKAGQNDTRLGVQLNIQLGTPWQHQLDTSAVGAMRTLAGS 406 Query: 333 RYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQ 392 RYD RNN LEYR+++ + ++ A GE L + I ++YG+ ++ W + Sbjct: 407 RYDLVDRNNNIVLEYRKKEVIHLYTADH-LAGYAGEQKSLNVSINTKYGLERIDWSAP-E 464 Query: 393 ILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPF 452 +L+ S + +++++PD+ N + +S V D G + TLT+ +P Sbjct: 465 LLAAGGKIVQESIDNYSIVLPDYNFDSANGNVYEISGVAIDTHGNVSKKAKTTLTVTQPA 524 Query: 453 DALSNDE 459 + E Sbjct: 525 INTTTSE 531 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 531 bits (1367), Expect = e-149, Method: Composition-based stats. Identities = 329/415 (79%), Positives = 376/415 (90%) Query: 48 MAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 M PE+H+GEK FAE+VK FGE SM DNGLDTGEQAK FA +VRDALS QVNQH+ESWLS Sbjct: 1 MMPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLS 60 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 PWGNASV+V+VDN+G F GSRGSWF+P QDN RYLTWSQLGLT+Q++GLVSNVG+GQRWA Sbjct: 61 PWGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRWA 120 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMAR 227 R WL+GYNTFYDNLLDE+LQRAG GAEAWGEYLRLSAN+YQPFA+WHE++ATQEQRMAR Sbjct: 121 RDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWHERSATQEQRMAR 180 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYD++A+MRMPFYQHL+T VS+EQYFGD VDLF+SG GYHNP+A+SLGLNYTPVPLVTVT Sbjct: 181 GYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLVTVT 240 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 AQHKQGESG +QNNLGLNLNYRFGVPLKKQL+A EVAES+SLRGSRYD+PQRN+LP +EY Sbjct: 241 AQHKQGESGVSQNNLGLNLNYRFGVPLKKQLAASEVAESKSLRGSRYDSPQRNSLPVIEY 300 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 RQRKTL+VFLATPPWDL+PGETVPLKLQ+RS +GIR + WQGDTQ LSLT GA A+S +G Sbjct: 301 RQRKTLSVFLATPPWDLQPGETVPLKLQVRSLHGIRHVSWQGDTQALSLTAGANADSIDG 360 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRW 462 WT+IMP W + EGA + WRLSVVVED +GQRVSSNEITL L EPF A+S+D+ RW Sbjct: 361 WTIIMPTWDSSEGAIHRWRLSVVVEDEKGQRVSSNEITLALTEPFMAMSDDDPRW 415 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 530 bits (1365), Expect = e-149, Method: Composition-based stats. Identities = 128/415 (30%), Positives = 203/415 (48%), Gaps = 14/415 (3%) Query: 52 NHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGN 111 + ++ A + +DN A R + N + WLS +G Sbjct: 49 SSQYKERLAHNLLKGATVLADDNTPLA-------AASMARSVAVGEANDAAQHWLSQFGT 101 Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNW 171 A V + +DN GS +PL D+ + L +SQ GL D+ N+G G R + NW Sbjct: 102 ARVQLNLDNNLSLKGSAFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDNW 161 Query: 172 LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGY 229 + G N F+D + R GFGAEAW +YL+LSAN Y WH+ +R A GY Sbjct: 162 MYGANVFFDRDITGKNNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGY 221 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQ 289 DL +P Y + T++ EQY G+ V LF NP A + G+NYTP+PL+T+ A+ Sbjct: 222 DLRVEAYLPAYPQIGTNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLITIGAE 281 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 + G+ G N N+ + LNYR G P + Q+ VA S++L GSRYD +RNN LEY++ Sbjct: 282 QRAGKGGRNDTNISIQLNYRLGEPWQSQIDPSAVAASRTLAGSRYDLVERNNNIVLEYQK 341 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 + + + L E + ++ Q+ ++YG++++ W I++ S++ + Sbjct: 342 QDLIQLVLPN-QMTGSAFEIIKVEAQVTAKYGLKRIDWDT-AVIVAAGGVVTQTSSQNIS 399 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + +P + SN + LS V DNQG + + +T+ + + N L+ P Sbjct: 400 IKLPPY---TAGSNVYMLSAVAYDNQGNTSNHSTTQITVTQQSVSHLNSTLQVSP 451 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 529 bits (1364), Expect = e-149, Method: Composition-based stats. Identities = 117/407 (28%), Positives = 203/407 (49%), Gaps = 11/407 (2%) Query: 52 NHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGN 111 + +G A++ G N D AL R +S + V+ WL+ +G Sbjct: 141 DDEGAMKMADMASRAGTLLSNSPDGDA-------ALSMARGQISAVASGQVQQWLNQFGT 193 Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNW 171 A V ++ D S+ +P + + L ++Q L + D+ +N+G G R+ ++ Sbjct: 194 ARVQLEADEHFSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAPSY 253 Query: 172 LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGY 229 ++G N F D L R G G E W ++L+LSAN Y + W + ++R A G+ Sbjct: 254 MLGGNIFGDYDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGW 313 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQ 289 D+ A+ +P L ++ EQY+G V LF NP A++ G+N+TP PL+ + A+ Sbjct: 314 DIRAQAWLPSLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLLMLGAE 373 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 H+QG SG+N + + +YR G+P ++Q++ VA +SL GSRYD +RNN L+YR+ Sbjct: 374 HRQGASGKNDKRISADFSYRLGLPWQQQINPQAVATMRSLAGSRYDLVERNNHILLQYRK 433 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 ++T+ + GE L + + S+YG+ ++ W + +L+ A W+ Sbjct: 434 KETVRLHTVDRVT-GYAGEKKSLGVSVNSQYGLERIDWSA-SSLLACGGQLVREDAGNWS 491 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 +I+P++Q G A N W +S V D +G + + +T+ + S Sbjct: 492 VILPEYQPGAQAVNTWTVSGVAVDKKGNVSARADTQVTVAQSAIDAS 538 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 525 bits (1352), Expect = e-147, Method: Composition-based stats. Identities = 136/438 (31%), Positives = 211/438 (48%), Gaps = 19/438 (4%) Query: 35 PFDNNND----GLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKV 90 P N+N L + M + D + AE+ + G D EQA + A G V Sbjct: 117 PLINSNSPEARNLKAMQMERDGKDPQMQVAEMAQQSGTLLARDMD---SEQAASMARGWV 173 Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 + S Q WLS WG A V + VD + S + P + L +SQ L Sbjct: 174 ASSASAQAT----DWLSRWGTARVSLGVDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLH 229 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 + D+ +N G+G R+ +W+ G N F D+ L R G G E W +YL+LS N Y Sbjct: 230 RTDDRTQTNHGIGWRYFTSSWMSGVNMFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLR 289 Query: 211 FAAWHEQT---ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 + W E R A G+DL A +P + L + EQY+GD V LF + Sbjct: 290 LSNWRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFGKDERQN 349 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 +P A++ GL+YTPVPL++ +A+ +QG+ GEN +G+ L + G L+KQL EVA + Sbjct: 350 DPHAITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQKQLDPAEVAARR 409 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 SL GSRYD RNN LEYR+++ + + L T P KPGE L ++++Y ++ + Sbjct: 410 SLVGSRYDLVDRNNNIVLEYRKKELVRLTL-TDPLKGKPGEVKSLVSSLQTKYALKG--Y 466 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNG--EGASNHWRLSVVVEDNQGQRVSSNEIT 445 + L G A S + + +P ++ N + ++V ED++G E Sbjct: 467 DIEAASLQSAGGKVAVSGKDIQVTIPPYRFTAMPETDNTYPIAVTAEDSKGNFSRREESM 526 Query: 446 LTLVEPFDALSNDELRWE 463 + + +P +L++ L + Sbjct: 527 VVVEKPTLSLTDSTLSVD 544 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 524 bits (1350), Expect = e-147, Method: Composition-based stats. Identities = 138/438 (31%), Positives = 210/438 (47%), Gaps = 19/438 (4%) Query: 35 PFDNNND----GLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKV 90 P N+N L + M + D + AE+ + G D EQA + A G V Sbjct: 117 PLINSNSPEARNLKAMQMERDGKDPQMQVAEMAQQSGTLLARDMD---SEQAASMARGWV 173 Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 + S Q WLS WG A V + VD + S + P + L +SQ L Sbjct: 174 ASSASAQAT----DWLSRWGTARVSLGVDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLH 229 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 + DN +N G+G R+ +W+ G N F D+ L R G G E W +YL+LS N Y Sbjct: 230 RTDNRTQTNHGIGWRYFTSSWMSGVNMFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLR 289 Query: 211 FAAWHEQT---ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 + W E R A G+DL A +P + L V EQY+GD V LF + Sbjct: 290 LSNWRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKVVYEQYYGDEVALFGKDERQN 349 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 +P A++ GL+YTPVPL++ +A+ +QG+ GEN +G+ L + G L+KQL EVA + Sbjct: 350 DPHAITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQKQLDPAEVAARR 409 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 SL GSRYD RNN LEYR+++ + + L T P KPGE L ++++Y ++ + Sbjct: 410 SLVGSRYDLVDRNNNIVLEYRKKELVRLTL-TDPLKGKPGEVKSLVSSLQTKYALKG--Y 466 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNG--EGASNHWRLSVVVEDNQGQRVSSNEIT 445 + L G A S + + +P ++ N + ++V ED++G E Sbjct: 467 DIEAASLQSAGGKVAVSGKDIQVTIPPYRFTAMPETDNTYPIAVTAEDSKGNFSRREESM 526 Query: 446 LTLVEPFDALSNDELRWE 463 + + +P +L+ L + Sbjct: 527 VVVEKPTLSLAGSTLSVD 544 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 521 bits (1342), Expect = e-146, Method: Composition-based stats. Identities = 129/426 (30%), Positives = 211/426 (49%), Gaps = 23/426 (5%) Query: 32 AANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVR 91 + NP +NN + DL A G+ NDN D A R Sbjct: 146 SNNPNENNKKDVDDL------------LARNAMGAGKLLSNDNTSDA-------ASNMAR 186 Query: 92 DALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ 151 A++ ++N + WL+ +G A V + VD++ S VPL+D++ L ++QLG+ Sbjct: 187 SAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSLLFTQLGVRN 246 Query: 152 QDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPF 211 +D+ N+G G R +G+W+ G NTF+DN L +R G GAE +YL+ SAN Y Sbjct: 247 KDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLKFSANTYFGL 306 Query: 212 AAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 WH+ ++R A G+D+ +P Y L + E+Y GD V LF +P Sbjct: 307 TGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALFGKDDRQKDP 366 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 A++LG+NYTPVPLVT+ A+H++G+ N ++ + LNYR G P Q+ VA +++L Sbjct: 367 HAVTLGVNYTPVPLVTIGAEHREGKGNNNNTSVNVQLNYRMGQPWNDQIDQSAVAANRTL 426 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 GSRYD +RNN L+Y++++ + + L G + L Q+R++YG ++ W Sbjct: 427 AGSRYDLVERNNNIVLDYKKQELIHLVLP-DRISGSGGGAITLTAQVRAKYGFSRIEWDA 485 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 T + + + ++ +P +Q+ SN +S V D QG + ++ + Sbjct: 486 -TPLENAGGSTSPLTQSSLSVTLPFYQHILRTSNTHTISAVAYDAQGNASNRAVTSIEVT 544 Query: 450 EPFDAL 455 P + Sbjct: 545 RPETMV 550 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 517 bits (1332), Expect = e-145, Method: Composition-based stats. Identities = 122/421 (28%), Positives = 194/421 (46%), Gaps = 17/421 (4%) Query: 49 APENHDGEKHFAEIVKDFGETSMND-NGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 E A + + N E A+ +A + + WLS Sbjct: 46 LSEQDATAAQVAGMTTQAAGMLQSGMNSRQAAEMARGYA--------TSTAQSAFQEWLS 97 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 WG V + +D + GS +P D L ++Q + D+ N G G R Sbjct: 98 QWGTVRVTLGLDEDFTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHF 157 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA---TQEQR 224 +++ G N F+D+ L R G G E W + L+L AN Y + W + E R Sbjct: 158 APDYMAGVNLFFDHDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEAR 217 Query: 225 MARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 A G+D+ A +P Y L ++ EQY+GD V LF +P A + GL+YTPVPL+ Sbjct: 218 PANGWDVRAEGYLPAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLI 277 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 +++A+ KQG+ GEN LNL Y GV L QL VA +SL GSR+D +RNN Sbjct: 278 SLSAEQKQGKGGENDTRFALNLTYTPGVSLAHQLDPDAVAYRRSLAGSRHDLVERNNNIV 337 Query: 345 LEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANS 404 LEYR+++ + + L P K GE PL ++S+Y ++ L + + L G Sbjct: 338 LEYRKKELVKLQL-HDPVTGKGGEQKPLVASLQSKYALKTL--RAEAAELQSAGGVVNTE 394 Query: 405 AEGWTLIMPDWQN--GEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRW 462 A T+ +P+++ N +R++V ED +G R + E ++ ++ P + + ++ Sbjct: 395 ANQVTVTLPEYRYTATPQTDNVYRVAVTAEDEKGNRSNREEASVVVLAPQLSAQHSQVTS 454 Query: 463 E 463 + Sbjct: 455 D 455 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 516 bits (1329), Expect = e-145, Method: Composition-based stats. Identities = 123/412 (29%), Positives = 209/412 (50%), Gaps = 12/412 (2%) Query: 56 EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVD 115 E A V+ N + E+ +R A + + N + WLS +G A V Sbjct: 111 ENKLASHVQTGATALATSNAAKSSER-------MIRSAANNEFNSSAQQWLSQFGTARVQ 163 Query: 116 VKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGY 175 + V+++ GS VP+ DN + + ++QLG +DN N+G G R + NW+ G Sbjct: 164 MNVNDDFKLDGSAVDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNNWMYGV 223 Query: 176 NTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTA 233 NTF+DN + +R G GAEAW +YL+LSAN Y + WH+ +R A GYD+ A Sbjct: 224 NTFFDNDMTGKNRRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRA 283 Query: 234 RMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQG 293 +P + L + E+Y G+ V LF NP A++ G+NYTP+PL+TV A+H+ G Sbjct: 284 EAYLPSHPQLGGKLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLLTVGAEHRAG 343 Query: 294 ESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTL 353 + +N +++ NYR G + ++ VA +++L GSRYD +RNN L+Y++++ + Sbjct: 344 KGSKNDSSINFQFNYRLGESWQSHINPSAVAATRTLAGSRYDLVERNNNIVLDYQKQELI 403 Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 + L + K G+ + Q+ S+YG+ ++ W +++ S+ ++ +P Sbjct: 404 RLSLPERV-EGKAGDIATVNAQVTSKYGLERIDWD-SAALIAAGGTLSKGSSNSISITLP 461 Query: 414 DWQNGEGAS-NHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 +Q G + N + LS + D QG R +S+ + + + N + P Sbjct: 462 PYQASVGNTPNSYTLSAIAFDTQGNRSNSSSTLINVSPQNLSTGNSLMTATP 513 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 513 bits (1322), Expect = e-144, Method: Composition-based stats. Identities = 113/414 (27%), Positives = 206/414 (49%), Gaps = 13/414 (3%) Query: 52 NHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGN 111 + G + A++ G ++ D AL R ++ + + ++ WL+ +G Sbjct: 163 DDAGARKMADVASRAGAFLSDNPNGDA-------ALSLARGEVTAEASGQLQQWLNQFGT 215 Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNW 171 A V + D F S+ PL + L ++Q L + D+ N+G G R+ ++ Sbjct: 216 ARVQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQVNLGFGLRYFAPSY 275 Query: 172 LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGY 229 ++G N F D L R G G E W ++L+LSAN Y + W+ + ++R A G+ Sbjct: 276 MLGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNSSDFKDYQERPANGW 335 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQ 289 D+ A+ +P L ++ EQY+G V LF +P A++ G+N+TP PL+T+ A+ Sbjct: 336 DIRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAGVNFTPFPLLTLNAE 395 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 H+QG SG+N LG++ +Y+ G+P ++Q++ VA +SL GSRYD +RNN L+YR+ Sbjct: 396 HRQGASGKNDKRLGVDFSYQLGMPWQQQINPQAVATMRSLAGSRYDLVERNNHILLQYRK 455 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 ++ + + GE L + + S YG+ ++ W + +L+ + W+ Sbjct: 456 KEVIRLHTVGRVT-GYAGERKSLGVSVNSSYGLERIDWSA-SSLLAAGGKLVRENEGSWS 513 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 +I+P+ + G +N W ++ V D +G + + +T+ + S + E Sbjct: 514 VILPEHK--PGEANSWTITGVAVDKKGNVSTGADTQVTVAQAAIDASMSPVTPE 565 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 512 bits (1320), Expect = e-144, Method: Composition-based stats. Identities = 149/422 (35%), Positives = 235/422 (55%), Gaps = 7/422 (1%) Query: 40 NDGLPDLGM----APENHDGEKHFAEIVKDFGETSMNDNGLD-TGEQAKAFALGKVRDAL 94 LP+LGM P GE+ A G + N+ D QA+++A G+ + + Sbjct: 101 ASNLPELGMGNDPVPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQAESWAKGQAKAQV 160 Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + Q + L +G A V++ VD+ G + S S F P +ND + +SQ+G+ +QDN Sbjct: 161 VDPLRQQAQELLGKFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAMVAFSQVGVHRQDN 220 Query: 155 GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 ++ N+G G R+ +G+WL G NTF D + N R G G E W + L+L++N+Y P + W Sbjct: 221 RMIGNLGAGVRFDQGDWLFGANTFLDQDISRNHSRLGLGLEWWADNLKLASNYYHPLSGW 280 Query: 215 HEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 + + +R ARG+D+ A+ +P YQ L S EQY+GD V LF +P A+ Sbjct: 281 KDSKDFDDYLERPARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVALFGKDNLQKDPHAV 340 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS 332 ++G++YTP PL T+ HK G+ G+N LGL ++Y+ G L+KQL G VA +SL+GS Sbjct: 341 TVGVDYTPFPLATLKVSHKMGKDGKNNTELGLQVSYQIGTALEKQLDPGNVAAMRSLKGS 400 Query: 333 RYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQ 392 RYD RN LEY+++ L++ LA P L G+ ++ +RS+Y I + W GD Sbjct: 401 RYDLVDRNYDIVLEYKEKAVLSLDLAAVPMTLLEGDVYMMQPLVRSKYRITSVSWHGDAV 460 Query: 393 ILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPF 452 L L P A AN+ +GW + +P W GA+N + LS+ + D +G + +SN++ + + + Sbjct: 461 PLLLVPTAGANNPQGWQITLPAWDATPGATNLYTLSISIVDEKGHQATSNDVEIRVGQQR 520 Query: 453 DA 454 Sbjct: 521 LG 522 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 512 bits (1319), Expect = e-143, Method: Composition-based stats. Identities = 131/417 (31%), Positives = 212/417 (50%), Gaps = 5/417 (1%) Query: 50 PENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPW 109 +++ + + G L G+ AK+ VR A S + N + WLS + Sbjct: 129 SVDNNKDNRLSVENTLAGHAVAGATALSNGDVAKS-GERMVRSAASNEFNNSAQQWLSQF 187 Query: 110 GNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARG 169 G A V + ++++ H GS +PL DN++ + ++QLG +D+ N+G G R +G Sbjct: 188 GTARVQLNINDDFHLDGSAADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQG 247 Query: 170 NWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMAR 227 NW+ G NTF+DN L +R G GAEAW +YL+LSAN Y WH+ +R A Sbjct: 248 NWMYGANTFFDNDLTGKNRRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPAN 307 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYDL A +P Y L E+Y GD V LF NP A++ G+NYTP+PLVT+ Sbjct: 308 GYDLRAEAYLPSYPQLGGKAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPLVTIG 367 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 A+H+ G+ G+N +N+ LNYR G + + VA S++L GSRYD +RNN L+Y Sbjct: 368 AEHRAGKGGQNDSNINFQLNYRLGETWQSHIDPSAVAASRTLAGSRYDLVERNNHIVLDY 427 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 +++ + + L P + + Q+ + +G+ ++ WQ ++++ + S G Sbjct: 428 QKQNLVRLSLPDS-LAGDPFSQLSVTAQVTATHGLERIDWQ-SAELMAAGGVLKQTSKNG 485 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + +P++Q N + L+ + D QG S + +T+ ++N L P Sbjct: 486 LEITLPEYQMNRTGGNSYILNAIAYDTQGNASSQASMLITVNAQKINIANSTLVAVP 542 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 509 bits (1312), Expect = e-143, Method: Composition-based stats. Identities = 118/413 (28%), Positives = 191/413 (46%), Gaps = 15/413 (3%) Query: 54 DGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNAS 113 + E+ A + G D + A R S Q + + WLS +G A Sbjct: 136 NLEQQIASTSQQIGSLLAEDMNSE-------QAANMARGWASSQASGAMTDWLSRFGTAR 188 Query: 114 VDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLV 173 + + VD + S+ + P + L +SQ L + D N G+G R W+ Sbjct: 189 ITLGVDEDFSLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTPTWMS 248 Query: 174 GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT---ATQEQRMARGYD 230 G N F+D+ L RAG GAE W +YL+LS+N Y W E R A G+D Sbjct: 249 GINFFFDHDLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWD 308 Query: 231 LTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQH 290 + A +P + HL + EQY+GD V LF+ NP A++ GLNYTP PL+T +A+ Sbjct: 309 VRAEGWLPAWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLMTFSAEQ 368 Query: 291 KQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQR 350 +QG+ GEN ++ ++ G ++KQL EV +SL GSR+D RNN LEYR++ Sbjct: 369 RQGKQGENDTRFAVDFTWQPGSAMQKQLDPNEVDARRSLAGSRFDLVDRNNNIVLEYRKK 428 Query: 351 KTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTL 410 + + + L T P K GE L ++++Y ++ + + L G + + + Sbjct: 429 ELVRLTL-TDPVTGKSGEVKSLVSSLQTKYALKG--YNVEATALEAAGGKVVTTGKDILV 485 Query: 411 IMPDWQ--NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELR 461 +P ++ + N W + V ED +G + + + + P + + + Sbjct: 486 TLPAYRFTSTPETDNTWPIEVTAEDVKGNFSNREQSMVVVQAPTLSQKDSSVS 538 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 507 bits (1307), Expect = e-142, Method: Composition-based stats. Identities = 120/425 (28%), Positives = 198/425 (46%), Gaps = 19/425 (4%) Query: 44 PDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVE 103 P L A + A+ V + E A G R + N+ ++ Sbjct: 152 PKLSSAMREPSRAEKEAQAVGQLMSVGATLSSTRPSEA----AAGMARSMATNAANEEIQ 207 Query: 104 SWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVG 163 WLS +G A V + +D + S WF+P+ D+ ++QLG +D N+GVG Sbjct: 208 QWLSKYGTARVQLNLDKNFSLSESALDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVG 267 Query: 164 QRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQ 221 R W++G N FYD+ L + R G GAEAW +YL+LS N Y + WH+ Sbjct: 268 ARTLLDRWMLGVNMFYDHDLTGHNSRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADY 327 Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPV 281 ++R A G+D+ A +P L + EQY G+ V LF NP AL+ G+NYTP Sbjct: 328 DERAANGFDIRANAWLPALPQLGGKLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPF 387 Query: 282 PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNN 341 PL+TV + G++G N + L+YR G+ + Q+ VA + + SRY+ RNN Sbjct: 388 PLLTVGVDERLGKAGRNDTQFSIQLSYRPGLSWQSQIDPSSVAAIRQIAESRYNLVDRNN 447 Query: 342 LPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQ 401 LEY++++ + + L+ + G + ++S+Y + Q+ WQ D +++ Sbjct: 448 DIVLEYKKQEVIKLALSHHAINDLAGAVYTVSANLKSKYALDQVSWQ-DGGLVAAGGQLT 506 Query: 402 ANSAEGWTLIMPDWQ------------NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 ++L++P ++ E A+N ++L V DNQG + +S + + + Sbjct: 507 VIDKNHFSLMLPPYRPAQAKSDAHQTSTAEIAANTYQLIAVAFDNQGNQSNSETLRVVVQ 566 Query: 450 EPFDA 454 P Sbjct: 567 PPQVT 571 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 505 bits (1300), Expect = e-141, Method: Composition-based stats. Identities = 114/415 (27%), Positives = 195/415 (46%), Gaps = 12/415 (2%) Query: 47 GMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWL 106 A E ++ + A G + D R+ + + + WL Sbjct: 131 ASAEEGNEQAQKVAGYASQAGSFLASSAKSDAAASMA-------RNMATVEAGGAFQQWL 183 Query: 107 SPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW 166 S +G A V + D S+ +PL D ++Q L + D+ +++G G R Sbjct: 184 SHFGTARVQLDADKNFSLKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWRH 243 Query: 167 ARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQR 224 + +++G N F D L + RAG G E W +L+L N Y + W + ++R Sbjct: 244 STSTYMLGGNLFGDFDLSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQER 303 Query: 225 MARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 A G+D+ + +P L ++ EQY+G V LF + NP A+++G+NYTPVPL+ Sbjct: 304 PANGWDVRGQAWVPSLPQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPLI 363 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 T+ A+ +QG+SG++ L LN+NY GVP + Q+ VA +SL GS+YD +RNN Sbjct: 364 TLGAEQRQGQSGKSDTRLTLNMNYHLGVPWRAQVDPTAVAAMRSLAGSQYDLVERNNNIV 423 Query: 345 LEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANS 404 LEYR+++ + + A GE L + + SR+G+ ++ W D L+ G + Sbjct: 424 LEYRKKEIVRLKTADLVT-GYTGEQKSLGVSVNSRHGLERIDW--DASALNAAGGKIVQN 480 Query: 405 AEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 + +++P +Q+ N + +S V D +G R S ++ +T+ Sbjct: 481 GRDYAVVLPAYQSSAQGVNTYTVSGVAVDTKGNRSSRSDTQVTVQATEVNKQTST 535 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 504 bits (1299), Expect = e-141, Method: Composition-based stats. Identities = 228/476 (47%), Positives = 294/476 (61%), Gaps = 13/476 (2%) Query: 1 MSRFVPRIIPFYLL---LLVAGGTANAQSTFEQKA-------ANP--FDNNNDGLPDLGM 48 MS+F + LL L+V G T N F ++A A+P F LP+LG Sbjct: 40 MSQFYRYLTLSCLLPAVLVVGGFTLNDALAFTEQARVDDAPFADPARFAKMQQQLPELGT 99 Query: 49 APENHDGEKHFAEIVKDFGETSMN-DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 +N K AE K GE SMN D+ E+A + + RDA Q+ E LS Sbjct: 100 VHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVFNRFRDAAKQRAASEGEQLLS 159 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 P+G ASV + + ++G F GS P QDN YLT+SQLG+ Q + G V N G+GQRW Sbjct: 160 PYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQLGIEQSEYGSVGNAGLGQRWI 219 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMAR 227 G+W VGYN F D+LL + QR GAEAWG+YLR SAN+YQP + + + RMAR Sbjct: 220 AGSWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSANYYQPLSGCRNHSNSALMRMAR 279 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYD+T R +PFY+ L ++S EQY G+ VDLFNSG NP A+SLG+NYTPVPL T++ Sbjct: 280 GYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGNAVANPAAVSLGINYTPVPLFTLS 339 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 A HK+G+ GE+Q+ L +NYR GV L +QLSA VA +QSL GSRYD RNN P + + Sbjct: 340 ASHKEGDGGESQDKFALKMNYRLGVALSQQLSADNVAAAQSLSGSRYDGVNRNNSPVMAF 399 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 RQ KTL+VFLATPPW L+PGET+PLKLQI I+ + WQGDTQ LSLTP +G Sbjct: 400 RQLKTLSVFLATPPWQLQPGETLPLKLQIAHSNAIKAVSWQGDTQALSLTPPPNNVDPQG 459 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 W++IMP W + +GA+N W LSV +ED++ QRV+SN ITL L P + D + Sbjct: 460 WSIIMPAWNSQQGANNSWHLSVTLEDSKHQRVTSNWITLKLSPPMTLQAADRGNFS 515 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 504 bits (1297), Expect = e-141, Method: Composition-based stats. Identities = 206/469 (43%), Positives = 287/469 (61%), Gaps = 12/469 (2%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANP------FDNNNDGLPDLG-MAPENHDGEK 57 + +++PF L A G A A +P + LP+LG A + + EK Sbjct: 12 LKKVVPFATGCLPAMGLAWLCGALPAYAESPPAPDSVVQQPANDLPELGGNASNDAEREK 71 Query: 58 HFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVK 117 +A + K GE ++N+ +Q + A S + Q + LSP GNA + + Sbjct: 72 EWATMAKQLGERNLNNVS---SQQVRTRAESYAVGQASSVLQQQAQELLSPLGNAKLSLV 128 Query: 118 VDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNT 177 + ++G F+GS G F PL D + LT+SQLGL QQ G + N G+GQRW G+WL+GYNT Sbjct: 129 MSDQGDFSGSSGQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGDWLLGYNT 188 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARM 235 D+ + + RA GAEAWG++LR SAN+Y P +A +Q + R A GYD+T + Sbjct: 189 VLDSDFERHHNRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQG 248 Query: 236 RMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGES 295 +PFY+ + S+S EQY+G+ VDLF SG ++P A+ LG+NYTPVPLVTV A HK GE Sbjct: 249 YLPFYRQIGGSLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLVTVKALHKMGEG 308 Query: 296 GENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTV 355 G +Q+ + L LNYR GVPL KQ+S VA+++SLRGSRYDN +R N+P + ++QRKTL V Sbjct: 309 GVSQDQVELALNYRLGVPLVKQISPEYVAQAKSLRGSRYDNIERKNVPVMAFKQRKTLQV 368 Query: 356 FLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDW 415 FLATPPW L+PGET+PL L+I++ I ++ WQGDTQ LSLTP +N GW+LI+P W Sbjct: 369 FLATPPWRLQPGETLPLVLEIKTTNKITRVSWQGDTQALSLTPSQNSNDPHGWSLIVPQW 428 Query: 416 QNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + A+N + LSV +ED++ Q V+SN I L + P S E P Sbjct: 429 DDSPDAANRYHLSVTLEDDKQQLVTSNWIQLQVTPPLTVSSEIEQGLPP 477 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 503 bits (1295), Expect = e-141, Method: Composition-based stats. Identities = 141/490 (28%), Positives = 229/490 (46%), Gaps = 35/490 (7%) Query: 1 MSRFVPRIIPF--------YLLLLVAGGTANAQSTFEQ---KAANPFDNNNDGLPDLGMA 49 MS + +II F + L+ A A ++ + P+ ++ +L Sbjct: 17 MSMYFNKIISFNIISRIVICIFLICGMFMAGASEKYDANAPQQVQPYSVSSSAFENLHPN 76 Query: 50 PENH---------DGEKHFA--EIVKDFGETSMNDNGLDTGE--QAKAFALGKVRDALSQ 96 E D E++ A + ET + + TG A A + Sbjct: 77 NEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAASGRASDVAHSMVGD 136 Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL 156 VNQ ++ WL+ +G A V++ D S W P D+ +L +SQLG+ +D+ Sbjct: 137 AVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLFFSQLGIRNKDSRN 196 Query: 157 VSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N+GVG R WL G NTFYDN L + R G GAEAW +YL+L+AN Y WH Sbjct: 197 TLNLGVGIRTLENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQLAANGYFRLNGWHS 256 Query: 217 QTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSL 274 ++R A G DL A +P L + EQY G+RV LF NP A++ Sbjct: 257 SRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALFGKDNLQRNPYAVTA 316 Query: 275 GLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 G+NYTPVPL+TV + G+S +++ L +NYR G + QLS VA ++ L SRY Sbjct: 317 GINYTPVPLLTVGVDQRMGKSSKHETQWNLQMNYRLGESFQSQLSPSAVAGTRLLAESRY 376 Query: 335 DNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQIL 394 + RNN LEY++++ + + L+ PG+ + Q++ +R+++W D +++ Sbjct: 377 NLVDRNNNIVLEYQKQQVVKLTLSPATISGLPGQVYQVNAQVQGASAVREIVWS-DAELI 435 Query: 395 SLTPGAQANSAEGWTLIMPDWQNGEG--------ASNHWRLSVVVEDNQGQRVSSNEITL 446 + S + L++P ++ +N + LS + D+QG R +S +++ Sbjct: 436 AAGGTLTPLSTTQFNLVLPPYKRTAQVSRVTDDLTANFYSLSALAVDHQGNRSNSFTLSV 495 Query: 447 TLVEPFDALS 456 T+ +P L+ Sbjct: 496 TVQQPQLTLT 505 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 502 bits (1293), Expect = e-140, Method: Composition-based stats. Identities = 133/448 (29%), Positives = 211/448 (47%), Gaps = 34/448 (7%) Query: 10 PFYLLLLVAGGTANAQ---STFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDF 66 P L+ L + ANA+ S+ E++ NP D N A+ Sbjct: 40 PSSLIYLSSVFNANAEEITSSAEKEQGNPSDQN----------------ASSVAQTAVQA 83 Query: 67 GETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTG 126 G +DN D A V A++ + + WLS +G A V++ D + Sbjct: 84 GSLLSSDNASDALGSA-------VVSAVTGKAASSAQEWLSQFGTARVNISTDEHFTLSD 136 Query: 127 SRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDEN 186 S VPL + + L ++QLG + D+ + N G G R W+ G N FYD + N Sbjct: 137 SELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHFNDGWMWGTNVFYDRQVSGN 196 Query: 187 -LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHL 243 QR G E +YL +SAN Y + W ++ ++R+A G+D+ A +P Y L Sbjct: 197 QHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFDIRATGYLPAYPQL 256 Query: 244 NTSVSLEQYFGDRVDLFNS--GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNN 301 ++ EQYFGD V LF +P A+++GLNYTPVPLVT+ K G+SGEN Sbjct: 257 GANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLVTMGLNQKMGKSGENDTQ 316 Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 + L L + GVPL QL +VA ++L+G R D RNN LEYR+++ +++ L Sbjct: 317 VNLGLTWTPGVPLSAQLDPSQVALRRTLQGGRLDLVDRNNNIVLEYRKQELISLALPA-E 375 Query: 362 WDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGA 421 + P+ +++++YG+ ++ WQGD+ ++ E + +P W Sbjct: 376 LEGAEQSKRPVTAKVKAKYGLDRIEWQGDSFFSHGGKITPGSNPEQVVMTLPVWVGS--G 433 Query: 422 SNHWRLSVVVEDNQGQRVSSNEITLTLV 449 SN + LS D +G ++ + +T+ Sbjct: 434 SNSYTLSATAWDKKGNASAAERVNVTVN 461 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 502 bits (1292), Expect = e-140, Method: Composition-based stats. Identities = 119/379 (31%), Positives = 194/379 (51%), Gaps = 5/379 (1%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ AK+ A R A++ +++ + WL +G A + +++ F S +PL DN Sbjct: 153 GDAAKS-AENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDDFEFDSSAIDVLIPLYDN 211 Query: 139 DRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 + L ++QLG +D+ N+G G R NW+ G NTF+DN + N +R G GAEAW Sbjct: 212 QKSLFFTQLGGRNKDSRNTINIGAGVRAFLTNWMYGANTFFDNDITGNNRRVGIGAEAWT 271 Query: 199 EYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 +YL+LSAN Y WH+ +R A GYDL A +P Y L + EQY GD Sbjct: 272 DYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPAYPQLGGKLMYEQYNGDE 331 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LF +P A+++G+NYTPV LVTV H+ G+S ++ +++ L NYR + Sbjct: 332 VALFGKDKRQKDPHAITVGINYTPVSLVTVGIDHRAGKSSKSDSSINLQFNYRLSNSWQS 391 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376 + VA +++L GSR D +RNN L+Y++++ L + L G+ L QI Sbjct: 392 HIDPSAVAVTRTLAGSRQDLVERNNNIVLDYQKQELLRLSLP-EQLTGSAGDNAILTAQI 450 Query: 377 RSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQG 436 S+Y ++++ W ++ +++ S + T+ P +Q G SN + LS + D G Sbjct: 451 ESKYEVQRVEWDANS-LIAAGGNISTTSQKDVTITFPPYQYQVGVSNIYALSAIAYDVNG 509 Query: 437 QRVSSNEITLTLVEPFDAL 455 + + + + + Sbjct: 510 NISNRATTQIHVSQSSTII 528 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 499 bits (1284), Expect = e-139, Method: Composition-based stats. Identities = 118/419 (28%), Positives = 193/419 (46%), Gaps = 24/419 (5%) Query: 50 PENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPW 109 + + E A + G + Q+ A R + N + WLS + Sbjct: 166 ADTSERESRVAGQLMGVGRVLAS-------PQSSNAASEMARSWATAAANDEIVKWLSKY 218 Query: 110 GNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARG 169 G A + + +D GS W +P D T++QLG +D+ N+G+G R Sbjct: 219 GTAQLQLNIDKNFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTRTLSN 278 Query: 170 NWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMAR 227 NWL G N FYD+ L R G G+EAW +YL+LS N Y + WH+ +R A Sbjct: 279 NWLFGVNAFYDHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNERPAN 338 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 G+D+ A MP L + EQYFGD V LF NP A ++G+NYTP PL+T+ Sbjct: 339 GFDVRANAWMPTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPLLTLG 398 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 + G++ E+ + LNYR G + Q+ V S+ + SRY+ +RNN LEY Sbjct: 399 VDQRLGKNSEHDTQFNVQLNYRIGDDWRAQVDPSAVPHSRLISESRYNLVERNNNIVLEY 458 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 +++ + + L + +PG + ++++YG++ ++WQ D + +S Q Sbjct: 459 QKQNIMHLSLPSDTLSGQPGSEHMISAILQTKYGLQDIVWQ-DAEFISAGGKLQRQDKTH 517 Query: 408 WTLIMPDWQNGEG--------------ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPF 452 + L +P ++ A+N + LS D +G + ++ +T+T+ P Sbjct: 518 FNLTLPSYRYSATARRSGSHATAQAEIAANTYHLSATAFDTKGNQSNTINLTVTVEPPT 576 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 498 bits (1283), Expect = e-139, Method: Composition-based stats. Identities = 129/433 (29%), Positives = 202/433 (46%), Gaps = 19/433 (4%) Query: 37 DNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQ 96 + N P G + E A + G D + A G R S Sbjct: 127 EQQNAVPPANG----ENTLENQIASTSQRVGTLLSQDMNSE-------QASGMARGWASS 175 Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL 156 + + + WL+ +G A + + VD + S+ + P D YL +SQ L + D+ Sbjct: 176 EASGAMTDWLNNFGTAKISLGVDEDFSLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDRT 235 Query: 157 VSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N G+G R +W+ G N F+D+ L RAG GAE W +YL+LS+N Y W Sbjct: 236 QINTGLGWRHFTPSWMSGINLFFDHDLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWRS 295 Query: 217 QT---ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALS 273 E R A G+DL A +P + L + EQY+GD V LF+ NP A++ Sbjct: 296 APELDNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAIT 355 Query: 274 LGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSR 333 GLNYTP PL+T++A+ +QG+ GEN ++L ++ ++KQL+ EVA +SL GSR Sbjct: 356 AGLNYTPFPLLTLSAEQRQGKQGENDTRFAVDLTWQPSSSMQKQLNPDEVAGRRSLAGSR 415 Query: 334 YDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQI 393 YD RNN LEYR+++ + + L P K GE PL ++++Y ++ + + Sbjct: 416 YDLIDRNNNIVLEYRKKELIRLSL-LDPVKGKSGEIKPLVSSLQTKYALKG--YNIEAAA 472 Query: 394 LSLTPGAQANSAEGWTLIMPDWQ--NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEP 451 L G + S + T+ +P ++ N N W + V ED +G + + + P Sbjct: 473 LEAAGGKVSTSGKDITVTLPGYRFTNTPETDNTWSIDVTAEDVKGNLSRHEQSMVVIQAP 532 Query: 452 FDALSNDELRWEP 464 + + L P Sbjct: 533 TLSQKDSLLSVNP 545 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 492 bits (1268), Expect = e-138, Method: Composition-based stats. Identities = 119/421 (28%), Positives = 213/421 (50%), Gaps = 12/421 (2%) Query: 35 PFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDAL 94 P +D P++ + + E A++ G+ + E+ KAFA R+ L Sbjct: 119 PIVEWDDDKPEIVLPSSASENEIRVAQLASQAGKFFSTN---PDQEKTKAFA----RELL 171 Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + + + + W + +G++ + ++ D + S+ +P + + L +SQ L +++ Sbjct: 172 TTAASSYAQDWFNRFGSSQIHLEADKKFSLKNSQIDLLMPWYETEDNLIFSQTSLHRKEG 231 Query: 155 GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 + +N+G+G RW ++G NTF+D + R G G E ++L+LSAN Y + W Sbjct: 232 RIETNLGLGARWYGEGQMIGGNTFFDYDISRKHSRLGLGVEYRRDFLKLSANSYHRLSGW 291 Query: 215 HEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 + R + G+D+ A +P Y H+ ++ EQY+GD V LF + NP ++ Sbjct: 292 RSSRDLADHSARPSNGWDVRAEGWLPSYPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSI 351 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS 332 + GLNYTP+PLVT A+H+QG++ + + GL LNY+FG K+ L G V +SL G+ Sbjct: 352 TAGLNYTPIPLVTFNAEHRQGKASKQDSRFGLQLNYQFGKTWKQHLDPGSVTTFRSLMGN 411 Query: 333 RYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQ 392 RYD RNN LEY++ + + +A GE +PL + S+YG+ L W +T Sbjct: 412 RYDFVSRNNHIVLEYKKNDVIQLNIANSIT-GYAGEKIPLSFTVASKYGLSHLKWNAET- 469 Query: 393 ILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPF 452 L G ++L++P ++N ++N++ +S V D +G + + + + +P Sbjct: 470 -LVAAGGHIVQENGKYSLVLPAYRNDAKSANNYTISAVAIDKKGNISPNTMLRVVVTQPA 528 Query: 453 D 453 Sbjct: 529 I 529 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 490 bits (1262), Expect = e-137, Method: Composition-based stats. Identities = 125/444 (28%), Positives = 207/444 (46%), Gaps = 20/444 (4%) Query: 24 AQSTFEQKAANPFDNNNDGL-PDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQA 82 A+ + P +N GL P+ + E++ A+ + ++G++A Sbjct: 57 AKLQAGDELEIPQAQSNLGLAPENTALTDTQTTERNLAKTATTSAQML------NSGDKA 110 Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYL 142 A ++R NQ SWL+ +G A + VD+ G GS+ +P D + Sbjct: 111 ---AARQLRGLAVGNANQAANSWLNNFGTARLQANVDDRGDLDGSQFDMLMPFYDTPSQM 167 Query: 143 TWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLR 202 ++Q G+ + D +N+G+G R +W+VGYN F D + + R G GAE +YL+ Sbjct: 168 AFTQFGIRRIDKRTTANLGIGIRHFIDDWMVGYNLFLDRDITRDHTRVGAGAEYARDYLK 227 Query: 203 LSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLF 260 L+AN Y + W + +R A G+DL A +P L + EQYFG+ V LF Sbjct: 228 LAANGYLRLSDWRDSPDFSSYSERPATGFDLRAEAYLPSLPQLGGKLMYEQYFGNDVGLF 287 Query: 261 NSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSA 320 NP A++ G+NYTP+PLVTV KQG +G + L +NY G P KQ+S Sbjct: 288 GKDNRQQNPAAITAGINYTPIPLVTVGIDRKQGSAGNGETLFNLGVNYEVGTPWAKQISP 347 Query: 321 GEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRY 380 V ++L+GSR D +RNN LEY+++ + ++++ + ET L + + S+Y Sbjct: 348 DAVNARRTLQGSRNDLVERNNQIVLEYKKQDVINLYVSNNV-SGRAAETKQLVVSVTSKY 406 Query: 381 GIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVS 440 G+R + + + + + L +P N+W +S + D +G + Sbjct: 407 GLRNIQFD-QGALAAAGGKIIPQGPSQFALQLPP---QPSGGNNWTISAIASDVKGNTSN 462 Query: 441 SNEITLTLVEPFDALSNDELRWEP 464 +TLV+ D + W P Sbjct: 463 RA---VTLVQLQDTPATISGTWTP 483 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 489 bits (1260), Expect = e-137, Method: Composition-based stats. Identities = 124/435 (28%), Positives = 204/435 (46%), Gaps = 16/435 (3%) Query: 35 PFDNNN---DGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVR 91 P NN LP +P + A+ G T N+ E A Sbjct: 105 PLIGNNFTTQSLPHSTSSPNDS----LLAQSASQVGNTLQNN---PNSEALNDLARSSAL 157 Query: 92 DALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ 151 A + + Q + WL+ G V + D + S+ VPL +++ ++ +SQ + + Sbjct: 158 SAANAKAGQEISDWLNGKGKVRVKLDADRDFSVKNSQLDLLVPLWESESHMIFSQGSVHR 217 Query: 152 QDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPF 211 D+ SN+G+G R+ ++ +G NTFYD+ + R G GAE + +L+ N Y Sbjct: 218 TDDRTQSNLGLGYRYFADSYALGANTFYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRL 277 Query: 212 AAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 + W + E+R A G+D+ A +P Y L ++ EQY+GD V LF NP Sbjct: 278 SNWKDSPDFDNYEERPANGWDIRAEGYLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNP 337 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 A++ G NY+P PL+ + +QG+ G+N G++LNY G PL QL ++ S+SL Sbjct: 338 HAITFGGNYSPFPLLKFSVDQRQGKGGQNDTRFGIDLNYTLGTPLSHQLDRNQLIASRSL 397 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 +RYD RNN LEYR++ TL++ LA GE L++ + S G+ ++ W Sbjct: 398 IANRYDFVDRNNNIVLEYRKKNTLSLKLAQQV-SGYTGERKSLEVSVNSSNGLERIDWDA 456 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 ++LS +++I+P++Q G GA+N + ++ DN G T+ + Sbjct: 457 P-ELLSNGGQIIQEGPGLYSVIVPEFQYGVGAANQYIVNATAYDNSGNASQQASTTVVVT 515 Query: 450 EPFDALSNDELRWEP 464 A+S + P Sbjct: 516 --ASAVSTTHSEFTP 528 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 482 bits (1240), Expect = e-134, Method: Composition-based stats. Identities = 126/414 (30%), Positives = 198/414 (47%), Gaps = 12/414 (2%) Query: 53 HDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNA 112 E+ A ND D+ +QA + A G +A N+ ++ W + +G+A Sbjct: 68 SQAEQSTANAATRLASILTND---DSAKQASSIARGTAANAG----NEALQKWFNQFGSA 120 Query: 113 SVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWL 172 V + +D + GS+ +PL D+ LT++QLG D+ + NVG+GQR + Sbjct: 121 KVQLNLDEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQM 180 Query: 173 VGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGYD 230 +GYN F D+ + R G GAE +++ L+AN Y + W ++++A G+D Sbjct: 181 LGYNLFVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFD 240 Query: 231 LTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQH 290 L + +P L + EQYFGD V LF NP+A++LG+NYTP+PL TV H Sbjct: 241 LRSEAYLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPLFTVGVDH 300 Query: 291 KQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQR 350 K G +G N L NY FG PL QL + VA +SL GSRY+ RNN ++YR++ Sbjct: 301 KMGRAGMNDTRFNLGFNYAFGTPLAHQLDSDAVAIKRSLMGSRYNLVDRNNQIVMKYRKQ 360 Query: 351 KTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTL 410 +T+ L +T+PL ++ GI ++ W+ L+L G S W + Sbjct: 361 NRVTLELPARV-SGAARQTMPLVANATAQQGIDRIEWEASA--LTLAGGKITGSGNNWQI 417 Query: 411 IMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 +P + +G +N +R+S + D G L + + L P Sbjct: 418 TLPSYLSGGEGNNTYRISAIAYDTLGNASPVAYSDLVVDSHGVNTNASGLTAAP 471 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 481 bits (1237), Expect = e-134, Method: Composition-based stats. Identities = 115/406 (28%), Positives = 195/406 (48%), Gaps = 16/406 (3%) Query: 68 ETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGS 127 E T A R ++ NQ V+ WL+ +G V+V D + S Sbjct: 51 EAFNKIISTGTSLAVSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKES 110 Query: 128 RGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENL 187 W +P D+ Y+ +SQLG+ +D+ N+G G R + +W+ G+NT YDN + + Sbjct: 111 SLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHN 170 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNT 245 R G GAEAW +YL+LSAN Y WH+ +R A G D+ + +P L Sbjct: 171 HRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGG 230 Query: 246 SVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLN 305 + EQY G+RV LF NP A++ GL YTP+P +T+ + G+S +++ L Sbjct: 231 KLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFITLGVDQRMGKSRQHEIQWNLQ 290 Query: 306 LNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLK 365 ++YR G + Q S VA ++ L SRY+ +RN LEY+++ T+ + + Sbjct: 291 MDYRLGESFRSQFSPAVVAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGL 350 Query: 366 PGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ--------- 416 PG+ + QI+S+ +++++W D Q ++ SA + +++P ++ Sbjct: 351 PGQVYSVSAQIQSQSALQRILWN-DAQWVAAGGKLIPVSATDYNVVLPPYKPMAPASRTV 409 Query: 417 ----NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSND 458 E A N + LS DN G + +T+ + +P ++++ Sbjct: 410 GKTGESEAAVNTYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSE 455 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 479 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 113/418 (27%), Positives = 194/418 (46%), Gaps = 12/418 (2%) Query: 44 PDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVE 103 P+ P + E VK + + A VR +S N+ ++ Sbjct: 135 PNEVNCPVGIENNPQTKEYVKRVSALLASSDPT-------TVATDVVRSEVSSTANKEIQ 187 Query: 104 SWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVG 163 WL +G A V + VD++ S W D+ + ++QLG+ +D+ +N+G+G Sbjct: 188 KWLGQYGTAQVRLNVDDKFSLRESSLDWLFSFYDSSSAIIFTQLGIRNKDHRNTANLGLG 247 Query: 164 QRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQ 221 R + GNW++G NTFYDN L R GFGAEAW +YL+LSAN Y WH+ Sbjct: 248 GRISMGNWILGANTFYDNDLTGINSRLGFGAEAWTDYLQLSANSYMRLNNWHQSRDFIDH 307 Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPV 281 ++R A G+D+ +P L + EQY GD V LF NP A++ G+ YTP Sbjct: 308 DERPANGFDIRTNAWLPVLPQLGGKLMYEQYSGDSVALFGKDKLQKNPYAVTAGITYTPF 367 Query: 282 PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNN 341 PL+T ++G++G++ + L+Y G VA ++ L +RY+ RNN Sbjct: 368 PLLTFGIDERRGKAGKSDTQFNIQLSYHLGESWLSLTDPSAVAGTRQLAEARYNLVDRNN 427 Query: 342 LPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQ 401 LEY+++ L + +T G+ + +I S++ + ++ W + +L+ + Sbjct: 428 NIVLEYQKQDILNIT-STEQLRGYSGDNGIILTKIVSKHNVERVEWINISALLAAGGNSV 486 Query: 402 ANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 + P +Q +N + + VV D++G R + + +T+++ + S Sbjct: 487 ELPGRKLAITYPPYQ--IDGNNTYHVDVVAYDSRGNRSNISTTAITVLQKENTPSTVN 542 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 476 bits (1225), Expect = e-133, Method: Composition-based stats. Identities = 147/416 (35%), Positives = 230/416 (55%), Gaps = 11/416 (2%) Query: 43 LPDLGMAPENHDG----EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQV 98 LPD+ + E ++ FA+ K+ G N D +A++ ++ + + Sbjct: 20 LPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRD---KAQSKTTQWAKEKIISPL 76 Query: 99 NQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVS 158 Q + L +G A V++ +DN+G+ S S F P D+++YL +SQ+ + QDN + Sbjct: 77 QQQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYLLFSQINIHHQDNRKIG 136 Query: 159 NVGVGQRWARGNW--LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N G+G R + L+GYN F D+ RAG GAEA +YL+ SAN+Y P + W + Sbjct: 137 NFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADYLKFSANYYHPLSHWKD 196 Query: 217 QTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSL 274 + +R A+GYDL ++ +P Y L S E YFGD V LF +P AL+L Sbjct: 197 SPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVALFGKSHRQKDPRALTL 256 Query: 275 GLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 G++YTPVPLVT+ A+HK G+ G+ + + Y+FG PL QL V + +SL+GSRY Sbjct: 257 GIDYTPVPLVTLGAKHKYGQQGKKDTQIDVAFRYQFGSPLSAQLDPDNVNQLRSLKGSRY 316 Query: 335 DNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQIL 394 D RNN LEY++++ L LA P L GE+ L+ ++S+Y I LIW GD L Sbjct: 317 DLVDRNNDIVLEYKEKQVLFADLAAVPDSLMEGESYILRPLVKSKYPIIDLIWLGDLLPL 376 Query: 395 SLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVE 450 L A +++ +GW + +P W + GASN ++L++ +ED + RV++N I + + + Sbjct: 377 QLLATAGSHNPQGWQITLPAWSSVAGASNRYQLALSLEDQKNHRVTTNTIEIQVGQ 432 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 474 bits (1221), Expect = e-132, Method: Composition-based stats. Identities = 119/421 (28%), Positives = 192/421 (45%), Gaps = 16/421 (3%) Query: 39 NNDGLPDLGMAP--ENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQ 96 +N LP LG + ++ E AE K G + Q A R+ +S Sbjct: 39 DNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQ---IAQDYARNKVSS 95 Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL 156 + + +E WLS GN +++ D + S+ W +P D + L ++Q L + D Sbjct: 96 EATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLFTQHTLHRYDERF 155 Query: 157 VSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 +N G+G R+ +G N F D+ L R G G E W +YL+L+AN Y +W Sbjct: 156 HTNNGIGLRYFHEKSTIGMNAFIDHDLSHAHTRVGLGVEYWQDYLKLNANSYFGLTSWKS 215 Query: 217 QTA---TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALS 273 + + A G+D+ +P Y HL ++ EQY+GD V LF NP A + Sbjct: 216 ASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALFGKTKRQKNPNAAT 275 Query: 274 LGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSR 333 +G N+TP PL T+ A HK G + + L + FG L L +VAE++ L G+R Sbjct: 276 IGANWTPFPLFTLNASHKLGSEKQVETQAKLQFTWTFGKNLAHHLDPTKVAETRRLSGNR 335 Query: 334 YDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQI 393 YD +RNN L Y+++ L + L + + G++VPL S+Y ++ + WQ + Sbjct: 336 YDFVERNNNIILNYQKKTVLHLSLPSKIQGIT-GQSVPLVKSFTSKYPLKHIEWQAPEFL 394 Query: 394 LSLTPGAQANSAEGWTLIMPDWQNGEGAS-----NHWRLSVVVEDNQGQRVSSNEITLTL 448 G+ ++ + TL +P +Q A N +RL + D +G E + + Sbjct: 395 --AVGGSISSDDQTATLTLPSYQTSNAAKDVQRINRYRLRAIAYDIKGNVSPVAETLIEI 452 Query: 449 V 449 Sbjct: 453 T 453 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 473 bits (1217), Expect = e-132, Method: Composition-based stats. Identities = 120/419 (28%), Positives = 202/419 (48%), Gaps = 21/419 (5%) Query: 35 PFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDAL 94 P ++ + A E + + + + G + N A + + Sbjct: 39 PTISHASAVKASQAAAEQQEL-RGLSSLAAQAGRSIEN-----------GHAGSFAANTV 86 Query: 95 SQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 Q + V WL +GNA + + VD+ S + P D +++ +SQ L + D+ Sbjct: 87 PAQATKEVVEWLQKYGNARIQLNVDDAFSLKDSAFDFLYPWIDKKQHVLFSQTSLHRTDD 146 Query: 155 GLVSNVGVGQRWAR-GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 +N+G+G R+ N ++G N FYD L + R G G E W +YLR AN Y + Sbjct: 147 RTQTNIGMGYRYFTADNSMLGANLFYDYDLSRHHARMGAGVEYWRDYLRAGANAYLRLSK 206 Query: 214 WHEQ--TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 W + ++R A G+D+ + +P Y L S+ E+Y+G V LF S NP A Sbjct: 207 WKDSHDLDDYQERPADGWDIYTQGWLPSYPQLGASLKYEKYYGKNVGLFGSDHLQENPYA 266 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 + G++YTPVPLVT++A+HKQG+S + + G+ +NYR G+PL KQL + VA + ++ Sbjct: 267 FTGGISYTPVPLVTLSAEHKQGQSNTHDSRFGIEINYRPGIPLAKQLDSDNVALMREVQH 326 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI-RSRYGIRQLIWQGD 390 RYD +RNN LEYR++ L + L + G +P+ + + +S +GI+ + W Sbjct: 327 GRYDFVERNNNIVLEYRKKSVLKIRLPESV-QGEGGAVIPVTISLDKSHWGIQSVEWN-- 383 Query: 391 TQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 + G + S W L +P + G +NHW++ D +G + + +T+ Sbjct: 384 DSAFTAAGGRISGSGTSWQLTLPAY--TPGGTNHWQIGATARDVKGNVSNYAVMNVTVT 440 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 472 bits (1214), Expect = e-131, Method: Composition-based stats. Identities = 120/425 (28%), Positives = 198/425 (46%), Gaps = 14/425 (3%) Query: 43 LPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHV 102 +P+ + A + G N L + ++ AF+ + L + V Sbjct: 108 IPNQEEEQQATQQASMVASHLSQVG------NSLSSEDRVGAFSR-LAKGMLLSSTAKTV 160 Query: 103 ESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGV 162 E WL G A V ++ D++ F+GS F+PL D L +SQ G + D + N+G+ Sbjct: 161 EEWLGHIGQAQVKLQADDKNDFSGSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGL 220 Query: 163 GQRWARGNWLVGYNTFYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--A 219 GQR +W+ GYN F+D + N +R GFG E +Y++LSAN Y W T Sbjct: 221 GQRHYVSDWMFGYNIFFDQQISGNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLE 280 Query: 220 TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYT 279 ++R A GYD+ +P Y L + EQYFGD V LF NP AL+ G++YT Sbjct: 281 DYDERAANGYDIRTEAYLPHYPQLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYT 340 Query: 280 PVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQR 339 P+PLV++ H G G+ + + + +NY P +KQ+ V +++L GSR D R Sbjct: 341 PIPLVSLGLDHTIGNGGKKKTGVNVAVNYEINTPWQKQIDPAAVQATRTLAGSRMDLVDR 400 Query: 340 NNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPG 399 NN LEYR+++ +T+ L K +P+ +R+G+ ++ W ++ Sbjct: 401 NNNIVLEYRKQQVVTLNLPEKI-SGKEALVLPINYTFNARHGLDRIEWDA-ADVIQAGGQ 458 Query: 400 AQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 + + + +P + +GA+N + LS D +G +S+ + + + N Sbjct: 459 VSSQGNLAYHVALPPY--IDGAANAYVLSGRAVDKKGNYSTSSSTNIYVTGVNISSVNSV 516 Query: 460 LRWEP 464 P Sbjct: 517 SSLTP 521 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 470 bits (1210), Expect = e-131, Method: Composition-based stats. Identities = 110/372 (29%), Positives = 181/372 (48%), Gaps = 4/372 (1%) Query: 89 KVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLG 148 +S + NQ +E WL+ +G+A V + D S +PL + L ++Q Sbjct: 118 YAISQISSKSNQKIEQWLNQFGHARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTN 177 Query: 149 LTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFY 208 ++D N G+G R+ ++VG N FYD+ L + R G GAE W +Y +LS+N Y Sbjct: 178 YHRKDLRSQFNYGIGYRYFTEKFMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHY 237 Query: 209 QPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGY 266 ++W +R A G+D+ P Y L T + EQY+G V LF Sbjct: 238 HRLSSWRASNNILDYSERPANGWDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRD 297 Query: 267 HNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAES 326 NP +LG+NYTP+PLVT+ A+ + G NNL +NL+YR G L QL+ V Sbjct: 298 KNPHTYTLGINYTPIPLVTLNAERRIGLHDRADNNLNINLSYRIGESLASQLNPDNVKAI 357 Query: 327 QSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 ++L GSRYD RNN LEY++ + + + + E L++Q++++Y + + Sbjct: 358 RTLAGSRYDFVNRNNDMILEYKKETLVFLSMVDSI-NGYAKEERDLQVQVKTKYPLANIE 416 Query: 387 WQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITL 446 W +++ + + + +T+I+P +Q G N + +S V D G R + + T+ Sbjct: 417 WSA-SKLNAQGGQIKHHGGTHYTVILPQYQIGAIEKNSYIISAVAIDTHGNRSAPVQTTV 475 Query: 447 TLVEPFDALSND 458 + + N Sbjct: 476 IVDKSLINTRNS 487 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 467 bits (1203), Expect = e-130, Method: Composition-based stats. Identities = 115/417 (27%), Positives = 190/417 (45%), Gaps = 18/417 (4%) Query: 53 HDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNA 112 E+ A+ + G N D A + S Q V WL+ +G A Sbjct: 37 QADEQSVAQTAMEAGRVLQGSNSGDA-------ARQMLTSQASGQAADAVTQWLNQFGTA 89 Query: 113 SVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-W 171 + V ++ GS +P + + + ++QLG+ D +N G+G R+ N W Sbjct: 90 KTQLSVVSDFSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGW 149 Query: 172 LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGY 229 ++GYN FYD +R G G EAW +YL+LSAN Y+ + W + ++R A G+ Sbjct: 150 MLGYNVFYDVDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGW 209 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQ 289 D+ A +P Y L + EQY+G+ V LF NP A++ G+ +TP L+T Sbjct: 210 DIRAEGWLPAYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLLTAGVD 269 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 +++G++G + L L L YR G PL QL + V +SL +R + RNN LEYR+ Sbjct: 270 YRRGKNGADDTRLNLGLTYRIGEPLAHQLDSSRVGAQRSLAANRLELVNRNNDVVLEYRK 329 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 + +T+ L + + TV L Q+ ++YG+ ++ D ++ +N+ T Sbjct: 330 QTLITLQLPPDVYGAEL-TTVTLTPQVNAKYGLSRIE-LDDAELRQAGGKIISNTGNQIT 387 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL---VEPFDALSNDELRWE 463 L +P W + + LS D +G + + V+ A+S D+ Sbjct: 388 LQLPAWSSDRQSV---TLSGRARDTRGNLSDIARTRILVSPAVQQQLAVSTDKTTAT 441 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 467 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 108/395 (27%), Positives = 195/395 (49%), Gaps = 15/395 (3%) Query: 57 KHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDV 116 + A I D G NDN + + ++ +VN H++SW +G A + + Sbjct: 133 QQIASIATDVGNILSNDNISKN---------SALLNKITNKVNSHIQSWFENFGTAHIQL 183 Query: 117 KVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYN 176 +VD S+ P+ ++D L +SQ G++ D+ +SN+G+G R NW++G N Sbjct: 184 QVDKNFSLKNSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDNWMLGGN 243 Query: 177 TFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTAR 234 +F D L + R G G E W + L+L AN Y + W + E+R A G DL + Sbjct: 244 SFIDYDLRKEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIK 303 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 +P Y + + E+Y+GD V LF NP + +LG++YTP PL++ A+HK G Sbjct: 304 SWLPSYPQIGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLMSFKAEHKMGS 363 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 + N + +G +NY+ P + Q++ + + L G RYD +RNN L+YR+++ + Sbjct: 364 NNINDSRIGFEINYQIHTPWESQINPVLIPAMRKLAGQRYDLVERNNNIILDYRKKEIIK 423 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 + GE L +++ S+Y + ++ W +T ++ +++I+PD Sbjct: 424 ID-GVDVISGFSGEKKRLDIRVNSKYPVDRIDWLANT-FIANGGKIINEGLHNYSIILPD 481 Query: 415 WQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 ++N E +N + + + D +G + I + ++ Sbjct: 482 YRNQE--NNSYTIDLSAIDIKGHTSNRKTIKIDVL 514 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 462 bits (1190), Expect = e-129, Method: Composition-based stats. Identities = 127/378 (33%), Positives = 194/378 (51%), Gaps = 14/378 (3%) Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY- 141 K A + + L+ Q+ + + WLS +G A +++ VD+ G S VP D+ + Sbjct: 128 KKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHW 187 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 L +SQ G +D+ N+G+G R +W+ G NTFYDN L N R G E W YL Sbjct: 188 LIYSQYGYRHKDSRDTVNLGIGTRLFINDWMYGANTFYDNDLTGNNSRFSLGGELWTNYL 247 Query: 202 RLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 ++SAN Y + WH +R A GYDL A M +P L + EQYFGD V L Sbjct: 248 KMSANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQYFGDNVAL 307 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F + +P A ++G+NYTP+PL+T +K G+ G++ LN+NYRFGVPL +QLS Sbjct: 308 FGTNNRQKDPYAATIGVNYTPIPLITAGVDYKLGKEGKSDGIFSLNMNYRFGVPLSEQLS 367 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEY-RQRKTLTVFLATPPWDLKPGETVPLKLQIRS 378 V +SL GSRYD +RNN L Y +++K + + GE P+ QI+S Sbjct: 368 PENVGSLRSLAGSRYDLVERNNNIILNYLKKQKHFRLLVPVIEIIGYGGEIKPI--QIQS 425 Query: 379 RYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR 438 ++ +IW D L G + G+T+ +P++Q N + ++ +D+Q Sbjct: 426 DTPLKNIIW--DMPELFQKNGGIIKNTNGYTIQLPEYQ--PDGKNDYTITGTSKDDQ--- 478 Query: 439 VSSNEITLTLVEPFDALS 456 +I +++ +LS Sbjct: 479 -QRVQIQTHVLQRNISLS 495 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 461 bits (1186), Expect = e-128, Method: Composition-based stats. Identities = 111/465 (23%), Positives = 203/465 (43%), Gaps = 24/465 (5%) Query: 9 IPFYLLLLVAGGTANAQSTFEQKAANPFD---NNNDGLPDLGMAPENHDGEKHFAEIVKD 65 IP L + + +S ++ + D +NN + + + + E + A K Sbjct: 84 IPLSKLYKLNQFRSFHKSFYDLSGGDEIDIPASNNYSFENRPLDTKVDNNENYSANKTKA 143 Query: 66 FGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFT 125 S ++ + ALG S N ++ WLS WG + D++ Sbjct: 144 AVNVSESNKSPE--------ALGVASSMASSAANNAIQKWLSQWGTVESQLSFDSKASLK 195 Query: 126 GSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDE 185 S W +P+ D D + Q G +D+ N+G G R W+ G N F+D + Sbjct: 196 NSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVYNGWMYGLNNFFDYDITG 255 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHL 243 N +R G G EA +YL +++N Y WH+ ++R A G+D+ +P Y + Sbjct: 256 NNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANGFDMRVNGWLPAYPQI 315 Query: 244 NTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLG 303 + EQY+GD V LF +P A++ G+++TP PL+++ HK G++G++ ++ Sbjct: 316 GGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLLSLGVDHKIGQAGKHDTSVN 375 Query: 304 LNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWD 363 L L +R L QL VA S+ L SRYD RNN LEYR+++ +++ L+ + Sbjct: 376 LQLTWRPSDSLSSQLMPDNVAASRLLSKSRYDLVDRNNNIVLEYRKQQLISLKLSHGEIN 435 Query: 364 LKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDW-------- 415 G + + + ++ G+ + W ++ +A + + +P + Sbjct: 436 APGGTSHTIIATVAAKSGLSDITWNA-ANFIAAGGKIKAIDKTVFAITLPPYINQGSDRK 494 Query: 416 --QNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSND 458 ++G N + L V + + G E+ + ++ P + D Sbjct: 495 TQKSGAQGGNAYTLIAVAQSDDGSISEPKELHVNVLPPNINFNGD 539 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 460 bits (1185), Expect = e-128, Method: Composition-based stats. Identities = 125/460 (27%), Positives = 213/460 (46%), Gaps = 23/460 (5%) Query: 3 RFVPRIIPFYLLLLVAGGTANAQS--TFEQKAA-NPFDNNNDGLPDLGMA----PENHDG 55 R R+ ++ L++ +F AA NP N ++ E + Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 56 EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVD 115 K+ A I+ G DN D + + S V ++ WL +G A V+ Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDA-------LINSAINNGSAYVTGQIQQWLQQFGTAKVN 114 Query: 116 VKVDNEGHFTG-SRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVG 174 + +D + S D + L ++Q G + D+ + NVG+G R+ W+ G Sbjct: 115 LGLDKDLSLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFADRWMWG 174 Query: 175 YNTFYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDL 231 NTFYD + +N +R G G E Y +LSAN Y+ + W + + ++R+A GYD+ Sbjct: 175 INTFYDRQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDI 234 Query: 232 TARMRMPFYQHLNTSVSLEQYFGDRVDLFNS--GTGYHNPVALSLGLNYTPVPLVTVTAQ 289 A +P + L + EQY+GD V LF+ NP A++ G+NYTP PLV++ Sbjct: 235 RAEGYLPAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLVSIGLN 294 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 K G+ N + L +N+ G LK QL + V ++L GSR D RNN LEYR+ Sbjct: 295 QKMGKGNHNDTQIDLAVNWMLGSSLKSQLDSDAVKARRTLLGSRLDLINRNNNIVLEYRK 354 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 + +++ + ET+P+ + ++S+Y + + W+ D L G + + W+ Sbjct: 355 QDLISLKVQNKVT-GTESETLPVSVNVKSKYPLDHISWEDDN--LVKNGGKISENNGSWS 411 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 + +P +Q G N + +S DNQG + +++ +T+ + Sbjct: 412 VTLPHYQQNSGEKNLYVVSATAWDNQGNKSNASHMTVEVS 451 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 460 bits (1185), Expect = e-128, Method: Composition-based stats. Identities = 118/419 (28%), Positives = 194/419 (46%), Gaps = 8/419 (1%) Query: 49 APENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSP 108 P + +K + S + L + + +AF+ + L + VE WL Sbjct: 69 VPNREEEQKATQQASLVASHLSQIGSTLSSESRVEAFSR-LAKGVLLSSTAKSVEEWLGH 127 Query: 109 WGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR 168 G A V ++VD++ F+GS FVPL + L +SQ G + D + N+G+GQR Sbjct: 128 IGKAQVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYL 187 Query: 169 GNWLVGYNTFYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRM 225 +W++GYN F D + N +R G G E +Y++LSAN Y W T ++R Sbjct: 188 SDWMLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERA 247 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVT 285 A GYD+ +P+Y L + EQYFG+ V LF NP AL+ ++YTP PLV Sbjct: 248 ASGYDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLVN 307 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 + +H G SG+N+ + L +NY P +KQ+ V +++L GSR D RNN L Sbjct: 308 LALEHTIGNSGKNKTGVNLAVNYEINTPWQKQIDPAAVKATRTLAGSRMDLVDRNNNIVL 367 Query: 346 EYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSA 405 EYR+++ +T+ L K + +P+ +R+G+ ++ W +++ Sbjct: 368 EYRKQQVVTLNLPAKV-SGKEKQVLPINYTFNARHGLDRIEWDA-ADVINAGGNISDQGN 425 Query: 406 EGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + + P + +G N + L+ D +G SN + + + P Sbjct: 426 LAYHITFPPY--IDGGDNAYVLAGRAVDKKGNYSVSNSTNIYVTGVNINSVKSTITLTP 482 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 456 bits (1174), Expect = e-127, Method: Composition-based stats. Identities = 112/420 (26%), Positives = 191/420 (45%), Gaps = 14/420 (3%) Query: 34 NPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDA 93 N NN G + D ++ G +D R Sbjct: 121 NRASQNNKNNAGAGSLTKEQDPMDSL--SIRGVGSALAASGRVDA-------LHHMARTM 171 Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 + VN + WL+ +G A + + D + S W +PL D+ ++Q G +D Sbjct: 172 ATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLLPLYDSQTLTLFTQQGFRNKD 231 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 ++N+G+G R+ W++G N FYDN + +R G GAE W + +LSAN Y A Sbjct: 232 RRNIANIGIGTRFIHHEWMMGGNAFYDNDFTGDNKRVGLGAELWTDSFQLSANGYFRLTA 291 Query: 214 WHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 WH+ +R A G DL A +P HL S+ E YFGD V LF NP A Sbjct: 292 WHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYEHYFGDNVALFGKDHLQRNPYA 351 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 ++LG +YTP L+T+ + + G+ G LGL++NYR G L QL + ++++ Sbjct: 352 ITLGGSYTPFSLLTLEVKQRLGKQGNQDTQLGLHINYRLGADLPAQLDPAALVAARTIAK 411 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 +RYD +RN+ L+Y++++ L + +T + PG + + ++ S+YG+R L W Sbjct: 412 TRYDLVERNHNIVLQYQEQQRLKIK-STEYLEGYPGNSSEIYAEVVSKYGVRNLQWMNVA 470 Query: 392 QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEP 451 ++ + P + + N + + V+ D +GQ + + +++++P Sbjct: 471 AFVAAGGQIMELPNNRLKITYPPYN--DNGDNRYHIDVMAYDTRGQSSNISTTQISVLKP 528 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 454 bits (1168), Expect = e-126, Method: Composition-based stats. Identities = 125/440 (28%), Positives = 208/440 (47%), Gaps = 29/440 (6%) Query: 35 PFDNNNDGLPDLGM-----APENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGK 89 P ++ LP LG+ A + EK F + + N+N + A+ Sbjct: 84 PSVDHRRALPTLGIKETSQAKQVESAEKQFVQGATQIAQGLANNNATEA-------AINY 136 Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 R+ +NQ + WL+ +G A V + + G +PL D L +SQ+G+ Sbjct: 137 ARNRGEGLLNQKISDWLNQYGKARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGI 191 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 + +N+G+G R + NW+ G N+FYD + R G G E W YL+L+ N Y Sbjct: 192 RANEQRSTTNLGLGYRQYQQNWMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYF 251 Query: 210 PFAAWHEQ----TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNS--- 262 WH+ ++R A G+DL A +P Y HL EQYFGD V L ++ Sbjct: 252 RLTDWHQSFLHEMRDYDERPANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTA 311 Query: 263 GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGE 322 NP A++ GL+YTP PL+T+ Q QG+S N + +G+ YRFG+PL QL+ Sbjct: 312 KDLKDNPSAVTFGLSYTPFPLLTLKTQVSQGDS--NDSLIGMEFAYRFGIPLAAQLNPDN 369 Query: 323 VAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI-RSRYG 381 V +SL G+RYD RN ++YR+++ L + L + +T+ +K + +++YG Sbjct: 370 VDLMRSLAGNRYDFVDRNYNIVMQYRKQEILAISLPDSAMA-EAAQTIAIKATVQKAKYG 428 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS 441 + +++W ++L+ S L++P + S + LS V DN+G + + Sbjct: 429 LNKILWSAP-ELLAKGGKINETSTTTIDLVLPAYDEDNQGSKAYTLSAVGVDNEGNKSKA 487 Query: 442 NEITLTLVEPFDALSNDELR 461 + + + + D + L Sbjct: 488 AVMVIHVTQSKDGFAYFTLE 507 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 452 bits (1163), Expect = e-125, Method: Composition-based stats. Identities = 126/441 (28%), Positives = 215/441 (48%), Gaps = 27/441 (6%) Query: 15 LLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDN 74 L+ AGG A + + + + +N L A A+ G + + Sbjct: 132 LVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYA----------AQQAASLGSQLQSRS 181 Query: 75 GLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVP 134 G+ AK ALG + S Q +++WL +G A V+++ N GS + +P Sbjct: 182 L--NGDYAKDTALGIAGNQASSQ----LQAWLQHYGTAEVNLQSGNNF--DGSSLDFLLP 233 Query: 135 LQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGA 194 D+++ L + Q+G D+ +N+G GQR+ ++GYN F D + R G G Sbjct: 234 FYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGG 293 Query: 195 EAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQY 252 E W +Y + S N Y + WHE ++R A G+D+ +P Y L + EQY Sbjct: 294 EYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQY 353 Query: 253 FGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGV 312 +GD V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F Sbjct: 354 YGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDK 413 Query: 313 PLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPL 372 P +Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T + Sbjct: 414 PWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDI-NGTERSTQKI 472 Query: 373 KLQIRSRYGIRQLIWQGDTQILSLTPGAQ---ANSAEGWTLIMPDWQNGEGASNHWRLSV 429 +L ++S+YG+ +++W D+ + S Q + SA+ + I+P + +G SN ++++ Sbjct: 473 QLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAY--VQGGSNVYKVTA 529 Query: 430 VVEDNQGQRVSSNEITLTLVE 450 D G ++ +T+T++ Sbjct: 530 RAYDRNGNSSNNVLLTITVLS 550 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 450 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 120/357 (33%), Positives = 180/357 (50%), Gaps = 8/357 (2%) Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY- 141 K A + + L+ Q+ + + WLS +G A +++ VD+ G S VP D+ + Sbjct: 121 KKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHW 180 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 L +SQ G +D+ N+G+G R NW+ G NTFYDN L N R G E W YL Sbjct: 181 LVYSQYGYRHKDSRDTVNLGIGTRLFINNWMYGANTFYDNDLTGNNSRFSLGGELWTNYL 240 Query: 202 RLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 ++SAN Y + WH +R A GYDL A M +P L + EQYFGD V L Sbjct: 241 KMSANAYFRLSDWHNARDLVNYYERPANGYDLIADMYLPSMPSLGAKIKYEQYFGDNVAL 300 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F +P A ++G+NYTP+PL+T +K G+ G++ N+NYRFGVPL +QLS Sbjct: 301 FGKNKRQKDPYAATIGVNYTPIPLITAGIDYKLGKEGKSDGIFSFNVNYRFGVPLSEQLS 360 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEY-RQRKTLTVFLATPPWDLKPGETVPLKLQIRS 378 V+ +SL GSRYD +RNN L Y ++++ + + GE P+ QI+S Sbjct: 361 PENVSSLRSLAGSRYDLVERNNNIILNYLKKQQHFRLLVPVIEISSYGGEVKPI--QIQS 418 Query: 379 RYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQ 435 + + W S G+T+ +P++Q N + ++ +D+Q Sbjct: 419 DTPFKNVTWDIPELFQKNGGMINIESTHGYTIQLPEYQ--PDGKNDYTITGTSKDDQ 473 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 449 bits (1156), Expect = e-125, Method: Composition-based stats. Identities = 131/481 (27%), Positives = 214/481 (44%), Gaps = 49/481 (10%) Query: 15 LLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDN 74 ++ + A + N+ LPDLG D + + + +K+ G + ++ Sbjct: 7 CIILTFISGAAFAAPEI----NVKQNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINS 62 Query: 75 GLD-----TGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRG 129 T E K+ A ++ ++ ++E LSP+G ++ + G GS Sbjct: 63 ATQGFENLTPEALKSQARSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSI 122 Query: 130 SWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQR 189 +FVP DN + +SQ ++++ + N+G+G R+ +L+G N FYD +R Sbjct: 123 DYFVPWYDNQTTVYFSQFSAQRKEDRTIGNIGLGVRYNFDKYLLGGNIFYDYDFTRGHRR 182 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSV 247 G GAEAW +YL+ S N+Y P + W + E+R ARG+D+ A +P Y L + Sbjct: 183 LGLGAEAWTDYLKFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKI 242 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 EQY+G+ V LF + + +P A++LG+ Y PVPL+ V K G ++ LN Sbjct: 243 VFEQYYGNEVALFGTDSLEKDPFAVTLGVKYQPVPLIVVGTDFKAGTGDNTDLSVNATLN 302 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFL---------- 357 Y+FGVPLK QL +V+ + SL GSR+D +RNN LEY+++ L V L Sbjct: 303 YQFGVPLKDQLDPDKVSAAHSLMGSRHDFVERNNFIVLEYKEKDPLYVTLWLKADVTNEH 362 Query: 358 ATPPWDLKPGETV-------PLKLQIRSRYGIRQLIWQGDTQILS--------------- 395 P E + + I Y I WQ S Sbjct: 363 PECVIKDTPEEAIGLEKCKWTINALINHHYKIVAASWQAKNNAASWQAKNNAARTLVMPV 422 Query: 396 -LTPGAQANSAEGWTLIMPDWQNGEGAS-----NHWRLSVVVEDNQGQRVSSNEITLTLV 449 + W L++P WQ + N WR+ + +ED +G R +S + +T+ Sbjct: 423 IKENTLTEGNNNHWNLVLPAWQYSSDQAEQEKLNTWRVRLALEDEKGNRQNSGVVEITVQ 482 Query: 450 E 450 + Sbjct: 483 Q 483 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 443 bits (1140), Expect = e-123, Method: Composition-based stats. Identities = 134/462 (29%), Positives = 203/462 (43%), Gaps = 31/462 (6%) Query: 10 PFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHD-------GEKHFAEI 62 PF+ L + N ST A + LPDLG + D E + A Sbjct: 79 PFFAPSLPSEAPLNG-STTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASA 137 Query: 63 VKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEG 122 NDN E A V + +Q WL +GNA V + ++ G Sbjct: 138 ATQLWGIMGNDNSSRAAESA-------VTGVAAGLASQAAADWLGQYGNARVQLNSNSIG 190 Query: 123 HFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNL 182 + +PL + L + QLG+ +NVG+G R +W+ G NTFYD Sbjct: 191 N-----ADVLIPLTETQNNLLFGQLGVRYNGERTTNNVGLGVRSFTDSWMFGVNTFYDYD 245 Query: 183 LDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ----TATQEQRMARGYDLTARMRMP 238 L R G G EAW + L+ SAN Y WH+ +R A G+D+ A +P Sbjct: 246 LTGKNSRLGVGGEAWTDNLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLP 305 Query: 239 FYQHLNTSVSLEQYFGDRVDLFNS----GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 Y L + E+YFG V L + +P A ++GLNYTP+PL TV HK+G+ Sbjct: 306 SYPQLGGRLMYEKYFGKGVALNSGSTSPDDLGDSPSAFTVGLNYTPIPLFTVDVAHKKGQ 365 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 + N+ LGLN NYRFGVP Q++ V +SL GSRYD RN ++Y ++ + Sbjct: 366 NTNNELQLGLNFNYRFGVPWVDQINKNAVGLMRSLMGSRYDIVDRNYNIVMQYEKQDLIK 425 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 + L + L I ++YG ++ W +++ + E ++ +P Sbjct: 426 LTLP-ETLAAYAITNLSLTGNITAKYGAERMEWSAPA-LMAAGGSIIPLTMESASVTLPP 483 Query: 415 WQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 +Q + A N +++S V D +G R ++ TL + E +S Sbjct: 484 YQQVQTA-NSYQISAVAYDVRGNRSNTATTTLVVQESPQQIS 524 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 119/464 (25%), Positives = 211/464 (45%), Gaps = 27/464 (5%) Query: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDG---EKH 58 S+ P++ P LLL A + A + D LP LG +G E Sbjct: 6 SKLKPKL-PNSLLLSTAIWSTAILPMVPSYAQ---IVHLDDLPTLGGQAIQFEGTQPEDS 61 Query: 59 FAEIVKDFGETSMN-----DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNAS 113 + ++G+ + N N + + A+ +A K + + ++ WLS GNA Sbjct: 62 TERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEIT----HWLSKAGNAR 117 Query: 114 VDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLV 173 +++ +D + S+ W VP + L +SQ + + D L +N G+G R + N ++ Sbjct: 118 LNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLRHFQQNSMI 177 Query: 174 GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT---ATQEQRMARGYD 230 G N F+D+ L R GFG E +Y+R+SAN Y + W + R A G+D Sbjct: 178 GVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDYNARPANGWD 237 Query: 231 LTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQH 290 + +P Y +L ++ LEQY+GD V LF +P+A ++G+N++P PL+ + A+H Sbjct: 238 IQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPFPLLAINAEH 297 Query: 291 KQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQR 350 K G SG N+ N + N+ G L + L VA ++ + +RYD RNN LEY+++ Sbjct: 298 KIGNSGTNETNAKVAFNWLLGRSLAQHLDTSAVAATRHISTNRYDFINRNNNIVLEYQKK 357 Query: 351 KTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTL 410 +++ L GE + + + ++Y + +++ + L G + ++ Sbjct: 358 SLISLSLP-KVIQGMTGEELSIIRNLTTKYPLEKIV--IEAPELIAAGGEIHLNGRESSV 414 Query: 411 IMPDWQ-----NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 +P ++ N +RL+V D G E + + Sbjct: 415 KLPSYKIANHSFKNSQLNLYRLTVTAYDINGNVSPQAETLIEVT 458 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 441 bits (1135), Expect = e-122, Method: Composition-based stats. Identities = 114/371 (30%), Positives = 193/371 (52%), Gaps = 9/371 (2%) Query: 91 RDALSQQVNQHVESWLSPWG-NASVDVKVDNEGHFTGSRGSWFVPLQDN-DRYLTWSQLG 148 +D L+ ++ E+WL+ +G ++ V + G +PL ++ + ++ +SQLG Sbjct: 133 QDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQLG 192 Query: 149 LTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFY 208 + D+ N+G+G R+ W++G N F+DN + +R G GAE + LRL+AN Y Sbjct: 193 VRHADDRTTGNIGLGARYFGEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAANGY 252 Query: 209 QPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGY 266 WH+ A ++R A G+D+ +P Y L V EQY+GD V L + G Sbjct: 253 FKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRGRLQ 312 Query: 267 HNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAES 326 HNP A +LG+N+TP+PLV++ A H+ +GLNLN+ FG L LS V Sbjct: 313 HNPSAATLGVNWTPIPLVSIDAGHRMSMQRGEDTTVGLNLNWNFGRSLDWHLSPDAVETQ 372 Query: 327 QSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 +SL GSRYD RNN ++YR++ +T LA ++ T L + + +++G+ +++ Sbjct: 373 RSLAGSRYDLVSRNNEIVMDYREQTVITFSLANAIQGVES-TTHSLGVSVWAKHGLGKIV 431 Query: 387 WQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITL 446 W D L G S L++P ++ +GA N + LS + DN+G+ ++ + Sbjct: 432 W--DDATLVNAGGKIVGSGANSVLVLPAYK--DGADNRYTLSAIAYDNKGKASPRAQVQI 487 Query: 447 TLVEPFDALSN 457 T+ + + + Sbjct: 488 TVEKAQQVVPD 498 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 439 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 128/430 (29%), Positives = 219/430 (50%), Gaps = 13/430 (3%) Query: 31 KAANPFDNNND-----GLPDLG-MAPENHDGEKHFAEIVKDFGETSMND-NGLDTGEQAK 83 N F + LP LG E + + ++K N+ N +AK Sbjct: 115 YQGNSFVKKENIKIYHDLPTLGHNQNEQVNHDIDVYNMIKPLIHKDWNNINREKIKSEAK 174 Query: 84 AFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLT 143 + R+ L Q+V+++ +G +++ VDN+G F SR P N+ ++ Sbjct: 175 FYIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVL 234 Query: 144 WSQLGLTQQDNGLVSNVGVGQRWARGNWLV--GYNTFYDNLLDENLQRAGFGAEAWGEYL 201 +SQLG Q + + ++G+GQR+ + + GYN F D LD+ +R G EA Y Sbjct: 235 FSQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYF 293 Query: 202 RLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 +LS N+Y P W + ++ +R A G+D+ + +P Y L + EQYFG V L Sbjct: 294 KLSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVAL 353 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 FN NP A+S+G++Y P PL ++ HK G++ + LGL LNY+FG PL QL Sbjct: 354 FNKTKRQKNPKAVSIGIDYRPFPLASIYVDHKLGQNHHRETKLGLTLNYQFGTPLSSQLD 413 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSR 379 + E+++L+ +R RN +EY++++ L+V L ++ G+ ++ I+++ Sbjct: 414 PNNLNEARNLKQNRLAPVDRNYNIVMEYKEKQLLSVDLPAMDKNILEGDIYVIRPLIKNK 473 Query: 380 YGIRQLIWQGDT-QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR 438 Y I+ + W GD Q+ + A NS GW +I+P+W + + A N +RL++ +ED +G + Sbjct: 474 YPIKTVSWLGDVSQLSLSSSSADKNSPVGWKIILPEWNSEKDAKNTYRLAIQIEDTKGHQ 533 Query: 439 VSSNEITLTL 448 SN + + + Sbjct: 534 AISNYMDIVV 543 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 430 bits (1105), Expect = e-119, Method: Composition-based stats. Identities = 124/450 (27%), Positives = 210/450 (46%), Gaps = 31/450 (6%) Query: 25 QSTFEQ--KAANPFDNNNDGLPDLG---MAPENHDGEKHFAEIVKDFGETSMNDNGLDTG 79 Q +F + K + +N LP+LG + PE ++ E+ FA G+ +DN +D Sbjct: 70 QDSFPENIKNNDNVENITKYLPNLGSTKILPEENNNEEKFASSFTLMGDILSSDNFVDNS 129 Query: 80 EQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND 139 + + VNQ + WL+ +G A + D G + +P+ D Sbjct: 130 -------INYAKSIGQGLVNQQINDWLNQYGKARISFSSD-----KNISGDFLLPVIDEP 177 Query: 140 RYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGE 199 L ++QLGL + N+G+G R NW+ G NTFYD R G G EAW + Sbjct: 178 NNLLFTQLGLRNNTDRNTINLGLGYRKYWRNWMFGINTFYDYDYTGGNARLGVGGEAWID 237 Query: 200 YLRLSANFYQPFAAWHEQT----ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGD 255 YL+L+ N Y WH+ ++R A G+D+ A +P Y L +S+ E+YFG Sbjct: 238 YLKLAINGYFGLTDWHQSKISVMDDYDERPATGFDVRAEAYLPKYPQLGSSIKYEKYFGK 297 Query: 256 RVDL---FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGV 312 + L N + +L +GLNYTP+PL+T+ A+ G+ N + L++NYRFGV Sbjct: 298 GIHLGTGVNPEYLKDDAQSLIMGLNYTPIPLLTLKAERSIGD--RNDTKISLDVNYRFGV 355 Query: 313 PLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPL 372 PL +QL+ V +SL G++YD RN ++YR++ L +FL + +T + Sbjct: 356 PLSQQLNPDAVDVMRSLVGNKYDFVDRNYDIVMQYRKQDLLNIFLPREIV-GEARDTHRI 414 Query: 373 KLQIR-SRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASN---HWRLS 428 + + ++YG++ + W D +++ + S + P + + +N + +S Sbjct: 415 NVTVNKTKYGLKNIKWIIDPKLIEDKGHFKQISQTEGIITFPIYNSLNEKNNLPAEYYIS 474 Query: 429 VVVEDNQGQRVSSNEITLTLVEPFDALSND 458 + DN G + + + + S D Sbjct: 475 AIGTDNNGNESNKATTIIRVNRSTNDFSGD 504 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 255/323 (78%), Positives = 283/323 (87%) Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 +TWSQLGLTQQ +GLVSNVG+GQRWA+ WL+GYNTFYDNLLDENLQRAGFGAEAWGEYL Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEYL 60 Query: 202 RLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFN 261 RLSAN+YQPFA W TAT EQRMARGYD+ A++R+PFYQH+NTSVSLEQYFGD VDLF+ Sbjct: 61 RLSANYYQPFADWQTHTATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVDLFD 120 Query: 262 SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAG 321 SGTGYHNPVAL LGLNYTPVPL+T+TAQHKQGESG +QNNLGL LNYRFGVPLKKQL+A Sbjct: 121 SGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYRFGVPLKKQLAAS 180 Query: 322 EVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG 381 EVA+SQSLRGSRYD PQRN+LPT+EYRQRKTLTVFLATPPWDL PGETV LKLQ+RS +G Sbjct: 181 EVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLATPPWDLTPGETVALKLQVRSVHG 240 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS 441 IR L WQGDTQ LSLT G S EGWT+IMP W + EGA+N WRLSVVVED +GQRVSS Sbjct: 241 IRHLSWQGDTQALSLTAGTDTRSTEGWTIIMPAWDHREGAANRWRLSVVVEDEKGQRVSS 300 Query: 442 NEITLTLVEPFDALSNDELRWEP 464 NEITL L EPF + +D W+P Sbjct: 301 NEITLALTEPFITMPDDNPHWQP 323 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 118/402 (29%), Positives = 193/402 (48%), Gaps = 15/402 (3%) Query: 58 HFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVK 117 A K + + ND+ + + +K+ AL + + G ++ Sbjct: 69 KSAMSGKRWLQHQTNDDVMQGSDISKSGIADMGFAALQPETEKSA-------GEVRANLP 121 Query: 118 VDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNT 177 + + G T F PL D D L + Q+G + D + N+G+GQR+ +G+W +GYNT Sbjct: 122 LSD-GKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGIGQRYFQGDWALGYNT 180 Query: 178 FYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGYDLTAR 234 FYD + N QR GFG E W +YL LSAN Y W+ +A +R A GYD+ A+ Sbjct: 181 FYDIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSALDGYAERAANGYDIRAQ 240 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 P Y L+ + EQYFGD + L N Y NP AL++GL YTP+ L+++ Sbjct: 241 GWFPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEYTPIQLISLGIDRTFSH 300 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 G++ + L+ NY+ GVPL +Q+ ++L +RY +RNN L++R+R L+ Sbjct: 301 RGKDDTKVNLSFNYQLGVPLSQQIDPTVAPVKRTLADNRYHLVERNNNIVLKHRERAQLS 360 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 ++L T GE + +Y ++ + W D + + A S + + P+ Sbjct: 361 LYLPTG-LSGFGGERKLINFSFNGKYRLKHIQWN-DGALRARGGRIIALSNNSYVVQFPN 418 Query: 415 WQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 + + SNH +S V D QG +S+E+ + + P + Sbjct: 419 YSRQQ--SNHITISAVAHDEQGNVSNSSEMGVLINVPVALSA 458 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 119/395 (30%), Positives = 177/395 (44%), Gaps = 28/395 (7%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 A + V+ WLS +G A V + VD+ G++ S PL DN + + ++ Sbjct: 75 AKSAATGMATSAAASSVQQWLSQFGTARVQLNVDDNGNWDDSAVDLLAPLYDNKKAVLFT 134 Query: 146 QLGLTQQDNGLVSNVGVGQRWAR-GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLS 204 QLGL D N+G+G R NW+ G N F+D+ +R GFGAEAW YL+LS Sbjct: 135 QLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVFFDDDFTGKNRRVGFGAEAWTNYLKLS 194 Query: 205 ANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNS 262 AN Y WH ++ A GYD+ A +P Y L + EQY+GD+V LF++ Sbjct: 195 ANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGYLPAYPQLGAKLMYEQYYGDKVALFDT 254 Query: 263 GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGE 322 NP A++ G++YTPVPLV + +K+G+ + +N Y FG + Q+ Sbjct: 255 DHLQSNPSAVTTGISYTPVPLVQLAVDYKRGQDSMDDTQFQVNFRYDFGHDWRYQIDPEN 314 Query: 323 VAESQSLRGSRYDNPQRNNLPTLEYRQRK-------TLTVFLATPPWDLKPGETVPLKLQ 375 V +SL GSRYD +RNN L+Y+++ TL P D T+ + Sbjct: 315 VKAERSLAGSRYDLVERNNQIVLQYKKKDEQGVSKLTLQTVADNAPADGLTPNTLQVLAT 374 Query: 376 IRSRYGIRQ--LIW--QGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVV 431 S +R + W GD ++ SLT A L N +V V Sbjct: 375 NSSNEPVRNASIAWSTSGDAKLDSLTAVTNAQGIAVVNLT-----------NTSPATVQV 423 Query: 432 EDNQGQRVS---SNEITLTLVEPFDALSNDELRWE 463 G + S+ ++T+ AL D + Sbjct: 424 TAKSGNVSAMQDSHFNSVTVSHLILALDKDGSVAD 458 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 407 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 107/382 (28%), Positives = 184/382 (48%), Gaps = 18/382 (4%) Query: 74 NGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKV--DNEGHFTGSRGSW 131 + L +G +A GK+ SQ +Q +E WL +GNA + + DN GS Sbjct: 24 SLLKSGPAFDQYAAGKI----SQLTSQAIEGWLKQYGNARITLNAQSDNSTALAGSSADL 79 Query: 132 FVPLQDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWA-RGNWLVGYNTFYDNLLDENLQR 189 L + D L + Q QD ++ NVG+GQR+ ++GYN FYD ++ + R Sbjct: 80 LFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNINSGVSR 139 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQ--TATQEQRMARGYDLTARMRMPFYQHLNTSV 247 +G G E W +Y + S N Y + W +++ A GYD+ +P Y L + Sbjct: 140 SGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQLGGHL 199 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 EQYFGD V LF++ +P A+++G++YTP+PL+T +K+G + ++ +N Sbjct: 200 KYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLITFALDYKKGNDSLDDTSISAAIN 259 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPG 367 Y GVP +Q+S+ V +SL GSR+D RNN ++YR++ + + L T + + Sbjct: 260 YAIGVPWSQQISSDYVQTRRSLVGSRFDFVSRNNDIVMQYRKQDVIKLILPT-QLNGQAT 318 Query: 368 ETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANS-AEGWTLIMPDWQNGEGASNHWR 426 + +PL + ++ G+ + W + +L S A +T+ +P + + Sbjct: 319 QQLPLVATVEAKNGLDHIQWDSSSSLLQAGGTVIPGSDATHFTVSLPA------TAGQYV 372 Query: 427 LSVVVEDNQGQRVSSNEITLTL 448 L+ DN +S + + Sbjct: 373 LNGTAYDNHHNASNSAQTRFIV 394 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 403 bits (1035), Expect = e-111, Method: Composition-based stats. Identities = 107/340 (31%), Positives = 158/340 (46%), Gaps = 15/340 (4%) Query: 87 LGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQ 146 + VE WLS +G A V++ D G++ S + PL DN + + ++Q Sbjct: 48 KSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSVLFTQ 107 Query: 147 LGLTQQDNGLVSNVGVGQRWAR-GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 LGL D N+G+G R NW+ G N F+D+ +R G GAEAW +YL+L+A Sbjct: 108 LGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYLKLAA 167 Query: 206 NFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 N Y WH ++ A G+D+ A +P Y L V EQY+G+ V LF+ Sbjct: 168 NSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVALFDKD 227 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 ++P A+++GLNYTP+ LVT +K+G+ ++ LN Y G +Q SA +V Sbjct: 228 HLQNDPSAVTMGLNYTPISLVTAGIDYKRGQDSQDDVKFSLNFRYAIGESWSQQTSADQV 287 Query: 324 AESQSLRGSRYDNPQRNNLPTLEYRQRK--------TLTVFLATPPWDLKPGETVPLKLQ 375 A +SL GSRYD RNN L+Y+++ TL P D V L+ Sbjct: 288 ALRRSLAGSRYDLVNRNNEIILQYKKKDAELVLADMTLVATKDHSPADGTTANMVTLQAI 347 Query: 376 IRSRYGI--RQLIW--QGDTQILSLTPGAQANSAEGWTLI 411 + + W G Q+ S AN +L Sbjct: 348 TSDHKPVPGATIAWAVTGGAQLSSKNSVTDANGDASVSLT 387 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 398 bits (1024), Expect = e-109, Method: Composition-based stats. Identities = 108/340 (31%), Positives = 160/340 (47%), Gaps = 15/340 (4%) Query: 87 LGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQ 146 + VE WLS +G A V + VD+ G++ S + PL DN + + ++Q Sbjct: 75 KNTATHLATTHAASTVEEWLSHFGTAQVTLDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQ 134 Query: 147 LGLTQQDNGLVSNVGVGQRWAR-GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 LG+ D N+G+G R +W+ G N F+D+ +R GFGAEAW YL+LSA Sbjct: 135 LGIRAPDGRTTGNIGLGVRTFYVRDWMFGGNVFFDDDFTGENRRIGFGAEAWTNYLKLSA 194 Query: 206 NFYQPFAAWHEQ--TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 N Y + WH ++ A GYD+ A +P + L + EQY+GD V LF+ Sbjct: 195 NTYIGTSQWHNSGDFDNYNEKPADGYDVRAEGYLPSFPQLGAKLMYEQYYGDNVALFDKD 254 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 NP A+++GLNYTPVPL+T +K+G+ ++ LN +Y + Q+S +V Sbjct: 255 HLQSNPSAVTVGLNYTPVPLITAGIDYKRGQDSMDEMKFSLNFHYALDSSWQSQISPEQV 314 Query: 324 AESQSLRGSRYDNPQRNNLPTLEYRQRK--------TLTVFLATPPWDLKPGETVPLKLQ 375 A +SL GSRYD RNN L+Y+++ TL P D +TV L Sbjct: 315 ATRRSLAGSRYDLVDRNNEIILQYKKKATSKAVADMTLATIKNNSPADGTSADTVTLHAV 374 Query: 376 IRSRYGIRQ--LIW--QGDTQILSLTPGAQANSAEGWTLI 411 ++W G+ + S AN L Sbjct: 375 TADGKPAAHAAIVWTVSGNAALSSTNSVTDANGNTSVNLT 414 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 397 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 113/412 (27%), Positives = 188/412 (45%), Gaps = 13/412 (3%) Query: 62 IVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNE 121 + GE EQ + ++ ++ N + S L G A V +D++ Sbjct: 160 SIAASGEQVPTSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDDD 219 Query: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQ-DNGLVSNVGVGQRWARGNWLVGYNTFYD 180 + + +PL + + L ++Q GL + + ++N+GVGQR W++GYN F D Sbjct: 220 FNMVTAEADLLLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFLDRWMLGYNLFAD 279 Query: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQ--EQRMARGYDLTARMRMP 238 L RAG GAEAW +YL+L ANFY P ++W + + E+R ARG D+ +P Sbjct: 280 YDLTNRHWRAGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYLP 339 Query: 239 FYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGEN 298 Y + S++ EQY G+RV L ++ +P A++ GL+Y P PL+ + + + ++ Sbjct: 340 AYPQWSASLTAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLKMDVEQVEASGRQH 399 Query: 299 QNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLA 358 L L ++ G L L+ V +SL G R+D +RNN LEYR + L L Sbjct: 400 DTRFTLGLEWKLGATLWDMLNPSSVD--KSLAGMRHDLIERNNDMVLEYRDKVLLKASL- 456 Query: 359 TPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ-N 417 + G+ + L L I+ I + W GD LS A + L +P Sbjct: 457 NDQYSAVEGQALTLTLNIQHSRQIASIQWLGDVLGLSGLSPADTAGQDKRALTLPSLPTY 516 Query: 418 GEGASNHWRLSVVVEDNQGQRVSSNEITLTL-----VEPFDALSNDELRWEP 464 G SN + + +V D G + + + + ++P L+ ++ P Sbjct: 517 RIGQSNQYPVVAIVTDIDGHEAIAEGV-VAVSEDSGLQPAIQLAEHFVQLLP 567 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 112/432 (25%), Positives = 192/432 (44%), Gaps = 22/432 (5%) Query: 45 DLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVES 104 D + + + A G N + AK + + + ++ Sbjct: 41 DSSVDKTDQPEAEWLASRASSLGSLLQEGN---ISDFAKNQIQALPQTIANDGITSGIKH 97 Query: 105 WLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN-----GLVSN 159 WL P + +++ + + +PL + + + QLGL DN N Sbjct: 98 WL-PEAQFRGGITLEDASKYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVN 156 Query: 160 VGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA 219 G+G R G+WL+G N+F D + + R G E + + + L+ N+Y P + W Sbjct: 157 TGIGWRQDVGDWLLGINSFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKV 216 Query: 220 --TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLN 277 ++R A G D+ + +P ++ EQYFGD+VD+ + + +P A + + Sbjct: 217 QPLHDERPATGIDVRLKGALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAIT 276 Query: 278 YTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNP 337 + PVPLV + A +K S +Q GLNLNY FGVPL+ QL +V + S +R Sbjct: 277 WKPVPLVEIKAGYKDAGSSGSQTEAGLNLNYTFGVPLRAQLDPSQVRPA-SNTTNRTAFV 335 Query: 338 QRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLT 397 RN +EYR++ + A+P + + G+TV L I SRY + ++ W GD +++ Sbjct: 336 DRNYNIVMEYREQASRIRVYASPV-NGQSGDTVTLSATINSRYPVERIEWTGDAELI--- 391 Query: 398 PGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV------EP 451 G Q L +PD + + L + V D++G V+S I +T+ P Sbjct: 392 GGLQQQGNVNSGLRLPDLSLDVTENKEYSLYLKVTDSRGNSVTSERIPVTVSINPESFTP 451 Query: 452 FDALSNDELRWE 463 + + +DE+R E Sbjct: 452 YLNVLHDEVRRE 463 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 105/304 (34%), Positives = 161/304 (52%), Gaps = 13/304 (4%) Query: 98 VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLV 157 VE WLS +G A V + VD++G++ S + PL D+ + + ++QLGL D+ + Sbjct: 74 AGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLAPLYDSQKAMLFTQLGLRAPDDRVT 133 Query: 158 SNVGVGQRWA-RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N G+G R NW+ G N F+D+ + +R GFGAEAW L+LSAN Y WH Sbjct: 134 GNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGFGAEAWTNNLKLSANTYLGTTNWHS 193 Query: 217 QTATQE--QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSL 274 + ++ A G+D+ A +P Y L + EQY+GD+V LF+ NP A+++ Sbjct: 194 SRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYEQYYGDKVALFDKDDLQSNPSAVTV 253 Query: 275 GLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 G++YTPVPL+T +++G+ ++ + G+N Y FG L QLS+ EV +SL GSRY Sbjct: 254 GVSYTPVPLITAAVDYRRGQDSMDETHFGVNFRYNFGQSLSSQLSSSEVQNLRSLAGSRY 313 Query: 335 DNPQRNNLPTLEYRQRK--------TLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQ-- 384 D +RNN L+Y+++K LT P D TV ++ +R Sbjct: 314 DLVERNNEIVLQYKEKKQNNAVADMLLTTVKDNSPADGVTANTVTVRATTSDGTPVRNTV 373 Query: 385 LIWQ 388 + W Sbjct: 374 ISWS 377 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 116/420 (27%), Positives = 184/420 (43%), Gaps = 22/420 (5%) Query: 41 DGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQ 100 +P +A +++ + A D +A GK++ Q N Sbjct: 19 SQIPLPVIADSDNEIQSWIAGTASSISPHLQEGTLED-------YAKGKIKALPGQAANH 71 Query: 101 HVESWLS---PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN--- 154 V + P V +++ + S F+P+Q+ L + QLG DN Sbjct: 72 LVNEGIKSAFPEIIFRGGVNLEDGAKYRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSF 131 Query: 155 --GLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 NVG+G R WL+G NTF D + + R G G E + + L S N+Y P Sbjct: 132 DGRTYVNVGMGYRQEVNGWLLGVNTFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLT 191 Query: 213 AWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPV 270 W A ++R A G+DL + +P + + ++ EQY+GD+VDL +GT NP Sbjct: 192 GWKTSAAHELHDERPAYGFDLRTKGTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPR 251 Query: 271 ALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLR 330 A L + PVPL+ V A ++ +G +Q GL +NY FG PL +QL V S Sbjct: 252 AAGADLVWNPVPLLEVRAGYRDAGNGGSQAEGGLRVNYSFGTPLHEQLDYRNVGA-PSNT 310 Query: 331 GSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGD 390 +R RN + YR++ + + + P G V L + SRY I ++ W GD Sbjct: 311 TNRRAFVDRNYDIVMAYREQAS-KIRITAMPVSGLSGTLVTLMATVDSRYPIEKVEWSGD 369 Query: 391 TQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVE 450 ++L+ G Q + G LI+P + L + V D++G RV+S I + + + Sbjct: 370 AELLA---GLQLQGSLGSGLILPQLPLTVTDGQEYSLYLTVTDSRGTRVTSERIPVRVTQ 426 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 324 bits (831), Expect = 4e-87, Method: Composition-based stats. Identities = 72/239 (30%), Positives = 112/239 (46%), Gaps = 10/239 (4%) Query: 48 MAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 +++ EK+ A + G + D + + + NQ ++ WL Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDAT-------RNFITGMATAKANQEIQEWLG 113 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 +G A V + VD + S P+ D + ++Q + + D+ SN+G G R Sbjct: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF 173 Query: 168 RGN-WLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQR 224 GN W+ G NTF D+ L + R G GAE W +YL+LSAN Y + W + ++R Sbjct: 174 SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQER 233 Query: 225 MARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPL 283 A G+D+ A +P + L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 264 bits (675), Expect = 5e-69, Method: Composition-based stats. Identities = 74/359 (20%), Positives = 150/359 (41%), Gaps = 26/359 (7%) Query: 10 PFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGET 69 PF + L AN + + ND D ++ E+ + ++++ G Sbjct: 261 PFETVYLENPTNANYYNENLKTQKA----LNDNKKDNNLSKEDQEFSNKVMKVIQTAGAI 316 Query: 70 SMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRG 129 +++ E K A + + ++ + ++S L+ N + F+G+ Sbjct: 317 YDSEDSKSKEEIVKNMASSYLNTSANELAKEFIDS-LNTSINTDFSFNYNERSGFSGNAK 375 Query: 130 SWFVPL--QDNDRYLTWSQLGLTQ-QDNGLVSNVGVGQRWA--------RGNWLVGYNTF 178 + P+ +DN + + Q G+ + ++ + + G G R+ GN ++G N+ Sbjct: 376 ALL-PIVSEDNPKISYFLQSGIGEFANDRTIGHFGGGIRYYPNATALNNSGNIMLGLNSV 434 Query: 179 YDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE----QRMARGYDLTAR 234 YD+ +R GAEA + L +AN YQ ++W + + +R A G+D + Sbjct: 435 YDHDFSRGHKRMSLGAEAMVDTLAFNANVYQRLSSWIDSYDFDKDYVQERPANGWDAKIK 494 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNS---GTGYHNPVALSLGLNYTPVPLVTVTAQH- 290 P +++ + Q++G++V +F + NP+ G++Y+P P +T T H Sbjct: 495 YAFPSLINVSFFAKMGQWYGNKVGIFGANSVDDLEKNPLIYEGGISYSPFPALTFTLSHS 554 Query: 291 KQGESGENQNNLGLNLNYRFGV-PLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 + ES + ++ N+N +K S ++ G+R R+ LEYR Sbjct: 555 RSAESSKKNTSINANINIPLDEKAMKLAFEPKLAGISNTIEGTRTQFIDRDYSMVLEYR 613 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 246 bits (629), Expect = 1e-63, Method: Composition-based stats. Identities = 73/315 (23%), Positives = 123/315 (39%), Gaps = 24/315 (7%) Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 +++G+G R + G N F+D L R GAE +Y S N Y P + Sbjct: 35 GRDFAHLGLGYRQLDDSQFFGVNVFFDYDLSRQHTRVSVGAEYGLDYGTFSTNAYFPLSN 94 Query: 214 WHEQTATQE------QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 W + E ++ A+G+DL +P ++ QY G V+ + Sbjct: 95 WKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLDTRWKFGLTAGQYLGRYVEHSDGSLPSK 154 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 NP SL + P P + ++ + + Q G+N + + L Q Sbjct: 155 NPYHFSLSTEFRPDPAWAFSLGYQTEQGAKEQWIAGIN----YSLSLSGLYEGERRLSQQ 210 Query: 328 SL--RGSRY-DNPQRNNLPTLEYRQRKTLTVFLATPPWDLK---PGETVPLKLQIRSRYG 381 SL + R D QR++ LEY ++K + + P L + + ++++ Sbjct: 211 SLLPKPERLTDFVQRDHNMVLEY-KQKFAEISIRLPESALVTELSQQMLSSWMEVKGGAD 269 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS 441 I WQGD + QA+S T I P ++ A+N LSV + GQ S Sbjct: 270 IVSYQWQGDAA--NYLNDIQASSP---TFIAPAYRY--DANNTLSLSVSYKLRSGQIKQS 322 Query: 442 NEITLTLVEPFDALS 456 N + +T+ + S Sbjct: 323 NTMKITVTDSKVLES 337 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 245 bits (625), Expect = 3e-63, Method: Composition-based stats. Identities = 93/418 (22%), Positives = 154/418 (36%), Gaps = 37/418 (8%) Query: 56 EKHFAEIVKDFGETSMNDNGLDTGEQA--KAFALGKVRDALSQQVNQHVESWLSPWGNA- 112 E A+ ++ + G + F + + + V Q V+ W + G Sbjct: 74 ETRVAQTIQALAQAREAGGARQDGRASLDGQFLRSQAQAQANVLVQQGVQ-WANETGLPW 132 Query: 113 --SVDVKVDNEGHFTGSRGSWFV--PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR 168 ++ V + L + QLG Q++ N GV R A Sbjct: 133 LRRLEGNVSYDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVARSAA 192 Query: 169 GNWL-VGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRM 225 G+ L +G N F D + + R GAEA L N Y P + W ++ +E+R Sbjct: 193 GSSLILGGNAFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERREERP 252 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVT 285 A G+D+ R Q L + ++ G +VD F+ G NP G+ Y PVPL+ Sbjct: 253 AAGWDVGFTARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVPLIG 312 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL-RGSRY-DNPQRNNLP 343 V + + +SGE Q ++ L + G PL +QL G + RG+R D +R N Sbjct: 313 VGVEQARLQSGERQTSVQLGVRLNLGEPLSRQLRRGAQDTAPPFDRGARLQDFVRRENRI 372 Query: 344 TLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI-WQGDTQI----LSLTP 398 L+ R++K + + L P + + + W D + Sbjct: 373 VLDTRRKKIV-LALRIAEVRTDPATGRITVYGVTE--PLADVQLWLPDGTATSVRANAAG 429 Query: 399 GAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 G +A+SA T + + D G +S E+T + D + Sbjct: 430 GFEASSAGDMTSGL--------------IRARATDRYGD--TSQEVTYAYTDTVDKTA 471 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 80/360 (22%), Positives = 128/360 (35%), Gaps = 13/360 (3%) Query: 26 STFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDT------- 78 + +A P G P L A + A + + + Sbjct: 51 GSILAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGAR 110 Query: 79 --GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQ 136 G+ K A +V D L VN ES L N + D E T + + + Sbjct: 111 VDGDYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVY 170 Query: 137 DNDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDENLQRAGFGAE 195 R QLG Q++ +N G R +VG N F D + R G E Sbjct: 171 RAGRNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLE 230 Query: 196 AWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYF 253 L N Y P + W ++ +E++ A G D+ R F L+ S + ++ Sbjct: 231 VIAPEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWN 290 Query: 254 GDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 G VD F++G +G+ Y PV LV+V + + G + + L LN P Sbjct: 291 GAEVDYFDNGRTQAGAKGFKVGVEYRPVSLVSVGLEQTKVIGGGRETRMQLGLNINLSEP 350 Query: 314 LKKQLSAGEVAE-SQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPL 372 L KQL + S R+ +R N L R+++ + + + L+ V + Sbjct: 351 LSKQLRRDASGTPAFSPDARRHALVERENRIVLNTRRKEIILPLVVSEVSTLQADGRVTV 410 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 239 bits (609), Expect = 3e-61, Method: Composition-based stats. Identities = 78/382 (20%), Positives = 130/382 (34%), Gaps = 44/382 (11%) Query: 25 QSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKA 84 + +AA P G P++ PE G+ +D +A+ Sbjct: 48 CLSLGMQAAAPLAVLAQGAPEMTNRPE--------------AGDIVPSDVLTQVAVRAQD 93 Query: 85 FALGKVRDALSQQVNQHVESWLSPWGNASVD--------------------VKVDNEGHF 124 A + QV+ +L G A + ++ D F Sbjct: 94 LARRQADRREGAQVDA---DYLKQQGQAQFNQFLQEGVRAANESGLRFLRNLQGDLRHDF 150 Query: 125 TGSRGSWFV----PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFY 179 R S + + QLG Q+N +N+G R ++G N F Sbjct: 151 DNGRTSLELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYRRDINERLMLGANAFL 210 Query: 180 DNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW--HEQTATQEQRMARGYDLTARMRM 237 D + R G EA N Y P + W ++ +E+R A G DL + Sbjct: 211 DYEFAKQHLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRREERPASGMDLGMKYSP 270 Query: 238 PFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE 297 F L+ + ++ G VD F++G G+ Y PVPL+++ + + G Sbjct: 271 GFAPGLSLKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVPLLSLGVEQTRVIGGA 330 Query: 298 NQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFL 357 +Q ++ L + PL KQL G +L R +R N L RQ+ + Sbjct: 331 SQTSVQLGVALNLSEPLSKQLRRGGETPVFNLDAHRNALVERENRIVLNTRQKLIILPLT 390 Query: 358 ATPPWDLKPGETVPLKLQIRSR 379 T + L Q +++ Sbjct: 391 VTTVLTDSVSGRITLVGQTQAQ 412 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 237 bits (606), Expect = 5e-61, Method: Composition-based stats. Identities = 61/243 (25%), Positives = 98/243 (40%), Gaps = 24/243 (9%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 V + +L T ++F A N N + A + Sbjct: 16 VTWSVIATQILSPVTFTLIPANSFASSANTESAQTN----------ANDEYANELASLAA 65 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 + G++ N+ A D LS Q + V WL +GNA + + VD Sbjct: 66 NAGQSLANNT-----------AGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDESFTL 114 Query: 125 TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA-RGNWLVGYNTFYDNLL 183 + + P D+ Y+ +SQ L + D+ +N+G+G R N ++G N FYD L Sbjct: 115 KDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFYDYDL 174 Query: 184 DENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARMRMPFYQ 241 + RAG G E W +Y+R AN Y + W + + +R A G+D++A +P Y Sbjct: 175 SRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWLPVYP 234 Query: 242 HLN 244 L Sbjct: 235 QLG 237 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 234 bits (597), Expect = 5e-60, Method: Composition-based stats. Identities = 75/346 (21%), Positives = 124/346 (35%), Gaps = 14/346 (4%) Query: 13 LLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMN 72 +L L A AQ +A P + D M A + Sbjct: 33 VLTLQTVAPAFAQGA-PSFSARPAQADRQDAADSAMLRVA-----QTARQLAQRQAAGSR 86 Query: 73 DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWF 132 + G+ K A + + L + V ++ L P+ + V+ + Sbjct: 87 ASARVDGDLLKGQAEAQANELLQEGVRLANQTEL-PFLR-RLQGGVNYDFSNKDLSLDLR 144 Query: 133 V--PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWL-VGYNTFYDNLLDENLQR 189 + +R QL +++ N GV R A + VG N F D +N R Sbjct: 145 TIDEVHRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLR 204 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSV 247 G E L N Y P + W ++ +E+R A G+D+ R++ L Sbjct: 205 GSLGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKG 264 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 ++ G VD F++G N G+ Y PVPLV V + + G Q + L +N Sbjct: 265 QYFRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPLVAVGLEQTKVLGGARQTTVQLGVN 324 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTL 353 G PL +QL + L+ + +R N L+ R++ + Sbjct: 325 LSLGEPLSRQLRHQS-GPAFDLQARMGEFVERENRIVLQTRRKHVV 369 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 220 bits (562), Expect = 7e-56, Method: Composition-based stats. Identities = 138/198 (69%), Positives = 157/198 (79%), Gaps = 8/198 (4%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 +SR V R LLLL A GT A +A +PFD N LPDLGM PE+H+GEKHFA Sbjct: 50 LSRIVFRSFSLSLLLLAASGTIRA------QAQDPFDQNR--LPDLGMMPESHEGEKHFA 101 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 E+ K F E SM +N LDTGEQA+ FA G+VRD +S+QVNQ +ESWLS WG+ASVD+ VDN Sbjct: 102 EMAKAFSEASMKNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDN 161 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 EGHF GSRGSWF+PLQD RYLTWSQLGLTQQ +GLVSNVG+GQRWA+ WL+GYNTFYD Sbjct: 162 EGHFNGSRGSWFIPLQDKQRYLTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYD 221 Query: 181 NLLDENLQRAGFGAEAWG 198 NLLDENLQRAGFGAEAWG Sbjct: 222 NLLDENLQRAGFGAEAWG 239 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 211 bits (537), Expect = 5e-53, Method: Composition-based stats. Identities = 69/294 (23%), Positives = 112/294 (38%), Gaps = 13/294 (4%) Query: 77 DTGEQAKAFALGKVRDALSQQVNQHVESWLSPW-GNASVDVKVDNEGHFTGSRGSWFVPL 135 DT AL A Q Q WL G D+ F+ + Sbjct: 47 DTSPGLAQSALDAGVAAGLQASRQTGLPWLRHLDGGLRYDLDPG-RLSFSLRTIDDLM-- 103 Query: 136 QDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDENLQRAGFGA 194 ++R Q GL Q+ +N G+ R A +VG N F D + R G Sbjct: 104 -VSERRALMLQAGLHNQNQRPTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGL 162 Query: 195 EAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQY 252 EA + L AN+Y P + W + + +E+R A GYDL ++ L+ + ++ Sbjct: 163 EAIAPHYSLYANYYAPLSGWKGARRDSRREERPAAGYDLG--GQLSSDAGLSLQAAYFRW 220 Query: 253 FGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGV 312 G +D+F+SG N G+ Y P L + + G+ Q ++ LN+ Sbjct: 221 HGAGIDVFDSGRAQRNASGFRYGVAYQPGALFNIGLNQTRTLDGQKQTSVQLNVRINLQE 280 Query: 313 PLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKP 366 P +QL +L R+ +R + L R++ +T+ L+ P Sbjct: 281 PPSRQLRRESQPF--NLTSRRHQWVERESRIVLNTRRKA-ITLPLSIAQLRGDP 331 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 61/293 (20%), Positives = 113/293 (38%), Gaps = 25/293 (8%) Query: 75 GLDTGEQAKAFALGKVRDALSQQVNQHVESWLSP-WGNASVDVKVDNEGHFTGSRGS--W 131 D EQ K+ + ++ + S V+ + + LSP + V + S Sbjct: 28 SADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKEG----DSTVDIGV 83 Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR--GNWLVGYNTFYDNLLDENLQR 189 DN ++Q+ L + D N+G G R W+ G N FYD+ + +R Sbjct: 84 LKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAFYDHEFPNDHKR 143 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSL 249 G G E L N Y + + + + ++ G D+ ++ +P+ + ++ Sbjct: 144 NGVGFEVVSSVLESRVNSYNGTTGYIKDKSGTDSKVLDGRDMGFKVALPYLPGMMFGMNA 203 Query: 250 EQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYR 309 Q+ G +D + SL N + L V +K + ++ +++ LN + Sbjct: 204 VQWKG--IDGLKDQKMRKYSLGGSLSDN---LSLSYVRTDYKDA-AKKDIDSISLNYTWA 257 Query: 310 FGVPLKKQLSA------GEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVF 356 FG +K + + E + L RYD +R N ++ TLTV Sbjct: 258 FGQ--EKHVRPTLFALSDKAYEFKKLGAERYDLVKRENNLV--KKKSGTLTVT 306 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 185 bits (470), Expect = 3e-45, Method: Composition-based stats. Identities = 76/193 (39%), Positives = 111/193 (57%), Gaps = 8/193 (4%) Query: 40 NDGLPDLGMAPENHDG-EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQV 98 LP+LG A N G EK A + + E + ++N T + +++ LG+ +D + ++ Sbjct: 54 QQALPNLGSASVNESGTEKKLATLARQMAEVNQDEN---TDQTWRSYLLGEAKDRVLDRL 110 Query: 99 NQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY-LTWSQLGLTQQDNGLV 157 Q E+ LSP G +V + VD G F GS G +PL D LT+SQLGL D+G+V Sbjct: 111 QQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRGLTYSQLGLQGVDDGVV 170 Query: 158 SNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAG-FGAEAWGEYLRLSANFYQPFAAWHE 216 N+G+ QRW G WL+GYN FYD L+++ R G GAEA +YL LS+N+Y P + H Sbjct: 171 GNMGLRQRWNAGRWLLGYNVFYDQYLNQDASRRGSIGAEARSDYLTLSSNYYYPLSGMHA 230 Query: 217 QTATQEQ--RMAR 227 +++ RMAR Sbjct: 231 ANDDEDELLRMAR 243 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 177 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 59/332 (17%), Positives = 126/332 (37%), Gaps = 46/332 (13%) Query: 120 NEGHFTGSRGSWFVPLQ----DNDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVG 174 ++ F + + L++ QLG+ + + N G G+R + +G Sbjct: 86 SKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKYAIG 145 Query: 175 YNTFYDNLLDENLQR---AGFGAEAW-GEYLRLSANFYQPFAAWHEQTATQEQR---MAR 227 YNTFY + + + G E W L + N+Y ++ +T+ Q+ Sbjct: 146 YNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHYPRS 205 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 G+ L + + P + + LEQ+ ++ ++ LSL LNY P+P++ + Sbjct: 206 GHQLYIQTKFPRFFEFTGKIKLEQFIYEK-KYKKIFNKKNSDYYLSLDLNYQPIPMLGFS 264 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS--------------R 333 + N + + Y+FG P+ +Q+ E++S+ + Sbjct: 265 INNIFVNKQYNSTICRVLIAYQFGTPIIEQIHYTN-NENKSILNNLDTIIQPFIPTIIPH 323 Query: 334 YDNP---QRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGD 390 +D N+LP+L+ Q+ PGE +K+ + + + W + Sbjct: 324 HDYISINDHNHLPSLQRTQK-----------ITGYPGEIKIIKINDNNN---KYVRWDLE 369 Query: 391 TQILSLTPGAQANSAEGWTLIMPDWQNGEGAS 422 + + + A + + L P++ + + Sbjct: 370 S-LENHGGNIVAITNNTYALYFPNYPIIQENN 400 >UniRef50_C0B2E7 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E7_9ENTR Length = 815 Score = 153 bits (387), Expect = 1e-35, Method: Composition-based stats. Identities = 33/149 (22%), Positives = 66/149 (44%), Gaps = 2/149 (1%) Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 +GL+ NY+FG P+ QL + + L S+YD RNN +Y+++ L++ Sbjct: 1 MGLSFNYQFGTPINAQLDPNNIKPLRLLENSKYDFVDRNNNIVFDYQEQSYLSLKTP-DL 59 Query: 362 WDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGA 421 + E + + + S G+ + G ++ L + L +P + EGA Sbjct: 60 IEGYSNEQKTVTISVESSAGLDYIDIDG-SRFLQHGGRIIEQGQNSYLLYLPYYDQQEGA 118 Query: 422 SNHWRLSVVVEDNQGQRVSSNEITLTLVE 450 +N + + D +G+ SS + +++ Sbjct: 119 TNTYNIVATAYDKKGRASSSETTKVVVLK 147 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 133 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 66/325 (20%), Positives = 116/325 (35%), Gaps = 26/325 (8%) Query: 39 NNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQV 98 N L +AP A+ +K E + + + + G V + + Sbjct: 1 MNKKQKILLIAPLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAGDVEALKNTAI 60 Query: 99 NQHVE-------SWLSPWGNASVDVKVDNEGHFTGSRGSWFV-PLQDNDR--YLTWSQLG 148 + VE S+L + +V++ +G S G V PL D D ++Q Sbjct: 61 DGVVEKGVGVTKSFLEQYF-PTVELNFGAQGGSKPSGGLLVVAPLSDPDDIFNTYFTQGS 119 Query: 149 LTQQDNGLVSNVGVGQRWARGNWLV--GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSAN 206 + +DN N+G+G R N ++ G N FYD+ + R G EA ++AN Sbjct: 120 VFYEDNRTTLNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINAN 179 Query: 207 FYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGY 266 Y W E+R GYD+ A + +P+ V Q+ ++ S Sbjct: 180 KYWATTKWKTGKNGLEERALDGYDIEAGVPLPYMNWATVFVKNFQW---DSEISGSKDIK 236 Query: 267 HNPVALSLGLNYTP-VPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLK------KQLS 319 N + L Y P + + + A + +N+ Y Q Sbjct: 237 GNDLQLRA---YIPGITGLEIQAGRTFFSDSSGTDENYINIFYNVTQLFADKPRYNHQWI 293 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPT 344 + + + +S+ RY+ +R N Sbjct: 294 SKDAYKLESMEDRRYEKVRRTNNIV 318 >UniRef50_Q4ACI6 Invasin (Fragment) n=1 Tax=Edwardsiella tarda RepID=Q4ACI6_EDWTA Length = 270 Score = 125 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 62/140 (44%), Gaps = 13/140 (9%) Query: 327 QSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 + + Y+ RNN LEY++++ + + L+ + G + ++S+Y + Q+ Sbjct: 4 RQIAEIPYNLVDRNNDLVLEYKKQEVIKLALSHHAINDLAGAVYTVSANLKSKYALDQVS 63 Query: 387 WQGDTQILSLTPGAQANSAEGWTLIMPDWQ------------NGEGASNHWRLSVVVEDN 434 WQ D +++ ++L++P ++ E A+N ++L V DN Sbjct: 64 WQ-DGGLVAAGGQLTVIDKNHFSLMLPPYRPAQAKSDAHQTSTAEIAANTYQLIAVAFDN 122 Query: 435 QGQRVSSNEITLTLVEPFDA 454 QG + +S + + + P Sbjct: 123 QGNQSNSETLRVVVQPPQVT 142 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 117 bits (294), Expect = 7e-25, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 86/244 (35%), Gaps = 19/244 (7%) Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQD--NDRYLTWSQLGLTQQDN-GLVSNVGVGQ 164 + A + + TGS P+ D ++ + ++Q L D+ N+G G Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 165 RW--ARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE 222 R LVGYN FYD+ LD + QRA G EA L AN Y + W Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKSGLNNIN 126 Query: 223 QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVP 282 ++ G D+ M +P+ N + G + + ++L Sbjct: 127 EKALNGSDVELGMPLPYLPWTNLYYRSFNWEGAS----GAADLEGDEISLEA-------K 175 Query: 283 LVTVTAQ-HKQGESGENQNNLGLNLNYRFGVPLKKQLS-AGEVAESQSLRGSRYDNPQRN 340 L + K+ G ++ L + Y ++ + S+ ++ +R Sbjct: 176 LTNFNIEIGKRSNDGVTEDEEFLKITYTCCNNSNNEIGISDTAYNLTSVSDQKFAKVRRQ 235 Query: 341 NLPT 344 NL Sbjct: 236 NLIV 239 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 115 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 55/278 (19%), Positives = 97/278 (34%), Gaps = 22/278 (7%) Query: 81 QAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWF----VPLQ 136 A V +V++ + + + G V + ++ G S + Sbjct: 14 IITTVANADVASQALNKVSEKISNLIPGEGITEVSLDYND-GDEDQLNFSILGVRDIETT 72 Query: 137 DNDRYLTWSQLGLTQQD----NGLVSNVGVGQRWARG--NWLVGYNTFYDNLLDENLQRA 190 DN ++Q L Q+ ++ N+G+G R N++ G NTFYD L E R Sbjct: 73 DNSN--FFTQFSLMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRL 130 Query: 191 GFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLE 250 G G EA G L L+AN Y + +EQ + G+D ++P ++ Sbjct: 131 GLGIEAKGSILDLTANSYTKISNSEVVNGDREQ-VLSGWDFNLTSQIPRAPW--ARINYN 187 Query: 251 QYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRF 310 Y + G+ SL L+ T V + +++ +L +N Y Sbjct: 188 GYKWET----EKGSADQKGNIYSLELDVTNSVEVVASLDKSSLNGVDDETSLSINYIYPP 243 Query: 311 GVPLKKQLSA-GEVAESQSLRGSRY-DNPQRNNLPTLE 346 +S + + +R N +E Sbjct: 244 KEKSMVMSDGLSNDMFEKSNMEQKLKEKVRRRNKLVME 281 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 108 bits (270), Expect = 5e-22, Method: Composition-based stats. Identities = 65/318 (20%), Positives = 102/318 (32%), Gaps = 32/318 (10%) Query: 42 GLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQH 101 L L ++ + FA IVK+ G N E ++ DA + ++Q Sbjct: 12 ALSALPLSAQEVAKSGKFATIVKNIGNAL---NIGQGEEAVESEVNTLAVDAANAGLDQV 68 Query: 102 VESWLSPWGNASVDVKV-------DNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + LS ++ V D T + L++ + ++Q +N Sbjct: 69 EDKVLSTSNFTHFELSVGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNN 128 Query: 155 GLVSNVGVGQRWARGNWLV--GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 N G G R V GYN FYD L +R G G E AN YQ + Sbjct: 129 RTTINTGFGARHINDANTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVS 188 Query: 213 AWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 Q + GYD +P++ N L + + Sbjct: 189 KTLTYNGIQ-ETALDGYDAKLTANLPYFYSSNLYGKLSNW---------KDAASYETEHY 238 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAE------- 325 G+N P +T+ Q + N ++NY VPL +V + Sbjct: 239 EAGINAEIAPNLTLRVA-AQHKKNSNNTEAVASINY--SVPLGGANQPAKVKQDGDWSTK 295 Query: 326 SQSLRGSRYDNPQRNNLP 343 + +R Y QR N Sbjct: 296 FEPIREKLYRPVQRENRI 313 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 50/327 (15%), Positives = 100/327 (30%), Gaps = 56/327 (17%) Query: 124 FTGSRGSWF-----VPLQDN-DRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYN 176 +F P + L + Q+G+ + G G+R ++GYN Sbjct: 104 IKNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVYNKKHIIGYN 163 Query: 177 TFYDNLLDENLQR---AGFGAEAWGEYLRLSANFYQPFA----AWHE-QTATQEQRMARG 228 Y + + + G E W L+ N Y ++ Q G Sbjct: 164 ACYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNHHYYQYPKIG 223 Query: 229 YDLTARMRMPFYQHLNTSVSLEQYFGD----RVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 Y + A+ P+ + EQ D + +N+ L + L Y P+P+ Sbjct: 224 YQICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVSLEYQPIPMF 280 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKK------------------QLSAGEVAES 326 ++ ++ + LNY+F VPLK+ L++ Sbjct: 281 NLSINNRFIYKKYCNTFFTITLNYQFHVPLKQQLNNVNNNIQQNKIIFNNHLNSILNPFI 340 Query: 327 QSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 S+ + NN + + + PGE ++++ Y + Sbjct: 341 PSIDPY---FIKTNNS--------NEILLTQSNNEIIGYPGERKFIQIE---DYK-TNIQ 385 Query: 387 WQGDTQILSLTPGAQANSAEGWTLIMP 413 W ++ + + + +P Sbjct: 386 WNYES-LKKQGGKIFHIHNNLYEIHLP 411 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 99.7 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 30/171 (17%), Positives = 62/171 (36%), Gaps = 7/171 (4%) Query: 134 PLQ--DNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQR 189 P+ +N + + G QD + +G R ++G N Y + N QR Sbjct: 68 PVYVSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQR 127 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWH-EQTATQEQRMARGYDLTARMRMPFYQHLNTSVS 248 +GAE +++N+Y W E++ GYD+ + +P+ + V Sbjct: 128 ISYGAEIRTSVFEINSNYYHRLTDWKLTGVDNNEEKARGGYDVELALAVPYVPSAHFRVK 187 Query: 249 LEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQ 299 + G + +S + + ++ + ++V + SG Sbjct: 188 HFCWNG--IASNDSNNPIDDLKGNTFSVSGSVYDGLSVEVGYIDYTSGNAD 236 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 96.2 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 72/176 (40%), Gaps = 9/176 (5%) Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEY--LRL 203 Q + ++ ++ + G+G VG N F+D ++ R G++ L Sbjct: 127 QANIDYKNEDILISNGIGILPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPNYIFNL 186 Query: 204 SANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 S+N Y P + + D+ A + + SLE +FGD + + + Sbjct: 187 SSNIYFPLSGKGSEDDL-----VNSIDIRAEGAI--TPTVQFHSSLEFFFGDDIQINDDY 239 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 +N + GL+YTP+PL+ + + + + + + L NY PL +QL Sbjct: 240 DPTNNSHKFTAGLDYTPIPLLQLGVEATKVQDHDVGYGVYLYFNYDPWRPLNEQLE 295 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 94.6 bits (234), Expect = 7e-18, Method: Composition-based stats. Identities = 57/301 (18%), Positives = 106/301 (35%), Gaps = 36/301 (11%) Query: 73 DNGLDTGEQAKAFALGKVR-------DALSQQVNQHVESWLSPWGNASVDVK--VDNEGH 123 +NG E A R D + + N ++ + + SV++ +++ Sbjct: 67 NNGAKGDEYTGIMADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTESDTS 125 Query: 124 FTGSRGSWFVPLQDND----RYLTWSQLGLTQQDN--GLVSNVGVGQRWARGN-WLVGYN 176 F+ + L +D + L +SQ N G N+G+G R + +VG N Sbjct: 126 FSINSLMKLGELAKDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISMVGAN 185 Query: 177 TFYDN---LLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE---QTATQEQRMARGYD 230 F+D + R G G E + + N+Y + + ++R+ G+D Sbjct: 186 AFWDYRMTDYSDAHSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVVPGWD 245 Query: 231 LTARMRMPFYQHLNTSVSLE----QYFGDRVDLFNSGTGYHNPVALSLGLN-YTPVPLVT 285 L R+P L + +Y D L + + P +GL Y + Sbjct: 246 LEVGYRLPNNPELAFYIRGFNWDYKYTQDNSGLEGAVSWQATPH---VGLEAYVSNEISA 302 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 + G ++N GL +N G P+K + +++ +R L Sbjct: 303 ASTTANTDLPGTDENFFGLRMNIT-GNPVKF----EKSNYKKNMVTQMTQPVKRKYDVLL 357 Query: 346 E 346 E Sbjct: 358 E 358 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 88.9 bits (219), Expect = 4e-16, Method: Composition-based stats. Identities = 51/287 (17%), Positives = 97/287 (33%), Gaps = 34/287 (11%) Query: 47 GMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWL 106 G+ E+ + +++ + N NG++ Q AL + LS + ++ + Sbjct: 66 GIWHESPLASRVIDKLLIRNWTSLNNKNGIEWSNQISNLALNLASNKLSDYATKTIQKYP 125 Query: 107 SPWGNASVDVKVDNEGHFTGSRGSWFVPLQD-------NDRYLTWSQLGLTQQD-NGLVS 158 G ASV+ + EG T G + D + + + T N Sbjct: 126 FVLG-ASVNFDIRTEGA-TNIGGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSNDSTW 183 Query: 159 NVGVGQRWARGNWLV-GYNTFYDN---LLDENLQRAGFGAEAWGEYLRLSANFYQPFAAW 214 N G+G R G L+ G N ++D + R G G E + + L L+ N+Y Sbjct: 184 NAGLGLRHLIGEELLAGVNGYWDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIAGTGT 243 Query: 215 H---EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 +R+ G+D R+P ++ ++ ++ Sbjct: 244 KTISTNNTDYYERVVPGWDFELGYRLPSNPNIAFFARGFRWDYRN---------RNDNTG 294 Query: 272 LSLGLNYTPVPLVTVT--------AQHKQGESGENQNNLGLNLNYRF 310 + Y P V + A Q + N++ + LN+ Sbjct: 295 FQGKVTYQMTPHVRLDSWISNEVPANQTQTNGELDNNDITIGLNFTL 341 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 87.7 bits (216), Expect = 9e-16, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 54/157 (34%), Gaps = 11/157 (7%) Query: 64 KDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKV----D 119 G + G + K D + VN + + + +G + ++ V Sbjct: 1 MALGLSLNATAGGKGVSEVLDAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKG 60 Query: 120 NEGHFTGSRGSWFVPLQDNDRYLTWSQLGL----TQQDNGLVSNVGVGQRWARGN--WLV 173 +E ++ PL ++ L W Q L D N+G+G RW +V Sbjct: 61 DEASYSIITTQPLAPLSEDGSRLFW-QGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIV 119 Query: 174 GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 G N+FYD +R G E LS N Y Sbjct: 120 GINSFYDYEFSAKHKRMSLGGEYKRSNAELSVNKYWG 156 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 87.3 bits (215), Expect = 1e-15, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 60/169 (35%), Gaps = 7/169 (4%) Query: 75 GLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNAS---VDVKVDNEGHFTGSRGSW 131 G+ + + S V S L +++ V T S + Sbjct: 28 GISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFKPTISLTNV 86 Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW--ARGNWLVGYNTFYDNLLDENLQR 189 + + + ++Q L +N N+G+G R + G N F+D D++ QR Sbjct: 87 NM-ISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQR 145 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMP 238 G G E L +N Y + + ++++ G+D+ +P Sbjct: 146 NGAGLEVLSSVFDLRSNIYDATSGIEAVSTSRDEEAMDGWDMRLDYHLP 194 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 84.6 bits (208), Expect = 7e-15, Method: Composition-based stats. Identities = 50/288 (17%), Positives = 89/288 (30%), Gaps = 45/288 (15%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKV---------DNEGHFTGSRG 129 +A + + +ESW A ++ EG R Sbjct: 22 ASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRSREGAKPTFRA 75 Query: 130 SWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNW--LVGYNTFYDNLLDENL 187 + ND SQL + D+ N G+ R + + G N FYD+ + Sbjct: 76 ISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNIFYDHQFNTGH 135 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSV 247 R G G E ++ NFY+ H + A GYD ++P+ Sbjct: 136 ARTGLGFEMKSSVYDVNINFYEAQTEIH-HVDGVPEVAAGGYDAEIGAQVPYLPWAKVYY 194 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 Q+ + +++ + +L L P P ++ + + L LN Sbjct: 195 KAYQWNNETLNIKDGE---------TLSLYMMPTP--RLSVEFGTQDDSTMSTKSFLKLN 243 Query: 308 YRF-------GVPL----KKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 Y PL + + G++ + Y+ +R N Sbjct: 244 YVLCCGETTKSAPLFTVSNQAFNYGKIDNQRM-----YEKVRRENNII 286 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 81.6 bits (200), Expect = 7e-14, Method: Composition-based stats. Identities = 48/234 (20%), Positives = 84/234 (35%), Gaps = 21/234 (8%) Query: 57 KHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDV 116 + A +G + +N NG+D G + + + N+ ++ + + ++ + Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 117 KVDNEGHFTGSRGSWFVPL----QDN---DRYLTWSQLGLTQQDN-GLVSNVGVGQRW-A 167 TG F+ L DN L + Q +T + + NVG+G R+ Sbjct: 168 DAATSSDLTGY-LDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRL 226 Query: 168 RGNWLVGYNTFYD---NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE---QTATQ 221 +VG N F+D R G GAE + + L N+Y +A Sbjct: 227 GDEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDY 286 Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFG----DRVDLFNSGTGYHNPVA 271 +R+ G+D+ R+P Y L V + D + S P A Sbjct: 287 VERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDYQDHSDNSGIEGSVNWQATPHA 340 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 81.2 bits (199), Expect = 9e-14, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 111/322 (34%), Gaps = 66/322 (20%) Query: 82 AKAFALGKVRDALSQQVNQHVESWLSP--WGNASVDVKVDNEGHFTGSRGSWFVPLQDND 139 +K+F + D L++ V + + +LS +G V + D + + + L +D Sbjct: 117 SKSFIVSFAHDYLNEYVLKQI-PFLSQTEFG---VGFESDADMTYYLNSLISLAQLGSDD 172 Query: 140 RY----LTWSQLGLT--QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQ---RA 190 L ++Q + + + +R R N ++G N F+D R Sbjct: 173 NGYPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYSRW 232 Query: 191 GFGAEAWGEYLRLSANFYQPFAAWHE-------------------------QTATQEQRM 225 G GAE W + +L+ N+Y T ++R+ Sbjct: 233 GAGAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDERV 292 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVT 285 G+D+ R+P Y L+ + ++ + + +N+ P Sbjct: 293 VPGWDVALNYRLPSYPQLSLGIRGFRW---------DYMRKSDNSGVEGSVNWQATPHTN 343 Query: 286 VTA----------QHKQGE-SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 ++A + S + +G+ N + P+ + + + E +L Sbjct: 344 LSAWISSEIPAYPAQSNAQLSSGDDVYVGVRFNVQL-KPVTYKTGSNRIRE--NLLTQMR 400 Query: 335 DNPQRNNLPTLEY---RQRKTL 353 QR N LE +Q+KT+ Sbjct: 401 QPVQRRNDVLLERWKPKQKKTI 422 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 76.9 bits (188), Expect = 2e-12, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 55/158 (34%), Gaps = 13/158 (8%) Query: 137 DNDRYLTWSQLGLTQQ-DNGLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQRAGFG 193 + R++ ++Q L D G +NVG+G R + G N FYD+ N R G Sbjct: 144 ETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHRFRGNHGRMSVG 203 Query: 194 AEAWGEYLRLSANFYQPFAAWHEQT-ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQY 252 E N+Y+ + AT+ + ++ GY + V +E Y Sbjct: 204 LEYVSGIGAFRMNWYRGVSGERSLDGATRMENVSNGYTAEYGTSFKNARW--ARVYMEAY 261 Query: 253 FGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQH 290 + L +G P ++V + Sbjct: 262 RWQL-------RRSADKHGLRIGTELQLTPRISVDMGY 292 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 74.2 bits (181), Expect = 1e-11, Method: Composition-based stats. Identities = 39/218 (17%), Positives = 72/218 (33%), Gaps = 18/218 (8%) Query: 136 QDNDRY-LTWSQLGL-TQQDNGLVSNVGVGQRWAR--GNWLVGYNTFYDNLLDENLQRAG 191 DN + ++Q + D G N+GVG R L G + FYD+ R Sbjct: 132 YDNSSRDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDHRFLNRHNRLS 191 Query: 192 FGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSL 249 G E N+Y + + +R+A GY + + Sbjct: 192 AGLEYMSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTV----------EYGKTFKN 241 Query: 250 EQYFGDRVDLFN-SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNY 308 ++ V+ ++ + + L +G P V+V + + E + Sbjct: 242 ARWARVYVEGYHWNQERQADKNGLRVGSELQLTPRVSVDMGYNKPEHSSGGVYGKITFRL 301 Query: 309 RFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLE 346 G P+ + ES ++R + +R N +E Sbjct: 302 A-GAPMAWYGGKHRLEESATVRDKMLNLVRRTNTIFVE 338 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 65.0 bits (157), Expect = 7e-09, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 33/87 (37%), Gaps = 9/87 (10%) Query: 44 PDLGMAPENHDG--EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQH 101 +L P N G E+ A + G D EQA A G S Q + Sbjct: 166 KNLTPPPGNSSGNLEQQIASTSQLIGSLLAEDMN---SEQAANIARGWA----SSQASGV 218 Query: 102 VESWLSPWGNASVDVKVDNEGHFTGSR 128 + WLS +G A + + VD + SR Sbjct: 219 MTDWLSRFGTARITLGVDEDFSLKNSR 245 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 61.1 bits (147), Expect = 1e-07, Method: Composition-based stats. Identities = 47/241 (19%), Positives = 84/241 (34%), Gaps = 53/241 (21%) Query: 151 QQDN-GLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANF 207 Q D+ G+V ++G G R + VG NTFYD + L R G G E ++SAN Sbjct: 173 QHDSLGVVGSIGAGYRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANV 232 Query: 208 YQPFAAWHEQTATQE------------------------------QRMARGYDLTARMRM 237 Y + + E + + GY++ Sbjct: 233 YHGLSEKKTKPYYFENSLVIVPRADEFHYPEDGYPNGFTKIRYAYENVLDGYNVRYTRDY 292 Query: 238 PFYQHLNTSVSLEQYFGDR-----VDLFN-SGTGYHNPVALSLG--LNYTPVPLVTVTAQ 289 + ++T V + VD+F + + + L LG LN TP ++ Sbjct: 293 KNARWISTYVEGYHWKTKSPSEHPVDMFYLNQHKWKSISGLKLGATLNITP----HISID 348 Query: 290 HKQGESGENQNNLGLNLNYRFGVP----LKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 ++ + +++ Y G L + S V+ ++S D +R + + Sbjct: 349 LGFNKNNISSGEPYVSVMYTLGKSRYAYLGGKHSEDTVSTARSKM---LDKVKR-HDMVV 404 Query: 346 E 346 E Sbjct: 405 E 405 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 59.6 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 46/134 (34%), Gaps = 10/134 (7%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 + + F ++ + + + + + L EN++G + A+ Sbjct: 12 LRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENK--LSTNTENNNG-RWLAQQTS 68 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 G +DN D Q + + + +VN+ +E+W + +G A +++ VD Sbjct: 69 QLGTILSSDNTHDAASQ-------YLINQANSKVNREIENWFNQYGKAQINLGVDKHFTL 121 Query: 125 TGSRGSWFVPLQDN 138 + Sbjct: 122 KTQKLKSLFLFTKQ 135 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 59.2 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 76/227 (33%), Gaps = 39/227 (17%) Query: 118 VDNEGHFTGSRGSW--FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGY 175 D F L + + Q + + G+ R+ + ++L+G Sbjct: 73 TDGNLDFQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLINRYEKDDFLLGI 132 Query: 176 NTFYDNLLDENLQRAGFGAEA-WGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTAR 234 N F D ++ + FGAE + ++++ +N+Y P ++ + Sbjct: 133 NGFIDGQKEQKESK-SFGAELGYYQFVKAYSNYYVP--------NEADENL----QFGVS 179 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQ-- 292 +P Y +S + ++ ++Y+P ++++ + Sbjct: 180 FTIPSYSAFIFDIS------------------KDSEKINYQVSYSPYSVLSLKILRRDFS 221 Query: 293 GESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQR 339 + L + ++ F KQL + A +RYD QR Sbjct: 222 ANEAIDDTVLQVGFSFNFNESFVKQLRKKDNALQ---EVNRYDFLQR 265 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 59.2 bits (142), Expect = 3e-07, Method: Composition-based stats. Identities = 30/134 (22%), Positives = 50/134 (37%), Gaps = 20/134 (14%) Query: 127 SRGSWFVPLQDNDRYLTWSQLGLT-QQDNGLVSNVGVGQRWARGN-WLVGYNTFYDN--- 181 + +PL N+ L ++ + D+ N+G+ R N W +G ++D Sbjct: 48 AEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHMLDNGWNLGGYGYFDRRKS 107 Query: 182 LLDENLQRAGFGAEAWGEYLRLSANFYQPF--AAWHEQTATQ-------------EQRMA 226 D + G EA L AN Y P +++ E + E+R Sbjct: 108 PYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTVDFSGTTITYRAGEERSM 167 Query: 227 RGYDLTARMRMPFY 240 RGYD R+P + Sbjct: 168 RGYDAEVGWRIPVF 181 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 56.9 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 56/187 (29%), Gaps = 30/187 (16%) Query: 155 GLVSNVGVGQRWAR--GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 G V+NVG+G R + VG NTFYD+ + R G E + AN Y+ Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 213 AWHEQTAT----------------------QEQRMARGYDLTARMRMPFYQHLNTSVSLE 250 + + + Q+ GYD++ + V Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 251 QYFGDRVDLFNSGTG----YHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNL 306 + G V G G P V++ + ++ + G + Sbjct: 261 HWNGLGVKTHGEGPALALNVGKSHGWQAGTTLQLTPHVSLDVGYT-SDNNHSSGAYGF-V 318 Query: 307 NYRFGVP 313 Y G Sbjct: 319 KYTLGTS 325 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 56.9 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 2/40 (5%) Query: 258 DLFNSG--TGYHNPVALSLGLNYTPVPLVTVTAQHKQGES 295 +F G NP A++LGLNY PVPLVT+ + G++ Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLVTIGVNQRMGQN 42 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 55.7 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 34/92 (36%), Gaps = 7/92 (7%) Query: 127 SRGSWFVPLQDNDRYLTWSQ--LGLTQQDNGLVSNVGVGQRWARG--NWLVGYNTFYDNL 182 S G + +PL D++ L + L +NVGVG R A N G N FYD Sbjct: 73 SFGIFTIPLLDSNGQLFFDARIHNLRH--ERWAANVGVGTRIAIPCTNLFFGINFFYDYR 130 Query: 183 LDEN-LQRAGFGAEAWGEYLRLSANFYQPFAA 213 + + G G E N Y P Sbjct: 131 RTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 55.4 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 39/211 (18%), Positives = 80/211 (37%), Gaps = 29/211 (13%) Query: 128 RGSWFVPLQDNDRYLTWSQLGLTQQDNG-LVSNVGVGQRWARG-NWLVGYNTFYDNLLDE 185 + F+P+ + R L ++ + + G ++G G R W +G F D Sbjct: 52 EANLFIPVVQDARSLYFANVRARMANGGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTT 111 Query: 186 NLQ---RAGFGAEAWGEYLRLSANFYQP-------FAAWHEQTAT----------QEQRM 225 +A G EA G AN YQP ++ + + + QE+R Sbjct: 112 YNNSYDQATLGVEALGRQFDWRANVYQPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERA 171 Query: 226 ARGYDLTARMRMPFY-----QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTP 280 G+D+ A R+P + + + ++ ++ D + + + +A + Sbjct: 172 LPGFDIEAGWRLPVFDEEDTRQVRAYLAGYRFSDDGLKVQGTRVRAEYVMA-EFSDTWKG 230 Query: 281 VPLVTVTAQHKQGESGENQNNLGLNLNYRFG 311 L T+ A+++ + +Q+ + L L G Sbjct: 231 AQL-TIGAEYQDDNARGSQSFVALRLRIPLG 260 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 55.4 bits (132), Expect = 5e-06, Method: Composition-based stats. Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 3/58 (5%) Query: 158 SNVGVGQRWAR-GNWLVGYNTFYDNL-LDE-NLQRAGFGAEAWGEYLRLSANFYQPFA 212 +VG+G R W+VG N +YD + +L + G G E G+ + N Y P Sbjct: 131 GSVGIGLRHFSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLGDCVEFRVNGYLPVN 188 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 53.4 bits (127), Expect = 2e-05, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 5/84 (5%) Query: 132 FVPLQDNDRYLTWSQL-GLTQQDNGLVSNVGVGQRWARGNW--LVGYNTFYDNLLDENL- 187 +P + ++ + L G + +NVG G R+ N+ ++G N ++D Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 188 -QRAGFGAEAWGEYLRLSANFYQP 210 + GFG E G Y N Y P Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFP 178 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 52.3 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 41/243 (16%), Positives = 79/243 (32%), Gaps = 51/243 (20%) Query: 128 RGSWFVPLQDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWARGNWLVGYNTFYDN---LL 183 + +PL D+D T+ L + D + V+N+G+G R+ G ++G +YD L Sbjct: 30 SIAVAIPLADSDAARTFLDLRGSIDDADRRVANIGIGHRFRLGAVVLGGAVYYDRVRTDL 89 Query: 184 DENLQRAGFGAEAWGEYLRLSANFYQP----------------FAAWHEQTATQEQR--M 225 + + +A + L L AN+Y P + H + + R Sbjct: 90 ESDFSQATVSLDLMTADLDLRANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIFQPREVT 149 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGL------NYT 279 +G+D R+ + G V F G Y + A ++ ++ Sbjct: 150 LKGFDAEVGYRLGAIE------------GYDVRAFAGGYRYTDDEAPTVDGVKGRLEAWS 197 Query: 280 PVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDN-PQ 338 + + + ++ R G+ + SR D Sbjct: 198 QDGRFSFGIEVRD--DDQDDTQAFATFRMRLGL--------FSEPARREGTASRLDWPVL 247 Query: 339 RNN 341 R + Sbjct: 248 RES 250 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 51.5 bits (122), Expect = 6e-05, Method: Composition-based stats. Identities = 23/88 (26%), Positives = 33/88 (37%), Gaps = 5/88 (5%) Query: 131 WFVPLQDNDRYLTWSQLGLTQ-QDNGLVSNVGVGQRW--ARGNWLVGYNTFYDNLLDENL 187 F PL D Y + L ++ +NVG G RW ++ G N +YD Sbjct: 294 LFAPLVPYDDYYPFLDLRAHYIKNKRWAANVGGGLRWRDCMTGFIFGANLYYDYRNTTQT 353 Query: 188 --QRAGFGAEAWGEYLRLSANFYQPFAA 213 + GFG E + + N Y P Sbjct: 354 DFNQFGFGLEFFTNCFEMRLNAYFPVGD 381 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 50.3 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 59/195 (30%), Gaps = 22/195 (11%) Query: 125 TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL-VSNVGVGQRW-ARGNWLVGYNTFYD-- 180 +G F+PL ++ L ++ L D+ N G+ R W+ G FYD Sbjct: 54 DNGQGLLFIPLAQDEESLFFADLRGNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDVR 113 Query: 181 -NLLDENLQRAGFGAEAWGEYLRLSANFYQP--------------FAAWHEQTATQEQRM 225 + ++ FG E N Y P + + E+R Sbjct: 114 RSQYSNIFRQGSFGFELLSIEWDFRVNGYVPSQKQQRVDSLNTAYLSGNNIVMRAGEERA 173 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTG-YHNPVALSLGLNYTPVPLV 284 G D + + N L Y G F++ + + Y L Sbjct: 174 YWGTDFEVGRLLKSFPESNLDAELRGYVGG--YYFDNSAPGFKEMTGPRARVEYRMFDLP 231 Query: 285 TVTAQHKQGESGENQ 299 + + +G+ Q Sbjct: 232 WLGNGSRVVLAGQYQ 246 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 50.3 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 46/271 (16%), Positives = 77/271 (28%), Gaps = 47/271 (17%) Query: 103 ESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL-VSNVG 161 + W PW + V + DN + +P+ D L +++L D G N+ Sbjct: 29 DKW-DPWLESGVSIGTDNSSR---GEAALLLPIYQTDSGLLFTELRGKLFDAGSKEGNLA 84 Query: 162 VGQRW-ARGNWLVGYNTFYDNLLDENLQRA---GFGAEAWGEYLRLSANFYQPFAAWHEQ 217 +G R W +G D E R +G EA N Y ++ Sbjct: 85 LGYRKMINNRWAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDFRINAYNALSSAQAY 144 Query: 218 TATQE--------------QRMARGYDLTARMRMPFYQHLNTSVSLEQ--YFGDRVDLFN 261 E + GYD R SV +Q + F+ Sbjct: 145 PQPVEAELIGNQLFITSAAEVPLSGYDFELGHRF--------SVLSDQDIWLYAGAFSFD 196 Query: 262 SGTGYHNPVALSLGLNYT--------PVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 L + P +T A + + +++ GL L G Sbjct: 197 DELVSTPVEGPKLRAEWRWNNILNDIPGSSLTAEAGYSHDKVRDDKWEAGLKLTIPLGGK 256 Query: 314 LKKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 K+ + E + L +R+ Sbjct: 257 AKRSI-PLSALEKRLLSA-----VERDTDIV 281 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 49.6 bits (117), Expect = 3e-04, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 43/136 (31%), Gaps = 26/136 (19%) Query: 128 RGSWFVPLQDNDRYLTWSQL--GLTQQDNGLVSNVGVGQRWARG-NWLVGYNTFYDNLLD 184 + F+P+ +D L + L + N G+G R + W +G FYD Sbjct: 50 EVNLFLPIAQDDSNLLFLDLRTSFDNLEQR-EGNFGLGYRAMQDSGWNLGAYAFYDRRRS 108 Query: 185 ---ENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE-----------------QR 224 + G EA G+ N Y P + ++ +R Sbjct: 109 SEGHYFSQITTGLEALGQDFDARINAYLPIG--RKSYEVEDSARVDLSGGSIQILSGLER 166 Query: 225 MARGYDLTARMRMPFY 240 G D R+P + Sbjct: 167 AYHGGDAELGWRLPVF 182 >UniRef50_UPI0000E0F7DB beta-glycosidase-like protein n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F7DB Length = 744 Score = 48.4 bits (114), Expect = 5e-04, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 55/167 (32%), Gaps = 24/167 (14%) Query: 298 NQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFL 357 + +N +R P +++ E + + S N P V L Sbjct: 557 SDSNAPYIFTWRPSTPGTHEITVKAFKEDGTEKTSATTLVSLNGEPL-------VFDVSL 609 Query: 358 ATP-PWDLKPGETVPLKLQIRSRYG-IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDW 415 A P ++ GE++ L + S YG + ++ + + + ++ P + Sbjct: 610 AQPSTSEMTAGESLTLDASVSSNYGKVSKVDFHVNGNFV-------------FSSTTPPY 656 Query: 416 QN--GEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDEL 460 + + L V + G S IT+T+ P + + Sbjct: 657 TFAWQPATAGEYTLDAVAAKSDGSIKQSESITITVNAPAPTPTPVQP 703 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 48.4 bits (114), Expect = 6e-04, Method: Composition-based stats. Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 4/69 (5%) Query: 148 GLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDENL---QRAGFGAEAWGEYLRL 203 G D ++ G+G R ++G NT+YD L + G G E + + Sbjct: 14 GYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVGVGFEMLSDCFDV 73 Query: 204 SANFYQPFA 212 N Y P + Sbjct: 74 RINGYLPVS 82 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 48.4 bits (114), Expect = 6e-04, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 24/61 (39%), Gaps = 2/61 (3%) Query: 155 GLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 G++ +VG+G R + VG N F D N R G E + AN Y+ Sbjct: 168 GIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVNGLNEVYANVYRGLG 227 Query: 213 A 213 Sbjct: 228 D 228 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 48.0 bits (113), Expect = 8e-04, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 28/79 (35%), Gaps = 23/79 (29%) Query: 158 SNVGVGQRWARG--------------------NWLVGYNTF---YDNLLDENLQRAGFGA 194 +++G+G R G + VG N F D + + G G Sbjct: 144 ASLGLGWRHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGI 203 Query: 195 EAWGEYLRLSANFYQPFAA 213 EA YL + N+Y P + Sbjct: 204 EAGTRYLEVRGNYYIPLSD 222 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 46.5 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 26/86 (30%), Positives = 38/86 (44%), Gaps = 6/86 (6%) Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVS-NVGVGQRWARG--NWLVGYNTFYDNL---LDE 185 PL ++ ++L Q L D + NVG+ R + + G N +YDN Sbjct: 70 LTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGRVYDAGRDRIWGANVYYDNDETTYSN 129 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPF 211 + GFG E+ G+ L L AN Y P Sbjct: 130 RYSQIGFGFESLGQNLDLRANAYLPT 155 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 46.5 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 43/265 (16%), Positives = 73/265 (27%), Gaps = 41/265 (15%) Query: 127 SRGSWFVPLQDNDR-YLTWSQLGLTQQDNGLVS-NVGVGQRWARGNW--LVGYNTFYDNL 182 S +P N + + L D G N+G G R N + +YD Sbjct: 142 SNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLGAGWRAYNDNLDKIFTVAGWYDYD 201 Query: 183 LDE--NLQRAGFGAEAWGEYLRLSANFYQPF-------------AAWHEQTATQEQRMAR 227 + + G E G+YL N Y P +A+ + R R Sbjct: 202 DGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEIIISNNLSGSAYFQTDRIYLNRTRR 261 Query: 228 ------GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPV 281 G D +P + Y+ + + Sbjct: 262 SESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYY-------NSDHDKSAAGAKFRAEANIN 314 Query: 282 PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNN 341 ++ + + + + + L+ G K + ++L+ Y RN Sbjct: 315 DWWQMSVSYAKDSVFGSNAWMNVTLSIPEGRS-DKWMRP------KTLQQRMYQPMNRNY 367 Query: 342 LPTLEYRQRKTLTVFLATPPWDLKP 366 +Q T T LA P D P Sbjct: 368 RVVANVKQ--TTTNELAINPDDGLP 390 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 46.1 bits (108), Expect = 0.003, Method: Composition-based stats. Identities = 17/79 (21%), Positives = 25/79 (31%), Gaps = 23/79 (29%) Query: 158 SNVGVGQRW--------------------ARGNWLVGYNTF---YDNLLDENLQRAGFGA 194 +++G+G R+ VG N F D D + G G Sbjct: 103 ASLGLGYRYLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGV 162 Query: 195 EAWGEYLRLSANFYQPFAA 213 E YL N+Y P + Sbjct: 163 EFGNRYLEFRGNYYIPLSD 181 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 45.7 bits (107), Expect = 0.004, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 35/90 (38%), Gaps = 9/90 (10%) Query: 133 VPLQDNDRYLTWSQLGLTQ-QDNGLVSNVGVGQRWARGN-----WLVGYNTFYDNLLDEN 186 +P +D+ ++ + + + N+G+ R W++G + FYD+ EN Sbjct: 245 LPFWQDDQSFAFADVHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSEN 304 Query: 187 ---LQRAGFGAEAWGEYLRLSANFYQPFAA 213 + GAE + N Y P + Sbjct: 305 GFQYHQGSLGAELVNKKFEFRVNGYLPGSN 334 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 45.7 bits (107), Expect = 0.004, Method: Composition-based stats. Identities = 51/273 (18%), Positives = 83/273 (30%), Gaps = 45/273 (16%) Query: 132 FVPLQDNDRYLTWSQL-GLTQQDNGLVSNVGVGQRWARGNW--LVGYNTFYDNLLDENL- 187 F+P ++ L ++ + GL +NVGVG R + G + +YD Sbjct: 113 FLPFFRDENSLIFTDIRGLMTNGGKGGANVGVGYRQFVPELDRIFGVSGWYDFDNGHREA 172 Query: 188 -QRAGFGAEAWGEYLRLSANFYQPF-------------AAWHEQTATQE-----QRMARG 228 + G E+ G YL N Y P A + +G Sbjct: 173 FNQFGVSFESIGRYLDWRVNGYLPVEDNEEISNQILGAAGFQNNFILLNRGRSVDSAYKG 232 Query: 229 YDLTARMRMPFYQHLNTSVSLEQYFGDR--VDLF-NSGTGYHNPVALSLGLNYTPVPLVT 285 +D P S + Y+ V F + V L +N VT Sbjct: 233 FDTEIGGPFPILGRYGMSGYVGMYYYANTDVGSFTGVSGRFQQRVNEDLTIN------VT 286 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 VT H G + + Q + + A + +R D RN T+ Sbjct: 287 VTDDHTFGTNAQIQVIADIPNGF-----------PSRWAREKRVRDRLRDPVMRNYRVTV 335 Query: 346 EYRQRKTLTVFLATPPWDLKPGETVPLKLQIRS 378 ++R ++ A P D P + + Sbjct: 336 --QERLLVSQEFAIDPEDGNPYFVSHINPNLAG 366 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 45.3 bits (106), Expect = 0.005, Method: Composition-based stats. Identities = 53/350 (15%), Positives = 99/350 (28%), Gaps = 68/350 (19%) Query: 25 QSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKA 84 + Q P D+ D L A ++ A + +++N + Sbjct: 86 EGNAIQFNCTPHDSRGDSLQSAIQAGKSQGRVSELARNLPQAERSTLN------AYRVNV 139 Query: 85 FALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTW 144 FA KV +N + + + N G + S +PL L + Sbjct: 140 FAPEKVVTQ--SDLNNTSRHTVGARFTVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIY 197 Query: 145 SQLGLTQQD-NGLVSNVGVGQRWARGNWLVGY-NTFYD-NLLDENLQR-AGFGAEAWGEY 200 L D + G+ R L G N F D L E R G E + + Sbjct: 198 IDLKSKLYDAKEGEVSTGIVFRRQMSPLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKS 257 Query: 201 LRLSANFY---------------QPFAAWHEQTATQEQRMA-RGYDLTARMRMPFYQHLN 244 L+ N+Y + ++R A GYDL + + Y +++ Sbjct: 258 FSLNGNYYRSNKKTTISSVKSFEFHDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIH 317 Query: 245 TSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQ---------GES 295 S + + F G++ +++ + +S Sbjct: 318 GSAFFFYSPYNTEEKF---------SGYRAGVD------LSLYLNERFSVLVSPEFVADS 362 Query: 296 GENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 N+ + + N G ++ L +R+ L Sbjct: 363 KRNRFLVNVGFNLPVGRDY-----------TRLLGH-----VRRDRDIVL 396 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 45.0 bits (105), Expect = 0.007, Method: Composition-based stats. Identities = 19/88 (21%), Positives = 33/88 (37%), Gaps = 6/88 (6%) Query: 132 FVPLQDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWARGNWLVGYNTFYDNLLD-----E 185 +P + + L ++ L + D + N+G G R N Y + L Sbjct: 50 LIPFYQDGKRLGYADLRYSSSDVDTDEINLGAGFRSLNENETAIYGFYGSYDLRKSATER 109 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPFAA 213 + ++ FGAE + +NFY P Sbjct: 110 DYRQLTFGAELLTDTWDYRSNFYFPTGD 137 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 44.2 bits (103), Expect = 0.011, Method: Composition-based stats. Identities = 29/111 (26%), Positives = 41/111 (36%), Gaps = 8/111 (7%) Query: 118 VDNEGHFTG-SRGSWFVPLQDND-RYLTW--SQLGLTQQDNGLVSNVGVGQRWARGNW-- 171 + G F G +R FVPL + LT+ +L L D L N+ +G R N Sbjct: 46 TSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRLFLDNDDANLGGNLILGYRTYSANSHR 105 Query: 172 LVGYNTFYDNLLDENLQ--RAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT 220 + G YDN + + G G E+ G N Y P + Sbjct: 106 IWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDFRVNGYLPIGDTRQGVGD 156 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 41.9 bits (97), Expect = 0.049, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 58/199 (29%), Gaps = 43/199 (21%) Query: 123 HFTGSRGSWFVPLQDNDRYLTWSQLGLTQ-QDNGLVSNVGVGQRWARGNWLV-GYNTFYD 180 +T + PL + + L+ DN +VG+G RW + G F Sbjct: 43 DYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFGG 102 Query: 181 NLLDENLQRAGF---GAEAWGEYLRLSANFYQPFAAWHEQTATQ---------------- 221 +N R G EA+G N Y P H T+ Sbjct: 103 YSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVFL 162 Query: 222 -EQRMARGYDLTA----------RMRMPFY---QH-----LNTSVSLEQYFGDRVDLFNS 262 Q G D+ A + + Y + LE + V L S Sbjct: 163 MHQYAGSGADIKAGYQLFPHSSLKGYLGSYYFSPAETNNVWGGAAGLEYWLTQGVKLIGS 222 Query: 263 ---GTGYHNPVALSLGLNY 278 +H+ A +GL + Sbjct: 223 YSYDNLHHSTYAFGIGLEW 241 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 41.9 bits (97), Expect = 0.056, Method: Composition-based stats. Identities = 23/95 (24%), Positives = 36/95 (37%), Gaps = 6/95 (6%) Query: 128 RGSWFVPLQDND-RYLTWSQLGLTQQDNGLVS-NVGVGQRWARGNW--LVGYNTFYDNLL 183 R F+PL N LT+ + L ++ V N+ G R+ + + G +D Sbjct: 93 RLDSFLPLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRFYNQSLNRIFGGYLGFDRRD 152 Query: 184 DENLQ--RAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N + G G E GE + N Y P + Sbjct: 153 TGNSTFHQLGVGVETLGEVWDVRLNGYFPLGDTRD 187 >UniRef50_A1AQZ5 Fibronectin, type III domain protein n=2 Tax=Desulfuromonadales RepID=A1AQZ5_PELPD Length = 1141 Score = 41.5 bits (96), Expect = 0.064, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 39/125 (31%), Gaps = 16/125 (12%) Query: 340 NNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPG 399 NN TV L P + T+ + G+ + + +L + Sbjct: 388 NNDI-------TAPTVALTAPLSNSIVSGTITVSAGASDNVGVNMVEVYANGALLFASN- 439 Query: 400 AQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 S +T W + A+ + L+ D+ G +S+ +T+ + + Sbjct: 440 ---ASPFNFT-----WDTTQVANGSYTLTARAVDSSGNIGTSSTVTVNVQNADTTAPSIS 491 Query: 460 LRWEP 464 P Sbjct: 492 AFSLP 496 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 41.5 bits (96), Expect = 0.074, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 47/135 (34%), Gaps = 33/135 (24%) Query: 110 GNASVDVKVDNEGHFTGSRGSWFVPLQDN-------DRYLTWSQLGLTQQDNGLVSN-VG 161 G + +K + +T S PL + + + + ++ + G +++ +G Sbjct: 53 GTVTAGLKTSD--AYTDGHFSIVAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASSLG 110 Query: 162 VGQRWARGNW--------------------LVGYNTF---YDNLLDENLQRAGFGAEAWG 198 +G R G+ VG + F D + + G G EA Sbjct: 111 LGFRHLFGSQPLTALSANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEAGT 170 Query: 199 EYLRLSANFYQPFAA 213 Y+ + N+Y P + Sbjct: 171 RYVEVRGNYYIPLSD 185 >UniRef50_B9XJ25 Na-Ca exchanger/integrin-beta4 n=1 Tax=bacterium Ellin514 RepID=B9XJ25_9BACT Length = 888 Score = 41.1 bits (95), Expect = 0.098, Method: Composition-based stats. Identities = 12/97 (12%), Positives = 31/97 (31%), Gaps = 1/97 (1%) Query: 353 LTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIM 412 + + T V L + + + + T L Q+ T + Sbjct: 148 IRLLYPTNNQTFTAPTNVTLYASVTDSNLVTTVQFFAGTNNLGTVTNTQSAPPTNATSSI 207 Query: 413 PDWQNGEGA-SNHWRLSVVVEDNQGQRVSSNEITLTL 448 ++ + + L+ + D+ G +S I++ + Sbjct: 208 TFYKIWSNVLAGTYTLTAIATDSTGHTATSAPISIVV 244 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 505 e-141 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 500 e-140 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 488 e-136 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 486 e-136 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 477 e-133 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 475 e-132 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 473 e-132 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 471 e-131 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 469 e-131 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 467 e-130 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 464 e-129 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 463 e-129 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 462 e-128 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 460 e-128 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 459 e-128 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 458 e-127 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 458 e-127 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 457 e-127 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 456 e-127 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 456 e-127 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 454 e-126 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 454 e-126 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 453 e-126 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 451 e-125 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 451 e-125 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 450 e-125 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 443 e-123 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 442 e-122 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 437 e-121 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 437 e-121 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 436 e-121 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 433 e-120 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 432 e-120 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 432 e-119 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 431 e-119 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 431 e-119 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 431 e-119 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 430 e-119 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 429 e-118 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 429 e-118 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 428 e-118 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 423 e-117 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 421 e-116 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 421 e-116 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 420 e-116 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 417 e-115 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 416 e-114 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 414 e-114 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 411 e-113 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 409 e-113 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 409 e-112 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 407 e-112 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 376 e-103 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 374 e-102 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 374 e-102 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 372 e-101 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 370 e-101 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 360 6e-98 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 355 2e-96 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 355 2e-96 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 333 6e-90 UniRef50_P36943 Putative attaching and effacing protein homolog ... 296 1e-78 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 281 4e-74 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 268 3e-70 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 267 7e-70 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 262 2e-68 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 250 1e-64 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 233 9e-60 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 233 1e-59 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 217 8e-55 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 205 2e-51 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 184 5e-45 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 183 1e-44 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 176 1e-42 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 173 1e-41 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 173 1e-41 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 173 2e-41 UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 164 8e-39 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 164 9e-39 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 162 3e-38 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 160 8e-38 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 156 1e-36 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 154 6e-36 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 154 9e-36 UniRef50_C0B2E7 Putative uncharacterized protein n=1 Tax=Proteus... 152 3e-35 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 150 1e-34 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 145 4e-33 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 141 8e-32 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 140 1e-31 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 137 6e-31 UniRef50_Q4ACI6 Invasin (Fragment) n=1 Tax=Edwardsiella tarda Re... 133 2e-29 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 122 3e-26 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 120 7e-26 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 118 6e-25 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 115 5e-24 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 113 1e-23 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 112 3e-23 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 107 6e-22 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 107 7e-22 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 94 9e-18 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 92 4e-17 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 91 1e-16 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 90 1e-16 UniRef50_UPI0000E0F7DB beta-glycosidase-like protein n=1 Tax=Gla... 89 2e-16 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 84 8e-15 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 84 9e-15 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 84 1e-14 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 82 3e-14 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 80 2e-13 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 72 5e-11 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 66 4e-09 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 64 1e-08 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 49 5e-04 Sequences not found previously or not previously below threshold: UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum ... 86 2e-15 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 85 7e-15 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 79 3e-13 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 74 1e-11 UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryoc... 74 2e-11 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 71 8e-11 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 69 5e-10 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 67 1e-09 UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastop... 65 7e-09 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 64 9e-09 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 62 5e-08 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 61 8e-08 UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legione... 61 1e-07 UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microco... 60 2e-07 UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Plancto... 59 4e-07 UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 59 4e-07 UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Plancto... 59 4e-07 UniRef50_B9KFW6 Putative uncharacterized protein n=1 Tax=Campylo... 57 2e-06 UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Plancto... 57 2e-06 UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachl... 57 2e-06 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 57 2e-06 UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanoth... 56 3e-06 UniRef50_A8PQI7 Putative outer membrane autotransporter barrel d... 56 3e-06 UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickett... 54 9e-06 UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=... 54 1e-05 UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyano... 53 3e-05 UniRef50_Q11VX9 CHU large protein; candidate pectate lyase, poly... 47 0.001 UniRef50_A1AQZ5 Fibronectin, type III domain protein n=2 Tax=Des... 47 0.001 UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellu... 47 0.002 UniRef50_A5G3Y8 Fibronectin, type III domain protein n=2 Tax=Geo... 47 0.002 UniRef50_B9XJ25 Na-Ca exchanger/integrin-beta4 n=1 Tax=bacterium... 45 0.005 UniRef50_Q08MX9 Chitinase c n=1 Tax=Stigmatella aurantiaca DW4/3... 45 0.005 UniRef50_Q3A4E5 Pectate lyase protein n=2 Tax=Pelobacter carbino... 45 0.006 UniRef50_A5G3Y9 Fibronectin, type III domain protein n=1 Tax=Geo... 45 0.007 UniRef50_A9B6Z4 Penicillin-binding protein, 1A family n=1 Tax=He... 44 0.014 UniRef50_B3BT68 Large repetitive protein n=16 Tax=Bacteria RepID... 44 0.016 UniRef50_Q1IW67 PPC, peptidase containing PKD repeats n=1 Tax=De... 43 0.026 UniRef50_B7LJF8 Adhesin for cattle intestine colonization n=14 T... 42 0.038 UniRef50_Q11W88 Endoglucanase-related protein n=1 Tax=Cytophaga ... 42 0.047 UniRef50_B3E8T7 Multicopper oxidase type 2 n=3 Tax=Geobacteracea... 42 0.071 UniRef50_Q1H368 Phosphate-selective porin O and P n=1 Tax=Methyl... 41 0.087 UniRef50_C9XYC9 Putative uncharacterized protein n=1 Tax=Cronoba... 41 0.087 UniRef50_C4SD59 Autotransporter adhesin n=1 Tax=Yersinia mollare... 41 0.091 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 505 bits (1301), Expect = e-141, Method: Composition-based stats. Identities = 464/464 (100%), Positives = 464/464 (100%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA Sbjct: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN Sbjct: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD Sbjct: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 Query: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY 240 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFY 240 Query: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN Sbjct: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 Query: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP Sbjct: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATP 360 Query: 361 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG Sbjct: 361 PWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 Query: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP Sbjct: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 500 bits (1286), Expect = e-140, Method: Composition-based stats. Identities = 121/433 (27%), Positives = 202/433 (46%), Gaps = 11/433 (2%) Query: 32 AANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVR 91 A P + P A + + A I G N D A R Sbjct: 111 APLPAVTWAEETPVPASASKEDLQAQKIAGIASQAGNFLANSPRGDA-------AASIAR 163 Query: 92 DALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ 151 + + V+ WLS +G A + + VDN+ S+ +PL + L ++Q L + Sbjct: 164 GMATGAASTEVQQWLSQFGTARLQLDVDNKFSLKNSQLDLLIPLYEQPDKLVFTQGSLHR 223 Query: 152 QDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPF 211 D+ +N+G+G RW +++G NTF D L + R G G E W +YL++ AN Y Sbjct: 224 TDDRTQTNLGMGMRWFNDGYMLGGNTFLDYDLSRDHARMGMGVEYWRDYLKIGANNYLRL 283 Query: 212 AAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 W + ++R A G+D++ +P L ++ EQY+G V LF +P Sbjct: 284 TNWRDSKDFADYQERPANGWDMSLEGWVPALPQLGGNLKYEQYYGKEVALFGKDNRQKDP 343 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 A+++G+NYTP PL+T +A +QG++G+N LG+ LN + G P + QL V ++L Sbjct: 344 HAITVGVNYTPFPLLTFSADQRQGKAGQNDTRLGVQLNIQLGTPWQHQLDTSAVGAMRTL 403 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 GSRYD RNN LEYR+++ + ++ A GE L + I ++YG+ ++ W Sbjct: 404 AGSRYDLVDRNNNIVLEYRKKEVIHLYTADH-LAGYAGEQKSLNVSINTKYGLERIDWSA 462 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 ++L+ S + +++++PD+ N + +S V D G + TLT+ Sbjct: 463 P-ELLAAGGKIVQESIDNYSIVLPDYNFDSANGNVYEISGVAIDTHGNVSKKAKTTLTVT 521 Query: 450 EPFDALSNDELRW 462 +P + E Sbjct: 522 QPAINTTTSEFTP 534 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 488 bits (1255), Expect = e-136, Method: Composition-based stats. Identities = 129/451 (28%), Positives = 214/451 (47%), Gaps = 9/451 (1%) Query: 17 VAGGTANAQSTFEQKAANPFDNNN-DGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNG 75 V + ++ + + + D LP+ + + ++ + + Sbjct: 7 VNQFRSFSKPFIQLGSGDEIDIPRITPLPE-KITTAENAKTVSSSQYKERLAHNLLKGAT 65 Query: 76 LDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPL 135 + + A R + N + WLS +G A V + +DN GS +PL Sbjct: 66 VLADDNTPLAAASMARSVAVGEANDAAQHWLSQFGTARVQLNLDNNLSLKGSAFDMLLPL 125 Query: 136 QDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAE 195 D+ + L +SQ GL D+ N+G G R + NW+ G N F+D + R GFGAE Sbjct: 126 YDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDNWMYGANVFFDRDITGKNNRIGFGAE 185 Query: 196 AWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYF 253 AW +YL+LSAN Y WH+ +R A GYDL +P Y + T++ EQY Sbjct: 186 AWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLPAYPQIGTNLKYEQYK 245 Query: 254 GDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 G+ V LF NP A + G+NYTP+PL+T+ A+ + G+ G N N+ + LNYR G P Sbjct: 246 GNEVALFGKDDRQKNPYAFTAGINYTPIPLITIGAEQRAGKGGRNDTNISIQLNYRLGEP 305 Query: 314 LKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLK 373 + Q+ VA S++L GSRYD +RNN LEY+++ + + L E + ++ Sbjct: 306 WQSQIDPSAVAASRTLAGSRYDLVERNNNIVLEYQKQDLIQLVLPN-QMTGSAFEIIKVE 364 Query: 374 LQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVED 433 Q+ ++YG++++ W I++ S++ ++ +P + SN + LS V D Sbjct: 365 AQVTAKYGLKRIDWDT-AVIVAAGGVVTQTSSQNISIKLPPYT---AGSNVYMLSAVAYD 420 Query: 434 NQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 NQG + + +T+ + + N L+ P Sbjct: 421 NQGNTSNHSTTQITVTQQSVSHLNSTLQVSP 451 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 486 bits (1252), Expect = e-136, Method: Composition-based stats. Identities = 117/412 (28%), Positives = 205/412 (49%), Gaps = 11/412 (2%) Query: 51 ENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWG 110 ++ +G A++ G N D AL R +S + V+ WL+ +G Sbjct: 140 KDDEGAMKMADMASRAGTLLSNSPDGDA-------ALSMARGQISAVASGQVQQWLNQFG 192 Query: 111 NASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN 170 A V ++ D S+ +P + + L ++Q L + D+ +N+G G R+ + Sbjct: 193 TARVQLEADEHFSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAPS 252 Query: 171 WLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARG 228 +++G N F D L R G G E W ++L+LSAN Y + W + ++R A G Sbjct: 253 YMLGGNIFGDYDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANG 312 Query: 229 YDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTA 288 +D+ A+ +P L ++ EQY+G V LF NP A++ G+N+TP PL+ + A Sbjct: 313 WDIRAQAWLPSLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLLMLGA 372 Query: 289 QHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 +H+QG SG+N + + +YR G+P ++Q++ VA +SL GSRYD +RNN L+YR Sbjct: 373 EHRQGASGKNDKRISADFSYRLGLPWQQQINPQAVATMRSLAGSRYDLVERNNHILLQYR 432 Query: 349 QRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGW 408 +++T+ + GE L + + S+YG+ ++ W + +L+ A W Sbjct: 433 KKETVRLHTVDRV-TGYAGEKKSLGVSVNSQYGLERIDWSA-SSLLACGGQLVREDAGNW 490 Query: 409 TLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDEL 460 ++I+P++Q G A N W +S V D +G + + +T+ + S + Sbjct: 491 SVILPEYQPGAQAVNTWTVSGVAVDKKGNVSARADTQVTVAQSAIDASMSPV 542 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 477 bits (1227), Expect = e-133, Method: Composition-based stats. Identities = 119/441 (26%), Positives = 201/441 (45%), Gaps = 19/441 (4%) Query: 34 NPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDA 93 + + P L A + A+ V + E A G R Sbjct: 142 SSLNTQAGQAPKLSSAMREPSRAEKEAQAVGQLMSVGATLSSTRPSEA----AAGMARSM 197 Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 + N+ ++ WLS +G A V + +D + S WF+P+ D+ ++QLG +D Sbjct: 198 ATNAANEEIQQWLSKYGTARVQLNLDKNFSLSESALDWFIPVWDSANLTAFTQLGARNKD 257 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 N+GVG R W++G N FYD+ L + R G GAEAW +YL+LS N Y + Sbjct: 258 RRNTINLGVGARTLLDRWMLGVNMFYDHDLTGHNSRLGIGAEAWTDYLQLSTNGYMRLSN 317 Query: 214 WHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 WH+ ++R A G+D+ A +P L + EQY G+ V LF NP A Sbjct: 318 WHQSRDFADYDERAANGFDIRANAWLPALPQLGGKLVYEQYIGENVALFGKENLQRNPYA 377 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 L+ G+NYTP PL+TV + G++G N + L+YR G+ + Q+ VA + + Sbjct: 378 LTAGVNYTPFPLLTVGVDERLGKAGRNDTQFSIQLSYRPGLSWQSQIDPSSVAAIRQIAE 437 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 SRY+ RNN LEY++++ + + L+ + G + ++S+Y + Q+ WQ D Sbjct: 438 SRYNLVDRNNDIVLEYKKQEVIKLALSHHAINDLAGAVYTVSANLKSKYALDQVSWQ-DG 496 Query: 392 QILSLTPGAQANSAEGWTLIMPDWQNGEG------------ASNHWRLSVVVEDNQGQRV 439 +++ ++L++P ++ + A+N ++L V DNQG + Sbjct: 497 GLVAAGGQLTVIDKNHFSLMLPPYRPAQAKSDAHQTSTAEIAANTYQLIAVAFDNQGNQS 556 Query: 440 SSNEITLTLVEPFDALSNDEL 460 +S + + + P + Sbjct: 557 NSETLRVVVQPPQVTAQGTFV 577 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 475 bits (1223), Expect = e-132, Method: Composition-based stats. Identities = 113/414 (27%), Positives = 206/414 (49%), Gaps = 13/414 (3%) Query: 52 NHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGN 111 + G + A++ G ++ D AL R ++ + + ++ WL+ +G Sbjct: 163 DDAGARKMADVASRAGAFLSDNPNGDA-------ALSLARGEVTAEASGQLQQWLNQFGT 215 Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNW 171 A V + D F S+ PL + L ++Q L + D+ N+G G R+ ++ Sbjct: 216 ARVQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQVNLGFGLRYFAPSY 275 Query: 172 LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGY 229 ++G N F D L R G G E W ++L+LSAN Y + W+ + ++R A G+ Sbjct: 276 MLGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNSSDFKDYQERPANGW 335 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQ 289 D+ A+ +P L ++ EQY+G V LF +P A++ G+N+TP PL+T+ A+ Sbjct: 336 DIRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAGVNFTPFPLLTLNAE 395 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 H+QG SG+N LG++ +Y+ G+P ++Q++ VA +SL GSRYD +RNN L+YR+ Sbjct: 396 HRQGASGKNDKRLGVDFSYQLGMPWQQQINPQAVATMRSLAGSRYDLVERNNHILLQYRK 455 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 ++ + + GE L + + S YG+ ++ W + +L+ + W+ Sbjct: 456 KEVIRLHTVGRV-TGYAGERKSLGVSVNSSYGLERIDWSA-SSLLAAGGKLVRENEGSWS 513 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 +I+P+ + G +N W ++ V D +G + + +T+ + S + E Sbjct: 514 VILPEHK--PGEANSWTITGVAVDKKGNVSTGADTQVTVAQAAIDASMSPVTPE 565 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 473 bits (1217), Expect = e-132, Method: Composition-based stats. Identities = 120/420 (28%), Positives = 189/420 (45%), Gaps = 15/420 (3%) Query: 49 APENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSP 108 E A + + A R + + WLS Sbjct: 46 LSEQDATAAQVAGMTTQAAGMLQSGMNS-------RQAAEMARGYATSTAQSAFQEWLSQ 98 Query: 109 WGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR 168 WG V + +D + GS +P D L ++Q + D+ N G G R Sbjct: 99 WGTVRVTLGLDEDFTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFA 158 Query: 169 GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA---TQEQRM 225 +++ G N F+D+ L R G G E W + L+L AN Y + W + E R Sbjct: 159 PDYMAGVNLFFDHDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARP 218 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVT 285 A G+D+ A +P Y L ++ EQY+GD V LF +P A + GL+YTPVPL++ Sbjct: 219 ANGWDVRAEGYLPAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLIS 278 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 ++A+ KQG+ GEN LNL Y GV L QL VA +SL GSR+D +RNN L Sbjct: 279 LSAEQKQGKGGENDTRFALNLTYTPGVSLAHQLDPDAVAYRRSLAGSRHDLVERNNNIVL 338 Query: 346 EYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSA 405 EYR+++ + + L P K GE PL ++S+Y ++ L + L G A Sbjct: 339 EYRKKELVKLQL-HDPVTGKGGEQKPLVASLQSKYALKTLR--AEAAELQSAGGVVNTEA 395 Query: 406 EGWTLIMPDWQN--GEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 T+ +P+++ N +R++V ED +G R + E ++ ++ P + + ++ + Sbjct: 396 NQVTVTLPEYRYTATPQTDNVYRVAVTAEDEKGNRSNREEASVVVLAPQLSAQHSQVTSD 455 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 471 bits (1213), Expect = e-131, Method: Composition-based stats. Identities = 138/490 (28%), Positives = 222/490 (45%), Gaps = 35/490 (7%) Query: 1 MSRFVPRIIPF--------YLLLLVAGGTANAQSTFEQ---KAANPFDNNNDGLPDLGMA 49 MS + +II F + L+ A A ++ + P+ ++ +L Sbjct: 17 MSMYFNKIISFNIISRIVICIFLICGMFMAGASEKYDANAPQQVQPYSVSSSAFENLHPN 76 Query: 50 PENHDGEKHF-AEIVKDFGETSMNDNGLDTGEQ------------AKAFALGKVRDALSQ 96 E F A + N E A A + Sbjct: 77 NEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAASGRASDVAHSMVGD 136 Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL 156 VNQ ++ WL+ +G A V++ D S W P D+ +L +SQLG+ +D+ Sbjct: 137 AVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLFFSQLGIRNKDSRN 196 Query: 157 VSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N+GVG R WL G NTFYDN L + R G GAEAW +YL+L+AN Y WH Sbjct: 197 TLNLGVGIRTLENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQLAANGYFRLNGWHS 256 Query: 217 QTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSL 274 ++R A G DL A +P L + EQY G+RV LF NP A++ Sbjct: 257 SRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALFGKDNLQRNPYAVTA 316 Query: 275 GLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 G+NYTPVPL+TV + G+S +++ L +NYR G + QLS VA ++ L SRY Sbjct: 317 GINYTPVPLLTVGVDQRMGKSSKHETQWNLQMNYRLGESFQSQLSPSAVAGTRLLAESRY 376 Query: 335 DNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQIL 394 + RNN LEY++++ + + L+ PG+ + Q++ +R+++W D +++ Sbjct: 377 NLVDRNNNIVLEYQKQQVVKLTLSPATISGLPGQVYQVNAQVQGASAVREIVWS-DAELI 435 Query: 395 SLTPGAQANSAEGWTLIMPDWQNGEG--------ASNHWRLSVVVEDNQGQRVSSNEITL 446 + S + L++P ++ +N + LS + D+QG R +S +++ Sbjct: 436 AAGGTLTPLSTTQFNLVLPPYKRTAQVSRVTDDLTANFYSLSALAVDHQGNRSNSFTLSV 495 Query: 447 TLVEPFDALS 456 T+ +P L+ Sbjct: 496 TVQQPQLTLT 505 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 469 bits (1207), Expect = e-131, Method: Composition-based stats. Identities = 127/455 (27%), Positives = 220/455 (48%), Gaps = 16/455 (3%) Query: 17 VAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDG----EKHFAEIVKDFGETSMN 72 V ++ + + D P + +N + E A V+ Sbjct: 68 VNIYRTFSKPFTALTSGDEIDIPRKASPFSIDSEKNKNADVLLENKLASHVQTGATALAT 127 Query: 73 DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWF 132 N + E+ +R A + + N + WLS +G A V + V+++ GS Sbjct: 128 SNAAKSSER-------MIRSAANNEFNSSAQQWLSQFGTARVQMNVNDDFKLDGSAVDVL 180 Query: 133 VPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGF 192 VP+ DN + + ++QLG +DN N+G G R + NW+ G NTF+DN + +R G Sbjct: 181 VPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNNWMYGVNTFFDNDMTGKNRRVGV 240 Query: 193 GAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLE 250 GAEAW +YL+LSAN Y + WH+ +R A GYD+ A +P + L + E Sbjct: 241 GAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAEAYLPSHPQLGGKLMYE 300 Query: 251 QYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRF 310 +Y G+ V LF NP A++ G+NYTP+PL+TV A+H+ G+ +N +++ NYR Sbjct: 301 KYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLLTVGAEHRAGKGSKNDSSINFQFNYRL 360 Query: 311 GVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETV 370 G + ++ VA +++L GSRYD +RNN L+Y++++ + + L + K G+ Sbjct: 361 GESWQSHINPSAVAATRTLAGSRYDLVERNNNIVLDYQKQELIRLSLPERV-EGKAGDIA 419 Query: 371 PLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGAS-NHWRLSV 429 + Q+ S+YG+ ++ W +++ S+ ++ +P +Q G + N + LS Sbjct: 420 TVNAQVTSKYGLERIDWD-SAALIAAGGTLSKGSSNSISITLPPYQASVGNTPNSYTLSA 478 Query: 430 VVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + D QG R +S+ + + + N + P Sbjct: 479 IAFDTQGNRSNSSSTLINVSPQNLSTGNSLMTATP 513 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 467 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 123/463 (26%), Positives = 201/463 (43%), Gaps = 18/463 (3%) Query: 9 IPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENH---DGEKHFAEIVKD 65 I L + A+ + + D + P + + E+ A + Sbjct: 88 ISVAELRKLNQFRTFARGFDNVRQGDELDVPAQVSENNLTPPPGNSSGNLEQQIASTSQQ 147 Query: 66 FGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFT 125 G D + A R S Q + + WLS +G A + + VD + Sbjct: 148 IGSLLAEDMNSE-------QAANMARGWASSQASGAMTDWLSRFGTARITLGVDEDFSLK 200 Query: 126 GSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDE 185 S+ + P + L +SQ L + D N G+G R W+ G N F+D+ L Sbjct: 201 NSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTPTWMSGINFFFDHDLSR 260 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT---ATQEQRMARGYDLTARMRMPFYQH 242 RAG GAE W +YL+LS+N Y W E R A G+D+ A +P + H Sbjct: 261 YHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRAEGWLPAWPH 320 Query: 243 LNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNL 302 L + EQY+GD V LF+ NP A++ GLNYTP PL+T +A+ +QG+ GEN Sbjct: 321 LGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLMTFSAEQRQGKQGENDTRF 380 Query: 303 GLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPW 362 ++ ++ G ++KQL EV +SL GSR+D RNN LEYR+++ + + L T P Sbjct: 381 AVDFTWQPGSAMQKQLDPNEVDARRSLAGSRFDLVDRNNNIVLEYRKKELVRLTL-TDPV 439 Query: 363 DLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQN--GEG 420 K GE L ++++Y ++ + + L G + + + +P ++ Sbjct: 440 TGKSGEVKSLVSSLQTKYALKG--YNVEATALEAAGGKVVTTGKDILVTLPAYRFTSTPE 497 Query: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 N W + V ED +G + + + + P + + + Sbjct: 498 TDNTWPIEVTAEDVKGNFSNREQSMVVVQAPTLSQKDSSVSLS 540 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 464 bits (1193), Expect = e-129, Method: Composition-based stats. Identities = 286/434 (65%), Positives = 344/434 (79%) Query: 30 QKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGK 89 ++A NPFD N D LPDLG+APEN+ EKHFA ++K FGE S D+ L G+QA+ FA + Sbjct: 2 RQAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTR 61 Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 +RDA+S + ES LSPWGNA+VD+ VD EG+F GS GS F P QDN+RYLTWSQ+G+ Sbjct: 62 LRDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGV 121 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 +QQ+ GLV N G+GQRW G+WL+GYNTFYD L D++ RAGFGAEAWG+YLRLSAN+YQ Sbjct: 122 SQQNQGLVGNAGIGQRWTAGHWLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 210 PFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNP 269 P W + EQRMARGYD+TA+ +PFYQH+NTSVS EQYFGD+V+LF+SG+GYHNP Sbjct: 182 PLGGWEHRAGLLEQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYHNP 241 Query: 270 VALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL 329 VA+ +GL+YTPVPLVTV+A H+QGESG +QN+LGL LNYRFGVPL KQLS EVA S+SL Sbjct: 242 VAVKVGLSYTPVPLVTVSAHHRQGESGVSQNDLGLKLNYRFGVPLNKQLSPDEVAASRSL 301 Query: 330 RGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQG 389 RGSRYD +R N+P +E+RQRKTL+VFLATPPWDL GETV LKLQ+RSR+GIRQL WQG Sbjct: 302 RGSRYDRVERTNVPVMEFRQRKTLSVFLATPPWDLSAGETVALKLQVRSRHGIRQLSWQG 361 Query: 390 DTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 DTQ LSLTP + SA+GWT+IMP W N GASN WRLSV VED QGQRV+SN ITL L Sbjct: 362 DTQALSLTPPIDSTSADGWTVIMPAWDNSPGASNSWRLSVTVEDEQGQRVTSNWITLKLS 421 Query: 450 EPFDALSNDELRWE 463 P L D+ R+E Sbjct: 422 VPVQTLPQDDPRYE 435 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 463 bits (1190), Expect = e-129, Method: Composition-based stats. Identities = 134/435 (30%), Positives = 208/435 (47%), Gaps = 15/435 (3%) Query: 34 NPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDA 93 N L + M + D + AE+ + G D EQA + A G V + Sbjct: 120 NSNSPEARNLKAMQMERDGKDPQMQVAEMAQQSGTLLARDM---DSEQAASMARGWVASS 176 Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 S Q WLS WG A V + VD + S + P + L +SQ L + D Sbjct: 177 ASAQAT----DWLSRWGTARVSLGVDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLHRTD 232 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 + +N G+G R+ +W+ G N F D+ L R G G E W +YL+LS N Y + Sbjct: 233 DRTQTNHGIGWRYFTSSWMSGVNMFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSN 292 Query: 214 WHEQT---ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPV 270 W E R A G+DL A +P + L + EQY+GD V LF ++P Sbjct: 293 WRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFGKDERQNDPH 352 Query: 271 ALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLR 330 A++ GL+YTPVPL++ +A+ +QG+ GEN +G+ L + G L+KQL EVA +SL Sbjct: 353 AITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQKQLDPAEVAARRSLV 412 Query: 331 GSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGD 390 GSRYD RNN LEYR+++ + + L T P KPGE L ++++Y ++ + + Sbjct: 413 GSRYDLVDRNNNIVLEYRKKELVRLTL-TDPLKGKPGEVKSLVSSLQTKYALKG--YDIE 469 Query: 391 TQILSLTPGAQANSAEGWTLIMPDWQNG--EGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 L G A S + + +P ++ N + ++V ED++G E + + Sbjct: 470 AASLQSAGGKVAVSGKDIQVTIPPYRFTAMPETDNTYPIAVTAEDSKGNFSRREESMVVV 529 Query: 449 VEPFDALSNDELRWE 463 +P +L++ L + Sbjct: 530 EKPTLSLTDSTLSVD 544 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 462 bits (1189), Expect = e-128, Method: Composition-based stats. Identities = 123/446 (27%), Positives = 205/446 (45%), Gaps = 14/446 (3%) Query: 23 NAQSTFEQKAANPFDNNN---DGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTG 79 T + P NN LP +P + A+ G T N+ Sbjct: 93 FEALTTGDEIDIPLIGNNFTTQSLPHSTSSPNDSL----LAQSASQVGNTLQNN---PNS 145 Query: 80 EQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND 139 E A A + + Q + WL+ G V + D + S+ VPL +++ Sbjct: 146 EALNDLARSSALSAANAKAGQEISDWLNGKGKVRVKLDADRDFSVKNSQLDLLVPLWESE 205 Query: 140 RYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGE 199 ++ +SQ + + D+ SN+G+G R+ ++ +G NTFYD+ + R G GAE Sbjct: 206 SHMIFSQGSVHRTDDRTQSNLGLGYRYFADSYALGANTFYDHDWSRSHSRLGLGAEYQRN 265 Query: 200 YLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRV 257 + +L+ N Y + W + E+R A G+D+ A +P Y L ++ EQY+GD V Sbjct: 266 FFKLATNGYLRLSNWKDSPDFDNYEERPANGWDIRAEGYLPSYPGLGAKLAYEQYYGDNV 325 Query: 258 DLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQ 317 LF NP A++ G NY+P PL+ + +QG+ G+N G++LNY G PL Q Sbjct: 326 GLFGKDNQQKNPHAITFGGNYSPFPLLKFSVDQRQGKGGQNDTRFGIDLNYTLGTPLSHQ 385 Query: 318 LSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIR 377 L ++ S+SL +RYD RNN LEYR++ TL++ LA GE L++ + Sbjct: 386 LDRNQLIASRSLIANRYDFVDRNNNIVLEYRKKNTLSLKLAQ-QVSGYTGERKSLEVSVN 444 Query: 378 SRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQ 437 S G+ ++ W ++LS +++I+P++Q G GA+N + ++ DN G Sbjct: 445 SSNGLERIDWDAP-ELLSNGGQIIQEGPGLYSVIVPEFQYGVGAANQYIVNATAYDNSGN 503 Query: 438 RVSSNEITLTLVEPFDALSNDELRWE 463 T+ + + ++ E Sbjct: 504 ASQQASTTVVVTASAVSTTHSEFTPT 529 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 460 bits (1183), Expect = e-128, Method: Composition-based stats. Identities = 136/435 (31%), Positives = 207/435 (47%), Gaps = 15/435 (3%) Query: 34 NPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDA 93 N L + M + D + AE+ + G D EQA + A G V + Sbjct: 120 NSNSPEARNLKAMQMERDGKDPQMQVAEMAQQSGTLLARDM---DSEQAASMARGWVASS 176 Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 S Q WLS WG A V + VD + S + P + L +SQ L + D Sbjct: 177 ASAQAT----DWLSRWGTARVSLGVDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLHRTD 232 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 N +N G+G R+ +W+ G N F D+ L R G G E W +YL+LS N Y + Sbjct: 233 NRTQTNHGIGWRYFTSSWMSGVNMFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSN 292 Query: 214 WHEQT---ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPV 270 W E R A G+DL A +P + L V EQY+GD V LF ++P Sbjct: 293 WRSAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKVVYEQYYGDEVALFGKDERQNDPH 352 Query: 271 ALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLR 330 A++ GL+YTPVPL++ +A+ +QG+ GEN +G+ L + G L+KQL EVA +SL Sbjct: 353 AITAGLSYTPVPLISFSAEQRQGKQGENDTRIGMELTLQPGHSLQKQLDPAEVAARRSLV 412 Query: 331 GSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGD 390 GSRYD RNN LEYR+++ + + L T P KPGE L ++++Y ++ + + Sbjct: 413 GSRYDLVDRNNNIVLEYRKKELVRLTL-TDPLKGKPGEVKSLVSSLQTKYALKG--YDIE 469 Query: 391 TQILSLTPGAQANSAEGWTLIMPDWQNG--EGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 L G A S + + +P ++ N + ++V ED++G E + + Sbjct: 470 AASLQSAGGKVAVSGKDIQVTIPPYRFTAMPETDNTYPIAVTAEDSKGNFSRREESMVVV 529 Query: 449 VEPFDALSNDELRWE 463 +P +L+ L + Sbjct: 530 EKPTLSLAGSTLSVD 544 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 459 bits (1182), Expect = e-128, Method: Composition-based stats. Identities = 129/433 (29%), Positives = 212/433 (48%), Gaps = 23/433 (5%) Query: 31 KAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKV 90 + NP +NN + DL A G+ NDN D A Sbjct: 145 FSNNPNENNKKDVDDL------------LARNAMGAGKLLSNDNTSDA-------ASNMA 185 Query: 91 RDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT 150 R A++ ++N + WL+ +G A V + VD++ S VPL+D++ L ++QLG+ Sbjct: 186 RSAVTNEINASSQQWLNQFGTARVQLNVDSDFKLDNSALDLLVPLKDSESSLLFTQLGVR 245 Query: 151 QQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 +D+ N+G G R +G+W+ G NTF+DN L +R G GAE +YL+ SAN Y Sbjct: 246 NKDSRNTVNIGAGIRQYQGDWMYGANTFFDNDLTGKNRRVGVGAEVATDYLKFSANTYFG 305 Query: 211 FAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHN 268 WH+ ++R A G+D+ +P Y L + E+Y GD V LF + Sbjct: 306 LTGWHQSRDFSSYDERPADGFDIRTEAYLPAYPQLGGKLMYEKYRGDEVALFGKDDRQKD 365 Query: 269 PVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS 328 P A++LG+NYTPVPLVT+ A+H++G+ N ++ + LNYR G P Q+ VA +++ Sbjct: 366 PHAVTLGVNYTPVPLVTIGAEHREGKGNNNNTSVNVQLNYRMGQPWNDQIDQSAVAANRT 425 Query: 329 LRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQ 388 L GSRYD +RNN L+Y++++ + + L G + L Q+R++YG ++ W Sbjct: 426 LAGSRYDLVERNNNIVLDYKKQELIHLVLPDR-ISGSGGGAITLTAQVRAKYGFSRIEWD 484 Query: 389 GDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 T + + + ++ +P +Q+ SN +S V D QG + ++ + Sbjct: 485 A-TPLENAGGSTSPLTQSSLSVTLPFYQHILRTSNTHTISAVAYDAQGNASNRAVTSIEV 543 Query: 449 VEPFDALSNDELR 461 P + + Sbjct: 544 TRPETMVISHLAT 556 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 116/435 (26%), Positives = 209/435 (48%), Gaps = 12/435 (2%) Query: 30 QKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGK 89 A P +D P++ + + E A++ G+ + + Sbjct: 114 PMAPLPIVEWDDDKPEIVLPSSASENEIRVAQLASQAGKFFSTNPDQEKT-------KAF 166 Query: 90 VRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGL 149 R+ L+ + + + W + +G++ + ++ D + S+ +P + + L +SQ L Sbjct: 167 ARELLTTAASSYAQDWFNRFGSSQIHLEADKKFSLKNSQIDLLMPWYETEDNLIFSQTSL 226 Query: 150 TQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQ 209 +++ + +N+G+G RW ++G NTF+D + R G G E ++L+LSAN Y Sbjct: 227 HRKEGRIETNLGLGARWYGEGQMIGGNTFFDYDISRKHSRLGLGVEYRRDFLKLSANSYH 286 Query: 210 PFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 + W R + G+D+ A +P Y H+ ++ EQY+GD V LF + Sbjct: 287 RLSGWRSSRDLADHSARPSNGWDVRAEGWLPSYPHIGGKLTYEQYYGDSVALFGTKNLQQ 346 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 NP +++ GLNYTP+PLVT A+H+QG++ + + GL LNY+FG K+ L G V + Sbjct: 347 NPYSITAGLNYTPIPLVTFNAEHRQGKASKQDSRFGLQLNYQFGKTWKQHLDPGSVTTFR 406 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 SL G+RYD RNN LEY++ + + +A GE +PL + S+YG+ L W Sbjct: 407 SLMGNRYDFVSRNNHIVLEYKKNDVIQLNIANS-ITGYAGEKIPLSFTVASKYGLSHLKW 465 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLT 447 +T L G ++L++P ++N ++N++ +S V D +G + + + Sbjct: 466 NAET--LVAAGGHIVQENGKYSLVLPAYRNDAKSANNYTISAVAIDKKGNISPNTMLRVV 523 Query: 448 LVEPFDALSNDELRW 462 + +P L Sbjct: 524 VTQPAIYPIKSALTP 538 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 458 bits (1178), Expect = e-127, Method: Composition-based stats. Identities = 329/415 (79%), Positives = 376/415 (90%) Query: 48 MAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 M PE+H+GEK FAE+VK FGE SM DNGLDTGEQAK FA +VRDALS QVNQH+ESWLS Sbjct: 1 MMPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLS 60 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 PWGNASV+V+VDN+G F GSRGSWF+P QDN RYLTWSQLGLT+Q++GLVSNVG+GQRWA Sbjct: 61 PWGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRWA 120 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMAR 227 R WL+GYNTFYDNLLDE+LQRAG GAEAWGEYLRLSAN+YQPFA+WHE++ATQEQRMAR Sbjct: 121 RDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWHERSATQEQRMAR 180 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYD++A+MRMPFYQHL+T VS+EQYFGD VDLF+SG GYHNP+A+SLGLNYTPVPLVTVT Sbjct: 181 GYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLVTVT 240 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 AQHKQGESG +QNNLGLNLNYRFGVPLKKQL+A EVAES+SLRGSRYD+PQRN+LP +EY Sbjct: 241 AQHKQGESGVSQNNLGLNLNYRFGVPLKKQLAASEVAESKSLRGSRYDSPQRNSLPVIEY 300 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 RQRKTL+VFLATPPWDL+PGETVPLKLQ+RS +GIR + WQGDTQ LSLT GA A+S +G Sbjct: 301 RQRKTLSVFLATPPWDLQPGETVPLKLQVRSLHGIRHVSWQGDTQALSLTAGANADSIDG 360 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRW 462 WT+IMP W + EGA + WRLSVVVED +GQRVSSNEITL L EPF A+S+D+ RW Sbjct: 361 WTIIMPTWDSSEGAIHRWRLSVVVEDEKGQRVSSNEITLALTEPFMAMSDDDPRW 415 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 457 bits (1175), Expect = e-127, Method: Composition-based stats. Identities = 114/419 (27%), Positives = 195/419 (46%), Gaps = 12/419 (2%) Query: 46 LGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESW 105 A E ++ + A G + D R+ + + + W Sbjct: 130 SASAEEGNEQAQKVAGYASQAGSFLASSAKSDAAASMA-------RNMATVEAGGAFQQW 182 Query: 106 LSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQR 165 LS +G A V + D S+ +PL D ++Q L + D+ +++G G R Sbjct: 183 LSHFGTARVQLDADKNFSLKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWR 242 Query: 166 WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQ 223 + +++G N F D L + RAG G E W +L+L N Y + W + ++ Sbjct: 243 HSTSTYMLGGNLFGDFDLSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQE 302 Query: 224 RMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPL 283 R A G+D+ + +P L ++ EQY+G V LF + NP A+++G+NYTPVPL Sbjct: 303 RPANGWDVRGQAWVPSLPQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPL 362 Query: 284 VTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLP 343 +T+ A+ +QG+SG++ L LN+NY GVP + Q+ VA +SL GS+YD +RNN Sbjct: 363 ITLGAEQRQGQSGKSDTRLTLNMNYHLGVPWRAQVDPTAVAAMRSLAGSQYDLVERNNNI 422 Query: 344 TLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQAN 403 LEYR+++ + + A GE L + + SR+G+ ++ W D L+ G Sbjct: 423 VLEYRKKEIVRLKTA-DLVTGYTGEQKSLGVSVNSRHGLERIDW--DASALNAAGGKIVQ 479 Query: 404 SAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRW 462 + + +++P +Q+ N + +S V D +G R S ++ +T+ Sbjct: 480 NGRDYAVVLPAYQSSAQGVNTYTVSGVAVDTKGNRSSRSDTQVTVQATEVNKQTSTFTP 538 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 456 bits (1174), Expect = e-127, Method: Composition-based stats. Identities = 150/434 (34%), Positives = 236/434 (54%), Gaps = 9/434 (2%) Query: 29 EQKAANPFD--NNNDGLPDLGM----APENHDGEKHFAEIVKDFGETSMNDNGLD-TGEQ 81 +P LP+LGM P GE+ A G + N+ D Q Sbjct: 88 PLFPLDPLAGKAIASNLPELGMGNDPVPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQ 147 Query: 82 AKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY 141 A+++A G+ + + + Q + L +G A V++ VD+ G + S S F P +ND Sbjct: 148 AESWAKGQAKAQVVDPLRQQAQELLGKFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAM 207 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 + +SQ+G+ +QDN ++ N+G G R+ +G+WL G NTF D + N R G G E W + L Sbjct: 208 VAFSQVGVHRQDNRMIGNLGAGVRFDQGDWLFGANTFLDQDISRNHSRLGLGLEWWADNL 267 Query: 202 RLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 +L++N+Y P + W + +R ARG+D+ A+ +P YQ L S EQY+GD V L Sbjct: 268 KLASNYYHPLSGWKDSKDFDDYLERPARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVAL 327 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F +P A+++G++YTP PL T+ HK G+ G+N LGL ++Y+ G L+KQL Sbjct: 328 FGKDNLQKDPHAVTVGVDYTPFPLATLKVSHKMGKDGKNNTELGLQVSYQIGTALEKQLD 387 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSR 379 G VA +SL+GSRYD RN LEY+++ L++ LA P L G+ ++ +RS+ Sbjct: 388 PGNVAAMRSLKGSRYDLVDRNYDIVLEYKEKAVLSLDLAAVPMTLLEGDVYMMQPLVRSK 447 Query: 380 YGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRV 439 Y I + W GD L L P A AN+ +GW + +P W GA+N + LS+ + D +G + Sbjct: 448 YRITSVSWHGDAVPLLLVPTAGANNPQGWQITLPAWDATPGATNLYTLSISIVDEKGHQA 507 Query: 440 SSNEITLTLVEPFD 453 +SN++ + + + Sbjct: 508 TSNDVEIRVGQQRL 521 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 456 bits (1173), Expect = e-127, Method: Composition-based stats. Identities = 133/452 (29%), Positives = 217/452 (48%), Gaps = 13/452 (2%) Query: 17 VAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENH--DGEKHFAEIVKDFGETSMNDN 74 + ++ + D P +++ E A N + Sbjct: 100 INIYRTFSRPFTALTTGDEIDIPRKASPFSVDNNKDNRLSVENTLAGHAVAGATALSNGD 159 Query: 75 GLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVP 134 +GE+ VR A S + N + WLS +G A V + ++++ H GS +P Sbjct: 160 VAKSGER-------MVRSAASNEFNNSAQQWLSQFGTARVQLNINDDFHLDGSAADVLIP 212 Query: 135 LQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGA 194 L DN++ + ++QLG +D+ N+G G R +GNW+ G NTF+DN L +R G GA Sbjct: 213 LYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGNWMYGANTFFDNDLTGKNRRIGVGA 272 Query: 195 EAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQY 252 EAW +YL+LSAN Y WH+ +R A GYDL A +P Y L E+Y Sbjct: 273 EAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGYDLRAEAYLPSYPQLGGKAMYEKY 332 Query: 253 FGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGV 312 GD V LF NP A++ G+NYTP+PLVT+ A+H+ G+ G+N +N+ LNYR G Sbjct: 333 RGDDVALFGKDNRQKNPHAITAGVNYTPIPLVTIGAEHRAGKGGQNDSNINFQLNYRLGE 392 Query: 313 PLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPL 372 + + VA S++L GSRYD +RNN L+Y+++ + + L P + + Sbjct: 393 TWQSHIDPSAVAASRTLAGSRYDLVERNNHIVLDYQKQNLVRLSLPDS-LAGDPFSQLSV 451 Query: 373 KLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVE 432 Q+ + +G+ ++ WQ ++++ + S G + +P++Q N + L+ + Sbjct: 452 TAQVTATHGLERIDWQ-SAELMAAGGVLKQTSKNGLEITLPEYQMNRTGGNSYILNAIAY 510 Query: 433 DNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 D QG S + +T+ ++N L P Sbjct: 511 DTQGNASSQASMLITVNAQKINIANSTLVAVP 542 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 454 bits (1168), Expect = e-126, Method: Composition-based stats. Identities = 124/454 (27%), Positives = 202/454 (44%), Gaps = 11/454 (2%) Query: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAE 61 S I L + A+ + D P E A Sbjct: 82 SVAKKYAISVDELKRINIYRTFAKPFTALTVGDEIDVPRKKSPFTVDNNVTVPAENGVAS 141 Query: 62 IVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNE 121 + + + A R A++ +++ + WL +G A + +++ Sbjct: 142 NAAAGAALLSHGDAAKS-------AENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDD 194 Query: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDN 181 F S +PL DN + L ++QLG +D+ N+G G R NW+ G NTF+DN Sbjct: 195 FEFDSSAIDVLIPLYDNQKSLFFTQLGGRNKDSRNTINIGAGVRAFLTNWMYGANTFFDN 254 Query: 182 LLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPF 239 + N +R G GAEAW +YL+LSAN Y WH+ +R A GYDL A +P Sbjct: 255 DITGNNRRVGIGAEAWTDYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPA 314 Query: 240 YQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQ 299 Y L + EQY GD V LF +P A+++G+NYTPV LVTV H+ G+S ++ Sbjct: 315 YPQLGGKLMYEQYNGDEVALFGKDKRQKDPHAITVGINYTPVSLVTVGIDHRAGKSSKSD 374 Query: 300 NNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLAT 359 +++ L NYR + + VA +++L GSR D +RNN L+Y++++ L + L Sbjct: 375 SSINLQFNYRLSNSWQSHIDPSAVAVTRTLAGSRQDLVERNNNIVLDYQKQELLRLSLP- 433 Query: 360 PPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGE 419 G+ L QI S+Y ++++ W ++ +++ S + T+ P +Q Sbjct: 434 EQLTGSAGDNAILTAQIESKYEVQRVEWDANS-LIAAGGNISTTSQKDVTITFPPYQYQV 492 Query: 420 GASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFD 453 G SN + LS + D G + + + + Sbjct: 493 GVSNIYALSAIAYDVNGNISNRATTQIHVSQSST 526 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 454 bits (1167), Expect = e-126, Method: Composition-based stats. Identities = 123/445 (27%), Positives = 203/445 (45%), Gaps = 20/445 (4%) Query: 23 NAQSTFEQKAANPFDNNNDGL-PDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQ 81 A+ + P +N GL P+ + E++ A+ + + + Sbjct: 56 FAKLQAGDELEIPQAQSNLGLAPENTALTDTQTTERNLAKTATTSAQMLNSGDKA----- 110 Query: 82 AKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY 141 A ++R NQ SWL+ +G A + VD+ G GS+ +P D Sbjct: 111 ----AARQLRGLAVGNANQAANSWLNNFGTARLQANVDDRGDLDGSQFDMLMPFYDTPSQ 166 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 + ++Q G+ + D +N+G+G R +W+VGYN F D + + R G GAE +YL Sbjct: 167 MAFTQFGIRRIDKRTTANLGIGIRHFIDDWMVGYNLFLDRDITRDHTRVGAGAEYARDYL 226 Query: 202 RLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 +L+AN Y + W + +R A G+DL A +P L + EQYFG+ V L Sbjct: 227 KLAANGYLRLSDWRDSPDFSSYSERPATGFDLRAEAYLPSLPQLGGKLMYEQYFGNDVGL 286 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F NP A++ G+NYTP+PLVTV KQG +G + L +NY G P KQ+S Sbjct: 287 FGKDNRQQNPAAITAGINYTPIPLVTVGIDRKQGSAGNGETLFNLGVNYEVGTPWAKQIS 346 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSR 379 V ++L+GSR D +RNN LEY+++ + ++++ + ET L + + S+ Sbjct: 347 PDAVNARRTLQGSRNDLVERNNQIVLEYKKQDVINLYVSNNV-SGRAAETKQLVVSVTSK 405 Query: 380 YGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRV 439 YG+R + + + + + L +P N+W +S + D +G Sbjct: 406 YGLRNIQFD-QGALAAAGGKIIPQGPSQFALQLPP---QPSGGNNWTISAIASDVKGNTS 461 Query: 440 SSNEITLTLVEPFDALSNDELRWEP 464 + +TLV+ D + W P Sbjct: 462 NRA---VTLVQLQDTPATISGTWTP 483 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 453 bits (1165), Expect = e-126, Method: Composition-based stats. Identities = 120/432 (27%), Positives = 197/432 (45%), Gaps = 24/432 (5%) Query: 46 LGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESW 105 L + + E A + G + Q+ A R + N + W Sbjct: 162 LSPHADTSERESRVAGQLMGVGRVLAS-------PQSSNAASEMARSWATAAANDEIVKW 214 Query: 106 LSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQR 165 LS +G A + + +D GS W +P D T++QLG +D+ N+G+G R Sbjct: 215 LSKYGTAQLQLNIDKNFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTR 274 Query: 166 WARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQ 223 NWL G N FYD+ L R G G+EAW +YL+LS N Y + WH+ + Sbjct: 275 TLSNNWLFGVNAFYDHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNE 334 Query: 224 RMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPL 283 R A G+D+ A MP L + EQYFGD V LF NP A ++G+NYTP PL Sbjct: 335 RPANGFDVRANAWMPTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPL 394 Query: 284 VTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLP 343 +T+ + G++ E+ + LNYR G + Q+ V S+ + SRY+ +RNN Sbjct: 395 LTLGVDQRLGKNSEHDTQFNVQLNYRIGDDWRAQVDPSAVPHSRLISESRYNLVERNNNI 454 Query: 344 TLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQAN 403 LEY+++ + + L + +PG + ++++YG++ ++WQ D + +S Q Sbjct: 455 VLEYQKQNIMHLSLPSDTLSGQPGSEHMISAILQTKYGLQDIVWQ-DAEFISAGGKLQRQ 513 Query: 404 SAEGWTLIMPDWQNGEG--------------ASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 + L +P ++ A+N + LS D +G + ++ +T+T+ Sbjct: 514 DKTHFNLTLPSYRYSATARRSGSHATAQAEIAANTYHLSATAFDTKGNQSNTINLTVTVE 573 Query: 450 EPFDALSNDELR 461 P E++ Sbjct: 574 PPTVFQGKFEVQ 585 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 451 bits (1160), Expect = e-125, Method: Composition-based stats. Identities = 126/434 (29%), Positives = 202/434 (46%), Gaps = 15/434 (3%) Query: 36 FDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALS 95 +++ + A + E A + G D + A G R S Sbjct: 122 LQKSHEQQNAVPPANGENTLENQIASTSQRVGTLLSQDMNSE-------QASGMARGWAS 174 Query: 96 QQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNG 155 + + + WL+ +G A + + VD + S+ + P D YL +SQ L + D+ Sbjct: 175 SEASGAMTDWLNNFGTAKISLGVDEDFSLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDR 234 Query: 156 LVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH 215 N G+G R +W+ G N F+D+ L RAG GAE W +YL+LS+N Y W Sbjct: 235 TQINTGLGWRHFTPSWMSGINLFFDHDLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWR 294 Query: 216 EQT---ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 E R A G+DL A +P + L + EQY+GD V LF+ NP A+ Sbjct: 295 SAPELDNDYEARPANGWDLRAEGWLPAWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAI 354 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS 332 + GLNYTP PL+T++A+ +QG+ GEN ++L ++ ++KQL+ EVA +SL GS Sbjct: 355 TAGLNYTPFPLLTLSAEQRQGKQGENDTRFAVDLTWQPSSSMQKQLNPDEVAGRRSLAGS 414 Query: 333 RYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQ 392 RYD RNN LEYR+++ + + L P K GE PL ++++Y ++ + + Sbjct: 415 RYDLIDRNNNIVLEYRKKELIRLSL-LDPVKGKSGEIKPLVSSLQTKYALKG--YNIEAA 471 Query: 393 ILSLTPGAQANSAEGWTLIMPDWQN--GEGASNHWRLSVVVEDNQGQRVSSNEITLTLVE 450 L G + S + T+ +P ++ N W + V ED +G + + + Sbjct: 472 ALEAAGGKVSTSGKDITVTLPGYRFTNTPETDNTWSIDVTAEDVKGNLSRHEQSMVVIQA 531 Query: 451 PFDALSNDELRWEP 464 P + + L P Sbjct: 532 PTLSQKDSLLSVNP 545 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 451 bits (1160), Expect = e-125, Method: Composition-based stats. Identities = 133/463 (28%), Positives = 211/463 (45%), Gaps = 34/463 (7%) Query: 10 PFYLLLLVAGGTANAQ---STFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDF 66 P L+ L + ANA+ S+ E++ NP D N A+ Sbjct: 40 PSSLIYLSSVFNANAEEITSSAEKEQGNPSDQNASS----------------VAQTAVQA 83 Query: 67 GETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTG 126 G +DN D V A++ + + WLS +G A V++ D + Sbjct: 84 GSLLSSDNASDA-------LGSAVVSAVTGKAASSAQEWLSQFGTARVNISTDEHFTLSD 136 Query: 127 SRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDEN 186 S VPL + + L ++QLG + D+ + N G G R W+ G N FYD + N Sbjct: 137 SELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHFNDGWMWGTNVFYDRQVSGN 196 Query: 187 -LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHL 243 QR G E +YL +SAN Y + W ++ ++R+A G+D+ A +P Y L Sbjct: 197 QHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFDIRATGYLPAYPQL 256 Query: 244 NTSVSLEQYFGDRVDLFNSG--TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNN 301 ++ EQYFGD V LF +P A+++GLNYTPVPLVT+ K G+SGEN Sbjct: 257 GANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLVTMGLNQKMGKSGENDTQ 316 Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 + L L + GVPL QL +VA ++L+G R D RNN LEYR+++ +++ L Sbjct: 317 VNLGLTWTPGVPLSAQLDPSQVALRRTLQGGRLDLVDRNNNIVLEYRKQELISLALP-AE 375 Query: 362 WDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGA 421 + P+ +++++YG+ ++ WQGD+ ++ E + +P W Sbjct: 376 LEGAEQSKRPVTAKVKAKYGLDRIEWQGDSFFSHGGKITPGSNPEQVVMTLPVWVGS--G 433 Query: 422 SNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 SN + LS D +G ++ + +T+ P Sbjct: 434 SNSYTLSATAWDKKGNASAAERVNVTVNGIDVNTLLSATTVSP 476 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 450 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 206/469 (43%), Positives = 286/469 (60%), Gaps = 12/469 (2%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANP------FDNNNDGLPDLG-MAPENHDGEK 57 + +++PF L A G A A +P + LP+LG A + + EK Sbjct: 12 LKKVVPFATGCLPAMGLAWLCGALPAYAESPPAPDSVVQQPANDLPELGGNASNDAEREK 71 Query: 58 HFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVK 117 +A + K GE ++N+ Q + A S + Q + LSP GNA + + Sbjct: 72 EWATMAKQLGERNLNNVSSQ---QVRTRAESYAVGQASSVLQQQAQELLSPLGNAKLSLV 128 Query: 118 VDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNT 177 + ++G F+GS G F PL D + LT+SQLGL QQ G + N G+GQRW G+WL+GYNT Sbjct: 129 MSDQGDFSGSSGQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGDWLLGYNT 188 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE--QRMARGYDLTARM 235 D+ + + RA GAEAWG++LR SAN+Y P +A +Q + R A GYD+T + Sbjct: 189 VLDSDFERHHNRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQG 248 Query: 236 RMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGES 295 +PFY+ + S+S EQY+G+ VDLF SG ++P A+ LG+NYTPVPLVTV A HK GE Sbjct: 249 YLPFYRQIGGSLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLVTVKALHKMGEG 308 Query: 296 GENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTV 355 G +Q+ + L LNYR GVPL KQ+S VA+++SLRGSRYDN +R N+P + ++QRKTL V Sbjct: 309 GVSQDQVELALNYRLGVPLVKQISPEYVAQAKSLRGSRYDNIERKNVPVMAFKQRKTLQV 368 Query: 356 FLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDW 415 FLATPPW L+PGET+PL L+I++ I ++ WQGDTQ LSLTP +N GW+LI+P W Sbjct: 369 FLATPPWRLQPGETLPLVLEIKTTNKITRVSWQGDTQALSLTPSQNSNDPHGWSLIVPQW 428 Query: 416 QNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + A+N + LSV +ED++ Q V+SN I L + P S E P Sbjct: 429 DDSPDAANRYHLSVTLEDDKQQLVTSNWIQLQVTPPLTVSSEIEQGLPP 477 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 443 bits (1139), Expect = e-123, Method: Composition-based stats. Identities = 127/462 (27%), Positives = 206/462 (44%), Gaps = 18/462 (3%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 + + + + +++ A + T A +D P E+ A Sbjct: 26 ISKSVVWANIVIQAIFPLSIAFT-PAVMAAETVGASDEKP-----RSASQAEQSTANAAT 79 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 ND+ A R + N+ ++ W + +G+A V + +D + Sbjct: 80 RLASILTNDDSAK-------QASSIARGTAANAGNEALQKWFNQFGSAKVQLNLDEKLSL 132 Query: 125 TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLD 184 GS+ +PL D+ LT++QLG D+ + NVG+GQR ++GYN F D+ Sbjct: 133 KGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQMLGYNLFVDHDAS 192 Query: 185 ENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQH 242 + R G GAE +++ L+AN Y + W ++++A G+DL + +P Sbjct: 193 YSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSEAYLPTLPQ 252 Query: 243 LNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNL 302 L + EQYFGD V LF NP+A++LG+NYTP+PL TV HK G +G N Sbjct: 253 LGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPLFTVGVDHKMGRAGMNDTRF 312 Query: 303 GLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPW 362 L NY FG PL QL + VA +SL GSRY+ RNN ++YR++ +T+ L Sbjct: 313 NLGFNYAFGTPLAHQLDSDAVAIKRSLMGSRYNLVDRNNQIVMKYRKQNRVTLELPARV- 371 Query: 363 DLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGAS 422 +T+PL ++ GI ++ W + L+L G S W + +P + +G + Sbjct: 372 SGAARQTMPLVANATAQQGIDRIEW--EASALTLAGGKITGSGNNWQITLPSYLSGGEGN 429 Query: 423 NHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 N +R+S + D G L + + L P Sbjct: 430 NTYRISAIAYDTLGNASPVAYSDLVVDSHGVNTNASGLTAAP 471 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 442 bits (1137), Expect = e-122, Method: Composition-based stats. Identities = 125/463 (26%), Positives = 216/463 (46%), Gaps = 28/463 (6%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 V +++ + + L A A T P ++ + A E + + + + Sbjct: 16 VNKVVAWSTIALQALYPALLSFT-------PTISHASAVKASQAAAEQQEL-RGLSSLAA 67 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 G + N A + + Q + V WL +GNA + + VD+ Sbjct: 68 QAGRSIENG-----------HAGSFAANTVPAQATKEVVEWLQKYGNARIQLNVDDAFSL 116 Query: 125 TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR-GNWLVGYNTFYDNLL 183 S + P D +++ +SQ L + D+ +N+G+G R+ N ++G N FYD L Sbjct: 117 KDSAFDFLYPWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGANLFYDYDL 176 Query: 184 DENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQ 241 + R G G E W +YLR AN Y + W + ++R A G+D+ + +P Y Sbjct: 177 SRHHARMGAGVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYTQGWLPSYP 236 Query: 242 HLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNN 301 L S+ E+Y+G V LF S NP A + G++YTPVPLVT++A+HKQG+S + + Sbjct: 237 QLGASLKYEKYYGKNVGLFGSDHLQENPYAFTGGISYTPVPLVTLSAEHKQGQSNTHDSR 296 Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 G+ +NYR G+PL KQL + VA + ++ RYD +RNN LEYR++ L + L Sbjct: 297 FGIEINYRPGIPLAKQLDSDNVALMREVQHGRYDFVERNNNIVLEYRKKSVLKIRLPESV 356 Query: 362 WDLKPGETVPLKLQI-RSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG 420 + G +P+ + + +S +GI+ + W D+ + + S W L +P + G Sbjct: 357 -QGEGGAVIPVTISLDKSHWGIQSVEWN-DSAFTAAGGRI-SGSGTSWQLTLPAY--TPG 411 Query: 421 ASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 +NHW++ D +G + + +T+ ++ + Sbjct: 412 GTNHWQIGATARDVKGNVSNYAVMNVTVTGSSASVGTMDFTLN 454 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 437 bits (1124), Expect = e-121, Method: Composition-based stats. Identities = 110/375 (29%), Positives = 181/375 (48%), Gaps = 4/375 (1%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 +S + NQ +E WL+ +G+A V + D S +PL + L ++ Sbjct: 115 VTQYAISQISSKSNQKIEQWLNQFGHARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFA 174 Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSA 205 Q ++D N G+G R+ ++VG N FYD+ L + R G GAE W +Y +LS+ Sbjct: 175 QTNYHRKDLRSQFNYGIGYRYFTEKFMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSS 234 Query: 206 NFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 N Y ++W +R A G+D+ P Y L T + EQY+G V LF Sbjct: 235 NHYHRLSSWRASNNILDYSERPANGWDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKD 294 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 NP +LG+NYTP+PLVT+ A+ + G NNL +NL+YR G L QL+ V Sbjct: 295 KRDKNPHTYTLGINYTPIPLVTLNAERRIGLHDRADNNLNINLSYRIGESLASQLNPDNV 354 Query: 324 AESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIR 383 ++L GSRYD RNN LEY++ + + + + E L++Q++++Y + Sbjct: 355 KAIRTLAGSRYDFVNRNNDMILEYKKETLVFLSMVDS-INGYAKEERDLQVQVKTKYPLA 413 Query: 384 QLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNE 443 + W +++ + + + +T+I+P +Q G N + +S V D G R + + Sbjct: 414 NIEWSA-SKLNAQGGQIKHHGGTHYTVILPQYQIGAIEKNSYIISAVAIDTHGNRSAPVQ 472 Query: 444 ITLTLVEPFDALSND 458 T+ + + N Sbjct: 473 TTVIVDKSLINTRNS 487 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 437 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 122/457 (26%), Positives = 199/457 (43%), Gaps = 19/457 (4%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAP--ENHDGEKHFAEI 62 V + + ++ A ++ A +N LP LG + ++ E AE Sbjct: 8 VKQKLANGFVIFTAIWSSAIMPVIPAYAK---MLDNKELPSLGSDQIIDENNTEHLAAEY 64 Query: 63 VKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEG 122 K G + Q A R+ +S + + +E WLS GN +++ D + Sbjct: 65 TKTVGTFLSQKKTMKDLSQI---AQDYARNKVSSEATKEIEHWLSKAGNVKLNIDFDKKF 121 Query: 123 HFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNL 182 S+ W +P D + L ++Q L + D +N G+G R+ +G N F D+ Sbjct: 122 SIKNSQFDWLIPWYDQEDILLFTQHTLHRYDERFHTNNGIGLRYFHEKSTIGMNAFIDHD 181 Query: 183 LDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ---TATQEQRMARGYDLTARMRMPF 239 L R G G E W +YL+L+AN Y +W + A G+D+ +P Sbjct: 182 LSHAHTRVGLGVEYWQDYLKLNANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPN 241 Query: 240 YQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQ 299 Y HL ++ EQY+GD V LF NP A ++G N+TP PL T+ A HK G + + Sbjct: 242 YPHLGGNLRYEQYYGDSVALFGKTKRQKNPNAATIGANWTPFPLFTLNASHKLGSEKQVE 301 Query: 300 NNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLAT 359 L + FG L L +VAE++ L G+RYD +RNN L Y+++ L + L + Sbjct: 302 TQAKLQFTWTFGKNLAHHLDPTKVAETRRLSGNRYDFVERNNNIILNYQKKTVLHLSLPS 361 Query: 360 PPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGE 419 G++VPL S+Y ++ + WQ + G+ ++ + TL +P +Q Sbjct: 362 K-IQGITGQSVPLVKSFTSKYPLKHIEWQAPEFL--AVGGSISSDDQTATLTLPSYQTSN 418 Query: 420 GAS-----NHWRLSVVVEDNQGQRVSSNEITLTLVEP 451 A N +RL + D +G E + + Sbjct: 419 AAKDVQRINRYRLRAIAYDIKGNVSPVAETLIEITHS 455 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 436 bits (1122), Expect = e-121, Method: Composition-based stats. Identities = 114/434 (26%), Positives = 196/434 (45%), Gaps = 12/434 (2%) Query: 28 FEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFAL 87 A + P+ P + E VK + + A Sbjct: 119 VPIIAEQGATKVSVVTPNEVNCPVGIENNPQTKEYVKRVSALLASSDPT-------TVAT 171 Query: 88 GKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQL 147 VR +S N+ ++ WL +G A V + VD++ S W D+ + ++QL Sbjct: 172 DVVRSEVSSTANKEIQKWLGQYGTAQVRLNVDDKFSLRESSLDWLFSFYDSSSAIIFTQL 231 Query: 148 GLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANF 207 G+ +D+ +N+G+G R + GNW++G NTFYDN L R GFGAEAW +YL+LSAN Sbjct: 232 GIRNKDHRNTANLGLGGRISMGNWILGANTFYDNDLTGINSRLGFGAEAWTDYLQLSANS 291 Query: 208 YQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTG 265 Y WH+ ++R A G+D+ +P L + EQY GD V LF Sbjct: 292 YMRLNNWHQSRDFIDHDERPANGFDIRTNAWLPVLPQLGGKLMYEQYSGDSVALFGKDKL 351 Query: 266 YHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAE 325 NP A++ G+ YTP PL+T ++G++G++ + L+Y G VA Sbjct: 352 QKNPYAVTAGITYTPFPLLTFGIDERRGKAGKSDTQFNIQLSYHLGESWLSLTDPSAVAG 411 Query: 326 SQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQL 385 ++ L +RY+ RNN LEY+++ L + +T G+ + +I S++ + ++ Sbjct: 412 TRQLAEARYNLVDRNNNIVLEYQKQDILNIT-STEQLRGYSGDNGIILTKIVSKHNVERV 470 Query: 386 IWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEIT 445 W + +L+ + + P +Q +N + + VV D++G R + + Sbjct: 471 EWINISALLAAGGNSVELPGRKLAITYPPYQI--DGNNTYHVDVVAYDSRGNRSNISTTA 528 Query: 446 LTLVEPFDALSNDE 459 +T+++ + S Sbjct: 529 ITVLQKENTPSTVN 542 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 433 bits (1114), Expect = e-120, Method: Composition-based stats. Identities = 226/476 (47%), Positives = 291/476 (61%), Gaps = 13/476 (2%) Query: 1 MSRFVPRIIPFYLL---LLVAGGTANAQSTFEQKA---------ANPFDNNNDGLPDLGM 48 MS+F + LL L+V G T N F ++A F LP+LG Sbjct: 40 MSQFYRYLTLSCLLPAVLVVGGFTLNDALAFTEQARVDDAPFADPARFAKMQQQLPELGT 99 Query: 49 APENHDGEKHFAEIVKDFGETSMN-DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLS 107 +N K AE K GE SMN D+ E+A + + RDA Q+ E LS Sbjct: 100 VHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVFNRFRDAAKQRAASEGEQLLS 159 Query: 108 PWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA 167 P+G ASV + + ++G F GS P QDN YLT+SQLG+ Q + G V N G+GQRW Sbjct: 160 PYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQLGIEQSEYGSVGNAGLGQRWI 219 Query: 168 RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMAR 227 G+W VGYN F D+LL + QR GAEAWG+YLR SAN+YQP + + + RMAR Sbjct: 220 AGSWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSANYYQPLSGCRNHSNSALMRMAR 279 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 GYD+T R +PFY+ L ++S EQY G+ VDLFNSG NP A+SLG+NYTPVPL T++ Sbjct: 280 GYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGNAVANPAAVSLGINYTPVPLFTLS 339 Query: 288 AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 A HK+G+ GE+Q+ L +NYR GV L +QLSA VA +QSL GSRYD RNN P + + Sbjct: 340 ASHKEGDGGESQDKFALKMNYRLGVALSQQLSADNVAAAQSLSGSRYDGVNRNNSPVMAF 399 Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 RQ KTL+VFLATPPW L+PGET+PLKLQI I+ + WQGDTQ LSLTP +G Sbjct: 400 RQLKTLSVFLATPPWQLQPGETLPLKLQIAHSNAIKAVSWQGDTQALSLTPPPNNVDPQG 459 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 W++IMP W + +GA+N W LSV +ED++ QRV+SN ITL L P + D + Sbjct: 460 WSIIMPAWNSQQGANNSWHLSVTLEDSKHQRVTSNWITLKLSPPMTLQAADRGNFS 515 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 432 bits (1112), Expect = e-120, Method: Composition-based stats. Identities = 116/425 (27%), Positives = 196/425 (46%), Gaps = 14/425 (3%) Query: 43 LPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHV 102 +P+ + A + G + +++ + + + L + V Sbjct: 108 IPNQEEEQQATQQASMVASHLSQVGNSLSSEDRVGAFSRL-------AKGMLLSSTAKTV 160 Query: 103 ESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGV 162 E WL G A V ++ D++ F+GS F+PL D L +SQ G + D + N+G+ Sbjct: 161 EEWLGHIGQAQVKLQADDKNDFSGSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGL 220 Query: 163 GQRWARGNWLVGYNTFYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--A 219 GQR +W+ GYN F+D + N +R GFG E +Y++LSAN Y W T Sbjct: 221 GQRHYVSDWMFGYNIFFDQQISGNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLE 280 Query: 220 TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYT 279 ++R A GYD+ +P Y L + EQYFGD V LF NP AL+ G++YT Sbjct: 281 DYDERAANGYDIRTEAYLPHYPQLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYT 340 Query: 280 PVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQR 339 P+PLV++ H G G+ + + + +NY P +KQ+ V +++L GSR D R Sbjct: 341 PIPLVSLGLDHTIGNGGKKKTGVNVAVNYEINTPWQKQIDPAAVQATRTLAGSRMDLVDR 400 Query: 340 NNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPG 399 NN LEYR+++ +T+ L K +P+ +R+G+ ++ W ++ Sbjct: 401 NNNIVLEYRKQQVVTLNLPEK-ISGKEALVLPINYTFNARHGLDRIEWDA-ADVIQAGGQ 458 Query: 400 AQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 + + + +P + +GA+N + LS D +G +S+ + + + N Sbjct: 459 VSSQGNLAYHVALPPY--IDGAANAYVLSGRAVDKKGNYSTSSSTNIYVTGVNISSVNSV 516 Query: 460 LRWEP 464 P Sbjct: 517 SSLTP 521 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 432 bits (1111), Expect = e-119, Method: Composition-based stats. Identities = 111/465 (23%), Positives = 203/465 (43%), Gaps = 24/465 (5%) Query: 9 IPFYLLLLVAGGTANAQSTFEQKAANPFD---NNNDGLPDLGMAPENHDGEKHFAEIVKD 65 IP L + + +S ++ + D +NN + + + + E + A K Sbjct: 84 IPLSKLYKLNQFRSFHKSFYDLSGGDEIDIPASNNYSFENRPLDTKVDNNENYSANKTKA 143 Query: 66 FGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFT 125 S ++ + ALG S N ++ WLS WG + D++ Sbjct: 144 AVNVSESNKSPE--------ALGVASSMASSAANNAIQKWLSQWGTVESQLSFDSKASLK 195 Query: 126 GSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDE 185 S W +P+ D D + Q G +D+ N+G G R W+ G N F+D + Sbjct: 196 NSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVYNGWMYGLNNFFDYDITG 255 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHL 243 N +R G G EA +YL +++N Y WH+ ++R A G+D+ +P Y + Sbjct: 256 NNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANGFDMRVNGWLPAYPQI 315 Query: 244 NTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLG 303 + EQY+GD V LF +P A++ G+++TP PL+++ HK G++G++ ++ Sbjct: 316 GGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLLSLGVDHKIGQAGKHDTSVN 375 Query: 304 LNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWD 363 L L +R L QL VA S+ L SRYD RNN LEYR+++ +++ L+ + Sbjct: 376 LQLTWRPSDSLSSQLMPDNVAASRLLSKSRYDLVDRNNNIVLEYRKQQLISLKLSHGEIN 435 Query: 364 LKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDW-------- 415 G + + + ++ G+ + W ++ +A + + +P + Sbjct: 436 APGGTSHTIIATVAAKSGLSDITWNA-ANFIAAGGKIKAIDKTVFAITLPPYINQGSDRK 494 Query: 416 --QNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSND 458 ++G N + L V + + G E+ + ++ P + D Sbjct: 495 TQKSGAQGGNAYTLIAVAQSDDGSISEPKELHVNVLPPNINFNGD 539 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 125/460 (27%), Positives = 213/460 (46%), Gaps = 23/460 (5%) Query: 3 RFVPRIIPFYLLLLVAGGTANAQS--TFEQKAA-NPFDNNNDGLPDLGMA----PENHDG 55 R R+ ++ L++ +F AA NP N ++ E + Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 56 EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVD 115 K+ A I+ G DN D + + S V ++ WL +G A V+ Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDA-------LINSAINNGSAYVTGQIQQWLQQFGTAKVN 114 Query: 116 VKVDNEGHFTG-SRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVG 174 + +D + S D + L ++Q G + D+ + NVG+G R+ W+ G Sbjct: 115 LGLDKDLSLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFADRWMWG 174 Query: 175 YNTFYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDL 231 NTFYD + +N +R G G E Y +LSAN Y+ + W + + ++R+A GYD+ Sbjct: 175 INTFYDRQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDI 234 Query: 232 TARMRMPFYQHLNTSVSLEQYFGDRVDLFNS--GTGYHNPVALSLGLNYTPVPLVTVTAQ 289 A +P + L + EQY+GD V LF+ NP A++ G+NYTP PLV++ Sbjct: 235 RAEGYLPAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLVSIGLN 294 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 K G+ N + L +N+ G LK QL + V ++L GSR D RNN LEYR+ Sbjct: 295 QKMGKGNHNDTQIDLAVNWMLGSSLKSQLDSDAVKARRTLLGSRLDLINRNNNIVLEYRK 354 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 + +++ + ET+P+ + ++S+Y + + W+ D L G + + W+ Sbjct: 355 QDLISLKVQNKV-TGTESETLPVSVNVKSKYPLDHISWEDDN--LVKNGGKISENNGSWS 411 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 + +P +Q G N + +S DNQG + +++ +T+ + Sbjct: 412 VTLPHYQQNSGEKNLYVVSATAWDNQGNKSNASHMTVEVS 451 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 114/417 (27%), Positives = 188/417 (45%), Gaps = 18/417 (4%) Query: 53 HDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNA 112 E+ A+ + G N D A + S Q V WL+ +G A Sbjct: 37 QADEQSVAQTAMEAGRVLQGSNSGDA-------ARQMLTSQASGQAADAVTQWLNQFGTA 89 Query: 113 SVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-W 171 + V ++ GS +P + + + ++QLG+ D +N G+G R+ N W Sbjct: 90 KTQLSVVSDFSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGW 149 Query: 172 LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGY 229 ++GYN FYD +R G G EAW +YL+LSAN Y+ + W + ++R A G+ Sbjct: 150 MLGYNVFYDVDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGW 209 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQ 289 D+ A +P Y L + EQY+G+ V LF NP A++ G+ +TP L+T Sbjct: 210 DIRAEGWLPAYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLLTAGVD 269 Query: 290 HKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 +++G++G + L L L YR G PL QL + V +SL +R + RNN LEYR+ Sbjct: 270 YRRGKNGADDTRLNLGLTYRIGEPLAHQLDSSRVGAQRSLAANRLELVNRNNDVVLEYRK 329 Query: 350 RKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWT 409 + +T+ L + TV L Q+ ++YG+ ++ D ++ +N+ T Sbjct: 330 QTLITLQLPPDVY-GAELTTVTLTPQVNAKYGLSRIE-LDDAELRQAGGKIISNTGNQIT 387 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV---EPFDALSNDELRWE 463 L +P W + + LS D +G + + + A+S D+ Sbjct: 388 LQLPAWSSDRQSV---TLSGRARDTRGNLSDIARTRILVSPAVQQQLAVSTDKTTAT 441 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 115/416 (27%), Positives = 187/416 (44%), Gaps = 7/416 (1%) Query: 52 NHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGN 111 N + E+ + + E + L + VE WL G Sbjct: 71 NREEEQKATQQASLVASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIGK 130 Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNW 171 A V ++VD++ F+GS FVPL + L +SQ G + D + N+G+GQR +W Sbjct: 131 AQVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYLSDW 190 Query: 172 LVGYNTFYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARG 228 ++GYN F D + N +R G G E +Y++LSAN Y W T ++R A G Sbjct: 191 MLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAASG 250 Query: 229 YDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTA 288 YD+ +P+Y L + EQYFG+ V LF NP AL+ ++YTP PLV + Sbjct: 251 YDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLVNLAL 310 Query: 289 QHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 +H G SG+N+ + L +NY P +KQ+ V +++L GSR D RNN LEYR Sbjct: 311 EHTIGNSGKNKTGVNLAVNYEINTPWQKQIDPAAVKATRTLAGSRMDLVDRNNNIVLEYR 370 Query: 349 QRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGW 408 +++ +T+ L K + +P+ +R+G+ ++ W +++ + Sbjct: 371 KQQVVTLNLPAKV-SGKEKQVLPINYTFNARHGLDRIEWDA-ADVINAGGNISDQGNLAY 428 Query: 409 TLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + P + +G N + L+ D +G SN + + + P Sbjct: 429 HITFPPY--IDGGDNAYVLAGRAVDKKGNYSVSNSTNIYVTGVNINSVKSTITLTP 482 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 430 bits (1105), Expect = e-119, Method: Composition-based stats. Identities = 109/406 (26%), Positives = 196/406 (48%), Gaps = 15/406 (3%) Query: 57 KHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDV 116 + A I D G NDN + ++ +VN H++SW +G A + + Sbjct: 133 QQIASIATDVGNILSNDNISKNSALL---------NKITNKVNSHIQSWFENFGTAHIQL 183 Query: 117 KVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYN 176 +VD S+ P+ ++D L +SQ G++ D+ +SN+G+G R NW++G N Sbjct: 184 QVDKNFSLKNSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDNWMLGGN 243 Query: 177 TFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTAR 234 +F D L + R G G E W + L+L AN Y + W + E+R A G DL + Sbjct: 244 SFIDYDLRKEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIK 303 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 +P Y + + E+Y+GD V LF NP + +LG++YTP PL++ A+HK G Sbjct: 304 SWLPSYPQIGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLMSFKAEHKMGS 363 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 + N + +G +NY+ P + Q++ + + L G RYD +RNN L+YR+++ + Sbjct: 364 NNINDSRIGFEINYQIHTPWESQINPVLIPAMRKLAGQRYDLVERNNNIILDYRKKEIIK 423 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 + GE L +++ S+Y + ++ W +T ++ +++I+PD Sbjct: 424 ID-GVDVISGFSGEKKRLDIRVNSKYPVDRIDWLANT-FIANGGKIINEGLHNYSIILPD 481 Query: 415 WQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDEL 460 ++N E +N + + + D +G + I + ++ + L Sbjct: 482 YRNQE--NNSYTIDLSAIDIKGHTSNRKTIKIDVLYMDIDPTISSL 525 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 115/406 (28%), Positives = 195/406 (48%), Gaps = 16/406 (3%) Query: 68 ETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGS 127 E T A R ++ NQ V+ WL+ +G V+V D + S Sbjct: 51 EAFNKIISTGTSLAVSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKES 110 Query: 128 RGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENL 187 W +P D+ Y+ +SQLG+ +D+ N+G G R + +W+ G+NT YDN + + Sbjct: 111 SLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQSWMYGFNTSYDNDMTGHN 170 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNT 245 R G GAEAW +YL+LSAN Y WH+ +R A G D+ + +P L Sbjct: 171 HRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQLGG 230 Query: 246 SVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLN 305 + EQY G+RV LF NP A++ GL YTP+P +T+ + G+S +++ L Sbjct: 231 KLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFITLGVDQRMGKSRQHEIQWNLQ 290 Query: 306 LNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLK 365 ++YR G + Q S VA ++ L SRY+ +RN LEY+++ T+ + + Sbjct: 291 MDYRLGESFRSQFSPAVVAGTRLLAESRYNLVERNPNIVLEYQKQNTIKLAFSPAVLSGL 350 Query: 366 PGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ--------- 416 PG+ + QI+S+ +++++W D Q ++ SA + +++P ++ Sbjct: 351 PGQVYSVSAQIQSQSALQRILWN-DAQWVAAGGKLIPVSATDYNVVLPPYKPMAPASRTV 409 Query: 417 ----NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSND 458 E A N + LS DN G + +T+ + +P ++++ Sbjct: 410 GKTGESEAAVNTYTLSATAIDNHGNSSNPATLTVIVQQPQFVITSE 455 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 429 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 147/417 (35%), Positives = 229/417 (54%), Gaps = 11/417 (2%) Query: 42 GLPDLGMAPENHDG----EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQ 97 LPD+ + E ++ FA+ K+ G N D +A++ ++ + Sbjct: 19 KLPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRD---KAQSKTTQWAKEKIISP 75 Query: 98 VNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLV 157 + Q + L +G A V++ +DN+G+ S S F P D+++YL +SQ+ + QDN + Sbjct: 76 LQQQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYLLFSQINIHHQDNRKI 135 Query: 158 SNVGVGQRWARGNW--LVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH 215 N G+G R + L+GYN F D+ RAG GAEA +YL+ SAN+Y P + W Sbjct: 136 GNFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADYLKFSANYYHPLSHWK 195 Query: 216 EQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALS 273 + +R A+GYDL ++ +P Y L S E YFGD V LF +P AL+ Sbjct: 196 DSPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVALFGKSHRQKDPRALT 255 Query: 274 LGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSR 333 LG++YTPVPLVT+ A+HK G+ G+ + + Y+FG PL QL V + +SL+GSR Sbjct: 256 LGIDYTPVPLVTLGAKHKYGQQGKKDTQIDVAFRYQFGSPLSAQLDPDNVNQLRSLKGSR 315 Query: 334 YDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQI 393 YD RNN LEY++++ L LA P L GE+ L+ ++S+Y I LIW GD Sbjct: 316 YDLVDRNNDIVLEYKEKQVLFADLAAVPDSLMEGESYILRPLVKSKYPIIDLIWLGDLLP 375 Query: 394 LSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVE 450 L L A +++ +GW + +P W + GASN ++L++ +ED + RV++N I + + + Sbjct: 376 LQLLATAGSHNPQGWQITLPAWSSVAGASNRYQLALSLEDQKNHRVTTNTIEIQVGQ 432 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 133/462 (28%), Positives = 202/462 (43%), Gaps = 31/462 (6%) Query: 10 PFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHD-------GEKHFAEI 62 PF+ L + N ST A + LPDLG + D E + A Sbjct: 79 PFFAPSLPSEAPLNG-STTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASA 137 Query: 63 VKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEG 122 NDN E A V + +Q WL +GNA V + ++ G Sbjct: 138 ATQLWGIMGNDNSSRAAESA-------VTGVAAGLASQAAADWLGQYGNARVQLNSNSIG 190 Query: 123 HFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNL 182 + +PL + L + QLG+ +NVG+G R +W+ G NTFYD Sbjct: 191 N-----ADVLIPLTETQNNLLFGQLGVRYNGERTTNNVGLGVRSFTDSWMFGVNTFYDYD 245 Query: 183 LDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ----TATQEQRMARGYDLTARMRMP 238 L R G G EAW + L+ SAN Y WH+ +R A G+D+ A +P Sbjct: 246 LTGKNSRLGVGGEAWTDNLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLP 305 Query: 239 FYQHLNTSVSLEQYFGDRVDLFNS----GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 Y L + E+YFG V L + +P A ++GLNYTP+PL TV HK+G+ Sbjct: 306 SYPQLGGRLMYEKYFGKGVALNSGSTSPDDLGDSPSAFTVGLNYTPIPLFTVDVAHKKGQ 365 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 + N+ LGLN NYRFGVP Q++ V +SL GSRYD RN ++Y ++ + Sbjct: 366 NTNNELQLGLNFNYRFGVPWVDQINKNAVGLMRSLMGSRYDIVDRNYNIVMQYEKQDLIK 425 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 + L + L I ++YG ++ W +++ + E ++ +P Sbjct: 426 LTLP-ETLAAYAITNLSLTGNITAKYGAERMEWSAPA-LMAAGGSIIPLTMESASVTLPP 483 Query: 415 WQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 +Q +N +++S V D +G R ++ TL + E +S Sbjct: 484 YQ-QVQTANSYQISAVAYDVRGNRSNTATTTLVVQESPQQIS 524 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 423 bits (1088), Expect = e-117, Method: Composition-based stats. Identities = 112/423 (26%), Positives = 191/423 (45%), Gaps = 14/423 (3%) Query: 34 NPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDA 93 N NN G + D ++ G +D R Sbjct: 121 NRASQNNKNNAGAGSLTKEQDPMDSL--SIRGVGSALAASGRVDA-------LHHMARTM 171 Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 + VN + WL+ +G A + + D + S W +PL D+ ++Q G +D Sbjct: 172 ATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLLPLYDSQTLTLFTQQGFRNKD 231 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAA 213 ++N+G+G R+ W++G N FYDN + +R G GAE W + +LSAN Y A Sbjct: 232 RRNIANIGIGTRFIHHEWMMGGNAFYDNDFTGDNKRVGLGAELWTDSFQLSANGYFRLTA 291 Query: 214 WHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 WH+ +R A G DL A +P HL S+ E YFGD V LF NP A Sbjct: 292 WHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYEHYFGDNVALFGKDHLQRNPYA 351 Query: 272 LSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 ++LG +YTP L+T+ + + G+ G LGL++NYR G L QL + ++++ Sbjct: 352 ITLGGSYTPFSLLTLEVKQRLGKQGNQDTQLGLHINYRLGADLPAQLDPAALVAARTIAK 411 Query: 332 SRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDT 391 +RYD +RN+ L+Y++++ L + +T + PG + + ++ S+YG+R L W Sbjct: 412 TRYDLVERNHNIVLQYQEQQRLKIK-STEYLEGYPGNSSEIYAEVVSKYGVRNLQWMNVA 470 Query: 392 QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEP 451 ++ + P + + N + + V+ D +GQ + + +++++P Sbjct: 471 AFVAAGGQIMELPNNRLKITYPPYN--DNGDNRYHIDVMAYDTRGQSSNISTTQISVLKP 528 Query: 452 FDA 454 Sbjct: 529 EMD 531 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 120/455 (26%), Positives = 210/455 (46%), Gaps = 27/455 (5%) Query: 12 YLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSM 71 L+ AGG A + + + + +N L A A+ G Sbjct: 129 SAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYA----------AQQAASLGSQLQ 178 Query: 72 NDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSW 131 + + +A Q + +++WL +G A V+++ N GS + Sbjct: 179 SRSLN------GDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDF 230 Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAG 191 +P D+++ L + Q+G D+ +N+G GQR+ ++GYN F D + R G Sbjct: 231 LLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLG 290 Query: 192 FGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSL 249 G E W +Y + S N Y + WHE ++R A G+D+ +P Y L + Sbjct: 291 IGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMY 350 Query: 250 EQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYR 309 EQY+GD V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+ Sbjct: 351 EQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQ 410 Query: 310 FGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGET 369 F P +Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T Sbjct: 411 FDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHD-INGTERST 469 Query: 370 VPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQA---NSAEGWTLIMPDWQNGEGASNHWR 426 ++L ++S+YG+ +++W D+ + S Q SA+ + I+P + +G SN ++ Sbjct: 470 QKIQLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAY--VQGGSNVYK 526 Query: 427 LSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELR 461 ++ D G ++ +T+T++ + + Sbjct: 527 VTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVT 561 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 421 bits (1082), Expect = e-116, Method: Composition-based stats. Identities = 124/378 (32%), Positives = 192/378 (50%), Gaps = 14/378 (3%) Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY- 141 K A + + L+ Q+ + + WLS +G A +++ VD+ G S VP D+ + Sbjct: 128 KKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHW 187 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 L +SQ G +D+ N+G+G R +W+ G NTFYDN L N R G E W YL Sbjct: 188 LIYSQYGYRHKDSRDTVNLGIGTRLFINDWMYGANTFYDNDLTGNNSRFSLGGELWTNYL 247 Query: 202 RLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 ++SAN Y + WH +R A GYDL A M +P L + EQYFGD V L Sbjct: 248 KMSANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQYFGDNVAL 307 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F + +P A ++G+NYTP+PL+T +K G+ G++ LN+NYRFGVPL +QLS Sbjct: 308 FGTNNRQKDPYAATIGVNYTPIPLITAGVDYKLGKEGKSDGIFSLNMNYRFGVPLSEQLS 367 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEY-RQRKTLTVFLATPPWDLKPGETVPLKLQIRS 378 V +SL GSRYD +RNN L Y +++K + + GE P+ QI+S Sbjct: 368 PENVGSLRSLAGSRYDLVERNNNIILNYLKKQKHFRLLVPVIEIIGYGGEIKPI--QIQS 425 Query: 379 RYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR 438 ++ +IW + + G+T+ +P++Q N + ++ +D+Q Sbjct: 426 DTPLKNIIWDMPELFQKNGGIIK--NTNGYTIQLPEYQ--PDGKNDYTITGTSKDDQ--- 478 Query: 439 VSSNEITLTLVEPFDALS 456 +I +++ +LS Sbjct: 479 -QRVQIQTHVLQRNISLS 495 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 420 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 125/443 (28%), Positives = 208/443 (46%), Gaps = 29/443 (6%) Query: 32 AANPFDNNNDGLPDLGM-----APENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFA 86 P ++ LP LG+ A + EK F + + N+N + A Sbjct: 81 NNKPSVDHRRALPTLGIKETSQAKQVESAEKQFVQGATQIAQGLANNNATEA-------A 133 Query: 87 LGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQ 146 + R+ +NQ + WL+ +G A V + + G +PL D L +SQ Sbjct: 134 INYARNRGEGLLNQKISDWLNQYGKARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQ 188 Query: 147 LGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSAN 206 +G+ + +N+G+G R + NW+ G N+FYD + R G G E W YL+L+ N Sbjct: 189 IGIRANEQRSTTNLGLGYRQYQQNWMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVN 248 Query: 207 FYQPFAAWHEQ----TATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNS 262 Y WH+ ++R A G+DL A +P Y HL EQYFGD V L ++ Sbjct: 249 GYFRLTDWHQSFLHEMRDYDERPANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHN 308 Query: 263 ---GTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 NP A++ GL+YTP PL+T+ Q QG+S N + +G+ YRFG+PL QL+ Sbjct: 309 PTAKDLKDNPSAVTFGLSYTPFPLLTLKTQVSQGDS--NDSLIGMEFAYRFGIPLAAQLN 366 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIR-S 378 V +SL G+RYD RN ++YR+++ L + L + +T+ +K ++ + Sbjct: 367 PDNVDLMRSLAGNRYDFVDRNYNIVMQYRKQEILAISLPDSAMA-EAAQTIAIKATVQKA 425 Query: 379 RYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR 438 +YG+ +++W ++L+ S L++P + S + LS V DN+G + Sbjct: 426 KYGLNKILWSAP-ELLAKGGKINETSTTTIDLVLPAYDEDNQGSKAYTLSAVGVDNEGNK 484 Query: 439 VSSNEITLTLVEPFDALSNDELR 461 + + + + + D + L Sbjct: 485 SKAAVMVIHVTQSKDGFAYFTLE 507 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 130/481 (27%), Positives = 213/481 (44%), Gaps = 49/481 (10%) Query: 15 LLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDN 74 ++ + A + N+ LPDLG D + + + +K+ G + ++ Sbjct: 7 CIILTFISGAAFAAPEI----NVKQNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINS 62 Query: 75 GLDT-----GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRG 129 E K+ A ++ ++ ++E LSP+G ++ + G GS Sbjct: 63 ATQGFENLTPEALKSQARSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSI 122 Query: 130 SWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQR 189 +FVP DN + +SQ ++++ + N+G+G R+ +L+G N FYD +R Sbjct: 123 DYFVPWYDNQTTVYFSQFSAQRKEDRTIGNIGLGVRYNFDKYLLGGNIFYDYDFTRGHRR 182 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSV 247 G GAEAW +YL+ S N+Y P + W + E+R ARG+D+ A +P Y L + Sbjct: 183 LGLGAEAWTDYLKFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKI 242 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 EQY+G+ V LF + + +P A++LG+ Y PVPL+ V K G ++ LN Sbjct: 243 VFEQYYGNEVALFGTDSLEKDPFAVTLGVKYQPVPLIVVGTDFKAGTGDNTDLSVNATLN 302 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFL---------- 357 Y+FGVPLK QL +V+ + SL GSR+D +RNN LEY+++ L V L Sbjct: 303 YQFGVPLKDQLDPDKVSAAHSLMGSRHDFVERNNFIVLEYKEKDPLYVTLWLKADVTNEH 362 Query: 358 ATPPWDLKPGETV-------PLKLQIRSRYGIRQLIWQGDTQILS--------------- 395 P E + + I Y I WQ S Sbjct: 363 PECVIKDTPEEAIGLEKCKWTINALINHHYKIVAASWQAKNNAASWQAKNNAARTLVMPV 422 Query: 396 -LTPGAQANSAEGWTLIMPDWQNGEGAS-----NHWRLSVVVEDNQGQRVSSNEITLTLV 449 + W L++P WQ + N WR+ + +ED +G R +S + +T+ Sbjct: 423 IKENTLTEGNNNHWNLVLPAWQYSSDQAEQEKLNTWRVRLALEDEKGNRQNSGVVEITVQ 482 Query: 450 E 450 + Sbjct: 483 Q 483 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 117/473 (24%), Positives = 205/473 (43%), Gaps = 19/473 (4%) Query: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAP---ENHDGEKH 58 S+ P++ P LLL A + A + D LP LG E E Sbjct: 6 SKLKPKL-PNSLLLSTAIWSTAILPMVPSYAQ---IVHLDDLPTLGGQAIQFEGTQPEDS 61 Query: 59 FAEIVKDFGETSMN-DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVK 117 + ++G+ + N + + A R + + WLS GNA +++ Sbjct: 62 TERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHWLSKAGNARLNIN 121 Query: 118 VDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNT 177 +D + S+ W VP + L +SQ + + D L +N G+G R + N ++G N Sbjct: 122 LDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLRHFQQNSMIGVNA 181 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT---ATQEQRMARGYDLTAR 234 F+D+ L R GFG E +Y+R+SAN Y + W + R A G+D+ Sbjct: 182 FFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDYNARPANGWDIQLE 241 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 +P Y +L ++ LEQY+GD V LF +P+A ++G+N++P PL+ + A+HK G Sbjct: 242 GWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPFPLLAINAEHKIGN 301 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 SG N+ N + N+ G L + L VA ++ + +RYD RNN LEY+++ ++ Sbjct: 302 SGTNETNAKVAFNWLLGRSLAQHLDTSAVAATRHISTNRYDFINRNNNIVLEYQKKSLIS 361 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 + L GE + + + ++Y + +++ + L G + ++ +P Sbjct: 362 LSLP-KVIQGMTGEELSIIRNLTTKYPLEKIV--IEAPELIAAGGEIHLNGRESSVKLPS 418 Query: 415 WQ-----NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRW 462 ++ N +RL+V D G E + + + Sbjct: 419 YKIANHSFKNSQLNLYRLTVTAYDINGNVSPQAETLIEVTNTGALTITHKNTA 471 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 112/377 (29%), Positives = 193/377 (51%), Gaps = 9/377 (2%) Query: 86 ALGKVRDALSQQVNQHVESWLSPWG-NASVDVKVDNEGHFTGSRGSWFVPLQDN-DRYLT 143 +D L+ ++ E+WL+ +G ++ V + G +PL ++ + ++ Sbjct: 128 VRQFGQDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMI 187 Query: 144 WSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRL 203 +SQLG+ D+ N+G+G R+ W++G N F+DN + +R G GAE + LRL Sbjct: 188 FSQLGVRHADDRTTGNIGLGARYFGEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRL 247 Query: 204 SANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFN 261 +AN Y WH+ A ++R A G+D+ +P Y L V EQY+GD V L + Sbjct: 248 AANGYFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALIS 307 Query: 262 SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAG 321 G HNP A +LG+N+TP+PLV++ A H+ +GLNLN+ FG L LS Sbjct: 308 RGRLQHNPSAATLGVNWTPIPLVSIDAGHRMSMQRGEDTTVGLNLNWNFGRSLDWHLSPD 367 Query: 322 EVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG 381 V +SL GSRYD RNN ++YR++ +T LA T L + + +++G Sbjct: 368 AVETQRSLAGSRYDLVSRNNEIVMDYREQTVITFSLANA-IQGVESTTHSLGVSVWAKHG 426 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS 441 + +++W D +++ + A L++P ++ +GA N + LS + DN+G+ Sbjct: 427 LGKIVWD-DATLVNAGGKIVGSGANSV-LVLPAYK--DGADNRYTLSAIAYDNKGKASPR 482 Query: 442 NEITLTLVEPFDALSND 458 ++ +T+ + + + Sbjct: 483 AQVQITVEKAQQVVPDV 499 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 411 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 123/378 (32%), Positives = 188/378 (49%), Gaps = 12/378 (3%) Query: 83 KAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY- 141 K A + + L+ Q+ + + WLS +G A +++ VD+ G S VP D+ + Sbjct: 121 KKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHW 180 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 L +SQ G +D+ N+G+G R NW+ G NTFYDN L N R G E W YL Sbjct: 181 LVYSQYGYRHKDSRDTVNLGIGTRLFINNWMYGANTFYDNDLTGNNSRFSLGGELWTNYL 240 Query: 202 RLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDL 259 ++SAN Y + WH +R A GYDL A M +P L + EQYFGD V L Sbjct: 241 KMSANAYFRLSDWHNARDLVNYYERPANGYDLIADMYLPSMPSLGAKIKYEQYFGDNVAL 300 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 F +P A ++G+NYTP+PL+T +K G+ G++ N+NYRFGVPL +QLS Sbjct: 301 FGKNKRQKDPYAATIGVNYTPIPLITAGIDYKLGKEGKSDGIFSFNVNYRFGVPLSEQLS 360 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEY-RQRKTLTVFLATPPWDLKPGETVPLKLQIRS 378 V+ +SL GSRYD +RNN L Y ++++ + + GE P+ QI+S Sbjct: 361 PENVSSLRSLAGSRYDLVERNNNIILNYLKKQQHFRLLVPVIEISSYGGEVKPI--QIQS 418 Query: 379 RYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR 438 + + W S G+T+ +P++Q N + ++ +D+Q Sbjct: 419 DTPFKNVTWDIPELFQKNGGMINIESTHGYTIQLPEYQ--PDGKNDYTITGTSKDDQ--- 473 Query: 439 VSSNEITLTLVEPFDALS 456 +I +++ +LS Sbjct: 474 -LRVQIQAHVLQRNISLS 490 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 409 bits (1052), Expect = e-113, Method: Composition-based stats. Identities = 126/458 (27%), Positives = 212/458 (46%), Gaps = 32/458 (6%) Query: 23 NAQSTFEQ--KAANPFDNNNDGLPDLG---MAPENHDGEKHFAEIVKDFGETSMNDNGLD 77 Q +F + K + +N LP+LG + PE ++ E+ FA G+ +DN +D Sbjct: 68 GYQDSFPENIKNNDNVENITKYLPNLGSTKILPEENNNEEKFASSFTLMGDILSSDNFVD 127 Query: 78 TGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQD 137 + + VNQ + WL+ +G A + D G + +P+ D Sbjct: 128 NS-------INYAKSIGQGLVNQQINDWLNQYGKARISFSSD-----KNISGDFLLPVID 175 Query: 138 NDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAW 197 L ++QLGL + N+G+G R NW+ G NTFYD R G G EAW Sbjct: 176 EPNNLLFTQLGLRNNTDRNTINLGLGYRKYWRNWMFGINTFYDYDYTGGNARLGVGGEAW 235 Query: 198 GEYLRLSANFYQPFAAWHEQT----ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYF 253 +YL+L+ N Y WH+ ++R A G+D+ A +P Y L +S+ E+YF Sbjct: 236 IDYLKLAINGYFGLTDWHQSKISVMDDYDERPATGFDVRAEAYLPKYPQLGSSIKYEKYF 295 Query: 254 GDRVDL---FNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRF 310 G + L N + +L +GLNYTP+PL+T+ A+ G+ N + L++NYRF Sbjct: 296 GKGIHLGTGVNPEYLKDDAQSLIMGLNYTPIPLLTLKAERSIGD--RNDTKISLDVNYRF 353 Query: 311 GVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETV 370 GVPL +QL+ V +SL G++YD RN ++YR++ L +FL + +T Sbjct: 354 GVPLSQQLNPDAVDVMRSLVGNKYDFVDRNYDIVMQYRKQDLLNIFLP-REIVGEARDTH 412 Query: 371 PLKLQIR-SRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASN---HWR 426 + + + ++YG++ + W D +++ + S + P + + +N + Sbjct: 413 RINVTVNKTKYGLKNIKWIIDPKLIEDKGHFKQISQTEGIITFPIYNSLNEKNNLPAEYY 472 Query: 427 LSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 +S + DN G + + + + S D L P Sbjct: 473 ISAIGTDNNGNESNKATTIIRVNRSTNDFSGD-LTISP 509 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 409 bits (1050), Expect = e-112, Method: Composition-based stats. Identities = 117/404 (28%), Positives = 193/404 (47%), Gaps = 15/404 (3%) Query: 58 HFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVK 117 A K + + ND+ + + +K+ AL + + G ++ Sbjct: 69 KSAMSGKRWLQHQTNDDVMQGSDISKSGIADMGFAALQPETEKSA-------GEVRANLP 121 Query: 118 VDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNT 177 + ++G T F PL D D L + Q+G + D + N+G+GQR+ +G+W +GYNT Sbjct: 122 L-SDGKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGIGQRYFQGDWALGYNT 180 Query: 178 FYDNLLDEN-LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTAR 234 FYD + N QR GFG E W +YL LSAN Y W+ + +R A GYD+ A+ Sbjct: 181 FYDIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSALDGYAERAANGYDIRAQ 240 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 P Y L+ + EQYFGD + L N Y NP AL++GL YTP+ L+++ Sbjct: 241 GWFPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEYTPIQLISLGIDRTFSH 300 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLT 354 G++ + L+ NY+ GVPL +Q+ ++L +RY +RNN L++R+R L+ Sbjct: 301 RGKDDTKVNLSFNYQLGVPLSQQIDPTVAPVKRTLADNRYHLVERNNNIVLKHRERAQLS 360 Query: 355 VFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPD 414 ++L T GE + +Y ++ + W D + + A S + + P+ Sbjct: 361 LYLPTG-LSGFGGERKLINFSFNGKYRLKHIQWN-DGALRARGGRIIALSNNSYVVQFPN 418 Query: 415 WQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSND 458 + + SNH +S V D QG +S+E+ + + P + Sbjct: 419 YSRQQ--SNHITISAVAHDEQGNVSNSSEMGVLINVPVALSAPV 460 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 407 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 126/432 (29%), Positives = 213/432 (49%), Gaps = 8/432 (1%) Query: 24 AQSTFEQKAANPFDNNNDGLPDLG-MAPENHDGEKHFAEIVKDFGETSMND-NGLDTGEQ 81 LP LG E + + ++K N+ N + Sbjct: 113 HCYQGNSFVKKENIKIYHDLPTLGHNQNEQVNHDIDVYNMIKPLIHKDWNNINREKIKSE 172 Query: 82 AKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRY 141 AK + R+ L Q+V+++ +G +++ VDN+G F SR P N+ + Sbjct: 173 AKFYIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSH 232 Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRW--ARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGE 199 + +SQLG Q + + ++G+GQR+ +GYN F D LD+ +R G EA Sbjct: 233 VLFSQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASN 291 Query: 200 YLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRV 257 Y +LS N+Y P W + +R A G+D+ + +P Y L + EQYFG V Sbjct: 292 YFKLSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEV 351 Query: 258 DLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQ 317 LFN NP A+S+G++Y P PL ++ HK G++ + LGL LNY+FG PL Q Sbjct: 352 ALFNKTKRQKNPKAVSIGIDYRPFPLASIYVDHKLGQNHHRETKLGLTLNYQFGTPLSSQ 411 Query: 318 LSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIR 377 L + E+++L+ +R RN +EY++++ L+V L ++ G+ ++ I+ Sbjct: 412 LDPNNLNEARNLKQNRLAPVDRNYNIVMEYKEKQLLSVDLPAMDKNILEGDIYVIRPLIK 471 Query: 378 SRYGIRQLIWQGDT-QILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQG 436 ++Y I+ + W GD Q+ + A NS GW +I+P+W + + A N +RL++ +ED +G Sbjct: 472 NKYPIKTVSWLGDVSQLSLSSSSADKNSPVGWKIILPEWNSEKDAKNTYRLAIQIEDTKG 531 Query: 437 QRVSSNEITLTL 448 + SN + + + Sbjct: 532 HQAISNYMDIVV 543 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 129/477 (27%), Positives = 194/477 (40%), Gaps = 51/477 (10%) Query: 4 FVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIV 63 ++I + ++L F A AP E A Sbjct: 16 LKSKLIIWSQIVLQILFPLFTV--FPVHA----------------APATTTKETTVAMPY 57 Query: 64 KDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGH 123 T + T A + V+ WLS +G A V + VD+ G+ Sbjct: 58 SQELSTLASSTASGT-----DGAKSAATGMATSAAASSVQQWLSQFGTARVQLNVDDNGN 112 Query: 124 FTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQR-WARGNWLVGYNTFYDNL 182 + S PL DN + + ++QLGL D N+G+G R + NW+ G N F+D+ Sbjct: 113 WDDSAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVFFDDD 172 Query: 183 LDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFY 240 +R GFGAEAW YL+LSAN Y WH ++ A GYD+ A +P Y Sbjct: 173 FTGKNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGYLPAY 232 Query: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 L + EQY+GD+V LF++ NP A++ G++YTPVPLV + +K+G+ + Sbjct: 233 PQLGAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQLAVDYKRGQDSMDDT 292 Query: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRK-------TL 353 +N Y FG + Q+ V +SL GSRYD +RNN L+Y+++ TL Sbjct: 293 QFQVNFRYDFGHDWRYQIDPENVKAERSLAGSRYDLVERNNQIVLQYKKKDEQGVSKLTL 352 Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQ--LIW--QGDTQILSLTPGAQANSAEGWT 409 P D T+ + S +R + W GD ++ SLT A Sbjct: 353 QTVADNAPADGLTPNTLQVLATNSSNEPVRNASIAWSTSGDAKLDSLTAVTNAQGIAVVN 412 Query: 410 LIMPDWQNGEGASNHWRLSVVVEDNQGQRVS---SNEITLTLVEPFDALSNDELRWE 463 L N +V V G + S+ ++T+ AL D + Sbjct: 413 LT-----------NTSPATVQVTAKSGNVSAMQDSHFNSVTVSHLILALDKDGSVAD 458 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 102/368 (27%), Positives = 176/368 (47%), Gaps = 14/368 (3%) Query: 88 GKVRDALSQQVNQHVESWLSPWGNASVDVK--VDNEGHFTGSRGSWFVPLQDNDRYLTWS 145 +SQ +Q +E WL +GNA + + DN GS L + D L + Sbjct: 34 QYAAGKISQLTSQAIEGWLKQYGNARITLNAQSDNSTALAGSSADLLFGLHNQDSRLDYI 93 Query: 146 QLGLTQQD-NGLVSNVGVGQRWA-RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRL 203 Q QD ++ NVG+GQR+ ++GYN FYD ++ + R+G G E W +Y + Sbjct: 94 QFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNINSGVSRSGVGFELWRDYFKF 153 Query: 204 SANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFN 261 S N Y + W +++ A GYD+ +P Y L + EQYFGD V LF+ Sbjct: 154 SGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQLGGHLKYEQYFGDNVALFD 213 Query: 262 SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAG 321 + +P A+++G++YTP+PL+T +K+G + ++ +NY GVP +Q+S+ Sbjct: 214 TNHLQTDPSAITVGMSYTPIPLITFALDYKKGNDSLDDTSISAAINYAIGVPWSQQISSD 273 Query: 322 EVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG 381 V +SL GSR+D RNN ++YR++ + + L T + + + +PL + ++ G Sbjct: 274 YVQTRRSLVGSRFDFVSRNNDIVMQYRKQDVIKLILPT-QLNGQATQQLPLVATVEAKNG 332 Query: 382 IRQLIWQGDTQILSLTPGAQANS-AEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVS 440 + + W + +L S A +T+ +P + + L+ DN + Sbjct: 333 LDHIQWDSSSSLLQAGGTVIPGSDATHFTVSLPA------TAGQYVLNGTAYDNHHNASN 386 Query: 441 SNEITLTL 448 S + + Sbjct: 387 SAQTRFIV 394 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 117/476 (24%), Positives = 205/476 (43%), Gaps = 33/476 (6%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 ++R++ ++ LL A +A P D + + + A Sbjct: 8 LARWLAWVLVGTQLLTPA---------ALAQAMLPEIT--RSGADSSVDKTDQPEAEWLA 56 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 G N + AK + + + ++ WL P + +++ Sbjct: 57 SRASSLGSLLQEGN---ISDFAKNQIQALPQTIANDGITSGIKHWL-PEAQFRGGITLED 112 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN-----GLVSNVGVGQRWARGNWLVGY 175 + + +PL + + + QLGL DN N G+G R G+WL+G Sbjct: 113 ASKYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVGDWLLGI 172 Query: 176 NTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTA 233 N+F D + + R G E + + + L+ N+Y P + W ++R A G D+ Sbjct: 173 NSFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVRL 232 Query: 234 RMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQG 293 + +P ++ EQYFGD+VD+ + + +P A + + + PVPLV + A +K Sbjct: 233 KGALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVEIKAGYKDA 292 Query: 294 ESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTL 353 S +Q GLNLNY FGVPL+ QL +V + S +R RN +EYR++ + Sbjct: 293 GSSGSQTEAGLNLNYTFGVPLRAQLDPSQVRPA-SNTTNRTAFVDRNYNIVMEYREQAS- 350 Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 + + P + + G+TV L I SRY + ++ W GD +++ G Q L +P Sbjct: 351 RIRVYASPVNGQSGDTVTLSATINSRYPVERIEWTGDAELI---GGLQQQGNVNSGLRLP 407 Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV------EPFDALSNDELRWE 463 D + + L + V D++G V+S I +T+ P+ + +DE+R E Sbjct: 408 DLSLDVTENKEYSLYLKVTDSRGNSVTSERIPVTVSINPESFTPYLNVLHDEVRRE 463 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 113/413 (27%), Positives = 187/413 (45%), Gaps = 13/413 (3%) Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 + GE EQ + ++ ++ N + S L G A V +D+ Sbjct: 159 NSIAASGEQVPTSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDD 218 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQ-DNGLVSNVGVGQRWARGNWLVGYNTFY 179 + + + +PL + + L ++Q GL + + ++N+GVGQR W++GYN F Sbjct: 219 DFNMVTAEADLLLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFLDRWMLGYNLFA 278 Query: 180 DNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQ--EQRMARGYDLTARMRM 237 D L RAG GAEAW +YL+L ANFY P ++W + + E+R ARG D+ + Sbjct: 279 DYDLTNRHWRAGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYL 338 Query: 238 PFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE 297 P Y + S++ EQY G+RV L ++ +P A++ GL+Y P PL+ + + + + Sbjct: 339 PAYPQWSASLTAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLKMDVEQVEASGRQ 398 Query: 298 NQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFL 357 + L L ++ G L L+ V +SL G R+D +RNN LEYR + L L Sbjct: 399 HDTRFTLGLEWKLGATLWDMLNPSSVD--KSLAGMRHDLIERNNDMVLEYRDKVLLKASL 456 Query: 358 ATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ- 416 + G+ + L L I+ I + W GD LS A + L +P Sbjct: 457 -NDQYSAVEGQALTLTLNIQHSRQIASIQWLGDVLGLSGLSPADTAGQDKRALTLPSLPT 515 Query: 417 NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV-----EPFDALSNDELRWEP 464 G SN + + +V D G + + + + +P L+ ++ P Sbjct: 516 YRIGQSNQYPVVAIVTDIDGHEAIAEGV-VAVSEDSGLQPAIQLAEHFVQLLP 567 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 255/323 (78%), Positives = 283/323 (87%) Query: 142 LTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYL 201 +TWSQLGLTQQ +GLVSNVG+GQRWA+ WL+GYNTFYDNLLDENLQRAGFGAEAWGEYL Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEYL 60 Query: 202 RLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFN 261 RLSAN+YQPFA W TAT EQRMARGYD+ A++R+PFYQH+NTSVSLEQYFGD VDLF+ Sbjct: 61 RLSANYYQPFADWQTHTATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVDLFD 120 Query: 262 SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAG 321 SGTGYHNPVAL LGLNYTPVPL+T+TAQHKQGESG +QNNLGL LNYRFGVPLKKQL+A Sbjct: 121 SGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYRFGVPLKKQLAAS 180 Query: 322 EVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG 381 EVA+SQSLRGSRYD PQRN+LPT+EYRQRKTLTVFLATPPWDL PGETV LKLQ+RS +G Sbjct: 181 EVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLATPPWDLTPGETVALKLQVRSVHG 240 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS 441 IR L WQGDTQ LSLT G S EGWT+IMP W + EGA+N WRLSVVVED +GQRVSS Sbjct: 241 IRHLSWQGDTQALSLTAGTDTRSTEGWTIIMPAWDHREGAANRWRLSVVVEDEKGQRVSS 300 Query: 442 NEITLTLVEPFDALSNDELRWEP 464 NEITL L EPF + +D W+P Sbjct: 301 NEITLALTEPFITMPDDNPHWQP 323 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 360 bits (924), Expect = 6e-98, Method: Composition-based stats. Identities = 108/347 (31%), Positives = 159/347 (45%), Gaps = 15/347 (4%) Query: 80 EQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND 139 E + VE WLS +G A V++ D G++ S + PL DN Sbjct: 41 ENGSNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNK 100 Query: 140 RYLTWSQLGLTQQDNGLVSNVGVGQRWAR-GNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 + + ++QLGL D N+G+G R NW+ G N F+D+ +R G GAEAW Sbjct: 101 KSVLFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWT 160 Query: 199 EYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDR 256 +YL+L+AN Y WH ++ A G+D+ A +P Y L V EQY+G+ Sbjct: 161 DYLKLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGEN 220 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LF+ ++P A+++GLNYTP+ LVT +K+G+ ++ LN Y G + Sbjct: 221 VALFDKDHLQNDPSAVTMGLNYTPISLVTAGIDYKRGQDSQDDVKFSLNFRYAIGESWSQ 280 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRK--------TLTVFLATPPWDLKPGE 368 Q SA +VA +SL GSRYD RNN L+Y+++ TL P D Sbjct: 281 QTSADQVALRRSLAGSRYDLVNRNNEIILQYKKKDAELVLADMTLVATKDHSPADGTTAN 340 Query: 369 TVPLKLQIRSRYGI--RQLIW--QGDTQILSLTPGAQANSAEGWTLI 411 V L+ + + W G Q+ S AN +L Sbjct: 341 MVTLQAITSDHKPVPGATIAWAVTGGAQLSSKNSVTDANGDASVSLT 387 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 115/422 (27%), Positives = 185/422 (43%), Gaps = 16/422 (3%) Query: 41 DGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQ 100 +P +A +++ + A + AK A + VN+ Sbjct: 19 SQIPLPVIADSDNEIQSWIAGTASSISPHLQEGTL---EDYAKGKIKALPGQAANHLVNE 75 Query: 101 HVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN-----G 155 ++S P V +++ + S F+P+Q+ L + QLG DN Sbjct: 76 GIKS-AFPEIIFRGGVNLEDGAKYRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGR 134 Query: 156 LVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH 215 NVG+G R WL+G NTF D + + R G G E + + L S N+Y P W Sbjct: 135 TYVNVGMGYRQEVNGWLLGVNTFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWK 194 Query: 216 EQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALS 273 A ++R A G+DL + +P + + ++ EQY+GD+VDL +GT NP A Sbjct: 195 TSAAHELHDERPAYGFDLRTKGTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAG 254 Query: 274 LGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSR 333 L + PVPL+ V A ++ +G +Q GL +NY FG PL +QL V S +R Sbjct: 255 ADLVWNPVPLLEVRAGYRDAGNGGSQAEGGLRVNYSFGTPLHEQLDYRNVGA-PSNTTNR 313 Query: 334 YDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQI 393 RN + YR++ + + + P G V L + SRY I ++ W GD ++ Sbjct: 314 RAFVDRNYDIVMAYREQAS-KIRITAMPVSGLSGTLVTLMATVDSRYPIEKVEWSGDAEL 372 Query: 394 LSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFD 453 L+ G Q + G LI+P + L + V D++G RV+S I + + + Sbjct: 373 LA---GLQLQGSLGSGLILPQLPLTVTDGQEYSLYLTVTDSRGTRVTSERIPVRVTQDET 429 Query: 454 AL 455 + Sbjct: 430 SF 431 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 355 bits (911), Expect = 2e-96, Method: Composition-based stats. Identities = 126/477 (26%), Positives = 193/477 (40%), Gaps = 52/477 (10%) Query: 4 FVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIV 63 + +I+ + +LL A P + + A + Sbjct: 16 LLKKIVIWAQILLQIAFPL---LVLPAHA--------------SSGPGATETDMSDASTL 58 Query: 64 KDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGH 123 +S NG D + VE WLS +G A V + VD+ G+ Sbjct: 59 SASLASSAAQNGADAM-------KNTATHLATTHAASTVEEWLSHFGTAQVTLDVDDNGN 111 Query: 124 FTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR-GNWLVGYNTFYDNL 182 + S + PL DN + + ++QLG+ D N+G+G R +W+ G N F+D+ Sbjct: 112 WDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFGGNVFFDDD 171 Query: 183 LDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ--TATQEQRMARGYDLTARMRMPFY 240 +R GFGAEAW YL+LSAN Y + WH ++ A GYD+ A +P + Sbjct: 172 FTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVRAEGYLPSF 231 Query: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQN 300 L + EQY+GD V LF+ NP A+++GLNYTPVPL+T +K+G+ ++ Sbjct: 232 PQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPLITAGIDYKRGQDSMDEM 291 Query: 301 NLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRK--------T 352 LN +Y + Q+S +VA +SL GSRYD RNN L+Y+++ T Sbjct: 292 KFSLNFHYALDSSWQSQISPEQVATRRSLAGSRYDLVDRNNEIILQYKKKATSKAVADMT 351 Query: 353 LTVFLATPPWDLKPGETVPLKLQIRSRYGIRQ--LIW--QGDTQILSLTPGAQANSAEGW 408 L P D +TV L ++W G+ + S AN Sbjct: 352 LATIKNNSPADGTSADTVTLHAVTADGKPAAHAAIVWTVSGNAALSSTNSVTDANGNTSV 411 Query: 409 TLIMPDWQNGEGASNHWRLSVVVEDNQGQRV--SSNEITLTLVEPFDALSNDELRWE 463 L N V+V G V +S L + ++ D + Sbjct: 412 NLT-----------NTTAGQVIVTATSGSVVRTTSAAFNLLVANLDLVVTKDNSIAD 457 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 333 bits (855), Expect = 6e-90, Method: Composition-based stats. Identities = 116/396 (29%), Positives = 185/396 (46%), Gaps = 32/396 (8%) Query: 7 RIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDF 66 + + +LL + +AA N + D +P + A++ + Sbjct: 2 KSMAIMQILLQTALPVALSMSATVRAAELSQNTHSADKDNINSP-------YSAQMTQAA 54 Query: 67 GETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTG 126 S + A +A VE WLS +G A V + VD++G++ Sbjct: 55 TALSSGNAAGAGASMASGYAGD------------SVEKWLSQFGTARVQLNVDDKGNWDD 102 Query: 127 SRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQR-WARGNWLVGYNTFYDNLLDE 185 S + PL D+ + + ++QLGL D+ + N G+G R + NW+ G N F+D+ Sbjct: 103 SAIDFLAPLYDSQKAMLFTQLGLRAPDDRVTGNFGLGVRTFYTDNWMFGGNVFFDDDFTG 162 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFYQHL 243 + +R GFGAEAW L+LSAN Y WH ++ A G+D+ A +P Y L Sbjct: 163 DNRRVGFGAEAWTNNLKLSANTYLGTTNWHSSRDFDDYYEKPADGFDVRAEGYLPAYPQL 222 Query: 244 NTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLG 303 + EQY+GD+V LF+ NP A+++G++YTPVPL+T +++G+ ++ + G Sbjct: 223 GAKLMYEQYYGDKVALFDKDDLQSNPSAVTVGVSYTPVPLITAAVDYRRGQDSMDETHFG 282 Query: 304 LNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRK--------TLTV 355 +N Y FG L QLS+ EV +SL GSRYD +RNN L+Y+++K LT Sbjct: 283 VNFRYNFGQSLSSQLSSSEVQNLRSLAGSRYDLVERNNEIVLQYKEKKQNNAVADMLLTT 342 Query: 356 FLATPPWDLKPGETVPLKLQIRSRYGIRQ--LIWQG 389 P D TV ++ +R + W Sbjct: 343 VKDNSPADGVTANTVTVRATTSDGTPVRNTVISWSI 378 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 296 bits (758), Expect = 1e-78, Method: Composition-based stats. Identities = 75/283 (26%), Positives = 120/283 (42%), Gaps = 10/283 (3%) Query: 4 FVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIV 63 + R + + + + T A +++ EK+ A Sbjct: 17 VLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFA 76 Query: 64 KDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGH 123 + G + D + + + NQ ++ WL +G A V + VD + Sbjct: 77 ANAGTFLSSQPDSDAT-------RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 Query: 124 FTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYNTFYDNL 182 S P+ D + ++Q + + D+ SN+G G R GN W+ G NTF D+ Sbjct: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 Query: 183 LDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA--TQEQRMARGYDLTARMRMPFY 240 L + R G GAE W +YL+LSAN Y + W + ++R A G+D+ A +P + Sbjct: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 Query: 241 QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPL 283 L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 281 bits (719), Expect = 4e-74, Method: Composition-based stats. Identities = 80/365 (21%), Positives = 129/365 (35%), Gaps = 13/365 (3%) Query: 26 STFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDT------- 78 + +A P G P L A + A + + + Sbjct: 51 GSILAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGAR 110 Query: 79 --GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQ 136 G+ K A +V D L VN ES L N + D E T + + + Sbjct: 111 VDGDYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVY 170 Query: 137 DNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA-RGNWLVGYNTFYDNLLDENLQRAGFGAE 195 R QLG Q++ +N G R +VG N F D + R G E Sbjct: 171 RAGRNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLE 230 Query: 196 AWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYF 253 L N Y P + W ++ +E++ A G D+ R F L+ S + ++ Sbjct: 231 VIAPEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWN 290 Query: 254 GDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 G VD F++G +G+ Y PV LV+V + + G + + L LN P Sbjct: 291 GAEVDYFDNGRTQAGAKGFKVGVEYRPVSLVSVGLEQTKVIGGGRETRMQLGLNINLSEP 350 Query: 314 LKKQLSAGEVAE-SQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPL 372 L KQL + S R+ +R N L R+++ + + + L+ V + Sbjct: 351 LSKQLRRDASGTPAFSPDARRHALVERENRIVLNTRRKEIILPLVVSEVSTLQADGRVTV 410 Query: 373 KLQIR 377 + Sbjct: 411 IGATQ 415 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 80/458 (17%), Positives = 142/458 (31%), Gaps = 57/458 (12%) Query: 13 LLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMN 72 +L L A AQ +A P + D M A + Sbjct: 33 VLTLQTVAPAFAQGA-PSFSARPAQADRQDAADSAMLRVA-----QTARQLAQRQAAGSR 86 Query: 73 DNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWF 132 + G+ K A + + L + V ++ L + V+ + Sbjct: 87 ASARVDGDLLKGQAEAQANELLQEGVRLANQTELPFLR--RLQGGVNYDFSNKDLSLDLR 144 Query: 133 V--PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWL-VGYNTFYDNLLDENLQR 189 + +R QL +++ N GV R A + VG N F D +N R Sbjct: 145 TIDEVHRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLR 204 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARMRMPFYQHLNTSV 247 G E L N Y P + W ++ +E+R A G+D+ R++ L Sbjct: 205 GSLGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKG 264 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 ++ G VD F++G N G+ Y PVPLV V + + G Q + L +N Sbjct: 265 QYFRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPLVAVGLEQTKVLGGARQTTVQLGVN 324 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPG 367 G PL +QL + L+ + +R N L+ R++ + Sbjct: 325 LSLGEPLSRQLRHQS-GPAFDLQARMGEFVERENRIVLQTRRKHVVLPLTIARVDTDPAT 383 Query: 368 ETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ---NGEGASNH 424 + + +L +P+ + S Sbjct: 384 GRITVTG--------------------------VTEPGAQVSLGLPNGEVVVAQADGSGT 417 Query: 425 WR-----------LSVVVEDNQGQRV---SSNEITLTL 448 +R + + G R + + + + + Sbjct: 418 YRATSARDMVGGPVRARATNRHGDRSREVTHHYVDVAV 455 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 267 bits (682), Expect = 7e-70, Method: Composition-based stats. Identities = 95/471 (20%), Positives = 159/471 (33%), Gaps = 37/471 (7%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 R + A + A F + E A+ ++ Sbjct: 23 YRRHRAGAAGMSAVLAMQAAAPVAYGQGAPTFSATQVADAASNAVAQPGAVETRVAQTIQ 82 Query: 65 DFGETSMNDNGLDTGEQA--KAFALGKVRDALSQQVNQHVESWLSPWGNA---SVDVKVD 119 + G + F + + + V Q V+ W + G ++ V Sbjct: 83 ALAQAREAGGARQDGRASLDGQFLRSQAQAQANVLVQQGVQ-WANETGLPWLRRLEGNVS 141 Query: 120 NEGHFTGSRGSWFV--PLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYN 176 + L + QLG Q++ N GV R A G+ ++G N Sbjct: 142 YDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVARSAAGSSLILGGN 201 Query: 177 TFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTAR 234 F D + + R GAEA L N Y P + W ++ +E+R A G+D+ Sbjct: 202 AFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERREERPAAGWDVGFT 261 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 R Q L + ++ G +VD F+ G NP G+ Y PVPL+ V + + + Sbjct: 262 ARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVPLIGVGVEQARLQ 321 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSL-RGSRY-DNPQRNNLPTLEYRQRKT 352 SGE Q ++ L + G PL +QL G + RG+R D +R N L+ R++K Sbjct: 322 SGERQTSVQLGVRLNLGEPLSRQLRRGAQDTAPPFDRGARLQDFVRRENRIVLDTRRKKI 381 Query: 353 LTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI-WQGDTQI----LSLTPGAQANSAEG 407 + + L P + + + W D + G +A+SA Sbjct: 382 V-LALRIAEVRTDPATGRITVYGVTE--PLADVQLWLPDGTATSVRANAAGGFEASSAGD 438 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSND 458 T + + D G E+T + D + Sbjct: 439 MTSGL--------------IRARATDRYGDTSQ--EVTYAYTDTVDKTAPV 473 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 79/376 (21%), Positives = 130/376 (34%), Gaps = 13/376 (3%) Query: 25 QSTFEQKAANPFDNNNDGLPDLGMAPEN-----HDGEKHFAEIVKDFGETSMN--DNGLD 77 + +AA P G P++ PE D A +D + + Sbjct: 48 CLSLGMQAAAPLAVLAQGAPEMTNRPEAGDIVPSDVLTQVAVRAQDLARRQADRREGAQV 107 Query: 78 TGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQD 137 + K + L + V ES L N D++ D + T + Sbjct: 108 DADYLKQQGQAQFNQFLQEGVRAANESGLRFLRNLQGDLRHDFDNGRTSLELRTIDQVYR 167 Query: 138 NDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDENLQRAGFGAEA 196 QLG Q+N +N+G R ++G N F D + R G EA Sbjct: 168 KGANTGLLQLGGHNQNNRPTANLGGVYRRDINERLMLGANAFLDYEFAKQHLRGSLGVEA 227 Query: 197 WGEYLRLSANFYQPFAAW--HEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFG 254 N Y P + W ++ +E+R A G DL + F L+ + ++ G Sbjct: 228 IAPEFSFYGNVYAPMSGWTGAKRDNRREERPASGMDLGMKYSPGFAPGLSLKANYFRWNG 287 Query: 255 DRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPL 314 VD F++G G+ Y PVPL+++ + + G +Q ++ L + PL Sbjct: 288 AAVDYFDNGRTQDRATGFKYGVQYKPVPLLSLGVEQTRVIGGASQTSVQLGVALNLSEPL 347 Query: 315 KKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKL 374 KQL G +L R +R N L RQ+ + T + L Sbjct: 348 SKQLRRGGETPVFNLDAHRNALVERENRIVLNTRQKLIILPLTVTTVLTDSVSGRITLVG 407 Query: 375 QIRSRYGIRQLIWQGD 390 Q +++ + W Sbjct: 408 QTQAQ---ATVNWTLP 420 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 250 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 73/360 (20%), Positives = 148/360 (41%), Gaps = 24/360 (6%) Query: 8 IIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFG 67 PF + L AN + + ND D ++ E+ + ++++ G Sbjct: 259 KKPFETVYLENPTNANYYNENLKTQKA----LNDNKKDNNLSKEDQEFSNKVMKVIQTAG 314 Query: 68 ETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGS 127 +++ E K A + + ++ + ++S L+ N + F+G+ Sbjct: 315 AIYDSEDSKSKEEIVKNMASSYLNTSANELAKEFIDS-LNTSINTDFSFNYNERSGFSGN 373 Query: 128 RGSWFVPLQ-DNDRYLTWSQLGLTQQ-DNGLVSNVGVGQRWA--------RGNWLVGYNT 177 + + DN + + Q G+ + ++ + + G G R+ GN ++G N+ Sbjct: 374 AKALLPIVSEDNPKISYFLQSGIGEFANDRTIGHFGGGIRYYPNATALNNSGNIMLGLNS 433 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT----QEQRMARGYDLTA 233 YD+ +R GAEA + L +AN YQ ++W + ++R A G+D Sbjct: 434 VYDHDFSRGHKRMSLGAEAMVDTLAFNANVYQRLSSWIDSYDFDKDYVQERPANGWDAKI 493 Query: 234 RMRMPFYQHLNTSVSLEQYFGDRVDLFNS---GTGYHNPVALSLGLNYTPVPLVTVTAQH 290 + P +++ + Q++G++V +F + NP+ G++Y+P P +T T H Sbjct: 494 KYAFPSLINVSFFAKMGQWYGNKVGIFGANSVDDLEKNPLIYEGGISYSPFPALTFTLSH 553 Query: 291 -KQGESGENQNNLGLNLNYRFGVP-LKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 + ES + ++ N+N +K S ++ G+R R+ LEYR Sbjct: 554 SRSAESSKKNTSINANINIPLDEKAMKLAFEPKLAGISNTIEGTRTQFIDRDYSMVLEYR 613 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 233 bits (595), Expect = 9e-60, Method: Composition-based stats. Identities = 74/340 (21%), Positives = 127/340 (37%), Gaps = 24/340 (7%) Query: 130 SWFVPLQDNDRYLTW--SQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENL 187 L + L + + G +++G+G R + G N F+D L Sbjct: 9 DTLHELDQPLKKLAYVSNHWGPLLFHGRDFAHLGLGYRQLDDSQFFGVNVFFDYDLSRQH 68 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE------QRMARGYDLTARMRMPFYQ 241 R GAE +Y S N Y P + W + E ++ A+G+DL +P Sbjct: 69 TRVSVGAEYGLDYGTFSTNAYFPLSNWKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLDT 128 Query: 242 HLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNN 301 ++ QY G V+ + NP SL + P P + ++ + + Q Sbjct: 129 RWKFGLTAGQYLGRYVEHSDGSLPSKNPYHFSLSTEFRPDPAWAFSLGYQTEQGAKEQWI 188 Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRG---SRYDNPQRNNLPTLEYRQR-KTLTVFL 357 G+N + + L QSL D QR++ LEY+Q+ +++ L Sbjct: 189 AGIN----YSLSLSGLYEGERRLSQQSLLPKPERLTDFVQRDHNMVLEYKQKFAEISIRL 244 Query: 358 ATPPW-DLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQ 416 + + ++++ I WQGD + QA+S T I P ++ Sbjct: 245 PESALVTELSQQMLSSWMEVKGGADIVSYQWQGDAA--NYLNDIQASSP---TFIAPAYR 299 Query: 417 NGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 A+N LSV + GQ SN + +T+ + S Sbjct: 300 Y--DANNTLSLSVSYKLRSGQIKQSNTMKITVTDSKVLES 337 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 233 bits (594), Expect = 1e-59, Method: Composition-based stats. Identities = 60/243 (24%), Positives = 96/243 (39%), Gaps = 24/243 (9%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 V + +L T ++F A N + A + Sbjct: 16 VTWSVIATQILSPVTFTLIPANSFASSANTESAQTNAN----------DEYANELASLAA 65 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 + G++ N+ A D LS Q + V WL +GNA + + VD Sbjct: 66 NAGQSLANNT-----------AGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDESFTL 114 Query: 125 TGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA-RGNWLVGYNTFYDNLL 183 + + P D+ Y+ +SQ L + D+ +N+G+G R N ++G N FYD L Sbjct: 115 KDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFYDYDL 174 Query: 184 DENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMPFYQ 241 + RAG G E W +Y+R AN Y + W + +R A G+D++A +P Y Sbjct: 175 SRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWLPVYP 234 Query: 242 HLN 244 L Sbjct: 235 QLG 237 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 217 bits (552), Expect = 8e-55, Method: Composition-based stats. Identities = 69/294 (23%), Positives = 110/294 (37%), Gaps = 13/294 (4%) Query: 77 DTGEQAKAFALGKVRDALSQQVNQHVESWLSPW-GNASVDVKVDNEGHFTGSRGSWFVPL 135 DT AL A Q Q WL G D+ F+ + Sbjct: 47 DTSPGLAQSALDAGVAAGLQASRQTGLPWLRHLDGGLRYDLDPG-RLSFSLRTIDDLM-- 103 Query: 136 QDNDRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDENLQRAGFGA 194 ++R Q GL Q+ +N G+ R A +VG N F D + R G Sbjct: 104 -VSERRALMLQAGLHNQNQRPTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGL 162 Query: 195 EAWGEYLRLSANFYQPFAAWHEQTAT--QEQRMARGYDLTARMRMPFYQHLNTSVSLEQY 252 EA + L AN+Y P + W +E+R A GYDL ++ L+ + ++ Sbjct: 163 EAIAPHYSLYANYYAPLSGWKGARRDSRREERPAAGYDL--GGQLSSDAGLSLQAAYFRW 220 Query: 253 FGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGV 312 G +D+F+SG N G+ Y P L + + G+ Q ++ LN+ Sbjct: 221 HGAGIDVFDSGRAQRNASGFRYGVAYQPGALFNIGLNQTRTLDGQKQTSVQLNVRINLQE 280 Query: 313 PLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKP 366 P +QL +L R+ +R + L R++ +T+ L+ P Sbjct: 281 PPSRQLRRESQPF--NLTSRRHQWVERESRIVLNTRRKA-ITLPLSIAQLRGDP 331 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 66/329 (20%), Positives = 118/329 (35%), Gaps = 26/329 (7%) Query: 39 NNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQV 98 N L +AP A+ +K E + + + + G V + + Sbjct: 1 MNKKQKILLIAPLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAGDVEALKNTAI 60 Query: 99 NQHVE-------SWLSPWGNASVDVKVDNEGHFTGSRGSWFV-PLQDNDR--YLTWSQLG 148 + VE S+L + +V++ +G S G V PL D D ++Q Sbjct: 61 DGVVEKGVGVTKSFLEQY-FPTVELNFGAQGGSKPSGGLLVVAPLSDPDDIFNTYFTQGS 119 Query: 149 LTQQDNGLVSNVGVGQRWARGNWLV--GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSAN 206 + +DN N+G+G R N ++ G N FYD+ + R G EA ++AN Sbjct: 120 VFYEDNRTTLNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINAN 179 Query: 207 FYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGY 266 Y W E+R GYD+ A + +P+ V Q+ ++ S Sbjct: 180 KYWATTKWKTGKNGLEERALDGYDIEAGVPLPYMNWATVFVKNFQW---DSEISGSKDIK 236 Query: 267 HNPVALSLGLNYTP-VPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPL------KKQLS 319 N + L Y P + + + A + +N+ Y Q Sbjct: 237 GNDLQLRA---YIPGITGLEIQAGRTFFSDSSGTDENYINIFYNVTQLFADKPRYNHQWI 293 Query: 320 AGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 + + + +S+ RY+ +R N + + Sbjct: 294 SKDAYKLESMEDRRYEKVRRTNNIVKQIK 322 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 184 bits (468), Expect = 5e-45, Method: Composition-based stats. Identities = 63/305 (20%), Positives = 120/305 (39%), Gaps = 21/305 (6%) Query: 60 AEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVD 119 A + G + D EQ K+ + ++ + S V+ + + LSP + V+V + Sbjct: 15 AVLTMSLGFSLS--VSADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDT-VEVSTN 71 Query: 120 NEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWAR--GNWLVGYNT 177 + + DN ++Q+ L + D N+G G R W+ G N Sbjct: 72 LKEGDSTVDIGVLKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNA 131 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRM 237 FYD+ + +R G G E L N Y + + + + ++ G D+ ++ + Sbjct: 132 FYDHEFPNDHKRNGVGFEVVSSVLESRVNSYNGTTGYIKDKSGTDSKVLDGRDMGFKVAL 191 Query: 238 PFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE 297 P+ + ++ Q+ G +D + SL N + L V +K + + Sbjct: 192 PYLPGMMFGMNAVQWKG--IDGLKDQKMRKYSLGGSLSDNLS---LSYVRTDYKDA-AKK 245 Query: 298 NQNNLGLNLNYRFGVPLKKQLSA------GEVAESQSLRGSRYDNPQRNNLPTLEYRQRK 351 + +++ LN + FG +K + + E + L RYD +R N ++ Sbjct: 246 DIDSISLNYTWAFGQ--EKHVRPTLFALSDKAYEFKKLGAERYDLVKRENNLV--KKKSG 301 Query: 352 TLTVF 356 TLTV Sbjct: 302 TLTVT 306 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 183 bits (464), Expect = 1e-44, Method: Composition-based stats. Identities = 138/198 (69%), Positives = 157/198 (79%), Gaps = 8/198 (4%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFA 60 +SR V R LLLL A GT A +A +PFD N LPDLGM PE+H+GEKHFA Sbjct: 50 LSRIVFRSFSLSLLLLAASGTIRA------QAQDPFDQNR--LPDLGMMPESHEGEKHFA 101 Query: 61 EIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDN 120 E+ K F E SM +N LDTGEQA+ FA G+VRD +S+QVNQ +ESWLS WG+ASVD+ VDN Sbjct: 102 EMAKAFSEASMKNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDN 161 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYD 180 EGHF GSRGSWF+PLQD RYLTWSQLGLTQQ +GLVSNVG+GQRWA+ WL+GYNTFYD Sbjct: 162 EGHFNGSRGSWFIPLQDKQRYLTWSQLGLTQQTDGLVSNVGIGQRWAQDGWLLGYNTFYD 221 Query: 181 NLLDENLQRAGFGAEAWG 198 NLLDENLQRAGFGAEAWG Sbjct: 222 NLLDENLQRAGFGAEAWG 239 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 176 bits (447), Expect = 1e-42, Method: Composition-based stats. Identities = 59/335 (17%), Positives = 125/335 (37%), Gaps = 46/335 (13%) Query: 117 KVDNEGHFTGSRGSWFVPLQ----DNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA-RGNW 171 ++ F + + L++ QLG+ + + N G G+R + Sbjct: 83 TYKSKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKY 142 Query: 172 LVGYNTFYDNLLDENLQR---AGFGAEAW-GEYLRLSANFYQPFAAWHEQTATQEQR--- 224 +GYNTFY + + + G E W L + N+Y ++ +T+ Q+ Sbjct: 143 AIGYNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHY 202 Query: 225 MARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 G+ L + + P + + LEQ+ ++ + LSL LNY P+P++ Sbjct: 203 PRSGHQLYIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPML 261 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGS------------ 332 + + N + + Y+FG P+ +Q+ E++S+ + Sbjct: 262 GFSINNIFVNKQYNSTICRVLIAYQFGTPIIEQIHYTN-NENKSILNNLDTIIQPFIPTI 320 Query: 333 --RYDNP---QRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 +D N+LP+L+ Q+ PGE +K+ + + + W Sbjct: 321 IPHHDYISINDHNHLPSLQRTQK-----------ITGYPGEIKIIKINDNNN---KYVRW 366 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGAS 422 ++ + + A + + L P++ + + Sbjct: 367 DLES-LENHGGNIVAITNNTYALYFPNYPIIQENN 400 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 63/318 (19%), Positives = 102/318 (32%), Gaps = 32/318 (10%) Query: 42 GLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQH 101 L L ++ + FA IVK+ G N E ++ DA + ++Q Sbjct: 12 ALSALPLSAQEVAKSGKFATIVKNIGNAL---NIGQGEEAVESEVNTLAVDAANAGLDQV 68 Query: 102 VESWLSPWGNASVDVKV-------DNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN 154 + LS ++ V D T + L++ + ++Q +N Sbjct: 69 EDKVLSTSNFTHFELSVGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNN 128 Query: 155 GLVSNVGVGQRWARGN--WLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 N G G R + GYN FYD L +R G G E AN YQ + Sbjct: 129 RTTINTGFGARHINDANTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVS 188 Query: 213 AWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 ++ GYD +P++ N L + + Sbjct: 189 KTLTYNGI-QETALDGYDAKLTANLPYFYSSNLYGKLSNW---------KDAASYETEHY 238 Query: 273 SLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAE------- 325 G+N P +T+ Q + N ++NY VPL +V + Sbjct: 239 EAGINAEIAPNLTLRVA-AQHKKNSNNTEAVASINY--SVPLGGANQPAKVKQDGDWSTK 295 Query: 326 SQSLRGSRYDNPQRNNLP 343 + +R Y QR N Sbjct: 296 FEPIREKLYRPVQRENRI 313 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 173 bits (438), Expect = 1e-41, Method: Composition-based stats. Identities = 52/373 (13%), Positives = 106/373 (28%), Gaps = 61/373 (16%) Query: 120 NEGHFTGSRGSWFVPLQDN-DRYLTWSQLGLTQQDNGLVSNVGVGQRW-ARGNWLVGYNT 177 P + L + Q+G+ + G G+R ++GYN Sbjct: 105 KNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVYNKKHIIGYNA 164 Query: 178 FYDNLLDENLQR---AGFGAEAWGEYLRLSANFYQPFAAW-----HEQTATQEQRMARGY 229 Y + + + G E W L+ N Y + Q GY Sbjct: 165 CYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNHHYYQYPKIGY 224 Query: 230 DLTARMRMPFYQHLNTSVSLEQYFGD----RVDLFNSGTGYHNPVALSLGLNYTPVPLVT 285 + A+ P+ + EQ D + +N+ L + L Y P+P+ Sbjct: 225 QICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVSLEYQPIPMFN 281 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPL------------------KKQLSAGEVAESQ 327 ++ ++ + LNY+F VPL L++ Sbjct: 282 LSINNRFIYKKYCNTFFTITLNYQFHVPLKQQLNNVNNNIQQNKIIFNNHLNSILNPFIP 341 Query: 328 SLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIW 387 S+ + NN + + + PGE ++++ Y + W Sbjct: 342 SIDPY---FIKTNNS--------NEILLTQSNNEIIGYPGERKFIQIE---DYK-TNIQW 386 Query: 388 QGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLT 447 ++ + + + +P + +S + DN T+ Sbjct: 387 NYES-LKKQGGKIFHIHNNLYEIHLPKSTHSITDIFLTYISELNIDN----------TIH 435 Query: 448 LVEPFDALSNDEL 460 + ++ Sbjct: 436 YTQQKISIITKNP 448 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 173 bits (438), Expect = 2e-41, Method: Composition-based stats. Identities = 59/349 (16%), Positives = 113/349 (32%), Gaps = 39/349 (11%) Query: 24 AQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVKDFGETSMNDNGLD-----T 78 + + P + NN D + + A F N D Sbjct: 25 YKFEEIKFNQIPNEQNNYEPKD-----KLDEYIIKGANYSTKFVPLMNNGAKGDEYTGIM 79 Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVK--VDNEGHFTGSRGSWFVPLQ 136 + + D + + N ++ + + SV++ +++ F+ + L Sbjct: 80 ADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTESDTSFSINSLMKLGELA 138 Query: 137 DND----RYLTWSQLGLTQQDN--GLVSNVGVGQRWARGNW-LVGYNTFYDN---LLDEN 186 +D + L +SQ N G N+G+G R + +VG N F+D + Sbjct: 139 KDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISMVGANAFWDYRMTDYSDA 198 Query: 187 LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ---TATQEQRMARGYDLTARMRMPFYQHL 243 R G G E + + N+Y + ++R+ G+DL R+P L Sbjct: 199 HSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVVPGWDLEVGYRLPNNPEL 258 Query: 244 NTSVSLEQYFG----DRVDLFNSGTGYHNPVALSLGLN-YTPVPLVTVTAQHKQGESGEN 298 + + D L + + P +GL Y + + G + Sbjct: 259 AFYIRGFNWDYKYTQDNSGLEGAVSWQATPH---VGLEAYVSNEISAASTTANTDLPGTD 315 Query: 299 QNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 +N GL +N G P+K + +++ +R LE Sbjct: 316 ENFFGLRMNIT-GNPVKF----EKSNYKKNMVTQMTQPVKRKYDVLLER 359 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 164 bits (414), Expect = 8e-39, Method: Composition-based stats. Identities = 50/298 (16%), Positives = 89/298 (29%), Gaps = 45/298 (15%) Query: 69 TSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKV---------D 119 +A + + +ESW A ++ Sbjct: 12 ILSTSIYAGEASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRS 65 Query: 120 NEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN--WLVGYNT 177 EG R + ND SQL + D+ N G+ R + + G N Sbjct: 66 REGAKPTFRAISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNI 125 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRM 237 FYD+ + R G G E ++ NFY+ H + A GYD ++ Sbjct: 126 FYDHQFNTGHARTGLGFEMKSSVYDVNINFYEAQTEIH-HVDGVPEVAAGGYDAEIGAQV 184 Query: 238 PFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE 297 P+ Q+ + +++ + +L L P P ++ + + Sbjct: 185 PYLPWAKVYYKAYQWNNETLNIKDGE---------TLSLYMMPTPR--LSVEFGTQDDST 233 Query: 298 NQNNLGLNLNYRF-------GVPL----KKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 L LNY PL + + G++ + Y+ +R N Sbjct: 234 MSTKSFLKLNYVLCCGETTKSAPLFTVSNQAFNYGKIDNQR-----MYEKVRRENNII 286 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 164 bits (414), Expect = 9e-39, Method: Composition-based stats. Identities = 56/324 (17%), Positives = 105/324 (32%), Gaps = 37/324 (11%) Query: 47 GMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWL 106 G+ E+ + +++ + N NG++ Q AL + LS + ++ + Sbjct: 66 GIWHESPLASRVIDKLLIRNWTSLNNKNGIEWSNQISNLALNLASNKLSDYATKTIQKYP 125 Query: 107 SPWGNASVDVKVDNEGHFTGSRGSWFVPLQD-------NDRYLTWSQLGLTQQ-DNGLVS 158 G ASV+ + EG T G + D + + + T N Sbjct: 126 FVLG-ASVNFDIRTEGA-TNIGGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSNDSTW 183 Query: 159 NVGVGQRWARGNWLV-GYNTFYDNLLDEN---LQRAGFGAEAWGEYLRLSANFYQPFAAW 214 N G+G R G L+ G N ++D R G G E + + L L+ N+Y Sbjct: 184 NAGLGLRHLIGEELLAGVNGYWDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIAGTGT 243 Query: 215 HE---QTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVA 271 +R+ G+D R+P ++ ++ ++ Sbjct: 244 KTISTNNTDYYERVVPGWDFELGYRLPSNPNIAFFARGFRWDYRN---------RNDNTG 294 Query: 272 LSLGLNYTPVPLVTVT--------AQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEV 323 + Y P V + A Q + N++ + LN+ ++ Sbjct: 295 FQGKVTYQMTPHVRLDSWISNEVPANQTQTNGELDNNDITIGLNFTLTANP---VTYKTN 351 Query: 324 AESQSLRGSRYDNPQRNNLPTLEY 347 Q L+ +R LE Sbjct: 352 NIKQILQQEMVKPVRRRYDVLLER 375 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 162 bits (409), Expect = 3e-38, Method: Composition-based stats. Identities = 51/257 (19%), Positives = 92/257 (35%), Gaps = 21/257 (8%) Query: 107 SPWGNASVDVKVDNEGHFTGSRGSWFVPLQD--NDRYLTWSQLGLTQQDN-GLVSNVGVG 163 + A + + TGS P+ D ++ + ++Q L D+ N+G G Sbjct: 7 DKFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFG 65 Query: 164 QRWA--RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQ 221 R LVGYN FYD+ LD + QRA G EA L AN Y + W Sbjct: 66 NRKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKSGLNNI 125 Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPV 281 ++ G D+ M +P+ N + G + + ++L L Sbjct: 126 NEKALNGSDVELGMPLPYLPWTNLYYRSFNWEGAS----GAADLEGDEISLEAKLT---- 177 Query: 282 PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS-AGEVAESQSLRGSRYDNPQRN 340 + K+ G ++ L + Y ++ + S+ ++ +R Sbjct: 178 -NFNIEIG-KRSNDGVTEDEEFLKITYTCCNNSNNEIGISDTAYNLTSVSDQKFAKVRRQ 235 Query: 341 NLPTLEYRQRKTLTVFL 357 NL ++K + + + Sbjct: 236 NLIV----KQKEMDLTV 248 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 160 bits (405), Expect = 8e-38, Method: Composition-based stats. Identities = 77/195 (39%), Positives = 112/195 (57%), Gaps = 8/195 (4%) Query: 38 NNNDGLPDLGMAPENHDG-EKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQ 96 + LP+LG A N G EK A + + E + ++N T +++ LG+ +D + Sbjct: 52 SQQQALPNLGSASVNESGTEKKLATLARQMAEVNQDENTDQT---WRSYLLGEAKDRVLD 108 Query: 97 QVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND-RYLTWSQLGLTQQDNG 155 ++ Q E+ LSP G +V + VD G F GS G +PL D R LT+SQLGL D+G Sbjct: 109 RLQQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRGLTYSQLGLQGVDDG 168 Query: 156 LVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAG-FGAEAWGEYLRLSANFYQPFAAW 214 +V N+G+ QRW G WL+GYN FYD L+++ R G GAEA +YL LS+N+Y P + Sbjct: 169 VVGNMGLRQRWNAGRWLLGYNVFYDQYLNQDASRRGSIGAEARSDYLTLSSNYYYPLSGM 228 Query: 215 HEQTATQEQ--RMAR 227 H +++ RMAR Sbjct: 229 HAANDDEDELLRMAR 243 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 53/322 (16%), Positives = 104/322 (32%), Gaps = 42/322 (13%) Query: 57 KHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDV 116 + A +G + +N NG+D G + + + N+ ++ + + ++ + Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 117 KVDNEGHFTGSRGSWFVPLQ-------DNDRYLTWSQLGLTQQDN-GLVSNVGVGQRW-A 167 TG F+ L+ + L + Q +T + + NVG+G R+ Sbjct: 168 DAATSSDLTGY-LDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRL 226 Query: 168 RGNWLVGYNTFYDNLLDEN---LQRAGFGAEAWGEYLRLSANFYQPFAAWHE---QTATQ 221 +VG N F+D R G GAE + + L N+Y +A Sbjct: 227 GDEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDY 286 Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPV 281 +R+ G+D+ R+P Y L V + + + + +N+ Sbjct: 287 VERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDYQD---------HSDNSGIEGSVNWQAT 337 Query: 282 PLVTVTA-----------QHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLR 330 P + + +G + G P+ + Q+L Sbjct: 338 PHANLELWVSNEIPAYPTDSNDTIGNQPGPYIGARVRLT-GRPVVF----TKSNTKQNLL 392 Query: 331 GSRYDNPQRNNLPTLEYRQRKT 352 +R LE + T Sbjct: 393 TQMTQPVRRRYEVLLERVKEPT 414 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 154 bits (389), Expect = 6e-36, Method: Composition-based stats. Identities = 57/347 (16%), Positives = 115/347 (33%), Gaps = 67/347 (19%) Query: 60 AEIVKDFGETSMNDNGLDTGEQ-----AKAFALGKVRDALSQQVNQHVESWLSPWGNASV 114 A + + D ++ +K+F + D L++ V + + +LS V Sbjct: 90 ASYATRIFPLLNSASLSDGIQKMLWMDSKSFIVSFAHDYLNEYVLKQI-PFLSQT-EFGV 147 Query: 115 DVKVDNEGHFTGSRGSWFVPLQDNDRY----LTWSQLGLT--QQDNGLVSNVGVGQRWAR 168 + D + + + L +D L ++Q + + + +R R Sbjct: 148 GFESDADMTYYLNSLISLAQLGSDDNGYPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLR 207 Query: 169 GNWLVGYNTFYDNLLDEN---LQRAGFGAEAWGEYLRLSANFYQPFAAWHE--------- 216 N ++G N F+D R G GAE W + +L+ N+Y Sbjct: 208 DNAMLGANAFWDYRFTNYSSSYSRWGAGAELWWDDFKLTNNWYIAGTGIKRITTSGRAYT 267 Query: 217 ----------------QTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLF 260 T ++R+ G+D+ R+P Y L+ + ++ Sbjct: 268 DTTSLAAGTYDETTLLGANTFDERVVPGWDVALNYRLPSYPQLSLGIRGFRWDY------ 321 Query: 261 NSGTGYHNPVALSLGLNYTPVPLVTVT-----------AQHKQGESGENQNNLGLNLNYR 309 + + +N+ P ++ AQ S + +G+ N + Sbjct: 322 ---MRKSDNSGVEGSVNWQATPHTNLSAWISSEIPAYPAQSNAQLSSGDDVYVGVRFNVQ 378 Query: 310 FGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY---RQRKTL 353 P+ + + + E +L QR N LE +Q+KT+ Sbjct: 379 L-KPVTYKTGSNRIRE--NLLTQMRQPVQRRNDVLLERWKPKQKKTI 422 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 154 bits (388), Expect = 9e-36, Method: Composition-based stats. Identities = 53/278 (19%), Positives = 99/278 (35%), Gaps = 18/278 (6%) Query: 81 QAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFV--PLQDN 138 A V +V++ + + + G V + ++ G S ++ Sbjct: 14 IITTVANADVASQALNKVSEKISNLIPGEGITEVSLDYND-GDEDQLNFSILGVRDIETT 72 Query: 139 DRYLTWSQLGLTQQD----NGLVSNVGVGQRWARG--NWLVGYNTFYDNLLDENLQRAGF 192 D ++Q L Q+ ++ N+G+G R N++ G NTFYD L E R G Sbjct: 73 DNSNFFTQFSLMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGL 132 Query: 193 GAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQY 252 G EA G L L+AN Y + +EQ + G+D ++P + + ++ Sbjct: 133 GIEAKGSILDLTANSYTKISNSEVVNGDREQ-VLSGWDFNLTSQIPRAPWARINYNGYKW 191 Query: 253 FGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGV 312 + G+ SL L+ T V + +++ +L +N Y Sbjct: 192 ETE------KGSADQKGNIYSLELDVTNSVEVVASLDKSSLNGVDDETSLSINYIYPPKE 245 Query: 313 PLKKQLSA-GEVAESQSLRGSRY-DNPQRNNLPTLEYR 348 +S + + +R N +E + Sbjct: 246 KSMVMSDGLSNDMFEKSNMEQKLKEKVRRRNKLVMEIQ 283 >UniRef50_C0B2E7 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E7_9ENTR Length = 815 Score = 152 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 34/163 (20%), Positives = 69/163 (42%), Gaps = 3/163 (1%) Query: 302 LGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPP 361 +GL+ NY+FG P+ QL + + L S+YD RNN +Y+++ L++ Sbjct: 1 MGLSFNYQFGTPINAQLDPNNIKPLRLLENSKYDFVDRNNNIVFDYQEQSYLSLKTP-DL 59 Query: 362 WDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGA 421 + E + + + S G+ + G ++ L + L +P + EGA Sbjct: 60 IEGYSNEQKTVTISVESSAGLDYIDIDG-SRFLQHGGRIIEQGQNSYLLYLPYYDQQEGA 118 Query: 422 SNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 +N + + D +G+ SS + +++ + + P Sbjct: 119 TNTYNIVATAYDKKGRASSSETTKVVVLKSGI-VQKAAISATP 160 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 150 bits (379), Expect = 1e-34, Method: Composition-based stats. Identities = 45/291 (15%), Positives = 90/291 (30%), Gaps = 22/291 (7%) Query: 67 GETSMNDNGLDTGEQAKAFALGKVRDAL------SQQVNQHVESWLSPWGNASVDVKVDN 120 G M+ T + + A+ + +A+ + + W+ S + Sbjct: 59 GPLVMDRQETKTVQYSNVDAVNRAINAVAMSNVSNAMYGAKGKPWMRRT-TLSFQFQEGW 117 Query: 121 EGHFTGSRGSWFVPLQDNDRYLTWSQLGL-TQQDNGLVSNVGVGQRWA--RGNWLVGYNT 177 + ++ ++ R + ++Q + D G N+GVG R L G + Sbjct: 118 KPLYSVETVQPLGHYDNSSRDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHL 177 Query: 178 FYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWH--EQTATQEQRMARGYDLTARM 235 FYD+ R G E N+Y + + +R+A GY + Sbjct: 178 FYDHRFLNRHNRLSAGLEYMSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTVEYGK 237 Query: 236 RMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGES 295 + V + + + L +G P V+V + + E Sbjct: 238 TFKNARWARVYVEGYHW---------NQERQADKNGLRVGSELQLTPRVSVDMGYNKPEH 288 Query: 296 GENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLE 346 + G P+ + ES ++R + +R N +E Sbjct: 289 SSGGVYGKITFRLA-GAPMAWYGGKHRLEESATVRDKMLNLVRRTNTIFVE 338 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 145 bits (365), Expect = 4e-33, Method: Composition-based stats. Identities = 36/218 (16%), Positives = 76/218 (34%), Gaps = 16/218 (7%) Query: 135 LQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQRAGF 192 + +N + + G QD + +G R ++G N Y + N QR + Sbjct: 71 VSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQRISY 130 Query: 193 GAEAWGEYLRLSANFYQPFAAWH-EQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQ 251 GAE +++N+Y W E++ GYD+ + +P+ + V Sbjct: 131 GAEIRTSVFEINSNYYHRLTDWKLTGVDNNEEKARGGYDVELALAVPYVPSAHFRVKHFC 190 Query: 252 YFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQ-----NNLGLNL 306 + G + +S + + ++ + ++V + SG L + Sbjct: 191 WNG--IASNDSNNPIDDLKGNTFSVSGSVYDGLSVEVGYIDYTSGNADYSKAGGERFLKV 248 Query: 307 NYR---FGVPLKKQLSA---GEVAESQSLRGSRYDNPQ 338 +Y FG + K E + + R++ + Sbjct: 249 SYNFDIFGTHVNKATKPRFSNTPYEFERMDDRRFEKIR 286 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 141 bits (354), Expect = 8e-32, Method: Composition-based stats. Identities = 48/357 (13%), Positives = 106/357 (29%), Gaps = 31/357 (8%) Query: 1 MSRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPEN--HDGEKH 58 M + + + ++ G ++ + +++ L + + Sbjct: 1 MKYIIVLMSALCMFVMPVSGEQVNKAEQPMQGTTVKQVHSESKTILSDDSDTFHVNSSHT 60 Query: 59 FAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDAL------SQQVNQHVESWLSPWGNA 112 ++ G + T + A+ A+ S + W+ Sbjct: 61 PDHVIGQGGLEMPDSTDKKTTRYSDTDAVNSALQAVVMTGVHSAMHGSKAKPWMQ----- 115 Query: 113 SVDVKVDNEGHFTG-SRGSWFVPLQDND---RYLTWSQLGLTQQ-DNGLVSNVGVGQRWA 167 + + + ++ PL D R++ ++Q L D G +NVG+G R Sbjct: 116 RTVLSLRFQKNWKPLYGVETLQPLGHYDETSRHVWFTQERLANAADTGTTANVGIGYRRI 175 Query: 168 --RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT-ATQEQR 224 + G N FYD+ N R G E N+Y+ + AT+ + Sbjct: 176 AENDDHYYGGNLFYDHRFRGNHGRMSVGLEYVSGIGAFRMNWYRGVSGERSLDGATRMEN 235 Query: 225 MARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 ++ GY + + ++ + L +G P + Sbjct: 236 VSNGYTAEYGTSFKNARWARVYMEAYRW---------QLRRSADKHGLRIGTELQLTPRI 286 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNN 341 +V + + E + GV + S+R + +N +R + Sbjct: 287 SVDMGYNKPEHEHGSPYGKIMFRLA-GVDTAWFGGNHRLDTKTSVRANMLENVRRQH 342 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 140 bits (353), Expect = 1e-31, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 72/176 (40%), Gaps = 9/176 (5%) Query: 146 QLGLTQQDNGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEY--LRL 203 Q + ++ ++ + G+G VG N F+D ++ R G++ L Sbjct: 127 QANIDYKNEDILISNGIGILPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPNYIFNL 186 Query: 204 SANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 S+N Y P + + D+ A + + SLE +FGD + + + Sbjct: 187 SSNIYFPLSGKGSEDDL-----VNSIDIRAEGAI--TPTVQFHSSLEFFFGDDIQINDDY 239 Query: 264 TGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLS 319 +N + GL+YTP+PL+ + + + + + + L NY PL +QL Sbjct: 240 DPTNNSHKFTAGLDYTPIPLLQLGVEATKVQDHDVGYGVYLYFNYDPWRPLNEQLE 295 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 137 bits (346), Expect = 6e-31, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 87/274 (31%), Gaps = 16/274 (5%) Query: 75 GLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNAS---VDVKVDNEGHFTGSRGSW 131 G+ + + S V S L +++ V T S + Sbjct: 28 GISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFKPTISLTNV 86 Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQR 189 + + + ++Q L +N N+G+G R + G N F+D D++ QR Sbjct: 87 NM-ISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQR 145 Query: 190 AGFGAEAWGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSL 249 G G E L +N Y + + ++++ G+D+ +P + V L Sbjct: 146 NGAGLEVLSSVFDLRSNIYDATSGIEAVSTSRDEEAMDGWDMRLDYHLPIKTNARLFVGL 205 Query: 250 EQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYR 309 + F + G + GLN + + + + L+ Sbjct: 206 FE--------FENAAGSYEVEGEKYGLN-VLSKNFDLEVGYIDDNKTGDGSFANLSYILP 256 Query: 310 FGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLP 343 ++ E S+ Y+ +R N Sbjct: 257 LSSKSSFSNNSSNYFEYTSVAERLYEPVKRENKI 290 >UniRef50_Q4ACI6 Invasin (Fragment) n=1 Tax=Edwardsiella tarda RepID=Q4ACI6_EDWTA Length = 270 Score = 133 bits (334), Expect = 2e-29, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 63/146 (43%), Gaps = 13/146 (8%) Query: 327 QSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 + + Y+ RNN LEY++++ + + L+ + G + ++S+Y + Q+ Sbjct: 4 RQIAEIPYNLVDRNNDLVLEYKKQEVIKLALSHHAINDLAGAVYTVSANLKSKYALDQVS 63 Query: 387 WQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEG------------ASNHWRLSVVVEDN 434 WQ D +++ ++L++P ++ + A+N ++L V DN Sbjct: 64 WQ-DGGLVAAGGQLTVIDKNHFSLMLPPYRPAQAKSDAHQTSTAEIAANTYQLIAVAFDN 122 Query: 435 QGQRVSSNEITLTLVEPFDALSNDEL 460 QG + +S + + + P + Sbjct: 123 QGNQSNSETLRVVVQPPQVTAQGTFV 148 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 122 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 57/214 (26%), Gaps = 31/214 (14%) Query: 155 GLVSNVGVGQRWAR--GNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 G V+NVG+G R + VG NTFYD+ + R G E + AN Y+ Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 213 AWHEQTATQ----------------------EQRMARGYDLTARMRMPFYQHLNTSVSLE 250 + + Q+ GYD++ + V Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 251 QYFGDRVDLFNSGTG----YHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNL 306 + G V G G P V++ + + + + Sbjct: 261 HWNGLGVKTHGEGPALALNVGKSHGWQAGTTLQLTPHVSLDVGYTSDNNHSSGAYGFVK- 319 Query: 307 NYRFGVP-LKKQLSAGEVAESQSLRGSRYDNPQR 339 Y G + R D R Sbjct: 320 -YTLGTSKFAWHGGKHSDDIITNARARMLDKVDR 352 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 120 bits (302), Expect = 7e-26, Method: Composition-based stats. Identities = 43/269 (15%), Positives = 74/269 (27%), Gaps = 43/269 (15%) Query: 103 ESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL-VSNVG 161 + W PW + V + DN + +P+ D L +++L D G N+ Sbjct: 29 DKW-DPWLESGVSIGTDNSSR---GEAALLLPIYQTDSGLLFTELRGKLFDAGSKEGNLA 84 Query: 162 VGQRWA-RGNWLVGYNTFYDNLLDENLQRA---GFGAEAWGEYLRLSANFYQPFAAWHEQ 217 +G R W +G D E R +G EA N Y ++ Sbjct: 85 LGYRKMINNRWAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDFRINAYNALSSAQAY 144 Query: 218 TATQE--------------QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSG 263 E + GYD R + + F+ Sbjct: 145 PQPVEAELIGNQLFITSAAEVPLSGYDFELGHRFSVLSDQDI------WLYAGAFSFDDE 198 Query: 264 TGYHNPVALSLGLNYT--------PVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLK 315 L + P +T A + + +++ GL L G K Sbjct: 199 LVSTPVEGPKLRAEWRWNNILNDIPGSSLTAEAGYSHDKVRDDKWEAGLKLTIPLGGKAK 258 Query: 316 KQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 + + E + L +R+ Sbjct: 259 RSI-PLSALEKRLLSA-----VERDTDIV 281 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 118 bits (295), Expect = 6e-25, Method: Composition-based stats. Identities = 48/317 (15%), Positives = 105/317 (33%), Gaps = 46/317 (14%) Query: 93 ALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQ 152 + + S + +D++ + F+P+ + R L ++ + Sbjct: 21 SAGAYAQNAGQEKWSTY----LDLEGKVGSKRDIGEANLFIPVVQDARSLYFANVRARMA 76 Query: 153 DNG-LVSNVGVGQRWARG-NWLVGYNTFYDNLLDENLQ---RAGFGAEAWGEYLRLSANF 207 + G ++G G R W +G F D +A G EA G AN Sbjct: 77 NGGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTTYNNSYDQATLGVEALGRQFDWRANV 136 Query: 208 YQP-------FAAWHEQTAT----------QEQRMARGYDLTARMRMPFY-----QHLNT 245 YQP ++ + + + QE+R G+D+ A R+P + + + Sbjct: 137 YQPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERALPGFDIEAGWRLPVFDEEDTRQVRA 196 Query: 246 SVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLN 305 ++ ++ D + + + +A + L T+ A+++ + +Q+ + L Sbjct: 197 YLAGYRFSDDGLKVQGTRVRAEYVMA-EFSDTWKGAQL-TIGAEYQDDNARGSQSFVALR 254 Query: 306 LNYRFGVPLKKQLSAGEVAESQSLRGSR-YDNPQRNNLPTLEYRQRKTLTVFLATPPWDL 364 L G + A+ + + R R+ + T +A Sbjct: 255 LRIPLG-------NVASSAQRMTAQERRMTAPIVRDVDIV-----SQVGTRQIAHEKAST 302 Query: 365 KPGETVPLKLQIRSRYG 381 G + + + G Sbjct: 303 TAGGQAIVAISSETTTG 319 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 115 bits (287), Expect = 5e-24, Method: Composition-based stats. Identities = 42/234 (17%), Positives = 79/234 (33%), Gaps = 48/234 (20%) Query: 155 GLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 G+V ++G G R + VG NTFYD + L R G G E ++SAN Y + Sbjct: 178 GVVGSIGAGYRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLS 237 Query: 213 AWHEQTATQE------------------------------QRMARGYDLTARMRMPFYQH 242 + E + + GY++ + Sbjct: 238 EKKTKPYYFENSLVIVPRADEFHYPEDGYPNGFTKIRYAYENVLDGYNVRYTRDYKNARW 297 Query: 243 LNTSVSLEQYFGDR-----VDLFN-SGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESG 296 ++T V + VD+F + + + L LG P +++ ++ Sbjct: 298 ISTYVEGYHWKTKSPSEHPVDMFYLNQHKWKSISGLKLGATLNITPHISIDLGFN--KNN 355 Query: 297 ENQNNLGLNLNYRFGVP----LKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLE 346 + +++ Y G L + S V+ ++S D +R + +E Sbjct: 356 ISSGEPYVSVMYTLGKSRYAYLGGKHSEDTVSTARS---KMLDKVKR-HDMVVE 405 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 42/269 (15%), Positives = 79/269 (29%), Gaps = 47/269 (17%) Query: 111 NASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLT-QQDNGLVSNVGVGQRWARG 169 N +D + + + +PL N+ L ++ + D+ N+G+ R Sbjct: 32 NPRIDFEGKLGNDRSIAEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHMLD 91 Query: 170 N-WLVGYNTFYDNL---LDENLQRAGFGAEAWGEYLRLSANFYQPF-------------- 211 N W +G ++D D + G EA L AN Y P Sbjct: 92 NGWNLGGYGYFDRRKSPYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTVD 151 Query: 212 -AAWHEQTATQEQRMARGYDLTARMRMPFYQ-----HLNTSVSLEQYFGDRVDLFNSGTG 265 + E+R RGYD R+P + L ++ + + Sbjct: 152 FSGTTITYRAGEERSMRGYDAEVGWRIPVFSPEADKQLRIYAGGYRF---------TDSK 202 Query: 266 YHNPVALSLGLNYT----PV----PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQ 317 L P +++ +++ + +Q+ L L+ G K Sbjct: 203 ADTIQGPRARLEMKFNELPFLSRSSRLSLGLEYQHDDPRGSQSFAVLRLSIPLGGSKAKA 262 Query: 318 LSAGEVAESQSLRGSRYDNPQRNNLPTLE 346 E + D R+ + Sbjct: 263 GRRLTPMEQR-----MTDPIVRDVDIVSQ 286 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 39/231 (16%), Positives = 65/231 (28%), Gaps = 25/231 (10%) Query: 92 DALSQQVNQHVESWLSPWG---NASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLG 148 V ++ W A D G +G F+PL ++ L ++ L Sbjct: 18 SYAQDPVPEYQPEWFQEEDYLYRAYFDFTGQAGGVNDNGQGLLFIPLAQDEESLFFADLR 77 Query: 149 LTQQDNGL-VSNVGVGQRWA-RGNWLVGYNTFYDNL---LDENLQRAGFGAEAWGEYLRL 203 D+ N G+ R W+ G FYD ++ FG E Sbjct: 78 GNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDVRRSQYSNIFRQGSFGFELLSIEWDF 137 Query: 204 SANFYQP--------------FAAWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSL 249 N Y P + + E+R G D + + N L Sbjct: 138 RVNGYVPSQKQQRVDSLNTAYLSGNNIVMRAGEERAYWGTDFEVGRLLKSFPESNLDAEL 197 Query: 250 EQYFGDRVDLFNSGTG-YHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQ 299 Y G F++ + + Y L + + +G+ Q Sbjct: 198 RGYVGGY--YFDNSAPGFKEMTGPRARVEYRMFDLPWLGNGSRVVLAGQYQ 246 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 107 bits (268), Expect = 6e-22, Method: Composition-based stats. Identities = 41/248 (16%), Positives = 79/248 (31%), Gaps = 51/248 (20%) Query: 123 HFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWARGNWLVGYNTFYDN 181 + +PL D+D T+ L + D + V+N+G+G R+ G ++G +YD Sbjct: 25 DGAEGSIAVAIPLADSDAARTFLDLRGSIDDADRRVANIGIGHRFRLGAVVLGGAVYYDR 84 Query: 182 ---LLDENLQRAGFGAEAWGEYLRLSANFYQP----------------FAAWHEQTATQE 222 L+ + +A + L L AN+Y P + H + + Sbjct: 85 VRTDLESDFSQATVSLDLMTADLDLRANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIFQ 144 Query: 223 QR--MARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGL---- 276 R +G+D R+ + G V F G Y + A ++ Sbjct: 145 PREVTLKGFDAEVGYRLGAIE------------GYDVRAFAGGYRYTDDEAPTVDGVKGR 192 Query: 277 --NYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 ++ + + + ++ R G+ + SR Sbjct: 193 LEAWSQDGRFSFGIEVRD--DDQDDTQAFATFRMRLGL--------FSEPARREGTASRL 242 Query: 335 DN-PQRNN 341 D R + Sbjct: 243 DWPVLRES 250 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 107 bits (268), Expect = 7e-22, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 54/157 (34%), Gaps = 11/157 (7%) Query: 64 KDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKV----D 119 G + G + K D + VN + + + +G + ++ V Sbjct: 1 MALGLSLNATAGGKGVSEVLDAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKG 60 Query: 120 NEGHFTGSRGSWFVPLQDNDRYLTWSQLGL----TQQDNGLVSNVGVGQRWARGN--WLV 173 +E ++ PL ++ L W Q L D N+G+G RW +V Sbjct: 61 DEASYSIITTQPLAPLSEDGSRLFW-QGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIV 119 Query: 174 GYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQP 210 G N+FYD +R G E LS N Y Sbjct: 120 GINSFYDYEFSAKHKRMSLGGEYKRSNAELSVNKYWG 156 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 94.4 bits (233), Expect = 9e-18, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 76/227 (33%), Gaps = 39/227 (17%) Query: 118 VDNEGHFTGSRGSW--FVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGNWLVGY 175 D F L + + Q + + G+ R+ + ++L+G Sbjct: 73 TDGNLDFQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLINRYEKDDFLLGI 132 Query: 176 NTFYDNLLDENLQRAGFGAEA-WGEYLRLSANFYQPFAAWHEQTATQEQRMARGYDLTAR 234 N F D ++ + FGAE + ++++ +N+Y P ++ + Sbjct: 133 NGFIDGQKEQKESK-SFGAELGYYQFVKAYSNYYVP--------NEADENL----QFGVS 179 Query: 235 MRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQ-- 292 +P Y +S + ++ ++Y+P ++++ + Sbjct: 180 FTIPSYSAFIFDIS------------------KDSEKINYQVSYSPYSVLSLKILRRDFS 221 Query: 293 GESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQR 339 + L + ++ F KQL + A +RYD QR Sbjct: 222 ANEAIDDTVLQVGFSFNFNESFVKQLRKKDNALQ---EVNRYDFLQR 265 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 92.1 bits (227), Expect = 4e-17, Method: Composition-based stats. Identities = 53/348 (15%), Positives = 95/348 (27%), Gaps = 55/348 (15%) Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWARG- 169 SVDV + F+P+ +D L + L + + N G+G R + Sbjct: 34 PSVDVSGKAGETRRIGEVNLFLPIAQDDSNLLFLDLRTSFDNLEQREGNFGLGYRAMQDS 93 Query: 170 NWLVGYNTFYDNLLD---ENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE---- 222 W +G FYD + G EA G+ N Y P + ++ Sbjct: 94 GWNLGAYAFYDRRRSSEGHYFSQITTGLEALGQDFDARINAYLPIG--RKSYEVEDSARV 151 Query: 223 -------------QRMARGYDLTARMRMPFY-----QHLNTSVSLEQYFGDRVDLFNSGT 264 +R G D R+P + + +F +G Sbjct: 152 DLSGGSIQILSGLERAYHGGDAELGWRLPVFATDQDSEIRVYGGGY-WFDAESSEAVAGP 210 Query: 265 GYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVA 324 + L + P VT + + ++ E+ Q+ LGL L Sbjct: 211 RGRIELRLYDPIEALPGSRVTFSGELQRDEARGTQHFLGLKLRIPL----------QAEN 260 Query: 325 ESQSLRGS---RYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG 381 ++ L D R+ + G+TV I Sbjct: 261 TARRLSPQERRMTDPLIRDVDIVTQ---------SATVSEKATLNGQTVASVTNITDGAN 311 Query: 382 IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSV 429 + + D + + L + Q+ + L+ Sbjct: 312 AQSV---IDGAASNTLLVVNGTTTVTSQLSLRQGQSLSSGGSTITLTG 356 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 90.9 bits (224), Expect = 1e-16, Method: Composition-based stats. Identities = 30/242 (12%), Positives = 60/242 (24%), Gaps = 57/242 (23%) Query: 155 GLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENLQRAGFGAEAWGEYLRLSANFYQPFA 212 G++ +VG+G R + VG N F D N R G E + AN Y+ Sbjct: 168 GIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVNGLNEVYANVYRGLG 227 Query: 213 AW-------------HEQTATQ----------EQR--------MARGYDLTARMRMPFYQ 241 + + GY++ + Sbjct: 228 DKDLVKGGGGNPYPKRLYPNGYPDTFPYNTIPSENYNTYVGGGVLDGYEIGIVRSFKNAR 287 Query: 242 HLNTSVSLEQYFGDRVDLFNSGTGYHNPV---------------ALSLGLNYTPVPLVTV 286 V+ ++ G+ + +G P +++ Sbjct: 288 WARAYVNGYRWNGNGFSHKQEYNWGRPGHWSVPWFTSRNANHYKGIKIGAELQLTPHISL 347 Query: 287 TAQHKQGESGENQNNLGLNLNYRFGVP----LKKQLSAGEVAESQSLRGSRYDNPQRNNL 342 + + L Y G + S + ++S D +R ++ Sbjct: 348 DIGYNNANNMSKGMYGTLK--YTLGTSKFAFWGGKHSDDTITTARS---KMLDKVRRQDM 402 Query: 343 PT 344 Sbjct: 403 IV 404 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 90.2 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 37/264 (14%), Positives = 65/264 (24%), Gaps = 34/264 (12%) Query: 106 LSPWGNASVDVKVDNEGHFT---GSRGSWFVPLQDNDRYLTWSQLGLTQ-QDNGLVSNVG 161 S +G + + F PL D Y + L ++ +NVG Sbjct: 266 FSEFGYVRGAYTFGEGIGIRHNYSTLTALFAPLVPYDDYYPFLDLRAHYIKNKRWAANVG 325 Query: 162 VGQRWAR--GNWLVGYNTFYDNLLDE--NLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ 217 G RW ++ G N +YD + + GFG E + + N Y P Sbjct: 326 GGLRWRDCMTGFIFGANLYYDYRNTTQTDFNQFGFGLEFFTNCFEMRLNAYFPVGDVTHC 385 Query: 218 TATQ--------------EQRMARGYDLTARMRM---PFYQHLNTSVSLEQYFGDRVDLF 260 + +G DL P++ Y+ D Sbjct: 386 EDHVFSDYIGPYYAVCGLTEIAQKGVDLEVGHTFWKCPYFSVFGAIGGY--YYTDVCGHR 443 Query: 261 NSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSA 320 + L+++ A+ + L+ L Sbjct: 444 HHNHNNEKRWGEKARACINVGSLLSLQARFFHDNHQNSFWQGMAMLSIPLDFCFGIALRE 503 Query: 321 GEVAESQSLRGSRYDNPQRNNLPT 344 +RN + Sbjct: 504 NYNRVF-------TQPVERNEMIV 520 >UniRef50_UPI0000E0F7DB beta-glycosidase-like protein n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F7DB Length = 744 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 55/170 (32%), Gaps = 24/170 (14%) Query: 298 NQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFL 357 + +N +R P +++ E + + S N P V L Sbjct: 557 SDSNAPYIFTWRPSTPGTHEITVKAFKEDGTEKTSATTLVSLNGEPL-------VFDVSL 609 Query: 358 ATP-PWDLKPGETVPLKLQIRSRYG-IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDW 415 A P ++ GE++ L + S YG + ++ + + + ++ P + Sbjct: 610 AQPSTSEMTAGESLTLDASVSSNYGKVSKVDFHVNGNFV-------------FSSTTPPY 656 Query: 416 QN--GEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 + + L V + G S IT+T+ P + + Sbjct: 657 TFAWQPATAGEYTLDAVAAKSDGSIKQSESITITVNAPAPTPTPVQPAPP 706 >UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQN0_PARL1 Length = 675 Score = 86.3 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 49/381 (12%), Positives = 95/381 (24%), Gaps = 60/381 (15%) Query: 104 SWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLV-SNVGV 162 W PW A + + + + F+PL + L ++ + G+ N + Sbjct: 30 KW-GPWIEAGGFLSTERDRGEATA----FMPLFQSGESLLFADVKGKLFSEGVTEGNFAL 84 Query: 163 GQRWARG-NWLVGYNTFYDNLLD---ENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT 218 G R + +G YD +A FG EA N + P A Sbjct: 85 GYRRMTAWDVNLGLWGGYDIRESVSGNTFDQAAFGIEALAADYDFRLNGFVPLADGKAAP 144 Query: 219 ATQ--------------EQRMARGYDLTARMRMPFYQHLNT-SVSLEQYFGDRVDLFNSG 263 + + G++ R+P + L E F+ Sbjct: 145 GMARVELSGSQILLTGGRELVLGGFEGEVGWRLP-LEALGADRERHEFRLYAGGYRFDDS 203 Query: 264 TGYHNPVALSLGLNYTPV--------PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLK 315 L + + ++ + + +Q G L Sbjct: 204 DLAKPVQGPRLRAEWRILDAVPGLAGSRLSFESSFQHDSYRHDQWEAGFRLRIPL----- 258 Query: 316 KQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQ 375 + + R+ + + + E V Sbjct: 259 --YGGDAAKTLSPVERRMAEPIIRDTDIVTAPSRAEKV-----ADALTGTVFENVVRVDS 311 Query: 376 IRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQ 435 + Y +L + GA A S +++ + G + L V Sbjct: 312 TQDLYAATHAAGANSLVLLDGSAGAVAASG----IVLDGDRTLAGGGSQLALRGVA---- 363 Query: 436 GQRVSSNEITLTLVEPFDALS 456 + I +T P A + Sbjct: 364 ------SGIAVTYTVPGAAPT 378 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 84.8 bits (208), Expect = 7e-15, Method: Composition-based stats. Identities = 46/302 (15%), Positives = 83/302 (27%), Gaps = 44/302 (14%) Query: 93 ALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSW--FVPLQDND-RYLTWSQLGL 149 + L + + + G FVPL + LT+ + L Sbjct: 22 LAQTEAESETADTLR--IKPRLGIGHTSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRL 79 Query: 150 --TQQDNGLVSNVGVGQRWA--RGNWLVGYNTFYDNLLDENL--QRAGFGAEAWGEYLRL 203 D L N+ +G R + + G YDN + + G G E+ G Sbjct: 80 FLDNDDANLGGNLILGYRTYSANSHRIWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDF 139 Query: 204 SANFYQPFAAWHEQTATQ-------------------EQRMARGYDLTARMRMPFYQHLN 244 N Y P + ++ G+D ++ ++ Sbjct: 140 RVNGYLPIGDTRQGVGDAGVRDIFFRRNFLILEQGQNKEAAMGGWDAEVGAKLARI-GID 198 Query: 245 TSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGL 304 + G + G + L P + S +N + G Sbjct: 199 GDLR-----GYGGLYWYDAEGSSEIWGWRVRLEARPSDNFNLGL------SLQNDDLFGT 247 Query: 305 NLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDL 364 NL + G G E + ++ QR N +++ + AT P Sbjct: 248 NLVFTVGATFPGSRPQGLGDEDDQVLARVAESVQRTNAIVIDH--QDDFQDVPATNPETG 305 Query: 365 KP 366 +P Sbjct: 306 EP 307 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 84.4 bits (207), Expect = 8e-15, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 46/134 (34%), Gaps = 10/134 (7%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 + + F ++ + + + + + L EN++G + A+ Sbjct: 12 LRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENK--LSTNTENNNG-RWLAQQTS 68 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 G +DN D Q + + + +VN+ +E+W + +G A +++ VD Sbjct: 69 QLGTILSSDNTHDAASQ-------YLINQANSKVNREIENWFNQYGKAQINLGVDKHFTL 121 Query: 125 TGSRGSWFVPLQDN 138 + Sbjct: 122 KTQKLKSLFLFTKQ 135 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 84.4 bits (207), Expect = 9e-15, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 71/233 (30%), Gaps = 36/233 (15%) Query: 143 TWSQLGLTQQDNG-LVSNVGVGQRWAR-GNWLVGYNTFYDNL-LDE-NLQRAGFGAEAWG 198 + D+G +VG+G R W+VG N +YD + +L + G G E G Sbjct: 115 IFLDGKAFLFDHGKWGGSVGIGLRHFSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLG 174 Query: 199 EYLRLSANFYQPFAAWH-----------EQTATQEQR--MARGYDLTARMRM--PFYQH- 242 + + N Y P AT +R + G D + P Sbjct: 175 DCVEFRVNGYLPVNKNRWDQCCLFNYSGSYFATLRERGYVWSGLDTEIGTWLVKPSCCQD 234 Query: 243 LNTSVSLEQYFG----DRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE---- 294 + V+ Y+ D+ F+ H+ + + T + ++ Sbjct: 235 IGLYVAAGPYYYRRSHDQDFFFHDQ--KHHTIGGKARILATLGDFIELSMAATHDSVWHT 292 Query: 295 --SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 G + + + Y P+ Q + SL R + + Sbjct: 293 RVQGRAEIIVPFDYFYTLFNPVHDQSTCLVTPSHSSL----TRPVYRQDSIVV 341 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats. Identities = 42/262 (16%), Positives = 82/262 (31%), Gaps = 42/262 (16%) Query: 132 FVPLQDNDRYLTWSQL-GLTQQDNGLVSNVGVGQRWARGNW--LVGYNTFYDNLLDEN-- 186 +P + ++ + L G + +NVG G R+ N+ ++G N ++D Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 187 LQRAGFGAEAWGEYLRLSANFYQPFAAWHE---------QTATQEQR-----------MA 226 + GFG E G Y N Y P + Q+ R Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFPVGPTEQLLSQSVVTGSQRFQDTRILFDRERIVGLAP 214 Query: 227 RGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPV--ALSLGLNYTPVPLV 284 +G+D M + + E++ F + P L +P + Sbjct: 215 KGFDAEFGMPL------FFNSFFERHDLRAFGGFYHYQSENLPTLWGWKGRLAADVIPSI 268 Query: 285 TVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPT 344 V + + + N+ N+ + FG +Q + + + +R+ Sbjct: 269 NVGLEVAHDQ--VFKTNVAFNVQWTFG--GFRQGESERITQI----NRMTTPVRRSYNVV 320 Query: 345 LEYRQRKTLTVFLATPPWDLKP 366 + + R +A P P Sbjct: 321 VA-QNRVVDEDVVAINPATGNP 341 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 82.5 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 40/235 (17%), Positives = 64/235 (27%), Gaps = 28/235 (11%) Query: 101 HVESWLSPWGNASVDVKVDNEGHFTG---SRGSWFVPLQDNDRYLTWSQLGLTQ-QDNGL 156 W+ P A + V S G + +PL D++ L + + + Sbjct: 44 QANEWVFPPTLAYLQGVVGKGIGEQNGYASFGIFTIPLLDSNGQLFF-DARIHNLRHERW 102 Query: 157 VSNVGVGQRWARG--NWLVGYNTFYDNLLDE-NLQRAGFGAEAWGEYLRLSANFYQPFAA 213 +NVGVG R A N G N FYD + + G G E N Y P Sbjct: 103 AANVGVGTRIAIPCTNLFFGINFFYDYRRTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 Query: 214 ---------WHEQTATQE-----QRMARGYDLTARMRMPFY-QHLNTSVSLEQYFGDRVD 258 + Q G DL + + L V + Sbjct: 163 RSLRKHPKVFRFHDNLFAACTQIQNSLSGGDLELETSLRRWDPCLCFDV-----YIAPGG 217 Query: 259 LFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 F + + L + + + + + + FG P Sbjct: 218 YFYHIRHHRDITGGRLRIGAVLFDYLGLEVRGSYDHYYKGTVQGVAYVEIPFGGP 272 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 79.8 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 69/233 (29%), Gaps = 39/233 (16%) Query: 137 DNDRYLTWSQL-GLTQQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDEN---LQRAG 191 + + L G D ++ G+G R ++G NT+YD L + G Sbjct: 2 EIKDLDVFIDLDGYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVG 61 Query: 192 FGAEAWGEYLRLSANFYQPFA---------AWHEQTATQE------QRMARGYDLTARMR 236 G E + + N Y P + ++H + G D Sbjct: 62 VGFEMLSDCFDVRINGYLPVSEKVHSHQCLSFHYSGTDFHASRCKLEYAYGGLDAEIGKP 121 Query: 237 MPFYQHLNTS--VSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGE 294 + Y + V ++ F G L +++ + V V A + + Sbjct: 122 LLTYYDFDLYGAVGPYYFYRRNFKHFCGGYA-------RLEVDWKSILSVGVQASYDKFN 174 Query: 295 SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEY 347 + Q +++ F + Q+ +RN + ++ Sbjct: 175 AIRLQGIFAVSIPLDFCK--IGAICEDSSLFLQN--------VRRNGVILTDH 217 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 79.0 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 35/246 (14%), Positives = 66/246 (26%), Gaps = 52/246 (21%) Query: 132 FVPLQDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWARGN--WLVGYNTFYDNLLD---E 185 +P + + L ++ L + D + N+G G R N + G+ YD Sbjct: 50 LIPFYQDGKRLGYADLRYSSSDVDTDEINLGAGFRSLNENETAIYGFYGSYDLRKSATER 109 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPFAA--WHEQTATQE--------------------- 222 + ++ FGAE + +NFY P + A + Sbjct: 110 DYRQLTFGAELLTDTWDYRSNFYFPTGDDSYQVGNAEDDVTVESEFVGHDLVRTTTTVGG 169 Query: 223 ----QRMARGYDLTARMRMPF-YQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLN 277 + G D+ + F + + + D + L Sbjct: 170 GTIFEEALSGADIEVGRLLNFDNFEMRGYLGAYHFSADVIGG---------TTGTRARLE 220 Query: 278 YTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNP 337 P L + + + + FG + G V +SL Sbjct: 221 MRP--LKNFNLNFVIEDDDLYRTRGLMEFRWAFG----WDATPGGV---RSLHERMTQFV 271 Query: 338 QRNNLP 343 R+ Sbjct: 272 YRDIDI 277 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 74.0 bits (180), Expect = 1e-11, Method: Composition-based stats. Identities = 39/265 (14%), Positives = 71/265 (26%), Gaps = 41/265 (15%) Query: 127 SRGSWFVPL-QDNDRYLTWSQLGLTQQDNGLVS-NVGVGQRWARGNW--LVGYNTFYDNL 182 S +P + ++ + + L D G N+G G R N + +YD Sbjct: 142 SNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLGAGWRAYNDNLDKIFTVAGWYDYD 201 Query: 183 LDEN--LQRAGFGAEAWGEYLRLSANFYQP-----------FAA--------WHEQTATQ 221 + G E G+YL N Y P + + + Sbjct: 202 DGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEIIISNNLSGSAYFQTDRIYLNRTRR 261 Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPV 281 + G D +P + Y+ + + Sbjct: 262 SESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYY-------NSDHDKSAAGAKFRAEANIN 314 Query: 282 PLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNN 341 ++ + + + + + L+ G K ++L+ Y RN Sbjct: 315 DWWQMSVSYAKDSVFGSNAWMNVTLSIPEGRSDKWM-------RPKTLQQRMYQPMNRNY 367 Query: 342 LPTLEYRQRKTLTVFLATPPWDLKP 366 +Q T T LA P D P Sbjct: 368 RVVANVKQ--TTTNELAINPDDGLP 390 >UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLP1_ACAM1 Length = 1022 Score = 73.6 bits (179), Expect = 2e-11, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 87/298 (29%), Gaps = 49/298 (16%) Query: 106 LSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDND-RYLTWSQLGLTQQDNGL-VSNVGVG 163 L + + G T +R FVP+ R LT+ + L D G N+ G Sbjct: 36 LRFSPRFGIGANSPSSGTNTTTRLETFVPVWQKPGRALTFFEGRLLLDDQGNPGGNILFG 95 Query: 164 QRWARGNW--LVGYNTFYDNLLDENL--QRAGFGAEAWGEYLRLSANFYQPFAAWHEQT- 218 R + + G + +D +N Q+ G E+ G+ + L N Y P + QT Sbjct: 96 FRQYSDDLKRIFGGHLGFDIRNTDNNTFQQLSLGIESLGKDVDLHLNGYWPVGSTRRQTR 155 Query: 219 ------------------------------ATQEQRMARGYDLTARMRMPFYQHLNTSVS 248 Q + G D ++ +++ + Sbjct: 156 QRIFEVLQLNGDPRFTGNILLLDLLRRRLITRQFEEALAGVDFEVGKQLLSFKN-GGDLR 214 Query: 249 LEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNY 308 Y G F + N + L L P P ++ + + + Sbjct: 215 A--YLG---PYFLHSSIRGNTIGGRLRLQVRPTPNISGGIGIQHDDFFGTHVLGNITFTL 269 Query: 309 RFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKP 366 P V+ + ++ D+ RNN ++ L + P Sbjct: 270 SVNQP------QSPVSPANNVVARMGDSVIRNNSIIVDTPNTTELLETETQTVAAMNP 321 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 71.7 bits (174), Expect = 5e-11, Method: Composition-based stats. Identities = 41/238 (17%), Positives = 67/238 (28%), Gaps = 48/238 (20%) Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVS-NVGVGQRWARG--NWLVGYNTFYDNL---LDE 185 PL ++ ++L Q L D + NVG+ R + + G N +YDN Sbjct: 70 LTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGRVYDAGRDRIWGANVYYDNDETTYSN 129 Query: 186 NLQRAGFGAEAWGEYLRLSANFYQPF--AAWHEQTAT------------------QEQRM 225 + GFG E+ G+ L L AN Y P + + Sbjct: 130 RYSQIGFGFESLGQNLDLRANAYLPTGSSDKVIGPNGLSNTLFYTGNQLNFTGSYLSEEA 189 Query: 226 ARGYDLTARMRMP---FYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVP 282 RG D +P L Y +N + L+ Sbjct: 190 LRGADFELG--IPVTQNMSWLRAYGGGYFY----------DATQNNVSGVRGRLDAQLST 237 Query: 283 LVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRN 340 +T + + + +RF + + + L QRN Sbjct: 238 DLTAGVIATH--DSTFKTRVNAYVEWRFAGFVPTVWFPRLNTQERMLTS-----VQRN 288 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 71.3 bits (173), Expect = 8e-11, Method: Composition-based stats. Identities = 49/260 (18%), Positives = 81/260 (31%), Gaps = 43/260 (16%) Query: 132 FVPLQDNDRYLTWSQLGLTQQDNGLVS-NVGVGQRWARG--NWLVGYNTFYDNLLDEN-- 186 F+P ++ L ++ + + G NVGVG R + + G + +YD Sbjct: 113 FLPFFRDENSLIFTDIRGLMTNGGKGGANVGVGYRQFVPELDRIFGVSGWYDFDNGHREA 172 Query: 187 LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTA------------------TQEQRMARG 228 + G E+ G YL N Y P E + +G Sbjct: 173 FNQFGVSFESIGRYLDWRVNGYLPVEDNEEISNQILGAAGFQNNFILLNRGRSVDSAYKG 232 Query: 229 YDLTARMRMPFY--QHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTV 286 +D P ++ V + Y V F +G +N VTV Sbjct: 233 FDTEIGGPFPILGRYGMSGYVGMYYYANTDVGSFTGVSGR-----FQQRVNEDLTINVTV 287 Query: 287 TAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLE 346 T H G + + Q + + A + +R D RN T++ Sbjct: 288 TDDHTFGTNAQIQVIADIPNGF-----------PSRWAREKRVRDRLRDPVMRNYRVTVQ 336 Query: 347 YRQRKTLTVFLATPPWDLKP 366 +R ++ A P D P Sbjct: 337 --ERLLVSQEFAIDPEDGNP 354 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 68.6 bits (166), Expect = 5e-10, Method: Composition-based stats. Identities = 43/277 (15%), Positives = 77/277 (27%), Gaps = 55/277 (19%) Query: 111 NASVDVKVDNEGHFTG--SRGSWFVPLQDND-RYLTWSQLGLTQQDNGLVS-NVGVGQRW 166 + G +R F+PL N LT+ + L ++ V N+ G R+ Sbjct: 74 KPRWGIGYSTSGAGYDGFTRLDSFLPLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRF 133 Query: 167 ARG--NWLVGYNTFYDNLLDEN--LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT-- 220 N + G +D N + G G E GE + N Y P + Sbjct: 134 YNQSLNRIFGGYLGFDRRDTGNSTFHQLGVGVETLGEVWDVRLNGYFPLGDTRDLVDETA 193 Query: 221 -----------------------------QEQRMARGYDLTARMRMPFYQHLNTSVSLEQ 251 + G+DL R+ + Sbjct: 194 FDTGFQLTDRFFSDHFLVIQGKRQRGQVRHFEAAMTGFDLEVGARLAQWGEGGGLRG--- 250 Query: 252 YFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFG 311 + TG + + + L P + ++ + G N+ G Sbjct: 251 ---YGGLYYYDATGSDSSLGWRMRLEVQPTDSLNFGVALQEDQ------IFGTNVIVSVG 301 Query: 312 VPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYR 348 K S+G S+ D +R + T++ + Sbjct: 302 AIFGKTRSSGNA----SILSRLGDGVERISSITVDSQ 334 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 67.4 bits (163), Expect = 1e-09, Method: Composition-based stats. Identities = 35/274 (12%), Positives = 77/274 (28%), Gaps = 54/274 (19%) Query: 87 LGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQ 146 + + + + + + + + + +P +D+ ++ Sbjct: 203 KSDTSNDTNTEADTSTNLGIPYFVDTEFRGSTRRQFG----GINLRLPFWQDDQSFAFAD 258 Query: 147 LGLTQ-QDNGLVSNVGVGQRWARGN-----WLVGYNTFYDNLLDEN---LQRAGFGAEAW 197 + + + N+G+ R W++G + FYD+ EN + GAE Sbjct: 259 VHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSENGFQYHQGSLGAELV 318 Query: 198 GEYLRLSANFYQPFAA---------------------------WHEQTATQEQRMARGYD 230 + N Y P + T +R G+D Sbjct: 319 NKKFEFRVNGYLPGSNPNVVGQRTINGVLGIQPRANGLGTNIVQQTLTLEARERALAGFD 378 Query: 231 LTARMR--MPFYQHLNTSVSLEQYFGDRVDLFNSGTGY-----HNPVALSLGLNYTPVPL 283 A R L + + +P+ ++ G L Sbjct: 379 FEAGHRHHFNDKVSLGLFGGYFFFDSRETLSIDGPMARTQLEVQDPLGMNGG-------L 431 Query: 284 VTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQ 317 + V + + E+ ++ + L FG P + Q Sbjct: 432 LQVGGRFRFDETRGSELEGFVRLGIPFGGPKRSQ 465 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 40/124 (32%), Gaps = 7/124 (5%) Query: 5 VPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAEIVK 64 V +II LL + ++ + ++ + + E+ A + Sbjct: 129 VQQIIDTPLLRKLNQFRTFVRNVRPGDELDVQAQVSEKNLTPPPGNSSGNLEQQIASTSQ 188 Query: 65 DFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHF 124 G D + A R S Q + + WLS +G A + + VD + Sbjct: 189 LIGSLLAEDMNSE-------QAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFSL 241 Query: 125 TGSR 128 SR Sbjct: 242 KNSR 245 >UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZRN5_9PLAN Length = 792 Score = 64.7 bits (156), Expect = 7e-09, Method: Composition-based stats. Identities = 49/330 (14%), Positives = 93/330 (28%), Gaps = 50/330 (15%) Query: 78 TGEQAKAFALGKVRDALSQQVNQHVESWLSPWG---NASVDVKVDNEGHFTGSRGSWF-- 132 T + A ++ L + + + G + V+ EG + F Sbjct: 20 TAPASAQQAGDDIQPGLISGTSTFASPYANGQGGEYFPRISVQHRTEGAGYDYSFTDFRA 79 Query: 133 -VPLQD--NDRYLTWSQLGLTQQDNGLVS-NVGVGQRWARGNWL--VGYNTFYDNLLDEN 186 VPL + + + LT+ ++ V N VGQR+ N+ G YDN N Sbjct: 80 WVPLYESYDSKSLTFFDGAFLLANDQNVGMNAVVGQRFYSDNYGRTFGGYVGYDNRDTGN 139 Query: 187 LQ--RAGFGAEAWGEYLRLSANFYQPFA-------------------AWHEQTATQEQRM 225 + G E+ G + N Y P TQ + Sbjct: 140 QTVGQVVTGFESLGR-IDFRVNGYFPTTSDPTMTGQTGFFDPTYVGYNIQLSQLTQYEVA 198 Query: 226 ARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVT 285 +G+D +P H+ + Y G + SG+ + Sbjct: 199 MKGFDAEIGGALP---HVGDYLRA--YLGAY-NFQGSGSPQA--WGWKTRFESHVTDRMR 250 Query: 286 VTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTL 345 + + + G + G + V + + + RN + Sbjct: 251 LYLTVSDDQVFDTNVVFGAAFFF-PGSSAR------RVPQYDNYVNKMDEPIIRNEAIVV 303 Query: 346 EYRQRKTLTVFLATPPWDLKPGETVPLKLQ 375 + ++ AT P + + + Sbjct: 304 NKTNQS--SIVNATNPVSGADQQVIHVDPN 331 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 64.4 bits (155), Expect = 9e-09, Method: Composition-based stats. Identities = 51/326 (15%), Positives = 91/326 (27%), Gaps = 51/326 (15%) Query: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ-QDNGLVSNVGVGQRWARGN-WLVGYNTFY 179 +T + PL + + L+ DN +VG+G RW +VG F Sbjct: 42 SDYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFG 101 Query: 180 DNLLDENLQRA---GFGAEAWGEYLRLSANFYQPFAAWHEQTATQ--------------- 221 +N R G EA+G N Y P H T+ Sbjct: 102 GYSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVF 161 Query: 222 --EQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYT 279 Q G D+ A ++ + H + L Y+ + +N + GL Y Sbjct: 162 LMHQYAGSGADIKAGYQL--FPHSSLKGYLGSYYFSPAET-------NNVWGGAAGLEYW 212 Query: 280 PVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDNPQR 339 V + + + G+ L + G + L D +R Sbjct: 213 LTQGVKLIGSYSYDNLHHSTYAFGIGLEW--GGLRAHRADPE-------LEERLTDPVER 263 Query: 340 NNLPTLEYRQRKTLTVFL-ATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTP 398 + E + T+ L A P V + L + + + + Sbjct: 264 H---VAELGRGSTIPTRLKAKPLITGDQPTGVIILLG-------NNIAFFSEAGGPNNGG 313 Query: 399 GAQANSAEGWTLIMPDWQNGEGASNH 424 + + + + + N Sbjct: 314 VGLSLANCTFENPCGPTDFSQASVNT 339 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 64.0 bits (154), Expect = 1e-08, Method: Composition-based stats. Identities = 42/304 (13%), Positives = 77/304 (25%), Gaps = 91/304 (29%) Query: 158 SNVGVGQRWA--------------------RGNWLVGYNTFYD---NLLDENLQRAGFGA 194 +++G+G R + VG N F D + + G G Sbjct: 144 ASLGLGWRHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGI 203 Query: 195 EAWGEYLRLSANFYQPFAAWH--------------EQTAT-------------------- 220 EA YL + N+Y P + T Sbjct: 204 EAGTRYLEVRGNYYIPLSDKQLAEQTRTREILRNSSSRDTTTVSALSDPYATGNTVSQDV 263 Query: 221 --------------------QEQRMARGYDLTARMRMPFY-QHLNTSVSLEQYFGDRVDL 259 + + G+D + +P ++ + + Y D Sbjct: 264 SYRTQRTTTTTTTTIERLFSRYEEGMEGWDTEVAVLVPGLDKYFDLRLIGGYYSFDNQPF 323 Query: 260 FNSGTGYHNPVALSLGLNYTPVPLVTV-TAQHKQGESGENQNNLGLNLNYRF-------G 311 G N G+ PVP V + ++ + G+ L F G Sbjct: 324 GPQTGGTGNVEGWKAGVEVRPVPAVILTGTWYEDDRLTGSDWTAGVQLQLPFELGDLGDG 383 Query: 312 VPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPT-----LEYRQRKTLTVFLATPPWDLKP 366 ++ + L + R N ++ + + +V T P Sbjct: 384 KGFWGRIGDSFKPRRRHLAERLAEPVHRQNAAIKVANTVDVDTKTSTSVQKVTKVVSQTP 443 Query: 367 GETV 370 G+ V Sbjct: 444 GKIV 447 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 62.0 bits (149), Expect = 5e-08, Method: Composition-based stats. Identities = 36/260 (13%), Positives = 73/260 (28%), Gaps = 52/260 (20%) Query: 111 NASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGL-VSNVGVGQRW-AR 168 + + N G + S +PL L + L D + G+ R Sbjct: 164 TVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIYIDLKSKLYDAKEGEVSTGIVFRRQMS 223 Query: 169 GNWLVGYNTFYDNLL--DENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQ----- 221 G N F D + N + G E + + L+ N+Y+ + ++ Sbjct: 224 PLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKSFSLNGNYYR--SNKKTTISSVKSFEF 281 Query: 222 ------------EQRMA-RGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHN 268 ++R A GYDL + + Y +++ S ++ Sbjct: 282 HDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIHGSAFFF---------YSPYNTEEK 332 Query: 269 PVALSLGLNYTPVPLVTVTAQHK---QGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAE 325 G++ + + +S N+ + + N G + L Sbjct: 333 FSGYRAGVDLSLYLNERFSVLVSPEFVADSKRNRFLVNVGFNLPVGRDYTRLLGH----- 387 Query: 326 SQSLRGSRYDNPQRNNLPTL 345 +R+ L Sbjct: 388 -----------VRRDRDIVL 396 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 61.3 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 57/454 (12%), Positives = 118/454 (25%), Gaps = 111/454 (24%) Query: 109 WGNASVDVKVDNEGHFTGSRGSWFVPLQDN-------DRYLTWSQLGLTQQDNGLVSN-V 160 G + +K + +T S PL + + + + ++ + G +++ + Sbjct: 52 LGTVTAGLKTSD--AYTDGHFSIVAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASSL 109 Query: 161 GVGQRWA--------------------RGNWLVGYNTF---YDNLLDENLQRAGFGAEAW 197 G+G R VG + F D + + G G EA Sbjct: 110 GLGFRHLFGSQPLTALSANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEAG 169 Query: 198 GEYLRLSANFYQPFAAW---------------HEQTATQE-------------------- 222 Y+ + N+Y P + ++ + Sbjct: 170 TRYVEVRGNYYIPLSDKQLAEETRTRETIRNSRSRSTSYLTGVSDPYATGNTIAQDAAFT 229 Query: 223 ------------QRMARGYDLTARMR-------MPFY-QHLNTSVSLEQYFGDRVDLFNS 262 +R+ R Y+ +P ++L+ V Y D Sbjct: 230 TRTTTTTYTTTIERLFRRYEEGMEGWDAEVAVLVPGLDRYLDVRVIGGYYSFDNQPFGPQ 289 Query: 263 GTGYHNPVALSLGLNYTPVPLVTV-TAQHKQGESGENQNNLGLNLNYRF-------GVPL 314 G N GL PVP V + ++ + +G+ L F G Sbjct: 290 QGGTGNVEGWKAGLELRPVPAVILTGTWYEDARLTGSDWTVGVQLQIPFEAGDLGDGKNF 349 Query: 315 KKQLSAGEVAESQSLRGSRYDNPQRNNLPT-LEYRQRKTLTVFLATPPWDLKPGETVPLK 373 ++ + L + +R N L + T + Sbjct: 350 LSRVGDAFKPRRRHLAERMAEPVRRQNAAVKLASTVESSSRSQTTRNTDTSTSQSTKMIV 409 Query: 374 LQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP-DWQNGEGASNHWRLSVVV- 431 L +++ + + A S G + + +N + Sbjct: 410 LT-------DDVVFVNNGDAVGNGIQAGDTSGNGANGTAERPYNSLVDGANTASIRSNAS 462 Query: 432 -----EDNQGQRVSSNEITLTLVEPFDALSNDEL 460 QG ++ + D +S+ E+ Sbjct: 463 GQIWKVYTQGDTDLPYTGSVIVTGSTDFISSFEV 496 >UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZT8_9GAMM Length = 785 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 32/236 (13%), Positives = 56/236 (23%), Gaps = 35/236 (14%) Query: 101 HVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNV 160 V +W PW + V +PL N + ++ L + + Sbjct: 31 QVWAWGGPW-KPRQTLNVQGGHGMQDYY-DALLPLSGNAERMLYANGALAATHHETGGEL 88 Query: 161 GVGQRW--ARGNWLVGYNTFYDNLLDENL---QRAGFGAEAWGEYLRLSANFYQPFA--- 212 G+G R +++G + G E +G A+ Y P + Sbjct: 89 GLGYRHIILNNEYVIGGFALMGRYQTNYHNMFNQLTLGTEFFGSIWEGRAHLYLPVSRRT 148 Query: 213 AWHEQTA---------------TQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRV 257 + + T + G D+ +P L Sbjct: 149 KFVRSRSEGLSFQGHKLFGIQTTTYEHAEGGADVEIGHVIPGIPKLRGFAGYYN------ 202 Query: 258 DLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVP 313 G + N Y T T N + + G P Sbjct: 203 --NGLGNEHKNINGGYGRFEYRYNNHFTFTLG--DSYDRYQGNFFAIGVRMNIGSP 254 >UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZV2_9CYAN Length = 1059 Score = 59.7 bits (143), Expect = 2e-07, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 59/187 (31%), Gaps = 31/187 (16%) Query: 132 FVPLQDND-RYLTWSQLGLTQQDNGLVS-NVGVGQRWARG--NWLVGYNTFYDNLLDEN- 186 FVP+ N +T+ + L + + + +GQR+ N ++G YD N Sbjct: 72 FVPITQNPGSTVTFLEGQLRLFTDSTMGGTILLGQRFYNSTQNRILGGYLSYDTRDTGNS 131 Query: 187 -LQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT------------------QEQRMAR 227 + G G E G+ L N Y P + + + Sbjct: 132 LFHQIGAGFERLGDDWDLRVNAYLPVGERRPEVDESFSLRGFQENNLLLNHRQRFEAAMA 191 Query: 228 GYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVT 287 G+D+ A R+ + G + G G N + + L P + + Sbjct: 192 GFDIEAGGRL-------LRLGAGDLRGYAGFYYYGGEGTDNAIGIRGRLEAHPTDYLNLG 244 Query: 288 AQHKQGE 294 + + Sbjct: 245 LSVQTDQ 251 >UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C500_9PLAN Length = 1337 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 76/257 (29%), Gaps = 40/257 (15%) Query: 136 QDNDRYLTWSQLGLTQQD-NGLVSNVGVGQRWAR--GNWLVGYNTFYDNLLDEN--LQRA 190 +D + + L + + L G+G R+ + + G + +YD + Q+ Sbjct: 143 NPDDAGMMFGNFRLWRTNRGNLGGGAGLGYRFYNYDTDRIFGTSFYYDRDDSTDKIFQQL 202 Query: 191 GFGAEAWGEYLRLSANFYQPFAAWHEQTA---------------TQEQ-----RMARGYD 230 E G Y + NFY P +Q +Q + RG+D Sbjct: 203 ALNVETMGRYWDANGNFYLPIGNREQQLNLEFNDGSQRFSGFNVLYDQTRTIGKSMRGFD 262 Query: 231 LTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQH 290 +P + L Y G + + L PL V + Sbjct: 263 AEIG--VPIWGELAQQFQARAYAGTYGF---QASESADVWGWRGRL--QAYPLPNVLTEL 315 Query: 291 KQGESGENQNNLGLNLNYRF-GVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQ 349 N+ N+ + F G P Q+ + ++N +E + Sbjct: 316 SITSDDTFNTNVFFNVTWTFAGRPEWNQMEKSTQMYR------MAERVRKNYNVVVE-QS 368 Query: 350 RKTLTVFLATPPWDLKP 366 + + +A P P Sbjct: 369 KVVDSGLVAINPETGLP 385 >UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YK40_ANASP Length = 1487 Score = 59.0 bits (141), Expect = 4e-07, Method: Composition-based stats. Identities = 46/304 (15%), Positives = 78/304 (25%), Gaps = 39/304 (12%) Query: 93 ALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSW--FVPLQDND-RYLTWSQLGL 149 + S Q S + V V+ EG S S+ F+P+ LT+ Q L Sbjct: 18 SASTVSAQTPASTTAQVFTPRVGVRYTTEGAGYESFSSFEGFLPVLQIPGNSLTFLQGKL 77 Query: 150 TQQDNGLVS-NVGVGQRWARG--NWLVGYNTFYDNLLDENLQ--RAGFGAEAWGEYLRLS 204 ++ ++ N+ +G R N ++G Y + G G E G Sbjct: 78 LLDNDSNLATNILLGHRIFSEEANRVIGGYISYSTRDTGKSNFDQLGLGFETLG-VWDFR 136 Query: 205 ANFYQPFAAWHE-----------------QTATQEQRMARGYDLTARMRMPFYQHLNTSV 247 N Y P Q + + G D R+ S Sbjct: 137 FNAYLPLNGSENQVEQANLPFFQGDSLMVQRSRFLEVAMSGVDAEVGTRLASL----GSG 192 Query: 248 SLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLN 307 L Y G + SG + P + + S +N + L Sbjct: 193 DLRGYAGV---YYYSGGESREAFGWKTRIEARPNDFLGFSL------SLQNDDLFDTRLV 243 Query: 308 YRFGVPLKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPG 367 + G + S + + R + + P + Sbjct: 244 FSIGANFPGSGARRGKPSKNSALTRMAQSVDNQATILVAVENRTDVFAAMVEPEEEGSTN 303 Query: 368 ETVP 371 Sbjct: 304 SIDT 307 >UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK4_9PLAN Length = 786 Score = 58.6 bits (140), Expect = 4e-07, Method: Composition-based stats. Identities = 26/98 (26%), Positives = 38/98 (38%), Gaps = 8/98 (8%) Query: 127 SRGSWFVPL--QDNDRYLTWSQLGL--TQQDNGLVSNVGVGQRWARGNWL--VGYNTFYD 180 S F+PL ++ +LT+ L Q+ L SNVG G R W +G +YD Sbjct: 72 SSLDGFLPLLEAEDGNWLTFLDARLLLDDQNQNLGSNVGFGARQYLPEWGRTIGGYVYYD 131 Query: 181 NLLD--ENLQRAGFGAEAWGEYLRLSANFYQPFAAWHE 216 N + G E G+ N+Y P + Sbjct: 132 TRDTGTRNFSQVSGGIETLGDLWDARLNWYVPTGSRRS 169 >UniRef50_B9KFW6 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KFW6_CAMLR Length = 276 Score = 57.0 bits (136), Expect = 2e-06, Method: Composition-based stats. Identities = 35/250 (14%), Positives = 79/250 (31%), Gaps = 33/250 (13%) Query: 94 LSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQD 153 LS + + L+ + +D + D + + Sbjct: 58 LSSFASNSLGQVLNLSSKDKTEANLDYGA--LNVNIKNINSILDYENATLLLEKQARIGQ 115 Query: 154 NGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEA-WGEYLRLSANFYQPFA 212 ++G+ R+ + +G+N F D E ++ FGAE + Y + N Y Sbjct: 116 LEQAYSLGLINRYEFDEFNLGFNYFND-QYKEAYEKNSFGAEFQFSRYFKAYVNHY---N 171 Query: 213 AWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVAL 272 + + L +P+ LN + N + Sbjct: 172 IKENDSEDSTE-------LGLMFDLPYLNILNVN-------------SNIKELQN----- 206 Query: 273 SLGLNYTPVPLVTVTAQHKQGE-SGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRG 331 + Y+P+ ++ ++ ++ + S ++Q + + + L KQ ++ + Sbjct: 207 QYNITYSPISILDLSLNYQDEKTSAKDQTAMWVRFRLNYEQSLSKQFYNSLYRKNNIGKF 266 Query: 332 SRYDNPQRNN 341 +RYD R Sbjct: 267 NRYDFATRTY 276 >UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK3_9PLAN Length = 967 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 40/270 (14%), Positives = 76/270 (28%), Gaps = 42/270 (15%) Query: 127 SRGSWFVPL--QDNDRYLTWSQLGLTQQDNGLVS--NVGVGQRWARGNW--LVGYNTFYD 180 S F PL ++ +LT+ L D+ NVGVG R + +G +YD Sbjct: 75 SSFDAFFPLLEGEDSDWLTFIDARLLLGDDNHNLGSNVGVGARQYIPEYQRTIGAYIYYD 134 Query: 181 NLLDENLQ--RAGFGAEAWGEYLRLSANFYQPFAAWHEQTAT------------------ 220 + G E G+ N+Y P Q AT Sbjct: 135 TRDAGYASFDQVSGGIETLGDIWDARLNWYVPTGQTRNQYATTHTSGGSYKFVGHYLTGG 194 Query: 221 ----QEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGL 276 Q +G D+ A + + ++ ++ G+ + Sbjct: 195 TFTRYYQAAMKGLDMEAGAKFYSNESMDLRA-YAGWY----HFQAKGSEQA--WGWKSRI 247 Query: 277 NYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRYDN 336 +V++ + N + + + L+ + A + Sbjct: 248 ESRISDMVSLNLGVQNDRVFNTTVNFAVGIQWPSITGLRGGPRSDLKAWDRLGES----- 302 Query: 337 PQRNNLPTLEYRQRKTLTVFLATPPWDLKP 366 P+R + ++ + L P P Sbjct: 303 PERLRSIVVANQEIQDSDGGLVIDPTTGLP 332 >UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA61_9CHLA Length = 188 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 17/73 (23%), Positives = 24/73 (32%), Gaps = 4/73 (5%) Query: 139 DRYLTWSQLGLT-QQDNGLVSNVGVGQRW-ARGNWLVGYNTFYDNLLDEN--LQRAGFGA 194 + +S L + N GVG R + N FYD+ + G G Sbjct: 57 EDITIFSDLKGHWLTRHHYAVNAGVGFRKIYAPQTIWDANLFYDHPKSSYDHYNQVGLGL 116 Query: 195 EAWGEYLRLSANF 207 E + E L N Sbjct: 117 ELFHELWELRLNG 129 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 56.7 bits (135), Expect = 2e-06, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 43/134 (32%), Gaps = 31/134 (23%) Query: 113 SVDVKVDNEGHFTGSRGSWFVPLQDN-------DRYLTWSQLGLTQQDNGLVS-NVGVGQ 164 +V+ V + +T S P+ + + + + + + G ++ ++G+G Sbjct: 50 TVNAGVKSSDAYTDGNFSIVAPVWSSLGAEGTLSGGVLFLEPYTSYGEGGEIAASLGLGY 109 Query: 165 RW--------------------ARGNWLVGYNTFYD---NLLDENLQRAGFGAEAWGEYL 201 R+ VG N F D D + G G E YL Sbjct: 110 RYLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGVEFGNRYL 169 Query: 202 RLSANFYQPFAAWH 215 N+Y P + Sbjct: 170 EFRGNYYIPLSDKQ 183 >UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QR03_CYAP0 Length = 1985 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 47/315 (14%), Positives = 80/315 (25%), Gaps = 73/315 (23%) Query: 135 LQDNDRYLTWSQLGLTQQDNGLV---SNVGVGQRWARGNW--LVGYNTFYDNLLDENL-- 187 LQ ++ LT+++ + + +N VG R + + G YD + Sbjct: 129 LQIDENQLTFTEGRVLASTHDAEDIRANFLVGHRLYSQDHDRVYGAYIGYDLRDTKYNKF 188 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFAAW--------------------------------- 214 + G G E G + N Y P Sbjct: 189 NQFGVGLETLGSFWDARFNAYIPLGTTQQQIGQTNTDLNPIINTISVDKFGFQRNFLVFE 248 Query: 215 --------HEQTATQEQRMARG--YDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGT 264 Q + G +D+ A+ + L Y G GT Sbjct: 249 GVTIQQQRQNQITRTYETALFGLDWDVGAK-----ILQIGGQGDLRGYVG-GYYYEMQGT 302 Query: 265 GYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVA 324 + L L P V V S ++ + G N+ +R G + Sbjct: 303 QEDDVWGWRLRLEAKPTDTVRVGL------SVQDDDTFGTNVVFRVGANFPG--TGPRDT 354 Query: 325 ESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQ 384 + + D R + + + + V P E + L I + Sbjct: 355 KVNEVWARMGDWVTRQDNIVINEFEESEIIVS---------PVEFLTLSESIVAINPTTN 405 Query: 385 LIWQGDTQILSLTPG 399 W L G Sbjct: 406 QPWIFRHVNLGQGGG 420 >UniRef50_A8PQI7 Putative outer membrane autotransporter barrel domain n=5 Tax=Rickettsiella grylli RepID=A8PQI7_9COXI Length = 1171 Score = 55.9 bits (133), Expect = 3e-06, Method: Composition-based stats. Identities = 43/253 (16%), Positives = 75/253 (29%), Gaps = 43/253 (16%) Query: 112 ASVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQ-QDNGLVSNVGVGQRWA-RG 169 A V + + +PL + ++ + LT + ++G+G RW G Sbjct: 34 ARFSGNVYGSTKYVVGQADAMLPLVGDAQHNFYIDPALTSGSNWEGHGDLGLGYRWIQNG 93 Query: 170 NWLVGYNTFYDNLLDENLQRA---GFGAEAWGEYLRLSANFYQ----------------P 210 + ++G F + +N R G EA G N Y Sbjct: 94 SAILGGYLFGEYNRMDNNVRIWTMNPGIEALGSRWDAHLNGYFVMDNRSKVVGTDLEFVR 153 Query: 211 FAAWHEQTATQE--QRMARGYDLTARMRM-PFYQHLNTSVSLEQYFGDRVDLFNSGTGYH 267 F + Q + G D+ ++ P S S Sbjct: 154 FRGHSAVYNLFDVTQNVGNGGDVKLGYQLFPKTPLKAFVGSYFF----------SPAETK 203 Query: 268 NPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQ 327 N + ++GL Y V V A + + + +LGL + G + Sbjct: 204 NILGGAVGLEYWANRNVKVFASYTYDKLRRSVGSLGLGV--ELGGTHVHRSDP------- 254 Query: 328 SLRGSRYDNPQRN 340 S+ D +RN Sbjct: 255 SIEERITDPLERN 267 >UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PN48_9COXI Length = 607 Score = 54.3 bits (129), Expect = 9e-06, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 70/251 (27%), Gaps = 48/251 (19%) Query: 120 NEGHFTGSRGSWFVPLQDNDRYLTWSQ-LGLTQQDNGLVSNVGVGQRWA-RGNWLVGYNT 177 G +T R V L + ++ + G D +VG+G RW +VG+ Sbjct: 42 YTGVYTVGRADLMVSLDGDGQHNLYVDPQGGYGTDQEWYGDVGLGYRWISNDAAIVGWYV 101 Query: 178 FYDN---LLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQTATQE------------ 222 F + G E G N Y P A + E Sbjct: 102 FAGHSCVENSSGFWITNPGVEIMGSRWDARINAYIPVAGRSDDLGGIESTTAGPSFFTGH 161 Query: 223 --------------QRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGYHN 268 Q++ G D ++ + + + YF + N Sbjct: 162 SELRTVSFTAFNEVQQVGNGADARVGYQL--FSGVPLKAVVGAYFFEIPH-------AEN 212 Query: 269 PVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS 328 G++Y V V A++ +Q GL ++ FG S Sbjct: 213 VRGGGAGVDYWFDDYVRVFARYNYDNRQHSQVVGGLGIS--FGGVRNGHW------ADPS 264 Query: 329 LRGSRYDNPQR 339 L D +R Sbjct: 265 LSERLTDPVER 275 >UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=27 Tax=Cyanobacteria RepID=Q0IAR8_SYNS3 Length = 401 Score = 54.0 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 44/314 (14%), Positives = 84/314 (26%), Gaps = 62/314 (19%) Query: 46 LGMAPENHDGEKHFAEIVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESW 105 LG+ A +D G + EQ + ++ L Q++ + Sbjct: 7 LGLLASAISVASLPAIAQEDGGAALLRQQRDKLLEQIEQLKQR--KEQLEAQISGSAQGK 64 Query: 106 LSPWGNASVDVKVDNEGHF------------TGSRGSWFVPLQDNDRYLTWSQLGL---- 149 + + + + ++ + F+PL + + + Sbjct: 65 DDAFDLQEISLNDAVKFNWGFQGALQGAGTPNQAGIGGFLPLSVGENSVWFLDALANANF 124 Query: 150 -TQQDNGLVSNVG-----------VGQRWARGN--WLVGYNTFYDN-------------- 181 ++N + N +G RW G+ W+ G N YD+ Sbjct: 125 SDYENNSSIINTDVAGTTISTSSRLGYRWLNGDRSWMYGLNAGYDSRPMNTGGTDTGINV 184 Query: 182 ---LLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQ-TATQEQRMARGYDLTARMRM 237 Q+ AEA L+A P + + + Y L + Sbjct: 185 SGTEKSAFFQQVVVNAEAVSNDWNLNAYALIPIGDTEQDLNSFYQGGALNTYGLDVGYFI 244 Query: 238 PFYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGE 297 LN SV N G + + + Y +T Sbjct: 245 --TPELNASVGYY--------YQNGDLGSADGSGVLGRVAYEISNGLTAGVN--ISYDEA 292 Query: 298 NQNNLGLNLNYRFG 311 + + +L RFG Sbjct: 293 FETRVSADLKVRFG 306 >UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K1T2_CYAP8 Length = 1873 Score = 52.8 bits (125), Expect = 3e-05, Method: Composition-based stats. Identities = 39/270 (14%), Positives = 71/270 (26%), Gaps = 60/270 (22%) Query: 135 LQDNDRYLTWSQLGLTQQDNGLV---SNVGVGQRWARGNW--LVGYNTFYDNLLDENL-- 187 LQ ++ LT+++ + + +N VG R + + G YD + Sbjct: 129 LQIDENQLTFTEGRVLASTHDAEDIRANFLVGHRLYSQDHNRVYGAYIGYDLRDTKYNKF 188 Query: 188 QRAGFGAEAWGEYLRLSANFYQPFA----------------------------------- 212 + G G E G++ N Y P Sbjct: 189 NQFGVGIETLGDFWDARFNAYIPLGTTQQQIGQTNTALNPIINTITVNQFGFQRNFLVFE 248 Query: 213 ------AWHEQTATQEQRMARGYDLTARMRMPFYQHLNTSVSLEQYFGDRVDLFNSGTGY 266 Q + + G D ++ + L Y G GT Sbjct: 249 EVTIQQQKQNQITRRYETALFGLDWDVGAKI---LQIGEQGDLRGYVG-GYYYEMQGTQE 304 Query: 267 HNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAES 326 + L L P V V S ++ + G N+ +R G + + Sbjct: 305 DDVWGWRLRLEAKPTDTVRVGL------SVQDDDTFGTNVVFRVGANFPG--TRPRDTKV 356 Query: 327 QSLRGSRYDNPQRNNLPTLEYRQRKTLTVF 356 + D R + + + + V Sbjct: 357 NEVWARMGDWVTRQDNIVINEFEESEIIVS 386 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 48.6 bits (114), Expect = 5e-04, Method: Composition-based stats. Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 2/40 (5%) Query: 258 DLFNSG--TGYHNPVALSLGLNYTPVPLVTVTAQHKQGES 295 +F G NP A++LGLNY PVPLVT+ + G++ Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLVTIGVNQRMGQN 42 >UniRef50_Q11VX9 CHU large protein; candidate pectate lyase, polysaccharide lyase family 1 protein n=2 Tax=Bacteroidetes RepID=Q11VX9_CYTH3 Length = 991 Score = 47.4 bits (111), Expect = 0.001, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 43/127 (33%), Gaps = 21/127 (16%) Query: 340 NNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG-IRQLIWQGDTQILSLTP 398 NN P T T+ G + L + G I+++ + T +L Sbjct: 350 NNNP--------TATLSAPANGASGCIGTAITLTASAQDSDGTIQKVDFYNGTALL---- 397 Query: 399 GAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVE-PFDALSN 457 G+ S +++ A+ + DN G +S T+T+ P ++S+ Sbjct: 398 GSDNTSP--YSIT-----YTPTAAGTLSIKATATDNAGGTGTSATNTVTVSALPTASVSS 450 Query: 458 DELRWEP 464 P Sbjct: 451 STTNLCP 457 >UniRef50_A1AQZ5 Fibronectin, type III domain protein n=2 Tax=Desulfuromonadales RepID=A1AQZ5_PELPD Length = 1141 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 16/125 (12%) Query: 340 NNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPG 399 NN T LT L+ T+ + G+ + + +L Sbjct: 388 NNDITAPT---VALTAPLSNSIVSG----TITVSAGASDNVGVNMVEVYANGALL----- 435 Query: 400 AQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDE 459 A++A + W + A+ + L+ D+ G +S+ +T+ + + Sbjct: 436 -FASNASPFNFT---WDTTQVANGSYTLTARAVDSSGNIGTSSTVTVNVQNADTTAPSIS 491 Query: 460 LRWEP 464 P Sbjct: 492 AFSLP 496 >UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellular organisms RepID=B4D818_9BACT Length = 5429 Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 21/103 (20%), Positives = 35/103 (33%), Gaps = 5/103 (4%) Query: 113 SVDVKVDNEGHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDN-GLVSNVGVGQRWARG-- 169 V ++ H VPL + + L+ D ++G G R Sbjct: 74 RVTFGLEFYEHQIDESLDTLVPLATPQNGVLYFNPKLSLSDRLNPSVSIGFGYRHLLKAR 133 Query: 170 NWLVGYNTFY-DN-LLDENLQRAGFGAEAWGEYLRLSANFYQP 210 G + D D ++ + G GAE ++ AN+Y P Sbjct: 134 RSSSGETSLRSDYTNFDHHVNQFGVGAEVMSRWVDFRANYYLP 176 >UniRef50_A5G3Y8 Fibronectin, type III domain protein n=2 Tax=Geobacter RepID=A5G3Y8_GEOUR Length = 482 Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 18/110 (16%), Positives = 33/110 (30%), Gaps = 9/110 (8%) Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 TV L P + TV + G+ ++ + + +LS A + Sbjct: 220 TVSLTAPGNNTTVSGTVVITASAGDNVGVGRVEFYANGVLLSAGNVAPYSYN-------- 271 Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWE 463 W A+ + L D G S T+T+ + + Sbjct: 272 -WNTAAVANGSYTLVAKAYDAAGNVGQSTVATVTVNNTVADTTAPTVSIS 320 Score = 42.4 bits (98), Expect = 0.037, Method: Composition-based stats. Identities = 13/96 (13%), Positives = 33/96 (34%), Gaps = 9/96 (9%) Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 TV ++ P T + G+ ++ + + +++ S ++ Sbjct: 316 TVSISAPANGATVSGTASVTASSGDNVGVTKVEFYVNGSLMA----TDTASPYSFS---- 367 Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 W A+ + L+ D G + +T+T+ Sbjct: 368 -WNTASAANGSYTLTAKAYDAAGNVGQATAVTVTVS 402 >UniRef50_B9XJ25 Na-Ca exchanger/integrin-beta4 n=1 Tax=bacterium Ellin514 RepID=B9XJ25_9BACT Length = 888 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 12/97 (12%), Positives = 31/97 (31%), Gaps = 1/97 (1%) Query: 353 LTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIM 412 + + T V L + + + + T L Q+ T + Sbjct: 148 IRLLYPTNNQTFTAPTNVTLYASVTDSNLVTTVQFFAGTNNLGTVTNTQSAPPTNATSSI 207 Query: 413 PDWQN-GEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 ++ + + L+ + D+ G +S I++ + Sbjct: 208 TFYKIWSNVLAGTYTLTAIATDSTGHTATSAPISIVV 244 >UniRef50_Q08MX9 Chitinase c n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08MX9_STIAU Length = 454 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 30/97 (30%), Gaps = 5/97 (5%) Query: 369 TVPLKLQIRSRYG-IRQLIWQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRL 427 L G + ++ + ++ NS E ++ W + A L Sbjct: 76 RRTLLATAEDDSGKVAKVEFYVSGALVCTDG-TDRNSGEAFSC---AWDSASTAQGSHSL 131 Query: 428 SVVVEDNQGQRVSSNEITLTLVEPFDALSNDELRWEP 464 + D G SS I ++ P A + + P Sbjct: 132 TARAYDAAGNASSSEPIAFSVPAPNRAPTVTGVTASP 168 >UniRef50_Q3A4E5 Pectate lyase protein n=2 Tax=Pelobacter carbinolicus DSM 2380 RepID=Q3A4E5_PELCD Length = 1031 Score = 45.1 bits (105), Expect = 0.006, Method: Composition-based stats. Identities = 10/98 (10%), Positives = 36/98 (36%), Gaps = 9/98 (9%) Query: 352 TLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLI 411 + ++ +T+ + + + ++ + + + ++ ++ Sbjct: 396 VAAITTPLSGLEVTSAQTLTIVAEASDNVAVSKVEFYDGSNL------IGTDTTNTYSAS 449 Query: 412 MPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLV 449 + E + L+VV D G + +S+ +T+T+ Sbjct: 450 L---DVSETDNGTHNLTVVAYDEAGNQTTSSPVTVTVN 484 >UniRef50_A5G3Y9 Fibronectin, type III domain protein n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5G3Y9_GEOUR Length = 675 Score = 44.7 bits (104), Expect = 0.007, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 40/108 (37%), Gaps = 9/108 (8%) Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 TV +++P + TV + G+ ++ + + + + ++A +T+ Sbjct: 403 TVSISSPAGNTNVSGTVTVSTSASDNVGVTRVEFYVNGVLQA------TDTASPYTV--- 453 Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELR 461 W A+ + L+ D G S +T+T+ + + Sbjct: 454 SWNATAVANGTYILTAKAYDAAGNVGQSGNVTVTVNSTAADTTPPTVT 501 Score = 43.9 bits (102), Expect = 0.012, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 36/108 (33%), Gaps = 9/108 (8%) Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 TV L P + TV + G+ ++ + G+ +LS A + Sbjct: 215 TVSLTAPGNNATVSGTVAITASAGDNVGVSKVEFYGNGVLLSAGNVAPYSYN-------- 266 Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELR 461 W A+ + L+ V D G S +T+TL + Sbjct: 267 -WNTASVANGGYTLTAKVYDAAGNVGQSGNVTVTLNNTAADTMPPTVT 313 Score = 43.6 bits (101), Expect = 0.017, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 65/201 (32%), Gaps = 18/201 (8%) Query: 271 ALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQS-- 328 A ++ L + P + A +K ++ V + + + ++ Sbjct: 36 ATNVALQWNPNTDANL-AGYKVYYQADSSTTPFNGTGSPMDVSNQTTATVSNLDPGRTYY 94 Query: 329 LRGSRYDN--PQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLI 386 S YD + + + + TV L+ P + T+ + G+ ++ Sbjct: 95 FAVSAYDTSGVESSYSNIVTVPESVLPTVSLSYPANNTTASGTLSVTASAGDNVGVTKVE 154 Query: 387 WQGDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITL 446 + + + G ++ ++ W A+ ++ L D G S+ +++ Sbjct: 155 FYVNGVL----NGTDTSTPYIYS-----WNTSSLAAGNYTLMAKAYDAAGNVDQSSNVSV 205 Query: 447 TL----VEPFDALSNDELRWE 463 T+ P +L+ Sbjct: 206 TVVNDTTPPTVSLTAPGNNAT 226 >UniRef50_A9B6Z4 Penicillin-binding protein, 1A family n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B6Z4_HERA2 Length = 952 Score = 43.9 bits (102), Expect = 0.014, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 33/101 (32%), Gaps = 15/101 (14%) Query: 348 RQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEG 407 R T+++ L +P + + + + ++ W + Sbjct: 866 RTAPTISINLPASALRGQP---LQFQATAQDDRQLAKVEWTVN-------GEVFMRDQAP 915 Query: 408 WTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTL 448 ++L + ++R+ D G R +S+ L++ Sbjct: 916 YSL-----DFSPTQAGNYRVVATAIDQAGNRATSSVSVLSV 951 >UniRef50_B3BT68 Large repetitive protein n=16 Tax=Bacteria RepID=B3BT68_ECO57 Length = 5188 Score = 43.6 bits (101), Expect = 0.016, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 23/61 (37%), Gaps = 1/61 (1%) Query: 399 GAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR-VSSNEITLTLVEPFDALSN 457 + W+L +P A+N + L+ V D G +S +T+ P + + Sbjct: 3280 QTTVQTDGSWSLTLPASDLTALANNGYTLTATVSDLAGNLGSASKGVTVDTTAPVISFNT 3339 Query: 458 D 458 Sbjct: 3340 V 3340 >UniRef50_Q1IW67 PPC, peptidase containing PKD repeats n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1IW67_DEIGD Length = 343 Score = 42.8 bits (99), Expect = 0.026, Method: Composition-based stats. Identities = 21/108 (19%), Positives = 39/108 (36%), Gaps = 10/108 (9%) Query: 354 TVFLATPPWDLKPGETVPLKLQIRSRYGIRQLIWQGDTQILSLTPGAQANSAEGWTLIMP 413 TV ++ P L TV L G+ ++ + + Q+++ GA +++ + Sbjct: 126 TVSVSAQPSSLTLPGTVTLTATASDDRGVTKVEFYDNGQLVATDTGAPYTASQTYGF--- 182 Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALSNDELR 461 + + +SV D QG S TL + ND Sbjct: 183 ------ADNGNHIISVKAYDEQGNVGESA-TTLNVAISDANEPNDNPT 223 >UniRef50_B7LJF8 Adhesin for cattle intestine colonization n=14 Tax=Enterobacteriaceae RepID=B7LJF8_ESCF3 Length = 7222 Score = 42.4 bits (98), Expect = 0.038, Method: Composition-based stats. Identities = 12/61 (19%), Positives = 23/61 (37%), Gaps = 1/61 (1%) Query: 399 GAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQR-VSSNEITLTLVEPFDALSN 457 + W+L +P A+N + L+ V D G +S +T+ P + + Sbjct: 5208 QTTVQADGSWSLTLPASDLTALANNGYTLTATVSDLAGNPGSASKGVTVDTTAPVISFNT 5267 Query: 458 D 458 Sbjct: 5268 V 5268 >UniRef50_Q11W88 Endoglucanase-related protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11W88_CYTH3 Length = 1295 Score = 42.0 bits (97), Expect = 0.047, Method: Composition-based stats. Identities = 7/39 (17%), Positives = 17/39 (43%) Query: 414 DWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPF 452 + A + ++ + DN G + +S +T+ + P Sbjct: 760 SYSWTNVAVGTYTITAIATDNSGNKKTSAPVTIKVNVPQ 798 >UniRef50_B3E8T7 Multicopper oxidase type 2 n=3 Tax=Geobacteraceae RepID=B3E8T7_GEOLS Length = 1601 Score = 41.6 bits (96), Expect = 0.071, Method: Composition-based stats. Identities = 24/128 (18%), Positives = 47/128 (36%), Gaps = 13/128 (10%) Query: 339 RNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRSRYG--IRQLIWQGDTQILSL 396 R+N+ ++ Q T+ + T PG T+PL + G I ++ + T +LS Sbjct: 1105 RSNIISVTAVQAPTVDLTSPTNGTVYWPGTTIPLAATATAPNGSTITKVEFYDGTTLLS- 1163 Query: 397 TPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 +S + S L+ + G V+S+ + ++L P + + Sbjct: 1164 ---TDTSSPYS-------YNWATATSGSHSLTAKAYASNGAVVTSSAVAISLTAPVSSAT 1213 Query: 457 NDELRWEP 464 P Sbjct: 1214 LSATPSSP 1221 >UniRef50_Q1H368 Phosphate-selective porin O and P n=1 Tax=Methylobacillus flagellatus KT RepID=Q1H368_METFK Length = 473 Score = 41.2 bits (95), Expect = 0.087, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 35/99 (35%), Gaps = 10/99 (10%) Query: 222 EQRMARGYDLTARMRMPFYQHLNTSVSLEQY-FGDRVDLFNSGTGYHNPVALSLGLNYTP 280 E+ G + A R+ + + ++ + D F S ++ G + P Sbjct: 375 EETSLNGGYIQAMYRITDFFGYGVMLPFVKWQYYDGAQKFESNAPQNHVNDWEAGFEWQP 434 Query: 281 VPLVTVTAQHKQGE---------SGENQNNLGLNLNYRF 310 VP + TA + + + S N + L L Y F Sbjct: 435 VPEIEFTAYYSKLDRNNLATAPYSKYNTDILRFQLQYNF 473 >UniRef50_C9XYC9 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9XYC9_CROTZ Length = 3864 Score = 41.2 bits (95), Expect = 0.087, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 62/192 (32%), Gaps = 17/192 (8%) Query: 275 GLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKKQLSAGEVAESQSLRGSRY 334 +N+ P +T+T KQ + + ++ VP L + + ++ G+ Sbjct: 1129 SVNFAPQTTLTITLNGKQYTAATGPDG-----SWSVTVPRVDALDISDGKATLTVSGA-- 1181 Query: 335 DNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQIRS------RYGIRQLIWQ 388 N + +T L + + + ++ + G+ Sbjct: 1182 ---DENGAVVSGNQSFTIITTDLPDVTLNTPFTDGIISAAEVSAGGALSGSTGVNGAGQT 1238 Query: 389 GDTQILSLTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSS-NEITLT 447 Q T A +S+ W + +P L V D G + +S + IT+ Sbjct: 1239 VTVQFNGATYNAVVDSSGNWAVTLPPAALQGLTEGETPLVVTATDAAGNQNTSQSTITVD 1298 Query: 448 LVEPFDALSNDE 459 L P +++ Sbjct: 1299 LSAPVLTVNDIT 1310 >UniRef50_C4SD59 Autotransporter adhesin n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SD59_YERMO Length = 4235 Score = 41.2 bits (95), Expect = 0.091, Method: Composition-based stats. Identities = 31/172 (18%), Positives = 53/172 (30%), Gaps = 18/172 (10%) Query: 294 ESGENQNNLGLNLNYRFGVP---LKKQLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQR 350 +G ++ +P L L +L S D+ N P + Sbjct: 1333 NGKNYTATVGAGGSWSVSLPKADLALLLDGKA-----TLTASATDS---NGNPVSTSSEL 1384 Query: 351 KTLTVFLATPPWDLKPGETVPLKLQI---RSRYGIRQLIWQGDTQILSLTPGAQA---NS 404 L + G+++ K + +S G + G T I++L S Sbjct: 1385 GIYIHNLPNVTLNGPFGDSILSKAEAGISQSLSGTTGITGSGQTVIVTLGGKTYPALVGS 1444 Query: 405 AEGWTLIMPDWQNGEGASNHWRLSVVVEDNQGQRVSSNEITLTLVEPFDALS 456 W+L +P A ++V V D G S +T+ LS Sbjct: 1445 DGNWSLTLPTSVLQGLAQGPQSITVQVTDGGGNTS-SKVTPITVDTVAPELS 1495 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.130 0.341 Lambda K H 0.267 0.0399 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,645,911,485 Number of Sequences: 3077464 Number of extensions: 122313886 Number of successful extensions: 272268 Number of sequences better than 1.0e-01: 173 Number of HSP's better than 0.1 without gapping: 271 Number of HSP's successfully gapped in prelim test: 118 Number of HSP's that attempted gapping in prelim test: 270423 Number of HSP's gapped (non-prelim): 1080 length of query: 464 length of database: 1,040,396,356 effective HSP length: 132 effective length of query: 332 effective length of database: 634,171,108 effective search space: 210544807856 effective search space used: 210544807856 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 95 (41.2 bits)