BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (295 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P36943 Putative attaching and effacing protein homolog ... 385 e-105 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 345 9e-94 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 337 3e-91 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 336 4e-91 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 335 9e-91 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 335 1e-90 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 333 4e-90 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 330 4e-89 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 330 5e-89 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 327 3e-88 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 324 2e-87 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 324 2e-87 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 323 4e-87 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 322 7e-87 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 321 2e-86 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 321 2e-86 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 320 3e-86 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 320 5e-86 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 314 2e-84 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 314 3e-84 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 313 5e-84 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 311 2e-83 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 310 4e-83 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 308 2e-82 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 306 5e-82 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 303 5e-81 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 303 5e-81 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 302 9e-81 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 300 6e-80 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 297 3e-79 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 296 7e-79 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 290 3e-77 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 289 6e-77 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 289 9e-77 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 288 2e-76 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 287 3e-76 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 287 3e-76 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 287 3e-76 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 286 6e-76 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 284 2e-75 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 284 3e-75 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 283 6e-75 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 282 1e-74 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 280 4e-74 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 279 6e-74 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 278 2e-73 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 273 7e-72 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 271 2e-71 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 268 1e-70 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 263 5e-69 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 258 1e-67 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 257 3e-67 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 253 8e-66 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 248 1e-64 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 248 3e-64 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 244 2e-63 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 244 2e-63 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 240 4e-62 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 238 1e-61 UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 232 1e-59 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 229 7e-59 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 228 2e-58 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 220 5e-56 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 212 2e-53 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 209 7e-53 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 206 6e-52 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 191 2e-47 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 185 2e-45 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 177 3e-43 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 158 2e-37 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 157 3e-37 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 149 9e-35 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 148 3e-34 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 145 2e-33 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 137 4e-31 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 137 5e-31 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 133 5e-30 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 133 7e-30 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 133 9e-30 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 132 2e-29 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 131 2e-29 UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 131 2e-29 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 130 6e-29 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 119 1e-25 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 118 3e-25 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 117 4e-25 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 117 6e-25 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 115 1e-24 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 115 2e-24 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 113 5e-24 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 112 1e-23 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 107 4e-22 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 107 4e-22 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 107 6e-22 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 106 1e-21 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 103 5e-21 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 102 2e-20 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 100 5e-20 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 99 1e-19 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 99 2e-19 UniRef50_A8PQI7 Putative outer membrane autotransporter barrel d... 98 5e-19 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 93 1e-17 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 92 2e-17 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 92 2e-17 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 92 2e-17 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 92 3e-17 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 91 4e-17 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 89 1e-16 UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryoc... 87 5e-16 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 85 2e-15 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 85 2e-15 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 85 3e-15 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 81 6e-14 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 80 1e-13 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 80 1e-13 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 78 3e-13 UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microco... 75 3e-12 UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum ... 75 4e-12 UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 74 6e-12 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 72 3e-11 UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickett... 71 4e-11 UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Plancto... 70 7e-11 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 70 1e-10 UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Plancto... 70 1e-10 UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastop... 70 1e-10 UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachl... 68 3e-10 UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legione... 67 6e-10 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 66 1e-09 UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanoth... 62 2e-08 UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyano... 62 2e-08 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 62 3e-08 UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellu... 57 6e-07 UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=... 54 7e-06 UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Plancto... 53 9e-06 UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6... 49 2e-04 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 45 0.004 UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=... 44 0.006 UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicib... 43 0.009 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 43 0.015 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 295/295 (100%), Positives = 295/295 (100%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT Sbjct: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA Sbjct: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI Sbjct: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ Sbjct: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 345 bits (886), Expect = 9e-94, Method: Composition-based stats. Identities = 134/291 (46%), Positives = 184/291 (63%), Gaps = 10/291 (3%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 H + ++ AR ++ PL A + + Sbjct: 78 HMSLEALRKLNQFRTFARGFDHLQPGDELDVPL---------APLPAVTWAEETPVPASA 128 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + ++ + +A A+ AG FL++ P DA + GMAT A+ E+Q+WL ++GTAR++L Sbjct: 129 SKEDLQAQKIAGIASQAGNFLANSPRGDAAASIARGMATGAASTEVQQWLSQFGTARLQL 188 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 +VD FSLK+S L++L P+Y+ P ++FTQG++HRTDDRTQ+N+G G R F + +M G Sbjct: 189 DVDNKFSLKNSQLDLLIPLYEQPDKLVFTQGSLHRTDDRTQTNLGMGMRWF-NDGYMLGG 247 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF+D+DLSR H R+G+G EYWRDYLK+ AN Y+R + W+ S D DYQERPANGWD+ Sbjct: 248 NTFLDYDLSRDHARMGMGVEYWRDYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSL 307 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG++PA PQLG +L YEQYYG EV LFGKD RQKDPHAI+ V YTP PL Sbjct: 308 EGWVPALPQLGGNLKYEQYYGKEVALFGKDNRQKDPHAITVGVNYTPFPLL 358 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 337 bits (864), Expect = 3e-91, Method: Composition-based stats. Identities = 126/286 (44%), Positives = 177/286 (61%), Gaps = 6/286 (2%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + A ++ P + + + T D+ Sbjct: 89 ALRKLNVLRTFAHGFDNLQPGDELDVPAVMP-----DGKPDSPAKTGDEQAATPPLKDDE 143 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 +A A+ AGT LS+ PD DA + G +A A+ ++Q+WL ++GTARV+L D+ Sbjct: 144 GAMKMADMASRAGTLLSNSPDGDAALSMARGQISAVASGQVQQWLNQFGTARVQLEADEH 203 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSLK+S +++L P Y+ +LFTQG++HRTDDRTQ+N+GFG R+F+ +M G N F D Sbjct: 204 FSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAP-SYMLGGNIFGD 262 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 +DLS H+R G+G EYWRD+LKLSANGY+R S W+ SP++++YQERPANGWDIRA+ +LP Sbjct: 263 YDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQAWLP 322 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + PQLG L YEQYYG V LFGK+ Q++P AI+A V +TP PL Sbjct: 323 SLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLL 368 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 336 bits (863), Expect = 4e-91, Method: Composition-based stats. Identities = 130/286 (45%), Positives = 184/286 (64%), Gaps = 17/286 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ P A ++ + N+ Sbjct: 93 LRKLNQFRTFARGFDNVRQGDELDVP---------------AQVSENNLTPPPGNSSGNL 137 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 138 EQQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMTDWLSRFGTARITLGVDEDF 197 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S + L+P Y+TP N+ F+Q +HRTD+RTQ N G GWRHF+ WM+G+N F DH Sbjct: 198 SLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTP-TWMSGINFFFDH 256 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DLSR H+R G+GAEYWRDYLKLS+NGY+R + W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 257 DLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRAEGWLP 316 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWP LG L+YEQYYGDEV LF KD RQ +PHAI+A + YTP PL Sbjct: 317 AWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLM 362 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 335 bits (860), Expect = 9e-91, Method: Composition-based stats. Identities = 109/280 (38%), Positives = 155/280 (55%), Gaps = 12/280 (4%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 ++ ++ ++ P + + N + + ++ +A Sbjct: 9 QFRSFSKPFIQLGSGDEIDIPRITPLP-----------EKITTAENAKTVSSSQYKERLA 57 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 T L+ A + +A +AN Q WL ++GTARV+LN+D + SLK S Sbjct: 58 HNLLKGATVLADDNTPLAAASMARSVAVGEANDAAQHWLSQFGTARVQLNLDNNLSLKGS 117 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 + +ML P+YD ++LF+Q + D R NIG G R N WM G N F D D++ Sbjct: 118 AFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDN-WMYGANVFFDRDITGK 176 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 + RIG GAE W DYLKLSAN Y+R + W +S D DY ERPANG+D+R E YLPA+PQ+G Sbjct: 177 NNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLPAYPQIG 236 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +L YEQY G+EV LFGKD RQK+P+A +A + YTP+PL Sbjct: 237 TNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLI 276 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 335 bits (859), Expect = 1e-90, Method: Composition-based stats. Identities = 132/286 (46%), Positives = 179/286 (62%), Gaps = 13/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ P A +N + Sbjct: 93 LRRLNQFRTFARGFDNVRQGEELDVPATTLQKSHEQQNAV-----------PPANGENTL 141 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E +AS + GT LS +S+ G A+++A+ + +WL +GTA++ L VD+DF Sbjct: 142 ENQIASTSQRVGTLLSQDMNSEQASGMARGWASSEASGAMTDWLNNFGTAKISLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S + L+P YDTP +LF+Q +HRTDDRTQ N G GWRHF+ WM+G+N F DH Sbjct: 202 SLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDRTQINTGLGWRHFTP-SWMSGINLFFDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DLSR H+R G+GAEYWRDYLKLS+N YI +GW+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG L+YEQYYGDEV LF K+ RQ +PHAI+A + YTP PL Sbjct: 321 AWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAITAGLNYTPFPLL 366 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 333 bits (854), Expect = 4e-90, Method: Composition-based stats. Identities = 118/279 (42%), Positives = 159/279 (56%), Gaps = 7/279 (2%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y + AR + ++ P+ ++ A +A E Sbjct: 119 YRIFARGFEHVGVGDEIDIPVDMSSLNTQAGQAPKLSSAMREPSRA------EKEAQAVG 172 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 + G LSS S+A MAT AN+EIQ+WL KYGTARV+LN+DK+FSL +S+ Sbjct: 173 QLMSVGATLSSTRPSEAAAGMARSMATNAANEEIQQWLSKYGTARVQLNLDKNFSLSESA 232 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 L+ P++D+ FTQ D R N+G G R + WM GVN F DHDL+ + Sbjct: 233 LDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVGARTLL-DRWMLGVNMFYDHDLTGHN 291 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 +R+G+GAE W DYL+LS NGY+R S W +S D DY ER ANG+DIRA +LPA PQLG Sbjct: 292 SRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADYDERAANGFDIRANAWLPALPQLGG 351 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 L+YEQY G+ V LFGK+ Q++P+A++A V YTP PL Sbjct: 352 KLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPFPLL 390 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 330 bits (845), Expect = 4e-89, Method: Composition-based stats. Identities = 143/286 (50%), Positives = 191/286 (66%), Gaps = 12/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 92 LRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM------ 143 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DF Sbjct: 144 --QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SS E L+P Y+TP N++F+Q +HRTDDRTQ+N G GWR+F+ + WM+GVN FIDH Sbjct: 202 SLKSSSFEFLHPWYETPDNLVFSQHTLHRTDDRTQTNHGIGWRYFT-SSWMSGVNMFIDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG L+YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 321 AWPQLGGKLVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 330 bits (845), Expect = 5e-89, Method: Composition-based stats. Identities = 141/286 (49%), Positives = 191/286 (66%), Gaps = 12/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 92 LRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM------ 143 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DF Sbjct: 144 --QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SS E L+P Y+TP N++F+Q +HRTD+RTQ+N G GWR+F+ + WM+GVN FIDH Sbjct: 202 SLKSSSFEFLHPWYETPDNLVFSQHTLHRTDNRTQTNHGIGWRYFT-SSWMSGVNMFIDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG ++YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 321 AWPQLGGKVVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 327 bits (838), Expect = 3e-88, Method: Composition-based stats. Identities = 147/287 (51%), Positives = 185/287 (64%), Gaps = 11/287 (3%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 RF S L R VA I QVLFP+ A ++ + Sbjct: 1 MISARFHSSRLTRAVASLCIVTQVLFPV---------ASTAGHRVAAPQAAPAVLSEQDA 51 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 VA A L S +S G AT+ A QEWL ++GT RV L +D+D Sbjct: 52 TAAQVAGMTTQAAGMLQSGMNSRQAAEMARGYATSTAQSAFQEWLSQWGTVRVTLGLDED 111 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F+LK S+ ++L P +DTP N+LFTQ + HRTDDR Q N G GWRHF+ + +MAGVN F D Sbjct: 112 FTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFAPD-YMAGVNLFFD 170 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYL 246 HDL+R H+R+G+G EYWRD LKL ANGY+R SGW+ +P+++ DY+ RPANGWD+RAEGYL Sbjct: 171 HDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYL 230 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA+PQLGA+LMYEQYYGDEV LFGKDKRQ+DPHA +A ++YTPVPL Sbjct: 231 PAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLI 277 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 324 bits (831), Expect = 2e-87, Method: Composition-based stats. Identities = 122/285 (42%), Positives = 161/285 (56%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + Y ++ A ++ P + N +V Sbjct: 113 LKKLNAYRTFSKPFASLTTGDEIEVPRKESSFF---------------SNNPNENNKKDV 157 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 + +A A AG LS+ SDA N T + N Q+WL ++GTARV+LNVD DF Sbjct: 158 DDLLARNAMGAGKLLSNDNTSDAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDF 217 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L +S+L++L P+ D+ +++LFTQ + D R NIG G R + G+ WM G NTF D+ Sbjct: 218 KLDNSALDLLVPLKDSESSLLFTQLGVRNKDSRNTVNIGAGIRQYQGD-WMYGANTFFDN 276 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 DL+ + R+GVGAE DYLK SAN Y +GW +S D Y ERPA+G+DIR E YLPA Sbjct: 277 DLTGKNRRVGVGAEVATDYLKFSANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPA 336 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +PQLG LMYE+Y GDEV LFGKD RQKDPHA++ V YTPVPL Sbjct: 337 YPQLGGKLMYEKYRGDEVALFGKDDRQKDPHAVTLGVNYTPVPLV 381 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 324 bits (830), Expect = 2e-87, Method: Composition-based stats. Identities = 124/296 (41%), Positives = 174/296 (58%), Gaps = 11/296 (3%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 M +++ H + + R + +A I +Q+ P ++ + + +A + S Sbjct: 12 MLFFRSTHMRSKTR-----KLLACIQIVLQLAPPSSLIYLS--SVFNANAEEITSSAEKE 64 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 + +VA A AG+ LSS SDA + + T KA QEWL ++GTARV Sbjct: 65 QGNPSDQNASSVAQTAVQAGSLLSSDNASDALGSAVVSAVTGKAASSAQEWLSQFGTARV 124 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 ++ D+ F+L DS L++L P+Y+ N+LFTQ R DDR N GFG+RHF + WM Sbjct: 125 NISTDEHFTLSDSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHF-NDGWMW 183 Query: 181 GVNTFIDHDLS-RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G N F D +S H R+G+ E DYL +SANGY+R S W S +DY ER A+G+D Sbjct: 184 GTNVFYDRQVSGNQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFD 243 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 IRA GYLPA+PQLGA+++YEQY+GD VGLFG D RQKDP+A++ + YTPVPL Sbjct: 244 IRATGYLPAYPQLGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLV 299 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 323 bits (829), Expect = 4e-87, Method: Composition-based stats. Identities = 121/287 (42%), Positives = 161/287 (56%), Gaps = 8/287 (2%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + ++ A + ++ P + + P + + + Sbjct: 117 SQLKKINQFRKFAHGIDKIGAGDEIDIPHSGSSL-------TKPGSPAAATPLSPHADTS 169 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 E VA G L+S S+A ATA AN EI +WL KYGTA+++LN+DK Sbjct: 170 ERESRVAGQLMGVGRVLASPQSSNAASEMARSWATAAANDEIVKWLSKYGTAQLQLNIDK 229 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 +FSL S+L+ L P YDTPT FTQ D R NIG G R S N W+ GVN F Sbjct: 230 NFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTRTLSNN-WLFGVNAFY 288 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 DHDLS ++R+G+G+E W DYL+LS NGY+R S W +S D+ DY ERPANG+D+RA ++ Sbjct: 289 DHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNERPANGFDVRANAWM 348 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P PQLG LMYEQY+GD VGLFGKD Q++P+A + V YTP PL Sbjct: 349 PTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPLL 395 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 322 bits (826), Expect = 7e-87, Method: Composition-based stats. Identities = 128/309 (41%), Positives = 180/309 (58%), Gaps = 19/309 (6%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLA-VTFTPVMAARAQHAVQPRLSM---- 57 H + + ++ P + P + A + Sbjct: 83 HLTPEALRKLNQRRTFTYGFDNLQPGDKLNVPAIKLDDEPDVPAARLDNKANLPAARLDN 142 Query: 58 -------------GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA 104 + D+ + +A A+ AG FLS P+ DA + G TA+A Sbjct: 143 KPDVPAIIWGQEGSAASALGDDAGARKMADVASRAGAFLSDNPNGDAALSLARGEVTAEA 202 Query: 105 NQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS 164 + ++Q+WL ++GTARV+L+ D+ FS K+S ++L P+Y+ +++FTQG++HRTDDRTQ Sbjct: 203 SGQLQQWLNQFGTARVQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQV 262 Query: 165 NIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 N+GFG R+F+ +M G N F D+DLSR+H+R G+G EYWRD+LKLSANGY+R S W S Sbjct: 263 NLGFGLRYFAP-SYMLGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNS 321 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 D +DYQERPANGWDIRA+ +LP+ PQLG L YEQYYG V LFGK+ Q+DP AI+A Sbjct: 322 SDFKDYQERPANGWDIRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAG 381 Query: 285 VTYTPVPLT 293 V +TP PL Sbjct: 382 VNFTPFPLL 390 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 117/279 (41%), Positives = 156/279 (55%), Gaps = 17/279 (6%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y +R ++ P + V + E +A Sbjct: 103 YRTFSRPFTALTTGDEIDIPRKASPFSVDNNKDNRLSV----------------ENTLAG 146 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 A T LS+ + + + A+ + N Q+WL ++GTARV+LN++ DF L S+ Sbjct: 147 HAVAGATALSNGDVAKSGERMVRSAASNEFNNSAQQWLSQFGTARVQLNINDDFHLDGSA 206 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 ++L P+YD ++LFTQ D R N+G G R F GN WM G NTF D+DL+ + Sbjct: 207 ADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGN-WMYGANTFFDNDLTGKN 265 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 RIGVGAE W DYLKLSAN Y + W +S D DY ERPANG+D+RAE YLP++PQLG Sbjct: 266 RRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGYDLRAEAYLPSYPQLGG 325 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 MYE+Y GD+V LFGKD RQK+PHAI+A V YTP+PL Sbjct: 326 KAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPLV 364 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 321 bits (822), Expect = 2e-86, Method: Composition-based stats. Identities = 124/289 (42%), Positives = 172/289 (59%), Gaps = 16/289 (5%) Query: 6 TGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD 65 + F S+ ++ + W+ I +Q+LFPL F PV AA A + T Sbjct: 5 SIKNNNSFFLSLKSKLIIWSQIVLQILFPLFTVF-PVHAAPAT------TTKETTVAMPY 57 Query: 66 NNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 + +AS S+ +D ++ TGMAT+ A +Q+WL ++GTARV+LNVD Sbjct: 58 SQELSTLAS---------STASGTDGAKSAATGMATSAAASSVQQWLSQFGTARVQLNVD 108 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 + + DS++++L P+YD +LFTQ + D RT N+G G R F +WM G N F Sbjct: 109 DNGNWDDSAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFGGNVF 168 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 D D + + R+G GAE W +YLKLSAN Y+ + W S D DY E+PA+G+DIRAEGY Sbjct: 169 FDDDFTGKNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIRAEGY 228 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LPA+PQLGA LMYEQYYGD+V LF D Q +P A++ ++YTPVPL Q Sbjct: 229 LPAYPQLGAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQ 277 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 320 bits (821), Expect = 3e-86, Method: Composition-based stats. Identities = 122/292 (41%), Positives = 168/292 (57%), Gaps = 17/292 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 H ++ ++ + S+L + V WA I +Q+ FPL V P A+ A + +S +T Sbjct: 1 MHEQSIMEKNTLKISLLKKIVIWAQILLQIAFPLLV--LPAHASSGPGATETDMSDASTL 58 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + + +DA +N T +AT A ++EWL +GTA+V Sbjct: 59 SASLASSAAQ---------------NGADAMKNTATHLATTHAASTVEEWLSHFGTAQVT 103 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L+VD + + +S+ + L P+YD ++LFTQ I D RT NIG G R F DWM G Sbjct: 104 LDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFG 163 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + RIG GAE W +YLKLSAN YI S W S D ++Y E+PA+G+D+R Sbjct: 164 GNVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVR 223 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AEGYLP++PQLGA LMYEQYYGD V LF KD Q +P A++ + YTPVPL Sbjct: 224 AEGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPLI 275 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 320 bits (819), Expect = 5e-86, Method: Composition-based stats. Identities = 118/279 (42%), Positives = 164/279 (58%), Gaps = 15/279 (5%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y ++ ++ P + + + + ++A +E +AS Sbjct: 71 YRTFSKPFTALTSGDEIDIPRKASPFSIDSEKNKNADVL--------------LENKLAS 116 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 T L++ + ++ I A + N Q+WL ++GTARV++NV+ DF L S+ Sbjct: 117 HVQTGATALATSNAAKSSERMIRSAANNEFNSSAQQWLSQFGTARVQMNVNDDFKLDGSA 176 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L PIYD ++LFTQ D+R NIG G R F N WM GVNTF D+D++ + Sbjct: 177 VDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNN-WMYGVNTFFDNDMTGKN 235 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R+GVGAE W DYLKLSAN YI S W +S D DY ERPANG+D+RAE YLP+ PQLG Sbjct: 236 RRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAEAYLPSHPQLGG 295 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYE+Y G+EV LFGKD RQK+PHA++A V YTP+PL Sbjct: 296 KLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLL 334 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 314 bits (805), Expect = 2e-84, Method: Composition-based stats. Identities = 130/291 (44%), Positives = 187/291 (64%), Gaps = 17/291 (5%) Query: 8 HKQPRFRYS-----VLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + S + + VAW+ I++Q L+P ++FTP ++ ++ + Sbjct: 1 MSNKKISRSNGATGPVNKVVAWSTIALQALYPALLSFTPTISH--------ASAVKASQA 52 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 A+ + ++S AA AG + + +F A+A +E+ EWL KYG AR++L Sbjct: 53 AAEQQELRGLSSLAAQAGRSIENG----HAGSFAANTVPAQATKEVVEWLQKYGNARIQL 108 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD FSLKDS+ + LYP D ++LF+Q ++HRTDDRTQ+NIG G+R+F+ ++ M G Sbjct: 109 NVDDAFSLKDSAFDFLYPWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGA 168 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F D+DLSR H R+G G EYWRDYL+ AN Y+R S WK S D++DYQERPA+GWDI Sbjct: 169 NLFYDYDLSRHHARMGAGVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYT 228 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +G+LP++PQLGASL YE+YYG VGLFG D Q++P+A + ++YTPVPL Sbjct: 229 QGWLPSYPQLGASLKYEKYYGKNVGLFGSDHLQENPYAFTGGISYTPVPLV 279 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 314 bits (804), Expect = 3e-84, Method: Composition-based stats. Identities = 124/288 (43%), Positives = 179/288 (62%), Gaps = 7/288 (2%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 K +K + + +++ V WANI +Q +FPL++ FTP + A + + + Sbjct: 13 KQLNKNKQLNKTRISKSVVWANIVIQAIFPLSIAFTPAVMAAET------VGASDEKPRS 66 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 + E++ A+ A + L++ + + G A N+ +Q+W ++G+A+V+LN+ Sbjct: 67 ASQAEQSTANAATRLASILTNDDSAKQASSIARGTAANAGNEALQKWFNQFGSAKVQLNL 126 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+ SLK S L++L P+ D+P + FTQ DDR N+G G RHF M G N Sbjct: 127 DEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQ-MLGYNL 185 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+DHD S SHTRIGVGAEY RD++ L+ANGY SGWK SPD++ Y E+ ANG+D+R+E Sbjct: 186 FVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSEA 245 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 YLP PQLG L+YEQY+GDEVGLFG D RQK+P A++ V YTP+PL Sbjct: 246 YLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPL 293 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 313 bits (802), Expect = 5e-84, Method: Composition-based stats. Identities = 126/285 (44%), Positives = 166/285 (58%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + + ++ A ++ P Q L+ NT +T Sbjct: 43 LKKINQLRTFSKPFAKLQAGDELEIP-------------QAQSNLGLAPENTALTDTQTT 89 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+N+A A + L+S A + G+A ANQ WL +GTAR++ NVD Sbjct: 90 ERNLAKTATTSAQMLNSGD--KAAARQLRGLAVGNANQAANSWLNNFGTARLQANVDDRG 147 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L S +ML P YDTP+ M FTQ I R D RT +N+G G RHF + WM G N F+D Sbjct: 148 DLDGSQFDMLMPFYDTPSQMAFTQFGIRRIDKRTTANLGIGIRHFIDD-WMVGYNLFLDR 206 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++R HTR+G GAEY RDYLKL+ANGY+R S W+ SPD Y ERPA G+D+RAE YLP+ Sbjct: 207 DITRDHTRVGAGAEYARDYLKLAANGYLRLSDWRDSPDFSSYSERPATGFDLRAEAYLPS 266 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG LMYEQY+G++VGLFGKD RQ++P AI+A + YTP+PL Sbjct: 267 LPQLGGKLMYEQYFGNDVGLFGKDNRQQNPAAITAGINYTPIPLV 311 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 311 bits (796), Expect = 2e-83, Method: Composition-based stats. Identities = 127/286 (44%), Positives = 168/286 (58%), Gaps = 22/286 (7%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + S + R + +Q F + F V A Sbjct: 1 MVKTNPSSSQVRRVAVYGLAGLQFFFQVTPAFAGVFQAD--------------------- 39 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 E++VA A AG L DA R +T A+ +A + +WL ++GTA+ +L+V D Sbjct: 40 -EQSVAQTAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQWLNQFGTAKTQLSVVSD 98 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSLK SSL++L P Y+TP N+LFTQ + D R +N G G R+F+ N WM G N F D Sbjct: 99 FSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWMLGYNVFYD 158 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 D ++ R G+G E WRDYLKLSANGY R S W++SP + DY ERPA+GWDIRAEG+LP Sbjct: 159 VDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDIRAEGWLP 218 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+PQLG L+YEQYYG+EV LFG+ +RQK+PHAI+A VT+TP L Sbjct: 219 AYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLL 264 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 310 bits (794), Expect = 4e-83, Method: Composition-based stats. Identities = 108/285 (37%), Positives = 169/285 (59%), Gaps = 10/285 (3%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ ++ ++ P+ A ++ + Sbjct: 89 LRKLNQFRTFSQNFENLQPGDELDIPM---------APLPIVEWDDDKPEIVLPSSASEN 139 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E VA A+ AG F S+ PD + T+ F + T A+ Q+W ++G++++ L DK F Sbjct: 140 EIRVAQLASQAGKFFSTNPDQEKTKAFARELLTTAASSYAQDWFNRFGSSQIHLEADKKF 199 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S +++L P Y+T N++F+Q ++HR + R ++N+G G R + G M G NTF D+ Sbjct: 200 SLKNSQIDLLMPWYETEDNLIFSQTSLHRKEGRIETNLGLGARWY-GEGQMIGGNTFFDY 258 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D+SR H+R+G+G EY RD+LKLSAN Y R SGW+ S D+ D+ RP+NGWD+RAEG+LP+ Sbjct: 259 DISRKHSRLGLGVEYRRDFLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVRAEGWLPS 318 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +P +G L YEQYYGD V LFG Q++P++I+A + YTP+PL Sbjct: 319 YPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSITAGLNYTPIPLV 363 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 118/279 (42%), Positives = 154/279 (55%), Gaps = 19/279 (6%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 Y A+ + ++ P + V A E VAS Sbjct: 100 YRTFAKPFTALTVGDEIDVPRKKSPFTVDNNVTVPA------------------ENGVAS 141 Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 AA LS + + N + + Q+WLG++GTAR++ N + DF S+ Sbjct: 142 NAAAGAALLSHGDAAKSAENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDDFEFDSSA 201 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 +++L P+YD ++ FTQ D R NIG G R F N WM G NTF D+D++ ++ Sbjct: 202 IDVLIPLYDNQKSLFFTQLGGRNKDSRNTINIGAGVRAFLTN-WMYGANTFFDNDITGNN 260 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R+G+GAE W DYLKLSANGY + W +S D DY ERPANG+D+RAE YLPA+PQLG Sbjct: 261 RRVGIGAEAWTDYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPAYPQLGG 320 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LMYEQY GDEV LFGKDKRQKDPHAI+ + YTPV L Sbjct: 321 KLMYEQYNGDEVALFGKDKRQKDPHAITVGINYTPVSLV 359 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 306 bits (784), Expect = 5e-82, Method: Composition-based stats. Identities = 125/290 (43%), Positives = 183/290 (63%), Gaps = 20/290 (6%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + + ++ ++ PL + + + + Sbjct: 80 LKKLNQLRKFSKPFEALTTGDEIDIPLIG---------------NNFTTQSLPHSTSSPN 124 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN----QEIQEWLGKYGTARVKLNV 124 + +A A+ G L + P+S+A + A + AN QEI +WL G RVKL+ Sbjct: 125 DSLLAQSASQVGNTLQNNPNSEALNDLARSSALSAANAKAGQEISDWLNGKGKVRVKLDA 184 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+DFS+K+S L++L P++++ ++M+F+QG++HRTDDRTQSN+G G+R+F+ + + G NT Sbjct: 185 DRDFSVKNSQLDLLVPLWESESHMIFSQGSVHRTDDRTQSNLGLGYRYFA-DSYALGANT 243 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F DHD SRSH+R+G+GAEY R++ KL+ NGY+R S WK SPD ++Y+ERPANGWDIRAEG Sbjct: 244 FYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRLSNWKDSPDFDNYEERPANGWDIRAEG 303 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 YLP++P LGA L YEQYYGD VGLFGKD +QK+PHAI+ Y+P PL + Sbjct: 304 YLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPHAITFGGNYSPFPLLK 353 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 303 bits (776), Expect = 5e-81, Method: Composition-based stats. Identities = 116/281 (41%), Positives = 169/281 (60%), Gaps = 9/281 (3%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAAR-AQHAVQPRLSMGNTTVTADNNVEKNV 72 ++ + + ++ P+ P++A + A + N V +NN + Sbjct: 98 QFRTFPQGFEQVSSGEEIDIPV-----PIIAEQGATKVSVVTPNEVNCPVGIENNPQTK- 151 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 + L+S + + + ++ AN+EIQ+WLG+YGTA+V+LNVD FSL++ Sbjct: 152 -EYVKRVSALLASSDPTTVATDVVRSEVSSTANKEIQKWLGQYGTAQVRLNVDDKFSLRE 210 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 SSL+ L+ YD+ + ++FTQ I D R +N+G G R GN W+ G NTF D+DL+ Sbjct: 211 SSLDWLFSFYDSSSAIIFTQLGIRNKDHRNTANLGLGGRISMGN-WILGANTFYDNDLTG 269 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 ++R+G GAE W DYL+LSAN Y+R + W +S D D+ ERPANG+DIR +LP PQL Sbjct: 270 INSRLGFGAEAWTDYLQLSANSYMRLNNWHQSRDFIDHDERPANGFDIRTNAWLPVLPQL 329 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G LMYEQY GD V LFGKDK QK+P+A++A +TYTP PL Sbjct: 330 GGKLMYEQYSGDSVALFGKDKLQKNPYAVTAGITYTPFPLL 370 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 303 bits (776), Expect = 5e-81, Method: Composition-based stats. Identities = 116/265 (43%), Positives = 158/265 (59%), Gaps = 17/265 (6%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 +Q+LFP V +A A QP +++ T V S A GT ++ Sbjct: 1 MQLLFPF------VTSAYTYAASQPPVAVPVPT---------QVTSLLAAGGT--ETENG 43 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 S+ ++ T MAT A ++EWL +GTA V LN D++ + +SS++ L P+YD ++ Sbjct: 44 SNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSV 103 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 LFTQ + D RT NIG G R F+ +WM G N F D D + + R+G+GAE W DYL Sbjct: 104 LFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYL 163 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 KL+AN YI + W S D DY E+PA+G+DIRAEGYLPA+PQLGA +MYEQYYG+ V L Sbjct: 164 KLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVAL 223 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 F KD Q DP A++ + YTP+ L Sbjct: 224 FDKDHLQNDPSAVTMGLNYTPISLV 248 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 302 bits (774), Expect = 9e-81, Method: Composition-based stats. Identities = 137/284 (48%), Positives = 181/284 (63%), Gaps = 5/284 (1%) Query: 11 PRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD-NNVE 69 + R R A +Q P P+MAA+ + G + + N Sbjct: 84 NQLRELNQLRTFAHGLNGLQ---PGDDVDVPLMAAKDNKNASDAAAPGRSASAEEGNEQA 140 Query: 70 KNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS 129 + VA +A+ AG+FL+S SDA + MAT +A Q+WL +GTARV+L+ DK+FS Sbjct: 141 QKVAGYASQAGSFLASSAKSDAAASMARNMATVEAGGAFQQWLSHFGTARVQLDADKNFS 200 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHD 189 LK+S ++L P+YD N +FTQG++HRTD RTQ+++G GWRH S + +M G N F D D Sbjct: 201 LKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWRH-STSTYMLGGNLFGDFD 259 Query: 190 LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAW 249 LSR H R G G EYWR++LKL N Y+R SGWK SPD+EDYQERPANGWD+R + ++P+ Sbjct: 260 LSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQERPANGWDVRGQAWVPSL 319 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG L YEQYYG EV LFG D RQ++PHAI+ + YTPVPL Sbjct: 320 PQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPLI 363 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 300 bits (767), Expect = 6e-80, Method: Composition-based stats. Identities = 123/304 (40%), Positives = 167/304 (54%), Gaps = 18/304 (5%) Query: 6 TGHKQPRFRYSVLARCV-------AWANISVQVLFPLAVTFTPV--MAARAQHAVQPRLS 56 Q L + N +L P P+ +A +A + P L Sbjct: 49 QIALQSGLDLRTLRKLNNGSLDKRDELNAGESLLLPANSPLFPLDPLAGKAIASNLPELG 108 Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA--------NQEI 108 MGN V ++ E+ A+ A G + SD +N A +A Q+ Sbjct: 109 MGNDPVPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQAESWAKGQAKAQVVDPLRQQA 168 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 QE LGK+G A+V L VD + SL S+ + P Y+ + F+Q +HR D+R N+G Sbjct: 169 QELLGKFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAMVAFSQVGVHRQDNRMIGNLGA 228 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 G R G+ W+ G NTF+D D+SR+H+R+G+G E+W D LKL++N Y SGWK S D + Sbjct: 229 GVRFDQGD-WLFGANTFLDQDISRNHSRLGLGLEWWADNLKLASNYYHPLSGWKDSKDFD 287 Query: 229 DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYT 288 DY ERPA G+D+ A+GYLPA+ QLGAS +YEQYYGDEV LFGKD QKDPHA++ V YT Sbjct: 288 DYLERPARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVALFGKDNLQKDPHAVTVGVDYT 347 Query: 289 PVPL 292 P PL Sbjct: 348 PFPL 351 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 297 bits (760), Expect = 3e-79, Method: Composition-based stats. Identities = 113/264 (42%), Positives = 154/264 (58%), Gaps = 10/264 (3%) Query: 36 AVTFTPVMAARAQHAVQPRLSMGNTTVTA------DNNVEKNVASFAANAGTFLSSQPDS 89 + F + + S +T A + E + + G L++ S Sbjct: 67 SSAFENLHPNNEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAA---S 123 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML 149 + M NQEI++WL ++GTA+V LN DK+FSLK+SSL+ L P YD+ + + Sbjct: 124 GRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLF 183 Query: 150 FTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 F+Q I D R N+G G R N W+ G+NTF D+DL+ + RIG+GAE W DYL+ Sbjct: 184 FSQLGIRNKDSRNTLNLGVGIRTL-ENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQ 242 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 L+ANGY R +GW S D DY+ERPA G D+RA YLPA PQLG LMYEQY G+ V LF Sbjct: 243 LAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALF 302 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 GKD Q++P+A++A + YTPVPL Sbjct: 303 GKDNLQRNPYAVTAGINYTPVPLL 326 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 296 bits (757), Expect = 7e-79, Method: Composition-based stats. Identities = 117/274 (42%), Positives = 160/274 (58%), Gaps = 12/274 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + +A I +Q P+A++ + + A LS + DN A A Sbjct: 2 KSMAIMQILLQTALPVALSMSATVRAA-------ELSQNTHSADKDNINSPYSAQM-TQA 53 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 T LSS + A MA+ A +++WL ++GTARV+LNVD + DS+++ L Sbjct: 54 ATALSSGNAAGAGA----SMASGYAGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLA 109 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ MLFTQ + DDR N G G R F ++WM G N F D D + + R+G Sbjct: 110 PLYDSQKAMLFTQLGLRAPDDRVTGNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGF 169 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W + LKLSAN Y+ + W S D +DY E+PA+G+D+RAEGYLPA+PQLGA LMYE Sbjct: 170 GAEAWTNNLKLSANTYLGTTNWHSSRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYE 229 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 QYYGD+V LF KD Q +P A++ V+YTPVPL Sbjct: 230 QYYGDKVALFDKDDLQSNPSAVTVGVSYTPVPLI 263 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 290 bits (743), Expect = 3e-77, Method: Composition-based stats. Identities = 110/280 (39%), Positives = 166/280 (59%), Gaps = 16/280 (5%) Query: 14 RYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVA 73 + V + ++ P+ V+F P+ T + + +A Sbjct: 90 QGRVFLNGIKNIKEGDEINVPV-VSFAPIKWGEE------------ETKEQGSGNLQQIA 136 Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 S A + G LS+ S + + T K N IQ W +GTA ++L VDK+FSLK+S Sbjct: 137 SIATDVGNILSNDNISK--NSALLNKITNKVNSHIQSWFENFGTAHIQLQVDKNFSLKNS 194 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 LE+L+P+++ + F+QG I DD+ SNIG G+R F N WM G N+FID+DL + Sbjct: 195 QLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDN-WMLGGNSFIDYDLRKE 253 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLG 253 H+R+G+G EYW+D LKL AN Y+R S W+ S +I DY+ERPANG D+ + +LP++PQ+G Sbjct: 254 HSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNIKSWLPSYPQIG 313 Query: 254 ASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + YE+YYGD+V LFG++ RQ++PH+ + ++YTP PL Sbjct: 314 GDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLM 353 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 289 bits (740), Expect = 6e-77, Method: Composition-based stats. Identities = 129/289 (44%), Positives = 180/289 (62%), Gaps = 12/289 (4%) Query: 16 SVLARCVAWANISVQVLFPLAVTF---TPVMAARAQHAVQP----RLSMGNTTVTADNNV 68 + R A+ + +QV+F +F P AA Q ++ +T ++ Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 KN+A+ ++ G+ LS +DA N +A +IQ+WL ++GTA+V L +DKD Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDALINSAINNGSAYVTGQIQQWLQQFGTAKVNLGLDKDL 121 Query: 129 SLKDSSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 SL ++SL++L P+YD N+LFTQ R DDR N+G G+R+F+ + WM G+NTF D Sbjct: 122 SLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFA-DRWMWGINTFYD 180 Query: 188 HDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 +S + H R+G+G E +Y KLSANGY R SGWK S + EDYQER ANG+DIRAEGYL Sbjct: 181 RQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDIRAEGYL 240 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 PAWPQLGA L++EQYYGD+V LF D RQ++P+A++A V YTP PL Sbjct: 241 PAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLV 289 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 289 bits (739), Expect = 9e-77, Method: Composition-based stats. Identities = 110/236 (46%), Positives = 152/236 (64%), Gaps = 4/236 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 N + + +A + L+ D + ++K+NQ+I++WL ++G Sbjct: 84 NNQDEAIPSTEGEELAKIIVDNSFLLNKDID---VTQYAISQISSKSNQKIEQWLNQFGH 140 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND 177 ARV L+ DK+ +LK+SS E+L P+Y+ ++F Q HR D R+Q N G G+R+F+ Sbjct: 141 ARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTNYHRKDLRSQFNYGIGYRYFT-EK 199 Query: 178 WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 +M G+N F DHDL+ H R+G+GAE WRDY KLS+N Y R S W+ S +I DY ERPANG Sbjct: 200 FMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHYHRLSSWRASNNILDYSERPANG 259 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 WDIR EGY PA+PQLG L++EQYYG EVGLFGKDKR K+PH + + YTP+PL Sbjct: 260 WDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRDKNPHTYTLGINYTPIPLV 315 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 108/281 (38%), Positives = 149/281 (53%), Gaps = 16/281 (5%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + ++ +S+ ++F + A++ N + Sbjct: 5 FNTLTVTKIISRLILSIGLIFGIFTYGFSQQHYFNSEALENPA--------EHNEAFNKI 56 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 S + S N M ANQE++ WL ++GT +V +N DK FSLK+ Sbjct: 57 ISTGTSLA-------VSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKFSLKE 109 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 SSL+ L P YD+ + + F+Q I D R NIG G R F WM G NT D+D++ Sbjct: 110 SSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQ-SWMYGFNTSYDNDMTG 168 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 + RIGVGAE W DYL+LSANGY R +GW +S D DY ERPA+G DI + YLPA PQL Sbjct: 169 HNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPALPQL 228 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G L YEQY G+ V LFGKD Q +P+A++ + YTP+P Sbjct: 229 GGKLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFI 269 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 111/263 (42%), Positives = 162/263 (61%), Gaps = 7/263 (2%) Query: 36 AVTFTPVMAARAQHAVQPRL-SMGNTTVTADNNVEKNVASFAANAGTFLSSQ----PDSD 90 + PV+ A A+ L S+G+ + +NN E A + GTFLS + S Sbjct: 24 SSAIMPVIPAYAKMLDNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQ 83 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 +++ +++A +EI+ WL K G ++ ++ DK FS+K+S + L P YD +LF Sbjct: 84 IAQDYARNKVSSEATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLF 143 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 TQ +HR D+R +N G G R+F + G+N FIDHDLS +HTR+G+G EYW+DYLKL Sbjct: 144 TQHTLHRYDERFHTNNGIGLRYFHEKSTI-GMNAFIDHDLSHAHTRVGLGVEYWQDYLKL 202 Query: 211 SANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 +AN Y + WK + ++ D+ +PA+GWDI+ EG+LP +P LG +L YEQYYGD V LF Sbjct: 203 NANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALF 262 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 GK KRQK+P+A + +TP PL Sbjct: 263 GKTKRQKNPNAATIGANWTPFPL 285 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 106/238 (44%), Positives = 136/238 (57%), Gaps = 2/238 (0%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 VAS + G LSS+ A GM + + ++EWLG G Sbjct: 109 PNQEEEQQATQQASMVASHLSQVGNSLSSEDRVGAFSRLAKGMLLSSTAKTVEEWLGHIG 168 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A+VKL D S +++ P+YD P + F+Q R D R NIG G RH+ + Sbjct: 169 QAQVKLQADDKNDFSGSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYVSD 228 Query: 177 DWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 WM G N F D +S + H R+G G E RDY+KLSAN Y R GWK S +EDY ER A Sbjct: 229 -WMFGYNIFFDQQISGNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAA 287 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 NG+DIR E YLP +PQLG LMYEQY+GDEV LFG ++RQK+P A++A V+YTP+PL Sbjct: 288 NGYDIRTEAYLPHYPQLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYTPIPLV 345 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 117/288 (40%), Positives = 157/288 (54%), Gaps = 19/288 (6%) Query: 22 VAWANISVQVLFPL-----------AVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK 70 + A Q++ PL + P++AA +L+ + VT N + Sbjct: 101 MMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD 160 Query: 71 NV----ASFAANAGTFLSSQP-DSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 A AA+ G+ L S+ + D ++ G+A +A+ ++Q WL YGTA V L Sbjct: 161 KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSG 220 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 +F SSL+ L P YD+ + F Q D R +N+G G R F + M G N F Sbjct: 221 NNFD--GSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPEN-MLGYNVF 277 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 ID D S +TR+G+G EYWRDY K S NGY R SGW +S + +DY ERPANG+DIR GY Sbjct: 278 IDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGY 337 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LP++P LGA LMYEQYYGD V LF DK Q +P A + V YTP+PL Sbjct: 338 LPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLV 385 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 286 bits (732), Expect = 6e-76, Method: Composition-based stats. Identities = 114/269 (42%), Positives = 156/269 (57%), Gaps = 6/269 (2%) Query: 25 ANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLS 84 AN V P+ + RA + G+ T D ++ G+ L+ Sbjct: 102 ANAGELVDSPINDAIAININ-RASQNNKNNAGAGSLTKEQDPMDSLSI----RGVGSALA 156 Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDT 144 + DA + MAT+ N +I +WL +YGTAR++LN D+DFSL +S+L+ L P+YD+ Sbjct: 157 ASGRVDALHHMARTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLLPLYDS 216 Query: 145 PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYW 204 T LFTQ D R +NIG G R F ++WM G N F D+D + + R+G+GAE W Sbjct: 217 QTLTLFTQQGFRNKDRRNIANIGIGTR-FIHHEWMMGGNAFYDNDFTGDNKRVGLGAELW 275 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD 264 D +LSANGY R + W +S D DY ERPANG D+RA G+LPA P LG SL+YE Y+GD Sbjct: 276 TDSFQLSANGYFRLTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYEHYFGD 335 Query: 265 EVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V LFGKD Q++P+AI+ +YTP L Sbjct: 336 NVALFGKDHLQRNPYAITLGGSYTPFSLL 364 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 284 bits (727), Expect = 2e-75, Method: Composition-based stats. Identities = 91/212 (42%), Positives = 133/212 (62%), Gaps = 2/212 (0%) Query: 83 LSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIY 142 L ++ +++I ++ Q+WL ++GTA++ LNVD L +SS+++L P Y Sbjct: 122 LLNKDPKKLAQDYIVNKLNSQITSNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFY 181 Query: 143 DTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGA 201 D + ++++Q D R N+G G R F + WM G NTF D+DL+ +++R +G Sbjct: 182 DDKDHWLIYSQYGYRHKDSRDTVNLGIGTRLFIND-WMYGANTFYDNDLTGNNSRFSLGG 240 Query: 202 EYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 E W +YLK+SAN Y R S W S D+ +Y ERPANG+D+ A+ YLPA P LGA + YEQY Sbjct: 241 ELWTNYLKMSANAYFRLSDWHNSRDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQY 300 Query: 262 YGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +GD V LFG + RQKDP+A + V YTP+PL Sbjct: 301 FGDNVALFGTNNRQKDPYAATIGVNYTPIPLI 332 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 284 bits (726), Expect = 3e-75, Method: Composition-based stats. Identities = 100/265 (37%), Positives = 140/265 (52%), Gaps = 9/265 (3%) Query: 37 VTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASF--------AANAGTFLSSQPD 88 F Q+ P L N K++ A L+ + Sbjct: 16 AAFAAPEINVKQNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINSATQGFENLTPEAL 75 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 R+++ T+ A I++ L YG R L++ + L SS++ P YD T + Sbjct: 76 KSQARSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTV 135 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 F+Q + R +DRT NIG G R+ + + ++ G N F D+D +R H R+G+GAE W DYL Sbjct: 136 YFSQFSAQRKEDRTIGNIGLGVRY-NFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYL 194 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 K S N Y S WK S D + Y+ERPA GWDIRAE +LPA+PQLG +++EQYYG+EV L Sbjct: 195 KFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVAL 254 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 FG D +KDP A++ V Y PVPL Sbjct: 255 FGTDSLEKDPFAVTLGVKYQPVPLI 279 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 283 bits (724), Expect = 6e-75, Method: Composition-based stats. Identities = 109/289 (37%), Positives = 153/289 (52%), Gaps = 17/289 (5%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 ++ + + ++ P + T Sbjct: 85 PLSKLYKLNQFRSFHKSFYDLSGGDEIDIPAS---------------NNYSFENRPLDTK 129 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 +N E A+ A S S + MA++ AN IQ+WL ++GT +L+ Sbjct: 130 VDNNENYSANKTKAAVNV-SESNKSPEALGVASSMASSAANNAIQKWLSQWGTVESQLSF 188 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D SLK+SSL+ L PIYDT N F Q D R N+G+G RH N WM G+N Sbjct: 189 DSKASLKNSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVY-NGWMYGLNN 247 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F D+D++ ++ R+G+G E DYL +++N Y+R + W +S D DY ERPANG+D+R G Sbjct: 248 FFDYDITGNNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANGFDMRVNG 307 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LPA+PQ+G L+YEQYYGDEVGLFGKD RQKDP AI+A V++TP PL Sbjct: 308 WLPAYPQIGGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLL 356 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 282 bits (721), Expect = 1e-74, Method: Composition-based stats. Identities = 104/238 (43%), Positives = 139/238 (58%), Gaps = 2/238 (0%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 VAS + G+ LSS+ +A G+ + + ++EWLG G Sbjct: 70 PNREEEQKATQQASLVASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIG 129 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A+VKL VD S L + P+Y+ P + F+Q R D R NIG G RH+ + Sbjct: 130 KAQVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYLSD 189 Query: 177 DWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 WM G N F+D +S + H R+G+G E RDY+KLSAN Y R GWK S +EDY ER A Sbjct: 190 -WMLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAA 248 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +G+DIR E YLP +PQLG LMYEQY+G+EV LFG ++RQK+P A++A V+YTP PL Sbjct: 249 SGYDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLV 306 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 280 bits (716), Expect = 4e-74, Method: Composition-based stats. Identities = 100/266 (37%), Positives = 138/266 (51%), Gaps = 1/266 (0%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 ++ + P + +D + L+ Sbjct: 1 MENEIGGTLINKPGHDMPKLPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRDKA 60 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 T + + Q+ Q+ LG++G A+V L++D +L S+ + P YD+ + Sbjct: 61 QSKTTQWAKEKIISPLQQQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYL 120 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 LF+Q IH D+R N G G R + + + G N FIDHD SR H R G+GAE DY Sbjct: 121 LFSQINIHHQDNRKIGNFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADY 180 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 LK SAN Y S WK SPD +DY ERPA G+D+R++GYLPA+PQLG S +YE Y+GDEV Sbjct: 181 LKFSANYYHPLSHWKDSPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVA 240 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LFGK RQKDP A++ + YTPVPL Sbjct: 241 LFGKSHRQKDPRALTLGIDYTPVPLV 266 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 279 bits (715), Expect = 6e-74, Method: Composition-based stats. Identities = 95/237 (40%), Positives = 143/237 (60%), Gaps = 3/237 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 ++ ++ + + S L+S P +++I ++ Q+WL ++GT Sbjct: 91 EDSHKDGNHPLPPLILSHGTKILGLLNSDPK-KLAQDYIVNKLNSQITSNTQKWLSQFGT 149 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A++ LNVD L +SS+++L P YD + ++++Q D R N+G G R F N Sbjct: 150 AKINLNVDHRGRLDESSVDLLVPFYDDKDHWLVYSQYGYRHKDSRDTVNLGIGTRLFINN 209 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 WM G NTF D+DL+ +++R +G E W +YLK+SAN Y R S W + D+ +Y ERPAN Sbjct: 210 -WMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNARDLVNYYERPAN 268 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G+D+ A+ YLP+ P LGA + YEQY+GD V LFGK+KRQKDP+A + V YTP+PL Sbjct: 269 GYDLIADMYLPSMPSLGAKIKYEQYFGDNVALFGKNKRQKDPYAATIGVNYTPIPLI 325 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 278 bits (710), Expect = 2e-73, Method: Composition-based stats. Identities = 115/303 (37%), Positives = 168/303 (55%), Gaps = 23/303 (7%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 + + +P+ S+L W+ + P++ + AQ L Sbjct: 1 MNPPSSKLKPKLPNSLLLSTAIWSTAIL-----------PMVPSYAQIVHLDDLPTLGGQ 49 Query: 62 VTA------DNNVEKNVASFAANAGTFLSSQ----PDSDATRNFITGMATAKANQEIQEW 111 +++ E+ +A + NA F S + +D +++ A A EI W Sbjct: 50 AIQFEGTQPEDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHW 109 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L K G AR+ +N+DK S+K S L+ L P Y+ +LF+Q +IHRTD R Q+N G G R Sbjct: 110 LSKAGNARLNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLR 169 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDY 230 HF N M GVN F DHDLS H+R+G G EY +DY+++SAN Y+ S W+ + ++ +DY Sbjct: 170 HFQQNS-MIGVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDY 228 Query: 231 QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPV 290 RPANGWDI+ EG+LP + LGA+L EQYYGD+V LFGK++RQKDP A + V ++P Sbjct: 229 NARPANGWDIQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPF 288 Query: 291 PLT 293 PL Sbjct: 289 PLL 291 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 273 bits (697), Expect = 7e-72, Method: Composition-based stats. Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 11/242 (4%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 + + EK A L++ ++A N+ NQ+I +WL +YG Sbjct: 98 KETSQAKQVESAEKQFVQGATQIAQGLANNNATEAAINYARNRGEGLLNQKISDWLNQYG 157 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 ARV+++ +K ++L P+ D P ++LF+Q I + R+ +N+G G+R + N Sbjct: 158 KARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGIRANEQRSTTNLGLGYRQYQQN 212 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERP 234 WM G+N+F D+D+S + R G+G E W YLKL+ NGY R + W +S ++ DY ERP Sbjct: 213 -WMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQSFLHEMRDYDERP 271 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKRQKDPHAISAEVTYTPVP 291 ANG+D+RAEGYLP++P LGA YEQY+GD V L + +P A++ ++YTP P Sbjct: 272 ANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKDNPSAVTFGLSYTPFP 331 Query: 292 LT 293 L Sbjct: 332 LL 333 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 91/261 (34%), Positives = 136/261 (52%), Gaps = 16/261 (6%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRN 94 T + A A L+ V V+ V + R Sbjct: 84 ATAGQTIWIPAAKPAATTLPLAPATVQVAKPGKVDGKV-------------DDKTTNVRQ 130 Query: 95 FITGMATAKANQEIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDT-PTNMLFTQ 152 F A+++ + WL +G ++RV ++ ++F+ + + ++L P++++ M+F+Q Sbjct: 131 FGQDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQ 190 Query: 153 GAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSA 212 + DDRT NIG G R+F G WM G N F D+D S S+ RIG+GAE D L+L+A Sbjct: 191 LGVRHADDRTTGNIGLGARYF-GEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAA 249 Query: 213 NGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 NGY + +GW S I D+ ERPANGWDI +LP +PQLG + YEQYYGD V L + Sbjct: 250 NGYFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRG 309 Query: 273 KRQKDPHAISAEVTYTPVPLT 293 + Q +P A + V +TP+PL Sbjct: 310 RLQHNPSAATLGVNWTPIPLV 330 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 268 bits (686), Expect = 1e-70, Method: Composition-based stats. Identities = 110/272 (40%), Positives = 152/272 (55%), Gaps = 18/272 (6%) Query: 33 FPLAVTFTPVMAARAQH---AVQPRLSMGNTTVTADNN---VEKNVASFAANAGTFLSSQ 86 PL + TP+ A P L + +N E NVAS A + + Sbjct: 89 APLNGSTTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASAATQLWGIMGND 148 Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPT 146 S A + +TG+A A+Q +WLG+YG ARV+LN S + ++L P+ +T Sbjct: 149 NSSRAAESAVTGVAAGLASQAAADWLGQYGNARVQLN-----SNSIGNADVLIPLTETQN 203 Query: 147 NMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRD 206 N+LF Q + +RT +N+G G R F+ + WM GVNTF D+DL+ ++R+GVG E W D Sbjct: 204 NLLFGQLGVRYNGERTTNNVGLGVRSFT-DSWMFGVNTFYDYDLTGKNSRLGVGGEAWTD 262 Query: 207 YLKLSANGYIRASGWKKS--PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD 264 LK SANGY R + W +S D+EDY ERPANG+D+RAE YLP++PQLG LMYE+Y+G Sbjct: 263 NLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLPSYPQLGGRLMYEKYFGK 322 Query: 265 EVGLFGK----DKRQKDPHAISAEVTYTPVPL 292 V L D P A + + YTP+PL Sbjct: 323 GVALNSGSTSPDDLGDSPSAFTVGLNYTPIPL 354 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 263 bits (673), Expect = 5e-69, Method: Composition-based stats. Identities = 89/291 (30%), Positives = 144/291 (49%), Gaps = 16/291 (5%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 + + LAR +AW + Q+L P A+ A+A R ++ D + Sbjct: 2 KLTPTPLARWLAWVLVGTQLLTPAAL-------AQAMLPEITRSGADSSVDKTDQPEAEW 54 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKY---GTARVKLNVDKDF 128 +AS A++ G+ L SD +N I + AN I + + R + ++ Sbjct: 55 LASRASSLGSLLQEGNISDFAKNQIQALPQTIANDGITSGIKHWLPEAQFRGGITLEDAS 114 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVN 183 + + ++L P+Y + +++LF Q + D+ R N G GWR G+ W+ G+N Sbjct: 115 KYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVGD-WLLGIN 173 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 +F+D D+ H R +G E +RD + L+ N Y S WK S + ERPA G D+R + Sbjct: 174 SFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVRLK 233 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G LP+ P GA L +EQY+GD+V + G D +DP A + +T+ PVPL + Sbjct: 234 GALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVE 284 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 258 bits (660), Expect = 1e-67, Method: Composition-based stats. Identities = 90/279 (32%), Positives = 133/279 (47%), Gaps = 10/279 (3%) Query: 19 ARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAAN 78 A +AW ++ P P + Q +G D EK A+ A Sbjct: 25 AMGLAWLCGAL----PAYAESPPAPDSVVQQPANDLPELGGNASN-DAEREKEWATMAKQ 79 Query: 79 AGTFLSSQPDSDA----TRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 G + S ++ G A++ Q+ QE L G A++ L + SS Sbjct: 80 LGERNLNNVSSQQVRTRAESYAVGQASSVLQQQAQELLSPLGNAKLSLVMSDQGDFSGSS 139 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 ++ P+YD + ++Q + + + + N G G R +G+ W+ G NT +D D R H Sbjct: 140 GQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGD-WLLGYNTVLDSDFERHH 198 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R +GAE W D+L+ SAN Y S + D + RPA+G+DI +GYLP + Q+G Sbjct: 199 NRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQGYLPFYRQIGG 258 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 SL YEQY+G+ V LFG K+Q DP A+ V YTPVPL Sbjct: 259 SLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLV 297 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 257 bits (657), Expect = 3e-67, Method: Composition-based stats. Identities = 93/236 (39%), Positives = 135/236 (57%), Gaps = 11/236 (4%) Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +NN E+ AS G LSS D + N+ + NQ+I +WL +YG AR+ Sbjct: 100 PEENNNEEKFASSFTLMGDILSSDNFVDNSINYAKSIGQGLVNQQINDWLNQYGKARISF 159 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 + D K+ S + L P+ D P N+LFTQ + DR N+G G+R + N WM G+ Sbjct: 160 SSD-----KNISGDFLLPVIDEPNNLLFTQLGLRNNTDRNTINLGLGYRKYWRN-WMFGI 213 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP--DIEDYQERPANGWDI 240 NTF D+D + + R+GVG E W DYLKL+ NGY + W +S ++DY ERPA G+D+ Sbjct: 214 NTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSKISVMDDYDERPATGFDV 273 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGL---FGKDKRQKDPHAISAEVTYTPVPLT 293 RAE YLP +PQLG+S+ YE+Y+G + L + + D ++ + YTP+PL Sbjct: 274 RAEAYLPKYPQLGSSIKYEKYFGKGIHLGTGVNPEYLKDDAQSLIMGLNYTPIPLL 329 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 253 bits (645), Expect = 8e-66, Method: Composition-based stats. Identities = 93/222 (41%), Positives = 127/222 (57%), Gaps = 2/222 (0%) Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 + + S+ + R + +AN LG GTAR ++ +D DF++ + Sbjct: 166 EQVPTSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDDDFNMVTA 225 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 ++L P+ + +LFTQ + R DRT +N+G G RHF + WM G N F D+DL+ Sbjct: 226 EADLLLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFL-DRWMLGYNLFADYDLTN 284 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 H R GVGAE WRDYLKL AN Y S W+ SP E +ER A G D+R E YLPA+PQ Sbjct: 285 RHWRAGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYLPAYPQW 344 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 ASL EQY G+ VGL D+ ++DPHAI+A + Y P PL + Sbjct: 345 SASLTAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLK 386 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 105/247 (42%), Positives = 150/247 (60%), Gaps = 11/247 (4%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + R + V W+ I+ Q+L P+ T P ++ + + A++ Sbjct: 2 TMVNKKLR-RKASCAVTWSVIATQILSPVTFTLIPA------NSFASSANTESAQTNAND 54 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 +AS AANAG L++ F +A+A +E+ +WL +YG AR+KLNVD+ Sbjct: 55 EYANELASLAANAGQSLANN----TAGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDE 110 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 F+LKD++ + LYP D+ +LF+Q ++HRTDDR Q+NIG G RHF+ ++ M G N F Sbjct: 111 SFTLKDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFY 170 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 D+DLSR H+R G+G EYWRDY++ AN Y S WK S DI+DY ERPANGWD+ AEG+L Sbjct: 171 DYDLSRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWL 230 Query: 247 PAWPQLG 253 P +PQLG Sbjct: 231 PVYPQLG 237 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 248 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 79/203 (38%), Positives = 114/203 (56%), Gaps = 2/203 (0%) Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 N Q ++ + +G + L+VD S +L P Y +++LF Sbjct: 176 YIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVLF 235 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 +Q ++++RT +IG G R + ++ G N FID+DL + H R+ +G E +Y K Sbjct: 236 SQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYFK 294 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 LS N Y + W+ S D+EDY ERPA G+DIR +GYLP +PQLG + YEQY+G EV LF Sbjct: 295 LSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVALF 354 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 K KRQK+P A+S + Y P PL Sbjct: 355 NKTKRQKNPKAVSIGIDYRPFPL 377 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 244 bits (624), Expect = 2e-63, Method: Composition-based stats. Identities = 87/222 (39%), Positives = 126/222 (56%), Gaps = 6/222 (2%) Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS--LKD 132 + +A + L S P D + G + +Q I+ WL +YG AR+ LN D S L Sbjct: 18 YTKSAASLLKSGPAFD---QYAAGKISQLTSQAIEGWLKQYGNARITLNAQSDNSTALAG 74 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS ++L+ +++ + + + Q H D + N+G G R+F N M G N F D +++ Sbjct: 75 SSADLLFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNIN 134 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 +R GVG E WRDY K S NGY S W+ S +EDY E+ A+G+D++ E YLP + Q Sbjct: 135 SGVSRSGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQ 194 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LG L YEQY+GD V LF + Q DP AI+ ++YTP+PL Sbjct: 195 LGGHLKYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLI 236 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 244 bits (623), Expect = 2e-63, Method: Composition-based stats. Identities = 85/246 (34%), Positives = 122/246 (49%), Gaps = 10/246 (4%) Query: 49 HAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEI 108 + + K N SD +++ I M A E Sbjct: 54 QHQTDDDATQGGDIPKSAMSGKRWLQHQTNDDVM----QGSDISKSGIADMGFAALQPET 109 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 ++ G R L + D L S+++ YP+YD + + F Q R D R N+G Sbjct: 110 EK---SAGEVRANLPL-SDGKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGI 165 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 G R+F G+ W G NTF D +S + H R+G G EYWRDYL LSANGY + W S + Sbjct: 166 GQRYFQGD-WALGYNTFYDIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSAL 224 Query: 228 EDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 + Y ER ANG+DIRA+G+ P +PQL L +EQY+GD++ L R K+P+A++ + Y Sbjct: 225 DGYAERAANGYDIRAQGWFPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEY 284 Query: 288 TPVPLT 293 TP+ L Sbjct: 285 TPIQLI 290 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 240 bits (613), Expect = 4e-62, Method: Composition-based stats. Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 34/290 (11%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + + WA ++ Q+ P+ +DN ++ + Sbjct: 3 FSKKPITKYITWAIVTSQIPLPVI-------------------------ADSDNEIQSWI 37 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFS 129 A A++ L D + I + AN + E + R +N++ Sbjct: 38 AGTASSISPHLQEGTLEDYAKGKIKALPGQAANHLVNEGIKSAFPEIIFRGGVNLEDGAK 97 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVNT 184 + S +M P+ +T +++LF Q D+ RT N+G G+R N W+ GVNT Sbjct: 98 YRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYRQEV-NGWLLGVNT 156 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+D D+ SH R G+G E ++D L S N Y +GWK S E + ERPA G+D+R +G Sbjct: 157 FLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAAHELHDERPAYGFDLRTKG 216 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LP +P L YEQYYGD+V L G ++P A A++ + PVPL + Sbjct: 217 TLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGADLVWNPVPLLE 266 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 238 bits (608), Expect = 1e-61, Method: Composition-based stats. Identities = 73/239 (30%), Positives = 109/239 (45%), Gaps = 10/239 (4%) Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSD---ATRNFI----TGMATAKANQEIQEWLGK 114 + + EK A G + D + F +A+ NQ ++ WL Sbjct: 2 MPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLSP 61 Query: 115 YGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 +G A V + VD S P D + ++Q + R +D SN+G G R + Sbjct: 62 WGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRW-A 120 Query: 175 GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERP 234 + W+ G NTF D+ L R G+GAE W +YL+LSAN Y + W ++R Sbjct: 121 RDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWH--ERSATQEQRM 178 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A G+D+ A+ +P + L + EQY+GD V LF K +P A+S + YTPVPL Sbjct: 179 ARGYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLV 237 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 75/286 (26%), Positives = 120/286 (41%), Gaps = 10/286 (3%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 + R + + + + T A +++ EK+ A Sbjct: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAE 61 Query: 75 FAANAGTFLSSQPDSDATRNF-------ITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + G + D + + + NQ ++ WL +G A V + VD + Sbjct: 62 IVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNE 121 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 S P+ D + ++Q + + D+ SN+G G R GN W+ G NTF D Sbjct: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYNTFYD 180 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 + L + R G GAE W +YL+LSAN Y + W + ++R A G+D+ A +P Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMP 238 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 239 FYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 229 bits (585), Expect = 7e-59, Method: Composition-based stats. Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 10/257 (3%) Query: 44 AARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD---SDATRNFI---- 96 A+ +N EK+ A G + R+F Sbjct: 3 QAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTRL 62 Query: 97 TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH 156 ++ E + L +G A V L VD++ + SS + P D + ++Q + Sbjct: 63 RDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGVS 122 Query: 157 RTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYI 216 + + N G G R +G+ W+ G NTF D +R G GAE W DYL+LSAN Y Sbjct: 123 QQNQGLVGNAGIGQRWTAGH-WLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 217 RASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQK 276 GW+ + ++R A G+D+ A+ YLP + + S+ +EQY+GD+V LF Sbjct: 182 PLGGWEHRAGLL--EQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYH 239 Query: 277 DPHAISAEVTYTPVPLT 293 +P A+ ++YTPVPL Sbjct: 240 NPVAVKVGLSYTPVPLV 256 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 70/302 (23%), Positives = 122/302 (40%), Gaps = 12/302 (3%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTF-TPVMAA-RAQHAVQPRLSMGN 59 +H ++ +R A +++Q P+A P +A + A ++ Sbjct: 12 AHLPARGRRHWYRRHRAGAAGMSAVLAMQAAAPVAYGQGAPTFSATQVADAASNAVAQPG 71 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA- 118 T + +A G + D F+ A A+AN +Q+ + Sbjct: 72 AVETRVAQTIQALAQAREAGGARQDGRASLDG--QFLRSQAQAQANVLVQQGVQWANETG 129 Query: 119 -----RVKLNVDKDFSLKDSSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 R++ NV DFS +D ++++ + ++ L Q H + R N G R Sbjct: 130 LPWLRRLEGNVSYDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVAR 189 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +G+ + G N F+D+++ + H R +GAE L N Y SGWK + E + Sbjct: 190 SAAGSSLILGGNAFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERRE 249 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA GWD+ A L + Y ++ G +V F + +++P + Y PVP Sbjct: 250 ERPAAGWDVGFTARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVP 309 Query: 292 LT 293 L Sbjct: 310 LI 311 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 220 bits (560), Expect = 5e-56, Method: Composition-based stats. Identities = 82/259 (31%), Positives = 118/259 (45%), Gaps = 11/259 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFI----- 96 A A+ A + TV ++ + K +A A + G + + R Sbjct: 80 PFADPARFAKMQQQLPELGTVHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVF 139 Query: 97 ---TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 A +A E ++ L YG A V L + D S SS +++ P D + + F+Q Sbjct: 140 NRFRDAAKQRAASEGEQLLSPYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQL 199 Query: 154 AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSAN 213 I +++ + N G G R +G W G N F+D L R +GAE W YL+ SAN Sbjct: 200 GIEQSEYGSVGNAGLGQRWIAG-SWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSAN 258 Query: 214 GYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDK 273 Y SG + + R A G+DI GYLP + QLG +L YEQY G+ V LF Sbjct: 259 YYQPLSGCRNHSNSA--LMRMARGYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGN 316 Query: 274 RQKDPHAISAEVTYTPVPL 292 +P A+S + YTPVPL Sbjct: 317 AVANPAAVSLGINYTPVPL 335 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 56/268 (20%), Positives = 89/268 (33%), Gaps = 9/268 (3%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQP-----DS 89 LA P+ A + D + +A+ A + + + D Sbjct: 54 LAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGARVDG 113 Query: 90 DATRNFITGMATAKANQEI----QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 D + + + L + L+ D + L + +Y Sbjct: 114 DYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVYRAG 173 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N Q H +DR +N G +R + M G N F+D++ + H R VG E Sbjct: 174 RNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLEVIA 233 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L N Y S WK + +E+PA+G D+ P L S + ++ G E Sbjct: 234 PEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWNGAE 293 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V F + Q V Y PV L Sbjct: 294 VDYFDNGRTQAGAKGFKVGVEYRPVSLV 321 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 209 bits (533), Expect = 7e-53, Method: Composition-based stats. Identities = 73/276 (26%), Positives = 118/276 (42%), Gaps = 15/276 (5%) Query: 26 NISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSS 85 +++Q + P P +AR A R ++ + + +A AG+ S+ Sbjct: 33 VLTLQTVAPAFAQGAPSFSAR--PAQADRQDAADSAMLRVAQTARQLAQR-QAAGSRASA 89 Query: 86 QPDSDATRNFITGMATAKANQEIQEWLGKYGTA------RVKLNVDKDFSLKDSSLEM-- 137 + D D + G A A+AN+ +QE + R++ V+ DFS KD SL++ Sbjct: 90 RVDGD----LLKGQAEAQANELLQEGVRLANQTELPFLRRLQGGVNYDFSNKDLSLDLRT 145 Query: 138 LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRI 197 + ++ + + Q + H + R N G RH G N F+D++ ++H R Sbjct: 146 IDEVHRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLRG 205 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLM 257 +G E L N Y SGWK + E +ERPA+GWD+ A P L Sbjct: 206 SLGGEVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKGQ 265 Query: 258 YEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 Y ++ G V F + Q++ V Y PVPL Sbjct: 266 YFRWSGAAVDYFDNGRPQRNARGYKYGVEYRPVPLV 301 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 69/302 (22%), Positives = 109/302 (36%), Gaps = 18/302 (5%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 +SH K + R R A ++ +Q PLAV A+ + R G+ Sbjct: 25 VSHAKGSGRNRRRRAQRAASSAVCLSLGMQAAAPLAVL------AQGAPEMTNRPEAGDI 78 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDAT-RNFITGMATAKANQEIQEW-------- 111 + V VA A + + + + +++ A+ NQ +QE Sbjct: 79 VPSD---VLTQVAVRAQDLARRQADRREGAQVDADYLKQQGQAQFNQFLQEGVRAANESG 135 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L + L D D L + +Y N Q H ++R +N+G +R Sbjct: 136 LRFLRNLQGDLRHDFDNGRTSLELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYR 195 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 M G N F+D++ ++ H R +G E N Y SGW + + Sbjct: 196 RDINERLMLGANAFLDYEFAKQHLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRRE 255 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA+G D+ + P L Y ++ G V F + Q V Y PVP Sbjct: 256 ERPASGMDLGMKYSPGFAPGLSLKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVP 315 Query: 292 LT 293 L Sbjct: 316 LL 317 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 76/268 (28%), Positives = 112/268 (41%), Gaps = 22/268 (8%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 N D V AG S+ S + + MA++ N Sbjct: 281 KTQKALNDNKKDNNLSKEDQEFSNKVMKVIQTAGAIYDSED-SKSKEEIVKNMASSYLNT 339 Query: 107 EIQEWLGKY-GTARVKLNVDKDFSLK-----DSSLEMLYPIY--DTPTNMLFTQGAIHRT 158 E ++ + +N D F+ + + L PI D P F Q I Sbjct: 340 SANELAKEFIDSLNTSINTDFSFNYNERSGFSGNAKALLPIVSEDNPKISYFLQSGIGEF 399 Query: 159 -DDRTQSNIGFGWRHFSG-------NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 +DRT + G G R++ + M G+N+ DHD SR H R+ +GAE D L Sbjct: 400 ANDRTIGHFGGGIRYYPNATALNNSGNIMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAF 459 Query: 211 SANGYIRASGWKKSPDIE-DY-QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 +AN Y R S W S D + DY QERPANGWD + + P+ + Q+YG++VG+ Sbjct: 460 NANVYQRLSSWIDSYDFDKDYVQERPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGI 519 Query: 269 FGK---DKRQKDPHAISAEVTYTPVPLT 293 FG D +K+P ++Y+P P Sbjct: 520 FGANSVDDLEKNPLIYEGGISYSPFPAL 547 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 185 bits (469), Expect = 2e-45, Method: Composition-based stats. Identities = 57/252 (22%), Positives = 88/252 (34%), Gaps = 6/252 (2%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMAT 101 M + A+ V + +E VA N T S Sbjct: 5 SMPSPARLLTLLLCPTLLPPVAYGSAIESEVA---RNLWTRAQHPDTSPGLAQSALDAGV 61 Query: 102 AK-ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 A Q L L D D SL + + + L Q +H + Sbjct: 62 AAGLQASRQTGLPWLRHLDGGLRYDLDPGRLSFSLRTIDDLMVSERRALMLQAGLHNQNQ 121 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASG 220 R +N G R + + G N F+D++ + H R +G E + L AN Y SG Sbjct: 122 RPTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGLEAIAPHYSLYANYYAPLSG 181 Query: 221 WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHA 280 WK + +ERPA G+D+ G L + L Y +++G + +F + Q++ Sbjct: 182 WKGARRDSRREERPAAGYDL--GGQLSSDAGLSLQAAYFRWHGAGIDVFDSGRAQRNASG 239 Query: 281 ISAEVTYTPVPL 292 V Y P L Sbjct: 240 FRYGVAYQPGAL 251 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 3/146 (2%) Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 M ++Q + + D SN+G G R + + W+ G NTF D+ L + R G GAE W +Y Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEY 59 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L+LSAN Y + W+ ++R A G+DI A+ LP + + S+ EQY+GD V Sbjct: 60 LRLSANYYQPFADWQTHT--ATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVD 117 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LF +P A+ + YTPVPL Sbjct: 118 LFDSGTGYHNPVALKLGLNYTPVPLL 143 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 57/233 (24%), Positives = 95/233 (40%), Gaps = 11/233 (4%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 A V L+ + +A + + + T L + +A +N K Sbjct: 11 APLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAGD-VEALKNTAIDGVVEKGVG 69 Query: 107 EIQEWLGKYGTARVKLNVD-KDFSLKDSSLEMLYPIYDTPT--NMLFTQGAIHRTDDRTQ 163 + +L +Y V+LN + S L ++ P+ D N FTQG++ D+RT Sbjct: 70 VTKSFLEQYF-PTVELNFGAQGGSKPSGGLLVVAPLSDPDDIFNTYFTQGSVFYEDNRTT 128 Query: 164 SNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 N+G G+R S N + G+N F DH+ H R +G E +++AN Y + WK Sbjct: 129 LNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINANKYWATTKWK 188 Query: 223 KSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD---EVGLFGKD 272 + +ER +G+DI A LP + Q+ + + G D Sbjct: 189 TGKN--GLEERALDGYDIEAGVPLPYMNWATVFVKNFQWDSEISGSKDIKGND 239 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 157 bits (398), Expect = 3e-37, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 105/288 (36%), Gaps = 37/288 (12%) Query: 28 SVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK--------NVASFAANA 79 Q L + + F +++ A + +N K A+++ Sbjct: 3 ISQALTSITLVFGSILSVSANEYKFEEIKFNQIPNEQNNYEPKDKLDEYIIKGANYSTKF 62 Query: 80 GTFLSSQPDSDATRNFITG------------MATAKANQEIQEWLGKYGTARVKLN--VD 125 +++ D + A AKAN EIQ+ + + V ++ + Sbjct: 63 VPLMNNGAKGDEYTGIMADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTE 121 Query: 126 KDFSLKDSSLEMLYPIYDTP----TNMLFTQGAIHRTDD--RTQSNIGFGWRHFSGNDWM 179 D S +SL L + + F+Q + + NIG G R+ + M Sbjct: 122 SDTSFSINSLMKLGELAKDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISM 181 Query: 180 AGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERPA 235 G N F D+ D S +H+R+G+G EY+ + N Y+ + K DYQER Sbjct: 182 VGANAFWDYRMTDYSDAHSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVV 241 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 GWD+ LP P+L + + D GL G Q PH Sbjct: 242 PGWDLEVGYRLPNNPELAFYIRGFNWDYKYTQDNSGLEGAVSWQATPH 289 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 149 bits (377), Expect = 9e-35, Method: Composition-based stats. Identities = 55/210 (26%), Positives = 84/210 (40%), Gaps = 7/210 (3%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 P A T A + Q L + ++ EK +A+ A + Sbjct: 35 LPAAAWVTQPENDAALLSQQQALPNLGSASVNESGTEKKLATLARQMAEVNQDENTDQTW 94 Query: 93 RNF----ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN- 147 R++ + Q+ + L G V L+VD+ SS ++L P+ D T Sbjct: 95 RSYLLGEAKDRVLDRLQQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRG 154 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRD 206 + ++Q + DD N+G R +G W+ G N F D L++ R +GAE D Sbjct: 155 LTYSQLGLQGVDDGVVGNMGLRQRWNAG-RWLLGYNVFYDQYLNQDASRRGSIGAEARSD 213 Query: 207 YLKLSANGYIRASGWKKSPDIEDYQERPAN 236 YL LS+N Y SG + D ED R A Sbjct: 214 YLTLSSNYYYPLSGMHAANDDEDELLRMAR 243 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 60/233 (25%), Positives = 100/233 (42%), Gaps = 23/233 (9%) Query: 70 KNVASFAANAGTFLSSQPDSD-------ATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + A++AA G + + D + + AN++I++ + + + L Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 123 N--VDKDFSLKDSSLEMLYPI-YDT---PTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSG 175 + D + S L I YD P ++F Q + T + Q N+G G R G Sbjct: 168 DAATSSDLTGYLDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRLG 227 Query: 176 NDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-DIEDYQ 231 ++ + G+N F D + S ++TR G+GAE + +L N YI S K + DY Sbjct: 228 DEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDYV 287 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQY----YGDEVGLFGKDKRQKDPHA 280 ER GWD+ +P++PQL + + + D G+ G Q PHA Sbjct: 288 ERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDYQDHSDNSGIEGSVNWQATPHA 340 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 11/198 (5%) Query: 73 ASFAANAGTFLS-SQPDSDATRNFITGMATAKANQEIQEWL-----GKYGTARVKLNVDK 126 A + G LS S DS+ ++ + T+ A+ + + + T V N+ + Sbjct: 15 AVLTMSLGFSLSVSADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKE 74 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTF 185 S D + +L D P + LF Q ++R D RT N+GFG+R + ++ WM GVN F Sbjct: 75 GDSTVD--IGVLKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAF 132 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 DH+ H R GVG E L+ N Y +G D + +G D+ + Sbjct: 133 YDHEFPNDHKRNGVGFEVVSSVLESRVNSYNGTTG--YIKDKSGTDSKVLDGRDMGFKVA 190 Query: 246 LPAWPQLGASLMYEQYYG 263 LP P + + Q+ G Sbjct: 191 LPYLPGMMFGMNAVQWKG 208 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 44/171 (25%), Positives = 71/171 (41%), Gaps = 8/171 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD---ATRNFI-- 96 + A+AQ + + EK+ A A D D R F Sbjct: 70 TIRAQAQDPFDQNRLPDLGMMPESHEGEKHFAEMAKAFSEASMKNNDLDTGEQARQFAFG 129 Query: 97 --TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + + + NQ+++ WL +G+A V +NVD + S P+ D + ++Q Sbjct: 130 QVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLG 189 Query: 155 IHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 + + D SN+G G R + + W+ G NTF D+ L + R G GAE W Sbjct: 190 LTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWG 239 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 51/136 (37%), Positives = 65/136 (47%), Gaps = 5/136 (3%) Query: 160 DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 R +++G G+R + GVN F D+DLSR HTR+ VGAEY DY S N Y S Sbjct: 35 GRDFAHLGLGYRQL-DDSQFFGVNVFFDYDLSRQHTRVSVGAEYGLDYGTFSTNAYFPLS 93 Query: 220 GWKKSPD----IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275 WK SPD + E+ A GWD+ E YLP + L QY G V Sbjct: 94 NWKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLDTRWKFGLTAGQYLGRYVEHSDGSLPS 153 Query: 276 KDPHAISAEVTYTPVP 291 K+P+ S + P P Sbjct: 154 KNPYHFSLSTEFRPDP 169 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 133 bits (336), Expect = 5e-30, Method: Composition-based stats. Identities = 48/156 (30%), Positives = 69/156 (44%), Gaps = 7/156 (4%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYD--TPTNMLFTQGAIHRTDD-RTQSNIGFGW 170 K+ TA + L+ + S L ++ PI D N++FTQ ++ +DD R N+GFG Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 171 RHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +D + G N F DH+L H R +G E L AN Y SGWK + + Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKS--GLNN 124 Query: 230 YQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 E+ NG D+ LP P + G Sbjct: 125 INEKALNGSDVELGMPLPYLPWTNLYYRSFNWEGAS 160 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 133 bits (334), Expect = 7e-30, Method: Composition-based stats. Identities = 57/222 (25%), Positives = 93/222 (41%), Gaps = 23/222 (10%) Query: 80 GTFLSSQPD---SDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFSLKDS 133 T L+++ S+ N +A+ K + + + KY A V ++ + + + Sbjct: 86 WTSLNNKNGIEWSNQISNLALNLASNKLSDYATKTIQKYPFVLGASVNFDIRTEGA-TNI 144 Query: 134 SLEMLYPIYD-------TPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTF 185 ++L+ I D + + F + + + N G G RH G + +AGVN + Sbjct: 145 GGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSNDSTWNAGLGLRHLIGEELLAGVNGY 204 Query: 186 IDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKK-SPDIEDYQERPANGWDIR 241 D+ S SH+R G+G E + L L+ N YI +G K S + DY ER GWD Sbjct: 205 WDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIAGTGTKTISTNNTDYYERVVPGWDFE 264 Query: 242 AEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 LP+ P + ++ D G GK Q PH Sbjct: 265 LGYRLPSNPNIAFFARGFRWDYRNRNDNTGFQGKVTYQMTPH 306 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 133 bits (334), Expect = 9e-30, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 81/217 (37%), Gaps = 15/217 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN---QEIQEWL- 112 + A+ N G L+ +A + + +A AN ++++ + Sbjct: 14 SALPLSAQEVAKSGKFATIVKNIGNALNIGQGEEAVESEVNTLAVDAANAGLDQVEDKVL 73 Query: 113 --GKYGTARVKLNVD-----KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSN 165 + + + D K+ S + +Y + +T LF Q + ++RT N Sbjct: 74 STSNFTHFELSVGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTIN 133 Query: 166 IGFGWRHFSG-NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 GFG RH + N + G N F D++L H R+G G E + AN Y S K+ Sbjct: 134 TGFGARHINDANTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVS---KT 190 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 QE +G+D + LP + + Sbjct: 191 LTYNGIQETALDGYDAKLTANLPYFYSSNLYGKLSNW 227 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 64/287 (22%), Positives = 102/287 (35%), Gaps = 49/287 (17%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNV----ASFAANAGTFLSSQPDSDAT----- 92 +A + R N+ + + AS+A L+S SD Sbjct: 55 AVAGALEAGQSVRCETLVDADNQSNSTVQKIFVTGASYATRIFPLLNSASLSDGIQKMLW 114 Query: 93 ---RNFITGMATAKANQEIQEWLGKYGTAR--VKLNVDKDFSLKDSSLEMLYPIYDTPTN 147 ++FI A N+ + + + V D D + +SL L + Sbjct: 115 MDSKSFIVSFAHDYLNEYVLKQIPFLSQTEFGVGFESDADMTYYLNSLISLAQLGSDDNG 174 Query: 148 ----MLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHDLSR---SHTRIGV 199 +LF QG+ + +N+G G R ++ M G N F D+ + S++R G Sbjct: 175 YPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYSRWGA 234 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDI-----------------------EDYQERPAN 236 GAE W D KL+ N YI +G K+ + ER Sbjct: 235 GAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDERVVP 294 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 GWD+ LP++PQL + ++ D G+ G Q PH Sbjct: 295 GWDVALNYRLPSYPQLSLGIRGFRWDYMRKSDNSGVEGSVNWQATPH 341 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 35/150 (23%), Positives = 64/150 (42%), Gaps = 6/150 (4%) Query: 140 PIY--DTPTNMLFTQGAIHRTDDRTQSNIGFGWRH-FSGNDWMAGVNTFIDHDLSRSHTR 196 P+Y + + +F +G D R + G+RH S N M G N H+ R+H R Sbjct: 68 PVYVSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQR 127 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASL 256 I GAE ++++N Y R + WK +++ +E+ G+D+ +P P + Sbjct: 128 ISYGAEIRTSVFEINSNYYHRLTDWK-LTGVDNNEEKARGGYDVELALAVPYVPSAHFRV 186 Query: 257 MYEQYYGDEVGLFGKDKRQKDPHAISAEVT 286 + + G + + D + V+ Sbjct: 187 KHFCWNG--IASNDSNNPIDDLKGNTFSVS 214 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 131 bits (330), Expect = 2e-29, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 67/200 (33%), Gaps = 19/200 (9%) Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV---------DKDFSLKDSSL 135 + + ++ A + + I+ W AR L ++ + Sbjct: 22 ASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRSREGAKPTFRA 75 Query: 136 EMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH 194 L+ I N + +Q + DD N G +R + + + G+N F DH + H Sbjct: 76 ISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNIFYDHQFNTGH 135 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R G+G E ++ N Y + ++ E A G+D +P P Sbjct: 136 ARTGLGFEMKSSVYDVNINFYEAQTEIHH---VDGVPEVAAGGYDAEIGAQVPYLPWAKV 192 Query: 255 SLMYEQYYGDEVGLFGKDKR 274 Q+ + + + + Sbjct: 193 YYKAYQWNNETLNIKDGETL 212 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 130 bits (327), Expect = 6e-29, Method: Composition-based stats. Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 13/201 (6%) Query: 92 TRNFITGMATAKANQEIQEWLGKYGTARVKLNV-DKDFSLKDSSLEMLYPIYDTPTNMLF 150 + A K +++I + G V L+ D D + S+ + I T + F Sbjct: 19 ANADVASQALNKVSEKISNLIPGEGITEVSLDYNDGDEDQLNFSILGVRDIETTDNSNFF 78 Query: 151 TQGAIHRTD----DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 TQ ++ + R NIG G+R S + ++M G NTF D DL+ R+G+G E Sbjct: 79 TQFSLMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGLGIEAKG 138 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L L+AN Y + S S + +E+ +GWD +P P + ++ ++ Sbjct: 139 SILDLTANSYTKIS---NSEVVNGDREQVLSGWDFNLTSQIPRAPWARINYNGYKWETEK 195 Query: 266 VGLFGKDKRQKDPHAISAEVT 286 G ++ + +++ +VT Sbjct: 196 ----GSADQKGNIYSLELDVT 212 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 19/186 (10%) Query: 123 NVDKDFSLKDSSLEMLY-----PI-YDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 N +K+ S++ + P + F Q + + G G R Sbjct: 97 NNQSQIQIKNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVYNK 156 Query: 177 DWMAGVNTFIDHDLSRSHTR---IGVGAEYWRDYLKLSANGYIRASGWKKSP---DIEDY 230 + G N H +S ++ I +G EYW LK N Y + S Y Sbjct: 157 KHIIGYNACYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNHHY 216 Query: 231 QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD----EVGLFGKDKRQKDPHAISAEVT 286 + P G+ I A+ P + + +EQ D + + + + H + + Sbjct: 217 YQYPKIGYQICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVSLE 273 Query: 287 YTPVPL 292 Y P+P+ Sbjct: 274 YQPIPM 279 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 52/211 (24%), Positives = 80/211 (37%), Gaps = 11/211 (5%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPI--YD 143 + DA I +A + + + GK R L+ K S+E + P+ YD Sbjct: 74 SNVDAVNRAINAVAMSNVSNAMYGAKGKPWMRRTTLSFQFQEGWKPLYSVETVQPLGHYD 133 Query: 144 TPTN-MLFTQGAIHR-TDDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 + + FTQ I R +D T NIG G+R S + + G + F DH H R+ G Sbjct: 134 NSSRDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDHRFLNRHNRLSAG 193 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 EY + N Y AS + ER ANG+ + + Sbjct: 194 LEYMSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTVEYGKTFKNARWARVYVEGYH 253 Query: 261 YYG----DEVGLFGKDKRQKDPHAISAEVTY 287 + D+ GL + Q P +S ++ Y Sbjct: 254 WNQERQADKNGLRVGSELQLTPR-VSVDMGY 283 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 117 bits (294), Expect = 4e-25, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 14/149 (9%) Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 L Q I ++ + G G + + GVN F D +++ + R+ +G++Y Sbjct: 124 LVWQANIDYKNEDILISNGIGI--LPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPN 181 Query: 208 --LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 LS+N Y SG D+ N DIRAEG + Q +SL E ++GD+ Sbjct: 182 YIFNLSSNIYFPLSGKGSEDDL-------VNSIDIRAEGAITPTVQFHSSL--EFFFGDD 232 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 + + + H +A + YTP+PL Q Sbjct: 233 IQINDDYDPTNNSHKFTAGLDYTPIPLLQ 261 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 117 bits (292), Expect = 6e-25, Method: Composition-based stats. Identities = 39/180 (21%), Positives = 75/180 (41%), Gaps = 10/180 (5%) Query: 123 NVDKDFSLKDSSLEMLYPIYDT----PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 L++ S++M + Y N+ F Q IH N G G RH + + + Sbjct: 83 TYKSKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKY 142 Query: 179 MAGVNTFIDHDLSRSHTR---IGVGAEYW-RDYLKLSANGYIRASGWKKSPDIEDYQ-ER 233 G NTF +S+ ++ I VG EYW + L + N Y + + ++ Sbjct: 143 AIGYNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHY 202 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P +G + + P + + + EQ+ ++ +K+ D + +S ++ Y P+P+ Sbjct: 203 PRSGHQLYIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPML 261 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 115 bits (289), Expect = 1e-24, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 80/262 (30%), Gaps = 32/262 (12%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 F + T A ++ G T + N T + SD + Sbjct: 148 FTASPPRTLAEAGWTTAPQVVAINKGTTPSNLPAATSHRLVQAEPNVPTDTKTGEKSDTS 207 Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQ 152 + +A+ + + + + + F + L P + + F Sbjct: 208 NDT-----NTEADTSTNLGIPYFVDTEFRGSTRRQFGGINLRL----PFWQDDQSFAFAD 258 Query: 153 GAIHRTDDRTQ-SNIGFGWRHFSG----NDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 + T N+G +R N W+ G + F D S + + + +GAE Sbjct: 259 VHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSENGFQYHQGSLGAELV 318 Query: 205 RDYLKLSANGYIRAS-----GWKKSPDIEDYQERPANGW-------DIRAEGYLPAWPQL 252 + NGY+ S G + + Q R ANG + E A Sbjct: 319 NKKFEFRVNGYLPGSNPNVVGQRTINGVLGIQPR-ANGLGTNIVQQTLTLEARERALAGF 377 Query: 253 GASLMYEQYYGDEV--GLFGKD 272 + ++ D+V GLFG Sbjct: 378 DFEAGHRHHFNDKVSLGLFGGY 399 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 48/151 (31%), Gaps = 17/151 (11%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH-RTDDRTQSNIGFGWRHFS 174 R+ + ++L P++ ++LF DD + NIG RH Sbjct: 31 WNPRIDFEGKLGNDRSIAEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHML 90 Query: 175 GNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY- 230 N W G + D ++ +G E L AN YI + D D Sbjct: 91 DNGWNLGGYGYFDRRKSPYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTV 150 Query: 231 ------------QERPANGWDIRAEGYLPAW 249 +ER G+D +P + Sbjct: 151 DFSGTTITYRAGEERSMRGYDAEVGWRIPVF 181 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 113 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 81/211 (38%), Gaps = 12/211 (5%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPIY--- 142 D+DA + + + + + K R L++ + K +E L P+ Sbjct: 84 SDTDAVNSALQAVVMTGVHSAMHGSKAKPWMQRTVLSLRFQKNWKPLYGVETLQPLGHYD 143 Query: 143 DTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 +T ++ FTQ + D T +N+G G+R + +D G N F DH +H R+ VG Sbjct: 144 ETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHRFRGNHGRMSVG 203 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 EY N Y SG + S D E +NG+ + + Sbjct: 204 LEYVSGIGAFRMNWYRGVSGER-SLDGATRMENVSNGYTAEYGTSFKNARWARVYMEAYR 262 Query: 261 Y----YGDEVGLFGKDKRQKDPHAISAEVTY 287 + D+ GL + Q P IS ++ Y Sbjct: 263 WQLRRSADKHGLRIGTELQLTPR-ISVDMGY 292 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 41/170 (24%), Positives = 73/170 (42%), Gaps = 8/170 (4%) Query: 82 FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR---VKLNVDKDFSLKDSSLEML 138 S+ + ++ G+A++ + LG+ + + L V + F SL + Sbjct: 29 ISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFK-PTISLTNV 86 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRI 197 I + + +F Q +++ ++ N+G G R +D + G+N F D+ SH R Sbjct: 87 NMISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQRN 146 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 G G E L +N Y SG + D E +GWD+R + +LP Sbjct: 147 GAGLEVLSSVFDLRSNIYDATSGIEAVSTSRD--EEAMDGWDMRLDYHLP 194 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 7/143 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + ++ I+ Q FP+A++ TP + + A + +LS +NN Sbjct: 4 MNNTLLDKLRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENKLST-----NTENN 58 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + +A + GT LSS DA ++ A +K N+EI+ W +YG A++ L VDK Sbjct: 59 NGRWLAQQTSQLGTILSSDNTHDAASQYLINQANSKVNREIENWFNQYGKAQINLGVDKH 118 Query: 128 FSLKDSSLEMLYPIYDTPTNMLF 150 F+LK L+ L T ++F Sbjct: 119 FTLKTQKLKSL--FLFTKQTIIF 139 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 50/169 (29%), Gaps = 23/169 (13%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 A A A QE Y + L + P+ ++ F + Sbjct: 22 AGAYAQNAGQEKWSTY----LDLEGKVGSKRDIGEANLFIPVVQDARSLYFANVRARMAN 77 Query: 160 DRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGY 215 + ++G G RH W G F+D + S+ + +G E AN Y Sbjct: 78 GGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTTYNNSYDQATLGVEALGRQFDWRANVY 137 Query: 216 IRASGWKKSPDIEDY---------------QERPANGWDIRAEGYLPAW 249 + + +ER G+DI A LP + Sbjct: 138 QPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERALPGFDIEAGWRLPVF 186 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 107 bits (267), Expect = 6e-22, Method: Composition-based stats. Identities = 37/151 (24%), Positives = 67/151 (44%), Gaps = 12/151 (7%) Query: 76 AANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV----DKDFSLK 131 A G +S DA +N + + N + ++ ++G +++V + S Sbjct: 9 ATAGGKGVSE--VLDAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKGDEASYS 66 Query: 132 DSSLEMLYPIYDTPTNMLFTQGAI----HRTDDRTQSNIGFGWRHFSG-NDWMAGVNTFI 186 + + L P+ + + + F QG++ D RT N+G G R + G+N+F Sbjct: 67 IITTQPLAPLSEDGSRL-FWQGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIVGINSFY 125 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIR 217 D++ S H R+ +G EY R +LS N Y Sbjct: 126 DYEFSAKHKRMSLGGEYKRSNAELSVNKYWG 156 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 106 bits (264), Expect = 1e-21, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 50/174 (28%), Gaps = 19/174 (10%) Query: 112 LGKYGTARVKLNVDKDF---SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIG 167 ++G R + + + P+ F H + R +N+G Sbjct: 266 FSEFGYVRGAYTFGEGIGIRHNYSTLTALFAPLVPYDDYYPFLDLRAHYIKNKRWAANVG 325 Query: 168 FGWRHFS-GNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 G R ++ G N + D+ + + G G E++ + ++ N Y Sbjct: 326 GGLRWRDCMTGFIFGANLYYDYRNTTQTDFNQFGFGLEFFTNCFEMRLNAYFPVGDVTHC 385 Query: 225 PD--IEDY----------QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 D DY E G D+ P YY +V Sbjct: 386 EDHVFSDYIGPYYAVCGLTEIAQKGVDLEVGHTFWKCPYFSVFGAIGGYYYTDV 439 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 103 bits (258), Expect = 5e-21, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 61/201 (30%), Gaps = 44/201 (21%) Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 + ++P+ + L+ A+ TD++ Q ++G G+R + + G F Sbjct: 43 DYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFGG 102 Query: 188 HDLSRSHTRIGV---GAEYWRDYLKLSANGYIRASGWKKSPD------IEDYQE------ 232 + ++ R+ + G E + N YI + + E Sbjct: 103 YSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVFL 162 Query: 233 ---RPANGWDIRAEGYLPAWPQ--------------------LGASLMYEQYYGDEVGLF 269 +G DI+A L +P G + E + V L Sbjct: 163 MHQYAGSGADIKAGYQL--FPHSSLKGYLGSYYFSPAETNNVWGGAAGLEYWLTQGVKLI 220 Query: 270 GK---DKRQKDPHAISAEVTY 287 G D +A + + Sbjct: 221 GSYSYDNLHHSTYAFGIGLEW 241 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 30/151 (19%), Positives = 52/151 (34%), Gaps = 21/151 (13%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGN 176 V ++ + + + + PI +N+LF + ++ + N G G+R + Sbjct: 34 PSVDVSGKAGETRRIGEVNLFLPIAQDDSNLLFLDLRTSFDNLEQREGNFGLGYRAMQDS 93 Query: 177 DWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY--- 230 W G F D S ++I G E N Y+ +KS ++ED Sbjct: 94 GWNLGAYAFYDRRRSSEGHYFSQITTGLEALGQDFDARINAYLPIG--RKSYEVEDSARV 151 Query: 231 ------------QERPANGWDIRAEGYLPAW 249 ER +G D LP + Sbjct: 152 DLSGGSIQILSGLERAYHGGDAELGWRLPVF 182 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 100 bits (250), Expect = 5e-20, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 49/171 (28%), Gaps = 19/171 (11%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 F + + + + + + V + D +L PIY T + +LFT+ Sbjct: 14 FALSITFTEHSLASSDKWDPWLESGVSIGTDNS---SRGEAALLLPIYQTDSGLLFTELR 70 Query: 155 IHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKL 210 D + + N+ G+R N W G+ D + + G E Sbjct: 71 GKLFDAGSKEGNLALGYRKMINNRWAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDF 130 Query: 211 SANGYIRASG------------WKKSPDIEDYQERPANGWDIRAEGYLPAW 249 N Y S I E P +G+D Sbjct: 131 RINAYNALSSAQAYPQPVEAELIGNQLFITSAAEVPLSGYDFELGHRFSVL 181 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 99.3 bits (246), Expect = 1e-19, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 56/185 (30%), Gaps = 23/185 (12%) Query: 102 AKANQEIQEWLGKYGT-------ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + A + E+ ++ A + + P+ ++ F Sbjct: 18 SYAQDPVPEYQPEWFQEEDYLYRAYFDFTGQAGGVNDNGQGLLFIPLAQDEESLFFADLR 77 Query: 155 IHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKL 210 + DD + + N G +R + W+AG+ F D S + G E Sbjct: 78 GNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDVRRSQYSNIFRQGSFGFELLSIEWDF 137 Query: 211 SANGYIRASGWKKSPDIEDY------------QERPANGWDIRAEGYLPAWPQLGASLMY 258 NGY+ + ++ + +ER G D L ++P+ Sbjct: 138 RVNGYVPSQKQQRVDSLNTAYLSGNNIVMRAGEERAYWGTDFEVGRLLKSFPESNLDAEL 197 Query: 259 EQYYG 263 Y G Sbjct: 198 RGYVG 202 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 98.9 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 34/188 (18%), Positives = 59/188 (31%), Gaps = 26/188 (13%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAI-- 155 +A +A E + L + + LE P+ TP N+ F +G + Sbjct: 22 LAQTEAESETADTLRIKPRLGIGHTSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRLFL 81 Query: 156 HRTDDRTQSNIGFGWRHFSGNDW-MAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLKLSA 212 D N+ G+R +S N + G D+ + + ++G+G E Sbjct: 82 DNDDANLGGNLILGYRTYSANSHRIWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDFRV 141 Query: 213 NGYIRASGWKKSPDIED-----------------YQERPANGWDIRAEGYLPAWPQLGAS 255 NGY+ ++ +E GWD L ++G Sbjct: 142 NGYLPIGDTRQGVGDAGVRDIFFRRNFLILEQGQNKEAAMGGWDAEVGAKL---ARIGID 198 Query: 256 LMYEQYYG 263 Y G Sbjct: 199 GDLRGYGG 206 >UniRef50_A8PQI7 Putative outer membrane autotransporter barrel domain n=5 Tax=Rickettsiella grylli RepID=A8PQI7_9COXI Length = 1171 Score = 97.8 bits (242), Expect = 5e-19, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 38/197 (19%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGN 176 AR NV + + P+ + + A+ + ++G G+R Sbjct: 34 ARFSGNVYGSTKYVVGQADAMLPLVGDAQHNFYIDPALTSGSNWEGHGDLGLGYRWIQNG 93 Query: 177 DWMAGVNTFIDHDLSRSHTRIG---VGAEYWRDYLKLSANGYI----------------R 217 + G F +++ ++ RI G E NGY R Sbjct: 94 SAILGGYLFGEYNRMDNNVRIWTMNPGIEALGSRWDAHLNGYFVMDNRSKVVGTDLEFVR 153 Query: 218 ASGWKKSPDIEDYQERPANGWDIRAEGYL-PAWPQ-----------------LGASLMYE 259 G ++ D + NG D++ L P P LG ++ E Sbjct: 154 FRGHSAVYNLFDVTQNVGNGGDVKLGYQLFPKTPLKAFVGSYFFSPAETKNILGGAVGLE 213 Query: 260 QYYGDEVGLFGKDKRQK 276 + V +F K Sbjct: 214 YWANRNVKVFASYTYDK 230 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 57/182 (31%), Gaps = 20/182 (10%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-N 165 + E + + +S+ P + +++FT T+ N Sbjct: 82 SVDEIFNPIFRVDARGGQLYGYDEGYTSVGGFLPFFRDENSLIFTDIRGLMTNGGKGGAN 141 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R F D + GV+ + D D + GV E YL NGY+ + Sbjct: 142 VGVGYRQFVPELDRIFGVSGWYDFDNGHREAFNQFGVSFESIGRYLDWRVNGYLPVEDNE 201 Query: 223 KSPDI----EDYQER------------PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 + + +Q G+D G P + G S YY Sbjct: 202 EISNQILGAAGFQNNFILLNRGRSVDSAYKGFDTEIGGPFPILGRYGMSGYVGMYYYANT 261 Query: 267 GL 268 + Sbjct: 262 DV 263 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 68/184 (36%), Gaps = 26/184 (14%) Query: 120 VKLNVDKDFSL-----KDSSLEMLYPIYDTPTNM-LFTQGAIHRTDDR-TQSNIGFGWRH 172 + LN SL + M++P + + +F G D ++G G RH Sbjct: 80 LNLNYTFGKSLGCQKSYGTFGGMIFPFFSSCRPFQIFLDGKAFLFDHGKWGGSVGIGLRH 139 Query: 173 FSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWK-------- 222 FS N WM G+N + D+ ++G+G E D ++ NGY+ + + Sbjct: 140 FSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLGDCVEFRVNGYLPVNKNRWDQCCLFN 199 Query: 223 -KSPDIEDYQER--PANGWDIRAEGYL--PAWPQ-LGASLMYEQYYGDEV---GLFGKDK 273 +ER +G D +L P+ Q +G + YY F D+ Sbjct: 200 YSGSYFATLRERGYVWSGLDTEIGTWLVKPSCCQDIGLYVAAGPYYYRRSHDQDFFFHDQ 259 Query: 274 RQKD 277 + Sbjct: 260 KHHT 263 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 62/178 (34%), Gaps = 22/178 (12%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDS--SLEML-YPIYDTPTNMLFTQGAIHR-TDDRT 162 + EW+ A ++ V K ++ S + P+ D+ + F IH +R Sbjct: 44 QANEWVFPPTLAYLQGVVGKGIGEQNGYASFGIFTIPLLDSNGQLFF-DARIHNLRHERW 102 Query: 163 QSNIGFGWRHFSG-NDWMAGVNTFIDHDLSR-SHTRIGVGAEYWRDYLKLSANGYIRASG 220 +N+G G R + G+N F D+ +R + ++G G E NGY Sbjct: 103 AANVGVGTRIAIPCTNLFFGINFFYDYRRTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 Query: 221 ---WKKSPDIEDYQ---------ERPANGWDIRAEGYLPAW-PQLGASLMY--EQYYG 263 K + + +G D+ E L W P L + Y+ Sbjct: 163 RSLRKHPKVFRFHDNLFAACTQIQNSLSGGDLELETSLRRWDPCLCFDVYIAPGGYFY 220 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 92.4 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 58/191 (30%), Gaps = 32/191 (16%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 G +A A+ + +++ + GT + S+ + P+ D+ F Sbjct: 2 TALGAGSAAADPALMDFVLRPGT-----------DGAEGSIAVAIPLADSDAARTFLDLR 50 Query: 155 IHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKL 210 D DR +NIG G R G + G + D DL ++ V + L L Sbjct: 51 GSIDDADRRVANIGIGHRFRLG-AVVLGGAVYYDRVRTDLESDFSQATVSLDLMTADLDL 109 Query: 211 SANGYIRA----------------SGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 AN Y SG I +E G+D L A Sbjct: 110 RANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIFQPREVTLKGFDAEVGYRLGAIEGYDV 169 Query: 255 SLMYEQYYGDE 265 Y + Sbjct: 170 RAFAGGYRYTD 180 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 91.6 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 39/155 (25%), Positives = 54/155 (34%), Gaps = 22/155 (14%) Query: 133 SSLEMLYPIYDTPT-NMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHD 189 S+L +L P P +MLF TD N+G GWR ++ N D + V + D+D Sbjct: 142 SNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLGAGWRAYNDNLDKIFTVAGWYDYD 201 Query: 190 LSR--SHTRIGVGAEYWRDYLKLSANGYIR-----------ASGWKKSPDIEDYQERPAN 236 + ++G+ E YL NGY SG Y R Sbjct: 202 DGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEIIISNNLSGSAYFQTDRIYLNRTRR 261 Query: 237 ------GWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 G D G LP + G YY + Sbjct: 262 SESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYYNS 296 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 57/177 (32%), Gaps = 17/177 (9%) Query: 101 TAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD 160 + N + +G T + + ++ S L P+ N+++ D Sbjct: 148 QSDLNNTSRHTVGARFTVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIYIDLKSKLYDA 207 Query: 161 RT-QSNIGFGWRHFSGNDWMAGVNTFIDHDL-SRSHTRI-GVGAEYWRDYLKLSANGY-- 215 + + + G +R G+N F D + R +G E + L+ N Y Sbjct: 208 KEGEVSTGIVFRRQMSPLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKSFSLNGNYYRS 267 Query: 216 IRASGWKKSPDIEDY-----------QERPA-NGWDIRAEGYLPAWPQLGASLMYEQ 260 + + E + ER A NG+D+ L + + S + Sbjct: 268 NKKTTISSVKSFEFHDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIHGSAFFFY 324 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 89.3 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 30/261 (11%), Positives = 75/261 (28%), Gaps = 56/261 (21%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 +A N V++L+ ++ A P L + + ++ + + ++ Sbjct: 1 MAIACMNSLVRLLWTSFCFTPLLIPAAIAQTEIPSLPKADAVPESHPSLGSPLQAQTPDS 60 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 + + + ++G + + + L+ Sbjct: 61 PPSTTPDLTTLQIK-------------------PRWG---IGYSTSGAGYDGFTRLDSFL 98 Query: 140 PIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--H 194 P+ P + + F +G + + N+ FG R ++ + + + G D + + Sbjct: 99 PLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRFYNQSLNRIFGGYLGFDRRDTGNSTF 158 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKK-----------------------------SP 225 ++GVG E + + NGY + Sbjct: 159 HQLGVGVETLGEVWDVRLNGYFPLGDTRDLVDETAFDTGFQLTDRFFSDHFLVIQGKRQR 218 Query: 226 DIEDYQERPANGWDIRAEGYL 246 + E G+D+ L Sbjct: 219 GQVRHFEAAMTGFDLEVGARL 239 >UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLP1_ACAM1 Length = 1022 Score = 87.4 bits (215), Expect = 5e-16, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 60/157 (38%), Gaps = 11/157 (7%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQG 153 + +A+ + ++G + N + + LE P++ P + F +G Sbjct: 24 IAEPQPSTQASDL--RFSPRFG---IGANSPSSGTNTTTRLETFVPVWQKPGRALTFFEG 78 Query: 154 AIHRTDDRT-QSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH--TRIGVGAEYWRDYLK 209 + D NI FG+R +S + + G + D + ++ ++ +G E + Sbjct: 79 RLLLDDQGNPGGNILFGFRQYSDDLKRIFGGHLGFDIRNTDNNTFQQLSLGIESLGKDVD 138 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 L NGY ++ ++ NG D R G + Sbjct: 139 LHLNGYWPVGSTRRQTRQRIFEVLQLNG-DPRFTGNI 174 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 41/131 (31%), Gaps = 17/131 (12%) Query: 149 LFTQG-AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 +F D + ++ G G R + + G+NT+ D+ R ++GVG E Sbjct: 8 VFIDLDGYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVGVGFEML 67 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDY-------------QERPANGWDIRAEGYLPAWPQ 251 D + NGY+ S S + E G D L + Sbjct: 68 SDCFDVRINGYLPVSEKVHSHQCLSFHYSGTDFHASRCKLEYAYGGLDAEIGKPLLTYYD 127 Query: 252 LGASLMYEQYY 262 YY Sbjct: 128 FDLYGAVGPYY 138 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 85.5 bits (210), Expect = 2e-15, Method: Composition-based stats. Identities = 32/191 (16%), Positives = 51/191 (26%), Gaps = 36/191 (18%) Query: 102 AKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-D 160 EW + D S L L P Y + + +D D Sbjct: 19 GSVQAADSEWKP---NTQAYFAAGDDRSYFG--LAGLIPFYQDGKRLGYADLRYSSSDVD 73 Query: 161 RTQSNIGFGWRHFSGND-WMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYI 216 + N+G G+R + N+ + G D S R + ++ GAE D +N Y Sbjct: 74 TDEINLGAGFRSLNENETAIYGFYGSYDLRKSATERDYRQLTFGAELLTDTWDYRSNFYF 133 Query: 217 RASGWKKS-----------PDIEDYQ--------------ERPANGWDIRAEGYLPAWPQ 251 + + E +G DI G L + Sbjct: 134 PTGDDSYQVGNAEDDVTVESEFVGHDLVRTTTTVGGGTIFEEALSGADIEV-GRLLNFDN 192 Query: 252 LGASLMYEQYY 262 Y+ Sbjct: 193 FEMRGYLGAYH 203 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 84.7 bits (208), Expect = 3e-15, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 57/176 (32%), Gaps = 26/176 (14%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-NIGFGWR 171 G Y R + N + + + L P+ + ++ Q + TD N+G R Sbjct: 45 GTYFDVRNQSNSGVGYQHGFTQIGALTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGR 104 Query: 172 -HFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASG------- 220 + +G D + G N + D+D S +++IG G E L L AN Y+ Sbjct: 105 VYDAGRDRIWGANVYYDNDETTYSNRYSQIGFGFESLGQNLDLRANAYLPTGSSDKVIGP 164 Query: 221 ---------WKKSPDIED--YQERPANGWDIRAEGYLPAWPQLG-ASLMYEQYYGD 264 + E G D +P + Y+ D Sbjct: 165 NGLSNTLFYTGNQLNFTGSYLSEEALRGADFELG--IPVTQNMSWLRAYGGGYFYD 218 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 80.8 bits (198), Expect = 6e-14, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 44/131 (33%), Gaps = 22/131 (16%) Query: 138 LYPIYDTPTNMLFTQG-AIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS-- 193 L P MLF DR +N+G G R++ N D + G N + D+D + Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKK---------SPDIEDY-----QER----PA 235 +G G E Y N Y ++ S +D +ER Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFPVGPTEQLLSQSVVTGSQRFQDTRILFDRERIVGLAP 214 Query: 236 NGWDIRAEGYL 246 G+D L Sbjct: 215 KGFDAEFGMPL 225 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 59/166 (35%), Gaps = 39/166 (23%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 T +N+G G+R S ++ GVNTF DH S+ + RI G EY ++ AN Y + Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 220 GWKKSPD----IEDYQE----------------RPANGWDIRAEGYLPAWPQLGASLMYE 259 K P E Y E + +G+D+ A + Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 260 QYYGDEVGLFGKD-----------------KRQKDPHAISAEVTYT 288 + G V G+ Q PH +S +V YT Sbjct: 261 HWNGLGVKTHGEGPALALNVGKSHGWQAGTTLQLTPH-VSLDVGYT 305 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 35/140 (25%), Positives = 53/140 (37%), Gaps = 17/140 (12%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML----FTQGAIHRTDD--RTQSNIGFG 169 G K++ D ++ +SS P Y + + D +IG G Sbjct: 127 GNGGEKISSDAYWNGGESSYIGDDPKYKAAARLAQQPSYLDKGETVQHDSLGVVGSIGAG 186 Query: 170 WRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI- 227 +R S N+ G+NTF D+ +R+G+G EY K+SAN Y S K P Sbjct: 187 YRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLSEKKTKPYYF 246 Query: 228 ---------EDYQERPANGW 238 D P +G+ Sbjct: 247 ENSLVIVPRADEFHYPEDGY 266 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 78.1 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 55/126 (43%), Gaps = 18/126 (14%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ R ++ A ++ + N+ Sbjct: 138 LRKLNQFRTFVR---NVRPGDELDV---------------QAQVSEKNLTPPPGNSSGNL 179 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 180 EQQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDF 239 Query: 129 SLKDSS 134 SLK+S Sbjct: 240 SLKNSR 245 >UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZV2_9CYAN Length = 1059 Score = 75.1 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 47/145 (32%), Gaps = 21/145 (14%) Query: 123 NVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWRHFSG-NDWM 179 + + SLE PI P + + F +G + D T I G R ++ + + Sbjct: 57 SEGAGYQDPFFSLEGFVPITQNPGSTVTFLEGQLRLFTDSTMGGTILLGQRFYNSTQNRI 116 Query: 180 AGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD----------- 226 G D + + +IG G E D L N Y+ + D Sbjct: 117 LGGYLSYDTRDTGNSLFHQIGAGFERLGDDWDLRVNAYLPVGERRPEVDESFSLRGFQEN 176 Query: 227 -----IEDYQERPANGWDIRAEGYL 246 E G+DI A G L Sbjct: 177 NLLLNHRQRFEAAMAGFDIEAGGRL 201 >UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQN0_PARL1 Length = 675 Score = 74.7 bits (182), Expect = 4e-12, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 56/180 (31%), Gaps = 28/180 (15%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR-TQSNIGFGW 170 G + A L+ ++D P++ + ++LF + T+ N G+ Sbjct: 31 WGPWIEAGGFLSTERD----RGEATAFMPLFQSGESLLFADVKGKLFSEGVTEGNFALGY 86 Query: 171 RHFSGNDWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 R + D G+ D S + + G E NG++ + K +P + Sbjct: 87 RRMTAWDVNLGLWGGYDIRESVSGNTFDQAAFGIEALAADYDFRLNGFVPLADGKAAPGM 146 Query: 228 EDYQ------------ERPANGWDIRAEGYLPAWPQLG-------ASLMYEQYYGDEVGL 268 + E G++ LP LG L Y D+ L Sbjct: 147 ARVELSGSQILLTGGRELVLGGFEGEVGWRLP-LEALGADRERHEFRLYAGGYRFDDSDL 205 >UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YK40_ANASP Length = 1487 Score = 73.9 bits (180), Expect = 6e-12, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 61/186 (32%), Gaps = 23/186 (12%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS--SLEMLYPIYDTP-TNMLFTQGAIH 156 A+ + Q + T RV + + + +S S E P+ P ++ F QG + Sbjct: 19 ASTVSAQTPASTTAQVFTPRVGVRYTTEGAGYESFSSFEGFLPVLQIPGNSLTFLQGKLL 78 Query: 157 RTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSA 212 +D + NI G R FS + + G + + ++G+G E Sbjct: 79 LDNDSNLATNILLGHRIFSEEANRVIGGYISYSTRDTGKSNFDQLGLGFETLG-VWDFRF 137 Query: 213 NGYIRASGWKKSPDIED---------------YQERPANGWDIRAEGYLPAWPQLGASLM 257 N Y+ +G + + + + E +G D L + Sbjct: 138 NAYLPLNGSENQVEQANLPFFQGDSLMVQRSRFLEVAMSGVDAEVGTRLASLGSGDLRGY 197 Query: 258 YEQYYG 263 YY Sbjct: 198 AGVYYY 203 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 72.0 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 49/143 (34%), Gaps = 30/143 (20%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + D + ++ P++ + +LF + + + ++G G+ Sbjct: 50 TVNAGVKSSDAYTDGNFSIVAPVWSSLGAEGTLSGGVLFLEPYTSYGEGGEIAASLGLGY 109 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 R+ F G N F +D + ++GVG E+ YL Sbjct: 110 RYLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGVEFGNRYL 169 Query: 209 KLSANGYIRASGWKKSPDIEDYQ 231 + N YI S + + + + Sbjct: 170 EFRGNYYIPLSDKQVAEQFKTRE 192 >UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PN48_9COXI Length = 607 Score = 71.2 bits (173), Expect = 4e-11, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 60/224 (26%), Gaps = 51/224 (22%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG-AIHRTDDRTQSN 165 + +E L +A V +++ + + L+ + TD + Sbjct: 28 QAREPLPPRFSAEAYTGV-----YTVGRADLMVSLDGDGQHNLYVDPQGGYGTDQEWYGD 82 Query: 166 IGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R S + + G F H + S G E N YI +G Sbjct: 83 VGLGYRWISNDAAIVGWYVFAGHSCVENSSGFWITNPGVEIMGSRWDARINAYIPVAGRS 142 Query: 223 K------------------------SPDIEDYQERPANGWDIRAEGYLPA---------- 248 S + ++ NG D R L + Sbjct: 143 DDLGGIESTTAGPSFFTGHSELRTVSFTAFNEVQQVGNGADARVGYQLFSGVPLKAVVGA 202 Query: 249 ----WPQL----GASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 P G + ++ D V +F + H+ Sbjct: 203 YFFEIPHAENVRGGGAGVDYWFDDYVRVFARYNYDNRQHSQVVG 246 >UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK3_9PLAN Length = 967 Score = 70.4 bits (171), Expect = 7e-11, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 55/201 (27%), Gaps = 36/201 (17%) Query: 98 GMATAKANQEIQEWLG---KYGTARVKLNVDKDFSLKD------SSLEMLYPIYD--TPT 146 G+ + N ++ E G +G R + SS + +P+ + Sbjct: 31 GVPQEEINGDVSELFGDSGWFGRYRPHFGYRYEAGDTIGRIGGLSSFDAFFPLLEGEDSD 90 Query: 147 NMLFTQGAIHRTDDR--TQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGA 201 + F + DD SN+G G R + G + D + S ++ G Sbjct: 91 WLTFIDARLLLGDDNHNLGSNVGVGARQYIPEYQRTIGAYIYYDTRDAGYASFDQVSGGI 150 Query: 202 EYWRDYLKLSANGYIRASGWKKSP--------------------DIEDYQERPANGWDIR 241 E D N Y+ + Y + G D+ Sbjct: 151 ETLGDIWDARLNWYVPTGQTRNQYATTHTSGGSYKFVGHYLTGGTFTRYYQAAMKGLDME 210 Query: 242 AEGYLPAWPQLGASLMYEQYY 262 A + + Y+ Sbjct: 211 AGAKFYSNESMDLRAYAGWYH 231 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 52/148 (35%), Gaps = 32/148 (21%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQSN- 165 GT L + D ++ P+Y T ++LF + + + ++ Sbjct: 51 YLGTVTAGLKTSD--AYTDGHFSIVAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASS 108 Query: 166 IGFGWRH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEY 203 +G G+RH F G + F +D + + ++GVG E Sbjct: 109 LGLGFRHLFGSQPLTALSANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEA 168 Query: 204 WRDYLKLSANGYIRASGWKKSPDIEDYQ 231 Y+++ N YI S + + + + Sbjct: 169 GTRYVEVRGNYYIPLSDKQLAEETRTRE 196 >UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK4_9PLAN Length = 786 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 46/158 (29%), Gaps = 28/158 (17%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPI--YDTPTNMLFTQGAI--HRTDDRTQSNIGF 168 +G R + SSL+ P+ + + F + + SN+GF Sbjct: 53 PHFGY-RYQAGDTIGRIGGLSSLDGFLPLLEAEDGNWLTFLDARLLLDDQNQNLGSNVGF 111 Query: 169 GWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP 225 G R + G + D + R+ +++ G E D N Y+ + Sbjct: 112 GARQYLPEWGRTIGGYVYYDTRDTGTRNFSQVSGGIETLGDLWDARLNWYVPTGSRRSLV 171 Query: 226 D--------------------IEDYQERPANGWDIRAE 243 + Y + G D+ A Sbjct: 172 GTSHTVGGPSQFIGHYLYGGILTRYYQAAMTGVDMEAG 209 >UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZRN5_9PLAN Length = 792 Score = 69.7 bits (169), Expect = 1e-10, Method: Composition-based stats. Identities = 35/190 (18%), Positives = 61/190 (32%), Gaps = 28/190 (14%) Query: 84 SSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEML---YP 140 S+Q D + + + A+ G+Y R+ + + + D S P Sbjct: 24 SAQQAGDDIQPGLISGTSTFASPYANGQGGEYF-PRISVQHRTEGAGYDYSFTDFRAWVP 82 Query: 141 IYD--TPTNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHT- 195 +Y+ ++ F GA +D+ N G R +S N G D+ + + T Sbjct: 83 LYESYDSKSLTFFDGAFLLANDQNVGMNAVVGQRFYSDNYGRTFGGYVGYDNRDTGNQTV 142 Query: 196 -RIGVGAEYWRDYLKLSANGYIRAS-----------------GWKKSPDIEDYQERPANG 237 ++ G E + NGY + G+ E G Sbjct: 143 GQVVTGFESLGR-IDFRVNGYFPTTSDPTMTGQTGFFDPTYVGYNIQLSQLTQYEVAMKG 201 Query: 238 WDIRAEGYLP 247 +D G LP Sbjct: 202 FDAEIGGALP 211 >UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA61_9CHLA Length = 188 Score = 68.1 bits (165), Expect = 3e-10, Method: Composition-based stats. Identities = 23/111 (20%), Positives = 38/111 (34%), Gaps = 12/111 (10%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLY------PIYDTPTNMLFTQGAIH-RTDDRTQSN 165 +Y + D S+ L P+ +F+ H T N Sbjct: 22 NEYFKTYLSYKGGNDGLGYHSNYASLDLMCFPLPL---EDITIFSDLKGHWLTRHHYAVN 78 Query: 166 IGFGWRHFSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANG 214 G G+R + N F DH S + ++G+G E + + +L NG Sbjct: 79 AGVGFRKIYAPQTIWDANLFYDHPKSSYDHYNQVGLGLELFHELWELRLNG 129 >UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZT8_9GAMM Length = 785 Score = 67.4 bits (163), Expect = 6e-10, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 66/205 (32%), Gaps = 33/205 (16%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 G R LNV + + L P+ ML+ GA+ T T +G G+R Sbjct: 35 WGGPWKPRQTLNV-QGGHGMQDYYDALLPLSGNAERMLYANGALAATHHETGGELGLGYR 93 Query: 172 H-FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS-------- 219 H N+++ G + ++ +G E++ + A+ Y+ S Sbjct: 94 HIILNNEYVIGGFALMGRYQTNYHNMFNQLTLGTEFFGSIWEGRAHLYLPVSRRTKFVRS 153 Query: 220 --------GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK 271 G K E G D+ +P P+L YY + +G K Sbjct: 154 RSEGLSFQGHKLFGIQTTTYEHAEGGADVEIGHVIPGIPKLRGFA---GYYNNGLGNEHK 210 Query: 272 D---------KRQKDPHAISAEVTY 287 + R + + +Y Sbjct: 211 NINGGYGRFEYRYNNHFTFTLGDSY 235 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 50/143 (34%), Gaps = 30/143 (20%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + + ++ P++ T ++++ + + + ++G GW Sbjct: 91 TVTSGVKTSDVYTEGNFSIVAPVFSTLGADATLSGDVIYLEPYTSSGEGGEIAASLGLGW 150 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 RH F + G N F +D + + ++GVG E YL Sbjct: 151 RHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGIEAGTRYL 210 Query: 209 KLSANGYIRASGWKKSPDIEDYQ 231 ++ N YI S + + + Sbjct: 211 EVRGNYYIPLSDKQLAEQTRTRE 233 >UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QR03_CYAP0 Length = 1985 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 45/132 (34%), Gaps = 11/132 (8%) Query: 114 KYGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAI--HRTDDRTQ-SN 165 +Y T RV + ++ + E +PI + FT+G + D +N Sbjct: 97 RYFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRAN 156 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 G R +S + D + G D + + GVG E + N YI + Sbjct: 157 FLVGHRLYSQDHDRVYGAYIGYDLRDTKYNKFNQFGVGLETLGSFWDARFNAYIPLGTTQ 216 Query: 223 KSPDIEDYQERP 234 + + P Sbjct: 217 QQIGQTNTDLNP 228 >UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K1T2_CYAP8 Length = 1873 Score = 62.4 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 45/127 (35%), Gaps = 11/127 (8%) Query: 114 KYGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAI--HRTDDRTQ-SN 165 +Y T RV + ++ + E +PI + FT+G + D +N Sbjct: 97 RYFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRAN 156 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 G R +S + + + G D + + GVG E D+ N YI + Sbjct: 157 FLVGHRLYSQDHNRVYGAYIGYDLRDTKYNKFNQFGVGIETLGDFWDARFNAYIPLGTTQ 216 Query: 223 KSPDIED 229 + + Sbjct: 217 QQIGQTN 223 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 61.6 bits (148), Expect = 3e-08, Method: Composition-based stats. Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 ++G G+R S N+ GVN F+D + ++ RI G EY ++ AN Y Sbjct: 168 GIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVNGLNEVYANVYRGLG 227 Query: 220 GW 221 Sbjct: 228 DK 229 >UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellular organisms RepID=B4D818_9BACT Length = 5429 Score = 57.3 bits (137), Expect = 6e-07, Method: Composition-based stats. Identities = 26/103 (25%), Positives = 43/103 (41%), Gaps = 4/103 (3%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGND 177 RV ++ D SL+ L P+ +L+ + +D +IGFG+RH Sbjct: 74 RVTFGLEFYEHQIDESLDTLVPLATPQNGVLYFNPKLSLSDRLNPSVSIGFGYRHLLKAR 133 Query: 178 WMAGVNTFIDHDLSR-SHT--RIGVGAEYWRDYLKLSANGYIR 217 + T + D + H + GVGAE ++ AN Y+ Sbjct: 134 RSSSGETSLRSDYTNFDHHVNQFGVGAEVMSRWVDFRANYYLP 176 >UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=27 Tax=Cyanobacteria RepID=Q0IAR8_SYNS3 Length = 401 Score = 53.9 bits (128), Expect = 7e-06, Method: Composition-based stats. Identities = 37/294 (12%), Positives = 83/294 (28%), Gaps = 50/294 (17%) Query: 31 VLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD 90 + L + V + A ++ + + +S S Sbjct: 5 LSLGLLASAISVASLPAIAQEDGGAALLRQQRDKLLEQIEQLKQRKEQLEAQIS---GSA 61 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 ++ + N ++ + + + + + P+ ++ F Sbjct: 62 QGKDDAFDLQEISLNDAVK------FNWGFQGALQGAGTPNQAGIGGFLPLSVGENSVWF 115 Query: 151 TQGAI-----HRTDDRTQSNIG-----------FGWRHFSGND-WMAGVNTFIDH----- 188 ++ + N G+R +G+ WM G+N D Sbjct: 116 LDALANANFSDYENNSSIINTDVAGTTISTSSRLGYRWLNGDRSWMYGLNAGYDSRPMNT 175 Query: 189 ------------DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 + S ++ V AE + L+A I ++ + YQ N Sbjct: 176 GGTDTGINVSGTEKSAFFQQVVVNAEAVSNDWNLNAYALIPIGDTEQDLN-SFYQGGALN 234 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH----AISAEVT 286 + + ++ P+L AS+ Y GD G + + ++A V Sbjct: 235 TYGLDVGYFI--TPELNASVGYYYQNGDLGSADGSGVLGRVAYEISNGLTAGVN 286 >UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C500_9PLAN Length = 1337 Score = 53.5 bits (127), Expect = 9e-06, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 47/142 (33%), Gaps = 24/142 (16%) Query: 144 TPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGV 199 M+F + RT+ G G+R ++ + D + G + + D D S ++ + Sbjct: 145 DDAGMMFGNFRLWRTNRGNLGGGAGLGYRFYNYDTDRIFGTSFYYDRDDSTDKIFQQLAL 204 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ------------------ERPANGWDIR 241 E Y + N Y+ ++ ++E + G+D Sbjct: 205 NVETMGRYWDANGNFYLPIGNREQQLNLEFNDGSQRFSGFNVLYDQTRTIGKSMRGFDAE 264 Query: 242 AEGYLPAWPQLGASLMYEQYYG 263 +P W +L Y G Sbjct: 265 IG--VPIWGELAQQFQARAYAG 284 >UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6_SALET Length = 197 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 13/88 (14%), Positives = 27/88 (30%), Gaps = 4/88 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + AR ++ P P+ + + ++ Sbjct: 67 LKKLNGLRTFARGFDHLQAGDELDVPAV----PLTGGKGDNNRHDARGPFAADRENEDAQ 122 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFI 96 + + A+ AG+FL+S PD A + Sbjct: 123 AQQMVGMASQAGSFLASHPDGQAAAGMV 150 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 44.6 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 14/29 (48%), Positives = 19/29 (65%), Gaps = 2/29 (6%) Query: 267 GLFGKD--KRQKDPHAISAEVTYTPVPLT 293 G+FG RQ++PHAI+ + Y PVPL Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLV 31 >UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=1 Tax=Synechococcus sp. RS9916 RepID=Q05XC6_9SYNE Length = 404 Score = 44.2 bits (103), Expect = 0.006, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 65/223 (29%), Gaps = 46/223 (20%) Query: 112 LGKYGTARVKLNVDKDFSLK--DSSLEMLYPIYDTPTNMLFTQG--AIHRTDDRTQSNI- 166 + K R + + + ++ PI T ++ F D + S+I Sbjct: 29 IRKTTKFRWNVFSKSQGAGTPNQAGGQVFIPISTTRKSIFFLDALATADFGDALSTSSIV 88 Query: 167 -----G--------FGWRHFSGNDWM-AGVNTFID-HDLSRS------------------ 193 G G+R + N + GVN D +S Sbjct: 89 NTPVEGTTFSTSSRIGYRWLNDNGDILFGVNAGYDSRPISTGIPSRYSWAPRSLLQPQDV 148 Query: 194 -HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 +I GAE + + + + + ++ Y + + I E L Sbjct: 149 FFQQIAFGAELVTNNIAIKPYALVPVGKTEDVLNLF-YSGGALDTYGIDIEHSFDEL--L 205 Query: 253 GASLMYEQYYGDEVGLFGKDKRQK---DPHA-ISAEVTYTPVP 291 AS+ Y GD G + +P S V YT P Sbjct: 206 TASIGYYYQQGDLTYANGSGLKSTIAINPAGSFSMGVEYTYDP 248 >UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CT24_9RHOB Length = 771 Score = 43.5 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 38/132 (28%), Gaps = 17/132 (12%) Query: 132 DSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDH-- 188 + + + +P + + R + Q +I R + W GV F D Sbjct: 34 STGIALSFPFAIEENRATIARLSYGRDEGHNAQLSIEAMRRMTLAHGWTVGVGVFADSST 93 Query: 189 -DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-------------DIEDYQERP 234 D+ +++G+ + R + + N Y+ + + + Sbjct: 94 DDIGNRFSQVGMSGDLQRGIFQANLNAYLPVGTKSHADARYDALAEMDGTIRFKGGRSLA 153 Query: 235 ANGWDIRAEGYL 246 G D Sbjct: 154 LRGLDAEVGARF 165 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 42.7 bits (99), Expect = 0.015, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 63/168 (37%), Gaps = 14/168 (8%) Query: 51 VQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQE 110 + + + N + A + + ++ + G+ + Sbjct: 13 SLLNADELDNALKNNQNKWQKFNYQATQKAPTIKEENID--FKSALNGILSNVLE----- 65 Query: 111 WLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGW 170 K G + N+D F ++ ++ L +Y+ N L Q + T D + G Sbjct: 66 --NKNGIDKTDGNLD--FQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLIN 121 Query: 171 RHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEY-WRDYLKLSANGYIR 217 R+ +D++ G+N FID + ++ GAE + ++K +N Y+ Sbjct: 122 RY-EKDDFLLGINGFIDGQKEQKESK-SFGAELGYYQFVKAYSNYYVP 167 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P36943 Putative attaching and effacing protein homolog ... 369 e-101 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 359 6e-98 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 346 6e-94 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 344 2e-93 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 344 2e-93 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 340 5e-92 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 339 6e-92 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 339 8e-92 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 339 8e-92 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 336 4e-91 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 335 1e-90 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 333 4e-90 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 329 8e-89 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 328 2e-88 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 323 4e-87 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 321 2e-86 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 317 4e-85 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 316 5e-85 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 313 7e-84 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 312 8e-84 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 311 2e-83 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 310 3e-83 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 310 4e-83 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 306 4e-82 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 306 4e-82 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 305 1e-81 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 301 2e-80 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 300 4e-80 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 298 1e-79 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 295 1e-78 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 295 1e-78 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 292 1e-77 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 290 3e-77 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 286 8e-76 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 286 9e-76 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 285 2e-75 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 283 4e-75 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 283 6e-75 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 283 6e-75 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 282 1e-74 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 281 2e-74 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 280 3e-74 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 280 4e-74 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 279 1e-73 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 278 2e-73 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 275 1e-72 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 270 4e-71 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 268 2e-70 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 261 1e-68 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 256 8e-67 UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 251 3e-65 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 249 8e-65 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 249 9e-65 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 247 4e-64 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 245 2e-63 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 243 4e-63 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 241 2e-62 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 241 3e-62 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 239 1e-61 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 236 7e-61 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 229 6e-59 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 220 4e-56 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 218 1e-55 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 206 6e-52 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 206 6e-52 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 203 7e-51 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 196 1e-48 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 182 2e-44 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 178 2e-43 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 160 5e-38 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 155 2e-36 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 151 2e-35 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 145 2e-33 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 140 4e-32 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 140 7e-32 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 139 2e-31 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 138 3e-31 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 137 4e-31 UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 137 5e-31 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 136 8e-31 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 136 1e-30 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 136 1e-30 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 133 6e-30 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 131 3e-29 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 127 3e-28 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 127 6e-28 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 123 9e-27 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 121 3e-26 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 115 2e-24 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 115 2e-24 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 115 2e-24 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 114 3e-24 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 114 3e-24 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 114 4e-24 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 114 5e-24 UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legione... 114 5e-24 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 112 1e-23 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 112 2e-23 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 111 3e-23 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 110 6e-23 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 109 2e-22 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 109 2e-22 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 104 3e-21 UniRef50_A8PQI7 Putative outer membrane autotransporter barrel d... 103 7e-21 UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=... 103 7e-21 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 103 9e-21 UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickett... 102 1e-20 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 102 2e-20 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 102 2e-20 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 101 3e-20 UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Plancto... 101 4e-20 UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 100 6e-20 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 100 8e-20 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 99 2e-19 UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum ... 98 3e-19 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 96 1e-18 UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryoc... 96 1e-18 UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Plancto... 96 1e-18 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 95 3e-18 UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastop... 95 3e-18 UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microco... 93 1e-17 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 93 1e-17 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 91 5e-17 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 91 6e-17 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 84 8e-15 UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyano... 79 2e-13 UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanoth... 79 2e-13 UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachl... 77 9e-13 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 76 1e-12 UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellu... 76 2e-12 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 75 2e-12 UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Plancto... 74 7e-12 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 71 3e-11 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 67 5e-10 UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6... 66 1e-09 Sequences not found previously or not previously below threshold: UniRef50_A5GVG9 Uncharacterized conserved secreted protein n=15 ... 67 5e-10 UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=... 63 1e-08 UniRef50_A5GVB4 Uncharacterized conserved secreted protein n=1 T... 57 9e-07 UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicib... 49 2e-04 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 44 0.004 UniRef50_Q28JV0 Putative uncharacterized protein n=1 Tax=Jannasc... 44 0.008 UniRef50_Q7V422 Prochlorococcus marinus MIT9313 complete genome ... 43 0.010 UniRef50_Q0IBW0 Putative uncharacterized protein n=1 Tax=Synecho... 43 0.011 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 42 0.018 UniRef50_Q0I6I6 Unnamed protein product n=3 Tax=Synechococcus Re... 42 0.019 UniRef50_B3JG90 Putative uncharacterized protein n=1 Tax=Bactero... 41 0.049 UniRef50_C4RKT9 Secreted protease n=4 Tax=Actinomycetales RepID=... 40 0.071 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 369 bits (946), Expect = e-101, Method: Composition-based stats. Identities = 295/295 (100%), Positives = 295/295 (100%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT Sbjct: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA Sbjct: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI Sbjct: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ Sbjct: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 359 bits (922), Expect = 6e-98, Method: Composition-based stats. Identities = 134/291 (46%), Positives = 184/291 (63%), Gaps = 10/291 (3%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 H + ++ AR ++ PL A + + Sbjct: 78 HMSLEALRKLNQFRTFARGFDHLQPGDELDVPL---------APLPAVTWAEETPVPASA 128 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + ++ + +A A+ AG FL++ P DA + GMAT A+ E+Q+WL ++GTAR++L Sbjct: 129 SKEDLQAQKIAGIASQAGNFLANSPRGDAAASIARGMATGAASTEVQQWLSQFGTARLQL 188 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 +VD FSLK+S L++L P+Y+ P ++FTQG++HRTDDRTQ+N+G G R F + +M G Sbjct: 189 DVDNKFSLKNSQLDLLIPLYEQPDKLVFTQGSLHRTDDRTQTNLGMGMRWF-NDGYMLGG 247 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF+D+DLSR H R+G+G EYWRDYLK+ AN Y+R + W+ S D DYQERPANGWD+ Sbjct: 248 NTFLDYDLSRDHARMGMGVEYWRDYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSL 307 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG++PA PQLG +L YEQYYG EV LFGKD RQKDPHAI+ V YTP PL Sbjct: 308 EGWVPALPQLGGNLKYEQYYGKEVALFGKDNRQKDPHAITVGVNYTPFPLL 358 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 346 bits (887), Expect = 6e-94, Method: Composition-based stats. Identities = 129/286 (45%), Positives = 183/286 (63%), Gaps = 17/286 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ P ++ + N+ Sbjct: 93 LRKLNQFRTFARGFDNVRQGDELDVPA---------------QVSENNLTPPPGNSSGNL 137 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 138 EQQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMTDWLSRFGTARITLGVDEDF 197 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S + L+P Y+TP N+ F+Q +HRTD+RTQ N G GWRHF+ WM+G+N F DH Sbjct: 198 SLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTP-TWMSGINFFFDH 256 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DLSR H+R G+GAEYWRDYLKLS+NGY+R + W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 257 DLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRAEGWLP 316 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWP LG L+YEQYYGDEV LF KD RQ +PHAI+A + YTP PL Sbjct: 317 AWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLM 362 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 344 bits (882), Expect = 2e-93, Method: Composition-based stats. Identities = 126/289 (43%), Positives = 177/289 (61%), Gaps = 6/289 (2%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 + A ++ P + + + T Sbjct: 86 TPKALRKLNVLRTFAHGFDNLQPGDELDVPAVMP-----DGKPDSPAKTGDEQAATPPLK 140 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 D+ +A A+ AGT LS+ PD DA + G +A A+ ++Q+WL ++GTARV+L Sbjct: 141 DDEGAMKMADMASRAGTLLSNSPDGDAALSMARGQISAVASGQVQQWLNQFGTARVQLEA 200 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+ FSLK+S +++L P Y+ +LFTQG++HRTDDRTQ+N+GFG R+F+ +M G N Sbjct: 201 DEHFSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAP-SYMLGGNI 259 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F D+DLS H+R G+G EYWRD+LKLSANGY+R S W+ SP++++YQERPANGWDIRA+ Sbjct: 260 FGDYDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQA 319 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LP+ PQLG L YEQYYG V LFGK+ Q++P AI+A V +TP PL Sbjct: 320 WLPSLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLL 368 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 344 bits (882), Expect = 2e-93, Method: Composition-based stats. Identities = 132/286 (46%), Positives = 179/286 (62%), Gaps = 13/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ P A +N + Sbjct: 93 LRRLNQFRTFARGFDNVRQGEELDVPATTLQKSHEQQNAV-----------PPANGENTL 141 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E +AS + GT LS +S+ G A+++A+ + +WL +GTA++ L VD+DF Sbjct: 142 ENQIASTSQRVGTLLSQDMNSEQASGMARGWASSEASGAMTDWLNNFGTAKISLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+S + L+P YDTP +LF+Q +HRTDDRTQ N G GWRHF+ WM+G+N F DH Sbjct: 202 SLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDRTQINTGLGWRHFTP-SWMSGINLFFDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DLSR H+R G+GAEYWRDYLKLS+N YI +GW+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG L+YEQYYGDEV LF K+ RQ +PHAI+A + YTP PL Sbjct: 321 AWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAITAGLNYTPFPLL 366 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 340 bits (871), Expect = 5e-92, Method: Composition-based stats. Identities = 141/286 (49%), Positives = 190/286 (66%), Gaps = 12/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 92 LRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM------ 143 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DF Sbjct: 144 --QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SS E L+P Y+TP N++F+Q +HRTD+RTQ+N G GWR+F+ WM+GVN FIDH Sbjct: 202 SLKSSSFEFLHPWYETPDNLVFSQHTLHRTDNRTQTNHGIGWRYFTS-SWMSGVNMFIDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG ++YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 321 AWPQLGGKVVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 339 bits (870), Expect = 6e-92, Method: Composition-based stats. Identities = 143/286 (50%), Positives = 190/286 (66%), Gaps = 12/286 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 92 LRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM------ 143 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L VD+DF Sbjct: 144 --QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLGVDEDF 201 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK SS E L+P Y+TP N++F+Q +HRTDDRTQ+N G GWR+F+ WM+GVN FIDH Sbjct: 202 SLKSSSFEFLHPWYETPDNLVFSQHTLHRTDDRTQTNHGIGWRYFTS-SWMSGVNMFIDH 260 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLP 247 DL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RAEG+LP Sbjct: 261 DLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRAEGWLP 320 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AWPQLG L+YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 321 AWPQLGGKLVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 339 bits (869), Expect = 8e-92, Method: Composition-based stats. Identities = 118/286 (41%), Positives = 160/286 (55%), Gaps = 7/286 (2%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + Y + AR + ++ P+ ++ A +A Sbjct: 112 QLKQVNAYRIFARGFEHVGVGDEIDIPVDMSSLNTQAGQAPKLSSAMREPSRA------E 165 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 E + G LSS S+A MAT AN+EIQ+WL KYGTARV+LN+DK+ Sbjct: 166 KEAQAVGQLMSVGATLSSTRPSEAAAGMARSMATNAANEEIQQWLSKYGTARVQLNLDKN 225 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSL +S+L+ P++D+ FTQ D R N+G G R + WM GVN F D Sbjct: 226 FSLSESALDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVGARTLL-DRWMLGVNMFYD 284 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 HDL+ ++R+G+GAE W DYL+LS NGY+R S W +S D DY ER ANG+DIRA +LP Sbjct: 285 HDLTGHNSRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADYDERAANGFDIRANAWLP 344 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A PQLG L+YEQY G+ V LFGK+ Q++P+A++A V YTP PL Sbjct: 345 ALPQLGGKLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPFPLL 390 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 339 bits (869), Expect = 8e-92, Method: Composition-based stats. Identities = 109/285 (38%), Positives = 156/285 (54%), Gaps = 12/285 (4%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ ++ ++ P + + N + + Sbjct: 4 LKEVNQFRSFSKPFIQLGSGDEIDIPRITPLP-----------EKITTAENAKTVSSSQY 52 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 ++ +A T L+ A + +A +AN Q WL ++GTARV+LN+D + Sbjct: 53 KERLAHNLLKGATVLADDNTPLAAASMARSVAVGEANDAAQHWLSQFGTARVQLNLDNNL 112 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK S+ +ML P+YD ++LF+Q + D R NIG G R N WM G N F D Sbjct: 113 SLKGSAFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDN-WMYGANVFFDR 171 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++ + RIG GAE W DYLKLSAN Y+R + W +S D DY ERPANG+D+R E YLPA Sbjct: 172 DITGKNNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLPA 231 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +PQ+G +L YEQY G+EV LFGKD RQK+P+A +A + YTP+PL Sbjct: 232 YPQIGTNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLI 276 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 336 bits (862), Expect = 4e-91, Method: Composition-based stats. Identities = 128/309 (41%), Positives = 180/309 (58%), Gaps = 19/309 (6%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLA-VTFTPVMAARAQHAVQPRLSM---- 57 H + + ++ P + P + A + Sbjct: 83 HLTPEALRKLNQRRTFTYGFDNLQPGDKLNVPAIKLDDEPDVPAARLDNKANLPAARLDN 142 Query: 58 -------------GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA 104 + D+ + +A A+ AG FLS P+ DA + G TA+A Sbjct: 143 KPDVPAIIWGQEGSAASALGDDAGARKMADVASRAGAFLSDNPNGDAALSLARGEVTAEA 202 Query: 105 NQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS 164 + ++Q+WL ++GTARV+L+ D+ FS K+S ++L P+Y+ +++FTQG++HRTDDRTQ Sbjct: 203 SGQLQQWLNQFGTARVQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQV 262 Query: 165 NIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 N+GFG R+F+ +M G N F D+DLSR+H+R G+G EYWRD+LKLSANGY+R S W S Sbjct: 263 NLGFGLRYFAP-SYMLGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNS 321 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 D +DYQERPANGWDIRA+ +LP+ PQLG L YEQYYG V LFGK+ Q+DP AI+A Sbjct: 322 SDFKDYQERPANGWDIRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAG 381 Query: 285 VTYTPVPLT 293 V +TP PL Sbjct: 382 VNFTPFPLL 390 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 335 bits (859), Expect = 1e-90, Method: Composition-based stats. Identities = 121/287 (42%), Positives = 161/287 (56%), Gaps = 8/287 (2%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + ++ A + ++ P + + P + + + Sbjct: 117 SQLKKINQFRKFAHGIDKIGAGDEIDIPHSGSSL-------TKPGSPAAATPLSPHADTS 169 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 E VA G L+S S+A ATA AN EI +WL KYGTA+++LN+DK Sbjct: 170 ERESRVAGQLMGVGRVLASPQSSNAASEMARSWATAAANDEIVKWLSKYGTAQLQLNIDK 229 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 +FSL S+L+ L P YDTPT FTQ D R NIG G R S N W+ GVN F Sbjct: 230 NFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTRTLSNN-WLFGVNAFY 288 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 DHDLS ++R+G+G+E W DYL+LS NGY+R S W +S D+ DY ERPANG+D+RA ++ Sbjct: 289 DHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNERPANGFDVRANAWM 348 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P PQLG LMYEQY+GD VGLFGKD Q++P+A + V YTP PL Sbjct: 349 PTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPLL 395 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 333 bits (854), Expect = 4e-90, Method: Composition-based stats. Identities = 122/285 (42%), Positives = 161/285 (56%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + Y ++ A ++ P + N +V Sbjct: 113 LKKLNAYRTFSKPFASLTTGDEIEVPRKESSFF---------------SNNPNENNKKDV 157 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 + +A A AG LS+ SDA N T + N Q+WL ++GTARV+LNVD DF Sbjct: 158 DDLLARNAMGAGKLLSNDNTSDAASNMARSAVTNEINASSQQWLNQFGTARVQLNVDSDF 217 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L +S+L++L P+ D+ +++LFTQ + D R NIG G R + G+ WM G NTF D+ Sbjct: 218 KLDNSALDLLVPLKDSESSLLFTQLGVRNKDSRNTVNIGAGIRQYQGD-WMYGANTFFDN 276 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 DL+ + R+GVGAE DYLK SAN Y +GW +S D Y ERPA+G+DIR E YLPA Sbjct: 277 DLTGKNRRVGVGAEVATDYLKFSANTYFGLTGWHQSRDFSSYDERPADGFDIRTEAYLPA 336 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +PQLG LMYE+Y GDEV LFGKD RQKDPHA++ V YTPVPL Sbjct: 337 YPQLGGKLMYEKYRGDEVALFGKDDRQKDPHAVTLGVNYTPVPLV 381 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 329 bits (843), Expect = 8e-89, Method: Composition-based stats. Identities = 117/285 (41%), Positives = 157/285 (55%), Gaps = 17/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + Y +R ++ P + V + Sbjct: 97 LKKINIYRTFSRPFTALTTGDEIDIPRKASPFSVDNNKDNRLSV---------------- 140 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E +A A T LS+ + + + A+ + N Q+WL ++GTARV+LN++ DF Sbjct: 141 ENTLAGHAVAGATALSNGDVAKSGERMVRSAASNEFNNSAQQWLSQFGTARVQLNINDDF 200 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L S+ ++L P+YD ++LFTQ D R N+G G R F GN WM G NTF D+ Sbjct: 201 HLDGSAADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGN-WMYGANTFFDN 259 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 DL+ + RIGVGAE W DYLKLSAN Y + W +S D DY ERPANG+D+RAE YLP+ Sbjct: 260 DLTGKNRRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGYDLRAEAYLPS 319 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +PQLG MYE+Y GD+V LFGKD RQK+PHAI+A V YTP+PL Sbjct: 320 YPQLGGKAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPLV 364 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 328 bits (840), Expect = 2e-88, Method: Composition-based stats. Identities = 108/291 (37%), Positives = 170/291 (58%), Gaps = 10/291 (3%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + ++ ++ ++ P+ A Sbjct: 83 NINLQQLRKLNQFRTFSQNFENLQPGDELDIPM---------APLPIVEWDDDKPEIVLP 133 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 ++ + E VA A+ AG F S+ PD + T+ F + T A+ Q+W ++G++++ L Sbjct: 134 SSASENEIRVAQLASQAGKFFSTNPDQEKTKAFARELLTTAASSYAQDWFNRFGSSQIHL 193 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 DK FSLK+S +++L P Y+T N++F+Q ++HR + R ++N+G G R + G M G Sbjct: 194 EADKKFSLKNSQIDLLMPWYETEDNLIFSQTSLHRKEGRIETNLGLGARWY-GEGQMIGG 252 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF D+D+SR H+R+G+G EY RD+LKLSAN Y R SGW+ S D+ D+ RP+NGWD+RA Sbjct: 253 NTFFDYDISRKHSRLGLGVEYRRDFLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVRA 312 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG+LP++P +G L YEQYYGD V LFG Q++P++I+A + YTP+PL Sbjct: 313 EGWLPSYPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSITAGLNYTPIPLV 363 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 323 bits (828), Expect = 4e-87, Method: Composition-based stats. Identities = 118/285 (41%), Positives = 165/285 (57%), Gaps = 15/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + Y ++ ++ P + + + + ++A + Sbjct: 65 LKRVNIYRTFSKPFTALTSGDEIDIPRKASPFSIDSEKNKNADVL--------------L 110 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E +AS T L++ + ++ I A + N Q+WL ++GTARV++NV+ DF Sbjct: 111 ENKLASHVQTGATALATSNAAKSSERMIRSAANNEFNSSAQQWLSQFGTARVQMNVNDDF 170 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 L S++++L PIYD ++LFTQ D+R NIG G R F N WM GVNTF D+ Sbjct: 171 KLDGSAVDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNN-WMYGVNTFFDN 229 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++ + R+GVGAE W DYLKLSAN YI S W +S D DY ERPANG+D+RAE YLP+ Sbjct: 230 DMTGKNRRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAEAYLPS 289 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG LMYE+Y G+EV LFGKD RQK+PHA++A V YTP+PL Sbjct: 290 HPQLGGKLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLL 334 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 126/291 (43%), Positives = 167/291 (57%), Gaps = 16/291 (5%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + + ++ A ++ P Q L+ NT + Sbjct: 37 NLTLAQLKKINQLRTFSKPFAKLQAGDELEIP-------------QAQSNLGLAPENTAL 83 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 T E+N+A A + L+S A + G+A ANQ WL +GTAR++ Sbjct: 84 TDTQTTERNLAKTATTSAQMLNSGD--KAAARQLRGLAVGNANQAANSWLNNFGTARLQA 141 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD L S +ML P YDTP+ M FTQ I R D RT +N+G G RHF + WM G Sbjct: 142 NVDDRGDLDGSQFDMLMPFYDTPSQMAFTQFGIRRIDKRTTANLGIGIRHFIDD-WMVGY 200 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F+D D++R HTR+G GAEY RDYLKL+ANGY+R S W+ SPD Y ERPA G+D+RA Sbjct: 201 NLFLDRDITRDHTRVGAGAEYARDYLKLAANGYLRLSDWRDSPDFSSYSERPATGFDLRA 260 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 E YLP+ PQLG LMYEQY+G++VGLFGKD RQ++P AI+A + YTP+PL Sbjct: 261 EAYLPSLPQLGGKLMYEQYFGNDVGLFGKDNRQQNPAAITAGINYTPIPLV 311 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 317 bits (811), Expect = 4e-85, Method: Composition-based stats. Identities = 147/287 (51%), Positives = 185/287 (64%), Gaps = 11/287 (3%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 RF S L R VA I QVLFP+ A ++ + Sbjct: 1 MISARFHSSRLTRAVASLCIVTQVLFPV---------ASTAGHRVAAPQAAPAVLSEQDA 51 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 VA A L S +S G AT+ A QEWL ++GT RV L +D+D Sbjct: 52 TAAQVAGMTTQAAGMLQSGMNSRQAAEMARGYATSTAQSAFQEWLSQWGTVRVTLGLDED 111 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F+LK S+ ++L P +DTP N+LFTQ + HRTDDR Q N G GWRHF+ + +MAGVN F D Sbjct: 112 FTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFAPD-YMAGVNLFFD 170 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYL 246 HDL+R H+R+G+G EYWRD LKL ANGY+R SGW+ +P+++ DY+ RPANGWD+RAEGYL Sbjct: 171 HDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYL 230 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA+PQLGA+LMYEQYYGDEV LFGKDKRQ+DPHA +A ++YTPVPL Sbjct: 231 PAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLI 277 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 316 bits (810), Expect = 5e-85, Method: Composition-based stats. Identities = 124/296 (41%), Positives = 175/296 (59%), Gaps = 11/296 (3%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 M +++ H + + R + +A I +Q+ P ++ + + +A + S Sbjct: 12 MLFFRSTHMRSKTR-----KLLACIQIVLQLAPPSSLIYLS--SVFNANAEEITSSAEKE 64 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 + +VA A AG+ LSS SDA + + T KA QEWL ++GTARV Sbjct: 65 QGNPSDQNASSVAQTAVQAGSLLSSDNASDALGSAVVSAVTGKAASSAQEWLSQFGTARV 124 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 ++ D+ F+L DS L++L P+Y+ N+LFTQ R DDR N GFG+RHF + WM Sbjct: 125 NISTDEHFTLSDSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHF-NDGWMW 183 Query: 181 GVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G N F D +S + H R+G+ E DYL +SANGY+R S W S +DY ER A+G+D Sbjct: 184 GTNVFYDRQVSGNQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFD 243 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 IRA GYLPA+PQLGA+++YEQY+GD VGLFG D RQKDP+A++ + YTPVPL Sbjct: 244 IRATGYLPAYPQLGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLV 299 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 313 bits (801), Expect = 7e-84, Method: Composition-based stats. Identities = 125/290 (43%), Positives = 183/290 (63%), Gaps = 20/290 (6%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + + ++ ++ PL + + + + Sbjct: 80 LKKLNQLRKFSKPFEALTTGDEIDIPLIG---------------NNFTTQSLPHSTSSPN 124 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN----QEIQEWLGKYGTARVKLNV 124 + +A A+ G L + P+S+A + A + AN QEI +WL G RVKL+ Sbjct: 125 DSLLAQSASQVGNTLQNNPNSEALNDLARSSALSAANAKAGQEISDWLNGKGKVRVKLDA 184 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D+DFS+K+S L++L P++++ ++M+F+QG++HRTDDRTQSN+G G+R+F+ + + G NT Sbjct: 185 DRDFSVKNSQLDLLVPLWESESHMIFSQGSVHRTDDRTQSNLGLGYRYFA-DSYALGANT 243 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F DHD SRSH+R+G+GAEY R++ KL+ NGY+R S WK SPD ++Y+ERPANGWDIRAEG Sbjct: 244 FYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRLSNWKDSPDFDNYEERPANGWDIRAEG 303 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 YLP++P LGA L YEQYYGD VGLFGKD +QK+PHAI+ Y+P PL + Sbjct: 304 YLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPHAITFGGNYSPFPLLK 353 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 312 bits (800), Expect = 8e-84, Method: Composition-based stats. Identities = 124/293 (42%), Positives = 172/293 (58%), Gaps = 16/293 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 + F S+ ++ + W+ I +Q+LFPL F PV AA A + T Sbjct: 1 MMKSSIKNNNSFFLSLKSKLIIWSQIVLQILFPLFTVF-PVHAAPAT------TTKETTV 53 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + +AS S+ +D ++ TGMAT+ A +Q+WL ++GTARV+ Sbjct: 54 AMPYSQELSTLAS---------STASGTDGAKSAATGMATSAAASSVQQWLSQFGTARVQ 104 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 LNVD + + DS++++L P+YD +LFTQ + D RT N+G G R F +WM G Sbjct: 105 LNVDDNGNWDDSAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFG 164 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + R+G GAE W +YLKLSAN Y+ + W S D DY E+PA+G+DIR Sbjct: 165 GNVFFDDDFTGKNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIR 224 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 AEGYLPA+PQLGA LMYEQYYGD+V LF D Q +P A++ ++YTPVPL Q Sbjct: 225 AEGYLPAYPQLGAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQ 277 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 311 bits (796), Expect = 2e-83, Method: Composition-based stats. Identities = 118/285 (41%), Positives = 155/285 (54%), Gaps = 19/285 (6%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + Y A+ + ++ P + V A Sbjct: 94 LKRINIYRTFAKPFTALTVGDEIDVPRKKSPFTVDNNVTVPA------------------ 135 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E VAS AA LS + + N + + Q+WLG++GTAR++ N + DF Sbjct: 136 ENGVASNAAAGAALLSHGDAAKSAENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDDF 195 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 S++++L P+YD ++ FTQ D R NIG G R F N WM G NTF D+ Sbjct: 196 EFDSSAIDVLIPLYDNQKSLFFTQLGGRNKDSRNTINIGAGVRAFLTN-WMYGANTFFDN 254 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++ ++ R+G+GAE W DYLKLSANGY + W +S D DY ERPANG+D+RAE YLPA Sbjct: 255 DITGNNRRVGIGAEAWTDYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLPA 314 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +PQLG LMYEQY GDEV LFGKDKRQKDPHAI+ + YTPV L Sbjct: 315 YPQLGGKLMYEQYNGDEVALFGKDKRQKDPHAITVGINYTPVSLV 359 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 310 bits (794), Expect = 3e-83, Method: Composition-based stats. Identities = 122/292 (41%), Positives = 168/292 (57%), Gaps = 17/292 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 H ++ ++ + S+L + V WA I +Q+ FPL V P A+ A + +S +T Sbjct: 1 MHEQSIMEKNTLKISLLKKIVIWAQILLQIAFPLLV--LPAHASSGPGATETDMSDASTL 58 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + + +DA +N T +AT A ++EWL +GTA+V Sbjct: 59 SASLASSAAQ---------------NGADAMKNTATHLATTHAASTVEEWLSHFGTAQVT 103 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L+VD + + +S+ + L P+YD ++LFTQ I D RT NIG G R F DWM G Sbjct: 104 LDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFG 163 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + RIG GAE W +YLKLSAN YI S W S D ++Y E+PA+G+D+R Sbjct: 164 GNVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVR 223 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AEGYLP++PQLGA LMYEQYYGD V LF KD Q +P A++ + YTPVPL Sbjct: 224 AEGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPLI 275 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 310 bits (794), Expect = 4e-83, Method: Composition-based stats. Identities = 116/291 (39%), Positives = 170/291 (58%), Gaps = 9/291 (3%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAAR-AQHAVQPRLSMGNTTV 62 + ++ + + ++ P+ P++A + A + N V Sbjct: 88 LTLAQLEQINQFRTFPQGFEQVSSGEEIDIPV-----PIIAEQGATKVSVVTPNEVNCPV 142 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +NN + + L+S + + + ++ AN+EIQ+WLG+YGTA+V+L Sbjct: 143 GIENNPQTK--EYVKRVSALLASSDPTTVATDVVRSEVSSTANKEIQKWLGQYGTAQVRL 200 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD FSL++SSL+ L+ YD+ + ++FTQ I D R +N+G G R GN W+ G Sbjct: 201 NVDDKFSLRESSLDWLFSFYDSSSAIIFTQLGIRNKDHRNTANLGLGGRISMGN-WILGA 259 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF D+DL+ ++R+G GAE W DYL+LSAN Y+R + W +S D D+ ERPANG+DIR Sbjct: 260 NTFYDNDLTGINSRLGFGAEAWTDYLQLSANSYMRLNNWHQSRDFIDHDERPANGFDIRT 319 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LP PQLG LMYEQY GD V LFGKDK QK+P+A++A +TYTP PL Sbjct: 320 NAWLPVLPQLGGKLMYEQYSGDSVALFGKDKLQKNPYAVTAGITYTPFPLL 370 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 306 bits (785), Expect = 4e-82, Method: Composition-based stats. Identities = 130/291 (44%), Positives = 187/291 (64%), Gaps = 17/291 (5%) Query: 8 HKQPRFRYS-----VLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + S + + VAW+ I++Q L+P ++FTP ++ ++ + Sbjct: 1 MSNKKISRSNGATGPVNKVVAWSTIALQALYPALLSFTPTISHA--------SAVKASQA 52 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 A+ + ++S AA AG + + +F A+A +E+ EWL KYG AR++L Sbjct: 53 AAEQQELRGLSSLAAQAGRSIENG----HAGSFAANTVPAQATKEVVEWLQKYGNARIQL 108 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD FSLKDS+ + LYP D ++LF+Q ++HRTDDRTQ+NIG G+R+F+ ++ M G Sbjct: 109 NVDDAFSLKDSAFDFLYPWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGA 168 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F D+DLSR H R+G G EYWRDYL+ AN Y+R S WK S D++DYQERPA+GWDI Sbjct: 169 NLFYDYDLSRHHARMGAGVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYT 228 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +G+LP++PQLGASL YE+YYG VGLFG D Q++P+A + ++YTPVPL Sbjct: 229 QGWLPSYPQLGASLKYEKYYGKNVGLFGSDHLQENPYAFTGGISYTPVPLV 279 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 306 bits (785), Expect = 4e-82, Method: Composition-based stats. Identities = 124/289 (42%), Positives = 179/289 (61%), Gaps = 7/289 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 K +K + + +++ V WANI +Q +FPL++ FTP + A + + Sbjct: 12 AKQLNKNKQLNKTRISKSVVWANIVIQAIFPLSIAFTPAVMAAET------VGASDEKPR 65 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 + + E++ A+ A + L++ + + G A N+ +Q+W ++G+A+V+LN Sbjct: 66 SASQAEQSTANAATRLASILTNDDSAKQASSIARGTAANAGNEALQKWFNQFGSAKVQLN 125 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 +D+ SLK S L++L P+ D+P + FTQ DDR N+G G RHF M G N Sbjct: 126 LDEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQ-MLGYN 184 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F+DHD S SHTRIGVGAEY RD++ L+ANGY SGWK SPD++ Y E+ ANG+D+R+E Sbjct: 185 LFVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSE 244 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 YLP PQLG L+YEQY+GDEVGLFG D RQK+P A++ V YTP+PL Sbjct: 245 AYLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPL 293 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 135/291 (46%), Positives = 179/291 (61%), Gaps = 9/291 (3%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + + A + V PL MAA+ + G + Sbjct: 81 LTLNQLRELNQLRTFAHGLNGLQPGDDVDVPL-------MAAKDNKNASDAAAPGRSASA 133 Query: 64 AD-NNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + N + VA +A+ AG+FL+S SDA + MAT +A Q+WL +GTARV+L Sbjct: 134 EEGNEQAQKVAGYASQAGSFLASSAKSDAAASMARNMATVEAGGAFQQWLSHFGTARVQL 193 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 + DK+FSLK+S ++L P+YD N +FTQG++HRTD RTQ+++G GWRH S + +M G Sbjct: 194 DADKNFSLKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWRH-STSTYMLGG 252 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F D DLSR H R G G EYWR++LKL N Y+R SGWK SPD+EDYQERPANGWD+R Sbjct: 253 NLFGDFDLSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQERPANGWDVRG 312 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + ++P+ PQLG L YEQYYG EV LFG D RQ++PHAI+ + YTPVPL Sbjct: 313 QAWVPSLPQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPLI 363 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 301 bits (771), Expect = 2e-80, Method: Composition-based stats. Identities = 127/286 (44%), Positives = 168/286 (58%), Gaps = 22/286 (7%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + S + R + +Q F + F V A Sbjct: 1 MVKTNPSSSQVRRVAVYGLAGLQFFFQVTPAFAGVFQA---------------------- 38 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 E++VA A AG L DA R +T A+ +A + +WL ++GTA+ +L+V D Sbjct: 39 DEQSVAQTAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQWLNQFGTAKTQLSVVSD 98 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSLK SSL++L P Y+TP N+LFTQ + D R +N G G R+F+ N WM G N F D Sbjct: 99 FSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWMLGYNVFYD 158 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 D ++ R G+G E WRDYLKLSANGY R S W++SP + DY ERPA+GWDIRAEG+LP Sbjct: 159 VDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDIRAEGWLP 218 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+PQLG L+YEQYYG+EV LFG+ +RQK+PHAI+A VT+TP L Sbjct: 219 AYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLL 264 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 300 bits (768), Expect = 4e-80, Method: Composition-based stats. Identities = 110/291 (37%), Positives = 169/291 (58%), Gaps = 16/291 (5%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + + + V + ++ P+ V+F P+ T Sbjct: 79 NIELSELERINQGRVFLNGIKNIKEGDEINVPV-VSFAPIKWG------------EEETK 125 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + + +AS A + G LS+ S + + T K N IQ W +GTA ++L Sbjct: 126 EQGSGNLQQIASIATDVGNILSNDNISK--NSALLNKITNKVNSHIQSWFENFGTAHIQL 183 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 VDK+FSLK+S LE+L+P+++ + F+QG I DD+ SNIG G+R F N WM G Sbjct: 184 QVDKNFSLKNSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDN-WMLGG 242 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N+FID+DL + H+R+G+G EYW+D LKL AN Y+R S W+ S +I DY+ERPANG D+ Sbjct: 243 NSFIDYDLRKEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNI 302 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + +LP++PQ+G + YE+YYGD+V LFG++ RQ++PH+ + ++YTP PL Sbjct: 303 KSWLPSYPQIGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLM 353 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 298 bits (764), Expect = 1e-79, Method: Composition-based stats. Identities = 109/289 (37%), Positives = 153/289 (52%), Gaps = 17/289 (5%) Query: 5 KTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTA 64 ++ + + ++ P + T Sbjct: 85 PLSKLYKLNQFRSFHKSFYDLSGGDEIDIPAS---------------NNYSFENRPLDTK 129 Query: 65 DNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV 124 +N E A+ A S S + MA++ AN IQ+WL ++GT +L+ Sbjct: 130 VDNNENYSANKTKAAVNV-SESNKSPEALGVASSMASSAANNAIQKWLSQWGTVESQLSF 188 Query: 125 DKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNT 184 D SLK+SSL+ L PIYDT N F Q D R N+G+G RH N WM G+N Sbjct: 189 DSKASLKNSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVY-NGWMYGLNN 247 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F D+D++ ++ R+G+G E DYL +++N Y+R + W +S D DY ERPANG+D+R G Sbjct: 248 FFDYDITGNNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANGFDMRVNG 307 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LPA+PQ+G L+YEQYYGDEVGLFGKD RQKDP AI+A V++TP PL Sbjct: 308 WLPAYPQIGGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLL 356 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 122/304 (40%), Positives = 165/304 (54%), Gaps = 18/304 (5%) Query: 6 TGHKQPRFRYSVLARCV-------AWANISVQVLFPLAVTFTPVMA--ARAQHAVQPRLS 56 Q L + N +L P P+ +A + P L Sbjct: 49 QIALQSGLDLRTLRKLNNGSLDKRDELNAGESLLLPANSPLFPLDPLAGKAIASNLPELG 108 Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA--------NQEI 108 MGN V ++ E+ A+ A G + SD +N A +A Q+ Sbjct: 109 MGNDPVPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQAESWAKGQAKAQVVDPLRQQA 168 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 QE LGK+G A+V L VD + SL S+ + P Y+ + F+Q +HR D+R N+G Sbjct: 169 QELLGKFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAMVAFSQVGVHRQDNRMIGNLGA 228 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 G R G+ W+ G NTF+D D+SR+H+R+G+G E+W D LKL++N Y SGWK S D + Sbjct: 229 GVRFDQGD-WLFGANTFLDQDISRNHSRLGLGLEWWADNLKLASNYYHPLSGWKDSKDFD 287 Query: 229 DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYT 288 DY ERPA G+D+ A+GYLPA+ QLGAS +YEQYYGDEV LFGKD QKDPHA++ V YT Sbjct: 288 DYLERPARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVALFGKDNLQKDPHAVTVGVDYT 347 Query: 289 PVPL 292 P PL Sbjct: 348 PFPL 351 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 295 bits (756), Expect = 1e-78, Method: Composition-based stats. Identities = 115/265 (43%), Positives = 157/265 (59%), Gaps = 17/265 (6%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 +Q+LFP + A A QP +++ T V S A GT ++ Sbjct: 1 MQLLFPFVTS------AYTYAASQPPVAVPVPT---------QVTSLLAAGGT--ETENG 43 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 S+ ++ T MAT A ++EWL +GTA V LN D++ + +SS++ L P+YD ++ Sbjct: 44 SNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSV 103 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 LFTQ + D RT NIG G R F+ +WM G N F D D + + R+G+GAE W DYL Sbjct: 104 LFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYL 163 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 KL+AN YI + W S D DY E+PA+G+DIRAEGYLPA+PQLGA +MYEQYYG+ V L Sbjct: 164 KLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVAL 223 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 F KD Q DP A++ + YTP+ L Sbjct: 224 FDKDHLQNDPSAVTMGLNYTPISLV 248 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 292 bits (747), Expect = 1e-77, Method: Composition-based stats. Identities = 117/274 (42%), Positives = 160/274 (58%), Gaps = 12/274 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + +A I +Q P+A++ + + A LS + DN A A Sbjct: 2 KSMAIMQILLQTALPVALSMSATVRAA-------ELSQNTHSADKDNINSPYSAQM-TQA 53 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 T LSS + A MA+ A +++WL ++GTARV+LNVD + DS+++ L Sbjct: 54 ATALSSGNAAGAGA----SMASGYAGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLA 109 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ MLFTQ + DDR N G G R F ++WM G N F D D + + R+G Sbjct: 110 PLYDSQKAMLFTQLGLRAPDDRVTGNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGF 169 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W + LKLSAN Y+ + W S D +DY E+PA+G+D+RAEGYLPA+PQLGA LMYE Sbjct: 170 GAEAWTNNLKLSANTYLGTTNWHSSRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYE 229 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 QYYGD+V LF KD Q +P A++ V+YTPVPL Sbjct: 230 QYYGDKVALFDKDDLQSNPSAVTVGVSYTPVPLI 263 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 290 bits (743), Expect = 3e-77, Method: Composition-based stats. Identities = 113/264 (42%), Positives = 154/264 (58%), Gaps = 10/264 (3%) Query: 36 AVTFTPVMAARAQHAVQPRLSMGNTTVTA------DNNVEKNVASFAANAGTFLSSQPDS 89 + F + + S +T A + E + + G L++ S Sbjct: 67 SSAFENLHPNNEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAA---S 123 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML 149 + M NQEI++WL ++GTA+V LN DK+FSLK+SSL+ L P YD+ + + Sbjct: 124 GRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLF 183 Query: 150 FTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 F+Q I D R N+G G R N W+ G+NTF D+DL+ + RIG+GAE W DYL+ Sbjct: 184 FSQLGIRNKDSRNTLNLGVGIRTL-ENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQ 242 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 L+ANGY R +GW S D DY+ERPA G D+RA YLPA PQLG LMYEQY G+ V LF Sbjct: 243 LAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALF 302 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 GKD Q++P+A++A + YTPVPL Sbjct: 303 GKDNLQRNPYAVTAGINYTPVPLL 326 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 286 bits (731), Expect = 8e-76, Method: Composition-based stats. Identities = 106/238 (44%), Positives = 136/238 (57%), Gaps = 2/238 (0%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 VAS + G LSS+ A GM + + ++EWLG G Sbjct: 109 PNQEEEQQATQQASMVASHLSQVGNSLSSEDRVGAFSRLAKGMLLSSTAKTVEEWLGHIG 168 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A+VKL D S +++ P+YD P + F+Q R D R NIG G RH+ + Sbjct: 169 QAQVKLQADDKNDFSGSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYVSD 228 Query: 177 DWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 WM G N F D +S + H R+G G E RDY+KLSAN Y R GWK S +EDY ER A Sbjct: 229 -WMFGYNIFFDQQISGNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAA 287 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 NG+DIR E YLP +PQLG LMYEQY+GDEV LFG ++RQK+P A++A V+YTP+PL Sbjct: 288 NGYDIRTEAYLPHYPQLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYTPIPLV 345 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 286 bits (731), Expect = 9e-76, Method: Composition-based stats. Identities = 108/285 (37%), Positives = 149/285 (52%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 F + + ++ +S+ ++F + A++ N Sbjct: 1 MYSFFNTLTVTKIISRLILSIGLIFGIFTYGFSQQHYFNSEALEN--------PAEHNEA 52 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 + S + S N M ANQE++ WL ++GT +V +N DK F Sbjct: 53 FNKIISTGTSLA-------VSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKF 105 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+SSL+ L P YD+ + + F+Q I D R NIG G R F WM G NT D+ Sbjct: 106 SLKESSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQ-SWMYGFNTSYDN 164 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++ + RIGVGAE W DYL+LSANGY R +GW +S D DY ERPA+G DI + YLPA Sbjct: 165 DMTGHNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPA 224 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG L YEQY G+ V LFGKD Q +P+A++ + YTP+P Sbjct: 225 LPQLGGKLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFI 269 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 285 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 100/266 (37%), Positives = 138/266 (51%), Gaps = 1/266 (0%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 ++ + P + +D + L+ Sbjct: 1 MENEIGGTLINKPGHDMPKLPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRDKA 60 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 T + + Q+ Q+ LG++G A+V L++D +L S+ + P YD+ + Sbjct: 61 QSKTTQWAKEKIISPLQQQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYL 120 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 LF+Q IH D+R N G G R + + + G N FIDHD SR H R G+GAE DY Sbjct: 121 LFSQINIHHQDNRKIGNFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADY 180 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 LK SAN Y S WK SPD +DY ERPA G+D+R++GYLPA+PQLG S +YE Y+GDEV Sbjct: 181 LKFSANYYHPLSHWKDSPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVA 240 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LFGK RQKDP A++ + YTPVPL Sbjct: 241 LFGKSHRQKDPRALTLGIDYTPVPLV 266 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 283 bits (724), Expect = 4e-75, Method: Composition-based stats. Identities = 129/289 (44%), Positives = 180/289 (62%), Gaps = 12/289 (4%) Query: 16 SVLARCVAWANISVQVLFPLAVTF---TPVMAARAQHAVQP----RLSMGNTTVTADNNV 68 + R A+ + +QV+F +F P AA Q ++ +T ++ Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 KN+A+ ++ G+ LS +DA N +A +IQ+WL ++GTA+V L +DKD Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDALINSAINNGSAYVTGQIQQWLQQFGTAKVNLGLDKDL 121 Query: 129 SLKDSSLEMLYPIYD-TPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 SL ++SL++L P+YD N+LFTQ R DDR N+G G+R+F+ + WM G+NTF D Sbjct: 122 SLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFA-DRWMWGINTFYD 180 Query: 188 HDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 +S + H R+G+G E +Y KLSANGY R SGWK S + EDYQER ANG+DIRAEGYL Sbjct: 181 RQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDIRAEGYL 240 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 PAWPQLGA L++EQYYGD+V LF D RQ++P+A++A V YTP PL Sbjct: 241 PAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLV 289 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 283 bits (724), Expect = 6e-75, Method: Composition-based stats. Identities = 115/274 (41%), Positives = 157/274 (57%), Gaps = 6/274 (2%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 V AN V P+ + RA + G+ T D ++ Sbjct: 97 HNVEDANAGELVDSPINDAIAININ-RASQNNKNNAGAGSLTKEQDPMDSLSI----RGV 151 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 G+ L++ DA + MAT+ N +I +WL +YGTAR++LN D+DFSL +S+L+ L Sbjct: 152 GSALAASGRVDALHHMARTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLL 211 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ T LFTQ D R +NIG G R F ++WM G N F D+D + + R+G+ Sbjct: 212 PLYDSQTLTLFTQQGFRNKDRRNIANIGIGTR-FIHHEWMMGGNAFYDNDFTGDNKRVGL 270 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W D +LSANGY R + W +S D DY ERPANG D+RA G+LPA P LG SL+YE Sbjct: 271 GAELWTDSFQLSANGYFRLTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYE 330 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 Y+GD V LFGKD Q++P+AI+ +YTP L Sbjct: 331 HYFGDNVALFGKDHLQRNPYAITLGGSYTPFSLL 364 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 283 bits (723), Expect = 6e-75, Method: Composition-based stats. Identities = 110/236 (46%), Positives = 152/236 (64%), Gaps = 4/236 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 N + + +A + L+ D + ++K+NQ+I++WL ++G Sbjct: 84 NNQDEAIPSTEGEELAKIIVDNSFLLNKDID---VTQYAISQISSKSNQKIEQWLNQFGH 140 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND 177 ARV L+ DK+ +LK+SS E+L P+Y+ ++F Q HR D R+Q N G G+R+F+ Sbjct: 141 ARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTNYHRKDLRSQFNYGIGYRYFT-EK 199 Query: 178 WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 +M G+N F DHDL+ H R+G+GAE WRDY KLS+N Y R S W+ S +I DY ERPANG Sbjct: 200 FMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHYHRLSSWRASNNILDYSERPANG 259 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 WDIR EGY PA+PQLG L++EQYYG EVGLFGKDKR K+PH + + YTP+PL Sbjct: 260 WDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRDKNPHTYTLGINYTPIPLV 315 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 282 bits (721), Expect = 1e-74, Method: Composition-based stats. Identities = 100/265 (37%), Positives = 140/265 (52%), Gaps = 9/265 (3%) Query: 37 VTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASF--------AANAGTFLSSQPD 88 F Q+ P L N K++ A L+ + Sbjct: 16 AAFAAPEINVKQNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINSATQGFENLTPEAL 75 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 R+++ T+ A I++ L YG R L++ + L SS++ P YD T + Sbjct: 76 KSQARSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTV 135 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 F+Q + R +DRT NIG G R ++ + ++ G N F D+D +R H R+G+GAE W DYL Sbjct: 136 YFSQFSAQRKEDRTIGNIGLGVR-YNFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYL 194 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 K S N Y S WK S D + Y+ERPA GWDIRAE +LPA+PQLG +++EQYYG+EV L Sbjct: 195 KFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVAL 254 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 FG D +KDP A++ V Y PVPL Sbjct: 255 FGTDSLEKDPFAVTLGVKYQPVPLI 279 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 281 bits (718), Expect = 2e-74, Method: Composition-based stats. Identities = 108/296 (36%), Positives = 148/296 (50%), Gaps = 7/296 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANIS-----VQVLFPLAVTFTPVMAARAQHAVQPRLSMG 58 K R + ++ + +A A Sbjct: 12 AKKTALFKRLHTLTATDTLESVASGYGLSVDELWALNINLYNNRVAFDAIKYGAVVYVPN 71 Query: 59 NTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA 118 VAS + G+ LSS+ +A G+ + + ++EWLG G A Sbjct: 72 REEEQKATQQASLVASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIGKA 131 Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 +VKL VD S L + P+Y+ P + F+Q R D R NIG G RH+ + W Sbjct: 132 QVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYLSD-W 190 Query: 179 MAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 M G N F+D +S + H R+G+G E RDY+KLSAN Y R GWK S +EDY ER A+G Sbjct: 191 MLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAASG 250 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +DIR E YLP +PQLG LMYEQY+G+EV LFG ++RQK+P A++A V+YTP PL Sbjct: 251 YDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLV 306 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 280 bits (717), Expect = 3e-74, Method: Composition-based stats. Identities = 94/238 (39%), Positives = 136/238 (57%), Gaps = 3/238 (1%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 N + S L+ P +++I ++ Q+WL ++G Sbjct: 97 KDENHKEDSQNTPPLILSRGPEFLGLLNKDPK-KLAQDYIVNKLNSQITSNTQKWLSQFG 155 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSG 175 TA++ LNVD L +SS+++L P YD + ++++Q D R N+G G R F Sbjct: 156 TAKINLNVDHRGRLDESSVDLLVPFYDDKDHWLIYSQYGYRHKDSRDTVNLGIGTRLFIN 215 Query: 176 NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPA 235 + WM G NTF D+DL+ +++R +G E W +YLK+SAN Y R S W S D+ +Y ERPA Sbjct: 216 D-WMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNSRDLTNYYERPA 274 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 NG+D+ A+ YLPA P LGA + YEQY+GD V LFG + RQKDP+A + V YTP+PL Sbjct: 275 NGYDLIADMYLPAMPSLGAKIKYEQYFGDNVALFGTNNRQKDPYAATIGVNYTPIPLI 332 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 280 bits (716), Expect = 4e-74, Method: Composition-based stats. Identities = 117/288 (40%), Positives = 157/288 (54%), Gaps = 19/288 (6%) Query: 22 VAWANISVQVLFPLA-----------VTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK 70 + A Q++ PL + P++AA +L+ + VT N + Sbjct: 101 MMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD 160 Query: 71 NV----ASFAANAGTFLSSQP-DSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 A AA+ G+ L S+ + D ++ G+A +A+ ++Q WL YGTA V L Sbjct: 161 KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSG 220 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 +F SSL+ L P YD+ + F Q D R +N+G G R F + M G N F Sbjct: 221 NNFD--GSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPEN-MLGYNVF 277 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 ID D S +TR+G+G EYWRDY K S NGY R SGW +S + +DY ERPANG+DIR GY Sbjct: 278 IDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGY 337 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LP++P LGA LMYEQYYGD V LF DK Q +P A + V YTP+PL Sbjct: 338 LPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLV 385 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 279 bits (713), Expect = 1e-73, Method: Composition-based stats. Identities = 110/263 (41%), Positives = 160/263 (60%), Gaps = 7/263 (2%) Query: 36 AVTFTPVMAARAQHAVQPRLSM-GNTTVTADNNVEKNVASFAANAGTFLSSQ----PDSD 90 + PV+ A A+ L G+ + +NN E A + GTFLS + S Sbjct: 24 SSAIMPVIPAYAKMLDNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQ 83 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 +++ +++A +EI+ WL K G ++ ++ DK FS+K+S + L P YD +LF Sbjct: 84 IAQDYARNKVSSEATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLF 143 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 TQ +HR D+R +N G G R+F + G+N FIDHDLS +HTR+G+G EYW+DYLKL Sbjct: 144 TQHTLHRYDERFHTNNGIGLRYFHEKSTI-GMNAFIDHDLSHAHTRVGLGVEYWQDYLKL 202 Query: 211 SANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 +AN Y + WK + ++ D+ +PA+GWDI+ EG+LP +P LG +L YEQYYGD V LF Sbjct: 203 NANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALF 262 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 GK KRQK+P+A + +TP PL Sbjct: 263 GKTKRQKNPNAATIGANWTPFPL 285 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 278 bits (710), Expect = 2e-73, Method: Composition-based stats. Identities = 95/237 (40%), Positives = 143/237 (60%), Gaps = 3/237 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 ++ ++ + + S L+S P +++I ++ Q+WL ++GT Sbjct: 91 EDSHKDGNHPLPPLILSHGTKILGLLNSDPK-KLAQDYIVNKLNSQITSNTQKWLSQFGT 149 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A++ LNVD L +SS+++L P YD + ++++Q D R N+G G R F N Sbjct: 150 AKINLNVDHRGRLDESSVDLLVPFYDDKDHWLVYSQYGYRHKDSRDTVNLGIGTRLFINN 209 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 WM G NTF D+DL+ +++R +G E W +YLK+SAN Y R S W + D+ +Y ERPAN Sbjct: 210 -WMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNARDLVNYYERPAN 268 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G+D+ A+ YLP+ P LGA + YEQY+GD V LFGK+KRQKDP+A + V YTP+PL Sbjct: 269 GYDLIADMYLPSMPSLGAKIKYEQYFGDNVALFGKNKRQKDPYAATIGVNYTPIPLI 325 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 275 bits (703), Expect = 1e-72, Method: Composition-based stats. Identities = 115/303 (37%), Positives = 168/303 (55%), Gaps = 23/303 (7%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 + + +P+ S+L W+ + P++ + AQ L Sbjct: 1 MNPPSSKLKPKLPNSLLLSTAIWSTAIL-----------PMVPSYAQIVHLDDLPTLGGQ 49 Query: 62 VTA------DNNVEKNVASFAANAGTFLSSQ----PDSDATRNFITGMATAKANQEIQEW 111 +++ E+ +A + NA F S + +D +++ A A EI W Sbjct: 50 AIQFEGTQPEDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHW 109 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L K G AR+ +N+DK S+K S L+ L P Y+ +LF+Q +IHRTD R Q+N G G R Sbjct: 110 LSKAGNARLNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLR 169 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDY 230 HF N M GVN F DHDLS H+R+G G EY +DY+++SAN Y+ S W+ + ++ +DY Sbjct: 170 HFQQNS-MIGVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDY 228 Query: 231 QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPV 290 RPANGWDI+ EG+LP + LGA+L EQYYGD+V LFGK++RQKDP A + V ++P Sbjct: 229 NARPANGWDIQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPF 288 Query: 291 PLT 293 PL Sbjct: 289 PLL 291 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 270 bits (690), Expect = 4e-71, Method: Composition-based stats. Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 11/242 (4%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 + + EK A L++ ++A N+ NQ+I +WL +YG Sbjct: 98 KETSQAKQVESAEKQFVQGATQIAQGLANNNATEAAINYARNRGEGLLNQKISDWLNQYG 157 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 ARV+++ +K ++L P+ D P ++LF+Q I + R+ +N+G G+R + N Sbjct: 158 KARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGIRANEQRSTTNLGLGYRQYQQN 212 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERP 234 WM G+N+F D+D+S + R G+G E W YLKL+ NGY R + W +S ++ DY ERP Sbjct: 213 -WMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQSFLHEMRDYDERP 271 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKRQKDPHAISAEVTYTPVP 291 ANG+D+RAEGYLP++P LGA YEQY+GD V L + +P A++ ++YTP P Sbjct: 272 ANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKDNPSAVTFGLSYTPFP 331 Query: 292 LT 293 L Sbjct: 332 LL 333 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 268 bits (685), Expect = 2e-70, Method: Composition-based stats. Identities = 91/261 (34%), Positives = 136/261 (52%), Gaps = 16/261 (6%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRN 94 T + A A L+ V V+ V + R Sbjct: 84 ATAGQTIWIPAAKPAATTLPLAPATVQVAKPGKVDGKV-------------DDKTTNVRQ 130 Query: 95 FITGMATAKANQEIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDT-PTNMLFTQ 152 F A+++ + WL +G ++RV ++ ++F+ + + ++L P++++ M+F+Q Sbjct: 131 FGQDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQ 190 Query: 153 GAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSA 212 + DDRT NIG G R+F G WM G N F D+D S S+ RIG+GAE D L+L+A Sbjct: 191 LGVRHADDRTTGNIGLGARYF-GEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAA 249 Query: 213 NGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 NGY + +GW S I D+ ERPANGWDI +LP +PQLG + YEQYYGD V L + Sbjct: 250 NGYFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRG 309 Query: 273 KRQKDPHAISAEVTYTPVPLT 293 + Q +P A + V +TP+PL Sbjct: 310 RLQHNPSAATLGVNWTPIPLV 330 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 261 bits (668), Expect = 1e-68, Method: Composition-based stats. Identities = 108/273 (39%), Positives = 149/273 (54%), Gaps = 18/273 (6%) Query: 32 LFPLAVTFTPVMAARAQHAVQPRLSMGNT------TVTADNNVEKNVASFAANAGTFLSS 85 PL + TP+ A L + E NVAS A + + Sbjct: 88 EAPLNGSTTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASAATQLWGIMGN 147 Query: 86 QPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 S A + +TG+A A+Q +WLG+YG ARV+LN S + ++L P+ +T Sbjct: 148 DNSSRAAESAVTGVAAGLASQAAADWLGQYGNARVQLN-----SNSIGNADVLIPLTETQ 202 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N+LF Q + +RT +N+G G R F+ + WM GVNTF D+DL+ ++R+GVG E W Sbjct: 203 NNLLFGQLGVRYNGERTTNNVGLGVRSFT-DSWMFGVNTFYDYDLTGKNSRLGVGGEAWT 261 Query: 206 DYLKLSANGYIRASGWKKS--PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG 263 D LK SANGY R + W +S D+EDY ERPANG+D+RAE YLP++PQLG LMYE+Y+G Sbjct: 262 DNLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLPSYPQLGGRLMYEKYFG 321 Query: 264 DEVGLFGK----DKRQKDPHAISAEVTYTPVPL 292 V L D P A + + YTP+PL Sbjct: 322 KGVALNSGSTSPDDLGDSPSAFTVGLNYTPIPL 354 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 256 bits (653), Expect = 8e-67, Method: Composition-based stats. Identities = 90/279 (32%), Positives = 133/279 (47%), Gaps = 10/279 (3%) Query: 19 ARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAAN 78 A +AW ++ P P + Q L + D EK A+ A Sbjct: 25 AMGLAWLCGAL----PAYAESPPAPDSVVQQPA-NDLPELGGNASNDAEREKEWATMAKQ 79 Query: 79 AGTFLSSQPDSDA----TRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSS 134 G + S ++ G A++ Q+ QE L G A++ L + SS Sbjct: 80 LGERNLNNVSSQQVRTRAESYAVGQASSVLQQQAQELLSPLGNAKLSLVMSDQGDFSGSS 139 Query: 135 LEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSH 194 ++ P+YD + ++Q + + + + N G G R +G+ W+ G NT +D D R H Sbjct: 140 GQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWVAGD-WLLGYNTVLDSDFERHH 198 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R +GAE W D+L+ SAN Y S + D + RPA+G+DI +GYLP + Q+G Sbjct: 199 NRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSRPASGYDITTQGYLPFYRQIGG 258 Query: 255 SLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 SL YEQY+G+ V LFG K+Q DP A+ V YTPVPL Sbjct: 259 SLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLV 297 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 251 bits (640), Expect = 3e-65, Method: Composition-based stats. Identities = 75/286 (26%), Positives = 120/286 (41%), Gaps = 10/286 (3%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 + R + + + + T A +++ EK+ A Sbjct: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAE 61 Query: 75 FAANAGTFLSSQPDSDATRNF-------ITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + G + D + + + NQ ++ WL +G A V + VD + Sbjct: 62 IVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNE 121 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 S P+ D + ++Q + + D+ SN+G G R GN W+ G NTF D Sbjct: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYNTFYD 180 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 + L + R G GAE W +YL+LSAN Y + W + ++R A G+D+ A +P Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMP 238 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 239 FYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 249 bits (636), Expect = 8e-65, Method: Composition-based stats. Identities = 93/236 (39%), Positives = 135/236 (57%), Gaps = 11/236 (4%) Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +NN E+ AS G LSS D + N+ + NQ+I +WL +YG AR+ Sbjct: 100 PEENNNEEKFASSFTLMGDILSSDNFVDNSINYAKSIGQGLVNQQINDWLNQYGKARISF 159 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 + D K+ S + L P+ D P N+LFTQ + DR N+G G+R + N WM G+ Sbjct: 160 SSD-----KNISGDFLLPVIDEPNNLLFTQLGLRNNTDRNTINLGLGYRKYWRN-WMFGI 213 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP--DIEDYQERPANGWDI 240 NTF D+D + + R+GVG E W DYLKL+ NGY + W +S ++DY ERPA G+D+ Sbjct: 214 NTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSKISVMDDYDERPATGFDV 273 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGL---FGKDKRQKDPHAISAEVTYTPVPLT 293 RAE YLP +PQLG+S+ YE+Y+G + L + + D ++ + YTP+PL Sbjct: 274 RAEAYLPKYPQLGSSIKYEKYFGKGIHLGTGVNPEYLKDDAQSLIMGLNYTPIPLL 329 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 249 bits (635), Expect = 9e-65, Method: Composition-based stats. Identities = 89/291 (30%), Positives = 144/291 (49%), Gaps = 16/291 (5%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 + + LAR +AW + Q+L P A+ A+A R ++ D + Sbjct: 2 KLTPTPLARWLAWVLVGTQLLTPAAL-------AQAMLPEITRSGADSSVDKTDQPEAEW 54 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKY---GTARVKLNVDKDF 128 +AS A++ G+ L SD +N I + AN I + + R + ++ Sbjct: 55 LASRASSLGSLLQEGNISDFAKNQIQALPQTIANDGITSGIKHWLPEAQFRGGITLEDAS 114 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVN 183 + + ++L P+Y + +++LF Q + D+ R N G GWR G+ W+ G+N Sbjct: 115 KYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVGD-WLLGIN 173 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 +F+D D+ H R +G E +RD + L+ N Y S WK S + ERPA G D+R + Sbjct: 174 SFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVRLK 233 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G LP+ P GA L +EQY+GD+V + G D +DP A + +T+ PVPL + Sbjct: 234 GALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVE 284 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 247 bits (630), Expect = 4e-64, Method: Composition-based stats. Identities = 93/222 (41%), Positives = 127/222 (57%), Gaps = 2/222 (0%) Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 + + S+ + R + +AN LG GTAR ++ +D DF++ + Sbjct: 166 EQVPTSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDDDFNMVTA 225 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 ++L P+ + +LFTQ + R DRT +N+G G RHF + WM G N F D+DL+ Sbjct: 226 EADLLLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFL-DRWMLGYNLFADYDLTN 284 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 H R GVGAE WRDYLKL AN Y S W+ SP E +ER A G D+R E YLPA+PQ Sbjct: 285 RHWRAGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYLPAYPQW 344 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 ASL EQY G+ VGL D+ ++DPHAI+A + Y P PL + Sbjct: 345 SASLTAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLK 386 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 245 bits (625), Expect = 2e-63, Method: Composition-based stats. Identities = 85/246 (34%), Positives = 122/246 (49%), Gaps = 10/246 (4%) Query: 49 HAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEI 108 + + K N SD +++ I M A E Sbjct: 54 QHQTDDDATQGGDIPKSAMSGKRWLQHQTNDDVM----QGSDISKSGIADMGFAALQPET 109 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 ++ G R L + D L S+++ YP+YD + + F Q R D R N+G Sbjct: 110 EK---SAGEVRANLPL-SDGKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGI 165 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 G R+F G+ W G NTF D +S + H R+G G EYWRDYL LSANGY + W S + Sbjct: 166 GQRYFQGD-WALGYNTFYDIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSAL 224 Query: 228 EDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 + Y ER ANG+DIRA+G+ P +PQL L +EQY+GD++ L R K+P+A++ + Y Sbjct: 225 DGYAERAANGYDIRAQGWFPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEY 284 Query: 288 TPVPLT 293 TP+ L Sbjct: 285 TPIQLI 290 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 243 bits (621), Expect = 4e-63, Method: Composition-based stats. Identities = 87/222 (39%), Positives = 126/222 (56%), Gaps = 6/222 (2%) Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS--LKD 132 + +A + L S P D + G + +Q I+ WL +YG AR+ LN D S L Sbjct: 18 YTKSAASLLKSGPAFD---QYAAGKISQLTSQAIEGWLKQYGNARITLNAQSDNSTALAG 74 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS ++L+ +++ + + + Q H D + N+G G R+F N M G N F D +++ Sbjct: 75 SSADLLFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNIN 134 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 +R GVG E WRDY K S NGY S W+ S +EDY E+ A+G+D++ E YLP + Q Sbjct: 135 SGVSRSGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQ 194 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LG L YEQY+GD V LF + Q DP AI+ ++YTP+PL Sbjct: 195 LGGHLKYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLI 236 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 72/240 (30%), Positives = 108/240 (45%), Gaps = 10/240 (4%) Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRN-------FITGMATAKANQEIQEWLG 113 + + EK A G + D + +A+ NQ ++ WL Sbjct: 1 MMPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLS 60 Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF 173 +G A V + VD S P D + ++Q + R +D SN+G G R Sbjct: 61 PWGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRW- 119 Query: 174 SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQER 233 + + W+ G NTF D+ L R G+GAE W +YL+LSAN Y + W ++R Sbjct: 120 ARDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWH--ERSATQEQR 177 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A G+D+ A+ +P + L + EQY+GD V LF K +P A+S + YTPVPL Sbjct: 178 MARGYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLV 237 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 241 bits (614), Expect = 3e-62, Method: Composition-based stats. Identities = 105/247 (42%), Positives = 150/247 (60%), Gaps = 11/247 (4%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + R + V W+ I+ Q+L P+ T P ++ + + A++ Sbjct: 2 TMVNKKLR-RKASCAVTWSVIATQILSPVTFTLIPA------NSFASSANTESAQTNAND 54 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 +AS AANAG L++ F +A+A +E+ +WL +YG AR+KLNVD+ Sbjct: 55 EYANELASLAANAGQSLANN----TAGRFAVDTLSAQATKEVVDWLQQYGNARIKLNVDE 110 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 F+LKD++ + LYP D+ +LF+Q ++HRTDDR Q+NIG G RHF+ ++ M G N F Sbjct: 111 SFTLKDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFY 170 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 D+DLSR H+R G+G EYWRDY++ AN Y S WK S DI+DY ERPANGWD+ AEG+L Sbjct: 171 DYDLSRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWL 230 Query: 247 PAWPQLG 253 P +PQLG Sbjct: 231 PVYPQLG 237 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 79/203 (38%), Positives = 114/203 (56%), Gaps = 2/203 (0%) Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 N Q ++ + +G + L+VD S +L P Y +++LF Sbjct: 176 YIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVLF 235 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 +Q ++++RT +IG G R + ++ G N FID+DL + H R+ +G E +Y K Sbjct: 236 SQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYFK 294 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 LS N Y + W+ S D+EDY ERPA G+DIR +GYLP +PQLG + YEQY+G EV LF Sbjct: 295 LSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVALF 354 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 K KRQK+P A+S + Y P PL Sbjct: 355 NKTKRQKNPKAVSIGIDYRPFPL 377 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 236 bits (602), Expect = 7e-61, Method: Composition-based stats. Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 10/257 (3%) Query: 44 AARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD---SDATRNFI---- 96 A+ +N EK+ A G + R+F Sbjct: 3 QAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTRL 62 Query: 97 TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH 156 ++ E + L +G A V L VD++ + SS + P D + ++Q + Sbjct: 63 RDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGVS 122 Query: 157 RTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYI 216 + + N G G R +G+ W+ G NTF D +R G GAE W DYL+LSAN Y Sbjct: 123 QQNQGLVGNAGIGQRWTAGH-WLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 217 RASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQK 276 GW+ + ++R A G+D+ A+ YLP + + S+ +EQY+GD+V LF Sbjct: 182 PLGGWEHRAGLL--EQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYH 239 Query: 277 DPHAISAEVTYTPVPLT 293 +P A+ ++YTPVPL Sbjct: 240 NPVAVKVGLSYTPVPLV 256 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 229 bits (585), Expect = 6e-59, Method: Composition-based stats. Identities = 85/290 (29%), Positives = 134/290 (46%), Gaps = 34/290 (11%) Query: 13 FRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNV 72 F + + + WA ++ Q+ P+ +DN ++ + Sbjct: 3 FSKKPITKYITWAIVTSQIPLPVIA-------------------------DSDNEIQSWI 37 Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDFS 129 A A++ L D + I + AN + E + R +N++ Sbjct: 38 AGTASSISPHLQEGTLEDYAKGKIKALPGQAANHLVNEGIKSAFPEIIFRGGVNLEDGAK 97 Query: 130 LKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVNT 184 + S +M P+ +T +++LF Q D+ RT N+G G+R N W+ GVNT Sbjct: 98 YRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYRQEV-NGWLLGVNT 156 Query: 185 FIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEG 244 F+D D+ SH R G+G E ++D L S N Y +GWK S E + ERPA G+D+R +G Sbjct: 157 FLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAAHELHDERPAYGFDLRTKG 216 Query: 245 YLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 LP +P L YEQYYGD+V L G ++P A A++ + PVPL + Sbjct: 217 TLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGADLVWNPVPLLE 266 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 220 bits (561), Expect = 4e-56, Method: Composition-based stats. Identities = 70/302 (23%), Positives = 122/302 (40%), Gaps = 12/302 (3%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTF-TPVMAA-RAQHAVQPRLSMGN 59 +H ++ +R A +++Q P+A P +A + A ++ Sbjct: 12 AHLPARGRRHWYRRHRAGAAGMSAVLAMQAAAPVAYGQGAPTFSATQVADAASNAVAQPG 71 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGK----- 114 T + +A G + D F+ A A+AN +Q+ + Sbjct: 72 AVETRVAQTIQALAQAREAGGARQDGRASLDG--QFLRSQAQAQANVLVQQGVQWANETG 129 Query: 115 -YGTARVKLNVDKDFSLKDSSLEM--LYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 R++ NV DFS +D ++++ + ++ L Q H + R N G R Sbjct: 130 LPWLRRLEGNVSYDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVAR 189 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +G+ + G N F+D+++ + H R +GAE L N Y SGWK + E + Sbjct: 190 SAAGSSLILGGNAFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERRE 249 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA GWD+ A L + Y ++ G +V F + +++P + Y PVP Sbjct: 250 ERPAAGWDVGFTARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVP 309 Query: 292 LT 293 L Sbjct: 310 LI 311 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 218 bits (556), Expect = 1e-55, Method: Composition-based stats. Identities = 82/259 (31%), Positives = 118/259 (45%), Gaps = 11/259 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFI----- 96 A A+ A + TV ++ + K +A A + G + + R Sbjct: 80 PFADPARFAKMQQQLPELGTVHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVF 139 Query: 97 ---TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 A +A E ++ L YG A V L + D S SS +++ P D + + F+Q Sbjct: 140 NRFRDAAKQRAASEGEQLLSPYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQL 199 Query: 154 AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSAN 213 I +++ + N G G R +G W G N F+D L R +GAE W YL+ SAN Sbjct: 200 GIEQSEYGSVGNAGLGQRWIAG-SWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSAN 258 Query: 214 GYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDK 273 Y SG + + R A G+DI GYLP + QLG +L YEQY G+ V LF Sbjct: 259 YYQPLSGCRNHSNSA--LMRMARGYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGN 316 Query: 274 RQKDPHAISAEVTYTPVPL 292 +P A+S + YTPVPL Sbjct: 317 AVANPAAVSLGINYTPVPL 335 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 57/268 (21%), Positives = 89/268 (33%), Gaps = 9/268 (3%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQP-----DS 89 LA P+ A + D + +A+ A + + + D Sbjct: 54 LAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGARVDG 113 Query: 90 DATRNFITGMATAK----ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 D + N + L + L+ D + L + +Y Sbjct: 114 DYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVYRAG 173 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N Q H +DR +N G +R + M G N F+D++ + H R VG E Sbjct: 174 RNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLEVIA 233 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L N Y S WK + +E+PA+G D+ P L S + ++ G E Sbjct: 234 PEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWNGAE 293 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V F + Q V Y PV L Sbjct: 294 VDYFDNGRTQAGAKGFKVGVEYRPVSLV 321 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 63/272 (23%), Positives = 103/272 (37%), Gaps = 7/272 (2%) Query: 26 NISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSS 85 +++Q + P P +AR A R ++ + + +A AG+ S+ Sbjct: 33 VLTLQTVAPAFAQGAPSFSAR--PAQADRQDAADSAMLRVAQTARQLAQR-QAAGSRASA 89 Query: 86 QPDSDATRNFITGMATAKANQEI----QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPI 141 + D D + A + + Q L + +N D L + + Sbjct: 90 RVDGDLLKGQAEAQANELLQEGVRLANQTELPFLRRLQGGVNYDFSNKDLSLDLRTIDEV 149 Query: 142 YDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGA 201 + + + Q + H + R N G RH G N F+D++ ++H R +G Sbjct: 150 HRGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLRGSLGG 209 Query: 202 EYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 E L N Y SGWK + E +ERPA+GWD+ A P L Y ++ Sbjct: 210 EVIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKGQYFRW 269 Query: 262 YGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G V F + Q++ V Y PVPL Sbjct: 270 SGAAVDYFDNGRPQRNARGYKYGVEYRPVPLV 301 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 203 bits (516), Expect = 7e-51, Method: Composition-based stats. Identities = 69/302 (22%), Positives = 110/302 (36%), Gaps = 18/302 (5%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 +SH K + R R A ++ +Q PLAV A+ + R G+ Sbjct: 25 VSHAKGSGRNRRRRAQRAASSAVCLSLGMQAAAPLAVL------AQGAPEMTNRPEAGDI 78 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDAT-RNFITGMATAKANQEIQEW-------- 111 ++V VA A + + + + +++ A+ NQ +QE Sbjct: 79 VP---SDVLTQVAVRAQDLARRQADRREGAQVDADYLKQQGQAQFNQFLQEGVRAANESG 135 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L + L D D L + +Y N Q H ++R +N+G +R Sbjct: 136 LRFLRNLQGDLRHDFDNGRTSLELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYR 195 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 M G N F+D++ ++ H R +G E N Y SGW + + Sbjct: 196 RDINERLMLGANAFLDYEFAKQHLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRRE 255 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA+G D+ + P L Y ++ G V F + Q V Y PVP Sbjct: 256 ERPASGMDLGMKYSPGFAPGLSLKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVP 315 Query: 292 LT 293 L Sbjct: 316 LL 317 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 196 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 76/268 (28%), Positives = 112/268 (41%), Gaps = 22/268 (8%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 N D V AG S+ S + + MA++ N Sbjct: 281 KTQKALNDNKKDNNLSKEDQEFSNKVMKVIQTAGAIYDSED-SKSKEEIVKNMASSYLNT 339 Query: 107 EIQEWLGKY-GTARVKLNVDKDFSLK-----DSSLEMLYPIY--DTPTNMLFTQGAI-HR 157 E ++ + +N D F+ + + L PI D P F Q I Sbjct: 340 SANELAKEFIDSLNTSINTDFSFNYNERSGFSGNAKALLPIVSEDNPKISYFLQSGIGEF 399 Query: 158 TDDRTQSNIGFGWRHFSG-------NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 +DRT + G G R++ + M G+N+ DHD SR H R+ +GAE D L Sbjct: 400 ANDRTIGHFGGGIRYYPNATALNNSGNIMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAF 459 Query: 211 SANGYIRASGWKKSPDIE-DY-QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 +AN Y R S W S D + DY QERPANGWD + + P+ + Q+YG++VG+ Sbjct: 460 NANVYQRLSSWIDSYDFDKDYVQERPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGI 519 Query: 269 FGK---DKRQKDPHAISAEVTYTPVPLT 293 FG D +K+P ++Y+P P Sbjct: 520 FGANSVDDLEKNPLIYEGGISYSPFPAL 547 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 182 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 57/251 (22%), Positives = 88/251 (35%), Gaps = 6/251 (2%) Query: 43 MAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATA 102 M + A+ V + +E VA N T S A Sbjct: 6 MPSPARLLTLLLCPTLLPPVAYGSAIESEVA---RNLWTRAQHPDTSPGLAQSALDAGVA 62 Query: 103 K-ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR 161 Q L L D D SL + + + L Q +H + R Sbjct: 63 AGLQASRQTGLPWLRHLDGGLRYDLDPGRLSFSLRTIDDLMVSERRALMLQAGLHNQNQR 122 Query: 162 TQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 +N G R + + G N F+D++ + H R +G E + L AN Y SGW Sbjct: 123 PTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGLEAIAPHYSLYANYYAPLSGW 182 Query: 222 KKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAI 281 K + +ERPA G+D+ G L + L Y +++G + +F + Q++ Sbjct: 183 KGARRDSRREERPAAGYDL--GGQLSSDAGLSLQAAYFRWHGAGIDVFDSGRAQRNASGF 240 Query: 282 SAEVTYTPVPL 292 V Y P L Sbjct: 241 RYGVAYQPGAL 251 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 3/146 (2%) Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 M ++Q + + D SN+G G R + + W+ G NTF D+ L + R G GAE W +Y Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEY 59 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L+LSAN Y + W+ ++R A G+DI A+ LP + + S+ EQY+GD V Sbjct: 60 LRLSANYYQPFADWQTHT--ATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVD 117 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LF +P A+ + YTPVPL Sbjct: 118 LFDSGTGYHNPVALKLGLNYTPVPLL 143 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 160 bits (405), Expect = 5e-38, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 105/288 (36%), Gaps = 37/288 (12%) Query: 28 SVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK--------NVASFAANA 79 Q L + + F +++ A + +N K A+++ Sbjct: 3 ISQALTSITLVFGSILSVSANEYKFEEIKFNQIPNEQNNYEPKDKLDEYIIKGANYSTKF 62 Query: 80 GTFLSSQPDSDATRNFITG------------MATAKANQEIQEWLGKYGTARVKLN--VD 125 +++ D + A AKAN EIQ+ + + V ++ + Sbjct: 63 VPLMNNGAKGDEYTGIMADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTE 121 Query: 126 KDFSLKDSSLEMLYPIYDTPTN----MLFTQGAIHRTDD--RTQSNIGFGWRHFSGNDWM 179 D S +SL L + + F+Q + + NIG G R+ + M Sbjct: 122 SDTSFSINSLMKLGELAKDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISM 181 Query: 180 AGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERPA 235 G N F D+ D S +H+R+G+G EY+ + N Y+ + K DYQER Sbjct: 182 VGANAFWDYRMTDYSDAHSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVV 241 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPH 279 GWD+ LP P+L + + D GL G Q PH Sbjct: 242 PGWDLEVGYRLPNNPELAFYIRGFNWDYKYTQDNSGLEGAVSWQATPH 289 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 57/239 (23%), Positives = 96/239 (40%), Gaps = 10/239 (4%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 A V L+ + +A + + + T L + +A +N K Sbjct: 11 APLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAGD-VEALKNTAIDGVVEKGVG 69 Query: 107 EIQEWLGKYGTARVKLNVD-KDFSLKDSSLEMLYPIYDTPT--NMLFTQGAIHRTDDRTQ 163 + +L +Y V+LN + S L ++ P+ D N FTQG++ D+RT Sbjct: 70 VTKSFLEQYF-PTVELNFGAQGGSKPSGGLLVVAPLSDPDDIFNTYFTQGSVFYEDNRTT 128 Query: 164 SNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 N+G G+R S N + G+N F DH+ H R +G E +++AN Y + WK Sbjct: 129 LNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINANKYWATTKWK 188 Query: 223 KSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFG--KDKRQKDPH 279 + +ER +G+DI A LP + Q+ + G + Q + Sbjct: 189 TGKN--GLEERALDGYDIEAGVPLPYMNWATVFVKNFQWDSEISGSKDIKGNDLQLRAY 245 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 58/233 (24%), Positives = 97/233 (41%), Gaps = 23/233 (9%) Query: 70 KNVASFAANAGTFLSSQPDSD-------ATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + A++AA G + + D + + AN++I++ + + + L Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 123 N--VDKDFSLKDSSLEMLYPIY----DTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSG 175 + D + S L I P ++F Q + T + Q N+G G R G Sbjct: 168 DAATSSDLTGYLDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRLG 227 Query: 176 NDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-DIEDYQ 231 ++ + G+N F D + S ++TR G+GAE + +L N YI S K + DY Sbjct: 228 DEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDYV 287 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPHA 280 ER GWD+ +P++PQL + + D G+ G Q PHA Sbjct: 288 ERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDYQDHSDNSGIEGSVNWQATPHA 340 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 54/210 (25%), Positives = 83/210 (39%), Gaps = 7/210 (3%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 P A T A + Q L + ++ EK +A+ A + Sbjct: 35 LPAAAWVTQPENDAALLSQQQALPNLGSASVNESGTEKKLATLARQMAEVNQDENTDQTW 94 Query: 93 RNF----ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TN 147 R++ + Q+ + L G V L+VD+ SS ++L P+ D Sbjct: 95 RSYLLGEAKDRVLDRLQQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRG 154 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRD 206 + ++Q + DD N+G R +G W+ G N F D L++ R +GAE D Sbjct: 155 LTYSQLGLQGVDDGVVGNMGLRQRWNAG-RWLLGYNVFYDQYLNQDASRRGSIGAEARSD 213 Query: 207 YLKLSANGYIRASGWKKSPDIEDYQERPAN 236 YL LS+N Y SG + D ED R A Sbjct: 214 YLTLSSNYYYPLSGMHAANDDEDELLRMAR 243 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 140 bits (354), Expect = 4e-32, Method: Composition-based stats. Identities = 42/171 (24%), Positives = 70/171 (40%), Gaps = 8/171 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRN------- 94 + A+AQ + + EK+ A A D D Sbjct: 70 TIRAQAQDPFDQNRLPDLGMMPESHEGEKHFAEMAKAFSEASMKNNDLDTGEQARQFAFG 129 Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + + + + NQ+++ WL +G+A V +NVD + S P+ D + ++Q Sbjct: 130 QVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLG 189 Query: 155 IHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 + + D SN+G G R + + W+ G NTF D+ L + R G GAE W Sbjct: 190 LTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWG 239 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 140 bits (352), Expect = 7e-32, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 11/198 (5%) Query: 73 ASFAANAGTFLS-SQPDSDATRNFITGMATAKANQEIQEWL-----GKYGTARVKLNVDK 126 A + G LS S DS+ ++ + T+ A+ + + + T V N+ + Sbjct: 15 AVLTMSLGFSLSVSADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKE 74 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTF 185 S D + +L D P + LF Q ++R D RT N+GFG+R + ++ WM GVN F Sbjct: 75 GDSTVD--IGVLKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAF 132 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 DH+ H R GVG E L+ N Y +G D + +G D+ + Sbjct: 133 YDHEFPNDHKRNGVGFEVVSSVLESRVNSYNGTTG--YIKDKSGTDSKVLDGRDMGFKVA 190 Query: 246 LPAWPQLGASLMYEQYYG 263 LP P + + Q+ G Sbjct: 191 LPYLPGMMFGMNAVQWKG 208 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 66/307 (21%), Positives = 117/307 (38%), Gaps = 37/307 (12%) Query: 9 KQPRFRYSVLARCVAWANISV---QVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD 65 + R + A+ + A ++ F L + + A + + + Sbjct: 1 MRTTNRLLLSAKHIKQAMSGSVSFKLAFSLIASGLTLQCLPASAESTQKNFTERGSHSLY 60 Query: 66 NNV------EKNVASFAAN-----AGTFLSSQPD---SDATRNFITGMATAKANQEIQEW 111 ++ E +AS + T L+++ S+ N +A+ K + + Sbjct: 61 SSHSKGIWHESPLASRVIDKLLIRNWTSLNNKNGIEWSNQISNLALNLASNKLSDYATKT 120 Query: 112 LGKYG---TARVKLNVDKDFSLKDSSLEMLYPIYD-------TPTNMLFTQGAIHRT-DD 160 + KY A V ++ + + + ++L+ I D + + F + + Sbjct: 121 IQKYPFVLGASVNFDIRTEGA-TNIGGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSN 179 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIR 217 + N G G RH G + +AGVN + D+ S SH+R G+G E + L L+ N YI Sbjct: 180 DSTWNAGLGLRHLIGEELLAGVNGYWDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIA 239 Query: 218 ASGWKK-SPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKD 272 +G K S + DY ER GWD LP+ P + ++ D G GK Sbjct: 240 GTGTKTISTNNTDYYERVVPGWDFELGYRLPSNPNIAFFARGFRWDYRNRNDNTGFQGKV 299 Query: 273 KRQKDPH 279 Q PH Sbjct: 300 TYQMTPH 306 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 138 bits (347), Expect = 3e-31, Method: Composition-based stats. Identities = 60/268 (22%), Positives = 92/268 (34%), Gaps = 14/268 (5%) Query: 30 QVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDS 89 Q LAV +P + ++ T V S+ Sbjct: 20 QREHTLAVATSPREEHIKAVSEDFHITPAATPDHVIGEGGPLVMDRQETKTVQYSN---V 76 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPI--YDTPT 146 DA I +A + + + GK R L+ K S+E + P+ YD + Sbjct: 77 DAVNRAINAVAMSNVSNAMYGAKGKPWMRRTTLSFQFQEGWKPLYSVETVQPLGHYDNSS 136 Query: 147 N-MLFTQGAIHR-TDDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVGAEY 203 + FTQ I R +D T NIG G+R S + + G + F DH H R+ G EY Sbjct: 137 RDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDHRFLNRHNRLSAGLEY 196 Query: 204 WRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYY- 262 + N Y AS + ER ANG+ + + + Sbjct: 197 MSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTVEYGKTFKNARWARVYVEGYHWNQ 256 Query: 263 ---GDEVGLFGKDKRQKDPHAISAEVTY 287 D+ GL + Q P +S ++ Y Sbjct: 257 ERQADKNGLRVGSELQLTPR-VSVDMGY 283 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 137 bits (346), Expect = 4e-31, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 81/211 (38%), Gaps = 12/211 (5%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPIY--- 142 D+DA + + + + + K R L++ + K +E L P+ Sbjct: 84 SDTDAVNSALQAVVMTGVHSAMHGSKAKPWMQRTVLSLRFQKNWKPLYGVETLQPLGHYD 143 Query: 143 DTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 +T ++ FTQ + D T +N+G G+R + +D G N F DH +H R+ VG Sbjct: 144 ETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHRFRGNHGRMSVG 203 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 EY N Y SG + S D E +NG+ + + Sbjct: 204 LEYVSGIGAFRMNWYRGVSGER-SLDGATRMENVSNGYTAEYGTSFKNARWARVYMEAYR 262 Query: 261 Y----YGDEVGLFGKDKRQKDPHAISAEVTY 287 + D+ GL + Q P IS ++ Y Sbjct: 263 WQLRRSADKHGLRIGTELQLTPR-ISVDMGY 292 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 137 bits (345), Expect = 5e-31, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 66/200 (33%), Gaps = 19/200 (9%) Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV---------DKDFSLKDSSL 135 + + ++ A + + I+ W AR L ++ + Sbjct: 22 ASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRSREGAKPTFRA 75 Query: 136 EMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH 194 L+ I N + +Q + DD N G +R + + + G+N F DH + H Sbjct: 76 ISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNIFYDHQFNTGH 135 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R G+G E ++ N Y + + E A G+D +P P Sbjct: 136 ARTGLGFEMKSSVYDVNINFYEAQTEIHHV---DGVPEVAAGGYDAEIGAQVPYLPWAKV 192 Query: 255 SLMYEQYYGDEVGLFGKDKR 274 Q+ + + + + Sbjct: 193 YYKAYQWNNETLNIKDGETL 212 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 136 bits (343), Expect = 8e-31, Method: Composition-based stats. Identities = 52/162 (32%), Positives = 71/162 (43%), Gaps = 7/162 (4%) Query: 136 EMLYPIYDTPTNMLFT--QGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 + L+ + + + R +++G G+R + GVN F D+DLSR Sbjct: 9 DTLHELDQPLKKLAYVSNHWGPLLFHGRDFAHLGLGYRQL-DDSQFFGVNVFFDYDLSRQ 67 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD----IEDYQERPANGWDIRAEGYLPAW 249 HTR+ VGAEY DY S N Y S WK SPD + E+ A GWD+ E YLP Sbjct: 68 HTRVSVGAEYGLDYGTFSTNAYFPLSNWKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLD 127 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 + L QY G V K+P+ S + P P Sbjct: 128 TRWKFGLTAGQYLGRYVEHSDGSLPSKNPYHFSLSTEFRPDP 169 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 79/217 (36%), Gaps = 15/217 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN---QEIQEWL- 112 + A+ N G L+ +A + + +A AN ++++ + Sbjct: 14 SALPLSAQEVAKSGKFATIVKNIGNALNIGQGEEAVESEVNTLAVDAANAGLDQVEDKVL 73 Query: 113 --GKYGTARVKLNVD-----KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSN 165 + + + D K+ S + +Y + +T LF Q + ++RT N Sbjct: 74 STSNFTHFELSVGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTIN 133 Query: 166 IGFGWRHFSG-NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 GFG RH + N + G N F D++L H R+G G E + AN Y S Sbjct: 134 TGFGARHINDANTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVSKT--- 190 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 QE +G+D + LP + + Sbjct: 191 LTYNGIQETALDGYDAKLTANLPYFYSSNLYGKLSNW 227 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 49/201 (24%), Positives = 88/201 (43%), Gaps = 13/201 (6%) Query: 92 TRNFITGMATAKANQEIQEWLGKYGTARVKLNV-DKDFSLKDSSLEMLYPIYDTPTNMLF 150 + A K +++I + G V L+ D D + S+ + I T + F Sbjct: 19 ANADVASQALNKVSEKISNLIPGEGITEVSLDYNDGDEDQLNFSILGVRDIETTDNSNFF 78 Query: 151 TQGAIHRTD----DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 TQ ++ + R NIG G+R S + ++M G NTF D DL+ R+G+G E Sbjct: 79 TQFSLMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGLGIEAKG 138 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L L+AN Y + S S + +E+ +GWD +P P + ++ ++ Sbjct: 139 SILDLTANSYTKIS---NSEVVNGDREQVLSGWDFNLTSQIPRAPWARINYNGYKWETEK 195 Query: 266 VGLFGKDKRQKDPHAISAEVT 286 G ++ + +++ +VT Sbjct: 196 ----GSADQKGNIYSLELDVT 212 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 133 bits (335), Expect = 6e-30, Method: Composition-based stats. Identities = 48/158 (30%), Positives = 69/158 (43%), Gaps = 7/158 (4%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYD--TPTNMLFTQGAIHRTDD-RTQSNIGFGW 170 K+ TA + L+ + S L ++ PI D N++FTQ ++ +DD R N+GFG Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 171 RHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +D + G N F DH+L H R +G E L AN Y SGWK + + Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKS--GLNN 124 Query: 230 YQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 E+ NG D+ LP P + G Sbjct: 125 INEKALNGSDVELGMPLPYLPWTNLYYRSFNWEGASGA 162 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 131 bits (329), Expect = 3e-29, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 102/288 (35%), Gaps = 49/288 (17%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNV----ASFAANAGTFLSSQPDSDAT----- 92 +A + R N+ + + AS+A L+S SD Sbjct: 55 AVAGALEAGQSVRCETLVDADNQSNSTVQKIFVTGASYATRIFPLLNSASLSDGIQKMLW 114 Query: 93 ---RNFITGMATAKANQEIQEWLGKYGTAR--VKLNVDKDFSLKDSSLEMLYPIYDTPTN 147 ++FI A N+ + + + V D D + +SL L + Sbjct: 115 MDSKSFIVSFAHDYLNEYVLKQIPFLSQTEFGVGFESDADMTYYLNSLISLAQLGSDDNG 174 Query: 148 ----MLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHDLSR---SHTRIGV 199 +LF QG+ + +N+G G R ++ M G N F D+ + S++R G Sbjct: 175 YPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYSRWGA 234 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDI-----------------------EDYQERPAN 236 GAE W D KL+ N YI +G K+ + ER Sbjct: 235 GAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDERVVP 294 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPHA 280 GWD+ LP++PQL + ++ D G+ G Q PH Sbjct: 295 GWDVALNYRLPSYPQLSLGIRGFRWDYMRKSDNSGVEGSVNWQATPHT 342 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 127 bits (320), Expect = 3e-28, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 53/172 (30%), Gaps = 20/172 (11%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFS 174 R+ + ++L P++ ++LF DD + NIG RH Sbjct: 31 WNPRIDFEGKLGNDRSIAEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHML 90 Query: 175 GNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY- 230 N W G + D ++ +G E L AN YI + D D Sbjct: 91 DNGWNLGGYGYFDRRKSPYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTV 150 Query: 231 ------------QERPANGWDIRAEGYLPAW-PQLG--ASLMYEQYYGDEVG 267 +ER G+D +P + P+ + Y + Sbjct: 151 DFSGTTITYRAGEERSMRGYDAEVGWRIPVFSPEADKQLRIYAGGYRFTDSK 202 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 127 bits (318), Expect = 6e-28, Method: Composition-based stats. Identities = 35/150 (23%), Positives = 64/150 (42%), Gaps = 6/150 (4%) Query: 140 PIY--DTPTNMLFTQGAIHRTDDRTQSNIGFGWRH-FSGNDWMAGVNTFIDHDLSRSHTR 196 P+Y + + +F +G D R + G+RH S N M G N H+ R+H R Sbjct: 68 PVYVSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQR 127 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASL 256 I GAE ++++N Y R + WK +++ +E+ G+D+ +P P + Sbjct: 128 ISYGAEIRTSVFEINSNYYHRLTDWK-LTGVDNNEEKARGGYDVELALAVPYVPSAHFRV 186 Query: 257 MYEQYYGDEVGLFGKDKRQKDPHAISAEVT 286 + + G + + D + V+ Sbjct: 187 KHFCWNG--IASNDSNNPIDDLKGNTFSVS 214 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 123 bits (308), Expect = 9e-27, Method: Composition-based stats. Identities = 31/264 (11%), Positives = 76/264 (28%), Gaps = 56/264 (21%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 +A N V++L+ ++ A P L + + ++ + + ++ Sbjct: 1 MAIACMNSLVRLLWTSFCFTPLLIPAAIAQTEIPSLPKADAVPESHPSLGSPLQAQTPDS 60 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 + + + ++G + + + L+ Sbjct: 61 PPSTTPDLTTLQIK-------------------PRWG---IGYSTSGAGYDGFTRLDSFL 98 Query: 140 PIYDTP-TNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--H 194 P+ P + + F +G + + N+ FG R ++ + + + G D + + Sbjct: 99 PLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRFYNQSLNRIFGGYLGFDRRDTGNSTF 158 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKK-----------------------------SP 225 ++GVG E + + NGY + Sbjct: 159 HQLGVGVETLGEVWDVRLNGYFPLGDTRDLVDETAFDTGFQLTDRFFSDHFLVIQGKRQR 218 Query: 226 DIEDYQERPANGWDIRAEGYLPAW 249 + E G+D+ L W Sbjct: 219 GQVRHFEAAMTGFDLEVGARLAQW 242 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 121 bits (303), Expect = 3e-26, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 64/187 (34%), Gaps = 19/187 (10%) Query: 122 LNVDKDFSLKDSSLEML-----YPI-YDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSG 175 N +K+ S++ YP + F Q + + G G R Sbjct: 96 YNNQSQIQIKNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVYN 155 Query: 176 NDWMAGVNTFIDHDLSRSHTR---IGVGAEYWRDYLKLSANGYIRASGWKKSP---DIED 229 + G N H +S ++ I +G EYW LK N Y + S Sbjct: 156 KKHIIGYNACYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNHH 215 Query: 230 YQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD----EVGLFGKDKRQKDPHAISAEV 285 Y + P G+ I A+ P + + +EQ D + + + + H + + Sbjct: 216 YYQYPKIGYQICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVSL 272 Query: 286 TYTPVPL 292 Y P+P+ Sbjct: 273 EYQPIPM 279 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 79/198 (39%), Gaps = 8/198 (4%) Query: 82 FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR---VKLNVDKDFSLKDSSLEML 138 S+ + ++ G+A++ + LG+ + + L V + F SL + Sbjct: 29 ISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFK-PTISLTNV 86 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRI 197 I + + +F Q +++ ++ N+G G R +D + G+N F D+ SH R Sbjct: 87 NMISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQRN 146 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLM 257 G G E L +N Y SG + D E +GWD+R + +LP + Sbjct: 147 GAGLEVLSSVFDLRSNIYDATSGIEAVSTSRD--EEAMDGWDMRLDYHLPIKTNARLFVG 204 Query: 258 YEQYYGDEVGLFGKDKRQ 275 ++ + ++ Sbjct: 205 LFEFENAAGSYEVEGEKY 222 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 88/240 (36%), Gaps = 28/240 (11%) Query: 81 TFLSSQPDSDATRNFITGMATAK-----ANQEIQEWLGKYGTARVKL------------- 122 L+ S+ + N K + I L Y KL Sbjct: 23 NTLTEGIKSNVSNNIFQDDLYQKEMKLHTHDHIHHTLNFYPYTTNKLRVHAYNYRPPFSS 82 Query: 123 NVDKDFSLKDSSLEMLYPIYDT----PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 L++ S++M + Y N+ F Q IH N G G RH + + + Sbjct: 83 TYKSKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKY 142 Query: 179 MAGVNTFIDHDLSRSHTR---IGVGAEYW-RDYLKLSANGYIRASGWKKSPDIEDYQ-ER 233 G NTF +S+ ++ I VG EYW + L + N Y + + ++ Sbjct: 143 AIGYNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHY 202 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P +G + + P + + + EQ+ ++ +K+ D + +S ++ Y P+P+ Sbjct: 203 PRSGHQLYIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPML 261 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 60/195 (30%), Gaps = 28/195 (14%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 A A A QE Y + L + P+ ++ F + Sbjct: 22 AGAYAQNAGQEKWSTY----LDLEGKVGSKRDIGEANLFIPVVQDARSLYFANVRARMAN 77 Query: 160 DRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGY 215 + ++G G RH W G F+D + S+ + +G E AN Y Sbjct: 78 GGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTTYNNSYDQATLGVEALGRQFDWRANVY 137 Query: 216 IRASGWKKSPDIEDY---------------QERPANGWDIRAEGYLPAW-----PQLGAS 255 + + +ER G+DI A LP + Q+ A Sbjct: 138 QPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERALPGFDIEAGWRLPVFDEEDTRQVRAY 197 Query: 256 LMYEQYYGDEVGLFG 270 L ++ D + + G Sbjct: 198 LAGYRFSDDGLKVQG 212 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 52/186 (27%), Gaps = 23/186 (12%) Query: 112 LGKYGTARVKLNVDKD---FSLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIG 167 ++G R + + + P+ F H + R +N+G Sbjct: 266 FSEFGYVRGAYTFGEGIGIRHNYSTLTALFAPLVPYDDYYPFLDLRAHYIKNKRWAANVG 325 Query: 168 FGWRHFS-GNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 G R ++ G N + D+ + + G G E++ + ++ N Y Sbjct: 326 GGLRWRDCMTGFIFGANLYYDYRNTTQTDFNQFGFGLEFFTNCFEMRLNAYFPVGDVTHC 385 Query: 225 PD--IEDY----------QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 D DY E G D+ P YY +V Sbjct: 386 EDHVFSDYIGPYYAVCGLTEIAQKGVDLEVGHTFWKCPYFSVFGAIGGYYYTDV----CG 441 Query: 273 KRQKDP 278 R + Sbjct: 442 HRHHNH 447 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 114 bits (286), Expect = 3e-24, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 56/195 (28%), Gaps = 26/195 (13%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAI-- 155 +A +A E + L + + LE P+ TP N+ F +G + Sbjct: 22 LAQTEAESETADTLRIKPRLGIGHTSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRLFL 81 Query: 156 HRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSA 212 D N+ G+R +S N + G D+ + ++G+G E Sbjct: 82 DNDDANLGGNLILGYRTYSANSHRIWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDFRV 141 Query: 213 NGYIRASGWKKSPDIEDYQ-----------------ERPANGWDIRAEGYLPAW---PQL 252 NGY+ ++ + E GWD L L Sbjct: 142 NGYLPIGDTRQGVGDAGVRDIFFRRNFLILEQGQNKEAAMGGWDAEVGAKLARIGIDGDL 201 Query: 253 GASLMYEQYYGDEVG 267 Y + Sbjct: 202 RGYGGLYWYDAEGSS 216 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 114 bits (285), Expect = 4e-24, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 14/149 (9%) Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 L Q I ++ + G G + + GVN F D +++ + R+ +G++Y Sbjct: 124 LVWQANIDYKNEDILISNGIGI--LPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPN 181 Query: 208 --LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 LS+N Y SG D+ N DIRAEG + Q +SL E ++GD+ Sbjct: 182 YIFNLSSNIYFPLSGKGSEDDL-------VNSIDIRAEGAITPTVQFHSSL--EFFFGDD 232 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 + + + H +A + YTP+PL Q Sbjct: 233 IQINDDYDPTNNSHKFTAGLDYTPIPLLQ 261 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 68/184 (36%), Gaps = 26/184 (14%) Query: 120 VKLNVDKDFSL-----KDSSLEMLYPIYDTPTNM-LFTQGAIHRTDDR-TQSNIGFGWRH 172 + LN SL + M++P + + +F G D ++G G RH Sbjct: 80 LNLNYTFGKSLGCQKSYGTFGGMIFPFFSSCRPFQIFLDGKAFLFDHGKWGGSVGIGLRH 139 Query: 173 FSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWK-------- 222 FS N WM G+N + D+ ++G+G E D ++ NGY+ + + Sbjct: 140 FSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLGDCVEFRVNGYLPVNKNRWDQCCLFN 199 Query: 223 -KSPDIEDYQER--PANGWDIRAEGYL--PAWPQ-LGASLMYEQYYGDEV---GLFGKDK 273 +ER +G D +L P+ Q +G + YY F D+ Sbjct: 200 YSGSYFATLRERGYVWSGLDTEIGTWLVKPSCCQDIGLYVAAGPYYYRRSHDQDFFFHDQ 259 Query: 274 RQKD 277 + Sbjct: 260 KHHT 263 >UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZT8_9GAMM Length = 785 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 65/205 (31%), Gaps = 33/205 (16%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 G R LNV + L P+ ML+ GA+ T T +G G+R Sbjct: 35 WGGPWKPRQTLNVQ-GGHGMQDYYDALLPLSGNAERMLYANGALAATHHETGGELGLGYR 93 Query: 172 H-FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS-------- 219 H N+++ G + ++ +G E++ + A+ Y+ S Sbjct: 94 HIILNNEYVIGGFALMGRYQTNYHNMFNQLTLGTEFFGSIWEGRAHLYLPVSRRTKFVRS 153 Query: 220 --------GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK 271 G K E G D+ +P P+L YY + +G K Sbjct: 154 RSEGLSFQGHKLFGIQTTTYEHAEGGADVEIGHVIPGIPKLRGFA---GYYNNGLGNEHK 210 Query: 272 D---------KRQKDPHAISAEVTY 287 + R + + +Y Sbjct: 211 NINGGYGRFEYRYNNHFTFTLGDSY 235 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 50/172 (29%), Gaps = 20/172 (11%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWR 171 V ++ + + + + PI +N+LF + ++ + N G G+R Sbjct: 29 QPKWAPSVDVSGKAGETRRIGEVNLFLPIAQDDSNLLFLDLRTSFDNLEQREGNFGLGYR 88 Query: 172 HFSGNDWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 + W G F D S ++I G E N Y+ + Sbjct: 89 AMQDSGWNLGAYAFYDRRRSSEGHYFSQITTGLEALGQDFDARINAYLPIGRKSYEVEDS 148 Query: 229 DY-------------QERPANGWDIRAEGYLPAW---PQLGASLMYEQYYGD 264 ER +G D LP + + Y+ D Sbjct: 149 ARVDLSGGSIQILSGLERAYHGGDAELGWRLPVFATDQDSEIRVYGGGYWFD 200 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 56/186 (30%), Gaps = 23/186 (12%) Query: 101 TAKANQEIQEWLGKYGT-------ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 + A + E+ ++ A + + P+ ++ F Sbjct: 17 FSYAQDPVPEYQPEWFQEEDYLYRAYFDFTGQAGGVNDNGQGLLFIPLAQDEESLFFADL 76 Query: 154 AIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLK 209 + DD + + N G +R + W+AG+ F D S + G E Sbjct: 77 RGNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDVRRSQYSNIFRQGSFGFELLSIEWD 136 Query: 210 LSANGYIRASGWKKSPDIEDY------------QERPANGWDIRAEGYLPAWPQLGASLM 257 NGY+ + ++ + +ER G D L ++P+ Sbjct: 137 FRVNGYVPSQKQQRVDSLNTAYLSGNNIVMRAGEERAYWGTDFEVGRLLKSFPESNLDAE 196 Query: 258 YEQYYG 263 Y G Sbjct: 197 LRGYVG 202 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 111 bits (277), Expect = 3e-23, Method: Composition-based stats. Identities = 45/262 (17%), Positives = 80/262 (30%), Gaps = 32/262 (12%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 F + T A ++ G T + N T + SD + Sbjct: 148 FTASPPRTLAEAGWTTAPQVVAINKGTTPSNLPAATSHRLVQAEPNVPTDTKTGEKSDTS 207 Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQ 152 + +A+ + + + + + F + L P + + F Sbjct: 208 NDT-----NTEADTSTNLGIPYFVDTEFRGSTRRQFGGINLRL----PFWQDDQSFAFAD 258 Query: 153 GAIHRTDDRT-QSNIGFGWRHFSG----NDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 + T N+G +R N W+ G + F D S + + + +GAE Sbjct: 259 VHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSENGFQYHQGSLGAELV 318 Query: 205 RDYLKLSANGYIRAS-----GWKKSPDIEDYQERPANGW-------DIRAEGYLPAWPQL 252 + NGY+ S G + + Q R ANG + E A Sbjct: 319 NKKFEFRVNGYLPGSNPNVVGQRTINGVLGIQPR-ANGLGTNIVQQTLTLEARERALAGF 377 Query: 253 GASLMYEQYYGDEV--GLFGKD 272 + ++ D+V GLFG Sbjct: 378 DFEAGHRHHFNDKVSLGLFGGY 399 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 110 bits (275), Expect = 6e-23, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 53/187 (28%), Gaps = 19/187 (10%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 F + + + + + + V + D +L PIY T + +LFT+ Sbjct: 14 FALSITFTEHSLASSDKWDPWLESGVSIGTDNS---SRGEAALLLPIYQTDSGLLFTELR 70 Query: 155 IHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKL 210 D + + N+ G+R N W G+ D + + G E Sbjct: 71 GKLFDAGSKEGNLALGYRKMINNRWAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDF 130 Query: 211 SANGYIRASG------------WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMY 258 N Y S I E P +G+D L Sbjct: 131 RINAYNALSSAQAYPQPVEAELIGNQLFITSAAEVPLSGYDFELGHRFSVLSDQDIWLYA 190 Query: 259 EQYYGDE 265 + D+ Sbjct: 191 GAFSFDD 197 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 57/182 (31%), Gaps = 20/182 (10%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-N 165 + E + + +S+ P + +++FT T+ N Sbjct: 82 SVDEIFNPIFRVDARGGQLYGYDEGYTSVGGFLPFFRDENSLIFTDIRGLMTNGGKGGAN 141 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R F D + GV+ + D D + GV E YL NGY+ + Sbjct: 142 VGVGYRQFVPELDRIFGVSGWYDFDNGHREAFNQFGVSFESIGRYLDWRVNGYLPVEDNE 201 Query: 223 KSPDI----EDYQER------------PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 + + +Q G+D G P + G S YY Sbjct: 202 EISNQILGAAGFQNNFILLNRGRSVDSAYKGFDTEIGGPFPILGRYGMSGYVGMYYYANT 261 Query: 267 GL 268 + Sbjct: 262 DV 263 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 109 bits (271), Expect = 2e-22, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 61/201 (30%), Gaps = 44/201 (21%) Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 + ++P+ + L+ A+ TD++ Q ++G G+R + + G F Sbjct: 43 DYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFGG 102 Query: 188 HDLSRSHTRIGV---GAEYWRDYLKLSANGYIRASGWKKSPD------IEDYQE------ 232 + ++ R+ + G E + N YI + + E Sbjct: 103 YSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVFL 162 Query: 233 ---RPANGWDIRAEGYLPAWPQ--------------------LGASLMYEQYYGDEVGLF 269 +G DI+A L +P G + E + V L Sbjct: 163 MHQYAGSGADIKAGYQL--FPHSSLKGYLGSYYFSPAETNNVWGGAAGLEYWLTQGVKLI 220 Query: 270 GK---DKRQKDPHAISAEVTY 287 G D +A + + Sbjct: 221 GSYSYDNLHHSTYAFGIGLEW 241 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 104 bits (260), Expect = 3e-21, Method: Composition-based stats. Identities = 42/226 (18%), Positives = 69/226 (30%), Gaps = 42/226 (18%) Query: 107 EIQEWLGKYGTARVKLNVDKDF---SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRT 162 + EW+ A ++ V K + S P+ D+ + F IH +R Sbjct: 44 QANEWVFPPTLAYLQGVVGKGIGEQNGYASFGIFTIPLLDSNGQLFF-DARIHNLRHERW 102 Query: 163 QSNIGFGWRHFSG-NDWMAGVNTFIDHDLSR-SHTRIGVGAEYWRDYLKLSANGYIRASG 220 +N+G G R + G+N F D+ +R + ++G G E NGY Sbjct: 103 AANVGVGTRIAIPCTNLFFGINFFYDYRRTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 Query: 221 ---WKKSPDIEDYQ---------ERPANGWDIRAEGYLPAW-PQLGASLMY--EQYYG-- 263 K + + +G D+ E L W P L + Y+ Sbjct: 163 RSLRKHPKVFRFHDNLFAACTQIQNSLSGGDLELETSLRRWDPCLCFDVYIAPGGYFYHI 222 Query: 264 ------------------DEVGLFGKDKRQKDPHAISAEVTYTPVP 291 D +GL + V Y +P Sbjct: 223 RHHRDITGGRLRIGAVLFDYLGLEVRGSYDHYYKGTVQGVAYVEIP 268 >UniRef50_A8PQI7 Putative outer membrane autotransporter barrel domain n=5 Tax=Rickettsiella grylli RepID=A8PQI7_9COXI Length = 1171 Score = 103 bits (257), Expect = 7e-21, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 38/197 (19%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGN 176 AR NV + + P+ + + A+ + ++G G+R Sbjct: 34 ARFSGNVYGSTKYVVGQADAMLPLVGDAQHNFYIDPALTSGSNWEGHGDLGLGYRWIQNG 93 Query: 177 DWMAGVNTFIDHDLSRSHTRIG---VGAEYWRDYLKLSANGYI----------------R 217 + G F +++ ++ RI G E NGY R Sbjct: 94 SAILGGYLFGEYNRMDNNVRIWTMNPGIEALGSRWDAHLNGYFVMDNRSKVVGTDLEFVR 153 Query: 218 ASGWKKSPDIEDYQERPANGWDIRAEGYL-PAWPQ-----------------LGASLMYE 259 G ++ D + NG D++ L P P LG ++ E Sbjct: 154 FRGHSAVYNLFDVTQNVGNGGDVKLGYQLFPKTPLKAFVGSYFFSPAETKNILGGAVGLE 213 Query: 260 QYYGDEVGLFGKDKRQK 276 + V +F K Sbjct: 214 YWANRNVKVFASYTYDK 230 >UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=27 Tax=Cyanobacteria RepID=Q0IAR8_SYNS3 Length = 401 Score = 103 bits (257), Expect = 7e-21, Method: Composition-based stats. Identities = 37/294 (12%), Positives = 82/294 (27%), Gaps = 50/294 (17%) Query: 31 VLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD 90 + L + V + A ++ + + +S S Sbjct: 5 LSLGLLASAISVASLPAIAQEDGGAALLRQQRDKLLEQIEQLKQRKEQLEAQIS---GSA 61 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 ++ + N ++ + + + + + P+ ++ F Sbjct: 62 QGKDDAFDLQEISLNDAVK------FNWGFQGALQGAGTPNQAGIGGFLPLSVGENSVWF 115 Query: 151 TQGAI-----HRTDDRTQSNIG-----------FGWRHFSGN-DWMAGVNTFIDHD---- 189 ++ + N G+R +G+ WM G+N D Sbjct: 116 LDALANANFSDYENNSSIINTDVAGTTISTSSRLGYRWLNGDRSWMYGLNAGYDSRPMNT 175 Query: 190 -------------LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 S ++ V AE + L+A I ++ + YQ N Sbjct: 176 GGTDTGINVSGTEKSAFFQQVVVNAEAVSNDWNLNAYALIPIGDTEQDLN-SFYQGGALN 234 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH----AISAEVT 286 + + ++ P+L AS+ Y GD G + + ++A V Sbjct: 235 TYGLDVGYFI--TPELNASVGYYYQNGDLGSADGSGVLGRVAYEISNGLTAGVN 286 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 58/191 (30%), Gaps = 32/191 (16%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 G +A A+ + +++ + GT + S+ + P+ D+ F Sbjct: 2 TALGAGSAAADPALMDFVLRPGT-----------DGAEGSIAVAIPLADSDAARTFLDLR 50 Query: 155 IHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKL 210 D DR +NIG G R G + G + D DL ++ V + L L Sbjct: 51 GSIDDADRRVANIGIGHRFRLG-AVVLGGAVYYDRVRTDLESDFSQATVSLDLMTADLDL 109 Query: 211 SANGYIRA----------------SGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 AN Y SG I +E G+D L A Sbjct: 110 RANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIFQPREVTLKGFDAEVGYRLGAIEGYDV 169 Query: 255 SLMYEQYYGDE 265 Y + Sbjct: 170 RAFAGGYRYTD 180 >UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PN48_9COXI Length = 607 Score = 102 bits (255), Expect = 1e-20, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 60/224 (26%), Gaps = 51/224 (22%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG-AIHRTDDRTQSN 165 + +E L +A V +++ + + L+ + TD + Sbjct: 28 QAREPLPPRFSAEAYTGV-----YTVGRADLMVSLDGDGQHNLYVDPQGGYGTDQEWYGD 82 Query: 166 IGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R S + + G F H + S G E N YI +G Sbjct: 83 VGLGYRWISNDAAIVGWYVFAGHSCVENSSGFWITNPGVEIMGSRWDARINAYIPVAGRS 142 Query: 223 K------------------------SPDIEDYQERPANGWDIRAEGYLPA---------- 248 S + ++ NG D R L + Sbjct: 143 DDLGGIESTTAGPSFFTGHSELRTVSFTAFNEVQQVGNGADARVGYQLFSGVPLKAVVGA 202 Query: 249 ----WPQL----GASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 P G + ++ D V +F + H+ Sbjct: 203 YFFEIPHAENVRGGGAGVDYWFDDYVRVFARYNYDNRQHSQVVG 246 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 39/144 (27%), Positives = 69/144 (47%), Gaps = 6/144 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + ++ I+ Q FP+A++ TP + + A + +LS +NN Sbjct: 4 MNNTLLDKLRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENKLSTNT-----ENN 58 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + +A + GT LSS DA ++ A +K N+EI+ W +YG A++ L VDK Sbjct: 59 NGRWLAQQTSQLGTILSSDNTHDAASQYLINQANSKVNREIENWFNQYGKAQINLGVDKH 118 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFT 151 F+LK L+ L+ ++ T + + Sbjct: 119 FTLKTQKLKSLF-LFTKQTIIFYL 141 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 10/137 (7%) Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNV----DKDFSLKDSSLEMLYPIYDTP 145 DA +N + + N + ++ ++G +++V + S + + L P+ + Sbjct: 21 DAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKGDEASYSIITTQPLAPLSEDG 80 Query: 146 TNMLFTQGAI----HRTDDRTQSNIGFGWRHFSG-NDWMAGVNTFIDHDLSRSHTRIGVG 200 + + F QG++ D RT N+G G R + G+N+F D++ S H R+ +G Sbjct: 81 SRL-FWQGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIVGINSFYDYEFSAKHKRMSLG 139 Query: 201 AEYWRDYLKLSANGYIR 217 EY R +LS N Y Sbjct: 140 GEYKRSNAELSVNKYWG 156 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 59/188 (31%), Gaps = 26/188 (13%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-NIGFGWR 171 G Y R + N + + + L P+ + ++ Q + TD N+G R Sbjct: 45 GTYFDVRNQSNSGVGYQHGFTQIGALTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGR 104 Query: 172 -HFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASG------- 220 + +G D + G N + D+D S +++IG G E L L AN Y+ Sbjct: 105 VYDAGRDRIWGANVYYDNDETTYSNRYSQIGFGFESLGQNLDLRANAYLPTGSSDKVIGP 164 Query: 221 ---------WKKSPDIEDYQ--ERPANGWDIRAEGYLPAWPQLG-ASLMYEQYYGDEVGL 268 + E G D +P + Y+ D Sbjct: 165 NGLSNTLFYTGNQLNFTGSYLSEEALRGADFELG--IPVTQNMSWLRAYGGGYFYDATQN 222 Query: 269 FGKDKRQK 276 R + Sbjct: 223 NVSGVRGR 230 >UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK3_9PLAN Length = 967 Score = 101 bits (251), Expect = 4e-20, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 55/201 (27%), Gaps = 36/201 (17%) Query: 98 GMATAKANQEIQEWLG---KYGTARVKLNVDKDFSLKD------SSLEMLYPI--YDTPT 146 G+ + N ++ E G +G R + SS + +P+ + Sbjct: 31 GVPQEEINGDVSELFGDSGWFGRYRPHFGYRYEAGDTIGRIGGLSSFDAFFPLLEGEDSD 90 Query: 147 NMLFTQGAIHRTDDR--TQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGA 201 + F + DD SN+G G R + G + D + S ++ G Sbjct: 91 WLTFIDARLLLGDDNHNLGSNVGVGARQYIPEYQRTIGAYIYYDTRDAGYASFDQVSGGI 150 Query: 202 EYWRDYLKLSANGYIRASGWKKSP--------------------DIEDYQERPANGWDIR 241 E D N Y+ + Y + G D+ Sbjct: 151 ETLGDIWDARLNWYVPTGQTRNQYATTHTSGGSYKFVGHYLTGGTFTRYYQAAMKGLDME 210 Query: 242 AEGYLPAWPQLGASLMYEQYY 262 A + + Y+ Sbjct: 211 AGAKFYSNESMDLRAYAGWYH 231 >UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YK40_ANASP Length = 1487 Score = 100 bits (249), Expect = 6e-20, Method: Composition-based stats. Identities = 33/186 (17%), Positives = 60/186 (32%), Gaps = 23/186 (12%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLK--DSSLEMLYPIYDTP-TNMLFTQGAIH 156 A+ + Q + T RV + + + SS E P+ P ++ F QG + Sbjct: 19 ASTVSAQTPASTTAQVFTPRVGVRYTTEGAGYESFSSFEGFLPVLQIPGNSLTFLQGKLL 78 Query: 157 RTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSA 212 +D + NI G R FS + + G + + ++G+G E Sbjct: 79 LDNDSNLATNILLGHRIFSEEANRVIGGYISYSTRDTGKSNFDQLGLGFETLG-VWDFRF 137 Query: 213 NGYIRASGWKKSPDIED---------------YQERPANGWDIRAEGYLPAWPQLGASLM 257 N Y+ +G + + + + E +G D L + Sbjct: 138 NAYLPLNGSENQVEQANLPFFQGDSLMVQRSRFLEVAMSGVDAEVGTRLASLGSGDLRGY 197 Query: 258 YEQYYG 263 YY Sbjct: 198 AGVYYY 203 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 100 bits (248), Expect = 8e-20, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 49/191 (25%), Gaps = 36/191 (18%) Query: 102 AKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-D 160 EW + D S L L P Y + + +D D Sbjct: 19 GSVQAADSEWKP---NTQAYFAAGDDRSYFG--LAGLIPFYQDGKRLGYADLRYSSSDVD 73 Query: 161 RTQSNIGFGWRHFSGND-WMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYI 216 + N+G G+R + N+ + G D S R + ++ GAE D +N Y Sbjct: 74 TDEINLGAGFRSLNENETAIYGFYGSYDLRKSATERDYRQLTFGAELLTDTWDYRSNFYF 133 Query: 217 RASGWKKSPDIEDYQ-------------------------ERPANGWDIRAEGYLPAWPQ 251 + E +G DI L + Sbjct: 134 PTGDDSYQVGNAEDDVTVESEFVGHDLVRTTTTVGGGTIFEEALSGADIEVG-RLLNFDN 192 Query: 252 LGASLMYEQYY 262 Y+ Sbjct: 193 FEMRGYLGAYH 203 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 41/131 (31%), Gaps = 17/131 (12%) Query: 149 LFTQG-AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 +F D + ++ G G R + + G+NT+ D+ R ++GVG E Sbjct: 8 VFIDLDGYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVGVGFEML 67 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDY-------------QERPANGWDIRAEGYLPAWPQ 251 D + NGY+ S S + E G D L + Sbjct: 68 SDCFDVRINGYLPVSEKVHSHQCLSFHYSGTDFHASRCKLEYAYGGLDAEIGKPLLTYYD 127 Query: 252 LGASLMYEQYY 262 YY Sbjct: 128 FDLYGAVGPYY 138 >UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQN0_PARL1 Length = 675 Score = 97.8 bits (242), Expect = 3e-19, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 56/180 (31%), Gaps = 28/180 (15%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR-TQSNIGFGW 170 G + A L+ ++D P++ + ++LF + T+ N G+ Sbjct: 31 WGPWIEAGGFLSTERD----RGEATAFMPLFQSGESLLFADVKGKLFSEGVTEGNFALGY 86 Query: 171 RHFSGNDWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 R + D G+ D S + + G E NG++ + K +P + Sbjct: 87 RRMTAWDVNLGLWGGYDIRESVSGNTFDQAAFGIEALAADYDFRLNGFVPLADGKAAPGM 146 Query: 228 EDYQ------------ERPANGWDIRAEGYLPAWPQLG-------ASLMYEQYYGDEVGL 268 + E G++ LP LG L Y D+ L Sbjct: 147 ARVELSGSQILLTGGRELVLGGFEGEVGWRLP-LEALGADRERHEFRLYAGGYRFDDSDL 205 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 96.3 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 67/244 (27%), Gaps = 18/244 (7%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 H + +A A + + + + N Sbjct: 95 TPHDSRGDSLQSAIQAGKSQGRVSELARNLPQAERSTLNAYRVNVFAPE-KVVTQSDLNN 153 Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQ-SN 165 + +G T + + ++ S L P+ N+++ D + + Sbjct: 154 TSRHTVGARFTVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIYIDLKSKLYDAKEGEVS 213 Query: 166 IGFGWRHFSGNDWMAGVNTFIDHDL-SRSHTRI-GVGAEYWRDYLKLSANGY--IRASGW 221 G +R G+N F D + R +G E + L+ N Y + + Sbjct: 214 TGIVFRRQMSPLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKSFSLNGNYYRSNKKTTI 273 Query: 222 KKSPDIEDY-----------QERPA-NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 E + ER A NG+D+ L + + S + + F Sbjct: 274 SSVKSFEFHDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIHGSAFFFYSPYNTEEKF 333 Query: 270 GKDK 273 + Sbjct: 334 SGYR 337 >UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLP1_ACAM1 Length = 1022 Score = 96.3 bits (238), Expect = 1e-18, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 62/203 (30%), Gaps = 40/203 (19%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQG 153 + +A+ + ++G + N + + LE P++ P + F +G Sbjct: 24 IAEPQPSTQASDL--RFSPRFG---IGANSPSSGTNTTTRLETFVPVWQKPGRALTFFEG 78 Query: 154 AIHRTDDRT-QSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLK 209 + D NI FG+R +S + + G + D + + ++ +G E + Sbjct: 79 RLLLDDQGNPGGNILFGFRQYSDDLKRIFGGHLGFDIRNTDNNTFQQLSLGIESLGKDVD 138 Query: 210 LSANGYIRASGWKKSPDIEDYQ-----------------------------ERPANGWDI 240 L NGY ++ ++ E G D Sbjct: 139 LHLNGYWPVGSTRRQTRQRIFEVLQLNGDPRFTGNILLLDLLRRRLITRQFEEALAGVDF 198 Query: 241 RAEGYLPAWPQLG-ASLMYEQYY 262 L ++ G Y+ Sbjct: 199 EVGKQLLSFKNGGDLRAYLGPYF 221 >UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK4_9PLAN Length = 786 Score = 95.9 bits (237), Expect = 1e-18, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 50/177 (28%), Gaps = 28/177 (15%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYD--TPTNMLFTQGAIHRTDDRT--QSNIGF 168 +G R + SSL+ P+ + + F + D SN+GF Sbjct: 53 PHFGY-RYQAGDTIGRIGGLSSLDGFLPLLEAEDGNWLTFLDARLLLDDQNQNLGSNVGF 111 Query: 169 GWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP 225 G R + G + D + R+ +++ G E D N Y+ + Sbjct: 112 GARQYLPEWGRTIGGYVYYDTRDTGTRNFSQVSGGIETLGDLWDARLNWYVPTGSRRSLV 171 Query: 226 D--------------------IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYY 262 + Y + G D+ A + + Y+ Sbjct: 172 GTSHTVGGPSQFIGHYLYGGILTRYYQAAMTGVDMEAGRKILTSDSMDVRAFAGWYH 228 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 56/174 (32%), Gaps = 22/174 (12%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAIHRTDDRTQS-NIGFGWR 171 + + S+L +L P P +MLF TD N+G GWR Sbjct: 123 PLFRLDKGIGGGIGYDDGYSNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLGAGWR 182 Query: 172 HFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIR----------- 217 ++ N D + V + D+D + ++G+ E YL NGY Sbjct: 183 AYNDNLDKIFTVAGWYDYDDGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEIIISNN 242 Query: 218 ASGWKKSPDIEDYQERPAN------GWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 SG Y R G D G LP + G YY + Sbjct: 243 LSGSAYFQTDRIYLNRTRRSESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYYNS 296 >UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZRN5_9PLAN Length = 792 Score = 94.7 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 62/210 (29%), Gaps = 29/210 (13%) Query: 84 SSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEML---YP 140 S+Q D + + + A+ G+Y R+ + + + D S P Sbjct: 24 SAQQAGDDIQPGLISGTSTFASPYANGQGGEYF-PRISVQHRTEGAGYDYSFTDFRAWVP 82 Query: 141 IYD--TPTNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHT- 195 +Y+ ++ F GA +D+ N G R +S N G D+ + + T Sbjct: 83 LYESYDSKSLTFFDGAFLLANDQNVGMNAVVGQRFYSDNYGRTFGGYVGYDNRDTGNQTV 142 Query: 196 -RIGVGAEYWRDYLKLSANGYIRAS-----------------GWKKSPDIEDYQERPANG 237 ++ G E NGY + G+ E G Sbjct: 143 GQVVTGFESLGRI-DFRVNGYFPTTSDPTMTGQTGFFDPTYVGYNIQLSQLTQYEVAMKG 201 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 +D G LP Y G Sbjct: 202 FDAEIGGALPHVGDY-LRAYLGAYNFQGSG 230 >UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZV2_9CYAN Length = 1059 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 47/145 (32%), Gaps = 21/145 (14%) Query: 123 NVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAIHRTDDRT-QSNIGFGWRHFSG-NDWM 179 + + SLE PI P + + F +G + D T I G R ++ + + Sbjct: 57 SEGAGYQDPFFSLEGFVPITQNPGSTVTFLEGQLRLFTDSTMGGTILLGQRFYNSTQNRI 116 Query: 180 AGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD----------- 226 G D + + +IG G E D L N Y+ + D Sbjct: 117 LGGYLSYDTRDTGNSLFHQIGAGFERLGDDWDLRVNAYLPVGERRPEVDESFSLRGFQEN 176 Query: 227 -----IEDYQERPANGWDIRAEGYL 246 E G+DI A G L Sbjct: 177 NLLLNHRQRFEAAMAGFDIEAGGRL 201 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 59/166 (35%), Gaps = 39/166 (23%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 T +N+G G+R S ++ GVNTF DH S+ + RI G EY ++ AN Y + Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 220 GWKKSPD----IEDYQE----------------RPANGWDIRAEGYLPAWPQLGASLMYE 259 K P E Y E + +G+D+ A + Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 260 QYYGDEVGLFGKD-----------------KRQKDPHAISAEVTYT 288 + G V G+ Q PH +S +V YT Sbjct: 261 HWNGLGVKTHGEGPALALNVGKSHGWQAGTTLQLTPH-VSLDVGYT 305 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 90.9 bits (224), Expect = 5e-17, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 52/156 (33%), Gaps = 25/156 (16%) Query: 138 LYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS-- 193 L P MLF R++ DR +N+G G R++ N D + G N + D+D + Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKK---------SPDIEDY-----QER----PA 235 +G G E Y N Y ++ S +D +ER Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFPVGPTEQLLSQSVVTGSQRFQDTRILFDRERIVGLAP 214 Query: 236 NGWDIRAEGYL---PAWPQLGASLMYEQYYGDEVGL 268 G+D L + + Y+ L Sbjct: 215 KGFDAEFGMPLFFNSFFERHDLRAFGGFYHYQSENL 250 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 90.5 bits (223), Expect = 6e-17, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 55/126 (43%), Gaps = 18/126 (14%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ R ++ A ++ + N+ Sbjct: 138 LRKLNQFRTFVR---NVRPGDELDV---------------QAQVSEKNLTPPPGNSSGNL 179 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 180 EQQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDF 239 Query: 129 SLKDSS 134 SLK+S Sbjct: 240 SLKNSR 245 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 83.6 bits (205), Expect = 8e-15, Method: Composition-based stats. Identities = 41/217 (18%), Positives = 69/217 (31%), Gaps = 43/217 (19%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML----FTQGAIHRTDD--RTQSNIGFG 169 G K++ D ++ +SS P Y + + D +IG G Sbjct: 127 GNGGEKISSDAYWNGGESSYIGDDPKYKAAARLAQQPSYLDKGETVQHDSLGVVGSIGAG 186 Query: 170 WRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI- 227 +R S N+ G+NTF D+ +R+G+G EY K+SAN Y S K P Sbjct: 187 YRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLSEKKTKPYYF 246 Query: 228 -----------------EDY----------QERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 + Y E +G+++R + + Sbjct: 247 ENSLVIVPRADEFHYPEDGYPNGFTKIRYAYENVLDGYNVRYTRDYKNARWISTYVEGYH 306 Query: 261 YYGDE-----VGLFG-KDKRQKDPHAISAE--VTYTP 289 + V +F + K + + TP Sbjct: 307 WKTKSPSEHPVDMFYLNQHKWKSISGLKLGATLNITP 343 >UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K1T2_CYAP8 Length = 1873 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 13/191 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLG--K 114 + + + + + T P+ T ++ + + Sbjct: 38 TEPEQLNELSPKLEGIETIEEAGWTEKPISPNGTNPSETPTNETDSQGTPSPETPQPAIR 97 Query: 115 YGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAIHRTDDRTQ---SNI 166 Y T RV + ++ + E +PI + FT+G + + + +N Sbjct: 98 YFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRANF 157 Query: 167 GFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWKK 223 G R +S + + + G D + + GVG E D+ N YI ++ Sbjct: 158 LVGHRLYSQDHNRVYGAYIGYDLRDTKYNKFNQFGVGIETLGDFWDARFNAYIPLGTTQQ 217 Query: 224 SPDIEDYQERP 234 + P Sbjct: 218 QIGQTNTALNP 228 >UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QR03_CYAP0 Length = 1985 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 49/141 (34%), Gaps = 11/141 (7%) Query: 105 NQEIQEWLGKYGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAIHRTD 159 + E + +Y T RV + ++ + E +PI + FT+G + + Sbjct: 88 SPETSQPAIRYFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLAST 147 Query: 160 DRTQ---SNIGFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSAN 213 + +N G R +S + D + G D + + GVG E + N Sbjct: 148 HDAEDIRANFLVGHRLYSQDHDRVYGAYIGYDLRDTKYNKFNQFGVGLETLGSFWDARFN 207 Query: 214 GYIRASGWKKSPDIEDYQERP 234 YI ++ + P Sbjct: 208 AYIPLGTTQQQIGQTNTDLNP 228 >UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA61_9CHLA Length = 188 Score = 76.6 bits (187), Expect = 9e-13, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 39/116 (33%), Gaps = 12/116 (10%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLY------PIYDTPTNMLFTQGAIH-RTDDRTQSN 165 +Y + D S+ L P+ +F+ H T N Sbjct: 22 NEYFKTYLSYKGGNDGLGYHSNYASLDLMCFPLPL---EDITIFSDLKGHWLTRHHYAVN 78 Query: 166 IGFGWRHFSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRAS 219 G G+R + N F DH S + ++G+G E + + +L NG + Sbjct: 79 AGVGFRKIYAPQTIWDANLFYDHPKSSYDHYNQVGLGLELFHELWELRLNGAVALG 134 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 76.3 bits (186), Expect = 1e-12, Method: Composition-based stats. Identities = 44/305 (14%), Positives = 79/305 (25%), Gaps = 89/305 (29%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 + S S A G AT + + GT L + Sbjct: 13 LKSLVGAVLALSLSGTGIQAGPPDAKGAATIEPSGHPM----YLGTVTAGLKTSD--AYT 66 Query: 132 DSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQSN-IGFGWRH----------- 172 D ++ P+Y T ++LF + + + ++ +G G+RH Sbjct: 67 DGHFSIVAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASSLGLGFRHLFGSQPLTALS 126 Query: 173 --------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 F G + F +D + + ++GVG E Y+++ N YI S Sbjct: 127 ANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEAGTRYVEVRGNYYIPLSDK 186 Query: 222 KKSPD----------------------------------------------------IED 229 + + + + Sbjct: 187 QLAEETRTRETIRNSRSRSTSYLTGVSDPYATGNTIAQDAAFTTRTTTTTYTTTIERLFR 246 Query: 230 YQERPANGWDIRAEGYLPAW-PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYT 288 E GWD +P L ++ Y D + + A + Sbjct: 247 RYEEGMEGWDAEVAVLVPGLDRYLDVRVIGGYYSFDNQPFGPQQGGTGNVEGWKAGLELR 306 Query: 289 PVPLT 293 PVP Sbjct: 307 PVPAV 311 >UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellular organisms RepID=B4D818_9BACT Length = 5429 Score = 75.9 bits (185), Expect = 2e-12, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 4/116 (3%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGND 177 RV ++ D SL+ L P+ +L+ + +D +IGFG+RH Sbjct: 74 RVTFGLEFYEHQIDESLDTLVPLATPQNGVLYFNPKLSLSDRLNPSVSIGFGYRHLLKAR 133 Query: 178 WMAGVNTFIDHDLSR-SHT--RIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY 230 + T + D + H + GVGAE ++ AN Y+ ++ + Sbjct: 134 RSSSGETSLRSDYTNFDHHVNQFGVGAEVMSRWVDFRANYYLPEQNRRRINTNQTT 189 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 68/258 (26%), Gaps = 83/258 (32%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + D + ++ P++ + +LF + + + ++G G+ Sbjct: 50 TVNAGVKSSDAYTDGNFSIVAPVWSSLGAEGTLSGGVLFLEPYTSYGEGGEIAASLGLGY 109 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 R+ F G N F +D + ++GVG E+ YL Sbjct: 110 RYLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGVEFGNRYL 169 Query: 209 KLSANGYIRASGWKKSPDIEDYQ------------------------------------- 231 + N YI S + + + + Sbjct: 170 EFRGNYYIPLSDKQVAEQFKTREVLQSSSTSRSQSVTPLNNPYATGYTIAQDALYTTRAT 229 Query: 232 ---------------ERPANGWDIRAEGYLPAWPQ-LGASLMYEQYYGDEVGLFGKDKRQ 275 E GWD A +P + L+ Y D + Sbjct: 230 TTTRTTTIDRLFSRYEEGMEGWDAEAAFLVPGLDKYFDLRLIGGYYSFDNQPFGPQTGGT 289 Query: 276 KDPHAISAEVTYTPVPLT 293 + A V PVP Sbjct: 290 GNVEGWKAGVEIRPVPAI 307 >UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C500_9PLAN Length = 1337 Score = 73.6 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 45/142 (31%), Gaps = 24/142 (16%) Query: 144 TPTNMLFTQGAIHRTDDRTQSNIGFG-WRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGV 199 M+F + RT+ +R ++ + D + G + + D D S ++ + Sbjct: 145 DDAGMMFGNFRLWRTNRGNLGGGAGLGYRFYNYDTDRIFGTSFYYDRDDSTDKIFQQLAL 204 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ------------------ERPANGWDIR 241 E Y + N Y+ ++ ++E + G+D Sbjct: 205 NVETMGRYWDANGNFYLPIGNREQQLNLEFNDGSQRFSGFNVLYDQTRTIGKSMRGFDAE 264 Query: 242 AEGYLPAWPQLGASLMYEQYYG 263 +P W +L Y G Sbjct: 265 IG--VPIWGELAQQFQARAYAG 284 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 71.3 bits (173), Expect = 3e-11, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 68/258 (26%), Gaps = 83/258 (32%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + + ++ P++ T ++++ + + + ++G GW Sbjct: 91 TVTSGVKTSDVYTEGNFSIVAPVFSTLGADATLSGDVIYLEPYTSSGEGGEIAASLGLGW 150 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 RH F + G N F +D + + ++GVG E YL Sbjct: 151 RHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGIEAGTRYL 210 Query: 209 KLSANGYIRASGWKKS-------------------------------------------- 224 ++ N YI S + + Sbjct: 211 EVRGNYYIPLSDKQLAEQTRTREILRNSSSRDTTTVSALSDPYATGNTVSQDVSYRTQRT 270 Query: 225 --------PDIEDYQERPANGWDIRAEGYLPAWPQ-LGASLMYEQYYGDEVGLFGKDKRQ 275 + E GWD +P + L+ Y D + Sbjct: 271 TTTTTTTIERLFSRYEEGMEGWDTEVAVLVPGLDKYFDLRLIGGYYSFDNQPFGPQTGGT 330 Query: 276 KDPHAISAEVTYTPVPLT 293 + A V PVP Sbjct: 331 GNVEGWKAGVEVRPVPAV 348 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 67.4 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 44/146 (30%), Gaps = 30/146 (20%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 ++G G+R S N+ GVN F+D + ++ RI G EY ++ AN Y Sbjct: 168 GIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVNGLNEVYANVYRGLG 227 Query: 220 GWK----------------------------KSPDIEDYQ-ERPANGWDIRAEGYLPAWP 250 S + Y +G++I Sbjct: 228 DKDLVKGGGGNPYPKRLYPNGYPDTFPYNTIPSENYNTYVGGGVLDGYEIGIVRSFKNAR 287 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQK 276 A + ++ G+ + + Sbjct: 288 WARAYVNGYRWNGNGFSHKQEYNWGR 313 >UniRef50_A5GVG9 Uncharacterized conserved secreted protein n=15 Tax=Cyanobacteria RepID=A5GVG9_SYNR3 Length = 349 Score = 67.4 bits (163), Expect = 5e-10, Method: Composition-based stats. Identities = 24/219 (10%), Positives = 56/219 (25%), Gaps = 51/219 (23%) Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG----------AIHRTD----DRT 162 + + + + PI ++ + D D Sbjct: 43 RLGFQGQTQGAGTPNEVGVGGFLPIAVGDNSVFYADVEVNANLADFSGYSSIDNTQVDGV 102 Query: 163 QSNIG--FGWRHFSGND-WMAGVNTFIDHD------------------------------ 189 + G+R + + WM G+N D Sbjct: 103 TVSTSSRLGYRWLNDDRSWMFGINAGYDSRPMNTGDAKPWHPIKRRYQHLYLPSIKYAKN 162 Query: 190 -LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 S ++ E A + ++ + YQ + + + ++ Sbjct: 163 PRSVFFQQVAAEVEAVSPTWNFGAYALVPFGDTEQRLN-SHYQGGALDTYGLDVGYFI-- 219 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 P++ AS+ Y GD+ + + A++ V + Sbjct: 220 TPEINASVGYYYQQGDDSAGNSSGVKGRLAFAVAKGVEF 258 >UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6_SALET Length = 197 Score = 66.2 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 13/89 (14%), Positives = 27/89 (30%), Gaps = 4/89 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + AR ++ P P+ + + ++ Sbjct: 66 QLKKLNGLRTFARGFDHLQAGDELDVPAV----PLTGGKGDNNRHDARGPFAADRENEDA 121 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFI 96 + + A+ AG+FL+S PD A + Sbjct: 122 QAQQMVGMASQAGSFLASHPDGQAAAGMV 150 >UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=1 Tax=Synechococcus sp. RS9916 RepID=Q05XC6_9SYNE Length = 404 Score = 62.8 bits (151), Expect = 1e-08, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 60/220 (27%), Gaps = 44/220 (20%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRT-----DDRTQSNI---G 167 V + + ++ PI T ++ F + N G Sbjct: 35 FRWNVFSKSQGAGTPNQAGGQVFIPISTTRKSIFFLDALATADFGDALSTSSIVNTPVEG 94 Query: 168 --------FGWRHFSGNDWM-AGVNTFIDHD-LSRS-------------------HTRIG 198 G+R + N + GVN D +S +I Sbjct: 95 TTFSTSSRIGYRWLNDNGDILFGVNAGYDSRPISTGIPSRYSWAPRSLLQPQDVFFQQIA 154 Query: 199 VGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMY 258 GAE + + + + + ++ Y + + I E L AS+ Y Sbjct: 155 FGAELVTNNIAIKPYALVPVGKTEDVLNL-FYSGGALDTYGIDIEHSFDEL--LTASIGY 211 Query: 259 EQYYGDEVGLFGKDKRQK---DPHA-ISAEVTYTPVPLTQ 294 GD G + +P S V YT P + Sbjct: 212 YYQQGDLTYANGSGLKSTIAINPAGSFSMGVEYTYDPAFE 251 >UniRef50_A5GVB4 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GVB4_SYNR3 Length = 394 Score = 56.6 bits (135), Expect = 9e-07, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 57/226 (25%), Gaps = 79/226 (34%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG--AIHRTD----------- 159 ++G + + ++ + +P+ + + F + +D Sbjct: 42 PRFG---FQGQTQGAGTPNEAGIGGFFPLSVSENGVFFVDALANANFSDFSGTSSIVDTA 98 Query: 160 -DRTQSNIG--FGWRHFSGN-DWMAGVNTFIDHDLSR----------------------- 192 T + G+R + N WM G+N D Sbjct: 99 VAGTTISTSTRLGYRWLNTNRSWMFGINGGYDSRPMNSGPTVSGIKVGRSTSPTNSATSA 158 Query: 193 ---------------------------SHTR------IGVGAEYWRDYLKLSANGYIRAS 219 + R + E + +A + Sbjct: 159 TGTVSSQESSISARTTNSGSTSIKKNVDNHRSVFYQQVAANIEAVSNSWNFNAYALVPIG 218 Query: 220 GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 ++ + Y + + I ++ P+L AS+ Y GDE Sbjct: 219 DTQQRLN-SHYDSAALDKYGIDVGYFI--TPELNASVGYYYQTGDE 261 >UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CT24_9RHOB Length = 771 Score = 48.9 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 38/138 (27%), Gaps = 17/138 (12%) Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNT 184 + + + +P + + R + Q +I R + W GV Sbjct: 28 YHEEGLSTGIALSFPFAIEENRATIARLSYGRDEGHNAQLSIEAMRRMTLAHGWTVGVGV 87 Query: 185 FIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-------------DIE 228 F D D+ +++G+ + R + + N Y+ + + Sbjct: 88 FADSSTDDIGNRFSQVGMSGDLQRGIFQANLNAYLPVGTKSHADARYDALAEMDGTIRFK 147 Query: 229 DYQERPANGWDIRAEGYL 246 + G D Sbjct: 148 GGRSLALRGLDAEVGARF 165 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 44.3 bits (103), Expect = 0.004, Method: Composition-based stats. Identities = 14/29 (48%), Positives = 19/29 (65%), Gaps = 2/29 (6%) Query: 267 GLFGKD--KRQKDPHAISAEVTYTPVPLT 293 G+FG RQ++PHAI+ + Y PVPL Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLV 31 >UniRef50_Q28JV0 Putative uncharacterized protein n=1 Tax=Jannaschia sp. CCS1 RepID=Q28JV0_JANSC Length = 423 Score = 43.5 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 49/161 (30%), Gaps = 18/161 (11%) Query: 120 VKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHR--TD--DRTQSNI--GFGWRHF 173 V + + +L + L + +D +RT + G R Sbjct: 38 VGEGTEHSGAFSSGYFGVLN---QSADRALMFDVGLSMDWSDGWNRTSGTLYAGIIHRSM 94 Query: 174 SGNDWMAGVNTFIDHDL--SRSHTRIGVGAEY---WRDYLK--LSANGYIRASGWKKSPD 226 GN + G N ++D L S + G EY ++ + N Y + + Sbjct: 95 LGNGAVLGFNAYVDAGLLNSEIAGLVSAGVEYHPALGGDVELLFAGNLYHAFEDYTDTGA 154 Query: 227 IEDYQERPANGWD--IRAEGYLPAWPQLGASLMYEQYYGDE 265 P +G D + + L L + Y G + Sbjct: 155 FGAAATIPRSGADAFVTVDYTLGDGLSLTGTGGIFGYVGTD 195 >UniRef50_Q7V422 Prochlorococcus marinus MIT9313 complete genome n=4 Tax=cellular organisms RepID=Q7V422_PROMM Length = 742 Score = 43.1 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 50/164 (30%), Gaps = 17/164 (10%) Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 A ++ + L ++PD D + N I +G N L Sbjct: 139 AQISSKSEAALLNEPDPDILTLYPR------LNPVIGIGGTIWGN---NSNNSDFEGLIL 189 Query: 133 SSLEMLYPIYDTPTNMLFTQG--AIHRTDDRTQSNIGFGWRHF-SGNDWMAGVNTFIDH- 188 L P+ + + + D + FG+R F N GV H Sbjct: 190 GDLAYFQPLSQNSGSSVLYSLTSSSSNFDKAWGVSQEFGYRWFDPNNQRSNGVMAGYTHW 249 Query: 189 ----DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 S S +++ +G E R+ K +A G + + Sbjct: 250 QGQIKDSCSRSQLSLGVETARNRWKFAAAGGVPVDNCESQFSFA 293 >UniRef50_Q0IBW0 Putative uncharacterized protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0IBW0_SYNS3 Length = 221 Score = 43.1 bits (100), Expect = 0.011, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 41/148 (27%), Gaps = 30/148 (20%) Query: 168 FGWRHFSGND-WMAGVNTFIDHD----------------LSRSHTRIGVGAEYWRDYLKL 210 G+R + + M G+N D + ++ V E + Sbjct: 63 LGYRWLNRDRSTMYGINAGYDSRPIATGTTTNGIEVFNSQTPFFQQVAVNVELQSNQWGA 122 Query: 211 SANGYIRASGWKKSPD-----IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG-- 263 + G I + D + P + + ++M YY Sbjct: 123 NVYGLIPVGKYGYGSDNIATMNSSFAAEPLTTVGLDVNYNISNL----LAVMAGYYYQSC 178 Query: 264 -DEVGLFGKDKRQKDPHA-ISAEVTYTP 289 E +F D A + +++Y P Sbjct: 179 EKEPEIFENDAEGSGVKARLEYDISYQP 206 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 42.4 bits (98), Expect = 0.018, Method: Composition-based stats. Identities = 34/167 (20%), Positives = 60/167 (35%), Gaps = 15/167 (8%) Query: 52 QPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEW 111 + N N +K A T D + N I N Sbjct: 15 LNADELDNALKNNQNKWQKFNYQATQKAPTIKEENIDFKSALNGILSNVLENKN------ 68 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 G + N+D F ++ ++ L +Y+ N L Q + T D + G R Sbjct: 69 ----GIDKTDGNLD--FQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLINR 122 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEY-WRDYLKLSANGYIR 217 + +D++ G+N FID + ++ GAE + ++K +N Y+ Sbjct: 123 -YEKDDFLLGINGFIDGQKEQKESK-SFGAELGYYQFVKAYSNYYVP 167 >UniRef50_Q0I6I6 Unnamed protein product n=3 Tax=Synechococcus RepID=Q0I6I6_SYNS3 Length = 605 Score = 42.4 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 49/156 (31%), Gaps = 7/156 (4%) Query: 80 GTFLSSQPDSDATRNFITGMATAK--ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEM 137 G + S T + N I +G+ + + L Sbjct: 136 GEAIISDQGLQETDQNSPDILEQYPRLNPVIGFGSSAWGSNASGSTLGQAAGLILGEASF 195 Query: 138 LYPIYDTPTNMLFTQGAI--HRTDDRTQSNIGFGWRHFSGNDW-MAGVNTFIDHDLSRS- 193 P+ + + L + D ++ FG++ F+ N+ ++ + D + Sbjct: 196 FLPLRQSEGSKLLYNYSTASSNFDSSWGASTEFGYKWFNPNNRSISSLLVGYDAWETSQC 255 Query: 194 -HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 H+++ +G ++ + + G I + + Sbjct: 256 VHSQLALGGQWQKKRWQFGVTGGIPIDDCENNLGFA 291 >UniRef50_B3JG90 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JG90_9BACE Length = 207 Score = 41.2 bits (95), Expect = 0.049, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 47/164 (28%), Gaps = 35/164 (21%) Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 WL YG + +M+ P + D R ++GF Sbjct: 13 SGWLVTYGQTGRSSGLTIYGGKDMVGYDMVPP----SAGVPPFDAGKTYFDSR--FSLGF 66 Query: 169 GWRHFS---GNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY--LKLSANGYIRASGWKK 223 G+R + W F D ++ + R G Y + + N Sbjct: 67 GYRWRLKPIDSRW------FFDFAITAGYHRYKYGLSYLSPDEHITYAGNA--------- 111 Query: 224 SPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE--QYYGDE 265 + + +G YL + L S+ E Y+ D+ Sbjct: 112 --GVCNLLSVSLSG---SVGYYL--YKGLNLSVGVESAYYFYDK 148 >UniRef50_C4RKT9 Secreted protease n=4 Tax=Actinomycetales RepID=C4RKT9_9ACTO Length = 937 Score = 40.4 bits (93), Expect = 0.071, Method: Composition-based stats. Identities = 37/248 (14%), Positives = 70/248 (28%), Gaps = 32/248 (12%) Query: 51 VQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQE 110 R +G + +K + ++G + + A + + + Sbjct: 478 QTSRTPVGTAHGIKVDLPDKVITLATPHSGGNMWHSGADQNWADVKLSRAVSVPSAADAK 537 Query: 111 WLGK--------YGTARVKLNVDKDFSLKDSS-LEMLYPIYDTPTNMLFTQGAIHRTD-- 159 + + V+L+ D + + + + TP N + D Sbjct: 538 FWMWNNYVIEEDWDYGFVELSTDGGATWSEQKVYDAAGAVVTTPDN--YADPNGRMADFG 595 Query: 160 ---DRTQSNIGFGWRHFSGNDWMAGVNT------------FIDHD-LSRSHTRIGVGAEY 203 + G GWRH + T F++ + G G Sbjct: 596 GKKYGLTGSTG-GWRHDYVDLSAYAGQTVQLRLRQATDESFLERGWFADDFAVTGGGTTT 654 Query: 204 WRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG 263 W D ++ ANG+ G D + +G ++RA YL W +Y Sbjct: 655 WSDDVEGGANGWTATGG--SFTDTTGGGWKTNSGTEVRAHYYLAEWRNFDGFDKGLKYAY 712 Query: 264 DEVGLFGK 271 D V Sbjct: 713 DTVYSHDA 720 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P36943 Putative attaching and effacing protein homolog ... 356 5e-97 UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius st... 352 7e-96 UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepI... 340 3e-92 UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria R... 338 1e-91 UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Entero... 337 3e-91 UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 335 2e-90 UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersini... 334 3e-90 UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escheri... 332 8e-90 UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepI... 332 1e-89 UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB2... 331 2e-89 UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 329 8e-89 UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellula... 327 3e-88 UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia... 324 3e-87 UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax... 321 2e-86 UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersi... 318 9e-86 UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersini... 316 5e-85 UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC ... 308 2e-82 UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 3546... 308 2e-82 UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersini... 307 2e-82 UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodenti... 307 3e-82 UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersini... 305 1e-81 UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=... 305 1e-81 UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enter... 303 4e-81 UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR 301 2e-80 UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638... 300 3e-80 UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterob... 300 3e-80 UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Ta... 296 6e-79 UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax... 293 5e-78 UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia ... 290 4e-77 UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersini... 290 5e-77 UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=... 288 1e-76 UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersini... 287 3e-76 UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regula... 286 4e-76 UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=... 284 3e-75 UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS 282 9e-75 UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=IN... 280 3e-74 UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotic... 278 2e-73 UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI 277 3e-73 UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmone... 276 6e-73 UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersini... 276 6e-73 UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB2... 275 9e-73 UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX 275 1e-72 UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photo... 273 7e-72 UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_E... 273 7e-72 UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Provide... 270 5e-71 UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rett... 265 1e-69 UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae Re... 261 2e-68 UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax... 261 2e-68 UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersini... 258 2e-67 UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_S... 251 1e-65 UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular ... 245 1e-63 UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus ... 243 7e-63 UniRef50_B7LRE6 Putative invasin-like protein; putative exported... 241 2e-62 UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydroph... 240 5e-62 UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorh... 239 1e-61 UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae ... 238 1e-61 UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youn... 238 2e-61 UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 Rep... 234 3e-60 UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus... 234 3e-60 UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacte... 232 1e-59 UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enteric... 222 1e-56 UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM ... 214 2e-54 UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussi... 213 6e-54 UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchisepti... 205 2e-51 UniRef50_Q9APE8 Putative outer membrane ligand binding protein n... 203 9e-51 UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella a... 196 5e-49 UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter l... 191 2e-47 UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW... 179 1e-43 UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenteri... 174 4e-42 UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus mar... 156 7e-37 UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax... 154 3e-36 UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillon... 151 4e-35 UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synecho... 149 1e-34 UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 T... 149 1e-34 UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius st... 142 1e-32 UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 ... 139 1e-31 UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio ... 139 2e-31 UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultu... 137 3e-31 UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candida... 137 4e-31 UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillon... 136 1e-30 UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodoba... 135 1e-30 UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured b... 135 1e-30 UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candida... 131 3e-29 UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylo... 130 7e-29 UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 T... 129 2e-28 UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorob... 123 7e-27 UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microco... 121 2e-26 UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 119 8e-26 UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia ... 119 1e-25 UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microco... 119 2e-25 UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=... 118 2e-25 UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthin... 118 3e-25 UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magneto... 117 4e-25 UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachl... 115 2e-24 UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Plancto... 115 2e-24 UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuni... 115 2e-24 UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=... 115 2e-24 UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachl... 114 5e-24 UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candida... 113 7e-24 UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepI... 112 1e-23 UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legione... 111 2e-23 UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorick... 109 9e-23 UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root Re... 109 1e-22 UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Plancto... 108 2e-22 UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickett... 107 4e-22 UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus ... 105 1e-21 UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Plancto... 105 2e-21 UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma p... 103 9e-21 UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Plancto... 102 1e-20 UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodosp... 102 2e-20 UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=... 102 2e-20 UniRef50_A8PQI7 Putative outer membrane autotransporter barrel d... 101 3e-20 UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepI... 101 3e-20 UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickett... 101 3e-20 UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachl... 101 3e-20 UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultu... 100 6e-20 UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum ... 99 2e-19 UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus... 99 2e-19 UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryoc... 99 2e-19 UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastop... 97 7e-19 UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Plancto... 95 3e-18 UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillon... 95 3e-18 UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Plancto... 94 4e-18 UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=... 94 5e-18 UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microco... 94 6e-18 UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyano... 93 9e-18 UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillon... 93 1e-17 UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanoth... 92 3e-17 UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=... 91 4e-17 UniRef50_A5GVG9 Uncharacterized conserved secreted protein n=15 ... 91 4e-17 UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=... 90 9e-17 UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escher... 90 1e-16 UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachl... 79 1e-13 UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellu... 78 4e-13 UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillon... 76 2e-12 UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Plancto... 76 2e-12 UniRef50_A5GVB4 Uncharacterized conserved secreted protein n=1 T... 75 3e-12 UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicib... 72 3e-11 UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6... 66 1e-09 Sequences not found previously or not previously below threshold: UniRef50_Q0IBW0 Putative uncharacterized protein n=1 Tax=Synecho... 49 2e-04 UniRef50_Q7V422 Prochlorococcus marinus MIT9313 complete genome ... 46 0.002 UniRef50_B6R6H4 Putative uncharacterized protein n=1 Tax=Pseudov... 45 0.002 UniRef50_Q0I6I6 Unnamed protein product n=3 Tax=Synechococcus Re... 45 0.002 UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmone... 43 0.010 UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Ca... 41 0.041 >UniRef50_P36943 Putative attaching and effacing protein homolog n=48 Tax=Enterobacteriaceae RepID=EAEH_ECOLI Length = 295 Score = 356 bits (914), Expect = 5e-97, Method: Composition-based stats. Identities = 295/295 (100%), Positives = 295/295 (100%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT Sbjct: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV Sbjct: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA Sbjct: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 Query: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI Sbjct: 181 GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDI 240 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ Sbjct: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQQ 295 >UniRef50_Q2NVE8 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVE8_SODGM Length = 934 Score = 352 bits (904), Expect = 7e-96, Method: Composition-based stats. Identities = 134/291 (46%), Positives = 184/291 (63%), Gaps = 10/291 (3%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 H + ++ AR ++ PL A + + Sbjct: 78 HMSLEALRKLNQFRTFARGFDHLQPGDELDVPL---------APLPAVTWAEETPVPASA 128 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + ++ + +A A+ AG FL++ P DA + GMAT A+ E+Q+WL ++GTAR++L Sbjct: 129 SKEDLQAQKIAGIASQAGNFLANSPRGDAAASIARGMATGAASTEVQQWLSQFGTARLQL 188 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 +VD FSLK+S L++L P+Y+ P ++FTQG++HRTDDRTQ+N+G G R F + +M G Sbjct: 189 DVDNKFSLKNSQLDLLIPLYEQPDKLVFTQGSLHRTDDRTQTNLGMGMRWF-NDGYMLGG 247 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF+D+DLSR H R+G+G EYWRDYLK+ AN Y+R + W+ S D DYQERPANGWD+ Sbjct: 248 NTFLDYDLSRDHARMGMGVEYWRDYLKIGANNYLRLTNWRDSKDFADYQERPANGWDMSL 307 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG++PA PQLG +L YEQYYG EV LFGKD RQKDPHAI+ V YTP PL Sbjct: 308 EGWVPALPQLGGNLKYEQYYGKEVALFGKDNRQKDPHAITVGVNYTPFPLL 358 >UniRef50_D0FWP0 Putative invasin n=4 Tax=Erwinia pyrifoliae RepID=D0FWP0_ERWPY Length = 1270 Score = 340 bits (873), Expect = 3e-92, Method: Composition-based stats. Identities = 126/290 (43%), Positives = 176/290 (60%), Gaps = 6/290 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + A ++ P + + T Sbjct: 85 ITPKALRKLNVLRTFAHGFDNLQPGDELDVPAVMPDGKPDSPAKTGDE-----QAATPPL 139 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 D+ +A A+ AGT LS+ PD DA + G +A A+ ++Q+WL ++GTARV+L Sbjct: 140 KDDEGAMKMADMASRAGTLLSNSPDGDAALSMARGQISAVASGQVQQWLNQFGTARVQLE 199 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 D+ FSLK+S +++L P Y+ +LFTQG++HRTDDRTQ+N+GFG R+F+ +M G N Sbjct: 200 ADEHFSLKNSQVDLLIPFYEQNDELLFTQGSLHRTDDRTQANLGFGLRYFAP-SYMLGGN 258 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F D+DLS H+R G+G EYWRD+LKLSANGY+R S W+ SP++++YQERPANGWDIRA+ Sbjct: 259 IFGDYDLSHEHSRTGIGVEYWRDFLKLSANGYLRLSDWRDSPNMKEYQERPANGWDIRAQ 318 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LP+ PQLG L YEQYYG V LFGK+ Q++P AI+A V +TP PL Sbjct: 319 AWLPSLPQLGGKLTYEQYYGKGVALFGKENLQQNPRAITAGVNFTPFPLL 368 >UniRef50_Q8X8V7 Uncharacterized protein yeeJ n=47 Tax=Bacteria RepID=YEEJ_ECO57 Length = 2660 Score = 338 bits (867), Expect = 1e-91, Method: Composition-based stats. Identities = 129/291 (44%), Positives = 183/291 (62%), Gaps = 17/291 (5%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + ++ AR ++ P ++ Sbjct: 88 ISVAELRKLNQFRTFARGFDNVRQGDELDVPA---------------QVSENNLTPPPGN 132 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 + N+E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L Sbjct: 133 SSGNLEQQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMTDWLSRFGTARITLG 192 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 VD+DFSLK+S + L+P Y+TP N+ F+Q +HRTD+RTQ N G GWRHF+ WM+G+N Sbjct: 193 VDEDFSLKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTP-TWMSGIN 251 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRA 242 F DHDLSR H+R G+GAEYWRDYLKLS+NGY+R + W+ +P+++ DY+ RPANGWD+RA Sbjct: 252 FFFDHDLSRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRA 311 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG+LPAWP LG L+YEQYYGDEV LF KD RQ +PHAI+A + YTP PL Sbjct: 312 EGWLPAWPHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLM 362 >UniRef50_B7MMM3 Putative invasin/intimin protein n=10 Tax=Enterobacteriaceae RepID=B7MMM3_ECO45 Length = 1746 Score = 337 bits (864), Expect = 3e-91, Method: Composition-based stats. Identities = 132/291 (45%), Positives = 179/291 (61%), Gaps = 13/291 (4%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + ++ AR ++ P A Sbjct: 88 ISLEELRRLNQFRTFARGFDNVRQGEELDVPATTLQKSHEQQNAV-----------PPAN 136 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 +N +E +AS + GT LS +S+ G A+++A+ + +WL +GTA++ L Sbjct: 137 GENTLENQIASTSQRVGTLLSQDMNSEQASGMARGWASSEASGAMTDWLNNFGTAKISLG 196 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 VD+DFSLK+S + L+P YDTP +LF+Q +HRTDDRTQ N G GWRHF+ WM+G+N Sbjct: 197 VDEDFSLKNSQFDFLHPWYDTPDYLLFSQHTLHRTDDRTQINTGLGWRHFTP-SWMSGIN 255 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRA 242 F DHDLSR H+R G+GAEYWRDYLKLS+N YI +GW+ +P+++ DY+ RPANGWD+RA Sbjct: 256 LFFDHDLSRYHSRAGLGAEYWRDYLKLSSNAYIGLTGWRSAPELDNDYEARPANGWDLRA 315 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG+LPAWPQLG L+YEQYYGDEV LF K+ RQ +PHAI+A + YTP PL Sbjct: 316 EGWLPAWPQLGGKLVYEQYYGDEVALFDKNDRQSNPHAITAGLNYTPFPLL 366 >UniRef50_D0ZEM2 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZEM2_EDWTE Length = 750 Score = 335 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 118/290 (40%), Positives = 160/290 (55%), Gaps = 7/290 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + Y + AR + ++ P+ ++ A +A Sbjct: 108 ISVAQLKQVNAYRIFARGFEHVGVGDEIDIPVDMSSLNTQAGQAPKLSSAMREPSRA--- 164 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 E + G LSS S+A MAT AN+EIQ+WL KYGTARV+LN Sbjct: 165 ---EKEAQAVGQLMSVGATLSSTRPSEAAAGMARSMATNAANEEIQQWLSKYGTARVQLN 221 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 +DK+FSL +S+L+ P++D+ FTQ D R N+G G R + WM GVN Sbjct: 222 LDKNFSLSESALDWFIPVWDSANLTAFTQLGARNKDRRNTINLGVGARTLL-DRWMLGVN 280 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F DHDL+ ++R+G+GAE W DYL+LS NGY+R S W +S D DY ER ANG+DIRA Sbjct: 281 MFYDHDLTGHNSRLGIGAEAWTDYLQLSTNGYMRLSNWHQSRDFADYDERAANGFDIRAN 340 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LPA PQLG L+YEQY G+ V LFGK+ Q++P+A++A V YTP PL Sbjct: 341 AWLPALPQLGGKLVYEQYIGENVALFGKENLQRNPYALTAGVNYTPFPLL 390 >UniRef50_C4SVZ0 Putative uncharacterized protein n=2 Tax=Yersinia RepID=C4SVZ0_YERFR Length = 830 Score = 334 bits (856), Expect = 3e-90, Method: Composition-based stats. Identities = 109/286 (38%), Positives = 156/286 (54%), Gaps = 12/286 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + ++ ++ ++ P + + N + + Sbjct: 3 ELKEVNQFRSFSKPFIQLGSGDEIDIPRITPLP-----------EKITTAENAKTVSSSQ 51 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 ++ +A T L+ A + +A +AN Q WL ++GTARV+LN+D + Sbjct: 52 YKERLAHNLLKGATVLADDNTPLAAASMARSVAVGEANDAAQHWLSQFGTARVQLNLDNN 111 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 SLK S+ +ML P+YD ++LF+Q + D R NIG G R N WM G N F D Sbjct: 112 LSLKGSAFDMLLPLYDDQKSLLFSQFGLRNHDSRNTINIGAGVRTLQDN-WMYGANVFFD 170 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 D++ + RIG GAE W DYLKLSAN Y+R + W +S D DY ERPANG+D+R E YLP Sbjct: 171 RDITGKNNRIGFGAEAWTDYLKLSANSYLRLTDWHQSRDFADYNERPANGYDLRVEAYLP 230 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+PQ+G +L YEQY G+EV LFGKD RQK+P+A +A + YTP+PL Sbjct: 231 AYPQIGTNLKYEQYKGNEVALFGKDDRQKNPYAFTAGINYTPIPLI 276 >UniRef50_B7NEX3 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B7NEX3_ECOLU Length = 3418 Score = 332 bits (851), Expect = 8e-90, Method: Composition-based stats. Identities = 141/291 (48%), Positives = 190/291 (65%), Gaps = 12/291 (4%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 87 ITVDELRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM- 143 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L Sbjct: 144 -------QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLG 196 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 VD+DFSLK SS E L+P Y+TP N++F+Q +HRTD+RTQ+N G GWR+F+ WM+GVN Sbjct: 197 VDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLHRTDNRTQTNHGIGWRYFTS-SWMSGVN 255 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRA 242 FIDHDL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RA Sbjct: 256 MFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRA 315 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG+LPAWPQLG ++YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 316 EGWLPAWPQLGGKVVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_B1LKY4 Putative invasin n=8 Tax=Enterobacteriaceae RepID=B1LKY4_ECOSM Length = 2933 Score = 332 bits (850), Expect = 1e-89, Method: Composition-based stats. Identities = 143/291 (49%), Positives = 190/291 (65%), Gaps = 12/291 (4%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + ++ AR ++ PL + +P AR A+Q + + Sbjct: 87 ITVDELRRLNQFRTFARGFDNVRQGDEIDVPLINSNSP--EARNLKAMQMERDGKDPQM- 143 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 VA A +GT L+ DS+ + G + A+ + +WL ++GTARV L Sbjct: 144 -------QVAEMAQQSGTLLARDMDSEQAASMARGWVASSASAQATDWLSRWGTARVSLG 196 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 VD+DFSLK SS E L+P Y+TP N++F+Q +HRTDDRTQ+N G GWR+F+ WM+GVN Sbjct: 197 VDEDFSLKSSSFEFLHPWYETPDNLVFSQHTLHRTDDRTQTNHGIGWRYFTS-SWMSGVN 255 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRA 242 FIDHDL+R HTR G+G EYWRDYLKLS NGY+R S W+ +P+++ DY+ RPANGWD+RA Sbjct: 256 MFIDHDLTRYHTRTGMGVEYWRDYLKLSGNGYLRLSNWRSAPELDNDYEARPANGWDLRA 315 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG+LPAWPQLG L+YEQYYGDEV LFGKD+RQ DPHAI+A ++YTPVPL Sbjct: 316 EGWLPAWPQLGGKLVYEQYYGDEVALFGKDERQNDPHAITAGLSYTPVPLI 366 >UniRef50_D0ZAL1 Putative invasin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZAL1_EDWTE Length = 2359 Score = 331 bits (848), Expect = 2e-89, Method: Composition-based stats. Identities = 121/290 (41%), Positives = 161/290 (55%), Gaps = 8/290 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + ++ A + ++ P + + P + + Sbjct: 114 VTVSQLKKINQFRKFAHGIDKIGAGDEIDIPHSGSSL-------TKPGSPAAATPLSPHA 166 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 + E VA G L+S S+A ATA AN EI +WL KYGTA+++LN Sbjct: 167 DTSERESRVAGQLMGVGRVLASPQSSNAASEMARSWATAAANDEIVKWLSKYGTAQLQLN 226 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 +DK+FSL S+L+ L P YDTPT FTQ D R NIG G R S N W+ GVN Sbjct: 227 IDKNFSLDGSALDWLLPFYDTPTTTTFTQLGFRNRDHRNTLNIGIGTRTLSNN-WLFGVN 285 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F DHDLS ++R+G+G+E W DYL+LS NGY+R S W +S D+ DY ERPANG+D+RA Sbjct: 286 AFYDHDLSGKNSRLGLGSEAWTDYLQLSLNGYLRLSDWHQSRDLADYNERPANGFDVRAN 345 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 ++P PQLG LMYEQY+GD VGLFGKD Q++P+A + V YTP PL Sbjct: 346 AWMPTLPQLGGKLMYEQYFGDAVGLFGKDNLQRNPYAFTVGVNYTPFPLL 395 >UniRef50_B2VKC8 Putative invasin n=5 Tax=Erwinia RepID=B2VKC8_ERWT9 Length = 1400 Score = 329 bits (843), Expect = 8e-89, Method: Composition-based stats. Identities = 128/309 (41%), Positives = 180/309 (58%), Gaps = 19/309 (6%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLA-VTFTPVMAARAQHAVQPRLSM---- 57 H + + ++ P + P + A + Sbjct: 83 HLTPEALRKLNQRRTFTYGFDNLQPGDKLNVPAIKLDDEPDVPAARLDNKANLPAARLDN 142 Query: 58 -------------GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA 104 + D+ + +A A+ AG FLS P+ DA + G TA+A Sbjct: 143 KPDVPAIIWGQEGSAASALGDDAGARKMADVASRAGAFLSDNPNGDAALSLARGEVTAEA 202 Query: 105 NQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS 164 + ++Q+WL ++GTARV+L+ D+ FS K+S ++L P+Y+ +++FTQG++HRTDDRTQ Sbjct: 203 SGQLQQWLNQFGTARVQLDADEHFSFKNSQFDLLAPLYEQKDSLIFTQGSLHRTDDRTQV 262 Query: 165 NIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 N+GFG R+F+ +M G N F D+DLSR+H+R G+G EYWRD+LKLSANGY+R S W S Sbjct: 263 NLGFGLRYFAP-SYMLGGNIFGDYDLSRAHSRTGIGMEYWRDFLKLSANGYLRLSDWNNS 321 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 D +DYQERPANGWDIRA+ +LP+ PQLG L YEQYYG V LFGK+ Q+DP AI+A Sbjct: 322 SDFKDYQERPANGWDIRAQAWLPSLPQLGGKLTYEQYYGRGVALFGKENLQQDPRAITAG 381 Query: 285 VTYTPVPLT 293 V +TP PL Sbjct: 382 VNFTPFPLL 390 >UniRef50_B1JHX5 Conserved repeat domain protein n=39 Tax=cellular organisms RepID=B1JHX5_YERPY Length = 5337 Score = 327 bits (838), Expect = 3e-88, Method: Composition-based stats. Identities = 122/291 (41%), Positives = 162/291 (55%), Gaps = 16/291 (5%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + Y ++ A ++ P + N Sbjct: 107 NITVDELKKLNAYRTFSKPFASLTTGDEIEVPRKESSFF---------------SNNPNE 151 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +V+ +A A AG LS+ SDA N T + N Q+WL ++GTARV+L Sbjct: 152 NNKKDVDDLLARNAMGAGKLLSNDNTSDAASNMARSAVTNEINASSQQWLNQFGTARVQL 211 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD DF L +S+L++L P+ D+ +++LFTQ + D R NIG G R + G+ WM G Sbjct: 212 NVDSDFKLDNSALDLLVPLKDSESSLLFTQLGVRNKDSRNTVNIGAGIRQYQGD-WMYGA 270 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF D+DL+ + R+GVGAE DYLK SAN Y +GW +S D Y ERPA+G+DIR Sbjct: 271 NTFFDNDLTGKNRRVGVGAEVATDYLKFSANTYFGLTGWHQSRDFSSYDERPADGFDIRT 330 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 E YLPA+PQLG LMYE+Y GDEV LFGKD RQKDPHA++ V YTPVPL Sbjct: 331 EAYLPAYPQLGGKLMYEKYRGDEVALFGKDDRQKDPHAVTLGVNYTPVPLV 381 >UniRef50_D1P141 Putative LysM domain protein n=2 Tax=Providencia RepID=D1P141_9ENTR Length = 2373 Score = 324 bits (830), Expect = 3e-87, Method: Composition-based stats. Identities = 108/291 (37%), Positives = 170/291 (58%), Gaps = 10/291 (3%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + ++ ++ ++ P+ A Sbjct: 83 NINLQQLRKLNQFRTFSQNFENLQPGDELDIPM---------APLPIVEWDDDKPEIVLP 133 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 ++ + E VA A+ AG F S+ PD + T+ F + T A+ Q+W ++G++++ L Sbjct: 134 SSASENEIRVAQLASQAGKFFSTNPDQEKTKAFARELLTTAASSYAQDWFNRFGSSQIHL 193 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 DK FSLK+S +++L P Y+T N++F+Q ++HR + R ++N+G G R + G M G Sbjct: 194 EADKKFSLKNSQIDLLMPWYETEDNLIFSQTSLHRKEGRIETNLGLGARWY-GEGQMIGG 252 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF D+D+SR H+R+G+G EY RD+LKLSAN Y R SGW+ S D+ D+ RP+NGWD+RA Sbjct: 253 NTFFDYDISRKHSRLGLGVEYRRDFLKLSANSYHRLSGWRSSRDLADHSARPSNGWDVRA 312 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 EG+LP++P +G L YEQYYGD V LFG Q++P++I+A + YTP+PL Sbjct: 313 EGWLPSYPHIGGKLTYEQYYGDSVALFGTKNLQQNPYSITAGLNYTPIPLV 363 >UniRef50_B1JSC0 Ig domain protein group 1 domain protein n=5 Tax=Yersinia RepID=B1JSC0_YERPY Length = 1976 Score = 321 bits (822), Expect = 2e-86, Method: Composition-based stats. Identities = 117/290 (40%), Positives = 157/290 (54%), Gaps = 17/290 (5%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + Y +R ++ P + V + Sbjct: 92 ITVDELKKINIYRTFSRPFTALTTGDEIDIPRKASPFSVDNNKDNRLSV----------- 140 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 E +A A T LS+ + + + A+ + N Q+WL ++GTARV+LN Sbjct: 141 -----ENTLAGHAVAGATALSNGDVAKSGERMVRSAASNEFNNSAQQWLSQFGTARVQLN 195 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 ++ DF L S+ ++L P+YD ++LFTQ D R N+G G R F GN WM G N Sbjct: 196 INDDFHLDGSAADVLIPLYDNEKSILFTQLGARNKDSRNTVNMGAGVRTFQGN-WMYGAN 254 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 TF D+DL+ + RIGVGAE W DYLKLSAN Y + W +S D DY ERPANG+D+RAE Sbjct: 255 TFFDNDLTGKNRRIGVGAEAWTDYLKLSANNYFGITDWHQSRDFIDYNERPANGYDLRAE 314 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 YLP++PQLG MYE+Y GD+V LFGKD RQK+PHAI+A V YTP+PL Sbjct: 315 AYLPSYPQLGGKAMYEKYRGDDVALFGKDNRQKNPHAITAGVNYTPIPLV 364 >UniRef50_C4RYB3 RTX toxin and Ca2+-binding protein n=1 Tax=Yersinia bercovieri ATCC 43970 RepID=C4RYB3_YERBE Length = 945 Score = 318 bits (816), Expect = 9e-86, Method: Composition-based stats. Identities = 126/291 (43%), Positives = 167/291 (57%), Gaps = 16/291 (5%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + + ++ A ++ P Q L+ NT + Sbjct: 37 NLTLAQLKKINQLRTFSKPFAKLQAGDELEIP-------------QAQSNLGLAPENTAL 83 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 T E+N+A A + L+S A + G+A ANQ WL +GTAR++ Sbjct: 84 TDTQTTERNLAKTATTSAQMLNSGDK--AAARQLRGLAVGNANQAANSWLNNFGTARLQA 141 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD L S +ML P YDTP+ M FTQ I R D RT +N+G G RHF + WM G Sbjct: 142 NVDDRGDLDGSQFDMLMPFYDTPSQMAFTQFGIRRIDKRTTANLGIGIRHFIDD-WMVGY 200 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N F+D D++R HTR+G GAEY RDYLKL+ANGY+R S W+ SPD Y ERPA G+D+RA Sbjct: 201 NLFLDRDITRDHTRVGAGAEYARDYLKLAANGYLRLSDWRDSPDFSSYSERPATGFDLRA 260 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 E YLP+ PQLG LMYEQY+G++VGLFGKD RQ++P AI+A + YTP+PL Sbjct: 261 EAYLPSLPQLGGKLMYEQYFGNDVGLFGKDNRQQNPAAITAGINYTPIPLV 311 >UniRef50_C4SMR2 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SMR2_YERFR Length = 906 Score = 316 bits (810), Expect = 5e-85, Method: Composition-based stats. Identities = 118/286 (41%), Positives = 165/286 (57%), Gaps = 15/286 (5%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + Y ++ ++ P + + + + ++A Sbjct: 64 ELKRVNIYRTFSKPFTALTSGDEIDIPRKASPFSIDSEKNKNADVL-------------- 109 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 +E +AS T L++ + ++ I A + N Q+WL ++GTARV++NV+ D Sbjct: 110 LENKLASHVQTGATALATSNAAKSSERMIRSAANNEFNSSAQQWLSQFGTARVQMNVNDD 169 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F L S++++L PIYD ++LFTQ D+R NIG G R F N WM GVNTF D Sbjct: 170 FKLDGSAVDVLVPIYDNQKSILFTQLGARNKDNRNTVNIGAGVRTFQNN-WMYGVNTFFD 228 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 +D++ + R+GVGAE W DYLKLSAN YI S W +S D DY ERPANG+D+RAE YLP Sbjct: 229 NDMTGKNRRVGVGAEAWTDYLKLSANSYIGTSDWHQSRDFADYNERPANGYDVRAEAYLP 288 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + PQLG LMYE+Y G+EV LFGKD RQK+PHA++A V YTP+PL Sbjct: 289 SHPQLGGKLMYEKYRGEEVALFGKDNRQKNPHAVTAGVNYTPIPLL 334 >UniRef50_C4UZB1 Invasin (Fragment) n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UZB1_YERRO Length = 717 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 125/295 (42%), Positives = 183/295 (62%), Gaps = 20/295 (6%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + + ++ ++ PL + + + Sbjct: 75 LTVAELKKLNQLRKFSKPFEALTTGDEIDIPLIG---------------NNFTTQSLPHS 119 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN----QEIQEWLGKYGTAR 119 + + +A A+ G L + P+S+A + A + AN QEI +WL G R Sbjct: 120 TSSPNDSLLAQSASQVGNTLQNNPNSEALNDLARSSALSAANAKAGQEISDWLNGKGKVR 179 Query: 120 VKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM 179 VKL+ D+DFS+K+S L++L P++++ ++M+F+QG++HRTDDRTQSN+G G+R+F+ + + Sbjct: 180 VKLDADRDFSVKNSQLDLLVPLWESESHMIFSQGSVHRTDDRTQSNLGLGYRYFA-DSYA 238 Query: 180 AGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G NTF DHD SRSH+R+G+GAEY R++ KL+ NGY+R S WK SPD ++Y+ERPANGWD Sbjct: 239 LGANTFYDHDWSRSHSRLGLGAEYQRNFFKLATNGYLRLSNWKDSPDFDNYEERPANGWD 298 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 IRAEGYLP++P LGA L YEQYYGD VGLFGKD +QK+PHAI+ Y+P PL + Sbjct: 299 IRAEGYLPSYPGLGAKLAYEQYYGDNVGLFGKDNQQKNPHAITFGGNYSPFPLLK 353 >UniRef50_B7LVE8 Intimin n=2 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LVE8_ESCF3 Length = 2104 Score = 308 bits (788), Expect = 2e-82, Method: Composition-based stats. Identities = 147/287 (51%), Positives = 185/287 (64%), Gaps = 11/287 (3%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 RF S L R VA I QVLFP+ A ++ + Sbjct: 1 MISARFHSSRLTRAVASLCIVTQVLFPV---------ASTAGHRVAAPQAAPAVLSEQDA 51 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 VA A L S +S G AT+ A QEWL ++GT RV L +D+D Sbjct: 52 TAAQVAGMTTQAAGMLQSGMNSRQAAEMARGYATSTAQSAFQEWLSQWGTVRVTLGLDED 111 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F+LK S+ ++L P +DTP N+LFTQ + HRTDDR Q N G GWRHF+ + +MAGVN F D Sbjct: 112 FTLKGSAFDLLLPWHDTPENLLFTQHSFHRTDDRNQLNTGAGWRHFAPD-YMAGVNLFFD 170 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYL 246 HDL+R H+R+G+G EYWRD LKL ANGY+R SGW+ +P+++ DY+ RPANGWD+RAEGYL Sbjct: 171 HDLTRYHSRMGLGGEYWRDNLKLGANGYLRLSGWRDAPELDYDYEARPANGWDVRAEGYL 230 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA+PQLGA+LMYEQYYGDEV LFGKDKRQ+DPHA +A ++YTPVPL Sbjct: 231 PAYPQLGATLMYEQYYGDEVALFGKDKRQQDPHAFTAGLSYTPVPLI 277 >UniRef50_C4UN28 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UN28_YERRU Length = 842 Score = 307 bits (787), Expect = 2e-82, Method: Composition-based stats. Identities = 116/291 (39%), Positives = 169/291 (58%), Gaps = 9/291 (3%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAAR-AQHAVQPRLSMGNTTV 62 + ++ + + ++ P+ P++A + A + N V Sbjct: 88 LTLAQLEQINQFRTFPQGFEQVSSGEEIDIPV-----PIIAEQGATKVSVVTPNEVNCPV 142 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +NN + L+S + + + ++ AN+EIQ+WLG+YGTA+V+L Sbjct: 143 GIENN--PQTKEYVKRVSALLASSDPTTVATDVVRSEVSSTANKEIQKWLGQYGTAQVRL 200 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 NVD FSL++SSL+ L+ YD+ + ++FTQ I D R +N+G G R GN W+ G Sbjct: 201 NVDDKFSLRESSLDWLFSFYDSSSAIIFTQLGIRNKDHRNTANLGLGGRISMGN-WILGA 259 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 NTF D+DL+ ++R+G GAE W DYL+LSAN Y+R + W +S D D+ ERPANG+DIR Sbjct: 260 NTFYDNDLTGINSRLGFGAEAWTDYLQLSANSYMRLNNWHQSRDFIDHDERPANGFDIRT 319 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +LP PQLG LMYEQY GD V LFGKDK QK+P+A++A +TYTP PL Sbjct: 320 NAWLPVLPQLGGKLMYEQYSGDSVALFGKDKLQKNPYAVTAGITYTPFPLL 370 >UniRef50_D2TS61 Intimin-like protein n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TS61_CITRO Length = 1424 Score = 307 bits (787), Expect = 3e-82, Method: Composition-based stats. Identities = 124/296 (41%), Positives = 175/296 (59%), Gaps = 11/296 (3%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 M +++ H + + R + +A I +Q+ P ++ + + +A + S Sbjct: 12 MLFFRSTHMRSKTR-----KLLACIQIVLQLAPPSSLIYLS--SVFNANAEEITSSAEKE 64 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARV 120 + +VA A AG+ LSS SDA + + T KA QEWL ++GTARV Sbjct: 65 QGNPSDQNASSVAQTAVQAGSLLSSDNASDALGSAVVSAVTGKAASSAQEWLSQFGTARV 124 Query: 121 KLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMA 180 ++ D+ F+L DS L++L P+Y+ N+LFTQ R DDR N GFG+RHF + WM Sbjct: 125 NISTDEHFTLSDSELDLLVPLYNENENLLFTQLGGRRHDDRNIVNGGFGYRHF-NDGWMW 183 Query: 181 GVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWD 239 G N F D +S + H R+G+ E DYL +SANGY+R S W S +DY ER A+G+D Sbjct: 184 GTNVFYDRQVSGNQHQRLGLDTELRWDYLNVSANGYLRLSDWMSSSSYQDYDERVADGFD 243 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 IRA GYLPA+PQLGA+++YEQY+GD VGLFG D RQKDP+A++ + YTPVPL Sbjct: 244 IRATGYLPAYPQLGANIIYEQYFGDSVGLFGDDEDDRQKDPYAVTVGLNYTPVPLV 299 >UniRef50_C4U8H6 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4U8H6_YERAL Length = 828 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 118/286 (41%), Positives = 155/286 (54%), Gaps = 19/286 (6%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + Y A+ + ++ P + V A Sbjct: 93 ELKRINIYRTFAKPFTALTVGDEIDVPRKKSPFTVDNNVTVPA----------------- 135 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 E VAS AA LS + + N + + Q+WLG++GTAR++ N + D Sbjct: 136 -ENGVASNAAAGAALLSHGDAAKSAENMARSAVNNEISSSAQQWLGQFGTARIQFNTNDD 194 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 F S++++L P+YD ++ FTQ D R NIG G R F N WM G NTF D Sbjct: 195 FEFDSSAIDVLIPLYDNQKSLFFTQLGGRNKDSRNTINIGAGVRAFLTN-WMYGANTFFD 253 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 +D++ ++ R+G+GAE W DYLKLSANGY + W +S D DY ERPANG+D+RAE YLP Sbjct: 254 NDITGNNRRVGIGAEAWTDYLKLSANGYFGTTDWHQSRDFADYNERPANGYDLRAETYLP 313 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+PQLG LMYEQY GDEV LFGKDKRQKDPHAI+ + YTPV L Sbjct: 314 AYPQLGGKLMYEQYNGDEVALFGKDKRQKDPHAITVGINYTPVSLV 359 >UniRef50_UPI0001C33D72 hypothetical protein CATC2_03030 n=4 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D72 Length = 1538 Score = 305 bits (781), Expect = 1e-81, Method: Composition-based stats. Identities = 124/293 (42%), Positives = 172/293 (58%), Gaps = 16/293 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 + F S+ ++ + W+ I +Q+LFPL F PV AA A + T Sbjct: 1 MMKSSIKNNNSFFLSLKSKLIIWSQIVLQILFPLFTVF-PVHAAPAT------TTKETTV 53 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + +AS S+ +D ++ TGMAT+ A +Q+WL ++GTARV+ Sbjct: 54 AMPYSQELSTLAS---------STASGTDGAKSAATGMATSAAASSVQQWLSQFGTARVQ 104 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 LNVD + + DS++++L P+YD +LFTQ + D RT N+G G R F +WM G Sbjct: 105 LNVDDNGNWDDSAVDLLAPLYDNKKAVLFTQLGLRAPDGRTTGNLGMGVRTFYLENWMFG 164 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + R+G GAE W +YLKLSAN Y+ + W S D DY E+PA+G+DIR Sbjct: 165 GNVFFDDDFTGKNRRVGFGAEAWTNYLKLSANTYVGTTNWHSSRDFTDYNEKPADGYDIR 224 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 AEGYLPA+PQLGA LMYEQYYGD+V LF D Q +P A++ ++YTPVPL Q Sbjct: 225 AEGYLPAYPQLGAKLMYEQYYGDKVALFDTDHLQSNPSAVTTGISYTPVPLVQ 277 >UniRef50_UPI0001AF5B53 putative invasin n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5B53 Length = 1149 Score = 303 bits (776), Expect = 4e-81, Method: Composition-based stats. Identities = 133/290 (45%), Positives = 177/290 (61%), Gaps = 7/290 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + + A + V PL A+ A + + + Sbjct: 81 LTLNQLRELNQLRTFAHGLNGLQPGDDVDVPLMAAKDNKNASDAAAPGR------SASAE 134 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 N + VA +A+ AG+FL+S SDA + MAT +A Q+WL +GTARV+L+ Sbjct: 135 EGNEQAQKVAGYASQAGSFLASSAKSDAAASMARNMATVEAGGAFQQWLSHFGTARVQLD 194 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 DK+FSLK+S ++L P+YD N +FTQG++HRTD RTQ+++G GWRH S + +M G N Sbjct: 195 ADKNFSLKNSQFDLLLPLYDQGDNFVFTQGSLHRTDSRTQASLGAGWRH-STSTYMLGGN 253 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F D DLSR H R G G EYWR++LKL N Y+R SGWK SPD+EDYQERPANGWD+R + Sbjct: 254 LFGDFDLSRDHARAGAGLEYWRNFLKLGVNSYLRLSGWKDSPDLEDYQERPANGWDVRGQ 313 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 ++P+ PQLG L YEQYYG EV LFG D RQ++PHAI+ + YTPVPL Sbjct: 314 AWVPSLPQLGGKLTYEQYYGKEVALFGVDSRQRNPHAITVGINYTPVPLI 363 >UniRef50_C1MAD0 EaeH n=2 Tax=Enterobacteriaceae RepID=C1MAD0_9ENTR Length = 1180 Score = 301 bits (770), Expect = 2e-80, Method: Composition-based stats. Identities = 129/277 (46%), Positives = 185/277 (66%), Gaps = 12/277 (4%) Query: 17 VLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFA 76 + + VAW+ I++Q L+P ++FTP ++ ++ + A+ + ++S A Sbjct: 15 PVNKVVAWSTIALQALYPALLSFTPTISHA--------SAVKASQAAAEQQELRGLSSLA 66 Query: 77 ANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLE 136 A AG + + +F A+A +E+ EWL KYG AR++LNVD FSLKDS+ + Sbjct: 67 AQAGRSIENG----HAGSFAANTVPAQATKEVVEWLQKYGNARIQLNVDDAFSLKDSAFD 122 Query: 137 MLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTR 196 LYP D ++LF+Q ++HRTDDRTQ+NIG G+R+F+ ++ M G N F D+DLSR H R Sbjct: 123 FLYPWIDKKQHVLFSQTSLHRTDDRTQTNIGMGYRYFTADNSMLGANLFYDYDLSRHHAR 182 Query: 197 IGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASL 256 +G G EYWRDYL+ AN Y+R S WK S D++DYQERPA+GWDI +G+LP++PQLGASL Sbjct: 183 MGAGVEYWRDYLRAGANAYLRLSKWKDSHDLDDYQERPADGWDIYTQGWLPSYPQLGASL 242 Query: 257 MYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 YE+YYG VGLFG D Q++P+A + ++YTPVPL Sbjct: 243 KYEKYYGKNVGLFGSDHLQENPYAFTGGISYTPVPLV 279 >UniRef50_B2U5L0 Intimin type beta n=1 Tax=Escherichia coli 53638 RepID=B2U5L0_ECOLX Length = 1653 Score = 300 bits (769), Expect = 3e-80, Method: Composition-based stats. Identities = 110/291 (37%), Positives = 169/291 (58%), Gaps = 16/291 (5%) Query: 3 HYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTV 62 + + + + V + ++ P+ V+F P+ T Sbjct: 79 NIELSELERINQGRVFLNGIKNIKEGDEINVPV-VSFAPIKWG------------EEETK 125 Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + + +AS A + G LS+ S + + T K N IQ W +GTA ++L Sbjct: 126 EQGSGNLQQIASIATDVGNILSNDNISKN--SALLNKITNKVNSHIQSWFENFGTAHIQL 183 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 VDK+FSLK+S LE+L+P+++ + F+QG I DD+ SNIG G+R F N WM G Sbjct: 184 QVDKNFSLKNSQLELLFPVFEDDERLFFSQGGISYIDDKFISNIGIGYRAFYDN-WMLGG 242 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRA 242 N+FID+DL + H+R+G+G EYW+D LKL AN Y+R S W+ S +I DY+ERPANG D+ Sbjct: 243 NSFIDYDLRKEHSRLGLGIEYWQDNLKLGANSYLRLSNWRNSSNIVDYEERPANGLDLNI 302 Query: 243 EGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + +LP++PQ+G + YE+YYGD+V LFG++ RQ++PH+ + ++YTP PL Sbjct: 303 KSWLPSYPQIGGDIKYEKYYGDDVALFGENHRQRNPHSTTLGISYTPFPLM 353 >UniRef50_A7MHR4 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7MHR4_ENTS8 Length = 1027 Score = 300 bits (769), Expect = 3e-80, Method: Composition-based stats. Identities = 122/292 (41%), Positives = 168/292 (57%), Gaps = 17/292 (5%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 H ++ ++ + S+L + V WA I +Q+ FPL V P A+ A + +S +T Sbjct: 1 MHEQSIMEKNTLKISLLKKIVIWAQILLQIAFPLLV--LPAHASSGPGATETDMSDASTL 58 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVK 121 + + +DA +N T +AT A ++EWL +GTA+V Sbjct: 59 SASLASSAAQ---------------NGADAMKNTATHLATTHAASTVEEWLSHFGTAQVT 103 Query: 122 LNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAG 181 L+VD + + +S+ + L P+YD ++LFTQ I D RT NIG G R F DWM G Sbjct: 104 LDVDDNGNWDNSAFDFLAPLYDNKKSVLFTQLGIRAPDGRTTGNIGLGVRTFYVRDWMFG 163 Query: 182 VNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIR 241 N F D D + + RIG GAE W +YLKLSAN YI S W S D ++Y E+PA+G+D+R Sbjct: 164 GNVFFDDDFTGENRRIGFGAEAWTNYLKLSANTYIGTSQWHNSGDFDNYNEKPADGYDVR 223 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 AEGYLP++PQLGA LMYEQYYGD V LF KD Q +P A++ + YTPVPL Sbjct: 224 AEGYLPSFPQLGAKLMYEQYYGDNVALFDKDHLQSNPSAVTVGLNYTPVPLI 275 >UniRef50_B1JPU7 Ig domain protein group 1 domain protein n=24 Tax=Yersinia RepID=B1JPU7_YERPY Length = 1075 Score = 296 bits (758), Expect = 6e-79, Method: Composition-based stats. Identities = 124/290 (42%), Positives = 179/290 (61%), Gaps = 7/290 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 K +K + + +++ V WANI +Q +FPL++ FTP + A + + Sbjct: 12 AKQLNKNKQLNKTRISKSVVWANIVIQAIFPLSIAFTPAVMAAET------VGASDEKPR 65 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 + + E++ A+ A + L++ + + G A N+ +Q+W ++G+A+V+LN Sbjct: 66 SASQAEQSTANAATRLASILTNDDSAKQASSIARGTAANAGNEALQKWFNQFGSAKVQLN 125 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 +D+ SLK S L++L P+ D+P + FTQ DDR N+G G RHF M G N Sbjct: 126 LDEKLSLKGSQLDVLLPLTDSPDLLTFTQLGGRYIDDRVTLNVGLGQRHFFAQQ-MLGYN 184 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F+DHD S SHTRIGVGAEY RD++ L+ANGY SGWK SPD++ Y E+ ANG+D+R+E Sbjct: 185 LFVDHDASYSHTRIGVGAEYGRDFINLAANGYFGVSGWKNSPDLDKYDEKVANGFDLRSE 244 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 YLP PQLG L+YEQY+GDEVGLFG D RQK+P A++ V YTP+PL Sbjct: 245 AYLPTLPQLGGKLIYEQYFGDEVGLFGVDNRQKNPLAVTLGVNYTPIPLF 294 >UniRef50_C5B8S8 Ig domain protein group 1 domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B8S8_EDWI9 Length = 1764 Score = 293 bits (750), Expect = 5e-78, Method: Composition-based stats. Identities = 109/290 (37%), Positives = 152/290 (52%), Gaps = 17/290 (5%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 ++ + + ++ P T Sbjct: 84 IPLSKLYKLNQFRSFHKSFYDLSGGDEIDIPA---------------SNNYSFENRPLDT 128 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLN 123 +N E A+ A S S + MA++ AN IQ+WL ++GT +L+ Sbjct: 129 KVDNNENYSANKTKAAVNV-SESNKSPEALGVASSMASSAANNAIQKWLSQWGTVESQLS 187 Query: 124 VDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVN 183 D SLK+SSL+ L PIYDT N F Q D R N+G+G RH N WM G+N Sbjct: 188 FDSKASLKNSSLDWLIPIYDTDENTWFIQAGGRNKDSRNTVNLGWGVRHVY-NGWMYGLN 246 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 F D+D++ ++ R+G+G E DYL +++N Y+R + W +S D DY ERPANG+D+R Sbjct: 247 NFFDYDITGNNRRLGLGVEARTDYLSIASNAYLRMNNWHQSRDFYDYDERPANGFDMRVN 306 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G+LPA+PQ+G L+YEQYYGDEVGLFGKD RQKDP AI+A V++TP PL Sbjct: 307 GWLPAYPQIGGKLVYEQYYGDEVGLFGKDDRQKDPKAITAGVSWTPFPLL 356 >UniRef50_A7ZRD2 Bacterial Ig family protein n=1 Tax=Escherichia coli E24377A RepID=A7ZRD2_ECO24 Length = 1084 Score = 290 bits (742), Expect = 4e-77, Method: Composition-based stats. Identities = 127/286 (44%), Positives = 168/286 (58%), Gaps = 22/286 (7%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + S + R + +Q F + F V A Sbjct: 1 MVKTNPSSSQVRRVAVYGLAGLQFFFQVTPAFAGVFQA---------------------- 38 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 E++VA A AG L DA R +T A+ +A + +WL ++GTA+ +L+V D Sbjct: 39 DEQSVAQTAMEAGRVLQGSNSGDAARQMLTSQASGQAADAVTQWLNQFGTAKTQLSVVSD 98 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 FSLK SSL++L P Y+TP N+LFTQ + D R +N G G R+F+ N WM G N F D Sbjct: 99 FSLKGSSLDVLLPFYNTPKNVLFTQLGMRDNDGRFTTNAGLGHRYFTDNGWMLGYNVFYD 158 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 D ++ R G+G E WRDYLKLSANGY R S W++SP + DY ERPA+GWDIRAEG+LP Sbjct: 159 VDWRNTNRRYGIGVEAWRDYLKLSANGYKRLSDWRQSPTVTDYDERPADGWDIRAEGWLP 218 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A+PQLG L+YEQYYG+EV LFG+ +RQK+PHAI+A VT+TP L Sbjct: 219 AYPQLGGKLVYEQYYGNEVALFGESERQKNPHAITAGVTWTPFSLL 264 >UniRef50_C4SDT7 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SDT7_YERMO Length = 1424 Score = 290 bits (742), Expect = 5e-77, Method: Composition-based stats. Identities = 108/296 (36%), Positives = 148/296 (50%), Gaps = 7/296 (2%) Query: 4 YKTGHKQPRFRYSVLARCVAWANIS-----VQVLFPLAVTFTPVMAARAQHAVQPRLSMG 58 K R + ++ + +A A Sbjct: 12 AKKTALFKRLHTLTATDTLESVASGYGLSVDELWALNINLYNNRVAFDAIKYGAVVYVPN 71 Query: 59 NTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTA 118 VAS + G+ LSS+ +A G+ + + ++EWLG G A Sbjct: 72 REEEQKATQQASLVASHLSQIGSTLSSESRVEAFSRLAKGVLLSSTAKSVEEWLGHIGKA 131 Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 +VKL VD S L + P+Y+ P + F+Q R D R NIG G RH+ + W Sbjct: 132 QVKLQVDDKNDFSGSELHLFVPLYNQPERLAFSQFGFRRIDQRNIMNIGLGQRHYLSD-W 190 Query: 179 MAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 M G N F+D +S + H R+G+G E RDY+KLSAN Y R GWK S +EDY ER A+G Sbjct: 191 MLGYNVFLDQQISGNAHRRLGLGGELARDYVKLSANSYYRLGGWKNSTRLEDYDERAASG 250 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 +DIR E YLP +PQLG LMYEQY+G+EV LFG ++RQK+P A++A V+YTP PL Sbjct: 251 YDIRTEAYLPYYPQLGGKLMYEQYFGNEVALFGLNERQKNPSALTASVSYTPFPLV 306 >UniRef50_UPI0001C33E08 hypothetical protein CATC2_09202 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E08 Length = 1492 Score = 288 bits (738), Expect = 1e-76, Method: Composition-based stats. Identities = 111/265 (41%), Positives = 152/265 (57%), Gaps = 17/265 (6%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 +Q+LFP + + + V V S A GT ++ Sbjct: 1 MQLLFPFVTS---------------AYTYAASQPPVAVPVPTQVTSLLAAGGTE--TENG 43 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 S+ ++ T MAT A ++EWL +GTA V LN D++ + +SS++ L P+YD ++ Sbjct: 44 SNGLKSTATSMATGAAANSVEEWLSHFGTAEVNLNTDENGNWDNSSIDFLAPLYDNKKSV 103 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 LFTQ + D RT NIG G R F+ +WM G N F D D + + R+G+GAE W DYL Sbjct: 104 LFTQLGLRAPDGRTTGNIGMGVRSFNTENWMFGGNVFFDDDFTGKNRRVGIGAEAWTDYL 163 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 KL+AN YI + W S D DY E+PA+G+DIRAEGYLPA+PQLGA +MYEQYYG+ V L Sbjct: 164 KLAANSYIGTTEWHSSRDFADYNEKPADGFDIRAEGYLPAYPQLGAKVMYEQYYGENVAL 223 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 F KD Q DP A++ + YTP+ L Sbjct: 224 FDKDHLQNDPSAVTMGLNYTPISLV 248 >UniRef50_C4UDV3 Putative uncharacterized protein n=1 Tax=Yersinia aldovae ATCC 35236 RepID=C4UDV3_YERAL Length = 2487 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 110/295 (37%), Positives = 144/295 (48%), Gaps = 7/295 (2%) Query: 5 KTGHKQPRFRYSVLARCVAWANI-----SVQVLFPLAVTFTPVMAARAQHAVQPRLSMGN 59 K R + ++ + A A Sbjct: 52 KKTVLFKRLYTLTPTDTLESVASNYGLSVDELWALNINLYNNRSAFDAVKYGAVVYIPNQ 111 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR 119 VAS + G LSS+ A GM + + ++EWLG G A+ Sbjct: 112 EEEQQATQQASMVASHLSQVGNSLSSEDRVGAFSRLAKGMLLSSTAKTVEEWLGHIGQAQ 171 Query: 120 VKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM 179 VKL D S +++ P+YD P + F+Q R D R NIG G RH+ + WM Sbjct: 172 VKLQADDKNDFSGSEVDLFIPLYDQPEKLAFSQFGFRRIDQRNIMNIGLGQRHYVSD-WM 230 Query: 180 AGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGW 238 G N F D +S + H R+G G E RDY+KLSAN Y R GWK S +EDY ER ANG+ Sbjct: 231 FGYNIFFDQQISGNAHRRVGFGGELARDYVKLSANSYHRLGGWKNSTRLEDYDERAANGY 290 Query: 239 DIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 DIR E YLP +PQLG LMYEQY+GDEV LFG ++RQK+P A++A V+YTP+PL Sbjct: 291 DIRTEAYLPHYPQLGGKLMYEQYFGDEVALFGINERQKNPSALTAGVSYTPIPLV 345 >UniRef50_C4T5G2 Soluble lytic murein transglycosylase and regulatory protein n=4 Tax=Yersinia RepID=C4T5G2_YERIN Length = 753 Score = 286 bits (733), Expect = 4e-76, Method: Composition-based stats. Identities = 122/299 (40%), Positives = 166/299 (55%), Gaps = 13/299 (4%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMA--ARAQHAVQPRLSMGNTT 61 + S+ R N +L P P+ +A + P L MGN Sbjct: 56 LDLRTLRKLNNGSLDKR--DELNAGESLLLPANSPLFPLDPLAGKAIASNLPELGMGNDP 113 Query: 62 VTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKA--------NQEIQEWLG 113 V ++ E+ A+ A G + SD +N A +A Q+ QE LG Sbjct: 114 VPLVSSGEQKTAAAAHAVGAQNWNNMTSDQMKNQAESWAKGQAKAQVVDPLRQQAQELLG 173 Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF 173 K+G A+V L VD + SL S+ + P Y+ + F+Q +HR D+R N+G G R Sbjct: 174 KFGKAQVNLAVDDNGSLSKSAFSLFSPWYENDAMVAFSQVGVHRQDNRMIGNLGAGVRFD 233 Query: 174 SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQER 233 G+ W+ G NTF+D D+SR+H+R+G+G E+W D LKL++N Y SGWK S D +DY ER Sbjct: 234 QGD-WLFGANTFLDQDISRNHSRLGLGLEWWADNLKLASNYYHPLSGWKDSKDFDDYLER 292 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 PA G+D+ A+GYLPA+ QLGAS +YEQYYGDEV LFGKD QKDPHA++ V YTP PL Sbjct: 293 PARGFDVHAQGYLPAYQQLGASAVYEQYYGDEVALFGKDNLQKDPHAVTVGVDYTPFPL 351 >UniRef50_UPI0001C33D83 hypothetical protein CATC2_09062 n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33D83 Length = 1063 Score = 284 bits (726), Expect = 3e-75, Method: Composition-based stats. Identities = 113/274 (41%), Positives = 155/274 (56%), Gaps = 12/274 (4%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 + +A I +Q P+A++ + + A LS + DN A A Sbjct: 2 KSMAIMQILLQTALPVALSMSATVRAA-------ELSQNTHSADKDNINSPYSAQMTQAA 54 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 S MA+ A +++WL ++GTARV+LNVD + DS+++ L Sbjct: 55 TALSSGNAAGAGA-----SMASGYAGDSVEKWLSQFGTARVQLNVDDKGNWDDSAIDFLA 109 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ MLFTQ + DDR N G G R F ++WM G N F D D + + R+G Sbjct: 110 PLYDSQKAMLFTQLGLRAPDDRVTGNFGLGVRTFYTDNWMFGGNVFFDDDFTGDNRRVGF 169 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W + LKLSAN Y+ + W S D +DY E+PA+G+D+RAEGYLPA+PQLGA LMYE Sbjct: 170 GAEAWTNNLKLSANTYLGTTNWHSSRDFDDYYEKPADGFDVRAEGYLPAYPQLGAKLMYE 229 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 QYYGD+V LF KD Q +P A++ V+YTPVPL Sbjct: 230 QYYGDKVALFDKDDLQSNPSAVTVGVSYTPVPLI 263 >UniRef50_P11922 Invasin n=27 Tax=Yersinia RepID=INVA_YERPS Length = 985 Score = 282 bits (722), Expect = 9e-75, Method: Composition-based stats. Identities = 113/264 (42%), Positives = 153/264 (57%), Gaps = 10/264 (3%) Query: 36 AVTFTPVMAARAQHAVQPRLSMGNTTVTA------DNNVEKNVASFAANAGTFLSSQPDS 89 + F + + S +T A + E + + G L+ S Sbjct: 67 SSAFENLHPNNEMESSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLA---AS 123 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNML 149 + M NQEI++WL ++GTA+V LN DK+FSLK+SSL+ L P YD+ + + Sbjct: 124 GRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDKNFSLKESSLDWLAPWYDSASFLF 183 Query: 150 FTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 F+Q I D R N+G G R N W+ G+NTF D+DL+ + RIG+GAE W DYL+ Sbjct: 184 FSQLGIRNKDSRNTLNLGVGIRTL-ENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDYLQ 242 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 L+ANGY R +GW S D DY+ERPA G D+RA YLPA PQLG LMYEQY G+ V LF Sbjct: 243 LAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALF 302 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 GKD Q++P+A++A + YTPVPL Sbjct: 303 GKDNLQRNPYAVTAGINYTPVPLL 326 >UniRef50_P19196 Invasin n=3 Tax=Yersinia enterocolitica RepID=INVA_YEREN Length = 835 Score = 280 bits (717), Expect = 3e-74, Method: Composition-based stats. Identities = 108/285 (37%), Positives = 149/285 (52%), Gaps = 16/285 (5%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 F + + ++ +S+ ++F + A++ N Sbjct: 1 MYSFFNTLTVTKIISRLILSIGLIFGIFTYGFSQQHYFNSEALEN--------PAEHNEA 52 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 + S + S N M ANQE++ WL ++GT +V +N DK F Sbjct: 53 FNKIISTGTSLA-------VSGNASNITRSMVNDAANQEVKHWLNRFGTTQVNVNFDKKF 105 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDH 188 SLK+SSL+ L P YD+ + + F+Q I D R NIG G R F WM G NT D+ Sbjct: 106 SLKESSLDWLLPWYDSASYVFFSQLGIRNKDSRNTLNIGAGVRTFQQ-SWMYGFNTSYDN 164 Query: 189 DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPA 248 D++ + RIGVGAE W DYL+LSANGY R +GW +S D DY ERPA+G DI + YLPA Sbjct: 165 DMTGHNHRIGVGAEAWTDYLQLSANGYFRLNGWHQSRDFADYNERPASGGDIHVKAYLPA 224 Query: 249 WPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PQLG L YEQY G+ V LFGKD Q +P+A++ + YTP+P Sbjct: 225 LPQLGGKLKYEQYRGERVALFGKDNLQSNPYAVTTGLIYTPIPFI 269 >UniRef50_C7BN31 Putative adhesin n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BN31_PHOAA Length = 1815 Score = 278 bits (710), Expect = 2e-73, Method: Composition-based stats. Identities = 94/249 (37%), Positives = 137/249 (55%), Gaps = 3/249 (1%) Query: 46 RAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN 105 + N + S L+ P +++I ++ Sbjct: 86 PKIPQLYENSHKDENHKEDSQNTPPLILSRGPEFLGLLNKDPK-KLAQDYIVNKLNSQIT 144 Query: 106 QEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQS 164 Q+WL ++GTA++ LNVD L +SS+++L P YD + ++++Q D R Sbjct: 145 SNTQKWLSQFGTAKINLNVDHRGRLDESSVDLLVPFYDDKDHWLIYSQYGYRHKDSRDTV 204 Query: 165 NIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 N+G G R F + WM G NTF D+DL+ +++R +G E W +YLK+SAN Y R S W S Sbjct: 205 NLGIGTRLFIND-WMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNS 263 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 D+ +Y ERPANG+D+ A+ YLPA P LGA + YEQY+GD V LFG + RQKDP+A + Sbjct: 264 RDLTNYYERPANGYDLIADMYLPAMPSLGAKIKYEQYFGDNVALFGTNNRQKDPYAATIG 323 Query: 285 VTYTPVPLT 293 V YTP+PL Sbjct: 324 VNYTPIPLI 332 >UniRef50_C2LNI4 Intimin/invasin n=7 Tax=Proteus RepID=C2LNI4_PROMI Length = 2323 Score = 277 bits (709), Expect = 3e-73, Method: Composition-based stats. Identities = 110/236 (46%), Positives = 152/236 (64%), Gaps = 4/236 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 N + + +A + L+ D + ++K+NQ+I++WL ++G Sbjct: 84 NNQDEAIPSTEGEELAKIIVDNSFLLNKDIDV---TQYAISQISSKSNQKIEQWLNQFGH 140 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND 177 ARV L+ DK+ +LK+SS E+L P+Y+ ++F Q HR D R+Q N G G+R+F+ Sbjct: 141 ARVSLSADKNLTLKNSSAELLIPLYEQKEKLIFAQTNYHRKDLRSQFNYGIGYRYFT-EK 199 Query: 178 WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANG 237 +M G+N F DHDL+ H R+G+GAE WRDY KLS+N Y R S W+ S +I DY ERPANG Sbjct: 200 FMVGINGFYDHDLTHHHNRLGIGAEIWRDYFKLSSNHYHRLSSWRASNNILDYSERPANG 259 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 WDIR EGY PA+PQLG L++EQYYG EVGLFGKDKR K+PH + + YTP+PL Sbjct: 260 WDIRTEGYFPAYPQLGTKLIFEQYYGKEVGLFGKDKRDKNPHTYTLGINYTPIPLV 315 >UniRef50_A9MKL6 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MKL6_SALAR Length = 1812 Score = 276 bits (706), Expect = 6e-73, Method: Composition-based stats. Identities = 126/289 (43%), Positives = 174/289 (60%), Gaps = 12/289 (4%) Query: 16 SVLARCVAWANISVQVLFPLAVTF---TPVMAARAQHAVQPRLSMG----NTTVTADNNV 68 + R A+ + +QV+F +F P AA Q + + +T ++ Sbjct: 2 RIYLRLTAYFQLVIQVIFLFVNSFIFSFPAHAATNPDTNQKKPTTEITAQSTAKKEEDEA 61 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 KN+A+ ++ G+ LS +DA N +A +IQ+WL ++GTA+V L +DKD Sbjct: 62 GKNLAAILSSTGSMLSQDNKTDALINSAINNGSAYVTGQIQQWLQQFGTAKVNLGLDKDL 121 Query: 129 SLKD-SSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 SL + S +L D N+LFTQ R DDR N+G G+R+F+ + WM G+NTF D Sbjct: 122 SLDNASLDLLLPLYDDKKQNLLFTQWGGRRDDDRNIINVGMGYRYFA-DRWMWGINTFYD 180 Query: 188 HDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 +S + H R+G+G E +Y KLSANGY R SGWK S + EDYQER ANG+DIRAEGYL Sbjct: 181 RQISDNAHERLGIGGELGWNYFKLSANGYKRLSGWKDSSEYEDYQERVANGYDIRAEGYL 240 Query: 247 PAWPQLGASLMYEQYYGDEVGLFGK--DKRQKDPHAISAEVTYTPVPLT 293 PAWPQLGA L++EQYYGD+V LF D RQ++P+A++A V YTP PL Sbjct: 241 PAWPQLGAQLVWEQYYGDDVALFDDSEDDRQRNPYAVTAGVNYTPFPLV 289 >UniRef50_C4S9J0 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J0_YERMO Length = 686 Score = 276 bits (706), Expect = 6e-73, Method: Composition-based stats. Identities = 100/266 (37%), Positives = 138/266 (51%), Gaps = 1/266 (0%) Query: 29 VQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD 88 ++ + P + +D + L+ Sbjct: 1 MENEIGGTLINKPGHDMPKLPDMAIMAETSGAKPISDQQFADWGKNLGGQDWNTLNRDKA 60 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 T + + Q+ Q+ LG++G A+V L++D +L S+ + P YD+ + Sbjct: 61 QSKTTQWAKEKIISPLQQQAQDLLGRFGQAQVNLSMDNKGNLNRSTASLFTPWYDSEQYL 120 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 LF+Q IH D+R N G G R + + + G N FIDHD SR H R G+GAE DY Sbjct: 121 LFSQINIHHQDNRKIGNFGLGHRIELPSLNGLLGYNVFIDHDFSRGHNRAGIGAEARADY 180 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 LK SAN Y S WK SPD +DY ERPA G+D+R++GYLPA+PQLG S +YE Y+GDEV Sbjct: 181 LKFSANYYHPLSHWKDSPDFDDYLERPAKGYDLRSQGYLPAYPQLGVSAVYEHYFGDEVA 240 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LFGK RQKDP A++ + YTPVPL Sbjct: 241 LFGKSHRQKDPRALTLGIDYTPVPLV 266 >UniRef50_D0ZDP6 Putative adhesin n=1 Tax=Edwardsiella tarda EIB202 RepID=D0ZDP6_EDWTE Length = 839 Score = 275 bits (704), Expect = 9e-73, Method: Composition-based stats. Identities = 115/274 (41%), Positives = 157/274 (57%), Gaps = 6/274 (2%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 V AN V P+ + RA + G+ T D ++ Sbjct: 97 HNVEDANAGELVDSPINDAIAININ-RASQNNKNNAGAGSLTKEQDPMDSLSI----RGV 151 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 G+ L++ DA + MAT+ N +I +WL +YGTAR++LN D+DFSL +S+L+ L Sbjct: 152 GSALAASGRVDALHHMARTMATSAVNDQIGQWLNRYGTARIQLNTDRDFSLAESALDWLL 211 Query: 140 PIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGV 199 P+YD+ T LFTQ D R +NIG G R F ++WM G N F D+D + + R+G+ Sbjct: 212 PLYDSQTLTLFTQQGFRNKDRRNIANIGIGTR-FIHHEWMMGGNAFYDNDFTGDNKRVGL 270 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE W D +LSANGY R + W +S D DY ERPANG D+RA G+LPA P LG SL+YE Sbjct: 271 GAELWTDSFQLSANGYFRLTAWHQSRDRSDYNERPANGVDLRANGWLPAQPHLGGSLIYE 330 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 Y+GD V LFGKD Q++P+AI+ +YTP L Sbjct: 331 HYFGDNVALFGKDHLQRNPYAITLGGSYTPFSLL 364 >UniRef50_Q7X2C2 Aec1 n=50 Tax=Enterobacteriaceae RepID=Q7X2C2_ECOLX Length = 734 Score = 275 bits (704), Expect = 1e-72, Method: Composition-based stats. Identities = 100/265 (37%), Positives = 140/265 (52%), Gaps = 9/265 (3%) Query: 37 VTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASF--------AANAGTFLSSQPD 88 F Q+ P L N K++ A L+ + Sbjct: 16 AAFAAPEINVKQNESLPDLGSQAAQQDEQTNKGKSLKERGADYVINSATQGFENLTPEAL 75 Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM 148 R+++ T+ A I++ L YG R L++ + L SS++ P YD T + Sbjct: 76 KSQARSYLQSQITSTAQSYIEDTLSPYGKVRSNLSIGQGGDLDGSSIDYFVPWYDNQTTV 135 Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYL 208 F+Q + R +DRT NIG G R ++ + ++ G N F D+D +R H R+G+GAE W DYL Sbjct: 136 YFSQFSAQRKEDRTIGNIGLGVR-YNFDKYLLGGNIFYDYDFTRGHRRLGLGAEAWTDYL 194 Query: 209 KLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 K S N Y S WK S D + Y+ERPA GWDIRAE +LPA+PQLG +++EQYYG+EV L Sbjct: 195 KFSGNYYHPLSDWKDSEDFDFYEERPARGWDIRAEAWLPAYPQLGGKIVFEQYYGNEVAL 254 Query: 269 FGKDKRQKDPHAISAEVTYTPVPLT 293 FG D +KDP A++ V Y PVPL Sbjct: 255 FGTDSLEKDPFAVTLGVKYQPVPLI 279 >UniRef50_Q7N599 Similarities with putative adhesin n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N599_PHOLL Length = 1695 Score = 273 bits (697), Expect = 7e-72, Method: Composition-based stats. Identities = 95/237 (40%), Positives = 143/237 (60%), Gaps = 3/237 (1%) Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGT 117 ++ ++ + + S L+S P +++I ++ Q+WL ++GT Sbjct: 91 EDSHKDGNHPLPPLILSHGTKILGLLNSDPK-KLAQDYIVNKLNSQITSNTQKWLSQFGT 149 Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 A++ LNVD L +SS+++L P YD + ++++Q D R N+G G R F N Sbjct: 150 AKINLNVDHRGRLDESSVDLLVPFYDDKDHWLVYSQYGYRHKDSRDTVNLGIGTRLFINN 209 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 WM G NTF D+DL+ +++R +G E W +YLK+SAN Y R S W + D+ +Y ERPAN Sbjct: 210 -WMYGANTFYDNDLTGNNSRFSLGGELWTNYLKMSANAYFRLSDWHNARDLVNYYERPAN 268 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G+D+ A+ YLP+ P LGA + YEQY+GD V LFGK+KRQKDP+A + V YTP+PL Sbjct: 269 GYDLIADMYLPSMPSLGAKIKYEQYFGDNVALFGKNKRQKDPYAATIGVNYTPIPLI 325 >UniRef50_P19809 Intimin n=231 Tax=Enterobacteriaceae RepID=EAE_ECO27 Length = 939 Score = 273 bits (697), Expect = 7e-72, Method: Composition-based stats. Identities = 117/288 (40%), Positives = 157/288 (54%), Gaps = 19/288 (6%) Query: 22 VAWANISVQVLFPLA-----------VTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK 70 + A Q++ PL + P++AA +L+ + VT N + Sbjct: 101 MMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD 160 Query: 71 NV----ASFAANAGTFLSSQP-DSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVD 125 A AA+ G+ L S+ + D ++ G+A +A+ ++Q WL YGTA V L Sbjct: 161 KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSG 220 Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTF 185 +F SSL+ L P YD+ + F Q D R +N+G G R F + M G N F Sbjct: 221 NNFDG--SSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPEN-MLGYNVF 277 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 ID D S +TR+G+G EYWRDY K S NGY R SGW +S + +DY ERPANG+DIR GY Sbjct: 278 IDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGY 337 Query: 246 LPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LP++P LGA LMYEQYYGD V LF DK Q +P A + V YTP+PL Sbjct: 338 LPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLV 385 >UniRef50_B6XDB5 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XDB5_9ENTR Length = 2521 Score = 270 bits (689), Expect = 5e-71, Method: Composition-based stats. Identities = 109/264 (41%), Positives = 159/264 (60%), Gaps = 7/264 (2%) Query: 36 AVTFTPVMAARAQHAVQPRLSMGNT-TVTADNNVEKNVASFAANAGTFLSSQ----PDSD 90 + PV+ A A+ L + + +NN E A + GTFLS + S Sbjct: 24 SSAIMPVIPAYAKMLDNKELPSLGSDQIIDENNTEHLAAEYTKTVGTFLSQKKTMKDLSQ 83 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 +++ +++A +EI+ WL K G ++ ++ DK FS+K+S + L P YD +LF Sbjct: 84 IAQDYARNKVSSEATKEIEHWLSKAGNVKLNIDFDKKFSIKNSQFDWLIPWYDQEDILLF 143 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 TQ +HR D+R +N G G R+F + G+N FIDHDLS +HTR+G+G EYW+DYLKL Sbjct: 144 TQHTLHRYDERFHTNNGIGLRYFHEKSTI-GMNAFIDHDLSHAHTRVGLGVEYWQDYLKL 202 Query: 211 SANGYIRASGWKKSPDIE-DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 +AN Y + WK + ++ D+ +PA+GWDI+ EG+LP +P LG +L YEQYYGD V LF Sbjct: 203 NANSYFGLTSWKSASELNHDFNAKPAHGWDIQVEGWLPNYPHLGGNLRYEQYYGDSVALF 262 Query: 270 GKDKRQKDPHAISAEVTYTPVPLT 293 GK KRQK+P+A + +TP PL Sbjct: 263 GKTKRQKNPNAATIGANWTPFPLF 286 >UniRef50_UPI0001C34895 putative invasin n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI0001C34895 Length = 722 Score = 265 bits (678), Expect = 1e-69, Method: Composition-based stats. Identities = 115/303 (37%), Positives = 168/303 (55%), Gaps = 23/303 (7%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTT 61 + + +P+ S+L W+ + P++ + AQ L Sbjct: 1 MNPPSSKLKPKLPNSLLLSTAIWSTAIL-----------PMVPSYAQIVHLDDLPTLGGQ 49 Query: 62 VTA------DNNVEKNVASFAANAGTFLSSQ----PDSDATRNFITGMATAKANQEIQEW 111 +++ E+ +A + NA F S + +D +++ A A EI W Sbjct: 50 AIQFEGTQPEDSTERFLAEYGQNAANFASEEKNTKNLADMAQDYARHKAANMATDEITHW 109 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L K G AR+ +N+DK S+K S L+ L P Y+ +LF+Q +IHRTD R Q+N G G R Sbjct: 110 LSKAGNARLNINLDKKLSIKTSQLDWLVPWYEQQDLLLFSQHSIHRTDGRLQTNNGIGLR 169 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDY 230 HF N M GVN F DHDLS H+R+G G EY +DY+++SAN Y+ S W+ + ++ +DY Sbjct: 170 HFQQNS-MIGVNAFFDHDLSHYHSRLGFGVEYAQDYVRMSANSYLGLSTWRSASELADDY 228 Query: 231 QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPV 290 RPANGWDI+ EG+LP + LGA+L EQYYGD+V LFGK++RQKDP A + V ++P Sbjct: 229 NARPANGWDIQLEGWLPTYANLGANLKLEQYYGDDVALFGKNERQKDPMAATVGVNWSPF 288 Query: 291 PLT 293 PL Sbjct: 289 PLL 291 >UniRef50_D2U3C0 Intimin/invasin n=2 Tax=Arsenophonus nasoniae RepID=D2U3C0_9ENTR Length = 1459 Score = 261 bits (668), Expect = 2e-68, Method: Composition-based stats. Identities = 89/242 (36%), Positives = 137/242 (56%), Gaps = 11/242 (4%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG 116 + + EK A L++ ++A N+ NQ+I +WL +YG Sbjct: 98 KETSQAKQVESAEKQFVQGATQIAQGLANNNATEAAINYARNRGEGLLNQKISDWLNQYG 157 Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN 176 ARV+++ +K ++L P+ D P ++LF+Q I + R+ +N+G G+R + N Sbjct: 158 KARVQISSNKTGD-----ADLLLPLIDKPNSLLFSQIGIRANEQRSTTNLGLGYRQYQQN 212 Query: 177 DWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS--PDIEDYQERP 234 WM G+N+F D+D+S + R G+G E W YLKL+ NGY R + W +S ++ DY ERP Sbjct: 213 -WMWGINSFYDYDISGGNARFGLGGELWAYYLKLAVNGYFRLTDWHQSFLHEMRDYDERP 271 Query: 235 ANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK---DKRQKDPHAISAEVTYTPVP 291 ANG+D+RAEGYLP++P LGA YEQY+GD V L + +P A++ ++YTP P Sbjct: 272 ANGFDLRAEGYLPSYPHLGAYAKYEQYFGDGVSLSHNPTAKDLKDNPSAVTFGLSYTPFP 331 Query: 292 LT 293 L Sbjct: 332 LL 333 >UniRef50_C8QCN4 Ig domain protein group 1 domain protein n=1 Tax=Pantoea sp. At-9b RepID=C8QCN4_9ENTR Length = 845 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 91/261 (34%), Positives = 136/261 (52%), Gaps = 16/261 (6%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRN 94 T + A A L+ V V+ V + R Sbjct: 84 ATAGQTIWIPAAKPAATTLPLAPATVQVAKPGKVDGKV-------------DDKTTNVRQ 130 Query: 95 FITGMATAKANQEIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDT-PTNMLFTQ 152 F A+++ + WL +G ++RV ++ ++F+ + + ++L P++++ M+F+Q Sbjct: 131 FGQDQLNTLASEQAETWLNGFGGSSRVAISSTQNFAKYNYAGDVLLPLWNSREDFMIFSQ 190 Query: 153 GAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSA 212 + DDRT NIG G R+F G WM G N F D+D S S+ RIG+GAE D L+L+A Sbjct: 191 LGVRHADDRTTGNIGLGARYF-GEGWMLGNNVFFDNDFSGSNRRIGLGAELGTDALRLAA 249 Query: 213 NGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 NGY + +GW S I D+ ERPANGWDI +LP +PQLG + YEQYYGD V L + Sbjct: 250 NGYFKLTGWHDSKFIADHDERPANGWDIELSSWLPVYPQLGGKVKYEQYYGDNVALISRG 309 Query: 273 KRQKDPHAISAEVTYTPVPLT 293 + Q +P A + V +TP+PL Sbjct: 310 RLQHNPSAATLGVNWTPIPLV 330 >UniRef50_C4SU11 Leucyl aminopeptidase (Fragment) n=3 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SU11_YERFR Length = 1395 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 108/274 (39%), Positives = 149/274 (54%), Gaps = 18/274 (6%) Query: 32 LFPLAVTFTPVMAARAQHAVQPRLSMGNT------TVTADNNVEKNVASFAANAGTFLSS 85 PL + TP+ A L + E NVAS A + + Sbjct: 88 EAPLNGSTTPLFAPEETSKSITELPDLGSIQNDIDVNNKLPVTEDNVASAATQLWGIMGN 147 Query: 86 QPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 S A + +TG+A A+Q +WLG+YG ARV+LN S + ++L P+ +T Sbjct: 148 DNSSRAAESAVTGVAAGLASQAAADWLGQYGNARVQLN-----SNSIGNADVLIPLTETQ 202 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N+LF Q + +RT +N+G G R F+ + WM GVNTF D+DL+ ++R+GVG E W Sbjct: 203 NNLLFGQLGVRYNGERTTNNVGLGVRSFT-DSWMFGVNTFYDYDLTGKNSRLGVGGEAWT 261 Query: 206 DYLKLSANGYIRASGWKKSP--DIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG 263 D LK SANGY R + W +S D+EDY ERPANG+D+RAE YLP++PQLG LMYE+Y+G Sbjct: 262 DNLKFSANGYFRLTDWHQSVLADMEDYNERPANGFDVRAEAYLPSYPQLGGRLMYEKYFG 321 Query: 264 DEVGLFGK----DKRQKDPHAISAEVTYTPVPLT 293 V L D P A + + YTP+PL Sbjct: 322 KGVALNSGSTSPDDLGDSPSAFTVGLNYTPIPLF 355 >UniRef50_A8GFW2 Putative invasin n=2 Tax=Serratia RepID=A8GFW2_SERP5 Length = 497 Score = 251 bits (642), Expect = 1e-65, Method: Composition-based stats. Identities = 91/300 (30%), Positives = 138/300 (46%), Gaps = 13/300 (4%) Query: 1 MSHYKTGHKQPRFRYSVL---ARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSM 57 + H + ++ A +AW ++ P P + Q L Sbjct: 4 VIHKARYRLKKVVPFATGCLPAMGLAWLCGAL----PAYAESPPAPDSVVQQP-ANDLPE 58 Query: 58 GNTTVTADNNVEKNVASFAANAGTFLSSQPDSDA----TRNFITGMATAKANQEIQEWLG 113 + D EK A+ A G + S ++ G A++ Q+ QE L Sbjct: 59 LGGNASNDAEREKEWATMAKQLGERNLNNVSSQQVRTRAESYAVGQASSVLQQQAQELLS 118 Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF 173 G A++ L + SS ++ P+YD + ++Q + + + + N G G R Sbjct: 119 PLGNAKLSLVMSDQGDFSGSSGQLFSPLYDVNGLLTYSQLGLLQQTEGSLGNFGLGQRWV 178 Query: 174 SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQER 233 +G+ W+ G NT +D D R H R +GAE W D+L+ SAN Y S + D + R Sbjct: 179 AGD-WLLGYNTVLDSDFERHHNRASLGAEAWGDFLRFSANYYYPLSALAQQRDNAQFLSR 237 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 PA+G+DI +GYLP + Q+G SL YEQY+G+ V LFG K+Q DP A+ V YTPVPL Sbjct: 238 PASGYDITTQGYLPFYRQIGGSLSYEQYWGENVDLFGSGKKQNDPRAMQLGVNYTPVPLV 297 >UniRef50_P39165 Uncharacterized protein ychO n=102 Tax=cellular organisms RepID=YCHO_ECOLI Length = 464 Score = 245 bits (625), Expect = 1e-63, Method: Composition-based stats. Identities = 75/286 (26%), Positives = 120/286 (41%), Gaps = 10/286 (3%) Query: 15 YSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVAS 74 + R + + + + T A +++ EK+ A Sbjct: 2 SRFVPRIIPFYLLLLVAGGTANAQSTFEQKAANPFDNNNDGLPDLGMAPENHDGEKHFAE 61 Query: 75 FAANAGTFLSSQPDSDATRNF-------ITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + G + D + + + NQ ++ WL +G A V + VD + Sbjct: 62 IVKDFGETSMNDNGLDTGEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNE 121 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 S P+ D + ++Q + + D+ SN+G G R GN W+ G NTF D Sbjct: 122 GHFTGSRGSWFVPLQDNDRYLTWSQLGLTQQDNGLVSNVGVGQRWARGN-WLVGYNTFYD 180 Query: 188 HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 + L + R G GAE W +YL+LSAN Y + W + ++R A G+D+ A +P Sbjct: 181 NLLDENLQRAGFGAEAWGEYLRLSANFYQPFAAWHEQT--ATQEQRMARGYDLTARMRMP 238 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 + L S+ EQY+GD V LF +P A+S + YTPVPL Sbjct: 239 FYQHLNTSVSLEQYFGDRVDLFNSGTGYHNPVALSLGLNYTPVPLV 284 >UniRef50_D2TXV3 Intimin/invasin (Fragment) n=1 Tax=Arsenophonus nasoniae RepID=D2TXV3_9ENTR Length = 539 Score = 243 bits (619), Expect = 7e-63, Method: Composition-based stats. Identities = 93/236 (39%), Positives = 134/236 (56%), Gaps = 11/236 (4%) Query: 63 TADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 +NN E+ AS G LSS D + N+ + NQ+I +WL +YG AR+ Sbjct: 100 PEENNNEEKFASSFTLMGDILSSDNFVDNSINYAKSIGQGLVNQQINDWLNQYGKARISF 159 Query: 123 NVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGV 182 + D K+ S + L P+ D P N+LFTQ + DR N+G G+R + N WM G+ Sbjct: 160 SSD-----KNISGDFLLPVIDEPNNLLFTQLGLRNNTDRNTINLGLGYRKYWRN-WMFGI 213 Query: 183 NTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP--DIEDYQERPANGWDI 240 NTF D+D + + R+GVG E W DYLKL+ NGY + W +S ++DY ERPA G+D+ Sbjct: 214 NTFYDYDYTGGNARLGVGGEAWIDYLKLAINGYFGLTDWHQSKISVMDDYDERPATGFDV 273 Query: 241 RAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDK---RQKDPHAISAEVTYTPVPLT 293 RAE YLP +PQLG+S+ YE+Y+G + L + D ++ + YTP+PL Sbjct: 274 RAEAYLPKYPQLGSSIKYEKYFGKGIHLGTGVNPEYLKDDAQSLIMGLNYTPIPLL 329 >UniRef50_B7LRE6 Putative invasin-like protein; putative exported protein n=3 Tax=Enterobacteriaceae RepID=B7LRE6_ESCF3 Length = 672 Score = 241 bits (615), Expect = 2e-62, Method: Composition-based stats. Identities = 89/291 (30%), Positives = 144/291 (49%), Gaps = 16/291 (5%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 + + LAR +AW + Q+L P A+ A+A R ++ D + Sbjct: 2 KLTPTPLARWLAWVLVGTQLLTPAAL-------AQAMLPEITRSGADSSVDKTDQPEAEW 54 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKY---GTARVKLNVDKDF 128 +AS A++ G+ L SD +N I + AN I + + R + ++ Sbjct: 55 LASRASSLGSLLQEGNISDFAKNQIQALPQTIANDGITSGIKHWLPEAQFRGGITLEDAS 114 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVN 183 + + ++L P+Y + +++LF Q + D+ R N G GWR G+ W+ G+N Sbjct: 115 KYRSAEADLLIPLYQSTSSILFGQLGLRDHDNNSFNGRFFVNTGIGWRQDVGD-WLLGIN 173 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 +F+D D+ H R +G E +RD + L+ N Y S WK S + ERPA G D+R + Sbjct: 174 SFLDADVRYDHLRGSLGVELFRDSMSLAGNWYFPLSDWKASKVQPLHDERPATGIDVRLK 233 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G LP+ P GA L +EQY+GD+V + G D +DP A + +T+ PVPL + Sbjct: 234 GALPSLPWFGAELAFEQYFGDKVDILGNDSLTRDPAAFTGAITWKPVPLVE 284 >UniRef50_A0KH56 Invasin family protein n=1 Tax=Aeromonas hydrophila subsp. hydrophila ATCC 7966 RepID=A0KH56_AERHH Length = 916 Score = 240 bits (612), Expect = 5e-62, Method: Composition-based stats. Identities = 93/222 (41%), Positives = 127/222 (57%), Gaps = 2/222 (0%) Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 + + S+ + R + +AN LG GTAR ++ +D DF++ + Sbjct: 166 EQVPTSASRYGSEQEVQYWRQQLATQFEEEANAYAASLLGAMGTARTRVTLDDDFNMVTA 225 Query: 134 SLEMLYPIYDTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSR 192 ++L P+ + +LFTQ + R DRT +N+G G RHF + WM G N F D+DL+ Sbjct: 226 EADLLLPLAEEQQTLLFTQFGLRRNGQDRTIANLGVGQRHFL-DRWMLGYNLFADYDLTN 284 Query: 193 SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQL 252 H R GVGAE WRDYLKL AN Y S W+ SP E +ER A G D+R E YLPA+PQ Sbjct: 285 RHWRAGVGAEAWRDYLKLGANFYTPLSSWRDSPRFEGMEERAARGMDVRLEAYLPAYPQW 344 Query: 253 GASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 ASL EQY G+ VGL D+ ++DPHAI+A + Y P PL + Sbjct: 345 SASLTAEQYLGERVGLLDADQLERDPHAITAGLHYNPFPLLK 386 >UniRef50_B6VL97 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VL97_PHOAA Length = 924 Score = 239 bits (609), Expect = 1e-61, Method: Composition-based stats. Identities = 85/246 (34%), Positives = 122/246 (49%), Gaps = 10/246 (4%) Query: 49 HAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEI 108 + + K N SD +++ I M A E Sbjct: 54 QHQTDDDATQGGDIPKSAMSGKRWLQHQTNDDVM----QGSDISKSGIADMGFAALQPET 109 Query: 109 QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGF 168 ++ G R L + D L S+++ YP+YD + + F Q R D R N+G Sbjct: 110 EKSA---GEVRANLPL-SDGKLTSGSIDLFYPLYDGDSRLFFGQVGARRFDGRNIVNLGI 165 Query: 169 GWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 G R+F G+ W G NTF D +S + H R+G G EYWRDYL LSANGY + W S + Sbjct: 166 GQRYFQGD-WALGYNTFYDIQISGNAHQRLGFGLEYWRDYLYLSANGYFGLTDWYSSSAL 224 Query: 228 EDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 + Y ER ANG+DIRA+G+ P +PQL L +EQY+GD++ L R K+P+A++ + Y Sbjct: 225 DGYAERAANGYDIRAQGWFPVYPQLSGKLKFEQYFGDDIALLNHQNRYKNPYALTMGLEY 284 Query: 288 TPVPLT 293 TP+ L Sbjct: 285 TPIQLI 290 >UniRef50_D2TL92 Intimin-like protein n=2 Tax=Enterobacteriaceae RepID=D2TL92_CITRO Length = 421 Score = 238 bits (608), Expect = 1e-61, Method: Composition-based stats. Identities = 72/240 (30%), Positives = 108/240 (45%), Gaps = 10/240 (4%) Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDATRN-------FITGMATAKANQEIQEWLG 113 + + EK A G + D + +A+ NQ ++ WL Sbjct: 1 MMPESHEGEKQFAEMVKAFGEASMTDNGLDTGEQAKQFAFDQVRDALSAQVNQHLESWLS 60 Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHF 173 +G A V + VD S P D + ++Q + R +D SN+G G R Sbjct: 61 PWGNASVNVQVDNQGKFNGSRGSWFIPWQDNLRYLTWSQLGLTRQEDGLVSNVGIGQRW- 119 Query: 174 SGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQER 233 + + W+ G NTF D+ L R G+GAE W +YL+LSAN Y + W ++R Sbjct: 120 ARDGWLLGYNTFYDNLLDEDLQRAGLGAEAWGEYLRLSANYYQPFASWH--ERSATQEQR 177 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 A G+D+ A+ +P + L + EQY+GD V LF K +P A+S + YTPVPL Sbjct: 178 MARGYDVSAQMRMPFYQHLDTRVSVEQYFGDSVDLFDSGKGYHNPLAVSLGLNYTPVPLV 237 >UniRef50_UPI0001C33E3F putative adhesin n=1 Tax=Citrobacter youngae ATCC 29220 RepID=UPI0001C33E3F Length = 684 Score = 238 bits (607), Expect = 2e-61, Method: Composition-based stats. Identities = 87/222 (39%), Positives = 126/222 (56%), Gaps = 6/222 (2%) Query: 75 FAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFS--LKD 132 + +A + L S P D + G + +Q I+ WL +YG AR+ LN D S L Sbjct: 18 YTKSAASLLKSGPAFD---QYAAGKISQLTSQAIEGWLKQYGNARITLNAQSDNSTALAG 74 Query: 133 SSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLS 191 SS ++L+ +++ + + + Q H D + N+G G R+F N M G N F D +++ Sbjct: 75 SSADLLFGLHNQDSRLDYIQFDTHYQDTEDMIFNVGLGQRYFMTNKTMLGYNVFYDRNIN 134 Query: 192 RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQ 251 +R GVG E WRDY K S NGY S W+ S +EDY E+ A+G+D++ E YLP + Q Sbjct: 135 SGVSRSGVGFELWRDYFKFSGNGYFALSDWQNSEQLEDYDEKAADGYDMQIEAYLPTYAQ 194 Query: 252 LGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 LG L YEQY+GD V LF + Q DP AI+ ++YTP+PL Sbjct: 195 LGGHLKYEQYFGDNVALFDTNHLQTDPSAITVGMSYTPIPLI 236 >UniRef50_B1EM37 Invasin n=1 Tax=Escherichia albertii TW07627 RepID=B1EM37_9ESCH Length = 237 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 105/247 (42%), Positives = 150/247 (60%), Gaps = 11/247 (4%) Query: 7 GHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADN 66 + R + V W+ I+ Q+L P+ T P ++ + + A++ Sbjct: 2 TMVNKKLR-RKASCAVTWSVIATQILSPVTFTLIP------ANSFASSANTESAQTNAND 54 Query: 67 NVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDK 126 +AS AANAG L++ F +A+A +E+ +WL +YG AR+KLNVD+ Sbjct: 55 EYANELASLAANAGQSLANNTAG----RFAVDTLSAQATKEVVDWLQQYGNARIKLNVDE 110 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 F+LKD++ + LYP D+ +LF+Q ++HRTDDR Q+NIG G RHF+ ++ M G N F Sbjct: 111 SFTLKDAAFDFLYPWMDSKDYVLFSQTSLHRTDDRNQANIGLGLRHFTTDNAMLGANIFY 170 Query: 187 DHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYL 246 D+DLSR H+R G+G EYWRDY++ AN Y S WK S DI+DY ERPANGWD+ AEG+L Sbjct: 171 DYDLSRHHSRAGLGVEYWRDYMRFGANTYFGLSDWKDSRDIDDYFERPANGWDVSAEGWL 230 Query: 247 PAWPQLG 253 P +PQLG Sbjct: 231 PVYPQLG 237 >UniRef50_C4K752 Putative intimin-like protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K752_HAMD5 Length = 796 Score = 234 bits (596), Expect = 3e-60, Method: Composition-based stats. Identities = 79/203 (38%), Positives = 114/203 (56%), Gaps = 2/203 (0%) Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 N Q ++ + +G + L+VD S +L P Y +++LF Sbjct: 176 YIENTARNQLLNPFQQNVKTFFDHFGQTEINLSVDNKGRFNQSRFLLLTPWYKNNSHVLF 235 Query: 151 TQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDYLK 209 +Q ++++RT +IG G R + ++ G N FID+DL + H R+ +G E +Y K Sbjct: 236 SQLGF-QSEERTIGHIGIGQRFDDLHPFLNLGYNVFIDYDLDQQHKRMSIGTEAASNYFK 294 Query: 210 LSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 LS N Y + W+ S D+EDY ERPA G+DIR +GYLP +PQLG + YEQY+G EV LF Sbjct: 295 LSTNYYWPITKWRDSFDMEDYMERPAEGFDIRLQGYLPNYPQLGGKMKYEQYFGKEVALF 354 Query: 270 GKDKRQKDPHAISAEVTYTPVPL 292 K KRQK+P A+S + Y P PL Sbjct: 355 NKTKRQKNPKAVSIGIDYRPFPL 377 >UniRef50_C9XTU1 Uncharacterized protein ychO n=7 Tax=Enterobacteriaceae RepID=C9XTU1_CROTZ Length = 441 Score = 232 bits (591), Expect = 1e-59, Method: Composition-based stats. Identities = 73/257 (28%), Positives = 114/257 (44%), Gaps = 10/257 (3%) Query: 44 AARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPD---SDATRNFI---- 96 A+ +N EK+ A G + R+F Sbjct: 3 QAQNPFDENGDNLPDLGLAPENNAAEKHFAHVLKAFGEASQTDSALSPGQQARHFAFTRL 62 Query: 97 TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIH 156 ++ E + L +G A V L VD++ + SS + P D + ++Q + Sbjct: 63 RDAVSSSITSEAESLLSPWGNATVDLLVDEEGNFNGSSGSLFTPWQDNNRYLTWSQVGVS 122 Query: 157 RTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYI 216 + + N G G R +G+ W+ G NTF D +R G GAE W DYL+LSAN Y Sbjct: 123 QQNQGLVGNAGIGQRWTAGH-WLLGYNTFYDRLFDDDTSRAGFGAEAWGDYLRLSANYYQ 181 Query: 217 RASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQK 276 GW+ + ++R A G+D+ A+ YLP + + S+ +EQY+GD+V LF Sbjct: 182 PLGGWEHRAGLL--EQRMARGYDVTAQAYLPFYQHINTSVSFEQYFGDQVELFDSGSGYH 239 Query: 277 DPHAISAEVTYTPVPLT 293 +P A+ ++YTPVPL Sbjct: 240 NPVAVKVGLSYTPVPLV 256 >UniRef50_B5R4C3 Invasin-like protein n=40 Tax=Salmonella enterica RepID=B5R4C3_SALEP Length = 660 Score = 222 bits (565), Expect = 1e-56, Method: Composition-based stats. Identities = 85/291 (29%), Positives = 134/291 (46%), Gaps = 34/291 (11%) Query: 12 RFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKN 71 F + + + WA ++ Q+ P+ +DN ++ Sbjct: 2 VFSKKPITKYITWAIVTSQIPLPVIA-------------------------DSDNEIQSW 36 Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYG---TARVKLNVDKDF 128 +A A++ L D + I + AN + E + R +N++ Sbjct: 37 IAGTASSISPHLQEGTLEDYAKGKIKALPGQAANHLVNEGIKSAFPEIIFRGGVNLEDGA 96 Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-----RTQSNIGFGWRHFSGNDWMAGVN 183 + S +M P+ +T +++LF Q D+ RT N+G G+R N W+ GVN Sbjct: 97 KYRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYRQEV-NGWLLGVN 155 Query: 184 TFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAE 243 TF+D D+ SH R G+G E ++D L S N Y +GWK S E + ERPA G+D+R + Sbjct: 156 TFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKTSAAHELHDERPAYGFDLRTK 215 Query: 244 GYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 G LP +P L YEQYYGD+V L G ++P A A++ + PVPL + Sbjct: 216 GTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGADLVWNPVPLLE 266 >UniRef50_D2TBQ7 Putative invasin n=1 Tax=Erwinia pyrifoliae DSM 12163 RepID=D2TBQ7_ERWPY Length = 519 Score = 214 bits (546), Expect = 2e-54, Method: Composition-based stats. Identities = 82/260 (31%), Positives = 118/260 (45%), Gaps = 11/260 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFI----- 96 A A+ A + TV ++ + K +A A + G + + R Sbjct: 80 PFADPARFAKMQQQLPELGTVHDNDQLAKKIAEAAKSIGEASMNSDSDRSLREEAGIWVF 139 Query: 97 ---TGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 A +A E ++ L YG A V L + D S SS +++ P D + + F+Q Sbjct: 140 NRFRDAAKQRAASEGEQLLSPYGRASVSLALSDDGSFNGSSAQLVTPWQDNYSYLTFSQL 199 Query: 154 AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSAN 213 I +++ + N G G R +G W G N F+D L R +GAE W YL+ SAN Sbjct: 200 GIEQSEYGSVGNAGLGQRWIAG-SWRVGYNAFVDSLLGPDRQRGSLGAEAWGKYLRFSAN 258 Query: 214 GYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDK 273 Y SG + + R A G+DI GYLP + QLG +L YEQY G+ V LF Sbjct: 259 YYQPLSGCRNHSNSA--LMRMARGYDITTRGYLPFYRQLGVTLSYEQYLGEGVDLFNSGN 316 Query: 274 RQKDPHAISAEVTYTPVPLT 293 +P A+S + YTPVPL Sbjct: 317 AVANPAAVSLGINYTPVPLF 336 >UniRef50_Q7W286 Putative adhesin n=1 Tax=Bordetella parapertussis RepID=Q7W286_BORPA Length = 1937 Score = 213 bits (542), Expect = 6e-54, Method: Composition-based stats. Identities = 65/302 (21%), Positives = 112/302 (37%), Gaps = 12/302 (3%) Query: 2 SHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTF-TPVMAA-RAQHAVQPRLSMGN 59 +H ++ +R A +++Q P+A P +A + A ++ Sbjct: 12 AHLPARGRRHWYRRHRAGAAGMSAVLAMQAAAPVAYGQGAPTFSATQVADAASNAVAQPG 71 Query: 60 TTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEW-------- 111 T + +A G + D F+ A A+AN +Q+ Sbjct: 72 AVETRVAQTIQALAQAREAGGARQDGRASLDG--QFLRSQAQAQANVLVQQGVQWANETG 129 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L ++ D + + ++ L Q H + R N G R Sbjct: 130 LPWLRRLEGNVSYDFSGRDVAVDVRTIDALHLDQDRALLLQLGGHNQNHRPTVNAGVVAR 189 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 +G+ + G N F+D+++ + H R +GAE L N Y SGWK + E + Sbjct: 190 SAAGSSLILGGNAFLDYEVGKRHLRGSLGAEAVAAQFTLYGNVYAPLSGWKAAKRAERRE 249 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA GWD+ A L + Y ++ G +V F + +++P + Y PVP Sbjct: 250 ERPAAGWDVGFTARPEAVQGLALNAQYFRWRGAQVDYFDDGRYRRNPSGFKYGIEYRPVP 309 Query: 292 LT 293 L Sbjct: 310 LI 311 >UniRef50_Q7WR47 Putative adhesin n=1 Tax=Bordetella bronchiseptica RepID=Q7WR47_BORBR Length = 969 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 63/271 (23%), Positives = 103/271 (38%), Gaps = 7/271 (2%) Query: 27 ISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQ 86 +++Q + P P +AR A R ++ + + +A AG+ S++ Sbjct: 34 LTLQTVAPAFAQGAPSFSAR--PAQADRQDAADSAMLRVAQTARQLAQR-QAAGSRASAR 90 Query: 87 PDSDATRNFITGMATAKANQEI----QEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIY 142 D D + A + + Q L + +N D L + ++ Sbjct: 91 VDGDLLKGQAEAQANELLQEGVRLANQTELPFLRRLQGGVNYDFSNKDLSLDLRTIDEVH 150 Query: 143 DTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAE 202 + + Q + H + R N G RH G N F+D++ ++H R +G E Sbjct: 151 RGERDRVLLQLSGHNRNHRPTVNGGVVLRHALNQHMAVGANAFLDYEFGKNHLRGSLGGE 210 Query: 203 YWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYY 262 L N Y SGWK + E +ERPA+GWD+ A P L Y ++ Sbjct: 211 VIAPQFTLYGNVYAPMSGWKAAKRAERREERPASGWDVGVRLQPEALPGLAIKGQYFRWS 270 Query: 263 GDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 G V F + Q++ V Y PVPL Sbjct: 271 GAAVDYFDNGRPQRNARGYKYGVEYRPVPLV 301 >UniRef50_Q9APE8 Putative outer membrane ligand binding protein n=3 Tax=Bordetella RepID=Q9APE8_BORBR Length = 1578 Score = 203 bits (515), Expect = 9e-51, Method: Composition-based stats. Identities = 57/268 (21%), Positives = 89/268 (33%), Gaps = 9/268 (3%) Query: 35 LAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQP-----DS 89 LA P+ A + D + +A+ A + + + D Sbjct: 54 LAQALLPLSALAQGAPTLRPARVAQEEAGQDAAWTRKLAAQAESLARRQAERQPGARVDG 113 Query: 90 DATRNFITGMATAK----ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP 145 D + N + L + L+ D + L + +Y Sbjct: 114 DYLKREAQAQVNDVLRDGVNLARESGLPFLRNLQGGLSHDFESGRTSLQLNTIDEVYRAG 173 Query: 146 TNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 N Q H +DR +N G +R + M G N F+D++ + H R VG E Sbjct: 174 RNTGLLQLGAHNQNDRPTANAGAVYRREVNDALMVGANGFLDYEFGKQHLRGSVGLEVIA 233 Query: 206 DYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 L N Y S WK + +E+PA+G D+ P L S + ++ G E Sbjct: 234 PEFSLYGNVYAPLSDWKGAKRNNRREEKPASGMDVGVGYRPAFAPGLSLSATHFRWNGAE 293 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLT 293 V F + Q V Y PV L Sbjct: 294 VDYFDNGRTQAGAKGFKVGVEYRPVSLV 321 >UniRef50_Q2KVY3 Putative adhesin (Fragment) n=1 Tax=Bordetella avium 197N RepID=Q2KVY3_BORA1 Length = 1654 Score = 196 bits (499), Expect = 5e-49, Method: Composition-based stats. Identities = 69/302 (22%), Positives = 110/302 (36%), Gaps = 18/302 (5%) Query: 1 MSHYKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNT 60 +SH K + R R A ++ +Q PLAV A+ + R G+ Sbjct: 25 VSHAKGSGRNRRRRAQRAASSAVCLSLGMQAAAPLAVL------AQGAPEMTNRPEAGDI 78 Query: 61 TVTADNNVEKNVASFAANAGTFLSSQPDSDAT-RNFITGMATAKANQEIQEW-------- 111 ++V VA A + + + + +++ A+ NQ +QE Sbjct: 79 VP---SDVLTQVAVRAQDLARRQADRREGAQVDADYLKQQGQAQFNQFLQEGVRAANESG 135 Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 L + L D D L + +Y N Q H ++R +N+G +R Sbjct: 136 LRFLRNLQGDLRHDFDNGRTSLELRTIDQVYRKGANTGLLQLGGHNQNNRPTANLGGVYR 195 Query: 172 HFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ 231 M G N F+D++ ++ H R +G E N Y SGW + + Sbjct: 196 RDINERLMLGANAFLDYEFAKQHLRGSLGVEAIAPEFSFYGNVYAPMSGWTGAKRDNRRE 255 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVP 291 ERPA+G D+ + P L Y ++ G V F + Q V Y PVP Sbjct: 256 ERPASGMDLGMKYSPGFAPGLSLKANYFRWNGAAVDYFDNGRTQDRATGFKYGVQYKPVP 315 Query: 292 LT 293 L Sbjct: 316 LL 317 >UniRef50_B9KGJ3 Putative adhesin/invasin n=1 Tax=Campylobacter lari RM2100 RepID=B9KGJ3_CAMLR Length = 1459 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 76/268 (28%), Positives = 112/268 (41%), Gaps = 22/268 (8%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 N D V AG S+ S + + MA++ N Sbjct: 281 KTQKALNDNKKDNNLSKEDQEFSNKVMKVIQTAGAIYDSED-SKSKEEIVKNMASSYLNT 339 Query: 107 EIQEWLGKY-GTARVKLNVDKDFSLK-----DSSLEMLYPIY--DTPTNMLFTQGAI-HR 157 E ++ + +N D F+ + + L PI D P F Q I Sbjct: 340 SANELAKEFIDSLNTSINTDFSFNYNERSGFSGNAKALLPIVSEDNPKISYFLQSGIGEF 399 Query: 158 TDDRTQSNIGFGWRHFSG-------NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKL 210 +DRT + G G R++ + M G+N+ DHD SR H R+ +GAE D L Sbjct: 400 ANDRTIGHFGGGIRYYPNATALNNSGNIMLGLNSVYDHDFSRGHKRMSLGAEAMVDTLAF 459 Query: 211 SANGYIRASGWKKSPDIE-DY-QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGL 268 +AN Y R S W S D + DY QERPANGWD + + P+ + Q+YG++VG+ Sbjct: 460 NANVYQRLSSWIDSYDFDKDYVQERPANGWDAKIKYAFPSLINVSFFAKMGQWYGNKVGI 519 Query: 269 FGK---DKRQKDPHAISAEVTYTPVPLT 293 FG D +K+P ++Y+P P Sbjct: 520 FGANSVDDLEKNPLIYEGGISYSPFPAL 547 >UniRef50_Q2KW90 Adhesin n=1 Tax=Bordetella avium 197N RepID=Q2KW90_BORA1 Length = 747 Score = 179 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 57/252 (22%), Positives = 88/252 (34%), Gaps = 6/252 (2%) Query: 43 MAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATA 102 M + A+ V + +E VA N T S A Sbjct: 6 MPSPARLLTLLLCPTLLPPVAYGSAIESEVA---RNLWTRAQHPDTSPGLAQSALDAGVA 62 Query: 103 K-ANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR 161 Q L L D D SL + + + L Q +H + R Sbjct: 63 AGLQASRQTGLPWLRHLDGGLRYDLDPGRLSFSLRTIDDLMVSERRALMLQAGLHNQNQR 122 Query: 162 TQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 +N G R + + G N F+D++ + H R +G E + L AN Y SGW Sbjct: 123 PTANTGIVLRQQASPGLIVGSNAFLDYEFGKQHVRGSLGLEAIAPHYSLYANYYAPLSGW 182 Query: 222 KKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAI 281 K + +ERPA G+D+ G L + L Y +++G + +F + Q++ Sbjct: 183 KGARRDSRREERPAAGYDL--GGQLSSDAGLSLQAAYFRWHGAGIDVFDSGRAQRNASGF 240 Query: 282 SAEVTYTPVPLT 293 V Y P L Sbjct: 241 RYGVAYQPGALF 252 >UniRef50_UPI000190CDC9 hypothetical protein Salmonentericaenterica_25197 n=6 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190CDC9 Length = 327 Score = 174 bits (441), Expect = 4e-42, Method: Composition-based stats. Identities = 54/146 (36%), Positives = 79/146 (54%), Gaps = 3/146 (2%) Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 M ++Q + + D SN+G G R + + W+ G NTF D+ L + R G GAE W +Y Sbjct: 1 MTWSQLGLTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWGEY 59 Query: 208 LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 L+LSAN Y + W+ ++R A G+DI A+ LP + + S+ EQY+GD V Sbjct: 60 LRLSANYYQPFADWQTHT--ATLEQRMARGYDINAQVRLPFYQHINTSVSLEQYFGDSVD 117 Query: 268 LFGKDKRQKDPHAISAEVTYTPVPLT 293 LF +P A+ + YTPVPL Sbjct: 118 LFDSGTGYHNPVALKLGLNYTPVPLL 143 >UniRef50_Q31A57 Adhesin-like protein n=4 Tax=Prochlorococcus marinus RepID=Q31A57_PROM9 Length = 372 Score = 156 bits (395), Expect = 7e-37, Method: Composition-based stats. Identities = 64/295 (21%), Positives = 106/295 (35%), Gaps = 40/295 (13%) Query: 28 SVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEK--------NVASFAANA 79 Q L + + F +++ A + +N K A+++ Sbjct: 3 ISQALTSITLVFGSILSVSANEYKFEEIKFNQIPNEQNNYEPKDKLDEYIIKGANYSTKF 62 Query: 80 GTFLSSQPDSDATRNFITG------------MATAKANQEIQEWLGKYGTARVKLN--VD 125 +++ D + A AKAN EIQ+ + + V ++ + Sbjct: 63 VPLMNNGAKGDEYTGIMADDLNRLLVDAGFDFANAKANGEIQK-IPFFAQTSVNISGGTE 121 Query: 126 KDFSLKDSSLEMLYPIYDTPTN----MLFTQGAIHRTDD--RTQSNIGFGWRHFSGNDWM 179 D S +SL L + + F+Q + + NIG G R+ + M Sbjct: 122 SDTSFSINSLMKLGELAKDDQGDLKTLAFSQARFATATNAEGSTINIGLGIRNRPDDISM 181 Query: 180 AGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI-EDYQERPA 235 G N F D+ D S +H+R+G+G EY+ + N Y+ + K DYQER Sbjct: 182 VGANAFWDYRMTDYSDAHSRLGLGGEYFWKDFEFRNNWYMAITNEKDVIIKGVDYQERVV 241 Query: 236 NGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPHAISAEVT 286 GWD+ LP P+L + + D GL G Q PH + Sbjct: 242 PGWDLEVGYRLPNNPELAFYIRGFNWDYKYTQDNSGLEGAVSWQATPH---VGLE 293 >UniRef50_UPI0000E87F3C hypothetical protein MB2181_03125 n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87F3C Length = 331 Score = 154 bits (389), Expect = 3e-36, Method: Composition-based stats. Identities = 55/239 (23%), Positives = 93/239 (38%), Gaps = 10/239 (4%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 A V L+ + +A + + + T L + +A +N K Sbjct: 11 APLIVAVSLTQADALKSALEMQDAQDKAEIMDLSTMLLAGD-VEALKNTAIDGVVEKGVG 69 Query: 107 EIQEWLGKYG-TARVKLNVDKDFSLKDSSLEMLYPIYDTPT--NMLFTQGAIHRTDDRTQ 163 + +L +Y T + S L ++ P+ D N FTQG++ D+RT Sbjct: 70 VTKSFLEQYFPTVELNFGAQ-GGSKPSGGLLVVAPLSDPDDIFNTYFTQGSVFYEDNRTT 128 Query: 164 SNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 N+G G+R S N + G+N F DH+ H R +G E +++AN Y + WK Sbjct: 129 LNLGLGYRKLSDNKMLLTGINAFYDHEFPYDHGRTSIGLEARTTVWEINANKYWATTKWK 188 Query: 223 KSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFG--KDKRQKDPH 279 + +ER +G+DI A LP + Q+ + G + Q + Sbjct: 189 TGKN--GLEERALDGYDIEAGVPLPYMNWATVFVKNFQWDSEISGSKDIKGNDLQLRAY 245 >UniRef50_C4FMD1 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FMD1_9FIRM Length = 338 Score = 151 bits (380), Expect = 4e-35, Method: Composition-based stats. Identities = 58/268 (21%), Positives = 90/268 (33%), Gaps = 14/268 (5%) Query: 30 QVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDS 89 Q LAV +P + ++ T V S+ Sbjct: 20 QREHTLAVATSPREEHIKAVSEDFHITPAATPDHVIGEGGPLVMDRQETKTVQYSN---V 76 Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPIYDTPTN- 147 DA I +A + + + GK R L+ K S+E + P+ + Sbjct: 77 DAVNRAINAVAMSNVSNAMYGAKGKPWMRRTTLSFQFQEGWKPLYSVETVQPLGHYDNSS 136 Query: 148 --MLFTQGAIHR-TDDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVGAEY 203 + FTQ I R +D T NIG G+R S + + G + F DH H R+ G EY Sbjct: 137 RDVWFTQQRISRASDTGTTLNIGVGYRRISKDDRRLYGAHLFYDHRFLNRHNRLSAGLEY 196 Query: 204 WRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYY- 262 + N Y AS + ER ANG+ + + + Sbjct: 197 MSGESEFRFNWYGSASDERVLDVNLHTLERVANGYTVEYGKTFKNARWARVYVEGYHWNQ 256 Query: 263 ---GDEVGLFGKDKRQKDPHAISAEVTY 287 D+ GL + Q P +S ++ Y Sbjct: 257 ERQADKNGLRVGSELQLTPR-VSVDMGY 283 >UniRef50_D0CKU8 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 8109 RepID=D0CKU8_9SYNE Length = 389 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 66/307 (21%), Positives = 117/307 (38%), Gaps = 37/307 (12%) Query: 9 KQPRFRYSVLARCVAWANISV---QVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTAD 65 + R + A+ + A ++ F L + + A + + + Sbjct: 1 MRTTNRLLLSAKHIKQAMSGSVSFKLAFSLIASGLTLQCLPASAESTQKNFTERGSHSLY 60 Query: 66 NNV------EKNVASFAAN-----AGTFLSSQPD---SDATRNFITGMATAKANQEIQEW 111 ++ E +AS + T L+++ S+ N +A+ K + + Sbjct: 61 SSHSKGIWHESPLASRVIDKLLIRNWTSLNNKNGIEWSNQISNLALNLASNKLSDYATKT 120 Query: 112 LGKYG---TARVKLNVDKDFSLKDSSLEMLYPIYD-------TPTNMLFTQGAIHRT-DD 160 + KY A V ++ + + + ++L+ I D + + F + + Sbjct: 121 IQKYPFVLGASVNFDIRTEGA-TNIGGDVLFKIADFGLKDDESRDGIAFLHTKYTGSLSN 179 Query: 161 RTQSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIR 217 + N G G RH G + +AGVN + D+ S SH+R G+G E + L L+ N YI Sbjct: 180 DSTWNAGLGLRHLIGEELLAGVNGYWDYRTTNYSTSHSRFGLGGELFWKTLSLTNNWYIA 239 Query: 218 ASGWKK-SPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKD 272 +G K S + DY ER GWD LP+ P + ++ D G GK Sbjct: 240 GTGTKTISTNNTDYYERVVPGWDFELGYRLPSNPNIAFFARGFRWDYRNRNDNTGFQGKV 299 Query: 273 KRQKDPH 279 Q PH Sbjct: 300 TYQMTPH 306 >UniRef50_A5GWU2 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GWU2_SYNR3 Length = 428 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 58/233 (24%), Positives = 97/233 (41%), Gaps = 23/233 (9%) Query: 70 KNVASFAANAGTFLSSQPDSD-------ATRNFITGMATAKANQEIQEWLGKYGTARVKL 122 + A++AA G + + D + + AN++I++ + + + L Sbjct: 109 QKGANYAALYGPSMVNSNGVDLGGLIQTELSRTLISSGVSYANKQIKK-IPFFAQTTLGL 167 Query: 123 N--VDKDFSLKDSSLEMLYPIY----DTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSG 175 + D + S L I P ++F Q + T + Q N+G G R G Sbjct: 168 DAATSSDLTGYLDSFMRLKTIGYDNEGDPMGLMFGQARVTLETSAQPQVNVGLGSRFRLG 227 Query: 176 NDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-DIEDYQ 231 ++ + G+N F D + S ++TR G+GAE + +L N YI S K + DY Sbjct: 228 DEAIVGLNGFWDLRTTNYSTAYTRWGIGAEGFWKSFELRNNWYINGSADKNITINNIDYV 287 Query: 232 ERPANGWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPHA 280 ER GWD+ +P++PQL + + D G+ G Q PHA Sbjct: 288 ERVVPGWDVEVGYRIPSYPQLAIFVRGFNWDYQDHSDNSGIEGSVNWQATPHA 340 >UniRef50_Q2NRT1 Putative invasin n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NRT1_SODGM Length = 276 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 53/210 (25%), Positives = 82/210 (39%), Gaps = 7/210 (3%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 P A T A + Q L + ++ EK +A+ A + Sbjct: 35 LPAAAWVTQPENDAALLSQQQALPNLGSASVNESGTEKKLATLARQMAEVNQDENTDQTW 94 Query: 93 RNF----ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TN 147 R++ + Q+ + L G V L+VD+ SS ++L P+ D Sbjct: 95 RSYLLGEAKDRVLDRLQQKSEALLSPLGYTTVTLDVDERGRFNGSSGQLLLPLVDQKTRG 154 Query: 148 MLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS-HTRIGVGAEYWRD 206 + ++Q + DD N+G R + W+ G N F D L++ R +GAE D Sbjct: 155 LTYSQLGLQGVDDGVVGNMGLRQRW-NAGRWLLGYNVFYDQYLNQDASRRGSIGAEARSD 213 Query: 207 YLKLSANGYIRASGWKKSPDIEDYQERPAN 236 YL LS+N Y SG + D ED R A Sbjct: 214 YLTLSSNYYYPLSGMHAANDDEDELLRMAR 243 >UniRef50_UPI000190D9BD hypothetical protein SentesTyp_07924 n=2 Tax=Salmonella enterica subsp. enterica serovar Typhi RepID=UPI000190D9BD Length = 239 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 42/171 (24%), Positives = 70/171 (40%), Gaps = 8/171 (4%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRN------- 94 + A+AQ + + EK+ A A D D Sbjct: 70 TIRAQAQDPFDQNRLPDLGMMPESHEGEKHFAEMAKAFSEASMKNNDLDTGEQARQFAFG 129 Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 + + + + NQ+++ WL +G+A V +NVD + S P+ D + ++Q Sbjct: 130 QVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSWFIPLQDKQRYLTWSQLG 189 Query: 155 IHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWR 205 + + D SN+G G R + + W+ G NTF D+ L + R G GAE W Sbjct: 190 LTQQTDGLVSNVGIGQRW-AQDGWLLGYNTFYDNLLDENLQRAGFGAEAWG 239 >UniRef50_A7MZV1 Putative uncharacterized protein n=3 Tax=Vibrio harveyi RepID=A7MZV1_VIBHB Length = 543 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 52/163 (31%), Positives = 71/163 (43%), Gaps = 7/163 (4%) Query: 136 EMLYPIYDTPTNMLFT--QGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS 193 + L+ + + + R +++G G+R + GVN F D+DLSR Sbjct: 9 DTLHELDQPLKKLAYVSNHWGPLLFHGRDFAHLGLGYRQL-DDSQFFGVNVFFDYDLSRQ 67 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD----IEDYQERPANGWDIRAEGYLPAW 249 HTR+ VGAEY DY S N Y S WK SPD + E+ A GWD+ E YLP Sbjct: 68 HTRVSVGAEYGLDYGTFSTNAYFPLSNWKDSPDHYEGMNSLVEKAAKGWDLNLETYLPLD 127 Query: 250 PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPL 292 + L QY G V K+P+ S + P P Sbjct: 128 TRWKFGLTAGQYLGRYVEHSDGSLPSKNPYHFSLSTEFRPDPA 170 >UniRef50_A4GHH9 Putative uncharacterized protein n=2 Tax=uncultured marine bacterium EB0_35D03 RepID=A4GHH9_9BACT Length = 308 Score = 137 bits (346), Expect = 3e-31, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 87/198 (43%), Gaps = 11/198 (5%) Query: 73 ASFAANAGTFLS-SQPDSDATRNFITGMATAKANQEIQEWL-----GKYGTARVKLNVDK 126 A + G LS S DS+ ++ + T+ A+ + + + T V N+ + Sbjct: 15 AVLTMSLGFSLSVSADDSEQIKSSLMSRMTSSASSFVSTGIGALLSPNFDTVEVSTNLKE 74 Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTF 185 S D + +L D P + LF Q ++R D RT N+GFG+R + ++ WM GVN F Sbjct: 75 GDSTVD--IGVLKAFGDNPNSFLFNQINLNRHDKRTTLNLGFGFRRLNADETWMGGVNAF 132 Query: 186 IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGY 245 DH+ H R GVG E L+ N Y +G D + +G D+ + Sbjct: 133 YDHEFPNDHKRNGVGFEVVSSVLESRVNSYNGTTG--YIKDKSGTDSKVLDGRDMGFKVA 190 Query: 246 LPAWPQLGASLMYEQYYG 263 LP P + + Q+ G Sbjct: 191 LPYLPGMMFGMNAVQWKG 208 >UniRef50_Q4FMH8 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter ubique RepID=Q4FMH8_PELUB Length = 291 Score = 137 bits (345), Expect = 4e-31, Method: Composition-based stats. Identities = 48/204 (23%), Positives = 88/204 (43%), Gaps = 13/204 (6%) Query: 89 SDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV-DKDFSLKDSSLEMLYPIYDTPTN 147 + + A K +++I + G V L+ D D + S+ + I T + Sbjct: 16 TTVANADVASQALNKVSEKISNLIPGEGITEVSLDYNDGDEDQLNFSILGVRDIETTDNS 75 Query: 148 MLFTQGAIHRTD----DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHTRIGVGAE 202 FTQ ++ + R NIG G+R S + ++M G NTF D DL+ R+G+G E Sbjct: 76 NFFTQFSLMNQEINSSGRIIGNIGLGYRKLSEDKNFMFGANTFYDRDLTEGQDRLGLGIE 135 Query: 203 YWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYY 262 L L+AN Y + S + +E+ +GWD +P P + ++ Sbjct: 136 AKGSILDLTANSY---TKISNSEVVNGDREQVLSGWDFNLTSQIPRAPWARINYNGYKWE 192 Query: 263 GDEVGLFGKDKRQKDPHAISAEVT 286 ++ G ++ + +++ +VT Sbjct: 193 TEK----GSADQKGNIYSLELDVT 212 >UniRef50_D1BQB6 Putative uncharacterized protein n=2 Tax=Veillonella parvula RepID=D1BQB6_VEIPT Length = 347 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 81/211 (38%), Gaps = 12/211 (5%) Query: 87 PDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK-DSSLEMLYPIY--- 142 D+DA + + + + + K R L++ + K +E L P+ Sbjct: 84 SDTDAVNSALQAVVMTGVHSAMHGSKAKPWMQRTVLSLRFQKNWKPLYGVETLQPLGHYD 143 Query: 143 DTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFS-GNDWMAGVNTFIDHDLSRSHTRIGVG 200 +T ++ FTQ + D T +N+G G+R + +D G N F DH +H R+ VG Sbjct: 144 ETSRHVWFTQERLANAADTGTTANVGIGYRRIAENDDHYYGGNLFYDHRFRGNHGRMSVG 203 Query: 201 AEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 EY N Y SG + S D E +NG+ + + Sbjct: 204 LEYVSGIGAFRMNWYRGVSGER-SLDGATRMENVSNGYTAEYGTSFKNARWARVYMEAYR 262 Query: 261 Y----YGDEVGLFGKDKRQKDPHAISAEVTY 287 + D+ GL + Q P IS ++ Y Sbjct: 263 WQLRRSADKHGLRIGTELQLTPR-ISVDMGY 292 >UniRef50_Q0FCK2 Putative uncharacterized protein n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=Q0FCK2_9RHOB Length = 327 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 79/217 (36%), Gaps = 15/217 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKAN---QEIQEWL- 112 + A+ N G L+ +A + + +A AN ++++ + Sbjct: 14 SALPLSAQEVAKSGKFATIVKNIGNALNIGQGEEAVESEVNTLAVDAANAGLDQVEDKVL 73 Query: 113 --GKYGTARVKLNVD-----KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSN 165 + + + D K+ S + +Y + +T LF Q + ++RT N Sbjct: 74 STSNFTHFELSVGSDTMGLDKNKSDTKTEAMTVYRLKETGNWFLFNQTSAVNFNNRTTIN 133 Query: 166 IGFGWRHFSG-NDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 GFG RH + N + G N F D++L H R+G G E + AN Y S Sbjct: 134 TGFGARHINDANTVITGYNIFYDYELQSKHERVGAGLELLSSIFEFRANAYQAVSKT--- 190 Query: 225 PDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQY 261 QE +G+D + LP + + Sbjct: 191 LTYNGIQETALDGYDAKLTANLPYFYSSNLYGKLSNW 227 >UniRef50_Q4JN04 Predicted invasin-like SivH n=1 Tax=uncultured bacterium BAC13K9BAC RepID=Q4JN04_9BACT Length = 301 Score = 135 bits (341), Expect = 1e-30, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 66/200 (33%), Gaps = 19/200 (9%) Query: 85 SQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNV---------DKDFSLKDSSL 135 + + ++ A + + I+ W AR L ++ + Sbjct: 22 ASKAVNQIKDSAINKAFSYGDSAIESW------ARDNLTSLRLIEIETRSREGAKPTFRA 75 Query: 136 EMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRSH 194 L+ I N + +Q + DD N G +R + + + G+N F DH + H Sbjct: 76 ISLFEIGGNDFNKILSQLSYSTFDDDETINAGLIYRMMNSDMTVIYGLNIFYDHQFNTGH 135 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 R G+G E ++ N Y + + E A G+D +P P Sbjct: 136 ARTGLGFEMKSSVYDVNINFYEAQTEIHHV---DGVPEVAAGGYDAEIGAQVPYLPWAKV 192 Query: 255 SLMYEQYYGDEVGLFGKDKR 274 Q+ + + + + Sbjct: 193 YYKAYQWNNETLNIKDGETL 212 >UniRef50_B6BQN0 Putative uncharacterized protein n=1 Tax=Candidatus Pelagibacter sp. HTCC7211 RepID=B6BQN0_9RICK Length = 251 Score = 131 bits (329), Expect = 3e-29, Method: Composition-based stats. Identities = 48/158 (30%), Positives = 69/158 (43%), Gaps = 7/158 (4%) Query: 114 KYGTARVKLNVDKDFSLKDSSLEMLYPIYD--TPTNMLFTQGAIHRTDD-RTQSNIGFGW 170 K+ TA + L+ + S L ++ PI D N++FTQ ++ +DD R N+GFG Sbjct: 8 KFPTAEIGLSTGVTNEVTGSVL-VVKPISDPSDNENIIFTQASLFLSDDSRETINLGFGN 66 Query: 171 RHFSGND-WMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIED 229 R +D + G N F DH+L H R +G E L AN Y SGWK + + Sbjct: 67 RKLINDDTLLVGYNLFYDHELDYDHQRASIGIEAISSVGSLRANQYYGLSGWKS--GLNN 124 Query: 230 YQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 E+ NG D+ LP P + G Sbjct: 125 INEKALNGSDVELGMPLPYLPWTNLYYRSFNWEGASGA 162 >UniRef50_C0N7C0 Putative uncharacterized protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N7C0_9GAMM Length = 546 Score = 130 bits (326), Expect = 7e-29, Method: Composition-based stats. Identities = 32/172 (18%), Positives = 53/172 (30%), Gaps = 20/172 (11%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRT-DDRTQSNIGFGWRHFS 174 R+ + ++L P++ ++LF DD + NIG RH Sbjct: 31 WNPRIDFEGKLGNDRSIAEADLLIPLWQNNDSLLFANIRGRLDNDDSYEGNIGLALRHML 90 Query: 175 GNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY- 230 N W G + D ++ +G E L AN YI + D D Sbjct: 91 DNGWNLGGYGYFDRRKSPYDNFFNQVTLGVEALSLNWDLRANTYIPVGESSYAEDSLDTV 150 Query: 231 ------------QERPANGWDIRAEGYLPAW-PQLG--ASLMYEQYYGDEVG 267 +ER G+D +P + P+ + Y + Sbjct: 151 DFSGTTITYRAGEERSMRGYDAEVGWRIPVFSPEADKQLRIYAGGYRFTDSK 202 >UniRef50_A5GRI1 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GRI1_SYNR3 Length = 436 Score = 129 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 101/288 (35%), Gaps = 49/288 (17%) Query: 42 VMAARAQHAVQPRLSMGNTTVTADNNVEKNV----ASFAANAGTFLSSQPDSDAT----- 92 +A + R N+ + + AS+A L+S SD Sbjct: 55 AVAGALEAGQSVRCETLVDADNQSNSTVQKIFVTGASYATRIFPLLNSASLSDGIQKMLW 114 Query: 93 ---RNFITGMATAKANQEIQEWLGKYGTAR--VKLNVDKDFSLKDSSLEMLYPIYDTPTN 147 ++FI A N+ + + + V D D + +SL L + Sbjct: 115 MDSKSFIVSFAHDYLNEYVLKQIPFLSQTEFGVGFESDADMTYYLNSLISLAQLGSDDNG 174 Query: 148 ----MLFTQGAIHRTDDRTQS-NIGFGWRHFSGNDWMAGVNTFIDHDLSR---SHTRIGV 199 +LF QG+ + N+G G R ++ M G N F D+ + S++R G Sbjct: 175 YPLGLLFAQGSAKGAYSGSAVTNLGLGLRRRLRDNAMLGANAFWDYRFTNYSSSYSRWGA 234 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDI-----------------------EDYQERPAN 236 GAE W D KL+ N YI +G K+ + ER Sbjct: 235 GAELWWDDFKLTNNWYIAGTGIKRITTSGRAYTDTTSLAAGTYDETTLLGANTFDERVVP 294 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYG----DEVGLFGKDKRQKDPHA 280 GWD+ LP++PQL + ++ D G+ G Q PH Sbjct: 295 GWDVALNYRLPSYPQLSLGIRGFRWDYMRKSDNSGVEGSVNWQATPHT 342 >UniRef50_Q3B5D9 Putative uncharacterized protein n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B5D9_PELLD Length = 302 Score = 123 bits (309), Expect = 7e-27, Method: Composition-based stats. Identities = 33/147 (22%), Positives = 62/147 (42%), Gaps = 4/147 (2%) Query: 141 IYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRH-FSGNDWMAGVNTFIDHDLSRSHTRIGV 199 + + + +F +G D R + G+RH S N M G N H+ R+H RI Sbjct: 71 VSENQADNIFFEGGFDYQDARKTVDGALGYRHLMSDNKVMLGANVLYSHEFPRNHQRISY 130 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYE 259 GAE ++++N Y R + WK +++ +E+ G+D+ +P P + + Sbjct: 131 GAEIRTSVFEINSNYYHRLTDWK-LTGVDNNEEKARGGYDVELALAVPYVPSAHFRVKHF 189 Query: 260 QYYGDEVGLFGKDKRQKDPHAISAEVT 286 + G + + D + V+ Sbjct: 190 CWNG--IASNDSNNPIDDLKGNTFSVS 214 >UniRef50_B4VPY3 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VPY3_9CYAN Length = 1370 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 31/264 (11%), Positives = 76/264 (28%), Gaps = 56/264 (21%) Query: 20 RCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANA 79 +A N V++L+ ++ A P L + + ++ + + ++ Sbjct: 1 MAIACMNSLVRLLWTSFCFTPLLIPAAIAQTEIPSLPKADAVPESHPSLGSPLQAQTPDS 60 Query: 80 GTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLY 139 + + + ++G + + + L+ Sbjct: 61 PPSTTPDLTTLQIK-------------------PRWG---IGYSTSGAGYDGFTRLDSFL 98 Query: 140 PIYDTPTN-MLFTQGAIHRTDD-RTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--H 194 P+ P + + F +G + + N+ FG R ++ + + + G D + + Sbjct: 99 PLLQNPGSTLTFLEGRLQLDNSANVGGNLLFGHRFYNQSLNRIFGGYLGFDRRDTGNSTF 158 Query: 195 TRIGVGAEYWRDYLKLSANGYIRASGWKK-----------------------------SP 225 ++GVG E + + NGY + Sbjct: 159 HQLGVGVETLGEVWDVRLNGYFPLGDTRDLVDETAFDTGFQLTDRFFSDHFLVIQGKRQR 218 Query: 226 DIEDYQERPANGWDIRAEGYLPAW 249 + E G+D+ L W Sbjct: 219 GQVRHFEAAMTGFDLEVGARLAQW 242 >UniRef50_Q7VR49 Putative adhesin n=1 Tax=Candidatus Blochmannia floridanus RepID=Q7VR49_BLOFL Length = 680 Score = 119 bits (299), Expect = 8e-26, Method: Composition-based stats. Identities = 38/189 (20%), Positives = 65/189 (34%), Gaps = 19/189 (10%) Query: 121 KLNVDKDFSLKDSSLEML-----YPI-YDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFS 174 K N +K+ S++ YP + F Q + + G G R Sbjct: 95 KYNNQSQIQIKNDSIDFFHVLLEYPWNMQYKKILYFLQIGMKNFTENKMIVFGSGKRLVY 154 Query: 175 GNDWMAGVNTFIDHDLSRSHTR---IGVGAEYWRDYLKLSANGYIRASGWKKSP---DIE 228 + G N H +S ++ I +G EYW LK N Y + S Sbjct: 155 NKKHIIGYNACYHHPISTIQSQPYSINIGGEYWYRNLKFIFNNYYNINEIFYSYKNISNH 214 Query: 229 DYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD----EVGLFGKDKRQKDPHAISAE 284 Y + P G+ I A+ P + + +EQ D + + + + H + Sbjct: 215 HYYQYPKIGYQICAKSNFPYISEFIGQIKFEQCVYDKTRNNIRFWNANNKN---HILCVS 271 Query: 285 VTYTPVPLT 293 + Y P+P+ Sbjct: 272 LEYQPIPMF 280 >UniRef50_Q492T4 Putative adhesin n=1 Tax=Candidatus Blochmannia pennsylvanicus str. BPEN RepID=Q492T4_BLOPB Length = 669 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 48/240 (20%), Positives = 88/240 (36%), Gaps = 28/240 (11%) Query: 81 TFLSSQPDSDATRNFITGMATAK-----ANQEIQEWLGKYGTARVKL------------- 122 L+ S+ + N K + I L Y KL Sbjct: 23 NTLTEGIKSNVSNNIFQDDLYQKEMKLHTHDHIHHTLNFYPYTTNKLRVHAYNYRPPFSS 82 Query: 123 NVDKDFSLKDSSLEMLYPIYDT----PTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDW 178 L++ S++M + Y N+ F Q IH N G G RH + + + Sbjct: 83 TYKSKMQLQNDSIDMFHSFYTQRNKHKKNLSFMQLGIHNLLSEQIFNFGGGKRHLTNDKY 142 Query: 179 MAGVNTFIDHDLSRSHTR---IGVGAEYW-RDYLKLSANGYIRASGWKKSPDIEDYQ-ER 233 G NTF +S+ ++ I VG EYW + L + N Y + + ++ Sbjct: 143 AIGYNTFYHCPISKQSSQPYSINVGVEYWLHNTLFMLNNYYNLDNIFNPETSLQKCNIHY 202 Query: 234 PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYTPVPLT 293 P +G + + P + + + EQ+ ++ +K+ D + +S ++ Y P+P+ Sbjct: 203 PRSGHQLYIQTKFPRFFEFTGKIKLEQFIYEKKYKKIFNKKNSD-YYLSLDLNYQPIPML 261 >UniRef50_B4VI48 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VI48_9CYAN Length = 908 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 56/195 (28%), Gaps = 26/195 (13%) Query: 99 MATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAI-- 155 +A +A E + L + + LE P+ TP N+ F +G + Sbjct: 22 LAQTEAESETADTLRIKPRLGIGHTSSGGGFDGFTRLEGFVPLLQTPGKNLTFLEGRLFL 81 Query: 156 HRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSA 212 D N+ G+R +S N + G D+ + ++G+G E Sbjct: 82 DNDDANLGGNLILGYRTYSANSHRIWGGYMSYDNRHTGHNTFNQLGLGIESLGTVWDFRV 141 Query: 213 NGYIRASGWKKSPDIEDYQ-----------------ERPANGWDIRAEGYLPAW---PQL 252 NGY+ ++ + E GWD L L Sbjct: 142 NGYLPIGDTRQGVGDAGVRDIFFRRNFLILEQGQNKEAAMGGWDAEVGAKLARIGIDGDL 201 Query: 253 GASLMYEQYYGDEVG 267 Y + Sbjct: 202 RGYGGLYWYDAEGSS 216 >UniRef50_A4GJL9 Possible adhesin/invasin n=2 Tax=Bacteria RepID=A4GJL9_9BACT Length = 304 Score = 118 bits (296), Expect = 2e-25, Method: Composition-based stats. Identities = 41/198 (20%), Positives = 79/198 (39%), Gaps = 8/198 (4%) Query: 82 FLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTAR---VKLNVDKDFSLKDSSLEML 138 S+ + ++ G+A++ + LG+ + + L V + F S L + Sbjct: 29 ISSASSLENRVTSYFNGLASSLGTS-VSSLLGENSRVKYLDLNLGVQEHFKPTIS-LTNV 86 Query: 139 YPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGND-WMAGVNTFIDHDLSRSHTRI 197 I + + +F Q +++ ++ N+G G R +D + G+N F D+ SH R Sbjct: 87 NMISEYGNSAIFNQNSLNLHNNDQTINLGIGHRTLLNDDKVIFGLNLFFDYAFDDSHQRN 146 Query: 198 GVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLM 257 G G E L +N Y SG + D E +GWD+R + +LP + Sbjct: 147 GAGLEVLSSVFDLRSNIYDATSGIEAVSTSRD--EEAMDGWDMRLDYHLPIKTNARLFVG 204 Query: 258 YEQYYGDEVGLFGKDKRQ 275 ++ + ++ Sbjct: 205 LFEFENAAGSYEVEGEKY 222 >UniRef50_A6T1E3 Putative uncharacterized protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T1E3_JANMA Length = 553 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 66/223 (29%), Gaps = 38/223 (17%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD 159 A A A QE Y + L + P+ ++ F + Sbjct: 22 AGAYAQNAGQEKWSTY----LDLEGKVGSKRDIGEANLFIPVVQDARSLYFANVRARMAN 77 Query: 160 DRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGY 215 + ++G G RH W G F+D + S+ + +G E AN Y Sbjct: 78 GGDFEGSLGGGMRHMLETGWNLGAYGFVDRRRTTYNNSYDQATLGVEALGRQFDWRANVY 137 Query: 216 IRASGWKKSPDIEDY---------------QERPANGWDIRAEGYLPAW-----PQLGAS 255 + + +ER G+DI A LP + Q+ A Sbjct: 138 QPFGKKSTTLSSSNTGSVSGGSLFVTTTAQEERALPGFDIEAGWRLPVFDEEDTRQVRAY 197 Query: 256 LMYEQYYGDEVGLFGKDKRQKDPHA----------ISAEVTYT 288 L ++ D + + G R + A ++ Y Sbjct: 198 LAGYRFSDDGLKVQGTRVRAEYVMAEFSDTWKGAQLTIGAEYQ 240 >UniRef50_A4TV20 Putative uncharacterized protein n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4TV20_9PROT Length = 732 Score = 117 bits (294), Expect = 4e-25, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 50/172 (29%), Gaps = 20/172 (11%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWR 171 V ++ + + + + PI +N+LF + ++ + N G G+R Sbjct: 29 QPKWAPSVDVSGKAGETRRIGEVNLFLPIAQDDSNLLFLDLRTSFDNLEQREGNFGLGYR 88 Query: 172 HFSGNDWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 + W G F D S ++I G E N Y+ + Sbjct: 89 AMQDSGWNLGAYAFYDRRRSSEGHYFSQITTGLEALGQDFDARINAYLPIGRKSYEVEDS 148 Query: 229 DY-------------QERPANGWDIRAEGYLPAW---PQLGASLMYEQYYGD 264 ER +G D LP + + Y+ D Sbjct: 149 ARVDLSGGSIQILSGLERAYHGGDAELGWRLPVFATDQDSEIRVYGGGYWFD 200 >UniRef50_D1RA50 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA50_9CHLA Length = 531 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 29/186 (15%), Positives = 52/186 (27%), Gaps = 23/186 (12%) Query: 112 LGKYGTARVKLNVDKD---FSLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIG 167 ++G R + + + P+ F H + R +N+G Sbjct: 266 FSEFGYVRGAYTFGEGIGIRHNYSTLTALFAPLVPYDDYYPFLDLRAHYIKNKRWAANVG 325 Query: 168 FGWRHFS-GNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWKKS 224 G R ++ G N + D+ + + G G E++ + ++ N Y Sbjct: 326 GGLRWRDCMTGFIFGANLYYDYRNTTQTDFNQFGFGLEFFTNCFEMRLNAYFPVGDVTHC 385 Query: 225 PD--IEDY----------QERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKD 272 D DY E G D+ P YY +V Sbjct: 386 EDHVFSDYIGPYYAVCGLTEIAQKGVDLEVGHTFWKCPYFSVFGAIGGYYYTDV----CG 441 Query: 273 KRQKDP 278 R + Sbjct: 442 HRHHNH 447 >UniRef50_A6C087 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C087_9PLAN Length = 849 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 56/194 (28%), Gaps = 27/194 (13%) Query: 101 TAKANQEIQEWLGKYG-------TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG 153 + A + E+ ++ A + + P+ ++ F Sbjct: 17 FSYAQDPVPEYQPEWFQEEDYLYRAYFDFTGQAGGVNDNGQGLLFIPLAQDEESLFFADL 76 Query: 154 AIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLK 209 + DD + + N G +R + W+AG+ F D S + G E Sbjct: 77 RGNIFDDSSAEGNFGLAYRRMVNDQWIAGMYGFYDVRRSQYSNIFRQGSFGFELLSIEWD 136 Query: 210 LSANGYIRASGWKKSPDIEDYQ------------ERPANGWDIRAEGYLPAWPQLG---- 253 NGY+ + ++ + ER G D L ++P+ Sbjct: 137 FRVNGYVPSQKQQRVDSLNTAYLSGNNIVMRAGEERAYWGTDFEVGRLLKSFPESNLDAE 196 Query: 254 ASLMYEQYYGDEVG 267 YY D Sbjct: 197 LRGYVGGYYFDNSA 210 >UniRef50_Q2BR71 Putative uncharacterized protein n=1 Tax=Neptuniibacter caesariensis RepID=Q2BR71_9GAMM Length = 851 Score = 115 bits (288), Expect = 2e-24, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 53/187 (28%), Gaps = 19/187 (10%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 F + + + + + + V + D +L PIY T + +LFT+ Sbjct: 14 FALSITFTEHSLASSDKWDPWLESGVSIGTDNS---SRGEAALLLPIYQTDSGLLFTELR 70 Query: 155 IHRTDDRT-QSNIGFGWRHFSGNDWMAGVNTFID---HDLSRSHTRIGVGAEYWRDYLKL 210 D + + N+ G+R N W G+ D + + G E Sbjct: 71 GKLFDAGSKEGNLALGYRKMINNRWAIGMWVGRDIRTSEYGNRFHQEAWGLEALHPNWDF 130 Query: 211 SANGYIRASG------------WKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMY 258 N Y S I E P +G+D L Sbjct: 131 RINAYNALSSAQAYPQPVEAELIGNQLFITSAAEVPLSGYDFELGHRFSVLSDQDIWLYA 190 Query: 259 EQYYGDE 265 + D+ Sbjct: 191 GAFSFDD 197 >UniRef50_Q0IAR8 Possible Carbamoyl-phosphate synthase L chain n=27 Tax=Cyanobacteria RepID=Q0IAR8_SYNS3 Length = 401 Score = 115 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 37/302 (12%), Positives = 84/302 (27%), Gaps = 50/302 (16%) Query: 31 VLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSD 90 + L + V + A ++ + + +S S Sbjct: 5 LSLGLLASAISVASLPAIAQEDGGAALLRQQRDKLLEQIEQLKQRKEQLEAQIS---GSA 61 Query: 91 ATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLF 150 ++ + N ++ + + + + + P+ ++ F Sbjct: 62 QGKDDAFDLQEISLNDAVK------FNWGFQGALQGAGTPNQAGIGGFLPLSVGENSVWF 115 Query: 151 TQGAI-----HRTDDRTQSNIG-----------FGWRHFSGN-DWMAGVNTFIDHD---- 189 ++ + N G+R +G+ WM G+N D Sbjct: 116 LDALANANFSDYENNSSIINTDVAGTTISTSSRLGYRWLNGDRSWMYGLNAGYDSRPMNT 175 Query: 190 -------------LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPAN 236 S ++ V AE + L+A I ++ + YQ N Sbjct: 176 GGTDTGINVSGTEKSAFFQQVVVNAEAVSNDWNLNAYALIPIGDTEQDLN-SFYQGGALN 234 Query: 237 GWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQKDPH----AISAEVTYTPVPL 292 + + ++ P+L AS+ Y GD G + + ++A V + Sbjct: 235 TYGLDVGYFI--TPELNASVGYYYQNGDLGSADGSGVLGRVAYEISNGLTAGVNISYDEA 292 Query: 293 TQ 294 + Sbjct: 293 FE 294 >UniRef50_D1RBA5 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RBA5_9CHLA Length = 306 Score = 114 bits (284), Expect = 5e-24, Method: Composition-based stats. Identities = 40/226 (17%), Positives = 67/226 (29%), Gaps = 42/226 (18%) Query: 107 EIQEWLGKYGTARVKLNVDKDF---SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRT 162 + EW+ A ++ V K + S P+ D+ + F IH +R Sbjct: 44 QANEWVFPPTLAYLQGVVGKGIGEQNGYASFGIFTIPLLDSNGQLFF-DARIHNLRHERW 102 Query: 163 QSNIGFGWRHFSG-NDWMAGVNTFIDHDLSR-SHTRIGVGAEYWRDYLKLSANGYIRASG 220 +N+G G R + G+N F D+ +R + ++G G E NGY Sbjct: 103 AANVGVGTRIAIPCTNLFFGINFFYDYRRTRHDYHQLGPGLELIHPCWAFRINGYFPICD 162 Query: 221 ---WKKSPDIEDYQ---------ERPANGWDIRAEGYLPAWP---QLGASLMYEQYYG-- 263 K + + +G D+ E L W + Y+ Sbjct: 163 RSLRKHPKVFRFHDNLFAACTQIQNSLSGGDLELETSLRRWDPCLCFDVYIAPGGYFYHI 222 Query: 264 ------------------DEVGLFGKDKRQKDPHAISAEVTYTPVP 291 D +GL + V Y +P Sbjct: 223 RHHRDITGGRLRIGAVLFDYLGLEVRGSYDHYYKGTVQGVAYVEIP 268 >UniRef50_Q6M9Z6 Putative uncharacterized protein n=1 Tax=Candidatus Protochlamydia amoebophila UWE25 RepID=Q6M9Z6_PARUW Length = 361 Score = 113 bits (283), Expect = 7e-24, Method: Composition-based stats. Identities = 40/184 (21%), Positives = 69/184 (37%), Gaps = 26/184 (14%) Query: 120 VKLNVDKDFSL-----KDSSLEMLYPIYDTPTNM-LFTQGAIHRTDDR-TQSNIGFGWRH 172 + LN SL + M++P + + +F G D ++G G RH Sbjct: 80 LNLNYTFGKSLGCQKSYGTFGGMIFPFFSSCRPFQIFLDGKAFLFDHGKWGGSVGIGLRH 139 Query: 173 FSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWK-------- 222 FS N WM G+N + D+ ++G+G E D ++ NGY+ + + Sbjct: 140 FSYNGWMVGLNGYYDYRRFNGWDLNQLGLGVELLGDCVEFRVNGYLPVNKNRWDQCCLFN 199 Query: 223 -KSPDIEDYQER--PANGWDIRAEGYL--PAWPQ-LGASLMYEQYYGD---EVGLFGKDK 273 +ER +G D +L P+ Q +G + YY + F D+ Sbjct: 200 YSGSYFATLRERGYVWSGLDTEIGTWLVKPSCCQDIGLYVAAGPYYYRRSHDQDFFFHDQ 259 Query: 274 RQKD 277 + Sbjct: 260 KHHT 263 >UniRef50_A6FJE0 Putative invasin n=1 Tax=Moritella sp. PE36 RepID=A6FJE0_9GAMM Length = 322 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 41/149 (27%), Positives = 67/149 (44%), Gaps = 14/149 (9%) Query: 149 LFTQGAIHRTDDRTQSNIGFGWRHFSGNDWM-AGVNTFIDHDLSRSHTRIGVGAEYWRDY 207 L Q I ++ + G G + + GVN F D +++ + R+ +G++Y Sbjct: 124 LVWQANIDYKNEDILISNGIGI--LPEDSLIGVGVNAFWDVEMNSGNHRLSLGSKYDDPN 181 Query: 208 --LKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 LS+N Y SG D+ N DIRAEG + Q +SL E ++GD+ Sbjct: 182 YIFNLSSNIYFPLSGKGSEDDL-------VNSIDIRAEGAITPTVQFHSSL--EFFFGDD 232 Query: 266 VGLFGKDKRQKDPHAISAEVTYTPVPLTQ 294 + + + H +A + YTP+PL Q Sbjct: 233 IQINDDYDPTNNSHKFTAGLDYTPIPLLQ 261 >UniRef50_C6MZT8 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZT8_9GAMM Length = 785 Score = 111 bits (278), Expect = 2e-23, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 65/205 (31%), Gaps = 33/205 (16%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWR 171 G R LNV + L P+ ML+ GA+ T T +G G+R Sbjct: 35 WGGPWKPRQTLNVQ-GGHGMQDYYDALLPLSGNAERMLYANGALAATHHETGGELGLGYR 93 Query: 172 H-FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS-------- 219 H N+++ G + ++ +G E++ + A+ Y+ S Sbjct: 94 HIILNNEYVIGGFALMGRYQTNYHNMFNQLTLGTEFFGSIWEGRAHLYLPVSRRTKFVRS 153 Query: 220 --------GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGK 271 G K E G D+ +P P+L YY + +G K Sbjct: 154 RSEGLSFQGHKLFGIQTTTYEHAEGGADVEIGHVIPGIPKLRGFAG---YYNNGLGNEHK 210 Query: 272 D---------KRQKDPHAISAEVTY 287 + R + + +Y Sbjct: 211 NINGGYGRFEYRYNNHFTFTLGDSY 235 >UniRef50_Q2GDV5 Putative uncharacterized protein n=2 Tax=Neorickettsia RepID=Q2GDV5_NEOSM Length = 696 Score = 109 bits (273), Expect = 9e-23, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 67/244 (27%), Gaps = 18/244 (7%) Query: 47 AQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQ 106 H + +A A + + + + N Sbjct: 95 TPHDSRGDSLQSAIQAGKSQGRVSELARNLPQAERSTLNAYRVNVFAPE-KVVTQSDLNN 153 Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQ-SN 165 + +G T + + ++ S L P+ N+++ D + + Sbjct: 154 TSRHTVGARFTVTNEFSDSNGGAVSMSEFGALLPLLSKVDNLIYIDLKSKLYDAKEGEVS 213 Query: 166 IGFGWRHFSGNDWMAGVNTFIDHDL-SRSHTRI-GVGAEYWRDYLKLSANGY--IRASGW 221 G +R G+N F D + R +G E + L+ N Y + + Sbjct: 214 TGIVFRRQMSPLLTGGINVFTDVRFLPEGNYRWYSLGGEIFFKSFSLNGNYYRSNKKTTI 273 Query: 222 KKSPDIEDY-----------QERPA-NGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLF 269 E + ER A NG+D+ L + + S + + F Sbjct: 274 SSVKSFEFHDPDPGKAVIVLDERAAGNGYDLGLGLTLNKYINIHGSAFFFYSPYNTEEKF 333 Query: 270 GKDK 273 + Sbjct: 334 SGYR 337 >UniRef50_B0C4D7 Putative uncharacterized protein n=5 Tax=root RepID=B0C4D7_ACAM1 Length = 3597 Score = 109 bits (272), Expect = 1e-22, Method: Composition-based stats. Identities = 40/266 (15%), Positives = 75/266 (28%), Gaps = 42/266 (15%) Query: 33 FPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDAT 92 F + T A ++ G T + N T + SD + Sbjct: 148 FTASPPRTLAEAGWTTAPQVVAINKGTTPSNLPAATSHRLVQAEPNVPTDTKTGEKSDTS 207 Query: 93 RNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQ 152 + +A+ + + + + + F + L P + + F Sbjct: 208 NDT-----NTEADTSTNLGIPYFVDTEFRGSTRRQFGGINLRL----PFWQDDQSFAFAD 258 Query: 153 GAIHRTDDRT-QSNIGFGWRHFSG----NDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 + T N+G +R N W+ G + F D S + + + +GAE Sbjct: 259 VHFEGGSNETFLGNLGLAYRRILNTSNENPWILGTHAFYDSKRSENGFQYHQGSLGAELV 318 Query: 205 RDYLKLSANGYIRAS-----GWKKSPDIEDYQ--------------------ERPANGWD 239 + NGY+ S G + + Q ER G+D Sbjct: 319 NKKFEFRVNGYLPGSNPNVVGQRTINGVLGIQPRANGLGTNIVQQTLTLEARERALAGFD 378 Query: 240 IRAEGYLPAWPQLGASLMYEQYYGDE 265 A ++ L ++ D Sbjct: 379 FEAGHRHHFNDKVSLGLFGGYFFFDS 404 >UniRef50_C1ZFX4 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFX4_PLALI Length = 1567 Score = 108 bits (270), Expect = 2e-22, Method: Composition-based stats. Identities = 33/182 (18%), Positives = 57/182 (31%), Gaps = 20/182 (10%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-N 165 + E + + +S+ P + +++FT T+ N Sbjct: 82 SVDEIFNPIFRVDARGGQLYGYDEGYTSVGGFLPFFRDENSLIFTDIRGLMTNGGKGGAN 141 Query: 166 IGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R F D + GV+ + D D + GV E YL NGY+ + Sbjct: 142 VGVGYRQFVPELDRIFGVSGWYDFDNGHREAFNQFGVSFESIGRYLDWRVNGYLPVEDNE 201 Query: 223 KSPDI----EDYQER------------PANGWDIRAEGYLPAWPQLGASLMYEQYYGDEV 266 + + +Q G+D G P + G S YY Sbjct: 202 EISNQILGAAGFQNNFILLNRGRSVDSAYKGFDTEIGGPFPILGRYGMSGYVGMYYYANT 261 Query: 267 GL 268 + Sbjct: 262 DV 263 >UniRef50_A8PQA2 Putative uncharacterized protein n=2 Tax=Rickettsiella grylli RepID=A8PQA2_9COXI Length = 642 Score = 107 bits (268), Expect = 4e-22, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 61/201 (30%), Gaps = 44/201 (21%) Query: 129 SLKDSSLEMLYPIYDTPTNMLFTQGAIHR-TDDRTQSNIGFGWRHFSGNDWMAGVNTFID 187 + ++P+ + L+ A+ TD++ Q ++G G+R + + G F Sbjct: 43 DYTVGQADAMFPLSGDMSRNLYVDPALSYGTDNQNQFDVGLGYRWITNQAAIVGGYFFGG 102 Query: 188 HDLSRSHTRIGV---GAEYWRDYLKLSANGYIRASGWKKSPD------IEDYQE------ 232 + ++ R+ + G E + N YI + + E Sbjct: 103 YSRVDNNARLWIANPGIEAFGSRWDAHLNAYIPMGDRHYTAGTEIVHFFTGHSEFGRVFL 162 Query: 233 ---RPANGWDIRAEGYLPAWPQ--------------------LGASLMYEQYYGDEVGLF 269 +G DI+A L +P G + E + V L Sbjct: 163 MHQYAGSGADIKAGYQL--FPHSSLKGYLGSYYFSPAETNNVWGGAAGLEYWLTQGVKLI 220 Query: 270 GK---DKRQKDPHAISAEVTY 287 G D +A + + Sbjct: 221 GSYSYDNLHHSTYAFGIGLEW 241 >UniRef50_C1ZN12 Leishmanolysin n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZN12_PLALI Length = 2615 Score = 105 bits (263), Expect = 1e-21, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 59/188 (31%), Gaps = 26/188 (13%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQS-NIGFGWR 171 G Y R + N + + + L P+ + ++ Q + TD N+G R Sbjct: 45 GTYFDVRNQSNSGVGYQHGFTQIGALTPLLNDGQFLIAPQARLLITDTSKIGVNVGLIGR 104 Query: 172 -HFSGNDWMAGVNTFIDHD---LSRSHTRIGVGAEYWRDYLKLSANGYIRASG------- 220 + +G D + G N + D+D S +++IG G E L L AN Y+ Sbjct: 105 VYDAGRDRIWGANVYYDNDETTYSNRYSQIGFGFESLGQNLDLRANAYLPTGSSDKVIGP 164 Query: 221 ---------WKKSPDIEDYQ--ERPANGWDIRAEGYLPAWPQLG-ASLMYEQYYGDEVGL 268 + E G D +P + Y+ D Sbjct: 165 NGLSNTLFYTGNQLNFTGSYLSEEALRGADFELG--IPVTQNMSWLRAYGGGYFYDATQN 222 Query: 269 FGKDKRQK 276 R + Sbjct: 223 NVSGVRGR 230 >UniRef50_A6CCK3 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK3_9PLAN Length = 967 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 55/206 (26%), Gaps = 36/206 (17%) Query: 98 GMATAKANQEIQEWLG---KYGTARVKLNVDK------DFSLKDSSLEMLYPI--YDTPT 146 G+ + N ++ E G +G R SS + +P+ + Sbjct: 31 GVPQEEINGDVSELFGDSGWFGRYRPHFGYRYEAGDTIGRIGGLSSFDAFFPLLEGEDSD 90 Query: 147 NMLFTQGAIHRTDDR--TQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGA 201 + F + DD SN+G G R + G + D + S ++ G Sbjct: 91 WLTFIDARLLLGDDNHNLGSNVGVGARQYIPEYQRTIGAYIYYDTRDAGYASFDQVSGGI 150 Query: 202 EYWRDYLKLSANGYIRASGWKKSP--------------------DIEDYQERPANGWDIR 241 E D N Y+ + Y + G D+ Sbjct: 151 ETLGDIWDARLNWYVPTGQTRNQYATTHTSGGSYKFVGHYLTGGTFTRYYQAAMKGLDME 210 Query: 242 AEGYLPAWPQLGASLMYEQYYGDEVG 267 A + + Y+ G Sbjct: 211 AGAKFYSNESMDLRAYAGWYHFQAKG 236 >UniRef50_B5JSX1 Putative uncharacterized protein n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JSX1_9GAMM Length = 808 Score = 103 bits (256), Expect = 9e-21, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 49/192 (25%), Gaps = 36/192 (18%) Query: 102 AKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTD-D 160 EW + D S L L P Y + + +D D Sbjct: 19 GSVQAADSEWKP---NTQAYFAAGDDRSY--FGLAGLIPFYQDGKRLGYADLRYSSSDVD 73 Query: 161 RTQSNIGFGWRHFSGND-WMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYI 216 + N+G G+R + N+ + G D S R + ++ GAE D +N Y Sbjct: 74 TDEINLGAGFRSLNENETAIYGFYGSYDLRKSATERDYRQLTFGAELLTDTWDYRSNFYF 133 Query: 217 RASGWKKSPDIEDYQ-------------------------ERPANGWDIRAEGYLPAWPQ 251 + E +G DI L + Sbjct: 134 PTGDDSYQVGNAEDDVTVESEFVGHDLVRTTTTVGGGTIFEEALSGADIEVG-RLLNFDN 192 Query: 252 LGASLMYEQYYG 263 Y+ Sbjct: 193 FEMRGYLGAYHF 204 >UniRef50_A6CCK4 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCK4_9PLAN Length = 786 Score = 102 bits (254), Expect = 1e-20, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 49/179 (27%), Gaps = 28/179 (15%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPI--YDTPTNMLFTQGAIHRTDDRT--QSNIGF 168 +G R + SSL+ P+ + + F + D SN+GF Sbjct: 53 PHFGY-RYQAGDTIGRIGGLSSLDGFLPLLEAEDGNWLTFLDARLLLDDQNQNLGSNVGF 111 Query: 169 GWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP 225 G R + G + D + + +++ G E D N Y+ + Sbjct: 112 GARQYLPEWGRTIGGYVYYDTRDTGTRNFSQVSGGIETLGDLWDARLNWYVPTGSRRSLV 171 Query: 226 D--------------------IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGD 264 + Y + G D+ A + + Y+ Sbjct: 172 GTSHTVGGPSQFIGHYLYGGILTRYYQAAMTGVDMEAGRKILTSDSMDVRAFAGWYHFQ 230 >UniRef50_B6INS3 Putative uncharacterized protein n=1 Tax=Rhodospirillum centenum SW RepID=B6INS3_RHOCS Length = 922 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 37/191 (19%), Positives = 58/191 (30%), Gaps = 32/191 (16%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGA 154 G +A A+ + +++ + GT + S+ + P+ D+ F Sbjct: 2 TALGAGSAAADPALMDFVLRPGT-----------DGAEGSIAVAIPLADSDAARTFLDLR 50 Query: 155 IHRTD-DRTQSNIGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKL 210 D DR +NIG G R G + G + D DL ++ V + L L Sbjct: 51 GSIDDADRRVANIGIGHRFRLG-AVVLGGAVYYDRVRTDLESDFSQATVSLDLMTADLDL 109 Query: 211 SANGYIRA----------------SGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGA 254 AN Y SG I +E G+D L A Sbjct: 110 RANYYAPLDDEESVGTTVAGAPRLSGNHIVRSIFQPREVTLKGFDAEVGYRLGAIEGYDV 169 Query: 255 SLMYEQYYGDE 265 Y + Sbjct: 170 RAFAGGYRYTD 180 >UniRef50_UPI0001744E34 hypothetical protein VspiD_17850 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744E34 Length = 1016 Score = 102 bits (253), Expect = 2e-20, Method: Composition-based stats. Identities = 44/305 (14%), Positives = 78/305 (25%), Gaps = 89/305 (29%) Query: 72 VASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLK 131 + S S A G AT + + GT L ++ Sbjct: 13 LKSLVGAVLALSLSGTGIQAGPPDAKGAATIEPSGHPM----YLGTVTAGLKTSDAYTDG 68 Query: 132 DSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQSN-IGFGWRH----------- 172 S+ + P+Y T ++LF + + + ++ +G G+RH Sbjct: 69 HFSI--VAPLYSTLGADATLEGSVLFIEPYVSYGEGGEIASSLGLGFRHLFGSQPLTALS 126 Query: 173 --------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGW 221 F G + F +D + + ++GVG E Y+++ N YI S Sbjct: 127 ANNTAQAGFLDEGVFVGSSVFVDMLDTEANNQFWQLGVGIEAGTRYVEVRGNYYIPLSDK 186 Query: 222 K----------------------------------------------------KSPDIED 229 + + Sbjct: 187 QLAEETRTRETIRNSRSRSTSYLTGVSDPYATGNTIAQDAAFTTRTTTTTYTTTIERLFR 246 Query: 230 YQERPANGWDIRAEGYLPAW-PQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTYT 288 E GWD +P L ++ Y D + + A + Sbjct: 247 RYEEGMEGWDAEVAVLVPGLDRYLDVRVIGGYYSFDNQPFGPQQGGTGNVEGWKAGLELR 306 Query: 289 PVPLT 293 PVP Sbjct: 307 PVPAV 311 >UniRef50_A8PQI7 Putative outer membrane autotransporter barrel domain n=5 Tax=Rickettsiella grylli RepID=A8PQI7_9COXI Length = 1171 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 58/197 (29%), Gaps = 38/197 (19%) Query: 118 ARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGN 176 AR NV + + P+ + + A+ + ++G G+R Sbjct: 34 ARFSGNVYGSTKYVVGQADAMLPLVGDAQHNFYIDPALTSGSNWEGHGDLGLGYRWIQNG 93 Query: 177 DWMAGVNTFIDHDLSRSHTRIG---VGAEYWRDYLKLSANGYI----------------R 217 + G F +++ ++ RI G E NGY R Sbjct: 94 SAILGGYLFGEYNRMDNNVRIWTMNPGIEALGSRWDAHLNGYFVMDNRSKVVGTDLEFVR 153 Query: 218 ASGWKKSPDIEDYQERPANGWDIRAEGYL-PAWPQ-----------------LGASLMYE 259 G ++ D + NG D++ L P P LG ++ E Sbjct: 154 FRGHSAVYNLFDVTQNVGNGGDVKLGYQLFPKTPLKAFVGSYFFSPAETKNILGGAVGLE 213 Query: 260 QYYGDEVGLFGKDKRQK 276 + V +F K Sbjct: 214 YWANRNVKVFASYTYDK 230 >UniRef50_Q8YK40 All8078 protein n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YK40_ANASP Length = 1487 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 63/212 (29%), Gaps = 30/212 (14%) Query: 100 ATAKANQEIQEWLGKYGTARVKLNVDKDFSLK--DSSLEMLYPIYD-TPTNMLFTQGAIH 156 A+ + Q + T RV + + + SS E P+ ++ F QG + Sbjct: 19 ASTVSAQTPASTTAQVFTPRVGVRYTTEGAGYESFSSFEGFLPVLQIPGNSLTFLQGKLL 78 Query: 157 RTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSA 212 +D + NI G R FS + + G + + ++G+G E Sbjct: 79 LDNDSNLATNILLGHRIFSEEANRVIGGYISYSTRDTGKSNFDQLGLGFETLG-VWDFRF 137 Query: 213 NGYIRASGWKKSPDIED---------------YQERPANGWDIRAEGYLPAWPQLGASLM 257 N Y+ +G + + + + E +G D L + Sbjct: 138 NAYLPLNGSENQVEQANLPFFQGDSLMVQRSRFLEVAMSGVDAEVGTRLASLGSGDLRGY 197 Query: 258 YEQYYGDEVGLFGKDKRQKDPHAISAEVTYTP 289 YY ++ + P Sbjct: 198 AGVYYY-------SGGESREAFGWKTRIEARP 222 >UniRef50_A8PN48 Putative uncharacterized protein n=3 Tax=Rickettsiella grylli RepID=A8PN48_9COXI Length = 607 Score = 101 bits (252), Expect = 3e-20, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 60/224 (26%), Gaps = 51/224 (22%) Query: 107 EIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG-AIHRTDDRTQSN 165 + +E L +A V +++ + + L+ + TD + Sbjct: 28 QAREPLPPRFSAEAYTGV-----YTVGRADLMVSLDGDGQHNLYVDPQGGYGTDQEWYGD 82 Query: 166 IGFGWRHFSGNDWMAGVNTFIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWK 222 +G G+R S + + G F H + S G E N YI +G Sbjct: 83 VGLGYRWISNDAAIVGWYVFAGHSCVENSSGFWITNPGVEIMGSRWDARINAYIPVAGRS 142 Query: 223 K------------------------SPDIEDYQERPANGWDIRAEGYLPA---------- 248 S + ++ NG D R L + Sbjct: 143 DDLGGIESTTAGPSFFTGHSELRTVSFTAFNEVQQVGNGADARVGYQLFSGVPLKAVVGA 202 Query: 249 ----WPQL----GASLMYEQYYGDEVGLFGKDKRQKDPHAISAE 284 P G + ++ D V +F + H+ Sbjct: 203 YFFEIPHAENVRGGGAGVDYWFDDYVRVFARYNYDNRQHSQVVG 246 >UniRef50_D1R7A8 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A8_9CHLA Length = 225 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 41/132 (31%), Gaps = 17/132 (12%) Query: 149 LFTQG-AIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRS---HTRIGVGAEYW 204 +F D + ++ G G R + + G+NT+ D+ R ++GVG E Sbjct: 8 VFIDLDGYRFNDGKWGASTGIGIRKELSDGCVLGLNTYYDYLRGRGRFSFHQVGVGFEML 67 Query: 205 RDYLKLSANGYIRASGWKKSPDIEDY-------------QERPANGWDIRAEGYLPAWPQ 251 D + NGY+ S S + E G D L + Sbjct: 68 SDCFDVRINGYLPVSEKVHSHQCLSFHYSGTDFHASRCKLEYAYGGLDAEIGKPLLTYYD 127 Query: 252 LGASLMYEQYYG 263 YY Sbjct: 128 FDLYGAVGPYYF 139 >UniRef50_D1KD13 Putative uncharacterized protein n=1 Tax=uncultured SUP05 cluster bacterium RepID=D1KD13_9GAMM Length = 157 Score = 100 bits (249), Expect = 6e-20, Method: Composition-based stats. Identities = 34/137 (24%), Positives = 63/137 (45%), Gaps = 10/137 (7%) Query: 90 DATRNFITGMATAKANQEIQEWLGKYGTARVKLNV----DKDFSLKDSSLEMLYPIYDTP 145 DA +N + + N + ++ ++G +++V + S + + L P+ + Sbjct: 21 DAVKNKANDVVESVVNSSLNDFANQFGEGNTEISVRKVKGDEASYSIITTQPLAPLSEDG 80 Query: 146 TNMLFTQGAI----HRTDDRTQSNIGFGWRHFSG-NDWMAGVNTFIDHDLSRSHTRIGVG 200 + + F QG++ D RT N+G G R + G+N+F D++ S H R+ +G Sbjct: 81 SRL-FWQGSLGSYDQNGDRRTTLNLGLGNRWLIDGEKAIVGINSFYDYEFSAKHKRMSLG 139 Query: 201 AEYWRDYLKLSANGYIR 217 EY R +LS N Y Sbjct: 140 GEYKRSNAELSVNKYWG 156 >UniRef50_A7HQN0 Parallel beta-helix repeat n=2 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HQN0_PARL1 Length = 675 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 56/180 (31%), Gaps = 28/180 (15%) Query: 112 LGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR-TQSNIGFGW 170 G + A L+ ++D P++ + ++LF + T+ N G+ Sbjct: 31 WGPWIEAGGFLSTERDR----GEATAFMPLFQSGESLLFADVKGKLFSEGVTEGNFALGY 86 Query: 171 RHFSGNDWMAGVNTFIDHDLS---RSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDI 227 R + D G+ D S + + G E NG++ + K +P + Sbjct: 87 RRMTAWDVNLGLWGGYDIRESVSGNTFDQAAFGIEALAADYDFRLNGFVPLADGKAAPGM 146 Query: 228 EDYQ------------ERPANGWDIRAEGYLPAWPQLG-------ASLMYEQYYGDEVGL 268 + E G++ LP LG L Y D+ L Sbjct: 147 ARVELSGSQILLTGGRELVLGGFEGEVGWRLP-LEALGADRERHEFRLYAGGYRFDDSDL 205 >UniRef50_C0B2E6 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0B2E6_9ENTR Length = 156 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 39/144 (27%), Positives = 69/144 (47%), Gaps = 6/144 (4%) Query: 8 HKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNN 67 + ++ I+ Q FP+A++ TP + + A + +LS +NN Sbjct: 4 MNNTLLDKLRKKKIFSYFIIASQFSFPIALSLTPTIQSYAATVEENKLSTNT-----ENN 58 Query: 68 VEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKD 127 + +A + GT LSS DA ++ A +K N+EI+ W +YG A++ L VDK Sbjct: 59 NGRWLAQQTSQLGTILSSDNTHDAASQYLINQANSKVNREIENWFNQYGKAQINLGVDKH 118 Query: 128 FSLKDSSLEMLYPIYDTPTNMLFT 151 F+LK L+ L+ ++ T + + Sbjct: 119 FTLKTQKLKSLF-LFTKQTIIFYL 141 >UniRef50_A8ZLP1 Putative uncharacterized protein n=2 Tax=Acaryochloris marina MBIC11017 RepID=A8ZLP1_ACAM1 Length = 1022 Score = 98.6 bits (244), Expect = 2e-19, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 62/203 (30%), Gaps = 40/203 (19%) Query: 95 FITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTN-MLFTQG 153 + +A+ + ++G + N + + LE P++ P + F +G Sbjct: 24 IAEPQPSTQASDL--RFSPRFG---IGANSPSSGTNTTTRLETFVPVWQKPGRALTFFEG 78 Query: 154 AIHRTDDRT-QSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGVGAEYWRDYLK 209 + D NI FG+R +S + + G + D + + ++ +G E + Sbjct: 79 RLLLDDQGNPGGNILFGFRQYSDDLKRIFGGHLGFDIRNTDNNTFQQLSLGIESLGKDVD 138 Query: 210 LSANGYIRASGWKKSPDIEDYQ-----------------------------ERPANGWDI 240 L NGY ++ ++ E G D Sbjct: 139 LHLNGYWPVGSTRRQTRQRIFEVLQLNGDPRFTGNILLLDLLRRRLITRQFEEALAGVDF 198 Query: 241 RAEGYLPAWPQLG-ASLMYEQYY 262 L ++ G Y+ Sbjct: 199 EVGKQLLSFKNGGDLRAYLGPYF 221 >UniRef50_A3ZRN5 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZRN5_9PLAN Length = 792 Score = 97.1 bits (240), Expect = 7e-19, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 62/210 (29%), Gaps = 29/210 (13%) Query: 84 SSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEML---YP 140 S+Q D + + + A+ G+Y R+ + + + D S P Sbjct: 24 SAQQAGDDIQPGLISGTSTFASPYANGQGGEYF-PRISVQHRTEGAGYDYSFTDFRAWVP 82 Query: 141 IYD--TPTNMLFTQGAIHRTDDRTQS-NIGFGWRHFSGN-DWMAGVNTFIDHDLSRSHT- 195 +Y+ ++ F GA +D+ N G R +S N G D+ + + T Sbjct: 83 LYESYDSKSLTFFDGAFLLANDQNVGMNAVVGQRFYSDNYGRTFGGYVGYDNRDTGNQTV 142 Query: 196 -RIGVGAEYWRDYLKLSANGYIRAS-----------------GWKKSPDIEDYQERPANG 237 ++ G E NGY + G+ E G Sbjct: 143 GQVVTGFESLGRI-DFRVNGYFPTTSDPTMTGQTGFFDPTYVGYNIQLSQLTQYEVAMKG 201 Query: 238 WDIRAEGYLPAWPQLGASLMYEQYYGDEVG 267 +D G LP Y G Sbjct: 202 FDAEIGGALPHVGDY-LRAYLGAYNFQGSG 230 >UniRef50_C1ZLE3 Putative uncharacterized protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZLE3_PLALI Length = 1304 Score = 94.8 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 52/156 (33%), Gaps = 25/156 (16%) Query: 138 LYPIYDTPTNMLFTQGAIHRTD-DRTQSNIGFGWRHFSGN-DWMAGVNTFIDHDLSRS-- 193 L P MLF R++ DR +N+G G R++ N D + G N + D+D + Sbjct: 95 LMPYGFIENFMLFGDLRGFRSNSDRYGANVGGGARYYLENYDRIIGANAYFDYDETSGAP 154 Query: 194 HTRIGVGAEYWRDYLKLSANGYIRASGWKK---------SPDIEDY-----QER----PA 235 +G G E Y N Y ++ S +D +ER Sbjct: 155 FRDVGFGIETLGRYWDARVNAYFPVGPTEQLLSQSVVTGSQRFQDTRILFDRERIVGLAP 214 Query: 236 NGWDIRAEGYL---PAWPQLGASLMYEQYYGDEVGL 268 G+D L + + Y+ L Sbjct: 215 KGFDAEFGMPLFFNSFFERHDLRAFGGFYHYQSENL 250 >UniRef50_C4FSN7 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FSN7_9FIRM Length = 420 Score = 94.8 bits (234), Expect = 3e-18, Method: Composition-based stats. Identities = 42/217 (19%), Positives = 69/217 (31%), Gaps = 43/217 (19%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNM----LFTQGAIHRTDD--RTQSNIGFG 169 G K++ D ++ +SS P Y + + D +IG G Sbjct: 127 GNGGEKISSDAYWNGGESSYIGDDPKYKAAARLAQQPSYLDKGETVQHDSLGVVGSIGAG 186 Query: 170 WRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP--- 225 +R S N+ G+NTF D+ +R+G+G EY K+SAN Y S K P Sbjct: 187 YRRLSKNEHAYVGINTFYDYAFRDKLSRVGIGLEYVAGLNKISANVYHGLSEKKTKPYYF 246 Query: 226 ----------DIEDY---------------QERPANGWDIRAEGYLPAWPQLGASLMYEQ 260 D Y E +G+++R + + Sbjct: 247 ENSLVIVPRADEFHYPEDGYPNGFTKIRYAYENVLDGYNVRYTRDYKNARWISTYVEGYH 306 Query: 261 YYGDE-----VGLFG-KDKRQKDPHAISAE--VTYTP 289 + V +F + K + + TP Sbjct: 307 WKTKSPSEHPVDMFYLNQHKWKSISGLKLGATLNITP 343 >UniRef50_A6C8X2 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8X2_9PLAN Length = 1606 Score = 94.4 bits (233), Expect = 4e-18, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 57/178 (32%), Gaps = 22/178 (12%) Query: 110 EWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTP-TNMLFTQGAIHRTDDRTQS-NIG 167 + + + S+L +L P P +MLF TD N+G Sbjct: 119 DEFHPLFRLDKGIGGGIGYDDGYSNLGVLMPFTINPEQSMLFLDLRAMVTDQGAGGVNLG 178 Query: 168 FGWRHFSGN-DWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIR------- 217 GWR ++ N D + V + D+D + ++G+ E YL NGY Sbjct: 179 AGWRAYNDNLDKIFTVAGWYDYDDGHYQDYHQLGLSGEVIGQYLTTRVNGYFPINNNEII 238 Query: 218 ----ASGWKKSPDIEDYQERPAN------GWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 SG Y R G D G LP + G YY + Sbjct: 239 ISNNLSGSAYFQTDRIYLNRTRRSESSYGGVDAEVGGPLPVLGKFGIDGYVGGYYYNS 296 >UniRef50_UPI0001746965 hypothetical protein VspiD_26965 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746965 Length = 1076 Score = 94.4 bits (233), Expect = 5e-18, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 67/258 (25%), Gaps = 83/258 (32%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + D + ++ P++ + +LF + + + ++G G+ Sbjct: 50 TVNAGVKSSDAYTDGNFSIVAPVWSSLGAEGTLSGGVLFLEPYTSYGEGGEIAASLGLGY 109 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 R+ F G N F +D + ++GVG E+ YL Sbjct: 110 RYLFGAQPISALTRKDAPQAGFFEEGVFVGTNVFIDMLDTEADNQFWQLGVGVEFGNRYL 169 Query: 209 KLSANGYIRASGWKKSPDIED--------------------------------------- 229 + N YI S + + + Sbjct: 170 EFRGNYYIPLSDKQVAEQFKTREVLQSSSTSRSQSVTPLNNPYATGYTIAQDALYTTRAT 229 Query: 230 -------------YQERPANGWDIRAEGYLPAWPQ-LGASLMYEQYYGDEVGLFGKDKRQ 275 E GWD A +P + L+ Y D + Sbjct: 230 TTTRTTTIDRLFSRYEEGMEGWDAEAAFLVPGLDKYFDLRLIGGYYSFDNQPFGPQTGGT 289 Query: 276 KDPHAISAEVTYTPVPLT 293 + A V PVP Sbjct: 290 GNVEGWKAGVEIRPVPAI 307 >UniRef50_B4VZV2 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VZV2_9CYAN Length = 1059 Score = 94.0 bits (232), Expect = 6e-18, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 58/209 (27%), Gaps = 24/209 (11%) Query: 71 NVASFAANAGTFLSSQPDSDATRNFITGMATA--KANQEIQEWLGKYGTARVKLNV-DKD 127 +++ A G +S S + + L + Sbjct: 2 KISAQALCVGLLVSGGLTSPVIAQTLESETDTPRSLTEGTATDLRVLPRIGGQFTSEGAG 61 Query: 128 FSLKDSSLEMLYPIYDTPTN-MLFTQGAIHRTDDRT-QSNIGFGWRHFSG-NDWMAGVNT 184 + SLE PI P + + F +G + D T I G R ++ + + G Sbjct: 62 YQDPFFSLEGFVPITQNPGSTVTFLEGQLRLFTDSTMGGTILLGQRFYNSTQNRILGGYL 121 Query: 185 FIDHDLSRS--HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPD---------------- 226 D + + +IG G E D L N Y+ + D Sbjct: 122 SYDTRDTGNSLFHQIGAGFERLGDDWDLRVNAYLPVGERRPEVDESFSLRGFQENNLLLN 181 Query: 227 IEDYQERPANGWDIRAEGYLPAWPQLGAS 255 E G+DI A G L Sbjct: 182 HRQRFEAAMAGFDIEAGGRLLRLGAGDLR 210 >UniRef50_B7K1T2 Parallel beta-helix repeat protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K1T2_CYAP8 Length = 1873 Score = 93.2 bits (230), Expect = 9e-18, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 13/191 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLG--K 114 + + + + + T P+ T ++ + + Sbjct: 38 TEPEQLNELSPKLEGIETIEEAGWTEKPISPNGTNPSETPTNETDSQGTPSPETPQPAIR 97 Query: 115 YGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAIHRTDDRTQ---SNI 166 Y T RV + ++ + E +PI + FT+G + + + +N Sbjct: 98 YFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRANF 157 Query: 167 GFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWKK 223 G R +S + + + G D + + GVG E D+ N YI ++ Sbjct: 158 LVGHRLYSQDHNRVYGAYIGYDLRDTKYNKFNQFGVGIETLGDFWDARFNAYIPLGTTQQ 217 Query: 224 SPDIEDYQERP 234 + P Sbjct: 218 QIGQTNTALNP 228 >UniRef50_C4FS47 Putative uncharacterized protein n=1 Tax=Veillonella dispar ATCC 17748 RepID=C4FS47_9FIRM Length = 373 Score = 93.2 bits (230), Expect = 1e-17, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 59/166 (35%), Gaps = 39/166 (23%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 T +N+G G+R S ++ GVNTF DH S+ + RI G EY ++ AN Y + Sbjct: 141 GTVANVGLGYRVLSKHEHAYVGVNTFYDHSFSKKYNRISGGLEYVSGLNEVRANIYKGLN 200 Query: 220 GWKKSPDI----EDYQE----------------RPANGWDIRAEGYLPAWPQLGASLMYE 259 K P E Y E + +G+D+ A + Sbjct: 201 STKSEPYNVPLYEGYFEFLLDGGPAGYTVYKSQKALSGYDVSYARTFKNARWARAYVGAY 260 Query: 260 QYYGDEVGLFGKD-----------------KRQKDPHAISAEVTYT 288 + G V G+ Q PH +S +V YT Sbjct: 261 HWNGLGVKTHGEGPALALNVGKSHGWQAGTTLQLTPH-VSLDVGYT 305 >UniRef50_C7QR03 Putative uncharacterized protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QR03_CYAP0 Length = 1985 Score = 91.7 bits (226), Expect = 3e-17, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 57/191 (29%), Gaps = 13/191 (6%) Query: 57 MGNTTVTADNNVEKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLG--K 114 + + + + + T P+ T + + + Sbjct: 38 TEPEQLNELSPKPEGIETLEEAGWTEEPISPNGTNPSETPTNETDSPGTPSPETSQPAIR 97 Query: 115 YGTARVKLNVDKD----FSLKDSSLEMLYPIYD-TPTNMLFTQGAIHRTDDRTQ---SNI 166 Y T RV + ++ + E +PI + FT+G + + + +N Sbjct: 98 YFTPRVGVKYTSGPEVGYNSSFFAFEAFFPILQIDENQLTFTEGRVLASTHDAEDIRANF 157 Query: 167 GFGWRHFSGN-DWMAGVNTFIDHDLS--RSHTRIGVGAEYWRDYLKLSANGYIRASGWKK 223 G R +S + D + G D + + GVG E + N YI ++ Sbjct: 158 LVGHRLYSQDHDRVYGAYIGYDLRDTKYNKFNQFGVGLETLGSFWDARFNAYIPLGTTQQ 217 Query: 224 SPDIEDYQERP 234 + P Sbjct: 218 QIGQTNTDLNP 228 >UniRef50_UPI000174607D hypothetical protein VspiD_10245 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI000174607D Length = 975 Score = 91.3 bits (225), Expect = 4e-17, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 67/258 (25%), Gaps = 83/258 (32%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDT-------PTNMLFTQGAIHRTDDRTQS-NIGFGW 170 V V + + ++ P++ T ++++ + + + ++G GW Sbjct: 91 TVTSGVKTSDVYTEGNFSIVAPVFSTLGADATLSGDVIYLEPYTSSGEGGEIAASLGLGW 150 Query: 171 RH-------------------FSGNDWMAGVNTF---IDHDLSRSHTRIGVGAEYWRDYL 208 RH F + G N F +D + + ++GVG E YL Sbjct: 151 RHLFGSQPVSALTRKDAPQASFLEEGFFVGANLFIDMLDTEANNQFWQLGVGIEAGTRYL 210 Query: 209 KLSANGYIRASGWK---------------------------------------------- 222 ++ N YI S + Sbjct: 211 EVRGNYYIPLSDKQLAEQTRTREILRNSSSRDTTTVSALSDPYATGNTVSQDVSYRTQRT 270 Query: 223 ------KSPDIEDYQERPANGWDIRAEGYLPAWPQ-LGASLMYEQYYGDEVGLFGKDKRQ 275 + E GWD +P + L+ Y D + Sbjct: 271 TTTTTTTIERLFSRYEEGMEGWDTEVAVLVPGLDKYFDLRLIGGYYSFDNQPFGPQTGGT 330 Query: 276 KDPHAISAEVTYTPVPLT 293 + A V PVP Sbjct: 331 GNVEGWKAGVEVRPVPAV 348 >UniRef50_A5GVG9 Uncharacterized conserved secreted protein n=15 Tax=Cyanobacteria RepID=A5GVG9_SYNR3 Length = 349 Score = 90.9 bits (224), Expect = 4e-17, Method: Composition-based stats. Identities = 24/220 (10%), Positives = 56/220 (25%), Gaps = 51/220 (23%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG----------AIHRTD----DR 161 + + + + PI ++ + D D Sbjct: 42 PRLGFQGQTQGAGTPNEVGVGGFLPIAVGDNSVFYADVEVNANLADFSGYSSIDNTQVDG 101 Query: 162 TQSNIG--FGWRHFSGN-DWMAGVNTFIDHD----------------------------- 189 + G+R + + WM G+N D Sbjct: 102 VTVSTSSRLGYRWLNDDRSWMFGINAGYDSRPMNTGDAKPWHPIKRRYQHLYLPSIKYAK 161 Query: 190 --LSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLP 247 S ++ E A + ++ + YQ + + + ++ Sbjct: 162 NPRSVFFQQVAAEVEAVSPTWNFGAYALVPFGDTEQRLN-SHYQGGALDTYGLDVGYFI- 219 Query: 248 AWPQLGASLMYEQYYGDEVGLFGKDKRQKDPHAISAEVTY 287 P++ AS+ Y GD+ + + A++ V + Sbjct: 220 -TPEINASVGYYYQQGDDSAGNSSGVKGRLAFAVAKGVEF 258 >UniRef50_Q05XC6 Possible Carbamoyl-phosphate synthase L chain n=1 Tax=Synechococcus sp. RS9916 RepID=Q05XC6_9SYNE Length = 404 Score = 90.1 bits (222), Expect = 9e-17, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 61/220 (27%), Gaps = 44/220 (20%) Query: 116 GTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG--AIHRTD------------DR 161 V + + ++ PI T ++ F D + Sbjct: 35 FRWNVFSKSQGAGTPNQAGGQVFIPISTTRKSIFFLDALATADFGDALSTSSIVNTPVEG 94 Query: 162 TQSNIG--FGWRHFSGNDWM-AGVNTFIDHD-LSRS-------------------HTRIG 198 T + G+R + N + GVN D +S +I Sbjct: 95 TTFSTSSRIGYRWLNDNGDILFGVNAGYDSRPISTGIPSRYSWAPRSLLQPQDVFFQQIA 154 Query: 199 VGAEYWRDYLKLSANGYIRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMY 258 GAE + + + + + ++ Y + + I E L AS+ Y Sbjct: 155 FGAELVTNNIAIKPYALVPVGKTEDVLNL-FYSGGALDTYGIDIEHSFDEL--LTASIGY 211 Query: 259 EQYYGDEVGLFGKDKRQK---DPHA-ISAEVTYTPVPLTQ 294 GD G + +P S V YT P + Sbjct: 212 YYQQGDLTYANGSGLKSTIAINPAGSFSMGVEYTYDPAFE 251 >UniRef50_Q1RPI2 Putative uncharacterized protein n=10 Tax=Escherichia RepID=Q1RPI2_ECOLX Length = 268 Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 55/126 (43%), Gaps = 18/126 (14%) Query: 9 KQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVTADNNV 68 + ++ R ++ A ++ + N+ Sbjct: 138 LRKLNQFRTFVR---NVRPGDELDV---------------QAQVSEKNLTPPPGNSSGNL 179 Query: 69 EKNVASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDF 128 E+ +AS + G+ L+ +S+ N G A+++A+ + +WL ++GTAR+ L VD+DF Sbjct: 180 EQQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDF 239 Query: 129 SLKDSS 134 SLK+S Sbjct: 240 SLKNSR 245 >UniRef50_D1RA61 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RA61_9CHLA Length = 188 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 52/167 (31%), Gaps = 18/167 (10%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEML----YPIYDTPTNMLFTQGAIH-RTDDRTQSNIG 167 +Y + D S+ L +P+ +F+ H T N G Sbjct: 22 NEYFKTYLSYKGGNDGLGYHSNYASLDLMCFPL-PLEDITIFSDLKGHWLTRHHYAVNAG 80 Query: 168 FGWRHFSGNDWMAGVNTFIDHDLSR--SHTRIGVGAEYWRDYLKLSANGYIRAS-GWKKS 224 G+R + N F DH S + ++G+G E + + +L NG + K++ Sbjct: 81 VGFRKIYAPQTIWDANLFYDHPKSSYDHYNQVGLGLELFHELWELRLNGAVALGHTTKRN 140 Query: 225 PDI---------EDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYY 262 + E ++ + + + YY Sbjct: 141 KTFVYTDVFFVKRHFHEYAFLFVEVELGKKFIFFDNISPFMGIRTYY 187 >UniRef50_B4D818 Parallel beta-helix repeat protein n=2 Tax=cellular organisms RepID=B4D818_9BACT Length = 5429 Score = 77.8 bits (190), Expect = 4e-13, Method: Composition-based stats. Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 4/116 (3%) Query: 119 RVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDD-RTQSNIGFGWRHFSGND 177 RV ++ D SL+ L P+ +L+ + +D +IGFG+RH Sbjct: 74 RVTFGLEFYEHQIDESLDTLVPLATPQNGVLYFNPKLSLSDRLNPSVSIGFGYRHLLKAR 133 Query: 178 WMAGVNTFIDHDLSR-SHT--RIGVGAEYWRDYLKLSANGYIRASGWKKSPDIEDY 230 + T + D + H + GVGAE ++ AN Y+ ++ + Sbjct: 134 RSSSGETSLRSDYTNFDHHVNQFGVGAEVMSRWVDFRANYYLPEQNRRRINTNQTT 189 >UniRef50_C4FS48 Putative uncharacterized protein n=2 Tax=Veillonella dispar ATCC 17748 RepID=C4FS48_9FIRM Length = 421 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 44/146 (30%), Gaps = 30/146 (20%) Query: 161 RTQSNIGFGWRHFSGNDWMA-GVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGYIRAS 219 ++G G+R S N+ GVN F+D + ++ RI G EY ++ AN Y Sbjct: 168 GIIGSVGIGYRRLSRNEHAYVGVNAFVDRAFTGNYNRISGGVEYVNGLNEVYANVYRGLG 227 Query: 220 GWK----------------------------KSPDIEDYQER-PANGWDIRAEGYLPAWP 250 S + Y +G++I Sbjct: 228 DKDLVKGGGGNPYPKRLYPNGYPDTFPYNTIPSENYNTYVGGGVLDGYEIGIVRSFKNAR 287 Query: 251 QLGASLMYEQYYGDEVGLFGKDKRQK 276 A + ++ G+ + + Sbjct: 288 WARAYVNGYRWNGNGFSHKQEYNWGR 313 >UniRef50_A6C500 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C500_9PLAN Length = 1337 Score = 75.5 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 42/142 (29%), Gaps = 24/142 (16%) Query: 144 TPTNMLFTQGAIHRTDDRTQSNIGFG-WRHFSGN-DWMAGVNTFIDHDLSRS--HTRIGV 199 M+F + RT+ +R ++ + D + G + + D D S ++ + Sbjct: 145 DDAGMMFGNFRLWRTNRGNLGGGAGLGYRFYNYDTDRIFGTSFYYDRDDSTDKIFQQLAL 204 Query: 200 GAEYWRDYLKLSANGYIRASGWKKSPDIEDYQ------------------ERPANGWDIR 241 E Y + N Y+ ++ ++E + G+D Sbjct: 205 NVETMGRYWDANGNFYLPIGNREQQLNLEFNDGSQRFSGFNVLYDQTRTIGKSMRGFDAE 264 Query: 242 AEGYLPA--WPQLGASLMYEQY 261 + Q A Y Sbjct: 265 IGVPIWGELAQQFQARAYAGTY 286 >UniRef50_A5GVB4 Uncharacterized conserved secreted protein n=1 Tax=Synechococcus sp. RCC307 RepID=A5GVB4_SYNR3 Length = 394 Score = 75.1 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 26/226 (11%), Positives = 57/226 (25%), Gaps = 79/226 (34%) Query: 113 GKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQG--AIHRTD----------- 159 ++G + + ++ + +P+ + + F + +D Sbjct: 42 PRFG---FQGQTQGAGTPNEAGIGGFFPLSVSENGVFFVDALANANFSDFSGTSSIVDTA 98 Query: 160 -DRTQSNIG--FGWRHFSGN-DWMAGVNTFIDHDLSR----------------------- 192 T + G+R + N WM G+N D Sbjct: 99 VAGTTISTSTRLGYRWLNTNRSWMFGINGGYDSRPMNSGPTVSGIKVGRSTSPTNSATSA 158 Query: 193 ---------------------------SHTR------IGVGAEYWRDYLKLSANGYIRAS 219 + R + E + +A + Sbjct: 159 TGTVSSQESSISARTTNSGSTSIKKNVDNHRSVFYQQVAANIEAVSNSWNFNAYALVPIG 218 Query: 220 GWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDE 265 ++ + Y + + I ++ P+L AS+ Y GDE Sbjct: 219 DTQQRLN-SHYDSAALDKYGIDVGYFI--TPELNASVGYYYQTGDE 261 >UniRef50_C9CT24 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9CT24_9RHOB Length = 771 Score = 72.0 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 38/138 (27%), Gaps = 17/138 (12%) Query: 126 KDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRT-QSNIGFGWRHFSGNDWMAGVNT 184 + + + +P + + R + Q +I R + W GV Sbjct: 28 YHEEGLSTGIALSFPFAIEENRATIARLSYGRDEGHNAQLSIEAMRRMTLAHGWTVGVGV 87 Query: 185 FIDH---DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSP-------------DIE 228 F D D+ +++G+ + R + + N Y+ + + Sbjct: 88 FADSSTDDIGNRFSQVGMSGDLQRGIFQANLNAYLPVGTKSHADARYDALAEMDGTIRFK 147 Query: 229 DYQERPANGWDIRAEGYL 246 + G D Sbjct: 148 GGRSLALRGLDAEVGARF 165 >UniRef50_A9QNP6 Sch_V10 n=5 Tax=Salmonella enterica RepID=A9QNP6_SALET Length = 197 Score = 66.3 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 14/94 (14%), Positives = 28/94 (29%), Gaps = 4/94 (4%) Query: 4 YKTGHKQPRFRYSVLARCVAWANISVQVLFPLAVTFTPVMAARAQHAVQPRLSMGNTTVT 63 + AR ++ P P+ + + Sbjct: 62 LTVPQLKKLNGLRTFARGFDHLQAGDELDVPAV----PLTGGKGDNNRHDARGPFAADRE 117 Query: 64 ADNNVEKNVASFAANAGTFLSSQPDSDATRNFIT 97 ++ + + A+ AG+FL+S PD A +T Sbjct: 118 NEDAQAQQMVGMASQAGSFLASHPDGQAAAGMVT 151 >UniRef50_Q0IBW0 Putative uncharacterized protein n=1 Tax=Synechococcus sp. CC9311 RepID=Q0IBW0_SYNS3 Length = 221 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 41/148 (27%), Gaps = 30/148 (20%) Query: 168 FGWRHFSGN-DWMAGVNTFIDHD----------------LSRSHTRIGVGAEYWRDYLKL 210 G+R + + M G+N D + ++ V E + Sbjct: 63 LGYRWLNRDRSTMYGINAGYDSRPIATGTTTNGIEVFNSQTPFFQQVAVNVELQSNQWGA 122 Query: 211 SANGYIRASGWKKSPD-----IEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYG-- 263 + G I + D + P + + ++M YY Sbjct: 123 NVYGLIPVGKYGYGSDNIATMNSSFAAEPLTTVGLDVNYNISNL----LAVMAGYYYQSC 178 Query: 264 -DEVGLFGKDKRQKDPHA-ISAEVTYTP 289 E +F D A + +++Y P Sbjct: 179 EKEPEIFENDAEGSGVKARLEYDISYQP 206 >UniRef50_Q7V422 Prochlorococcus marinus MIT9313 complete genome n=4 Tax=cellular organisms RepID=Q7V422_PROMM Length = 742 Score = 45.8 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 50/164 (30%), Gaps = 17/164 (10%) Query: 73 ASFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKD 132 A ++ + L ++PD D + N I +G N L Sbjct: 139 AQISSKSEAALLNEPDPDILTLYPR------LNPVIGIGGTIWGN---NSNNSDFEGLIL 189 Query: 133 SSLEMLYPIYDTPTNMLFTQG--AIHRTDDRTQSNIGFGWRHF-SGNDWMAGVNTFIDH- 188 L P+ + + + D + FG+R F N GV H Sbjct: 190 GDLAYFQPLSQNSGSSVLYSLTSSSSNFDKAWGVSQEFGYRWFDPNNQRSNGVMAGYTHW 249 Query: 189 ----DLSRSHTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 S S +++ +G E R+ K +A G + + Sbjct: 250 QGQIKDSCSRSQLSLGVETARNRWKFAAAGGVPVDNCESQFSFA 293 >UniRef50_B6R6H4 Putative uncharacterized protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R6H4_9RHOB Length = 274 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 46/158 (29%), Gaps = 14/158 (8%) Query: 117 TARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDR-TQSNIGFGWRHFSG 175 R+ + D + + E++ P++ + F I + + + G G R Sbjct: 21 QPRLIGLFETDRNNTSDAAEVITPVFQDSQGIAFADVRIGNSSEAVYLLSAGGGLRQLVS 80 Query: 176 NDWMAGVNTFIDHDLSRSHTR-----IGVGAEYWRDYLKLSANGYIRASGWKKSP----D 226 + + F+++ + R I G E+ + S + Sbjct: 81 PNLIGSTYIFMNYK--EDYKRGPESAITTGIEFLAHGYEARFIAQFPTSKVRILEPKPGS 138 Query: 227 IEDYQERPANGWDIRAEGYLPAWPQLGA--SLMYEQYY 262 E + P LP LG + +Y Sbjct: 139 SEQRESIPLLNLSGEIGYDLPGTEDLGFGLRIYAGGFY 176 >UniRef50_Q0I6I6 Unnamed protein product n=3 Tax=Synechococcus RepID=Q0I6I6_SYNS3 Length = 605 Score = 45.5 bits (106), Expect = 0.002, Method: Composition-based stats. Identities = 19/160 (11%), Positives = 52/160 (32%), Gaps = 11/160 (6%) Query: 74 SFAANAGTFLSSQPDSDATRNFITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDS 133 + ++ G + Q D + N I +G+ + + L Sbjct: 138 AIISDQGLQETDQNSPDILEQYPR------LNPVIGFGSSAWGSNASGSTLGQAAGLILG 191 Query: 134 SLEMLYPIYDTPTNMLFTQGAI--HRTDDRTQSNIGFGWRHFSGNDW-MAGVNTFIDHDL 190 P+ + + L + D ++ FG++ F+ N+ ++ + D Sbjct: 192 EASFFLPLRQSEGSKLLYNYSTASSNFDSSWGASTEFGYKWFNPNNRSISSLLVGYDAWE 251 Query: 191 SRS--HTRIGVGAEYWRDYLKLSANGYIRASGWKKSPDIE 228 + H+++ +G ++ + + G I + + Sbjct: 252 TSQCVHSQLALGGQWQKKRWQFGVTGGIPIDDCENNLGFA 291 >UniRef50_A9MK14 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MK14_SALAR Length = 110 Score = 43.1 bits (100), Expect = 0.010, Method: Composition-based stats. Identities = 14/29 (48%), Positives = 19/29 (65%), Gaps = 2/29 (6%) Query: 267 GLFGKD--KRQKDPHAISAEVTYTPVPLT 293 G+FG RQ++PHAI+ + Y PVPL Sbjct: 3 GIFGDGEADRQRNPHAIALGLNYPPVPLV 31 >UniRef50_Q4HGX9 Probable periplasmic protein Cj1193c n=14 Tax=Campylobacter RepID=Q4HGX9_CAMCO Length = 267 Score = 41.2 bits (95), Expect = 0.041, Method: Composition-based stats. Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 3/92 (3%) Query: 127 DFSLKDSSLEMLYPIYDTPTNMLFTQGAIHRTDDRTQSNIGFGWRHFSGNDWMAGVNTFI 186 DF ++ ++ L +Y+ N L Q + T D + G R + +D++ G+N FI Sbjct: 78 DFQNENVQIKNLNSLYEGENNSLLFQKEFYATQDSYNYSGGLINR-YEKDDFLLGINGFI 136 Query: 187 DHDLSRSHTRIGVGAEY-WRDYLKLSANGYIR 217 D + ++ GAE + ++K +N Y+ Sbjct: 137 DGQKEQKESK-SFGAELGYYQFVKAYSNYYVP 167 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.132 0.347 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,440,720,804 Number of Sequences: 3077464 Number of extensions: 67328371 Number of successful extensions: 133657 Number of sequences better than 1.0e-01: 156 Number of HSP's better than 0.1 without gapping: 379 Number of HSP's successfully gapped in prelim test: 71 Number of HSP's that attempted gapping in prelim test: 132256 Number of HSP's gapped (non-prelim): 561 length of query: 295 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 167 effective length of database: 646,480,964 effective search space: 107962320988 effective search space used: 107962320988 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 92 (40.0 bits)