BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (195 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0C8Z8 Fimbrillin matB n=53 Tax=Gammaproteobacteria Rep... 380 e-104 UniRef50_A4SH96 MatB protein n=1 Tax=Aeromonas salmonicida subsp... 163 3e-39 UniRef50_C2LEK0 Fimbrillin MatB protein n=6 Tax=Enterobacteriace... 159 5e-38 UniRef50_C4S9J4 Autotransporter adhesin n=1 Tax=Yersinia mollare... 92 8e-18 UniRef50_UPI000182708C fimbrillin MatB n=1 Tax=Enterobacter canc... 92 1e-17 >UniRef50_P0C8Z8 Fimbrillin matB n=53 Tax=Gammaproteobacteria RepID=MATB1_ECOLX Length = 195 Score = 380 bits (975), Expect = e-104, Method: Compositional matrix adjust. Identities = 194/195 (99%), Positives = 194/195 (99%) Query: 1 MKKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG 60 MKKKVLAIALVTVFTG GVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG Sbjct: 1 MKKKVLAIALVTVFTGTGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG 60 Query: 61 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG Sbjct: 61 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 Query: 121 DTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEGI 180 DTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEGI Sbjct: 121 DTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEGI 180 Query: 181 WSGDVSVQFDATWTS 195 WSGDVSVQFDATWTS Sbjct: 181 WSGDVSVQFDATWTS 195 >UniRef50_A4SH96 MatB protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SH96_AERS4 Length = 186 Score = 163 bits (412), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 93/194 (47%), Positives = 134/194 (69%), Gaps = 10/194 (5%) Query: 1 MKKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG 60 MKK ++A+ L + +G+ A A + TA TW ATAKKD+TS L VT L SLAFQY++G Sbjct: 1 MKKALIAVILAS--SGLSFAHAVESTA----TWKATAKKDSTSALEVTALDSLAFQYSDG 54 Query: 61 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 K FN+ GLFDV+I+GD++AT F L ++ ++ TL L + GST++VGV + G V + Sbjct: 55 SKAFNTVNGLFDVSIKGDASATDFTLKAKKLSGTLNHL-SGGSTVDVGVMWQGVPVSTSA 113 Query: 121 DTVMIDTANGVLGGNLSPLANGY-NASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEG 179 T +IDTA G GG+L+PLA G+ + + TTAQ F+F+I T NG A +++TLP+G Sbjct: 114 YTTLIDTAAGTTGGSLAPLAAGFKDDTQSTTAQGAFSFNIEKATVNGADA--EFNTLPDG 171 Query: 180 IWSGDVSVQFDATW 193 +WSG+V+V+F A+W Sbjct: 172 LWSGEVNVEFVASW 185 >UniRef50_C2LEK0 Fimbrillin MatB protein n=6 Tax=Enterobacteriaceae RepID=C2LEK0_PROMI Length = 207 Score = 159 bits (402), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 99/196 (50%), Positives = 121/196 (61%), Gaps = 6/196 (3%) Query: 1 MKKKVLAIALVTVFTGMGVAQAADVT--AQAVATWSATAKKDTTSKLVVTPLGSLAFQYA 58 + KK+ ++ L T+ A T AQA ATW ATA KDTTS LVVTPL SL F YA Sbjct: 2 LNKKIASVMLSTIIAASFATAANADTRVAQATATWQATAIKDTTSMLVVTPLKSLTFNYA 61 Query: 59 EGIKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEK 118 EG K FN Q G FD+AI+G S AT FKL+S+LI NTL + T STL VGV +NG + K Sbjct: 62 EGQKSFNQQNGAFDIAIQGQSGATDFKLSSKLIANTLART-TDDSTLTVGVKWNGEDLSK 120 Query: 119 TGDTVMIDTANGVLGGNLSPLANG-YNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLP 177 T D V+IDT+ G+ G + A+G YN S R T + FTF I S A D+ +L Sbjct: 121 TTDVVLIDTSKGLTSGLDNLAADGIYNGSERATDRGEFTFVIASAQA--AGATADFKSLA 178 Query: 178 EGIWSGDVSVQFDATW 193 +G+W GDV VQF ATW Sbjct: 179 DGVWDGDVKVQFTATW 194 >UniRef50_C4S9J4 Autotransporter adhesin n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J4_YERMO Length = 209 Score = 92.4 bits (228), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 70/204 (34%), Positives = 107/204 (52%), Gaps = 13/204 (6%) Query: 1 MKKK-VLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQY-- 57 MKK V A+ ++++F VA AA +T ++W TA K +++ L V P GSL Sbjct: 6 MKKNAVAAMVVMSIFAATNVAHAATLTQTDTSSWHVTANKVSSASLAVNPTGSLPVITFT 65 Query: 58 AEGI-----KGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTS--GSTLNVGVD 110 E + F++ F+V + GD++ATAF LT+ IT T+L ++ S L +GV Sbjct: 66 TENVGTTPTSTFSTADAPFEVVLGGDASATAFTLTANTITGQ-TRLASTLNSSYLEIGVA 124 Query: 111 YNGAAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAV 170 YN + T + +NG G + LA+ + T A D F FS+ S + +GT + Sbjct: 125 YNSTPLTTASQTTIF--SNGTTSGAFAGLASTALNNGVTAAGDKFVFSVASFSADGTNVI 182 Query: 171 TDYSTLPEGIWSGDVSVQFDATWT 194 + + LP WSG++ VQF ATWT Sbjct: 183 ANPNALPNTTWSGNIGVQFTATWT 206 >UniRef50_UPI000182708C fimbrillin MatB n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI000182708C Length = 207 Score = 91.7 bits (226), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 79/208 (37%), Positives = 107/208 (51%), Gaps = 22/208 (10%) Query: 4 KVLAIALVTVFT--GMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGI 61 K+ AI L ++ + + AA+ TA A TW A+A KD+ S LVVTP +L F Y+ Sbjct: 3 KLNAIVLGSLLSVSALSAVNAAETTASA--TWQASATKDSESDLVVTPTRALNFAYSANT 60 Query: 62 KGFNSQKGLFDVAIEGD-STATAFKLTSRL--ITNTLTQLDTSGSTLNVGVDYNGAAVEK 118 K FN+ GLFDVAI GD S+AT FKL + L NTL + + L VG + G + Sbjct: 61 KSFNTDTGLFDVAIRGDHSSATTFKLEAILDDSNNTLYSVGGEATKLKVGARWGGNDLGS 120 Query: 119 TGDTV---------MIDTAN--GVLGG--NLSPLANGYNASNRTTAQDGFTFSIISGTTN 165 G TV ++D+++ GV G NL+ A G A T QD F F + S + Sbjct: 121 VGGTVGAKSTAWTTLVDSSSNTGVSSGLWNLTTSA-GAAADTEITGQDKFVFYVDSA-QD 178 Query: 166 GTTAVTDYSTLPEGIWSGDVSVQFDATW 193 A ++ L +W G VSV F ATW Sbjct: 179 SAGAKKEFKDLTNSLWEGTVSVAFRATW 206 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0C8Z8 Fimbrillin matB n=53 Tax=Gammaproteobacteria Rep... 200 3e-50 UniRef50_C2LEK0 Fimbrillin MatB protein n=6 Tax=Enterobacteriace... 174 1e-42 UniRef50_A4SH96 MatB protein n=1 Tax=Aeromonas salmonicida subsp... 158 9e-38 UniRef50_UPI000182708C fimbrillin MatB n=1 Tax=Enterobacter canc... 124 1e-27 UniRef50_C4S9J4 Autotransporter adhesin n=1 Tax=Yersinia mollare... 121 1e-26 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P0C8Z8 Fimbrillin matB n=53 Tax=Gammaproteobacteria RepID=MATB1_ECOLX Length = 195 Score = 200 bits (507), Expect = 3e-50, Method: Composition-based stats. Identities = 194/195 (99%), Positives = 194/195 (99%) Query: 1 MKKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG 60 MKKKVLAIALVTVFTG GVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG Sbjct: 1 MKKKVLAIALVTVFTGTGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG 60 Query: 61 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG Sbjct: 61 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 Query: 121 DTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEGI 180 DTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEGI Sbjct: 121 DTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEGI 180 Query: 181 WSGDVSVQFDATWTS 195 WSGDVSVQFDATWTS Sbjct: 181 WSGDVSVQFDATWTS 195 >UniRef50_C2LEK0 Fimbrillin MatB protein n=6 Tax=Enterobacteriaceae RepID=C2LEK0_PROMI Length = 207 Score = 174 bits (441), Expect = 1e-42, Method: Composition-based stats. Identities = 99/196 (50%), Positives = 121/196 (61%), Gaps = 6/196 (3%) Query: 1 MKKKVLAIALVTVFTGMGVAQAADVT--AQAVATWSATAKKDTTSKLVVTPLGSLAFQYA 58 + KK+ ++ L T+ A T AQA ATW ATA KDTTS LVVTPL SL F YA Sbjct: 2 LNKKIASVMLSTIIAASFATAANADTRVAQATATWQATAIKDTTSMLVVTPLKSLTFNYA 61 Query: 59 EGIKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEK 118 EG K FN Q G FD+AI+G S AT FKL+S+LI NTL + T STL VGV +NG + K Sbjct: 62 EGQKSFNQQNGAFDIAIQGQSGATDFKLSSKLIANTLART-TDDSTLTVGVKWNGEDLSK 120 Query: 119 TGDTVMIDTANGVLGGNLSPLANG-YNASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLP 177 T D V+IDT+ G+ G + A+G YN S R T + FTF I S A D+ +L Sbjct: 121 TTDVVLIDTSKGLTSGLDNLAADGIYNGSERATDRGEFTFVIASAQA--AGATADFKSLA 178 Query: 178 EGIWSGDVSVQFDATW 193 +G+W GDV VQF ATW Sbjct: 179 DGVWDGDVKVQFTATW 194 >UniRef50_A4SH96 MatB protein n=1 Tax=Aeromonas salmonicida subsp. salmonicida A449 RepID=A4SH96_AERS4 Length = 186 Score = 158 bits (399), Expect = 9e-38, Method: Composition-based stats. Identities = 93/194 (47%), Positives = 134/194 (69%), Gaps = 10/194 (5%) Query: 1 MKKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEG 60 MKK ++A+ L + +G+ A A + TA TW ATAKKD+TS L VT L SLAFQY++G Sbjct: 1 MKKALIAVILAS--SGLSFAHAVESTA----TWKATAKKDSTSALEVTALDSLAFQYSDG 54 Query: 61 IKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 K FN+ GLFDV+I+GD++AT F L ++ ++ TL L + GST++VGV + G V + Sbjct: 55 SKAFNTVNGLFDVSIKGDASATDFTLKAKKLSGTLNHL-SGGSTVDVGVMWQGVPVSTSA 113 Query: 121 DTVMIDTANGVLGGNLSPLANGY-NASNRTTAQDGFTFSIISGTTNGTTAVTDYSTLPEG 179 T +IDTA G GG+L+PLA G+ + + TTAQ F+F+I T NG A +++TLP+G Sbjct: 114 YTTLIDTAAGTTGGSLAPLAAGFKDDTQSTTAQGAFSFNIEKATVNGADA--EFNTLPDG 171 Query: 180 IWSGDVSVQFDATW 193 +WSG+V+V+F A+W Sbjct: 172 LWSGEVNVEFVASW 185 >UniRef50_UPI000182708C fimbrillin MatB n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI000182708C Length = 207 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 73/205 (35%), Positives = 102/205 (49%), Gaps = 16/205 (7%) Query: 4 KVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGIKG 63 K+ AI L ++ + ++ A ATW A+A KD+ S LVVTP +L F Y+ K Sbjct: 3 KLNAIVLGSLLSVSALSAVNAAETTASATWQASATKDSESDLVVTPTRALNFAYSANTKS 62 Query: 64 FNSQKGLFDVAIEGD-STATAFKLTSRL--ITNTLTQLDTSGSTLNVGVDYNGAAVEKTG 120 FN+ GLFDVAI GD S+AT FKL + L NTL + + L VG + G + G Sbjct: 63 FNTDTGLFDVAIRGDHSSATTFKLEAILDDSNNTLYSVGGEATKLKVGARWGGNDLGSVG 122 Query: 121 DTV---------MIDTAN--GVLGGNLS-PLANGYNASNRTTAQDGFTFSIISGTTNGTT 168 TV ++D+++ GV G + + G A T QD F F + S + Sbjct: 123 GTVGAKSTAWTTLVDSSSNTGVSSGLWNLTTSAGAAADTEITGQDKFVFYVDSAQ-DSAG 181 Query: 169 AVTDYSTLPEGIWSGDVSVQFDATW 193 A ++ L +W G VSV F ATW Sbjct: 182 AKKEFKDLTNSLWEGTVSVAFRATW 206 >UniRef50_C4S9J4 Autotransporter adhesin n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4S9J4_YERMO Length = 209 Score = 121 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 66/201 (32%), Positives = 100/201 (49%), Gaps = 10/201 (4%) Query: 2 KKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQY---- 57 K V A+ ++++F VA AA +T ++W TA K +++ L V P GSL Sbjct: 8 KNAVAAMVVMSIFAATNVAHAATLTQTDTSSWHVTANKVSSASLAVNPTGSLPVITFTTE 67 Query: 58 ---AEGIKGFNSQKGLFDVAIEGDSTATAFKLTSRLITNTLTQLDT-SGSTLNVGVDYNG 113 F++ F+V + GD++ATAF LT+ IT T + S L +GV YN Sbjct: 68 NVGTTPTSTFSTADAPFEVVLGGDASATAFTLTANTITGQTRLASTLNSSYLEIGVAYNS 127 Query: 114 AAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSIISGTTNGTTAVTDY 173 + T + +NG G + LA+ + T A D F FS+ S + +GT + + Sbjct: 128 TPLTTASQTTIF--SNGTTSGAFAGLASTALNNGVTAAGDKFVFSVASFSADGTNVIANP 185 Query: 174 STLPEGIWSGDVSVQFDATWT 194 + LP WSG++ VQF ATWT Sbjct: 186 NALPNTTWSGNIGVQFTATWT 206 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.304 0.117 0.288 Lambda K H 0.267 0.0358 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 548,556,165 Number of Sequences: 3077464 Number of extensions: 17739299 Number of successful extensions: 64662 Number of sequences better than 1.0e-01: 124 Number of HSP's better than 0.1 without gapping: 15 Number of HSP's successfully gapped in prelim test: 137 Number of HSP's that attempted gapping in prelim test: 64164 Number of HSP's gapped (non-prelim): 568 length of query: 195 length of database: 1,040,396,356 effective HSP length: 122 effective length of query: 73 effective length of database: 664,945,748 effective search space: 48541039604 effective search space used: 48541039604 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.5 bits) S2: 89 (39.1 bits)