BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (273 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76498 Uncharacterized protein yfcO n=17 Tax=Enterobact... 570 e-161 UniRef50_B7UFZ0 Predicted protein n=9 Tax=Enterobacteriaceae Rep... 252 1e-65 UniRef50_C5W769 Ybl105 protein n=5 Tax=Escherichia coli RepID=C5... 241 2e-62 UniRef50_C8U6Q6 Conserved predicted protein n=22 Tax=Enterobacte... 236 6e-61 UniRef50_B1EJE2 Putative fimbrial adhesin YfcO n=1 Tax=Escherich... 202 8e-51 UniRef50_Q32DK5 Putative uncharacterized protein n=1 Tax=Shigell... 197 3e-49 UniRef50_O87663 Uncharacterized protein yadU n=21 Tax=Salmonella... 193 5e-48 UniRef50_B4TK29 Putative uncharacterized protein n=4 Tax=Enterob... 184 2e-45 >UniRef50_P76498 Uncharacterized protein yfcO n=17 Tax=Enterobacteriaceae RepID=YFCO_ECOLI Length = 273 Score = 570 bits (1469), Expect = e-161, Method: Compositional matrix adjust. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSS 60 MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSS Sbjct: 1 MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSS 60 Query: 61 QWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIE 120 QWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIE Sbjct: 61 QWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIE 120 Query: 121 SYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQS 180 SYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQS Sbjct: 121 SYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQS 180 Query: 181 TGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI 240 TGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI Sbjct: 181 TGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI 240 Query: 241 TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL Sbjct: 241 TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 >UniRef50_B7UFZ0 Predicted protein n=9 Tax=Enterobacteriaceae RepID=B7UFZ0_ECO27 Length = 287 Score = 252 bits (643), Expect = 1e-65, Method: Compositional matrix adjust. Identities = 145/294 (49%), Positives = 192/294 (65%), Gaps = 28/294 (9%) Query: 1 MKILRWLFALVMLIATTEAMAAGHSVDVYY---GYNGDSRNIATFNLKIMMPSAVYVGEY 57 MKI+RW+ L++ + + A+ A V Y GY + I T+ L ++ P +V G Y Sbjct: 1 MKIMRWILGLLISLFFSTAVQANVIVATMYTPIGYASYTTKI-TYYLDVLTPDSVNHGVY 59 Query: 58 KS--SQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLP--------SGWNCG 107 ++ + L+TG I SW+GP PAPS+K+I I+++SCPGL + W C Sbjct: 60 ETPNNTGLITGWIPLK-SWTGPGPAPSLKVISM-TAISQSSCPGLTEYDSRAQRTMWTC- 116 Query: 108 YYTFEVIVSAEIESYFSCPWLVIMN-DS----EASPGGVTYQGPDSHDTICPSVSVQPYD 162 Y + V E S CPWLV + DS + +PG TY G ++ CP + + PYD Sbjct: 117 -YEIRMEVWKEDSSVHGCPWLVSTHADSIDLIDPAPG--TYSGRTVSNSSCPPIPLGPYD 173 Query: 163 VSWNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNET---GGYCRWVAQMI 219 VSWNE+ V + K L LQSTGG++EKTLSTYLMKDGKLCD ++ +T G YCRWV+QM+ Sbjct: 174 VSWNESRVVRDKTLALQSTGGIIEKTLSTYLMKDGKLCDGSKFGDTDDRGAYCRWVSQML 233 Query: 220 TFTASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 TFT+SGCD A+V+VTPNRHPITDK+LHDMV+RVDT+S QPIDSTCRFQY+LN L Sbjct: 234 TFTSSGCDNAKVTVTPNRHPITDKELHDMVLRVDTTSRQPIDSTCRFQYVLNML 287 >UniRef50_C5W769 Ybl105 protein n=5 Tax=Escherichia coli RepID=C5W769_ECOBB Length = 274 Score = 241 bits (614), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 128/284 (45%), Positives = 184/284 (64%), Gaps = 21/284 (7%) Query: 1 MKILRWLFALVMLIATTEAMAAG---HSVDVYYGYNGDSRNIATFNLKIMMP-SAVYVGE 56 MK + LF L++ ++ ++ A SVDV YG +G S F I+ P + Y + Sbjct: 1 MKTIWILFCLLITWLSSTSVQASVTFGSVDVLYGASGQSDPTMEFYFTIISPRNGAYTAK 60 Query: 57 Y-------KSSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYY 109 Y +++ +L+T + WSGP AP ++++G+ N + CPGLPSG NC Y Sbjct: 61 YYDPHAVPRNNDYLVTDQ------WSGPGAAPIIQIVGF-GNAGASQCPGLPSGRNCQYL 113 Query: 110 TFEVIVSAEIESYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENY 169 TF + + A + F CPWL + S + G +Y+ P ++ T+CP+V V +DVSW+EN Sbjct: 114 TFSITIDAADD--FGCPWLASVY-SVVTDYGASYRAPTANSTVCPTVPVDTFDVSWSENR 170 Query: 170 VSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKA 229 V+ + ++L STGG +EKT+STYLM+ KLCDS+ M+ G YCR+V+ +ITF++ GCD A Sbjct: 171 VNHNLGISLHSTGGFIEKTVSTYLMESNKLCDSSVMDRRGDYCRFVSGLITFSSYGCDNA 230 Query: 230 EVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 +V+VTP +TDK+LHD+VVRVDTSSMQPIDS+CRFQYILNEL Sbjct: 231 KVTVTPIEQAVTDKKLHDIVVRVDTSSMQPIDSSCRFQYILNEL 274 >UniRef50_C8U6Q6 Conserved predicted protein n=22 Tax=Enterobacteriaceae RepID=C8U6Q6_ECO10 Length = 285 Score = 236 bits (602), Expect = 6e-61, Method: Compositional matrix adjust. Identities = 127/291 (43%), Positives = 183/291 (62%), Gaps = 24/291 (8%) Query: 1 MKILRWLFALVMLI-----------ATTEAMAAGH-SVDVYYGYNGDSRNIAT--FNLKI 46 MK +FA +ML+ A + M+A SV VYY + + +A+ FN+ + Sbjct: 1 MKKWTIIFASLMLLVLSVVGASKSYAGDKLMSASFDSVRVYYAMDKVTGAVASSVFNVTV 60 Query: 47 MMPSAVYVGEYKSSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSG-WN 105 + P V G+Y S + G+ L+ +SWSG AP++ L + IN ++CPG+ + + Sbjct: 61 ITPKEVAYGKYDSFAY--KGDTLRVISWSGSGSAPTLVLTDF-DTINNSNCPGIDTKIFR 117 Query: 106 CGYYTFEVIVSAEIESYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSW 165 C Y TF++ V+++ + CPW+ PG +Y P H+TICP++ V YD+SW Sbjct: 118 CAYMTFKITVASDD---YGCPWIASFYSYTDLPGFGSYTAPTVHNTICPTIPVASYDISW 174 Query: 166 NENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNET---GGYCRWVAQMITFT 222 +ENYVS +K L +QSTG V TLSTYLM+ G+LCD + ++ G YCR V++++TFT Sbjct: 175 SENYVSHNKALRIQSTGSTVTTTLSTYLMEGGRLCDGSNFSDNDGRGAYCRAVSELLTFT 234 Query: 223 ASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 + GCDK+ V+VTP RHP+TDK LHD+VV V+TSS QPIDSTCRFQY+LNEL Sbjct: 235 SYGCDKSTVTVTPTRHPVTDKVLHDIVVNVNTSSGQPIDSTCRFQYVLNEL 285 >UniRef50_B1EJE2 Putative fimbrial adhesin YfcO n=1 Tax=Escherichia albertii TW07627 RepID=B1EJE2_9ESCH Length = 248 Score = 202 bits (515), Expect = 8e-51, Method: Compositional matrix adjust. Identities = 118/248 (47%), Positives = 150/248 (60%), Gaps = 19/248 (7%) Query: 41 TFNLKIMMPSAVYVGEYKS-SQWLMTGEILQNVSWS---GPPPAPSVKLIGYHQNINKAS 96 T L+++ P Y G Y+ SQ L TG I N+ WS G AP + + + Sbjct: 5 TLYLEVVTPHGTYYGTYQEPSQRLNTGNI-TNIYWSDTTGTIHAPRLNMGAASGAGSIGP 63 Query: 97 CPGL-----PSGWNCGYYTFEVIVSAEIESYFSCPWLV-----IMNDSEASPGGVTYQGP 146 CPG+ S W C V V + CPWL+ S + Y GP Sbjct: 64 CPGVHDAPNTSTWGCYSTNISVYVDQPVGG---CPWLISSYITTFYHSTFTGDQGPYTGP 120 Query: 147 DSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMN 206 +H++ CP V+V PYDVSW+ENYV+ +K + L GGVV +TLSTYLMKDG+LCD +Q + Sbjct: 121 KAHNSSCPPVAVAPYDVSWDENYVAHNKTVRLPGGGGVVTQTLSTYLMKDGQLCDGSQPD 180 Query: 207 ETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQP-IDSTCR 265 E G YCR VAQ++TFT+SGCD A VSVTP HPITDKQLHDMV++V+TS+ P I +TCR Sbjct: 181 ERGLYCRLVAQLMTFTSSGCDDARVSVTPTPHPITDKQLHDMVLQVNTSNNVPAIAATCR 240 Query: 266 FQYILNEL 273 FQYILNEL Sbjct: 241 FQYILNEL 248 >UniRef50_Q32DK5 Putative uncharacterized protein n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32DK5_SHIDS Length = 169 Score = 197 bits (502), Expect = 3e-49, Method: Compositional matrix adjust. Identities = 93/169 (55%), Positives = 124/169 (73%), Gaps = 7/169 (4%) Query: 106 CGYYTFEVIVSAEIESYFSCPWLV---IMNDSEASPGGVTYQGPDSHDTICPSVSVQPYD 162 C + V+ E+ CPWLV +++ S P TY GP + + CP+ S+ PYD Sbjct: 3 CDSLPVKFTVTGEVSG---CPWLVTVRVISHSATVPPE-TYIGPQTPVSTCPAQSLTPYD 58 Query: 163 VSWNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFT 222 +SW++NYV K+K++ LQSTGG++EKTL T+LMKDGKLCD Q ++ G YCR+V QM+TF+ Sbjct: 59 ISWDQNYVVKNKVIRLQSTGGMIEKTLPTFLMKDGKLCDGGQASDEGAYCRFVTQMLTFS 118 Query: 223 ASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILN 271 +SGCD +V+VTPNRHPITDK++HDMVV VDT+ QPIDSTCRF Y+LN Sbjct: 119 SSGCDNGKVTVTPNRHPITDKEVHDMVVHVDTTERQPIDSTCRFTYVLN 167 >UniRef50_O87663 Uncharacterized protein yadU n=21 Tax=Salmonella enterica RepID=YADU_SALTY Length = 279 Score = 193 bits (490), Expect = 5e-48, Method: Compositional matrix adjust. Identities = 109/291 (37%), Positives = 156/291 (53%), Gaps = 30/291 (10%) Query: 1 MKILRWLFALVM------LIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYV 54 MKI+R LF L++ ++A A S +YYG +S + I P VY Sbjct: 1 MKIIRTLFLLLIAVYGSSVVAKPMLKATFSSTTMYYGIGPNSDKSIVAEVTIATPEGVYY 60 Query: 55 GEYKSSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWN-CGYYTFEV 113 G + S GE L SWSGP PAP V L + +++++C LPS W CG +T E+ Sbjct: 61 GSWNLSGH-RKGETLTADSWSGPEPAPKVVLKDFDNTVSRSACKNLPSNWRGCGSFTLEI 119 Query: 114 IVSAEIESYFSCPWLV---------IMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVS 164 V ++ + CPWL I N+ TY PD+ ++CP V V +D+S Sbjct: 120 TVQSD---DYGCPWLASSHIVATTFITNE--------TYSPPDTRSSVCPKVPVDTFDIS 168 Query: 165 WNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTAS 224 W+ N + L L +TGG V +TL TYLM+ GKLCD ++ + G YCR+V+ IT Sbjct: 169 WDANVSKQKTTLMLDATGGTVNRTLHTYLMEGGKLCDGSKFDNRGAYCRFVSSGITLNVL 228 Query: 225 GCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQP--IDSTCRFQYILNEL 273 GCD++ V+ + HPITD +LHD+ V V+T ++ STC FQYI++EL Sbjct: 229 GCDQSSVTTSAVDHPITDVELHDINVAVNTRNIGSGQFTSTCSFQYIIDEL 279 >UniRef50_B4TK29 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B4TK29_SALHS Length = 213 Score = 184 bits (468), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 87/206 (42%), Positives = 131/206 (63%), Gaps = 7/206 (3%) Query: 72 VSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWN-CGYYTFEVIVSAEIESYFSCPWLVI 130 +SW+GP PAP++ L + +I+K++C LPS WN CGYYT ++ V ++ + CPWL Sbjct: 11 LSWTGPDPAPTIVLRDFDNSISKSNCKNLPSSWNGCGYYTVDITVQSD---NYGCPWLAA 67 Query: 131 MNDS-EASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQSTGGVVEKTL 189 + + E G TY PD+ ++CP + V +D+SW+ N + L L +TGG V +TL Sbjct: 68 THSTAEDLVSGETYSAPDTRSSVCPKIPVDTFDISWDANVSKQKTTLMLDATGGTVNRTL 127 Query: 190 STYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPITDKQLHDMV 249 TYLM+ GKLCD ++ ++ G YCR+V+ IT GCD++ V+ + HPITD +LHD+ Sbjct: 128 HTYLMEGGKLCDGSKFDDRGAYCRFVSSGITLNVLGCDQSSVTTSAVDHPITDVELHDIN 187 Query: 250 VRVDTSSMQP--IDSTCRFQYILNEL 273 V V+T ++ STC FQYI++EL Sbjct: 188 VAVNTRNIGSGQFTSTCSFQYIIDEL 213 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76498 Uncharacterized protein yfcO n=17 Tax=Enterobact... 431 e-120 UniRef50_O87663 Uncharacterized protein yadU n=21 Tax=Salmonella... 356 6e-97 UniRef50_C5W769 Ybl105 protein n=5 Tax=Escherichia coli RepID=C5... 354 3e-96 UniRef50_B7UFZ0 Predicted protein n=9 Tax=Enterobacteriaceae Rep... 353 5e-96 UniRef50_C8U6Q6 Conserved predicted protein n=22 Tax=Enterobacte... 346 4e-94 UniRef50_B1EJE2 Putative fimbrial adhesin YfcO n=1 Tax=Escherich... 306 4e-82 UniRef50_B4TK29 Putative uncharacterized protein n=4 Tax=Enterob... 295 9e-79 UniRef50_Q32DK5 Putative uncharacterized protein n=1 Tax=Shigell... 248 1e-64 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P76498 Uncharacterized protein yfcO n=17 Tax=Enterobacteriaceae RepID=YFCO_ECOLI Length = 273 Score = 431 bits (1109), Expect = e-120, Method: Composition-based stats. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSS 60 MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSS Sbjct: 1 MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSS 60 Query: 61 QWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIE 120 QWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIE Sbjct: 61 QWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIE 120 Query: 121 SYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQS 180 SYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQS Sbjct: 121 SYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQS 180 Query: 181 TGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI 240 TGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI Sbjct: 181 TGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI 240 Query: 241 TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL Sbjct: 241 TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 >UniRef50_O87663 Uncharacterized protein yadU n=21 Tax=Salmonella enterica RepID=YADU_SALTY Length = 279 Score = 356 bits (912), Expect = 6e-97, Method: Composition-based stats. Identities = 107/283 (37%), Positives = 155/283 (54%), Gaps = 14/283 (4%) Query: 1 MKILRWLFALVM------LIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYV 54 MKI+R LF L++ ++A A S +YYG +S + I P VY Sbjct: 1 MKIIRTLFLLLIAVYGSSVVAKPMLKATFSSTTMYYGIGPNSDKSIVAEVTIATPEGVYY 60 Query: 55 GEYKSSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWN-CGYYTFEV 113 G + S GE L SWSGP PAP V L + +++++C LPS W CG +T E+ Sbjct: 61 GSWNLSGH-RKGETLTADSWSGPEPAPKVVLKDFDNTVSRSACKNLPSNWRGCGSFTLEI 119 Query: 114 IVSAEIESYFSCPWLVIMN-DSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSK 172 V ++ + CPWL + + TY PD+ ++CP V V +D+SW+ N + Sbjct: 120 TVQSDD---YGCPWLASSHIVATTFITNETYSPPDTRSSVCPKVPVDTFDISWDANVSKQ 176 Query: 173 SKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVS 232 L L +TGG V +TL TYLM+ GKLCD ++ + G YCR+V+ IT GCD++ V+ Sbjct: 177 KTTLMLDATGGTVNRTLHTYLMEGGKLCDGSKFDNRGAYCRFVSSGITLNVLGCDQSSVT 236 Query: 233 VTPNRHPITDKQLHDMVVRVDTSSMQP--IDSTCRFQYILNEL 273 + HPITD +LHD+ V V+T ++ STC FQYI++EL Sbjct: 237 TSAVDHPITDVELHDINVAVNTRNIGSGQFTSTCSFQYIIDEL 279 >UniRef50_C5W769 Ybl105 protein n=5 Tax=Escherichia coli RepID=C5W769_ECOBB Length = 274 Score = 354 bits (907), Expect = 3e-96, Method: Composition-based stats. Identities = 128/284 (45%), Positives = 184/284 (64%), Gaps = 21/284 (7%) Query: 1 MKILRWLFALVMLIATTEAMAA---GHSVDVYYGYNGDSRNIATFNLKIMMP-SAVYVGE 56 MK + LF L++ ++ ++ A SVDV YG +G S F I+ P + Y + Sbjct: 1 MKTIWILFCLLITWLSSTSVQASVTFGSVDVLYGASGQSDPTMEFYFTIISPRNGAYTAK 60 Query: 57 Y-------KSSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWNCGYY 109 Y +++ +L+T + WSGP AP ++++G+ N + CPGLPSG NC Y Sbjct: 61 YYDPHAVPRNNDYLVTDQ------WSGPGAAPIIQIVGF-GNAGASQCPGLPSGRNCQYL 113 Query: 110 TFEVIVSAEIESYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENY 169 TF + + A + F CPWL + S + G +Y+ P ++ T+CP+V V +DVSW+EN Sbjct: 114 TFSITIDAADD--FGCPWLASVY-SVVTDYGASYRAPTANSTVCPTVPVDTFDVSWSENR 170 Query: 170 VSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKA 229 V+ + ++L STGG +EKT+STYLM+ KLCDS+ M+ G YCR+V+ +ITF++ GCD A Sbjct: 171 VNHNLGISLHSTGGFIEKTVSTYLMESNKLCDSSVMDRRGDYCRFVSGLITFSSYGCDNA 230 Query: 230 EVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 +V+VTP +TDK+LHD+VVRVDTSSMQPIDS+CRFQYILNEL Sbjct: 231 KVTVTPIEQAVTDKKLHDIVVRVDTSSMQPIDSSCRFQYILNEL 274 >UniRef50_B7UFZ0 Predicted protein n=9 Tax=Enterobacteriaceae RepID=B7UFZ0_ECO27 Length = 287 Score = 353 bits (905), Expect = 5e-96, Method: Composition-based stats. Identities = 143/292 (48%), Positives = 189/292 (64%), Gaps = 24/292 (8%) Query: 1 MKILRWLFALVMLIATTEAMAAGHSVDVYY---GYNGDSRNIATFNLKIMMPSAVYVGEY 57 MKI+RW+ L++ + + A+ A V Y GY + I T+ L ++ P +V G Y Sbjct: 1 MKIMRWILGLLISLFFSTAVQANVIVATMYTPIGYASYTTKI-TYYLDVLTPDSVNHGVY 59 Query: 58 K--SSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLP--------SGWNCG 107 + ++ L+TG I SW+GP PAPS+K+I I+++SCPGL + W C Sbjct: 60 ETPNNTGLITGWIPLK-SWTGPGPAPSLKVISM-TAISQSSCPGLTEYDSRAQRTMWTC- 116 Query: 108 YYTFEVIVSAEIESYFSCPWLVIMN-DSEAS--PGGVTYQGPDSHDTICPSVSVQPYDVS 164 Y + V E S CPWLV + DS P TY G ++ CP + + PYDVS Sbjct: 117 -YEIRMEVWKEDSSVHGCPWLVSTHADSIDLIDPAPGTYSGRTVSNSSCPPIPLGPYDVS 175 Query: 165 WNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQM---NETGGYCRWVAQMITF 221 WNE+ V + K L LQSTGG++EKTLSTYLMKDGKLCD ++ ++ G YCRWV+QM+TF Sbjct: 176 WNESRVVRDKTLALQSTGGIIEKTLSTYLMKDGKLCDGSKFGDTDDRGAYCRWVSQMLTF 235 Query: 222 TASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 T+SGCD A+V+VTPNRHPITDK+LHDMV+RVDT+S QPIDSTCRFQY+LN L Sbjct: 236 TSSGCDNAKVTVTPNRHPITDKELHDMVLRVDTTSRQPIDSTCRFQYVLNML 287 >UniRef50_C8U6Q6 Conserved predicted protein n=22 Tax=Enterobacteriaceae RepID=C8U6Q6_ECO10 Length = 285 Score = 346 bits (888), Expect = 4e-94, Method: Composition-based stats. Identities = 127/291 (43%), Positives = 183/291 (62%), Gaps = 24/291 (8%) Query: 1 MKILRWLFALVMLI-----------ATTEAMAAGH-SVDVYYGYNGDSRNIAT--FNLKI 46 MK +FA +ML+ A + M+A SV VYY + + +A+ FN+ + Sbjct: 1 MKKWTIIFASLMLLVLSVVGASKSYAGDKLMSASFDSVRVYYAMDKVTGAVASSVFNVTV 60 Query: 47 MMPSAVYVGEYKSSQWLMTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSG-WN 105 + P V G+Y S + G+ L+ +SWSG AP++ L + IN ++CPG+ + + Sbjct: 61 ITPKEVAYGKYDSFAY--KGDTLRVISWSGSGSAPTLVLTDFD-TINNSNCPGIDTKIFR 117 Query: 106 CGYYTFEVIVSAEIESYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQPYDVSW 165 C Y TF++ V+++ + CPW+ PG +Y P H+TICP++ V YD+SW Sbjct: 118 CAYMTFKITVASDD---YGCPWIASFYSYTDLPGFGSYTAPTVHNTICPTIPVASYDISW 174 Query: 166 NENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNE---TGGYCRWVAQMITFT 222 +ENYVS +K L +QSTG V TLSTYLM+ G+LCD + ++ G YCR V++++TFT Sbjct: 175 SENYVSHNKALRIQSTGSTVTTTLSTYLMEGGRLCDGSNFSDNDGRGAYCRAVSELLTFT 234 Query: 223 ASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 + GCDK+ V+VTP RHP+TDK LHD+VV V+TSS QPIDSTCRFQY+LNEL Sbjct: 235 SYGCDKSTVTVTPTRHPVTDKVLHDIVVNVNTSSGQPIDSTCRFQYVLNEL 285 >UniRef50_B1EJE2 Putative fimbrial adhesin YfcO n=1 Tax=Escherichia albertii TW07627 RepID=B1EJE2_9ESCH Length = 248 Score = 306 bits (784), Expect = 4e-82, Method: Composition-based stats. Identities = 118/248 (47%), Positives = 150/248 (60%), Gaps = 19/248 (7%) Query: 41 TFNLKIMMPSAVYVGEYKS-SQWLMTGEILQNVSWS---GPPPAPSVKLIGYHQNINKAS 96 T L+++ P Y G Y+ SQ L TG I N+ WS G AP + + + Sbjct: 5 TLYLEVVTPHGTYYGTYQEPSQRLNTGNI-TNIYWSDTTGTIHAPRLNMGAASGAGSIGP 63 Query: 97 CPGL-----PSGWNCGYYTFEVIVSAEIESYFSCPWLVI-----MNDSEASPGGVTYQGP 146 CPG+ S W C V V + CPWL+ S + Y GP Sbjct: 64 CPGVHDAPNTSTWGCYSTNISVYVDQPVG---GCPWLISSYITTFYHSTFTGDQGPYTGP 120 Query: 147 DSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMN 206 +H++ CP V+V PYDVSW+ENYV+ +K + L GGVV +TLSTYLMKDG+LCD +Q + Sbjct: 121 KAHNSSCPPVAVAPYDVSWDENYVAHNKTVRLPGGGGVVTQTLSTYLMKDGQLCDGSQPD 180 Query: 207 ETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQP-IDSTCR 265 E G YCR VAQ++TFT+SGCD A VSVTP HPITDKQLHDMV++V+TS+ P I +TCR Sbjct: 181 ERGLYCRLVAQLMTFTSSGCDDARVSVTPTPHPITDKQLHDMVLQVNTSNNVPAIAATCR 240 Query: 266 FQYILNEL 273 FQYILNEL Sbjct: 241 FQYILNEL 248 >UniRef50_B4TK29 Putative uncharacterized protein n=4 Tax=Enterobacteriaceae RepID=B4TK29_SALHS Length = 213 Score = 295 bits (756), Expect = 9e-79, Method: Composition-based stats. Identities = 88/214 (41%), Positives = 132/214 (61%), Gaps = 7/214 (3%) Query: 64 MTGEILQNVSWSGPPPAPSVKLIGYHQNINKASCPGLPSGWN-CGYYTFEVIVSAEIESY 122 G +SW+GP PAP++ L + +I+K++C LPS WN CGYYT ++ V ++ Sbjct: 3 RKGATATLLSWTGPDPAPTIVLRDFDNSISKSNCKNLPSSWNGCGYYTVDITVQSD---N 59 Query: 123 FSCPWLVIMNDS-EASPGGVTYQGPDSHDTICPSVSVQPYDVSWNENYVSKSKLLTLQST 181 + CPWL + + E G TY PD+ ++CP + V +D+SW+ N + L L +T Sbjct: 60 YGCPWLAATHSTAEDLVSGETYSAPDTRSSVCPKIPVDTFDISWDANVSKQKTTLMLDAT 119 Query: 182 GGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPIT 241 GG V +TL TYLM+ GKLCD ++ ++ G YCR+V+ IT GCD++ V+ + HPIT Sbjct: 120 GGTVNRTLHTYLMEGGKLCDGSKFDDRGAYCRFVSSGITLNVLGCDQSSVTTSAVDHPIT 179 Query: 242 DKQLHDMVVRVDTSSMQP--IDSTCRFQYILNEL 273 D +LHD+ V V+T ++ STC FQYI++EL Sbjct: 180 DVELHDINVAVNTRNIGSGQFTSTCSFQYIIDEL 213 >UniRef50_Q32DK5 Putative uncharacterized protein n=1 Tax=Shigella dysenteriae Sd197 RepID=Q32DK5_SHIDS Length = 169 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 93/172 (54%), Positives = 123/172 (71%), Gaps = 7/172 (4%) Query: 105 NCGYYTFEVIVSAEIESYFSCPWLVI---MNDSEASPGGVTYQGPDSHDTICPSVSVQPY 161 C + V+ E+ CPWLV ++ S P TY GP + + CP+ S+ PY Sbjct: 2 GCDSLPVKFTVTGEVS---GCPWLVTVRVISHSATVPPE-TYIGPQTPVSTCPAQSLTPY 57 Query: 162 DVSWNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITF 221 D+SW++NYV K+K++ LQSTGG++EKTL T+LMKDGKLCD Q ++ G YCR+V QM+TF Sbjct: 58 DISWDQNYVVKNKVIRLQSTGGMIEKTLPTFLMKDGKLCDGGQASDEGAYCRFVTQMLTF 117 Query: 222 TASGCDKAEVSVTPNRHPITDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL 273 ++SGCD +V+VTPNRHPITDK++HDMVV VDT+ QPIDSTCRF Y+LN Sbjct: 118 SSSGCDNGKVTVTPNRHPITDKEVHDMVVHVDTTERQPIDSTCRFTYVLNMF 169 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.125 0.361 Lambda K H 0.267 0.0382 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 989,232,832 Number of Sequences: 3077464 Number of extensions: 36022072 Number of successful extensions: 92884 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 16 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 92801 Number of HSP's gapped (non-prelim): 19 length of query: 273 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 146 effective length of database: 649,558,428 effective search space: 94835530488 effective search space used: 94835530488 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 92 (40.1 bits)