BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (284 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P71296 Uncharacterized protein yagM n=3 Tax=Escherichia... 588 e-167 UniRef50_B2Q774 Putative uncharacterized protein n=1 Tax=Provide... 122 1e-26 UniRef50_C4SB24 Putative uncharacterized protein n=1 Tax=Yersini... 118 3e-25 UniRef50_B3Y1H1 Predicted protein n=1 Tax=Escherichia coli O111:... 85 2e-15 UniRef50_D2A7D3 Putative uncharacterized protein n=2 Tax=Shigell... 64 7e-09 UniRef50_Q707G9 Putative uncharacterized protein n=9 Tax=Enterob... 57 5e-07 >UniRef50_P71296 Uncharacterized protein yagM n=3 Tax=Escherichia coli RepID=YAGM_ECOLI Length = 284 Score = 588 bits (1516), Expect = e-167, Method: Compositional matrix adjust. Identities = 284/284 (100%), Positives = 284/284 (100%) Query: 1 MSNSVTNFEMSSVLPGKKPCQGKNNESQVVQTTPIKKHSVTFKNQSSLGVIDHYARLTNK 60 MSNSVTNFEMSSVLPGKKPCQGKNNESQVVQTTPIKKHSVTFKNQSSLGVIDHYARLTNK Sbjct: 1 MSNSVTNFEMSSVLPGKKPCQGKNNESQVVQTTPIKKHSVTFKNQSSLGVIDHYARLTNK 60 Query: 61 SHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFSIKESINRSRGKTEVTLEEYCSL 120 SHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFSIKESINRSRGKTEVTLEEYCSL Sbjct: 61 SHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFSIKESINRSRGKTEVTLEEYCSL 120 Query: 121 IWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENYDMEKAIYIYNQRHF 180 IWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENYDMEKAIYIYNQRHF Sbjct: 121 IWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENYDMEKAIYIYNQRHF 180 Query: 181 DVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYPVV 240 DVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYPVV Sbjct: 181 DVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYPVV 240 Query: 241 CWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDDIIINPYSQESRT 284 CWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDDIIINPYSQESRT Sbjct: 241 CWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDDIIINPYSQESRT 284 >UniRef50_B2Q774 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q774_PROST Length = 245 Score = 122 bits (307), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 80/241 (33%), Positives = 126/241 (52%), Gaps = 4/241 (1%) Query: 39 SVTFKNQSSLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFS 98 S+T K++SSL ID Y + SSVIA V+ A P+L + N+H +NE++N L F+ Sbjct: 6 SITIKDESSLKAIDEYCLKHKLTRSSVIASVLTSAYPLLTEINQHTQLVNELENKL--FN 63 Query: 99 IKESINRSRGKTEV-TLEEYCSLIWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRE 157 K+ S G T V +L EY IW+T+I+ I + + +G +EK I+E Sbjct: 64 PKKIPRTSWGDTPVFSLPEYLLDIWQTHIIEKGIIIDQHAYPHTYKNPKVGSEEKKNIQE 123 Query: 158 RLSSLRENYDMEKAIYIYNQRHFDVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTS 217 L++ + ++KAI+IY RH K G+SN ILI T + Y FD + + Sbjct: 124 TLNNYIDPPHIQKAIFIYTDRHVHYKLHKAGGFSNTILIRNTEYNNYLFDFNAIIHVPIK 183 Query: 218 QLIIFGINEVLRRKGIVMPYPVVCWIDIYHVNEMVVMLPVLRKTDVSNRVNV-PDDIIIN 276 ++ G ++ I++ +I IYH N V++ V+ KT V+ + + P+ IIIN Sbjct: 184 DIVFHGTEGAFKKNQIILKGTYAAFIPIYHTNNQCVLIGVIDKTQVNRKTDCSPNTIIIN 243 Query: 277 P 277 P Sbjct: 244 P 244 >UniRef50_C4SB24 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SB24_YERMO Length = 252 Score = 118 bits (295), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 70/245 (28%), Positives = 131/245 (53%), Gaps = 3/245 (1%) Query: 40 VTFKNQ--SSLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKF 97 +TF N +++G+ + R+ S SS I+ V+D P+LEK H+ NEI+ + + Sbjct: 5 ITFSNSNNTTVGIFKKFCRIKGISLSSGISFVMDATAPLLEKIISHHNDANEIEKIISQA 64 Query: 98 SIKESINRSRGKTEVTLEEYCSLIWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRE 157 + + R + EEY IW T+I + + ++ N+++ +G +EK IRE Sbjct: 65 YLLPTKPPIRSVPVRSQEEYYLAIWNTHIRYTIDSLDHNRYKHNSSNRRIGNNEKKSIRE 124 Query: 158 RLSSLRENYDMEKAIYIYNQRHFDVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTS 217 + E Y+ +K I+IY R ++ +G+SN+I+I T+++G +FD + +++ Sbjct: 125 SQGKIIEKYNAKKGIFIYIDRRISYAYKLSAGHSNLIMIKETSYDGVFFDFSKMIIVPIL 184 Query: 218 QLIIFGINEVLRRKGIVMPYPVVCWIDIYHVNEMVVMLPVLRKTDVSNRV-NVPDDIIIN 276 +LI GI+E + RK + +CWI +Y++N +++P++ + D S N II+ Sbjct: 185 ELITLGIDEAVSRKNDNVKIQCICWIPVYYINNKALIIPIVHEDDASALTKNGEKIIIVE 244 Query: 277 PYSQE 281 P+ E Sbjct: 245 PFKDE 249 >UniRef50_B3Y1H1 Predicted protein n=1 Tax=Escherichia coli O111:H- RepID=B3Y1H1_ECO11 Length = 178 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 57/176 (32%), Positives = 83/176 (47%), Gaps = 1/176 (0%) Query: 107 RGKTEVTLEEYCSLIWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENY 166 R K E+T E+ I+K++I + F + MG EK I+E + + E Y Sbjct: 3 RTKPEITKGEFFHSIYKSHIKYKYDVLDRKIFPHESTRNAMGVAEKKGIKENATLMLEYY 62 Query: 167 DMEKAIYIYNQRHFDVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINE 226 +EKAI IY R G+ ILI + F Y+FD ++ L +LI +G E Sbjct: 63 KVEKAICIYTNRKVSHTLNRAGGFYKTILIKTSVFGDYFFDFCNSVCLQIDELIEYGTKE 122 Query: 227 VLRRKGIVMPYPVVCWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDD-IIINPYSQE 281 +RR I I I+++N V++PVLR +VS D IIINP+ E Sbjct: 123 TVRRHQIRSTGFCTFHIPIFYINNKAVIVPVLRTEEVSQSSRTGGDVIIINPFEDE 178 >UniRef50_D2A7D3 Putative uncharacterized protein n=2 Tax=Shigella flexneri RepID=D2A7D3_SHIF2 Length = 304 Score = 63.5 bits (153), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 63/260 (24%), Positives = 113/260 (43%), Gaps = 43/260 (16%) Query: 40 VTFKNQS--SLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKF 97 +TF S +L ++D Y S S VI ++ P+L N + E+++ LL Sbjct: 5 ITFSMTSDETLRIVDEYCHTHKLSRSKVINALLSATAPVLNDINCYYQLAGELQSRLLNG 64 Query: 98 SIKESINRSRGKTEVTLEEYCSLIWKTNIMSPLKIPIADYFQLNA------------NDE 145 + + R R V+ E+YC IW+ + + I ++ N D+ Sbjct: 65 VYQRDLPRKRN--VVSAEKYCLEIWENKLFTRR---ILEFDSSNGVLYALKHKRHYRRDK 119 Query: 146 FMGKDEKTVIRE------RLSSLRENYD----MEKAIYIYNQRHFDVKHQSVSGYSNIIL 195 +G+ E I++ +LS + Y +E+ IY ++ + ++ G + I+L Sbjct: 120 MIGRVESRCIKDICEYQMQLSGEKTKYACFIYIERTIYNHDNPSDKIPVKAAVGNAVILL 179 Query: 196 IHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGI--VMPYP-VVCWIDIYHVNEMV 252 + Y+FD Q+ +S + L++ G KGI YP V CWI ++ +N V Sbjct: 180 AKDVIYNEYFFDLRQSFFVSVTDLMVSGA------KGIPETQTYPDVYCWIPLFSINSGV 233 Query: 253 VMLPV-----LRKTDVSNRV 267 ++ PV L+ V NR+ Sbjct: 234 LITPVYKIDPLKPVTVKNRI 253 >UniRef50_Q707G9 Putative uncharacterized protein n=9 Tax=Enterobacteriaceae RepID=Q707G9_ECOLX Length = 263 Score = 57.4 bits (137), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 67/263 (25%), Positives = 114/263 (43%), Gaps = 37/263 (14%) Query: 40 VTFKNQS--SLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKF 97 +TF S +L ++D Y S S VI ++ P+L N + ++++ LL Sbjct: 5 ITFNMTSDETLRIVDEYCHTHKLSRSKVIDALLSATAPVLNDINCYYQLAGKLQSRLLNG 64 Query: 98 SIKESINRSRGKTEVTLEEYCSLIWKTNIMSPLKIPIADY-----FQLN-----ANDEFM 147 + + R V+ E+YC IW+ + + +I DY + L D+ + Sbjct: 65 VYQRDLPHKRN--VVSAEKYCLEIWENKLFTK-RILEFDYSNGVLYALKHKRHYRRDKMI 121 Query: 148 GKDEKTVIRE------RLSSLRENYD----MEKAIYIYNQRHFDVKHQSVSGYSNIILIH 197 G+ E I++ +LS + Y +E+ IY ++ + +S G + I+L Sbjct: 122 GRVESRYIKDICEYQMQLSGEKTKYACFIYIERTIYNHDNPPDETPVKSAVGNAVILLAK 181 Query: 198 RTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGI--VMPYP-VVCWIDIYHVNEMVVM 254 + Y+FD ++ +S L+ G KGI YP V CWI ++ +N VV+ Sbjct: 182 DVIYNEYFFDLRKSFFVSVKDLMASGT------KGIPETQKYPDVYCWIPLFSINYGVVI 235 Query: 255 LPVLRKTDVSNRVNV--PDDIII 275 PV K D V V PD I + Sbjct: 236 TPVY-KIDPLKPVTVKKPDKITV 257 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P71296 Uncharacterized protein yagM n=3 Tax=Escherichia... 397 e-109 UniRef50_B2Q774 Putative uncharacterized protein n=1 Tax=Provide... 291 2e-77 UniRef50_C4SB24 Putative uncharacterized protein n=1 Tax=Yersini... 289 6e-77 UniRef50_D2A7D3 Putative uncharacterized protein n=2 Tax=Shigell... 252 9e-66 UniRef50_Q707G9 Putative uncharacterized protein n=9 Tax=Enterob... 251 2e-65 UniRef50_B3Y1H1 Predicted protein n=1 Tax=Escherichia coli O111:... 224 3e-57 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P71296 Uncharacterized protein yagM n=3 Tax=Escherichia coli RepID=YAGM_ECOLI Length = 284 Score = 397 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 284/284 (100%), Positives = 284/284 (100%) Query: 1 MSNSVTNFEMSSVLPGKKPCQGKNNESQVVQTTPIKKHSVTFKNQSSLGVIDHYARLTNK 60 MSNSVTNFEMSSVLPGKKPCQGKNNESQVVQTTPIKKHSVTFKNQSSLGVIDHYARLTNK Sbjct: 1 MSNSVTNFEMSSVLPGKKPCQGKNNESQVVQTTPIKKHSVTFKNQSSLGVIDHYARLTNK 60 Query: 61 SHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFSIKESINRSRGKTEVTLEEYCSL 120 SHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFSIKESINRSRGKTEVTLEEYCSL Sbjct: 61 SHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKFSIKESINRSRGKTEVTLEEYCSL 120 Query: 121 IWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENYDMEKAIYIYNQRHF 180 IWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENYDMEKAIYIYNQRHF Sbjct: 121 IWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENYDMEKAIYIYNQRHF 180 Query: 181 DVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYPVV 240 DVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYPVV Sbjct: 181 DVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYPVV 240 Query: 241 CWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDDIIINPYSQESRT 284 CWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDDIIINPYSQESRT Sbjct: 241 CWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDDIIINPYSQESRT 284 >UniRef50_B2Q774 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q774_PROST Length = 245 Score = 291 bits (745), Expect = 2e-77, Method: Composition-based stats. Identities = 80/244 (32%), Positives = 128/244 (52%), Gaps = 4/244 (1%) Query: 36 KKHSVTFKNQSSLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLL 95 ++ S+T K++SSL ID Y + SSVIA V+ A P+L + N+H +NE++N L Sbjct: 3 QRISITIKDESSLKAIDEYCLKHKLTRSSVIASVLTSAYPLLTEINQHTQLVNELENKL- 61 Query: 96 KFSIKESINRSRGKTEV-TLEEYCSLIWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTV 154 F+ K+ S G T V +L EY IW+T+I+ I + + +G +EK Sbjct: 62 -FNPKKIPRTSWGDTPVFSLPEYLLDIWQTHIIEKGIIIDQHAYPHTYKNPKVGSEEKKN 120 Query: 155 IRERLSSLRENYDMEKAIYIYNQRHFDVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLL 214 I+E L++ + ++KAI+IY RH K G+SN ILI T + Y FD + + Sbjct: 121 IQETLNNYIDPPHIQKAIFIYTDRHVHYKLHKAGGFSNTILIRNTEYNNYLFDFNAIIHV 180 Query: 215 STSQLIIFGINEVLRRKGIVMPYPVVCWIDIYHVNEMVVMLPVLRKTDVSNRVNV-PDDI 273 ++ G ++ I++ +I IYH N V++ V+ KT V+ + + P+ I Sbjct: 181 PIKDIVFHGTEGAFKKNQIILKGTYAAFIPIYHTNNQCVLIGVIDKTQVNRKTDCSPNTI 240 Query: 274 IINP 277 IINP Sbjct: 241 IINP 244 >UniRef50_C4SB24 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SB24_YERMO Length = 252 Score = 289 bits (740), Expect = 6e-77, Method: Composition-based stats. Identities = 70/245 (28%), Positives = 131/245 (53%), Gaps = 3/245 (1%) Query: 40 VTFKNQ--SSLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKF 97 +TF N +++G+ + R+ S SS I+ V+D P+LEK H+ NEI+ + + Sbjct: 5 ITFSNSNNTTVGIFKKFCRIKGISLSSGISFVMDATAPLLEKIISHHNDANEIEKIISQA 64 Query: 98 SIKESINRSRGKTEVTLEEYCSLIWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRE 157 + + R + EEY IW T+I + + ++ N+++ +G +EK IRE Sbjct: 65 YLLPTKPPIRSVPVRSQEEYYLAIWNTHIRYTIDSLDHNRYKHNSSNRRIGNNEKKSIRE 124 Query: 158 RLSSLRENYDMEKAIYIYNQRHFDVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTS 217 + E Y+ +K I+IY R ++ +G+SN+I+I T+++G +FD + +++ Sbjct: 125 SQGKIIEKYNAKKGIFIYIDRRISYAYKLSAGHSNLIMIKETSYDGVFFDFSKMIIVPIL 184 Query: 218 QLIIFGINEVLRRKGIVMPYPVVCWIDIYHVNEMVVMLPVLRKTDVSNRV-NVPDDIIIN 276 +LI GI+E + RK + +CWI +Y++N +++P++ + D S N II+ Sbjct: 185 ELITLGIDEAVSRKNDNVKIQCICWIPVYYINNKALIIPIVHEDDASALTKNGEKIIIVE 244 Query: 277 PYSQE 281 P+ E Sbjct: 245 PFKDE 249 >UniRef50_D2A7D3 Putative uncharacterized protein n=2 Tax=Shigella flexneri RepID=D2A7D3_SHIF2 Length = 304 Score = 252 bits (644), Expect = 9e-66, Method: Composition-based stats. Identities = 61/257 (23%), Positives = 112/257 (43%), Gaps = 31/257 (12%) Query: 40 VTFKNQS--SLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKF 97 +TF S +L ++D Y S S VI ++ P+L N + E+++ LL Sbjct: 5 ITFSMTSDETLRIVDEYCHTHKLSRSKVINALLSATAPVLNDINCYYQLAGELQSRLLNG 64 Query: 98 SIKESINRSRGKTEVTLEEYCSLIWKTNIMSPLKIPIAD----------YFQLNANDEFM 147 + + R R V+ E+YC IW+ + + +I D + + D+ + Sbjct: 65 VYQRDLPRKRN--VVSAEKYCLEIWENKLFTR-RILEFDSSNGVLYALKHKRHYRRDKMI 121 Query: 148 GKDEKTVIRE------RLSSLRENYD----MEKAIYIYNQRHFDVKHQSVSGYSNIILIH 197 G+ E I++ +LS + Y +E+ IY ++ + ++ G + I+L Sbjct: 122 GRVESRCIKDICEYQMQLSGEKTKYACFIYIERTIYNHDNPSDKIPVKAAVGNAVILLAK 181 Query: 198 RTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYP-VVCWIDIYHVNEMVVMLP 256 + Y+FD Q+ +S + L++ G + YP V CWI ++ +N V++ P Sbjct: 182 DVIYNEYFFDLRQSFFVSVTDLMVSGAKGIPE----TQTYPDVYCWIPLFSINSGVLITP 237 Query: 257 VLRKTDVSNRVNVPDDI 273 V K D V V + I Sbjct: 238 VY-KIDPLKPVTVKNRI 253 >UniRef50_Q707G9 Putative uncharacterized protein n=9 Tax=Enterobacteriaceae RepID=Q707G9_ECOLX Length = 263 Score = 251 bits (640), Expect = 2e-65, Method: Composition-based stats. Identities = 63/261 (24%), Positives = 111/261 (42%), Gaps = 33/261 (12%) Query: 40 VTFKNQS--SLGVIDHYARLTNKSHSSVIAEVVDLAIPILEKCNRHNWSINEIKNDLLKF 97 +TF S +L ++D Y S S VI ++ P+L N + ++++ LL Sbjct: 5 ITFNMTSDETLRIVDEYCHTHKLSRSKVIDALLSATAPVLNDINCYYQLAGKLQSRLLNG 64 Query: 98 SIKESINRSRGKTEVTLEEYCSLIWKTNIMSPLKIPIADYF----------QLNANDEFM 147 + + R V+ E+YC IW+ + + +I DY + D+ + Sbjct: 65 VYQRDLPHKRN--VVSAEKYCLEIWENKLFTK-RILEFDYSNGVLYALKHKRHYRRDKMI 121 Query: 148 GKDEKTVIRE------RLSSLRENYD----MEKAIYIYNQRHFDVKHQSVSGYSNIILIH 197 G+ E I++ +LS + Y +E+ IY ++ + +S G + I+L Sbjct: 122 GRVESRYIKDICEYQMQLSGEKTKYACFIYIERTIYNHDNPPDETPVKSAVGNAVILLAK 181 Query: 198 RTTFEGYYFDAGQALLLSTSQLIIFGINEVLRRKGIVMPYP-VVCWIDIYHVNEMVVMLP 256 + Y+FD ++ +S L+ G + YP V CWI ++ +N VV+ P Sbjct: 182 DVIYNEYFFDLRKSFFVSVKDLMASGTKGIPE----TQKYPDVYCWIPLFSINYGVVITP 237 Query: 257 VLRKTDVSNRVNV--PDDIII 275 V K D V V PD I + Sbjct: 238 VY-KIDPLKPVTVKKPDKITV 257 >UniRef50_B3Y1H1 Predicted protein n=1 Tax=Escherichia coli O111:H- RepID=B3Y1H1_ECO11 Length = 178 Score = 224 bits (570), Expect = 3e-57, Method: Composition-based stats. Identities = 57/176 (32%), Positives = 83/176 (47%), Gaps = 1/176 (0%) Query: 107 RGKTEVTLEEYCSLIWKTNIMSPLKIPIADYFQLNANDEFMGKDEKTVIRERLSSLRENY 166 R K E+T E+ I+K++I + F + MG EK I+E + + E Y Sbjct: 3 RTKPEITKGEFFHSIYKSHIKYKYDVLDRKIFPHESTRNAMGVAEKKGIKENATLMLEYY 62 Query: 167 DMEKAIYIYNQRHFDVKHQSVSGYSNIILIHRTTFEGYYFDAGQALLLSTSQLIIFGINE 226 +EKAI IY R G+ ILI + F Y+FD ++ L +LI +G E Sbjct: 63 KVEKAICIYTNRKVSHTLNRAGGFYKTILIKTSVFGDYFFDFCNSVCLQIDELIEYGTKE 122 Query: 227 VLRRKGIVMPYPVVCWIDIYHVNEMVVMLPVLRKTDVSNRVNVPDD-IIINPYSQE 281 +RR I I I+++N V++PVLR +VS D IIINP+ E Sbjct: 123 TVRRHQIRSTGFCTFHIPIFYINNKAVIVPVLRTEEVSQSSRTGGDVIIINPFEDE 178 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.129 0.324 Lambda K H 0.267 0.0397 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 918,708,483 Number of Sequences: 3077464 Number of extensions: 31974306 Number of successful extensions: 91045 Number of sequences better than 1.0e-01: 7 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 91015 Number of HSP's gapped (non-prelim): 14 length of query: 284 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 157 effective length of database: 649,558,428 effective search space: 101980673196 effective search space used: 101980673196 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 92 (40.1 bits)