BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (318 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P52123 Uncharacterized protein yfjH n=4 Tax=Escherichia... 664 0.0 UniRef50_C4USJ6 Putative uncharacterized protein n=1 Tax=Yersini... 91 8e-17 UniRef50_C4SB26 Putative uncharacterized protein n=1 Tax=Yersini... 64 7e-09 UniRef50_B2K8D0 Putative uncharacterized protein n=1 Tax=Yersini... 50 8e-05 UniRef50_B2Q776 Putative uncharacterized protein n=1 Tax=Provide... 43 0.014 >UniRef50_P52123 Uncharacterized protein yfjH n=4 Tax=Escherichia coli RepID=YFJH_ECOLI Length = 318 Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust. Identities = 318/318 (100%), Positives = 318/318 (100%) Query: 1 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV 60 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV Sbjct: 1 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV 60 Query: 61 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD 120 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD Sbjct: 61 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD 120 Query: 121 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA 180 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA Sbjct: 121 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA 180 Query: 181 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS 240 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS Sbjct: 181 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS 240 Query: 241 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN 300 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN Sbjct: 241 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN 300 Query: 301 EDVKQAFDSAVRRKKRSK 318 EDVKQAFDSAVRRKKRSK Sbjct: 301 EDVKQAFDSAVRRKKRSK 318 >UniRef50_C4USJ6 Putative uncharacterized protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4USJ6_YERRO Length = 246 Score = 90.5 bits (223), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 60/213 (28%), Positives = 112/213 (52%), Gaps = 15/213 (7%) Query: 110 EIKEE---YTELYDPAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWG 166 E+K+E + Y CWK L K T +PL+ + FK L A + WG Sbjct: 44 EVKDEVLKFRSRYIDHDACWKQLK---LKWTTIPLMHSAIGLKQTFKDELFAAGVFQLWG 100 Query: 167 KSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVY 226 + I ++ +A ++F A + ++C G FN + +KK+S RR + KGGK+KAE Y Sbjct: 101 RDIWEVDQEMAVESFLHANRILNECRGCVDFNQLIDREKKVSAGRRQSAIKGGKAKAEHY 160 Query: 227 HIIQLKLVELINDSVPND-GWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQ 285 ++ +++ L++ + P D GW+ + VA + L F++ + + +N+ +++ Sbjct: 161 IPVKQEVIRLLHKNAPPDGGWQKRTVAARAIELELMSFVE--KMKAHNEG-----LDLNE 213 Query: 286 DALVDTILNQWSLKNEDVKQAFDSAVRRKKRSK 318 D L T++ +W+ ++ +V+ AF++ VR K+ K Sbjct: 214 DELSATMV-RWAREDSEVRAAFEATVRVKRGKK 245 >UniRef50_C4SB26 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SB26_YERMO Length = 256 Score = 63.9 bits (154), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 44/171 (25%), Positives = 87/171 (50%), Gaps = 9/171 (5%) Query: 147 SINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCNQKK 206 +N+ F+ +L+ A I + W K+ + ++ ++ +A + A L +C G + +++ Sbjct: 91 GLNENFRLNLLLAGIEYQWAKAAWNYDKRLSVQALNVALMLLHECSGACMCCELIDGERE 150 Query: 207 LSEVRRSAGKKGGKSKAEVYHIIQLKLVELIND-SVPNDGWKNKVVAVNELIEPLWDFIQ 265 + V+ A KGG +KAEVY+ I + + L+ S + WK+ A + +PLW FI Sbjct: 151 IHAVKVKAAAKGGVTKAEVYNPIIKEAIRLLQSTSQTGEKWKHWTDAAKAIEQPLWSFI- 209 Query: 266 MSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVKQAFDSAVRRKKR 316 +E +++NQ + ++ LVD I W K+ ++ + V + K+ Sbjct: 210 -AEQKVDNQ-----TIDLKEENLVDKI-QLWGKKDIELGTTLNEKVLKPKK 253 >UniRef50_B2K8D0 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis PB1/+ RepID=B2K8D0_YERPB Length = 241 Score = 50.4 bits (119), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 38/176 (21%), Positives = 86/176 (48%), Gaps = 14/176 (7%) Query: 144 EPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCN 203 EP ++++ + A ++++ G +++ N++ A K E D C G+ + + Sbjct: 77 EPDTLSETMDELVFLADVNWYLGLGLLNTNKSEAIKCLLLGIEYLDYCRGLE--DQELWQ 134 Query: 204 QKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVVAVNELIEPLWDF 263 Q++ +++ +GG+SKA + ++ +++ L+ + P GW++K A+ + + + Sbjct: 135 QQQTTKL--DVPVQGGRSKAARFDPVKDRIISLLQEYCPPGGWESKQDAIRGIENGINEL 192 Query: 264 IQMSEFEINNQNKKYRVATMSQDALVDTILNQ---WSLKNEDVKQAFDSAVRRKKR 316 S + NN +K S D + + Q WS ++ ++ AFD V+ KK+ Sbjct: 193 EWPSARDRNNLSK-------SGDEIFAMKIRQVEVWSSRDPKIEAAFDCVVKPKKK 241 >UniRef50_B2Q776 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q776_PROST Length = 257 Score = 43.1 bits (100), Expect = 0.014, Method: Compositional matrix adjust. Identities = 37/146 (25%), Positives = 70/146 (47%), Gaps = 27/146 (18%) Query: 188 FDKCIGMTWFNISVCNQKKLSEV----------------RRSAGKKGGKSKAEVYHIIQL 231 FDK + +++C KL + R S+ KKGG AE+ + + Sbjct: 117 FDKKAAVEALIVAICRLAKLGSIEGYRELLKIERECRLKRISSSKKGGNVVAELAALFRN 176 Query: 232 KLVELINDSVPNDG--WKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALV 289 + + L++ + +DG WK+K AV + LW FI+ + K+ + Q++L Sbjct: 177 EAIRLLHRA-KSDGKEWKSKKKAVEAIENELWQFIE-------TKRKEGHKTLLKQESLN 228 Query: 290 DTILNQWSLKNEDVKQAFDSAVRRKK 315 D +L +W+ ++D++ A ++ V+ KK Sbjct: 229 DAVL-RWAKCHDDLRFALENVVKNKK 253 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52123 Uncharacterized protein yfjH n=4 Tax=Escherichia... 538 e-151 UniRef50_C4USJ6 Putative uncharacterized protein n=1 Tax=Yersini... 256 8e-67 UniRef50_B2K8D0 Putative uncharacterized protein n=1 Tax=Yersini... 206 7e-52 UniRef50_C4SB26 Putative uncharacterized protein n=1 Tax=Yersini... 198 2e-49 Sequences not found previously or not previously below threshold: UniRef50_B2Q776 Putative uncharacterized protein n=1 Tax=Provide... 85 4e-15 UniRef50_B7LDL5 Putative uncharacterized protein n=1 Tax=Escheri... 45 0.004 UniRef50_A4G676 Putative uncharacterized protein n=1 Tax=Hermini... 42 0.022 >UniRef50_P52123 Uncharacterized protein yfjH n=4 Tax=Escherichia coli RepID=YFJH_ECOLI Length = 318 Score = 538 bits (1386), Expect = e-151, Method: Composition-based stats. Identities = 318/318 (100%), Positives = 318/318 (100%) Query: 1 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV 60 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV Sbjct: 1 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV 60 Query: 61 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD 120 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD Sbjct: 61 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD 120 Query: 121 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA 180 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA Sbjct: 121 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA 180 Query: 181 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS 240 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS Sbjct: 181 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS 240 Query: 241 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN 300 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN Sbjct: 241 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN 300 Query: 301 EDVKQAFDSAVRRKKRSK 318 EDVKQAFDSAVRRKKRSK Sbjct: 301 EDVKQAFDSAVRRKKRSK 318 >UniRef50_C4USJ6 Putative uncharacterized protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4USJ6_YERRO Length = 246 Score = 256 bits (653), Expect = 8e-67, Method: Composition-based stats. Identities = 60/213 (28%), Positives = 111/213 (52%), Gaps = 15/213 (7%) Query: 110 EIKEE---YTELYDPAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWG 166 E+K+E + Y CWK L K T +PL+ + FK L A + WG Sbjct: 44 EVKDEVLKFRSRYIDHDACWKQLK---LKWTTIPLMHSAIGLKQTFKDELFAAGVFQLWG 100 Query: 167 KSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVY 226 + I ++ +A ++F A + ++C G FN + +KK+S RR + KGGK+KAE Y Sbjct: 101 RDIWEVDQEMAVESFLHANRILNECRGCVDFNQLIDREKKVSAGRRQSAIKGGKAKAEHY 160 Query: 227 HIIQLKLVELINDSVPND-GWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQ 285 ++ +++ L++ + P D GW+ + VA + L F++ + + +N+ +++ Sbjct: 161 IPVKQEVIRLLHKNAPPDGGWQKRTVAARAIELELMSFVE--KMKAHNEG-----LDLNE 213 Query: 286 DALVDTILNQWSLKNEDVKQAFDSAVRRKKRSK 318 D L T++ W+ ++ +V+ AF++ VR K+ K Sbjct: 214 DELSATMVR-WAREDSEVRAAFEATVRVKRGKK 245 >UniRef50_B2K8D0 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis PB1/+ RepID=B2K8D0_YERPB Length = 241 Score = 206 bits (525), Expect = 7e-52, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 86/176 (48%), Gaps = 14/176 (7%) Query: 144 EPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCN 203 EP ++++ + A ++++ G +++ N++ A K E D C G+ + + Sbjct: 77 EPDTLSETMDELVFLADVNWYLGLGLLNTNKSEAIKCLLLGIEYLDYCRGLE--DQELWQ 134 Query: 204 QKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVVAVNELIEPLWDF 263 Q++ +++ +GG+SKA + ++ +++ L+ + P GW++K A+ + + + Sbjct: 135 QQQTTKL--DVPVQGGRSKAARFDPVKDRIISLLQEYCPPGGWESKQDAIRGIENGINEL 192 Query: 264 IQMSEFEINNQNKKYRVATMSQDALVDTILNQ---WSLKNEDVKQAFDSAVRRKKR 316 S + NN +K S D + + Q WS ++ ++ AFD V+ KK+ Sbjct: 193 EWPSARDRNNLSK-------SGDEIFAMKIRQVEVWSSRDPKIEAAFDCVVKPKKK 241 >UniRef50_C4SB26 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SB26_YERMO Length = 256 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 44/171 (25%), Positives = 87/171 (50%), Gaps = 9/171 (5%) Query: 147 SINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCNQKK 206 +N+ F+ +L+ A I + W K+ + ++ ++ +A + A L +C G + +++ Sbjct: 91 GLNENFRLNLLLAGIEYQWAKAAWNYDKRLSVQALNVALMLLHECSGACMCCELIDGERE 150 Query: 207 LSEVRRSAGKKGGKSKAEVYHIIQLKLVELIND-SVPNDGWKNKVVAVNELIEPLWDFIQ 265 + V+ A KGG +KAEVY+ I + + L+ S + WK+ A + +PLW FI Sbjct: 151 IHAVKVKAAAKGGVTKAEVYNPIIKEAIRLLQSTSQTGEKWKHWTDAAKAIEQPLWSFI- 209 Query: 266 MSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVKQAFDSAVRRKKR 316 +E +++NQ + ++ LVD I W K+ ++ + V + K+ Sbjct: 210 -AEQKVDNQ-----TIDLKEENLVDKI-QLWGKKDIELGTTLNEKVLKPKK 253 >UniRef50_B2Q776 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q776_PROST Length = 257 Score = 84.8 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 84/191 (43%), Gaps = 9/191 (4%) Query: 129 LSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELF 188 +S +E + +N KA ++ I+ W + ++ A +A A Sbjct: 74 VSEIRVREYMMMFRSSGEGLNHYTKALFLWIGINKSWSEFCWDFDKKAAVEALIVAICRL 133 Query: 189 DKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDG-WK 247 K + + + +++ R S+ KKGG AE+ + + + + L++ + + WK Sbjct: 134 AKLGSIEGYRELLKIERECRLKRISSSKKGGNVVAELAALFRNEAIRLLHRAKSDGKEWK 193 Query: 248 NKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVKQAF 307 +K AV + LW FI+ E + + Q++L D +L W+ ++D++ A Sbjct: 194 SKKKAVEAIENELWQFIETKRKEGHK-------TLLKQESLNDAVLR-WAKCHDDLRFAL 245 Query: 308 DSAVRRKKRSK 318 ++ V+ KK + Sbjct: 246 ENVVKNKKTRR 256 >UniRef50_B7LDL5 Putative uncharacterized protein n=1 Tax=Escherichia coli 55989 RepID=B7LDL5_ECO55 Length = 293 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 37/170 (21%), Positives = 66/170 (38%), Gaps = 20/170 (11%) Query: 147 SINDIFKAH-----LMFASISFFWGKSIMSE-NENVAFKAFHRAAELFDKC------IGM 194 + ++ AH L A F+W + S N+ + K A + C + Sbjct: 121 GLARLYDAHYEYIYLSMALEDFYWAQFQYSLGNKKESCKFLLDAYSYMNVCISRDDNRDL 180 Query: 195 TWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVVAVN 254 F K+ RS+ GGK+K I+ +V L+ D + WKNK Sbjct: 181 PSFYRLRSGFDKVCRRARSSA--GGKNKLAELEYIKDTVVILLEDIPSHSDWKNK----R 234 Query: 255 ELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVK 304 +L+E L +I ++ ++N ++ + LN+WS + V+ Sbjct: 235 QLLEYLMPYICNNDKKVNPDKWINDTGD--REETIRRRLNKWSTGDLKVE 282 >UniRef50_A4G676 Putative uncharacterized protein n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G676_HERAR Length = 288 Score = 42.4 bits (98), Expect = 0.022, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 25/48 (52%) Query: 215 GKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVVAVNELIEPLWD 262 +GG KA Y +I+ + EL+ P +GW + A+ +++ L D Sbjct: 202 ASEGGNGKARRYELIKDNVAELLMKRAPVEGWASTSAAIKTIVDELID 249 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52123 Uncharacterized protein yfjH n=4 Tax=Escherichia... 523 e-147 UniRef50_C4USJ6 Putative uncharacterized protein n=1 Tax=Yersini... 242 2e-62 UniRef50_B2Q776 Putative uncharacterized protein n=1 Tax=Provide... 203 9e-51 UniRef50_B2K8D0 Putative uncharacterized protein n=1 Tax=Yersini... 196 1e-48 UniRef50_C4SB26 Putative uncharacterized protein n=1 Tax=Yersini... 190 6e-47 Sequences not found previously or not previously below threshold: UniRef50_B7LDL5 Putative uncharacterized protein n=1 Tax=Escheri... 46 0.002 UniRef50_A4G676 Putative uncharacterized protein n=1 Tax=Hermini... 43 0.016 UniRef50_B4EDK5 Putative uncharacterized protein n=1 Tax=Burkhol... 41 0.081 >UniRef50_P52123 Uncharacterized protein yfjH n=4 Tax=Escherichia coli RepID=YFJH_ECOLI Length = 318 Score = 523 bits (1348), Expect = e-147, Method: Composition-based stats. Identities = 318/318 (100%), Positives = 318/318 (100%) Query: 1 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV 60 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV Sbjct: 1 MGIFKAKNPCTKNTIFTTSNTLIYGGFMISLNDFYEQICRKRRDLAYHMSECEWAVDTDV 60 Query: 61 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD 120 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD Sbjct: 61 LEEDHPEIRIELGRMREQFWSSEKIGTRVRLYSCDVPWETRHHTVNGQLEIKEEYTELYD 120 Query: 121 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA 180 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA Sbjct: 121 PAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKA 180 Query: 181 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS 240 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS Sbjct: 181 FHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDS 240 Query: 241 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN 300 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN Sbjct: 241 VPNDGWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKN 300 Query: 301 EDVKQAFDSAVRRKKRSK 318 EDVKQAFDSAVRRKKRSK Sbjct: 301 EDVKQAFDSAVRRKKRSK 318 >UniRef50_C4USJ6 Putative uncharacterized protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4USJ6_YERRO Length = 246 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 60/213 (28%), Positives = 111/213 (52%), Gaps = 15/213 (7%) Query: 110 EIKEE---YTELYDPAQECWKNLSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWG 166 E+K+E + Y CWK L K T +PL+ + FK L A + WG Sbjct: 44 EVKDEVLKFRSRYIDHDACWKQLK---LKWTTIPLMHSAIGLKQTFKDELFAAGVFQLWG 100 Query: 167 KSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVY 226 + I ++ +A ++F A + ++C G FN + +KK+S RR + KGGK+KAE Y Sbjct: 101 RDIWEVDQEMAVESFLHANRILNECRGCVDFNQLIDREKKVSAGRRQSAIKGGKAKAEHY 160 Query: 227 HIIQLKLVELINDSVPND-GWKNKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQ 285 ++ +++ L++ + P D GW+ + VA + L F++ + + +N+ +++ Sbjct: 161 IPVKQEVIRLLHKNAPPDGGWQKRTVAARAIELELMSFVE--KMKAHNEG-----LDLNE 213 Query: 286 DALVDTILNQWSLKNEDVKQAFDSAVRRKKRSK 318 D L T++ W+ ++ +V+ AF++ VR K+ K Sbjct: 214 DELSATMVR-WAREDSEVRAAFEATVRVKRGKK 245 >UniRef50_B2Q776 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q776_PROST Length = 257 Score = 203 bits (515), Expect = 9e-51, Method: Composition-based stats. Identities = 40/191 (20%), Positives = 84/191 (43%), Gaps = 9/191 (4%) Query: 129 LSSNLTKETFLPLVIEPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELF 188 +S +E + +N KA ++ I+ W + ++ A +A A Sbjct: 74 VSEIRVREYMMMFRSSGEGLNHYTKALFLWIGINKSWSEFCWDFDKKAAVEALIVAICRL 133 Query: 189 DKCIGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDG-WK 247 K + + + +++ R S+ KKGG AE+ + + + + L++ + + WK Sbjct: 134 AKLGSIEGYRELLKIERECRLKRISSSKKGGNVVAELAALFRNEAIRLLHRAKSDGKEWK 193 Query: 248 NKVVAVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVKQAF 307 +K AV + LW FI+ E + + Q++L D +L W+ ++D++ A Sbjct: 194 SKKKAVEAIENELWQFIETKRKEGHK-------TLLKQESLNDAVLR-WAKCHDDLRFAL 245 Query: 308 DSAVRRKKRSK 318 ++ V+ KK + Sbjct: 246 ENVVKNKKTRR 256 >UniRef50_B2K8D0 Putative uncharacterized protein n=1 Tax=Yersinia pseudotuberculosis PB1/+ RepID=B2K8D0_YERPB Length = 241 Score = 196 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 38/176 (21%), Positives = 86/176 (48%), Gaps = 14/176 (7%) Query: 144 EPFSINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCN 203 EP ++++ + A ++++ G +++ N++ A K E D C G+ + + Sbjct: 77 EPDTLSETMDELVFLADVNWYLGLGLLNTNKSEAIKCLLLGIEYLDYCRGLE--DQELWQ 134 Query: 204 QKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVVAVNELIEPLWDF 263 Q++ +++ +GG+SKA + ++ +++ L+ + P GW++K A+ + + + Sbjct: 135 QQQTTKL--DVPVQGGRSKAARFDPVKDRIISLLQEYCPPGGWESKQDAIRGIENGINEL 192 Query: 264 IQMSEFEINNQNKKYRVATMSQDALVDTILNQ---WSLKNEDVKQAFDSAVRRKKR 316 S + NN +K S D + + Q WS ++ ++ AFD V+ KK+ Sbjct: 193 EWPSARDRNNLSK-------SGDEIFAMKIRQVEVWSSRDPKIEAAFDCVVKPKKK 241 >UniRef50_C4SB26 Putative uncharacterized protein n=1 Tax=Yersinia mollaretii ATCC 43969 RepID=C4SB26_YERMO Length = 256 Score = 190 bits (482), Expect = 6e-47, Method: Composition-based stats. Identities = 44/171 (25%), Positives = 87/171 (50%), Gaps = 9/171 (5%) Query: 147 SINDIFKAHLMFASISFFWGKSIMSENENVAFKAFHRAAELFDKCIGMTWFNISVCNQKK 206 +N+ F+ +L+ A I + W K+ + ++ ++ +A + A L +C G + +++ Sbjct: 91 GLNENFRLNLLLAGIEYQWAKAAWNYDKRLSVQALNVALMLLHECSGACMCCELIDGERE 150 Query: 207 LSEVRRSAGKKGGKSKAEVYHIIQLKLVELIND-SVPNDGWKNKVVAVNELIEPLWDFIQ 265 + V+ A KGG +KAEVY+ I + + L+ S + WK+ A + +PLW FI Sbjct: 151 IHAVKVKAAAKGGVTKAEVYNPIIKEAIRLLQSTSQTGEKWKHWTDAAKAIEQPLWSFI- 209 Query: 266 MSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVKQAFDSAVRRKKR 316 +E +++NQ + ++ LVD I W K+ ++ + V + K+ Sbjct: 210 -AEQKVDNQ-----TIDLKEENLVDKI-QLWGKKDIELGTTLNEKVLKPKK 253 >UniRef50_B7LDL5 Putative uncharacterized protein n=1 Tax=Escherichia coli 55989 RepID=B7LDL5_ECO55 Length = 293 Score = 45.9 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 59/293 (20%), Positives = 107/293 (36%), Gaps = 38/293 (12%) Query: 31 LNDFYEQICRKRRDLAYHMSECEWAVDTDVLEEDHPEIRIELGRMR---EQFWSSEKIGT 87 +DFY Q+C D + E EW ++ + E + +R R E+F E Sbjct: 9 FDDFYLQVCTACED---KLLELEWKLEVEKKESNQDVLRTYRQRAYDKIEEFKIIEYSKI 65 Query: 88 RVRLY-----SCDVPWETRHHTVNG-QLEIKEEYTELYDPAQECWKNLSSNLTKETFLPL 141 + + + S ++ + V+G +E Y E Y + + Sbjct: 66 QCQYFVDMTNSDNIRYSQEFINVDGIMVEKPFIYKETY----------AEYRIEIRGKMD 115 Query: 142 VIEPFSINDIFKAH-----LMFASISFFWGKSIMSE-NENVAFKAFHRAAELFDKCIGMT 195 + + ++ AH L A F+W + S N+ + K A + CI Sbjct: 116 GYKLTGLARLYDAHYEYIYLSMALEDFYWAQFQYSLGNKKESCKFLLDAYSYMNVCISRD 175 Query: 196 WFNISVCNQKKLS----EVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVV 251 + S RR+ GGK+K I+ +V L+ D + WKNK Sbjct: 176 DNRDLPSFYRLRSGFDKVCRRARSSAGGKNKLAELEYIKDTVVILLEDIPSHSDWKNK-- 233 Query: 252 AVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVK 304 +L+E L +I ++ ++N ++ + LN+WS + V+ Sbjct: 234 --RQLLEYLMPYICNNDKKVNPDKWINDTGD--REETIRRRLNKWSTGDLKVE 282 >UniRef50_A4G676 Putative uncharacterized protein n=1 Tax=Herminiimonas arsenicoxydans RepID=A4G676_HERAR Length = 288 Score = 42.8 bits (99), Expect = 0.016, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 25/48 (52%) Query: 215 GKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVVAVNELIEPLWD 262 +GG KA Y +I+ + EL+ P +GW + A+ +++ L D Sbjct: 202 ASEGGNGKARRYELIKDNVAELLMKRAPVEGWASTSAAIKTIVDELID 249 >UniRef50_B4EDK5 Putative uncharacterized protein n=1 Tax=Burkholderia cenocepacia J2315 RepID=B4EDK5_BURCJ Length = 285 Score = 40.5 bits (93), Expect = 0.081, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 44/126 (34%), Gaps = 15/126 (11%) Query: 192 IGMTWFNISVCNQKKLSEVRRSAGKKGGKSKAEVYHIIQLKLVELINDSVPNDGWKNKVV 251 G+ W + AG GG + + ++ K+ EL+ P +GW + + Sbjct: 174 RGVYWSRDEMLIVDPTRRFTERAGT-GGTATGLLREPVKEKVAELLKSLAPEEGWGSTQI 232 Query: 252 AVNELIEPLWDFIQMSEFEINNQNKKYRVATMSQDALVDTILNQWSLKNEDVKQAFDSAV 311 A++ + L D N + + + L TI QW + F V Sbjct: 233 AIDTVASYLND----------NHSHDVESCHLKLENLPRTI-KQWLDDEPE---RFPHCV 278 Query: 312 RRKKRS 317 + ++ Sbjct: 279 KPRQSK 284 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.126 0.345 Lambda K H 0.267 0.0389 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,604,979,787 Number of Sequences: 3077464 Number of extensions: 56359449 Number of successful extensions: 159465 Number of sequences better than 1.0e-01: 8 Number of HSP's better than 0.1 without gapping: 15 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 159423 Number of HSP's gapped (non-prelim): 23 length of query: 318 length of database: 1,040,396,356 effective HSP length: 128 effective length of query: 190 effective length of database: 646,480,964 effective search space: 122831383160 effective search space used: 122831383160 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 93 (40.5 bits)