BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (352 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76505 Uncharacterized protein yfdF n=32 Tax=Enterobact... 722 0.0 UniRef50_B2TWW0 Transposase n=66 Tax=Enterobacteriaceae RepID=B2... 119 1e-25 >UniRef50_P76505 Uncharacterized protein yfdF n=32 Tax=Enterobacteriaceae RepID=YFDF_ECOLI Length = 352 Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust. Identities = 352/352 (100%), Positives = 352/352 (100%) Query: 1 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV 60 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV Sbjct: 1 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV 60 Query: 61 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK 120 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK Sbjct: 61 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK 120 Query: 121 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK 180 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK Sbjct: 121 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK 180 Query: 181 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA 240 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA Sbjct: 181 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA 240 Query: 241 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL 300 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL Sbjct: 241 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL 300 Query: 301 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI 352 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI Sbjct: 301 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI 352 >UniRef50_B2TWW0 Transposase n=66 Tax=Enterobacteriaceae RepID=B2TWW0_SHIB3 Length = 137 Score = 119 bits (299), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 63/121 (52%), Positives = 85/121 (70%), Gaps = 5/121 (4%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y RAIRY+FQVDDAK++ D Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYRRAIRYNFQVDDAKFRRD 76 Query: 292 HLKEIVSTLVGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDD 351 ++KEI+STL NK++V H + YK FKDLE K+E+RLQNRQ EYQNEINQ SAP VNFDD Sbjct: 77 NVKEIISTLFANKVDVDHPENKYKDFKDLEDKVEKRLQNRQTEYQNEINQLSAPDVNFDD 136 Query: 352 I 352 I Sbjct: 137 I 137 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76505 Uncharacterized protein yfdF n=32 Tax=Enterobact... 677 0.0 UniRef50_B2TWW0 Transposase n=66 Tax=Enterobacteriaceae RepID=B2... 205 3e-51 Sequences not found previously or not previously below threshold: UniRef50_Q83JE7 Putative uncharacterized protein n=1 Tax=Shigell... 66 2e-09 UniRef50_D2A8R2 Putative alpha-mannosidase n=1 Tax=Shigella flex... 62 3e-08 UniRef50_D2AB32 IS600 ORF2 n=9 Tax=Enterobacteriaceae RepID=D2AB... 62 3e-08 UniRef50_D2AA76 Putative alpha-mannosidase n=1 Tax=Shigella flex... 61 7e-08 UniRef50_D2ADC7 Putative integrase encoded by prophage CP-933K n... 60 1e-07 UniRef50_B2TVU4 Galactose binding protein n=5 Tax=Shigella RepID... 59 2e-07 UniRef50_D2AEE4 IS600 ORF2 n=1 Tax=Shigella flexneri 2002017 Rep... 59 2e-07 UniRef50_Q0T0H3 Outer membrane fimbrial user protein n=206 Tax=E... 59 3e-07 UniRef50_B2TV45 Transposase n=5 Tax=Shigella RepID=B2TV45_SHIB3 59 3e-07 UniRef50_B2TUB7 Glycosy hydrolase, family 38 n=69 Tax=Enterobact... 59 3e-07 UniRef50_Q70W42 Transposase n=3 Tax=Enterobacteriaceae RepID=Q70... 59 3e-07 UniRef50_D2AIK9 FAD linked oxidase domain protein n=2 Tax=Shigel... 59 3e-07 UniRef50_B2TYZ1 Conserved domain protein n=2 Tax=Shigella RepID=... 57 1e-06 UniRef50_Q54ZT8 Putative uncharacterized protein n=1 Tax=Dictyos... 42 0.038 >UniRef50_P76505 Uncharacterized protein yfdF n=32 Tax=Enterobacteriaceae RepID=YFDF_ECOLI Length = 352 Score = 677 bits (1747), Expect = 0.0, Method: Composition-based stats. Identities = 352/352 (100%), Positives = 352/352 (100%) Query: 1 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV 60 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV Sbjct: 1 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV 60 Query: 61 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK 120 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK Sbjct: 61 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK 120 Query: 121 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK 180 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK Sbjct: 121 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK 180 Query: 181 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA 240 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA Sbjct: 181 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA 240 Query: 241 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL 300 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL Sbjct: 241 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL 300 Query: 301 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI 352 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI Sbjct: 301 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI 352 >UniRef50_B2TWW0 Transposase n=66 Tax=Enterobacteriaceae RepID=B2TWW0_SHIB3 Length = 137 Score = 205 bits (521), Expect = 3e-51, Method: Composition-based stats. Identities = 63/121 (52%), Positives = 85/121 (70%), Gaps = 5/121 (4%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y RAIRY+FQVDDAK++ D Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYRRAIRYNFQVDDAKFRRD 76 Query: 292 HLKEIVSTLVGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDD 351 ++KEI+STL NK++V H + YK FKDLE K+E+RLQNRQ EYQNEINQ SAP VNFDD Sbjct: 77 NVKEIISTLFANKVDVDHPENKYKDFKDLEDKVEKRLQNRQTEYQNEINQLSAPDVNFDD 136 Query: 352 I 352 I Sbjct: 137 I 137 >UniRef50_Q83JE7 Putative uncharacterized protein n=1 Tax=Shigella flexneri RepID=Q83JE7_SHIFL Length = 142 Score = 65.8 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 25/52 (48%), Gaps = 5/52 (9%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQV 283 S+ A+ +F++G+ N + L+ P+ ++ Y R I Y++ + Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSISWQHYLRLIPYNWTI 68 >UniRef50_D2A8R2 Putative alpha-mannosidase n=1 Tax=Shigella flexneri 2002017 RepID=D2A8R2_SHIF2 Length = 147 Score = 62.0 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 27/66 (40%), Gaps = 10/66 (15%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y ++ D K Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYR-----GYKPPDRKEYRH 58 Query: 292 HLKEIV 297 H+ E V Sbjct: 59 HVPERV 64 >UniRef50_D2AB32 IS600 ORF2 n=9 Tax=Enterobacteriaceae RepID=D2AB32_SHIF2 Length = 317 Score = 62.0 bits (149), Expect = 3e-08, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 15/82 (18%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y + A Y+C Sbjct: 11 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQH------YPWSETKAAYRCF 64 Query: 292 HLKEIVSTLVGNKINVGHSQKI 313 ++ NKI H + I Sbjct: 65 SNDKVS----ANKIMTPHKENI 82 >UniRef50_D2AA76 Putative alpha-mannosidase n=1 Tax=Shigella flexneri 2002017 RepID=D2AA76_SHIF2 Length = 134 Score = 60.8 bits (146), Expect = 7e-08, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 32/81 (39%), Gaps = 12/81 (14%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY-------GRAIRYDFQVD 284 S+ A+ +F++G+ N + L+ P+ ++ Y GR R D Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKLVSWQHYLNLHGFVGRIRRVSVASD 63 Query: 285 DAKYKCDHLKEIVSTLVGNKI 305 K + ++ +S L ++ Sbjct: 64 INKAHFTNYRQWISPLYFRRL 84 >UniRef50_D2ADC7 Putative integrase encoded by prophage CP-933K n=2 Tax=Enterobacteriaceae RepID=D2ADC7_SHIF2 Length = 178 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 19/43 (44%), Gaps = 5/43 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYG 274 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYR 46 >UniRef50_B2TVU4 Galactose binding protein n=5 Tax=Shigella RepID=B2TVU4_SHIB3 Length = 137 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 23/52 (44%), Gaps = 7/52 (13%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY--GRAIRYDF 281 S+ A+ +F++G+ N + L+ P+ ++ Y AI + F Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYHANEAINHGF 68 >UniRef50_D2AEE4 IS600 ORF2 n=1 Tax=Shigella flexneri 2002017 RepID=D2AEE4_SHIF2 Length = 103 Score = 58.9 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 22/49 (44%), Gaps = 5/49 (10%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYD 280 S+ A+ +F++G+ N + L+ P+ ++ Y +I + Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYPVSIGIN 52 >UniRef50_Q0T0H3 Outer membrane fimbrial user protein n=206 Tax=Enterobacteriaceae RepID=Q0T0H3_SHIF8 Length = 949 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 45 >UniRef50_B2TV45 Transposase n=5 Tax=Shigella RepID=B2TV45_SHIB3 Length = 68 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 58 >UniRef50_B2TUB7 Glycosy hydrolase, family 38 n=69 Tax=Enterobacteriaceae RepID=B2TUB7_SHIB3 Length = 416 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 58 >UniRef50_Q70W42 Transposase n=3 Tax=Enterobacteriaceae RepID=Q70W42_YEREN Length = 236 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 58 >UniRef50_D2AIK9 FAD linked oxidase domain protein n=2 Tax=Shigella flexneri RepID=D2AIK9_SHIF2 Length = 342 Score = 58.5 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 58 >UniRef50_B2TYZ1 Conserved domain protein n=2 Tax=Shigella RepID=B2TYZ1_SHIB3 Length = 226 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 7/45 (15%), Positives = 20/45 (44%), Gaps = 5/45 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRA 276 ++ A+ +F++G+ N + L+ P+ ++ Y + Sbjct: 17 NAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYQNS 61 >UniRef50_Q54ZT8 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54ZT8_DICDI Length = 833 Score = 42.0 bits (97), Expect = 0.038, Method: Composition-based stats. Identities = 32/103 (31%), Positives = 58/103 (56%), Gaps = 7/103 (6%) Query: 16 ESINEN---NNDEVNGLVQEFK-NLFNGKEGI--STCIKHLLELIKNAIRVNDDPYRFNI 69 E+IN + +N+ N L++ +K +L + E + CI + +EL+K I +D Y+F + Sbjct: 129 ETINGDTIISNNFYNNLIKNYKRHLIHKSETLCEELCISNNIELLKIIINDSDFNYKF-L 187 Query: 70 NNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTI 112 NNSS+ + I + I + N E +E+ NY +KEL++ + Sbjct: 188 NNSSLIDLSIKNGTNLKILKFLFNNENLEISKNYSEKELLKNL 230 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76505 Uncharacterized protein yfdF n=32 Tax=Enterobact... 675 0.0 UniRef50_B2TWW0 Transposase n=66 Tax=Enterobacteriaceae RepID=B2... 197 6e-49 UniRef50_D2AB32 IS600 ORF2 n=9 Tax=Enterobacteriaceae RepID=D2AB... 90 1e-16 UniRef50_D2AA76 Putative alpha-mannosidase n=1 Tax=Shigella flex... 89 2e-16 UniRef50_D2A8R2 Putative alpha-mannosidase n=1 Tax=Shigella flex... 80 1e-13 UniRef50_Q83JE7 Putative uncharacterized protein n=1 Tax=Shigell... 76 3e-12 UniRef50_D2AEE4 IS600 ORF2 n=1 Tax=Shigella flexneri 2002017 Rep... 68 6e-10 UniRef50_B2TVU4 Galactose binding protein n=5 Tax=Shigella RepID... 67 1e-09 UniRef50_B2TYZ1 Conserved domain protein n=2 Tax=Shigella RepID=... 65 4e-09 UniRef50_D2ADC7 Putative integrase encoded by prophage CP-933K n... 65 5e-09 UniRef50_B2TV45 Transposase n=5 Tax=Shigella RepID=B2TV45_SHIB3 64 7e-09 UniRef50_Q70W42 Transposase n=3 Tax=Enterobacteriaceae RepID=Q70... 64 7e-09 UniRef50_Q0T0H3 Outer membrane fimbrial user protein n=206 Tax=E... 64 8e-09 UniRef50_B2TUB7 Glycosy hydrolase, family 38 n=69 Tax=Enterobact... 64 8e-09 UniRef50_D2AIK9 FAD linked oxidase domain protein n=2 Tax=Shigel... 64 8e-09 Sequences not found previously or not previously below threshold: UniRef50_Q54ZT8 Putative uncharacterized protein n=1 Tax=Dictyos... 43 0.014 CONVERGED! >UniRef50_P76505 Uncharacterized protein yfdF n=32 Tax=Enterobacteriaceae RepID=YFDF_ECOLI Length = 352 Score = 675 bits (1742), Expect = 0.0, Method: Composition-based stats. Identities = 352/352 (100%), Positives = 352/352 (100%) Query: 1 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV 60 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV Sbjct: 1 MLPSISINNTSAAYPESINENNNDEVNGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRV 60 Query: 61 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK 120 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK Sbjct: 61 NDDPYRFNINNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEK 120 Query: 121 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK 180 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK Sbjct: 121 THDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEFDESGKSEPTDLFTWYGKDKK 180 Query: 181 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA 240 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA Sbjct: 181 GDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA 240 Query: 241 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL 300 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL Sbjct: 241 IMAESFATGSDHQVVNELNGERLREPNDVFKRYGRAIRYDFQVDDAKYKCDHLKEIVSTL 300 Query: 301 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI 352 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI Sbjct: 301 VGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDDI 352 >UniRef50_B2TWW0 Transposase n=66 Tax=Enterobacteriaceae RepID=B2TWW0_SHIB3 Length = 137 Score = 197 bits (500), Expect = 6e-49, Method: Composition-based stats. Identities = 63/121 (52%), Positives = 85/121 (70%), Gaps = 5/121 (4%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y RAIRY+FQVDDAK++ D Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYRRAIRYNFQVDDAKFRRD 76 Query: 292 HLKEIVSTLVGNKINVGHSQKIYKHFKDLEGKIEERLQNRQAEYQNEINQPSAPGVNFDD 351 ++KEI+STL NK++V H + YK FKDLE K+E+RLQNRQ EYQNEINQ SAP VNFDD Sbjct: 77 NVKEIISTLFANKVDVDHPENKYKDFKDLEDKVEKRLQNRQTEYQNEINQLSAPDVNFDD 136 Query: 352 I 352 I Sbjct: 137 I 137 >UniRef50_D2AB32 IS600 ORF2 n=9 Tax=Enterobacteriaceae RepID=D2AB32_SHIF2 Length = 317 Score = 89.8 bits (221), Expect = 1e-16, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 32/82 (39%), Gaps = 15/82 (18%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y + A Y+C Sbjct: 11 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY------PWSETKAAYRCF 64 Query: 292 HLKEIVSTLVGNKINVGHSQKI 313 ++ NKI H + I Sbjct: 65 SNDKVS----ANKIMTPHKENI 82 >UniRef50_D2AA76 Putative alpha-mannosidase n=1 Tax=Shigella flexneri 2002017 RepID=D2AA76_SHIF2 Length = 134 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 32/81 (39%), Gaps = 12/81 (14%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY-------GRAIRYDFQVD 284 S+ A+ +F++G+ N + L+ P+ ++ Y GR R D Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKLVSWQHYLNLHGFVGRIRRVSVASD 63 Query: 285 DAKYKCDHLKEIVSTLVGNKI 305 K + ++ +S L ++ Sbjct: 64 INKAHFTNYRQWISPLYFRRL 84 >UniRef50_D2A8R2 Putative alpha-mannosidase n=1 Tax=Shigella flexneri 2002017 RepID=D2A8R2_SHIF2 Length = 147 Score = 79.8 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 13/66 (19%), Positives = 27/66 (40%), Gaps = 10/66 (15%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVDDAKYKCD 291 S+ A+ +F++G+ N + L+ P+ ++ Y ++ D K Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYR-----GYKPPDRKEYRH 58 Query: 292 HLKEIV 297 H+ E V Sbjct: 59 HVPERV 64 >UniRef50_Q83JE7 Putative uncharacterized protein n=1 Tax=Shigella flexneri RepID=Q83JE7_SHIFL Length = 142 Score = 75.5 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 25/53 (47%), Gaps = 5/53 (9%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYDFQVD 284 S+ A+ +F++G+ N + L+ P+ ++ Y R I Y++ + Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSISWQHYLRLIPYNWTIT 69 >UniRef50_D2AEE4 IS600 ORF2 n=1 Tax=Shigella flexneri 2002017 RepID=D2AEE4_SHIF2 Length = 103 Score = 67.8 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 9/49 (18%), Positives = 22/49 (44%), Gaps = 5/49 (10%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIRYD 280 S+ A+ +F++G+ N + L+ P+ ++ Y +I + Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYPVSIGIN 52 >UniRef50_B2TVU4 Galactose binding protein n=5 Tax=Shigella RepID=B2TVU4_SHIB3 Length = 137 Score = 67.0 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 11/53 (20%), Positives = 23/53 (43%), Gaps = 7/53 (13%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY--GRAIRYDFQ 282 S+ A+ +F++G+ N + L+ P+ ++ Y AI + F Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYHANEAINHGFA 69 >UniRef50_B2TYZ1 Conserved domain protein n=2 Tax=Shigella RepID=B2TYZ1_SHIB3 Length = 226 Score = 64.7 bits (156), Expect = 4e-09, Method: Composition-based stats. Identities = 7/47 (14%), Positives = 21/47 (44%), Gaps = 5/47 (10%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYGRAIR 278 ++ A+ +F++G+ N + L+ P+ ++ Y + + Sbjct: 17 NAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYQNSRK 63 >UniRef50_D2ADC7 Putative integrase encoded by prophage CP-933K n=2 Tax=Enterobacteriaceae RepID=D2ADC7_SHIF2 Length = 178 Score = 64.7 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 19/43 (44%), Gaps = 5/43 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYG 274 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYR 46 >UniRef50_B2TV45 Transposase n=5 Tax=Shigella RepID=B2TV45_SHIB3 Length = 68 Score = 64.3 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 19/43 (44%), Gaps = 5/43 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYG 274 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYQ 59 >UniRef50_Q70W42 Transposase n=3 Tax=Enterobacteriaceae RepID=Q70W42_YEREN Length = 236 Score = 64.3 bits (155), Expect = 7e-09, Method: Composition-based stats. Identities = 8/43 (18%), Positives = 19/43 (44%), Gaps = 5/43 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRYG 274 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHYQ 59 >UniRef50_Q0T0H3 Outer membrane fimbrial user protein n=206 Tax=Enterobacteriaceae RepID=Q0T0H3_SHIF8 Length = 949 Score = 64.0 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 4 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 45 >UniRef50_B2TUB7 Glycosy hydrolase, family 38 n=69 Tax=Enterobacteriaceae RepID=B2TUB7_SHIB3 Length = 416 Score = 64.0 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 58 >UniRef50_D2AIK9 FAD linked oxidase domain protein n=2 Tax=Shigella flexneri RepID=D2AIK9_SHIF2 Length = 342 Score = 64.0 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 8/42 (19%), Positives = 19/42 (45%), Gaps = 5/42 (11%) Query: 237 SSKAIMAESFATGSDHQV-VNELNGERLREPNDV----FKRY 273 S+ A+ +F++G+ N + L+ P+ ++ Y Sbjct: 17 SAFAVTHHAFSSGAGRTSDGNGSHASTLKSPSYTKSVSWQHY 58 >UniRef50_Q54ZT8 Putative uncharacterized protein n=1 Tax=Dictyostelium discoideum RepID=Q54ZT8_DICDI Length = 833 Score = 43.2 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 32/103 (31%), Positives = 58/103 (56%), Gaps = 7/103 (6%) Query: 16 ESINEN---NNDEVNGLVQEFK-NLFNGKEGI--STCIKHLLELIKNAIRVNDDPYRFNI 69 E+IN + +N+ N L++ +K +L + E + CI + +EL+K I +D Y+F + Sbjct: 129 ETINGDTIISNNFYNNLIKNYKRHLIHKSETLCEELCISNNIELLKIIINDSDFNYKF-L 187 Query: 70 NNSSVTYIDIDSNDTDHITIGIDNQEPIELPANYKDKELVRTI 112 NNSS+ + I + I + N E +E+ NY +KEL++ + Sbjct: 188 NNSSLIDLSIKNGTNLKILKFLFNNENLEISKNYSEKELLKNL 230 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.131 0.363 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,193,892,148 Number of Sequences: 3077464 Number of extensions: 95062987 Number of successful extensions: 252918 Number of sequences better than 1.0e-01: 17 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 44 Number of HSP's that attempted gapping in prelim test: 252837 Number of HSP's gapped (non-prelim): 95 length of query: 352 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 222 effective length of database: 640,326,036 effective search space: 142152379992 effective search space used: 142152379992 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 93 (40.4 bits)