BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (132 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_C8UB47 Conserved predicted protein n=2 Tax=Escherichia ... 172 2e-42 UniRef50_B1LRP3 Putative uncharacterized protein n=1 Tax=Escheri... 137 7e-32 UniRef50_Q1R1V4 Uncharacterized protein yubK n=4 Tax=Escherichia... 115 6e-25 UniRef50_Q27TK9 Putative uncharacterized protein n=3 Tax=Enterob... 93 3e-18 UniRef50_Q9Z4D5 YffB protein n=7 Tax=root RepID=Q9Z4D5_SALTY 43 0.003 UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escheri... 43 0.003 >UniRef50_C8UB47 Conserved predicted protein n=2 Tax=Escherichia coli RepID=C8UB47_ECO10 Length = 138 Score = 172 bits (437), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 105/132 (79%), Positives = 108/132 (81%), Gaps = 6/132 (4%) Query: 1 MCPECFFLMLFFCGYRACYCSSSFSSSSSSSSSSSFRSSPAYGFSGRPPGGAGCRERSQR 60 MCPECFFLMLFF GYRACYCSSSFSSSSS SS P YGFSGRPPGGAGCRERSQR Sbjct: 13 MCPECFFLMLFFYGYRACYCSSSFSSSSSFRSS------PVYGFSGRPPGGAGCRERSQR 66 Query: 61 SCLRPGGLPSLTRNPGLQRPFRSRRLCRAVACAPGIPAKGRRDVRGNAVSQTALHVVAAG 120 SCLRPGGLPSLTRNPGLQRPFRS RLCRAVACAPGIPAKGRRDVRGNAVS+T + Sbjct: 67 SCLRPGGLPSLTRNPGLQRPFRSHRLCRAVACAPGIPAKGRRDVRGNAVSRTRSPSLRRV 126 Query: 121 PCSLPAGCHTPV 132 P CH+PV Sbjct: 127 PARCCVRCHSPV 138 >UniRef50_B1LRP3 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRP3_ECOSM Length = 96 Score = 137 bits (346), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 84/95 (88%), Positives = 84/95 (88%), Gaps = 6/95 (6%) Query: 6 FFLMLFFCGYRACYCSSSFSSSSSSSSSSSFRSSPAYGFSGRPPGGAGCRERSQRSCLRP 65 FFLMLFFCGYR CYCSSSFSSSSS SS P YGFSGRPPGGAGCRERSQRSCLRP Sbjct: 5 FFLMLFFCGYRVCYCSSSFSSSSSFRSS------PVYGFSGRPPGGAGCRERSQRSCLRP 58 Query: 66 GGLPSLTRNPGLQRPFRSRRLCRAVACAPGIPAKG 100 GGLPSLTRN GLQRPFRSRRLCRAVACAPGIPAKG Sbjct: 59 GGLPSLTRNHGLQRPFRSRRLCRAVACAPGIPAKG 93 >UniRef50_Q1R1V4 Uncharacterized protein yubK n=4 Tax=Escherichia coli RepID=YUBK_ECOUT Length = 119 Score = 115 bits (287), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 84/144 (58%), Positives = 87/144 (60%), Gaps = 37/144 (25%) Query: 1 MCPECFFLMLFFCGYRACYCSSSFSSSSSSSSSSSFRSSPAYGFSGRPPGGAGCRERSQR 60 MCPECFFLML FCGY SSSSSSFRSSP YGF GRPPGGAGCRERSQR Sbjct: 1 MCPECFFLMLCFCGY-------------CSSSSSSFRSSPVYGFPGRPPGGAGCRERSQR 47 Query: 61 SCLRPGGLPSLTRNPGLQRPFRSRRLCRAVACAPGIPAKGRRDVRGNAVSQT-------- 112 SCLRPGGLPSLT+NP R A P P RR +R +T Sbjct: 48 SCLRPGGLPSLTQNP------------RLAATVPVAPPLSRRGLRSWHPGKTTPGCKAQR 95 Query: 113 ----ALHVVAAGPCSLPAGCHTPV 132 AL VVAAGPCSLPAGCHTPV Sbjct: 96 RFAHALSVVAAGPCSLPAGCHTPV 119 >UniRef50_Q27TK9 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=Q27TK9_ECOK1 Length = 82 Score = 92.8 bits (229), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 47/52 (90%), Positives = 48/52 (92%) Query: 73 RNPGLQRPFRSRRLCRAVACAPGIPAKGRRDVRGNAVSQTALHVVAAGPCSL 124 + PGLQRPFRSRRLCRAVACAPGIPAK RRDVR NAVSQTALHVVAAGPC L Sbjct: 2 KTPGLQRPFRSRRLCRAVACAPGIPAKRRRDVRRNAVSQTALHVVAAGPCPL 53 >UniRef50_Q9Z4D5 YffB protein n=7 Tax=root RepID=Q9Z4D5_SALTY Length = 115 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/27 (74%), Positives = 21/27 (77%) Query: 71 LTRNPGLQRPFRSRRLCRAVACAPGIP 97 + PGLQRPFRSRRLCRAV CAP P Sbjct: 86 MREKPGLQRPFRSRRLCRAVTCAPCTP 112 >UniRef50_B9K6E8 Putative uncharacterized protein n=3 Tax=Escherichia coli RepID=B9K6E8_ECOLX Length = 184 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 20/22 (90%), Positives = 20/22 (90%) Query: 103 DVRGNAVSQTALHVVAAGPCSL 124 D RGNAVSQTALHVVAAGPC L Sbjct: 136 DARGNAVSQTALHVVAAGPCPL 157 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C8UB47 Conserved predicted protein n=2 Tax=Escherichia ... 151 6e-36 UniRef50_Q1R1V4 Uncharacterized protein yubK n=4 Tax=Escherichia... 114 8e-25 UniRef50_B1LRP3 Putative uncharacterized protein n=1 Tax=Escheri... 110 1e-23 UniRef50_Q27TK9 Putative uncharacterized protein n=3 Tax=Enterob... 68 8e-11 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_C8UB47 Conserved predicted protein n=2 Tax=Escherichia coli RepID=C8UB47_ECO10 Length = 138 Score = 151 bits (381), Expect = 6e-36, Method: Composition-based stats. Identities = 107/132 (81%), Positives = 110/132 (83%), Gaps = 6/132 (4%) Query: 1 MCPECFFLMLFFCGYRACYCSSSFSSSSSSSSSSSFRSSPAYGFSGRPPGGAGCRERSQR 60 MCPECFFLMLFF GYRACYCSSSFSSS SSFRSSP YGFSGRPPGGAGCRERSQR Sbjct: 13 MCPECFFLMLFFYGYRACYCSSSFSSS------SSFRSSPVYGFSGRPPGGAGCRERSQR 66 Query: 61 SCLRPGGLPSLTRNPGLQRPFRSRRLCRAVACAPGIPAKGRRDVRGNAVSQTALHVVAAG 120 SCLRPGGLPSLTRNPGLQRPFRS RLCRAVACAPGIPAKGRRDVRGNAVS+T + Sbjct: 67 SCLRPGGLPSLTRNPGLQRPFRSHRLCRAVACAPGIPAKGRRDVRGNAVSRTRSPSLRRV 126 Query: 121 PCSLPAGCHTPV 132 P CH+PV Sbjct: 127 PARCCVRCHSPV 138 >UniRef50_Q1R1V4 Uncharacterized protein yubK n=4 Tax=Escherichia coli RepID=YUBK_ECOUT Length = 119 Score = 114 bits (285), Expect = 8e-25, Method: Composition-based stats. Identities = 84/144 (58%), Positives = 87/144 (60%), Gaps = 37/144 (25%) Query: 1 MCPECFFLMLFFCGYRACYCSSSFSSSSSSSSSSSFRSSPAYGFSGRPPGGAGCRERSQR 60 MCPECFFLML FCGY SSSSSSFRSSP YGF GRPPGGAGCRERSQR Sbjct: 1 MCPECFFLMLCFCGY-------------CSSSSSSFRSSPVYGFPGRPPGGAGCRERSQR 47 Query: 61 SCLRPGGLPSLTRNPGLQRPFRSRRLCRAVACAPGIPAKGRRDVRGNAVSQT-------- 112 SCLRPGGLPSLT+NP R A P P RR +R +T Sbjct: 48 SCLRPGGLPSLTQNP------------RLAATVPVAPPLSRRGLRSWHPGKTTPGCKAQR 95 Query: 113 ----ALHVVAAGPCSLPAGCHTPV 132 AL VVAAGPCSLPAGCHTPV Sbjct: 96 RFAHALSVVAAGPCSLPAGCHTPV 119 >UniRef50_B1LRP3 Putative uncharacterized protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRP3_ECOSM Length = 96 Score = 110 bits (274), Expect = 1e-23, Method: Composition-based stats. Identities = 86/95 (90%), Positives = 86/95 (90%), Gaps = 6/95 (6%) Query: 6 FFLMLFFCGYRACYCSSSFSSSSSSSSSSSFRSSPAYGFSGRPPGGAGCRERSQRSCLRP 65 FFLMLFFCGYR CYCSSSFSSS SSFRSSP YGFSGRPPGGAGCRERSQRSCLRP Sbjct: 5 FFLMLFFCGYRVCYCSSSFSSS------SSFRSSPVYGFSGRPPGGAGCRERSQRSCLRP 58 Query: 66 GGLPSLTRNPGLQRPFRSRRLCRAVACAPGIPAKG 100 GGLPSLTRN GLQRPFRSRRLCRAVACAPGIPAKG Sbjct: 59 GGLPSLTRNHGLQRPFRSRRLCRAVACAPGIPAKG 93 >UniRef50_Q27TK9 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=Q27TK9_ECOK1 Length = 82 Score = 68.3 bits (165), Expect = 8e-11, Method: Composition-based stats. Identities = 47/52 (90%), Positives = 48/52 (92%) Query: 73 RNPGLQRPFRSRRLCRAVACAPGIPAKGRRDVRGNAVSQTALHVVAAGPCSL 124 + PGLQRPFRSRRLCRAVACAPGIPAK RRDVR NAVSQTALHVVAAGPC L Sbjct: 2 KTPGLQRPFRSRRLCRAVACAPGIPAKRRRDVRRNAVSQTALHVVAAGPCPL 53 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.305 0.121 0.386 Lambda K H 0.267 0.0360 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 553,214,152 Number of Sequences: 3077464 Number of extensions: 22446088 Number of successful extensions: 267283 Number of sequences better than 1.0e-01: 50 Number of HSP's better than 0.1 without gapping: 17 Number of HSP's successfully gapped in prelim test: 45 Number of HSP's that attempted gapping in prelim test: 266938 Number of HSP's gapped (non-prelim): 301 length of query: 132 length of database: 1,040,396,356 effective HSP length: 97 effective length of query: 35 effective length of database: 741,882,348 effective search space: 25965882180 effective search space used: 25965882180 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.7 bits) S2: 86 (37.9 bits)