BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (67 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q2NWN7 Protein aaeX n=75 Tax=Enterobacteriaceae RepID=A... 104 8e-22 UniRef50_UPI0001A4504C hypothetical protein PcarcW_04569 n=1 Tax... 76 3e-13 UniRef50_Q2KZU0 Putative component of multidrug efflux system n=... 47 2e-04 UniRef50_D1T3C0 Putative uncharacterized protein n=1 Tax=Acidovo... 46 3e-04 UniRef50_A6V0L3 Conserved domain protein n=11 Tax=Proteobacteria... 45 7e-04 UniRef50_B6JAE0 Conserved domain protein n=2 Tax=Bacteria RepID=... 43 0.004 UniRef50_Q2SVA7 Membrane protein, putative n=49 Tax=Burkholderia... 38 0.091 UniRef50_C5CYW8 Putative uncharacterized protein n=1 Tax=Variovo... 38 0.096 >UniRef50_Q2NWN7 Protein aaeX n=75 Tax=Enterobacteriaceae RepID=AAEX_SODGM Length = 113 Score = 104 bits (260), Expect = 8e-22, Method: Compositional matrix adjust. Identities = 48/63 (76%), Positives = 59/63 (93%) Query: 1 MSLFPVIVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFY 60 MSL PV+V+FGLSFPP+ FE++LSLA+F+ +RR L+P+GIYDFVWHPALFNTALYCC+FY Sbjct: 1 MSLLPVMVIFGLSFPPVLFEMILSLALFFALRRFLLPSGIYDFVWHPALFNTALYCCVFY 60 Query: 61 LIS 63 LIS Sbjct: 61 LIS 63 >UniRef50_UPI0001A4504C hypothetical protein PcarcW_04569 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A4504C Length = 98 Score = 76.3 bits (186), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 35/49 (71%), Positives = 44/49 (89%) Query: 16 PIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISR 64 P+FF LL+SL +F++V R+L PTGIYDFVWHPALFN+AL+CCLFYL+ R Sbjct: 47 PVFFVLLVSLTLFFVVNRLLQPTGIYDFVWHPALFNSALFCCLFYLLFR 95 >UniRef50_Q2KZU0 Putative component of multidrug efflux system n=9 Tax=Proteobacteria RepID=Q2KZU0_BORA1 Length = 70 Score = 47.4 bits (111), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 23/51 (45%), Positives = 33/51 (64%) Query: 10 FGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFY 60 +G+ FP + L +L + W++RR+L G+Y FVWHPALF+ AL L Y Sbjct: 8 YGVYFPWLLVLALPALGLSWMLRRLLARAGLYRFVWHPALFDLALTVVLLY 58 >UniRef50_D1T3C0 Putative uncharacterized protein n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3C0_9BURK Length = 72 Score = 46.2 bits (108), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 21/56 (37%), Positives = 37/56 (66%) Query: 10 FGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRL 65 +G+ P + +++ A+ W RR+L G+Y +VWH ALF+TA+Y + +L++RL Sbjct: 8 YGVYVPWLLLLAIVAWALCWGARRLLGSLGVYRWVWHAALFDTAIYVLILFLLTRL 63 >UniRef50_A6V0L3 Conserved domain protein n=11 Tax=Proteobacteria RepID=A6V0L3_PSEA7 Length = 66 Score = 45.1 bits (105), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 19/57 (33%), Positives = 33/57 (57%) Query: 7 IVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLIS 63 I + G+ P + +++ A+ W + R+ G+Y F WHPALF +L+ CL+ +S Sbjct: 5 IAIHGVYMPTLTLLFVIAAALTWGLDRIFASVGLYRFTWHPALFRVSLFACLYGGLS 61 >UniRef50_B6JAE0 Conserved domain protein n=2 Tax=Bacteria RepID=B6JAE0_OLICO Length = 65 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 33/56 (58%) Query: 7 IVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLI 62 I + G+ PP+ ++ I+ ++R L T +Y F+WHP LFNTALY L +I Sbjct: 5 INIAGVLLPPLMGYAFAAMIIWLVLRFTLARTNLYRFIWHPPLFNTALYVILLSII 60 >UniRef50_Q2SVA7 Membrane protein, putative n=49 Tax=Burkholderia RepID=Q2SVA7_BURTA Length = 67 Score = 38.1 bits (87), Expect = 0.091, Method: Compositional matrix adjust. Identities = 15/30 (50%), Positives = 21/30 (70%) Query: 29 WLVRRVLVPTGIYDFVWHPALFNTALYCCL 58 WL+ R+L TG+Y VWHP+LF +L C+ Sbjct: 28 WLIDRLLAYTGLYRVVWHPSLFRASLLVCI 57 >UniRef50_C5CYW8 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CYW8_VARPS Length = 70 Score = 38.1 bits (87), Expect = 0.096, Method: Compositional matrix adjust. Identities = 18/46 (39%), Positives = 25/46 (54%) Query: 10 FGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALY 55 G+ P + +L W RR+L G Y ++WHPALF+ ALY Sbjct: 7 LGMQLPWLLLLAAGALPCSWATRRLLAIVGAYRWIWHPALFDLALY 52 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q2NWN7 Protein aaeX n=75 Tax=Enterobacteriaceae RepID=A... 78 6e-14 UniRef50_UPI0001A4504C hypothetical protein PcarcW_04569 n=1 Tax... 61 8e-09 UniRef50_A6V0L3 Conserved domain protein n=11 Tax=Proteobacteria... 57 1e-07 UniRef50_D1T3C0 Putative uncharacterized protein n=1 Tax=Acidovo... 54 1e-06 UniRef50_Q2KZU0 Putative component of multidrug efflux system n=... 47 2e-04 Sequences not found previously or not previously below threshold: UniRef50_Q88NE7 Conserved domain protein n=14 Tax=Proteobacteria... 43 0.002 UniRef50_C5CYW8 Putative uncharacterized protein n=1 Tax=Variovo... 41 0.015 UniRef50_B0RMD9 Putative membrane protein n=11 Tax=Proteobacteri... 39 0.030 UniRef50_Q2SVA7 Membrane protein, putative n=49 Tax=Burkholderia... 39 0.031 CONVERGED! >UniRef50_Q2NWN7 Protein aaeX n=75 Tax=Enterobacteriaceae RepID=AAEX_SODGM Length = 113 Score = 78.4 bits (192), Expect = 6e-14, Method: Composition-based stats. Identities = 48/63 (76%), Positives = 59/63 (93%) Query: 1 MSLFPVIVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFY 60 MSL PV+V+FGLSFPP+ FE++LSLA+F+ +RR L+P+GIYDFVWHPALFNTALYCC+FY Sbjct: 1 MSLLPVMVIFGLSFPPVLFEMILSLALFFALRRFLLPSGIYDFVWHPALFNTALYCCVFY 60 Query: 61 LIS 63 LIS Sbjct: 61 LIS 63 >UniRef50_UPI0001A4504C hypothetical protein PcarcW_04569 n=1 Tax=Pectobacterium carotovorum subsp. carotovorum WPP14 RepID=UPI0001A4504C Length = 98 Score = 61.4 bits (148), Expect = 8e-09, Method: Composition-based stats. Identities = 35/49 (71%), Positives = 44/49 (89%) Query: 16 PIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISR 64 P+FF LL+SL +F++V R+L PTGIYDFVWHPALFN+AL+CCLFYL+ R Sbjct: 47 PVFFVLLVSLTLFFVVNRLLQPTGIYDFVWHPALFNSALFCCLFYLLFR 95 >UniRef50_A6V0L3 Conserved domain protein n=11 Tax=Proteobacteria RepID=A6V0L3_PSEA7 Length = 66 Score = 57.2 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 33/57 (57%) Query: 7 IVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLIS 63 I + G+ P + +++ A+ W + R+ G+Y F WHPALF +L+ CL+ +S Sbjct: 5 IAIHGVYMPTLTLLFVIAAALTWGLDRIFASVGLYRFTWHPALFRVSLFACLYGGLS 61 >UniRef50_D1T3C0 Putative uncharacterized protein n=1 Tax=Acidovorax avenae subsp. avenae ATCC 19860 RepID=D1T3C0_9BURK Length = 72 Score = 54.1 bits (129), Expect = 1e-06, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 37/56 (66%) Query: 10 FGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRL 65 +G+ P + +++ A+ W RR+L G+Y +VWH ALF+TA+Y + +L++RL Sbjct: 8 YGVYVPWLLLLAIVAWALCWGARRLLGSLGVYRWVWHAALFDTAIYVLILFLLTRL 63 >UniRef50_Q2KZU0 Putative component of multidrug efflux system n=9 Tax=Proteobacteria RepID=Q2KZU0_BORA1 Length = 70 Score = 46.8 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 23/54 (42%), Positives = 34/54 (62%) Query: 10 FGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLIS 63 +G+ FP + L +L + W++RR+L G+Y FVWHPALF+ AL L Y + Sbjct: 8 YGVYFPWLLVLALPALGLSWMLRRLLARAGLYRFVWHPALFDLALTVVLLYGVF 61 >UniRef50_Q88NE7 Conserved domain protein n=14 Tax=Proteobacteria RepID=Q88NE7_PSEPK Length = 66 Score = 43.3 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 18/57 (31%), Positives = 28/57 (49%) Query: 7 IVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLIS 63 I G+ P + L +L + W + R + Y F WHPAL +L+ CLF ++ Sbjct: 5 IAFHGVYMPTMTLMFLFALGLAWGLDRFIASHDGYRFFWHPALLRLSLFVCLFGAMA 61 >UniRef50_C5CYW8 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CYW8_VARPS Length = 70 Score = 40.6 bits (94), Expect = 0.015, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 29/55 (52%) Query: 11 GLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRL 65 G+ P + +L W RR+L G Y ++WHPALF+ ALY + ++ L Sbjct: 8 GMQLPWLLLLAAGALPCSWATRRLLAIVGAYRWIWHPALFDLALYALVLCTLASL 62 >UniRef50_B0RMD9 Putative membrane protein n=11 Tax=Proteobacteria RepID=B0RMD9_XANCB Length = 89 Score = 39.5 bits (91), Expect = 0.030, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 29/57 (50%) Query: 9 VFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRL 65 ++G+ P + + L+ + V +L G Y +WHPALFN ALY L + L Sbjct: 27 LYGVFVPTLLALMTLAYLLNSAVGALLTRAGAYRHIWHPALFNLALYIALLGGLFSL 83 >UniRef50_Q2SVA7 Membrane protein, putative n=49 Tax=Burkholderia RepID=Q2SVA7_BURTA Length = 67 Score = 39.5 bits (91), Expect = 0.031, Method: Composition-based stats. Identities = 17/57 (29%), Positives = 29/57 (50%) Query: 7 IVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLIS 63 + + P + ++ WL+ R+L TG+Y VWHP+LF +L C+ +S Sbjct: 6 LAILDAYVPAVVLMFVIGAFATWLIDRLLAYTGLYRVVWHPSLFRASLLVCICGGLS 62 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.331 0.154 0.503 Lambda K H 0.267 0.0472 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 308,980,104 Number of Sequences: 3077464 Number of extensions: 11503344 Number of successful extensions: 74904 Number of sequences better than 1.0e-01: 54 Number of HSP's better than 0.1 without gapping: 77 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 74823 Number of HSP's gapped (non-prelim): 83 length of query: 67 length of database: 1,040,396,356 effective HSP length: 39 effective length of query: 28 effective length of database: 920,375,260 effective search space: 25770507280 effective search space used: 25770507280 T: 11 A: 40 X1: 16 ( 7.6 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.3 bits) S2: 87 (37.9 bits)