BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (57 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B4T6X3 Conserved domain protein n=67 Tax=Gammaproteobac... 95 5e-19 UniRef50_B5YXI6 Conserved domain protein n=68 Tax=Bacteria RepID... 90 2e-17 UniRef50_P21361 Uncharacterized protein yciG n=68 Tax=Proteobact... 80 2e-14 UniRef50_C9Y566 Uncharacterized protein yciG n=16 Tax=Gammaprote... 77 2e-13 UniRef50_Q88HT4 Putative uncharacterized protein n=1 Tax=Pseudom... 71 1e-11 UniRef50_C1DH84 Putative uncharacterized protein n=2 Tax=Pseudom... 60 2e-08 UniRef50_Q9KX95 Putative uncharacterized protein ymdF n=1 Tax=En... 57 2e-07 UniRef50_Q5FP81 Putative uncharacterized protein n=1 Tax=Glucono... 56 3e-07 UniRef50_C6B8D0 Putative uncharacterized protein n=3 Tax=Rhizobi... 50 2e-05 UniRef50_A9HNT7 Putative uncharacterized protein n=2 Tax=Glucona... 50 2e-05 UniRef50_Q1QTW4 Putative uncharacterized protein n=1 Tax=Chromoh... 48 1e-04 UniRef50_A4XUQ0 Putative uncharacterized protein n=1 Tax=Pseudom... 47 2e-04 UniRef50_Q88FW4 Conserved domain protein n=3 Tax=Pseudomonas put... 43 0.002 UniRef50_B1J8T3 Putative uncharacterized protein n=4 Tax=Pseudom... 40 0.016 UniRef50_D2LEZ1 Putative uncharacterized protein n=1 Tax=Rhodomi... 39 0.045 >UniRef50_B4T6X3 Conserved domain protein n=67 Tax=Gammaproteobacteria RepID=B4T6X3_SALNS Length = 60 Score = 95.1 bits (235), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 51/59 (86%), Positives = 56/59 (94%), Gaps = 2/59 (3%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG--KSDN 57 MA HRGGSGNFAE+RE+ASEAG+KGGQHSGGNFKNDPQRASEAGKKGG++SHG KSDN Sbjct: 1 MAEHRGGSGNFAENREKASEAGRKGGQHSGGNFKNDPQRASEAGKKGGQNSHGGRKSDN 59 >UniRef50_B5YXI6 Conserved domain protein n=68 Tax=Bacteria RepID=B5YXI6_ECO5E Length = 60 Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 44/53 (83%), Positives = 51/53 (96%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 M HRGG+GNFAEDR++AS+AG+KGGQHSGGNFKNDPQRASEAGKKGG++SHG Sbjct: 1 MTEHRGGAGNFAEDRQKASDAGRKGGQHSGGNFKNDPQRASEAGKKGGQNSHG 53 >UniRef50_P21361 Uncharacterized protein yciG n=68 Tax=Proteobacteria RepID=YCIG_ECOLI Length = 59 Score = 80.1 bits (196), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 39/43 (90%), Positives = 42/43 (97%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEA 43 MA HRGGSGNFAEDRE+AS+AG+KGGQHSGGNFKNDPQRASEA Sbjct: 1 MAEHRGGSGNFAEDREKASDAGRKGGQHSGGNFKNDPQRASEA 43 >UniRef50_C9Y566 Uncharacterized protein yciG n=16 Tax=Gammaproteobacteria RepID=C9Y566_CROTZ Length = 79 Score = 76.6 bits (187), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 44/75 (58%), Positives = 50/75 (66%), Gaps = 22/75 (29%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQ---------------------- 38 M HRGGSGNFAEDR+RAS+AG+KGGQHS GNFKNDPQ Sbjct: 1 MTQHRGGSGNFAEDRQRASDAGQKGGQHSSGNFKNDPQRASEAGQKGGQQSGGNFKNDRE 60 Query: 39 RASEAGKKGGKSSHG 53 RASEAG+KGG++SHG Sbjct: 61 RASEAGRKGGENSHG 75 >UniRef50_Q88HT4 Putative uncharacterized protein n=1 Tax=Pseudomonas putida KT2440 RepID=Q88HT4_PSEPK Length = 85 Score = 70.9 bits (172), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 37/46 (80%), Positives = 40/46 (86%) Query: 8 SGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 SGNFA DRERAS AG KGGQ SGGNF ND QRASEAG+KGG++SHG Sbjct: 35 SGNFANDRERASRAGHKGGQASGGNFANDRQRASEAGRKGGRNSHG 80 >UniRef50_C1DH84 Putative uncharacterized protein n=2 Tax=Pseudomonadaceae RepID=C1DH84_AZOVD Length = 95 Score = 59.7 bits (143), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 32/45 (71%), Positives = 38/45 (84%) Query: 9 GNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 GNF D ERASEAG KGG++SGGNF ND +RA+EAG+KGG+ SHG Sbjct: 46 GNFRNDPERASEAGSKGGRNSGGNFANDRERAAEAGRKGGQHSHG 90 Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 29/40 (72%), Positives = 33/40 (82%) Query: 14 DRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 DRE E G KGGQHSGGNF+NDP+RASEAG KGG++S G Sbjct: 29 DREFYQEIGSKGGQHSGGNFRNDPERASEAGSKGGRNSGG 68 >UniRef50_Q9KX95 Putative uncharacterized protein ymdF n=1 Tax=Enterobacteria phage VT2-Sakai RepID=Q9KX95_BPVT2 Length = 57 Score = 57.0 bits (136), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 35/57 (61%), Positives = 35/57 (61%), Gaps = 22/57 (38%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHGKSDN 57 MANHRGGSGNFAEDRERASE AGKKGGKSSHGKSDN Sbjct: 23 MANHRGGSGNFAEDRERASE----------------------AGKKGGKSSHGKSDN 57 >UniRef50_Q5FP81 Putative uncharacterized protein n=1 Tax=Gluconobacter oxydans RepID=Q5FP81_GLUOX Length = 129 Score = 56.2 bits (134), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 20/67 (29%) Query: 6 GGSGNFAEDRERASEAGKKGGQHSGGN--------------------FKNDPQRASEAGK 45 G GNF++DRERASEAG+KGG+HS G F DPQ+ASEAG+ Sbjct: 61 GNPGNFSQDRERASEAGRKGGEHSHGGHSTKETSKSSSSGGEHSTRGFAKDPQKASEAGR 120 Query: 46 KGGKSSH 52 KGG++SH Sbjct: 121 KGGQASH 127 >UniRef50_C6B8D0 Putative uncharacterized protein n=3 Tax=Rhizobiales RepID=C6B8D0_RHILS Length = 66 Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 26/40 (65%), Positives = 31/40 (77%) Query: 14 DRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 D ++ + KGGQ SGGNF NDP+RASEAGKKGG+SS G Sbjct: 23 DDDKQRDIASKGGQASGGNFANDPERASEAGKKGGQSSSG 62 >UniRef50_A9HNT7 Putative uncharacterized protein n=2 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HNT7_GLUDA Length = 101 Score = 50.1 bits (118), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 31/70 (44%), Positives = 39/70 (55%), Gaps = 23/70 (32%) Query: 9 GNFAEDRERASEAGKKGGQHS-----------------------GGNFKNDPQRASEAGK 45 GNFA + ++ASEAG KGGQHS GNF DP+RA+EAG+ Sbjct: 32 GNFANNPQKASEAGHKGGQHSSGSSARESDKSSETGHKGGTQHNAGNFAEDPERAAEAGR 91 Query: 46 KGGKSSHGKS 55 K G+ SHGK+ Sbjct: 92 KDGQHSHGKT 101 Score = 40.0 bits (92), Expect = 0.022, Method: Compositional matrix adjust. Identities = 23/47 (48%), Positives = 31/47 (65%) Query: 9 GNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHGKS 55 G A +A ++G G +H+ GNF N+PQ+ASEAG KGG+ S G S Sbjct: 10 GELAFVLAQADKSGGSGAKHNPGNFANNPQKASEAGHKGGQHSSGSS 56 >UniRef50_Q1QTW4 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QTW4_CHRSD Length = 105 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 26/46 (56%), Positives = 32/46 (69%) Query: 10 NFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHGKS 55 NF D RA+EAG+KGG+ S GNFK D RA EAG+KGG+ +S Sbjct: 59 NFKHDPGRAAEAGRKGGRQSPGNFKQDSDRAREAGRKGGRQRASRS 104 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 29/46 (63%), Positives = 37/46 (80%) Query: 8 SGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 +GNFA D RA+EAG+KGG+ S NFK+DP RA+EAG+KGG+ S G Sbjct: 35 AGNFANDAARAAEAGRKGGRMSPTNFKHDPGRAAEAGRKGGRQSPG 80 >UniRef50_A4XUQ0 Putative uncharacterized protein n=1 Tax=Pseudomonas mendocina ymp RepID=A4XUQ0_PSEMY Length = 98 Score = 46.6 bits (109), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 31/45 (68%), Positives = 40/45 (88%) Query: 9 GNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 GN A DR++ASEAGKKGG+ SGG+F ND Q+A+EAG+KGG++SHG Sbjct: 50 GNLANDRQKASEAGKKGGRSSGGSFANDRQKATEAGRKGGRNSHG 94 >UniRef50_Q88FW4 Conserved domain protein n=3 Tax=Pseudomonas putida RepID=Q88FW4_PSEPK Length = 108 Score = 43.1 bits (100), Expect = 0.002, Method: Compositional matrix adjust. Identities = 26/49 (53%), Positives = 37/49 (75%), Gaps = 5/49 (10%) Query: 11 FAEDRERASEAGKKGG--QHSGGNFKNDPQRASEAGKKGGKSSHGKSDN 57 A D+E+A G++GG Q++ GNF NDPQRASEAG+KGG+ S G+ ++ Sbjct: 1 MANDQEKA---GQQGGTAQNNPGNFANDPQRASEAGQKGGQMSGGRPED 46 Score = 42.0 bits (97), Expect = 0.006, Method: Compositional matrix adjust. Identities = 27/58 (46%), Positives = 34/58 (58%), Gaps = 14/58 (24%) Query: 9 GNFAEDRERASEAGKKGGQHSGGNF--------------KNDPQRASEAGKKGGKSSH 52 GNFA D +RASEAG+KGGQ SGG ++DPQRA A ++GG+ H Sbjct: 20 GNFANDPQRASEAGQKGGQMSGGRPEDQQTGHGSGSNRQEDDPQRAGGASQRGGQGGH 77 >UniRef50_B1J8T3 Putative uncharacterized protein n=4 Tax=Pseudomonas RepID=B1J8T3_PSEPW Length = 72 Score = 40.4 bits (93), Expect = 0.016, Method: Compositional matrix adjust. Identities = 25/51 (49%), Positives = 33/51 (64%), Gaps = 9/51 (17%) Query: 9 GNFAEDRERASEAGKKGGQHSGGNFKNDPQ---------RASEAGKKGGKS 50 GNFA DR++ASEAG+KGGQ SGG D + + E+G+KGG+S Sbjct: 22 GNFANDRQKASEAGRKGGQSSGGGMAQDRESGRKDADMHKDRESGQKGGRS 72 >UniRef50_D2LEZ1 Putative uncharacterized protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LEZ1_RHOVA Length = 65 Score = 38.9 bits (89), Expect = 0.045, Method: Compositional matrix adjust. Identities = 18/18 (100%), Positives = 18/18 (100%) Query: 31 GNFKNDPQRASEAGKKGG 48 GNFKNDPQRASEAGKKGG Sbjct: 12 GNFKNDPQRASEAGKKGG 29 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B4T6X3 Conserved domain protein n=67 Tax=Gammaproteobac... 58 6e-08 UniRef50_B5YXI6 Conserved domain protein n=68 Tax=Bacteria RepID... 58 9e-08 UniRef50_Q88HT4 Putative uncharacterized protein n=1 Tax=Pseudom... 50 3e-05 UniRef50_P21361 Uncharacterized protein yciG n=68 Tax=Proteobact... 48 1e-04 UniRef50_C1DH84 Putative uncharacterized protein n=2 Tax=Pseudom... 47 2e-04 UniRef50_C9Y566 Uncharacterized protein yciG n=16 Tax=Gammaprote... 46 2e-04 Sequences not found previously or not previously below threshold: UniRef50_A4XUQ0 Putative uncharacterized protein n=1 Tax=Pseudom... 42 0.006 UniRef50_Q1QTW4 Putative uncharacterized protein n=1 Tax=Chromoh... 41 0.008 UniRef50_Q5FP81 Putative uncharacterized protein n=1 Tax=Glucono... 39 0.038 CONVERGED! >UniRef50_B4T6X3 Conserved domain protein n=67 Tax=Gammaproteobacteria RepID=B4T6X3_SALNS Length = 60 Score = 58.4 bits (139), Expect = 6e-08, Method: Composition-based stats. Identities = 47/53 (88%), Positives = 52/53 (98%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 MA HRGGSGNFAE+RE+ASEAG+KGGQHSGGNFKNDPQRASEAGKKGG++SHG Sbjct: 1 MAEHRGGSGNFAENREKASEAGRKGGQHSGGNFKNDPQRASEAGKKGGQNSHG 53 >UniRef50_B5YXI6 Conserved domain protein n=68 Tax=Bacteria RepID=B5YXI6_ECO5E Length = 60 Score = 58.0 bits (138), Expect = 9e-08, Method: Composition-based stats. Identities = 44/53 (83%), Positives = 51/53 (96%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 M HRGG+GNFAEDR++AS+AG+KGGQHSGGNFKNDPQRASEAGKKGG++SHG Sbjct: 1 MTEHRGGAGNFAEDRQKASDAGRKGGQHSGGNFKNDPQRASEAGKKGGQNSHG 53 >UniRef50_Q88HT4 Putative uncharacterized protein n=1 Tax=Pseudomonas putida KT2440 RepID=Q88HT4_PSEPK Length = 85 Score = 49.5 bits (116), Expect = 3e-05, Method: Composition-based stats. Identities = 38/51 (74%), Positives = 41/51 (80%) Query: 7 GSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHGKSDN 57 SGNFA DRERAS AG KGGQ SGGNF ND QRASEAG+KGG++SHG N Sbjct: 34 NSGNFANDRERASRAGHKGGQASGGNFANDRQRASEAGRKGGRNSHGAGRN 84 >UniRef50_P21361 Uncharacterized protein yciG n=68 Tax=Proteobacteria RepID=YCIG_ECOLI Length = 59 Score = 48.0 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 39/43 (90%), Positives = 42/43 (97%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEA 43 MA HRGGSGNFAEDRE+AS+AG+KGGQHSGGNFKNDPQRASEA Sbjct: 1 MAEHRGGSGNFAEDREKASDAGRKGGQHSGGNFKNDPQRASEA 43 >UniRef50_C1DH84 Putative uncharacterized protein n=2 Tax=Pseudomonadaceae RepID=C1DH84_AZOVD Length = 95 Score = 46.8 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 32/45 (71%), Positives = 38/45 (84%) Query: 9 GNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 GNF D ERASEAG KGG++SGGNF ND +RA+EAG+KGG+ SHG Sbjct: 46 GNFRNDPERASEAGSKGGRNSGGNFANDRERAAEAGRKGGQHSHG 90 >UniRef50_C9Y566 Uncharacterized protein yciG n=16 Tax=Gammaproteobacteria RepID=C9Y566_CROTZ Length = 79 Score = 46.5 bits (108), Expect = 2e-04, Method: Composition-based stats. Identities = 44/57 (77%), Positives = 49/57 (85%) Query: 1 MANHRGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHGKSDN 57 M HRGGSGNFAEDR+RAS+AG+KGGQHS GNFKNDPQRASEAG+KGG+ S G N Sbjct: 1 MTQHRGGSGNFAEDRQRASDAGQKGGQHSSGNFKNDPQRASEAGQKGGQQSGGNFKN 57 >UniRef50_A4XUQ0 Putative uncharacterized protein n=1 Tax=Pseudomonas mendocina ymp RepID=A4XUQ0_PSEMY Length = 98 Score = 41.8 bits (96), Expect = 0.006, Method: Composition-based stats. Identities = 31/47 (65%), Positives = 40/47 (85%) Query: 7 GSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHG 53 GN A DR++ASEAGKKGG+ SGG+F ND Q+A+EAG+KGG++SHG Sbjct: 48 NPGNLANDRQKASEAGKKGGRSSGGSFANDRQKATEAGRKGGRNSHG 94 >UniRef50_Q1QTW4 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QTW4_CHRSD Length = 105 Score = 41.4 bits (95), Expect = 0.008, Method: Composition-based stats. Identities = 27/51 (52%), Positives = 33/51 (64%) Query: 5 RGGSGNFAEDRERASEAGKKGGQHSGGNFKNDPQRASEAGKKGGKSSHGKS 55 R NF D RA+EAG+KGG+ S GNFK D RA EAG+KGG+ +S Sbjct: 54 RMSPTNFKHDPGRAAEAGRKGGRQSPGNFKQDSDRAREAGRKGGRQRASRS 104 >UniRef50_Q5FP81 Putative uncharacterized protein n=1 Tax=Gluconobacter oxydans RepID=Q5FP81_GLUOX Length = 129 Score = 39.1 bits (89), Expect = 0.038, Method: Composition-based stats. Identities = 33/67 (49%), Positives = 41/67 (61%), Gaps = 20/67 (29%) Query: 6 GGSGNFAEDRERASEAGKKGGQHSGGN--------------------FKNDPQRASEAGK 45 G GNF++DRERASEAG+KGG+HS G F DPQ+ASEAG+ Sbjct: 61 GNPGNFSQDRERASEAGRKGGEHSHGGHSTKETSKSSSSGGEHSTRGFAKDPQKASEAGR 120 Query: 46 KGGKSSH 52 KGG++SH Sbjct: 121 KGGQASH 127 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.290 0.113 0.287 Lambda K H 0.267 0.0349 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 192,600,630 Number of Sequences: 3077464 Number of extensions: 5503245 Number of successful extensions: 23089 Number of sequences better than 1.0e-01: 25 Number of HSP's better than 0.1 without gapping: 39 Number of HSP's successfully gapped in prelim test: 11 Number of HSP's that attempted gapping in prelim test: 22924 Number of HSP's gapped (non-prelim): 134 length of query: 57 length of database: 1,040,396,356 effective HSP length: 30 effective length of query: 27 effective length of database: 948,072,436 effective search space: 25597955772 effective search space used: 25597955772 T: 11 A: 40 X1: 16 ( 6.7 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 43 (21.1 bits) S2: 86 (38.0 bits)