BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (96 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P68662 Uncharacterized protein ybcO n=57 Tax=root RepID... 198 4e-50 UniRef50_B4ESM1 Phage protein n=32 Tax=root RepID=B4ESM1_PROMH 131 7e-30 UniRef50_D0FSB7 Conserved bacteriophage protein n=3 Tax=Enteroba... 115 4e-25 UniRef50_UPI0001AF4584 hypothetical protein Psyrpo1_28777 n=2 Ta... 78 1e-13 UniRef50_Q1GIK2 Putative uncharacterized protein n=1 Tax=Ruegeri... 75 8e-13 UniRef50_A2P365 Gp66 n=1 Tax=Vibrio cholerae 1587 RepID=A2P365_V... 67 2e-10 UniRef50_D0Z7P2 Putative uncharacterized protein n=3 Tax=Enterob... 65 8e-10 UniRef50_A6F0L5 Putative uncharacterized protein n=2 Tax=root Re... 53 4e-06 UniRef50_A4JWC7 Putative uncharacterized protein n=1 Tax=Burkhol... 45 5e-04 UniRef50_C5CJN8 Putative uncharacterized protein n=1 Tax=Variovo... 42 0.007 >UniRef50_P68662 Uncharacterized protein ybcO n=57 Tax=root RepID=YBCO_BP82 Length = 96 Score = 198 bits (504), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 96/96 (100%), Positives = 96/96 (100%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE Sbjct: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 Query: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA 96 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA Sbjct: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA 96 >UniRef50_B4ESM1 Phage protein n=32 Tax=root RepID=B4ESM1_PROMH Length = 96 Score = 131 bits (329), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 59/95 (62%), Positives = 70/95 (73%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 M +LR A+GRECQ+RIP VCNGN ET VLAH R++GLCG G K DL ACSACHDE Sbjct: 1 MMNLRNEAKGRECQIRIPSVCNGNSETVVLAHYRMSGLCGAGIKSHDLFGAWACSACHDE 60 Query: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIK 95 +DRRT F D YAK+C LEG+ RTQ I ++EG + Sbjct: 61 VDRRTRFTDMEYAKQCHLEGVLRTQAILIQEGKLN 95 >UniRef50_D0FSB7 Conserved bacteriophage protein n=3 Tax=Enterobacteriaceae RepID=D0FSB7_ERWPY Length = 159 Score = 115 bits (288), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 58/94 (61%), Positives = 65/94 (69%), Gaps = 2/94 (2%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 DLRK ARGRECQVRIPG CNGNPET VLAH R+ G CGTG KP D A IAC+ CHD ID Sbjct: 66 DLRKKARGRECQVRIPGYCNGNPETCVLAHYRMAGTCGTGYKPDDQQAAIACNGCHDAID 125 Query: 63 RRTHFVDAGYAKECAL--EGMARTQVIWLKEGVI 94 RT D + + + EG+ RTQ IW +EG I Sbjct: 126 GRTKTTDYTHDELRLMHAEGVLRTQAIWRREGFI 159 >UniRef50_UPI0001AF4584 hypothetical protein Psyrpo1_28777 n=2 Tax=Pseudomonas syringae group RepID=UPI0001AF4584 Length = 101 Score = 77.8 bits (190), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 45/95 (47%), Positives = 55/95 (57%), Gaps = 4/95 (4%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDR 63 L AAR RECQ+R PG C+ T+VLAH RL G CG G KP DL A AC+ CHD D Sbjct: 7 LTNAARDRECQIRYPG-CSSESSTTVLAHYRLAGTCGMGIKPNDLQAAWACAYCHDIADG 65 Query: 64 RTHFVDAGYAKECAL---EGMARTQVIWLKEGVIK 95 R +E L EG+ RTQ ++EG++K Sbjct: 66 RLRAPAVLSREEVRLFHAEGVMRTQDALIREGMVK 100 >UniRef50_Q1GIK2 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GIK2_SILST Length = 101 Score = 74.7 bits (182), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 35/64 (54%), Positives = 42/64 (65%), Gaps = 1/64 (1%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 M LR AA+G+ C +R+P CN NPET+ L HIR G GT KP D +A ACS CHD Sbjct: 8 MTKLRTAAKGQPCTLRLP-CCNNNPETTSLCHIRAFGWAGTSEKPMDFLAVFACSDCHDA 66 Query: 61 IDRR 64 +DRR Sbjct: 67 LDRR 70 >UniRef50_A2P365 Gp66 n=1 Tax=Vibrio cholerae 1587 RepID=A2P365_VIBCH Length = 95 Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 37/91 (40%), Positives = 55/91 (60%), Gaps = 6/91 (6%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDR 63 + ++ARG++C +R+ G+CN NPET+V AH+ + G G K D + ACSACH EID Sbjct: 8 IMQSARGKQCTLRLVGICNFNPETTVAAHVGVRR--GMGIKCGDNMVVYACSACHAEIDS 65 Query: 64 RTHFVDAGYAKECALEGMARTQVIWLKEGVI 94 + YA + L G+ TQ I ++EG+ Sbjct: 66 SSR---ESYAAD-KLRGIEETQEILVEEGLF 92 >UniRef50_D0Z7P2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=D0Z7P2_EDWTE Length = 100 Score = 64.7 bits (156), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 29/64 (45%), Positives = 42/64 (65%), Gaps = 1/64 (1%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDR 63 R++ARG+ C ++IPG+CNG+P+T VL H+ + + G G K D A CSACHD +D Sbjct: 12 WRESARGQGCTLQIPGICNGDPQTVVLCHLP-SPMHGMGYKSDDFWAVYGCSACHDALDG 70 Query: 64 RTHF 67 R + Sbjct: 71 RAPY 74 >UniRef50_A6F0L5 Putative uncharacterized protein n=2 Tax=root RepID=A6F0L5_9ALTE Length = 110 Score = 52.8 bits (125), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 29/64 (45%), Positives = 38/64 (59%), Gaps = 1/64 (1%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDR 63 +R AARG+ C ++I GVC+G+ T+VLAH+ G K DL A AC ACH ID Sbjct: 13 IRDAARGQPCTLQIVGVCSGDWSTTVLAHLPDESH-GIARKSDDLSACFACDACHSVIDG 71 Query: 64 RTHF 67 R + Sbjct: 72 RAKW 75 >UniRef50_A4JWC7 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JWC7_BURVG Length = 190 Score = 45.4 bits (106), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 27/65 (41%), Positives = 37/65 (56%), Gaps = 3/65 (4%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIR-LTGLCGTGTK--PPDLIATIACSAC 57 M+ + +ARG C +R+PGVCN +PET+V AH + G G K D I AC +C Sbjct: 1 MSKITDSARGETCALRLPGVCNRDPETTVWAHGNDVEGGKAKGKKLLRYDHIGCYACYSC 60 Query: 58 HDEID 62 H +D Sbjct: 61 HMVLD 65 >UniRef50_C5CJN8 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CJN8_VARPS Length = 164 Score = 41.6 bits (96), Expect = 0.007, Method: Compositional matrix adjust. Identities = 26/62 (41%), Positives = 33/62 (53%), Gaps = 3/62 (4%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGT--GTKPPDLIATIACSACHDEI 61 L ARG+ C +R+ VCN +P T V AH L G+ G K D ACSACH + Sbjct: 45 LLSMARGQSCVLRVEEVCNRDPATVVAAHSNL-GIHGKAGARKADDQYHVHACSACHQWL 103 Query: 62 DR 63 D+ Sbjct: 104 DQ 105 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B4ESM1 Phage protein n=32 Tax=root RepID=B4ESM1_PROMH 142 2e-33 UniRef50_P68662 Uncharacterized protein ybcO n=57 Tax=root RepID... 142 3e-33 UniRef50_D0FSB7 Conserved bacteriophage protein n=3 Tax=Enteroba... 137 9e-32 UniRef50_UPI0001AF4584 hypothetical protein Psyrpo1_28777 n=2 Ta... 119 4e-26 UniRef50_A2P365 Gp66 n=1 Tax=Vibrio cholerae 1587 RepID=A2P365_V... 106 2e-22 UniRef50_D0Z7P2 Putative uncharacterized protein n=3 Tax=Enterob... 100 2e-20 UniRef50_A6F0L5 Putative uncharacterized protein n=2 Tax=root Re... 98 1e-19 UniRef50_Q1GIK2 Putative uncharacterized protein n=1 Tax=Ruegeri... 95 6e-19 UniRef50_A4JWC7 Putative uncharacterized protein n=1 Tax=Burkhol... 92 7e-18 Sequences not found previously or not previously below threshold: UniRef50_C9D3I7 Putative uncharacterized protein n=1 Tax=Silicib... 65 8e-10 UniRef50_C5CJN8 Putative uncharacterized protein n=1 Tax=Variovo... 57 1e-07 UniRef50_Q7WLR3 Putative uncharacterized protein n=1 Tax=Bordete... 56 4e-07 UniRef50_D1DIA9 Predicted protein n=16 Tax=Neisseria gonorrhoeae... 48 8e-05 UniRef50_Q7WKA2 Putative uncharacterized protein n=2 Tax=Bordete... 40 0.025 UniRef50_Q2T6G6 Gp74 n=13 Tax=root RepID=Q2T6G6_BURTA 39 0.049 >UniRef50_B4ESM1 Phage protein n=32 Tax=root RepID=B4ESM1_PROMH Length = 96 Score = 142 bits (359), Expect = 2e-33, Method: Composition-based stats. Identities = 59/95 (62%), Positives = 70/95 (73%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 M +LR A+GRECQ+RIP VCNGN ET VLAH R++GLCG G K DL ACSACHDE Sbjct: 1 MMNLRNEAKGRECQIRIPSVCNGNSETVVLAHYRMSGLCGAGIKSHDLFGAWACSACHDE 60 Query: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIK 95 +DRRT F D YAK+C LEG+ RTQ I ++EG + Sbjct: 61 VDRRTRFTDMEYAKQCHLEGVLRTQAILIQEGKLN 95 >UniRef50_P68662 Uncharacterized protein ybcO n=57 Tax=root RepID=YBCO_BP82 Length = 96 Score = 142 bits (358), Expect = 3e-33, Method: Composition-based stats. Identities = 96/96 (100%), Positives = 96/96 (100%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE Sbjct: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 Query: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA 96 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA Sbjct: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA 96 >UniRef50_D0FSB7 Conserved bacteriophage protein n=3 Tax=Enterobacteriaceae RepID=D0FSB7_ERWPY Length = 159 Score = 137 bits (346), Expect = 9e-32, Method: Composition-based stats. Identities = 58/94 (61%), Positives = 65/94 (69%), Gaps = 2/94 (2%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 DLRK ARGRECQVRIPG CNGNPET VLAH R+ G CGTG KP D A IAC+ CHD ID Sbjct: 66 DLRKKARGRECQVRIPGYCNGNPETCVLAHYRMAGTCGTGYKPDDQQAAIACNGCHDAID 125 Query: 63 RRTHFVDAGYAKECAL--EGMARTQVIWLKEGVI 94 RT D + + + EG+ RTQ IW +EG I Sbjct: 126 GRTKTTDYTHDELRLMHAEGVLRTQAIWRREGFI 159 >UniRef50_UPI0001AF4584 hypothetical protein Psyrpo1_28777 n=2 Tax=Pseudomonas syringae group RepID=UPI0001AF4584 Length = 101 Score = 119 bits (297), Expect = 4e-26, Method: Composition-based stats. Identities = 45/97 (46%), Positives = 55/97 (56%), Gaps = 4/97 (4%) Query: 2 ADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEI 61 L AAR RECQ+R PG C+ T+VLAH RL G CG G KP DL A AC+ CHD Sbjct: 5 TKLTNAARDRECQIRYPG-CSSESSTTVLAHYRLAGTCGMGIKPNDLQAAWACAYCHDIA 63 Query: 62 DRRTHFVDAGYAKECAL---EGMARTQVIWLKEGVIK 95 D R +E L EG+ RTQ ++EG++K Sbjct: 64 DGRLRAPAVLSREEVRLFHAEGVMRTQDALIREGMVK 100 >UniRef50_A2P365 Gp66 n=1 Tax=Vibrio cholerae 1587 RepID=A2P365_VIBCH Length = 95 Score = 106 bits (264), Expect = 2e-22, Method: Composition-based stats. Identities = 37/92 (40%), Positives = 55/92 (59%), Gaps = 6/92 (6%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 + ++ARG++C +R+ G+CN NPET+V AH+ + G G K D + ACSACH EID Sbjct: 7 KIMQSARGKQCTLRLVGICNFNPETTVAAHVGVRR--GMGIKCGDNMVVYACSACHAEID 64 Query: 63 RRTHFVDAGYAKECALEGMARTQVIWLKEGVI 94 + YA + L G+ TQ I ++EG+ Sbjct: 65 SSSR---ESYAAD-KLRGIEETQEILVEEGLF 92 >UniRef50_D0Z7P2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=D0Z7P2_EDWTE Length = 100 Score = 99.8 bits (247), Expect = 2e-20, Method: Composition-based stats. Identities = 29/64 (45%), Positives = 42/64 (65%), Gaps = 1/64 (1%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDR 63 R++ARG+ C ++IPG+CNG+P+T VL H+ + + G G K D A CSACHD +D Sbjct: 12 WRESARGQGCTLQIPGICNGDPQTVVLCHLP-SPMHGMGYKSDDFWAVYGCSACHDALDG 70 Query: 64 RTHF 67 R + Sbjct: 71 RAPY 74 >UniRef50_A6F0L5 Putative uncharacterized protein n=2 Tax=root RepID=A6F0L5_9ALTE Length = 110 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 3/95 (3%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 +R AARG+ C ++I GVC+G+ T+VLAH+ G K DL A AC ACH ID Sbjct: 12 KIRDAARGQPCTLQIVGVCSGDWSTTVLAHLPDES-HGIARKSDDLSACFACDACHSVID 70 Query: 63 RRTHFVDA--GYAKECALEGMARTQVIWLKEGVIK 95 R + + + RT ++ + Sbjct: 71 GRAKWPAMEREHKEWYFRRAQIRTWRALFEKNIFS 105 >UniRef50_Q1GIK2 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GIK2_SILST Length = 101 Score = 95.2 bits (235), Expect = 6e-19, Method: Composition-based stats. Identities = 35/64 (54%), Positives = 42/64 (65%), Gaps = 1/64 (1%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 M LR AA+G+ C +R+P CN NPET+ L HIR G GT KP D +A ACS CHD Sbjct: 8 MTKLRTAAKGQPCTLRLP-CCNNNPETTSLCHIRAFGWAGTSEKPMDFLAVFACSDCHDA 66 Query: 61 IDRR 64 +DRR Sbjct: 67 LDRR 70 >UniRef50_A4JWC7 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JWC7_BURVG Length = 190 Score = 91.7 bits (226), Expect = 7e-18, Method: Composition-based stats. Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 3/80 (3%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIR-LTGLCGTGTK--PPDLIATIACSAC 57 M+ + +ARG C +R+PGVCN +PET+V AH + G G K D I AC +C Sbjct: 1 MSKITDSARGETCALRLPGVCNRDPETTVWAHGNDVEGGKAKGKKLLRYDHIGCYACYSC 60 Query: 58 HDEIDRRTHFVDAGYAKECA 77 H +D + ++ Sbjct: 61 HMVLDGQAKRPAHLALEQVR 80 >UniRef50_C9D3I7 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9D3I7_9RHOB Length = 125 Score = 64.7 bits (156), Expect = 8e-10, Method: Composition-based stats. Identities = 30/103 (29%), Positives = 47/103 (45%), Gaps = 14/103 (13%) Query: 4 LRKAARGRECQVRIPGV-----CNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACH 58 + +AA G C +RI C+G PET+V H+ + G G TK D+ C+ CH Sbjct: 17 IMRAAEGSPCTLRIASFIAGKKCSG-PETTVACHLPVWG-KGVSTKVTDMATAFGCATCH 74 Query: 59 DEIDR----RTHFVDAGYAK---ECALEGMARTQVIWLKEGVI 94 +D +++ Y E L G+ T + ++ GVI Sbjct: 75 AILDGIDQDARRYLEHHYKNAVLERMLHGLTETHALLIQRGVI 117 >UniRef50_C5CJN8 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CJN8_VARPS Length = 164 Score = 57.4 bits (137), Expect = 1e-07, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 32/61 (52%), Gaps = 3/61 (4%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGT--GTKPPDLIATIACSACHDEI 61 L ARG+ C +R+ VCN +P T V AH L G+ G K D ACSACH + Sbjct: 45 LLSMARGQSCVLRVEEVCNRDPATVVAAHSNL-GIHGKAGARKADDQYHVHACSACHQWL 103 Query: 62 D 62 D Sbjct: 104 D 104 >UniRef50_Q7WLR3 Putative uncharacterized protein n=1 Tax=Bordetella bronchiseptica RepID=Q7WLR3_BORBR Length = 164 Score = 55.9 bits (133), Expect = 4e-07, Method: Composition-based stats. Identities = 21/56 (37%), Positives = 26/56 (46%), Gaps = 1/56 (1%) Query: 8 ARGRECQVRIPGVCNGNPETSVLAHI-RLTGLCGTGTKPPDLIATIACSACHDEID 62 A G C +R+P C G +++V H RL G G K D CS CH ID Sbjct: 61 AEGENCLLRVPKYCQGGTDSTVACHSNRLRDGKGKGIKAHDWAIAFGCSGCHWFID 116 >UniRef50_D1DIA9 Predicted protein n=16 Tax=Neisseria gonorrhoeae RepID=D1DIA9_NEIGO Length = 116 Score = 48.2 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 38/90 (42%), Gaps = 5/90 (5%) Query: 2 ADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEI 61 + +RKAA+G +C I GVCN NPET VL G G K L A C I Sbjct: 18 SAIRKAAKGGQCTPNIAGVCNDNPETVVLCRFPGE-THGAGLKSGGLGAGFGCGCRRGAI 76 Query: 62 DRR----THFVDAGYAKECALEGMARTQVI 87 D R + Y + L + R + + Sbjct: 77 DGRGAGLSREDKEFYMRRSQLRTIRRLEAL 106 >UniRef50_Q7WKA2 Putative uncharacterized protein n=2 Tax=Bordetella bronchiseptica RepID=Q7WKA2_BORBR Length = 138 Score = 40.1 bits (92), Expect = 0.025, Method: Composition-based stats. Identities = 23/60 (38%), Positives = 27/60 (45%), Gaps = 5/60 (8%) Query: 7 AARGRECQVRIPGVCN--GNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRR 64 A RG C + G C+ G+P T V AH GTG K PD AC CH D+ Sbjct: 45 ACRGERCYQQFAGCCSYKGDP-TVVPAHQNQ--GKGTGLKVPDRFTVPACYHCHTLYDQS 101 >UniRef50_Q2T6G6 Gp74 n=13 Tax=root RepID=Q2T6G6_BURTA Length = 151 Score = 38.9 bits (89), Expect = 0.049, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 4/65 (6%) Query: 2 ADLRKAARGRECQVRIPGVCNG---NPETSVLAHIRLTGL-CGTGTKPPDLIATIACSAC 57 + A RG EC +R+PGVC E+ V H + G GTK C C Sbjct: 53 SKYLAACRGEECYLRVPGVCCSIGWPHESVVDCHSNQSKHGKGAGTKAKHEYTVPGCGPC 112 Query: 58 HDEID 62 H +D Sbjct: 113 HYWLD 117 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B4ESM1 Phage protein n=32 Tax=root RepID=B4ESM1_PROMH 127 1e-28 UniRef50_P68662 Uncharacterized protein ybcO n=57 Tax=root RepID... 124 7e-28 UniRef50_D0FSB7 Conserved bacteriophage protein n=3 Tax=Enteroba... 121 6e-27 UniRef50_A6F0L5 Putative uncharacterized protein n=2 Tax=root Re... 109 2e-23 UniRef50_UPI0001AF4584 hypothetical protein Psyrpo1_28777 n=2 Ta... 105 3e-22 UniRef50_A2P365 Gp66 n=1 Tax=Vibrio cholerae 1587 RepID=A2P365_V... 97 2e-19 UniRef50_D0Z7P2 Putative uncharacterized protein n=3 Tax=Enterob... 94 2e-18 UniRef50_A4JWC7 Putative uncharacterized protein n=1 Tax=Burkhol... 90 2e-17 UniRef50_C9D3I7 Putative uncharacterized protein n=1 Tax=Silicib... 90 2e-17 UniRef50_Q1GIK2 Putative uncharacterized protein n=1 Tax=Ruegeri... 88 8e-17 UniRef50_Q7WLR3 Putative uncharacterized protein n=1 Tax=Bordete... 76 4e-13 UniRef50_D1DIA9 Predicted protein n=16 Tax=Neisseria gonorrhoeae... 74 1e-12 UniRef50_C5CJN8 Putative uncharacterized protein n=1 Tax=Variovo... 70 2e-11 Sequences not found previously or not previously below threshold: UniRef50_Q2T6G6 Gp74 n=13 Tax=root RepID=Q2T6G6_BURTA 45 6e-04 UniRef50_Q7WKA2 Putative uncharacterized protein n=2 Tax=Bordete... 40 0.022 >UniRef50_B4ESM1 Phage protein n=32 Tax=root RepID=B4ESM1_PROMH Length = 96 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 59/95 (62%), Positives = 70/95 (73%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 M +LR A+GRECQ+RIP VCNGN ET VLAH R++GLCG G K DL ACSACHDE Sbjct: 1 MMNLRNEAKGRECQIRIPSVCNGNSETVVLAHYRMSGLCGAGIKSHDLFGAWACSACHDE 60 Query: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIK 95 +DRRT F D YAK+C LEG+ RTQ I ++EG + Sbjct: 61 VDRRTRFTDMEYAKQCHLEGVLRTQAILIQEGKLN 95 >UniRef50_P68662 Uncharacterized protein ybcO n=57 Tax=root RepID=YBCO_BP82 Length = 96 Score = 124 bits (312), Expect = 7e-28, Method: Composition-based stats. Identities = 96/96 (100%), Positives = 96/96 (100%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE Sbjct: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 Query: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA 96 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA Sbjct: 61 IDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA 96 >UniRef50_D0FSB7 Conserved bacteriophage protein n=3 Tax=Enterobacteriaceae RepID=D0FSB7_ERWPY Length = 159 Score = 121 bits (304), Expect = 6e-27, Method: Composition-based stats. Identities = 58/94 (61%), Positives = 65/94 (69%), Gaps = 2/94 (2%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 DLRK ARGRECQVRIPG CNGNPET VLAH R+ G CGTG KP D A IAC+ CHD ID Sbjct: 66 DLRKKARGRECQVRIPGYCNGNPETCVLAHYRMAGTCGTGYKPDDQQAAIACNGCHDAID 125 Query: 63 RRTHFVDAGYAKECAL--EGMARTQVIWLKEGVI 94 RT D + + + EG+ RTQ IW +EG I Sbjct: 126 GRTKTTDYTHDELRLMHAEGVLRTQAIWRREGFI 159 >UniRef50_A6F0L5 Putative uncharacterized protein n=2 Tax=root RepID=A6F0L5_9ALTE Length = 110 Score = 109 bits (273), Expect = 2e-23, Method: Composition-based stats. Identities = 31/95 (32%), Positives = 45/95 (47%), Gaps = 3/95 (3%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 +R AARG+ C ++I GVC+G+ T+VLAH+ G K DL A AC ACH ID Sbjct: 12 KIRDAARGQPCTLQIVGVCSGDWSTTVLAHLPDES-HGIARKSDDLSACFACDACHSVID 70 Query: 63 RRTHFVDA--GYAKECALEGMARTQVIWLKEGVIK 95 R + + + RT ++ + Sbjct: 71 GRAKWPAMEREHKEWYFRRAQIRTWRALFEKNIFS 105 >UniRef50_UPI0001AF4584 hypothetical protein Psyrpo1_28777 n=2 Tax=Pseudomonas syringae group RepID=UPI0001AF4584 Length = 101 Score = 105 bits (263), Expect = 3e-22, Method: Composition-based stats. Identities = 45/97 (46%), Positives = 55/97 (56%), Gaps = 4/97 (4%) Query: 2 ADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEI 61 L AAR RECQ+R PG C+ T+VLAH RL G CG G KP DL A AC+ CHD Sbjct: 5 TKLTNAARDRECQIRYPG-CSSESSTTVLAHYRLAGTCGMGIKPNDLQAAWACAYCHDIA 63 Query: 62 DRRTHFVDAGYAKECAL---EGMARTQVIWLKEGVIK 95 D R +E L EG+ RTQ ++EG++K Sbjct: 64 DGRLRAPAVLSREEVRLFHAEGVMRTQDALIREGMVK 100 >UniRef50_A2P365 Gp66 n=1 Tax=Vibrio cholerae 1587 RepID=A2P365_VIBCH Length = 95 Score = 97.1 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 37/93 (39%), Positives = 55/93 (59%), Gaps = 6/93 (6%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 + ++ARG++C +R+ G+CN NPET+V AH+ + G G K D + ACSACH EID Sbjct: 7 KIMQSARGKQCTLRLVGICNFNPETTVAAHVGVRR--GMGIKCGDNMVVYACSACHAEID 64 Query: 63 RRTHFVDAGYAKECALEGMARTQVIWLKEGVIK 95 + YA + L G+ TQ I ++EG+ Sbjct: 65 SSSR---ESYAAD-KLRGIEETQEILVEEGLFT 93 >UniRef50_D0Z7P2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=D0Z7P2_EDWTE Length = 100 Score = 93.6 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 29/66 (43%), Positives = 42/66 (63%), Gaps = 1/66 (1%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEID 62 R++ARG+ C ++IPG+CNG+P+T VL H+ + + G G K D A CSACHD +D Sbjct: 11 AWRESARGQGCTLQIPGICNGDPQTVVLCHLP-SPMHGMGYKSDDFWAVYGCSACHDALD 69 Query: 63 RRTHFV 68 R + Sbjct: 70 GRAPYD 75 >UniRef50_A4JWC7 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JWC7_BURVG Length = 190 Score = 90.1 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 3/80 (3%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIR-LTGLCGTGTK--PPDLIATIACSAC 57 M+ + +ARG C +R+PGVCN +PET+V AH + G G K D I AC +C Sbjct: 1 MSKITDSARGETCALRLPGVCNRDPETTVWAHGNDVEGGKAKGKKLLRYDHIGCYACYSC 60 Query: 58 HDEIDRRTHFVDAGYAKECA 77 H +D + ++ Sbjct: 61 HMVLDGQAKRPAHLALEQVR 80 >UniRef50_C9D3I7 Putative uncharacterized protein n=1 Tax=Silicibacter sp. TrichCH4B RepID=C9D3I7_9RHOB Length = 125 Score = 90.1 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 30/104 (28%), Positives = 47/104 (45%), Gaps = 14/104 (13%) Query: 3 DLRKAARGRECQVRIPGV-----CNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSAC 57 + +AA G C +RI C+G PET+V H+ + G G TK D+ C+ C Sbjct: 16 AIMRAAEGSPCTLRIASFIAGKKCSG-PETTVACHLPVWG-KGVSTKVTDMATAFGCATC 73 Query: 58 HDEIDR----RTHFVDAGYAK---ECALEGMARTQVIWLKEGVI 94 H +D +++ Y E L G+ T + ++ GVI Sbjct: 74 HAILDGIDQDARRYLEHHYKNAVLERMLHGLTETHALLIQRGVI 117 >UniRef50_Q1GIK2 Putative uncharacterized protein n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GIK2_SILST Length = 101 Score = 88.2 bits (217), Expect = 8e-17, Method: Composition-based stats. Identities = 35/65 (53%), Positives = 42/65 (64%), Gaps = 1/65 (1%) Query: 1 MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 M LR AA+G+ C +R+P CN NPET+ L HIR G GT KP D +A ACS CHD Sbjct: 8 MTKLRTAAKGQPCTLRLP-CCNNNPETTSLCHIRAFGWAGTSEKPMDFLAVFACSDCHDA 66 Query: 61 IDRRT 65 +DRR Sbjct: 67 LDRRR 71 >UniRef50_Q7WLR3 Putative uncharacterized protein n=1 Tax=Bordetella bronchiseptica RepID=Q7WLR3_BORBR Length = 164 Score = 75.9 bits (185), Expect = 4e-13, Method: Composition-based stats. Identities = 22/61 (36%), Positives = 27/61 (44%), Gaps = 1/61 (1%) Query: 3 DLRKAARGRECQVRIPGVCNGNPETSVLAHI-RLTGLCGTGTKPPDLIATIACSACHDEI 61 L A G C +R+P C G +++V H RL G G K D CS CH I Sbjct: 56 ALLALAEGENCLLRVPKYCQGGTDSTVACHSNRLRDGKGKGIKAHDWAIAFGCSGCHWFI 115 Query: 62 D 62 D Sbjct: 116 D 116 >UniRef50_D1DIA9 Predicted protein n=16 Tax=Neisseria gonorrhoeae RepID=D1DIA9_NEIGO Length = 116 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 29/90 (32%), Positives = 38/90 (42%), Gaps = 5/90 (5%) Query: 2 ADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEI 61 + +RKAA+G +C I GVCN NPET VL G G K L A C I Sbjct: 18 SAIRKAAKGGQCTPNIAGVCNDNPETVVLCRFPGE-THGAGLKSGGLGAGFGCGCRRGAI 76 Query: 62 DRR----THFVDAGYAKECALEGMARTQVI 87 D R + Y + L + R + + Sbjct: 77 DGRGAGLSREDKEFYMRRSQLRTIRRLEAL 106 >UniRef50_C5CJN8 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CJN8_VARPS Length = 164 Score = 70.1 bits (170), Expect = 2e-11, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 32/61 (52%), Gaps = 3/61 (4%) Query: 4 LRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGT--GTKPPDLIATIACSACHDEI 61 L ARG+ C +R+ VCN +P T V AH L G+ G K D ACSACH + Sbjct: 45 LLSMARGQSCVLRVEEVCNRDPATVVAAHSNL-GIHGKAGARKADDQYHVHACSACHQWL 103 Query: 62 D 62 D Sbjct: 104 D 104 >UniRef50_Q2T6G6 Gp74 n=13 Tax=root RepID=Q2T6G6_BURTA Length = 151 Score = 45.4 bits (106), Expect = 6e-04, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 4/65 (6%) Query: 2 ADLRKAARGRECQVRIPGVCNG---NPETSVLAHIRLTG-LCGTGTKPPDLIATIACSAC 57 + A RG EC +R+PGVC E+ V H + G GTK C C Sbjct: 53 SKYLAACRGEECYLRVPGVCCSIGWPHESVVDCHSNQSKHGKGAGTKAKHEYTVPGCGPC 112 Query: 58 HDEID 62 H +D Sbjct: 113 HYWLD 117 >UniRef50_Q7WKA2 Putative uncharacterized protein n=2 Tax=Bordetella bronchiseptica RepID=Q7WKA2_BORBR Length = 138 Score = 40.0 bits (92), Expect = 0.022, Method: Composition-based stats. Identities = 24/75 (32%), Positives = 29/75 (38%), Gaps = 5/75 (6%) Query: 3 DLRKAARGRECQVRIPGVCN--GNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDE 60 A RG C + G C+ G+P T V AH GTG K PD AC CH Sbjct: 41 KYLAACRGERCYQQFAGCCSYKGDP-TVVPAHQN--QGKGTGLKVPDRFTVPACYHCHTL 97 Query: 61 IDRRTHFVDAGYAKE 75 D+ + A Sbjct: 98 YDQSGLDREHKRATW 112 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.319 0.134 0.385 Lambda K H 0.267 0.0414 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 543,039,845 Number of Sequences: 3077464 Number of extensions: 18440487 Number of successful extensions: 42979 Number of sequences better than 1.0e-01: 15 Number of HSP's better than 0.1 without gapping: 34 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 42902 Number of HSP's gapped (non-prelim): 43 length of query: 96 length of database: 1,040,396,356 effective HSP length: 65 effective length of query: 31 effective length of database: 840,361,196 effective search space: 26051197076 effective search space used: 26051197076 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 87 (38.1 bits)