BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (135 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P08370 Uncharacterized protein ygdB n=116 Tax=Enterobac... 274 6e-73 UniRef50_C4WYP9 Putative uncharacterized protein n=5 Tax=Klebsie... 100 2e-20 UniRef50_A7MR34 Putative uncharacterized protein n=2 Tax=Cronoba... 70 2e-11 UniRef50_D0FVF4 Conserved uncharacterized protein n=3 Tax=Erwini... 64 2e-09 UniRef50_C6DAF2 Putative uncharacterized protein n=5 Tax=Pectoba... 52 4e-06 UniRef50_A4TLC2 Membrane protein n=31 Tax=Yersinia RepID=A4TLC2_... 48 9e-05 UniRef50_C0AT30 Putative uncharacterized protein n=1 Tax=Proteus... 44 0.001 UniRef50_B2PX29 Putative uncharacterized protein n=3 Tax=Provide... 44 0.002 UniRef50_UPI000197C60D hypothetical protein PretD1_07015 n=1 Tax... 42 0.007 UniRef50_C6CHQ7 Putative uncharacterized protein n=1 Tax=Dickeya... 42 0.008 UniRef50_C4K851 Putative uncharacterized protein n=1 Tax=Candida... 40 0.032 UniRef50_B4F2G1 Putative membrane protein n=2 Tax=Proteus mirabi... 39 0.056 >UniRef50_P08370 Uncharacterized protein ygdB n=116 Tax=Enterobacteriaceae RepID=YGDB_ECOLI Length = 135 Score = 274 bits (701), Expect = 6e-73, Method: Compositional matrix adjust. Identities = 135/135 (100%), Positives = 135/135 (100%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM Sbjct: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 Query: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW Sbjct: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 Query: 121 SDFCPLKERALCQLP 135 SDFCPLKERALCQLP Sbjct: 121 SDFCPLKERALCQLP 135 >UniRef50_C4WYP9 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4WYP9_KLEPN Length = 135 Score = 100 bits (248), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 60/135 (44%), Positives = 88/135 (65%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 M R++G+SSL +VL+LL LG LLL+G++ Q R+ ++ + E+Q++R AI SAL WGK Sbjct: 1 MIRQRGMSSLLMVLLLLTLGCLLLEGLNLQQRALLAQTASETQAIRDTAIAHSALQWGKQ 60 Query: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 W Q A+ C + A + CLR+ D +L + V +W++GEV G + FS GW Sbjct: 61 QVWSAQVALACREQAPQGWRACLRIFGDGSLVLSSASGEVQVWQSGEVRGGQVRFSAHGW 120 Query: 121 SDFCPLKERALCQLP 135 SDFCPL+E +LCQ+P Sbjct: 121 SDFCPLREASLCQMP 135 >UniRef50_A7MR34 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=A7MR34_ENTS8 Length = 140 Score = 70.5 bits (171), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 54/140 (38%), Positives = 75/140 (53%), Gaps = 10/140 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 REKG+S++A+VL LL+LGSL+L G+ QQ S R + ES +++ SALA + Sbjct: 4 REKGMSTIAMVLALLLLGSLMLGGLQQQLDSRFGRAANESAAIKAFNAALSALALSQSQR 63 Query: 63 WQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEG-------VSLWRTGEVIDGNIVF 115 W P QC E + C+R + + L+ EG ++LWR + + F Sbjct: 64 WMFTPQWQCQTLPEVKGRACVRQM---QTYLLVAAEGADENAVPLTLWRWAQPDGDRLRF 120 Query: 116 SPRGWSDFCPLKERALCQLP 135 P GWSDFCPL E CQLP Sbjct: 121 MPHGWSDFCPLTEAKQCQLP 140 >UniRef50_D0FVF4 Conserved uncharacterized protein n=3 Tax=Erwinia RepID=D0FVF4_ERWPY Length = 137 Score = 63.9 bits (154), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 10/139 (7%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 ++ ++G +L +V++LL++G+L+L +Q S V E L++ SALAWG+ Sbjct: 3 LDSQRGSGTLIMVVILLLMGTLMLNATRRQLSDAGSLVGDERIYLQQFTAATSALAWGQR 62 Query: 61 HCWQTQPAVQCSQYAETDAQVCLR-----LLADNEALLIAGYEGVSLWRTGEVIDGNIVF 115 W+ QC Q E + CL L AD+ + Y + R+GE+ Sbjct: 63 LSWKAADGWQCQQQGEYRWRACLHFARMLLRADSGPQTLVLYHWMKKSRSGELQP----- 117 Query: 116 SPRGWSDFCPLKERALCQL 134 P GW D+CPL ++ C + Sbjct: 118 RPHGWLDYCPLAKKGGCDV 136 >UniRef50_C6DAF2 Putative uncharacterized protein n=5 Tax=Pectobacterium RepID=C6DAF2_PECCP Length = 151 Score = 52.4 bits (124), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 44/142 (30%), Positives = 65/142 (45%), Gaps = 13/142 (9%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 +KG +LA+VL++ V+G LL+ G+ +Q S + E LR S+L WG W Sbjct: 6 QKGSGTLAMVLLIAVIGLLLMSGLQRQLESAIQVGNDERHYLRAFNQALSSLNWGIGLRW 65 Query: 64 Q-TQPAVQCSQYAETDAQVCLRLLAD-------NEALLIAGYEGVSLWRTGEVI---DGN 112 + + + QC Q + VCLR ++ E L A + L++ + G Sbjct: 66 RVSTESWQCQQLSAEQLVVCLRAASEGKQGVLRGEGTLPASTRTLKLYQRVSFLALSSGQ 125 Query: 113 IVFSP--RGWSDFCPLKERALC 132 I P GW DFCP KE C Sbjct: 126 IAIQPLANGWLDFCPDKEVTRC 147 >UniRef50_A4TLC2 Membrane protein n=31 Tax=Yersinia RepID=A4TLC2_YERPP Length = 156 Score = 48.1 bits (113), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 36/149 (24%), Positives = 62/149 (41%), Gaps = 20/149 (13%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMH-- 61 ++G S+LA V+ L LG L + +Q + E + LR +S+L WG Sbjct: 6 QRGSSTLAAVMTLFSLGLFWLSAIHRQLDNIQQITGEEQRYLRAYNQAESSLNWGVSQRW 65 Query: 62 ----CWQTQPAVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWR------ 104 W+ A C + E + C++ + E+L + + L++ Sbjct: 66 ALRIPWRVGSAWHCMAHQELGLKACVKRSSLAGFFILKGESLPLGSLPPLMLYQRVKLKA 125 Query: 105 -TGEVIDGNIVFSPRGWSDFCPLKERALC 132 TG + ++ +P GW DFCP K+ C Sbjct: 126 VTGSSGNYQLIDTPHGWLDFCPDKDAQFC 154 >UniRef50_C0AT30 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AT30_9ENTR Length = 153 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 37/139 (26%), Positives = 60/139 (43%), Gaps = 13/139 (9%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 E+G +SL +V+ +V+ L + R V E Q+ + +SALAWG W Sbjct: 11 EQGFASLLVVIGFIVMSLSALGNFAYHYRQSQQIVMQELQARQAFLFAESALAWGIKRNW 70 Query: 64 QTQPAV----QCSQY-AETDAQVCLRLLADNEALL------IAGYEGVSLWRTGEVIDGN 112 + + QC + + CL L++ + LL + GY+ + D N Sbjct: 71 EILSSQLNKWQCRHFHVNPKIKSCLFLISLEKGLLQGQAESLNGYKIYHYQWVDFLKDKN 130 Query: 113 --IVFSPRGWSDFCPLKER 129 +V P GW D+CPL + Sbjct: 131 KSMVTHPNGWLDYCPLVHK 149 >UniRef50_B2PX29 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2PX29_PROST Length = 147 Score = 43.5 bits (101), Expect = 0.002, Method: Compositional matrix adjust. Identities = 32/140 (22%), Positives = 58/140 (41%), Gaps = 11/140 (7%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G +L +V++L+ +G LLL+ + + E + +SALAWG W Sbjct: 7 QRGNIALVMVIVLMTMGLLLLKTLHFYQDNARDEFFREKNYIEAFNQAESALAWGLKQSW 66 Query: 64 QTQPAV----QCSQYAETDAQVCLRLLADNEALL-----IAGYEGVSLWRTGEVIDGNIV 114 + QC + C++ + +L G V+++R I Sbjct: 67 RLTTYRGANWQCQRPPTQTWVSCIKHYKSGQFVLSGKGLYKGRHYVTVYRWVAPISKTQK 126 Query: 115 FSPR--GWSDFCPLKERALC 132 PR GW D+CP+ ++ C Sbjct: 127 VRPRIKGWLDYCPVNQKGFC 146 >UniRef50_UPI000197C60D hypothetical protein PretD1_07015 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C60D Length = 146 Score = 41.6 bits (96), Expect = 0.007, Method: Compositional matrix adjust. Identities = 29/143 (20%), Positives = 58/143 (40%), Gaps = 11/143 (7%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 + +++G SL +V++ + +G+LL++ + + E + +SALAWG Sbjct: 4 IEQQQGNVSLLMVIIFITIGTLLIKSVHFFQERARDELHKEIKYFDAFNKAESALAWGGT 63 Query: 61 HCWQTQ----PAVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWRTGEVI 109 W +Q C + + CL+ +L+ L+ V W + Sbjct: 64 LHWDSQYRGLKRWVCQKEENQQWKSCLKHYKGADFVLSGQSHYLLGNDIKVYRWVVWDAN 123 Query: 110 DGNIVFSPRGWSDFCPLKERALC 132 +GW D+CP+ ++ C Sbjct: 124 KQQFFSRKKGWLDYCPVIKKGFC 146 >UniRef50_C6CHQ7 Putative uncharacterized protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CHQ7_DICZE Length = 148 Score = 41.6 bits (96), Expect = 0.008, Method: Compositional matrix adjust. Identities = 40/144 (27%), Positives = 66/144 (45%), Gaps = 15/144 (10%) Query: 2 NREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQ--SALAWGK 59 R++ S L +V++++++G L+L G+ Q+ + ++ Q QA Q S+L WG Sbjct: 3 GRQQQGSILVVVMLVMMIGLLMLGGL-QRQLDVQLQQGIDEQRFW-QAFNQGVSSLNWGI 60 Query: 60 MHCWQTQPAVQCSQYAETDAQVCLRLLADN-------EALLIAGYEGVSLWR--TGEV-- 108 WQT QC +VCLR+ ++N E +I + ++ + +V Sbjct: 61 SLQWQTIEGWQCQTQPSAQLRVCLRINSENRYGLLRAEGNVIGERQPLTFYHRVVADVAA 120 Query: 109 IDGNIVFSPRGWSDFCPLKERALC 132 G I GWSDFCP C Sbjct: 121 TGGRIQPVAGGWSDFCPETMEFAC 144 >UniRef50_C4K851 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K851_HAMD5 Length = 159 Score = 39.7 bits (91), Expect = 0.032, Method: Compositional matrix adjust. Identities = 35/152 (23%), Positives = 63/152 (41%), Gaps = 23/152 (15%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 +KG +L VL++ +G LLL + ++ + E L+ S+L+WG W Sbjct: 6 QKGTGTLISVLIIFTMGLLLLTALQRELEGAFNISGQERFYLKAYNQAASSLSWGVSKQW 65 Query: 64 QTQPAVQCSQ------YAETDA----QVCLRLLADNEALLIAG-------YEGVSLWRTG 106 + Q + Y ET+ + C++ + + LI G E + L+++ Sbjct: 66 PLRSLSQRTSRRHQGWYCETEQINQLKSCIKPMLEIGIFLIKGESILGNRSEKIILYQSA 125 Query: 107 E------VIDGNIVFSPRGWSDFCPLKERALC 132 + + +V GW DFCP+K C Sbjct: 126 QTNEMPNTTEQKLVPVTGGWLDFCPVKNEKFC 157 >UniRef50_B4F2G1 Putative membrane protein n=2 Tax=Proteus mirabilis RepID=B4F2G1_PROMH Length = 152 Score = 38.9 bits (89), Expect = 0.056, Method: Compositional matrix adjust. Identities = 36/142 (25%), Positives = 59/142 (41%), Gaps = 19/142 (13%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 + ++GV L + +V+ LL + R V E Q+ + SAL+WG Sbjct: 8 IKEQQGVIGLMAAVGFMVISLSLLTSFAYHYRQCQLMVMQELQAKQTFLFAGSALSWGMT 67 Query: 61 HCWQTQPAV----QCSQY-AETDAQVCLRLLADNEALL-----------IAGYEGVSLWR 104 W P+ QC + AE++ + C L ALL I+ Y+ ++L Sbjct: 68 LTWDLSPSRLYQWQCRTFTAESNMKSCFSLRGKTSALLLGEAKSHSGDKISHYQWINLVG 127 Query: 105 TGEVIDGNIVFSPRGWSDFCPL 126 ++ +I GW D+CPL Sbjct: 128 KEKL---HIEAQHNGWLDYCPL 146 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P08370 Uncharacterized protein ygdB n=116 Tax=Enterobac... 162 3e-39 UniRef50_B2PX29 Putative uncharacterized protein n=3 Tax=Provide... 155 4e-37 UniRef50_D0FVF4 Conserved uncharacterized protein n=3 Tax=Erwini... 144 6e-34 UniRef50_A4TLC2 Membrane protein n=31 Tax=Yersinia RepID=A4TLC2_... 141 1e-32 UniRef50_C6DAF2 Putative uncharacterized protein n=5 Tax=Pectoba... 133 2e-30 UniRef50_C0AT30 Putative uncharacterized protein n=1 Tax=Proteus... 132 2e-30 UniRef50_A7MR34 Putative uncharacterized protein n=2 Tax=Cronoba... 127 1e-28 UniRef50_C4WYP9 Putative uncharacterized protein n=5 Tax=Klebsie... 122 5e-27 Sequences not found previously or not previously below threshold: UniRef50_UPI000197C60D hypothetical protein PretD1_07015 n=1 Tax... 110 2e-23 UniRef50_C4K851 Putative uncharacterized protein n=1 Tax=Candida... 98 7e-20 UniRef50_C6CHQ7 Putative uncharacterized protein n=1 Tax=Dickeya... 98 7e-20 UniRef50_D2BTT7 Putative uncharacterized protein n=1 Tax=Dickeya... 92 6e-18 UniRef50_C6CC26 Putative uncharacterized protein n=1 Tax=Dickeya... 91 7e-18 UniRef50_Q7N8T9 Similar to putative membrane protein n=2 Tax=Pho... 88 7e-17 UniRef50_C4UJ53 Putative uncharacterized protein n=1 Tax=Yersini... 88 1e-16 UniRef50_B4F2G1 Putative membrane protein n=2 Tax=Proteus mirabi... 86 3e-16 UniRef50_A8GIH4 Putative uncharacterized protein n=2 Tax=Serrati... 77 2e-13 UniRef50_D2U290 Putative uncharacterized protein n=1 Tax=Arsenop... 73 3e-12 UniRef50_C5BH89 Putative uncharacterized protein n=2 Tax=Edwards... 44 0.001 >UniRef50_P08370 Uncharacterized protein ygdB n=116 Tax=Enterobacteriaceae RepID=YGDB_ECOLI Length = 135 Score = 162 bits (410), Expect = 3e-39, Method: Composition-based stats. Identities = 135/135 (100%), Positives = 135/135 (100%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM Sbjct: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 Query: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW Sbjct: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 Query: 121 SDFCPLKERALCQLP 135 SDFCPLKERALCQLP Sbjct: 121 SDFCPLKERALCQLP 135 >UniRef50_B2PX29 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2PX29_PROST Length = 147 Score = 155 bits (392), Expect = 4e-37, Method: Composition-based stats. Identities = 32/140 (22%), Positives = 58/140 (41%), Gaps = 11/140 (7%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G +L +V++L+ +G LLL+ + + E + +SALAWG W Sbjct: 7 QRGNIALVMVIVLMTMGLLLLKTLHFYQDNARDEFFREKNYIEAFNQAESALAWGLKQSW 66 Query: 64 QTQPA----VQCSQYAETDAQVCLRLLADNEAL-----LIAGYEGVSLWRTGEVIDGNIV 114 + QC + C++ + + L G V+++R I Sbjct: 67 RLTTYRGANWQCQRPPTQTWVSCIKHYKSGQFVLSGKGLYKGRHYVTVYRWVAPISKTQK 126 Query: 115 FSPR--GWSDFCPLKERALC 132 PR GW D+CP+ ++ C Sbjct: 127 VRPRIKGWLDYCPVNQKGFC 146 >UniRef50_D0FVF4 Conserved uncharacterized protein n=3 Tax=Erwinia RepID=D0FVF4_ERWPY Length = 137 Score = 144 bits (364), Expect = 6e-34, Method: Composition-based stats. Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 10/139 (7%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 ++ ++G +L +V++LL++G+L+L +Q S V E L++ SALAWG+ Sbjct: 3 LDSQRGSGTLIMVVILLLMGTLMLNATRRQLSDAGSLVGDERIYLQQFTAATSALAWGQR 62 Query: 61 HCWQTQPAVQCSQYAETDAQVCLR-----LLADNEALLIAGYEGVSLWRTGEVIDGNIVF 115 W+ QC Q E + CL L AD+ + Y + R+GE+ Sbjct: 63 LSWKAADGWQCQQQGEYRWRACLHFARMLLRADSGPQTLVLYHWMKKSRSGELQP----- 117 Query: 116 SPRGWSDFCPLKERALCQL 134 P GW D+CPL ++ C + Sbjct: 118 RPHGWLDYCPLAKKGGCDV 136 >UniRef50_A4TLC2 Membrane protein n=31 Tax=Yersinia RepID=A4TLC2_YERPP Length = 156 Score = 141 bits (354), Expect = 1e-32, Method: Composition-based stats. Identities = 36/149 (24%), Positives = 62/149 (41%), Gaps = 20/149 (13%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC- 62 ++G S+LA V+ L LG L + +Q + E + LR +S+L WG Sbjct: 6 QRGSSTLAAVMTLFSLGLFWLSAIHRQLDNIQQITGEEQRYLRAYNQAESSLNWGVSQRW 65 Query: 63 -----WQTQPAVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWR------ 104 W+ A C + E + C++ + E+L + + L++ Sbjct: 66 ALRIPWRVGSAWHCMAHQELGLKACVKRSSLAGFFILKGESLPLGSLPPLMLYQRVKLKA 125 Query: 105 -TGEVIDGNIVFSPRGWSDFCPLKERALC 132 TG + ++ +P GW DFCP K+ C Sbjct: 126 VTGSSGNYQLIDTPHGWLDFCPDKDAQFC 154 >UniRef50_C6DAF2 Putative uncharacterized protein n=5 Tax=Pectobacterium RepID=C6DAF2_PECCP Length = 151 Score = 133 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 44/143 (30%), Positives = 64/143 (44%), Gaps = 13/143 (9%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 +KG +LA+VL++ V+G LL+ G+ +Q S + E LR S+L WG W Sbjct: 6 QKGSGTLAMVLLIAVIGLLLMSGLQRQLESAIQVGNDERHYLRAFNQALSSLNWGIGLRW 65 Query: 64 QT-QPAVQCSQYAETDAQVCLRLLAD-------NEALLIAGYEGVSLWRTGEVI---DGN 112 + + QC Q + VCLR ++ E L A + L++ + G Sbjct: 66 RVSTESWQCQQLSAEQLVVCLRAASEGKQGVLRGEGTLPASTRTLKLYQRVSFLALSSGQ 125 Query: 113 IVFSP--RGWSDFCPLKERALCQ 133 I P GW DFCP KE C Sbjct: 126 IAIQPLANGWLDFCPDKEVTRCD 148 >UniRef50_C0AT30 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AT30_9ENTR Length = 153 Score = 132 bits (333), Expect = 2e-30, Method: Composition-based stats. Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 13/140 (9%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 E+G +SL +V+ +V+ L + R V E Q+ + +SALAWG Sbjct: 10 SEQGFASLLVVIGFIVMSLSALGNFAYHYRQSQQIVMQELQARQAFLFAESALAWGIKRN 69 Query: 63 WQTQPA----VQCSQ-YAETDAQVCLRLLADNEALL------IAGYEGVSLWRTGEVIDG 111 W+ + QC + + CL L++ + LL + GY+ + D Sbjct: 70 WEILSSQLNKWQCRHFHVNPKIKSCLFLISLEKGLLQGQAESLNGYKIYHYQWVDFLKDK 129 Query: 112 N--IVFSPRGWSDFCPLKER 129 N +V P GW D+CPL + Sbjct: 130 NKSMVTHPNGWLDYCPLVHK 149 >UniRef50_A7MR34 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=A7MR34_ENTS8 Length = 140 Score = 127 bits (318), Expect = 1e-28, Method: Composition-based stats. Identities = 54/140 (38%), Positives = 75/140 (53%), Gaps = 10/140 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 REKG+S++A+VL LL+LGSL+L G+ QQ S R + ES +++ SALA + Sbjct: 4 REKGMSTIAMVLALLLLGSLMLGGLQQQLDSRFGRAANESAAIKAFNAALSALALSQSQR 63 Query: 63 WQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEG-------VSLWRTGEVIDGNIVF 115 W P QC E + C+R + + L+ EG ++LWR + + F Sbjct: 64 WMFTPQWQCQTLPEVKGRACVRQM---QTYLLVAAEGADENAVPLTLWRWAQPDGDRLRF 120 Query: 116 SPRGWSDFCPLKERALCQLP 135 P GWSDFCPL E CQLP Sbjct: 121 MPHGWSDFCPLTEAKQCQLP 140 >UniRef50_C4WYP9 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4WYP9_KLEPN Length = 135 Score = 122 bits (305), Expect = 5e-27, Method: Composition-based stats. Identities = 60/135 (44%), Positives = 88/135 (65%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 M R++G+SSL +VL+LL LG LLL+G++ Q R+ ++ + E+Q++R AI SAL WGK Sbjct: 1 MIRQRGMSSLLMVLLLLTLGCLLLEGLNLQQRALLAQTASETQAIRDTAIAHSALQWGKQ 60 Query: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 W Q A+ C + A + CLR+ D +L + V +W++GEV G + FS GW Sbjct: 61 QVWSAQVALACREQAPQGWRACLRIFGDGSLVLSSASGEVQVWQSGEVRGGQVRFSAHGW 120 Query: 121 SDFCPLKERALCQLP 135 SDFCPL+E +LCQ+P Sbjct: 121 SDFCPLREASLCQMP 135 >UniRef50_UPI000197C60D hypothetical protein PretD1_07015 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C60D Length = 146 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 58/143 (40%), Gaps = 11/143 (7%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 + +++G SL +V++ + +G+LL++ + + E + +SALAWG Sbjct: 4 IEQQQGNVSLLMVIIFITIGTLLIKSVHFFQERARDELHKEIKYFDAFNKAESALAWGGT 63 Query: 61 HCWQTQ----PAVQCSQYAETDAQVCLRLLADNEALLIAGYEGV-----SLWRTGEVIDG 111 W +Q C + + CL+ + +L + ++R Sbjct: 64 LHWDSQYRGLKRWVCQKEENQQWKSCLKHYKGADFVLSGQSHYLLGNDIKVYRWVVWDAN 123 Query: 112 NIVF--SPRGWSDFCPLKERALC 132 F +GW D+CP+ ++ C Sbjct: 124 KQQFFSRKKGWLDYCPVIKKGFC 146 >UniRef50_C4K851 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K851_HAMD5 Length = 159 Score = 98.3 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 56/152 (36%), Gaps = 23/152 (15%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 +KG +L VL++ +G LLL + ++ + E L+ S+L+WG W Sbjct: 6 QKGTGTLISVLIIFTMGLLLLTALQRELEGAFNISGQERFYLKAYNQAASSLSWGVSKQW 65 Query: 64 QTQP----------AVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWRTG 106 + C + C++ L E++L E + L+++ Sbjct: 66 PLRSLSQRTSRRHQGWYCETEQINQLKSCIKPMLEIGIFLIKGESILGNRSEKIILYQSA 125 Query: 107 E----VIDGNIVFSP--RGWSDFCPLKERALC 132 + P GW DFCP+K C Sbjct: 126 QTNEMPNTTEQKLVPVTGGWLDFCPVKNEKFC 157 >UniRef50_C6CHQ7 Putative uncharacterized protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CHQ7_DICZE Length = 148 Score = 98.3 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 61/141 (43%), Gaps = 11/141 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 R++ S L +V++++++G L+L G+ +Q + E + + S+L WG Sbjct: 4 RQQQGSILVVVMLVMMIGLLMLGGLQRQLDVQLQQGIDEQRFWQAFNQGVSSLNWGISLQ 63 Query: 63 WQTQPAVQCSQYAETDAQVCLRLLADN-------EALLIAGYEGVSLWRT----GEVIDG 111 WQT QC +VCLR+ ++N E +I + ++ + G Sbjct: 64 WQTIEGWQCQTQPSAQLRVCLRINSENRYGLLRAEGNVIGERQPLTFYHRVVADVAATGG 123 Query: 112 NIVFSPRGWSDFCPLKERALC 132 I GWSDFCP C Sbjct: 124 RIQPVAGGWSDFCPETMEFAC 144 >UniRef50_D2BTT7 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BTT7_DICD5 Length = 171 Score = 91.8 bits (226), Expect = 6e-18, Method: Composition-based stats. Identities = 32/141 (22%), Positives = 57/141 (40%), Gaps = 11/141 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 R++ S L +V++++++G L+L G+ +Q + E + + S+L WG Sbjct: 27 RQQRGSILVVVMLVMMIGMLMLGGLQRQLDLQLQQGIEEQRFWQAFNQGLSSLNWGMSLR 86 Query: 63 WQTQPAVQCSQYAETDAQVCLRL-------LADNEALLIAGYEGVSLWRTGEVI----DG 111 W QC + CLR+ L E + + ++ + + +G Sbjct: 87 WPVSDEWQCQTLPSAQLRACLRINHDDRSGLLRGEGQVDGELQPLTFYHRVMIDVTTAEG 146 Query: 112 NIVFSPRGWSDFCPLKERALC 132 I GWSDFCP C Sbjct: 147 YIRPIMGGWSDFCPDAMEQAC 167 >UniRef50_C6CC26 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6CC26_DICDC Length = 149 Score = 91.4 bits (225), Expect = 7e-18, Method: Composition-based stats. Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 13/143 (9%) Query: 2 NREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMH 61 N ++G S++ + +++LV+G L+L G+ +Q + + E ++L + SAL WG Sbjct: 4 NPQRG-STVLMAMLMLVIGMLILSGLQRQLEAQMLQDRDEQRALEDFNLASSALKWGLTL 62 Query: 62 CWQT-QPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVS--------LWRTG-EVIDG 111 W+ QC + CLRL A L+ G V+ +R +++ Sbjct: 63 EWRIDDDGWQCQSAGVDALRACLRLNAGGRIGLLRGERLVTGVARPAAFYYRVVPALVNE 122 Query: 112 NIVFSP--RGWSDFCPLKERALC 132 + P GW D CP+ C Sbjct: 123 RPIIQPIAGGWLDVCPVAGAQNC 145 >UniRef50_Q7N8T9 Similar to putative membrane protein n=2 Tax=Photorhabdus RepID=Q7N8T9_PHOLL Length = 161 Score = 88.3 bits (217), Expect = 7e-17, Method: Composition-based stats. Identities = 33/150 (22%), Positives = 58/150 (38%), Gaps = 23/150 (15%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G L VL+LL + ++L+ + + + E + L+ +S+LAWG W Sbjct: 13 QQGNVLLISVLILLTVSLMMLKALHHHLDNALIMMVDERRYLKAFQQAESSLAWGIAQSW 72 Query: 64 QT----------QPAVQCSQYAETDAQVCLRLLADNEALL-----------IAGYEGVSL 102 C Q E CL+ + + LL + Y+ + L Sbjct: 73 PLHSSQIEDSNKTEEWYCQQQQEFHLTSCLKRQSSHFFLLKGMADVAPGQSVGLYQWMKL 132 Query: 103 WRTGEVIDGNIVFSPRGWSDFCPLKERALC 132 +G D +++ GW DFCP + C Sbjct: 133 HSSG--GDDSLIPVESGWLDFCPEVDVGFC 160 >UniRef50_C4UJ53 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UJ53_YERRU Length = 149 Score = 87.5 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 47/147 (31%), Gaps = 24/147 (16%) Query: 10 LALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCWQTQP-- 67 + + + L LLLQ M Q E LR + S+L WG W Sbjct: 1 MVTITAIFALSLLLLQAMHQHLDKMLLITRNEQHYLRSYNLAASSLNWGLNQAWPVSQIN 60 Query: 68 --------AVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWRTGEVI--- 109 C + E C++ + E L + ++L++ + Sbjct: 61 GSLFTHSKQWFCQHHTEESLTACIKPGTLQNTFVLKGEGLASRYTQKMALYQRVRIDFES 120 Query: 110 ----DGNIVFSPRGWSDFCPLKERALC 132 + +GW DFCP K+ C Sbjct: 121 LTQAGIKVASLAQGWLDFCPEKDAGFC 147 >UniRef50_B4F2G1 Putative membrane protein n=2 Tax=Proteus mirabilis RepID=B4F2G1_PROMH Length = 152 Score = 86.4 bits (212), Expect = 3e-16, Method: Composition-based stats. Identities = 34/140 (24%), Positives = 57/140 (40%), Gaps = 13/140 (9%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++GV L + +V+ LL + R V E Q+ + SAL+WG W Sbjct: 11 QQGVIGLMAAVGFMVISLSLLTSFAYHYRQCQLMVMQELQAKQTFLFAGSALSWGMTLTW 70 Query: 64 QTQPA----VQCSQY-AETDAQVCLRLLADNEALLIAGYEG-----VSLWRTGEVIDGN- 112 P+ QC + AE++ + C L ALL+ + +S ++ ++ Sbjct: 71 DLSPSRLYQWQCRTFTAESNMKSCFSLRGKTSALLLGEAKSHSGDKISHYQWINLVGKEK 130 Query: 113 --IVFSPRGWSDFCPLKERA 130 I GW D+CPL + Sbjct: 131 LHIEAQHNGWLDYCPLVKEG 150 >UniRef50_A8GIH4 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GIH4_SERP5 Length = 148 Score = 76.7 bits (187), Expect = 2e-13, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 16/143 (11%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G S+LA V+MLLV+G +LL +Q S + E Q L+ SAL+WG W Sbjct: 6 QRGGSTLAAVMMLLVMGLMLLNAQHRQLDSALLLAADEKQYLQAYNQAASALSWGVAQSW 65 Query: 64 ----QTQPAVQCSQYAETDAQVCLRLLADNEALLIAG------YEGVSLWRTGEVIDG-- 111 C Q + C +L + +L+ G E + L++ Sbjct: 66 PRGRLAANGWWCQQ--TDQLRACAKLSSQAGIVLVRGDAQVSRGEPLRLYQRTRPDGSGG 123 Query: 112 --NIVFSPRGWSDFCPLKERALC 132 + GW DFCP K++A C Sbjct: 124 DIGLRAETGGWLDFCPEKKQADC 146 >UniRef50_D2U290 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U290_9ENTR Length = 99 Score = 72.9 bits (177), Expect = 3e-12, Method: Composition-based stats. Identities = 20/87 (22%), Positives = 34/87 (39%), Gaps = 4/87 (4%) Query: 11 ALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCWQTQ---- 66 ++ML+ + L+L+ + + + E +SALAWGK W Sbjct: 1 MAIMMLITMSLLMLKALHHHQDNLLQMLESEQHYWLFFEQAESALAWGKYQPWVITKEKN 60 Query: 67 PAVQCSQYAETDAQVCLRLLADNEALL 93 + C Q + + CLR + LL Sbjct: 61 SSWSCQQPLGANFKSCLRHYKYDYFLL 87 >UniRef50_C5BH89 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BH89_EDWI9 Length = 147 Score = 44.0 bits (102), Expect = 0.001, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 37/108 (34%), Gaps = 11/108 (10%) Query: 28 SQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCWQTQPAVQCSQYAETDAQVCLRLLA 87 Q + R + + + A LA + WQ P QC A C+R L Sbjct: 28 QAQTTALLQRSAAQQRYWYADAQADLWLARAQHLLWQAWPHWQCRALA-PSWHACIRALD 86 Query: 88 DNEALLIAGYEGVSL--------WRTGEVIDGNIVFS--PRGWSDFCP 125 ALL + L + GE + + P GWSDF P Sbjct: 87 AQRALLCVSGQPSPLTYPLVRVRYLWGEAHEREQRYRALPGGWSDFLP 134 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B2PX29 Putative uncharacterized protein n=3 Tax=Provide... 162 3e-39 UniRef50_P08370 Uncharacterized protein ygdB n=116 Tax=Enterobac... 156 2e-37 UniRef50_UPI000197C60D hypothetical protein PretD1_07015 n=1 Tax... 152 3e-36 UniRef50_C6CHQ7 Putative uncharacterized protein n=1 Tax=Dickeya... 148 6e-35 UniRef50_A4TLC2 Membrane protein n=31 Tax=Yersinia RepID=A4TLC2_... 144 9e-34 UniRef50_D0FVF4 Conserved uncharacterized protein n=3 Tax=Erwini... 144 9e-34 UniRef50_D2BTT7 Putative uncharacterized protein n=1 Tax=Dickeya... 141 5e-33 UniRef50_C4K851 Putative uncharacterized protein n=1 Tax=Candida... 140 9e-33 UniRef50_C6DAF2 Putative uncharacterized protein n=5 Tax=Pectoba... 139 3e-32 UniRef50_C6CC26 Putative uncharacterized protein n=1 Tax=Dickeya... 131 8e-30 UniRef50_A7MR34 Putative uncharacterized protein n=2 Tax=Cronoba... 127 8e-29 UniRef50_C4UJ53 Putative uncharacterized protein n=1 Tax=Yersini... 127 1e-28 UniRef50_Q7N8T9 Similar to putative membrane protein n=2 Tax=Pho... 127 2e-28 UniRef50_C0AT30 Putative uncharacterized protein n=1 Tax=Proteus... 124 8e-28 UniRef50_B4F2G1 Putative membrane protein n=2 Tax=Proteus mirabi... 124 1e-27 UniRef50_C4WYP9 Putative uncharacterized protein n=5 Tax=Klebsie... 117 2e-25 UniRef50_A8GIH4 Putative uncharacterized protein n=2 Tax=Serrati... 112 5e-24 UniRef50_D2U290 Putative uncharacterized protein n=1 Tax=Arsenop... 103 2e-21 UniRef50_C5BH89 Putative uncharacterized protein n=2 Tax=Edwards... 90 2e-17 Sequences not found previously or not previously below threshold: UniRef50_D1B6L2 Aminotransferase class I and II n=1 Tax=Thermana... 39 0.039 CONVERGED! >UniRef50_B2PX29 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2PX29_PROST Length = 147 Score = 162 bits (410), Expect = 3e-39, Method: Composition-based stats. Identities = 32/140 (22%), Positives = 58/140 (41%), Gaps = 11/140 (7%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G +L +V++L+ +G LLL+ + + E + +SALAWG W Sbjct: 7 QRGNIALVMVIVLMTMGLLLLKTLHFYQDNARDEFFREKNYIEAFNQAESALAWGLKQSW 66 Query: 64 QTQPA----VQCSQYAETDAQVCLRLLADNEALLIA-----GYEGVSLWRTGEVIDGNIV 114 + QC + C++ + +L G V+++R I Sbjct: 67 RLTTYRGANWQCQRPPTQTWVSCIKHYKSGQFVLSGKGLYKGRHYVTVYRWVAPISKTQK 126 Query: 115 FSPR--GWSDFCPLKERALC 132 PR GW D+CP+ ++ C Sbjct: 127 VRPRIKGWLDYCPVNQKGFC 146 >UniRef50_P08370 Uncharacterized protein ygdB n=116 Tax=Enterobacteriaceae RepID=YGDB_ECOLI Length = 135 Score = 156 bits (395), Expect = 2e-37, Method: Composition-based stats. Identities = 135/135 (100%), Positives = 135/135 (100%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM Sbjct: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 Query: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW Sbjct: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 Query: 121 SDFCPLKERALCQLP 135 SDFCPLKERALCQLP Sbjct: 121 SDFCPLKERALCQLP 135 >UniRef50_UPI000197C60D hypothetical protein PretD1_07015 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197C60D Length = 146 Score = 152 bits (385), Expect = 3e-36, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 58/143 (40%), Gaps = 11/143 (7%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 + +++G SL +V++ + +G+LL++ + + E + +SALAWG Sbjct: 4 IEQQQGNVSLLMVIIFITIGTLLIKSVHFFQERARDELHKEIKYFDAFNKAESALAWGGT 63 Query: 61 HCWQTQ----PAVQCSQYAETDAQVCLRLLADNEALLIAGYEGV-----SLWRTGEVIDG 111 W +Q C + + CL+ + +L + ++R Sbjct: 64 LHWDSQYRGLKRWVCQKEENQQWKSCLKHYKGADFVLSGQSHYLLGNDIKVYRWVVWDAN 123 Query: 112 NIVF--SPRGWSDFCPLKERALC 132 F +GW D+CP+ ++ C Sbjct: 124 KQQFFSRKKGWLDYCPVIKKGFC 146 >UniRef50_C6CHQ7 Putative uncharacterized protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CHQ7_DICZE Length = 148 Score = 148 bits (373), Expect = 6e-35, Method: Composition-based stats. Identities = 36/141 (25%), Positives = 61/141 (43%), Gaps = 11/141 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 R++ S L +V++++++G L+L G+ +Q + E + + S+L WG Sbjct: 4 RQQQGSILVVVMLVMMIGLLMLGGLQRQLDVQLQQGIDEQRFWQAFNQGVSSLNWGISLQ 63 Query: 63 WQTQPAVQCSQYAETDAQVCLRLLADN-------EALLIAGYEGVSLWRTGEVI----DG 111 WQT QC +VCLR+ ++N E +I + ++ + G Sbjct: 64 WQTIEGWQCQTQPSAQLRVCLRINSENRYGLLRAEGNVIGERQPLTFYHRVVADVAATGG 123 Query: 112 NIVFSPRGWSDFCPLKERALC 132 I GWSDFCP C Sbjct: 124 RIQPVAGGWSDFCPETMEFAC 144 >UniRef50_A4TLC2 Membrane protein n=31 Tax=Yersinia RepID=A4TLC2_YERPP Length = 156 Score = 144 bits (363), Expect = 9e-34, Method: Composition-based stats. Identities = 34/149 (22%), Positives = 62/149 (41%), Gaps = 20/149 (13%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC- 62 ++G S+LA V+ L LG L + +Q + E + LR +S+L WG Sbjct: 6 QRGSSTLAAVMTLFSLGLFWLSAIHRQLDNIQQITGEEQRYLRAYNQAESSLNWGVSQRW 65 Query: 63 -----WQTQPAVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWRTGEV-- 108 W+ A C + E + C++ + E+L + + L++ ++ Sbjct: 66 ALRIPWRVGSAWHCMAHQELGLKACVKRSSLAGFFILKGESLPLGSLPPLMLYQRVKLKA 125 Query: 109 -----IDGNIVFSPRGWSDFCPLKERALC 132 + ++ +P GW DFCP K+ C Sbjct: 126 VTGSSGNYQLIDTPHGWLDFCPDKDAQFC 154 >UniRef50_D0FVF4 Conserved uncharacterized protein n=3 Tax=Erwinia RepID=D0FVF4_ERWPY Length = 137 Score = 144 bits (363), Expect = 9e-34, Method: Composition-based stats. Identities = 41/139 (29%), Positives = 68/139 (48%), Gaps = 10/139 (7%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 ++ ++G +L +V++LL++G+L+L +Q S V E L++ SALAWG+ Sbjct: 3 LDSQRGSGTLIMVVILLLMGTLMLNATRRQLSDAGSLVGDERIYLQQFTAATSALAWGQR 62 Query: 61 HCWQTQPAVQCSQYAETDAQVCLR-----LLADNEALLIAGYEGVSLWRTGEVIDGNIVF 115 W+ QC Q E + CL L AD+ + Y + R+GE + Sbjct: 63 LSWKAADGWQCQQQGEYRWRACLHFARMLLRADSGPQTLVLYHWMKKSRSGE-----LQP 117 Query: 116 SPRGWSDFCPLKERALCQL 134 P GW D+CPL ++ C + Sbjct: 118 RPHGWLDYCPLAKKGGCDV 136 >UniRef50_D2BTT7 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech586 RepID=D2BTT7_DICD5 Length = 171 Score = 141 bits (356), Expect = 5e-33, Method: Composition-based stats. Identities = 32/141 (22%), Positives = 57/141 (40%), Gaps = 11/141 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 R++ S L +V++++++G L+L G+ +Q + E + + S+L WG Sbjct: 27 RQQRGSILVVVMLVMMIGMLMLGGLQRQLDLQLQQGIEEQRFWQAFNQGLSSLNWGMSLR 86 Query: 63 WQTQPAVQCSQYAETDAQVCLRL-------LADNEALLIAGYEGVSLWRTGEVI----DG 111 W QC + CLR+ L E + + ++ + + +G Sbjct: 87 WPVSDEWQCQTLPSAQLRACLRINHDDRSGLLRGEGQVDGELQPLTFYHRVMIDVTTAEG 146 Query: 112 NIVFSPRGWSDFCPLKERALC 132 I GWSDFCP C Sbjct: 147 YIRPIMGGWSDFCPDAMEQAC 167 >UniRef50_C4K851 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K851_HAMD5 Length = 159 Score = 140 bits (354), Expect = 9e-33, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 58/152 (38%), Gaps = 23/152 (15%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 +KG +L VL++ +G LLL + ++ + E L+ S+L+WG W Sbjct: 6 QKGTGTLISVLIIFTMGLLLLTALQRELEGAFNISGQERFYLKAYNQAASSLSWGVSKQW 65 Query: 64 QTQP----------AVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWRTG 106 + C + C++ L E++L E + L+++ Sbjct: 66 PLRSLSQRTSRRHQGWYCETEQINQLKSCIKPMLEIGIFLIKGESILGNRSEKIILYQSA 125 Query: 107 EVI------DGNIVFSPRGWSDFCPLKERALC 132 + + +V GW DFCP+K C Sbjct: 126 QTNEMPNTTEQKLVPVTGGWLDFCPVKNEKFC 157 >UniRef50_C6DAF2 Putative uncharacterized protein n=5 Tax=Pectobacterium RepID=C6DAF2_PECCP Length = 151 Score = 139 bits (350), Expect = 3e-32, Method: Composition-based stats. Identities = 44/143 (30%), Positives = 64/143 (44%), Gaps = 13/143 (9%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 +KG +LA+VL++ V+G LL+ G+ +Q S + E LR S+L WG W Sbjct: 6 QKGSGTLAMVLLIAVIGLLLMSGLQRQLESAIQVGNDERHYLRAFNQALSSLNWGIGLRW 65 Query: 64 QT-QPAVQCSQYAETDAQVCLRLLAD-------NEALLIAGYEGVSLWRTG---EVIDGN 112 + + QC Q + VCLR ++ E L A + L++ + G Sbjct: 66 RVSTESWQCQQLSAEQLVVCLRAASEGKQGVLRGEGTLPASTRTLKLYQRVSFLALSSGQ 125 Query: 113 IVFSP--RGWSDFCPLKERALCQ 133 I P GW DFCP KE C Sbjct: 126 IAIQPLANGWLDFCPDKEVTRCD 148 >UniRef50_C6CC26 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6CC26_DICDC Length = 149 Score = 131 bits (329), Expect = 8e-30, Method: Composition-based stats. Identities = 36/143 (25%), Positives = 63/143 (44%), Gaps = 13/143 (9%) Query: 2 NREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMH 61 N ++G S++ + +++LV+G L+L G+ +Q + + E ++L + SAL WG Sbjct: 4 NPQRG-STVLMAMLMLVIGMLILSGLQRQLEAQMLQDRDEQRALEDFNLASSALKWGLTL 62 Query: 62 CWQT-QPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVS--------LWRTG-EVIDG 111 W+ QC + CLRL A L+ G V+ +R +++ Sbjct: 63 EWRIDDDGWQCQSAGVDALRACLRLNAGGRIGLLRGERLVTGVARPAAFYYRVVPALVNE 122 Query: 112 NIVFSP--RGWSDFCPLKERALC 132 + P GW D CP+ C Sbjct: 123 RPIIQPIAGGWLDVCPVAGAQNC 145 >UniRef50_A7MR34 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=A7MR34_ENTS8 Length = 140 Score = 127 bits (320), Expect = 8e-29, Method: Composition-based stats. Identities = 53/140 (37%), Positives = 74/140 (52%), Gaps = 10/140 (7%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 REKG+S++A+VL LL+LGSL+L G+ QQ S R + ES +++ SALA + Sbjct: 4 REKGMSTIAMVLALLLLGSLMLGGLQQQLDSRFGRAANESAAIKAFNAALSALALSQSQR 63 Query: 63 WQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYE-------GVSLWRTGEVIDGNIVF 115 W P QC E + C+R + + L+ E ++LWR + + F Sbjct: 64 WMFTPQWQCQTLPEVKGRACVRQM---QTYLLVAAEGADENAVPLTLWRWAQPDGDRLRF 120 Query: 116 SPRGWSDFCPLKERALCQLP 135 P GWSDFCPL E CQLP Sbjct: 121 MPHGWSDFCPLTEAKQCQLP 140 >UniRef50_C4UJ53 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UJ53_YERRU Length = 149 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 47/147 (31%), Gaps = 24/147 (16%) Query: 10 LALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCWQTQP-- 67 + + + L LLLQ M Q E LR + S+L WG W Sbjct: 1 MVTITAIFALSLLLLQAMHQHLDKMLLITRNEQHYLRSYNLAASSLNWGLNQAWPVSQIN 60 Query: 68 --------AVQCSQYAETDAQVCLR-------LLADNEALLIAGYEGVSLWRTGEVI--- 109 C + E C++ + E L + ++L++ + Sbjct: 61 GSLFTHSKQWFCQHHTEESLTACIKPGTLQNTFVLKGEGLASRYTQKMALYQRVRIDFES 120 Query: 110 ----DGNIVFSPRGWSDFCPLKERALC 132 + +GW DFCP K+ C Sbjct: 121 LTQAGIKVASLAQGWLDFCPEKDAGFC 147 >UniRef50_Q7N8T9 Similar to putative membrane protein n=2 Tax=Photorhabdus RepID=Q7N8T9_PHOLL Length = 161 Score = 127 bits (318), Expect = 2e-28, Method: Composition-based stats. Identities = 33/148 (22%), Positives = 60/148 (40%), Gaps = 19/148 (12%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G L VL+LL + ++L+ + + + E + L+ +S+LAWG W Sbjct: 13 QQGNVLLISVLILLTVSLMMLKALHHHLDNALIMMVDERRYLKAFQQAESSLAWGIAQSW 72 Query: 64 QT----------QPAVQCSQYAETDAQVCLRLLADNEALL-----IAGYEGVSLWRTGEV 108 C Q E CL+ + + LL +A + V L++ ++ Sbjct: 73 PLHSSQIEDSNKTEEWYCQQQQEFHLTSCLKRQSSHFFLLKGMADVAPGQSVGLYQWMKL 132 Query: 109 I----DGNIVFSPRGWSDFCPLKERALC 132 D +++ GW DFCP + C Sbjct: 133 HSSGGDDSLIPVESGWLDFCPEVDVGFC 160 >UniRef50_C0AT30 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AT30_9ENTR Length = 153 Score = 124 bits (312), Expect = 8e-28, Method: Composition-based stats. Identities = 34/140 (24%), Positives = 61/140 (43%), Gaps = 13/140 (9%) Query: 3 REKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHC 62 E+G +SL +V+ +V+ L + R V E Q+ + +SALAWG Sbjct: 10 SEQGFASLLVVIGFIVMSLSALGNFAYHYRQSQQIVMQELQARQAFLFAESALAWGIKRN 69 Query: 63 WQTQPA----VQCSQ-YAETDAQVCLRLLADNEALLIAGYEGVS-----LWRTGEV---I 109 W+ + QC + + CL L++ + LL E ++ ++ + Sbjct: 70 WEILSSQLNKWQCRHFHVNPKIKSCLFLISLEKGLLQGQAESLNGYKIYHYQWVDFLKDK 129 Query: 110 DGNIVFSPRGWSDFCPLKER 129 + ++V P GW D+CPL + Sbjct: 130 NKSMVTHPNGWLDYCPLVHK 149 >UniRef50_B4F2G1 Putative membrane protein n=2 Tax=Proteus mirabilis RepID=B4F2G1_PROMH Length = 152 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 34/143 (23%), Positives = 58/143 (40%), Gaps = 13/143 (9%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 + ++GV L + +V+ LL + R V E Q+ + SAL+WG Sbjct: 8 IKEQQGVIGLMAAVGFMVISLSLLTSFAYHYRQCQLMVMQELQAKQTFLFAGSALSWGMT 67 Query: 61 HCWQTQPA----VQCSQY-AETDAQVCLRLLADNEALLIAGYE-----GVSLWRTGEVID 110 W P+ QC + AE++ + C L ALL+ + +S ++ ++ Sbjct: 68 LTWDLSPSRLYQWQCRTFTAESNMKSCFSLRGKTSALLLGEAKSHSGDKISHYQWINLVG 127 Query: 111 GN---IVFSPRGWSDFCPLKERA 130 I GW D+CPL + Sbjct: 128 KEKLHIEAQHNGWLDYCPLVKEG 150 >UniRef50_C4WYP9 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4WYP9_KLEPN Length = 135 Score = 117 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 60/135 (44%), Positives = 88/135 (65%) Query: 1 MNREKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKM 60 M R++G+SSL +VL+LL LG LLL+G++ Q R+ ++ + E+Q++R AI SAL WGK Sbjct: 1 MIRQRGMSSLLMVLLLLTLGCLLLEGLNLQQRALLAQTASETQAIRDTAIAHSALQWGKQ 60 Query: 61 HCWQTQPAVQCSQYAETDAQVCLRLLADNEALLIAGYEGVSLWRTGEVIDGNIVFSPRGW 120 W Q A+ C + A + CLR+ D +L + V +W++GEV G + FS GW Sbjct: 61 QVWSAQVALACREQAPQGWRACLRIFGDGSLVLSSASGEVQVWQSGEVRGGQVRFSAHGW 120 Query: 121 SDFCPLKERALCQLP 135 SDFCPL+E +LCQ+P Sbjct: 121 SDFCPLREASLCQMP 135 >UniRef50_A8GIH4 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GIH4_SERP5 Length = 148 Score = 112 bits (279), Expect = 5e-24, Method: Composition-based stats. Identities = 40/143 (27%), Positives = 61/143 (42%), Gaps = 16/143 (11%) Query: 4 EKGVSSLALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCW 63 ++G S+LA V+MLLV+G +LL +Q S + E Q L+ SAL+WG W Sbjct: 6 QRGGSTLAAVMMLLVMGLMLLNAQHRQLDSALLLAADEKQYLQAYNQAASALSWGVAQSW 65 Query: 64 Q----TQPAVQCSQYAETDAQVCLRLLADNEALLIAG------YEGVSLWRTGEVIDG-- 111 C Q + C +L + +L+ G E + L++ Sbjct: 66 PRGRLAANGWWCQQ--TDQLRACAKLSSQAGIVLVRGDAQVSRGEPLRLYQRTRPDGSGG 123 Query: 112 --NIVFSPRGWSDFCPLKERALC 132 + GW DFCP K++A C Sbjct: 124 DIGLRAETGGWLDFCPEKKQADC 146 >UniRef50_D2U290 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2U290_9ENTR Length = 99 Score = 103 bits (256), Expect = 2e-21, Method: Composition-based stats. Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 4/89 (4%) Query: 11 ALVLMLLVLGSLLLQGMSQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCWQTQ---- 66 ++ML+ + L+L+ + + + E +SALAWGK W Sbjct: 1 MAIMMLITMSLLMLKALHHHQDNLLQMLESEQHYWLFFEQAESALAWGKYQPWVITKEKN 60 Query: 67 PAVQCSQYAETDAQVCLRLLADNEALLIA 95 + C Q + + CLR + LL Sbjct: 61 SSWSCQQPLGANFKSCLRHYKYDYFLLAG 89 >UniRef50_C5BH89 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BH89_EDWI9 Length = 147 Score = 90.1 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 27/108 (25%), Positives = 37/108 (34%), Gaps = 11/108 (10%) Query: 28 SQQDRSFASRVSMESQSLRRQAIVQSALAWGKMHCWQTQPAVQCSQYAETDAQVCLRLLA 87 Q + R + + + A LA + WQ P QC A C+R L Sbjct: 28 QAQTTALLQRSAAQQRYWYADAQADLWLARAQHLLWQAWPHWQCRALA-PSWHACIRALD 86 Query: 88 DNEALLIAGYEGVSL--------WRTGEVIDGNIVFS--PRGWSDFCP 125 ALL + L + GE + + P GWSDF P Sbjct: 87 AQRALLCVSGQPSPLTYPLVRVRYLWGEAHEREQRYRALPGGWSDFLP 134 >UniRef50_D1B6L2 Aminotransferase class I and II n=1 Tax=Thermanaerovibrio acidaminovorans DSM 6589 RepID=D1B6L2_THEAS Length = 399 Score = 39.3 bits (90), Expect = 0.039, Method: Composition-based stats. Identities = 9/58 (15%), Positives = 17/58 (29%), Gaps = 1/58 (1%) Query: 57 WGKMHCWQTQPAVQCSQYAETDAQV-CLRLLADNEALLIAGYEGVSLWRTGEVIDGNI 113 WG+ H W+ P C + +RL + +R + + Sbjct: 78 WGRRHRWEVDPGWICHSHGVMGGVALAIRLFTREGDRVAVQTPIYPPFRWVVENNRRV 135 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.135 0.444 Lambda K H 0.267 0.0419 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 695,801,359 Number of Sequences: 3077464 Number of extensions: 19677421 Number of successful extensions: 80839 Number of sequences better than 1.0e-01: 23 Number of HSP's better than 0.1 without gapping: 44 Number of HSP's successfully gapped in prelim test: 14 Number of HSP's that attempted gapping in prelim test: 80713 Number of HSP's gapped (non-prelim): 59 length of query: 135 length of database: 1,040,396,356 effective HSP length: 100 effective length of query: 35 effective length of database: 732,649,956 effective search space: 25642748460 effective search space used: 25642748460 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.1 bits)