BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (140 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76485 Uncharacterized protein yfbO n=5 Tax=Escherichia... 285 3e-76 UniRef50_C5UQB4 Putative uncharacterized protein n=1 Tax=Clostri... 79 4e-14 UniRef50_A3ZYW7 Putative uncharacterized protein n=1 Tax=Blastop... 62 4e-09 UniRef50_B9XDG8 Putative uncharacterized protein n=1 Tax=bacteri... 59 5e-08 UniRef50_C6PRL0 Putative uncharacterized protein n=1 Tax=Clostri... 57 1e-07 UniRef50_Q07KR5 Putative uncharacterized protein n=1 Tax=Rhodops... 49 7e-05 UniRef50_A1RID1 Putative uncharacterized protein n=3 Tax=Shewane... 44 0.001 UniRef50_B0CTT0 Glycosyltransferase family 39 protein n=6 Tax=Ba... 39 0.048 >UniRef50_P76485 Uncharacterized protein yfbO n=5 Tax=Escherichia coli RepID=YFBO_ECOLI Length = 140 Score = 285 bits (729), Expect = 3e-76, Method: Compositional matrix adjust. Identities = 140/140 (100%), Positives = 140/140 (100%) Query: 1 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL 60 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL Sbjct: 1 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL 60 Query: 61 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED Sbjct: 61 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 Query: 121 TEHGWVEVPVGMHPVTCWWD 140 TEHGWVEVPVGMHPVTCWWD Sbjct: 121 TEHGWVEVPVGMHPVTCWWD 140 >UniRef50_C5UQB4 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UQB4_CLOBO Length = 136 Score = 79.3 bits (194), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 46/132 (34%), Positives = 68/132 (51%), Gaps = 2/132 (1%) Query: 10 LVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRERNNVS 69 L I + P++SLE FF N+ GSI CN++ + IY +IR ++NV Sbjct: 6 LATIYAQEEENENQLPIVSLELFFEGNDDIGSIGCNILEHPGTEKIYSILKEIRNKSNVQ 65 Query: 70 DVLVEITMFDDPD-WPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHGWVEV 128 DVL+EI +D+ + WPFSE + + T A +EV W VEE+ E EG+ ++ Sbjct: 66 DVLIEILEYDEDEVWPFSERVYIFTDAEEDEVMEW-VEELDISEISEGYIYGQSKAAPKL 124 Query: 129 PVGMHPVTCWWD 140 G + WWD Sbjct: 125 SDGYKVYSLWWD 136 >UniRef50_A3ZYW7 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYW7_9PLAN Length = 141 Score = 62.4 bits (150), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 32/82 (39%), Positives = 46/82 (56%), Gaps = 3/82 (3%) Query: 22 TPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRERNNVSDVLVEITMFDDP 81 P P+ L FF N+ +GSI CN+ PQ Y I +R++V DVLV+I ++ Sbjct: 23 APAPVAPLHLFFDGNDDYGSIGCNLTDHPGPQGFYETLRSIHQRSDVVDVLVQIYEIEED 82 Query: 82 D---WPFSESILVITTASPEEV 100 D WPFSE + ++TTA E + Sbjct: 83 DVTMWPFSERVFIVTTADRETI 104 >UniRef50_B9XDG8 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XDG8_9BACT Length = 141 Score = 58.9 bits (141), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 41/140 (29%), Positives = 65/140 (46%), Gaps = 8/140 (5%) Query: 6 RITQLVNINGDVNNPDT--PRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIR 63 + QL+ I D PR L++ ++FF N+ GSI CN+I F ++ Sbjct: 5 KRKQLIEIAKSRGYTDMAPPRVLVTRQEFFDGNDDDGSIGCNLIEHPGIATFDAAFRQVE 64 Query: 64 ERNNVSDVLVEITMFD---DPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 E + V+ V + IT D D WPF+++ L++T P V + PDE S++ Sbjct: 65 EMDRVAGVYLAITEIDETYDGIWPFTDTALIVTRL-PAAVFEPLFRPLQPDEI--ASSDE 121 Query: 121 TEHGWVEVPVGMHPVTCWWD 140 + E+P G + WWD Sbjct: 122 SFANPPEIPAGYQLIRAWWD 141 >UniRef50_C6PRL0 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PRL0_9CLOT Length = 138 Score = 57.4 bits (137), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 44/141 (31%), Positives = 69/141 (48%), Gaps = 12/141 (8%) Query: 4 LERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLK-I 62 LE+I + G P + L+S+EDFF NN GS N E ++ LK I Sbjct: 6 LEKIEK----QGGFKPPYSMDILVSIEDFFEGNNAAGSFAANACTENLDVHEFYEILKGI 61 Query: 63 RERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTE 122 + + NV +V + I D+ +WP+S++I + TTA+ +++ WF P E W+ + E Sbjct: 62 KLKENVFEVWILICDIDE-EWPYSDTIYISTTANEDDIYKWFGYGF-PSEVWK--VDCKE 117 Query: 123 HGWV---EVPVGMHPVTCWWD 140 G + E+ G WWD Sbjct: 118 CGLIQTLELKNGFRVFGVWWD 138 >UniRef50_Q07KR5 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisA53 RepID=Q07KR5_RHOP5 Length = 143 Score = 48.5 bits (114), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 40/139 (28%), Positives = 60/139 (43%), Gaps = 20/139 (14%) Query: 6 RITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQ-AIYHHFLK-IR 63 R L + V P++ L+D+F+ N SI N I + P A H L+ IR Sbjct: 2 REKLLEKLQSLVEQSSAAPPVVELDDYFVGNAQEDSIAPNQIGDGRPSLADLHAALRAIR 61 Query: 64 ERNNVSDVLVEI------TMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGW 117 +R +V VLV I ++ D WP ++++ + +A V+ W V A D +GW Sbjct: 62 DRPDVQAVLVGIHGDWVESLKCDDVWPAADNVHIYASAGRSTVEGW-VAGFAHDGVLKGW 120 Query: 118 SEDTEHGWVEVPVGMHPVT 136 P GMHP Sbjct: 121 -----------PYGMHPAA 128 >UniRef50_A1RID1 Putative uncharacterized protein n=3 Tax=Shewanella RepID=A1RID1_SHESW Length = 145 Score = 44.3 bits (103), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 11/125 (8%) Query: 25 PLLSLEDFFIDNNIHGSICCNVIPEQSP--QAIYHHFLKIRERNNVSDVLVEI------T 76 P+++L++FF+ N SI N P + IY +I R +V V V + Sbjct: 23 PVVTLDEFFLANEDEESIAPNNWGYGRPSIKEIYRSLKEIEHRPDVQGVFVGMHDEWSEA 82 Query: 77 MFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHGWVEVPV-GMHPV 135 + DD WP +E+I +++ A EV++W + IA D GW EH +P G Sbjct: 83 LEDDELWPAAENIHILSCAPEVEVEAW-IAGIAADGLGTGWPY-GEHKLSPMPQDGYLVY 140 Query: 136 TCWWD 140 T +WD Sbjct: 141 TVYWD 145 >UniRef50_B0CTT0 Glycosyltransferase family 39 protein n=6 Tax=Basidiomycota RepID=B0CTT0_LACBS Length = 801 Score = 38.9 bits (89), Expect = 0.048, Method: Composition-based stats. Identities = 17/53 (32%), Positives = 29/53 (54%) Query: 81 PDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHGWVEVPVGMH 133 P+W F + + E+ +WFV+EI DE + ++ TEH V++P M+ Sbjct: 521 PEWAFKQQEINGNKNPTEKTATWFVDEIVADEDGDEVTDRTEHAAVKIPKSMN 573 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76485 Uncharacterized protein yfbO n=5 Tax=Escherichia... 190 1e-47 UniRef50_C5UQB4 Putative uncharacterized protein n=1 Tax=Clostri... 149 2e-35 UniRef50_B9XDG8 Putative uncharacterized protein n=1 Tax=bacteri... 145 4e-34 UniRef50_Q07KR5 Putative uncharacterized protein n=1 Tax=Rhodops... 140 1e-32 UniRef50_C6PRL0 Putative uncharacterized protein n=1 Tax=Clostri... 137 8e-32 UniRef50_A1RID1 Putative uncharacterized protein n=3 Tax=Shewane... 123 1e-27 UniRef50_A3ZYW7 Putative uncharacterized protein n=1 Tax=Blastop... 118 5e-26 Sequences not found previously or not previously below threshold: UniRef50_C2BV39 Putative uncharacterized protein n=1 Tax=Mobilun... 64 2e-09 UniRef50_Q11XY4 Putative uncharacterized protein n=1 Tax=Cytopha... 64 2e-09 UniRef50_B9XQA3 Putative uncharacterized protein n=1 Tax=bacteri... 46 3e-04 >UniRef50_P76485 Uncharacterized protein yfbO n=5 Tax=Escherichia coli RepID=YFBO_ECOLI Length = 140 Score = 190 bits (482), Expect = 1e-47, Method: Composition-based stats. Identities = 140/140 (100%), Positives = 140/140 (100%) Query: 1 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL 60 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL Sbjct: 1 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL 60 Query: 61 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED Sbjct: 61 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 Query: 121 TEHGWVEVPVGMHPVTCWWD 140 TEHGWVEVPVGMHPVTCWWD Sbjct: 121 TEHGWVEVPVGMHPVTCWWD 140 >UniRef50_C5UQB4 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UQB4_CLOBO Length = 136 Score = 149 bits (377), Expect = 2e-35, Method: Composition-based stats. Identities = 46/136 (33%), Positives = 69/136 (50%), Gaps = 2/136 (1%) Query: 6 RITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRER 65 + L I + P++SLE FF N+ GSI CN++ + IY +IR + Sbjct: 2 KENLLATIYAQEEENENQLPIVSLELFFEGNDDIGSIGCNILEHPGTEKIYSILKEIRNK 61 Query: 66 NNVSDVLVEITMFDDPD-WPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHG 124 +NV DVL+EI +D+ + WPFSE + + T A +EV W VEE+ E EG+ Sbjct: 62 SNVQDVLIEILEYDEDEVWPFSERVYIFTDAEEDEVMEW-VEELDISEISEGYIYGQSKA 120 Query: 125 WVEVPVGMHPVTCWWD 140 ++ G + WWD Sbjct: 121 APKLSDGYKVYSLWWD 136 >UniRef50_B9XDG8 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XDG8_9BACT Length = 141 Score = 145 bits (366), Expect = 4e-34, Method: Composition-based stats. Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 8/140 (5%) Query: 6 RITQLVNINGDVNNPDTPRP--LLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIR 63 + QL+ I D P L++ ++FF N+ GSI CN+I F ++ Sbjct: 5 KRKQLIEIAKSRGYTDMAPPRVLVTRQEFFDGNDDDGSIGCNLIEHPGIATFDAAFRQVE 64 Query: 64 ERNNVSDVLVEITMFD---DPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 E + V+ V + IT D D WPF+++ L++T P V + PDE S++ Sbjct: 65 EMDRVAGVYLAITEIDETYDGIWPFTDTALIVTRL-PAAVFEPLFRPLQPDEI--ASSDE 121 Query: 121 TEHGWVEVPVGMHPVTCWWD 140 + E+P G + WWD Sbjct: 122 SFANPPEIPAGYQLIRAWWD 141 >UniRef50_Q07KR5 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisA53 RepID=Q07KR5_RHOP5 Length = 143 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 37/143 (25%), Positives = 59/143 (41%), Gaps = 9/143 (6%) Query: 6 RITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSP--QAIYHHFLKIR 63 R L + V P++ L+D+F+ N SI N I + P ++ IR Sbjct: 2 REKLLEKLQSLVEQSSAAPPVVELDDYFVGNAQEDSIAPNQIGDGRPSLADLHAALRAIR 61 Query: 64 ERNNVSDVLVEI------TMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGW 117 +R +V VLV I ++ D WP ++++ + +A V+ W V A D +GW Sbjct: 62 DRPDVQAVLVGIHGDWVESLKCDDVWPAADNVHIYASAGRSTVEGW-VAGFAHDGVLKGW 120 Query: 118 SEDTEHGWVEVPVGMHPVTCWWD 140 + G H T WD Sbjct: 121 PYGMHPAAPKPQRGYHVYTICWD 143 >UniRef50_C6PRL0 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PRL0_9CLOT Length = 138 Score = 137 bits (346), Expect = 8e-32, Method: Composition-based stats. Identities = 44/141 (31%), Positives = 69/141 (48%), Gaps = 12/141 (8%) Query: 4 LERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLK-I 62 LE+I + G P + L+S+EDFF NN GS N E ++ LK I Sbjct: 6 LEKIEK----QGGFKPPYSMDILVSIEDFFEGNNAAGSFAANACTENLDVHEFYEILKGI 61 Query: 63 RERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTE 122 + + NV +V + I D+ +WP+S++I + TTA+ +++ WF P E W+ + E Sbjct: 62 KLKENVFEVWILICDIDE-EWPYSDTIYISTTANEDDIYKWFGYGF-PSEVWK--VDCKE 117 Query: 123 HGWV---EVPVGMHPVTCWWD 140 G + E+ G WWD Sbjct: 118 CGLIQTLELKNGFRVFGVWWD 138 >UniRef50_A1RID1 Putative uncharacterized protein n=3 Tax=Shewanella RepID=A1RID1_SHESW Length = 145 Score = 123 bits (309), Expect = 1e-27, Method: Composition-based stats. Identities = 40/144 (27%), Positives = 64/144 (44%), Gaps = 11/144 (7%) Query: 6 RITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSP--QAIYHHFLKIR 63 R L ++ + P+++L++FF+ N SI N P + IY +I Sbjct: 4 RTELLNKLSALLVADPENSPVVTLDEFFLANEDEESIAPNNWGYGRPSIKEIYRSLKEIE 63 Query: 64 ERNNVSDVLVEI------TMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGW 117 R +V V V + + DD WP +E+I +++ A EV++W + IA D GW Sbjct: 64 HRPDVQGVFVGMHDEWSEALEDDELWPAAENIHILSCAPEVEVEAW-IAGIAADGLGTGW 122 Query: 118 SEDTEHGWVEVP-VGMHPVTCWWD 140 EH +P G T +WD Sbjct: 123 PY-GEHKLSPMPQDGYLVYTVYWD 145 >UniRef50_A3ZYW7 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYW7_9PLAN Length = 141 Score = 118 bits (296), Expect = 5e-26, Method: Composition-based stats. Identities = 39/142 (27%), Positives = 59/142 (41%), Gaps = 8/142 (5%) Query: 4 LERITQLVNINGDVNN--PDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLK 61 L + L+ + P P+ L FF N+ +GSI CN+ PQ Y Sbjct: 3 LSKRKALIEHIQSIGGLGMSAPAPVAPLHLFFDGNDDYGSIGCNLTDHPGPQGFYETLRS 62 Query: 62 IRERNNVSDVLVEITMFDDPD---WPFSESILVITTASPEEVQSWFVEEIAPDECWEGWS 118 I +R++V DVLV+I ++ D WPFSE + ++TTA E + + P + + Sbjct: 63 IHQRSDVVDVLVQIYEIEEDDVTMWPFSERVFIVTTADRETIAELLTR-LQPTDIESEYP 121 Query: 119 EDTEHGWVEVPVGMHPVTCWWD 140 P WWD Sbjct: 122 LPPHAEQP--PEDCVVYAVWWD 141 >UniRef50_C2BV39 Putative uncharacterized protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BV39_9ACTO Length = 147 Score = 63.6 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 54/148 (36%), Gaps = 17/148 (11%) Query: 8 TQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSP--QAIYHHFLKIRER 65 T+L + + PLL+L++FF N SI N P I + KI Sbjct: 2 TELNTFKKILEENEERLPLLTLDEFFDGNTEEDSIAPNQWGFGRPTLSEIRNMLQKIELM 61 Query: 66 NNVSDVLVEITMFDD----------PDWPFSESILVITTASPEEVQSWF-VEEIAPDEC- 113 +++ V + + DD ++I++ TT P E++ E + D Sbjct: 62 PDIA--WVRVALHDDTGIVENNGKEELVLAGDTIVICTTILPAELEKLVNCEWLCSDGVI 119 Query: 114 -WEGWSEDTEHGWVEVPVGMHPVTCWWD 140 E + +T +P + WD Sbjct: 120 TIEAFELNTYSCVPPIPDNFDCLEIVWD 147 >UniRef50_Q11XY4 Putative uncharacterized protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11XY4_CYTH3 Length = 137 Score = 63.6 bits (153), Expect = 2e-09, Method: Composition-based stats. Identities = 29/117 (24%), Positives = 58/117 (49%), Gaps = 4/117 (3%) Query: 26 LLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRERNNVSD-VLVEITMFDDP-DW 83 +L+ E+FF N SI NV P + P + + + V+D V + I +DP +W Sbjct: 23 ILTFEEFFEGNTYETSIAVNV-PYKPPVVEFRGTFEKMLKEGVADNVWIRIVDIEDPEEW 81 Query: 84 PFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHGWVEVPVGMHPVTCWWD 140 F++++ VI + ++++ + ++++ D+ +EGW E + T +WD Sbjct: 82 IFTDTVYVIGDLTIQQLKEY-IKQLHADDIYEGWMYGEPVNAGEYDRSKNVYTIFWD 137 >UniRef50_B9XQA3 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XQA3_9BACT Length = 154 Score = 46.3 bits (108), Expect = 3e-04, Method: Composition-based stats. Identities = 16/80 (20%), Positives = 33/80 (41%), Gaps = 3/80 (3%) Query: 27 LSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRERNNVSDVLVEITMFDD---PDW 83 LS+EDFF + + N+ P ++ E VS ++ + ++D Sbjct: 41 LSIEDFFAKCSSDYCLAPNLEPHPGNANFKAALTRLSEEAGVSCCVIPVGEYEDADSELE 100 Query: 84 PFSESILVITTASPEEVQSW 103 P+S+ + V + + +W Sbjct: 101 PYSDRVFVAGSVDEAVIANW 120 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76485 Uncharacterized protein yfbO n=5 Tax=Escherichia... 170 1e-41 UniRef50_Q07KR5 Putative uncharacterized protein n=1 Tax=Rhodops... 154 1e-36 UniRef50_C5UQB4 Putative uncharacterized protein n=1 Tax=Clostri... 145 5e-34 UniRef50_A3ZYW7 Putative uncharacterized protein n=1 Tax=Blastop... 140 1e-32 UniRef50_B9XDG8 Putative uncharacterized protein n=1 Tax=bacteri... 138 6e-32 UniRef50_A1RID1 Putative uncharacterized protein n=3 Tax=Shewane... 136 2e-31 UniRef50_C6PRL0 Putative uncharacterized protein n=1 Tax=Clostri... 123 2e-27 UniRef50_C2BV39 Putative uncharacterized protein n=1 Tax=Mobilun... 121 7e-27 UniRef50_Q11XY4 Putative uncharacterized protein n=1 Tax=Cytopha... 115 4e-25 UniRef50_B9XQA3 Putative uncharacterized protein n=1 Tax=bacteri... 90 2e-17 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P76485 Uncharacterized protein yfbO n=5 Tax=Escherichia coli RepID=YFBO_ECOLI Length = 140 Score = 170 bits (431), Expect = 1e-41, Method: Composition-based stats. Identities = 140/140 (100%), Positives = 140/140 (100%) Query: 1 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL 60 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL Sbjct: 1 MTPLERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFL 60 Query: 61 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED Sbjct: 61 KIRERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 Query: 121 TEHGWVEVPVGMHPVTCWWD 140 TEHGWVEVPVGMHPVTCWWD Sbjct: 121 TEHGWVEVPVGMHPVTCWWD 140 >UniRef50_Q07KR5 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisA53 RepID=Q07KR5_RHOP5 Length = 143 Score = 154 bits (388), Expect = 1e-36, Method: Composition-based stats. Identities = 37/143 (25%), Positives = 59/143 (41%), Gaps = 9/143 (6%) Query: 6 RITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSP--QAIYHHFLKIR 63 R L + V P++ L+D+F+ N SI N I + P ++ IR Sbjct: 2 REKLLEKLQSLVEQSSAAPPVVELDDYFVGNAQEDSIAPNQIGDGRPSLADLHAALRAIR 61 Query: 64 ERNNVSDVLVEI------TMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGW 117 +R +V VLV I ++ D WP ++++ + +A V+ W V A D +GW Sbjct: 62 DRPDVQAVLVGIHGDWVESLKCDDVWPAADNVHIYASAGRSTVEGW-VAGFAHDGVLKGW 120 Query: 118 SEDTEHGWVEVPVGMHPVTCWWD 140 + G H T WD Sbjct: 121 PYGMHPAAPKPQRGYHVYTICWD 143 >UniRef50_C5UQB4 Putative uncharacterized protein n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UQB4_CLOBO Length = 136 Score = 145 bits (365), Expect = 5e-34, Method: Composition-based stats. Identities = 47/136 (34%), Positives = 68/136 (50%), Gaps = 2/136 (1%) Query: 6 RITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRER 65 + L I + P++SLE FF N+ GSI CN++ + IY +IR + Sbjct: 2 KENLLATIYAQEEENENQLPIVSLELFFEGNDDIGSIGCNILEHPGTEKIYSILKEIRNK 61 Query: 66 NNVSDVLVEITMFD-DPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHG 124 +NV DVL+EI +D D WPFSE + + T A +EV W VEE+ E EG+ Sbjct: 62 SNVQDVLIEILEYDEDEVWPFSERVYIFTDAEEDEVMEW-VEELDISEISEGYIYGQSKA 120 Query: 125 WVEVPVGMHPVTCWWD 140 ++ G + WWD Sbjct: 121 APKLSDGYKVYSLWWD 136 >UniRef50_A3ZYW7 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYW7_9PLAN Length = 141 Score = 140 bits (353), Expect = 1e-32, Method: Composition-based stats. Identities = 39/142 (27%), Positives = 59/142 (41%), Gaps = 8/142 (5%) Query: 4 LERITQLVNINGDVNN--PDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLK 61 L + L+ + P P+ L FF N+ +GSI CN+ PQ Y Sbjct: 3 LSKRKALIEHIQSIGGLGMSAPAPVAPLHLFFDGNDDYGSIGCNLTDHPGPQGFYETLRS 62 Query: 62 IRERNNVSDVLVEITMFDDPD---WPFSESILVITTASPEEVQSWFVEEIAPDECWEGWS 118 I +R++V DVLV+I ++ D WPFSE + ++TTA E + + P + + Sbjct: 63 IHQRSDVVDVLVQIYEIEEDDVTMWPFSERVFIVTTADRETIAELLTR-LQPTDIESEYP 121 Query: 119 EDTEHGWVEVPVGMHPVTCWWD 140 P WWD Sbjct: 122 LPPHAEQP--PEDCVVYAVWWD 141 >UniRef50_B9XDG8 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XDG8_9BACT Length = 141 Score = 138 bits (347), Expect = 6e-32, Method: Composition-based stats. Identities = 40/140 (28%), Positives = 64/140 (45%), Gaps = 8/140 (5%) Query: 6 RITQLVNINGDVNNPDTPRP--LLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIR 63 + QL+ I D P L++ ++FF N+ GSI CN+I F ++ Sbjct: 5 KRKQLIEIAKSRGYTDMAPPRVLVTRQEFFDGNDDDGSIGCNLIEHPGIATFDAAFRQVE 64 Query: 64 ERNNVSDVLVEITMFD---DPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSED 120 E + V+ V + IT D D WPF+++ L++T P V + PDE S++ Sbjct: 65 EMDRVAGVYLAITEIDETYDGIWPFTDTALIVTRL-PAAVFEPLFRPLQPDEI--ASSDE 121 Query: 121 TEHGWVEVPVGMHPVTCWWD 140 + E+P G + WWD Sbjct: 122 SFANPPEIPAGYQLIRAWWD 141 >UniRef50_A1RID1 Putative uncharacterized protein n=3 Tax=Shewanella RepID=A1RID1_SHESW Length = 145 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 37/144 (25%), Positives = 59/144 (40%), Gaps = 9/144 (6%) Query: 5 ERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQ--AIYHHFLKI 62 R L ++ + P+++L++FF+ N SI N P IY +I Sbjct: 3 TRTELLNKLSALLVADPENSPVVTLDEFFLANEDEESIAPNNWGYGRPSIKEIYRSLKEI 62 Query: 63 RERNNVSDVLVEI------TMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEG 116 R +V V V + + DD WP +E+I +++ A EV++W + IA D G Sbjct: 63 EHRPDVQGVFVGMHDEWSEALEDDELWPAAENIHILSCAPEVEVEAW-IAGIAADGLGTG 121 Query: 117 WSEDTEHGWVEVPVGMHPVTCWWD 140 W G T +WD Sbjct: 122 WPYGEHKLSPMPQDGYLVYTVYWD 145 >UniRef50_C6PRL0 Putative uncharacterized protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6PRL0_9CLOT Length = 138 Score = 123 bits (308), Expect = 2e-27, Method: Composition-based stats. Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 12/141 (8%) Query: 4 LERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQA-IYHHFLKI 62 LE+I + G P + L+S+EDFF NN GS N E Y I Sbjct: 6 LEKIEK----QGGFKPPYSMDILVSIEDFFEGNNAAGSFAANACTENLDVHEFYEILKGI 61 Query: 63 RERNNVSDVLVEITMFDDPDWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTE 122 + + NV +V + I D+ WP+S++I + TTA+ +++ WF P E W+ + E Sbjct: 62 KLKENVFEVWILICDIDEE-WPYSDTIYISTTANEDDIYKWFGYGF-PSEVWK--VDCKE 117 Query: 123 HGWV---EVPVGMHPVTCWWD 140 G + E+ G WWD Sbjct: 118 CGLIQTLELKNGFRVFGVWWD 138 >UniRef50_C2BV39 Putative uncharacterized protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BV39_9ACTO Length = 147 Score = 121 bits (303), Expect = 7e-27, Method: Composition-based stats. Identities = 31/148 (20%), Positives = 54/148 (36%), Gaps = 17/148 (11%) Query: 8 TQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSP--QAIYHHFLKIRER 65 T+L + + PLL+L++FF N SI N P I + KI Sbjct: 2 TELNTFKKILEENEERLPLLTLDEFFDGNTEEDSIAPNQWGFGRPTLSEIRNMLQKIELM 61 Query: 66 NNVSDVLVEITMFDD----------PDWPFSESILVITTASPEEVQSWF-VEEIAPDEC- 113 +++ V + + DD ++I++ TT P E++ E + D Sbjct: 62 PDIA--WVRVALHDDTGIVENNGKEELVLAGDTIVICTTILPAELEKLVNCEWLCSDGVI 119 Query: 114 -WEGWSEDTEHGWVEVPVGMHPVTCWWD 140 E + +T +P + WD Sbjct: 120 TIEAFELNTYSCVPPIPDNFDCLEIVWD 147 >UniRef50_Q11XY4 Putative uncharacterized protein n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11XY4_CYTH3 Length = 137 Score = 115 bits (288), Expect = 4e-25, Method: Composition-based stats. Identities = 31/139 (22%), Positives = 63/139 (45%), Gaps = 6/139 (4%) Query: 4 LERITQLVNINGDVNNPDTPRPLLSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIR 63 +I + + V +L+ E+FF N SI NV P + P + + Sbjct: 3 FNKIELIQKLQ--VLGFPQNELILTFEEFFEGNTYETSIAVNV-PYKPPVVEFRGTFEKM 59 Query: 64 ERNNVSD-VLVEITMFDDP-DWPFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDT 121 + V+D V + I +DP +W F++++ VI + ++++ + ++++ D+ +EGW Sbjct: 60 LKEGVADNVWIRIVDIEDPEEWIFTDTVYVIGDLTIQQLKEY-IKQLHADDIYEGWMYGE 118 Query: 122 EHGWVEVPVGMHPVTCWWD 140 E + T +WD Sbjct: 119 PVNAGEYDRSKNVYTIFWD 137 >UniRef50_B9XQA3 Putative uncharacterized protein n=1 Tax=bacterium Ellin514 RepID=B9XQA3_9BACT Length = 154 Score = 90.2 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 40/109 (36%), Gaps = 10/109 (9%) Query: 27 LSLEDFFIDNNIHGSICCNVIPEQSPQAIYHHFLKIRERNNVSDVLVEITMFDD---PDW 83 LS+EDFF + + N+ P ++ E VS ++ + ++D Sbjct: 41 LSIEDFFAKCSSDYCLAPNLEPHPGNANFKAALTRLSEEAGVSCCVIPVGEYEDADSELE 100 Query: 84 PFSESILVITTASPEEVQSWFVEEIAPDECWEGWSEDTEHGWVEVPVGM 132 P+S+ + V + + +W E + + G E+P Sbjct: 101 PYSDRVFVAGSVDEAVIANW------ARELRSEFYREERPGA-EMPREY 142 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.128 0.330 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 705,925,186 Number of Sequences: 3077464 Number of extensions: 22790021 Number of successful extensions: 58244 Number of sequences better than 1.0e-01: 10 Number of HSP's better than 0.1 without gapping: 23 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 58174 Number of HSP's gapped (non-prelim): 28 length of query: 140 length of database: 1,040,396,356 effective HSP length: 104 effective length of query: 36 effective length of database: 720,340,100 effective search space: 25932243600 effective search space used: 25932243600 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.2 bits)