BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (219 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P39293 Uncharacterized protein yjfK n=93 Tax=Enterobact... 451 e-125 UniRef50_Q15WJ0 Putative uncharacterized protein n=5 Tax=Gammapr... 98 2e-19 UniRef50_Q3BQS7 Putative uncharacterized protein n=9 Tax=Xanthom... 97 5e-19 UniRef50_Q7MZH5 Similar to unknown protein n=4 Tax=Gammaproteoba... 74 4e-12 UniRef50_A0NRQ9 Putative uncharacterized protein n=2 Tax=Rhodoba... 67 5e-10 UniRef50_B0SZP2 Putative uncharacterized protein n=2 Tax=Cauloba... 55 2e-06 UniRef50_Q3K622 Putative uncharacterized protein n=1 Tax=Pseudom... 55 2e-06 UniRef50_B6A291 Putative uncharacterized protein n=10 Tax=Rhizob... 53 6e-06 UniRef50_Q48FU6 Putative uncharacterized protein n=8 Tax=Pseudom... 41 0.032 >UniRef50_P39293 Uncharacterized protein yjfK n=93 Tax=Enterobacteriaceae RepID=YJFK_ECOLI Length = 219 Score = 451 bits (1159), Expect = e-125, Method: Compositional matrix adjust. Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH Sbjct: 1 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 Query: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK 120 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK Sbjct: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK 120 Query: 121 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT Sbjct: 121 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 Query: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG Sbjct: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 >UniRef50_Q15WJ0 Putative uncharacterized protein n=5 Tax=Gammaproteobacteria RepID=Q15WJ0_PSEA6 Length = 215 Score = 97.8 bits (242), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 57/192 (29%), Positives = 98/192 (51%), Gaps = 6/192 (3%) Query: 4 FFQRLFGKDNKPAIARGP--LGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFGK + + P +GL+L F +D L +L+E L+I + AV + Sbjct: 1 MFSKLFGKKDTSKTPKAPEVMGLYLGGSFQIDPLKLKLIEPSLIIESAASSHIIQAVGEV 60 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAKA 121 DL G +I R+YT D FLQ+ GG + I D+KL+ + E+ + ++ W++ + Sbjct: 61 DLDSGGKILRFYTDDDAFLQVVLDGGVTENHITDVKLWYFYETKTVGTDTQWQQLLKNDI 120 Query: 122 MGAMTLNWQEKRWQRFFNSEEPGNIEP-VYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 A + + +QR +++ G++ P V M EK ++ E F M Y+R++ + Sbjct: 121 SQAQ-YSLEGNSYQRVWDA--VGDVSPAVAMTEKTYEEDGDVSETDQFMMLYERELDDSN 177 Query: 181 YEYLLLNGEESF 192 E LL++GEE Sbjct: 178 IEALLVSGEEKL 189 >UniRef50_Q3BQS7 Putative uncharacterized protein n=9 Tax=Xanthomonas RepID=Q3BQS7_XANC5 Length = 243 Score = 96.7 bits (239), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 64/228 (28%), Positives = 109/228 (47%), Gaps = 22/228 (9%) Query: 4 FFQRLFGKDNKPAIARG---------PLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFT 54 FF +LFG+ P + PLGL + +DT +R+ + + LPG Sbjct: 23 FFNKLFGQPQPPPLPTSGSGAIGHALPLGLRVGGQVEIDTTLYRMAPEAMTAELPGGHQG 82 Query: 55 VAAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWR 114 + H++LG G + R+Y D FLQ+ T GG D++ +K FVY E+ + ++ Sbjct: 83 IPCYGHVNLGDGYALHRFYLDDDAFLQVTTVGG----DLEAMKAFVYCETVNPPSKQAFQ 138 Query: 115 E-AINAKAMGAMTLNWQEKRWQRFFNS-EEPGNIEPVYMLEKVENQNHAKW--EVHNFTM 170 E + +GA + + K+WQR S ++ I P+ E + + ++ ++ M Sbjct: 139 EFVMQHPHLGAAQIEYAGKQWQRATQSTDDASRIPPIAYDEVLYRYQPPRRDGDLTHYAM 198 Query: 171 GYQRQVTE-DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHI 217 Y R V E E+LL+ GE+S G E+ + A+G+D+ + L I Sbjct: 199 LYSRDVPELQREEFLLVTGEDS----GPNEFCVTYAVGIDVTVADLDI 242 >UniRef50_Q7MZH5 Similar to unknown protein n=4 Tax=Gammaproteobacteria RepID=Q7MZH5_PHOLL Length = 220 Score = 73.6 bits (179), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 56/197 (28%), Positives = 91/197 (46%), Gaps = 12/197 (6%) Query: 1 MSGFFQRLFG-KDNKPAIARGP--LGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAA 57 ++ F+ FG K PA+ + P LGL + +D L +L+E +L I + + Sbjct: 5 IADAFKSAFGGKTVAPAVPKVPEVLGLRIGGALEIDPLMLKLIESDLTIENAASTQLICS 64 Query: 58 VSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAI 117 V +DLG ++ RYYT + +LQ+ G D D + ++ L+ + E+ I ++ W I Sbjct: 65 VGVVDLGDNVRLVRYYTDDEGYLQVLQEGEGD-DGVKEVSLWYFYETKPIDSQAQWDALI 123 Query: 118 NAKAMGAMTLNWQEKRWQRFFNSEEP--GNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQ 175 G +T +R+ P NI+PV + EK ++ E F M Y RQ Sbjct: 124 EN---GIVT---PSRRYDLDGTQFSPLWDNIKPVAVTEKTYSKEGHITETDQFVMVYTRQ 177 Query: 176 VTEDTYEYLLLNGEESF 192 V + E L + GEE Sbjct: 178 VAHNRTEELQVVGEEKV 194 >UniRef50_A0NRQ9 Putative uncharacterized protein n=2 Tax=Rhodobacteraceae RepID=A0NRQ9_9RHOB Length = 215 Score = 66.6 bits (161), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 52/194 (26%), Positives = 89/194 (45%), Gaps = 5/194 (2%) Query: 5 FQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHIDLG 64 F LF K +K LGL + G + D +A RLL + LI P + A H DLG Sbjct: 2 FGSLFNKKDKGWQPPEILGLTIGRGISFDPIALRLLPADSLIERPDTTLMITAQGHCDLG 61 Query: 65 GGSQIFRYYTSGDEFL-QINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAKAMG 123 S + R+Y D FL Q G++ + +D+I L+ + + S ++ W +A Sbjct: 62 EQSHMHRFYPDDDRFLIQFQGGDGKEDERVDEIMLWYFYDVQYPSGDAEWNRVKSAIRQT 121 Query: 124 AMTLNWQEK--RWQRFFNSEEPGNIEPVYMLEKVENQNH--AKWEVHNFTMGYQRQVTED 179 + +L +E R++R + +P+ E+V + + + ++ M + R + + Sbjct: 122 SFSLPGEEGDFRFERAWFDTSTSPEDPMTYWEEVCDDRNGGGRRKIFQTAMLFARSLKDG 181 Query: 180 TYEYLLLNGEESFN 193 E LL+N EE N Sbjct: 182 RDEMLLVNMEEPEN 195 >UniRef50_B0SZP2 Putative uncharacterized protein n=2 Tax=Caulobacteraceae RepID=B0SZP2_CAUSK Length = 217 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 60/213 (28%), Positives = 95/213 (44%), Gaps = 13/213 (6%) Query: 4 FFQRLFG-KDNKPAIARGPL-GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFG KD PA A + + L LDTLA+R L D+L AL + + A + Sbjct: 1 MFSKLFGRKDTAPASALPAIRNVTLGRTVWLDTLAWRRLGDDLKFALDTDTLEITAQGLV 60 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFV-YEESY--GISKESHWREAIN 118 +L G + R+YT + Q T E + DI +F+ ++ +Y G + E W + + Sbjct: 61 ELREGGFVHRFYTDDNVMFQAVTDDREG-QRVTDITVFIPWDSAYPGGRADEEAWAKRLR 119 Query: 119 AKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNF--TMGYQRQV 176 A+ L ++R + +E + EPV E V + + F M + R + Sbjct: 120 ARTFTGPNL----PEYRRDWFGDEADSQEPVSFWEDVHDDRDGIPDRRIFQTCMLFSRDL 175 Query: 177 TEDTYEYLLLNGEESFN-DLGEPEWLFSRALGV 208 D E LL +E+ N D + E F +GV Sbjct: 176 PGDGRELLLAIQQENENEDTRQREVSFEIMIGV 208 >UniRef50_Q3K622 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3K622_PSEPF Length = 216 Score = 54.7 bits (130), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 63/225 (28%), Positives = 104/225 (46%), Gaps = 19/225 (8%) Query: 3 GFFQRLFGKDN----KPAI--ARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVA 56 G+F+ L G N PA A GPLGL DT LL+ + +P + + Sbjct: 2 GWFKDLLGTSNWQSAAPASESAGGPLGLAQGKAIRFDTTLGLLLDGSTSVRVPDAQ-AIW 60 Query: 57 AVSHIDLGGGSQIFRYYTSGDEF-LQINTTGGEDIDDIDDIKLFVYEESYGISKESHW-R 114 + IDLG +++ RYY + +EF +QI+ TG D I+ + LF Y ++ ++ R Sbjct: 61 SAGWIDLGQSNKLHRYYLNDEEFWVQIHVTGD---DQIESVTLFNYVSYVTVNSDAELQR 117 Query: 115 EAINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQR 174 A +G T + + R + +E G E V + E+V N + + + +++ +M Y R Sbjct: 118 LAGPNSQIGLPTYRHEGVEYTREWGTER-GQTELVPLTEQVINPDES-YTINHHSMLYAR 175 Query: 175 QV-TEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHII 218 + D E LL + E+ E S +LG+ + T L I Sbjct: 176 ETGLTDRRELLLFSVEQD----EEGTVSLSTSLGISLYTTDLSTI 216 >UniRef50_B6A291 Putative uncharacterized protein n=10 Tax=Rhizobiales RepID=B6A291_RHILW Length = 219 Score = 53.1 bits (126), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 54/196 (27%), Positives = 86/196 (43%), Gaps = 11/196 (5%) Query: 9 FGKD-NKPAIAR--GPLGLHLNSGFTLDTLAFRL--LEDELLIALPGE-EFTVAAVSHID 62 FG+D N+ + R GPL + +D L+ L E + LP F +A Sbjct: 5 FGRDKNEKPLPRELGPLSAAIGGALEIDFLSLEAETLGGEPAMPLPRSGPFIIAGYGESS 64 Query: 63 LGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHW-REAINAKA 121 L + + RYY +Q+ + G+ D +DDI + +S + + W R A Sbjct: 65 LDAATVLSRYYDEDHRMIQVMSASGQPGDAVDDISFYQPWDSVVPAGQGEWNRWTGPAGL 124 Query: 122 MGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDTY 181 +G + + + RF+ E P + V +EKVE+ A+ +H M Y R + T Sbjct: 125 VGQPSYDADGILYSRFWG-EGPERAQLVEFVEKVED-GEAQRSIHQTCMLYYRPLGS-TR 181 Query: 182 EYLLLNGEESFNDLGE 197 E LL+N E DLG+ Sbjct: 182 EMLLINVERDL-DLGQ 196 >UniRef50_Q48FU6 Putative uncharacterized protein n=8 Tax=Pseudomonas RepID=Q48FU6_PSE14 Length = 221 Score = 40.8 bits (94), Expect = 0.032, Method: Compositional matrix adjust. Identities = 46/176 (26%), Positives = 85/176 (48%), Gaps = 11/176 (6%) Query: 46 IALPGEEFTVAAVSHIDLGGGSQIFRYYTSG-DEFLQINTTGGEDIDDIDDIKLFVYEES 104 + + G+E V AV +DLG + R+Y D FLQ+ + G + +D+ DI LF Y + Sbjct: 54 VVVSGDE-KVWAVGRVDLGQSMALHRFYLDNEDYFLQV-VSNGLNPEDVQDIILFGYYSA 111 Query: 105 YGI-SKESHWREAINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKW 163 I SK+ R + +G T + ++R + + PG E + E + + + A + Sbjct: 112 EPITSKDELLRLTGPSSKIGLPTYEHDGEVFERQWGT-SPGQTELTPLDEDIVSPD-AAY 169 Query: 164 EVHNFTMGYQRQV-TEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHII 218 V + +M Y R+ + E+LL + EE E + A+G+ + T ++++ Sbjct: 170 RVKHLSMLYARETGLINRREFLLFSVEED----EEGSITLTTAVGITLQSTDINVL 221 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39293 Uncharacterized protein yjfK n=93 Tax=Enterobact... 292 4e-78 UniRef50_Q3BQS7 Putative uncharacterized protein n=9 Tax=Xanthom... 234 2e-60 UniRef50_A0NRQ9 Putative uncharacterized protein n=2 Tax=Rhodoba... 228 1e-58 UniRef50_Q15WJ0 Putative uncharacterized protein n=5 Tax=Gammapr... 219 5e-56 UniRef50_B0SZP2 Putative uncharacterized protein n=2 Tax=Cauloba... 209 5e-53 UniRef50_Q7MZH5 Similar to unknown protein n=4 Tax=Gammaproteoba... 206 6e-52 UniRef50_Q3K622 Putative uncharacterized protein n=1 Tax=Pseudom... 201 2e-50 UniRef50_B6A291 Putative uncharacterized protein n=10 Tax=Rhizob... 187 3e-46 Sequences not found previously or not previously below threshold: UniRef50_B8GZC5 Putative uncharacterized protein n=3 Tax=Cauloba... 155 1e-36 UniRef50_D0XST4 Putative uncharacterized protein n=1 Tax=Brevund... 150 3e-35 UniRef50_Q48FU6 Putative uncharacterized protein n=8 Tax=Pseudom... 130 3e-29 UniRef50_C4XQI6 Putative uncharacterized protein n=1 Tax=Desulfo... 73 8e-12 UniRef50_C5S9X0 Putative uncharacterized protein n=1 Tax=Allochr... 64 3e-09 UniRef50_C6BPG4 Putative uncharacterized protein n=2 Tax=Burkhol... 48 2e-04 >UniRef50_P39293 Uncharacterized protein yjfK n=93 Tax=Enterobacteriaceae RepID=YJFK_ECOLI Length = 219 Score = 292 bits (748), Expect = 4e-78, Method: Composition-based stats. Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH Sbjct: 1 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 Query: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK 120 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK Sbjct: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK 120 Query: 121 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT Sbjct: 121 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 Query: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG Sbjct: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 >UniRef50_Q3BQS7 Putative uncharacterized protein n=9 Tax=Xanthomonas RepID=Q3BQS7_XANC5 Length = 243 Score = 234 bits (596), Expect = 2e-60, Method: Composition-based stats. Identities = 64/228 (28%), Positives = 108/228 (47%), Gaps = 22/228 (9%) Query: 4 FFQRLFGKDNKPAIARG---------PLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFT 54 FF +LFG+ P + PLGL + +DT +R+ + + LPG Sbjct: 23 FFNKLFGQPQPPPLPTSGSGAIGHALPLGLRVGGQVEIDTTLYRMAPEAMTAELPGGHQG 82 Query: 55 VAAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWR 114 + H++LG G + R+Y D FLQ+ T GG D++ +K FVY E+ + ++ Sbjct: 83 IPCYGHVNLGDGYALHRFYLDDDAFLQVTTVGG----DLEAMKAFVYCETVNPPSKQAFQ 138 Query: 115 E-AINAKAMGAMTLNWQEKRWQRFFNS-EEPGNIEPVYMLEKVENQNHAKW--EVHNFTM 170 E + +GA + + K+WQR S ++ I P+ E + + ++ ++ M Sbjct: 139 EFVMQHPHLGAAQIEYAGKQWQRATQSTDDASRIPPIAYDEVLYRYQPPRRDGDLTHYAM 198 Query: 171 GYQRQVTE-DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHI 217 Y R V E E+LL+ GE D G E+ + A+G+D+ + L I Sbjct: 199 LYSRDVPELQREEFLLVTGE----DSGPNEFCVTYAVGIDVTVADLDI 242 >UniRef50_A0NRQ9 Putative uncharacterized protein n=2 Tax=Rhodobacteraceae RepID=A0NRQ9_9RHOB Length = 215 Score = 228 bits (580), Expect = 1e-58, Method: Composition-based stats. Identities = 55/219 (25%), Positives = 95/219 (43%), Gaps = 9/219 (4%) Query: 4 FFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHIDL 63 F LF K +K LGL + G + D +A RLL + LI P + A H DL Sbjct: 1 MFGSLFNKKDKGWQPPEILGLTIGRGISFDPIALRLLPADSLIERPDTTLMITAQGHCDL 60 Query: 64 GGGSQIFRYYTSGDEFL-QINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAKAM 122 G S + R+Y D FL Q G++ + +D+I L+ + + S ++ W +A Sbjct: 61 GEQSHMHRFYPDDDRFLIQFQGGDGKEDERVDEIMLWYFYDVQYPSGDAEWNRVKSAIRQ 120 Query: 123 GAMTLNWQEK--RWQRFFNSEEPGNIEPVYMLEKVENQN--HAKWEVHNFTMGYQRQVTE 178 + +L +E R++R + +P+ E+V + + ++ M + R + + Sbjct: 121 TSFSLPGEEGDFRFERAWFDTSTSPEDPMTYWEEVCDDRNGGGRRKIFQTAMLFARSLKD 180 Query: 179 DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHI 217 E LL+N EE N E + LG + + + Sbjct: 181 GRDEMLLVNMEEPEN----AERSVAFMLGFPLERHNFSV 215 >UniRef50_Q15WJ0 Putative uncharacterized protein n=5 Tax=Gammaproteobacteria RepID=Q15WJ0_PSEA6 Length = 215 Score = 219 bits (558), Expect = 5e-56, Method: Composition-based stats. Identities = 62/220 (28%), Positives = 108/220 (49%), Gaps = 9/220 (4%) Query: 4 FFQRLFGKDNKPAIARGP--LGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFGK + + P +GL+L F +D L +L+E L+I + AV + Sbjct: 1 MFSKLFGKKDTSKTPKAPEVMGLYLGGSFQIDPLKLKLIEPSLIIESAASSHIIQAVGEV 60 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAKA 121 DL G +I R+YT D FLQ+ GG + I D+KL+ + E+ + ++ W++ + Sbjct: 61 DLDSGGKILRFYTDDDAFLQVVLDGGVTENHITDVKLWYFYETKTVGTDTQWQQLLKNDI 120 Query: 122 MGAMTLNWQEKRWQRFFNSEEPGNIEP-VYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 A + + +QR + + G++ P V M EK ++ E F M Y+R++ + Sbjct: 121 SQA-QYSLEGNSYQRVW--DAVGDVSPAVAMTEKTYEEDGDVSETDQFMMLYERELDDSN 177 Query: 181 YEYLLLNGEESFNDLGEP-EWLFSRALGVDIPLTSLHIIG 219 E LL++GEE +G+ + + G +I + I G Sbjct: 178 IEALLVSGEEKL--VGQNFDRCLVISSGFNIEQADITING 215 >UniRef50_B0SZP2 Putative uncharacterized protein n=2 Tax=Caulobacteraceae RepID=B0SZP2_CAUSK Length = 217 Score = 209 bits (532), Expect = 5e-53, Method: Composition-based stats. Identities = 58/220 (26%), Positives = 93/220 (42%), Gaps = 13/220 (5%) Query: 4 FFQRLFG-KDNKPAIARGPL-GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFG KD PA A + + L LDTLA+R L D+L AL + + A + Sbjct: 1 MFSKLFGRKDTAPASALPAIRNVTLGRTVWLDTLAWRRLGDDLKFALDTDTLEITAQGLV 60 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISK---ESHWREAIN 118 +L G + R+YT + Q T E + DI +F+ +S E W + + Sbjct: 61 ELREGGFVHRFYTDDNVMFQAVTDDRE-GQRVTDITVFIPWDSAYPGGRADEEAWAKRLR 119 Query: 119 AKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWE--VHNFTMGYQRQV 176 A+ L ++R + +E + EPV E V + + + M + R + Sbjct: 120 ARTFTGPNLP----EYRRDWFGDEADSQEPVSFWEDVHDDRDGIPDRRIFQTCMLFSRDL 175 Query: 177 TEDTYEYLLLNGEESFN-DLGEPEWLFSRALGVDIPLTSL 215 D E LL +E+ N D + E F +GV + + Sbjct: 176 PGDGRELLLAIQQENENEDTRQREVSFEIMIGVALGVGEF 215 >UniRef50_Q7MZH5 Similar to unknown protein n=4 Tax=Gammaproteobacteria RepID=Q7MZH5_PHOLL Length = 220 Score = 206 bits (523), Expect = 6e-52, Method: Composition-based stats. Identities = 56/224 (25%), Positives = 97/224 (43%), Gaps = 13/224 (5%) Query: 1 MSGFFQRLFG-KDNKPAIARGP--LGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAA 57 ++ F+ FG K PA+ + P LGL + +D L +L+E +L I + + Sbjct: 5 IADAFKSAFGGKTVAPAVPKVPEVLGLRIGGALEIDPLMLKLIESDLTIENAASTQLICS 64 Query: 58 VSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAI 117 V +DLG ++ RYYT + +LQ+ G E D + ++ L+ + E+ I ++ W I Sbjct: 65 VGVVDLGDNVRLVRYYTDDEGYLQVLQEG-EGDDGVKEVSLWYFYETKPIDSQAQWDALI 123 Query: 118 NAKAMGAMTLNWQEKRWQRFFNSEEP--GNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQ 175 + +R+ P NI+PV + EK ++ E F M Y RQ Sbjct: 124 ENGIVT------PSRRYDLDGTQFSPLWDNIKPVAVTEKTYSKEGHITETDQFVMVYTRQ 177 Query: 176 VTEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 V + E L + GEE + + G+ + T ++ Sbjct: 178 VAHNRTEELQVVGEEKVVGSHLDRLMV-LSTGIQLNQTDFKVVA 220 >UniRef50_Q3K622 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3K622_PSEPF Length = 216 Score = 201 bits (510), Expect = 2e-50, Method: Composition-based stats. Identities = 60/225 (26%), Positives = 101/225 (44%), Gaps = 19/225 (8%) Query: 3 GFFQRLFGKDN------KPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVA 56 G+F+ L G N A GPLGL DT LL+ + +P + Sbjct: 2 GWFKDLLGTSNWQSAAPASESAGGPLGLAQGKAIRFDTTLGLLLDGSTSVRVPDA-QAIW 60 Query: 57 AVSHIDLGGGSQIFRYYTSGDEF-LQINTTGGEDIDDIDDIKLFVYEESYGISKESHWRE 115 + IDLG +++ RYY + +EF +QI+ TG D I+ + LF Y ++ ++ + Sbjct: 61 SAGWIDLGQSNKLHRYYLNDEEFWVQIHVTG---DDQIESVTLFNYVSYVTVNSDAELQR 117 Query: 116 -AINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQR 174 A +G T + + R + +E G E V + E+V N + + + +++ +M Y R Sbjct: 118 LAGPNSQIGLPTYRHEGVEYTREWGTER-GQTELVPLTEQVINPDES-YTINHHSMLYAR 175 Query: 175 QVT-EDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHII 218 + D E LL + E+ E S +LG+ + T L I Sbjct: 176 ETGLTDRRELLLFSVEQDE----EGTVSLSTSLGISLYTTDLSTI 216 >UniRef50_B6A291 Putative uncharacterized protein n=10 Tax=Rhizobiales RepID=B6A291_RHILW Length = 219 Score = 187 bits (474), Expect = 3e-46, Method: Composition-based stats. Identities = 56/223 (25%), Positives = 92/223 (41%), Gaps = 15/223 (6%) Query: 1 MSGFFQRLFGKDNKPAIAR--GPLGLHLNSGFTLDTLAFRL--LEDELLIALPGE-EFTV 55 M G+F R N+ + R GPL + +D L+ L E + LP F + Sbjct: 1 MIGWFGR---DKNEKPLPRELGPLSAAIGGALEIDFLSLEAETLGGEPAMPLPRSGPFII 57 Query: 56 AAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHW-R 114 A L + + RYY +Q+ + G+ D +DDI + +S + + W R Sbjct: 58 AGYGESSLDAATVLSRYYDEDHRMIQVMSASGQPGDAVDDISFYQPWDSVVPAGQGEWNR 117 Query: 115 EAINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQR 174 A +G + + + RF+ E P + V +EKVE+ A+ +H M Y R Sbjct: 118 WTGPAGLVGQPSYDADGILYSRFWG-EGPERAQLVEFVEKVED-GEAQRSIHQTCMLYYR 175 Query: 175 QVTEDTYEYLLLNGEESFNDLGEPEW--LFSRALGVDIPLTSL 215 + T E LL+N E DLG+ + +G + + Sbjct: 176 PLGS-TREMLLINVERDL-DLGQSQAGSSVEFLIGYGLAPADV 216 >UniRef50_B8GZC5 Putative uncharacterized protein n=3 Tax=Caulobacter RepID=B8GZC5_CAUCN Length = 257 Score = 155 bits (391), Expect = 1e-36, Method: Composition-based stats. Identities = 44/221 (19%), Positives = 85/221 (38%), Gaps = 17/221 (7%) Query: 4 FFQRLFGKDNKPAIARGPL--GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFG+ ++P+ P+ + + LD LA+R L E AL + + A I Sbjct: 43 MFGKLFGRKDQPSGPALPIIRNVTIGRTVVLDPLAWRRLGAETKFALDRDTLEITAQGLI 102 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH---WREAIN 118 L G+ + R+YT + Q+ + E +D +FV S + + W + + Sbjct: 103 QLNDGAFVHRFYTEDEILFQVVSDDRE-GQKANDFTVFVPWASEYPADRTDHELWSQRLR 161 Query: 119 AKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVE---NQNHAKWEVHNFTMGYQRQ 175 ++ L + R + +E +PV + E V + + M + R Sbjct: 162 SRTFQPEGLPA----YTRLWFGDEAEQQDPVTLWEDVYYARDAQTPDRRLFQTVMLFHRD 217 Query: 176 VTE-DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSL 215 + + D E LL E + + +G+ + + Sbjct: 218 LLDGDGRELLLALTLEPEDS---KDVSHETMIGLPLSVGEF 255 >UniRef50_D0XST4 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XST4_9CAUL Length = 231 Score = 150 bits (379), Expect = 3e-35, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 79/221 (35%), Gaps = 12/221 (5%) Query: 4 FFQRLFGKDNKP-AIARGPL--GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 F+RLFG P + R + + + +LD LA+R L+ E + L + + A Sbjct: 12 MFKRLFGGPTTPEPVNRLAVVRNITVGRTVSLDPLAWRRLQPETVFNLETDALEITAQGT 71 Query: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH---WREAI 117 I L G + R+YT LQ + + D LF+ S E+ WR+ + Sbjct: 72 IALDSGQHVHRFYTDDHVMLQAMSDDPSGAEAY-DFSLFIPWTSAYPPGETERRIWRDRL 130 Query: 118 NAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKW---EVHNFTMGYQR 174 + + + RF+ SE PV + E + + A + M Y R Sbjct: 131 SEPVFDGA--PEELPAYPRFWFSESDARQPPVTLWETIWDDRTATTPFSRIFQTCMLYAR 188 Query: 175 QVTEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSL 215 + L L E + +G+ + + Sbjct: 189 DLAGGRELMLALEMEPEKATGKSADISHEIMVGIPLEMAEF 229 >UniRef50_Q48FU6 Putative uncharacterized protein n=8 Tax=Pseudomonas RepID=Q48FU6_PSE14 Length = 221 Score = 130 bits (327), Expect = 3e-29, Method: Composition-based stats. Identities = 53/226 (23%), Positives = 99/226 (43%), Gaps = 18/226 (7%) Query: 4 FFQRLFGKDNKPAIAR---------GPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFT 54 +F+R G + A R PLGL LD+ LL+ + + G+E Sbjct: 3 WFKRAMGLEAPKASGRDGVQSVNTVSPLGLASGRMLCLDSSLKLLLDGHSQVVVSGDEK- 61 Query: 55 VAAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH-W 113 V AV +DLG + R+Y +++ + G + +D+ DI LF Y + I+ + Sbjct: 62 VWAVGRVDLGQSMALHRFYLDNEDYFLQVVSNGLNPEDVQDIILFGYYSAEPITSKDELL 121 Query: 114 REAINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQ 173 R + +G T + ++R + + PG E + E + + + A V + +M Y Sbjct: 122 RLTGPSSKIGLPTYEHDGEVFERQWGT-SPGQTELTPLDEDIVSPDAAYR-VKHLSMLYA 179 Query: 174 RQVT-EDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHII 218 R+ + E+LL + EE E + A+G+ + T ++++ Sbjct: 180 RETGLINRREFLLFSVEEDE----EGSITLTTAVGITLQSTDINVL 221 >UniRef50_C4XQI6 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XQI6_DESMR Length = 211 Score = 72.8 bits (177), Expect = 8e-12, Method: Composition-based stats. Identities = 46/215 (21%), Positives = 74/215 (34%), Gaps = 19/215 (8%) Query: 4 FFQRLFGKDNKPAIARGPLGLHLNSGFTLDTL-AFRLLEDELLIALPGEEFTVAAVSHID 62 FF + DN PA PL L + + +D A R L IA P E V A+S Sbjct: 2 FFSKRPKADNHPAYPNFPLELRIGAILAVDVAEALRFEGLGLTIAPPQGELLVEALSSTS 61 Query: 63 LGGGSQIFRYYTSGDE---FLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWR-EAIN 118 L G ++ R Y E Q N G + D+ F + + + W Sbjct: 62 L-FGLKLVRAYAKQGEATYLFQFNQDGAG---ALLDVSFFRLLQEIRPATAADWGLWLDA 117 Query: 119 AKAMGAMTLNWQ-EKRWQRFFNSEEPGNIEPVYMLEKVE-NQNHAKWEVHNFTMGYQRQV 176 +G LN + + R + + PV E + + + + Y R+V Sbjct: 118 GGLIGGKDLNAPNGQTYLRQWG--DGDYAPPVEAEELLFTDPKDPPRCLAHQMHLYTREV 175 Query: 177 TEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIP 211 ++ L+ D L +G+D+ Sbjct: 176 GDENENMLV------SADTEPEAALVRAWIGLDLT 204 >UniRef50_C5S9X0 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S9X0_CHRVI Length = 511 Score = 64.4 bits (155), Expect = 3e-09, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 72/207 (34%), Gaps = 24/207 (11%) Query: 25 HLNSGFTLDTLAFRLLEDELLIALPGEEFT-----VAAVSHIDLGGGSQIFRYYTS-GDE 78 + +D F L E + P V V + G G +R Y S GD Sbjct: 314 RVGMTLPVDPSLFILAEPLTKLQAPRSASGSGLVSVERVGEVR-GEGVTWYRLYVSGGDG 372 Query: 79 FLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAI--NAKAMGAMTLN-WQEKRWQ 135 F Q++ D + + F + + W + + +G + + Sbjct: 373 FFQVHLDAPGQPD---ECRYFSRLDVVEPADADEWGVWLDRDEGLIGWPEFQTQDGQLYA 429 Query: 136 RFFNSEEPGNIEPVYMLEKVENQNH-AKWEVHNFTMGYQRQVTED----TYEYLLLNGEE 190 R + S EP + E +E + M Y R + EYLL+ Sbjct: 430 RLW-SPGQTRREPYSLRETLEAADGTDIEPCRQQAMLYTRATGAQPPMPSTEYLLVAA-- 486 Query: 191 SFNDLGEPEWLFSRALGVDIPLTSLHI 217 N+ G+ W S +G+DIP+ SL++ Sbjct: 487 --NEQGDGAW-VSLHVGIDIPVASLNL 510 >UniRef50_C6BPG4 Putative uncharacterized protein n=2 Tax=Burkholderiaceae RepID=C6BPG4_RALP1 Length = 265 Score = 48.2 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 38/227 (16%), Positives = 71/227 (31%), Gaps = 40/227 (17%) Query: 21 PLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHIDLGGGS--QIFRYYTS--- 75 P G + S + LL LL + + A S + L G +FR YT Sbjct: 34 PFGARIGSLLEVPRTQIALLTGSLLTLPKSAQMPIVAASRVRLDGADDIALFRLYTDTGL 93 Query: 76 -----GDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHW----------------- 113 G +LQ+ G ++D+I D+ + + + + Sbjct: 94 DRSGAGASYLQVLCAQG-NVDEIRDLAYYQFLDRTFPITDEEQAPFRGEGFGLGQTDFEM 152 Query: 114 -REAINAKAMGAMTLN-----WQEKRWQRFFNSEEPGNIEPVYMLEKVENQ--NHAKWEV 165 E + A L R+ R + ++P E + + Sbjct: 153 GDEQLANIPQVAPQLAALLGGADSLRFVRD--TPGGDYVKPFQAEETRMDDPIGEEGMQK 210 Query: 166 HNFTMGYQRQVTEDTYEYLLLNGEESFNDLGEPEWL--FSRALGVDI 210 M Y R + + E LL++ + + G+P +G+ + Sbjct: 211 RQSFMPYVRALADGKQERLLISFDNVLSMDGKPTRAAYVDYLVGLAL 257 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39293 Uncharacterized protein yjfK n=93 Tax=Enterobact... 258 1e-67 UniRef50_A0NRQ9 Putative uncharacterized protein n=2 Tax=Rhodoba... 223 5e-57 UniRef50_Q15WJ0 Putative uncharacterized protein n=5 Tax=Gammapr... 218 1e-55 UniRef50_D0XST4 Putative uncharacterized protein n=1 Tax=Brevund... 210 3e-53 UniRef50_Q7MZH5 Similar to unknown protein n=4 Tax=Gammaproteoba... 208 1e-52 UniRef50_B0SZP2 Putative uncharacterized protein n=2 Tax=Cauloba... 207 2e-52 UniRef50_Q3BQS7 Putative uncharacterized protein n=9 Tax=Xanthom... 207 2e-52 UniRef50_B8GZC5 Putative uncharacterized protein n=3 Tax=Cauloba... 199 8e-50 UniRef50_B6A291 Putative uncharacterized protein n=10 Tax=Rhizob... 187 2e-46 UniRef50_Q3K622 Putative uncharacterized protein n=1 Tax=Pseudom... 185 1e-45 UniRef50_Q48FU6 Putative uncharacterized protein n=8 Tax=Pseudom... 175 1e-42 UniRef50_C4XQI6 Putative uncharacterized protein n=1 Tax=Desulfo... 160 2e-38 UniRef50_C5S9X0 Putative uncharacterized protein n=1 Tax=Allochr... 138 1e-31 UniRef50_C6BPG4 Putative uncharacterized protein n=2 Tax=Burkhol... 134 2e-30 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P39293 Uncharacterized protein yjfK n=93 Tax=Enterobacteriaceae RepID=YJFK_ECOLI Length = 219 Score = 258 bits (658), Expect = 1e-67, Method: Composition-based stats. Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH Sbjct: 1 MSGFFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 Query: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK 120 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK Sbjct: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAK 120 Query: 121 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT Sbjct: 121 AMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 Query: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG Sbjct: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 >UniRef50_A0NRQ9 Putative uncharacterized protein n=2 Tax=Rhodobacteraceae RepID=A0NRQ9_9RHOB Length = 215 Score = 223 bits (567), Expect = 5e-57, Method: Composition-based stats. Identities = 55/219 (25%), Positives = 95/219 (43%), Gaps = 9/219 (4%) Query: 4 FFQRLFGKDNKPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHIDL 63 F LF K +K LGL + G + D +A RLL + LI P + A H DL Sbjct: 1 MFGSLFNKKDKGWQPPEILGLTIGRGISFDPIALRLLPADSLIERPDTTLMITAQGHCDL 60 Query: 64 GGGSQIFRYYTSGDEFL-QINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAKAM 122 G S + R+Y D FL Q G++ + +D+I L+ + + S ++ W +A Sbjct: 61 GEQSHMHRFYPDDDRFLIQFQGGDGKEDERVDEIMLWYFYDVQYPSGDAEWNRVKSAIRQ 120 Query: 123 GAMTLNWQEK--RWQRFFNSEEPGNIEPVYMLEKVENQN--HAKWEVHNFTMGYQRQVTE 178 + +L +E R++R + +P+ E+V + + ++ M + R + + Sbjct: 121 TSFSLPGEEGDFRFERAWFDTSTSPEDPMTYWEEVCDDRNGGGRRKIFQTAMLFARSLKD 180 Query: 179 DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHI 217 E LL+N EE N E + LG + + + Sbjct: 181 GRDEMLLVNMEEPEN----AERSVAFMLGFPLERHNFSV 215 >UniRef50_Q15WJ0 Putative uncharacterized protein n=5 Tax=Gammaproteobacteria RepID=Q15WJ0_PSEA6 Length = 215 Score = 218 bits (554), Expect = 1e-55, Method: Composition-based stats. Identities = 61/219 (27%), Positives = 105/219 (47%), Gaps = 7/219 (3%) Query: 4 FFQRLFGKDNKPAIARGP--LGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFGK + + P +GL+L F +D L +L+E L+I + AV + Sbjct: 1 MFSKLFGKKDTSKTPKAPEVMGLYLGGSFQIDPLKLKLIEPSLIIESAASSHIIQAVGEV 60 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAINAKA 121 DL G +I R+YT D FLQ+ GG + I D+KL+ + E+ + ++ W++ + Sbjct: 61 DLDSGGKILRFYTDDDAFLQVVLDGGVTENHITDVKLWYFYETKTVGTDTQWQQLLKNDI 120 Query: 122 MGAMTLNWQEKRWQRFFNSEEPGNIEP-VYMLEKVENQNHAKWEVHNFTMGYQRQVTEDT 180 A + + +QR + + G++ P V M EK ++ E F M Y+R++ + Sbjct: 121 SQA-QYSLEGNSYQRVW--DAVGDVSPAVAMTEKTYEEDGDVSETDQFMMLYERELDDSN 177 Query: 181 YEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 E LL++GEE + + G +I + I G Sbjct: 178 IEALLVSGEEKLVGQNF-DRCLVISSGFNIEQADITING 215 >UniRef50_D0XST4 Putative uncharacterized protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XST4_9CAUL Length = 231 Score = 210 bits (534), Expect = 3e-53, Method: Composition-based stats. Identities = 46/222 (20%), Positives = 79/222 (35%), Gaps = 12/222 (5%) Query: 4 FFQRLFGKDNKP-AIARGPL--GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSH 60 F+RLFG P + R + + + +LD LA+R L+ E + L + + A Sbjct: 12 MFKRLFGGPTTPEPVNRLAVVRNITVGRTVSLDPLAWRRLQPETVFNLETDALEITAQGT 71 Query: 61 IDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH---WREAI 117 I L G + R+YT LQ + + D LF+ S E+ WR+ + Sbjct: 72 IALDSGQHVHRFYTDDHVMLQAMSDDPSGAEAY-DFSLFIPWTSAYPPGETERRIWRDRL 130 Query: 118 NAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKW---EVHNFTMGYQR 174 + + + RF+ SE PV + E + + A + M Y R Sbjct: 131 SEPVFDGA--PEELPAYPRFWFSESDARQPPVTLWETIWDDRTATTPFSRIFQTCMLYAR 188 Query: 175 QVTEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLH 216 + L L E + +G+ + + Sbjct: 189 DLAGGRELMLALEMEPEKATGKSADISHEIMVGIPLEMAEFT 230 >UniRef50_Q7MZH5 Similar to unknown protein n=4 Tax=Gammaproteobacteria RepID=Q7MZH5_PHOLL Length = 220 Score = 208 bits (528), Expect = 1e-52, Method: Composition-based stats. Identities = 53/223 (23%), Positives = 96/223 (43%), Gaps = 11/223 (4%) Query: 1 MSGFFQRLFGKDN-KPAIARGP--LGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAA 57 ++ F+ FG PA+ + P LGL + +D L +L+E +L I + + Sbjct: 5 IADAFKSAFGGKTVAPAVPKVPEVLGLRIGGALEIDPLMLKLIESDLTIENAASTQLICS 64 Query: 58 VSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAI 117 V +DLG ++ RYYT + +LQ+ GE D + ++ L+ + E+ I ++ W I Sbjct: 65 VGVVDLGDNVRLVRYYTDDEGYLQVLQE-GEGDDGVKEVSLWYFYETKPIDSQAQWDALI 123 Query: 118 NAKAMGAM-TLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQRQV 176 + + ++ + NI+PV + EK ++ E F M Y RQV Sbjct: 124 ENGIVTPSRRYDLDGTQFSPLW-----DNIKPVAVTEKTYSKEGHITETDQFVMVYTRQV 178 Query: 177 TEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHIIG 219 + E L + GEE + + G+ + T ++ Sbjct: 179 AHNRTEELQVVGEEKVVGSHLDRLMV-LSTGIQLNQTDFKVVA 220 >UniRef50_B0SZP2 Putative uncharacterized protein n=2 Tax=Caulobacteraceae RepID=B0SZP2_CAUSK Length = 217 Score = 207 bits (527), Expect = 2e-52, Method: Composition-based stats. Identities = 53/220 (24%), Positives = 90/220 (40%), Gaps = 13/220 (5%) Query: 4 FFQRLFGKDNKPAIARGPL--GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHI 61 F +LFG+ + + P + L LDTLA+R L D+L AL + + A + Sbjct: 1 MFSKLFGRKDTAPASALPAIRNVTLGRTVWLDTLAWRRLGDDLKFALDTDTLEITAQGLV 60 Query: 62 DLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH---WREAIN 118 +L G + R+YT + Q T E + DI +F+ +S + W + + Sbjct: 61 ELREGGFVHRFYTDDNVMFQAVTDDRE-GQRVTDITVFIPWDSAYPGGRADEEAWAKRLR 119 Query: 119 AKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNH--AKWEVHNFTMGYQRQV 176 A+ L ++R + +E + EPV E V + + M + R + Sbjct: 120 ARTFTGPNLP----EYRRDWFGDEADSQEPVSFWEDVHDDRDGIPDRRIFQTCMLFSRDL 175 Query: 177 TEDTYEYLLLNGEESFN-DLGEPEWLFSRALGVDIPLTSL 215 D E LL +E+ N D + E F +GV + + Sbjct: 176 PGDGRELLLAIQQENENEDTRQREVSFEIMIGVALGVGEF 215 >UniRef50_Q3BQS7 Putative uncharacterized protein n=9 Tax=Xanthomonas RepID=Q3BQS7_XANC5 Length = 243 Score = 207 bits (526), Expect = 2e-52, Method: Composition-based stats. Identities = 64/228 (28%), Positives = 108/228 (47%), Gaps = 22/228 (9%) Query: 4 FFQRLFGKDNKPAIARG---------PLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFT 54 FF +LFG+ P + PLGL + +DT +R+ + + LPG Sbjct: 23 FFNKLFGQPQPPPLPTSGSGAIGHALPLGLRVGGQVEIDTTLYRMAPEAMTAELPGGHQG 82 Query: 55 VAAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWR 114 + H++LG G + R+Y D FLQ+ T GG D++ +K FVY E+ + ++ Sbjct: 83 IPCYGHVNLGDGYALHRFYLDDDAFLQVTTVGG----DLEAMKAFVYCETVNPPSKQAFQ 138 Query: 115 E-AINAKAMGAMTLNWQEKRWQRFFNS-EEPGNIEPVYMLEKVENQNHAKW--EVHNFTM 170 E + +GA + + K+WQR S ++ I P+ E + + ++ ++ M Sbjct: 139 EFVMQHPHLGAAQIEYAGKQWQRATQSTDDASRIPPIAYDEVLYRYQPPRRDGDLTHYAM 198 Query: 171 GYQRQVTE-DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHI 217 Y R V E E+LL+ GE D G E+ + A+G+D+ + L I Sbjct: 199 LYSRDVPELQREEFLLVTGE----DSGPNEFCVTYAVGIDVTVADLDI 242 >UniRef50_B8GZC5 Putative uncharacterized protein n=3 Tax=Caulobacter RepID=B8GZC5_CAUCN Length = 257 Score = 199 bits (505), Expect = 8e-50, Method: Composition-based stats. Identities = 44/223 (19%), Positives = 85/223 (38%), Gaps = 17/223 (7%) Query: 2 SGFFQRLFGKDNKPAIARGPL--GLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVS 59 F +LFG+ ++P+ P+ + + LD LA+R L E AL + + A Sbjct: 41 IAMFGKLFGRKDQPSGPALPIIRNVTIGRTVVLDPLAWRRLGAETKFALDRDTLEITAQG 100 Query: 60 HIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH---WREA 116 I L G+ + R+YT + Q+ + E +D +FV S + + W + Sbjct: 101 LIQLNDGAFVHRFYTEDEILFQVVSDDRE-GQKANDFTVFVPWASEYPADRTDHELWSQR 159 Query: 117 INAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVE---NQNHAKWEVHNFTMGYQ 173 + ++ L + R + +E +PV + E V + + M + Sbjct: 160 LRSRTFQPEGLPA----YTRLWFGDEAEQQDPVTLWEDVYYARDAQTPDRRLFQTVMLFH 215 Query: 174 RQVTE-DTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSL 215 R + + D E LL E + + +G+ + + Sbjct: 216 RDLLDGDGRELLLALTLEPEDS---KDVSHETMIGLPLSVGEF 255 >UniRef50_B6A291 Putative uncharacterized protein n=10 Tax=Rhizobiales RepID=B6A291_RHILW Length = 219 Score = 187 bits (475), Expect = 2e-46, Method: Composition-based stats. Identities = 53/222 (23%), Positives = 89/222 (40%), Gaps = 13/222 (5%) Query: 1 MSGFFQRLFGKDNKPAIAR--GPLGLHLNSGFTLDTLAFRL--LEDELLIALPGE-EFTV 55 M G+F R N+ + R GPL + +D L+ L E + LP F + Sbjct: 1 MIGWFGR---DKNEKPLPRELGPLSAAIGGALEIDFLSLEAETLGGEPAMPLPRSGPFII 57 Query: 56 AAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHW-R 114 A L + + RYY +Q+ + G+ D +DDI + +S + + W R Sbjct: 58 AGYGESSLDAATVLSRYYDEDHRMIQVMSASGQPGDAVDDISFYQPWDSVVPAGQGEWNR 117 Query: 115 EAINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQR 174 A +G + + + RF+ E P + V +EKVE+ A+ +H M Y R Sbjct: 118 WTGPAGLVGQPSYDADGILYSRFWG-EGPERAQLVEFVEKVED-GEAQRSIHQTCMLYYR 175 Query: 175 QVTEDTYEYLLLNGEESFN-DLGEPEWLFSRALGVDIPLTSL 215 + T E LL+N E + + +G + + Sbjct: 176 PLG-STREMLLINVERDLDLGQSQAGSSVEFLIGYGLAPADV 216 >UniRef50_Q3K622 Putative uncharacterized protein n=1 Tax=Pseudomonas fluorescens Pf0-1 RepID=Q3K622_PSEPF Length = 216 Score = 185 bits (469), Expect = 1e-45, Method: Composition-based stats. Identities = 59/225 (26%), Positives = 100/225 (44%), Gaps = 19/225 (8%) Query: 3 GFFQRLFGKDN------KPAIARGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVA 56 G+F+ L G N A GPLGL DT LL+ + +P + + Sbjct: 2 GWFKDLLGTSNWQSAAPASESAGGPLGLAQGKAIRFDTTLGLLLDGSTSVRVP-DAQAIW 60 Query: 57 AVSHIDLGGGSQIFRYYTSGDEF-LQINTTGGEDIDDIDDIKLFVYEESYGISKESHWRE 115 + IDLG +++ RYY + +EF +QI+ TG D I+ + LF Y ++ ++ + Sbjct: 61 SAGWIDLGQSNKLHRYYLNDEEFWVQIHVTG---DDQIESVTLFNYVSYVTVNSDAELQR 117 Query: 116 A-INAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQR 174 +G T + + R + +E G E V + E+V N + + +++ +M Y R Sbjct: 118 LAGPNSQIGLPTYRHEGVEYTREWGTER-GQTELVPLTEQVINPDESYT-INHHSMLYAR 175 Query: 175 QVT-EDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHII 218 + D E LL + E+ E S +LG+ + T L I Sbjct: 176 ETGLTDRRELLLFSVEQDE----EGTVSLSTSLGISLYTTDLSTI 216 >UniRef50_Q48FU6 Putative uncharacterized protein n=8 Tax=Pseudomonas RepID=Q48FU6_PSE14 Length = 221 Score = 175 bits (443), Expect = 1e-42, Method: Composition-based stats. Identities = 53/226 (23%), Positives = 99/226 (43%), Gaps = 18/226 (7%) Query: 4 FFQRLFGKDNKPAIAR---------GPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFT 54 +F+R G + A R PLGL LD+ LL+ + + G+E Sbjct: 3 WFKRAMGLEAPKASGRDGVQSVNTVSPLGLASGRMLCLDSSLKLLLDGHSQVVVSGDEK- 61 Query: 55 VAAVSHIDLGGGSQIFRYYTSGDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESH-W 113 V AV +DLG + R+Y +++ + G + +D+ DI LF Y + I+ + Sbjct: 62 VWAVGRVDLGQSMALHRFYLDNEDYFLQVVSNGLNPEDVQDIILFGYYSAEPITSKDELL 121 Query: 114 REAINAKAMGAMTLNWQEKRWQRFFNSEEPGNIEPVYMLEKVENQNHAKWEVHNFTMGYQ 173 R + +G T + ++R + + PG E + E + + + A V + +M Y Sbjct: 122 RLTGPSSKIGLPTYEHDGEVFERQWGT-SPGQTELTPLDEDIVSPDAAYR-VKHLSMLYA 179 Query: 174 RQVT-EDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHII 218 R+ + E+LL + EE E + A+G+ + T ++++ Sbjct: 180 RETGLINRREFLLFSVEEDE----EGSITLTTAVGITLQSTDINVL 221 >UniRef50_C4XQI6 Putative uncharacterized protein n=1 Tax=Desulfovibrio magneticus RS-1 RepID=C4XQI6_DESMR Length = 211 Score = 160 bits (405), Expect = 2e-38, Method: Composition-based stats. Identities = 46/221 (20%), Positives = 76/221 (34%), Gaps = 19/221 (8%) Query: 4 FFQRLFGKDNKPAIARGPLGLHLNSGFTLDTL-AFRLLEDELLIALPGEEFTVAAVSHID 62 FF + DN PA PL L + + +D A R L IA P E V A+S Sbjct: 2 FFSKRPKADNHPAYPNFPLELRIGAILAVDVAEALRFEGLGLTIAPPQGELLVEALSSTS 61 Query: 63 LGGGSQIFRYYTSGDE---FLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWR-EAIN 118 L G ++ R Y E Q N G + D+ F + + + W Sbjct: 62 L-FGLKLVRAYAKQGEATYLFQFNQDGAG---ALLDVSFFRLLQEIRPATAADWGLWLDA 117 Query: 119 AKAMGAMTLNWQ-EKRWQRFFNSEEPGNIEPVYMLEKVE-NQNHAKWEVHNFTMGYQRQV 176 +G LN + + R + + PV E + + + + Y R+V Sbjct: 118 GGLIGGKDLNAPNGQTYLRQWG--DGDYAPPVEAEELLFTDPKDPPRCLAHQMHLYTREV 175 Query: 177 TEDTYEYLLLNGEESFNDLGEPEWLFSRALGVDIPLTSLHI 217 ++ L+ D L +G+D+ + + Sbjct: 176 GDENENMLV------SADTEPEAALVRAWIGLDLTPYGVKV 210 >UniRef50_C5S9X0 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5S9X0_CHRVI Length = 511 Score = 138 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 72/207 (34%), Gaps = 24/207 (11%) Query: 25 HLNSGFTLDTLAFRLLEDELLIALPGEEFT-----VAAVSHIDLGGGSQIFRYYTSG-DE 78 + +D F L E + P V V + G G +R Y SG D Sbjct: 314 RVGMTLPVDPSLFILAEPLTKLQAPRSASGSGLVSVERVGEVR-GEGVTWYRLYVSGGDG 372 Query: 79 FLQINTTGGEDIDDIDDIKLFVYEESYGISKESHWREAIN--AKAMGAMTLN-WQEKRWQ 135 F Q++ D + + F + + W ++ +G + + Sbjct: 373 FFQVHLDAPGQPD---ECRYFSRLDVVEPADADEWGVWLDRDEGLIGWPEFQTQDGQLYA 429 Query: 136 RFFNSEEPGNIEPVYMLEKVENQNH-AKWEVHNFTMGYQRQVTED----TYEYLLLNGEE 190 R + S EP + E +E + M Y R + EYLL+ Sbjct: 430 RLW-SPGQTRREPYSLRETLEAADGTDIEPCRQQAMLYTRATGAQPPMPSTEYLLVAA-- 486 Query: 191 SFNDLGEPEWLFSRALGVDIPLTSLHI 217 N+ G+ W S +G+DIP+ SL++ Sbjct: 487 --NEQGDGAW-VSLHVGIDIPVASLNL 510 >UniRef50_C6BPG4 Putative uncharacterized protein n=2 Tax=Burkholderiaceae RepID=C6BPG4_RALP1 Length = 265 Score = 134 bits (338), Expect = 2e-30, Method: Composition-based stats. Identities = 38/236 (16%), Positives = 74/236 (31%), Gaps = 40/236 (16%) Query: 19 RGPLGLHLNSGFTLDTLAFRLLEDELLIALPGEEFTVAAVSHIDLGGGS--QIFRYYTS- 75 P G + S + LL LL + + A S + L G +FR YT Sbjct: 32 GLPFGARIGSLLEVPRTQIALLTGSLLTLPKSAQMPIVAASRVRLDGADDIALFRLYTDT 91 Query: 76 -------GDEFLQINTTGGEDIDDIDDIKLFVYEESYGISKESHW--------------- 113 G +LQ+ G ++D+I D+ + + + + Sbjct: 92 GLDRSGAGASYLQVLCAQG-NVDEIRDLAYYQFLDRTFPITDEEQAPFRGEGFGLGQTDF 150 Query: 114 ---REAINAKAMGAMTLN-----WQEKRWQRFFNSEEPGNIEPVYMLEKVENQ--NHAKW 163 E + A L R+ R + ++P E + Sbjct: 151 EMGDEQLANIPQVAPQLAALLGGADSLRFVRD--TPGGDYVKPFQAEETRMDDPIGEEGM 208 Query: 164 EVHNFTMGYQRQVTEDTYEYLLLNGEESFNDLGEPEWL--FSRALGVDIPLTSLHI 217 + M Y R + + E LL++ + + G+P +G+ + + + + Sbjct: 209 QKRQSFMPYVRALADGKQERLLISFDNVLSMDGKPTRAAYVDYLVGLALDRSKVKV 264 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.126 0.311 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,055,023,493 Number of Sequences: 3077464 Number of extensions: 36148554 Number of successful extensions: 101827 Number of sequences better than 1.0e-01: 14 Number of HSP's better than 0.1 without gapping: 29 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 101702 Number of HSP's gapped (non-prelim): 37 length of query: 219 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 95 effective length of database: 658,790,820 effective search space: 62585127900 effective search space used: 62585127900 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 90 (39.4 bits)