BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (149 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P64552 Uncharacterized protein ygaW n=178 Tax=Gammaprot... 304 5e-82 UniRef50_B7LVR8 Putative inner membrane protein n=23 Tax=Gammapr... 248 4e-65 UniRef50_A8G9R7 Putative uncharacterized protein n=3 Tax=Gammapr... 191 8e-48 UniRef50_Q2NVW1 Putative uncharacterized protein n=1 Tax=Sodalis... 103 2e-21 UniRef50_Q3IW22 Putative membrane protein n=1 Tax=Rhodobacter sp... 89 5e-17 UniRef50_C8Q393 Putative uncharacterized protein n=1 Tax=Pantoea... 81 1e-14 UniRef50_A4EHU8 Putative uncharacterized protein n=1 Tax=Roseoba... 67 2e-10 >UniRef50_P64552 Uncharacterized protein ygaW n=178 Tax=Gammaproteobacteria RepID=YGAW_ECO57 Length = 149 Score = 304 bits (779), Expect = 5e-82, Method: Compositional matrix adjust. Identities = 149/149 (100%), Positives = 149/149 (100%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG Sbjct: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 Query: 61 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV 120 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV Sbjct: 61 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV 120 Query: 121 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA 149 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA Sbjct: 121 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA 149 >UniRef50_B7LVR8 Putative inner membrane protein n=23 Tax=Gammaproteobacteria RepID=B7LVR8_ESCF3 Length = 179 Score = 248 bits (634), Expect = 4e-65, Method: Compositional matrix adjust. Identities = 118/149 (79%), Positives = 133/149 (89%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 M S QSR R+A+ADTFAMVVYC+VVNM IE+FLSGMSFEQS SRLVAIPVNILIA PYG Sbjct: 31 MLSAQSRWRNAMADTFAMVVYCTVVNMLIEIFLSGMSFEQSLSSRLVAIPVNILIACPYG 90 Query: 61 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV 120 +YRD FMR ARK++ GW K +ADILAYVTFQSPVY AIL V+GADWHQI+AAVSSN+V+ Sbjct: 91 IYRDFFMRQARKMNNKGWTKTIADILAYVTFQSPVYAAILWVIGADWHQIVAAVSSNMVI 150 Query: 121 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA 149 SM+MGAVYGYFLDYCRRLFKV+ YQQ+KA Sbjct: 151 SMMMGAVYGYFLDYCRRLFKVTSYQQLKA 179 >UniRef50_A8G9R7 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=A8G9R7_SERP5 Length = 146 Score = 191 bits (484), Expect = 8e-48, Method: Compositional matrix adjust. Identities = 94/145 (64%), Positives = 114/145 (78%), Gaps = 1/145 (0%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 M SP + R A ADTFA+VVYC + M IE+ +SGMSF+QS SRL++IPVNILIAWPYG Sbjct: 1 MLSPVAHWRSAAADTFALVVYCFIAGMAIEILVSGMSFQQSLSSRLLSIPVNILIAWPYG 60 Query: 61 MYRDLFMRAARKVSPSGW-IKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIV 119 YRDLF+R AR+ + ++NLAD+LAYV+FQSPVY AIL VGAD QI+ AV+SN V Sbjct: 61 RYRDLFIRVARRFQRQPFLLRNLADLLAYVSFQSPVYAAILWSVGADGQQIITAVTSNAV 120 Query: 120 VSMLMGAVYGYFLDYCRRLFKVSRY 144 VSM MG VYGYFL+YCRRLF+V+ Y Sbjct: 121 VSMAMGVVYGYFLEYCRRLFRVAGY 145 >UniRef50_Q2NVW1 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVW1_SODGM Length = 126 Score = 103 bits (256), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 51/93 (54%), Positives = 63/93 (67%), Gaps = 1/93 (1%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 M S SRLR A ADTFA+VVYC M IE+ LSGMS +QS SR++AIPVN++IAWP+G Sbjct: 8 MSSVTSRLRSATADTFALVVYCFFTGMAIEILLSGMSLQQSLSSRVLAIPVNVIIAWPFG 67 Query: 61 MYRDLFMRAARKVSPSG-WIKNLADILAYVTFQ 92 YRD + A + P W +NLAD+LA F Sbjct: 68 QYRDAVVSLASRHGPKQFWTRNLADLLAMSVFN 100 >UniRef50_Q3IW22 Putative membrane protein n=1 Tax=Rhodobacter sphaeroides 2.4.1 RepID=Q3IW22_RHOS4 Length = 135 Score = 88.6 bits (218), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 46/135 (34%), Positives = 79/135 (58%), Gaps = 2/135 (1%) Query: 8 LRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRDLFM 67 +R + DT A V++ + V E+ ++GM+ + +RL+ +PV +L PY +RD + Sbjct: 1 MRLFIIDTVATVIFFTAVATFSELLIAGMAPSEVLATRLLMVPVMVLTGRPYTRWRDWLV 60 Query: 68 RAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLMGAV 127 R R + W L DILA+++FQ+PVY A LL+ GA + + A+ S I++ +L+ Sbjct: 61 R--RTAPRNRWSAFLTDILAFLSFQAPVYGATLLIAGASFAEAGTAIGSAIILMILLARP 118 Query: 128 YGYFLDYCRRLFKVS 142 +G F+++ R LF V Sbjct: 119 FGLFVEWTRSLFGVE 133 >UniRef50_C8Q393 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q393_9ENTR Length = 167 Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 42/137 (30%), Positives = 79/137 (57%), Gaps = 3/137 (2%) Query: 5 QSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRD 64 + R +ADT A++++ + + E ++GMS++Q ++RL+ + I +A PYG++RD Sbjct: 22 KKRGTEFLADTVALILFFTTTGIINERMIAGMSWDQVLHARLIGAALMIPVARPYGIWRD 81 Query: 65 LFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLM 124 M ++ +PS + L D +A V+FQ P+Y AI+ GA ++ ++ + + Sbjct: 82 WLM---QRANPSRGSQLLWDSMALVSFQVPIYAAIIAFSGATGGGLVRGTLGAALMMLFL 138 Query: 125 GAVYGYFLDYCRRLFKV 141 G YG FL++ R+LF + Sbjct: 139 GRPYGAFLNWVRKLFGL 155 >UniRef50_A4EHU8 Putative uncharacterized protein n=1 Tax=Roseobacter sp. CCS2 RepID=A4EHU8_9RHOB Length = 146 Score = 66.6 bits (161), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 37/137 (27%), Positives = 71/137 (51%) Query: 9 RHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRDLFMR 68 R +AD A++ + ++ + E F++GM +++ +RL+ P+ IL A PYG++RD + Sbjct: 10 RVWLADNIALLSFFTLTGVLNERFIAGMEWDEVLVARLIGAPLMILTARPYGIWRDWVLL 69 Query: 69 AARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLMGAVY 128 + + D LA +FQ P+Y I+ + GA +++ + ++ ++ G Y Sbjct: 70 KSNALQSGRAKLFFFDTLALFSFQVPIYAMIIWLGGAAGATLVSGIIGAAIIMLICGRPY 129 Query: 129 GYFLDYCRRLFKVSRYQ 145 G +LD R VS + Sbjct: 130 GLWLDTVRIWMGVSTVE 146 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7LVR8 Putative inner membrane protein n=23 Tax=Gammapr... 209 2e-53 UniRef50_P64552 Uncharacterized protein ygaW n=178 Tax=Gammaprot... 199 3e-50 UniRef50_A8G9R7 Putative uncharacterized protein n=3 Tax=Gammapr... 177 8e-44 UniRef50_C8Q393 Putative uncharacterized protein n=1 Tax=Pantoea... 167 1e-40 UniRef50_A4EHU8 Putative uncharacterized protein n=1 Tax=Roseoba... 162 2e-39 UniRef50_Q3IW22 Putative membrane protein n=1 Tax=Rhodobacter sp... 143 2e-33 UniRef50_Q2NVW1 Putative uncharacterized protein n=1 Tax=Sodalis... 108 7e-23 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_B7LVR8 Putative inner membrane protein n=23 Tax=Gammaproteobacteria RepID=B7LVR8_ESCF3 Length = 179 Score = 209 bits (533), Expect = 2e-53, Method: Composition-based stats. Identities = 118/149 (79%), Positives = 133/149 (89%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 M S QSR R+A+ADTFAMVVYC+VVNM IE+FLSGMSFEQS SRLVAIPVNILIA PYG Sbjct: 31 MLSAQSRWRNAMADTFAMVVYCTVVNMLIEIFLSGMSFEQSLSSRLVAIPVNILIACPYG 90 Query: 61 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV 120 +YRD FMR ARK++ GW K +ADILAYVTFQSPVY AIL V+GADWHQI+AAVSSN+V+ Sbjct: 91 IYRDFFMRQARKMNNKGWTKTIADILAYVTFQSPVYAAILWVIGADWHQIVAAVSSNMVI 150 Query: 121 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA 149 SM+MGAVYGYFLDYCRRLFKV+ YQQ+KA Sbjct: 151 SMMMGAVYGYFLDYCRRLFKVTSYQQLKA 179 >UniRef50_P64552 Uncharacterized protein ygaW n=178 Tax=Gammaproteobacteria RepID=YGAW_ECO57 Length = 149 Score = 199 bits (505), Expect = 3e-50, Method: Composition-based stats. Identities = 149/149 (100%), Positives = 149/149 (100%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG Sbjct: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 Query: 61 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV 120 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV Sbjct: 61 MYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVV 120 Query: 121 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA 149 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA Sbjct: 121 SMLMGAVYGYFLDYCRRLFKVSRYQQVKA 149 >UniRef50_A8G9R7 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=A8G9R7_SERP5 Length = 146 Score = 177 bits (449), Expect = 8e-44, Method: Composition-based stats. Identities = 94/145 (64%), Positives = 114/145 (78%), Gaps = 1/145 (0%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 M SP + R A ADTFA+VVYC + M IE+ +SGMSF+QS SRL++IPVNILIAWPYG Sbjct: 1 MLSPVAHWRSAAADTFALVVYCFIAGMAIEILVSGMSFQQSLSSRLLSIPVNILIAWPYG 60 Query: 61 MYRDLFMRAARKVSPSGW-IKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIV 119 YRDLF+R AR+ + ++NLAD+LAYV+FQSPVY AIL VGAD QI+ AV+SN V Sbjct: 61 RYRDLFIRVARRFQRQPFLLRNLADLLAYVSFQSPVYAAILWSVGADGQQIITAVTSNAV 120 Query: 120 VSMLMGAVYGYFLDYCRRLFKVSRY 144 VSM MG VYGYFL+YCRRLF+V+ Y Sbjct: 121 VSMAMGVVYGYFLEYCRRLFRVAGY 145 >UniRef50_C8Q393 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8Q393_9ENTR Length = 167 Score = 167 bits (423), Expect = 1e-40, Method: Composition-based stats. Identities = 42/137 (30%), Positives = 79/137 (57%), Gaps = 3/137 (2%) Query: 5 QSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRD 64 + R +ADT A++++ + + E ++GMS++Q ++RL+ + I +A PYG++RD Sbjct: 22 KKRGTEFLADTVALILFFTTTGIINERMIAGMSWDQVLHARLIGAALMIPVARPYGIWRD 81 Query: 65 LFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLM 124 M ++ +PS + L D +A V+FQ P+Y AI+ GA ++ ++ + + Sbjct: 82 WLM---QRANPSRGSQLLWDSMALVSFQVPIYAAIIAFSGATGGGLVRGTLGAALMMLFL 138 Query: 125 GAVYGYFLDYCRRLFKV 141 G YG FL++ R+LF + Sbjct: 139 GRPYGAFLNWVRKLFGL 155 >UniRef50_A4EHU8 Putative uncharacterized protein n=1 Tax=Roseobacter sp. CCS2 RepID=A4EHU8_9RHOB Length = 146 Score = 162 bits (411), Expect = 2e-39, Method: Composition-based stats. Identities = 37/137 (27%), Positives = 71/137 (51%) Query: 9 RHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRDLFMR 68 R +AD A++ + ++ + E F++GM +++ +RL+ P+ IL A PYG++RD + Sbjct: 10 RVWLADNIALLSFFTLTGVLNERFIAGMEWDEVLVARLIGAPLMILTARPYGIWRDWVLL 69 Query: 69 AARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLMGAVY 128 + + D LA +FQ P+Y I+ + GA +++ + ++ ++ G Y Sbjct: 70 KSNALQSGRAKLFFFDTLALFSFQVPIYAMIIWLGGAAGATLVSGIIGAAIIMLICGRPY 129 Query: 129 GYFLDYCRRLFKVSRYQ 145 G +LD R VS + Sbjct: 130 GLWLDTVRIWMGVSTVE 146 >UniRef50_Q3IW22 Putative membrane protein n=1 Tax=Rhodobacter sphaeroides 2.4.1 RepID=Q3IW22_RHOS4 Length = 135 Score = 143 bits (361), Expect = 2e-33, Method: Composition-based stats. Identities = 46/135 (34%), Positives = 79/135 (58%), Gaps = 2/135 (1%) Query: 8 LRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRDLFM 67 +R + DT A V++ + V E+ ++GM+ + +RL+ +PV +L PY +RD + Sbjct: 1 MRLFIIDTVATVIFFTAVATFSELLIAGMAPSEVLATRLLMVPVMVLTGRPYTRWRDWLV 60 Query: 68 RAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLMGAV 127 R R + W L DILA+++FQ+PVY A LL+ GA + + A+ S I++ +L+ Sbjct: 61 R--RTAPRNRWSAFLTDILAFLSFQAPVYGATLLIAGASFAEAGTAIGSAIILMILLARP 118 Query: 128 YGYFLDYCRRLFKVS 142 +G F+++ R LF V Sbjct: 119 FGLFVEWTRSLFGVE 133 >UniRef50_Q2NVW1 Putative uncharacterized protein n=1 Tax=Sodalis glossinidius str. 'morsitans' RepID=Q2NVW1_SODGM Length = 126 Score = 108 bits (269), Expect = 7e-23, Method: Composition-based stats. Identities = 51/93 (54%), Positives = 63/93 (67%), Gaps = 1/93 (1%) Query: 1 MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYG 60 M S SRLR A ADTFA+VVYC M IE+ LSGMS +QS SR++AIPVN++IAWP+G Sbjct: 8 MSSVTSRLRSATADTFALVVYCFFTGMAIEILLSGMSLQQSLSSRVLAIPVNVIIAWPFG 67 Query: 61 MYRDLFMRAARKVSPSG-WIKNLADILAYVTFQ 92 YRD + A + P W +NLAD+LA F Sbjct: 68 QYRDAVVSLASRHGPKQFWTRNLADLLAMSVFN 100 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.320 0.139 0.384 Lambda K H 0.267 0.0424 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 547,524,488 Number of Sequences: 3077464 Number of extensions: 19430093 Number of successful extensions: 76371 Number of sequences better than 1.0e-01: 8 Number of HSP's better than 0.1 without gapping: 14 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 76331 Number of HSP's gapped (non-prelim): 27 length of query: 149 length of database: 1,040,396,356 effective HSP length: 112 effective length of query: 37 effective length of database: 695,720,388 effective search space: 25741654356 effective search space used: 25741654356 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.3 bits) S2: 87 (38.1 bits)