BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (171 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P75818 Uncharacterized lipoprotein ybjP n=94 Tax=Entero... 355 3e-97 UniRef50_A4W8N2 Lipoprotein n=14 Tax=Enterobacteriaceae RepID=A4... 291 4e-78 UniRef50_A4TN56 Lipoprotein n=36 Tax=Enterobacteriaceae RepID=A4... 120 2e-26 UniRef50_A7HVC0 Putative uncharacterized protein n=1 Tax=Parviba... 43 0.004 >UniRef50_P75818 Uncharacterized lipoprotein ybjP n=94 Tax=Enterobacteriaceae RepID=YBJP_ECOLI Length = 171 Score = 355 bits (911), Expect = 3e-97, Method: Compositional matrix adjust. Identities = 171/171 (100%), Positives = 171/171 (100%) Query: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI Sbjct: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 Query: 61 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP 120 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP Sbjct: 61 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP 120 Query: 121 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR Sbjct: 121 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 >UniRef50_A4W8N2 Lipoprotein n=14 Tax=Enterobacteriaceae RepID=A4W8N2_ENT38 Length = 172 Score = 291 bits (746), Expect = 4e-78, Method: Compositional matrix adjust. Identities = 142/172 (82%), Positives = 152/172 (88%), Gaps = 1/172 (0%) Query: 1 MRYSKLTMLIPCALLLSACTT-VTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSND 59 MRYS LT+L+PCALLLSACTT VTPA+KD GTRSGPC++GGPD VAQQFYDYRI HR+ND Sbjct: 1 MRYSALTLLVPCALLLSACTTPVTPAFKDIGTRSGPCIDGGPDVVAQQFYDYRIQHRNND 60 Query: 60 ITALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNI 119 ITALRPYLSD LA LLSDA+RD H LL +DPFSSRTT PDSA VASASTIPN DARNI Sbjct: 61 ITALRPYLSDNLAKLLSDATRDPQHNALLQSDPFSSRTTPPDSAKVASASTIPNTDARNI 120 Query: 120 PLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 PLRV L QGDQ WQDEVLMI+EGQCW IDDVRY+GGSVHA AGTLRQSIENR Sbjct: 121 PLRVKLTQGDQSWQDEVLMIREGQCWAIDDVRYIGGSVHAPAGTLRQSIENR 172 >UniRef50_A4TN56 Lipoprotein n=36 Tax=Enterobacteriaceae RepID=A4TN56_YERPP Length = 191 Score = 120 bits (300), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 79/170 (46%), Positives = 101/170 (59%), Gaps = 23/170 (13%) Query: 22 VTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRI--------LHRSNDITALRPYLSDKLAT 73 V P ++ +RS PC+EGGPD VAQ+FYD RI L N RPYLS L Sbjct: 23 VNPVFEATSSRSSPCIEGGPDTVAQKFYDLRIQQIGGQQGLPDDNLSAQFRPYLSQSLYN 82 Query: 74 LLSDASRDNNHR--------ELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDL 125 + A + ++R ++++ D F+S SA VASASTIPN DARNIPLRV+L Sbjct: 83 DIQAARKQASNRTPAQVNKTQMISGDIFTSLREGSTSASVASASTIPNTDARNIPLRVNL 142 Query: 126 K-QGDQG----WQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIEN 170 Q G WQDEVLMI+EG CWV+DD+R++G V A A +LRQ + N Sbjct: 143 SHQMADGKAVMWQDEVLMIREGTCWVVDDIRFMG--VSAPASSLRQLLGN 190 >UniRef50_A7HVC0 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVC0_PARL1 Length = 181 Score = 43.1 bits (100), Expect = 0.004, Method: Compositional matrix adjust. Identities = 44/186 (23%), Positives = 73/186 (39%), Gaps = 31/186 (16%) Query: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 MR + +L +L+AC P E A FYD + RS+ + Sbjct: 1 MRTLRPLLLFALGFMLAACGEAKP-------------EADALRAAAVFYDIVLSARSSGV 47 Query: 61 ------TALRPYLSDKLATLLSDASR-DNNHRELLTND--PFSSRTTLPDSAHVASASTI 111 LRP +S L +LLS A+ + H E + N P+ A+A I Sbjct: 48 PDADMRARLRPVISSDLDSLLSQAAEAERRHTERVNNSEPPYLQGDIFSSLFEGATAYEI 107 Query: 112 PNRDARNIPLRVDLKQGDQG-----WQDEVLMIQEG----QCWVIDDVRYLGGSVHATAG 162 D ++ + + W D ++++ G + W++DD+ Y G A+ G Sbjct: 108 GTCDGDERRMQCEAMLAHEAEEPVQWTDRLVLVANGGPEDRRWLVDDILYGGDWDFASKG 167 Query: 163 TLRQSI 168 TL+ S+ Sbjct: 168 TLKSSL 173 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75818 Uncharacterized lipoprotein ybjP n=94 Tax=Entero... 314 8e-85 UniRef50_A4W8N2 Lipoprotein n=14 Tax=Enterobacteriaceae RepID=A4... 303 1e-81 UniRef50_A4TN56 Lipoprotein n=36 Tax=Enterobacteriaceae RepID=A4... 205 5e-52 Sequences not found previously or not previously below threshold: UniRef50_A7HVC0 Putative uncharacterized protein n=1 Tax=Parviba... 55 8e-07 UniRef50_B7LL23 Putative uncharacterized protein n=5 Tax=Escheri... 47 3e-04 UniRef50_Q1H0J5 Putative uncharacterized protein n=1 Tax=Methylo... 45 0.001 >UniRef50_P75818 Uncharacterized lipoprotein ybjP n=94 Tax=Enterobacteriaceae RepID=YBJP_ECOLI Length = 171 Score = 314 bits (804), Expect = 8e-85, Method: Composition-based stats. Identities = 171/171 (100%), Positives = 171/171 (100%) Query: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI Sbjct: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 Query: 61 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP 120 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP Sbjct: 61 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP 120 Query: 121 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR Sbjct: 121 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 >UniRef50_A4W8N2 Lipoprotein n=14 Tax=Enterobacteriaceae RepID=A4W8N2_ENT38 Length = 172 Score = 303 bits (777), Expect = 1e-81, Method: Composition-based stats. Identities = 142/172 (82%), Positives = 152/172 (88%), Gaps = 1/172 (0%) Query: 1 MRYSKLTMLIPCALLLSACTT-VTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSND 59 MRYS LT+L+PCALLLSACTT VTPA+KD GTRSGPC++GGPD VAQQFYDYRI HR+ND Sbjct: 1 MRYSALTLLVPCALLLSACTTPVTPAFKDIGTRSGPCIDGGPDVVAQQFYDYRIQHRNND 60 Query: 60 ITALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNI 119 ITALRPYLSD LA LLSDA+RD H LL +DPFSSRTT PDSA VASASTIPN DARNI Sbjct: 61 ITALRPYLSDNLAKLLSDATRDPQHNALLQSDPFSSRTTPPDSAKVASASTIPNTDARNI 120 Query: 120 PLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 PLRV L QGDQ WQDEVLMI+EGQCW IDDVRY+GGSVHA AGTLRQSIENR Sbjct: 121 PLRVKLTQGDQSWQDEVLMIREGQCWAIDDVRYIGGSVHAPAGTLRQSIENR 172 >UniRef50_A4TN56 Lipoprotein n=36 Tax=Enterobacteriaceae RepID=A4TN56_YERPP Length = 191 Score = 205 bits (522), Expect = 5e-52, Method: Composition-based stats. Identities = 79/170 (46%), Positives = 101/170 (59%), Gaps = 23/170 (13%) Query: 22 VTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRI--------LHRSNDITALRPYLSDKLAT 73 V P ++ +RS PC+EGGPD VAQ+FYD RI L N RPYLS L Sbjct: 23 VNPVFEATSSRSSPCIEGGPDTVAQKFYDLRIQQIGGQQGLPDDNLSAQFRPYLSQSLYN 82 Query: 74 LLSDASRDNNHR--------ELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDL 125 + A + ++R ++++ D F+S SA VASASTIPN DARNIPLRV+L Sbjct: 83 DIQAARKQASNRTPAQVNKTQMISGDIFTSLREGSTSASVASASTIPNTDARNIPLRVNL 142 Query: 126 K-QGDQG----WQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIEN 170 Q G WQDEVLMI+EG CWV+DD+R++G V A A +LRQ + N Sbjct: 143 SHQMADGKAVMWQDEVLMIREGTCWVVDDIRFMG--VSAPASSLRQLLGN 190 >UniRef50_A7HVC0 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVC0_PARL1 Length = 181 Score = 55.5 bits (132), Expect = 8e-07, Method: Composition-based stats. Identities = 43/190 (22%), Positives = 75/190 (39%), Gaps = 39/190 (20%) Query: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRI------L 54 MR + +L +L+AC P + + R+ A FYD + + Sbjct: 1 MRTLRPLLLFALGFMLAACGEAKP--EADALRA-----------AAVFYDIVLSARSSGV 47 Query: 55 HRSNDITALRPYLSDKLATLLSDA---------SRDNNHRELLTNDPFSSRTTLPDSAHV 105 ++ LRP +S L +LLS A +N+ L D FSS + + Sbjct: 48 PDADMRARLRPVISSDLDSLLSQAAEAERRHTERVNNSEPPYLQGDIFSSLFEGATAYEI 107 Query: 106 ASASTIPNRDARNIPLRVDLKQGDQ---GWQDEVLMIQEG----QCWVIDDVRYLGGSVH 158 + + D R + L + W D ++++ G + W++DD+ Y G Sbjct: 108 GTC----DGDERRMQCEAMLAHEAEEPVQWTDRLVLVANGGPEDRRWLVDDILYGGDWDF 163 Query: 159 ATAGTLRQSI 168 A+ GTL+ S+ Sbjct: 164 ASKGTLKSSL 173 >UniRef50_B7LL23 Putative uncharacterized protein n=5 Tax=Escherichia RepID=B7LL23_ESCF3 Length = 206 Score = 47.0 bits (110), Expect = 3e-04, Method: Composition-based stats. Identities = 33/144 (22%), Positives = 51/144 (35%), Gaps = 26/144 (18%) Query: 33 SGPCVEGGPDNVAQQFY------DYRILHRSNDITALRPYLSDKLATLLSDASRDNNHRE 86 +G C PD VA +FY D + + + YL+ L L+ A N Sbjct: 41 TGACPAQTPDGVADKFYSTYVFSDTSFKSEKDQLAFFQKYLTPSLYQLIVGAYDRNKRDY 100 Query: 87 LL---------TNDPFSSRTTLPDSAH------VASASTIPNRDARNIPLRVDLKQGD-Q 130 + F+S DS + +S P + + L D + Sbjct: 101 AIDPTAKPTFGDGIIFTSY--PSDSYYEKFDGVTLPSSYKPGDNNVTVALHFHFTVDDKK 158 Query: 131 GWQDEVLMIQEGQ--CWVIDDVRY 152 WQDE LM + CW ID++ + Sbjct: 159 AWQDEALMARSADDGCWRIDNIIF 182 >UniRef50_Q1H0J5 Putative uncharacterized protein n=1 Tax=Methylobacillus flagellatus KT RepID=Q1H0J5_METFK Length = 181 Score = 44.7 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 7/121 (5%) Query: 53 ILHRSNDITALRPYLSDKLATLLSDASRDNNHRELL---TNDPFSSRTTLPDSAHVASAS 109 + P L L + R + + + D F+S S + Sbjct: 56 MQQLDAVSRHFIPKLHRIFYISLREQHRCRTRNKPIPWSSGDLFTSNDAGYTSFTIEP-- 113 Query: 110 TIPNRDARNIPLRVDLKQGD--QGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQS 167 +IPN+ R + L+Q Q W DEV++ +E W+I D+ Y AG QS Sbjct: 114 SIPNQFGRQSTVHFSLEQNGKVQRWSDEVILHKENSQWMIYDIEYHAPFAGQKAGKSLQS 173 Query: 168 I 168 I Sbjct: 174 I 174 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P75818 Uncharacterized lipoprotein ybjP n=94 Tax=Entero... 259 3e-68 UniRef50_A4W8N2 Lipoprotein n=14 Tax=Enterobacteriaceae RepID=A4... 252 5e-66 UniRef50_A4TN56 Lipoprotein n=36 Tax=Enterobacteriaceae RepID=A4... 172 3e-42 UniRef50_A7HVC0 Putative uncharacterized protein n=1 Tax=Parviba... 164 8e-40 UniRef50_Q1H0J5 Putative uncharacterized protein n=1 Tax=Methylo... 142 5e-33 UniRef50_B7LL23 Putative uncharacterized protein n=5 Tax=Escheri... 136 3e-31 Sequences not found previously or not previously below threshold: UniRef50_C1AAU1 Putative uncharacterized protein n=1 Tax=Gemmati... 58 1e-07 UniRef50_C2FTK6 Putative uncharacterized protein n=2 Tax=Sphingo... 56 6e-07 UniRef50_C0YS39 Putative uncharacterized protein n=1 Tax=Chryseo... 52 7e-06 UniRef50_B2FK98 Putative uncharacterized protein n=2 Tax=Stenotr... 47 2e-04 UniRef50_UPI00017455D6 hypothetical protein VspiD_12630 n=1 Tax=... 45 0.001 UniRef50_B4SR97 Putative uncharacterized protein n=1 Tax=Stenotr... 44 0.002 >UniRef50_P75818 Uncharacterized lipoprotein ybjP n=94 Tax=Enterobacteriaceae RepID=YBJP_ECOLI Length = 171 Score = 259 bits (661), Expect = 3e-68, Method: Composition-based stats. Identities = 171/171 (100%), Positives = 171/171 (100%) Query: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI Sbjct: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDI 60 Query: 61 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP 120 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP Sbjct: 61 TALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIP 120 Query: 121 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR Sbjct: 121 LRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 >UniRef50_A4W8N2 Lipoprotein n=14 Tax=Enterobacteriaceae RepID=A4W8N2_ENT38 Length = 172 Score = 252 bits (642), Expect = 5e-66, Method: Composition-based stats. Identities = 142/172 (82%), Positives = 152/172 (88%), Gaps = 1/172 (0%) Query: 1 MRYSKLTMLIPCALLLSACTT-VTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSND 59 MRYS LT+L+PCALLLSACTT VTPA+KD GTRSGPC++GGPD VAQQFYDYRI HR+ND Sbjct: 1 MRYSALTLLVPCALLLSACTTPVTPAFKDIGTRSGPCIDGGPDVVAQQFYDYRIQHRNND 60 Query: 60 ITALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNI 119 ITALRPYLSD LA LLSDA+RD H LL +DPFSSRTT PDSA VASASTIPN DARNI Sbjct: 61 ITALRPYLSDNLAKLLSDATRDPQHNALLQSDPFSSRTTPPDSAKVASASTIPNTDARNI 120 Query: 120 PLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR 171 PLRV L QGDQ WQDEVLMI+EGQCW IDDVRY+GGSVHA AGTLRQSIENR Sbjct: 121 PLRVKLTQGDQSWQDEVLMIREGQCWAIDDVRYIGGSVHAPAGTLRQSIENR 172 >UniRef50_A4TN56 Lipoprotein n=36 Tax=Enterobacteriaceae RepID=A4TN56_YERPP Length = 191 Score = 172 bits (436), Expect = 3e-42, Method: Composition-based stats. Identities = 76/170 (44%), Positives = 99/170 (58%), Gaps = 23/170 (13%) Query: 22 VTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRIL--------HRSNDITALRPYLSDKLAT 73 V P ++ +RS PC+EGGPD VAQ+FYD RI N RPYLS L Sbjct: 23 VNPVFEATSSRSSPCIEGGPDTVAQKFYDLRIQQIGGQQGLPDDNLSAQFRPYLSQSLYN 82 Query: 74 LLSDASRDNNHR--------ELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDL 125 + A + ++R ++++ D F+S SA VASASTIPN DARNIPLRV+L Sbjct: 83 DIQAARKQASNRTPAQVNKTQMISGDIFTSLREGSTSASVASASTIPNTDARNIPLRVNL 142 Query: 126 KQ---GDQG--WQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIEN 170 + WQDEVLMI+EG CWV+DD+R++G V A A +LRQ + N Sbjct: 143 SHQMADGKAVMWQDEVLMIREGTCWVVDDIRFMG--VSAPASSLRQLLGN 190 >UniRef50_A7HVC0 Putative uncharacterized protein n=1 Tax=Parvibaculum lavamentivorans DS-1 RepID=A7HVC0_PARL1 Length = 181 Score = 164 bits (416), Expect = 8e-40, Method: Composition-based stats. Identities = 43/190 (22%), Positives = 75/190 (39%), Gaps = 39/190 (20%) Query: 1 MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRI------L 54 MR + +L +L+AC P + + R+ A FYD + + Sbjct: 1 MRTLRPLLLFALGFMLAACGEAKP--EADALRA-----------AAVFYDIVLSARSSGV 47 Query: 55 HRSNDITALRPYLSDKLATLLSDA---------SRDNNHRELLTNDPFSSRTTLPDSAHV 105 ++ LRP +S L +LLS A +N+ L D FSS + + Sbjct: 48 PDADMRARLRPVISSDLDSLLSQAAEAERRHTERVNNSEPPYLQGDIFSSLFEGATAYEI 107 Query: 106 ASASTIPNRDARNIPLRVDLKQGDQ---GWQDEVLMIQEG----QCWVIDDVRYLGGSVH 158 + + D R + L + W D ++++ G + W++DD+ Y G Sbjct: 108 GTC----DGDERRMQCEAMLAHEAEEPVQWTDRLVLVANGGPEDRRWLVDDILYGGDWDF 163 Query: 159 ATAGTLRQSI 168 A+ GTL+ S+ Sbjct: 164 ASKGTLKSSL 173 >UniRef50_Q1H0J5 Putative uncharacterized protein n=1 Tax=Methylobacillus flagellatus KT RepID=Q1H0J5_METFK Length = 181 Score = 142 bits (357), Expect = 5e-33, Method: Composition-based stats. Identities = 29/121 (23%), Positives = 45/121 (37%), Gaps = 7/121 (5%) Query: 53 ILHRSNDITALRPYLSDKLATLLSDASRDNNHRELL---TNDPFSSRTTLPDSAHVASAS 109 + P L L + R + + + D F+S S + Sbjct: 56 MQQLDAVSRHFIPKLHRIFYISLREQHRCRTRNKPIPWSSGDLFTSNDAGYTSFTIEP-- 113 Query: 110 TIPNRDARNIPLRVDLKQGD--QGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQS 167 +IPN+ R + L+Q Q W DEV++ +E W+I D+ Y AG QS Sbjct: 114 SIPNQFGRQSTVHFSLEQNGKVQRWSDEVILHKENSQWMIYDIEYHAPFAGQKAGKSLQS 173 Query: 168 I 168 I Sbjct: 174 I 174 >UniRef50_B7LL23 Putative uncharacterized protein n=5 Tax=Escherichia RepID=B7LL23_ESCF3 Length = 206 Score = 136 bits (342), Expect = 3e-31, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 53/151 (35%), Gaps = 26/151 (17%) Query: 33 SGPCVEGGPDNVAQQFY------DYRILHRSNDITALRPYLSDKLATLLSDASRDNNHRE 86 +G C PD VA +FY D + + + YL+ L L+ A N Sbjct: 41 TGACPAQTPDGVADKFYSTYVFSDTSFKSEKDQLAFFQKYLTPSLYQLIVGAYDRNKRDY 100 Query: 87 LL---------TNDPFSSRTTLPDSAH------VASASTIPNRDARNIPLRVDLKQGD-Q 130 + F+S DS + +S P + + L D + Sbjct: 101 AIDPTAKPTFGDGIIFTSY--PSDSYYEKFDGVTLPSSYKPGDNNVTVALHFHFTVDDKK 158 Query: 131 GWQDEVLMIQEGQ--CWVIDDVRYLGGSVHA 159 WQDE LM + CW ID++ + ++ Sbjct: 159 AWQDEALMARSADDGCWRIDNIIFHEDEDNS 189 >UniRef50_C1AAU1 Putative uncharacterized protein n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1AAU1_GEMAT Length = 160 Score = 57.9 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 31/150 (20%), Positives = 51/150 (34%), Gaps = 15/150 (10%) Query: 34 GPCVEGGPDNVAQQFY------DYRILHRSNDITALRPYLSDKLATLLSDASRDNN---- 83 P P A Q Y R + + + ALRP+L+D LA L+ A + Sbjct: 7 APKGSTTPSQTAMQLYAMLDALGVRGVPDEDALRALRPFLTDSLADALAHADAERRVAVQ 66 Query: 84 -----HRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDLKQGDQGWQDEVLM 138 DPFSS A + + + ++ W+D +++ Sbjct: 67 EAPDDKPPFADGDPFSSLFEGRTEARPDTVVMRGDTALVVMAFSNKTQRPAVNWRDTIVV 126 Query: 139 IQEGQCWVIDDVRYLGGSVHATAGTLRQSI 168 V+ D+RY G G L + + Sbjct: 127 TPFNGRLVVADIRYGAGWEFGFTGRLLEVL 156 >UniRef50_C2FTK6 Putative uncharacterized protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FTK6_9SPHI Length = 167 Score = 55.6 bits (132), Expect = 6e-07, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 46/120 (38%), Gaps = 18/120 (15%) Query: 64 RPYLSDKLATLLSDASRDNNHRE--------------LLTNDPFSSRTTLPDSAHVASAS 109 R +S LA L+ A ++ D F+S D+ + + Sbjct: 47 RQLISPDLALLIDKAISREKDDAEKVAKSDHPGDKPLMIEGDIFTSLYEGQDTFQIDTIK 106 Query: 110 TIPNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSV-HATAGTLRQSI 168 D+ + ++ + W+DEV++I++ W ID+V + +T L+Q I Sbjct: 107 VK--GDSAFVVVQFANTGYKESWKDEVVLIKKET-WRIDNVYFGEEKDLKSTKDVLKQLI 163 >UniRef50_C0YS39 Putative uncharacterized protein n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YS39_9FLAO Length = 181 Score = 52.1 bits (123), Expect = 7e-06, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 48/164 (29%), Gaps = 23/164 (14%) Query: 5 KLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDITALR 64 KL +L LL ++C K N + + + Y Sbjct: 3 KLFLLSGIFLLFASC-------KKNEAHCSLSSDAAINAKINELYTTYEKSNEAIYNQPI 55 Query: 65 P--YLSDKLATLLSDA-------------SRDNNHRELL-TNDPFSSRTTLPDSAHVASA 108 P SD L +L +A S + + L+ FSS SA Sbjct: 56 PEDLFSDDLKKVLEEAINTSKADIEKVKNSDHPDEKPLIFEGALFSSLYEGFTDYKTKSA 115 Query: 109 STIPNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRY 152 + +L W D + + G+ W ID++ + Sbjct: 116 RIKNTTAEVPVAFEYNLASPKVAWTDTIHLTNTGKEWKIDNITF 159 >UniRef50_B2FK98 Putative uncharacterized protein n=2 Tax=Stenotrophomonas RepID=B2FK98_STRMK Length = 204 Score = 47.1 bits (110), Expect = 2e-04, Method: Composition-based stats. Identities = 34/164 (20%), Positives = 59/164 (35%), Gaps = 30/164 (18%) Query: 2 RYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDIT 61 R S+ T L + V P + P +V + YD SN+ T Sbjct: 41 RMSRPTALFVLLI-------VAPLAAAASS--------DPRDVVTRLYDAIQAPTSNE-T 84 Query: 62 ALRPYLSDKLATLLSDASRDNN----------HRELLTNDPFSSRTTLPDSAHVASASTI 111 A+ P L + L + ++ +L P+ P++ V + Sbjct: 85 AISPLLGEALRSAIAGQRAYEQACTALAAPDEKPHMLDQSPYLLAPDRPETISVG----M 140 Query: 112 PNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGG 155 P + VD+ GD W D VL+ ++ + W + D+R+ G Sbjct: 141 PGSSGDATWIHVDMAVGDYRWTDRVLLQRQDRDWKVMDIRWGQG 184 >UniRef50_UPI00017455D6 hypothetical protein VspiD_12630 n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017455D6 Length = 185 Score = 45.2 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 51/140 (36%), Gaps = 26/140 (18%) Query: 49 YDYRILHRSNDITALRPYLSDKLATLLSDA--------------SRDNNHRELL------ 88 ++ R + + +I L P L ++L + A +++ H +L Sbjct: 36 WEIRGVPDAREINLLSPLLGEELIQIFKKAESQKAREQAGIKRKYKNDPHPRVLKSAWSK 95 Query: 89 TNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQCWVID 148 D F+S D+ V + L + + W +++ + Q WV+ Sbjct: 96 EGDLFASIAESFDTFAVGFPILKSGTVKVPVHLEYLVSKPAYRWTIILVLDRSDQDWVVS 155 Query: 149 DVRYLGGSVHATAGTLRQSI 168 D+ V+ +LR+S+ Sbjct: 156 DI------VNQDGESLRKSL 169 >UniRef50_B4SR97 Putative uncharacterized protein n=1 Tax=Stenotrophomonas maltophilia R551-3 RepID=B4SR97_STRM5 Length = 162 Score = 44.4 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 31/142 (21%), Positives = 54/142 (38%), Gaps = 17/142 (11%) Query: 35 PCVEGGPDNVAQQFYDYRILHRSNDITALRPYLSDKLATLLSDASR----------DNNH 84 P +V + Y S++ +RP L D L+ ++ + Sbjct: 17 AAAASDPRDVVTRLYATVQAPASSEEV-IRPLLGDALSAAINAQRAYERACTVLAAHDEK 75 Query: 85 RELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQC 144 +L P+ P++ V P ++VD+ GD W D VL+ ++GQ Sbjct: 76 PHMLDQSPYLMAPDRPETVRVG----RPGPIGSATWVQVDMAVGDYRWTDRVLLQRQGQD 131 Query: 145 WVIDDVRYLGGSVHATAGTLRQ 166 W + D+R+ G G L+Q Sbjct: 132 WKVMDIRWGQG--GNLIGRLKQ 151 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.121 0.312 Lambda K H 0.267 0.0372 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 854,302,351 Number of Sequences: 3077464 Number of extensions: 28600010 Number of successful extensions: 85237 Number of sequences better than 1.0e-01: 12 Number of HSP's better than 0.1 without gapping: 16 Number of HSP's successfully gapped in prelim test: 7 Number of HSP's that attempted gapping in prelim test: 85191 Number of HSP's gapped (non-prelim): 23 length of query: 171 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 52 effective length of database: 674,178,140 effective search space: 35057263280 effective search space used: 35057263280 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 88 (38.6 bits)