BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (168 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9S4X5 Uncharacterized protein yubA n=65 Tax=root RepID... 347 1e-94 UniRef50_C8QE75 Putative uncharacterized protein n=1 Tax=Pantoea... 130 1e-29 UniRef50_Q7N7T0 Similar to unknown protein n=2 Tax=Photorhabdus ... 123 2e-27 UniRef50_B6VNI3 Putative uncharacterized protein n=1 Tax=Photorh... 120 2e-26 UniRef50_B7MZL7 Putative uncharacterized protein n=3 Tax=Enterob... 111 1e-23 UniRef50_A8GJ61 Putative uncharacterized protein n=3 Tax=Serrati... 99 4e-20 UniRef50_D0FNQ8 Conserved uncharacterized protein n=6 Tax=Entero... 97 2e-19 UniRef50_C2LLG0 Plasmid-related protein n=2 Tax=Proteus mirabili... 79 5e-14 >UniRef50_Q9S4X5 Uncharacterized protein yubA n=65 Tax=root RepID=YUBA_ECOLI Length = 168 Score = 347 bits (889), Expect = 1e-94, Method: Compositional matrix adjust. Identities = 168/168 (100%), Positives = 168/168 (100%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF Sbjct: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP Sbjct: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 Query: 121 DDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRLTC 168 DDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRLTC Sbjct: 121 DDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRLTC 168 >UniRef50_C8QE75 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QE75_9ENTR Length = 319 Score = 130 bits (328), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 73/173 (42%), Positives = 96/173 (55%), Gaps = 14/173 (8%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 MPNWC NR+ SG I +++ L +G P Y RA EG+QLFLAG AG+L+ + + Sbjct: 1 MPNWCCNRLRVSGRSEDIGQVRALFAGGGYPSYARAAAEGVQLFLAGCAGILRPADGEHY 60 Query: 61 EPCPGLTAA------------GRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQ 108 P P LT A G VS N AFT+WL L++G +L + C LH LWL+ Sbjct: 61 APYPALTGAKGYPSCAEEGRDGHHDVS-GNQAFTQWLCCLREGTVLTDAACDDLHALWLE 119 Query: 109 SGTGRRRWEELPDDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPE 161 SG RRW +L +E I A+F +R DWC +S V WW+ +CD LPE Sbjct: 120 SGLRDRRWNDLTPARQECIVAVFAGQRHDWCGSFSATTVQTWWDTVCDG-LPE 171 >UniRef50_Q7N7T0 Similar to unknown protein n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N7T0_PHOLL Length = 307 Score = 123 bits (309), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M W +NR++ +G+P Q+ +++ G P Y +A ++ I+LF+AG AGLLQ TE + + Sbjct: 1 MSKWYANRLHITGQPDQLDALRQWGLGDKIPYYGQAIHQSIKLFVAGCAGLLQPTETIDY 60 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P L A G G +SPEN AF +WL L++ V LDE + L+ QSG R+WE L Sbjct: 61 PLYPALVAGGVGEMSPENWAFEQWLALLKENVALDEVTSQQIDRLYQQSGLAERKWETLL 120 Query: 121 DDARESITALFTPKRGDWCDI--WSNE-DVSVWWNRLCDNVLPEKPCRL 166 A+++I ALF K DW + W+ E D + W+ L D + PC L Sbjct: 121 PTAQKTIMALFNQKSCDWFGLVDWNGERDTEMLWSHLDDRLTETVPCDL 169 >UniRef50_B6VNI3 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VNI3_PHOAA Length = 346 Score = 120 bits (300), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 66/169 (39%), Positives = 96/169 (56%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M W +NR+ G+P Q+ +++ G P Y +A ++ I+LF+AG AGLLQ TE + + Sbjct: 41 MSKWYANRLNIIGQPDQLDALQQWGLGDKIPYYGQAIHQSIKLFVAGCAGLLQPTETIDY 100 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P L A+G G +SPEN AF +WL L++ V LDE + L+ QSG R+WE L Sbjct: 101 PLYPALVASGVGEMSPENWAFEQWLALLKENVALDEVTSQQIDRLYQQSGLAERKWETLL 160 Query: 121 DDARESITALFTPKRGDWCDI--WSNE-DVSVWWNRLCDNVLPEKPCRL 166 A+++I ALF K DW + W+ E D + W+ L D + PC L Sbjct: 161 PAAQKTIMALFNQKSCDWFGLVDWNGERDTEMLWSHLDDRLTETVPCDL 209 >UniRef50_B7MZL7 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B7MZL7_ECO81 Length = 326 Score = 111 bits (277), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 62/169 (36%), Positives = 85/169 (50%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M WC NR +G+ + + + G TP YR A + IQLFL G AGL++ T ++ Sbjct: 12 MAEWCRNRFEITGKSVCLDVLTQWIEGCETPRYRHAIQQSIQLFLIGCAGLVKPTRTTQY 71 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P P L G G+ SP N AF +WL L +LDE+ + L+ QS G RW LP Sbjct: 72 PPYPALVRHGTGMSSPANQAFEQWLGLLVKDAVLDEETIKTIDRLYHQSCIGAVRWGNLP 131 Query: 121 DDARESITALFTPKRGDW---CDIWSNEDVSVWWNRLCDNVLPEKPCRL 166 D+ARE IT L + DW + + D W+RL D +PC + Sbjct: 132 DNAREIITTLMHCQYSDWFGLVGLSEHIDAEACWSRLSDYPEQAQPCDM 180 >UniRef50_A8GJ61 Putative uncharacterized protein n=3 Tax=Serratia proteamaculans 568 RepID=A8GJ61_SERP5 Length = 313 Score = 99.4 bits (246), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 56/173 (32%), Positives = 93/173 (53%), Gaps = 10/173 (5%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQ--TTEDV 58 M NWC NR+ +G+ + E+++ +G V P YR A + +LFLAG AG+L+ TT+ Sbjct: 7 MSNWCKNRLVITGQSVFVDELQQWVNGHVVPDYRHAIQQSCRLFLAGCAGILKPATTKPG 66 Query: 59 RFEPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEE 118 + P P L G+ SP+N+AF W L+ + L +N ++ L+ QSG +WE Sbjct: 67 VYVPYPDLLTH-PGIASPQNLAFEHWFGLLKADIPLTAENVRLIERLYRQSGIDAVKWEN 125 Query: 119 LPDDARESITALFTPKRGDW---CDIWSNEDVSVWWNRLCDNVLPE--KPCRL 166 +P+ A+E + + + + DW + + D + W RL ++PE PC + Sbjct: 126 IPNVAKERMADVLSRQYADWFGLVGVSPDIDAGLCWERL--GMMPEYTAPCDM 176 >UniRef50_D0FNQ8 Conserved uncharacterized protein n=6 Tax=Enterobacteriaceae RepID=D0FNQ8_ERWPY Length = 313 Score = 96.7 bits (239), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 56/167 (33%), Positives = 84/167 (50%), Gaps = 5/167 (2%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M +WC NR+ +G+ I ++ G TP YR A + I+LFLAG AG+L+ + + Sbjct: 1 MFSWCHNRLDITGKSVCIDVMQSWIVGTETPRYRHAIRQAIKLFLAGCAGILKPVKATTY 60 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P LTA+G G + N AF +L L+ LD + ++WLQSG +WE +P Sbjct: 61 PAYPALTASGLGAQTSANHAFQHFLELLEKDAWLDGATLSRMEKIWLQSGIDGLKWENIP 120 Query: 121 DDARESITALFTPKRGDWCDIWSNE---DVSVWWNRLCDNVLPEKPC 164 AR+ I+ L DW + S D W+ L +++PE C Sbjct: 121 LAARQIISQLVAVHYADWFGVASGAGQFDPQERWDSL--SIMPETTC 165 >UniRef50_C2LLG0 Plasmid-related protein n=2 Tax=Proteus mirabilis RepID=C2LLG0_PROMI Length = 333 Score = 79.3 bits (194), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 49/154 (31%), Positives = 75/154 (48%), Gaps = 6/154 (3%) Query: 1 MPNWCSNRMYFS-GEPAQIAEIKR-LASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDV 58 MPNWCSNR+ + A + +K + + P ++ A + + L LAG AG+L+ + Sbjct: 1 MPNWCSNRLDITLHNAADMPALKHWIYADDGIPAWQTAIAQSLHLLLAGCAGILKPVRPL 60 Query: 59 RFEPCPGLTAAG-RGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLH---ELWLQSGTGRR 114 F P P LT+ G G VSP N AFT W+ L L C +H ++WL G Sbjct: 61 SFPPLPELTSYGATGPVSPGNTAFTHWVDLLMTAPDLTPSCCQQIHQWYQMWLSEGGAYH 120 Query: 115 RWEELPDDARESITALFTPKRGDWCDIWSNEDVS 148 W+ L + ++ L + DW + ++ ED S Sbjct: 121 SWDSLTATQKARLSPLLSGSSFDWLNRFTGEDES 154 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7MZL7 Putative uncharacterized protein n=3 Tax=Enterob... 250 1e-65 UniRef50_Q9S4X5 Uncharacterized protein yubA n=65 Tax=root RepID... 248 7e-65 UniRef50_Q7N7T0 Similar to unknown protein n=2 Tax=Photorhabdus ... 241 5e-63 UniRef50_B6VNI3 Putative uncharacterized protein n=1 Tax=Photorh... 236 2e-61 UniRef50_D0FNQ8 Conserved uncharacterized protein n=6 Tax=Entero... 236 2e-61 UniRef50_A8GJ61 Putative uncharacterized protein n=3 Tax=Serrati... 234 1e-60 UniRef50_C8QE75 Putative uncharacterized protein n=1 Tax=Pantoea... 218 9e-56 UniRef50_C2LLG0 Plasmid-related protein n=2 Tax=Proteus mirabili... 194 1e-48 Sequences not found previously or not previously below threshold: UniRef50_A8LZS4 Amino acid adenylation domain n=2 Tax=Salinispor... 38 0.100 CONVERGED! >UniRef50_B7MZL7 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B7MZL7_ECO81 Length = 326 Score = 250 bits (638), Expect = 1e-65, Method: Composition-based stats. Identities = 62/169 (36%), Positives = 85/169 (50%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M WC NR +G+ + + + G TP YR A + IQLFL G AGL++ T ++ Sbjct: 12 MAEWCRNRFEITGKSVCLDVLTQWIEGCETPRYRHAIQQSIQLFLIGCAGLVKPTRTTQY 71 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P P L G G+ SP N AF +WL L +LDE+ + L+ QS G RW LP Sbjct: 72 PPYPALVRHGTGMSSPANQAFEQWLGLLVKDAVLDEETIKTIDRLYHQSCIGAVRWGNLP 131 Query: 121 DDARESITALFTPKRGDWC---DIWSNEDVSVWWNRLCDNVLPEKPCRL 166 D+ARE IT L + DW + + D W+RL D +PC + Sbjct: 132 DNAREIITTLMHCQYSDWFGLVGLSEHIDAEACWSRLSDYPEQAQPCDM 180 >UniRef50_Q9S4X5 Uncharacterized protein yubA n=65 Tax=root RepID=YUBA_ECOLI Length = 168 Score = 248 bits (632), Expect = 7e-65, Method: Composition-based stats. Identities = 168/168 (100%), Positives = 168/168 (100%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF Sbjct: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP Sbjct: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 Query: 121 DDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRLTC 168 DDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRLTC Sbjct: 121 DDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRLTC 168 >UniRef50_Q7N7T0 Similar to unknown protein n=2 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N7T0_PHOLL Length = 307 Score = 241 bits (616), Expect = 5e-63, Method: Composition-based stats. Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M W +NR++ +G+P Q+ +++ G P Y +A ++ I+LF+AG AGLLQ TE + + Sbjct: 1 MSKWYANRLHITGQPDQLDALRQWGLGDKIPYYGQAIHQSIKLFVAGCAGLLQPTETIDY 60 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P L A G G +SPEN AF +WL L++ V LDE + L+ QSG R+WE L Sbjct: 61 PLYPALVAGGVGEMSPENWAFEQWLALLKENVALDEVTSQQIDRLYQQSGLAERKWETLL 120 Query: 121 DDARESITALFTPKRGDWCDI--WSNE-DVSVWWNRLCDNVLPEKPCRL 166 A+++I ALF K DW + W+ E D + W+ L D + PC L Sbjct: 121 PTAQKTIMALFNQKSCDWFGLVDWNGERDTEMLWSHLDDRLTETVPCDL 169 >UniRef50_B6VNI3 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=B6VNI3_PHOAA Length = 346 Score = 236 bits (602), Expect = 2e-61, Method: Composition-based stats. Identities = 66/169 (39%), Positives = 96/169 (56%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M W +NR+ G+P Q+ +++ G P Y +A ++ I+LF+AG AGLLQ TE + + Sbjct: 41 MSKWYANRLNIIGQPDQLDALQQWGLGDKIPYYGQAIHQSIKLFVAGCAGLLQPTETIDY 100 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P L A+G G +SPEN AF +WL L++ V LDE + L+ QSG R+WE L Sbjct: 101 PLYPALVASGVGEMSPENWAFEQWLALLKENVALDEVTSQQIDRLYQQSGLAERKWETLL 160 Query: 121 DDARESITALFTPKRGDWCDI--WSNE-DVSVWWNRLCDNVLPEKPCRL 166 A+++I ALF K DW + W+ E D + W+ L D + PC L Sbjct: 161 PAAQKTIMALFNQKSCDWFGLVDWNGERDTEMLWSHLDDRLTETVPCDL 209 >UniRef50_D0FNQ8 Conserved uncharacterized protein n=6 Tax=Enterobacteriaceae RepID=D0FNQ8_ERWPY Length = 313 Score = 236 bits (602), Expect = 2e-61, Method: Composition-based stats. Identities = 55/169 (32%), Positives = 81/169 (47%), Gaps = 3/169 (1%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 M +WC NR+ +G+ I ++ G TP YR A + I+LFLAG AG+L+ + + Sbjct: 1 MFSWCHNRLDITGKSVCIDVMQSWIVGTETPRYRHAIRQAIKLFLAGCAGILKPVKATTY 60 Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEELP 120 P LTA+G G + N AF +L L+ LD + ++WLQSG +WE +P Sbjct: 61 PAYPALTASGLGAQTSANHAFQHFLELLEKDAWLDGATLSRMEKIWLQSGIDGLKWENIP 120 Query: 121 DDARESITALFTPKRGDWCDIWSNE---DVSVWWNRLCDNVLPEKPCRL 166 AR+ I+ L DW + S D W+ L PC + Sbjct: 121 LAARQIISQLVAVHYADWFGVASGAGQFDPQERWDSLSIMPETTCPCDM 169 >UniRef50_A8GJ61 Putative uncharacterized protein n=3 Tax=Serratia proteamaculans 568 RepID=A8GJ61_SERP5 Length = 313 Score = 234 bits (596), Expect = 1e-60, Method: Composition-based stats. Identities = 53/171 (30%), Positives = 88/171 (51%), Gaps = 6/171 (3%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQT--TEDV 58 M NWC NR+ +G+ + E+++ +G V P YR A + +LFLAG AG+L+ T+ Sbjct: 7 MSNWCKNRLVITGQSVFVDELQQWVNGHVVPDYRHAIQQSCRLFLAGCAGILKPATTKPG 66 Query: 59 RFEPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEE 118 + P P L G+ SP+N+AF W L+ + L +N ++ L+ QSG +WE Sbjct: 67 VYVPYPDLLTH-PGIASPQNLAFEHWFGLLKADIPLTAENVRLIERLYRQSGIDAVKWEN 125 Query: 119 LPDDARESITALFTPKRGDWC---DIWSNEDVSVWWNRLCDNVLPEKPCRL 166 +P+ A+E + + + + DW + + D + W RL PC + Sbjct: 126 IPNVAKERMADVLSRQYADWFGLVGVSPDIDAGLCWERLGMMPEYTAPCDM 176 >UniRef50_C8QE75 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QE75_9ENTR Length = 319 Score = 218 bits (554), Expect = 9e-56, Method: Composition-based stats. Identities = 73/180 (40%), Positives = 98/180 (54%), Gaps = 15/180 (8%) Query: 1 MPNWCSNRMYFSGEPAQIAEIKRLASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDVRF 60 MPNWC NR+ SG I +++ L +G P Y RA EG+QLFLAG AG+L+ + + Sbjct: 1 MPNWCCNRLRVSGRSEDIGQVRALFAGGGYPSYARAAAEGVQLFLAGCAGILRPADGEHY 60 Query: 61 EPCPGLTAA------------GRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQ 108 P P LT A G VS N AFT+WL L++G +L + C LH LWL+ Sbjct: 61 APYPALTGAKGYPSCAEEGRDGHHDVS-GNQAFTQWLCCLREGTVLTDAACDDLHALWLE 119 Query: 109 SGTGRRRWEELPDDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNV--LPEKPCRL 166 SG RRW +L +E I A+F +R DWC +S V WW+ +CD + L +P L Sbjct: 120 SGLRDRRWNDLTPARQECIVAVFAGQRHDWCGSFSATTVQTWWDTVCDGLPELKAQPLDL 179 >UniRef50_C2LLG0 Plasmid-related protein n=2 Tax=Proteus mirabilis RepID=C2LLG0_PROMI Length = 333 Score = 194 bits (493), Expect = 1e-48, Method: Composition-based stats. Identities = 49/154 (31%), Positives = 74/154 (48%), Gaps = 6/154 (3%) Query: 1 MPNWCSNRMYFS-GEPAQIAEIKRL-ASGAVTPLYRRATNEGIQLFLAGSAGLLQTTEDV 58 MPNWCSNR+ + A + +K + P ++ A + + L LAG AG+L+ + Sbjct: 1 MPNWCSNRLDITLHNAADMPALKHWIYADDGIPAWQTAIAQSLHLLLAGCAGILKPVRPL 60 Query: 59 RFEPCPGLTAAG-RGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLH---ELWLQSGTGRR 114 F P P LT+ G G VSP N AFT W+ L L C +H ++WL G Sbjct: 61 SFPPLPELTSYGATGPVSPGNTAFTHWVDLLMTAPDLTPSCCQQIHQWYQMWLSEGGAYH 120 Query: 115 RWEELPDDARESITALFTPKRGDWCDIWSNEDVS 148 W+ L + ++ L + DW + ++ ED S Sbjct: 121 SWDSLTATQKARLSPLLSGSSFDWLNRFTGEDES 154 >UniRef50_A8LZS4 Amino acid adenylation domain n=2 Tax=Salinispora RepID=A8LZS4_SALAI Length = 1326 Score = 38.1 bits (87), Expect = 0.100, Method: Composition-based stats. Identities = 24/110 (21%), Positives = 40/110 (36%), Gaps = 8/110 (7%) Query: 61 EPCPGLTAAGRGVVSPENIAFTRWLTHLQDGVLLDEQNCLMLHELWLQSGTGRRRWEEL- 119 P P + +G G+ A WL GV Q L H +W + G R W+ + Sbjct: 449 PPYPVVFTSGVGLAGDGTAAPASWLGAEVFGVSQTPQVLLD-HIVWDEDGVLRIAWDGVV 507 Query: 120 ---PDDARESITALFTPKRGDWCDIWSNEDVSVWWNRLCDNVLPEKPCRL 166 PD S+ + + + +D + W+ LP +P + Sbjct: 508 DAFPDGYLRSMLDAYVRLLHRLTEATAWKDPRLAWDPFA---LPVEPLDV 554 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.132 0.401 Lambda K H 0.267 0.0405 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 702,883,650 Number of Sequences: 3077464 Number of extensions: 27204431 Number of successful extensions: 67279 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 15 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 67248 Number of HSP's gapped (non-prelim): 17 length of query: 168 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 49 effective length of database: 674,178,140 effective search space: 33034728860 effective search space used: 33034728860 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 88 (38.5 bits)