BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (136 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46835 Uncharacterized lipoprotein yghG n=39 Tax=Escher... 276 1e-73 UniRef50_C4K3F8 Putative lipoprotein n=1 Tax=Candidatus Hamilton... 69 6e-11 UniRef50_D0IBE7 Putative uncharacterized protein n=1 Tax=Grimont... 53 3e-06 UniRef50_Q2BX31 Putative uncharacterized protein n=3 Tax=Photoba... 49 7e-05 UniRef50_A1EN47 Methyl-accepting chemotaxis protein (Fragment) n... 45 6e-04 UniRef50_B7VMM1 Putative uncharacterized protein n=11 Tax=Vibrio... 39 0.035 UniRef50_Q6LP85 Putative uncharacterized protein n=2 Tax=Photoba... 38 0.094 >UniRef50_Q46835 Uncharacterized lipoprotein yghG n=39 Tax=Escherichia RepID=YGHG_ECOLI Length = 136 Score = 276 bits (707), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 136/136 (100%), Positives = 136/136 (100%) Query: 1 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ 60 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ Sbjct: 1 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ 60 Query: 61 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN 120 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN Sbjct: 61 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN 120 Query: 121 QYQRKLDRTTCGIVKA 136 QYQRKLDRTTCGIVKA Sbjct: 121 QYQRKLDRTTCGIVKA 136 >UniRef50_C4K3F8 Putative lipoprotein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K3F8_HAMD5 Length = 134 Score = 68.6 bits (166), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 29/82 (35%), Positives = 47/82 (57%) Query: 32 ASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQTPDAFLTSYQR 91 AS LAKKQ I LP+KS Y L++A + M + E G + T++PD FL Y++ Sbjct: 28 ASTLAKKQVDKIRPYLPVKSEHYVLMMAHHQANKINMIFMQEQGAEFTKSPDQFLEEYKK 87 Query: 92 QMCADPTVKLMITEGINYSITI 113 Q+CA + ++ + + Y ++I Sbjct: 88 QLCASNEILNVLQQNVQYDMSI 109 >UniRef50_D0IBE7 Putative uncharacterized protein n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0IBE7_VIBHO Length = 133 Score = 53.1 bits (126), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 36/127 (28%), Positives = 66/127 (51%), Gaps = 6/127 (4%) Query: 8 GRVLISLLLSVTGLLSGCASHN--ENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTT 65 GRV +L+ +T LLS C+S + E A A ++A +S+ +P+ GY LV A+++ T Sbjct: 8 GRV--ALIGVITLLLSACSSTSDLEIAESFALQRADMLSKIVPVPMNGYNLVRAKANSTQ 65 Query: 66 VKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRK 125 +++T++ AG+ P A ++ C D + ++ +G+NY + D R +R Sbjct: 66 IELTLLY-AGSGDI-APAALAERLEKTYCQDTEIASLMEKGVNYKLLFRDARGRPVLERV 123 Query: 126 LDRTTCG 132 + C Sbjct: 124 ITHKECA 130 >UniRef50_Q2BX31 Putative uncharacterized protein n=3 Tax=Photobacterium RepID=Q2BX31_9GAMM Length = 133 Score = 48.5 bits (114), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 5/110 (4%) Query: 25 CASHNEN--ASLLAKKQAQNISQNLPI-KSAGYTLVLAQSSGTTVKMTIISEAGTQTTQT 81 CASH + A LAK +A I P K Y + A +SG TVK+ II G ++ + Sbjct: 22 CASHEKQQIAETLAKSRASEIDSKAPYPKIDAYQITQAYASGNTVKINIIY--GGKSHVS 79 Query: 82 PDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 P + C + +K + G+NY ITI D Q+ + + TC Sbjct: 80 PSNAAKTAAINYCNNKEIKPLFNSGVNYLITIKDISGRTMAQQAVSKGTC 129 >UniRef50_A1EN47 Methyl-accepting chemotaxis protein (Fragment) n=40 Tax=Vibrio RepID=A1EN47_VIBCH Length = 126 Score = 45.1 bits (105), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 30/108 (27%), Positives = 53/108 (49%), Gaps = 1/108 (0%) Query: 10 VLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMT 69 V+ S LL V G SG A N LLA +A +S LP++ ++ A + G+T+++ Sbjct: 3 VIASTLLLV-GCSSGSAEKQRNLELLAGNRASLLSTELPLEFGPLNILRATAKGSTIELM 61 Query: 70 IISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTR 117 ++ + + L S CA+ ++ + GI+Y I + +TR Sbjct: 62 MVYNTDANNAKPTEQVLQSAVSSFCANKDIRSNLDVGISYRIQMRNTR 109 >UniRef50_B7VMM1 Putative uncharacterized protein n=11 Tax=Vibrionales RepID=B7VMM1_VIBSL Length = 142 Score = 39.3 bits (90), Expect = 0.035, Method: Compositional matrix adjust. Identities = 29/121 (23%), Positives = 57/121 (47%), Gaps = 13/121 (10%) Query: 20 GLLSGCASHNENA-----SLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTII--- 71 LL+GC+S +E ++A +A +S LPI+ +++ + T V++ +I Sbjct: 22 ALLAGCSSSSEQDKQRQLEMMAHHRAGVLSAGLPIEYGPLSIMRVLAKNTVVEIMMIYNQ 81 Query: 72 -SEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTT 130 ++ Q D + SY C + V+ + G+ Y+I I +TR ++ + + T Sbjct: 82 DAKGAKPLNQVVDMSVNSY----CTNSEVRANLDMGLAYNIKIRNTRGQLMVEKLISKET 137 Query: 131 C 131 C Sbjct: 138 C 138 >UniRef50_Q6LP85 Putative uncharacterized protein n=2 Tax=Photobacterium profundum RepID=Q6LP85_PHOPR Length = 128 Score = 38.1 bits (87), Expect = 0.094, Method: Compositional matrix adjust. Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 4/96 (4%) Query: 24 GCASHNENASL-LAKKQAQNISQNLPI-KSAGYTLVLAQSSGTTVKMTIISEAGTQTTQT 81 GCAS ++A + LAK +A I+ P K Y ++ A + V++TI+ G++ Sbjct: 16 GCASTEDDAVIALAKSRATTINNKAPYDKVDQYQIMKAHARDKIVEITILYGGGSKIA-- 73 Query: 82 PDAFLTSYQRQMCADPTVKLMITEGINYSITINDTR 117 P + C+ + + EG+ Y+I I D R Sbjct: 74 PTQAAAFAAKNYCSSNELSPLFNEGMGYNIIIMDMR 109 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46835 Uncharacterized lipoprotein yghG n=39 Tax=Escher... 176 1e-43 UniRef50_D0IBE7 Putative uncharacterized protein n=1 Tax=Grimont... 139 3e-32 UniRef50_A1EN47 Methyl-accepting chemotaxis protein (Fragment) n... 133 2e-30 UniRef50_C4K3F8 Putative lipoprotein n=1 Tax=Candidatus Hamilton... 120 1e-26 UniRef50_Q2BX31 Putative uncharacterized protein n=3 Tax=Photoba... 114 8e-25 Sequences not found previously or not previously below threshold: UniRef50_B7VMM1 Putative uncharacterized protein n=11 Tax=Vibrio... 92 4e-18 UniRef50_Q7MM63 Putative uncharacterized protein VV1210 n=3 Tax=... 85 6e-16 UniRef50_B5FDB1 Lipoprotein, putative n=3 Tax=Aliivibrio RepID=B... 83 3e-15 UniRef50_Q6LP85 Putative uncharacterized protein n=2 Tax=Photoba... 82 5e-15 UniRef50_A3UPE9 Putative uncharacterized protein n=1 Tax=Vibrio ... 57 2e-07 UniRef50_Q6LSN6 Putative uncharacterized protein n=2 Tax=Photoba... 50 3e-05 UniRef50_B0X7Z4 Ankyrin repeat domain-containing protein 29 n=1 ... 39 0.041 >UniRef50_Q46835 Uncharacterized lipoprotein yghG n=39 Tax=Escherichia RepID=YGHG_ECOLI Length = 136 Score = 176 bits (447), Expect = 1e-43, Method: Composition-based stats. Identities = 136/136 (100%), Positives = 136/136 (100%) Query: 1 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ 60 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ Sbjct: 1 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ 60 Query: 61 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN 120 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN Sbjct: 61 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN 120 Query: 121 QYQRKLDRTTCGIVKA 136 QYQRKLDRTTCGIVKA Sbjct: 121 QYQRKLDRTTCGIVKA 136 >UniRef50_D0IBE7 Putative uncharacterized protein n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0IBE7_VIBHO Length = 133 Score = 139 bits (349), Expect = 3e-32, Method: Composition-based stats. Identities = 36/127 (28%), Positives = 66/127 (51%), Gaps = 6/127 (4%) Query: 8 GRVLISLLLSVTGLLSGCASHN--ENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTT 65 GRV +L+ +T LLS C+S + E A A ++A +S+ +P+ GY LV A+++ T Sbjct: 8 GRV--ALIGVITLLLSACSSTSDLEIAESFALQRADMLSKIVPVPMNGYNLVRAKANSTQ 65 Query: 66 VKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRK 125 +++T++ AG+ P A ++ C D + ++ +G+NY + D R +R Sbjct: 66 IELTLLY-AGSGDI-APAALAERLEKTYCQDTEIASLMEKGVNYKLLFRDARGRPVLERV 123 Query: 126 LDRTTCG 132 + C Sbjct: 124 ITHKECA 130 >UniRef50_A1EN47 Methyl-accepting chemotaxis protein (Fragment) n=40 Tax=Vibrio RepID=A1EN47_VIBCH Length = 126 Score = 133 bits (334), Expect = 2e-30, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 58/122 (47%), Gaps = 1/122 (0%) Query: 10 VLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMT 69 V+ S LL V G SG A N LLA +A +S LP++ ++ A + G+T+++ Sbjct: 3 VIASTLLLV-GCSSGSAEKQRNLELLAGNRASLLSTELPLEFGPLNILRATAKGSTIELM 61 Query: 70 IISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRT 129 ++ + + L S CA+ ++ + GI+Y I + +TR + + + Sbjct: 62 MVYNTDANNAKPTEQVLQSAVSSFCANKDIRSNLDVGISYRIQMRNTRGQLMADQLVTKE 121 Query: 130 TC 131 +C Sbjct: 122 SC 123 >UniRef50_C4K3F8 Putative lipoprotein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K3F8_HAMD5 Length = 134 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 31/100 (31%), Positives = 52/100 (52%) Query: 32 ASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQTPDAFLTSYQR 91 AS LAKKQ I LP+KS Y L++A + M + E G + T++PD FL Y++ Sbjct: 28 ASTLAKKQVDKIRPYLPVKSEHYVLMMAHHQANKINMIFMQEQGAEFTKSPDQFLEEYKK 87 Query: 92 QMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 Q+CA + ++ + + Y ++I+ + + L C Sbjct: 88 QLCASNEILNVLQQNVQYDMSISGSWNKPLARMSLSYKDC 127 >UniRef50_Q2BX31 Putative uncharacterized protein n=3 Tax=Photobacterium RepID=Q2BX31_9GAMM Length = 133 Score = 114 bits (285), Expect = 8e-25, Method: Composition-based stats. Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 5/110 (4%) Query: 25 CASHNEN--ASLLAKKQAQNISQNLPI-KSAGYTLVLAQSSGTTVKMTIISEAGTQTTQT 81 CASH + A LAK +A I P K Y + A +SG TVK+ II G ++ + Sbjct: 22 CASHEKQQIAETLAKSRASEIDSKAPYPKIDAYQITQAYASGNTVKINIIY--GGKSHVS 79 Query: 82 PDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 P + C + +K + G+NY ITI D Q+ + + TC Sbjct: 80 PSNAAKTAAINYCNNKEIKPLFNSGVNYLITIKDISGRTMAQQAVSKGTC 129 >UniRef50_B7VMM1 Putative uncharacterized protein n=11 Tax=Vibrionales RepID=B7VMM1_VIBSL Length = 142 Score = 92.2 bits (227), Expect = 4e-18, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 49/118 (41%), Gaps = 1/118 (0%) Query: 15 LLSVTGLLSGCAS-HNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISE 73 + + G S ++A +A +S LPI+ +++ + T V++ +I Sbjct: 21 IALLAGCSSSSEQDKQRQLEMMAHHRAGVLSAGLPIEYGPLSIMRVLAKNTVVEIMMIYN 80 Query: 74 AGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 + + + + C + V+ + G+ Y+I I +TR ++ + + TC Sbjct: 81 QDAKGAKPLNQVVDMSVNSYCTNSEVRANLDMGLAYNIKIRNTRGQLMVEKLISKETC 138 >UniRef50_Q7MM63 Putative uncharacterized protein VV1210 n=3 Tax=Vibrio RepID=Q7MM63_VIBVY Length = 130 Score = 85.3 bits (209), Expect = 6e-16, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 1/112 (0%) Query: 20 GLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTT 79 G S L+A +A +S LPI+ ++ SS V++ +I Sbjct: 17 GCSSN-GEKERQLELMASNRAGVLSAGLPIEYGPLKVMRISSSKNIVEIMMIYNTDATGA 75 Query: 80 QTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 + L++ + C D TV+ + G+ Y I I ++R + +C Sbjct: 76 KPTQELLSTSVSKYCEDATVRNQLDVGLMYRIKIRNSRGQLIIDEMVTAASC 127 >UniRef50_B5FDB1 Lipoprotein, putative n=3 Tax=Aliivibrio RepID=B5FDB1_VIBFM Length = 132 Score = 82.6 bits (202), Expect = 3e-15, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 54/113 (47%), Gaps = 4/113 (3%) Query: 21 LLSGCASHNEN--ASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQT 78 +LSGCAS+ + +LA +A ++ LP++ TL+ A++ V+M + G Sbjct: 13 ILSGCASNEQQQEIEMLADYRASVLASVLPLEMGQLTLLQAKAKQGVVEMMFL--DGGNG 70 Query: 79 TQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 + + + + C D V+ ++ +G+ Y I ++R Q ++ +C Sbjct: 71 EISTNQIIENSITTFCKDKEVRPVLDKGVTYRYIIRNSRGQKQNDIIVNEQSC 123 >UniRef50_Q6LP85 Putative uncharacterized protein n=2 Tax=Photobacterium profundum RepID=Q6LP85_PHOPR Length = 128 Score = 82.2 bits (201), Expect = 5e-15, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 4/111 (3%) Query: 24 GCASHNENASL-LAKKQAQNISQNLPI-KSAGYTLVLAQSSGTTVKMTIISEAGTQTTQT 81 GCAS ++A + LAK +A I+ P K Y ++ A + V++TI+ G + Sbjct: 16 GCASTEDDAVIALAKSRATTINNKAPYDKVDQYQIMKAHARDKIVEITILY--GGGSKIA 73 Query: 82 PDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTCG 132 P + C+ + + EG+ Y+I I D R ++ + C Sbjct: 74 PTQAAAFAAKNYCSSNELSPLFNEGMGYNIIIMDMRGRKIVEQPVYAAYCA 124 >UniRef50_A3UPE9 Putative uncharacterized protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UPE9_VIBSP Length = 109 Score = 56.7 bits (135), Expect = 2e-07, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 45/112 (40%), Gaps = 4/112 (3%) Query: 21 LLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQ 80 +L+GC ++ LA +A IS LP + TLV AQS G +V + I + T Sbjct: 1 MLNGCQISPIKSNGLADYRATVISSQLPQELGSITLVNAQSEGNSVTLVFIK----KKTL 56 Query: 81 TPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTCG 132 D + C + ++ ++ GI+Y I + + C Sbjct: 57 NMDRLVEKVAVSFCDNIEIRPLLESGISYRIITLGKNEKVESLNVISLAKCS 108 >UniRef50_Q6LSN6 Putative uncharacterized protein n=2 Tax=Photobacterium profundum RepID=Q6LSN6_PHOPR Length = 118 Score = 49.8 bits (117), Expect = 3e-05, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 56/122 (45%), Gaps = 17/122 (13%) Query: 15 LLSVTGLLSGCASHNEN----ASLLAKKQAQNISQNLPIKS-AGYTLVLAQSSGTTVKMT 69 ++ V+ LL+GC+S +N + LAK+ AQ I+ + G +V A + G TV +T Sbjct: 9 IIVVSILLAGCSSSKQNDLMAMNRLAKQHAQTINDAKHVAMKEGVEIVEAYAIGPTVVVT 68 Query: 70 IISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRT 129 + G D + + C P +K ++ +G+ Y D + +D+T Sbjct: 69 L---QGNDL----DGLKSWIRPTFCQQPKIKPLLEQGLQY--VFRDQNGE---EYLVDKT 116 Query: 130 TC 131 +C Sbjct: 117 SC 118 >UniRef50_B0X7Z4 Ankyrin repeat domain-containing protein 29 n=1 Tax=Culex quinquefasciatus RepID=B0X7Z4_CULQU Length = 435 Score = 39.0 bits (89), Expect = 0.041, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 40/108 (37%) Query: 16 LSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAG 75 S+ +L+ A H + L + A+ +Q K +LV + V++ + A Sbjct: 159 GSIPLILAATAGHEKVVETLLRHGAKMEAQYERTKDTPLSLVCSGGRYEVVELLLDMNAN 218 Query: 76 TQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQ 123 + PD L S + L+++ G+ + + + Q Sbjct: 219 RENRNVPDYTLLSLAASGGYVKIITLLLSHGVEINSQMGSNGSRPVGQ 266 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46835 Uncharacterized lipoprotein yghG n=39 Tax=Escher... 157 1e-37 UniRef50_A1EN47 Methyl-accepting chemotaxis protein (Fragment) n... 136 3e-31 UniRef50_B7VMM1 Putative uncharacterized protein n=11 Tax=Vibrio... 130 1e-29 UniRef50_D0IBE7 Putative uncharacterized protein n=1 Tax=Grimont... 124 7e-28 UniRef50_B5FDB1 Lipoprotein, putative n=3 Tax=Aliivibrio RepID=B... 121 5e-27 UniRef50_Q7MM63 Putative uncharacterized protein VV1210 n=3 Tax=... 120 1e-26 UniRef50_C4K3F8 Putative lipoprotein n=1 Tax=Candidatus Hamilton... 117 1e-25 UniRef50_Q6LP85 Putative uncharacterized protein n=2 Tax=Photoba... 110 2e-23 UniRef50_Q2BX31 Putative uncharacterized protein n=3 Tax=Photoba... 106 3e-22 UniRef50_A3UPE9 Putative uncharacterized protein n=1 Tax=Vibrio ... 105 5e-22 UniRef50_Q6LSN6 Putative uncharacterized protein n=2 Tax=Photoba... 97 2e-19 Sequences not found previously or not previously below threshold: UniRef50_B9ZPA7 Putative uncharacterized protein n=1 Tax=Thioalk... 41 0.012 UniRef50_A1R2M1 Putative uncharacterized protein n=1 Tax=Arthrob... 41 0.017 UniRef50_D0S5V0 Predicted protein n=1 Tax=Acinetobacter calcoace... 39 0.045 CONVERGED! >UniRef50_Q46835 Uncharacterized lipoprotein yghG n=39 Tax=Escherichia RepID=YGHG_ECOLI Length = 136 Score = 157 bits (397), Expect = 1e-37, Method: Composition-based stats. Identities = 136/136 (100%), Positives = 136/136 (100%) Query: 1 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ 60 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ Sbjct: 1 MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQ 60 Query: 61 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN 120 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN Sbjct: 61 SSGTTVKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGN 120 Query: 121 QYQRKLDRTTCGIVKA 136 QYQRKLDRTTCGIVKA Sbjct: 121 QYQRKLDRTTCGIVKA 136 >UniRef50_A1EN47 Methyl-accepting chemotaxis protein (Fragment) n=40 Tax=Vibrio RepID=A1EN47_VIBCH Length = 126 Score = 136 bits (341), Expect = 3e-31, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 58/122 (47%), Gaps = 1/122 (0%) Query: 10 VLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMT 69 V+ S LL V G SG A N LLA +A +S LP++ ++ A + G+T+++ Sbjct: 3 VIASTLLLV-GCSSGSAEKQRNLELLAGNRASLLSTELPLEFGPLNILRATAKGSTIELM 61 Query: 70 IISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRT 129 ++ + + L S CA+ ++ + GI+Y I + +TR + + + Sbjct: 62 MVYNTDANNAKPTEQVLQSAVSSFCANKDIRSNLDVGISYRIQMRNTRGQLMADQLVTKE 121 Query: 130 TC 131 +C Sbjct: 122 SC 123 >UniRef50_B7VMM1 Putative uncharacterized protein n=11 Tax=Vibrionales RepID=B7VMM1_VIBSL Length = 142 Score = 130 bits (327), Expect = 1e-29, Method: Composition-based stats. Identities = 21/118 (17%), Positives = 49/118 (41%), Gaps = 1/118 (0%) Query: 15 LLSVTGLLSGCAS-HNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISE 73 + + G S ++A +A +S LPI+ +++ + T V++ +I Sbjct: 21 IALLAGCSSSSEQDKQRQLEMMAHHRAGVLSAGLPIEYGPLSIMRVLAKNTVVEIMMIYN 80 Query: 74 AGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 + + + + C + V+ + G+ Y+I I +TR ++ + + TC Sbjct: 81 QDAKGAKPLNQVVDMSVNSYCTNSEVRANLDMGLAYNIKIRNTRGQLMVEKLISKETC 138 >UniRef50_D0IBE7 Putative uncharacterized protein n=1 Tax=Grimontia hollisae CIP 101886 RepID=D0IBE7_VIBHO Length = 133 Score = 124 bits (312), Expect = 7e-28, Method: Composition-based stats. Identities = 34/127 (26%), Positives = 63/127 (49%), Gaps = 6/127 (4%) Query: 8 GRVLISLLLSVTGLLSGCASHN--ENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTT 65 GRV +L+ +T LLS C+S + E A A ++A +S+ +P+ GY LV A+++ T Sbjct: 8 GRV--ALIGVITLLLSACSSTSDLEIAESFALQRADMLSKIVPVPMNGYNLVRAKANSTQ 65 Query: 66 VKMTIISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRK 125 +++T++ P A ++ C D + ++ +G+NY + D R +R Sbjct: 66 IELTLLYA--GSGDIAPAALAERLEKTYCQDTEIASLMEKGVNYKLLFRDARGRPVLERV 123 Query: 126 LDRTTCG 132 + C Sbjct: 124 ITHKECA 130 >UniRef50_B5FDB1 Lipoprotein, putative n=3 Tax=Aliivibrio RepID=B5FDB1_VIBFM Length = 132 Score = 121 bits (304), Expect = 5e-27, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 55/120 (45%), Gaps = 4/120 (3%) Query: 14 LLLSVTGLLSGCASHNEN--ASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTII 71 L +LSGCAS+ + +LA +A ++ LP++ TL+ A++ V+M + Sbjct: 6 LTAISAIILSGCASNEQQQEIEMLADYRASVLASVLPLEMGQLTLLQAKAKQGVVEMMFL 65 Query: 72 SEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 G + + + + C D V+ ++ +G+ Y I ++R Q ++ +C Sbjct: 66 --DGGNGEISTNQIIENSITTFCKDKEVRPVLDKGVTYRYIIRNSRGQKQNDIIVNEQSC 123 >UniRef50_Q7MM63 Putative uncharacterized protein VV1210 n=3 Tax=Vibrio RepID=Q7MM63_VIBVY Length = 130 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 1/112 (0%) Query: 20 GLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTT 79 G S L+A +A +S LPI+ ++ SS V++ +I Sbjct: 17 GCSSN-GEKERQLELMASNRAGVLSAGLPIEYGPLKVMRISSSKNIVEIMMIYNTDATGA 75 Query: 80 QTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 + L++ + C D TV+ + G+ Y I I ++R + +C Sbjct: 76 KPTQELLSTSVSKYCEDATVRNQLDVGLMYRIKIRNSRGQLIIDEMVTAASC 127 >UniRef50_C4K3F8 Putative lipoprotein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K3F8_HAMD5 Length = 134 Score = 117 bits (292), Expect = 1e-25, Method: Composition-based stats. Identities = 31/100 (31%), Positives = 52/100 (52%) Query: 32 ASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQTPDAFLTSYQR 91 AS LAKKQ I LP+KS Y L++A + M + E G + T++PD FL Y++ Sbjct: 28 ASTLAKKQVDKIRPYLPVKSEHYVLMMAHHQANKINMIFMQEQGAEFTKSPDQFLEEYKK 87 Query: 92 QMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 Q+CA + ++ + + Y ++I+ + + L C Sbjct: 88 QLCASNEILNVLQQNVQYDMSISGSWNKPLARMSLSYKDC 127 >UniRef50_Q6LP85 Putative uncharacterized protein n=2 Tax=Photobacterium profundum RepID=Q6LP85_PHOPR Length = 128 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 4/111 (3%) Query: 24 GCASHNENASL-LAKKQAQNISQNLPI-KSAGYTLVLAQSSGTTVKMTIISEAGTQTTQT 81 GCAS ++A + LAK +A I+ P K Y ++ A + V++TI+ G + Sbjct: 16 GCASTEDDAVIALAKSRATTINNKAPYDKVDQYQIMKAHARDKIVEITILY--GGGSKIA 73 Query: 82 PDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTCG 132 P + C+ + + EG+ Y+I I D R ++ + C Sbjct: 74 PTQAAAFAAKNYCSSNELSPLFNEGMGYNIIIMDMRGRKIVEQPVYAAYCA 124 >UniRef50_Q2BX31 Putative uncharacterized protein n=3 Tax=Photobacterium RepID=Q2BX31_9GAMM Length = 133 Score = 106 bits (264), Expect = 3e-22, Method: Composition-based stats. Identities = 35/110 (31%), Positives = 51/110 (46%), Gaps = 5/110 (4%) Query: 25 CASHNEN--ASLLAKKQAQNISQNLPI-KSAGYTLVLAQSSGTTVKMTIISEAGTQTTQT 81 CASH + A LAK +A I P K Y + A +SG TVK+ II G ++ + Sbjct: 22 CASHEKQQIAETLAKSRASEIDSKAPYPKIDAYQITQAYASGNTVKINIIY--GGKSHVS 79 Query: 82 PDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 P + C + +K + G+NY ITI D Q+ + + TC Sbjct: 80 PSNAAKTAAINYCNNKEIKPLFNSGVNYLITIKDISGRTMAQQAVSKGTC 129 >UniRef50_A3UPE9 Putative uncharacterized protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UPE9_VIBSP Length = 109 Score = 105 bits (262), Expect = 5e-22, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 45/112 (40%), Gaps = 4/112 (3%) Query: 21 LLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQ 80 +L+GC ++ LA +A IS LP + TLV AQS G +V + I + T Sbjct: 1 MLNGCQISPIKSNGLADYRATVISSQLPQELGSITLVNAQSEGNSVTLVFIK----KKTL 56 Query: 81 TPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTCG 132 D + C + ++ ++ GI+Y I + + C Sbjct: 57 NMDRLVEKVAVSFCDNIEIRPLLESGISYRIITLGKNEKVESLNVISLAKCS 108 >UniRef50_Q6LSN6 Putative uncharacterized protein n=2 Tax=Photobacterium profundum RepID=Q6LSN6_PHOPR Length = 118 Score = 96.8 bits (239), Expect = 2e-19, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 56/122 (45%), Gaps = 17/122 (13%) Query: 15 LLSVTGLLSGCASHNEN----ASLLAKKQAQNISQNLPIKS-AGYTLVLAQSSGTTVKMT 69 ++ V+ LL+GC+S +N + LAK+ AQ I+ + G +V A + G TV +T Sbjct: 9 IIVVSILLAGCSSSKQNDLMAMNRLAKQHAQTINDAKHVAMKEGVEIVEAYAIGPTVVVT 68 Query: 70 IISEAGTQTTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRT 129 + G D + + C P +K ++ +G+ Y D + +D+T Sbjct: 69 L---QGND----LDGLKSWIRPTFCQQPKIKPLLEQGLQY--VFRDQNGE---EYLVDKT 116 Query: 130 TC 131 +C Sbjct: 117 SC 118 >UniRef50_B9ZPA7 Putative uncharacterized protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZPA7_9GAMM Length = 148 Score = 40.9 bits (94), Expect = 0.012, Method: Composition-based stats. Identities = 14/96 (14%), Positives = 33/96 (34%), Gaps = 7/96 (7%) Query: 43 ISQNLPIKSAGYTLVLAQSSGTTVKMTIISEA---GTQTTQTPDA----FLTSYQRQMCA 95 I++++PI + + S ++ +T + P R+ C Sbjct: 47 INRSVPIVTEPGVRLDGASIDPSIGITYRYTLTGLDGGQVENPQGTRQTMREYSIRESCG 106 Query: 96 DPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTC 131 P + + G+ + T D++ +D+ C Sbjct: 107 HPDIAPALRAGVTFIHTYRDSKGRRFTSITVDQNAC 142 >UniRef50_A1R2M1 Putative uncharacterized protein n=1 Tax=Arthrobacter aurescens TC1 RepID=A1R2M1_ARTAT Length = 285 Score = 40.5 bits (93), Expect = 0.017, Method: Composition-based stats. Identities = 17/125 (13%), Positives = 52/125 (41%), Gaps = 2/125 (1%) Query: 10 VLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYT-LVLAQSSGTTVKM 68 +++ +L+++G +G + E+ ++A+++ LP++ T L ++ G ++ Sbjct: 159 LMLGAVLALSGAFNGSSPSVESTVKSVVEKARDLQGELPMQVDSVTSLTGIEAEGDAIRY 218 Query: 69 TIISEAGTQ-TTQTPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLD 127 ++ + +T + ++ +CA + K ++ GI + + Sbjct: 219 DLVLSSSVDPSTLSAESVRGMVLPTVCAATSTKEILDAGIKMKYVYTFEGSAKSVDFTVT 278 Query: 128 RTTCG 132 + C Sbjct: 279 KAACA 283 >UniRef50_D0S5V0 Predicted protein n=1 Tax=Acinetobacter calcoaceticus RUH2202 RepID=D0S5V0_ACICA Length = 274 Score = 39.0 bits (89), Expect = 0.045, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 36/102 (35%), Gaps = 8/102 (7%) Query: 40 AQNISQNLPIKSA---GYTLVLAQSSGTTVKMTIISEAGTQTTQTPDAFLT-----SYQR 91 A+ I++NLP++ V AQ T+ +I + + Sbjct: 173 AKMINKNLPMQVDRETNLEHVEAQGKELTLNYRLIHITSADMPAEWVQAMEVNGVPEIVK 232 Query: 92 QMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTCGI 133 +C + +K ++ +G ++ + K+ + C I Sbjct: 233 GVCDNIKMKELLNKGAQLNLKYLYKDNVPLTKIKMTKQDCNI 274 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.126 0.301 Lambda K H 0.267 0.0395 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 493,353,056 Number of Sequences: 3077464 Number of extensions: 11751764 Number of successful extensions: 45566 Number of sequences better than 1.0e-01: 15 Number of HSP's better than 0.1 without gapping: 25 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 45518 Number of HSP's gapped (non-prelim): 40 length of query: 136 length of database: 1,040,396,356 effective HSP length: 101 effective length of query: 35 effective length of database: 729,572,492 effective search space: 25535037220 effective search space used: 25535037220 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 87 (38.2 bits)