BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (292 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P39267 Uncharacterized protein yjcZ n=62 Tax=Enterobact... 597 e-169 UniRef50_A7MPJ2 Putative uncharacterized protein n=1 Tax=Cronoba... 194 2e-48 UniRef50_B7UP35 Predicted protein n=54 Tax=Enterobacteriaceae Re... 125 3e-27 UniRef50_A4JJZ8 Putative uncharacterized protein n=1 Tax=Burkhol... 80 1e-13 UniRef50_D0SVD9 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 69 2e-10 >UniRef50_P39267 Uncharacterized protein yjcZ n=62 Tax=Enterobacteriaceae RepID=YJCZ_ECOLI Length = 292 Score = 597 bits (1538), Expect = e-169, Method: Compositional matrix adjust. Identities = 292/292 (100%), Positives = 292/292 (100%) Query: 1 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG 60 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG Sbjct: 1 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG 60 Query: 61 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY 120 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY Sbjct: 61 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY 120 Query: 121 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF Sbjct: 121 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 Query: 181 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL 240 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL Sbjct: 181 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL 240 Query: 241 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA 292 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA Sbjct: 241 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA 292 >UniRef50_A7MPJ2 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MPJ2_ENTS8 Length = 265 Score = 194 bits (494), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 116/273 (42%), Positives = 160/273 (58%), Gaps = 16/273 (5%) Query: 21 LVDLAQGDDA-RLPQAH--QQQFRERLMQEL-LSRVQLQTWTNGGMLNAPLSLRLTLVEK 76 +VDLA+G+D R ++H Q RERL + L ++R + + P L L+E Sbjct: 1 MVDLAKGEDEERYGRSHPQHQNMRERLQKSLAVTRAE-------PIAATPEVAALHLLES 53 Query: 77 LASMLDPGHLALTQIAQHLALL-QKMDHRQHSA-FPELPQQIAALYEWFSARCRWKEKAL 134 L+ PGHLAL+ AQ LA Q++ + A F + + A Y+ +AR E L Sbjct: 54 LSRQSPPGHLALSLTAQALARRSQRLAEKGVCAPFAQALEVTAGQYQHSAARL---EAQL 110 Query: 135 TQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLGSPQAVAL 194 Q LL A + ++ RW+ G +N WS GRC++ L+EL WGAFGDA RLG PQ Sbjct: 111 RQGDLLAAAQRHASEVMARWQRGEFNGWSPVGRCYVALQELHWGAFGDALRLGEPQYKTA 170 Query: 195 LLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFLSWLGKWTTADKQPV 254 LL + + + LA S+NA+P TRH+YHQW + P EH D L WLG +++QPV Sbjct: 171 LLPPVYDDTARLLAHSVNASPDTRHFYHQWLHTPPQPGLLEHKDILCWLGAMHDSERQPV 230 Query: 255 CWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIF 287 WSVTQ WQ+V+LGMPR+CSA+RL A++EEIF Sbjct: 231 SWSVTQTWQSVSLGMPRICSARRLVNALVEEIF 263 >UniRef50_B7UP35 Predicted protein n=54 Tax=Enterobacteriaceae RepID=B7UP35_ECO27 Length = 301 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 85/281 (30%), Positives = 135/281 (48%), Gaps = 4/281 (1%) Query: 11 RVLESVYPRFLVDLAQGDDA---RLPQAHQQQFRERLMQELLSRVQLQTWTNGGMLNAPL 67 +VL + +F+VD A G D + A ++ F RL + L + L + Sbjct: 16 QVLTCLPEKFVVDFANGIDVAQEHIRTAGERTFFRRLKEGLTGEGAARQNAINASLAQGV 75 Query: 68 SLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSARC 127 L + ++ + L + A+T++ ++ L R + +Q+ L + + Sbjct: 76 EASLRWLTEMTTSLATTNYAITRVNDRVSSLVSDTARLAHYSADTREQLLTLADQVHHKL 135 Query: 128 RWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLG 187 E+ L + + +A EQIF+ W AG Y ++S GRC++ LEELRWGAFGD R G Sbjct: 136 NHLEEKLHRVDQVQRAQLHLEQIFSWWSAGRYASFSPAGRCYVALEELRWGAFGDVIRQG 195 Query: 188 SPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFAS-STVPTGGEHADFLSWLGKW 246 V LL L KA +A+ + T R W E D ++WLG W Sbjct: 196 ETGQVNQLLDILRHKALTQMAQESGGSATVRLNTLDWLGGQGREQADNEWHDAINWLGDW 255 Query: 247 TTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIF 287 + ++ PV WS TQ + + + MPRLCSA+RL+ +M++EIF Sbjct: 256 CSEEQHPVIWSTTQAAEHLPVRMPRLCSAERLSESMVDEIF 296 >UniRef50_A4JJZ8 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JJZ8_BURVG Length = 300 Score = 79.7 bits (195), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 46/140 (32%), Positives = 69/140 (49%) Query: 148 EQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLGSPQAVALLLGDLLEKATQHL 207 +Q+F RWRAG + S GR + LE L WGAFGD CR + D + + T L Sbjct: 155 DQVFNRWRAGRFAGLSPAGRAYAALECLWWGAFGDYCRYQEGPQRHAFVEDAMNRVTAQL 214 Query: 208 AESINAAPTTRHYYHQWFASSTVPTGGEHADFLSWLGKWTTADKQPVCWSVTQRWQTVAL 267 A+ A + R W A+ + + A LS+LG + PV +SV+Q + L Sbjct: 215 ADDARCATSERIDTSWWLAAGSTAEQRDGAAALSYLGATLNPARAPVAFSVSQTPTRLPL 274 Query: 268 GMPRLCSAQRLAGAMLEEIF 287 MP + +R+ A++ E+F Sbjct: 275 EMPLRATPRRIGEALVRELF 294 >UniRef50_D0SVD9 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SVD9_ACILW Length = 295 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 58/225 (25%), Positives = 104/225 (46%), Gaps = 20/225 (8%) Query: 77 LASMLDPGHLALTQIAQHLALLQKMDHRQHSA--FPELPQQIAALYEWFSARCRWKEKAL 134 L S L G AL + Q KM Q+S + Q L E + + L Sbjct: 77 LTSELSVGFSALKTVNQ------KMTEVQNSVTDLAKFSLQTQDLLEKLTENVNYHISDL 130 Query: 135 TQRGLLVQAGDQSEQ----IFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRL--GS 188 QR LV+A ++ + +F +W+A + S R + VLE L WG FG+ R + Sbjct: 131 DQRLSLVEAESRAHRQINLLFKQWKANHFQILSPLERLYTVLERLYWGDFGEYYRNYHSN 190 Query: 189 PQAVALLLGDLLEKATQHLAESI--NAAPTTRHYYH--QWFASSTVPTGGEHADFLSWLG 244 P+A+ + + DL + E + + + Y H +W + ST P + +++G Sbjct: 191 PKAI-VTVEDLKQTIYFEAIECLQNDMQIESDEYLHPSKW-SKSTRPLSVSIKESYAFMG 248 Query: 245 KWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSV 289 WT D+ P+ + +Q+ + +++ +PR+ +A +L+ L E+F + Sbjct: 249 DWTKDDQMPISYYASQQPEQLSIYVPRILTATKLSQYALNEMFGI 293 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39267 Uncharacterized protein yjcZ n=62 Tax=Enterobact... 414 e-114 UniRef50_B7UP35 Predicted protein n=54 Tax=Enterobacteriaceae Re... 359 5e-98 UniRef50_A7MPJ2 Putative uncharacterized protein n=1 Tax=Cronoba... 308 1e-82 UniRef50_D0SVD9 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 275 1e-72 UniRef50_A4JJZ8 Putative uncharacterized protein n=1 Tax=Burkhol... 267 2e-70 Sequences not found previously or not previously below threshold: UniRef50_B8HLV1 Putative uncharacterized protein n=2 Tax=Cyanoba... 52 2e-05 >UniRef50_P39267 Uncharacterized protein yjcZ n=62 Tax=Enterobacteriaceae RepID=YJCZ_ECOLI Length = 292 Score = 414 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 292/292 (100%), Positives = 292/292 (100%) Query: 1 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG 60 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG Sbjct: 1 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG 60 Query: 61 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY 120 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY Sbjct: 61 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY 120 Query: 121 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF Sbjct: 121 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 Query: 181 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL 240 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL Sbjct: 181 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL 240 Query: 241 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA 292 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA Sbjct: 241 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA 292 >UniRef50_B7UP35 Predicted protein n=54 Tax=Enterobacteriaceae RepID=B7UP35_ECO27 Length = 301 Score = 359 bits (922), Expect = 5e-98, Method: Composition-based stats. Identities = 85/281 (30%), Positives = 135/281 (48%), Gaps = 4/281 (1%) Query: 11 RVLESVYPRFLVDLAQGDD---ARLPQAHQQQFRERLMQELLSRVQLQTWTNGGMLNAPL 67 +VL + +F+VD A G D + A ++ F RL + L + L + Sbjct: 16 QVLTCLPEKFVVDFANGIDVAQEHIRTAGERTFFRRLKEGLTGEGAARQNAINASLAQGV 75 Query: 68 SLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSARC 127 L + ++ + L + A+T++ ++ L R + +Q+ L + + Sbjct: 76 EASLRWLTEMTTSLATTNYAITRVNDRVSSLVSDTARLAHYSADTREQLLTLADQVHHKL 135 Query: 128 RWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLG 187 E+ L + + +A EQIF+ W AG Y ++S GRC++ LEELRWGAFGD R G Sbjct: 136 NHLEEKLHRVDQVQRAQLHLEQIFSWWSAGRYASFSPAGRCYVALEELRWGAFGDVIRQG 195 Query: 188 SPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFAS-STVPTGGEHADFLSWLGKW 246 V LL L KA +A+ + T R W E D ++WLG W Sbjct: 196 ETGQVNQLLDILRHKALTQMAQESGGSATVRLNTLDWLGGQGREQADNEWHDAINWLGDW 255 Query: 247 TTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIF 287 + ++ PV WS TQ + + + MPRLCSA+RL+ +M++EIF Sbjct: 256 CSEEQHPVIWSTTQAAEHLPVRMPRLCSAERLSESMVDEIF 296 >UniRef50_A7MPJ2 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MPJ2_ENTS8 Length = 265 Score = 308 bits (789), Expect = 1e-82, Method: Composition-based stats. Identities = 116/273 (42%), Positives = 160/273 (58%), Gaps = 16/273 (5%) Query: 21 LVDLAQGDDA-RLPQAH--QQQFRERLMQEL-LSRVQLQTWTNGGMLNAPLSLRLTLVEK 76 +VDLA+G+D R ++H Q RERL + L ++R + + P L L+E Sbjct: 1 MVDLAKGEDEERYGRSHPQHQNMRERLQKSLAVTRAE-------PIAATPEVAALHLLES 53 Query: 77 LASMLDPGHLALTQIAQHLALL-QKMDHRQHSA-FPELPQQIAALYEWFSARCRWKEKAL 134 L+ PGHLAL+ AQ LA Q++ + A F + + A Y+ +AR E L Sbjct: 54 LSRQSPPGHLALSLTAQALARRSQRLAEKGVCAPFAQALEVTAGQYQHSAARL---EAQL 110 Query: 135 TQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLGSPQAVAL 194 Q LL A + ++ RW+ G +N WS GRC++ L+EL WGAFGDA RLG PQ Sbjct: 111 RQGDLLAAAQRHASEVMARWQRGEFNGWSPVGRCYVALQELHWGAFGDALRLGEPQYKTA 170 Query: 195 LLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFLSWLGKWTTADKQPV 254 LL + + + LA S+NA+P TRH+YHQW + P EH D L WLG +++QPV Sbjct: 171 LLPPVYDDTARLLAHSVNASPDTRHFYHQWLHTPPQPGLLEHKDILCWLGAMHDSERQPV 230 Query: 255 CWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIF 287 WSVTQ WQ+V+LGMPR+CSA+RL A++EEIF Sbjct: 231 SWSVTQTWQSVSLGMPRICSARRLVNALVEEIF 263 >UniRef50_D0SVD9 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SVD9_ACILW Length = 295 Score = 275 bits (703), Expect = 1e-72, Method: Composition-based stats. Identities = 65/295 (22%), Positives = 126/295 (42%), Gaps = 24/295 (8%) Query: 11 RVLESVYPRFLVDLAQGDDA---RLP-QAHQQQFRERLMQELLSRVQLQTWTNGGMLNAP 66 ++ + +F++++A + Q+ + RL+ ++ L Sbjct: 7 EIMPCIADKFVIEIANSIHVSRDHIRVQSTRLNKMSRLVDSFTGASSIRQNQVNDNLAQG 66 Query: 67 LSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSA--FPELPQQIAALYEWFS 124 L + L S L G AL + Q KM Q+S + Q L E + Sbjct: 67 LEGAFEWLNALTSELSVGFSALKTVNQ------KMTEVQNSVTDLAKFSLQTQDLLEKLT 120 Query: 125 ARCRWKEKALTQRGLLVQAGDQSEQ----IFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 + L QR LV+A ++ + +F +W+A + S R + VLE L WG F Sbjct: 121 ENVNYHISDLDQRLSLVEAESRAHRQINLLFKQWKANHFQILSPLERLYTVLERLYWGDF 180 Query: 181 GDACRL--GSPQAVALLLGDLLEKATQHLAESI--NAAPTTRHYYH--QWFASSTVPTGG 234 G+ R +P+A+ + + DL + E + + + Y H +W + ST P Sbjct: 181 GEYYRNYHSNPKAI-VTVEDLKQTIYFEAIECLQNDMQIESDEYLHPSKW-SKSTRPLSV 238 Query: 235 EHADFLSWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSV 289 + +++G WT D+ P+ + +Q+ + +++ +PR+ +A +L+ L E+F + Sbjct: 239 SIKESYAFMGDWTKDDQMPISYYASQQPEQLSIYVPRILTATKLSQYALNEMFGI 293 >UniRef50_A4JJZ8 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JJZ8_BURVG Length = 300 Score = 267 bits (683), Expect = 2e-70, Method: Composition-based stats. Identities = 62/282 (21%), Positives = 107/282 (37%), Gaps = 4/282 (1%) Query: 11 RVLESVYPRFLVDLAQGDDARLPQAHQQ----QFRERLMQELLSRVQLQTWTNGGMLNAP 66 + L + + +VD A + Q F R + R + Sbjct: 14 KTLTCLPEKLVVDFANCVAESKGRLAGQFSRNGFFSRCWEGFTGRSAERQGALNQNFAEG 73 Query: 67 LSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSAR 126 L + L + +L + +LA+ + + + + ++ A R Sbjct: 74 LEVTLQWLTELTESVTQSNLAVGLVHDTVNRMMLDVADVADFSMSVKNELLASTLRLEKR 133 Query: 127 CRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRL 186 L + + + +Q+F RWRAG + S GR + LE L WGAFGD CR Sbjct: 134 LDALADDLARVDFIQRVQLNLDQVFNRWRAGRFAGLSPAGRAYAALECLWWGAFGDYCRY 193 Query: 187 GSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFLSWLGKW 246 + D + + T LA+ A + R W A+ + + A LS+LG Sbjct: 194 QEGPQRHAFVEDAMNRVTAQLADDARCATSERIDTSWWLAAGSTAEQRDGAAALSYLGAT 253 Query: 247 TTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFS 288 + PV +SV+Q + L MP + +R+ A++ E+F Sbjct: 254 LNPARAPVAFSVSQTPTRLPLEMPLRATPRRIGEALVRELFG 295 >UniRef50_B8HLV1 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=B8HLV1_CYAP4 Length = 177 Score = 52.1 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 34/151 (22%), Positives = 52/151 (34%), Gaps = 10/151 (6%) Query: 16 VYPRFLVDLAQGDDAR---LPQAHQQQFRERLMQELLSRVQLQTWTNGGMLNAPLSLRLT 72 V + L+DL G L + F R+ +L + + G L A Sbjct: 19 VGDKALIDLVNGLLVSKDILQYRKNRGFFGRIFDKLDGSNEKRRLILDGNLIAGQEALHD 78 Query: 73 LVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAF-------PELPQQIAALYEWFSA 125 V +L + L +ALT+ + L + Q F L Q+ L F Sbjct: 79 WVLELCNSLQISQVALTETQRSLLEARNAIRIQAQRFNSHEALLQSLGVQLEDLARNFGD 138 Query: 126 RCRWKEKALTQRGLLVQAGDQSEQIFTRWRA 156 R E + Q V A + ++IF W A Sbjct: 139 RIDKLESRIHQLETRVAASEDIDRIFVAWEA 169 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P39267 Uncharacterized protein yjcZ n=62 Tax=Enterobact... 389 e-107 UniRef50_B7UP35 Predicted protein n=54 Tax=Enterobacteriaceae Re... 348 1e-94 UniRef50_A4JJZ8 Putative uncharacterized protein n=1 Tax=Burkhol... 342 7e-93 UniRef50_D0SVD9 Predicted protein n=1 Tax=Acinetobacter lwoffii ... 299 1e-79 UniRef50_A7MPJ2 Putative uncharacterized protein n=1 Tax=Cronoba... 296 6e-79 UniRef50_B8HLV1 Putative uncharacterized protein n=2 Tax=Cyanoba... 148 2e-34 Sequences not found previously or not previously below threshold: UniRef50_A3ZZ56 Phosphate ABC transporter, permease protein n=1 ... 44 0.006 CONVERGED! >UniRef50_P39267 Uncharacterized protein yjcZ n=62 Tax=Enterobacteriaceae RepID=YJCZ_ECOLI Length = 292 Score = 389 bits (1000), Expect = e-107, Method: Composition-based stats. Identities = 292/292 (100%), Positives = 292/292 (100%) Query: 1 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG 60 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG Sbjct: 1 MTKTLLDGPGRVLESVYPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNG 60 Query: 61 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY 120 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY Sbjct: 61 GMLNAPLSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALY 120 Query: 121 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF Sbjct: 121 EWFSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 Query: 181 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL 240 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL Sbjct: 181 GDACRLGSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFL 240 Query: 241 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA 292 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA Sbjct: 241 SWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSVNLA 292 >UniRef50_B7UP35 Predicted protein n=54 Tax=Enterobacteriaceae RepID=B7UP35_ECO27 Length = 301 Score = 348 bits (893), Expect = 1e-94, Method: Composition-based stats. Identities = 85/281 (30%), Positives = 135/281 (48%), Gaps = 4/281 (1%) Query: 11 RVLESVYPRFLVDLAQGDD---ARLPQAHQQQFRERLMQELLSRVQLQTWTNGGMLNAPL 67 +VL + +F+VD A G D + A ++ F RL + L + L + Sbjct: 16 QVLTCLPEKFVVDFANGIDVAQEHIRTAGERTFFRRLKEGLTGEGAARQNAINASLAQGV 75 Query: 68 SLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSARC 127 L + ++ + L + A+T++ ++ L R + +Q+ L + + Sbjct: 76 EASLRWLTEMTTSLATTNYAITRVNDRVSSLVSDTARLAHYSADTREQLLTLADQVHHKL 135 Query: 128 RWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLG 187 E+ L + + +A EQIF+ W AG Y ++S GRC++ LEELRWGAFGD R G Sbjct: 136 NHLEEKLHRVDQVQRAQLHLEQIFSWWSAGRYASFSPAGRCYVALEELRWGAFGDVIRQG 195 Query: 188 SPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFAS-STVPTGGEHADFLSWLGKW 246 V LL L KA +A+ + T R W E D ++WLG W Sbjct: 196 ETGQVNQLLDILRHKALTQMAQESGGSATVRLNTLDWLGGQGREQADNEWHDAINWLGDW 255 Query: 247 TTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIF 287 + ++ PV WS TQ + + + MPRLCSA+RL+ +M++EIF Sbjct: 256 CSEEQHPVIWSTTQAAEHLPVRMPRLCSAERLSESMVDEIF 296 >UniRef50_A4JJZ8 Putative uncharacterized protein n=1 Tax=Burkholderia vietnamiensis G4 RepID=A4JJZ8_BURVG Length = 300 Score = 342 bits (878), Expect = 7e-93, Method: Composition-based stats. Identities = 62/282 (21%), Positives = 107/282 (37%), Gaps = 4/282 (1%) Query: 11 RVLESVYPRFLVDLAQGDDARLPQAHQQ----QFRERLMQELLSRVQLQTWTNGGMLNAP 66 + L + + +VD A + Q F R + R + Sbjct: 14 KTLTCLPEKLVVDFANCVAESKGRLAGQFSRNGFFSRCWEGFTGRSAERQGALNQNFAEG 73 Query: 67 LSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSAR 126 L + L + +L + +LA+ + + + + ++ A R Sbjct: 74 LEVTLQWLTELTESVTQSNLAVGLVHDTVNRMMLDVADVADFSMSVKNELLASTLRLEKR 133 Query: 127 CRWKEKALTQRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRL 186 L + + + +Q+F RWRAG + S GR + LE L WGAFGD CR Sbjct: 134 LDALADDLARVDFIQRVQLNLDQVFNRWRAGRFAGLSPAGRAYAALECLWWGAFGDYCRY 193 Query: 187 GSPQAVALLLGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFLSWLGKW 246 + D + + T LA+ A + R W A+ + + A LS+LG Sbjct: 194 QEGPQRHAFVEDAMNRVTAQLADDARCATSERIDTSWWLAAGSTAEQRDGAAALSYLGAT 253 Query: 247 TTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFS 288 + PV +SV+Q + L MP + +R+ A++ E+F Sbjct: 254 LNPARAPVAFSVSQTPTRLPLEMPLRATPRRIGEALVRELFG 295 >UniRef50_D0SVD9 Predicted protein n=1 Tax=Acinetobacter lwoffii SH145 RepID=D0SVD9_ACILW Length = 295 Score = 299 bits (764), Expect = 1e-79, Method: Composition-based stats. Identities = 62/293 (21%), Positives = 124/293 (42%), Gaps = 20/293 (6%) Query: 11 RVLESVYPRFLVDLAQGDDA---RLP-QAHQQQFRERLMQELLSRVQLQTWTNGGMLNAP 66 ++ + +F++++A + Q+ + RL+ ++ L Sbjct: 7 EIMPCIADKFVIEIANSIHVSRDHIRVQSTRLNKMSRLVDSFTGASSIRQNQVNDNLAQG 66 Query: 67 LSLRLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSAR 126 L + L S L G AL + Q + +Q + Q L E + Sbjct: 67 LEGAFEWLNALTSELSVGFSALKTVNQKMTEVQNSVTD----LAKFSLQTQDLLEKLTEN 122 Query: 127 CRWKEKALTQRGLLVQAGDQSEQ----IFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGD 182 + L QR LV+A ++ + +F +W+A + S R + VLE L WG FG+ Sbjct: 123 VNYHISDLDQRLSLVEAESRAHRQINLLFKQWKANHFQILSPLERLYTVLERLYWGDFGE 182 Query: 183 ACRL--GSPQAVALLLGDLLEKATQHLAESI--NAAPTTRHYYH--QWFASSTVPTGGEH 236 R +P+A+ + + DL + E + + + Y H +W + ST P Sbjct: 183 YYRNYHSNPKAI-VTVEDLKQTIYFEAIECLQNDMQIESDEYLHPSKW-SKSTRPLSVSI 240 Query: 237 ADFLSWLGKWTTADKQPVCWSVTQRWQTVALGMPRLCSAQRLAGAMLEEIFSV 289 + +++G WT D+ P+ + +Q+ + +++ +PR+ +A +L+ L E+F + Sbjct: 241 KESYAFMGDWTKDDQMPISYYASQQPEQLSIYVPRILTATKLSQYALNEMFGI 293 >UniRef50_A7MPJ2 Putative uncharacterized protein n=1 Tax=Cronobacter sakazakii ATCC BAA-894 RepID=A7MPJ2_ENTS8 Length = 265 Score = 296 bits (758), Expect = 6e-79, Method: Composition-based stats. Identities = 115/272 (42%), Positives = 156/272 (57%), Gaps = 14/272 (5%) Query: 21 LVDLAQGDDA-RLPQAH--QQQFRERLMQELLSRVQLQTWTNGGMLNAPLSLRLTLVEKL 77 +VDLA+G+D R ++H Q RERL + L + P L L+E L Sbjct: 1 MVDLAKGEDEERYGRSHPQHQNMRERLQKSLAVTRAE------PIAATPEVAALHLLESL 54 Query: 78 ASMLDPGHLALTQIAQHLALL-QKMDHRQHSA-FPELPQQIAALYEWFSARCRWKEKALT 135 + PGHLAL+ AQ LA Q++ + A F + + A Y+ +AR E L Sbjct: 55 SRQSPPGHLALSLTAQALARRSQRLAEKGVCAPFAQALEVTAGQYQHSAARL---EAQLR 111 Query: 136 QRGLLVQAGDQSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAFGDACRLGSPQAVALL 195 Q LL A + ++ RW+ G +N WS GRC++ L+EL WGAFGDA RLG PQ L Sbjct: 112 QGDLLAAAQRHASEVMARWQRGEFNGWSPVGRCYVALQELHWGAFGDALRLGEPQYKTAL 171 Query: 196 LGDLLEKATQHLAESINAAPTTRHYYHQWFASSTVPTGGEHADFLSWLGKWTTADKQPVC 255 L + + + LA S+NA+P TRH+YHQW + P EH D L WLG +++QPV Sbjct: 172 LPPVYDDTARLLAHSVNASPDTRHFYHQWLHTPPQPGLLEHKDILCWLGAMHDSERQPVS 231 Query: 256 WSVTQRWQTVALGMPRLCSAQRLAGAMLEEIF 287 WSVTQ WQ+V+LGMPR+CSA+RL A++EEIF Sbjct: 232 WSVTQTWQSVSLGMPRICSARRLVNALVEEIF 263 >UniRef50_B8HLV1 Putative uncharacterized protein n=2 Tax=Cyanobacteria RepID=B8HLV1_CYAP4 Length = 177 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 34/154 (22%), Positives = 53/154 (34%), Gaps = 10/154 (6%) Query: 13 LESVYPRFLVDLAQGDDAR---LPQAHQQQFRERLMQELLSRVQLQTWTNGGMLNAPLSL 69 + V + L+DL G L + F R+ +L + + G L A Sbjct: 16 IPVVGDKALIDLVNGLLVSKDILQYRKNRGFFGRIFDKLDGSNEKRRLILDGNLIAGQEA 75 Query: 70 RLTLVEKLASMLDPGHLALTQIAQHLALLQKMDHRQHSAF-------PELPQQIAALYEW 122 V +L + L +ALT+ + L + Q F L Q+ L Sbjct: 76 LHDWVLELCNSLQISQVALTETQRSLLEARNAIRIQAQRFNSHEALLQSLGVQLEDLARN 135 Query: 123 FSARCRWKEKALTQRGLLVQAGDQSEQIFTRWRA 156 F R E + Q V A + ++IF W A Sbjct: 136 FGDRIDKLESRIHQLETRVAASEDIDRIFVAWEA 169 >UniRef50_A3ZZ56 Phosphate ABC transporter, permease protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ56_9PLAN Length = 846 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 51/170 (30%), Gaps = 11/170 (6%) Query: 17 YPRFLVDLAQGDDARLPQAHQQQFRERLMQELLSRVQLQTWTNGGMLNAPLSLRLTLVEK 76 F+V+LA Q Q + +V+ + + + + L Sbjct: 339 PGVFIVELANELARLQGQVAVLDAVGEKNQAISRQVEKRYGADSPLTKFTQEVTQGLASD 398 Query: 77 LASMLDPGHLALTQIAQHLALLQKMDHRQHSAFPELPQQIAALYEWFSARCRWKEKALTQ 136 LA + +I Q +A + K +F + Q+ R R + Sbjct: 399 LAEQQTQPKKRIAEIKQLVAAMPKSTQTAVESFLRIRQESTERTVKLQDRIRQLNDENAR 458 Query: 137 -RGLLVQAGD-----QSEQIFTRWRAGAYNAWSLPGRCFIVLEELRWGAF 180 + A + +I + A + G+ + + RWG F Sbjct: 459 YVLHMQTAQEIDKDIPLSEIVRAFPANR---LGIAGKLGVYMS--RWGEF 503 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.122 0.306 Lambda K H 0.267 0.0374 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,308,380,716 Number of Sequences: 3077464 Number of extensions: 42184903 Number of successful extensions: 122518 Number of sequences better than 1.0e-01: 8 Number of HSP's better than 0.1 without gapping: 15 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 122479 Number of HSP's gapped (non-prelim): 21 length of query: 292 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 165 effective length of database: 649,558,428 effective search space: 107177140620 effective search space used: 107177140620 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 92 (40.2 bits)