BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (335 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P46142 Uncharacterized protein yggM n=91 Tax=Bacteria R... 460 e-128 UniRef50_B7LPS6 Putative uncharacterized protein n=1 Tax=Escheri... 458 e-127 UniRef50_B4TEF4 Putative periplasmic protein n=8 Tax=Salmonella ... 366 e-100 UniRef50_UPI000194CDF9 PREDICTED: similar to WD repeat domain 65... 45 0.003 UniRef50_A8N8S8 Putative uncharacterized protein n=2 Tax=Agarica... 44 0.010 UniRef50_UPI0001758695 PREDICTED: similar to AGAP007865-PA n=1 T... 42 0.036 UniRef50_UPI000069F0EA Angiomotin-like protein 2 (Leman coiled-c... 41 0.058 UniRef50_UPI00015B58FD PREDICTED: similar to rho/rac-interacting... 41 0.073 UniRef50_A8BUP1 Coiled-coil protein n=2 Tax=Giardia intestinalis... 41 0.074 UniRef50_C0CPE0 Putative uncharacterized protein n=1 Tax=Blautia... 41 0.077 UniRef50_A6UUX2 SMC domain protein n=1 Tax=Methanococcus aeolicu... 41 0.086 >UniRef50_P46142 Uncharacterized protein yggM n=91 Tax=Bacteria RepID=YGGM_ECOLI Length = 335 Score = 460 bits (1184), Expect = e-128, Method: Composition-based stats. Identities = 335/335 (100%), Positives = 335/335 (100%) Query: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA Sbjct: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 Query: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF Sbjct: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 Query: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM Sbjct: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 Query: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA Sbjct: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 Query: 241 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI 300 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI Sbjct: 241 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI 300 Query: 301 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK 335 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK Sbjct: 301 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK 335 >UniRef50_B7LPS6 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LPS6_ESCF3 Length = 334 Score = 458 bits (1177), Expect = e-127, Method: Composition-based stats. Identities = 131/335 (39%), Positives = 202/335 (60%), Gaps = 1/335 (0%) Query: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 MK+QWIVGT L +LM+G WA+ PP E LK QF + GI++LD I+LK + ++GNQ+ Sbjct: 1 MKRQWIVGTTLFVLMSGYVWAEAVPPDEKTLKTQFNDDFAGIMRLDQISLKPVSSEGNQS 60 Query: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 TWSAEGD+S+++DLY G ADY +E+TW KD+ VKFSAM+ +KGTP SGW+ F+S Sbjct: 61 TWSAEGDMSATEDLYVMAGMAADYRFMEKTWVKDQSVKFSAMVKAKGTPDSGWTTEFFSM 120 Query: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 Q AA + G + KYLI +F + +++E+ K+ I +++ K L KQ Sbjct: 121 QTAAKNMGYPLPKPDEKIKYLITTDSNFYAQLAKVEAGYGEMKDKIERSKEQEKELQKQY 180 Query: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 + +WGKDA G+Q TR + +++ Q+ E ++QND F Y + VY PA+AA Sbjct: 181 DDVGEKIKTFWGKDAKGEQRTRYNVQQEMLQKMYEADRQNDPLKFENNYYETVYSPALAA 240 Query: 241 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI 300 C + +C P++ +RD + E++R Q Q + K++ D L++ PL+ K + + Sbjct: 241 CQAKP-DCDAAPLRAERDAVVAEKKRYYVTQHQLMLEKIKTDMAELDEKVKPLSDKQTAL 299 Query: 301 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK 335 N +++ + +D+ +RW D +LRR GVIK Sbjct: 300 NHERIKLAYASEDLQAEYDRWNNDITELRRRGVIK 334 >UniRef50_B4TEF4 Putative periplasmic protein n=8 Tax=Salmonella enterica RepID=B4TEF4_SALHS Length = 333 Score = 366 bits (938), Expect = e-100, Method: Composition-based stats. Identities = 96/318 (30%), Positives = 169/318 (53%), Gaps = 26/318 (8%) Query: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 MK +++ G + +L++G D PTE +LK+QF Q+HG L LD+I +K GN+ Sbjct: 1 MKGKFLCGLLVSLLISG--CGDDNTPTEKVLKEQFSNQFHGRLILDSIDIKETSVDGNKR 58 Query: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 T++A+G +S+ DLYT V L DY +++++W K K +KFSA L S G +GW F S Sbjct: 59 TYAADGLLSTGYDLYTPVASLTDYIVVQKSWDKGKDIKFSATLNSLGNKDTGWKTIFSSL 118 Query: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 Q + + +G + +++T+ KY+I++ F+ + + ++ +K + L ++ + + Sbjct: 119 QMSETPKGNPIPNVETDGKYIIMDGAGFDDKINAIKDEYARKKLKLNELNNDIAKVKTNI 178 Query: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 + K D YWG+ +GK +R + ++++ + FNK+N F KY+ EV+ PA+ A Sbjct: 179 LVINKEIDEYWGRGEDGKTQSRYFVQRDLNKELELFNKENAPYYFEKKYNAEVFDPAMKA 238 Query: 241 CHKQSEECYEVPIQQKRDF-DINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSE 299 ++ + Y + DF DI ++R LEK + ++K +E Sbjct: 239 -RREKLKNYRLS-----DFDDIRAEKR-----------------AVLEKHKEEYSVKYNE 275 Query: 300 INSKKVAILMKIDDINQA 317 IN K A + +DD Q Sbjct: 276 INEKIKAKMKVLDDGLQE 293 >UniRef50_UPI000194CDF9 PREDICTED: similar to WD repeat domain 65 n=2 Tax=Neognathae RepID=UPI000194CDF9 Length = 980 Score = 45.2 bits (105), Expect = 0.003, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 91/239 (38%), Gaps = 21/239 (8%) Query: 114 SVNFYSFQAAASDRGRVVDDIKTNNKYLIVNS--EDFNYRFSQLESALNTQKNSIPALE- 170 +F +QA A +V ++ K + N ++ + ++E Q + + + Sbjct: 611 KKDFSEYQAHAGAVTKVAQKSDSSQKLSMENEKYQELQVKSQKMEEEYEKQLHKLEESKI 670 Query: 171 KEVKALDKQMV-----------AAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQ 219 + V L + A+++ ++ ED K+I + +D++ KQ Sbjct: 671 RAVAELKEHYEKKLKHKSVLLEEAKESIRKQIEAHEEMEKQIYEDGDKEILELKDKYEKQ 730 Query: 220 -------NDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDINEQRRQTFLQS 272 N + + ++ + E+ ++Q++ DI + + L Sbjct: 731 LLEEKESNMQLKGEIGVMNKRLNSLQKELKDRNRDIEEMRLEQQKLQDIIKSLEKDILAL 790 Query: 273 QKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERWKKDTEQLRRN 331 + ++ + + EK Y L MK E+ + K + +I++ + E + D E +++ Sbjct: 791 KTDVKERTETILEKEKHVYDLKMKNQELENFKFVLSYRIEEFKKQIESRENDIETMKKQ 849 >UniRef50_A8N8S8 Putative uncharacterized protein n=2 Tax=Agaricales RepID=A8N8S8_COPC7 Length = 1682 Score = 43.6 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 71/201 (35%), Gaps = 9/201 (4%) Query: 125 SDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQMVAAQ 184 D G + + ++ + + N + +LE + + I LEKE+ + + Sbjct: 789 KDLGGALKEANERVTQVMGDLRNANSQIKELEEEVVRADHRIDELEKELAEDKDVIANLE 848 Query: 185 KAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQ 244 + + + E + + N+ N ++ + + ++ IA + Sbjct: 849 EEV----ASQTDALEKEHEKVKTLENSLTELENELNATKEYVNELEE---GATIAVEQIE 901 Query: 245 SEECYEVPIQQKRDFDINEQRR--QTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEINS 302 E Q+ D + + ++ +++ QD LE+ + K SE Sbjct: 902 KLEHDLAQAQETIDAMTAAEENTAKMIKDLEEENQRAQDLHSQLEEALHQAESKASEDAD 961 Query: 303 KKVAILMKIDDINQANERWKK 323 + K+ + + +RWK+ Sbjct: 962 TISELESKVSSLERERDRWKQ 982 >UniRef50_UPI0001758695 PREDICTED: similar to AGAP007865-PA n=1 Tax=Tribolium castaneum RepID=UPI0001758695 Length = 1757 Score = 41.7 bits (96), Expect = 0.036, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 71/180 (39%), Gaps = 13/180 (7%) Query: 153 SQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDANGKQMTREDAFKKIHQQ 212 + L + + + LE+E + L K++ + NG + + + + Sbjct: 30 ASLRTRYSELLSEKQRLEQENQRLRKELNEVHRQQQDVVLLAENGNSDSLYISHSQALSK 89 Query: 213 RDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDINEQRRQTFLQS 272 + +N+ A + ++ Q I + +Q + Y+ + ++ ++ E+ Sbjct: 90 LELAKDENNRLAKQFEQERNNAQREIDSLKQQVLKYYKAHQKVQQQYEEAEK-------- 141 Query: 273 QKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERWKKDTEQLRRNG 332 Q ++ K++ + K LT + + ++ ++ + D +++ E+ D Q + G Sbjct: 142 QIVAIKIKAN-----KEVKRLTDERNATLAEYTLVMSERDQVHKEMEKLSDDLAQALKKG 196 >UniRef50_UPI000069F0EA Angiomotin-like protein 2 (Leman coiled-coil protein) (LCCP). n=2 Tax=Tetrapoda RepID=UPI000069F0EA Length = 734 Score = 40.9 bits (94), Expect = 0.058, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 70/190 (36%), Gaps = 11/190 (5%) Query: 149 NYRFSQLESALNTQKNSIPALEKEVKALDKQMVAA------QKAADAYWGKDANGKQMTR 202 N + + + + + I LE E++ + + + ++A + G+ Sbjct: 303 NEKLRRELESYSEKAVKIQKLETEIQRISEAYESLMKASSKREALENAMRTRMEGEIRRM 362 Query: 203 EDAFKKIHQQRDEFNKQNDSEAFAVKYDKE-VYQPAIAACHKQSEECYEVPIQQKRDFDI 261 +D + + ++ D NKQ +++ + D + V IA C +Q ++ ++ + Sbjct: 363 QDFNRDLRERLDSANKQLAAKSVEQREDSQGVVSKLIAQCREQQQDREKLEREVTLLRSA 422 Query: 262 NEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERW 321 NE +R+ ++ Q E K K + + + A E+ Sbjct: 423 NEDQRRRAELLEQALSNAQSRAARAEDELR----KKRAYVEKVERLQQALSQLQAACEKR 478 Query: 322 KKDTEQLRRN 331 ++ +LR Sbjct: 479 EQLELRLRTR 488 >UniRef50_UPI00015B58FD PREDICTED: similar to rho/rac-interacting citron kinase n=1 Tax=Nasonia vitripennis RepID=UPI00015B58FD Length = 1545 Score = 40.5 bits (93), Expect = 0.073, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 72/188 (38%), Gaps = 26/188 (13%) Query: 143 VNSEDFNYRFSQLESALNTQKNS-IPALEKEVKALDKQMVAAQKAADAYWGKDANGKQMT 201 ++ + + R + E+ +K + ++ + + ++ K+ G + + Sbjct: 472 LSDANLDKRIATREAKTAEEKVKSLQEEKQRLAERLNTKIREEEEKSKKVAKELEGVKNS 531 Query: 202 REDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDI 261 D+ K + + + + + A K + + ++++ D Sbjct: 532 LADSTKDASRNKLQADSAQRALTQANK-------------QIEELQNSSAALRRELD--- 575 Query: 262 NEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERW 321 ++K R QD +L+ + L++KVS++N +K + K++ I Q + Sbjct: 576 ---------STRKQLRGSQDRMDSLQTEKERLSLKVSKLNEEKNELESKLEKIQQEANSY 626 Query: 322 KKDTEQLR 329 + + E L+ Sbjct: 627 QVNIELLK 634 >UniRef50_A8BUP1 Coiled-coil protein n=2 Tax=Giardia intestinalis RepID=A8BUP1_GIALA Length = 1080 Score = 40.5 bits (93), Expect = 0.074, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 74/204 (36%), Gaps = 9/204 (4%) Query: 137 NNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDAN 196 +N+ IV N R QLES + + I L++E+ ++ K+ A + + A Sbjct: 739 DNQQRIVTVAALNDRVQQLESDKSELRQQIAELKEELSSVRKENAAITTEKEHLINQVAI 798 Query: 197 GKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQK 256 K+ Q++ + +N Y K A + ++ + + ++ + Sbjct: 799 AKEKISYHELLYTDQKKMLEDTENQRIDLHNTYLKTQSTIAELKLNYETSKLRQDSLEAE 858 Query: 257 RDFDINE------QRRQTFLQSQKLSRKLQDDWVTLEKGQYP---LTMKVSEINSKKVAI 307 I + +++ L D L K Q L+ K + Sbjct: 859 NAKLIKSSEVLSGELDYLRKENENLRLDKGTDGDALRKLQQEFDHLSAKYTNQTQDVEDK 918 Query: 308 LMKIDDINQANERWKKDTEQLRRN 331 ++I ++ N+R K E+++ N Sbjct: 919 QLRIQELIFQNDRMMKFIEEVKDN 942 >UniRef50_C0CPE0 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CPE0_9FIRM Length = 1199 Score = 40.5 bits (93), Expect = 0.077, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 59/167 (35%), Gaps = 15/167 (8%) Query: 151 RFSQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDANGKQMTREDAFKKIH 210 + + AL+ + + +K+++ Q+ AAQ+ DA + G+ E K + Sbjct: 483 ELEKQKPALDAAEAQLADGKKQLEDAQAQLDAAQEKIDAGKKELEQGEAQIEEAVQKLLS 542 Query: 211 QQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDINEQRRQTFL 270 Q+ Q+ + + ++ +E + + + + D NEQ+ Sbjct: 543 TQQTLKASQSQISDSERQ---------LEDGQREIDENEQKLKEAQEEIDENEQKLIEAE 593 Query: 271 QSQKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQA 317 Q L+D L G+ E + K K+ D Q Sbjct: 594 QD------LKDGESELADGEKEYEDGKKEADEKIADAKRKLKDAEQE 634 >UniRef50_A6UUX2 SMC domain protein n=1 Tax=Methanococcus aeolicus Nankai-3 RepID=A6UUX2_META3 Length = 994 Score = 40.5 bits (93), Expect = 0.086, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 74/205 (36%), Gaps = 15/205 (7%) Query: 134 IKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM---VAAQKAADAY 190 + NK L + + N + LN + + L ++ ++ M K D Sbjct: 633 VDNKNKLLDIIGNNTNKSILNTKKELNNKVGELNELLNLIRNKEQNMKKLNVVNKEIDNL 692 Query: 191 WGKDANGKQMT--REDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEEC 248 K A Q+ + K++ + +D +N+ NDS A Y + Y +I + Sbjct: 693 KDKVATNSQLEIEKNSNEKELIKYKDGYNQYNDSYAVLKNY-ADKYAVSIEEIRNK---- 747 Query: 249 YEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPL---TMKVSEINSKKV 305 V K++ + + KLQ + E + KV+EIN + Sbjct: 748 --VNELLKKENNNLNNLNNEIQNIKLSIEKLQYNIEEFEDIKKDYNKINNKVNEINEDII 805 Query: 306 AILMKIDDINQANERWKKDTEQLRR 330 + I + N+ ++ +QL + Sbjct: 806 KLDTTIKNENKLLSEFENKLKQLEK 830 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P46142 Uncharacterized protein yggM n=91 Tax=Bacteria R... 459 e-128 UniRef50_B7LPS6 Putative uncharacterized protein n=1 Tax=Escheri... 456 e-127 UniRef50_B4TEF4 Putative periplasmic protein n=8 Tax=Salmonella ... 369 e-101 Sequences not found previously or not previously below threshold: UniRef50_UPI000194CDF9 PREDICTED: similar to WD repeat domain 65... 45 0.004 UniRef50_A8N8S8 Putative uncharacterized protein n=2 Tax=Agarica... 43 0.014 UniRef50_UPI0001758695 PREDICTED: similar to AGAP007865-PA n=1 T... 41 0.047 UniRef50_A8BUP1 Coiled-coil protein n=2 Tax=Giardia intestinalis... 41 0.053 UniRef50_C0CPE0 Putative uncharacterized protein n=1 Tax=Blautia... 41 0.058 UniRef50_UPI00015B58FD PREDICTED: similar to rho/rac-interacting... 41 0.085 CONVERGED! >UniRef50_P46142 Uncharacterized protein yggM n=91 Tax=Bacteria RepID=YGGM_ECOLI Length = 335 Score = 459 bits (1181), Expect = e-128, Method: Composition-based stats. Identities = 335/335 (100%), Positives = 335/335 (100%) Query: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA Sbjct: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 Query: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF Sbjct: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 Query: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM Sbjct: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 Query: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA Sbjct: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 Query: 241 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI 300 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI Sbjct: 241 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI 300 Query: 301 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK 335 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK Sbjct: 301 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK 335 >UniRef50_B7LPS6 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LPS6_ESCF3 Length = 334 Score = 456 bits (1174), Expect = e-127, Method: Composition-based stats. Identities = 131/335 (39%), Positives = 202/335 (60%), Gaps = 1/335 (0%) Query: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 MK+QWIVGT L +LM+G WA+ PP E LK QF + GI++LD I+LK + ++GNQ+ Sbjct: 1 MKRQWIVGTTLFVLMSGYVWAEAVPPDEKTLKTQFNDDFAGIMRLDQISLKPVSSEGNQS 60 Query: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 TWSAEGD+S+++DLY G ADY +E+TW KD+ VKFSAM+ +KGTP SGW+ F+S Sbjct: 61 TWSAEGDMSATEDLYVMAGMAADYRFMEKTWVKDQSVKFSAMVKAKGTPDSGWTTEFFSM 120 Query: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 Q AA + G + KYLI +F + +++E+ K+ I +++ K L KQ Sbjct: 121 QTAAKNMGYPLPKPDEKIKYLITTDSNFYAQLAKVEAGYGEMKDKIERSKEQEKELQKQY 180 Query: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 + +WGKDA G+Q TR + +++ Q+ E ++QND F Y + VY PA+AA Sbjct: 181 DDVGEKIKTFWGKDAKGEQRTRYNVQQEMLQKMYEADRQNDPLKFENNYYETVYSPALAA 240 Query: 241 CHKQSEECYEVPIQQKRDFDINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEI 300 C + +C P++ +RD + E++R Q Q + K++ D L++ PL+ K + + Sbjct: 241 CQAKP-DCDAAPLRAERDAVVAEKKRYYVTQHQLMLEKIKTDMAELDEKVKPLSDKQTAL 299 Query: 301 NSKKVAILMKIDDINQANERWKKDTEQLRRNGVIK 335 N +++ + +D+ +RW D +LRR GVIK Sbjct: 300 NHERIKLAYASEDLQAEYDRWNNDITELRRRGVIK 334 >UniRef50_B4TEF4 Putative periplasmic protein n=8 Tax=Salmonella enterica RepID=B4TEF4_SALHS Length = 333 Score = 369 bits (947), Expect = e-101, Method: Composition-based stats. Identities = 96/318 (30%), Positives = 169/318 (53%), Gaps = 26/318 (8%) Query: 1 MKKQWIVGTALLMLMTGNAWADGEPPTENILKDQFKKQYHGILKLDAITLKNLDAKGNQA 60 MK +++ G + +L++G D PTE +LK+QF Q+HG L LD+I +K GN+ Sbjct: 1 MKGKFLCGLLVSLLISG--CGDDNTPTEKVLKEQFSNQFHGRLILDSIDIKETSVDGNKR 58 Query: 61 TWSAEGDVSSSDDLYTWVGQLADYELLEQTWTKDKPVKFSAMLTSKGTPASGWSVNFYSF 120 T++A+G +S+ DLYT V L DY +++++W K K +KFSA L S G +GW F S Sbjct: 59 TYAADGLLSTGYDLYTPVASLTDYIVVQKSWDKGKDIKFSATLNSLGNKDTGWKTIFSSL 118 Query: 121 QAAASDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQM 180 Q + + +G + +++T+ KY+I++ F+ + + ++ +K + L ++ + + Sbjct: 119 QMSETPKGNPIPNVETDGKYIIMDGAGFDDKINAIKDEYARKKLKLNELNNDIAKVKTNI 178 Query: 181 VAAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAA 240 + K D YWG+ +GK +R + ++++ + FNK+N F KY+ EV+ PA+ A Sbjct: 179 LVINKEIDEYWGRGEDGKTQSRYFVQRDLNKELELFNKENAPYYFEKKYNAEVFDPAMKA 238 Query: 241 CHKQSEECYEVPIQQKRDF-DINEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSE 299 ++ + Y + DF DI ++R LEK + ++K +E Sbjct: 239 -RREKLKNYRLS-----DFDDIRAEKR-----------------AVLEKHKEEYSVKYNE 275 Query: 300 INSKKVAILMKIDDINQA 317 IN K A + +DD Q Sbjct: 276 INEKIKAKMKVLDDGLQE 293 >UniRef50_UPI000194CDF9 PREDICTED: similar to WD repeat domain 65 n=2 Tax=Neognathae RepID=UPI000194CDF9 Length = 980 Score = 44.8 bits (104), Expect = 0.004, Method: Composition-based stats. Identities = 37/239 (15%), Positives = 91/239 (38%), Gaps = 21/239 (8%) Query: 114 SVNFYSFQAAASDRGRVVDDIKTNNKYLIVNS--EDFNYRFSQLESALNTQKNSIPALE- 170 +F +QA A +V ++ K + N ++ + ++E Q + + + Sbjct: 611 KKDFSEYQAHAGAVTKVAQKSDSSQKLSMENEKYQELQVKSQKMEEEYEKQLHKLEESKI 670 Query: 171 KEVKALDKQMV-----------AAQKAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQ 219 + V L + A+++ ++ ED K+I + +D++ KQ Sbjct: 671 RAVAELKEHYEKKLKHKSVLLEEAKESIRKQIEAHEEMEKQIYEDGDKEILELKDKYEKQ 730 Query: 220 -------NDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDINEQRRQTFLQS 272 N + + ++ + E+ ++Q++ DI + + L Sbjct: 731 LLEEKESNMQLKGEIGVMNKRLNSLQKELKDRNRDIEEMRLEQQKLQDIIKSLEKDILAL 790 Query: 273 QKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERWKKDTEQLRRN 331 + ++ + + EK Y L MK E+ + K + +I++ + E + D E +++ Sbjct: 791 KTDVKERTETILEKEKHVYDLKMKNQELENFKFVLSYRIEEFKKQIESRENDIETMKKQ 849 >UniRef50_A8N8S8 Putative uncharacterized protein n=2 Tax=Agaricales RepID=A8N8S8_COPC7 Length = 1682 Score = 43.2 bits (100), Expect = 0.014, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 71/201 (35%), Gaps = 9/201 (4%) Query: 125 SDRGRVVDDIKTNNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQMVAAQ 184 D G + + ++ + + N + +LE + + I LEKE+ + + Sbjct: 789 KDLGGALKEANERVTQVMGDLRNANSQIKELEEEVVRADHRIDELEKELAEDKDVIANLE 848 Query: 185 KAADAYWGKDANGKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQ 244 + + + E + + N+ N ++ + + ++ IA + Sbjct: 849 EEV----ASQTDALEKEHEKVKTLENSLTELENELNATKEYVNELEE---GATIAVEQIE 901 Query: 245 SEECYEVPIQQKRDFDINEQRR--QTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEINS 302 E Q+ D + + ++ +++ QD LE+ + K SE Sbjct: 902 KLEHDLAQAQETIDAMTAAEENTAKMIKDLEEENQRAQDLHSQLEEALHQAESKASEDAD 961 Query: 303 KKVAILMKIDDINQANERWKK 323 + K+ + + +RWK+ Sbjct: 962 TISELESKVSSLERERDRWKQ 982 >UniRef50_UPI0001758695 PREDICTED: similar to AGAP007865-PA n=1 Tax=Tribolium castaneum RepID=UPI0001758695 Length = 1757 Score = 41.3 bits (95), Expect = 0.047, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 71/180 (39%), Gaps = 13/180 (7%) Query: 153 SQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDANGKQMTREDAFKKIHQQ 212 + L + + + LE+E + L K++ + NG + + + + Sbjct: 30 ASLRTRYSELLSEKQRLEQENQRLRKELNEVHRQQQDVVLLAENGNSDSLYISHSQALSK 89 Query: 213 RDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDINEQRRQTFLQS 272 + +N+ A + ++ Q I + +Q + Y+ + ++ ++ E+ Sbjct: 90 LELAKDENNRLAKQFEQERNNAQREIDSLKQQVLKYYKAHQKVQQQYEEAEK-------- 141 Query: 273 QKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERWKKDTEQLRRNG 332 Q ++ K++ + K LT + + ++ ++ + D +++ E+ D Q + G Sbjct: 142 QIVAIKIKAN-----KEVKRLTDERNATLAEYTLVMSERDQVHKEMEKLSDDLAQALKKG 196 >UniRef50_A8BUP1 Coiled-coil protein n=2 Tax=Giardia intestinalis RepID=A8BUP1_GIALA Length = 1080 Score = 41.3 bits (95), Expect = 0.053, Method: Composition-based stats. Identities = 35/204 (17%), Positives = 74/204 (36%), Gaps = 9/204 (4%) Query: 137 NNKYLIVNSEDFNYRFSQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDAN 196 +N+ IV N R QLES + + I L++E+ ++ K+ A + + A Sbjct: 739 DNQQRIVTVAALNDRVQQLESDKSELRQQIAELKEELSSVRKENAAITTEKEHLINQVAI 798 Query: 197 GKQMTREDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQK 256 K+ Q++ + +N Y K A + ++ + + ++ + Sbjct: 799 AKEKISYHELLYTDQKKMLEDTENQRIDLHNTYLKTQSTIAELKLNYETSKLRQDSLEAE 858 Query: 257 RDFDINE------QRRQTFLQSQKLSRKLQDDWVTLEKGQYP---LTMKVSEINSKKVAI 307 I + +++ L D L K Q L+ K + Sbjct: 859 NAKLIKSSEVLSGELDYLRKENENLRLDKGTDGDALRKLQQEFDHLSAKYTNQTQDVEDK 918 Query: 308 LMKIDDINQANERWKKDTEQLRRN 331 ++I ++ N+R K E+++ N Sbjct: 919 QLRIQELIFQNDRMMKFIEEVKDN 942 >UniRef50_C0CPE0 Putative uncharacterized protein n=1 Tax=Blautia hydrogenotrophica DSM 10507 RepID=C0CPE0_9FIRM Length = 1199 Score = 40.9 bits (94), Expect = 0.058, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 59/167 (35%), Gaps = 15/167 (8%) Query: 151 RFSQLESALNTQKNSIPALEKEVKALDKQMVAAQKAADAYWGKDANGKQMTREDAFKKIH 210 + + AL+ + + +K+++ Q+ AAQ+ DA + G+ E K + Sbjct: 483 ELEKQKPALDAAEAQLADGKKQLEDAQAQLDAAQEKIDAGKKELEQGEAQIEEAVQKLLS 542 Query: 211 QQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDINEQRRQTFL 270 Q+ Q+ + + ++ +E + + + + D NEQ+ Sbjct: 543 TQQTLKASQSQISDSERQ---------LEDGQREIDENEQKLKEAQEEIDENEQKLIEAE 593 Query: 271 QSQKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQA 317 Q L+D L G+ E + K K+ D Q Sbjct: 594 QD------LKDGESELADGEKEYEDGKKEADEKIADAKRKLKDAEQE 634 >UniRef50_UPI00015B58FD PREDICTED: similar to rho/rac-interacting citron kinase n=1 Tax=Nasonia vitripennis RepID=UPI00015B58FD Length = 1545 Score = 40.5 bits (93), Expect = 0.085, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 72/188 (38%), Gaps = 26/188 (13%) Query: 143 VNSEDFNYRFSQLESALNTQKNS-IPALEKEVKALDKQMVAAQKAADAYWGKDANGKQMT 201 ++ + + R + E+ +K + ++ + + ++ K+ G + + Sbjct: 472 LSDANLDKRIATREAKTAEEKVKSLQEEKQRLAERLNTKIREEEEKSKKVAKELEGVKNS 531 Query: 202 REDAFKKIHQQRDEFNKQNDSEAFAVKYDKEVYQPAIAACHKQSEECYEVPIQQKRDFDI 261 D+ K + + + + + A K + + ++++ D Sbjct: 532 LADSTKDASRNKLQADSAQRALTQANK-------------QIEELQNSSAALRRELD--- 575 Query: 262 NEQRRQTFLQSQKLSRKLQDDWVTLEKGQYPLTMKVSEINSKKVAILMKIDDINQANERW 321 ++K R QD +L+ + L++KVS++N +K + K++ I Q + Sbjct: 576 ---------STRKQLRGSQDRMDSLQTEKERLSLKVSKLNEEKNELESKLEKIQQEANSY 626 Query: 322 KKDTEQLR 329 + + E L+ Sbjct: 627 QVNIELLK 634 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.124 0.319 Lambda K H 0.267 0.0371 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 870,684,637 Number of Sequences: 3077464 Number of extensions: 23973938 Number of successful extensions: 92383 Number of sequences better than 1.0e-01: 175 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 357 Number of HSP's that attempted gapping in prelim test: 91621 Number of HSP's gapped (non-prelim): 1136 length of query: 335 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 206 effective length of database: 643,403,500 effective search space: 132541121000 effective search space used: 132541121000 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 93 (40.6 bits)