BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (101 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P76172 Uncharacterized protein ynfD n=110 Tax=Enterobac... 205 4e-52 UniRef50_D2TBB0 Uncharacterized protein ynfD n=5 Tax=Enterobacte... 91 9e-18 UniRef50_Q7N4T6 Similar to unknown protein YnfD of Escherichia c... 91 1e-17 UniRef50_C7BK36 Putative uncharacterized protein n=1 Tax=Photorh... 89 5e-17 UniRef50_C5BA41 Putative uncharacterized protein n=2 Tax=Edwards... 86 3e-16 UniRef50_B6XCY0 Putative uncharacterized protein n=1 Tax=Provide... 72 7e-12 UniRef50_Q48QF9 Putative uncharacterized protein n=3 Tax=Pseudom... 54 2e-06 UniRef50_C9YE47 Putative uncharacterized protein n=1 Tax=Curviba... 48 1e-04 UniRef50_A2SLW8 Putative uncharacterized protein n=1 Tax=Methyli... 47 2e-04 UniRef50_C1DG54 Putative uncharacterized protein n=8 Tax=Pseudom... 46 4e-04 UniRef50_Q48QG2 Putative uncharacterized protein n=33 Tax=Pseudo... 45 7e-04 UniRef50_Q1QVN3 Putative uncharacterized protein n=1 Tax=Chromoh... 43 0.004 >UniRef50_P76172 Uncharacterized protein ynfD n=110 Tax=Enterobacteriaceae RepID=YNFD_ECOLI Length = 101 Score = 205 bits (521), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP Sbjct: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 Query: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ 101 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ Sbjct: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ 101 >UniRef50_D2TBB0 Uncharacterized protein ynfD n=5 Tax=Enterobacteriaceae RepID=D2TBB0_ERWPY Length = 98 Score = 91.3 bits (225), Expect = 9e-18, Method: Compositional matrix adjust. Identities = 43/78 (55%), Positives = 56/78 (71%), Gaps = 3/78 (3%) Query: 20 LAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTR 79 LAA SCE +++DI+Q+II+NGVP S F+L IVPNDQVDQ QVVGHC +DTHKI+Y Sbjct: 18 LAATASCESVKADINQKIISNGVPASGFSLDIVPNDQVDQAGGQVVGHCESDTHKIVYKH 77 Query: 80 TTSG---NVSAPAQSSQD 94 + + SA +S+D Sbjct: 78 VSGAAENDASASTGTSRD 95 >UniRef50_Q7N4T6 Similar to unknown protein YnfD of Escherichia coli n=19 Tax=Enterobacteriaceae RepID=Q7N4T6_PHOLL Length = 87 Score = 90.5 bits (223), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 42/82 (51%), Positives = 55/82 (67%), Gaps = 1/82 (1%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK + +ALL L P LAA SCE ++ I+Q+IINNGVPES FTL IV ND ++Q Sbjct: 3 MKKALLFSALLFTL-PPFALAAQASCESVKEQIAQKIINNGVPESGFTLEIVANDHIEQA 61 Query: 61 DSQVVGHCANDTHKILYTRTTS 82 ++VG+C N+T KI+Y R S Sbjct: 62 GGKIVGYCENNTKKIMYIRQGS 83 >UniRef50_C7BK36 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BK36_PHOAA Length = 100 Score = 89.0 bits (219), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 42/84 (50%), Positives = 58/84 (69%), Gaps = 3/84 (3%) Query: 1 MKLSTCCAALLLALASPAVLAAP--GSCERIQSDISQRIINNGVPESSFTLSIVPNDQVD 58 M+ + +AL+ A+ SP +A SCE ++ I+Q+II+NGVPES F L IVPNDQV+ Sbjct: 11 MRKALLSSALIFAV-SPFAFSASVQTSCESVKEQIAQKIIHNGVPESGFKLEIVPNDQVE 69 Query: 59 QPDSQVVGHCANDTHKILYTRTTS 82 Q ++VGHC N+T KI+YTR S Sbjct: 70 QASGKIVGHCENNTKKIVYTRQVS 93 >UniRef50_C5BA41 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BA41_EDWI9 Length = 90 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 39/61 (63%), Positives = 45/61 (73%) Query: 20 LAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTR 79 AA SCE I+ +ISQ+I+ NGVP +FTL IVPNDQV Q QVVGHC NDT KI+Y R Sbjct: 18 YAAAASCESIRDEISQKIVANGVPSDAFTLEIVPNDQVQQAGGQVVGHCGNDTQKIIYIR 77 Query: 80 T 80 T Sbjct: 78 T 78 >UniRef50_B6XCY0 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XCY0_9ENTR Length = 77 Score = 71.6 bits (174), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 31/71 (43%), Positives = 49/71 (69%), Gaps = 2/71 (2%) Query: 9 ALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHC 68 A+L+ L P + A SCE + +I+Q+IINNGVP SFT+++V +++ VVG+C Sbjct: 8 AILMTLFIPVI--AGASCESVVEEITQKIINNGVPSDSFTITVVSSEEAASQQGTVVGNC 65 Query: 69 ANDTHKILYTR 79 +N+T KI+YT+ Sbjct: 66 SNETQKIIYTK 76 >UniRef50_Q48QF9 Putative uncharacterized protein n=3 Tax=Pseudomonas syringae group RepID=Q48QF9_PSE14 Length = 77 Score = 53.5 bits (127), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 29/69 (42%), Positives = 42/69 (60%), Gaps = 3/69 (4%) Query: 9 ALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHC 68 A+ L + + LAAP SCE ++ +I +I N V +S+TL IV N++ P S +VG C Sbjct: 7 AVTCTLLATSALAAPKSCEELKDEIEAKIQANNV--TSYTLEIVSNEEASDP-SMIVGSC 63 Query: 69 ANDTHKILY 77 N T KI+Y Sbjct: 64 DNGTKKIIY 72 >UniRef50_C9YE47 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YE47_9BURK Length = 76 Score = 47.8 bits (112), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 29/79 (36%), Positives = 42/79 (53%), Gaps = 4/79 (5%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK S C + L L L S AA CE +++++ +I ++G +FTL +V D D Sbjct: 1 MKKSVCFSVLALGLLSANAWAAK-DCEVLKTELGAKIESHGAK--NFTLDVVDRD-ADSG 56 Query: 61 DSQVVGHCANDTHKILYTR 79 +VVG C KI+YTR Sbjct: 57 KKRVVGTCEAGKKKIVYTR 75 >UniRef50_A2SLW8 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SLW8_METPP Length = 122 Score = 47.0 bits (110), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 27/87 (31%), Positives = 44/87 (50%), Gaps = 2/87 (2%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 + S C A +L V A SCE+++ I ++ +G + +T++ V D Sbjct: 6 LSFSLCIGAAVLLCGVTGVAGAAQSCEQLRERIDAKVRASGA--THYTVTTVDADAAVGA 63 Query: 61 DSQVVGHCANDTHKILYTRTTSGNVSA 87 ++VVG C T KI+YTR G+V+A Sbjct: 64 SAKVVGSCELGTKKIVYTRGEEGSVAA 90 >UniRef50_C1DG54 Putative uncharacterized protein n=8 Tax=Pseudomonadaceae RepID=C1DG54_AZOVD Length = 99 Score = 45.8 bits (107), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 30/81 (37%), Positives = 42/81 (51%), Gaps = 3/81 (3%) Query: 19 VLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYT 78 +LAA CE ++ +I +I GV +S+TL IVPN +V + VVG C T KI+Y Sbjct: 17 LLAAIKPCEELKEEIEVKIQAAGV--TSYTLEIVPNAEVTD-RNLVVGSCDGGTRKIIYQ 73 Query: 79 RTTSGNVSAPAQSSQDGAPAE 99 R G+ + AP E Sbjct: 74 RNDGGSRRGEPSPTPAPAPEE 94 >UniRef50_Q48QG2 Putative uncharacterized protein n=33 Tax=Pseudomonas RepID=Q48QG2_PSE14 Length = 98 Score = 45.1 bits (105), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 27/69 (39%), Positives = 41/69 (59%), Gaps = 5/69 (7%) Query: 11 LLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCAN 70 LL++A A LAA CE ++S+I R+ GV +S+TL +V ++ D QVVG C Sbjct: 34 LLSIAGTA-LAAGKPCEELKSEIDARLQAKGV--TSYTLEVV--EKGSASDKQVVGTCEG 88 Query: 71 DTHKILYTR 79 T +++Y R Sbjct: 89 GTKEVVYQR 97 >UniRef50_Q1QVN3 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QVN3_CHRSD Length = 140 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 34/114 (29%), Positives = 52/114 (45%), Gaps = 30/114 (26%) Query: 8 AALLLALASPAV--------------LAAPG--SCERIQSDISQRIINNGVPESSFTLSI 51 A+L+ LA PA+ L PG C+ +Q +I +I NGV E F L + Sbjct: 9 GAVLMVLALPAIAVAQTTSHRDDDGALGEPGVMDCDVLQDEIEAKIRANGVDE--FQLDL 66 Query: 52 VPNDQVDQ---------PDSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGA 96 + + +VD+ +VVG C + K++Y R G+ A SS+D A Sbjct: 67 IASARVDEDGVREGDPLAGGEVVGSCDGGSRKVIYRRGAQGS---GAMSSEDAA 117 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76172 Uncharacterized protein ynfD n=110 Tax=Enterobac... 138 4e-32 UniRef50_Q7N4T6 Similar to unknown protein YnfD of Escherichia c... 120 1e-26 UniRef50_C7BK36 Putative uncharacterized protein n=1 Tax=Photorh... 108 5e-23 UniRef50_D2TBB0 Uncharacterized protein ynfD n=5 Tax=Enterobacte... 101 6e-21 UniRef50_C1DG54 Putative uncharacterized protein n=8 Tax=Pseudom... 101 9e-21 UniRef50_C5BA41 Putative uncharacterized protein n=2 Tax=Edwards... 100 1e-20 UniRef50_A2SLW8 Putative uncharacterized protein n=1 Tax=Methyli... 98 1e-19 UniRef50_C9YE47 Putative uncharacterized protein n=1 Tax=Curviba... 96 3e-19 UniRef50_Q48QF9 Putative uncharacterized protein n=3 Tax=Pseudom... 88 1e-16 UniRef50_B6XCY0 Putative uncharacterized protein n=1 Tax=Provide... 87 2e-16 UniRef50_Q48QG2 Putative uncharacterized protein n=33 Tax=Pseudo... 85 6e-16 Sequences not found previously or not previously below threshold: UniRef50_Q1QVN3 Putative uncharacterized protein n=1 Tax=Chromoh... 61 1e-08 UniRef50_C6NZB5 Putative uncharacterized protein n=1 Tax=Siderox... 59 4e-08 UniRef50_C5CVT9 Putative uncharacterized protein n=1 Tax=Variovo... 55 6e-07 >UniRef50_P76172 Uncharacterized protein ynfD n=110 Tax=Enterobacteriaceae RepID=YNFD_ECOLI Length = 101 Score = 138 bits (348), Expect = 4e-32, Method: Composition-based stats. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP Sbjct: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 Query: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ 101 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ Sbjct: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ 101 >UniRef50_Q7N4T6 Similar to unknown protein YnfD of Escherichia coli n=19 Tax=Enterobacteriaceae RepID=Q7N4T6_PHOLL Length = 87 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 42/84 (50%), Positives = 56/84 (66%), Gaps = 1/84 (1%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK + +ALL L P LAA SCE ++ I+Q+IINNGVPES FTL IV ND ++Q Sbjct: 3 MKKALLFSALLFTL-PPFALAAQASCESVKEQIAQKIINNGVPESGFTLEIVANDHIEQA 61 Query: 61 DSQVVGHCANDTHKILYTRTTSGN 84 ++VG+C N+T KI+Y R S + Sbjct: 62 GGKIVGYCENNTKKIMYIRQGSQS 85 >UniRef50_C7BK36 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BK36_PHOAA Length = 100 Score = 108 bits (269), Expect = 5e-23, Method: Composition-based stats. Identities = 42/86 (48%), Positives = 59/86 (68%), Gaps = 3/86 (3%) Query: 1 MKLSTCCAALLLALASPAVLAAP--GSCERIQSDISQRIINNGVPESSFTLSIVPNDQVD 58 M+ + +AL+ A+ SP +A SCE ++ I+Q+II+NGVPES F L IVPNDQV+ Sbjct: 11 MRKALLSSALIFAV-SPFAFSASVQTSCESVKEQIAQKIIHNGVPESGFKLEIVPNDQVE 69 Query: 59 QPDSQVVGHCANDTHKILYTRTTSGN 84 Q ++VGHC N+T KI+YTR S + Sbjct: 70 QASGKIVGHCENNTKKIVYTRQVSQS 95 >UniRef50_D2TBB0 Uncharacterized protein ynfD n=5 Tax=Enterobacteriaceae RepID=D2TBB0_ERWPY Length = 98 Score = 101 bits (252), Expect = 6e-21, Method: Composition-based stats. Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 3/93 (3%) Query: 7 CAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVG 66 L AL LAA SCE +++DI+Q+II+NGVP S F+L IVPNDQVDQ QVVG Sbjct: 5 FKWALSALLVVTPLAATASCESVKADINQKIISNGVPASGFSLDIVPNDQVDQAGGQVVG 64 Query: 67 HCANDTHKILYTRTTSG---NVSAPAQSSQDGA 96 HC +DTHKI+Y + + SA +S+D + Sbjct: 65 HCESDTHKIVYKHVSGAAENDASASTGTSRDSS 97 >UniRef50_C1DG54 Putative uncharacterized protein n=8 Tax=Pseudomonadaceae RepID=C1DG54_AZOVD Length = 99 Score = 101 bits (250), Expect = 9e-21, Method: Composition-based stats. Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 3/83 (3%) Query: 17 PAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKIL 76 +LAA CE ++ +I +I GV +S+TL IVPN +V + VVG C T KI+ Sbjct: 15 TPLLAAIKPCEELKEEIEVKIQAAGV--TSYTLEIVPNAEVTD-RNLVVGSCDGGTRKII 71 Query: 77 YTRTTSGNVSAPAQSSQDGAPAE 99 Y R G+ + AP E Sbjct: 72 YQRNDGGSRRGEPSPTPAPAPEE 94 >UniRef50_C5BA41 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BA41_EDWI9 Length = 90 Score = 100 bits (249), Expect = 1e-20, Method: Composition-based stats. Identities = 41/72 (56%), Positives = 47/72 (65%) Query: 20 LAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTR 79 AA SCE I+ +ISQ+I+ NGVP +FTL IVPNDQV Q QVVGHC NDT KI+Y R Sbjct: 18 YAAAASCESIRDEISQKIVANGVPSDAFTLEIVPNDQVQQAGGQVVGHCGNDTQKIIYIR 77 Query: 80 TTSGNVSAPAQS 91 T A A Sbjct: 78 TDGAEPGAYANP 89 >UniRef50_A2SLW8 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SLW8_METPP Length = 122 Score = 97.6 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 2/99 (2%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 + S C A +L V A SCE+++ I ++ +G + +T++ V D Sbjct: 6 LSFSLCIGAAVLLCGVTGVAGAAQSCEQLRERIDAKVRASGA--THYTVTTVDADAAVGA 63 Query: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAE 99 ++VVG C T KI+YTR G+V+A + + P + Sbjct: 64 SAKVVGSCELGTKKIVYTRGEEGSVAAASAPTVTARPRD 102 >UniRef50_C9YE47 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YE47_9BURK Length = 76 Score = 96.0 bits (237), Expect = 3e-19, Method: Composition-based stats. Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 4/80 (5%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK S C + L L L S AA CE +++++ +I ++G +FTL +V D D Sbjct: 1 MKKSVCFSVLALGLLSANAWAA-KDCEVLKTELGAKIESHGAK--NFTLDVVDRD-ADSG 56 Query: 61 DSQVVGHCANDTHKILYTRT 80 +VVG C KI+YTR Sbjct: 57 KKRVVGTCEAGKKKIVYTRN 76 >UniRef50_Q48QF9 Putative uncharacterized protein n=3 Tax=Pseudomonas syringae group RepID=Q48QF9_PSE14 Length = 77 Score = 87.6 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 28/73 (38%), Positives = 41/73 (56%), Gaps = 3/73 (4%) Query: 9 ALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHC 68 A+ L + + LAAP SCE ++ +I +I N V +S+TL IV N++ P +VG C Sbjct: 7 AVTCTLLATSALAAPKSCEELKDEIEAKIQANNV--TSYTLEIVSNEEASDPS-MIVGSC 63 Query: 69 ANDTHKILYTRTT 81 N T KI+Y Sbjct: 64 DNGTKKIIYQLNG 76 >UniRef50_B6XCY0 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XCY0_9ENTR Length = 77 Score = 86.8 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 31/73 (42%), Positives = 49/73 (67%), Gaps = 2/73 (2%) Query: 7 CAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVG 66 A+L+ L P + A SCE + +I+Q+IINNGVP SFT+++V +++ VVG Sbjct: 6 ITAILMTLFIPVI--AGASCESVVEEITQKIINNGVPSDSFTITVVSSEEAASQQGTVVG 63 Query: 67 HCANDTHKILYTR 79 +C+N+T KI+YT+ Sbjct: 64 NCSNETQKIIYTK 76 >UniRef50_Q48QG2 Putative uncharacterized protein n=33 Tax=Pseudomonas RepID=Q48QG2_PSE14 Length = 98 Score = 85.3 bits (209), Expect = 6e-16, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 5/78 (6%) Query: 3 LSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDS 62 A LL++A LAA CE ++S+I R+ GV +S+TL +V ++ D Sbjct: 26 KKFLLAVGLLSIAGT-ALAAGKPCEELKSEIDARLQAKGV--TSYTLEVV--EKGSASDK 80 Query: 63 QVVGHCANDTHKILYTRT 80 QVVG C T +++Y R Sbjct: 81 QVVGTCEGGTKEVVYQRG 98 >UniRef50_Q1QVN3 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QVN3_CHRSD Length = 140 Score = 60.6 bits (145), Expect = 1e-08, Method: Composition-based stats. Identities = 30/123 (24%), Positives = 52/123 (42%), Gaps = 30/123 (24%) Query: 6 CCAALLLALASPAVLAAPG----------------SCERIQSDISQRIINNGVPESSFTL 49 A+L+ LA PA+ A C+ +Q +I +I NGV E F L Sbjct: 7 MYGAVLMVLALPAIAVAQTTSHRDDDGALGEPGVMDCDVLQDEIEAKIRANGVDE--FQL 64 Query: 50 SIVPNDQVDQ---------PDSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDG---AP 97 ++ + +VD+ +VVG C + K++Y R G+ + ++ + A Sbjct: 65 DLIASARVDEDGVREGDPLAGGEVVGSCDGGSRKVIYRRGAQGSGAMSSEDAALPPREAS 124 Query: 98 AEP 100 +EP Sbjct: 125 SEP 127 >UniRef50_C6NZB5 Putative uncharacterized protein n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6NZB5_9PROT Length = 89 Score = 59.1 bits (141), Expect = 4e-08, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 40/94 (42%), Gaps = 22/94 (23%) Query: 3 LSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVD---- 58 + A L +L+ P A SC+ +++ I ++ GV S+TL IVP Q Sbjct: 2 RNLLLATCLFSLSVP----AMASCDDLKAQIDAKLQAKGVK--SYTLDIVPVAQAAAAPV 55 Query: 59 ------------QPDSQVVGHCANDTHKILYTRT 80 + +VVG C DT +I+Y R Sbjct: 56 AASGAAAATPAKETAGKVVGTCEGDTKQIIYKRN 89 >UniRef50_C5CVT9 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CVT9_VARPS Length = 110 Score = 55.2 bits (131), Expect = 6e-07, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 4/88 (4%) Query: 12 LALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCAND 71 +ALA +CE +++ I +I GV +++ D + QVVG C Sbjct: 8 MALALAGTAHGAENCEALRTQIEAKIAAAGVTR----FAVITVDANAEAPGQVVGSCDLG 63 Query: 72 THKILYTRTTSGNVSAPAQSSQDGAPAE 99 + KI+Y R + A G E Sbjct: 64 SKKIVYQREDAPAAGAAPARPSAGPAGE 91 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P76172 Uncharacterized protein ynfD n=110 Tax=Enterobac... 124 5e-28 UniRef50_Q7N4T6 Similar to unknown protein YnfD of Escherichia c... 117 1e-25 UniRef50_C7BK36 Putative uncharacterized protein n=1 Tax=Photorh... 107 1e-22 UniRef50_C5BA41 Putative uncharacterized protein n=2 Tax=Edwards... 102 3e-21 UniRef50_D2TBB0 Uncharacterized protein ynfD n=5 Tax=Enterobacte... 101 5e-21 UniRef50_A2SLW8 Putative uncharacterized protein n=1 Tax=Methyli... 99 4e-20 UniRef50_C1DG54 Putative uncharacterized protein n=8 Tax=Pseudom... 97 1e-19 UniRef50_C9YE47 Putative uncharacterized protein n=1 Tax=Curviba... 90 2e-17 UniRef50_Q48QF9 Putative uncharacterized protein n=3 Tax=Pseudom... 88 1e-16 UniRef50_B6XCY0 Putative uncharacterized protein n=1 Tax=Provide... 87 2e-16 UniRef50_Q48QG2 Putative uncharacterized protein n=33 Tax=Pseudo... 87 2e-16 UniRef50_Q1QVN3 Putative uncharacterized protein n=1 Tax=Chromoh... 83 3e-15 UniRef50_C5CVT9 Putative uncharacterized protein n=1 Tax=Variovo... 78 6e-14 UniRef50_C6NZB5 Putative uncharacterized protein n=1 Tax=Siderox... 72 5e-12 Sequences not found previously or not previously below threshold: CONVERGED! >UniRef50_P76172 Uncharacterized protein ynfD n=110 Tax=Enterobacteriaceae RepID=YNFD_ECOLI Length = 101 Score = 124 bits (312), Expect = 5e-28, Method: Composition-based stats. Identities = 101/101 (100%), Positives = 101/101 (100%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP Sbjct: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 Query: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ 101 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ Sbjct: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ 101 >UniRef50_Q7N4T6 Similar to unknown protein YnfD of Escherichia coli n=19 Tax=Enterobacteriaceae RepID=Q7N4T6_PHOLL Length = 87 Score = 117 bits (293), Expect = 1e-25, Method: Composition-based stats. Identities = 42/84 (50%), Positives = 56/84 (66%), Gaps = 1/84 (1%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK + +ALL L P LAA SCE ++ I+Q+IINNGVPES FTL IV ND ++Q Sbjct: 3 MKKALLFSALLFTL-PPFALAAQASCESVKEQIAQKIINNGVPESGFTLEIVANDHIEQA 61 Query: 61 DSQVVGHCANDTHKILYTRTTSGN 84 ++VG+C N+T KI+Y R S + Sbjct: 62 GGKIVGYCENNTKKIMYIRQGSQS 85 >UniRef50_C7BK36 Putative uncharacterized protein n=1 Tax=Photorhabdus asymbiotica subsp. asymbiotica ATCC 43949 RepID=C7BK36_PHOAA Length = 100 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 39/86 (45%), Positives = 57/86 (66%), Gaps = 1/86 (1%) Query: 1 MKLSTCCAALLLALAS-PAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQ 59 M+ + +AL+ A++ + SCE ++ I+Q+II+NGVPES F L IVPNDQV+Q Sbjct: 11 MRKALLSSALIFAVSPFAFSASVQTSCESVKEQIAQKIIHNGVPESGFKLEIVPNDQVEQ 70 Query: 60 PDSQVVGHCANDTHKILYTRTTSGNV 85 ++VGHC N+T KI+YTR S + Sbjct: 71 ASGKIVGHCENNTKKIVYTRQVSQSP 96 >UniRef50_C5BA41 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=C5BA41_EDWI9 Length = 90 Score = 102 bits (255), Expect = 3e-21, Method: Composition-based stats. Identities = 41/72 (56%), Positives = 47/72 (65%) Query: 20 LAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTR 79 AA SCE I+ +ISQ+I+ NGVP +FTL IVPNDQV Q QVVGHC NDT KI+Y R Sbjct: 18 YAAAASCESIRDEISQKIVANGVPSDAFTLEIVPNDQVQQAGGQVVGHCGNDTQKIIYIR 77 Query: 80 TTSGNVSAPAQS 91 T A A Sbjct: 78 TDGAEPGAYANP 89 >UniRef50_D2TBB0 Uncharacterized protein ynfD n=5 Tax=Enterobacteriaceae RepID=D2TBB0_ERWPY Length = 98 Score = 101 bits (252), Expect = 5e-21, Method: Composition-based stats. Identities = 46/93 (49%), Positives = 60/93 (64%), Gaps = 3/93 (3%) Query: 7 CAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVG 66 L AL LAA SCE +++DI+Q+II+NGVP S F+L IVPNDQVDQ QVVG Sbjct: 5 FKWALSALLVVTPLAATASCESVKADINQKIISNGVPASGFSLDIVPNDQVDQAGGQVVG 64 Query: 67 HCANDTHKILYTRTTSG---NVSAPAQSSQDGA 96 HC +DTHKI+Y + + SA +S+D + Sbjct: 65 HCESDTHKIVYKHVSGAAENDASASTGTSRDSS 97 >UniRef50_A2SLW8 Putative uncharacterized protein n=1 Tax=Methylibium petroleiphilum PM1 RepID=A2SLW8_METPP Length = 122 Score = 98.8 bits (244), Expect = 4e-20, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 48/99 (48%), Gaps = 2/99 (2%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 + S C A +L V A SCE+++ I ++ +G + +T++ V D Sbjct: 6 LSFSLCIGAAVLLCGVTGVAGAAQSCEQLRERIDAKVRASGA--THYTVTTVDADAAVGA 63 Query: 61 DSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAE 99 ++VVG C T KI+YTR G+V+A + + P + Sbjct: 64 SAKVVGSCELGTKKIVYTRGEEGSVAAASAPTVTARPRD 102 >UniRef50_C1DG54 Putative uncharacterized protein n=8 Tax=Pseudomonadaceae RepID=C1DG54_AZOVD Length = 99 Score = 97.2 bits (240), Expect = 1e-19, Method: Composition-based stats. Identities = 30/83 (36%), Positives = 42/83 (50%), Gaps = 3/83 (3%) Query: 17 PAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKIL 76 +LAA CE ++ +I +I GV +S+TL IVPN +V + VVG C T KI+ Sbjct: 15 TPLLAAIKPCEELKEEIEVKIQAAGV--TSYTLEIVPNAEVTD-RNLVVGSCDGGTRKII 71 Query: 77 YTRTTSGNVSAPAQSSQDGAPAE 99 Y R G+ + AP E Sbjct: 72 YQRNDGGSRRGEPSPTPAPAPEE 94 >UniRef50_C9YE47 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YE47_9BURK Length = 76 Score = 90.3 bits (222), Expect = 2e-17, Method: Composition-based stats. Identities = 29/80 (36%), Positives = 42/80 (52%), Gaps = 4/80 (5%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK S C + L L L S AA CE +++++ +I ++G +FTL +V D D Sbjct: 1 MKKSVCFSVLALGLLSANAWAA-KDCEVLKTELGAKIESHGAK--NFTLDVVDRD-ADSG 56 Query: 61 DSQVVGHCANDTHKILYTRT 80 +VVG C KI+YTR Sbjct: 57 KKRVVGTCEAGKKKIVYTRN 76 >UniRef50_Q48QF9 Putative uncharacterized protein n=3 Tax=Pseudomonas syringae group RepID=Q48QF9_PSE14 Length = 77 Score = 87.6 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 30/81 (37%), Positives = 43/81 (53%), Gaps = 5/81 (6%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK A+ L + + LAAP SCE ++ +I +I N V +S+TL IV N++ P Sbjct: 1 MKRFIL--AVTCTLLATSALAAPKSCEELKDEIEAKIQANNV--TSYTLEIVSNEEASDP 56 Query: 61 DSQVVGHCANDTHKILYTRTT 81 +VG C N T KI+Y Sbjct: 57 S-MIVGSCDNGTKKIIYQLNG 76 >UniRef50_B6XCY0 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XCY0_9ENTR Length = 77 Score = 86.8 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 33/79 (41%), Positives = 51/79 (64%), Gaps = 3/79 (3%) Query: 1 MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQP 60 MK A+L+ L P + A SCE + +I+Q+IINNGVP SFT+++V +++ Sbjct: 1 MKRKI-ITAILMTLFIPVI--AGASCESVVEEITQKIINNGVPSDSFTITVVSSEEAASQ 57 Query: 61 DSQVVGHCANDTHKILYTR 79 VVG+C+N+T KI+YT+ Sbjct: 58 QGTVVGNCSNETQKIIYTK 76 >UniRef50_Q48QG2 Putative uncharacterized protein n=33 Tax=Pseudomonas RepID=Q48QG2_PSE14 Length = 98 Score = 86.8 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 27/78 (34%), Positives = 41/78 (52%), Gaps = 5/78 (6%) Query: 3 LSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDS 62 A LL++A LAA CE ++S+I R+ GV +S+TL +V ++ D Sbjct: 26 KKFLLAVGLLSIAGT-ALAAGKPCEELKSEIDARLQAKGV--TSYTLEVV--EKGSASDK 80 Query: 63 QVVGHCANDTHKILYTRT 80 QVVG C T +++Y R Sbjct: 81 QVVGTCEGGTKEVVYQRG 98 >UniRef50_Q1QVN3 Putative uncharacterized protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1QVN3_CHRSD Length = 140 Score = 83.0 bits (203), Expect = 3e-15, Method: Composition-based stats. Identities = 29/123 (23%), Positives = 51/123 (41%), Gaps = 30/123 (24%) Query: 6 CCAALLLALASPAVLAAPG----------------SCERIQSDISQRIINNGVPESSFTL 49 A+L+ LA PA+ A C+ +Q +I +I NGV E F L Sbjct: 7 MYGAVLMVLALPAIAVAQTTSHRDDDGALGEPGVMDCDVLQDEIEAKIRANGVDE--FQL 64 Query: 50 SIVPNDQVDQ---------PDSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGA---P 97 ++ + +VD+ +VVG C + K++Y R G+ + ++ + Sbjct: 65 DLIASARVDEDGVREGDPLAGGEVVGSCDGGSRKVIYRRGAQGSGAMSSEDAALPPREAS 124 Query: 98 AEP 100 +EP Sbjct: 125 SEP 127 >UniRef50_C5CVT9 Putative uncharacterized protein n=1 Tax=Variovorax paradoxus S110 RepID=C5CVT9_VARPS Length = 110 Score = 78.3 bits (191), Expect = 6e-14, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 4/88 (4%) Query: 12 LALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCAND 71 +ALA +CE +++ I +I GV +++ D + QVVG C Sbjct: 8 MALALAGTAHGAENCEALRTQIEAKIAAAGVTR----FAVITVDANAEAPGQVVGSCDLG 63 Query: 72 THKILYTRTTSGNVSAPAQSSQDGAPAE 99 + KI+Y R + A G E Sbjct: 64 SKKIVYQREDAPAAGAAPARPSAGPAGE 91 >UniRef50_C6NZB5 Putative uncharacterized protein n=1 Tax=Sideroxydans lithotrophicus ES-1 RepID=C6NZB5_9PROT Length = 89 Score = 72.2 bits (175), Expect = 5e-12, Method: Composition-based stats. Identities = 26/94 (27%), Positives = 40/94 (42%), Gaps = 22/94 (23%) Query: 3 LSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVD---- 58 + A L +L+ P A SC+ +++ I ++ GV S+TL IVP Q Sbjct: 2 RNLLLATCLFSLSVP----AMASCDDLKAQIDAKLQAKGVK--SYTLDIVPVAQAAAAPV 55 Query: 59 ------------QPDSQVVGHCANDTHKILYTRT 80 + +VVG C DT +I+Y R Sbjct: 56 AASGAAAATPAKETAGKVVGTCEGDTKQIIYKRN 89 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.300 0.120 0.281 Lambda K H 0.267 0.0365 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 385,452,254 Number of Sequences: 3077464 Number of extensions: 10528227 Number of successful extensions: 40070 Number of sequences better than 1.0e-01: 14 Number of HSP's better than 0.1 without gapping: 32 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 40007 Number of HSP's gapped (non-prelim): 42 length of query: 101 length of database: 1,040,396,356 effective HSP length: 70 effective length of query: 31 effective length of database: 824,973,876 effective search space: 25574190156 effective search space used: 25574190156 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.2 bits) S2: 86 (37.9 bits)