BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (94 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0AD34 UPF0381 protein yfcZ n=137 Tax=Enterobacteriacea... 191 5e-48 UniRef50_A8ADP0 Putative uncharacterized protein n=2 Tax=Enterob... 145 3e-34 UniRef50_C6AKB2 DNA mismatch repair protein n=10 Tax=Gammaproteo... 98 8e-20 UniRef50_P44686 UPF0381 protein HI0400 n=31 Tax=Gammaproteobacte... 90 2e-17 UniRef50_B8F341 Possible DNA mismatch repair protein n=2 Tax=Hae... 82 5e-15 UniRef50_B8F3Q3 tRNA-dihydrouridine synthase A n=2 Tax=Haemophil... 77 1e-13 UniRef50_P44027 UPF0381 protein HI0636 n=26 Tax=Pasteurellaceae ... 76 3e-13 UniRef50_Q7VKK6 Putative uncharacterized protein n=5 Tax=Pasteur... 75 8e-13 UniRef50_D0Z807 Putative uncharacterized protein n=2 Tax=Edwards... 74 1e-12 UniRef50_P32162 UPF0381 protein yiiS n=61 Tax=Enterobacteriaceae... 72 5e-12 UniRef50_C4X908 Putative uncharacterized protein n=5 Tax=Klebsie... 60 3e-08 UniRef50_B0UTX1 Putative uncharacterized protein n=9 Tax=Pasteur... 51 1e-05 >UniRef50_P0AD34 UPF0381 protein yfcZ n=137 Tax=Enterobacteriaceae RepID=YFCZ_ECOL6 Length = 94 Score = 191 bits (486), Expect = 5e-48, Method: Compositional matrix adjust. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK Sbjct: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 Query: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR Sbjct: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 >UniRef50_A8ADP0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8ADP0_CITK8 Length = 80 Score = 145 bits (367), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 70/80 (87%), Positives = 75/80 (93%) Query: 15 MDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDGVRL 74 MDVGTIMDNSDCTASYSRVF NRA+AE+TLAALTEKAR VESEPC+IT TFTEE++GVRL Sbjct: 1 MDVGTIMDNSDCTASYSRVFENRAQAEETLAALTEKARGVESEPCQITSTFTEEAEGVRL 60 Query: 75 DIDFTFACEAEMLIFQLGLR 94 DIDF FACEAE LIFQLGLR Sbjct: 61 DIDFVFACEAETLIFQLGLR 80 >UniRef50_C6AKB2 DNA mismatch repair protein n=10 Tax=Gammaproteobacteria RepID=C6AKB2_AGGAN Length = 96 Score = 98.2 bits (243), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 44/92 (47%), Positives = 62/92 (67%) Query: 3 KCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKIT 62 KC A+E VCCC+DVGT++D SDCT + +V+A A++ L LTEKA++ ES+PC IT Sbjct: 5 KCQAEEAKVCCCVDVGTVIDGSDCTIDFEQVYATEELAKEALNYLTEKAKAAESDPCHIT 64 Query: 63 PTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 + +G +L F F+C+AE +IFQL R Sbjct: 65 SDISAVENGYKLKAQFEFSCQAESMIFQLSTR 96 >UniRef50_P44686 UPF0381 protein HI0400 n=31 Tax=Gammaproteobacteria RepID=Y400_HAEIN Length = 95 Score = 90.1 bits (222), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 41/92 (44%), Positives = 61/92 (66%) Query: 3 KCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKIT 62 KC A+E+ C C+DVGTI+D SDC+ + ++ A+A L LT+KAR+ ES+PC+I Sbjct: 4 KCKAEESLTCSCVDVGTIIDGSDCSVEVHQFYSTEADANAVLERLTKKARNTESDPCEIK 63 Query: 63 PTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 +GV+L+ FTF+C+AE +IF+L R Sbjct: 64 SEIVAVENGVQLNASFTFSCQAEAMIFELANR 95 >UniRef50_B8F341 Possible DNA mismatch repair protein n=2 Tax=Haemophilus parasuis RepID=B8F341_HAEPS Length = 91 Score = 82.0 bits (201), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 37/80 (46%), Positives = 56/80 (70%) Query: 15 MDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDGVRL 74 +DVGTI+DNSDC ++++++ +AEAE LA LT KAR+ ESEPC+I+ +G +L Sbjct: 12 IDVGTIIDNSDCAVNFTQLYTTKAEAEDALAFLTSKARATESEPCEISSEIISTENGFQL 71 Query: 75 DIDFTFACEAEMLIFQLGLR 94 F F+ + E +IF+LG+R Sbjct: 72 TACFKFSYQVESMIFELGIR 91 >UniRef50_B8F3Q3 tRNA-dihydrouridine synthase A n=2 Tax=Haemophilus parasuis RepID=B8F3Q3_HAEPS Length = 97 Score = 77.4 bits (189), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 3/97 (3%) Query: 1 MSKCSADETPVC---CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESE 57 MSK +D+ C C ++ DN+D T + V+ +A+AEQ LA LT KAR VESE Sbjct: 1 MSKTFSDQADCCGGICKPSSTSMFDNADSTIELALVYPTQADAEQGLATLTAKAREVESE 60 Query: 58 PCKITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 PC I+ ++ DG L F F+C+AE +IFQ+ LR Sbjct: 61 PCNISSQISQIEDGFLLQAKFLFSCQAEAVIFQMKLR 97 >UniRef50_P44027 UPF0381 protein HI0636 n=26 Tax=Pasteurellaceae RepID=Y636_HAEIN Length = 96 Score = 76.3 bits (186), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 36/90 (40%), Positives = 55/90 (61%), Gaps = 4/90 (4%) Query: 8 ETPV----CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITP 63 E P+ C D+ ++ DN DC+ ++ + +A++ LA T+KAR VESEPC+I Sbjct: 6 EKPIECVGCNTFDMKSLFDNRDCSQVIEYIYDSEGQAQEALAFFTQKARDVESEPCEIQS 65 Query: 64 TFTEESDGVRLDIDFTFACEAEMLIFQLGL 93 T+ DG L DFTF C+AE++IFQ+ + Sbjct: 66 EITKVDDGYLLKADFTFCCQAELVIFQMRI 95 >UniRef50_Q7VKK6 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q7VKK6_HAEDU Length = 138 Score = 74.7 bits (182), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 35/83 (42%), Positives = 53/83 (63%) Query: 12 CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDG 71 C DVG+I+DNS+ A + +V+A + +AE LA L +KAR E++ C+I+ G Sbjct: 54 CNTFDVGSIIDNSERDAKFEKVYATQEQAEAVLAKLVQKARDTETDSCEISSNILPVEAG 113 Query: 72 VRLDIDFTFACEAEMLIFQLGLR 94 L +F F+C+AE++IFQLG R Sbjct: 114 FLLTANFDFSCQAEVVIFQLGTR 136 >UniRef50_D0Z807 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=D0Z807_EDWTE Length = 98 Score = 74.3 bits (181), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 35/84 (41%), Positives = 55/84 (65%), Gaps = 2/84 (2%) Query: 13 CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESE--PCKITPTFTEESD 70 C +D+G+++DN +CT ++ +A+R EAE+ LA LT KA+ V + PC IT +E+ Sbjct: 13 CAIDIGSVLDNDNCTTVITQAYASRDEAEKQLAQLTAKAQRVAQQEYPCIITHDISEKEG 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 L+ F F+C+AE +IF+L LR Sbjct: 73 EFVLNAHFNFSCQAETVIFELSLR 96 >UniRef50_P32162 UPF0381 protein yiiS n=61 Tax=Enterobacteriaceae RepID=YIIS_ECOLI Length = 99 Score = 72.0 bits (175), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 36/84 (42%), Positives = 51/84 (60%), Gaps = 2/84 (2%) Query: 13 CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVES--EPCKITPTFTEESD 70 C +D+GT++DN +CT+ +SR FA R EAE + L E A + S E + + Sbjct: 13 CAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKELAAATSSADEGASVAYKIKDLEG 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 V LD FTF+C+AEM+IF+L LR Sbjct: 73 QVELDAAFTFSCQAEMIIFELSLR 96 >UniRef50_C4X908 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X908_KLEPN Length = 99 Score = 59.7 bits (143), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 35/84 (41%), Positives = 51/84 (60%), Gaps = 2/84 (2%) Query: 13 CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPC--KITPTFTEESD 70 C +DVGTI+DN DC +VF +R EAE T+AA+ E+A + ++ T D Sbjct: 13 CAIDVGTIIDNEDCVYRAEKVFPSREEAESTVAAVRERAAAAAPASEPPQVDYTIVAAGD 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 V+LD+ F+C+AE +IF+L LR Sbjct: 73 AVKLDLSIAFSCQAEKIIFELSLR 96 >UniRef50_B0UTX1 Putative uncharacterized protein n=9 Tax=Pasteurellaceae RepID=B0UTX1_HAES2 Length = 98 Score = 50.8 bits (120), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 24/87 (27%), Positives = 44/87 (50%) Query: 8 ETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTE 67 E C + +++DNS + + +A +KAR +E++P I T+ Sbjct: 12 EAHNMCRIKGDSMLDNSAREVVFEAEYDTEKQALNARDYFIQKARDIENDPANIESQITD 71 Query: 68 ESDGVRLDIDFTFACEAEMLIFQLGLR 94 S+G RL + F C+AE+++FQ+ +R Sbjct: 72 ISNGFRLKMRIVFGCQAEVVLFQMAIR 98 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AD34 UPF0381 protein yfcZ n=137 Tax=Enterobacteriacea... 120 9e-27 UniRef50_C6AKB2 DNA mismatch repair protein n=10 Tax=Gammaproteo... 114 9e-25 UniRef50_Q7VKK6 Putative uncharacterized protein n=5 Tax=Pasteur... 113 1e-24 UniRef50_P44686 UPF0381 protein HI0400 n=31 Tax=Gammaproteobacte... 110 2e-23 UniRef50_B8F3Q3 tRNA-dihydrouridine synthase A n=2 Tax=Haemophil... 107 1e-22 UniRef50_B0UTX1 Putative uncharacterized protein n=9 Tax=Pasteur... 103 2e-21 UniRef50_P44027 UPF0381 protein HI0636 n=26 Tax=Pasteurellaceae ... 101 7e-21 UniRef50_B8F341 Possible DNA mismatch repair protein n=2 Tax=Hae... 101 7e-21 UniRef50_A8ADP0 Putative uncharacterized protein n=2 Tax=Enterob... 100 1e-20 UniRef50_D0Z807 Putative uncharacterized protein n=2 Tax=Edwards... 95 5e-19 UniRef50_P32162 UPF0381 protein yiiS n=61 Tax=Enterobacteriaceae... 87 1e-16 UniRef50_C4X908 Putative uncharacterized protein n=5 Tax=Klebsie... 75 1e-12 Sequences not found previously or not previously below threshold: UniRef50_C4L7R4 Putative uncharacterized protein n=1 Tax=Tolumon... 47 2e-04 UniRef50_A0KF70 Putative uncharacterized protein n=2 Tax=Aeromon... 40 0.022 >UniRef50_P0AD34 UPF0381 protein yfcZ n=137 Tax=Enterobacteriaceae RepID=YFCZ_ECOL6 Length = 94 Score = 120 bits (302), Expect = 9e-27, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK Sbjct: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 Query: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR Sbjct: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 >UniRef50_C6AKB2 DNA mismatch repair protein n=10 Tax=Gammaproteobacteria RepID=C6AKB2_AGGAN Length = 96 Score = 114 bits (286), Expect = 9e-25, Method: Composition-based stats. Identities = 44/92 (47%), Positives = 62/92 (67%) Query: 3 KCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKIT 62 KC A+E VCCC+DVGT++D SDCT + +V+A A++ L LTEKA++ ES+PC IT Sbjct: 5 KCQAEEAKVCCCVDVGTVIDGSDCTIDFEQVYATEELAKEALNYLTEKAKAAESDPCHIT 64 Query: 63 PTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 + +G +L F F+C+AE +IFQL R Sbjct: 65 SDISAVENGYKLKAQFEFSCQAESMIFQLSTR 96 >UniRef50_Q7VKK6 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q7VKK6_HAEDU Length = 138 Score = 113 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 35/83 (42%), Positives = 53/83 (63%) Query: 12 CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDG 71 C DVG+I+DNS+ A + +V+A + +AE LA L +KAR E++ C+I+ G Sbjct: 54 CNTFDVGSIIDNSERDAKFEKVYATQEQAEAVLAKLVQKARDTETDSCEISSNILPVEAG 113 Query: 72 VRLDIDFTFACEAEMLIFQLGLR 94 L +F F+C+AE++IFQLG R Sbjct: 114 FLLTANFDFSCQAEVVIFQLGTR 136 >UniRef50_P44686 UPF0381 protein HI0400 n=31 Tax=Gammaproteobacteria RepID=Y400_HAEIN Length = 95 Score = 110 bits (274), Expect = 2e-23, Method: Composition-based stats. Identities = 41/92 (44%), Positives = 61/92 (66%) Query: 3 KCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKIT 62 KC A+E+ C C+DVGTI+D SDC+ + ++ A+A L LT+KAR+ ES+PC+I Sbjct: 4 KCKAEESLTCSCVDVGTIIDGSDCSVEVHQFYSTEADANAVLERLTKKARNTESDPCEIK 63 Query: 63 PTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 +GV+L+ FTF+C+AE +IF+L R Sbjct: 64 SEIVAVENGVQLNASFTFSCQAEAMIFELANR 95 >UniRef50_B8F3Q3 tRNA-dihydrouridine synthase A n=2 Tax=Haemophilus parasuis RepID=B8F3Q3_HAEPS Length = 97 Score = 107 bits (266), Expect = 1e-22, Method: Composition-based stats. Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 3/97 (3%) Query: 1 MSKCSADETPVC---CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESE 57 MSK +D+ C C ++ DN+D T + V+ +A+AEQ LA LT KAR VESE Sbjct: 1 MSKTFSDQADCCGGICKPSSTSMFDNADSTIELALVYPTQADAEQGLATLTAKAREVESE 60 Query: 58 PCKITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 PC I+ ++ DG L F F+C+AE +IFQ+ LR Sbjct: 61 PCNISSQISQIEDGFLLQAKFLFSCQAEAVIFQMKLR 97 >UniRef50_B0UTX1 Putative uncharacterized protein n=9 Tax=Pasteurellaceae RepID=B0UTX1_HAES2 Length = 98 Score = 103 bits (257), Expect = 2e-21, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 44/87 (50%) Query: 8 ETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTE 67 E C + +++DNS + + +A +KAR +E++P I T+ Sbjct: 12 EAHNMCRIKGDSMLDNSAREVVFEAEYDTEKQALNARDYFIQKARDIENDPANIESQITD 71 Query: 68 ESDGVRLDIDFTFACEAEMLIFQLGLR 94 S+G RL + F C+AE+++FQ+ +R Sbjct: 72 ISNGFRLKMRIVFGCQAEVVLFQMAIR 98 >UniRef50_P44027 UPF0381 protein HI0636 n=26 Tax=Pasteurellaceae RepID=Y636_HAEIN Length = 96 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 34/82 (41%), Positives = 52/82 (63%) Query: 12 CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDG 71 C D+ ++ DN DC+ ++ + +A++ LA T+KAR VESEPC+I T+ DG Sbjct: 14 CNTFDMKSLFDNRDCSQVIEYIYDSEGQAQEALAFFTQKARDVESEPCEIQSEITKVDDG 73 Query: 72 VRLDIDFTFACEAEMLIFQLGL 93 L DFTF C+AE++IFQ+ + Sbjct: 74 YLLKADFTFCCQAELVIFQMRI 95 >UniRef50_B8F341 Possible DNA mismatch repair protein n=2 Tax=Haemophilus parasuis RepID=B8F341_HAEPS Length = 91 Score = 101 bits (252), Expect = 7e-21, Method: Composition-based stats. Identities = 37/80 (46%), Positives = 56/80 (70%) Query: 15 MDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDGVRL 74 +DVGTI+DNSDC ++++++ +AEAE LA LT KAR+ ESEPC+I+ +G +L Sbjct: 12 IDVGTIIDNSDCAVNFTQLYTTKAEAEDALAFLTSKARATESEPCEISSEIISTENGFQL 71 Query: 75 DIDFTFACEAEMLIFQLGLR 94 F F+ + E +IF+LG+R Sbjct: 72 TACFKFSYQVESMIFELGIR 91 >UniRef50_A8ADP0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8ADP0_CITK8 Length = 80 Score = 100 bits (250), Expect = 1e-20, Method: Composition-based stats. Identities = 70/80 (87%), Positives = 75/80 (93%) Query: 15 MDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDGVRL 74 MDVGTIMDNSDCTASYSRVF NRA+AE+TLAALTEKAR VESEPC+IT TFTEE++GVRL Sbjct: 1 MDVGTIMDNSDCTASYSRVFENRAQAEETLAALTEKARGVESEPCQITSTFTEEAEGVRL 60 Query: 75 DIDFTFACEAEMLIFQLGLR 94 DIDF FACEAE LIFQLGLR Sbjct: 61 DIDFVFACEAETLIFQLGLR 80 >UniRef50_D0Z807 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=D0Z807_EDWTE Length = 98 Score = 95.5 bits (236), Expect = 5e-19, Method: Composition-based stats. Identities = 35/84 (41%), Positives = 55/84 (65%), Gaps = 2/84 (2%) Query: 13 CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESE--PCKITPTFTEESD 70 C +D+G+++DN +CT ++ +A+R EAE+ LA LT KA+ V + PC IT +E+ Sbjct: 13 CAIDIGSVLDNDNCTTVITQAYASRDEAEKQLAQLTAKAQRVAQQEYPCIITHDISEKEG 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 L+ F F+C+AE +IF+L LR Sbjct: 73 EFVLNAHFNFSCQAETVIFELSLR 96 >UniRef50_P32162 UPF0381 protein yiiS n=61 Tax=Enterobacteriaceae RepID=YIIS_ECOLI Length = 99 Score = 87.4 bits (215), Expect = 1e-16, Method: Composition-based stats. Identities = 36/84 (42%), Positives = 51/84 (60%), Gaps = 2/84 (2%) Query: 13 CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVES--EPCKITPTFTEESD 70 C +D+GT++DN +CT+ +SR FA R EAE + L E A + S E + + Sbjct: 13 CAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKELAAATSSADEGASVAYKIKDLEG 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 V LD FTF+C+AEM+IF+L LR Sbjct: 73 QVELDAAFTFSCQAEMIIFELSLR 96 >UniRef50_C4X908 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X908_KLEPN Length = 99 Score = 74.7 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 35/84 (41%), Positives = 51/84 (60%), Gaps = 2/84 (2%) Query: 13 CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPC--KITPTFTEESD 70 C +DVGTI+DN DC +VF +R EAE T+AA+ E+A + ++ T D Sbjct: 13 CAIDVGTIIDNEDCVYRAEKVFPSREEAESTVAAVRERAAAAAPASEPPQVDYTIVAAGD 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 V+LD+ F+C+AE +IF+L LR Sbjct: 73 AVKLDLSIAFSCQAEKIIFELSLR 96 >UniRef50_C4L7R4 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4L7R4_TOLAT Length = 99 Score = 46.6 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 1/84 (1%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 C +VGTI+ + D N +A A+ + ++ D Sbjct: 14 CGCVAEVGTIIRDGDDLVMIPVEGDNEEDARARQERYIAVAKEACPD-VLVSSELQTVED 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 GV L+ F C AE +IF++ R Sbjct: 73 GVVLNTTLQFTCTAEKMIFEMRAR 96 >UniRef50_A0KF70 Putative uncharacterized protein n=2 Tax=Aeromonas RepID=A0KF70_AERHH Length = 100 Score = 40.0 bits (92), Expect = 0.022, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 1/84 (1%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 C ++GTI+ D A A+A LAA A+ V +E + ++ ++ Sbjct: 16 CGCFAEIGTIIGEGDDVMELVIEAAAEADARAKLAAYEALAKQVTAEATS-SSELSQGAN 74 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 GV L F C AE LIF++ R Sbjct: 75 GVILKARLQFTCTAEKLIFEMRAR 98 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P0AD34 UPF0381 protein yfcZ n=137 Tax=Enterobacteriacea... 117 1e-25 UniRef50_C6AKB2 DNA mismatch repair protein n=10 Tax=Gammaproteo... 113 2e-24 UniRef50_Q7VKK6 Putative uncharacterized protein n=5 Tax=Pasteur... 112 3e-24 UniRef50_P44686 UPF0381 protein HI0400 n=31 Tax=Gammaproteobacte... 110 1e-23 UniRef50_B8F3Q3 tRNA-dihydrouridine synthase A n=2 Tax=Haemophil... 107 1e-22 UniRef50_B0UTX1 Putative uncharacterized protein n=9 Tax=Pasteur... 102 5e-21 UniRef50_B8F341 Possible DNA mismatch repair protein n=2 Tax=Hae... 100 2e-20 UniRef50_P44027 UPF0381 protein HI0636 n=26 Tax=Pasteurellaceae ... 100 2e-20 UniRef50_A8ADP0 Putative uncharacterized protein n=2 Tax=Enterob... 97 1e-19 UniRef50_D0Z807 Putative uncharacterized protein n=2 Tax=Edwards... 95 8e-19 UniRef50_P32162 UPF0381 protein yiiS n=61 Tax=Enterobacteriaceae... 86 4e-16 UniRef50_C4L7R4 Putative uncharacterized protein n=1 Tax=Tolumon... 82 5e-15 UniRef50_C4X908 Putative uncharacterized protein n=5 Tax=Klebsie... 74 1e-12 Sequences not found previously or not previously below threshold: UniRef50_A0KF70 Putative uncharacterized protein n=2 Tax=Aeromon... 53 4e-06 UniRef50_Q7MFA3 Uncharacterized protein conserved in bacteria n=... 44 0.002 UniRef50_A1STA8 Putative uncharacterized protein n=2 Tax=Psychro... 43 0.003 UniRef50_A6FGT2 Putative uncharacterized protein n=1 Tax=Moritel... 40 0.021 UniRef50_Q2C8E1 Putative uncharacterized protein n=2 Tax=Vibrion... 38 0.081 >UniRef50_P0AD34 UPF0381 protein yfcZ n=137 Tax=Enterobacteriaceae RepID=YFCZ_ECOL6 Length = 94 Score = 117 bits (292), Expect = 1e-25, Method: Composition-based stats. Identities = 94/94 (100%), Positives = 94/94 (100%) Query: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK Sbjct: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 Query: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR Sbjct: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 >UniRef50_C6AKB2 DNA mismatch repair protein n=10 Tax=Gammaproteobacteria RepID=C6AKB2_AGGAN Length = 96 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 44/92 (47%), Positives = 62/92 (67%) Query: 3 KCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKIT 62 KC A+E VCCC+DVGT++D SDCT + +V+A A++ L LTEKA++ ES+PC IT Sbjct: 5 KCQAEEAKVCCCVDVGTVIDGSDCTIDFEQVYATEELAKEALNYLTEKAKAAESDPCHIT 64 Query: 63 PTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 + +G +L F F+C+AE +IFQL R Sbjct: 65 SDISAVENGYKLKAQFEFSCQAESMIFQLSTR 96 >UniRef50_Q7VKK6 Putative uncharacterized protein n=5 Tax=Pasteurellaceae RepID=Q7VKK6_HAEDU Length = 138 Score = 112 bits (280), Expect = 3e-24, Method: Composition-based stats. Identities = 37/94 (39%), Positives = 55/94 (58%) Query: 1 MSKCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCK 60 M E C DVG+I+DNS+ A + +V+A + +AE LA L +KAR E++ C+ Sbjct: 43 MKSDKPIECVGCNTFDVGSIIDNSERDAKFEKVYATQEQAEAVLAKLVQKARDTETDSCE 102 Query: 61 ITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 I+ G L +F F+C+AE++IFQLG R Sbjct: 103 ISSNILPVEAGFLLTANFDFSCQAEVVIFQLGTR 136 >UniRef50_P44686 UPF0381 protein HI0400 n=31 Tax=Gammaproteobacteria RepID=Y400_HAEIN Length = 95 Score = 110 bits (275), Expect = 1e-23, Method: Composition-based stats. Identities = 41/92 (44%), Positives = 61/92 (66%) Query: 3 KCSADETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKIT 62 KC A+E+ C C+DVGTI+D SDC+ + ++ A+A L LT+KAR+ ES+PC+I Sbjct: 4 KCKAEESLTCSCVDVGTIIDGSDCSVEVHQFYSTEADANAVLERLTKKARNTESDPCEIK 63 Query: 63 PTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 +GV+L+ FTF+C+AE +IF+L R Sbjct: 64 SEIVAVENGVQLNASFTFSCQAEAMIFELANR 95 >UniRef50_B8F3Q3 tRNA-dihydrouridine synthase A n=2 Tax=Haemophilus parasuis RepID=B8F3Q3_HAEPS Length = 97 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 42/97 (43%), Positives = 58/97 (59%), Gaps = 3/97 (3%) Query: 1 MSKCSADETPVC---CCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESE 57 MSK +D+ C C ++ DN+D T + V+ +A+AEQ LA LT KAR VESE Sbjct: 1 MSKTFSDQADCCGGICKPSSTSMFDNADSTIELALVYPTQADAEQGLATLTAKAREVESE 60 Query: 58 PCKITPTFTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 PC I+ ++ DG L F F+C+AE +IFQ+ LR Sbjct: 61 PCNISSQISQIEDGFLLQAKFLFSCQAEAVIFQMKLR 97 >UniRef50_B0UTX1 Putative uncharacterized protein n=9 Tax=Pasteurellaceae RepID=B0UTX1_HAES2 Length = 98 Score = 102 bits (253), Expect = 5e-21, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 44/87 (50%) Query: 8 ETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTE 67 E C + +++DNS + + +A +KAR +E++P I T+ Sbjct: 12 EAHNMCRIKGDSMLDNSAREVVFEAEYDTEKQALNARDYFIQKARDIENDPANIESQITD 71 Query: 68 ESDGVRLDIDFTFACEAEMLIFQLGLR 94 S+G RL + F C+AE+++FQ+ +R Sbjct: 72 ISNGFRLKMRIVFGCQAEVVLFQMAIR 98 >UniRef50_B8F341 Possible DNA mismatch repair protein n=2 Tax=Haemophilus parasuis RepID=B8F341_HAEPS Length = 91 Score = 100 bits (249), Expect = 2e-20, Method: Composition-based stats. Identities = 37/80 (46%), Positives = 56/80 (70%) Query: 15 MDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDGVRL 74 +DVGTI+DNSDC ++++++ +AEAE LA LT KAR+ ESEPC+I+ +G +L Sbjct: 12 IDVGTIIDNSDCAVNFTQLYTTKAEAEDALAFLTSKARATESEPCEISSEIISTENGFQL 71 Query: 75 DIDFTFACEAEMLIFQLGLR 94 F F+ + E +IF+LG+R Sbjct: 72 TACFKFSYQVESMIFELGIR 91 >UniRef50_P44027 UPF0381 protein HI0636 n=26 Tax=Pasteurellaceae RepID=Y636_HAEIN Length = 96 Score = 100 bits (248), Expect = 2e-20, Method: Composition-based stats. Identities = 35/86 (40%), Positives = 53/86 (61%) Query: 8 ETPVCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTE 67 E C D+ ++ DN DC+ ++ + +A++ LA T+KAR VESEPC+I T+ Sbjct: 10 ECVGCNTFDMKSLFDNRDCSQVIEYIYDSEGQAQEALAFFTQKARDVESEPCEIQSEITK 69 Query: 68 ESDGVRLDIDFTFACEAEMLIFQLGL 93 DG L DFTF C+AE++IFQ+ + Sbjct: 70 VDDGYLLKADFTFCCQAELVIFQMRI 95 >UniRef50_A8ADP0 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A8ADP0_CITK8 Length = 80 Score = 97.4 bits (241), Expect = 1e-19, Method: Composition-based stats. Identities = 70/80 (87%), Positives = 75/80 (93%) Query: 15 MDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESDGVRL 74 MDVGTIMDNSDCTASYSRVF NRA+AE+TLAALTEKAR VESEPC+IT TFTEE++GVRL Sbjct: 1 MDVGTIMDNSDCTASYSRVFENRAQAEETLAALTEKARGVESEPCQITSTFTEEAEGVRL 60 Query: 75 DIDFTFACEAEMLIFQLGLR 94 DIDF FACEAE LIFQLGLR Sbjct: 61 DIDFVFACEAETLIFQLGLR 80 >UniRef50_D0Z807 Putative uncharacterized protein n=2 Tax=Edwardsiella RepID=D0Z807_EDWTE Length = 98 Score = 94.7 bits (234), Expect = 8e-19, Method: Composition-based stats. Identities = 35/85 (41%), Positives = 55/85 (64%), Gaps = 2/85 (2%) Query: 12 CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESE--PCKITPTFTEES 69 C +D+G+++DN +CT ++ +A+R EAE+ LA LT KA+ V + PC IT +E+ Sbjct: 12 GCAIDIGSVLDNDNCTTVITQAYASRDEAEKQLAQLTAKAQRVAQQEYPCIITHDISEKE 71 Query: 70 DGVRLDIDFTFACEAEMLIFQLGLR 94 L+ F F+C+AE +IF+L LR Sbjct: 72 GEFVLNAHFNFSCQAETVIFELSLR 96 >UniRef50_P32162 UPF0381 protein yiiS n=61 Tax=Enterobacteriaceae RepID=YIIS_ECOLI Length = 99 Score = 85.9 bits (211), Expect = 4e-16, Method: Composition-based stats. Identities = 36/85 (42%), Positives = 51/85 (60%), Gaps = 2/85 (2%) Query: 12 CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVES--EPCKITPTFTEES 69 C +D+GT++DN +CT+ +SR FA R EAE + L E A + S E + + Sbjct: 12 GCAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKELAAATSSADEGASVAYKIKDLE 71 Query: 70 DGVRLDIDFTFACEAEMLIFQLGLR 94 V LD FTF+C+AEM+IF+L LR Sbjct: 72 GQVELDAAFTFSCQAEMIIFELSLR 96 >UniRef50_C4L7R4 Putative uncharacterized protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4L7R4_TOLAT Length = 99 Score = 82.0 bits (201), Expect = 5e-15, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 1/84 (1%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 C +VGTI+ + D N +A A+ + ++ D Sbjct: 14 CGCVAEVGTIIRDGDDLVMIPVEGDNEEDARARQERYIAVAKEACPD-VLVSSELQTVED 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 GV L+ F C AE +IF++ R Sbjct: 73 GVVLNTTLQFTCTAEKMIFEMRAR 96 >UniRef50_C4X908 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X908_KLEPN Length = 99 Score = 74.3 bits (181), Expect = 1e-12, Method: Composition-based stats. Identities = 35/85 (41%), Positives = 51/85 (60%), Gaps = 2/85 (2%) Query: 12 CCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPC--KITPTFTEES 69 C +DVGTI+DN DC +VF +R EAE T+AA+ E+A + ++ T Sbjct: 12 GCAIDVGTIIDNEDCVYRAEKVFPSREEAESTVAAVRERAAAAAPASEPPQVDYTIVAAG 71 Query: 70 DGVRLDIDFTFACEAEMLIFQLGLR 94 D V+LD+ F+C+AE +IF+L LR Sbjct: 72 DAVKLDLSIAFSCQAEKIIFELSLR 96 >UniRef50_A0KF70 Putative uncharacterized protein n=2 Tax=Aeromonas RepID=A0KF70_AERHH Length = 100 Score = 52.7 bits (125), Expect = 4e-06, Method: Composition-based stats. Identities = 25/84 (29%), Positives = 38/84 (45%), Gaps = 1/84 (1%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 C ++GTI+ D A A+A LAA A+ V +E + ++ ++ Sbjct: 16 CGCFAEIGTIIGEGDDVMELVIEAAAEADARAKLAAYEALAKQVTAEATS-SSELSQGAN 74 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 GV L F C AE LIF++ R Sbjct: 75 GVILKARLQFTCTAEKLIFEMRAR 98 >UniRef50_Q7MFA3 Uncharacterized protein conserved in bacteria n=57 Tax=Vibrionales RepID=Q7MFA3_VIBVY Length = 99 Score = 43.9 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 22/84 (26%), Positives = 33/84 (39%), Gaps = 2/84 (2%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 C ++G I+ D A + ++ + A E A++V T T+ D Sbjct: 14 CGCAGEIGFIIKEGDDVADVTIYAGSKELLQAEFAKYLELAKAVNENVEYNTTDLTD--D 71 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 L F F AE LIF+L R Sbjct: 72 STELTARFKFEVSAEKLIFELKSR 95 >UniRef50_A1STA8 Putative uncharacterized protein n=2 Tax=Psychromonas RepID=A1STA8_PSYIN Length = 98 Score = 42.7 bits (99), Expect = 0.003, Method: Composition-based stats. Identities = 20/83 (24%), Positives = 34/83 (40%), Gaps = 1/83 (1%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 +++G+++ + D + A LA A + +T D Sbjct: 14 CGSFVELGSVISDDDTVLVLP-FTGKESVAVTALAIKYIDAAKTRFDSVVVTEQEITADD 72 Query: 71 GVRLDIDFTFACEAEMLIFQLGL 93 V L++ F F C AE LIF++GL Sbjct: 73 LVTLEVTFNFDCTAEKLIFEMGL 95 >UniRef50_A6FGT2 Putative uncharacterized protein n=1 Tax=Moritella sp. PE36 RepID=A6FGT2_9GAMM Length = 97 Score = 40.0 bits (92), Expect = 0.021, Method: Composition-based stats. Identities = 20/90 (22%), Positives = 35/90 (38%), Gaps = 4/90 (4%) Query: 6 ADETPVCCCM-DVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPT 64 D C ++G+I+ D S+ F ++ ++ + L E I T Sbjct: 7 TDIKDCCGAFAEIGSIIHPDDTELSFPISFDSQLAVDEKRSELEAYVEEQFDEAVTINFT 66 Query: 65 FTEESDGVRLDIDFTFACEAEMLIFQLGLR 94 D + +F C AE +IF++ LR Sbjct: 67 ---AQDEHTYLVTLSFTCTAEKMIFEMNLR 93 >UniRef50_Q2C8E1 Putative uncharacterized protein n=2 Tax=Vibrionaceae RepID=Q2C8E1_9GAMM Length = 99 Score = 38.1 bits (87), Expect = 0.081, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 30/84 (35%), Gaps = 2/84 (2%) Query: 11 VCCCMDVGTIMDNSDCTASYSRVFANRAEAEQTLAALTEKARSVESEPCKITPTFTEESD 70 ++G I+ +D TA + E L+ A+ V + T D Sbjct: 14 CGVSAEMGFIIKENDDTADVKVFANGKDALEIELSNYLALAKEVNANYKVETTPVNA--D 71 Query: 71 GVRLDIDFTFACEAEMLIFQLGLR 94 L F C AE LIF+L R Sbjct: 72 STELTARIQFECSAEKLIFELRSR 95 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.135 0.370 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 438,372,957 Number of Sequences: 3077464 Number of extensions: 11628540 Number of successful extensions: 38832 Number of sequences better than 1.0e-01: 21 Number of HSP's better than 0.1 without gapping: 42 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 38774 Number of HSP's gapped (non-prelim): 51 length of query: 94 length of database: 1,040,396,356 effective HSP length: 63 effective length of query: 31 effective length of database: 846,516,124 effective search space: 26241999844 effective search space used: 26241999844 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.4 bits) S2: 87 (38.1 bits)