BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (146 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P45751 Uncharacterized protein yrfB n=94 Tax=Enterobact... 295 4e-79 UniRef50_B7LS94 Putative uncharacterized protein n=1 Tax=Escheri... 132 4e-30 UniRef50_D2TLP5 Putative fimbrial protein HofO n=1 Tax=Citrobact... 119 3e-26 UniRef50_C1MD68 Putative uncharacterized protein n=1 Tax=Citroba... 112 5e-24 UniRef50_A9MMD5 Putative uncharacterized protein n=1 Tax=Salmone... 102 5e-21 UniRef50_A4WFI1 Putative uncharacterized protein n=1 Tax=Enterob... 66 3e-10 UniRef50_UPI000182682E hypothetical protein EcanA3_15612 n=1 Tax... 66 4e-10 UniRef50_A7MGC2 Putative uncharacterized protein n=2 Tax=Cronoba... 57 1e-07 UniRef50_C4X1Q4 Putative uncharacterized protein n=5 Tax=Klebsie... 52 6e-06 >UniRef50_P45751 Uncharacterized protein yrfB n=94 Tax=Enterobacteriaceae RepID=YRFB_ECOLI Length = 146 Score = 295 bits (754), Expect = 4e-79, Method: Compositional matrix adjust. Identities = 146/146 (100%), Positives = 146/146 (100%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR Sbjct: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 Query: 61 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV 120 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV Sbjct: 61 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV 120 Query: 121 SVSRFSLSVEGDDLLFTLQLETPHEG 146 SVSRFSLSVEGDDLLFTLQLETPHEG Sbjct: 121 SVSRFSLSVEGDDLLFTLQLETPHEG 146 >UniRef50_B7LS94 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LS94_ESCF3 Length = 202 Score = 132 bits (331), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 70/152 (46%), Positives = 95/152 (62%), Gaps = 6/152 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTL--IFLSSTHHEERDALIRLRASHHQQWAAL 58 MN FFD+W ATSPRLR +CW W++ L L I LS RDA RL ++ QW L Sbjct: 51 MNTFFDFWCATSPRLRLICWGLWVIALCGLSRIALSPPVENGRDAQSRLHQANLLQWRTL 110 Query: 59 YRLVDTAP----FSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTR 114 +++ +A S++ LPFSP DFQ + QLVSW PSA+GGELALK W A+P F + Sbjct: 111 HKIAGSADEQRLVSDDNALPFSPFDFQQAEMQLVSWQPSAEGGELALKGPWSAIPPVFRQ 170 Query: 115 LAERNVSVSRFSLSVEGDDLLFTLQLETPHEG 146 LAER + +++F+L + + L TL LE ++G Sbjct: 171 LAERQMRITQFTLDPDENGLRITLLLERLNDG 202 >UniRef50_D2TLP5 Putative fimbrial protein HofO n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TLP5_CITRO Length = 151 Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 71/151 (47%), Positives = 88/151 (58%), Gaps = 6/151 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLI--FLSSTHHEERDALIRLRASHHQQWAAL 58 M D W A +PR R CW W ++L L+ L S+ ++L RLR + Q W AL Sbjct: 1 MAALIDLWCALTPRTRVFCWTGWTVLLSALMAWCLFSSPTANSESLNRLRMENRQLWPAL 60 Query: 59 YRLVDTAP--FSEEKT--LPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTR 114 YRL T S E + PFSP L G QL W PS QGGELAL+ WEAVPS F Sbjct: 61 YRLAATEDERLSAEASPDAPFSPFAISLPGLQLRRWQPSEQGGELALRGGWEAVPSLFAY 120 Query: 115 LAERNVSVSRFSLSVEGDDLLFTLQLETPHE 145 LAER VSV+ FSL++E ++LL TL LE+ H+ Sbjct: 121 LAERGVSVNGFSLAMEKNELLVTLTLESLHD 151 >UniRef50_C1MD68 Putative uncharacterized protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MD68_9ENTR Length = 151 Score = 112 bits (279), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 61/135 (45%), Positives = 82/135 (60%), Gaps = 5/135 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFW---LLMLVTLIF--LSSTHHEERDALIRLRASHHQQW 55 M D W A SPR R +CW W LL+LVTL ++ +R+ L++ RA+ QW Sbjct: 1 MITLCDVWLAQSPRFRLICWCGWMLGLLILVTLCLYPVARERTMQRETLVQQRAAIQMQW 60 Query: 56 AALYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRL 115 +Y L + EEK FSPL FQ S +L+ W PSAQGGE+AL + W+A+P F +L Sbjct: 61 RDVYLLAASTAEPEEKLTLFSPLLFQTSLTRLIHWQPSAQGGEMALISAWDAIPQTFVQL 120 Query: 116 AERNVSVSRFSLSVE 130 A++ +SVSRFSLS E Sbjct: 121 ADQGMSVSRFSLSAE 135 >UniRef50_A9MMD5 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MMD5_SALAR Length = 113 Score = 102 bits (253), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 58/98 (59%), Positives = 65/98 (66%), Gaps = 4/98 (4%) Query: 48 RASHHQQWAALYRL-VDTAPF---SEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKT 103 R + QQW L RL V P + E T PFSPLDFQ +L+ W PSAQGGE+ALKT Sbjct: 10 REAARQQWRNLRRLSVAAEPLFGRTTENTRPFSPLDFQTPHQRLLHWLPSAQGGEMALKT 69 Query: 104 LWEAVPSAFTRLAERNVSVSRFSLSVEGDDLLFTLQLE 141 WE VPS F RLAE +SVSRFSL EG +LL TLQLE Sbjct: 70 SWEGVPSLFVRLAESEMSVSRFSLRREGAELLMTLQLE 107 >UniRef50_A4WFI1 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WFI1_ENT38 Length = 153 Score = 66.2 bits (160), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 46/151 (30%), Positives = 71/151 (47%), Gaps = 7/151 (4%) Query: 1 MNMFFDWWFATSPRLRQLCWAFW----LLMLVTLIFLSSTHHEERDALIR--LRASHHQQ 54 M+ F+ W+ R L W LL ++ + H + + R + Sbjct: 1 MHSLFERWYVFGSSTRILSWGLISGSVLLTAYLMLARPAIRHVSQLEMKREHIATVVKAL 60 Query: 55 WAA-LYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFT 113 WA+ +R +D FSPLDFQ G +LV W PSA+GGEL+L+ W VP+ F Sbjct: 61 WASEAHRGLDEGQPDSSARQAFSPLDFQTHGVRLVRWLPSAKGGELSLEADWIQVPALFD 120 Query: 114 RLAERNVSVSRFSLSVEGDDLLFTLQLETPH 144 +A+R + V+ FS+ L +Q+E+ H Sbjct: 121 AIAQRGMGVTAFSVEPGESRLKLIVQVESLH 151 >UniRef50_UPI000182682E hypothetical protein EcanA3_15612 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI000182682E Length = 124 Score = 65.9 bits (159), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 33/88 (37%), Positives = 49/88 (55%), Gaps = 1/88 (1%) Query: 55 WAALYRL-VDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFT 113 WAA+ ++ +T PFSPLDFQ +G +LV W P + GGEL L W A+P F Sbjct: 34 WAAVRKIPTETEEPVAPAVRPFSPLDFQANGTRLVHWKPLSHGGELELDADWAAIPGIFA 93 Query: 114 RLAERNVSVSRFSLSVEGDDLLFTLQLE 141 +L + NV + F ++ E L ++L+ Sbjct: 94 QLEQCNVRIGTFVITAESGRLRLRMELQ 121 >UniRef50_A7MGC2 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=A7MGC2_ENTS8 Length = 158 Score = 57.4 bits (137), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 52/159 (32%), Positives = 66/159 (41%), Gaps = 16/159 (10%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 M + D W + R C W+ +LV + T RD +L+A Q+ A L Sbjct: 1 MRLLIDRWLSGPSSRRAACAGVWIALLVAAGYAGLTPLT-RD-FRQLQAQLEQRRAELRT 58 Query: 61 LVD-TAPFSEEKTL-----------PFSPLDFQLSG-AQLVSWHPSAQGGELALKTLWEA 107 L A S E+ PFSPL L+ W P GEL L W Sbjct: 59 LKQRQAAQSGERDAVAALAAQLAMKPFSPLAVDPGPEGHLIHWQPQGDTGELELALAWPH 118 Query: 108 VPSAFTRLAERNVSVSRFSLSVEGDD-LLFTLQLETPHE 145 VP F LA+ ++ F+LS EGD L TLQLE HE Sbjct: 119 VPDIFEALAQSDMLAHGFTLSREGDQRLRLTLQLERAHE 157 >UniRef50_C4X1Q4 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X1Q4_KLEPN Length = 142 Score = 52.0 bits (123), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 49 ASHHQQWAALYRL---VDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLW 105 A+ QW + L + P + T FSP+ LSGA+L++W P GGE+ L W Sbjct: 42 AALQTQWRRIMALRIPGEENPGARIDTQSFSPIAIPLSGAKLIAWRPQGSGGEMELTLSW 101 Query: 106 EAVPSAFTRLAERNVSVSRFSLSVE 130 VP+ F+ LA ++ FSL E Sbjct: 102 STVPALFSWLARCGMNPRSFSLGRE 126 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P45751 Uncharacterized protein yrfB n=94 Tax=Enterobact... 195 3e-49 UniRef50_B7LS94 Putative uncharacterized protein n=1 Tax=Escheri... 186 2e-46 UniRef50_D2TLP5 Putative fimbrial protein HofO n=1 Tax=Citrobact... 164 6e-40 UniRef50_A4WFI1 Putative uncharacterized protein n=1 Tax=Enterob... 158 7e-38 UniRef50_C1MD68 Putative uncharacterized protein n=1 Tax=Citroba... 156 3e-37 UniRef50_A7MGC2 Putative uncharacterized protein n=2 Tax=Cronoba... 139 2e-32 UniRef50_A9MMD5 Putative uncharacterized protein n=1 Tax=Salmone... 120 1e-26 UniRef50_UPI000182682E hypothetical protein EcanA3_15612 n=1 Tax... 112 3e-24 UniRef50_C4X1Q4 Putative uncharacterized protein n=5 Tax=Klebsie... 97 2e-19 Sequences not found previously or not previously below threshold: UniRef50_D2T9A6 Conserved membrane protein hofO n=2 Tax=Erwinia ... 47 2e-04 UniRef50_B2VJX1 Putative uncharacterized protein n=1 Tax=Erwinia... 45 0.001 UniRef50_C8QE34 Putative uncharacterized protein n=1 Tax=Pantoea... 39 0.050 >UniRef50_P45751 Uncharacterized protein yrfB n=94 Tax=Enterobacteriaceae RepID=YRFB_ECOLI Length = 146 Score = 195 bits (496), Expect = 3e-49, Method: Composition-based stats. Identities = 146/146 (100%), Positives = 146/146 (100%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR Sbjct: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 Query: 61 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV 120 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV Sbjct: 61 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV 120 Query: 121 SVSRFSLSVEGDDLLFTLQLETPHEG 146 SVSRFSLSVEGDDLLFTLQLETPHEG Sbjct: 121 SVSRFSLSVEGDDLLFTLQLETPHEG 146 >UniRef50_B7LS94 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LS94_ESCF3 Length = 202 Score = 186 bits (472), Expect = 2e-46, Method: Composition-based stats. Identities = 70/152 (46%), Positives = 95/152 (62%), Gaps = 6/152 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTL--IFLSSTHHEERDALIRLRASHHQQWAAL 58 MN FFD+W ATSPRLR +CW W++ L L I LS RDA RL ++ QW L Sbjct: 51 MNTFFDFWCATSPRLRLICWGLWVIALCGLSRIALSPPVENGRDAQSRLHQANLLQWRTL 110 Query: 59 YRLVDTAPF----SEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTR 114 +++ +A S++ LPFSP DFQ + QLVSW PSA+GGELALK W A+P F + Sbjct: 111 HKIAGSADEQRLVSDDNALPFSPFDFQQAEMQLVSWQPSAEGGELALKGPWSAIPPVFRQ 170 Query: 115 LAERNVSVSRFSLSVEGDDLLFTLQLETPHEG 146 LAER + +++F+L + + L TL LE ++G Sbjct: 171 LAERQMRITQFTLDPDENGLRITLLLERLNDG 202 >UniRef50_D2TLP5 Putative fimbrial protein HofO n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TLP5_CITRO Length = 151 Score = 164 bits (416), Expect = 6e-40, Method: Composition-based stats. Identities = 69/151 (45%), Positives = 85/151 (56%), Gaps = 6/151 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLI--FLSSTHHEERDALIRLRASHHQQWAAL 58 M D W A +PR R CW W ++L L+ L S+ ++L RLR + Q W AL Sbjct: 1 MAALIDLWCALTPRTRVFCWTGWTVLLSALMAWCLFSSPTANSESLNRLRMENRQLWPAL 60 Query: 59 YRLVDTAPF----SEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTR 114 YRL T PFSP L G QL W PS QGGELAL+ WEAVPS F Sbjct: 61 YRLAATEDERLSAEASPDAPFSPFAISLPGLQLRRWQPSEQGGELALRGGWEAVPSLFAY 120 Query: 115 LAERNVSVSRFSLSVEGDDLLFTLQLETPHE 145 LAER VSV+ FSL++E ++LL TL LE+ H+ Sbjct: 121 LAERGVSVNGFSLAMEKNELLVTLTLESLHD 151 >UniRef50_A4WFI1 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WFI1_ENT38 Length = 153 Score = 158 bits (398), Expect = 7e-38, Method: Composition-based stats. Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 7/151 (4%) Query: 1 MNMFFDWWFATSPRLRQLCWAFW----LLMLVTLIFLSSTHHEERDALIR--LRASHHQQ 54 M+ F+ W+ R L W LL ++ + H + + R + Sbjct: 1 MHSLFERWYVFGSSTRILSWGLISGSVLLTAYLMLARPAIRHVSQLEMKREHIATVVKAL 60 Query: 55 WA-ALYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFT 113 WA +R +D FSPLDFQ G +LV W PSA+GGEL+L+ W VP+ F Sbjct: 61 WASEAHRGLDEGQPDSSARQAFSPLDFQTHGVRLVRWLPSAKGGELSLEADWIQVPALFD 120 Query: 114 RLAERNVSVSRFSLSVEGDDLLFTLQLETPH 144 +A+R + V+ FS+ L +Q+E+ H Sbjct: 121 AIAQRGMGVTAFSVEPGESRLKLIVQVESLH 151 >UniRef50_C1MD68 Putative uncharacterized protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MD68_9ENTR Length = 151 Score = 156 bits (393), Expect = 3e-37, Method: Composition-based stats. Identities = 69/151 (45%), Positives = 93/151 (61%), Gaps = 5/151 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFW---LLMLVTLIF--LSSTHHEERDALIRLRASHHQQW 55 M D W A SPR R +CW W LL+LVTL ++ +R+ L++ RA+ QW Sbjct: 1 MITLCDVWLAQSPRFRLICWCGWMLGLLILVTLCLYPVARERTMQRETLVQQRAAIQMQW 60 Query: 56 AALYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRL 115 +Y L + EEK FSPL FQ S +L+ W PSAQGGE+AL + W+A+P F +L Sbjct: 61 RDVYLLAASTAEPEEKLTLFSPLLFQTSLTRLIHWQPSAQGGEMALISAWDAIPQTFVQL 120 Query: 116 AERNVSVSRFSLSVEGDDLLFTLQLETPHEG 146 A++ +SVSRFSLS E +LL TLQLE ++G Sbjct: 121 ADQGMSVSRFSLSAEDTELLLTLQLERLNDG 151 >UniRef50_A7MGC2 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=A7MGC2_ENTS8 Length = 158 Score = 139 bits (350), Expect = 2e-32, Method: Composition-based stats. Identities = 50/159 (31%), Positives = 64/159 (40%), Gaps = 16/159 (10%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 M + D W + R C W+ +LV + T +L+A Q+ A L Sbjct: 1 MRLLIDRWLSGPSSRRAACAGVWIALLVAAGYAGLTPLTR--DFRQLQAQLEQRRAELRT 58 Query: 61 LVD-TAPFSEEKT-----------LPFSPLDFQLSG-AQLVSWHPSAQGGELALKTLWEA 107 L A S E+ PFSPL L+ W P GEL L W Sbjct: 59 LKQRQAAQSGERDAVAALAAQLAMKPFSPLAVDPGPEGHLIHWQPQGDTGELELALAWPH 118 Query: 108 VPSAFTRLAERNVSVSRFSLSVEGD-DLLFTLQLETPHE 145 VP F LA+ ++ F+LS EGD L TLQLE HE Sbjct: 119 VPDIFEALAQSDMLAHGFTLSREGDQRLRLTLQLERAHE 157 >UniRef50_A9MMD5 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MMD5_SALAR Length = 113 Score = 120 bits (301), Expect = 1e-26, Method: Composition-based stats. Identities = 58/109 (53%), Positives = 67/109 (61%), Gaps = 4/109 (3%) Query: 39 EERDALIRLRASHHQQWAALYRLVDTAPF----SEEKTLPFSPLDFQLSGAQLVSWHPSA 94 + L + R + QQW L RL A + E T PFSPLDFQ +L+ W PSA Sbjct: 1 MQHTRLSQQREAARQQWRNLRRLSVAAEPLFGRTTENTRPFSPLDFQTPHQRLLHWLPSA 60 Query: 95 QGGELALKTLWEAVPSAFTRLAERNVSVSRFSLSVEGDDLLFTLQLETP 143 QGGE+ALKT WE VPS F RLAE +SVSRFSL EG +LL TLQLE Sbjct: 61 QGGEMALKTSWEGVPSLFVRLAESEMSVSRFSLRREGAELLMTLQLERL 109 >UniRef50_UPI000182682E hypothetical protein EcanA3_15612 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI000182682E Length = 124 Score = 112 bits (280), Expect = 3e-24, Method: Composition-based stats. Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%) Query: 51 HHQQWAALYRL-VDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVP 109 + WAA+ ++ +T PFSPLDFQ +G +LV W P + GGEL L W A+P Sbjct: 30 NASLWAAVRKIPTETEEPVAPAVRPFSPLDFQANGTRLVHWKPLSHGGELELDADWAAIP 89 Query: 110 SAFTRLAERNVSVSRFSLSVEGDDLLFTLQLE 141 F +L + NV + F ++ E L ++L+ Sbjct: 90 GIFAQLEQCNVRIGTFVITAESGRLRLRMELQ 121 >UniRef50_C4X1Q4 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X1Q4_KLEPN Length = 142 Score = 97.1 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 49 ASHHQQWAALYRL---VDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLW 105 A+ QW + L + P + T FSP+ LSGA+L++W P GGE+ L W Sbjct: 42 AALQTQWRRIMALRIPGEENPGARIDTQSFSPIAIPLSGAKLIAWRPQGSGGEMELTLSW 101 Query: 106 EAVPSAFTRLAERNVSVSRFSLSVE 130 VP+ F+ LA ++ FSL E Sbjct: 102 STVPALFSWLARCGMNPRSFSLGRE 126 >UniRef50_D2T9A6 Conserved membrane protein hofO n=2 Tax=Erwinia pyrifoliae RepID=D2T9A6_ERWPY Length = 170 Score = 46.7 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 2/66 (3%) Query: 77 PLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLA--ERNVSVSRFSLSVEGDDL 134 P+ SGA+L WHP+ +GGELA W L+ E ++ FSL+ E L Sbjct: 101 PVLLNTSGAELEHWHPATEGGELAATLQWAQFAGLLNYLSALEPAPALPAFSLTREQAKL 160 Query: 135 LFTLQL 140 ++L Sbjct: 161 RLVMEL 166 >UniRef50_B2VJX1 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VJX1_ERWT9 Length = 166 Score = 44.8 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 2/66 (3%) Query: 77 PLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNVSVS--RFSLSVEGDDL 134 P SGA+L WHP+ GG+LA+ W L+E + FSL E L Sbjct: 97 PALLLTSGAELEYWHPTEHGGKLAVTLQWPQFAGLLNYLSELRPVPAFPAFSLKREEAKL 156 Query: 135 LFTLQL 140 ++L Sbjct: 157 RLMMEL 162 >UniRef50_C8QE34 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QE34_9ENTR Length = 164 Score = 39.0 bits (89), Expect = 0.050, Method: Composition-based stats. Identities = 30/109 (27%), Positives = 43/109 (39%), Gaps = 3/109 (2%) Query: 38 HEERDALIRLRASHHQQWAALYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGG 97 ++R + LR S + + L + + L S L G QL +W P +Q Sbjct: 57 EKQRQQQLALRPSIAGLLSEINALQQPTASPQPRLLLESLLA--ERGTQLDAWQPESQPQ 114 Query: 98 ELALKTLWEAVPSAFTRLAERNVSV-SRFSLSVEGDDLLFTLQLETPHE 145 + LK W+ F LA + V RF L E L+ L LE E Sbjct: 115 HIVLKLNWQQFTPLFAELAHTALPVPERFQLQAEQGTLMTQLWLENRDE 163 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P45751 Uncharacterized protein yrfB n=94 Tax=Enterobact... 187 7e-47 UniRef50_B7LS94 Putative uncharacterized protein n=1 Tax=Escheri... 181 5e-45 UniRef50_D2TLP5 Putative fimbrial protein HofO n=1 Tax=Citrobact... 162 3e-39 UniRef50_A4WFI1 Putative uncharacterized protein n=1 Tax=Enterob... 154 9e-37 UniRef50_C1MD68 Putative uncharacterized protein n=1 Tax=Citroba... 153 2e-36 UniRef50_A7MGC2 Putative uncharacterized protein n=2 Tax=Cronoba... 136 2e-31 UniRef50_A9MMD5 Putative uncharacterized protein n=1 Tax=Salmone... 121 1e-26 UniRef50_UPI000182682E hypothetical protein EcanA3_15612 n=1 Tax... 111 7e-24 UniRef50_C4X1Q4 Putative uncharacterized protein n=5 Tax=Klebsie... 94 2e-18 UniRef50_B2VJX1 Putative uncharacterized protein n=1 Tax=Erwinia... 74 2e-12 UniRef50_D2T9A6 Conserved membrane protein hofO n=2 Tax=Erwinia ... 73 2e-12 Sequences not found previously or not previously below threshold: UniRef50_C8QE34 Putative uncharacterized protein n=1 Tax=Pantoea... 42 0.006 CONVERGED! >UniRef50_P45751 Uncharacterized protein yrfB n=94 Tax=Enterobacteriaceae RepID=YRFB_ECOLI Length = 146 Score = 187 bits (475), Expect = 7e-47, Method: Composition-based stats. Identities = 146/146 (100%), Positives = 146/146 (100%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR Sbjct: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 Query: 61 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV 120 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV Sbjct: 61 LVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNV 120 Query: 121 SVSRFSLSVEGDDLLFTLQLETPHEG 146 SVSRFSLSVEGDDLLFTLQLETPHEG Sbjct: 121 SVSRFSLSVEGDDLLFTLQLETPHEG 146 >UniRef50_B7LS94 Putative uncharacterized protein n=1 Tax=Escherichia fergusonii ATCC 35469 RepID=B7LS94_ESCF3 Length = 202 Score = 181 bits (460), Expect = 5e-45, Method: Composition-based stats. Identities = 70/152 (46%), Positives = 95/152 (62%), Gaps = 6/152 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTL--IFLSSTHHEERDALIRLRASHHQQWAAL 58 MN FFD+W ATSPRLR +CW W++ L L I LS RDA RL ++ QW L Sbjct: 51 MNTFFDFWCATSPRLRLICWGLWVIALCGLSRIALSPPVENGRDAQSRLHQANLLQWRTL 110 Query: 59 YRLVDTAPF----SEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTR 114 +++ +A S++ LPFSP DFQ + QLVSW PSA+GGELALK W A+P F + Sbjct: 111 HKIAGSADEQRLVSDDNALPFSPFDFQQAEMQLVSWQPSAEGGELALKGPWSAIPPVFRQ 170 Query: 115 LAERNVSVSRFSLSVEGDDLLFTLQLETPHEG 146 LAER + +++F+L + + L TL LE ++G Sbjct: 171 LAERQMRITQFTLDPDENGLRITLLLERLNDG 202 >UniRef50_D2TLP5 Putative fimbrial protein HofO n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TLP5_CITRO Length = 151 Score = 162 bits (410), Expect = 3e-39, Method: Composition-based stats. Identities = 69/151 (45%), Positives = 85/151 (56%), Gaps = 6/151 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLI--FLSSTHHEERDALIRLRASHHQQWAAL 58 M D W A +PR R CW W ++L L+ L S+ ++L RLR + Q W AL Sbjct: 1 MAALIDLWCALTPRTRVFCWTGWTVLLSALMAWCLFSSPTANSESLNRLRMENRQLWPAL 60 Query: 59 YRLVDTAPF----SEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTR 114 YRL T PFSP L G QL W PS QGGELAL+ WEAVPS F Sbjct: 61 YRLAATEDERLSAEASPDAPFSPFAISLPGLQLRRWQPSEQGGELALRGGWEAVPSLFAY 120 Query: 115 LAERNVSVSRFSLSVEGDDLLFTLQLETPHE 145 LAER VSV+ FSL++E ++LL TL LE+ H+ Sbjct: 121 LAERGVSVNGFSLAMEKNELLVTLTLESLHD 151 >UniRef50_A4WFI1 Putative uncharacterized protein n=1 Tax=Enterobacter sp. 638 RepID=A4WFI1_ENT38 Length = 153 Score = 154 bits (389), Expect = 9e-37, Method: Composition-based stats. Identities = 46/151 (30%), Positives = 70/151 (46%), Gaps = 7/151 (4%) Query: 1 MNMFFDWWFATSPRLRQLCWAFW----LLMLVTLIFLSSTHHEERDALIR--LRASHHQQ 54 M+ F+ W+ R L W LL ++ + H + + R + Sbjct: 1 MHSLFERWYVFGSSTRILSWGLISGSVLLTAYLMLARPAIRHVSQLEMKREHIATVVKAL 60 Query: 55 WA-ALYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFT 113 WA +R +D FSPLDFQ G +LV W PSA+GGEL+L+ W VP+ F Sbjct: 61 WASEAHRGLDEGQPDSSARQAFSPLDFQTHGVRLVRWLPSAKGGELSLEADWIQVPALFD 120 Query: 114 RLAERNVSVSRFSLSVEGDDLLFTLQLETPH 144 +A+R + V+ FS+ L +Q+E+ H Sbjct: 121 AIAQRGMGVTAFSVEPGESRLKLIVQVESLH 151 >UniRef50_C1MD68 Putative uncharacterized protein n=1 Tax=Citrobacter sp. 30_2 RepID=C1MD68_9ENTR Length = 151 Score = 153 bits (385), Expect = 2e-36, Method: Composition-based stats. Identities = 69/151 (45%), Positives = 93/151 (61%), Gaps = 5/151 (3%) Query: 1 MNMFFDWWFATSPRLRQLCWAFW---LLMLVTLIF--LSSTHHEERDALIRLRASHHQQW 55 M D W A SPR R +CW W LL+LVTL ++ +R+ L++ RA+ QW Sbjct: 1 MITLCDVWLAQSPRFRLICWCGWMLGLLILVTLCLYPVARERTMQRETLVQQRAAIQMQW 60 Query: 56 AALYRLVDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRL 115 +Y L + EEK FSPL FQ S +L+ W PSAQGGE+AL + W+A+P F +L Sbjct: 61 RDVYLLAASTAEPEEKLTLFSPLLFQTSLTRLIHWQPSAQGGEMALISAWDAIPQTFVQL 120 Query: 116 AERNVSVSRFSLSVEGDDLLFTLQLETPHEG 146 A++ +SVSRFSLS E +LL TLQLE ++G Sbjct: 121 ADQGMSVSRFSLSAEDTELLLTLQLERLNDG 151 >UniRef50_A7MGC2 Putative uncharacterized protein n=2 Tax=Cronobacter RepID=A7MGC2_ENTS8 Length = 158 Score = 136 bits (343), Expect = 2e-31, Method: Composition-based stats. Identities = 50/159 (31%), Positives = 64/159 (40%), Gaps = 16/159 (10%) Query: 1 MNMFFDWWFATSPRLRQLCWAFWLLMLVTLIFLSSTHHEERDALIRLRASHHQQWAALYR 60 M + D W + R C W+ +LV + T +L+A Q+ A L Sbjct: 1 MRLLIDRWLSGPSSRRAACAGVWIALLVAAGYAGLTPLTR--DFRQLQAQLEQRRAELRT 58 Query: 61 LVD-TAPFSEEKT-----------LPFSPLDFQLSG-AQLVSWHPSAQGGELALKTLWEA 107 L A S E+ PFSPL L+ W P GEL L W Sbjct: 59 LKQRQAAQSGERDAVAALAAQLAMKPFSPLAVDPGPEGHLIHWQPQGDTGELELALAWPH 118 Query: 108 VPSAFTRLAERNVSVSRFSLSVEGD-DLLFTLQLETPHE 145 VP F LA+ ++ F+LS EGD L TLQLE HE Sbjct: 119 VPDIFEALAQSDMLAHGFTLSREGDQRLRLTLQLERAHE 157 >UniRef50_A9MMD5 Putative uncharacterized protein n=1 Tax=Salmonella enterica subsp. arizonae serovar 62:z4,z23:-- RepID=A9MMD5_SALAR Length = 113 Score = 121 bits (302), Expect = 1e-26, Method: Composition-based stats. Identities = 58/109 (53%), Positives = 67/109 (61%), Gaps = 4/109 (3%) Query: 39 EERDALIRLRASHHQQWAALYRLVDTAPF----SEEKTLPFSPLDFQLSGAQLVSWHPSA 94 + L + R + QQW L RL A + E T PFSPLDFQ +L+ W PSA Sbjct: 1 MQHTRLSQQREAARQQWRNLRRLSVAAEPLFGRTTENTRPFSPLDFQTPHQRLLHWLPSA 60 Query: 95 QGGELALKTLWEAVPSAFTRLAERNVSVSRFSLSVEGDDLLFTLQLETP 143 QGGE+ALKT WE VPS F RLAE +SVSRFSL EG +LL TLQLE Sbjct: 61 QGGEMALKTSWEGVPSLFVRLAESEMSVSRFSLRREGAELLMTLQLERL 109 >UniRef50_UPI000182682E hypothetical protein EcanA3_15612 n=1 Tax=Enterobacter cancerogenus ATCC 35316 RepID=UPI000182682E Length = 124 Score = 111 bits (277), Expect = 7e-24, Method: Composition-based stats. Identities = 33/92 (35%), Positives = 50/92 (54%), Gaps = 1/92 (1%) Query: 51 HHQQWAALYRL-VDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLWEAVP 109 + WAA+ ++ +T PFSPLDFQ +G +LV W P + GGEL L W A+P Sbjct: 30 NASLWAAVRKIPTETEEPVAPAVRPFSPLDFQANGTRLVHWKPLSHGGELELDADWAAIP 89 Query: 110 SAFTRLAERNVSVSRFSLSVEGDDLLFTLQLE 141 F +L + NV + F ++ E L ++L+ Sbjct: 90 GIFAQLEQCNVRIGTFVITAESGRLRLRMELQ 121 >UniRef50_C4X1Q4 Putative uncharacterized protein n=5 Tax=Klebsiella RepID=C4X1Q4_KLEPN Length = 142 Score = 93.7 bits (231), Expect = 2e-18, Method: Composition-based stats. Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 3/85 (3%) Query: 49 ASHHQQWAALYRL---VDTAPFSEEKTLPFSPLDFQLSGAQLVSWHPSAQGGELALKTLW 105 A+ QW + L + P + T FSP+ LSGA+L++W P GGE+ L W Sbjct: 42 AALQTQWRRIMALRIPGEENPGARIDTQSFSPIAIPLSGAKLIAWRPQGSGGEMELTLSW 101 Query: 106 EAVPSAFTRLAERNVSVSRFSLSVE 130 VP+ F+ LA ++ FSL E Sbjct: 102 STVPALFSWLARCGMNPRSFSLGRE 126 >UniRef50_B2VJX1 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VJX1_ERWT9 Length = 166 Score = 73.7 bits (179), Expect = 2e-12, Method: Composition-based stats. Identities = 21/66 (31%), Positives = 28/66 (42%), Gaps = 2/66 (3%) Query: 77 PLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLAERNVSV--SRFSLSVEGDDL 134 P SGA+L WHP+ GG+LA+ W L+E FSL E L Sbjct: 97 PALLLTSGAELEYWHPTEHGGKLAVTLQWPQFAGLLNYLSELRPVPAFPAFSLKREEAKL 156 Query: 135 LFTLQL 140 ++L Sbjct: 157 RLMMEL 162 >UniRef50_D2T9A6 Conserved membrane protein hofO n=2 Tax=Erwinia pyrifoliae RepID=D2T9A6_ERWPY Length = 170 Score = 73.3 bits (178), Expect = 2e-12, Method: Composition-based stats. Identities = 22/66 (33%), Positives = 32/66 (48%), Gaps = 2/66 (3%) Query: 77 PLDFQLSGAQLVSWHPSAQGGELALKTLWEAVPSAFTRLA--ERNVSVSRFSLSVEGDDL 134 P+ SGA+L WHP+ +GGELA W L+ E ++ FSL+ E L Sbjct: 101 PVLLNTSGAELEHWHPATEGGELAATLQWAQFAGLLNYLSALEPAPALPAFSLTREQAKL 160 Query: 135 LFTLQL 140 ++L Sbjct: 161 RLVMEL 166 >UniRef50_C8QE34 Putative uncharacterized protein n=1 Tax=Pantoea sp. At-9b RepID=C8QE34_9ENTR Length = 164 Score = 42.1 bits (97), Expect = 0.006, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 40/112 (35%), Gaps = 6/112 (5%) Query: 40 ERDALIRLRASHHQQWAALYRLVDTAPFSEEKTLPFSPLDFQLS-----GAQLVSWHPSA 94 ++ L + R ++ L+ ++ T P S G QL +W P + Sbjct: 52 QQAQLEKQRQQQLALRPSIAGLLSEINALQQPTASPQPRLLLESLLAERGTQLDAWQPES 111 Query: 95 QGGELALKTLWEAVPSAFTRLAERN-VSVSRFSLSVEGDDLLFTLQLETPHE 145 Q + LK W+ F LA RF L E L+ L LE E Sbjct: 112 QPQHIVLKLNWQQFTPLFAELAHTALPVPERFQLQAEQGTLMTQLWLENRDE 163 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.306 0.125 0.356 Lambda K H 0.267 0.0380 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 780,594,308 Number of Sequences: 3077464 Number of extensions: 24112220 Number of successful extensions: 104057 Number of sequences better than 1.0e-01: 14 Number of HSP's better than 0.1 without gapping: 32 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 103994 Number of HSP's gapped (non-prelim): 38 length of query: 146 length of database: 1,040,396,356 effective HSP length: 109 effective length of query: 37 effective length of database: 704,952,780 effective search space: 26083252860 effective search space used: 26083252860 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.7 bits) S2: 87 (38.2 bits)