BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (236 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B7M9T8 UPF0257 lipoprotein ynfC n=108 Tax=Enterobacteri... 483 e-135 UniRef50_C4RZQ9 Putative uncharacterized protein n=4 Tax=Yersini... 132 6e-30 UniRef50_C8TA52 Putative uncharacterized protein n=1 Tax=Klebsie... 125 9e-28 UniRef50_C6CDD8 Lipoprotein YnfC n=3 Tax=Dickeya RepID=C6CDD8_DICDC 106 6e-22 UniRef50_B6XI45 Putative uncharacterized protein n=1 Tax=Provide... 59 1e-07 UniRef50_C6C587 Putative uncharacterized protein n=1 Tax=Dickeya... 55 1e-06 UniRef50_C4UQ43 Putative uncharacterized protein n=1 Tax=Yersini... 46 0.001 UniRef50_C9XVD0 Putative uncharacterized protein n=1 Tax=Cronoba... 45 0.002 >UniRef50_B7M9T8 UPF0257 lipoprotein ynfC n=108 Tax=Enterobacteriaceae RepID=YNFC_ECO45 Length = 236 Score = 483 bits (1242), Expect = e-135, Method: Compositional matrix adjust. Identities = 233/236 (98%), Positives = 235/236 (99%) Query: 1 MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 MKYKLLPCLLAI LTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV Sbjct: 1 MKYKLLPCLLAILLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 Query: 61 TKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 TKRVSGTLSEEGCFDSLELLDLENNT+VALVLDANYYRDAETLEKRVRLQGKCQLAELPS Sbjct: 61 TKRVSGTLSEEGCFDSLELLDLENNTLVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 Query: 121 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD Sbjct: 121 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 Query: 181 YTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 YTAVTLLNNQRVGNVKQSCEYD+HANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY Sbjct: 181 YTAVTLLNNQRVGNVKQSCEYDNHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 >UniRef50_C4RZQ9 Putative uncharacterized protein n=4 Tax=Yersinia RepID=C4RZQ9_YERBE Length = 237 Score = 132 bits (333), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 77/234 (32%), Positives = 120/234 (51%), Gaps = 9/234 (3%) Query: 8 CLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGT 67 CLLA+ L+ CDR FT +AS+SN F FDP++GPVK TQ ++D +G+ V Sbjct: 8 CLLALALSACDRGHAPFIFTANVASYSNIFGFDPIQGPVKSLTQKMLDAKGDTYSDVHAE 67 Query: 68 LSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWET 127 ++E+GCF +L + + + + V + +Y D +T EK++ L KC + + S V Sbjct: 68 INEDGCFTALRIHTPDQDVDLDYVKEGSYLIDNKTKEKQLVLNDKCNITQTASGNVKVII 127 Query: 128 DDNGFV--IKAS-SKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAV 184 +D GFV +K S S + Y YD G+P+ + N K + +Y Sbjct: 128 NDKGFVTDVKMSESGATKKHYDYDASGFPVVDISYDNGKIFKIVTELDAKGQSPFNYKTK 187 Query: 185 TLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKP--AVERVYTIKNTIDYY 236 N++ V + K+ CE DSH NP C++ E ++P V VYT + +YY Sbjct: 188 IYENDKLVLSTKRVCETDSHGNPTSCKI----ESMEPEGKVIEVYTATYSTEYY 237 >UniRef50_C8TA52 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8TA52_KLEPR Length = 138 Score = 125 bits (315), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 62/98 (63%), Positives = 71/98 (72%), Gaps = 2/98 (2%) Query: 3 YKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTK 62 Y L+P A+ LT CDR +FTPEMASFSNEF+FDPLRGPVKDF+QTL+DE V K Sbjct: 15 YWLIPA--ALLLTACDRKSAPDAFTPEMASFSNEFEFDPLRGPVKDFSQTLLDEHDVVVK 72 Query: 63 RVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDA 100 +VS LS EGCFD L L D+EN T L+LDANYY D Sbjct: 73 KVSAQLSREGCFDLLTLEDVENKTGATLLLDANYYVDG 110 >UniRef50_C6CDD8 Lipoprotein YnfC n=3 Tax=Dickeya RepID=C6CDD8_DICDC Length = 242 Score = 106 bits (265), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 74/233 (31%), Positives = 111/233 (47%), Gaps = 9/233 (3%) Query: 5 LLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRV 64 L +LA+ GCD + +P MA FSN F FD LRG +K FTQT DEQG+V V Sbjct: 8 LTAAMLAV--AGCDDKQRLEPVSPMMAGFSNIFGFDALRGKIKRFTQTQTDEQGKVVAYV 65 Query: 65 SGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVS 124 + TL++ GC D+L+ N + LV + D + +L C L + + Sbjct: 66 AVTLNQNGCVDTLKSYQPTMNLDLDLVREGQLLVDRNDRKPAFQLGKDCLLEKSVDGRLI 125 Query: 125 WETDDNGFVIKASSKQMQ---MEYRYDDQGYP--LGKTTKSNDKTLSVSATPSTDPIKKL 179 + DD GF+ Y+YDD G+P + T+ + K VS P K+L Sbjct: 126 YRHDDKGFITDVFYNGHTTPFATYQYDDDGFPSDMTFTSPESGKVTLVSLRNDAAPGKRL 185 Query: 180 DYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNT 232 D T N ++ + +C YD NPV C++++ + K A + + T+ NT Sbjct: 186 DSTMSVTENGEQTSITRTTCRYDVRFNPVVCRVLMTNG--KGAGQTLSTLTNT 236 >UniRef50_B6XI45 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XI45_9ENTR Length = 239 Score = 58.9 bits (141), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 58/220 (26%), Positives = 100/220 (45%), Gaps = 17/220 (7%) Query: 6 LPCLLAIFLTGCDRTEV---TLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTK 62 L L+ +F++ +V L++ M ++S EF FDPL G VK+ TQT+ D++G + K Sbjct: 6 LHGLMFLFMSPLALAQVDNELLNYDKRMVNYSTEFQFDPLYGKVKELTQTMNDDKG-LRK 64 Query: 63 RVSGTLSEEGCFDSLELLDLENNTVVALVLDAN--YYRDAETLEKRVRLQGKCQL--AEL 118 +VS + + GC + L + ++ T +V + Y+ + K R+ C L EL Sbjct: 65 KVSVSFNPHGCLNELTYYNKDSETEFTIVRKSKQLYFLIDKQKLKGYRIDKMCNLQTGEL 124 Query: 119 PSAGVSWETDDNGFVIKASSKQMQME-YRYDDQGYPL-GKTTKSNDKTLSVSATPSTDP- 175 + + NG V K + + + Y + P+ N+ T + + TD Sbjct: 125 LNWKYHYR---NGLVNKITQGDKDIATFVYGRELLPIETHYFNENEVTNTKNEYFFTDGL 181 Query: 176 IKKLDYTAVTLLNNQRVGNVKQSC-EYDSHANPVDCQLII 214 + K+ + A N Q + Q C YD H NP C ++ Sbjct: 182 VTKILFHATK--NQQPYYEIVQKCTHYDDHKNPTSCTSVM 219 >UniRef50_C6C587 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C587_DICDC Length = 248 Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 51/220 (23%), Positives = 95/220 (43%), Gaps = 9/220 (4%) Query: 25 SFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLEN 84 +F P +++ S FDF+P RG ++ T + ++ G + + + +GC + ++ + Sbjct: 24 AFNPVISNMSVLFDFNPARGDIQMLTSRIYNDDGSLDYEMHLLMDRQGCVTAFGAKNVSD 83 Query: 85 NTVVALVLDANYYRDA-ETLEKRVRLQGKCQ-LAELPSAGVS-WETDDNGFVIKASSK-- 139 L + + E E + L KC+ + + + G + + +NG + SS Sbjct: 84 QKFTELFRKDHALKGKDERGEVTLTLDDKCRFITKTDATGTAKFTYGENGLIKSVSSAKT 143 Query: 140 -QMQMEYRYDDQGYPLGKTTKSNDKT-LSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQ 197 Q M Y+YD G G T N KT L K D ++T + V +K Sbjct: 144 GQTVMTYQYDTFGMLAGVETFMNGKTFLQNRLQCDVAGDKPFDCLSMTRVQGITVATIKM 203 Query: 198 SCEYDSHANPVDCQLI-IVDEGVKPAVERVYTIKNTIDYY 236 +C+YD + C ++ ++ G K + R + T+ YY Sbjct: 204 NCDYDDNGLAYQCNMLSVLGRGEKQKL-RKQRVSTTVTYY 242 >UniRef50_C4UQ43 Putative uncharacterized protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UQ43_YERRO Length = 236 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 55/222 (24%), Positives = 91/222 (40%), Gaps = 18/222 (8%) Query: 1 MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 MK ++L LL FL F P + + + F+ D G VK Q + D G++ Sbjct: 1 MKARVLLSLL--FLAISFNVAALTQFKPAVLNAALLFEHDATTGNVKHSIQWIRDVHGKL 58 Query: 61 TKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGK----CQLA 116 + GCF ++ ++D N+ LV A T K +R+ GK C++ Sbjct: 59 QAMTEVRYDQSGCFTNINMVDKANDREFHLVNKDG----ALTSFKGLRITGKINELCEIT 114 Query: 117 ELPSAGVSWETDDN--GF---VIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATP 171 EL + + N G ++ + ++ Y Y + +P+ + D+ + P Sbjct: 115 ELENEKGKYVLSYNVRGLLETIVDKDTGEVVERYEYHNNQFPV-RVRNYKDQKDTRILYP 173 Query: 172 STDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLI 213 S + LD +V+ V VKQSC Y + N C LI Sbjct: 174 SGSA-QFLDLESVSKRGELTV-RVKQSCAYTADGNADKCSLI 213 >UniRef50_C9XVD0 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9XVD0_CROTZ Length = 239 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 53/219 (24%), Positives = 90/219 (41%), Gaps = 12/219 (5%) Query: 26 FTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENN 85 + P + +FS +DF+P+RG VK T T+ E + ++ TLS +GC + E Sbjct: 25 YIPSLYNFSVMYDFNPVRGHVKTLTTTVRSETENFS--ITLTLSPQGCIEQFERSGDRAY 82 Query: 86 TVVALVLDANYYRDA-ETLEKRVRLQGKCQLAELPSA-GV-SWETDDNGFVIKASSKQMQ 142 +AL N + +C L + GV +++ ++ G + + + + Sbjct: 83 GNIALKRQGNDLTGTFDNSPVSYVFDNQCNLVSMTDKYGVKTFKLNNAGLIEQTMANSEK 142 Query: 143 ME-YRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKK-LDYTAVTLLNNQRVGNVKQSCE 200 YRY G NDK + S +D K LD+ T+ + + +C Sbjct: 143 FSAYRYIAGDSFAGSEYYLNDKVATYSDVTYSDINNKPLDFKMKTVFGQEYTVYGESTCL 202 Query: 201 YDSHANPVDCQLI---IVDEGVKPAVERVYTIKNTIDYY 236 YD P +C I + D+ K A E Y K + +Y Sbjct: 203 YDDRKVPRECTAITKKVRDD--KVAQENHYFSKTAVSWY 239 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7M9T8 UPF0257 lipoprotein ynfC n=108 Tax=Enterobacteri... 305 6e-82 UniRef50_C4RZQ9 Putative uncharacterized protein n=4 Tax=Yersini... 241 2e-62 UniRef50_C6CDD8 Lipoprotein YnfC n=3 Tax=Dickeya RepID=C6CDD8_DICDC 224 2e-57 UniRef50_C6C587 Putative uncharacterized protein n=1 Tax=Dickeya... 211 1e-53 UniRef50_C4UQ43 Putative uncharacterized protein n=1 Tax=Yersini... 192 1e-47 UniRef50_C9XVD0 Putative uncharacterized protein n=1 Tax=Cronoba... 190 2e-47 UniRef50_B6XI45 Putative uncharacterized protein n=1 Tax=Provide... 175 9e-43 UniRef50_C8TA52 Putative uncharacterized protein n=1 Tax=Klebsie... 127 4e-28 Sequences not found previously or not previously below threshold: UniRef50_B2Q2N1 Putative uncharacterized protein n=5 Tax=Provide... 97 5e-19 UniRef50_B2PZM3 Putative uncharacterized protein n=1 Tax=Provide... 72 2e-11 UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora vir... 41 0.029 >UniRef50_B7M9T8 UPF0257 lipoprotein ynfC n=108 Tax=Enterobacteriaceae RepID=YNFC_ECO45 Length = 236 Score = 305 bits (782), Expect = 6e-82, Method: Composition-based stats. Identities = 233/236 (98%), Positives = 235/236 (99%) Query: 1 MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 MKYKLLPCLLAI LTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV Sbjct: 1 MKYKLLPCLLAILLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 Query: 61 TKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 TKRVSGTLSEEGCFDSLELLDLENNT+VALVLDANYYRDAETLEKRVRLQGKCQLAELPS Sbjct: 61 TKRVSGTLSEEGCFDSLELLDLENNTLVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 Query: 121 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD Sbjct: 121 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 Query: 181 YTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 YTAVTLLNNQRVGNVKQSCEYD+HANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY Sbjct: 181 YTAVTLLNNQRVGNVKQSCEYDNHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 >UniRef50_C4RZQ9 Putative uncharacterized protein n=4 Tax=Yersinia RepID=C4RZQ9_YERBE Length = 237 Score = 241 bits (614), Expect = 2e-62, Method: Composition-based stats. Identities = 74/235 (31%), Positives = 117/235 (49%), Gaps = 5/235 (2%) Query: 5 LLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRV 64 L CLLA+ L+ CDR FT +AS+SN F FDP++GPVK TQ ++D +G+ V Sbjct: 5 LSICLLALALSACDRGHAPFIFTANVASYSNIFGFDPIQGPVKSLTQKMLDAKGDTYSDV 64 Query: 65 SGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVS 124 ++E+GCF +L + + + + V + +Y D +T EK++ L KC + + S V Sbjct: 65 HAEINEDGCFTALRIHTPDQDVDLDYVKEGSYLIDNKTKEKQLVLNDKCNITQTASGNVK 124 Query: 125 WETDDNGFVIKA---SSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDY 181 +D GFV S + Y YD G+P+ + N K + +Y Sbjct: 125 VIINDKGFVTDVKMSESGATKKHYDYDASGFPVVDISYDNGKIFKIVTELDAKGQSPFNY 184 Query: 182 TAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 N++ V + K+ CE DSH NP C++ ++ + V VYT + +YY Sbjct: 185 KTKIYENDKLVLSTKRVCETDSHGNPTSCKIESME--PEGKVIEVYTATYSTEYY 237 >UniRef50_C6CDD8 Lipoprotein YnfC n=3 Tax=Dickeya RepID=C6CDD8_DICDC Length = 242 Score = 224 bits (570), Expect = 2e-57, Method: Composition-based stats. Identities = 74/234 (31%), Positives = 111/234 (47%), Gaps = 9/234 (3%) Query: 5 LLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRV 64 L +LA+ GCD + +P MA FSN F FD LRG +K FTQT DEQG+V V Sbjct: 8 LTAAMLAV--AGCDDKQRLEPVSPMMAGFSNIFGFDALRGKIKRFTQTQTDEQGKVVAYV 65 Query: 65 SGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVS 124 + TL++ GC D+L+ N + LV + D + +L C L + + Sbjct: 66 AVTLNQNGCVDTLKSYQPTMNLDLDLVREGQLLVDRNDRKPAFQLGKDCLLEKSVDGRLI 125 Query: 125 WETDDNGFVIKASSKQMQ---MEYRYDDQGYP--LGKTTKSNDKTLSVSATPSTDPIKKL 179 + DD GF+ Y+YDD G+P + T+ + K VS P K+L Sbjct: 126 YRHDDKGFITDVFYNGHTTPFATYQYDDDGFPSDMTFTSPESGKVTLVSLRNDAAPGKRL 185 Query: 180 DYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTI 233 D T N ++ + +C YD NPV C++++ + K A + + T+ NT Sbjct: 186 DSTMSVTENGEQTSITRTTCRYDVRFNPVVCRVLMTNG--KGAGQTLSTLTNTT 237 >UniRef50_C6C587 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C587_DICDC Length = 248 Score = 211 bits (537), Expect = 1e-53, Method: Composition-based stats. Identities = 53/235 (22%), Positives = 98/235 (41%), Gaps = 11/235 (4%) Query: 10 LAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLS 69 LA ++ C +F P +++ S FDF+P RG ++ T + ++ G + + + Sbjct: 11 LAAIISPCAN--ALDAFNPVISNMSVLFDFNPARGDIQMLTSRIYNDDGSLDYEMHLLMD 68 Query: 70 EEGCFDSLELLDLENNTVVALVLDANYYRD-AETLEKRVRLQGKCQ-LAELPSAGV-SWE 126 +GC + ++ + L + + E E + L KC+ + + + G + Sbjct: 69 RQGCVTAFGAKNVSDQKFTELFRKDHALKGKDERGEVTLTLDDKCRFITKTDATGTAKFT 128 Query: 127 TDDNGFVIKASS---KQMQMEYRYDDQGYPLGKTTKSNDKTL-SVSATPSTDPIKKLDYT 182 +NG + SS Q M Y+YD G G T N KT K D Sbjct: 129 YGENGLIKSVSSAKTGQTVMTYQYDTFGMLAGVETFMNGKTFLQNRLQCDVAGDKPFDCL 188 Query: 183 AVTLLNNQRVGNVKQSCEYDSHANPVDCQLI-IVDEGVKPAVERVYTIKNTIDYY 236 ++T + V +K +C+YD + C ++ ++ G K + R + T+ YY Sbjct: 189 SMTRVQGITVATIKMNCDYDDNGLAYQCNMLSVLGRGEKQKL-RKQRVSTTVTYY 242 >UniRef50_C4UQ43 Putative uncharacterized protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UQ43_YERRO Length = 236 Score = 192 bits (487), Expect = 1e-47, Method: Composition-based stats. Identities = 55/245 (22%), Positives = 96/245 (39%), Gaps = 18/245 (7%) Query: 1 MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 MK ++L LL FL F P + + + F+ D G VK Q + D G++ Sbjct: 1 MKARVLLSLL--FLAISFNVAALTQFKPAVLNAALLFEHDATTGNVKHSIQWIRDVHGKL 58 Query: 61 TKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGK----CQLA 116 + GCF ++ ++D N+ LV T K +R+ GK C++ Sbjct: 59 QAMTEVRYDQSGCFTNINMVDKANDREFHLVNKDGAL----TSFKGLRITGKINELCEIT 114 Query: 117 ELPSAGVSWETDDN--GF---VIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATP 171 EL + + N G ++ + ++ Y Y + +P+ + D+ + P Sbjct: 115 ELENEKGKYVLSYNVRGLLETIVDKDTGEVVERYEYHNNQFPV-RVRNYKDQKDTRILYP 173 Query: 172 STDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKN 231 S + LD +V+ V VKQSC Y + N C LI ++ + Sbjct: 174 SGSA-QFLDLESVSKRGELTV-RVKQSCAYTADGNADKCSLIASTNDDYRGSILIWISNH 231 Query: 232 TIDYY 236 +Y+ Sbjct: 232 ETEYF 236 >UniRef50_C9XVD0 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9XVD0_CROTZ Length = 239 Score = 190 bits (483), Expect = 2e-47, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 85/224 (37%), Gaps = 8/224 (3%) Query: 19 RTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLE 78 + P + +FS +DF+P+RG VK T T+ E + ++ TLS +GC + E Sbjct: 18 NALAAEKYIPSLYNFSVMYDFNPVRGHVKTLTTTVRSETENFS--ITLTLSPQGCIEQFE 75 Query: 79 LLDLENNTVVALVLDANYYRDA-ETLEKRVRLQGKCQLAELPSAGV--SWETDDNGFVIK 135 +AL N + +C L + +++ ++ G + + Sbjct: 76 RSGDRAYGNIALKRQGNDLTGTFDNSPVSYVFDNQCNLVSMTDKYGVKTFKLNNAGLIEQ 135 Query: 136 ASSKQMQME-YRYDDQGYPLGKTTKSNDKTLSVSATPSTDP-IKKLDYTAVTLLNNQRVG 193 + + YRY G NDK + S +D K LD+ T+ + Sbjct: 136 TMANSEKFSAYRYIAGDSFAGSEYYLNDKVATYSDVTYSDINNKPLDFKMKTVFGQEYTV 195 Query: 194 NVKQSCEYDSHANPVDCQLIIVD-EGVKPAVERVYTIKNTIDYY 236 + +C YD P +C I K A E Y K + +Y Sbjct: 196 YGESTCLYDDRKVPRECTAITKKVRDDKVAQENHYFSKTAVSWY 239 >UniRef50_B6XI45 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XI45_9ENTR Length = 239 Score = 175 bits (444), Expect = 9e-43, Method: Composition-based stats. Identities = 61/242 (25%), Positives = 105/242 (43%), Gaps = 20/242 (8%) Query: 6 LPCLLAIFLTGCDRTEV---TLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTK 62 L L+ +F++ +V L++ M ++S EF FDPL G VK+ TQT+ D++G + K Sbjct: 6 LHGLMFLFMSPLALAQVDNELLNYDKRMVNYSTEFQFDPLYGKVKELTQTMNDDKG-LRK 64 Query: 63 RVSGTLSEEGCFDSLELLDLENNTVVALVLDAN--YYRDAETLEKRVRLQGKCQL--AEL 118 +VS + + GC + L + ++ T +V + Y+ + K R+ C L EL Sbjct: 65 KVSVSFNPHGCLNELTYYNKDSETEFTIVRKSKQLYFLIDKQKLKGYRIDKMCNLQTGEL 124 Query: 119 PSAGVSWETDDNGFVIKASSKQMQM-EYRYDDQGYPL-GKTTKSNDKTLSVSATPSTDP- 175 + + NG V K + + + Y + P+ N+ T + + TD Sbjct: 125 LNWKYHYR---NGLVNKITQGDKDIATFVYGRELLPIETHYFNENEVTNTKNEYFFTDGL 181 Query: 176 IKKLDYTAVTLLNNQRVGNVKQSC-EYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTID 234 + K+ + A N Q + Q C YD H NP C ++ + +T Sbjct: 182 VTKILFHATK--NQQPYYEIVQKCTHYDDHKNPTSCTSVMTYSDGRV---DKFTYDYKTT 236 Query: 235 YY 236 YY Sbjct: 237 YY 238 >UniRef50_C8TA52 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8TA52_KLEPR Length = 138 Score = 127 bits (318), Expect = 4e-28, Method: Composition-based stats. Identities = 62/99 (62%), Positives = 71/99 (71%), Gaps = 2/99 (2%) Query: 3 YKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTK 62 Y L+P A+ LT CDR +FTPEMASFSNEF+FDPLRGPVKDF+QTL+DE V K Sbjct: 15 YWLIPA--ALLLTACDRKSAPDAFTPEMASFSNEFEFDPLRGPVKDFSQTLLDEHDVVVK 72 Query: 63 RVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAE 101 +VS LS EGCFD L L D+EN T L+LDANYY D Sbjct: 73 KVSAQLSREGCFDLLTLEDVENKTGATLLLDANYYVDGR 111 >UniRef50_B2Q2N1 Putative uncharacterized protein n=5 Tax=Providencia RepID=B2Q2N1_PROST Length = 252 Score = 96.8 bits (239), Expect = 5e-19, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 83/241 (34%), Gaps = 13/241 (5%) Query: 9 LLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTL 68 L+ F+ + P + + + +DF+P++G VK+ + ++ + + Sbjct: 12 LITFFIASESLAFQKQQYNPVVFNMAQLYDFNPVKGNVKEIRSVVYNKDKSINYESLLKI 71 Query: 69 SEEGCFDSLELLDLENN------TVVALVLDANYYRDAE-TLEKRVRLQGKCQLA--ELP 119 +GC DS L ++ + + N + + + C + + Sbjct: 72 GRDGCIDSFALNQKKDEYLSGVHNYLFVERVKNKLVGRDNNGPVEMEIGNNCTILSRKDN 131 Query: 120 SAGVSWETDDNGFVIKASSKQMQMEY---RYDDQGYPLGKTTKSNDKTLSVSATPST-DP 175 + + + + G +I + + ++ Y++ P ++K +S + D Sbjct: 132 NGKLIYRYNKEGIIIGSVLADNKTKFSENNYNEFKLPTTIKYYKDNKVISETIITYGKDI 191 Query: 176 IKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDY 235 K D + Q + V C+Y+S C + ++ + + + Sbjct: 192 TKPFDLLMKIRVLGQTILQVDSKCDYNSQNIAHKCHFELTINENGQDIKLIKDSTTEVSF 251 Query: 236 Y 236 Y Sbjct: 252 Y 252 >UniRef50_B2PZM3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PZM3_PROST Length = 246 Score = 72.2 bits (175), Expect = 2e-11, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 75/195 (38%), Gaps = 7/195 (3%) Query: 26 FTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENN 85 + P + + F ++GPVK+ T E VT+ + L ++ C +S E + Sbjct: 28 YLPAVQNIYVLFSGVLVKGPVKEIKATFYTEDKTVTQDIQLKLDQKSCIESFESISHGIQ 87 Query: 86 TVVALVLDANYYRD-AETLEKRVRLQGKCQ-LAELPSAGVSW-ETDDNGFVIKASS---K 139 + L D N + + E + L +C+ +++ S G ++ E D+ ++ S Sbjct: 88 QHIQLKRDNNQLKGSSNEGEIIIDLDEQCRFISQKDSYGTTYFEYDERNYIKSIHSKEDN 147 Query: 140 QMQMEYRYDDQGYPLGKTTKSNDKTLSVSAT-PSTDPIKKLDYTAVTLLNNQRVGNVKQS 198 + Y + G ++ + P D + D + ++ Sbjct: 148 GTVDKVVYSNLGELQEMNYYDKNQLYMQTLFKPLADINRYADIQIEQKMLGDLGYISERK 207 Query: 199 CEYDSHANPVDCQLI 213 C+Y+ P +CQ++ Sbjct: 208 CQYNHFGAPTNCQIL 222 >UniRef50_C7MXD3 Rhs family protein n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MXD3_SACVD Length = 1485 Score = 41.3 bits (95), Expect = 0.029, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 47/156 (30%), Gaps = 15/156 (9%) Query: 36 EFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELL-------DLENNTVV 88 F PV+ T + D G R GC ++ D + + + Sbjct: 454 ILHFAAGDNPVRRLTA-VTDRNGN---RFDLDRDAVGCVTAVRHSGGYHIEVDTDRDRIT 509 Query: 89 ALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVS---WETDDNG-FVIKASSKQMQME 144 L L A T R R LAE+ ++ + DD V Sbjct: 510 ELRLRHRAEATATTRLVRYRYDDAGLLAEVVNSSGRSLRFSYDDQSRLVRWTDRNGHWYH 569 Query: 145 YRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 Y YDD G + + S + + S P + D Sbjct: 570 YFYDDAGRCVANHSSSKYLSGTFSYDPDNRITRFTD 605 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7M9T8 UPF0257 lipoprotein ynfC n=108 Tax=Enterobacteri... 284 1e-75 UniRef50_C4RZQ9 Putative uncharacterized protein n=4 Tax=Yersini... 237 2e-61 UniRef50_C6CDD8 Lipoprotein YnfC n=3 Tax=Dickeya RepID=C6CDD8_DICDC 220 3e-56 UniRef50_C6C587 Putative uncharacterized protein n=1 Tax=Dickeya... 215 8e-55 UniRef50_C4UQ43 Putative uncharacterized protein n=1 Tax=Yersini... 204 2e-51 UniRef50_B2Q2N1 Putative uncharacterized protein n=5 Tax=Provide... 198 9e-50 UniRef50_C9XVD0 Putative uncharacterized protein n=1 Tax=Cronoba... 194 2e-48 UniRef50_B6XI45 Putative uncharacterized protein n=1 Tax=Provide... 173 6e-42 UniRef50_B2PZM3 Putative uncharacterized protein n=1 Tax=Provide... 171 1e-41 UniRef50_C8TA52 Putative uncharacterized protein n=1 Tax=Klebsie... 122 1e-26 Sequences not found previously or not previously below threshold: UniRef50_C7N879 RHS repeat protein n=1 Tax=Slackia heliotrinired... 43 0.011 UniRef50_Q5ZWY6 2-oxoglutarate ferredoxin oxidoreductase beta su... 42 0.015 UniRef50_A7BPL6 Protein containing RHS repeats n=2 Tax=Beggiatoa... 41 0.027 UniRef50_B3PHE6 RHS Repeat family n=5 Tax=cellular organisms Rep... 41 0.035 CONVERGED! >UniRef50_B7M9T8 UPF0257 lipoprotein ynfC n=108 Tax=Enterobacteriaceae RepID=YNFC_ECO45 Length = 236 Score = 284 bits (727), Expect = 1e-75, Method: Composition-based stats. Identities = 233/236 (98%), Positives = 235/236 (99%) Query: 1 MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 MKYKLLPCLLAI LTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV Sbjct: 1 MKYKLLPCLLAILLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 Query: 61 TKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 TKRVSGTLSEEGCFDSLELLDLENNT+VALVLDANYYRDAETLEKRVRLQGKCQLAELPS Sbjct: 61 TKRVSGTLSEEGCFDSLELLDLENNTLVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 Query: 121 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD Sbjct: 121 AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLD 180 Query: 181 YTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 YTAVTLLNNQRVGNVKQSCEYD+HANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY Sbjct: 181 YTAVTLLNNQRVGNVKQSCEYDNHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 >UniRef50_C4RZQ9 Putative uncharacterized protein n=4 Tax=Yersinia RepID=C4RZQ9_YERBE Length = 237 Score = 237 bits (605), Expect = 2e-61, Method: Composition-based stats. Identities = 74/235 (31%), Positives = 116/235 (49%), Gaps = 5/235 (2%) Query: 5 LLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRV 64 L CLLA+ L+ CDR FT +AS+SN F FDP++GPVK TQ ++D +G+ V Sbjct: 5 LSICLLALALSACDRGHAPFIFTANVASYSNIFGFDPIQGPVKSLTQKMLDAKGDTYSDV 64 Query: 65 SGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVS 124 ++E+GCF +L + + + + V + +Y D +T EK++ L KC + + S V Sbjct: 65 HAEINEDGCFTALRIHTPDQDVDLDYVKEGSYLIDNKTKEKQLVLNDKCNITQTASGNVK 124 Query: 125 WETDDNGFVIKA---SSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDY 181 +D GFV S + Y YD G+P+ + N K + +Y Sbjct: 125 VIINDKGFVTDVKMSESGATKKHYDYDASGFPVVDISYDNGKIFKIVTELDAKGQSPFNY 184 Query: 182 TAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 N++ V + K+ CE DSH NP C++ + + V VYT + +YY Sbjct: 185 KTKIYENDKLVLSTKRVCETDSHGNPTSCKIESM--EPEGKVIEVYTATYSTEYY 237 >UniRef50_C6CDD8 Lipoprotein YnfC n=3 Tax=Dickeya RepID=C6CDD8_DICDC Length = 242 Score = 220 bits (560), Expect = 3e-56, Method: Composition-based stats. Identities = 71/236 (30%), Positives = 107/236 (45%), Gaps = 7/236 (2%) Query: 5 LLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRV 64 L +LA+ GCD + +P MA FSN F FD LRG +K FTQT DEQG+V V Sbjct: 8 LTAAMLAV--AGCDDKQRLEPVSPMMAGFSNIFGFDALRGKIKRFTQTQTDEQGKVVAYV 65 Query: 65 SGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVS 124 + TL++ GC D+L+ N + LV + D + +L C L + + Sbjct: 66 AVTLNQNGCVDTLKSYQPTMNLDLDLVREGQLLVDRNDRKPAFQLGKDCLLEKSVDGRLI 125 Query: 125 WETDDNGFVIKASSKQMQM---EYRYDDQGYP--LGKTTKSNDKTLSVSATPSTDPIKKL 179 + DD GF+ Y+YDD G+P + T+ + K VS P K+L Sbjct: 126 YRHDDKGFITDVFYNGHTTPFATYQYDDDGFPSDMTFTSPESGKVTLVSLRNDAAPGKRL 185 Query: 180 DYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDY 235 D T N ++ + +C YD NPV C++++ + T ++Y Sbjct: 186 DSTMSVTENGEQTSITRTTCRYDVRFNPVVCRVLMTNGKGAGQTLSTLTNTTKVEY 241 >UniRef50_C6C587 Putative uncharacterized protein n=1 Tax=Dickeya dadantii Ech703 RepID=C6C587_DICDC Length = 248 Score = 215 bits (548), Expect = 8e-55, Method: Composition-based stats. Identities = 52/234 (22%), Positives = 96/234 (41%), Gaps = 9/234 (3%) Query: 10 LAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLS 69 LA ++ C +F P +++ S FDF+P RG ++ T + ++ G + + + Sbjct: 11 LAAIISPCAN--ALDAFNPVISNMSVLFDFNPARGDIQMLTSRIYNDDGSLDYEMHLLMD 68 Query: 70 EEGCFDSLELLDLENNTVVALVLDANYYRD-AETLEKRVRLQGKCQ-LAELPSAGV-SWE 126 +GC + ++ + L + + E E + L KC+ + + + G + Sbjct: 69 RQGCVTAFGAKNVSDQKFTELFRKDHALKGKDERGEVTLTLDDKCRFITKTDATGTAKFT 128 Query: 127 TDDNGFVIKASS---KQMQMEYRYDDQGYPLGKTTKSNDKTLSVS-ATPSTDPIKKLDYT 182 +NG + SS Q M Y+YD G G T N KT + K D Sbjct: 129 YGENGLIKSVSSAKTGQTVMTYQYDTFGMLAGVETFMNGKTFLQNRLQCDVAGDKPFDCL 188 Query: 183 AVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY 236 ++T + V +K +C+YD + C ++ V + R + T+ YY Sbjct: 189 SMTRVQGITVATIKMNCDYDDNGLAYQCNMLSVLGRGEKQKLRKQRVSTTVTYY 242 >UniRef50_C4UQ43 Putative uncharacterized protein n=1 Tax=Yersinia rohdei ATCC 43380 RepID=C4UQ43_YERRO Length = 236 Score = 204 bits (519), Expect = 2e-51, Method: Composition-based stats. Identities = 50/241 (20%), Positives = 93/241 (38%), Gaps = 10/241 (4%) Query: 1 MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEV 60 MK ++L LL FL F P + + + F+ D G VK Q + D G++ Sbjct: 1 MKARVLLSLL--FLAISFNVAALTQFKPAVLNAALLFEHDATTGNVKHSIQWIRDVHGKL 58 Query: 61 TKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPS 120 + GCF ++ ++D N+ LV + L ++ C++ EL + Sbjct: 59 QAMTEVRYDQSGCFTNINMVDKANDREFHLVNKDGALTSFKGLRITGKINELCEITELEN 118 Query: 121 AGVSWETDDN--GF---VIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDP 175 + N G ++ + ++ Y Y + +P+ + D+ + PS Sbjct: 119 EKGKYVLSYNVRGLLETIVDKDTGEVVERYEYHNNQFPV-RVRNYKDQKDTRILYPSGSA 177 Query: 176 IKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDY 235 + LD +V+ + VKQSC Y + N C LI ++ + +Y Sbjct: 178 -QFLDLESVSKR-GELTVRVKQSCAYTADGNADKCSLIASTNDDYRGSILIWISNHETEY 235 Query: 236 Y 236 + Sbjct: 236 F 236 >UniRef50_B2Q2N1 Putative uncharacterized protein n=5 Tax=Providencia RepID=B2Q2N1_PROST Length = 252 Score = 198 bits (504), Expect = 9e-50, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 82/241 (34%), Gaps = 13/241 (5%) Query: 9 LLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTL 68 L+ F+ + P + + + +DF+P++G VK+ + ++ + + Sbjct: 12 LITFFIASESLAFQKQQYNPVVFNMAQLYDFNPVKGNVKEIRSVVYNKDKSINYESLLKI 71 Query: 69 SEEGCFDSLELLDLENN------TVVALVLDANYYRD-AETLEKRVRLQGKCQLA--ELP 119 +GC DS L ++ + + N + + C + + Sbjct: 72 GRDGCIDSFALNQKKDEYLSGVHNYLFVERVKNKLVGRDNNGPVEMEIGNNCTILSRKDN 131 Query: 120 SAGVSWETDDNGFVIKASSKQMQMEY---RYDDQGYPLGKTTKSNDKTLSVSATPST-DP 175 + + + + G +I + + ++ Y++ P ++K +S + D Sbjct: 132 NGKLIYRYNKEGIIIGSVLADNKTKFSENNYNEFKLPTTIKYYKDNKVISETIITYGKDI 191 Query: 176 IKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDY 235 K D + Q + V C+Y+S C + ++ + + + Sbjct: 192 TKPFDLLMKIRVLGQTILQVDSKCDYNSQNIAHKCHFELTINENGQDIKLIKDSTTEVSF 251 Query: 236 Y 236 Y Sbjct: 252 Y 252 >UniRef50_C9XVD0 Putative uncharacterized protein n=1 Tax=Cronobacter turicensis RepID=C9XVD0_CROTZ Length = 239 Score = 194 bits (493), Expect = 2e-48, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 85/224 (37%), Gaps = 8/224 (3%) Query: 19 RTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLE 78 + P + +FS +DF+P+RG VK T T+ E + ++ TLS +GC + E Sbjct: 18 NALAAEKYIPSLYNFSVMYDFNPVRGHVKTLTTTVRSETENFS--ITLTLSPQGCIEQFE 75 Query: 79 LLDLENNTVVALVLDANYYRDA-ETLEKRVRLQGKCQLAELPSAGV--SWETDDNGFVIK 135 +AL N + +C L + +++ ++ G + + Sbjct: 76 RSGDRAYGNIALKRQGNDLTGTFDNSPVSYVFDNQCNLVSMTDKYGVKTFKLNNAGLIEQ 135 Query: 136 ASSKQMQME-YRYDDQGYPLGKTTKSNDKTLSVSATPSTDP-IKKLDYTAVTLLNNQRVG 193 + + YRY G NDK + S +D K LD+ T+ + Sbjct: 136 TMANSEKFSAYRYIAGDSFAGSEYYLNDKVATYSDVTYSDINNKPLDFKMKTVFGQEYTV 195 Query: 194 NVKQSCEYDSHANPVDCQLIIVD-EGVKPAVERVYTIKNTIDYY 236 + +C YD P +C I K A E Y K + +Y Sbjct: 196 YGESTCLYDDRKVPRECTAITKKVRDDKVAQENHYFSKTAVSWY 239 >UniRef50_B6XI45 Putative uncharacterized protein n=1 Tax=Providencia alcalifaciens DSM 30120 RepID=B6XI45_9ENTR Length = 239 Score = 173 bits (437), Expect = 6e-42, Method: Composition-based stats. Identities = 61/242 (25%), Positives = 105/242 (43%), Gaps = 20/242 (8%) Query: 6 LPCLLAIFLTGCDRTEV---TLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTK 62 L L+ +F++ +V L++ M ++S EF FDPL G VK+ TQT+ D++G + K Sbjct: 6 LHGLMFLFMSPLALAQVDNELLNYDKRMVNYSTEFQFDPLYGKVKELTQTMNDDKG-LRK 64 Query: 63 RVSGTLSEEGCFDSLELLDLENNTVVALVLDAN--YYRDAETLEKRVRLQGKCQL--AEL 118 +VS + + GC + L + ++ T +V + Y+ + K R+ C L EL Sbjct: 65 KVSVSFNPHGCLNELTYYNKDSETEFTIVRKSKQLYFLIDKQKLKGYRIDKMCNLQTGEL 124 Query: 119 PSAGVSWETDDNGFVIKASSKQMQM-EYRYDDQGYPL-GKTTKSNDKTLSVSATPSTDP- 175 + + NG V K + + + Y + P+ N+ T + + TD Sbjct: 125 LNWKYHYR---NGLVNKITQGDKDIATFVYGRELLPIETHYFNENEVTNTKNEYFFTDGL 181 Query: 176 IKKLDYTAVTLLNNQRVGNVKQSC-EYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTID 234 + K+ + A N Q + Q C YD H NP C ++ + +T Sbjct: 182 VTKILFHATK--NQQPYYEIVQKCTHYDDHKNPTSCTSVMTYSDGRV---DKFTYDYKTT 236 Query: 235 YY 236 YY Sbjct: 237 YY 238 >UniRef50_B2PZM3 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2PZM3_PROST Length = 246 Score = 171 bits (434), Expect = 1e-41, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 81/223 (36%), Gaps = 7/223 (3%) Query: 20 TEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLEL 79 + + P + + F ++GPVK+ T E VT+ + L ++ C +S E Sbjct: 22 ASIAKGYLPAVQNIYVLFSGVLVKGPVKEIKATFYTEDKTVTQDIQLKLDQKSCIESFES 81 Query: 80 LDLENNTVVALVLDANYYRD-AETLEKRVRLQGKCQ-LAELPSAGVSW-ETDDNGFVIKA 136 + + L D N + + E + L +C+ +++ S G ++ E D+ ++ Sbjct: 82 ISHGIQQHIQLKRDNNQLKGSSNEGEIIIDLDEQCRFISQKDSYGTTYFEYDERNYIKSI 141 Query: 137 SS---KQMQMEYRYDDQGYPLGKTTKSNDKTLSVSAT-PSTDPIKKLDYTAVTLLNNQRV 192 S + Y + G ++ + P D + D + Sbjct: 142 HSKEDNGTVDKVVYSNLGELQEMNYYDKNQLYMQTLFKPLADINRYADIQIEQKMLGDLG 201 Query: 193 GNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDY 235 ++ C+Y+ P +CQ++ + + ++ + Sbjct: 202 YISERKCQYNHFGAPTNCQILTKYDVDGEMQTQRQSVVIDTQF 244 >UniRef50_C8TA52 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8TA52_KLEPR Length = 138 Score = 122 bits (305), Expect = 1e-26, Method: Composition-based stats. Identities = 62/99 (62%), Positives = 71/99 (71%), Gaps = 2/99 (2%) Query: 3 YKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTK 62 Y L+P A+ LT CDR +FTPEMASFSNEF+FDPLRGPVKDF+QTL+DE V K Sbjct: 15 YWLIPA--ALLLTACDRKSAPDAFTPEMASFSNEFEFDPLRGPVKDFSQTLLDEHDVVVK 72 Query: 63 RVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAE 101 +VS LS EGCFD L L D+EN T L+LDANYY D Sbjct: 73 KVSAQLSREGCFDLLTLEDVENKTGATLLLDANYYVDGR 111 >UniRef50_C7N879 RHS repeat protein n=1 Tax=Slackia heliotrinireducens DSM 20476 RepID=C7N879_SLAHD Length = 516 Score = 42.5 bits (98), Expect = 0.011, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 56/186 (30%), Gaps = 23/186 (12%) Query: 48 DFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNT-------VVALVLDANYY--- 97 T+ + G T S +L E G + + N Sbjct: 229 HVTKVVSSSGGTSTYEYSLSLDEAGVATGYAESTVGVDDTSEQAVYEFNYDESGNIIGIW 288 Query: 98 -RDAETLEKRVRLQGKCQLAELPS--AGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPL 154 +AE+++ + S+ DD G + S + YD+ G Sbjct: 289 DVEAESMDVEFVYDDAGNVIRRTGVPTEYSYTYDDQGRLESFVSTGTVLTLAYDEAGRLA 348 Query: 155 GKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLII 214 T+ + + + + VT +++ ++ + +YD + C I+ Sbjct: 349 SYTSSFGGRAVEYTYAYDAEGR-------VTGVSDSSDRPMEYTVDYDDNGA---CSSIL 398 Query: 215 VDEGVK 220 ++ V Sbjct: 399 IEGAVG 404 >UniRef50_Q5ZWY6 2-oxoglutarate ferredoxin oxidoreductase beta subunit n=5 Tax=Legionella RepID=Q5ZWY6_LEGPH Length = 330 Score = 42.1 bits (97), Expect = 0.015, Method: Composition-based stats. Identities = 15/98 (15%), Positives = 32/98 (32%), Gaps = 5/98 (5%) Query: 71 EGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDN 130 G FD + + + L D EK ++L + + ++ D+ Sbjct: 215 HGAFDDFAVKNNRAENTIILE-DGQPLVFGAQKEKALQLDDEQFVQISIDEKTLYKHDEK 273 Query: 131 GFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVS 168 + S + D PLG + + + ++S Sbjct: 274 NLI----SAMRLARLTFPDYPVPLGVYYQKDRERFTLS 307 >UniRef50_A7BPL6 Protein containing RHS repeats n=2 Tax=Beggiatoa sp. PS RepID=A7BPL6_9GAMM Length = 2594 Score = 41.4 bits (95), Expect = 0.027, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 54/177 (30%), Gaps = 8/177 (4%) Query: 35 NEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDA 94 + R + TQ + +G T V G G ++ + + Sbjct: 2163 VLYSNQYTRDKLGRITQKVETLEGVNTTDVYG-YDPAGRLVTVTQNGVVTEQY-TYDANG 2220 Query: 95 NYYRDAET--LEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGY 152 N T E R + +L E G ++ NG ++ + EY YD G Sbjct: 2221 NRLTANTTTYGEVNGRYDEQDRLLEY--GGNTYTYTANGELLTKNESGAITEYEYDVLGN 2278 Query: 153 PLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVD 209 + + V + KK++ V Q N E DS+ N V Sbjct: 2279 LRSVQLPDSTQIEYVIDGRNRRIGKKINGQLVQGFLYQGALN--PIAELDSNGNVVS 2333 >UniRef50_B3PHE6 RHS Repeat family n=5 Tax=cellular organisms RepID=B3PHE6_CELJU Length = 3998 Score = 41.0 bits (94), Expect = 0.035, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 51/165 (30%), Gaps = 18/165 (10%) Query: 45 PVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLE 104 + T+T + + ++ T + + ++ + A + + Sbjct: 3236 NITHLTKTYAENGADTSRYTLVTENNS----NTSTTQVDGRAGIHSFTSAEERSNTTYSD 3291 Query: 105 KRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKT 164 + L + Q++ L +E D+ G + K + + Y YD G T+ KT Sbjct: 3292 PQTLLTQRTQVSGLHD--THYEYDNRGRLTKTRTNTRAIHYAYDAFGNLSEITSADGRKT 3349 Query: 165 LSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVD 209 D G+ Q+ YD++ N Sbjct: 3350 HYE-----------YDLLGRVTRITYPDGHSTQT-RYDANGNATK 3382 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.308 0.119 0.286 Lambda K H 0.267 0.0373 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 904,270,426 Number of Sequences: 3077464 Number of extensions: 27832032 Number of successful extensions: 89952 Number of sequences better than 1.0e-01: 40 Number of HSP's better than 0.1 without gapping: 26 Number of HSP's successfully gapped in prelim test: 40 Number of HSP's that attempted gapping in prelim test: 89769 Number of HSP's gapped (non-prelim): 184 length of query: 236 length of database: 1,040,396,356 effective HSP length: 125 effective length of query: 111 effective length of database: 655,713,356 effective search space: 72784182516 effective search space used: 72784182516 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 90 (39.4 bits)