BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (255 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q47138 Putative uncharacterized protein ydfE (Fragment)... 531 e-150 UniRef50_A7ZM35 Exonuclease family protein n=57 Tax=Enterobacter... 500 e-140 UniRef50_D0FSA4 Similarities with exodeoxyribonuclease VIII n=2 ... 124 2e-27 UniRef50_C4SVJ7 Transcription termination factor n=2 Tax=Enterob... 67 4e-10 UniRef50_C4SVT6 Putative uncharacterized protein n=1 Tax=Yersini... 64 4e-09 UniRef50_B4T220 Putative uncharacterized protein n=3 Tax=Bacteri... 58 3e-07 UniRef50_B6XKP2 Putative uncharacterized protein n=3 Tax=Enterob... 55 2e-06 UniRef50_C8T873 Putative uncharacterized protein n=1 Tax=Klebsie... 46 0.001 UniRef50_Q7N1J2 Similarities with exodeoxyribonuclease VIII n=1 ... 42 0.017 >UniRef50_Q47138 Putative uncharacterized protein ydfE (Fragment) n=58 Tax=Enterobacteriaceae RepID=YDFE_ECOLI Length = 255 Score = 531 bits (1368), Expect = e-150, Method: Compositional matrix adjust. Identities = 255/255 (100%), Positives = 255/255 (100%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV Sbjct: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK Sbjct: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI Sbjct: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH Sbjct: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 Query: 241 REYKQTWKTLDDELA 255 REYKQTWKTLDDELA Sbjct: 241 REYKQTWKTLDDELA 255 >UniRef50_A7ZM35 Exonuclease family protein n=57 Tax=Enterobacteriaceae RepID=A7ZM35_ECO24 Length = 825 Score = 500 bits (1287), Expect = e-140, Method: Compositional matrix adjust. Identities = 249/257 (96%), Positives = 251/257 (97%), Gaps = 2/257 (0%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEA--GDERRARAKFHWQFLEHYPAAQDCAYKF 58 MSKVFICAAIPDELATREEGAVAVA A GDERRARAKFHWQFLEHYPAAQDCAYKF Sbjct: 1 MSKVFICAAIPDELATREEGAVAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKF 60 Query: 59 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVM 118 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVET+SDPMNVTFDKLAPEVQNAVM Sbjct: 61 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETKSDPMNVTFDKLAPEVQNAVM 120 Query: 119 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK 178 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK Sbjct: 121 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK 180 Query: 179 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAA 238 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARA+ANQQYTENSTGKISPVIAA Sbjct: 181 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARAKANQQYTENSTGKISPVIAA 240 Query: 239 IHREYKQTWKTLDDELA 255 IHREYKQTWKTLDDELA Sbjct: 241 IHREYKQTWKTLDDELA 257 >UniRef50_D0FSA4 Similarities with exodeoxyribonuclease VIII n=2 Tax=Erwinia pyrifoliae RepID=D0FSA4_ERWPY Length = 747 Score = 124 bits (312), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 85/255 (33%), Positives = 131/255 (51%), Gaps = 27/255 (10%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 MSK C P E A E GA+ +A EA +E++ARAK F+E +P A D ++ + Sbjct: 1 MSKFTGCW-FPHEQAIEENGALRIAHHFEAANEKQARAKATLLFMEEFPDADDSLFELRL 59 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 P +PRPA + WD ++ ++ W++E PV E P V F KL+ ++ AV++K Sbjct: 60 YPHAPDVPRPATE-WDETFLFDHEWNDELGH--PVLKEQ-PKPVEFHKLSQSLRVAVLLK 115 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 F T E + ++ +++D + F+ HI++A+ K P V +MYPE L AI ++++ C Sbjct: 116 FHTTEITHTQLSVAMA--IEQDGSGFERHIIDAISKTPAVISMYPERILDAIDYIRNNCA 173 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 KWPEI+A + W K+ E RK +D +Y I Sbjct: 174 TTKKWPEIKAVLAGWVKQCEQTRKGGAGGEDSID--------EY------------EVIP 213 Query: 241 REYKQTWKTLDDELA 255 R Y+ T KTLD E+A Sbjct: 214 RPYEHTHKTLDIEIA 228 >UniRef50_C4SVJ7 Transcription termination factor n=2 Tax=Enterobacteriaceae RepID=C4SVJ7_YERFR Length = 732 Score = 67.4 bits (163), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 71/268 (26%), Positives = 112/268 (41%), Gaps = 37/268 (13%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 VFIC P + A + GA+ + A+EA +++ A A + E YPAA D +CED Sbjct: 3 VFICLFEPKKAALKN-GAIPLVIALEAINKKMASALAIGKLWEAYPAAGDNFADPKICED 61 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESASFVPVETES--------------DPMNVTFDKL 109 G PRPA+ +D ++ Q+N +D + P S D V + KL Sbjct: 62 TVGQPRPAVGDFDEQFAQDNTFD--GKVWTPNTVASPEDEDDNDGDEQGDDSGLVNYAKL 119 Query: 110 APEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFD-GHIVEALMKMPEVNAMYPELK 168 +V+ ++ + +I + +L+ +D I+ AL +P V AMY E Sbjct: 120 GIDVKVGKILMYG-LRDIGTHELSLVYDLINDDEGDAGLRSIITALAGIPAVGAMYQESV 178 Query: 169 LHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENS 228 + + K ++PE+Q + W VD R T+ Sbjct: 179 KELVDAINVKFPKTPQFPEVQKFAQKW-----------------VDEPNKRDELTGTKKV 221 Query: 229 TGKISPVI-AAIHREYKQTWKTLDDELA 255 T +P A I R + T+KTLD E+A Sbjct: 222 TRINTPTPDAPIKRGFDHTYKTLDLEVA 249 >UniRef50_C4SVT6 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SVT6_YERFR Length = 701 Score = 64.3 bits (155), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 59/248 (23%), Positives = 110/248 (44%), Gaps = 29/248 (11%) Query: 20 GAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPAL--DSWD- 76 GAVA+A ++A +++ A K +P + + VC D+ G PRP + DS D Sbjct: 4 GAVAMAITVDAKNQKMATMKSTMLLEGEFPGSSGNFFAPKVCADRVGSPRPPVHDDSDDN 63 Query: 77 ----AEYMQENRWDEESASF--VPVETESDPMNVT-FDKLAPEVQNAVMVKFDT-CENIT 128 E+M+ N+W++ES F +P++ + N+ L EV+ A ++ + E + Sbjct: 64 AIFSTEWMEHNQWNDESKEFEPIPIQNNDEIENIKDIFNLPIEVRVAYILLYGVEPETVD 123 Query: 129 VDMVISAQELLQEDMA-TFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPE 187 ++ +A +L+ +D + I++ L ++P++ M I ++ P WP+ Sbjct: 124 SHLLSNAYDLINDDESEPLYRAIIDGLPRLPQIKHMLITSLAQLIDDIQAHTPPFKSWPD 183 Query: 188 IQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTW 247 + W R ER+ ++AN+ + TG R YK + Sbjct: 184 VNKFAEKWIMSRPDEREH----------PTSQANEPTNSSPTGN-------HERGYKHDY 226 Query: 248 KTLDDELA 255 +LD E+A Sbjct: 227 ASLDVEVA 234 >UniRef50_B4T220 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B4T220_SALNS Length = 754 Score = 57.8 bits (138), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 76/275 (27%), Positives = 124/275 (45%), Gaps = 46/275 (16%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 ++ C P + A ++ GAVA+A +IEA +++ A + + EHYPA D +K + ED Sbjct: 3 IYNCLFEPKKSAIKD-GAVALAISIEAPNKKVAESIVIGKLWEHYPANGDNYFKPKIWED 61 Query: 64 KPGIPRPALDSWDAEYMQENRWDEES------ASFVPVETESDPMNVTFDKLAPEVQNAV 117 PRP + +D ++ Q N +D E + VP SD + + KL + A Sbjct: 62 SEDQPRPEVGKFDEQFAQANTFDGEKWIANKPDTSVPGLPSSDEI-IDLMKLPARERFAA 120 Query: 118 MVKFDTCENITVDMVISAQEL-----LQEDMATFD------GHIVEALMKMPEVNAMYPE 166 ++ F N +D V+ +Q L L+ + FD +I+ AL V M+ E Sbjct: 121 VLMF---SNSPIDGVLYSQVLDYLDNLENNTEPFDDDDRINSNILHALHNNEPVRHMHVE 177 Query: 167 LKLHAIGWVKHKC---IPGAKWPEIQAEMRIWKKRREGERKET--GKYTSVVDLARARAN 221 + I + K PG I ++ W + G+R+E K +S+ Sbjct: 178 GLNNLIQAILAKFEDQTPGK--AAISQFIKRWLE-NPGKREEMVPSKTSSL--------- 225 Query: 222 QQYTENSTGKI-SPVIAAIHREYKQTWKTLDDELA 255 N++ I SP +A + R YK T+ TLD E+A Sbjct: 226 -----NTSDNIESPKVAPL-RGYKHTYATLDQEIA 254 >UniRef50_B6XKP2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B6XKP2_9ENTR Length = 668 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 38/121 (31%), Positives = 60/121 (49%), Gaps = 11/121 (9%) Query: 3 KVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDC--AYKFIV 60 K FIC P A R GAV + A+ A +E+ A A + E YP + D + ++ Sbjct: 2 KTFICVFEPTSEA-RTNGAVPLTIALNATNEKLASASAMIKLSEAYPESMDSFNTDEPLI 60 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 CED G PRPA+ +D + +N ++ S+ P+ + F+KLAP V+ A + Sbjct: 61 CEDTLGSPRPAIGEFDEAFASQNEFN--GTSWQPI------VYKDFNKLAPMVRIAATLL 112 Query: 121 F 121 + Sbjct: 113 Y 113 >UniRef50_C8T873 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T873_KLEPR Length = 727 Score = 45.8 bits (107), Expect = 0.001, Method: Compositional matrix adjust. Identities = 26/78 (33%), Positives = 44/78 (56%) Query: 11 PDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRP 70 P + A ++ GAV +A A+EA +++ A + + E YPA D +K + E G P P Sbjct: 10 PKKSAIKDCGAVQLAIAVEAPNKKVAESIMTGKLWESYPANGDNYFKPKLWEHIEGQPLP 69 Query: 71 ALDSWDAEYMQENRWDEE 88 A+ +D ++ Q+N +D E Sbjct: 70 AVGQFDEQFAQKNTFDGE 87 >UniRef50_Q7N1J2 Similarities with exodeoxyribonuclease VIII n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N1J2_PHOLL Length = 586 Score = 42.4 bits (98), Expect = 0.017, Method: Compositional matrix adjust. Identities = 32/108 (29%), Positives = 53/108 (49%), Gaps = 7/108 (6%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCA----YKFI 59 +IC P + A ++ GAV +A A+EA E+ A+A E Y A D + K I Sbjct: 3 TYICVFEPTKKA-QKNGAVPLAIALEANSEKMAKAMAIVYLGEEY--ADDMSNFNTSKPI 59 Query: 60 VCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFD 107 +C+D G PRP + +D ++ E ++ + +E + +P T D Sbjct: 60 ICKDIQGFPRPPIGKFDEKFATEYEFNGTTWQARTLEPKKNPSITTRD 107 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q47138 Putative uncharacterized protein ydfE (Fragment)... 364 2e-99 UniRef50_A7ZM35 Exonuclease family protein n=57 Tax=Enterobacter... 339 6e-92 UniRef50_D0FSA4 Similarities with exodeoxyribonuclease VIII n=2 ... 276 5e-73 UniRef50_C4SVJ7 Transcription termination factor n=2 Tax=Enterob... 266 4e-70 UniRef50_C4SVT6 Putative uncharacterized protein n=1 Tax=Yersini... 255 1e-66 UniRef50_B4T220 Putative uncharacterized protein n=3 Tax=Bacteri... 237 4e-61 UniRef50_C8T873 Putative uncharacterized protein n=1 Tax=Klebsie... 211 2e-53 UniRef50_B6XKP2 Putative uncharacterized protein n=3 Tax=Enterob... 148 2e-34 Sequences not found previously or not previously below threshold: UniRef50_B6I891 Putative uncharacterized protein n=2 Tax=Escheri... 163 7e-39 UniRef50_Q7N1J2 Similarities with exodeoxyribonuclease VIII n=1 ... 95 2e-18 UniRef50_A4WB56 Exonuclease, RNase T and DNA polymerase III n=27... 43 0.008 UniRef50_B4T1L2 Gifsy-1 prophage RecE n=21 Tax=Enterobacteriacea... 43 0.010 >UniRef50_Q47138 Putative uncharacterized protein ydfE (Fragment) n=58 Tax=Enterobacteriaceae RepID=YDFE_ECOLI Length = 255 Score = 364 bits (933), Expect = 2e-99, Method: Composition-based stats. Identities = 255/255 (100%), Positives = 255/255 (100%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV Sbjct: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK Sbjct: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI Sbjct: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH Sbjct: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 Query: 241 REYKQTWKTLDDELA 255 REYKQTWKTLDDELA Sbjct: 241 REYKQTWKTLDDELA 255 >UniRef50_A7ZM35 Exonuclease family protein n=57 Tax=Enterobacteriaceae RepID=A7ZM35_ECO24 Length = 825 Score = 339 bits (868), Expect = 6e-92, Method: Composition-based stats. Identities = 249/257 (96%), Positives = 251/257 (97%), Gaps = 2/257 (0%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEA--GDERRARAKFHWQFLEHYPAAQDCAYKF 58 MSKVFICAAIPDELATREEGAVAVA A GDERRARAKFHWQFLEHYPAAQDCAYKF Sbjct: 1 MSKVFICAAIPDELATREEGAVAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKF 60 Query: 59 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVM 118 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVET+SDPMNVTFDKLAPEVQNAVM Sbjct: 61 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETKSDPMNVTFDKLAPEVQNAVM 120 Query: 119 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK 178 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK Sbjct: 121 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK 180 Query: 179 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAA 238 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARA+ANQQYTENSTGKISPVIAA Sbjct: 181 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARAKANQQYTENSTGKISPVIAA 240 Query: 239 IHREYKQTWKTLDDELA 255 IHREYKQTWKTLDDELA Sbjct: 241 IHREYKQTWKTLDDELA 257 >UniRef50_D0FSA4 Similarities with exodeoxyribonuclease VIII n=2 Tax=Erwinia pyrifoliae RepID=D0FSA4_ERWPY Length = 747 Score = 276 bits (706), Expect = 5e-73, Method: Composition-based stats. Identities = 84/255 (32%), Positives = 129/255 (50%), Gaps = 27/255 (10%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 MSK C P E A E GA+ +A EA +E++ARAK F+E +P A D ++ + Sbjct: 1 MSKFTGCWF-PHEQAIEENGALRIAHHFEAANEKQARAKATLLFMEEFPDADDSLFELRL 59 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 P +PRPA + WD ++ ++ W++E PV E P V F KL+ ++ AV++K Sbjct: 60 YPHAPDVPRPATE-WDETFLFDHEWNDELGH--PVLKEQ-PKPVEFHKLSQSLRVAVLLK 115 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 F T E + ++ +++D + F+ HI++A+ K P V +MYPE L AI ++++ C Sbjct: 116 FHTTEITHTQLSVAMA--IEQDGSGFERHIIDAISKTPAVISMYPERILDAIDYIRNNCA 173 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 KWPEI+A + W K+ E RK +D I Sbjct: 174 TTKKWPEIKAVLAGWVKQCEQTRKGGAGGEDSIDE--------------------YEVIP 213 Query: 241 REYKQTWKTLDDELA 255 R Y+ T KTLD E+A Sbjct: 214 RPYEHTHKTLDIEIA 228 >UniRef50_C4SVJ7 Transcription termination factor n=2 Tax=Enterobacteriaceae RepID=C4SVJ7_YERFR Length = 732 Score = 266 bits (680), Expect = 4e-70, Method: Composition-based stats. Identities = 71/268 (26%), Positives = 112/268 (41%), Gaps = 37/268 (13%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 VFIC P + A + GA+ + A+EA +++ A A + E YPAA D +CED Sbjct: 3 VFICLFEPKKAALK-NGAIPLVIALEAINKKMASALAIGKLWEAYPAAGDNFADPKICED 61 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESASFVPVETES--------------DPMNVTFDKL 109 G PRPA+ +D ++ Q+N +D + P S D V + KL Sbjct: 62 TVGQPRPAVGDFDEQFAQDNTFD--GKVWTPNTVASPEDEDDNDGDEQGDDSGLVNYAKL 119 Query: 110 APEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFD-GHIVEALMKMPEVNAMYPELK 168 +V+ ++ + +I + +L+ +D I+ AL +P V AMY E Sbjct: 120 GIDVKVGKILMYG-LRDIGTHELSLVYDLINDDEGDAGLRSIITALAGIPAVGAMYQESV 178 Query: 169 LHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENS 228 + + K ++PE+Q + W VD R T+ Sbjct: 179 KELVDAINVKFPKTPQFPEVQKFAQKW-----------------VDEPNKRDELTGTKKV 221 Query: 229 TGKISPVI-AAIHREYKQTWKTLDDELA 255 T +P A I R + T+KTLD E+A Sbjct: 222 TRINTPTPDAPIKRGFDHTYKTLDLEVA 249 >UniRef50_C4SVT6 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SVT6_YERFR Length = 701 Score = 255 bits (650), Expect = 1e-66, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 108/249 (43%), Gaps = 29/249 (11%) Query: 19 EGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDS---- 74 GAVA+A ++A +++ A K +P + + VC D+ G PRP + Sbjct: 3 NGAVAMAITVDAKNQKMATMKSTMLLEGEFPGSSGNFFAPKVCADRVGSPRPPVHDDSDD 62 Query: 75 ---WDAEYMQENRWDEESASF--VPVETESDPMNV-TFDKLAPEVQNAVMVKFDT-CENI 127 + E+M+ N+W++ES F +P++ + N+ L EV+ A ++ + E + Sbjct: 63 NAIFSTEWMEHNQWNDESKEFEPIPIQNNDEIENIKDIFNLPIEVRVAYILLYGVEPETV 122 Query: 128 TVDMVISAQELLQEDMA-TFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWP 186 ++ +A +L+ +D + I++ L ++P++ M I ++ P WP Sbjct: 123 DSHLLSNAYDLINDDESEPLYRAIIDGLPRLPQIKHMLITSLAQLIDDIQAHTPPFKSWP 182 Query: 187 EIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQT 246 ++ W R ER+ ++AN+ + TG R YK Sbjct: 183 DVNKFAEKWIMSRPDEREH----------PTSQANEPTNSSPTGN-------HERGYKHD 225 Query: 247 WKTLDDELA 255 + +LD E+A Sbjct: 226 YASLDVEVA 234 >UniRef50_B4T220 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B4T220_SALNS Length = 754 Score = 237 bits (603), Expect = 4e-61, Method: Composition-based stats. Identities = 74/273 (27%), Positives = 123/273 (45%), Gaps = 42/273 (15%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 ++ C P + A ++ GAVA+A +IEA +++ A + + EHYPA D +K + ED Sbjct: 3 IYNCLFEPKKSAIKD-GAVALAISIEAPNKKVAESIVIGKLWEHYPANGDNYFKPKIWED 61 Query: 64 KPGIPRPALDSWDAEYMQENRWDEES------ASFVPVETESDPMNVTFDKLAPEVQNAV 117 PRP + +D ++ Q N +D E + VP SD + + KL + A Sbjct: 62 SEDQPRPEVGKFDEQFAQANTFDGEKWIANKPDTSVPGLPSSDEI-IDLMKLPARERFAA 120 Query: 118 MVKFDTCENITVDMVISAQEL-----LQEDMATFD------GHIVEALMKMPEVNAMYPE 166 ++ F N +D V+ +Q L L+ + FD +I+ AL V M+ E Sbjct: 121 VLMF---SNSPIDGVLYSQVLDYLDNLENNTEPFDDDDRINSNILHALHNNEPVRHMHVE 177 Query: 167 LKLHAIGWVKHKC---IPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQ 223 + I + K PG I ++ W + G+R+E ++ Sbjct: 178 GLNNLIQAILAKFEDQTPGK--AAISQFIKRWLE-NPGKREEMV------------PSKT 222 Query: 224 YTENSTGKI-SPVIAAIHREYKQTWKTLDDELA 255 + N++ I SP +A + R YK T+ TLD E+A Sbjct: 223 SSLNTSDNIESPKVAPL-RGYKHTYATLDQEIA 254 >UniRef50_C8T873 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T873_KLEPR Length = 727 Score = 211 bits (536), Expect = 2e-53, Method: Composition-based stats. Identities = 60/270 (22%), Positives = 107/270 (39%), Gaps = 38/270 (14%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 ++ P + A ++ GAV +A A+EA +++ A + + E YPA D +K + E Sbjct: 3 IYNGLFEPKKSAIKDCGAVQLAIAVEAPNKKVAESIMTGKLWESYPANGDNYFKPKLWEH 62 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDP-------MNVTFDKLAPEVQNA 116 G P PA+ +D ++ Q+N +D E +V D + ++P + A Sbjct: 63 IEGQPLPAVGQFDEQFAQKNTFDGEK--WVANSQNGDVTELPAGDEVIDLMTVSPRERFA 120 Query: 117 VMVKFDTCENIT---VDMVISAQELLQEDMA-----TFDGHIVEALMKMPEVNAMYPELK 168 ++ F E +V +L D + F+ +++ AL V M+ E Sbjct: 121 AVLLFSKLEINGQLYSQVVDYLDDLDNHDESIEEDDRFNFNVLCALHNNEPVKHMHVEGL 180 Query: 169 LHAIGWVKHKC---IPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYT 225 + I + +PG I ++ W + G+R+E Sbjct: 181 NNLIQGIFSHFENQMPGK--AAISQFVKRWLE-NPGKREEMVPGQ--------------N 223 Query: 226 ENSTGKISPVIAAIHREYKQTWKTLDDELA 255 + + IA R YK T+ TLD E+A Sbjct: 224 YAPSTDNNVKIAP-KRGYKHTYATLDQEIA 252 >UniRef50_B6I891 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B6I891_ECOSE Length = 738 Score = 163 bits (411), Expect = 7e-39, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 103/271 (38%), Gaps = 43/271 (15%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 +F + A + A+A AI+A +++ A + E YP D +K + ED Sbjct: 3 IFNGLLEAKKGALKNGAIPALAIAIDAPNKKVAENIIIGKLWEAYPDHGDNYFKPKIWED 62 Query: 64 KPGIPRPALDSWDAEYMQENRWDEES-ASFVPVETESDPMNV----TFDKLAPEVQNAVM 118 PG PRP + +D + E+ +D E PV++ ++ KL + A + Sbjct: 63 APGQPRPGVGEFDETFATEHSFDGERWVVNTPVDSGCSSADISQVSDLMKLPARERFAAV 122 Query: 119 VKF-DTCENITVDMVISAQELLQ----------EDMATFDGHIVEALMKMPEVNAMYPEL 167 + F + +++ +E L+ +++ F+ +++A++ + M+ Sbjct: 123 LLFSHDANEVDSELLAQTREYLEMLDNSDTDSEDEVDAFNRIVLDAMVACKPIEYMHIAG 182 Query: 168 KLHAIGWVKHKCIPGAKWPE---IQAEMRIWKKRREGERKETGKYTSVVDLARARANQQY 224 + + + C + P I ++ W + G+R E L + Sbjct: 183 LNNLVHAILASCDTQEQNPTSWTISKFIKKWVE-NPGKRDEM--------LPEVKPETAS 233 Query: 225 TENSTGKISPVIAAIHREYKQTWKTLDDELA 255 R Y+QT TLD E+A Sbjct: 234 A---------------RPYQQTHATLDREIA 249 >UniRef50_B6XKP2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B6XKP2_9ENTR Length = 668 Score = 148 bits (373), Expect = 2e-34, Method: Composition-based stats. Identities = 45/175 (25%), Positives = 77/175 (44%), Gaps = 11/175 (6%) Query: 3 KVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCA--YKFIV 60 K FIC P A R GAV + A+ A +E+ A A + E YP + D + ++ Sbjct: 2 KTFICVFEPTSEA-RTNGAVPLTIALNATNEKLASASAMIKLSEAYPESMDSFNTDEPLI 60 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 CED G PRPA+ +D + +N ++ S+ P+ + F+KLAP V+ A + Sbjct: 61 CEDTLGSPRPAIGEFDEAFASQNEFN--GTSWQPI------VYKDFNKLAPMVRIAATLL 112 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWV 175 + + + + + + + +I + L K+ V M E + V Sbjct: 113 YGKTQFTDAEYSQAIKFVHEGSDQPKIRNIGKGLAKISAVCTMDAEQTIEIAQAV 167 >UniRef50_Q7N1J2 Similarities with exodeoxyribonuclease VIII n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N1J2_PHOLL Length = 586 Score = 95.0 bits (234), Expect = 2e-18, Method: Composition-based stats. Identities = 37/149 (24%), Positives = 65/149 (43%), Gaps = 9/149 (6%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCA--YKFIVC 61 +IC P + A + GAV +A A+EA E+ A+A E Y K I+C Sbjct: 3 TYICVFEPTKKAQK-NGAVPLAIALEANSEKMAKAMAIVYLGEEYADDMSNFNTSKPIIC 61 Query: 62 EDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPM------NVTFDKLAPEVQN 115 +D G PRP + +D ++ E ++ + +E + +P N T+ L E+ Sbjct: 62 KDIQGFPRPPIGKFDEKFATEYEFNGTTWQARTLEPKKNPSITTRDFNHTYATLDLEIAL 121 Query: 116 AVMVKFDTCENITVDMVISAQELLQEDMA 144 A++ C I + A++L+ + Sbjct: 122 ALLDGDFHCWAIMSHSMKEAKQLITNNNE 150 >UniRef50_A4WB56 Exonuclease, RNase T and DNA polymerase III n=27 Tax=root RepID=A4WB56_ENT38 Length = 834 Score = 43.4 bits (100), Expect = 0.008, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 46/150 (30%), Gaps = 2/150 (1%) Query: 108 KLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPEL 167 +L E + F + +T + A + + A + +I+ A + ++ ++PE Sbjct: 267 QLTFEKRVFGTWLFGLFDELTPEQSSEASRIELDMDAHYAQNIMLASRNVRQLAHVFPET 326 Query: 168 KLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTEN 227 K K P + + + + + V + Sbjct: 327 IADLFSATKSVWPVNGKTPLLNDVITFFSEWIDTHSSGKASRDDVTAKWHKKFKSSMVRT 386 Query: 228 STGKISPVIAAIHRE--YKQTWKTLDDELA 255 +G + R T TLD E+A Sbjct: 387 DSGTNADGGEVTDRGEGAHHTLDTLDLEIA 416 >UniRef50_B4T1L2 Gifsy-1 prophage RecE n=21 Tax=Enterobacteriaceae RepID=B4T1L2_SALNS Length = 975 Score = 43.0 bits (99), Expect = 0.010, Method: Composition-based stats. Identities = 42/253 (16%), Positives = 80/253 (31%), Gaps = 27/253 (10%) Query: 23 AVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPA----------- 71 AVAT +E + F ++Y A+D PG+ P+ Sbjct: 54 AVATNFPVVNELPPEGELSLTFCDYYQLAKDNM----TWTQIPGVTLPSSEAAAAARQHI 109 Query: 72 ---LDSWDAEYMQENRWDEESASFVPVE-TESDPMNVTFDKLAPEVQNAVMVKFDTCENI 127 +D+ E ++++ + + S P + T P + + + Sbjct: 110 VDGVDTETGEVLEDHTENFGNESNSPAQATAPAPELTVVATMPLRHRVLAQYIGEGEYLY 169 Query: 128 TVDMVISAQEL-LQEDMATFD-GHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKW 185 VD + L L+ D +++ A + + VK K Sbjct: 170 HVDASQKKEILRLEMDTDNSYVQNLLLAAENVEAFKKAIEHDIHKIVNAVKKIFPVDGKT 229 Query: 186 PEIQ---AEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHRE 242 PE+ ++ W + +R K + + A + N+ G + Sbjct: 230 PELATVIQFLKTWFETEHIDRGLLVKEWAKGNRVSAIQRTESGANAGGGNKTDRNP---D 286 Query: 243 YKQTWKTLDDELA 255 Y+ T TLD E+A Sbjct: 287 YEYTLDTLDVEIA 299 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q47138 Putative uncharacterized protein ydfE (Fragment)... 344 1e-93 UniRef50_A7ZM35 Exonuclease family protein n=57 Tax=Enterobacter... 319 4e-86 UniRef50_C8T873 Putative uncharacterized protein n=1 Tax=Klebsie... 277 2e-73 UniRef50_D0FSA4 Similarities with exodeoxyribonuclease VIII n=2 ... 268 1e-70 UniRef50_C4SVJ7 Transcription termination factor n=2 Tax=Enterob... 268 1e-70 UniRef50_B4T220 Putative uncharacterized protein n=3 Tax=Bacteri... 267 3e-70 UniRef50_B6I891 Putative uncharacterized protein n=2 Tax=Escheri... 254 2e-66 UniRef50_C4SVT6 Putative uncharacterized protein n=1 Tax=Yersini... 252 7e-66 UniRef50_B6XKP2 Putative uncharacterized protein n=3 Tax=Enterob... 193 3e-48 UniRef50_Q7N1J2 Similarities with exodeoxyribonuclease VIII n=1 ... 170 5e-41 Sequences not found previously or not previously below threshold: UniRef50_B4T1L2 Gifsy-1 prophage RecE n=21 Tax=Enterobacteriacea... 46 9e-04 >UniRef50_Q47138 Putative uncharacterized protein ydfE (Fragment) n=58 Tax=Enterobacteriaceae RepID=YDFE_ECOLI Length = 255 Score = 344 bits (883), Expect = 1e-93, Method: Composition-based stats. Identities = 255/255 (100%), Positives = 255/255 (100%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV Sbjct: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK Sbjct: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI Sbjct: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH Sbjct: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 Query: 241 REYKQTWKTLDDELA 255 REYKQTWKTLDDELA Sbjct: 241 REYKQTWKTLDDELA 255 >UniRef50_A7ZM35 Exonuclease family protein n=57 Tax=Enterobacteriaceae RepID=A7ZM35_ECO24 Length = 825 Score = 319 bits (818), Expect = 4e-86, Method: Composition-based stats. Identities = 249/257 (96%), Positives = 251/257 (97%), Gaps = 2/257 (0%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEA--GDERRARAKFHWQFLEHYPAAQDCAYKF 58 MSKVFICAAIPDELATREEGAVAVA A GDERRARAKFHWQFLEHYPAAQDCAYKF Sbjct: 1 MSKVFICAAIPDELATREEGAVAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKF 60 Query: 59 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVM 118 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVET+SDPMNVTFDKLAPEVQNAVM Sbjct: 61 IVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETKSDPMNVTFDKLAPEVQNAVM 120 Query: 119 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK 178 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK Sbjct: 121 VKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHK 180 Query: 179 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAA 238 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARA+ANQQYTENSTGKISPVIAA Sbjct: 181 CIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARAKANQQYTENSTGKISPVIAA 240 Query: 239 IHREYKQTWKTLDDELA 255 IHREYKQTWKTLDDELA Sbjct: 241 IHREYKQTWKTLDDELA 257 >UniRef50_C8T873 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8T873_KLEPR Length = 727 Score = 277 bits (709), Expect = 2e-73, Method: Composition-based stats. Identities = 60/270 (22%), Positives = 107/270 (39%), Gaps = 38/270 (14%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 ++ P + A ++ GAV +A A+EA +++ A + + E YPA D +K + E Sbjct: 3 IYNGLFEPKKSAIKDCGAVQLAIAVEAPNKKVAESIMTGKLWESYPANGDNYFKPKLWEH 62 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDP-------MNVTFDKLAPEVQNA 116 G P PA+ +D ++ Q+N +D E +V D + ++P + A Sbjct: 63 IEGQPLPAVGQFDEQFAQKNTFDGEK--WVANSQNGDVTELPAGDEVIDLMTVSPRERFA 120 Query: 117 VMVKFDTCENIT---VDMVISAQELLQEDMA-----TFDGHIVEALMKMPEVNAMYPELK 168 ++ F E +V +L D + F+ +++ AL V M+ E Sbjct: 121 AVLLFSKLEINGQLYSQVVDYLDDLDNHDESIEEDDRFNFNVLCALHNNEPVKHMHVEGL 180 Query: 169 LHAIGWVKHKC---IPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYT 225 + I + +PG I ++ W + G+R+E Sbjct: 181 NNLIQGIFSHFENQMPGK--AAISQFVKRWLE-NPGKREEMVPGQ--------------N 223 Query: 226 ENSTGKISPVIAAIHREYKQTWKTLDDELA 255 + + IA R YK T+ TLD E+A Sbjct: 224 YAPSTDNNVKIAP-KRGYKHTYATLDQEIA 252 >UniRef50_D0FSA4 Similarities with exodeoxyribonuclease VIII n=2 Tax=Erwinia pyrifoliae RepID=D0FSA4_ERWPY Length = 747 Score = 268 bits (685), Expect = 1e-70, Method: Composition-based stats. Identities = 84/255 (32%), Positives = 128/255 (50%), Gaps = 27/255 (10%) Query: 1 MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIV 60 MSK C P E A E GA+ +A EA +E++ARAK F+E +P A D ++ + Sbjct: 1 MSKFTGCWF-PHEQAIEENGALRIAHHFEAANEKQARAKATLLFMEEFPDADDSLFELRL 59 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 P +PRPA WD ++ ++ W++E PV E P V F KL+ ++ AV++K Sbjct: 60 YPHAPDVPRPAT-EWDETFLFDHEWNDELGH--PVLKEQ-PKPVEFHKLSQSLRVAVLLK 115 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 F T E + ++ +++D + F+ HI++A+ K P V +MYPE L AI ++++ C Sbjct: 116 FHTTEITHTQLSVAMA--IEQDGSGFERHIIDAISKTPAVISMYPERILDAIDYIRNNCA 173 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 KWPEI+A + W K+ E RK +D I Sbjct: 174 TTKKWPEIKAVLAGWVKQCEQTRKGGAGGEDSIDE--------------------YEVIP 213 Query: 241 REYKQTWKTLDDELA 255 R Y+ T KTLD E+A Sbjct: 214 RPYEHTHKTLDIEIA 228 >UniRef50_C4SVJ7 Transcription termination factor n=2 Tax=Enterobacteriaceae RepID=C4SVJ7_YERFR Length = 732 Score = 268 bits (685), Expect = 1e-70, Method: Composition-based stats. Identities = 70/268 (26%), Positives = 111/268 (41%), Gaps = 37/268 (13%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 VFIC P + A + GA+ + A+EA +++ A A + E YPAA D +CED Sbjct: 3 VFICLFEPKKAALK-NGAIPLVIALEAINKKMASALAIGKLWEAYPAAGDNFADPKICED 61 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESASFVPVE--------------TESDPMNVTFDKL 109 G PRPA+ +D ++ Q+N +D + P D V + KL Sbjct: 62 TVGQPRPAVGDFDEQFAQDNTFDG--KVWTPNTVASPEDEDDNDGDEQGDDSGLVNYAKL 119 Query: 110 APEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFD-GHIVEALMKMPEVNAMYPELK 168 +V+ ++ + +I + +L+ +D I+ AL +P V AMY E Sbjct: 120 GIDVKVGKILMYG-LRDIGTHELSLVYDLINDDEGDAGLRSIITALAGIPAVGAMYQESV 178 Query: 169 LHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENS 228 + + K ++PE+Q + W VD R T+ Sbjct: 179 KELVDAINVKFPKTPQFPEVQKFAQKW-----------------VDEPNKRDELTGTKKV 221 Query: 229 TGKISPVI-AAIHREYKQTWKTLDDELA 255 T +P A I R + T+KTLD E+A Sbjct: 222 TRINTPTPDAPIKRGFDHTYKTLDLEVA 249 >UniRef50_B4T220 Putative uncharacterized protein n=3 Tax=Bacteria RepID=B4T220_SALNS Length = 754 Score = 267 bits (681), Expect = 3e-70, Method: Composition-based stats. Identities = 72/271 (26%), Positives = 121/271 (44%), Gaps = 38/271 (14%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 ++ C P + A ++ GAVA+A +IEA +++ A + + EHYPA D +K + ED Sbjct: 3 IYNCLFEPKKSAIKD-GAVALAISIEAPNKKVAESIVIGKLWEHYPANGDNYFKPKIWED 61 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESA------SFVPVETESDPMNVTFDKLAPEVQNAV 117 PRP + +D ++ Q N +D E + VP SD + + KL + A Sbjct: 62 SEDQPRPEVGKFDEQFAQANTFDGEKWIANKPDTSVPGLPSSDEI-IDLMKLPARERFAA 120 Query: 118 MVKFDTCENITVDMVISAQEL-----LQEDMATFD------GHIVEALMKMPEVNAMYPE 166 ++ F N +D V+ +Q L L+ + FD +I+ AL V M+ E Sbjct: 121 VLMF---SNSPIDGVLYSQVLDYLDNLENNTEPFDDDDRINSNILHALHNNEPVRHMHVE 177 Query: 167 LKLHAIGWVKHKC-IPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYT 225 + I + K I ++ W + G+R+E ++ + Sbjct: 178 GLNNLIQAILAKFEDQTPGKAAISQFIKRWLE-NPGKREEMV------------PSKTSS 224 Query: 226 ENSTGKI-SPVIAAIHREYKQTWKTLDDELA 255 N++ I SP +A + R YK T+ TLD E+A Sbjct: 225 LNTSDNIESPKVAPL-RGYKHTYATLDQEIA 254 >UniRef50_B6I891 Putative uncharacterized protein n=2 Tax=Escherichia coli RepID=B6I891_ECOSE Length = 738 Score = 254 bits (648), Expect = 2e-66, Method: Composition-based stats. Identities = 52/271 (19%), Positives = 103/271 (38%), Gaps = 43/271 (15%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCED 63 +F + A + A+A AI+A +++ A + E YP D +K + ED Sbjct: 3 IFNGLLEAKKGALKNGAIPALAIAIDAPNKKVAENIIIGKLWEAYPDHGDNYFKPKIWED 62 Query: 64 KPGIPRPALDSWDAEYMQENRWDEESA-SFVPVETESDPMNV----TFDKLAPEVQNAVM 118 PG PRP + +D + E+ +D E PV++ ++ KL + A + Sbjct: 63 APGQPRPGVGEFDETFATEHSFDGERWVVNTPVDSGCSSADISQVSDLMKLPARERFAAV 122 Query: 119 VKF-DTCENITVDMVISAQELL----------QEDMATFDGHIVEALMKMPEVNAMYPEL 167 + F + +++ +E L ++++ F+ +++A++ + M+ Sbjct: 123 LLFSHDANEVDSELLAQTREYLEMLDNSDTDSEDEVDAFNRIVLDAMVACKPIEYMHIAG 182 Query: 168 KLHAIGWVKHKCIPGAKWP---EIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQY 224 + + + C + P I ++ W + G+R E L + Sbjct: 183 LNNLVHAILASCDTQEQNPTSWTISKFIKKWVE-NPGKRDEM--------LPEVKPETAS 233 Query: 225 TENSTGKISPVIAAIHREYKQTWKTLDDELA 255 R Y+QT TLD E+A Sbjct: 234 A---------------RPYQQTHATLDREIA 249 >UniRef50_C4SVT6 Putative uncharacterized protein n=1 Tax=Yersinia frederiksenii ATCC 33641 RepID=C4SVT6_YERFR Length = 701 Score = 252 bits (644), Expect = 7e-66, Method: Composition-based stats. Identities = 56/249 (22%), Positives = 108/249 (43%), Gaps = 29/249 (11%) Query: 19 EGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDS---- 74 GAVA+A ++A +++ A K +P + + VC D+ G PRP + Sbjct: 3 NGAVAMAITVDAKNQKMATMKSTMLLEGEFPGSSGNFFAPKVCADRVGSPRPPVHDDSDD 62 Query: 75 ---WDAEYMQENRWDEESASF--VPVETESDPMNV-TFDKLAPEVQNAVMVKFDT-CENI 127 + E+M+ N+W++ES F +P++ + N+ L EV+ A ++ + E + Sbjct: 63 NAIFSTEWMEHNQWNDESKEFEPIPIQNNDEIENIKDIFNLPIEVRVAYILLYGVEPETV 122 Query: 128 TVDMVISAQELLQEDMA-TFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWP 186 ++ +A +L+ +D + I++ L ++P++ M I ++ P WP Sbjct: 123 DSHLLSNAYDLINDDESEPLYRAIIDGLPRLPQIKHMLITSLAQLIDDIQAHTPPFKSWP 182 Query: 187 EIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQT 246 ++ W R ER+ ++AN+ + TG R YK Sbjct: 183 DVNKFAEKWIMSRPDEREH----------PTSQANEPTNSSPTGN-------HERGYKHD 225 Query: 247 WKTLDDELA 255 + +LD E+A Sbjct: 226 YASLDVEVA 234 >UniRef50_B6XKP2 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=B6XKP2_9ENTR Length = 668 Score = 193 bits (491), Expect = 3e-48, Method: Composition-based stats. Identities = 54/255 (21%), Positives = 95/255 (37%), Gaps = 37/255 (14%) Query: 3 KVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCA--YKFIV 60 K FIC P A R GAV + A+ A +E+ A A + E YP + D + ++ Sbjct: 2 KTFICVFEPTSEA-RTNGAVPLTIALNATNEKLASASAMIKLSEAYPESMDSFNTDEPLI 60 Query: 61 CEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVK 120 CED G PRPA+ +D + +N ++ S + P+ + F+KLAP V+ A + Sbjct: 61 CEDTLGSPRPAIGEFDEAFASQNEFNGTS--WQPI------VYKDFNKLAPMVRIAATLL 112 Query: 121 FDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCI 180 + + + + + + + +I + L K+ V M E + V Sbjct: 113 YGKTQFTDAEYSQAIKFVHEGSDQPKIRNIGKGLAKISAVCTMDAEQTIEIAQAVSEFAD 172 Query: 181 PGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIH 240 + A + + K + S S + Sbjct: 173 DEITIEDATALGKSYLKEEP------------------KPEVISATKSENDESIL----- 209 Query: 241 REYKQTWKTLDDELA 255 K+ + T+D E+A Sbjct: 210 ---KRDYATIDMEVA 221 >UniRef50_Q7N1J2 Similarities with exodeoxyribonuclease VIII n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7N1J2_PHOLL Length = 586 Score = 170 bits (429), Expect = 5e-41, Method: Composition-based stats. Identities = 37/150 (24%), Positives = 65/150 (43%), Gaps = 9/150 (6%) Query: 4 VFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCA--YKFIVC 61 +IC P + A + GAV +A A+EA E+ A+A E Y K I+C Sbjct: 3 TYICVFEPTKKAQK-NGAVPLAIALEANSEKMAKAMAIVYLGEEYADDMSNFNTSKPIIC 61 Query: 62 EDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPM------NVTFDKLAPEVQN 115 +D G PRP + +D ++ E ++ + +E + +P N T+ L E+ Sbjct: 62 KDIQGFPRPPIGKFDEKFATEYEFNGTTWQARTLEPKKNPSITTRDFNHTYATLDLEIAL 121 Query: 116 AVMVKFDTCENITVDMVISAQELLQEDMAT 145 A++ C I + A++L+ + Sbjct: 122 ALLDGDFHCWAIMSHSMKEAKQLITNNNEA 151 >UniRef50_B4T1L2 Gifsy-1 prophage RecE n=21 Tax=Enterobacteriaceae RepID=B4T1L2_SALNS Length = 975 Score = 46.4 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 80/253 (31%), Gaps = 27/253 (10%) Query: 23 AVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRP------------ 70 AVAT +E + F ++Y A+D + PG+ P Sbjct: 54 AVATNFPVVNELPPEGELSLTFCDYYQLAKDN----MTWTQIPGVTLPSSEAAAAARQHI 109 Query: 71 --ALDSWDAEYMQENRWDEESASFVPVE-TESDPMNVTFDKLAPEVQN-AVMVKFDT-CE 125 +D+ E ++++ + + S P + T P + + A + Sbjct: 110 VDGVDTETGEVLEDHTENFGNESNSPAQATAPAPELTVVATMPLRHRVLAQYIGEGEYLY 169 Query: 126 NITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKW 185 ++ L + ++ +++ A + + VK K Sbjct: 170 HVDASQKKEILRLEMDTDNSYVQNLLLAAENVEAFKKAIEHDIHKIVNAVKKIFPVDGKT 229 Query: 186 PEIQ---AEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHRE 242 PE+ ++ W + +R K + + A + N+ G + Sbjct: 230 PELATVIQFLKTWFETEHIDRGLLVKEWAKGNRVSAIQRTESGANAGGGNKTDRNP---D 286 Query: 243 YKQTWKTLDDELA 255 Y+ T TLD E+A Sbjct: 287 YEYTLDTLDVEIA 299 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.302 0.115 0.289 Lambda K H 0.267 0.0354 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,074,284,890 Number of Sequences: 3077464 Number of extensions: 34091180 Number of successful extensions: 98211 Number of sequences better than 1.0e-01: 11 Number of HSP's better than 0.1 without gapping: 26 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 98107 Number of HSP's gapped (non-prelim): 36 length of query: 255 length of database: 1,040,396,356 effective HSP length: 126 effective length of query: 129 effective length of database: 652,635,892 effective search space: 84190030068 effective search space used: 84190030068 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 91 (39.9 bits)