BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (76 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P0ABT0 DNA polymerase III subunit theta n=92 Tax=Entero... 155 4e-37 UniRef50_B5F3L5 Conserved domain protein n=37 Tax=Enterobacteria... 141 7e-33 UniRef50_A8GD54 DNA polymerase II beta subunit n=14 Tax=Enteroba... 110 1e-23 UniRef50_A4TKC0 DNA polymerase III, theta subunit n=24 Tax=Enter... 97 1e-19 UniRef50_A8GDF3 DNA polymerase II beta subunit n=9 Tax=root RepI... 76 4e-13 UniRef50_Q8D2Y6 HolE protein n=1 Tax=Wigglesworthia glossinidia ... 73 4e-12 UniRef50_A8GGS1 Putative uncharacterized protein n=2 Tax=Serrati... 43 0.004 >UniRef50_P0ABT0 DNA polymerase III subunit theta n=92 Tax=Enterobacteriaceae RepID=HOLE_ECO57 Length = 76 Score = 155 bits (392), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 76/76 (100%), Positives = 76/76 (100%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR Sbjct: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 Query: 61 LASVNLSRLPYEPKLK 76 LASVNLSRLPYEPKLK Sbjct: 61 LASVNLSRLPYEPKLK 76 >UniRef50_B5F3L5 Conserved domain protein n=37 Tax=Enterobacteriaceae RepID=B5F3L5_SALA4 Length = 76 Score = 141 bits (355), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 67/76 (88%), Positives = 73/76 (96%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NLA+L+Q EMDKVNVDLAAAGVAFKERYNMPV+AEAVEREQPEHLR+WFRERLIAHR Sbjct: 1 MKTNLAQLEQVEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRAWFRERLIAHR 60 Query: 61 LASVNLSRLPYEPKLK 76 LASV+LSRLPYEPK+K Sbjct: 61 LASVSLSRLPYEPKVK 76 >UniRef50_A8GD54 DNA polymerase II beta subunit n=14 Tax=Enterobacteriaceae RepID=A8GD54_SERP5 Length = 76 Score = 110 bits (276), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 54/76 (71%), Positives = 61/76 (80%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NLA+L + EMDKVNVDLAA+GVAFKERYNMPVI E VEREQP HLR +FRER+ +R Sbjct: 1 MGYNLAELSKEEMDKVNVDLAASGVAFKERYNMPVIPEMVEREQPAHLRDYFRERVAHYR 60 Query: 61 LASVNLSRLPYEPKLK 76 + S SRLPYEPK K Sbjct: 61 VESHKFSRLPYEPKSK 76 >UniRef50_A4TKC0 DNA polymerase III, theta subunit n=24 Tax=Enterobacteriaceae RepID=A4TKC0_YERPP Length = 76 Score = 97.4 bits (241), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 49/76 (64%), Positives = 57/76 (75%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NL +L E K+NVDLAA+GVAFKERYNMPVI E V REQPE LR +F +RL +R Sbjct: 1 MGYNLVELSDEETAKMNVDLAASGVAFKERYNMPVIPEMVAREQPEALREYFLQRLAHYR 60 Query: 61 LASVNLSRLPYEPKLK 76 + S LSRLPYEPK+K Sbjct: 61 IESKKLSRLPYEPKVK 76 >UniRef50_A8GDF3 DNA polymerase II beta subunit n=9 Tax=root RepID=A8GDF3_SERP5 Length = 86 Score = 75.9 bits (185), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 39/67 (58%), Positives = 47/67 (70%), Gaps = 3/67 (4%) Query: 4 NLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRLAS 63 NLA L + +MDK NVDLAA+GVA+KER N PVIA+ VE QPEHLR + ER+ +R S Sbjct: 6 NLANLSKEDMDKTNVDLAASGVAYKERMNQPVIADQVELVQPEHLRGYICERVAHYRDVS 65 Query: 64 VNLSRLP 70 RLP Sbjct: 66 ---KRLP 69 >UniRef50_Q8D2Y6 HolE protein n=1 Tax=Wigglesworthia glossinidia endosymbiont of Glossina brevipalpis RepID=Q8D2Y6_WIGBR Length = 76 Score = 72.8 bits (177), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 31/58 (53%), Positives = 44/58 (75%) Query: 4 NLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRL 61 NL K+ + +K+ VDLA++GV F+ERYNMPV E +E QPE+L S+FR+RLI +R+ Sbjct: 7 NLFKISKKNREKIMVDLASSGVVFRERYNMPVSIEQIENNQPEYLLSYFRKRLIYYRI 64 >UniRef50_A8GGS1 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GGS1_SERP5 Length = 63 Score = 42.7 bits (99), Expect = 0.004, Method: Compositional matrix adjust. Identities = 25/63 (39%), Positives = 36/63 (57%), Gaps = 1/63 (1%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPV-IAEAVEREQPEHLRSWFRERLIAH 59 M + L L Q E +K+ VDL A V + ERY + V A E++ P++LR +F RL + Sbjct: 1 MGRKLDSLPQAEREKIEVDLLALSVIYNERYGITVNDASNAEQQVPDYLRPYFHLRLNYY 60 Query: 60 RLA 62 R A Sbjct: 61 RGA 63 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A8GD54 DNA polymerase II beta subunit n=14 Tax=Enteroba... 118 5e-26 UniRef50_P0ABT0 DNA polymerase III subunit theta n=92 Tax=Entero... 117 8e-26 UniRef50_B5F3L5 Conserved domain protein n=37 Tax=Enterobacteria... 115 5e-25 UniRef50_A4TKC0 DNA polymerase III, theta subunit n=24 Tax=Enter... 111 6e-24 UniRef50_A8GDF3 DNA polymerase II beta subunit n=9 Tax=root RepI... 98 7e-20 UniRef50_Q8D2Y6 HolE protein n=1 Tax=Wigglesworthia glossinidia ... 87 2e-16 Sequences not found previously or not previously below threshold: UniRef50_A8GGS1 Putative uncharacterized protein n=2 Tax=Serrati... 47 2e-04 >UniRef50_A8GD54 DNA polymerase II beta subunit n=14 Tax=Enterobacteriaceae RepID=A8GD54_SERP5 Length = 76 Score = 118 bits (296), Expect = 5e-26, Method: Composition-based stats. Identities = 54/76 (71%), Positives = 61/76 (80%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NLA+L + EMDKVNVDLAA+GVAFKERYNMPVI E VEREQP HLR +FRER+ +R Sbjct: 1 MGYNLAELSKEEMDKVNVDLAASGVAFKERYNMPVIPEMVEREQPAHLRDYFRERVAHYR 60 Query: 61 LASVNLSRLPYEPKLK 76 + S SRLPYEPK K Sbjct: 61 VESHKFSRLPYEPKSK 76 >UniRef50_P0ABT0 DNA polymerase III subunit theta n=92 Tax=Enterobacteriaceae RepID=HOLE_ECO57 Length = 76 Score = 117 bits (294), Expect = 8e-26, Method: Composition-based stats. Identities = 76/76 (100%), Positives = 76/76 (100%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR Sbjct: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 Query: 61 LASVNLSRLPYEPKLK 76 LASVNLSRLPYEPKLK Sbjct: 61 LASVNLSRLPYEPKLK 76 >UniRef50_B5F3L5 Conserved domain protein n=37 Tax=Enterobacteriaceae RepID=B5F3L5_SALA4 Length = 76 Score = 115 bits (287), Expect = 5e-25, Method: Composition-based stats. Identities = 67/76 (88%), Positives = 73/76 (96%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NLA+L+Q EMDKVNVDLAAAGVAFKERYNMPV+AEAVEREQPEHLR+WFRERLIAHR Sbjct: 1 MKTNLAQLEQVEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRAWFRERLIAHR 60 Query: 61 LASVNLSRLPYEPKLK 76 LASV+LSRLPYEPK+K Sbjct: 61 LASVSLSRLPYEPKVK 76 >UniRef50_A4TKC0 DNA polymerase III, theta subunit n=24 Tax=Enterobacteriaceae RepID=A4TKC0_YERPP Length = 76 Score = 111 bits (278), Expect = 6e-24, Method: Composition-based stats. Identities = 49/76 (64%), Positives = 57/76 (75%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NL +L E K+NVDLAA+GVAFKERYNMPVI E V REQPE LR +F +RL +R Sbjct: 1 MGYNLVELSDEETAKMNVDLAASGVAFKERYNMPVIPEMVAREQPEALREYFLQRLAHYR 60 Query: 61 LASVNLSRLPYEPKLK 76 + S LSRLPYEPK+K Sbjct: 61 IESKKLSRLPYEPKVK 76 >UniRef50_A8GDF3 DNA polymerase II beta subunit n=9 Tax=root RepID=A8GDF3_SERP5 Length = 86 Score = 98.3 bits (243), Expect = 7e-20, Method: Composition-based stats. Identities = 37/64 (57%), Positives = 45/64 (70%) Query: 3 KNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRLA 62 NLA L + +MDK NVDLAA+GVA+KER N PVIA+ VE QPEHLR + ER+ +R Sbjct: 5 FNLANLSKEDMDKTNVDLAASGVAYKERMNQPVIADQVELVQPEHLRGYICERVAHYRDV 64 Query: 63 SVNL 66 S L Sbjct: 65 SKRL 68 >UniRef50_Q8D2Y6 HolE protein n=1 Tax=Wigglesworthia glossinidia endosymbiont of Glossina brevipalpis RepID=Q8D2Y6_WIGBR Length = 76 Score = 86.7 bits (213), Expect = 2e-16, Method: Composition-based stats. Identities = 31/58 (53%), Positives = 44/58 (75%) Query: 4 NLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRL 61 NL K+ + +K+ VDLA++GV F+ERYNMPV E +E QPE+L S+FR+RLI +R+ Sbjct: 7 NLFKISKKNREKIMVDLASSGVVFRERYNMPVSIEQIENNQPEYLLSYFRKRLIYYRI 64 >UniRef50_A8GGS1 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GGS1_SERP5 Length = 63 Score = 46.6 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 1/61 (1%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVI-AEAVEREQPEHLRSWFRERLIAH 59 M + L L Q E +K+ VDL A V + ERY + V A E++ P++LR +F RL + Sbjct: 1 MGRKLDSLPQAEREKIEVDLLALSVIYNERYGITVNDASNAEQQVPDYLRPYFHLRLNYY 60 Query: 60 R 60 R Sbjct: 61 R 61 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A8GD54 DNA polymerase II beta subunit n=14 Tax=Enteroba... 116 2e-25 UniRef50_P0ABT0 DNA polymerase III subunit theta n=92 Tax=Entero... 116 2e-25 UniRef50_B5F3L5 Conserved domain protein n=37 Tax=Enterobacteria... 114 1e-24 UniRef50_A4TKC0 DNA polymerase III, theta subunit n=24 Tax=Enter... 110 1e-23 UniRef50_A8GDF3 DNA polymerase II beta subunit n=9 Tax=root RepI... 98 1e-19 UniRef50_Q8D2Y6 HolE protein n=1 Tax=Wigglesworthia glossinidia ... 85 6e-16 UniRef50_A8GGS1 Putative uncharacterized protein n=2 Tax=Serrati... 79 3e-14 Sequences not found previously or not previously below threshold: UniRef50_C0AWV9 Putative uncharacterized protein n=1 Tax=Proteus... 38 0.080 CONVERGED! >UniRef50_A8GD54 DNA polymerase II beta subunit n=14 Tax=Enterobacteriaceae RepID=A8GD54_SERP5 Length = 76 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 54/76 (71%), Positives = 61/76 (80%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NLA+L + EMDKVNVDLAA+GVAFKERYNMPVI E VEREQP HLR +FRER+ +R Sbjct: 1 MGYNLAELSKEEMDKVNVDLAASGVAFKERYNMPVIPEMVEREQPAHLRDYFRERVAHYR 60 Query: 61 LASVNLSRLPYEPKLK 76 + S SRLPYEPK K Sbjct: 61 VESHKFSRLPYEPKSK 76 >UniRef50_P0ABT0 DNA polymerase III subunit theta n=92 Tax=Enterobacteriaceae RepID=HOLE_ECO57 Length = 76 Score = 116 bits (291), Expect = 2e-25, Method: Composition-based stats. Identities = 76/76 (100%), Positives = 76/76 (100%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR Sbjct: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 Query: 61 LASVNLSRLPYEPKLK 76 LASVNLSRLPYEPKLK Sbjct: 61 LASVNLSRLPYEPKLK 76 >UniRef50_B5F3L5 Conserved domain protein n=37 Tax=Enterobacteriaceae RepID=B5F3L5_SALA4 Length = 76 Score = 114 bits (284), Expect = 1e-24, Method: Composition-based stats. Identities = 67/76 (88%), Positives = 73/76 (96%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NLA+L+Q EMDKVNVDLAAAGVAFKERYNMPV+AEAVEREQPEHLR+WFRERLIAHR Sbjct: 1 MKTNLAQLEQVEMDKVNVDLAAAGVAFKERYNMPVVAEAVEREQPEHLRAWFRERLIAHR 60 Query: 61 LASVNLSRLPYEPKLK 76 LASV+LSRLPYEPK+K Sbjct: 61 LASVSLSRLPYEPKVK 76 >UniRef50_A4TKC0 DNA polymerase III, theta subunit n=24 Tax=Enterobacteriaceae RepID=A4TKC0_YERPP Length = 76 Score = 110 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 49/76 (64%), Positives = 57/76 (75%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHR 60 M NL +L E K+NVDLAA+GVAFKERYNMPVI E V REQPE LR +F +RL +R Sbjct: 1 MGYNLVELSDEETAKMNVDLAASGVAFKERYNMPVIPEMVAREQPEALREYFLQRLAHYR 60 Query: 61 LASVNLSRLPYEPKLK 76 + S LSRLPYEPK+K Sbjct: 61 IESKKLSRLPYEPKVK 76 >UniRef50_A8GDF3 DNA polymerase II beta subunit n=9 Tax=root RepID=A8GDF3_SERP5 Length = 86 Score = 97.9 bits (242), Expect = 1e-19, Method: Composition-based stats. Identities = 37/64 (57%), Positives = 45/64 (70%) Query: 3 KNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRLA 62 NLA L + +MDK NVDLAA+GVA+KER N PVIA+ VE QPEHLR + ER+ +R Sbjct: 5 FNLANLSKEDMDKTNVDLAASGVAYKERMNQPVIADQVELVQPEHLRGYICERVAHYRDV 64 Query: 63 SVNL 66 S L Sbjct: 65 SKRL 68 >UniRef50_Q8D2Y6 HolE protein n=1 Tax=Wigglesworthia glossinidia endosymbiont of Glossina brevipalpis RepID=Q8D2Y6_WIGBR Length = 76 Score = 85.2 bits (209), Expect = 6e-16, Method: Composition-based stats. Identities = 31/58 (53%), Positives = 44/58 (75%) Query: 4 NLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRL 61 NL K+ + +K+ VDLA++GV F+ERYNMPV E +E QPE+L S+FR+RLI +R+ Sbjct: 7 NLFKISKKNREKIMVDLASSGVVFRERYNMPVSIEQIENNQPEYLLSYFRKRLIYYRI 64 >UniRef50_A8GGS1 Putative uncharacterized protein n=2 Tax=Serratia RepID=A8GGS1_SERP5 Length = 63 Score = 79.4 bits (194), Expect = 3e-14, Method: Composition-based stats. Identities = 24/61 (39%), Positives = 35/61 (57%), Gaps = 1/61 (1%) Query: 1 MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVI-AEAVEREQPEHLRSWFRERLIAH 59 M + L L Q E +K+ VDL A V + ERY + V A E++ P++LR +F RL + Sbjct: 1 MGRKLDSLPQAEREKIEVDLLALSVIYNERYGITVNDASNAEQQVPDYLRPYFHLRLNYY 60 Query: 60 R 60 R Sbjct: 61 R 61 >UniRef50_C0AWV9 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AWV9_9ENTR Length = 46 Score = 38.2 bits (87), Expect = 0.080, Method: Composition-based stats. Identities = 18/34 (52%), Positives = 22/34 (64%) Query: 33 MPVIAEAVEREQPEHLRSWFRERLIAHRLASVNL 66 M IA VER+QP HLR++F ERL +R S L Sbjct: 1 MLAIAVEVERQQPAHLRAYFNERLAFYRERSKKL 34 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.305 0.130 0.346 Lambda K H 0.267 0.0390 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 381,723,346 Number of Sequences: 3077464 Number of extensions: 10543144 Number of successful extensions: 30889 Number of sequences better than 1.0e-01: 9 Number of HSP's better than 0.1 without gapping: 22 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 30865 Number of HSP's gapped (non-prelim): 24 length of query: 76 length of database: 1,040,396,356 effective HSP length: 47 effective length of query: 29 effective length of database: 895,755,548 effective search space: 25976910892 effective search space used: 25976910892 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 87 (38.2 bits)