BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (273 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 563 e-159 UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 385 e-105 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 257 3e-67 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 231 1e-59 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 223 4e-57 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 205 1e-51 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 204 3e-51 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 193 5e-48 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 185 1e-45 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 155 1e-36 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 149 1e-34 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 142 1e-32 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 78 3e-13 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 58 3e-07 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 52 2e-05 UniRef50_A9V0Z1 Predicted protein n=1 Tax=Monosiga brevicollis R... 42 0.023 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 563 bits (1452), Expect = e-159, Method: Compositional matrix adjust. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE Sbjct: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 Query: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF Sbjct: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 Query: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ Sbjct: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 Query: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG Sbjct: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 Query: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ Sbjct: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 385 bits (988), Expect = e-105, Method: Compositional matrix adjust. Identities = 177/267 (66%), Positives = 220/267 (82%), Gaps = 1/267 (0%) Query: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 Query: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 QPFFACQ+RVRD RRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+F++V + +AM+ + L Q Sbjct: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 Query: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 Query: 242 RTHTRAVRGIDGDVKLNRALWVMAETL 268 T TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 257 bits (657), Expect = 3e-67, Method: Compositional matrix adjust. Identities = 130/270 (48%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Query: 4 LASRFG-AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 LA+RFG ++ I PL E L+R VPS+F+ + H+SRSERY Y+PTI +++ L+REG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 PFFA Q+ RD R H KHMLRLRRE + + E I++NSHDGTS++Q+ GM R Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 VC N ++ GE F EVRVPHKG++ +IEG Y V F R+ + + M+ + L Q+ Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 183 LAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG--RNAKG 240 L + +L R+GED P+T QI+ PRR++D + LWTT+ IQEN+I+GGL G RNA+G Sbjct: 186 LGEVSLVARYGEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAEG 245 Query: 241 --GRTHTRAVRGIDGDVKLNRALWVMAETL 268 R+ +R + GID +V LNRALW +AE + Sbjct: 246 RIRRSRSRPINGIDQNVTLNRALWTLAEGM 275 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 231 bits (590), Expect = 1e-59, Method: Compositional matrix adjust. Identities = 125/276 (45%), Positives = 171/276 (61%), Gaps = 13/276 (4%) Query: 6 SRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP 64 +RFG+ A ++R + L L P+VF+EDKH SRS++YTYIPT+ +L L REGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 65 FFACQTRVRDPRRREHTKHMLRLRREGQI---TGKQVPEIILLNSHDGTSSYQMLPGMFR 121 RD +R +TKH+LRLRR G G E++LLNSHDGTSSYQ++ G+FR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 122 AVCQNGLVCGESFGEV-RVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 +C NGLVC + ++ ++PHKGD+V QVI+GAY ++ E V+ M+ + L P Q Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 181 QALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+AA R+ + Q PV QI +PRR +D N LW + R QE LI+GG+ R Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 237 NAKGGRT----HTRAVRGIDGDVKLNRALWVMAETL 268 N + GR TR V+G+DG+ LNRALWV+A + Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRM 289 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 223 bits (569), Expect = 4e-57, Method: Compositional matrix adjust. Identities = 119/266 (44%), Positives = 166/266 (62%), Gaps = 9/266 (3%) Query: 12 NLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTR 71 ++ R PLT EL VPS+F+ + HESRS R+ +PTI++LD L+ EGF+PFFA Q R Sbjct: 44 SIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQAR 103 Query: 72 VRDPRRREHTKHMLRLRREGQIT-GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVC 130 R + E TKHMLRLR G + + EI+L+N++DGTS+YQM+PG FR VC NGL+ Sbjct: 104 TRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMA 163 Query: 131 GESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTY 190 GE+F EV+V H G+ + +VIEGAY VL RV ++ +S+ L ++ LA+AA + Sbjct: 164 GETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSL 223 Query: 191 RFGEDHQ----PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR-NAKGG---R 242 RF + P+ +L PRR +D + DLWT + +QEN ++GG+ GR G R Sbjct: 224 RFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRR 283 Query: 243 THTRAVRGIDGDVKLNRALWVMAETL 268 R V GID LNRALW++ E + Sbjct: 284 QTVREVTGIDQSRALNRALWMLTERM 309 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 205 bits (522), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 113/269 (42%), Positives = 158/269 (58%), Gaps = 17/269 (6%) Query: 15 RRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRD 74 R +PLT E+L R+ PS+F+E KHESRS+RYTYIPTI ++ L+ EGF P A Q R Sbjct: 11 RHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPVMARQGNSRI 70 Query: 75 PRRREHTKHMLRLRREG-----QITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLV 129 P + E+TKH++R R + G PE+ LLNSHDGTS+Y+++ M R C+NG+V Sbjct: 71 PGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMMRLACENGMV 130 Query: 130 CGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 ++ E+ VPHKG V +VIEG+Y VL + E L Q+ A+A Sbjct: 131 VQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQQKGFAEAVH 190 Query: 189 TYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS-------GRNAK 239 ++G+D + P T L RR D+ DLW R+QE+ I+GG++ GRN K Sbjct: 191 IAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFRWDEDGRNRK 250 Query: 240 GGRTHTRAVRGIDGDVKLNRALWVMAETL 268 R R V+ IDGD+KLN+A+W +A+ L Sbjct: 251 --RVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 204 bits (519), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 98/220 (44%), Positives = 148/220 (67%), Gaps = 4/220 (1%) Query: 53 LLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRR---EGQITGKQVPEIILLNSHDG 109 ++++L+REG+ P A ++RVR P R+ +KH+LR RR E + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 110 TSSYQMLPGMFRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRD 168 + +YQ+ G+FR VC NG++ +S G+V+ H GDVV +VIEG YE++ R+ + + Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 169 AMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENL 228 ++L L Q+ A++AL R+ E P +L PRR +D+ NDLW TYQR+QEN+ Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWREGEAPCMPQALLRPRRHEDQGNDLWATYQRVQENM 180 Query: 229 IKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 +KGG+ GR+A G + TRAV+ +DG+VKLN+ALW + E + Sbjct: 181 LKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQM 220 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 193 bits (491), Expect = 5e-48, Method: Compositional matrix adjust. Identities = 111/264 (42%), Positives = 156/264 (59%), Gaps = 13/264 (4%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRR 77 R +T E+++V PS+F+ HESRS+R+ IPTI +L L EGF P A Q+ R + Sbjct: 121 RTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEGFVPVGAKQSASRTEGK 180 Query: 78 REHTKHMLRLRR--EGQI--TGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGE- 132 + TKH++RLRR +G+ G V EI+L N++DGTS+Y++L G+FR C N LV Sbjct: 181 ADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLAGLFRIRCMNSLVTQTG 240 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRF 192 + ++V H GDV ++VIEG Y VL ER + L QQ +A+AA RF Sbjct: 241 TIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLNRDEQQIMAEAAHVLRF 300 Query: 193 ----GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRN----AKGGRTH 244 GE P+ Q+L PRR D ++DLWT + QEN+I+GGL G + R Sbjct: 301 GDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGGLRGIGREDLGRPRRVK 360 Query: 245 TRAVRGIDGDVKLNRALWVMAETL 268 +RAV GID D+KLN+ALW++ E + Sbjct: 361 SRAVNGIDQDIKLNKALWLIGEKM 384 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 185 bits (470), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 102/257 (39%), Positives = 151/257 (58%), Gaps = 7/257 (2%) Query: 20 LTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRRE 79 LT E+L++ PS+F+ SERY I T ++D L +EGF P A Q+ R ++ Sbjct: 16 LTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKATQSASRSEEKKV 75 Query: 80 HTKHMLRLR-REGQITGKQV-PEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEV 137 +KH++R R R+ G + PE++L+NSHDG SSY+++ G++R VC NGLV G+S+ EV Sbjct: 76 FSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTNGLVAGKSYDEV 135 Query: 138 RVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQ 197 RV H+GDV+ VIEG Y V+ +++ + + M LP + A RF ED Sbjct: 136 RVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQAHALRFSEDAN 195 Query: 198 PVTE-SQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG----RNAKGGRTHTRAVRGID 252 V E +L PRR +D DL++ + +QENLIKGG+ G + + R +R + ID Sbjct: 196 LVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWRRARSRKITSID 255 Query: 253 GDVKLNRALWVMAETLL 269 +VK+NR LW +AE L Sbjct: 256 QNVKINRDLWTIAENTL 272 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 155 bits (392), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 86/263 (32%), Positives = 143/263 (54%), Gaps = 19/263 (7%) Query: 17 DRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPR 76 + PLT E+L ++ PS+F+++ + S++Y +I TI +++ ++ + P + VRD + Sbjct: 5 NEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDEK 64 Query: 77 RREHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-ES 133 + KH +R R G+ V E++L NSHD + + + G+FR VC NGLV E Sbjct: 65 KEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDEV 124 Query: 134 FGEVRVPHKGD-------VVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKA 186 F ++ H GD ++++ + Y++L K + L + + AKA Sbjct: 125 FESYQIKHLGDKENDVSIAINKIAKAKYDILN-------KIKLFSKIPLTQDDKASFAKA 177 Query: 187 ALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHT- 245 A+ RF E H V +L P R +DE +DL+TT+ IQE+LI+G +SG NA+ R T Sbjct: 178 AIPLRF-EKHLKVDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTS 236 Query: 246 RAVRGIDGDVKLNRALWVMAETL 268 R ++ I D +N+ LW MAE++ Sbjct: 237 RIIKSISTDTDINKKLWNMAESI 259 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 149 bits (375), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 81/109 (74%), Positives = 95/109 (87%) Query: 160 FERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWT 219 F+ V EKR+ MQSLLLPPP QQALA+AALTYRFGE+HQP+TE Q+L PRRW+D+ +DLWT Sbjct: 4 FDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQPITEEQVLQPRRWEDKKDDLWT 63 Query: 220 TYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 YQR+QENLIKGGLSGRNAKG R TR+V GIDGD+KLN+ALWVM E + Sbjct: 64 VYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 142 bits (358), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 97/271 (35%), Positives = 143/271 (52%), Gaps = 23/271 (8%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRR 77 R L+ ++L RV PSVF+E S RYT++ T ++D L+ EG++P A Q RVR R Sbjct: 14 RALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQQRVRLENR 73 Query: 78 REHTKHMLRLRREGQI------TGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG 131 + H LR R + G PE+IL N+HDGT +Y++ G++R VC+NGL Sbjct: 74 QGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLVCRNGLTVA 133 Query: 132 ES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTY 190 ++ F V + H + A V RV E Q++ L P + + A A+ Sbjct: 134 DADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHSFAARAMAL 193 Query: 191 RFGEDHQPVTE----SQILSPRRWQDESNDLWTTYQRIQENLIKGGL--SGR--NAKGG- 241 R+ + QPVT Q+L+P R+ D++ DLWTT+ +QE L +GGL +G A+G Sbjct: 194 RW-DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGHIPAAEGAV 252 Query: 242 ------RTHTRAVRGIDGDVKLNRALWVMAE 266 R TR V G+ +LN+ALW +AE Sbjct: 253 FPTHYLRNTTRPVGGLTEGQRLNKALWNLAE 283 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 38/130 (29%), Positives = 70/130 (53%), Gaps = 3/130 (2%) Query: 18 RPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRR 77 +PL+ EL R+ PS+F+ + + S++Y +I TI +++ ++ + P + VR+ + Sbjct: 6 QPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNEDK 65 Query: 78 REHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-ESF 134 + +H +R R + E++L NSHD + + + G+FR VC NGLV E F Sbjct: 66 EGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADEVF 125 Query: 135 GEVRVPHKGD 144 ++ H G+ Sbjct: 126 ESYQIKHIGE 135 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 58.2 bits (139), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 27/52 (51%), Positives = 40/52 (76%), Gaps = 1/52 (1%) Query: 4 LASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLL 54 LASRF + + +R D PL+ +++ RV PS+F++ HESRSERY+YIPT ++L Sbjct: 36 LASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 52.0 bits (123), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 28/95 (29%), Positives = 49/95 (51%), Gaps = 5/95 (5%) Query: 96 KQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDV-VSQVIEGAY 154 K + ++++NS+DG+ ++Q+ G FR VC NG++ GE F + V H G + QV Sbjct: 139 KVILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDVRHTGTMNFGQVTRQVT 198 Query: 155 EVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALT 189 + FE + + D L+ P+ + A +T Sbjct: 199 TAVSSFENMGQYWDT----LINSPLNRKDADKIIT 229 >UniRef50_A9V0Z1 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V0Z1_MONBE Length = 981 Score = 42.0 bits (97), Expect = 0.023, Method: Compositional matrix adjust. Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 12/83 (14%) Query: 27 RVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHML- 85 RV+P +FSE + + R + + I QPFFA R D RE KH+L Sbjct: 181 RVLPKLFSESREQDRHDVAKHKAQI-----------QPFFAMYERHFDESVREEPKHILA 229 Query: 86 RLRREGQITGKQVPEIILLNSHD 108 RL ++ + + V I +L+ HD Sbjct: 230 RLEQQPAVENRSVTHIFILHCHD 252 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 416 e-115 UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 404 e-111 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 356 6e-97 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 332 6e-90 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 331 2e-89 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 330 4e-89 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 328 2e-88 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 326 4e-88 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 319 8e-86 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 299 6e-80 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 278 2e-73 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 202 1e-50 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 145 2e-33 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 104 3e-21 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 74 7e-12 Sequences not found previously or not previously below threshold: UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostri... 56 1e-06 UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfo... 55 3e-06 UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 53 9e-06 UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavoba... 51 4e-05 UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 48 4e-04 UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victiva... 47 6e-04 UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfo... 47 9e-04 UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 47 0.001 UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=... 44 0.005 UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax... 44 0.005 UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwen... 43 0.012 UniRef50_C2LEJ7 Putative uncharacterized protein n=1 Tax=Proteus... 42 0.016 UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=... 42 0.019 UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax... 42 0.026 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 416 bits (1069), Expect = e-115, Method: Composition-based stats. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE Sbjct: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 Query: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF Sbjct: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 Query: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ Sbjct: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 Query: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG Sbjct: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 Query: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ Sbjct: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 177/267 (66%), Positives = 220/267 (82%), Gaps = 1/267 (0%) Query: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 Query: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 QPFFACQ+RVRD RRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+F++V + +AM+ + L Q Sbjct: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 Query: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 Query: 242 RTHTRAVRGIDGDVKLNRALWVMAETL 268 T TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 356 bits (913), Expect = 6e-97, Method: Composition-based stats. Identities = 130/270 (48%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Query: 4 LASRFG-AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 LA+RFG ++ I PL E L+R VPS+F+ + H+SRSERY Y+PTI +++ L+REG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 PFFA Q+ RD R H KHMLRLRRE + + E I++NSHDGTS++Q+ GM R Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 VC N ++ GE F EVRVPHKG++ +IEG Y V F R+ + + M+ + L Q+ Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 183 LAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG--RNAKG 240 L + +L R+GED P+T QI+ PRR++D + LWTT+ IQEN+I+GGL G RNA+G Sbjct: 186 LGEVSLVARYGEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAEG 245 Query: 241 --GRTHTRAVRGIDGDVKLNRALWVMAETL 268 R+ +R + GID +V LNRALW +AE + Sbjct: 246 RIRRSRSRPINGIDQNVTLNRALWTLAEGM 275 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 332 bits (852), Expect = 6e-90, Method: Composition-based stats. Identities = 110/268 (41%), Positives = 153/268 (57%), Gaps = 13/268 (4%) Query: 14 IRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVR 73 R +T E+++V PS+F+ HESRS+R+ IPTI +L L EGF P A Q+ R Sbjct: 117 FDTARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEGFVPVGAKQSASR 176 Query: 74 DPRRREHTKHMLRLRREGQ----ITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLV 129 + + TKH++RLRR G V EI+L N++DGTS+Y++L G+FR C N LV Sbjct: 177 TEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLAGLFRIRCMNSLV 236 Query: 130 CGE-SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 + ++V H GDV ++VIEG Y VL ER + L QQ +A+AA Sbjct: 237 TQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLNRDEQQIMAEAAH 296 Query: 189 TYRF----GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRN----AKG 240 RF GE P+ Q+L PRR D ++DLWT + QEN+I+GGL G + Sbjct: 297 VLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGGLRGIGREDLGRP 356 Query: 241 GRTHTRAVRGIDGDVKLNRALWVMAETL 268 R +RAV GID D+KLN+ALW++ E + Sbjct: 357 RRVKSRAVNGIDQDIKLNKALWLIGEKM 384 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 118/266 (44%), Positives = 166/266 (62%), Gaps = 9/266 (3%) Query: 12 NLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTR 71 ++ R PLT EL VPS+F+ + HESRS R+ +PTI++LD L+ EGF+PFFA Q R Sbjct: 44 SIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQAR 103 Query: 72 VRDPRRREHTKHMLRLRREGQIT-GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVC 130 R + E TKHMLRLR G + + EI+L+N++DGTS+YQM+PG FR VC NGL+ Sbjct: 104 TRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMA 163 Query: 131 GESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTY 190 GE+F EV+V H G+ + +VIEGAY VL RV ++ +S+ L ++ LA+AA + Sbjct: 164 GETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSL 223 Query: 191 RFGEDHQ----PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR----NAKGGR 242 RF + P+ +L PRR +D + DLWT + +QEN ++GG+ GR + R Sbjct: 224 RFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRR 283 Query: 243 THTRAVRGIDGDVKLNRALWVMAETL 268 R V GID LNRALW++ E + Sbjct: 284 QTVREVTGIDQSRALNRALWMLTERM 309 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 330 bits (845), Expect = 4e-89, Method: Composition-based stats. Identities = 111/276 (40%), Positives = 156/276 (56%), Gaps = 13/276 (4%) Query: 6 SRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPF 65 S + R +PLT E+L R+ PS+F+E KHESRS+RYTYIPTI ++ L+ EGF P Sbjct: 2 SFLSRVSAHRHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPV 61 Query: 66 FACQTRVRDPRRREHTKHMLRLRREGQ-----ITGKQVPEIILLNSHDGTSSYQMLPGMF 120 A Q R P + E+TKH++R R G PE+ LLNSHDGTS+Y+++ M Sbjct: 62 MARQGNSRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMM 121 Query: 121 RAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPV 179 R C+NG+V ++ E+ VPHKG V +VIEG+Y VL + E L Sbjct: 122 RLACENGMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQ 181 Query: 180 QQALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG-- 235 Q+ A+A ++G+D + P T L RR D+ DLW R+QE+ I+GG++G Sbjct: 182 QKGFAEAVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFR 241 Query: 236 ---RNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 R R V+ IDGD+KLN+A+W +A+ L Sbjct: 242 WDEDGRNRKRVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 328 bits (840), Expect = 2e-88, Method: Composition-based stats. Identities = 103/268 (38%), Positives = 150/268 (55%), Gaps = 11/268 (4%) Query: 13 LIRRDRP----LTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFAC 68 LI P LT E+L++ PS+F+ SERY I T ++D L +EGF P A Sbjct: 5 LIESGEPAMNVLTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKAT 64 Query: 69 QTRVRDPRRREHTKHMLRLRREG-QITGKQ-VPEIILLNSHDGTSSYQMLPGMFRAVCQN 126 Q+ R ++ +KH++R R G PE++L+NSHDG SSY+++ G++R VC N Sbjct: 65 QSASRSEEKKVFSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTN 124 Query: 127 GLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKA 186 GLV G+S+ EVRV H+GDV+ VIEG Y V+ +++ + + M LP + Sbjct: 125 GLVAGKSYDEVRVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQ 184 Query: 187 ALTYRFGEDHQPV-TESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG----RNAKGG 241 A RF ED V +L PRR +D DL++ + +QENLIKGG+ G + + Sbjct: 185 AHALRFSEDANLVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWR 244 Query: 242 RTHTRAVRGIDGDVKLNRALWVMAETLL 269 R +R + ID +VK+NR LW +AE L Sbjct: 245 RARSRKITSIDQNVKINRDLWTIAENTL 272 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 326 bits (836), Expect = 4e-88, Method: Composition-based stats. Identities = 125/276 (45%), Positives = 171/276 (61%), Gaps = 13/276 (4%) Query: 6 SRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP 64 +RFG+ A ++R + L L P+VF+EDKH SRS++YTYIPT+ +L L REGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 65 FFACQTRVRDPRRREHTKHMLRLRREGQI---TGKQVPEIILLNSHDGTSSYQMLPGMFR 121 RD +R +TKH+LRLRR G G E++LLNSHDGTSSYQ++ G+FR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 122 AVCQNGLVCGESFGEV-RVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 +C NGLVC + ++ ++PHKGD+V QVI+GAY ++ E V+ M+ + L P Q Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 181 QALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+AA R+ + Q PV QI +PRR +D N LW + R QE LI+GG+ R Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 237 NAKGG----RTHTRAVRGIDGDVKLNRALWVMAETL 268 N + G R TR V+G+DG+ LNRALWV+A + Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRM 289 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 319 bits (817), Expect = 8e-86, Method: Composition-based stats. Identities = 85/257 (33%), Positives = 139/257 (54%), Gaps = 5/257 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 + PLT E+L ++ PS+F+++ + S++Y +I TI +++ ++ + P + VRD Sbjct: 4 SNEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDE 63 Query: 76 RRREHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-E 132 ++ KH +R R G+ V E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 KKEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRF 192 F ++ H GD + V ++ + K + L + + AKAA+ RF Sbjct: 124 VFESYQIKHLGDKENDVSIAINKIAKAKYDILNKIKLFSKIPLTQDDKASFAKAAIPLRF 183 Query: 193 GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHT-RAVRGI 251 E H V +L P R +DE +DL+TT+ IQE+LI+G +SG NA+ R T R ++ I Sbjct: 184 -EKHLKVDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSRIIKSI 242 Query: 252 DGDVKLNRALWVMAETL 268 D +N+ LW MAE++ Sbjct: 243 STDTDINKKLWNMAESI 259 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 299 bits (766), Expect = 6e-80, Method: Composition-based stats. Identities = 97/220 (44%), Positives = 147/220 (66%), Gaps = 4/220 (1%) Query: 53 LLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREG---QITGKQVPEIILLNSHDG 109 ++++L+REG+ P A ++RVR P R+ +KH+LR RR + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 110 TSSYQMLPGMFRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRD 168 + +YQ+ G+FR VC NG++ +S G+V+ H GDVV +VIEG YE++ R+ + + Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 169 AMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENL 228 ++L L Q+ A++AL R+ E P +L PRR +D+ NDLW TYQR+QEN+ Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWREGEAPCMPQALLRPRRHEDQGNDLWATYQRVQENM 180 Query: 229 IKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 +KGG+ GR+A G + TRAV+ +DG+VKLN+ALW + E + Sbjct: 181 LKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQM 220 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 278 bits (710), Expect = 2e-73, Method: Composition-based stats. Identities = 99/281 (35%), Positives = 144/281 (51%), Gaps = 23/281 (8%) Query: 10 AANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQ 69 A R R L+ ++L RV PSVF+E S RYT++ T ++D L+ EG++P A Q Sbjct: 6 TAPSSRVFRALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPVKANQ 65 Query: 70 TRVRDPRRREHTKHMLRLRREGQI------TGKQVPEIILLNSHDGTSSYQMLPGMFRAV 123 RVR R+ H LR R + G PE+IL N+HDGT +Y++ G++R V Sbjct: 66 QRVRLENRQGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGLYRLV 125 Query: 124 CQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 C+NGL ++ F V + H + A V RV E Q++ L P + + Sbjct: 126 CRNGLTVADADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPLARHS 185 Query: 183 LAKAALTYRFGEDHQPVT----ESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+ R+ + QPVT Q+L+P R+ D++ DLWTT+ +QE L +GGL G Sbjct: 186 FAARAMALRW-DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLRYAGH 244 Query: 237 --NAKGG-------RTHTRAVRGIDGDVKLNRALWVMAETL 268 A+G R TR V G+ +LN+ALW +AE Sbjct: 245 IPAAEGAVFPTHYLRNTTRPVGGLTEGQRLNKALWNLAEEF 285 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 42/172 (24%), Positives = 84/172 (48%), Gaps = 3/172 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 +PL+ EL R+ PS+F+ + + S++Y +I TI +++ ++ + P + VR+ Sbjct: 4 STQPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNE 63 Query: 76 RRREHTKHMLRLRREGQIT--GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-E 132 + + +H +R R + E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 DKEGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALA 184 F ++ H G+ + V ++ +++ +K + L + + A Sbjct: 124 VFESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 81/112 (72%), Positives = 96/112 (85%) Query: 157 LGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESND 216 + F+ V EKR+ MQSLLLPPP QQALA+AALTYRFGE+HQP+TE Q+L PRRW+D+ +D Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQPITEEQVLQPRRWEDKKDD 60 Query: 217 LWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 LWT YQR+QENLIKGGLSGRNAKG R TR+V GIDGD+KLN+ALWVM E + Sbjct: 61 LWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 104 bits (259), Expect = 3e-21, Method: Composition-based stats. Identities = 35/148 (23%), Positives = 62/148 (41%), Gaps = 13/148 (8%) Query: 80 HTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRV 139 HM+++ K + ++++NS+DG+ ++Q+ G FR VC NG++ GE F + V Sbjct: 127 FPAHMVQI----GSGDKVILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDV 182 Query: 140 PHKGDV-VSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQP 198 H G + QV + FE + + D L+ P+ + A +T + Sbjct: 183 RHTGTMNFGQVTRQVTTAVSSFENMGQYWDT----LINSPLNRKDADKIITDMSTVGREL 238 Query: 199 VTESQILSPRRWQDESNDL----WTTYQ 222 + R + D L W Y Sbjct: 239 NMNKFDMFDRLYTDHKKTLGENHWAMYN 266 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 73.5 bits (179), Expect = 7e-12, Method: Composition-based stats. Identities = 27/53 (50%), Positives = 41/53 (77%), Gaps = 1/53 (1%) Query: 3 RLASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLL 54 +LASRF + + +R D PL+ +++ RV PS+F++ HESRSERY+YIPT ++L Sbjct: 35 QLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RIH4_9CLOT Length = 312 Score = 56.2 bits (134), Expect = 1e-06, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 86/245 (35%), Gaps = 33/245 (13%) Query: 42 SERYTYIPTISLL---DSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV 98 ++RY + D L EG RR + +L + I+G ++ Sbjct: 76 TDRYKVVQNEDAFAFTDQLLGEG---VTYETAGSLQNGRRTWL--LAKLPQRYIISGDEI 130 Query: 99 -PEIILLNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 P ++ +N+HDGT + ++ R VC N L + H GD+ ++ + Y Sbjct: 131 TPYMVFMNTHDGTGAIRVAMTPVRVVCMNTLNLALSTAKRSWSTNHTGDIAGKMEDARYT 190 Query: 156 VLGIFERVEE---KRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQD 212 +L + E D M+ L L A P + Q R +D Sbjct: 191 LLYADRYMSELGKAIDHMKRLRLSERQVMEYIDALFPLY----DNPTPQQQKNLNRMKED 246 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGG-RTHTRAV------------RGIDGDVKLNR 259 + +++ K G NA TH R + + ++G+ ++R Sbjct: 247 MKTRYFDAPDL--KHVGKNGYRFINAVSDFATHARPLRESANHKENLFAKTVEGNALIDR 304 Query: 260 ALWVM 264 A ++ Sbjct: 305 AFAML 309 >UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZS75_DESOH Length = 318 Score = 54.6 bits (130), Expect = 3e-06, Method: Composition-based stats. Identities = 35/157 (22%), Positives = 61/157 (38%), Gaps = 18/157 (11%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRL----RREGQITG-- 95 +ERY + + +L L R GF P Q + D ++R+ R G G Sbjct: 108 TERYKPLDNMDVLSQLLRHGFDPDTQVQYAIDDG------MFLVRIPEYARAFGVNPGYG 161 Query: 96 ---KQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGD-VVSQVIE 151 + VP + NS G ++ + +R VC NGL+ S R H + + E Sbjct: 162 KLDEIVPGVSFANSEVGLLAFSIEAFFYRLVCTNGLISKTSSTFSRFKHISNRGLENFPE 221 Query: 152 GAYEVLGIFERVEEKRDAMQSLLLPPPVQ--QALAKA 186 V+ R +E+ + + P++ + A+ Sbjct: 222 TIAGVIEDSVRKQEQFKLSRQSPVENPIRSIETFARQ 258 >UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LV02_SYNAS Length = 264 Score = 53.1 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 49/271 (18%), Positives = 92/271 (33%), Gaps = 57/271 (21%) Query: 15 RRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDS--------LQREGFQPFF 66 R +T+++L + ++ Y + L D L+ Sbjct: 8 RGGELVTKDQLDLI--------PLPEPTDSYMPVSHYDLADKFLMISQDILRDYKL--VG 57 Query: 67 ACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQN 126 R + +L+ +RE G I NS+D + + + G VC N Sbjct: 58 ENYGIAR-QGNQFFA--VLKFQRERSEIG---LSIAFRNSYDRSMAIGLAIGASVFVCDN 111 Query: 127 GLVCGESFGEVRVPHKGDV----VSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 + GE V H +V + I Y+ ++++ DA +S LP A Sbjct: 112 LALSGEIV--VMKKHTKNVWSELEEKAIATIYKSQNNYDQLIGDVDAFKS--LPVDDNGA 167 Query: 183 LAKAALTYRFGEDHQPVTESQI-------LSPRRWQDESNDLWTTYQRIQENLIKGGLSG 235 A+ FG + ++ Q+ L P + E +LW+ Y E+L Sbjct: 168 F--QAMGLLFGNN--IISPRQLTVLKEEWLKPSHEEFEPRNLWSFYNAATESL------- 216 Query: 236 RNAKGGRTHTRAVRGIDGDVKLNRALWVMAE 266 + V ++ ++L+ AL + + Sbjct: 217 -------KSSPPVTIMEKHIRLHEALTYLGK 240 >UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXR9_FLAPJ Length = 285 Score = 51.2 bits (121), Expect = 4e-05, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 64/187 (34%), Gaps = 32/187 (17%) Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVI--- 150 K P + NS+DG+ G FR VC NGL + + H+G++ V+ Sbjct: 109 LDKIRPMLRFTNSYDGSCKTSGTFGFFREVCSNGLHTASTDIGFSLKHRGNINELVLPAI 168 Query: 151 -EGAYEVLG----IFERVEEKRDAMQSLLLPPPVQQALAKAALTYRF-GEDHQP---VTE 201 + Y L R E + VQ A+ ++F D P + Sbjct: 169 GKTIYNFLDNEFYELRRKFEVLADFKIADPSEIVQHI-AQQTKLFKFESSDKNPAPSLNA 227 Query: 202 SQILSPRRWQ----DESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKL 257 ++ + E ++W Y E L+ G + + D K+ Sbjct: 228 RLVIETIENETLILKEDANMWMVYNAFNE-LLHGKIK--------------KTFDQQKKI 272 Query: 258 NRALWVM 264 ++ ++ + Sbjct: 273 DKEIFNL 279 >UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Z8_9PLAN Length = 327 Score = 48.1 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 77/209 (36%), Gaps = 38/209 (18%) Query: 85 LRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGE---VRVPH 141 +R++ + K ++L N+HDG+S+ ++ R VCQN L ++ + + H Sbjct: 128 IRVKNSDDLVDKF---LLLSNAHDGSSALRVYFTPIRVVCQNTLNLADNRSTGQGISILH 184 Query: 142 KGDVVSQVIEGAYEVLGIFERVEE----KRDAMQSLLLPPPVQQALAKAALTYRFGEDHQ 197 KG++ ++ I A VLG+ E + D + S +A ++ + G D+ Sbjct: 185 KGNLHTK-IREAQRVLGLAEEFYDEAEGIIDILASHHPSSVQVEAFFQSVIPDPIGADNA 243 Query: 198 PVTESQILSPRRWQDE---------SNDL-------WTTYQRIQENLIKGGLSGRNAKGG 241 R+ +D D+ W Y + E + R+ Sbjct: 244 --------RARKVRDRLTCLFETGIGQDMPEIKGTSWAAYNAVTE-FVDHHRPTRSTDPL 294 Query: 242 RTHTRAVRG--IDGDVKLNRALWVMAETL 268 +R + +L W +A + Sbjct: 295 ERASRRLDSSWFGSGARLKAKAWNLAFDM 323 >UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N225_9BACT Length = 241 Score = 47.3 bits (111), Expect = 6e-04, Method: Composition-based stats. Identities = 33/202 (16%), Positives = 69/202 (34%), Gaps = 16/202 (7%) Query: 37 KHESRSERYTYIPTISLLDSL----QREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQ 92 + + + +P ++D++ + +Q R+ +R M + R + Sbjct: 20 PTPAATASWKPVPHSEVIDAVTDVVRAHNWQILDEQYGLARNGQR------MFGVIRINR 73 Query: 93 ITGKQVPEII-LLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVV--SQV 149 + + I + NSHD T + + G+ VC N + G + ++ H + V Sbjct: 74 TSSSEWSRCIGICNSHDRTIAVGLAAGLNVQVCANLMFGGSTV--LKRRHTSRIELNGLV 131 Query: 150 IEGAYEVLGIFERVEEKRDAMQSLLLPPP-VQQALAKAALTYRFGEDHQPVTESQILSPR 208 +E + F +E + ++ + + A+ KAA + PR Sbjct: 132 VEAIDALEDDFLTLETVAEDLKIQFVRDDTARAAIVKAAEAGAVNSCDIVPIFREFKEPR 191 Query: 209 RWQDESNDLWTTYQRIQENLIK 230 + W EN K Sbjct: 192 YEEFAEPTRWALLNAFTENAKK 213 >UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZYJ5_DESOH Length = 308 Score = 46.5 bits (109), Expect = 9e-04, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 44/125 (35%), Gaps = 15/125 (12%) Query: 44 RYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV-PEII 102 +YT + +L+ L G+ P Q + + + R+ I G + P I Sbjct: 107 KYTPVDNFEILERLDSLGYGPDTKVQCSL---DAEFLSLSIPDGRKAFDINGDRFKPGIS 163 Query: 103 LLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFER 162 + NS G +S + + R VC NGL+ H + +L F + Sbjct: 164 ISNSEVGLASLTISAFVLRLVCTNGLIARTGI-SASYRHV----------STRILKEFPQ 212 Query: 163 VEEKR 167 E Sbjct: 213 TIETV 217 >UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q5L2_CATAD Length = 329 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 41/103 (39%), Gaps = 12/103 (11%) Query: 75 PRRREHTKHMLRLRREGQITGKQVPEIIL--LNSHDGTSSYQMLPGMFRAVCQN--GLVC 130 R+ +RL + G ++ + LNSHDGT +Y+++ R VC N L Sbjct: 127 EGRQVFVT--MRLPETMTVAGTDRLDLYISGLNSHDGTGAYKLIVTPIRIVCANTQSLAL 184 Query: 131 GESFGEVRVPHKGDVVSQVIEG------AYEVLGIFERVEEKR 167 + + H ++ E ++ + FE+ E+ Sbjct: 185 DRARSSFSIRHTESAKKKIAEARKALGLMFKYVEEFEKAAERM 227 >UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=Cyanobacteria RepID=B4VVD2_9CYAN Length = 336 Score = 44.2 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 30/144 (20%), Positives = 55/144 (38%), Gaps = 17/144 (11%) Query: 97 QVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESF--------GEVRVPHKGDVVSQ 148 P ++L NSHDG+++ + R VC N L F + +PH + Q Sbjct: 133 VRPYLLLHNSHDGSTAVWLQFTPVRVVCWNTLNGAARFRFGDLWQKKAICIPHSLSLTEQ 192 Query: 149 VIEGAYEVLGIFERVEEK-RDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTES---QI 204 + E + +L + ++ + + Q++ + LA R QP Q+ Sbjct: 193 L-EHIHNILDLTQKEFQYSVEEYQAMAHKELTTELLAD--YIGRVLGTTQPTLHPAWSQL 249 Query: 205 LS--PRRWQDESNDLWTTYQRIQE 226 ++ ++ LW Y I E Sbjct: 250 VANFESGRGNQGQTLWDAYNSITE 273 >UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax=Bacteroidetes RepID=C6W397_DYAFD Length = 350 Score = 44.2 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 7/95 (7%) Query: 101 IILLNSHDGTSSYQMLPGMFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIEGAYEVLG 158 + L SHDG+ S R VC N L V++ H + V ++ A++V+G Sbjct: 152 LFLTTSHDGSGSITAAFTPVRIVCANTLNAAMKNITNVVKIRHTSNAVERL-RTAHKVMG 210 Query: 159 IFER----VEEKRDAMQSLLLPPPVQQALAKAALT 189 I + VEE + + P + L + A+ Sbjct: 211 IANKFSHEVEEIFNHWAKKPITDPQLKKLIEIAMA 245 >UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKH6_9FLAO Length = 312 Score = 42.7 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 57/153 (37%), Gaps = 22/153 (14%) Query: 98 VPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVL 157 +P + NS+DG+ G +R VC NGL + E + H + ++ + Sbjct: 141 LPMLRFKNSYDGSEKTSGHFGFYREVCSNGLHVSLAEIEFSIKHSKNNTHLIMPRLNNLF 200 Query: 158 -----GIFERVEEKRDAMQSLLLPPPVQQALAKAAL----TYRFGEDHQPVTE----SQI 204 F + +K D M+ + Q KA L +R+ + ++ Sbjct: 201 DKFLDNEFYTITKKFDKMKEFKIID--TQEFVKAILDRTKLFRYECSDKNSDPSKKSREV 258 Query: 205 LSPRRWQ----DESNDLWT---TYQRIQENLIK 230 + ++ +E +LW + + N++K Sbjct: 259 IEILNYEALLLNEEPNLWLGYNAFNSVLHNVLK 291 >UniRef50_C2LEJ7 Putative uncharacterized protein n=1 Tax=Proteus mirabilis ATCC 29906 RepID=C2LEJ7_PROMI Length = 39 Score = 42.3 bits (98), Expect = 0.016, Method: Composition-based stats. Identities = 18/35 (51%), Positives = 21/35 (60%) Query: 239 KGGRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 K T T +V GID D KLN+ALWVM E +Q Sbjct: 2 KSKHTRTCSVNGIDSDSKLNKALWVMTEKCTNIIQ 36 >UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=Actinomycetales RepID=C4DCZ5_9ACTO Length = 395 Score = 42.3 bits (98), Expect = 0.019, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 42/113 (37%), Gaps = 6/113 (5%) Query: 43 ERYTYIPTISLLDSLQR--EGFQPFFACQTRVRDPRRREHTKHMLRLRREG--QITGKQV 98 +Y + + L+ E + + VR RR + + I Sbjct: 151 SKYHTVQNRECFEFLRNLVESYDVVWESAGAVRGGRRTFVSMRLPDTVTVDAAGINDTIT 210 Query: 99 PEIILLNSHDGTSSYQMLPGMFRAVCQNG--LVCGESFGEVRVPHKGDVVSQV 149 P +++ NSHDG+SS + +R VC N L ++ + H + Q+ Sbjct: 211 PFVVVFNSHDGSSSITAVVTPYRPVCANTERLALDNAYTSWSIRHTESAMHQM 263 >UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF46A9 Length = 348 Score = 41.9 bits (97), Expect = 0.026, Method: Composition-based stats. Identities = 31/178 (17%), Positives = 58/178 (32%), Gaps = 31/178 (17%) Query: 101 IILLNSHDGTSSYQMLPGMFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIEGAYEVLG 158 + LNSHDG+++++ L R VC N + + H G + + E + Sbjct: 163 LAALNSHDGSAAFRFLLSPIRIVCANTQSAAIRSAKSSFSIRHTGGARASIAEARNALKL 222 Query: 159 IFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRW-QDESNDL 217 + +E +L P + + A T V + + RR ++ +N + Sbjct: 223 SWRYIEAFEAEAAALYAAPMDTEEMRSFANTL------LEVDSAGTTATRRHRRERANSI 276 Query: 218 -----------------WTTYQRIQENL-----IKGGLSGRNAKGGRTHTRAVRGIDG 253 W Y + E L ++G + +A R G Sbjct: 277 VKLWTSSETIAPIAGTRWAAYNAVTEYLDHVVPVRGAKTATDASAARALRNITTAASG 334 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YF... 373 e-102 UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_E... 364 2e-99 UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID... 322 7e-87 UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legione... 302 1e-80 UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Glucona... 300 3e-80 UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Prote... 300 4e-80 UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodoba... 299 6e-80 UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylo... 298 1e-79 UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylo... 288 2e-76 UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus o... 276 7e-73 UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutu... 256 6e-67 UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I ... 196 6e-49 UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophu... 164 3e-39 UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostri... 163 6e-39 UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victiva... 151 2e-35 UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio ... 134 4e-30 UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Provide... 129 1e-28 UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 127 4e-28 UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavoba... 121 2e-26 UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfo... 116 8e-25 UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfo... 94 5e-18 UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 79 1e-13 UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular or... 69 2e-10 Sequences not found previously or not previously below threshold: UniRef50_B9E574 Putative uncharacterized protein n=5 Tax=Clostri... 101 3e-20 UniRef50_B8F9V3 Putative uncharacterized protein n=4 Tax=Deltapr... 80 8e-14 UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwen... 80 9e-14 UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=... 78 3e-13 UniRef50_Q024R3 Putative uncharacterized protein n=1 Tax=Candida... 77 6e-13 UniRef50_A1SIX8 Putative uncharacterized protein n=2 Tax=Nocardi... 76 2e-12 UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=... 75 3e-12 UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax... 75 3e-12 UniRef50_UPI00017465AE hypothetical protein VspiD_04485 n=2 Tax=... 72 2e-11 UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobac... 71 5e-11 UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax... 65 2e-09 UniRef50_B4CXI2 Putative uncharacterized protein n=1 Tax=Chthoni... 63 9e-09 UniRef50_Q5LU35 Putative uncharacterized protein n=1 Tax=Ruegeri... 62 1e-08 UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betapro... 62 2e-08 UniRef50_Q0RM54 Putative uncharacterized protein n=1 Tax=Frankia... 62 2e-08 UniRef50_A1WP45 Putative uncharacterized protein n=2 Tax=Comamon... 59 2e-07 UniRef50_UPI00016C3597 hypothetical protein GobsU_16407 n=1 Tax=... 57 5e-07 UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synecho... 57 5e-07 UniRef50_A8ZKZ6 Putative uncharacterized protein n=3 Tax=Cyanoba... 56 1e-06 UniRef50_Q19YQ9 Gp96 n=7 Tax=unclassified Siphoviridae RepID=Q19... 56 2e-06 UniRef50_B7I5L8 Phage/plasmid-related protein n=5 Tax=Moraxellac... 54 5e-06 UniRef50_Q5Y1B4 Putative uncharacterized protein n=1 Tax=uncultu... 53 9e-06 UniRef50_C5CKG6 Phage/plasmid-related protein TIGR03299 n=10 Tax... 53 1e-05 UniRef50_C4ZMQ9 Phage/plasmid-related protein TIGR03299 n=1 Tax=... 52 2e-05 UniRef50_C6RKU8 Phage/plasmid-related protein n=12 Tax=Acinetoba... 52 3e-05 UniRef50_C0VFU1 Putative uncharacterized protein n=4 Tax=Acineto... 51 5e-05 UniRef50_A6WZ56 Putative uncharacterized protein n=1 Tax=Ochroba... 50 8e-05 UniRef50_C4V5A4 Putative uncharacterized protein n=1 Tax=Selenom... 49 1e-04 UniRef50_Q2IFF9 Putative uncharacterized protein n=3 Tax=Anaerom... 49 2e-04 UniRef50_A8ZPY1 Putative uncharacterized protein n=5 Tax=Bacteri... 49 2e-04 UniRef50_B3VM79 Gp52 n=2 Tax=unclassified Siphoviridae RepID=B3V... 47 5e-04 UniRef50_C8X3A3 Putative uncharacterized protein n=1 Tax=Desulfo... 47 7e-04 UniRef50_A6SWN5 Uncharacterized conserved protein n=39 Tax=Prote... 46 0.001 UniRef50_Q18F79 Putative uncharacterized protein n=1 Tax=Haloqua... 46 0.001 UniRef50_A8L7W9 Putative uncharacterized protein n=4 Tax=Actinom... 44 0.006 UniRef50_B5LJ78 Gp67 n=1 Tax=Mycobacterium phage Myrna RepID=B5L... 44 0.008 UniRef50_B8KMK8 Putative uncharacterized protein n=1 Tax=gamma p... 43 0.012 UniRef50_C1D7A8 Putative uncharacterized protein n=1 Tax=Laribac... 41 0.048 >UniRef50_P52132 UPF0380 protein yfjQ n=153 Tax=Bacteria RepID=YFJQ_ECOLI Length = 273 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 273/273 (100%), Positives = 273/273 (100%) Query: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE Sbjct: 1 MTRLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQRE 60 Query: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF Sbjct: 61 GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMF 120 Query: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ Sbjct: 121 RAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 Query: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG Sbjct: 181 QALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKG 240 Query: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ Sbjct: 241 GRTHTRAVRGIDGDVKLNRALWVMAETLLTQLQ 273 >UniRef50_P18005 UPF0380 protein yubP n=179 Tax=root RepID=YUBP_ECOLI Length = 273 Score = 364 bits (933), Expect = 2e-99, Method: Composition-based stats. Identities = 177/267 (66%), Positives = 220/267 (82%), Gaps = 1/267 (0%) Query: 3 RLASRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 RLASRFG N I R+RPLT +EL + VPSVFS DKHESRSERYTYIPTI++++ L+ EGF Sbjct: 2 RLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTINIINKLRDEGF 61 Query: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 QPFFACQ+RVRD RRE++KHMLRLRREG I G++VPEIILLNSHDG+SSYQM+PG+FR Sbjct: 62 QPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEIILLNSHDGSSSYQMIPGIFRF 121 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 VC NGLVCG +FGE+RVPHKGD+V QVIEGAYEVLG+F++V + +AM+ + L Q Sbjct: 122 VCTNGLVCGNNFGEIRVPHKGDIVGQVIEGAYEVLGVFDKVTDNMEAMKEIHLNSDEQHL 181 Query: 183 LAKAALTYRF-GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGG 241 +AAL R+ E+ PVT QI++PRRW+D+ NDLWTT+QR+QEN+IKGGLSGR+A G Sbjct: 182 FGRAALMVRYEDENKTPVTPEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGK 241 Query: 242 RTHTRAVRGIDGDVKLNRALWVMAETL 268 T TRA+ GIDGD+++N+ALWV+AE Sbjct: 242 NTRTRAITGIDGDIRINKALWVIAEQF 268 >UniRef50_Q1ND23 CP4-6 prophage n=4 Tax=Alphaproteobacteria RepID=Q1ND23_9SPHN Length = 281 Score = 322 bits (825), Expect = 7e-87, Method: Composition-based stats. Identities = 130/270 (48%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Query: 4 LASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGF 62 LA+RFG ++ I PL E L+R VPS+F+ + H+SRSERY Y+PTI +++ L+REG+ Sbjct: 6 LATRFGRNSHQIGGYEPLDNEALYRHVPSIFAREAHDSRSERYVYVPTIDIVEGLRREGW 65 Query: 63 QPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 PFFA Q+ RD R H KHMLRLRRE + + E I++NSHDGTS++Q+ GM R Sbjct: 66 FPFFAVQSVPRDGNRHGHAKHMLRLRREDGVGKSEAAEAIIVNSHDGTSAFQLFAGMLRF 125 Query: 123 VCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQA 182 VC N ++ GE F EVRVPHKG++ +IEG Y V F R+ + + M+ + L Q+ Sbjct: 126 VCTNSMIAGERFEEVRVPHKGNIEHDIIEGVYTVAEDFPRLIDASETMKGVRLSEDEQRL 185 Query: 183 LAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG--RNAKG 240 L + +L R+GED P+T QI+ PRR++D + LWTT+ IQEN+I+GGL G RNA+G Sbjct: 186 LGEVSLVARYGEDESPLTPEQIIEPRRYEDRGDSLWTTFNVIQENVIRGGLHGRKRNAEG 245 Query: 241 --GRTHTRAVRGIDGDVKLNRALWVMAETL 268 R+ +R + GID +V LNRALW +AE + Sbjct: 246 RIRRSRSRPINGIDQNVTLNRALWTLAEGM 275 >UniRef50_C6N6D1 Putative uncharacterized protein n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6D1_9GAMM Length = 275 Score = 302 bits (772), Expect = 1e-80, Method: Composition-based stats. Identities = 103/268 (38%), Positives = 150/268 (55%), Gaps = 11/268 (4%) Query: 13 LIRRDRP----LTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFAC 68 LI P LT E+L++ PS+F+ SERY I T ++D L +EGF P A Sbjct: 5 LIESGEPAMNVLTIEQLYKAAPSLFTRGAAVHTSERYQPIATSDVIDRLLQEGFYPTKAT 64 Query: 69 QTRVRDPRRREHTKHMLRLRREG-QITGK-QVPEIILLNSHDGTSSYQMLPGMFRAVCQN 126 Q+ R ++ +KH++R R G PE++L+NSHDG SSY+++ G++R VC N Sbjct: 65 QSASRSEEKKVFSKHLVRFRHRDYHNPGNGLFPELVLINSHDGLSSYRLMAGLYRQVCTN 124 Query: 127 GLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKA 186 GLV G+S+ EVRV H+GDV+ VIEG Y V+ +++ + + M LP + Sbjct: 125 GLVAGKSYDEVRVKHQGDVIGNVIEGTYRVIESSQKMLQVVEQMGDCALPDEKLLEFSAQ 184 Query: 187 ALTYRFGEDHQPV-TESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG----RNAKGG 241 A RF ED V +L PRR +D DL++ + +QENLIKGG+ G + + Sbjct: 185 AHALRFSEDANLVIEPKNLLVPRRREDMKRDLFSVFNVVQENLIKGGVLGYRLNEHGRWR 244 Query: 242 RTHTRAVRGIDGDVKLNRALWVMAETLL 269 R +R + ID +VK+NR LW +AE L Sbjct: 245 RARSRKITSIDQNVKINRDLWTIAENTL 272 >UniRef50_A9HST1 Putative uncharacterized protein n=1 Tax=Gluconacetobacter diazotrophicus PAl 5 RepID=A9HST1_GLUDA Length = 282 Score = 300 bits (768), Expect = 3e-80, Method: Composition-based stats. Identities = 111/276 (40%), Positives = 156/276 (56%), Gaps = 13/276 (4%) Query: 6 SRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPF 65 S + R +PLT E+L R+ PS+F+E KHESRS+RYTYIPTI ++ L+ EGF P Sbjct: 2 SFLSRVSAHRHAQPLTDEQLQRLAPSIFAEAKHESRSDRYTYIPTIEVVRGLRSEGFFPV 61 Query: 66 FACQTRVRDPRRREHTKHMLRLRREGQ-----ITGKQVPEIILLNSHDGTSSYQMLPGMF 120 A Q R P + E+TKH++R R G PE+ LLNSHDGTS+Y+++ M Sbjct: 62 MARQGNSRIPGKAEYTKHLIRFRHMDHGPMYENLGDLYPEVALLNSHDGTSAYKIIAAMM 121 Query: 121 RAVCQNGLVCGESF-GEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPV 179 R C+NG+V ++ E+ VPHKG V +VIEG+Y VL + E L Sbjct: 122 RLACENGMVVQDARLAEISVPHKGTVTDKVIEGSYTVLDESRKALEIAGEWSGKTLTERQ 181 Query: 180 QQALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSG-- 235 Q+ A+A ++G+D + P T L RR D+ DLW R+QE+ I+GG++G Sbjct: 182 QKGFAEAVHIAKYGDDAERMPFTPESYLRTRRAADQGADLWRVANRVQESAIRGGMTGFR 241 Query: 236 ---RNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 R R V+ IDGD+KLN+A+W +A+ L Sbjct: 242 WDEDGRNRKRVTARPVKSIDGDIKLNKAVWHLAQML 277 >UniRef50_B9JPN2 Phosphoribosylamine-glycine ligase n=2 Tax=Proteobacteria RepID=B9JPN2_AGRRK Length = 391 Score = 300 bits (768), Expect = 4e-80, Method: Composition-based stats. Identities = 109/276 (39%), Positives = 156/276 (56%), Gaps = 13/276 (4%) Query: 6 SRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPF 65 + + R +T E+++V PS+F+ HESRS+R+ IPTI +L L EGF P Sbjct: 109 TIYTETARFDTARTMTETEMWKVAPSIFATTAHESRSDRFKPIPTIEVLRGLMAEGFVPV 168 Query: 66 FACQTRVRDPRRREHTKHMLRLRREGQ----ITGKQVPEIILLNSHDGTSSYQMLPGMFR 121 A Q+ R + + TKH++RLRR G V EI+L N++DGTS+Y++L G+FR Sbjct: 169 GAKQSASRTEGKADFTKHLIRLRRVDDGKTYRVGDTVCEILLKNANDGTSAYELLAGLFR 228 Query: 122 AVCQNGLVCGE-SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 C N LV + ++V H GDV ++VIEG Y VL ER + L Q Sbjct: 229 IRCMNSLVTQTGTIDAIKVRHSGDVSAKVIEGTYRVLNEAERTLVAPQDWATHKLNRDEQ 288 Query: 181 QALAKAALTYRFGEDH----QPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR 236 Q +A+AA RFG++ P+ Q+L PRR D ++DLWT + QEN+I+GGL G Sbjct: 289 QIMAEAAHVLRFGDNDGETKTPIKPEQLLLPRRHDDRADDLWTVWNVTQENVIRGGLRGI 348 Query: 237 N----AKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 + R +RAV GID D+KLN+ALW++ E + Sbjct: 349 GREDLGRPRRVKSRAVNGIDQDIKLNKALWLIGEKM 384 >UniRef50_A4X0R7 Putative uncharacterized protein n=2 Tax=Rhodobacter sphaeroides RepID=A4X0R7_RHOS5 Length = 316 Score = 299 bits (766), Expect = 6e-80, Method: Composition-based stats. Identities = 118/266 (44%), Positives = 166/266 (62%), Gaps = 9/266 (3%) Query: 12 NLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTR 71 ++ R PLT EL VPS+F+ + HESRS R+ +PTI++LD L+ EGF+PFFA Q R Sbjct: 44 SIFSRGEPLTNAELHARVPSIFATEAHESRSARFAPVPTITVLDGLRAEGFEPFFAQQAR 103 Query: 72 VRDPRRREHTKHMLRLRREGQIT-GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVC 130 R + E TKHMLRLR G + + EI+L+N++DGTS+YQM+PG FR VC NGL+ Sbjct: 104 TRIEGKAEFTKHMLRLRHRGIVNEAGEAFEIVLVNANDGTSAYQMIPGFFRFVCANGLMA 163 Query: 131 GESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTY 190 GE+F EV+V H G+ + +VIEGAY VL RV ++ +S+ L ++ LA+AA + Sbjct: 164 GETFEEVKVRHSGNAIGEVIEGAYRVLEDAPRVADQVQRFKSIRLQDREREILAEAAHSL 223 Query: 191 RFGEDHQ----PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR----NAKGGR 242 RF + P+ +L PRR +D + DLWT + +QEN ++GG+ GR + R Sbjct: 224 RFPATAEGKAAPIDPPALLRPRRSEDRATDLWTAFNVVQENTLRGGMRGRIETDSGFIRR 283 Query: 243 THTRAVRGIDGDVKLNRALWVMAETL 268 R V GID LNRALW++ E + Sbjct: 284 QTVREVTGIDQSRALNRALWMLTERM 309 >UniRef50_C6RFJ3 Putative uncharacterized protein n=1 Tax=Campylobacter showae RM3277 RepID=C6RFJ3_9PROT Length = 271 Score = 298 bits (762), Expect = 1e-79, Method: Composition-based stats. Identities = 84/257 (32%), Positives = 139/257 (54%), Gaps = 5/257 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 + PLT E+L ++ PS+F+++ + S++Y +I TI +++ ++ + P + VRD Sbjct: 4 SNEPLTNEQLEQLAPSLFADEPYFEASDKYHFISTIDVINEIRDYAWYPVGVSEASVRDE 63 Query: 76 RRREHTKHMLRLRREGQ--ITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-E 132 ++ KH +R R G+ V E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 KKEGFQKHYVRFRHLDDFLNPGENVVELLLFNSHDRSKCFSISAGVFRFVCANGLVVSDE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRF 192 F ++ H GD + V ++ + K + L + + AKAA+ RF Sbjct: 124 VFESYQIKHLGDKENDVSIAINKIAKAKYDILNKIKLFSKIPLTQDDKASFAKAAIPLRF 183 Query: 193 GEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRT-HTRAVRGI 251 E H V +L P R +DE +DL+TT+ IQE+LI+G +SG NA+ R +R ++ I Sbjct: 184 -EKHLKVDYRDLLVPHRIEDEKDDLYTTFNTIQEHLIRGNISGINAETNRRFTSRIIKSI 242 Query: 252 DGDVKLNRALWVMAETL 268 D +N+ LW MAE++ Sbjct: 243 STDTDINKKLWNMAESI 259 >UniRef50_B8IVF8 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVF8_METNO Length = 295 Score = 288 bits (736), Expect = 2e-76, Method: Composition-based stats. Identities = 125/276 (45%), Positives = 171/276 (61%), Gaps = 13/276 (4%) Query: 6 SRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQP 64 +RFG+ A ++R + L L P+VF+EDKH SRS++YTYIPT+ +L L REGF P Sbjct: 14 TRFGSGAVVVRNNGGLDEAALRSAAPTVFAEDKHSSRSDKYTYIPTVEVLRGLGREGFLP 73 Query: 65 FFACQTRVRDPRRREHTKHMLRLRREGQIT---GKQVPEIILLNSHDGTSSYQMLPGMFR 121 RD +R +TKH+LRLRR G G E++LLNSHDGTSSYQ++ G+FR Sbjct: 74 VEVRVGGTRDEEKRGYTKHLLRLRRMGDAPTRVGDSSRELVLLNSHDGTSSYQLMSGLFR 133 Query: 122 AVCQNGLVCGESFGEV-RVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 +C NGLVC + ++ ++PHKGD+V QVI+GAY ++ E V+ M+ + L P Q Sbjct: 134 LICSNGLVCADGDAQILKIPHKGDIVQQVIDGAYRIVDASEEVDRIAAEMKQIELRPAEQ 193 Query: 181 QALAKAALTYRFGEDHQ--PVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS--GR 236 A A+AA R+ + Q PV QI +PRR +D N LW + R QE LI+GG+ R Sbjct: 194 DAFAEAAAELRWNGEGQRVPVEPRQIHAPRRREDVGNSLWLAFNRTQEGLIRGGIDYQQR 253 Query: 237 NAKGG----RTHTRAVRGIDGDVKLNRALWVMAETL 268 N + G R TR V+G+DG+ LNRALWV+A + Sbjct: 254 NPETGRLIARRQTRPVQGVDGNTALNRALWVLANRM 289 >UniRef50_B6C6K7 Conserved domain protein n=2 Tax=Nitrosococcus oceani RepID=B6C6K7_9GAMM Length = 226 Score = 276 bits (705), Expect = 7e-73, Method: Composition-based stats. Identities = 97/220 (44%), Positives = 147/220 (66%), Gaps = 4/220 (1%) Query: 53 LLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREG---QITGKQVPEIILLNSHDG 109 ++++L+REG+ P A ++RVR P R+ +KH+LR RR + G PEI+L+NSHDG Sbjct: 1 MIEALEREGWSPVHAEESRVRIPDRKGFSKHLLRFRRFDNELPMVGDSFPEIVLVNSHDG 60 Query: 110 TSSYQMLPGMFRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRD 168 + +YQ+ G+FR VC NG++ +S G+V+ H GDVV +VIEG YE++ R+ + + Sbjct: 61 SCAYQLHAGLFRLVCSNGMIVADSNMGQVKRRHTGDVVREVIEGTYEIVEELPRIAARVE 120 Query: 169 AMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENL 228 ++L L Q+ A++AL R+ E P +L PRR +D+ NDLW TYQR+QEN+ Sbjct: 121 DFKTLELSLQEQEIFAESALRVRWREGEAPCMPQALLRPRRHEDQGNDLWATYQRVQENM 180 Query: 229 IKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 +KGG+ GR+A G + TRAV+ +DG+VKLN+ALW + E + Sbjct: 181 LKGGIRGRSAVGRQITTRAVKSVDGNVKLNKALWFLTEQM 220 >UniRef50_B1ZQ12 Putative uncharacterized protein n=3 Tax=Opitutus terrae PB90-1 RepID=B1ZQ12_OPITP Length = 288 Score = 256 bits (654), Expect = 6e-67, Method: Composition-based stats. Identities = 96/285 (33%), Positives = 141/285 (49%), Gaps = 23/285 (8%) Query: 6 SRFGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPF 65 + A R R L+ ++L RV PSVF+E S RYT++ T ++D L+ EG++P Sbjct: 2 TTIETAPSSRVFRALSLDDLRRVAPSVFAEQARPGVSSRYTFVSTAQVVDLLRGEGWEPV 61 Query: 66 FACQTRVRDPRRREHTKHMLRLRREGQI------TGKQVPEIILLNSHDGTSSYQMLPGM 119 A Q RVR R+ H LR R + G PE+IL N+HDGT +Y++ G+ Sbjct: 62 KANQQRVRLENRQGFQMHELRFARRADLENASFAIGDVRPELILQNAHDGTRAYRIDAGL 121 Query: 120 FRAVCQNGLVCGES-FGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPP 178 +R VC+NGL ++ F V + H + A V RV E Q++ L P Sbjct: 122 YRLVCRNGLTVADADFAHVAIRHVDVSAEKFAAAAQAVAENTPRVMEVIARWQAVALTPL 181 Query: 179 VQQALAKAALTYRFGEDHQPVT----ESQILSPRRWQDESNDLWTTYQRIQENLIKGGLS 234 + + A A+ R+ + QPVT Q+L+P R+ D++ DLWTT+ +QE L +GGL Sbjct: 182 ARHSFAARAMALRW-DSAQPVTRLLRPDQLLAPARYGDQATDLWTTFNVVQERLCRGGLR 240 Query: 235 GRNAKGG-----------RTHTRAVRGIDGDVKLNRALWVMAETL 268 R TR V G+ +LN+ALW +AE Sbjct: 241 YAGHIPAAEGAVFPTHYLRNTTRPVGGLTEGQRLNKALWNLAEEF 285 >UniRef50_Q17W97 Putative uncharacterized protein Hac prophage I orf7 n=1 Tax=Helicobacter acinonychis str. Sheeba RepID=Q17W97_HELAH Length = 176 Score = 196 bits (498), Expect = 6e-49, Method: Composition-based stats. Identities = 42/172 (24%), Positives = 84/172 (48%), Gaps = 3/172 (1%) Query: 16 RDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDP 75 +PL+ EL R+ PS+F+ + + S++Y +I TI +++ ++ + P + VR+ Sbjct: 4 STQPLSNNELKRLAPSLFTAEPYYEASDKYHFISTIDIIEEIRFHAWYPVAVSEASVRNE 63 Query: 76 RRREHTKHMLRLRREGQI--TGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-E 132 + + +H +R R + E++L NSHD + + + G+FR VC NGLV E Sbjct: 64 DKEGYQQHYVRFRYLDDFLRPSENCVELLLFNSHDRSKCFTISAGVFRFVCANGLVVADE 123 Query: 133 SFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALA 184 F ++ H G+ + V ++ +++ +K + L + + A Sbjct: 124 VFESYQIKHIGEKANGVAVAIPSIVQAKDKIMDKISTFSQITLTEQDKISFA 175 >UniRef50_Q2LV02 Hypothetical cytosolic protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LV02_SYNAS Length = 264 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 49/274 (17%), Positives = 93/274 (33%), Gaps = 57/274 (20%) Query: 13 LIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDS--------LQREGFQP 64 + R +T+++L + ++ Y + L D L+ Sbjct: 6 MHRGGELVTKDQLDLI--------PLPEPTDSYMPVSHYDLADKFLMISQDILRDYKL-- 55 Query: 65 FFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVC 124 R + +L+ +RE G I NS+D + + + G VC Sbjct: 56 VGENYGIAR-QGNQFFA--VLKFQRERSEIG---LSIAFRNSYDRSMAIGLAIGASVFVC 109 Query: 125 QNGLVCGESFGEVRVPHKGDV----VSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQ 180 N + GE V H +V + I Y+ ++++ DA +S LP Sbjct: 110 DNLALSGEIV--VMKKHTKNVWSELEEKAIATIYKSQNNYDQLIGDVDAFKS--LPVDDN 165 Query: 181 QALAKAALTYRFGEDHQPVTESQI-------LSPRRWQDESNDLWTTYQRIQENLIKGGL 233 A A+ FG + ++ Q+ L P + E +LW+ Y E+L Sbjct: 166 GAF--QAMGLLFG--NNIISPRQLTVLKEEWLKPSHEEFEPRNLWSFYNAATESL----- 216 Query: 234 SGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAET 267 + V ++ ++L+ AL + + Sbjct: 217 ---------KSSPPVTIMEKHIRLHEALTYLGKE 241 >UniRef50_A8RIH4 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RIH4_9CLOT Length = 312 Score = 163 bits (412), Expect = 6e-39, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 86/245 (35%), Gaps = 33/245 (13%) Query: 42 SERYTYIPTISLL---DSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV 98 ++RY + D L EG RR + +L + I+G ++ Sbjct: 76 TDRYKVVQNEDAFAFTDQLLGEG---VTYETAGSLQNGRRTWL--LAKLPQRYIISGDEI 130 Query: 99 -PEIILLNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 P ++ +N+HDGT + ++ R VC N L + H GD+ ++ + Y Sbjct: 131 TPYMVFMNTHDGTGAIRVAMTPVRVVCMNTLNLALSTAKRSWSTNHTGDIAGKMEDARYT 190 Query: 156 VLGIFERVEE---KRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQD 212 +L + E D M+ L L A P + Q R +D Sbjct: 191 LLYADRYMSELGKAIDHMKRLRLSERQVMEYIDALFPLY----DNPTPQQQKNLNRMKED 246 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGG-RTHTRAV------------RGIDGDVKLNR 259 + +++ K G NA TH R + + ++G+ ++R Sbjct: 247 MKTRYFDAPDL--KHVGKNGYRFINAVSDFATHARPLRESANHKENLFAKTVEGNALIDR 304 Query: 260 ALWVM 264 A ++ Sbjct: 305 AFAML 309 >UniRef50_D1N225 Putative uncharacterized protein n=1 Tax=Victivallis vadensis ATCC BAA-548 RepID=D1N225_9BACT Length = 241 Score = 151 bits (381), Expect = 2e-35, Method: Composition-based stats. Identities = 40/229 (17%), Positives = 80/229 (34%), Gaps = 18/229 (7%) Query: 34 SEDKHESRSERYTYIPTISLLDSL----QREGFQPFFACQTRVRDPRRREHTKHMLRLRR 89 + + + + +P ++D++ + +Q R+ +R M + R Sbjct: 17 AMVPTPAATASWKPVPHSEVIDAVTDVVRAHNWQILDEQYGLARNGQR------MFGVIR 70 Query: 90 EGQITGKQVPEII-LLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVV-- 146 + + + I + NSHD T + + G+ VC N + G + ++ H + Sbjct: 71 INRTSSSEWSRCIGICNSHDRTIAVGLAAGLNVQVCANLMFGGSTV--LKRRHTSRIELN 128 Query: 147 SQVIEGAYEVLGIFERVEEKRDAMQSLLLPPP-VQQALAKAALTYRFGEDHQPVTESQIL 205 V+E + F +E + ++ + + A+ KAA + Sbjct: 129 GLVVEAIDALEDDFLTLETVAEDLKIQFVRDDTARAAIVKAAEAGAVNSCDIVPIFREFK 188 Query: 206 SPRRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGD 254 PR + W EN K R + R TR + G+DG Sbjct: 189 EPRYEEFAEPTRWALLNAFTENAKKYS-PARADQCYRGLTR-LFGLDGQ 235 >UniRef50_B5EW31 Putative uncharacterized protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5EW31_VIBFM Length = 318 Score = 134 bits (336), Expect = 4e-30, Method: Composition-based stats. Identities = 45/241 (18%), Positives = 84/241 (34%), Gaps = 29/241 (12%) Query: 43 ERYTYIPTISLLDSL--------------QREGFQPFFACQTRVRDPRRREHTKHMLRLR 88 RYT + DS+ F + R+ HM+++ Sbjct: 76 SRYTLLKNSDAFDSVNAAVNTLAENGVLNMDGAFIKDAVVNKGGKVIRQYFFPAHMVQI- 134 Query: 89 REGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVV-S 147 K + ++++NS+DG+ ++Q+ G FR VC NG++ GE F + V H G + Sbjct: 135 ---GSGDKVILRLVVVNSYDGSCNFQVQAGGFRIVCTNGMITGEKFLSLDVRHTGTMNFG 191 Query: 148 QVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSP 207 QV + FE + + D + + P+ + A +T + + Sbjct: 192 QVTRQVTTAVSSFENMGQYWDTL----INSPLNRKDADKIITDMSTVGRELNMNKFDMFD 247 Query: 208 RRWQDESNDL----WTTYQRIQENLIKGGLSGRN-AKGGRTHTRAVRGIDGDVKLNRALW 262 R + D L W Y + ++ N + + I + A+W Sbjct: 248 RLYTDHKKTLGENHWAMYNSLTAWATHYKVNESNISNIDNVRLEREKSI-QHLMRKPAIW 306 Query: 263 V 263 Sbjct: 307 N 307 >UniRef50_B2Q5G8 Putative uncharacterized protein n=3 Tax=Providencia RepID=B2Q5G8_PROST Length = 122 Score = 129 bits (323), Expect = 1e-28, Method: Composition-based stats. Identities = 81/112 (72%), Positives = 96/112 (85%) Query: 157 LGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESND 216 + F+ V EKR+ MQSLLLPPP QQALA+AALTYRFGE+HQP+TE Q+L PRRW+D+ +D Sbjct: 1 METFDTVAEKREQMQSLLLPPPAQQALAQAALTYRFGEEHQPITEEQVLQPRRWEDKKDD 60 Query: 217 LWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETL 268 LWT YQR+QENLIKGGLSGRNAKG R TR+V GIDGD+KLN+ALWVM E + Sbjct: 61 LWTVYQRLQENLIKGGLSGRNAKGKRARTRSVNGIDGDIKLNKALWVMTEKM 112 >UniRef50_D2R5Z8 Phage/plasmid-related protein TIGR03299 n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5Z8_9PLAN Length = 327 Score = 127 bits (319), Expect = 4e-28, Method: Composition-based stats. Identities = 43/256 (16%), Positives = 85/256 (33%), Gaps = 47/256 (18%) Query: 45 YTYIPTISLL---DSLQREGFQPFFACQTRVRDPRRREHTK----HMLRLRREGQITGKQ 97 Y + D++ +G R +R++ + K Sbjct: 83 YVPVQNRQAFGFLDAVVADG--SLRYHTAGALGKGERIWLLAKLPSQIRVKNSDDLVDKF 140 Query: 98 VPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGE---VRVPHKGDVVSQVIEGAY 154 ++L N+HDG+S+ ++ R VCQN L ++ + + HKG++ ++ I A Sbjct: 141 ---LLLSNAHDGSSALRVYFTPIRVVCQNTLNLADNRSTGQGISILHKGNLHTK-IREAQ 196 Query: 155 EVLGIFERVEE----KRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRW 210 VLG+ E + D + S +A ++ + G D+ R+ Sbjct: 197 RVLGLAEEFYDEAEGIIDILASHHPSSVQVEAFFQSVIPDPIGADNA--------RARKV 248 Query: 211 QDE---------SNDL-------WTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRG--ID 252 +D D+ W Y + E + R+ +R + Sbjct: 249 RDRLTCLFETGIGQDMPEIKGTSWAAYNAVTE-FVDHHRPTRSTDPLERASRRLDSSWFG 307 Query: 253 GDVKLNRALWVMAETL 268 +L W +A + Sbjct: 308 SGARLKAKAWNLAFDM 323 >UniRef50_A6GXR9 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXR9_FLAPJ Length = 285 Score = 121 bits (304), Expect = 2e-26, Method: Composition-based stats. Identities = 41/244 (16%), Positives = 78/244 (31%), Gaps = 37/244 (15%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQIT-----GK 96 S+ Y ++P ++ TR + R + + K Sbjct: 52 SKSYGHLPNEDFFYKVEEMLINSDINYITRSINRDNRSFAVDYILNDDNFSVNIKNGLDK 111 Query: 97 QVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVI----EG 152 P + NS+DG+ G FR VC NGL + + H+G++ V+ + Sbjct: 112 IRPMLRFTNSYDGSCKTSGTFGFFREVCSNGLHTASTDIGFSLKHRGNINELVLPAIGKT 171 Query: 153 AYEVLG----IFERVEEKRDAMQSLLLPPPVQQALAKAALTYRF-GEDHQP---VTESQI 204 Y L R E + VQ A+ ++F D P + + Sbjct: 172 IYNFLDNEFYELRRKFEVLADFKIADPSEIVQHI-AQQTKLFKFESSDKNPAPSLNARLV 230 Query: 205 LSPRRWQ----DESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRA 260 + + E ++W Y E L+ G + + D K+++ Sbjct: 231 IETIENETLILKEDANMWMVYNAFNE-LLHGKIK--------------KTFDQQKKIDKE 275 Query: 261 LWVM 264 ++ + Sbjct: 276 IFNL 279 >UniRef50_A8ZS75 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZS75_DESOH Length = 318 Score = 116 bits (290), Expect = 8e-25, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 65/206 (31%), Gaps = 26/206 (12%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRRE---------GQ 92 +ERY + + +L L R GF P Q + D ++R+ Sbjct: 108 TERYKPLDNMDVLSQLLRHGFDPDTQVQYAIDDG------MFLVRIPEYARAFGVNPGYG 161 Query: 93 ITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGD-VVSQVIE 151 + VP + NS G ++ + +R VC NGL+ S R H + + E Sbjct: 162 KLDEIVPGVSFANSEVGLLAFSIEAFFYRLVCTNGLISKTSSTFSRFKHISNRGLENFPE 221 Query: 152 GAYEVLGIFERVEEKRDAMQSLLLPPPVQ--QALAKAALTYRFGEDHQPVTESQILSPRR 209 V+ R +E+ + + P++ + A+ E + Sbjct: 222 TIAGVIEDSVRKQEQFKLSRQSPVENPIRSIETFARQ-FGLAHLETEVVCKAYLL----- 275 Query: 210 WQDESNDLWTTYQRIQENLIKGGLSG 235 ++ ++ L Sbjct: 276 --EQGATMFHIINAFTRAAQDKHLDT 299 >UniRef50_B9E574 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=B9E574_CLOK1 Length = 325 Score = 101 bits (251), Expect = 3e-20, Method: Composition-based stats. Identities = 42/241 (17%), Positives = 75/241 (31%), Gaps = 33/241 (13%) Query: 42 SERYTYIPTISLL---DSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV 98 ++RY + DSL EG R+ + +L + +I +V Sbjct: 88 TDRYKIVQNKEAFSFTDSLIGEG---CKYETAGSLQNGRKVWL--LAKLPDKYKILDDEV 142 Query: 99 -PEIILLNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 P ++ NSHDGT + ++ R VC N L + H G++ S++ E Sbjct: 143 TPYMVFSNSHDGTGAIKVAMTPIRVVCNNTLNLALSNAKRIWSTIHTGNISSKLNEAMKT 202 Query: 156 VL--GIFERVEEKRDA-MQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQD 212 +L + + + + + L + D Sbjct: 203 LLLAESYMENLDYEAHYLSRKTISDEKVLEFIELLLPLPDNASKT----QEKNINLLRDD 258 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGG-RTHTRAV------------RGIDGDVKLNR 259 + I +L K NA TH + + IDG+ ++R Sbjct: 259 MKLRYFDAPDLI--DLPKTSWRFVNAVSDFATHINPLRKTKNYKENLFSKTIDGNPLIDR 316 Query: 260 A 260 A Sbjct: 317 A 317 >UniRef50_A8ZYJ5 Putative uncharacterized protein n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZYJ5_DESOH Length = 308 Score = 94.1 bits (232), Expect = 5e-18, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 44/128 (34%), Gaps = 15/128 (11%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV-PE 100 + +YT + +L+ L G+ P Q + + + R+ I G + P Sbjct: 105 TPKYTPVDNFEILERLDSLGYGPDTKVQCSL---DAEFLSLSIPDGRKAFDINGDRFKPG 161 Query: 101 IILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIF 160 I + NS G +S + + R VC NGL+ H +L F Sbjct: 162 ISISNSEVGLASLTISAFVLRLVCTNGLIARTGI-SASYRHVST----------RILKEF 210 Query: 161 ERVEEKRD 168 + E Sbjct: 211 PQTIETVS 218 >UniRef50_B8F9V3 Putative uncharacterized protein n=4 Tax=Deltaproteobacteria RepID=B8F9V3_DESAA Length = 311 Score = 80.2 bits (196), Expect = 8e-14, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 64/215 (29%), Gaps = 15/215 (6%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREG--QITGKQVP 99 + RY + I +++ L++ GF Q + + + + K P Sbjct: 105 TPRYQPVDNIRVMERLEQMGFGHDMEIQLAL---DAEFFSLSIPDHEKTFAVGNDDKLTP 161 Query: 100 EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKG-DVVSQVIEGAYEVLG 158 I + NS G ++ + + R VC NGL+ + H V+ E +V G Sbjct: 162 GITVCNSEVGRAALSIAAFVLRLVCTNGLIAKTAV-SASYRHISAKVMEVFPETLQQVAG 220 Query: 159 IFERVEEKRDAMQSLLLPPPVQ--QALAKAALTYRFGEDHQPVTESQILSPRRWQDESND 216 + + + + P + + + Q + Sbjct: 221 ELDVQQTRFRLSMESQVENPSNTIHSFNRQFMLAEPEVQAVDWAYPQEME------LPAT 274 Query: 217 LWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGI 251 ++ + GL + A+ G+ Sbjct: 275 MFNVVNTYTKASQAPGLPAESCHRLGRVGGAILGM 309 >UniRef50_A3XKH6 Putative uncharacterized protein n=2 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XKH6_9FLAO Length = 312 Score = 79.8 bits (195), Expect = 9e-14, Method: Composition-based stats. Identities = 34/242 (14%), Positives = 79/242 (32%), Gaps = 38/242 (15%) Query: 42 SERYTYIPT---ISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGK-- 96 S Y +IP + + + F + R T ++ + + + Sbjct: 81 SNSYGHIPNQLFFKKAEEMLTDAQLNFHKRT--INKNDRSFITDFIIDDKSQFTVKNDKD 138 Query: 97 -QVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 +P + NS+DG+ G +R VC NGL + E + H + ++ Sbjct: 139 LILPMLRFKNSYDGSEKTSGHFGFYREVCSNGLHVSLAEIEFSIKHSKNNTHLIMPRLNN 198 Query: 156 VLG-----IFERVEEKRDAMQSLLL--PPPVQQALAKAALTYRFGEDHQPVTE----SQI 204 + F + +K D M+ + +A+ +R+ + ++ Sbjct: 199 LFDKFLDNEFYTITKKFDKMKEFKIIDTQEFVKAILDRTKLFRYECSDKNSDPSKKSREV 258 Query: 205 LSPRRWQ----DESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRA 260 + ++ +E +LW Y +++ L + +L++ Sbjct: 259 IEILNYEALLLNEEPNLWLGYNAFN-SVLHNVLK--------------KSFGQQERLDKK 303 Query: 261 LW 262 L+ Sbjct: 304 LF 305 >UniRef50_C7Q5L2 Phage/plasmid-related protein TIGR03299 n=1 Tax=Catenulispora acidiphila DSM 44928 RepID=C7Q5L2_CATAD Length = 329 Score = 79.4 bits (194), Expect = 1e-13, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 17/136 (12%) Query: 45 YTYIPTIS---LLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEI 101 YT + ++++L F R+ +RL + G ++ Sbjct: 96 YTPVQNEENCQIMNTLVDASGAHF--ETAGSLREGRQVFVT--MRLPETMTVAGTDRLDL 151 Query: 102 IL--LNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEG----- 152 + LNSHDGT +Y+++ R VC N L + + H ++ E Sbjct: 152 YISGLNSHDGTGAYKLIVTPIRIVCANTQSLALDRARSSFSIRHTESAKKKIAEARKALG 211 Query: 153 -AYEVLGIFERVEEKR 167 ++ + FE+ E+ Sbjct: 212 LMFKYVEEFEKAAERM 227 >UniRef50_C4DCZ5 Phage/plasmid-related protein TIGR03299 n=3 Tax=Actinomycetales RepID=C4DCZ5_9ACTO Length = 395 Score = 78.3 bits (191), Expect = 3e-13, Method: Composition-based stats. Identities = 35/257 (13%), Positives = 75/257 (29%), Gaps = 29/257 (11%) Query: 22 REELFRVVPSVF------SEDKHESRSERYTYIPTISLLDSLQR--EGFQPFFACQTRVR 73 ++L P F + +Y + + L+ E + + VR Sbjct: 125 DDQLHTH-PDKFHTLRSDTAAPLGVVGSKYHTVQNRECFEFLRNLVESYDVVWESAGAVR 183 Query: 74 DPRRREHTKHMLRLRREG--QITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNG--LV 129 RR + + I P +++ NSHDG+SS + +R VC N L Sbjct: 184 GGRRTFVSMRLPDTVTVDAAGINDTITPFVVVFNSHDGSSSITAVVTPYRPVCANTERLA 243 Query: 130 CGESFGEVRVPHKGDVVSQVIEGAYEV---LGIFERVEEKRDAMQSLLLPPPVQQALAKA 186 ++ + H + Q+ + + + ++ ++ + + +AL Sbjct: 244 LDNAYTSWSIRHTESAMHQMRQARRTLKMSVKYYDEFAAQQTTLAHHDMVIDEFRALIDE 303 Query: 187 ALTYRFGEDH----------QPVTESQILSPRRWQDESNDLWTTYQRIQENLIKGGLSGR 236 + + RR + L+ + I + Sbjct: 304 LWPLEPNATKRGKTNAKNRREALMAQWDTESRRC---GSTLYAAERAITGYIDHDKPRML 360 Query: 237 NAKGGRTHTRAVRGIDG 253 RA ++G Sbjct: 361 GKFSTLNAARATAIVEG 377 >UniRef50_Q024R3 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q024R3_SOLUE Length = 237 Score = 77.1 bits (188), Expect = 6e-13, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 72/227 (31%), Gaps = 24/227 (10%) Query: 8 FGAANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSL----QREGFQ 63 A LI LTR +L ++ + + +P + ++++L Sbjct: 1 MSEATLIASTAKLTRLQL--------ADVPTPLGTATHRPVPHVEVVEALVETLSFRHIG 52 Query: 64 PFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAV 123 +D M + I + NSHD + + G+ V Sbjct: 53 VVTEEYAVSKDG------MKMFGVLDLDTGMPGCRFSIGIRNSHDRSMRLAAVVGVRVLV 106 Query: 124 CQNGLVCGESFGEVRVPHKGD--VVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQ 181 C+N G+ F V H + + + + G ++ F+ + ++ DA + L V + Sbjct: 107 CENMAFSGD-FQPVLAKHSKNFSLQNALSIGVDQMQRNFDGMRKQVDAWRESQLSDTVAK 165 Query: 182 ALAKAALT---YRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQ 225 + A + SP+ + + +W+ Sbjct: 166 MIIYRAFIESDLEVPKHLARPVHDLYFSPKHEEFQPRTMWSLSNAFT 212 >UniRef50_A1SIX8 Putative uncharacterized protein n=2 Tax=Nocardioides sp. JS614 RepID=A1SIX8_NOCSJ Length = 334 Score = 75.6 bits (184), Expect = 2e-12, Method: Composition-based stats. Identities = 45/257 (17%), Positives = 82/257 (31%), Gaps = 24/257 (9%) Query: 27 RVVPSVFSEDKHESRSERYTYIPTISLLD--SLQREGFQPFFACQTRVRDPRRREHTKHM 84 R P + + YT + + +L E F +R R+ Sbjct: 77 RTNPFTGAPEALGVVGGGYTPLQNEDHAEFLNLLAEESGAIFDTAGSLR-GGRQVFIT-- 133 Query: 85 LRLRREGQITGKQVPEIIL--LNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVP 140 ++L + G ++ + LNSHDG+S++++L R VC N + Sbjct: 134 MQLPDSLTVGGTDRVDLNIAALNSHDGSSAFRILVTPVRVVCANTQSAALRNHESSFSIR 193 Query: 141 HKGDVVSQVIEGAYEVLGIF---ERVEEKRDAMQSLLLPPPVQQALAKAAL--TYRFGED 195 H + + V + F + + + + + + AL A G Sbjct: 194 HTRNAKAAVQAARDALGLTFTYVDAFQVEAERLIQQTMTDAAFDALIDATFGKAEANGTK 253 Query: 196 HQPVTESQILSPRRWQDESNDL--------WTTYQRIQENLIK-GGLSGRNAKGGRTHTR 246 TE + S W D W YQ + E + + + + TR Sbjct: 254 RVRETERRRRSRLHWLFADADTQAGIRATAWAGYQAVAEYVDHYAPVRTKGDEHAARATR 313 Query: 247 AVRGIDGDVKLNRALWV 263 + D D ++ R W Sbjct: 314 VLTSDDPD-RIKRRAWT 329 >UniRef50_B4VVD2 Phage/plasmid-related protein TIGR03299 n=2 Tax=Cyanobacteria RepID=B4VVD2_9CYAN Length = 336 Score = 74.8 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 41/200 (20%), Positives = 63/200 (31%), Gaps = 19/200 (9%) Query: 45 YTYIPTISLL---DSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV-PE 100 YT + D L G +R L I+G V P Sbjct: 79 YTPLQNEEAFRWFDPLLSRG--GVQLEAAGSLKGGKRIWILAKLINTEAEIISGDIVRPY 136 Query: 101 IILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFG--------EVRVPHKGDVVSQVIEG 152 ++L NSHDG+++ + R VC N L F + +PH + Q +E Sbjct: 137 LLLHNSHDGSTAVWLQFTPVRVVCWNTLNGAARFRFGDLWQKKAICIPHSLSLTEQ-LEH 195 Query: 153 AYEVLG----IFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPR 208 + +L F+ E+ AM L + L H ++ Sbjct: 196 IHNILDLTQKEFQYSVEEYQAMAHKELTTELLADYIGRVLGTTQPTLHPAWSQLVANFES 255 Query: 209 RWQDESNDLWTTYQRIQENL 228 ++ LW Y I E L Sbjct: 256 GRGNQGQTLWDAYNSITEWL 275 >UniRef50_UPI0001AF46A9 hypothetical protein MkanA1_07449 n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF46A9 Length = 348 Score = 74.8 bits (182), Expect = 3e-12, Method: Composition-based stats. Identities = 43/251 (17%), Positives = 76/251 (30%), Gaps = 22/251 (8%) Query: 43 ERYTYIPTI---SLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLR---LRREGQITGK 96 +Y + LLD+L + F +R R T + + + Sbjct: 99 SKYEPLQNEASCDLLDALVDQSGGAHFETAGALRGGRETFVTMKLPSSMVFDGKDGSKDR 158 Query: 97 QVPEIILLNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEGAY 154 + LNSHDG+++++ L R VC N + + H G + + E Sbjct: 159 TDFYLAALNSHDGSAAFRFLLSPIRIVCANTQSAAIRSAKSSFSIRHTGGARASIAEARN 218 Query: 155 EVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPR------ 208 + + +E +L P + + A T + + R Sbjct: 219 ALKLSWRYIEAFEAEAAALYAAPMDTEEMRSFANTLLEVDSAGTTATRRHRRERANSIVK 278 Query: 209 RWQDESN------DLWTTYQRIQENLIK-GGLSGRNAKGGRTHTRAVRGIDGDVKLNRAL 261 W W Y + E L + G + RA+R I ++L Sbjct: 279 LWTSSETIAPIAGTRWAAYNAVTEYLDHVVPVRGAKTATDASAARALRNITTAAS-GQSL 337 Query: 262 WVMAETLLTQL 272 A +L L Sbjct: 338 KAQAFRMLQTL 348 >UniRef50_UPI00017465AE hypothetical protein VspiD_04485 n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017465AE Length = 256 Score = 72.5 bits (176), Expect = 2e-11, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 64/207 (30%), Gaps = 17/207 (8%) Query: 38 HESRSERYTYIPTISLLDS----LQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQI 93 + + IP L+++ L+ + + R +L G Sbjct: 35 TPRSTSSWCPIPHNRLIETVQKTLKSTNLRIGTQAHSLSHKGHRYFGLMEIL-----GPK 89 Query: 94 TGKQVPEII-LLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVI-- 150 ++ L NSHD T ++ G VC N GE H +V + Sbjct: 90 NDDDYCWVLGLRNSHDKTFPAGIVAGASVFVCDNLSFSGEVK--FARKHTRFIVRDLPGI 147 Query: 151 --EGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDH-QPVTESQILSP 207 +++ + +++ A + + + L A + P + P Sbjct: 148 TERAIGQLMSKWHHQDKRIGAYKEADIEDSIAHDLIIRATDVGVCSNRLIPSVLKEWREP 207 Query: 208 RRWQDESNDLWTTYQRIQENLIKGGLS 234 R E +W+ + E L G LS Sbjct: 208 RYQVFEDRSVWSLFNAFTEALKDGSLS 234 >UniRef50_A1UPG4 Putative uncharacterized protein n=1 Tax=Mycobacterium sp. KMS RepID=A1UPG4_MYCSK Length = 344 Score = 70.6 bits (171), Expect = 5e-11, Method: Composition-based stats. Identities = 44/253 (17%), Positives = 77/253 (30%), Gaps = 30/253 (11%) Query: 43 ERYTYIPTI---SLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLR---LRREGQITGK 96 +Y + LLD+L E + +R R T + + Sbjct: 99 NKYEPMQNEASCDLLDALTGES-GAVYETAGALRGGRETFVTMRLPESMVFDGIDGTKDR 157 Query: 97 QVPEIILLNSHDGTSSYQMLPGMFRAVCQN--GLVCGESFGEVRVPHKGDVVSQVIEG-- 152 + LNSHDG+S ++ L R VC N + + H G + E Sbjct: 158 TDFYLAALNSHDGSSKFRFLVTPVRIVCANTQSAAIARAAASFGISHTGGAAVALQEARR 217 Query: 153 ----AYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSP- 207 ++ + FE ++ A+ + + + A + E + + Sbjct: 218 ALKLSWRYVEAFE---QEAAALYAAPMDLDQMRRFAGELVDVDGAESKTTARNRRDTANA 274 Query: 208 --RRWQDESN------DLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNR 259 + W W Y + E + S A G RA+R + G + Sbjct: 275 IVKLWVSSPTVAPIAGTRWAAYNAVTEYV--DHYSKVRAAGDPQSVRALRAVTGGSTA-Q 331 Query: 260 ALWVMAETLLTQL 272 L A +L L Sbjct: 332 TLKTNAFRMLQTL 344 >UniRef50_B9PA18 Predicted protein (Fragment) n=2 Tax=cellular organisms RepID=B9PA18_POPTR Length = 87 Score = 69.0 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 27/53 (50%), Positives = 41/53 (77%), Gaps = 1/53 (1%) Query: 3 RLASRFGA-ANLIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLL 54 +LASRF + + +R D PL+ +++ RV PS+F++ HESRSERY+YIPT ++L Sbjct: 35 QLASRFASHSPALRSDSPLSDDQIRRVAPSIFADAPHESRSERYSYIPTAAVL 87 >UniRef50_C6W397 Phage/plasmid-related protein TIGR03299 n=12 Tax=Bacteroidetes RepID=C6W397_DYAFD Length = 350 Score = 65.2 bits (157), Expect = 2e-09, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 46/152 (30%), Gaps = 8/152 (5%) Query: 45 YTYIPTISLLDSLQR-EGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIIL 103 Y + G R L + + L Sbjct: 95 YQIVQNRDAFTFFDSIVGNDGILYETAGALGKGERIFITAKLPGYIQVGSNDLIEKYLFL 154 Query: 104 LNSHDGTSSYQMLPGMFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIEGAYEVLGIFE 161 SHDG+ S R VC N L V++ H + V + + A++V+GI Sbjct: 155 TTSHDGSGSITAAFTPVRIVCANTLNAAMKNITNVVKIRHTSNAVER-LRTAHKVMGIAN 213 Query: 162 R----VEEKRDAMQSLLLPPPVQQALAKAALT 189 + VEE + + P + L + A+ Sbjct: 214 KFSHEVEEIFNHWAKKPITDPQLKKLIEIAMA 245 >UniRef50_B4CXI2 Putative uncharacterized protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CXI2_9BACT Length = 320 Score = 63.3 bits (152), Expect = 9e-09, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 61/212 (28%), Gaps = 41/212 (19%) Query: 42 SERYTYIPTISLLDSLQREGFQPF------FACQTRVRDPRRREHTKHMLRLRREGQIT- 94 S RY + F P + R M R+ ++ Sbjct: 76 SRRYRPLQNSEAFKF-----FDPIVGDRKAYFETAGALGEGERIWV--MARMPEVMEVVR 128 Query: 95 -GKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQVIE 151 ++L N+H+G S + R VCQN L+ + RV H + ++ E Sbjct: 129 GDDCFKYLLLSNTHNGEGSVIVKFTTVRVVCQNTLMLAMEDGQKAYRVRHSKQMQFKLDE 188 Query: 152 GAYEVL---GIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPR 208 A + +F+ E+ + ++ + + A V + + P Sbjct: 189 LADFLAITQQVFQEAEQTFRRLAAVKMTSERLEQYFDAVFP------RTDVQKKRHEKPP 242 Query: 209 RWQDESN---------------DLWTTYQRIQ 225 RW LW Y I Sbjct: 243 RWGFLQEMFDSQPDLQLPGVQGTLWGAYNAIT 274 >UniRef50_Q5LU35 Putative uncharacterized protein n=1 Tax=Ruegeria pomeroyi RepID=Q5LU35_SILPO Length = 275 Score = 62.5 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 48/252 (19%), Positives = 76/252 (30%), Gaps = 37/252 (14%) Query: 13 LIRRDRPLTREELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQ-REGFQ--PFFACQ 69 L PL + L ++ + + + IP L+D ++ GF Sbjct: 32 LHAGASPLDYDGLRQL--------ETPEATSTHVPIPHHRLVDVVRLTLGFYGHTVEEEH 83 Query: 70 TRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLV 129 V R LR G + L NSHD T + G VC N Sbjct: 84 HGVTPDGMRYFGVLSLR-----STYGDYTDTVGLRNSHDKTFPIGISFGSRVFVCDNLAF 138 Query: 130 CGESFGEVRVPHKG----DVVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAK 185 + VR H D+ V + + E ++ L Q+L Sbjct: 139 IADHV--VRRKHTAQAKRDLPGLVGDLIEPLADQREAQHRVISRYRAANLS----QSLVD 192 Query: 186 AALTYRFGEDHQPVTESQILSPRRWQDESNDL-----WTTYQRIQENLIKGGLSGRNAKG 240 A+ + + VT RW++ +D W + + L GR A+ Sbjct: 193 HAVLELYRAEVITVT-RIAAVMERWENPPHDWGVKTAWRLFNCVT-----HALEGRIAEQ 246 Query: 241 GRTHTRAVRGID 252 +R ID Sbjct: 247 PALTSRLHDVID 258 >UniRef50_Q47CX4 Putative uncharacterized protein n=4 Tax=Betaproteobacteria RepID=Q47CX4_DECAR Length = 354 Score = 62.1 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 33/153 (21%), Positives = 55/153 (35%), Gaps = 16/153 (10%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHM------LRLRREGQITG 95 S+RY + L +S+ P VR M RL+ E Sbjct: 115 SDRYRRLDNFDLAESVL-----PILQQLPEVRFESVELTETKMYLKCITPRLKYEMAPGD 169 Query: 96 KQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 +++ NS G + + P +FR VC NGL+ + +R H G + E Sbjct: 170 VVQAGVVISNSEVGQGTLSVQPLLFRLVCSNGLIVPD--RSLRKMHVGRALGGEDERIQV 227 Query: 156 VLGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 R ++K ++ + VQ A++ A Sbjct: 228 YQDDTLRADDKAFFLK---VRDVVQAAVSDATF 257 >UniRef50_Q0RM54 Putative uncharacterized protein n=1 Tax=Frankia alni ACN14a RepID=Q0RM54_FRAAA Length = 360 Score = 62.1 bits (149), Expect = 2e-08, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 52/168 (30%), Gaps = 14/168 (8%) Query: 40 SRSERYTYIPTISLLDSLQR-EGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQ- 97 + +T I + + ++ G + V + R +++L + G Sbjct: 116 HPRDTWTLIDHAEMGEIVEAFLGMENVQYETGGVLEKGRAVWA--LIKLDEPIALPGDNS 173 Query: 98 --VPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGE-----VRVPHKG---DVVS 147 +P +L N HDG S + R VC N E E H D + Sbjct: 174 LTLPYFLLRNRHDGNGSCSVSHTPVRVVCANTWKVSEMTDEANGTVFSFRHNEKWRDRLE 233 Query: 148 QVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGED 195 + + V F +E + + + + QQ + G Sbjct: 234 EAKQAIKGVRKQFTLYQEIAERLLDMTVTEKQQQMFVNDFIPTPTGAT 281 >UniRef50_A1WP45 Putative uncharacterized protein n=2 Tax=Comamonadaceae RepID=A1WP45_VEREI Length = 312 Score = 59.0 bits (141), Expect = 2e-07, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 73/230 (31%), Gaps = 31/230 (13%) Query: 37 KHESRSERYTYIPTISLLD----SLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQ 92 S RY + +L+ ++REGF RR + R E Sbjct: 64 PLSVVSPRYKIVQPKKMLEFYRSLVEREGFAI---ETIGSLKGGRRIWA--LARTHIEND 118 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGMFRAVCQNG--LVCGESFGEVRVPHKGDVVSQ 148 + G ++L+ S DG+ + R VC N + ES +V+V H Sbjct: 119 VLGSDRLKAYVLLITSCDGSLATTAKFTCVRVVCWNTQAIALNESGKQVKVRHNTAFNPD 178 Query: 149 VIEGAYEVLG--IFERVEEKRDAMQSLLLPPPVQQALAKAALTY----RFGEDHQPVTES 202 ++G ++G F+ K ++ + L P Q + L R G +H+ + ++ Sbjct: 179 AVKGEMGLMGAKAFDAFLGKMRSLTRVKLTEPDAQGIVACLLASPMDERKGVEHKGIEQT 238 Query: 203 QILSPRRWQDESN-----------DLWTTYQRIQENLIKGGLSGRNAKGG 241 + W + E RN + Sbjct: 239 KGFQKIMALFNGAAQGAHLPGVQGTAWGLLNAVTEYA-DHHARARNPENR 287 >UniRef50_UPI00016C3597 hypothetical protein GobsU_16407 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3597 Length = 235 Score = 57.5 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 54/218 (24%), Gaps = 30/218 (13%) Query: 70 TRVRDPRRREHTKHMLRLRREGQITGK-QVPEIILLNSHDGTSSYQMLPGMFRAVCQNGL 128 +R + + G +L N+HD + + + R VC N L Sbjct: 24 AGSLKEGKRIWVLARINGAEAEVVDGDPVRGYFLLSNAHDASQAVRAQFTSIRVVCANTL 83 Query: 129 VCGESFGE------VRVPHKGDVVS---QVIEGAYEVLGIFERVEEKRDAMQSLLLPPPV 179 + E VRV H + + V F M S LP Sbjct: 84 NAADRRAERGFEDCVRVRHTTGLETSLVLVQHTIDMAAKTFSASLADYQRMVSRRLPVDG 143 Query: 180 QQALAKAALTYRFGEDHQPVTESQILSPRRWQDESN----------DLWTTYQRIQENLI 229 + L +W + W Y I + + Sbjct: 144 FRKYVIDVLEVPESVQRMGKMPKAW-DTLQWAYHAAPGARINGVFGTYWGAYNAITDWV- 201 Query: 230 KGGLSGRNAKGGRTHTRAVRG--IDGDVKLNRALWVMA 265 + +G + + +L + + +A Sbjct: 202 ------DHTRGVKDADSRLDSAWFGSGARLKQRAFELA 233 >UniRef50_B4WVT0 Putative uncharacterized protein n=2 Tax=Synechococcus sp. PCC 7335 RepID=B4WVT0_9SYNE Length = 352 Score = 57.5 bits (137), Expect = 5e-07, Method: Composition-based stats. Identities = 26/132 (19%), Positives = 47/132 (35%), Gaps = 1/132 (0%) Query: 22 REELFRVVPSVFSEDKHESRSERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHT 81 +E+ + + D S RY + L D++ + A R + Sbjct: 91 QEQPEQRMIRTMGTDARAFLSRRYRRLDNFDLADAVLPTLLEMQGARVVSCELTETRMYL 150 Query: 82 KHMLRLRREGQITGKQV-PEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVP 140 K + + G V + + NS G S ++ P ++R VC NG+V + R Sbjct: 151 KVVTDRIQADVKVGDAVQAGVCISNSEIGMGSLRVEPLIYRLVCTNGMVSPDRSARNRFT 210 Query: 141 HKGDVVSQVIEG 152 H G + + Sbjct: 211 HLGRAAADTPDA 222 >UniRef50_A8ZKZ6 Putative uncharacterized protein n=3 Tax=Cyanobacteria RepID=A8ZKZ6_ACAM1 Length = 351 Score = 55.9 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 24/137 (17%), Positives = 49/137 (35%), Gaps = 14/137 (10%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQT------RVRDPRRREHTKHMLRLRREGQITG 95 S+RY + + +++ P A V R + K + + G Sbjct: 111 SDRYRRVDNFEIAETVL-----PVLAEFGQGLKIMSVGLTDSRLYIKAVNERVQLDVRKG 165 Query: 96 KQV-PEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAY 154 V +++ NS G S ++ P ++R VC NGL+ + + H G V + Sbjct: 166 DAVQAGVVISNSEIGLGSIRIEPLVYRLVCLNGLISQD--HSFKKYHVGRQVGESDAAVE 223 Query: 155 EVLGIFERVEEKRDAMQ 171 +++ ++ Sbjct: 224 LFSDETREADDRALLLK 240 >UniRef50_Q19YQ9 Gp96 n=7 Tax=unclassified Siphoviridae RepID=Q19YQ9_9CAUD Length = 400 Score = 55.5 bits (132), Expect = 2e-06, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 69/211 (32%), Gaps = 23/211 (10%) Query: 68 CQTRVRDPRRREHTK-----HMLRLRREGQITGKQVPEIILLNSHDGTSSYQMLPGMFRA 122 D RR HM + + + N HDG S R Sbjct: 175 ETGGSLDGGRRTFVTMKMPDHMELVSPITGKRDVTDLYLSIFNHHDGGGSLVANISPVRV 234 Query: 123 VCQNG--LVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEK----RDAMQSLLLP 176 VC N + + V + H G+ + +E +LG+ + ++ + M + + Sbjct: 235 VCANTQRMAERAAVSRVSIRHTGEAQVR-LEEVRRILGLTWKYQDTYVAEVEEMAKIEMS 293 Query: 177 PPVQQALAKAAL-TYRFGEDHQPVTESQILSPRRWQ---------DESNDLWTTYQRIQE 226 A+ ++ + + + ++ ++ ++ D + Y + E Sbjct: 294 NVETFAIMRSVFEVDKVDPESRSASQRTQMATEAFEIYRSSATVDDFRGVAFGGYNAVTE 353 Query: 227 NLIKG-GLSGRNAKGGRTHTRAVRGIDGDVK 256 + + G++ + R + G G++K Sbjct: 354 WVDHYMPVRGKDNVDVKRALRTINGGGGEIK 384 >UniRef50_B7I5L8 Phage/plasmid-related protein n=5 Tax=Moraxellaceae RepID=B7I5L8_ACIB5 Length = 342 Score = 54.0 bits (128), Expect = 5e-06, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 54/164 (32%), Gaps = 11/164 (6%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVRDPRRREHTKHMLRLRREGQ 92 + S+RY + +L+ + Q F V R+ + R + Sbjct: 67 THAPLSVVSQRYQEVQPKEILEFYRDLTEQSGFELETAGVLKGGRKFWA--LARTGQSAA 124 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-----ESFGEVRVPHKGDV 145 + K V I+L + DGT + R VC N L S G V+VPH Sbjct: 125 LKSKDVSNGYILLATACDGTLATTAQFTSIRVVCSNTLAIALRGQNSSVGVVKVPHSTKF 184 Query: 146 VSQVIEGAYEV-LGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 ++ I+ + + ++ + + + A A Sbjct: 185 DAEKIKQQLGISVRAWDEHMYEMKQLSQRKVTQQEAAAYFDAVF 228 >UniRef50_Q5Y1B4 Putative uncharacterized protein n=1 Tax=uncultured organism BAC21E04 RepID=Q5Y1B4_9ZZZZ Length = 315 Score = 53.2 bits (126), Expect = 9e-06, Method: Composition-based stats. Identities = 40/249 (16%), Positives = 78/249 (31%), Gaps = 35/249 (14%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV-PE 100 S++Y + SLL + + C + D + T + R + G V Sbjct: 81 SKQYEIVQNDSLLRMAEFIREEVDMDCVIVLSDGAKVCFTATL-RGAETDIVPGDTVKRR 139 Query: 101 IILLNSHDGTSSYQMLPGMFRAVCQNGLVCG---ESFGEVRVPHKGDVVSQVIEGAYEVL 157 I+ HDG + R VCQN L + HK Sbjct: 140 IVGYLGHDGKTGCGAKFTNIRVVCQNTLTAALGEAGGAHSSITHKNGAN----------- 188 Query: 158 GIFERVEEKRDAMQSLLLPP-PVQQALAKAALTY----RFGEDHQPVTESQILSPRRW-- 210 F+ + D + + + + ++A++ F ++ + E Q+ R Sbjct: 189 NNFDTLINSIDVARQDFVTECELMREFSRASMGVSQFNEFVDEVYNIDEGQVFRKREKLE 248 Query: 211 ---------QDESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRAL 261 + +W+ I E + + AKG R ++++ Sbjct: 249 RAFTRGFGFRFAPASVWSAVNAITE-VETSTRNTTAAKGRAQFAR--GTFGVGAQISKRA 305 Query: 262 WVMAETLLT 270 + +A L+T Sbjct: 306 FALARDLVT 314 >UniRef50_C5CKG6 Phage/plasmid-related protein TIGR03299 n=10 Tax=Proteobacteria RepID=C5CKG6_VARPS Length = 342 Score = 52.8 bits (125), Expect = 1e-05, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 53/161 (32%), Gaps = 8/161 (4%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREG-FQPFFACQTRVRDPRRREHTKHMLRLRREGQ 92 + S RY + +L+ + + V R+ + R ++ Sbjct: 83 TRAPLSVVSSRYQVVQPREVLEFYRDLTEIGGYEMETAGVLKGGRKVWA--LARTGQQAV 140 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG--ESFGEVRVPHKGDVVSQ 148 + G + ++L S DGT + + P R VC N L + +RVPH Sbjct: 141 LKGNDIVNGYLLLATSCDGTLATSVTPTTVRVVCSNTLAVALDATSNVIRVPHSTSFDPD 200 Query: 149 VIEGAYEV-LGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 ++ + +G ++ + + + + L Sbjct: 201 AVKRQLGIAIGQWDEFMYRMKTLSQRKVKTKEALQYIERVL 241 >UniRef50_C4ZMQ9 Phage/plasmid-related protein TIGR03299 n=1 Tax=Thauera sp. MZ1T RepID=C4ZMQ9_THASP Length = 334 Score = 52.5 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 59/178 (33%), Gaps = 18/178 (10%) Query: 45 YTYIPT---ISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPE- 100 + + + D+L +G +P + RL + Q+ K V E Sbjct: 84 FGPLQNRAGAEMFDALLGQG-RPI-YHTGGYLKNGEVVWL--LARLPGDIQVQEKDVIET 139 Query: 101 -IILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGI 159 ++ NSHDG+S+ + R VCQN L V G V + +G Y VL Sbjct: 140 YLLFSNSHDGSSAIDIRLTTVRVVCQNTLSLALDNTSV-----GKVFRRAHDGRYRVLKE 194 Query: 160 FERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDL 217 R + S+ Q + A + + P+R +L Sbjct: 195 EARAFFEF----SVKRSEEAQALFGRLANAECDDRAFEDFLAQLLPDPKRPVTAGQNL 248 >UniRef50_C6RKU8 Phage/plasmid-related protein n=12 Tax=Acinetobacter RepID=C6RKU8_ACIRA Length = 347 Score = 51.7 bits (122), Expect = 3e-05, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 53/163 (32%), Gaps = 10/163 (6%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVRDPRRREHTKHMLRLRREGQ 92 + S+RY + +L+ + Q F V ++ + R + Sbjct: 74 THAPLSVVSQRYQEVQPKQILEFYRDLTEQSGFELETAGVLKGGKKFWA--LARTGQSAA 131 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG----ESFGEVRVPHKGDV- 145 + GK V I+L + DGT + R VC N L S G V+VPH Sbjct: 132 LKGKDVSNAYILLATACDGTLATTAQFTSIRVVCNNTLAIALKGQSSAGVVKVPHSTRFD 191 Query: 146 VSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 ++ + + ++ + + + A A Sbjct: 192 AGKIKQQLGISVRQWDEHMYEMKQLSQRKVTQTEAAAYFDAVF 234 >UniRef50_C0VFU1 Putative uncharacterized protein n=4 Tax=Acinetobacter RepID=C0VFU1_9GAMM Length = 357 Score = 50.5 bits (119), Expect = 5e-05, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 53/167 (31%), Gaps = 14/167 (8%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVRDPRRREHTKHMLRLRREGQ 92 + + S+R+ + +L+ + Q F V ++ + + + Sbjct: 74 THEPLSVVSQRFQEVQPKEILEFYRDLTEQSGFELETAGVLKGGKKFWA--LAKTGQTSA 131 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG--------ESFGEVRVPHK 142 + GK V I+L + DGT + R VC N L + G V+VPH Sbjct: 132 LKGKDVSNGYILLATACDGTLATTAQFTSIRVVCNNTLAIALKAQNAGSNNTGVVKVPHS 191 Query: 143 GDV-VSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAAL 188 +V + ++ + + + A A Sbjct: 192 TRFDAEKVKHQLGISVRAWDEHMYEMKQLSQRKVTQQEAAAYFDAVF 238 >UniRef50_A6WZ56 Putative uncharacterized protein n=1 Tax=Ochrobactrum anthropi ATCC 49188 RepID=A6WZ56_OCHA4 Length = 402 Score = 50.2 bits (118), Expect = 8e-05, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 55/161 (34%), Gaps = 10/161 (6%) Query: 94 TGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG-ESFGEVRVPHKGDVVSQVIEG 152 + NS G+S+ ++ RAVC N L+ G E F E+ + H S+ IE Sbjct: 241 PDLVFRGFYITNSEVGSSALKVAAFYLRAVCCNRLMWGVEGFQEISMRHSKYAPSRFIEE 300 Query: 153 AYEVLGIF-----ERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSP 207 A L F +R+ + ++ + + + + +F + + Sbjct: 301 ARPALEGFADGSTQRLLDGVKKARATKVASND-EEMIEFLRGRKFSQKQAITILEYV--E 357 Query: 208 RRWQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAV 248 + + +W Q I + + + R + Sbjct: 358 KEEGAPARTIWDVAQGISASA-RNIPHTDDRVEFEREARRL 397 >UniRef50_C4V5A4 Putative uncharacterized protein n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V5A4_9FIRM Length = 365 Score = 49.4 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 28/135 (20%), Positives = 52/135 (38%), Gaps = 14/135 (10%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHML------RLRREGQITG 95 S+RY + + L ++ P H+ +L+ E + Sbjct: 114 SDRYRRLDNLELCTAVL-----PVIQEMKDAAIMSCEVTESHLYLKVVNKKLKAEVGVGD 168 Query: 96 KQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYE 155 ++ NS G S ++ P ++R VC+NGL+ + F + + H G V+ + AYE Sbjct: 169 VVQAGFVVSNSEVGLGSLKVEPLIYRLVCKNGLIVKD-FAQKKY-HVGRQVAAEDDTAYE 226 Query: 156 VLGIFERVEEKRDAM 170 L E + + Sbjct: 227 -LYSDETLAQDDKTF 240 >UniRef50_Q2IFF9 Putative uncharacterized protein n=3 Tax=Anaeromyxobacter RepID=Q2IFF9_ANADE Length = 325 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 62/222 (27%), Gaps = 18/222 (8%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLR-LRREGQITGKQVP- 99 S+ Y + + +L E A T +L + ++ G P Sbjct: 74 SKSYEVVQFSEVARTLV-EAAGDVKAVFTTAGTLGPVGIKGWLLGEIPNPIKVKGDPSPI 132 Query: 100 --EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG---ESFGEVRVPHKGDVVSQVIEG-- 152 ++ HDG ++ + R VC N L R+ H + ++ E Sbjct: 133 RKYVLGTTGHDGVTAVVLKNVATRVVCANTLGVALGERGGATWRIQHTANAKMRLDEAGK 192 Query: 153 AYE-VLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYRFGE-DHQPVTESQILSPRR- 209 A+ ++ +ER+ E + + +A + + DH + R Sbjct: 193 AFRQLVESYERLGELANVLAVTPFTTRQMKATIDRLMPVPKDDRDHTKPEAERGKVIRLF 252 Query: 210 -----WQDESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTR 246 + W Q E + R Sbjct: 253 DTAAAIERVRGTAWAALQGWTEYADHHRQVRDTGREDPRRAR 294 >UniRef50_A8ZPY1 Putative uncharacterized protein n=5 Tax=Bacteria RepID=A8ZPY1_ACAM1 Length = 209 Score = 49.0 bits (115), Expect = 2e-04, Method: Composition-based stats. Identities = 20/98 (20%), Positives = 39/98 (39%), Gaps = 12/98 (12%) Query: 42 SERYTYIPTISLLDSLQREGFQPFFACQT------RVRDPRRREHTKHMLRLRREGQITG 95 S+RY + + +++ P A V R + K + + G Sbjct: 111 SDRYRRVDNFEIAETVL-----PVLAEFGPGLKIMSVGLTDSRLYIKAVNERVQLDVRKG 165 Query: 96 KQV-PEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGE 132 V +++ NS G S ++ P ++R VC NG++ + Sbjct: 166 DAVQAGVVISNSEIGLGSIRIEPLVYRLVCLNGMISQD 203 >UniRef50_B3VM79 Gp52 n=2 Tax=unclassified Siphoviridae RepID=B3VM79_9CAUD Length = 403 Score = 47.5 bits (111), Expect = 5e-04, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 68/236 (28%), Gaps = 29/236 (12%) Query: 51 ISLLDSLQREGFQPFFACQTRVRDPRRREHTK-HMLRLRREGQITGKQVPEIILLNSHDG 109 I++ L + G A + R + + + + + P +++ S DG Sbjct: 146 INMTSELLQ-GSDNIGATGAGLLKWGRVCYMEVSIPQTMHNDRAGFDYRPNLLIYTSFDG 204 Query: 110 TSSYQMLPGMFRAVCQNGLVCGESFG-----EVRVPHKGDVVSQVIEGAYE---VLGIFE 161 + + + VC N L S + + H ++ E + + Sbjct: 205 SLKTTLARTITATVCDNTLQIAASEAKRAGTALTIGHTRLSSDRMPEARQVLGIIEQESD 264 Query: 162 RVEEKRDAMQSLLLPPPVQQALAKAALTY----------RFGEDHQPVTESQILSPRRWQ 211 D + + +A L + + + + + + Sbjct: 265 DFNTLLDEWAATPVSTKQFEAWLDEVLPVPEVKVIDGKAKTNSQTIVLNKREAIGDLYYT 324 Query: 212 DESNDLWT--------TYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNR 259 DE W + + + G + + G +T R + +K+++ Sbjct: 325 DERAATWVGTKLGVRQAWNTAHHHKFRSG-NAKQFDGNKTLARVESNMMRSLKMDK 379 >UniRef50_C8X3A3 Putative uncharacterized protein n=1 Tax=Desulfohalobium retbaense DSM 5692 RepID=C8X3A3_DESRD Length = 243 Score = 47.1 bits (110), Expect = 7e-04, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 71/219 (32%), Gaps = 28/219 (12%) Query: 37 KHESRSERYTYIPTISLLD----SLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQ 92 + + +P ++D ++ R+G +D + + + R Sbjct: 15 PVVPGTATWNPVPHNQVIDTVETAISRQGLGIVRKRFELTQDGANVFASYRLDQSR---- 70 Query: 93 ITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEG 152 G EI NS + + G F VC N + G+ F E R H + + Sbjct: 71 -NGSSW-EIGFRNSVAKKFAVGITAGTFTIVCSNLVFTGD-FLEFR-RHTKGLDLDELRA 126 Query: 153 AYE-----VLGIFERVEEKRDAMQSLLLPPPVQQALA-KAALTYRFGEDHQPVTESQILS 206 + + +E+ ++ +++ LP Q L +A F Sbjct: 127 IANRALLGTISRLQSLEQWQEGLKAKPLPRRDMQCLTYEALQRGAFPGG------RFSRF 180 Query: 207 PRRWQDE----SNDLWTTYQRIQENLIKGGLSGRNAKGG 241 ++DE L++ + + + + L+ + + Sbjct: 181 VEAYEDEASRHGQSLYSFHGALTQTIRDQSLNQISHRSR 219 >UniRef50_A6SWN5 Uncharacterized conserved protein n=39 Tax=Proteobacteria RepID=A6SWN5_JANMA Length = 318 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 52/167 (31%), Gaps = 14/167 (8%) Query: 34 SEDKHESRSERYTYIPTISLLDSLQREGFQP-FFACQTRVRDPRRREHTKHMLRLRREGQ 92 ++ S RY + +L+ + + F V R+ + + + Sbjct: 74 TKAALSVVSNRYQVVQPDEILEFYRDLTTRSGFELETAGVMKGGRKLWA--LAKTGQSFS 131 Query: 93 ITGKQVP--EIILLNSHDGTSSYQMLPGMFRAVCQNGLVCG--ESFGEVRVPHKG----D 144 I K ++L + DG+ + R VC N L V+VPH D Sbjct: 132 IKDKDRINGYLLLATACDGSLATTAQFTSVRVVCNNTLAIALSGGKDVVKVPHSTTFEPD 191 Query: 145 VVSQVIEGAYEVLGIFERVEEKRDAMQSLLLPPPVQQALAKAALTYR 191 +V + + ++ F K + L A + + Sbjct: 192 LVKKELGISFSAWDNFRYRMTKLAERK---LKDQEADAFLRTLFSIP 235 >UniRef50_Q18F79 Putative uncharacterized protein n=1 Tax=Haloquadratum walsbyi DSM 16790 RepID=Q18F79_HALWD Length = 351 Score = 46.3 bits (108), Expect = 0.001, Method: Composition-based stats. Identities = 46/222 (20%), Positives = 81/222 (36%), Gaps = 13/222 (5%) Query: 43 ERYTYIPTISLLDSLQRE----GFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQV 98 + Y+ I +L+++ RE G +P+ + R E +M + +I Sbjct: 84 DFYSVIQYGDVLEAVHREMGDQGVEPYGTV-SLSGSAHRMEAPVYMSGDQARVEIGEGDR 142 Query: 99 PEIILLNS--HDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVV-SQVIEGAYE 155 + + S H G G R VC+NG+ S + H E Sbjct: 143 LNMGVKVSAGHSGHMGVHYNLGAERLVCRNGMTRFVSDLHLDQSHGERFQPGLAYEAVRG 202 Query: 156 VLGIFERVEEKRDAM-QSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDES 214 VLG +RVEE+ + + LL + L R E+ + + + + ES Sbjct: 203 VLGSTDRVEERLERARKRELLNLDEARLLLHDIGVDRVAENSEADIMNALFEEVESR-ES 261 Query: 215 NDLWTTYQ---RIQENLIKGGLSGRNAKGGRTHTRAVRGIDG 253 L+ YQ R+ ++ G G + R + + +DG Sbjct: 262 PSLYEVYQAGTRVVDHYADSGSPGHFQETVRDNVARLLDVDG 303 >UniRef50_A8L7W9 Putative uncharacterized protein n=4 Tax=Actinomycetales RepID=A8L7W9_FRASN Length = 380 Score = 44.0 bits (102), Expect = 0.006, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 40/120 (33%), Gaps = 20/120 (16%) Query: 42 SERYTYIPTISLL----DSLQREGFQP--------FFACQTRVRDPRRREHTKHMLRLRR 89 S++Y + + +L ++ G Q RVR R +LR R Sbjct: 136 SDKYKPVDNLDVLVAALAGIRDAGAQTTVDGCDLTDRRMHVRVRSDSVRALAPTLLRDYR 195 Query: 90 ------EGQITGKQVPEIILLNSHDGTSSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKG 143 G L NS G ++ + P + VC NGL + +R H G Sbjct: 196 SPFTGQRGADNPVVWAGFELTNSEVGCGAFTITPRLVVQVCSNGLTI--TRDALREVHLG 253 >UniRef50_B5LJ78 Gp67 n=1 Tax=Mycobacterium phage Myrna RepID=B5LJ78_9CAUD Length = 418 Score = 43.6 bits (101), Expect = 0.008, Method: Composition-based stats. Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 6/79 (7%) Query: 69 QTRVRDPRRREHTKHMLRLRREGQITGKQVP----EIILLNSHDGTSSYQMLPGMFRAVC 124 V P + H + +++ K P ++ NS G ++Q+LP VC Sbjct: 211 YLSVDVPEIKIHAQDLVKNYHFYDQDSKDNPFMSAGLVFTNSEVGRGAFQILPRAVVQVC 270 Query: 125 QNGLVCGESFGEVRVPHKG 143 +NG+ R H G Sbjct: 271 KNGM--RRDVDGFRKVHLG 287 >UniRef50_B8KMK8 Putative uncharacterized protein n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KMK8_9GAMM Length = 348 Score = 42.8 bits (99), Expect = 0.012, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 67/198 (33%), Gaps = 17/198 (8%) Query: 51 ISLLDSLQREGFQPFFACQTRVRDPRRREHTKHMLRLRREGQITGKQVPEIILLNSHDGT 110 L+ SL+R G P E + +G G + + LN +G Sbjct: 134 EQLVKSLRRLGILPRSKVFKTPFGEVVEEFST-----PGQGGQVGLRCRAVYGLN--NGY 186 Query: 111 SSYQMLPGMFRAVCQNGLVCGESFGEVRVPHKGDVVSQVIEGAYEVLGIFERVEEKRDAM 170 SSY+++ G +C NGL ES G R H DV V V + R+ + Sbjct: 187 SSYRIIWGRVVLICSNGLTAFESVGRDRWIHNSDVDVDVFVE-ESVTEAYSRLAVTEKQI 245 Query: 171 QSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILSPRRWQDESNDLWTTYQRIQENLIK 230 + +L +T + + L + D ++ W+ Q + Sbjct: 246 ADAR-SRAINYSLLDQFMTRLALANASKERVRKRLI-HEFSDTGHNEWSVSQALT----- 298 Query: 231 GGLSGRNAKGGRTHTRAV 248 +G + K +R + Sbjct: 299 --YAGEHEKPIPIGSREI 314 >UniRef50_C1D7A8 Putative uncharacterized protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D7A8_LARHH Length = 192 Score = 40.9 bits (94), Expect = 0.048, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 42/178 (23%), Gaps = 18/178 (10%) Query: 106 SHDGTSSYQMLPGMFRAVCQNG--LVCGESFGEVRVPHKGDVVSQVIEGAYEV-LGIFER 162 S DG+ R VC N + G VRVPH ++ + L ++ Sbjct: 20 SCDGSLCTTAQFTSVRVVCNNTLQMAVAGRSGAVRVPHSTVFDPVAVKTELGLGLSGWDA 79 Query: 163 VEEKRDAMQSLLLPPPVQQALAKAALTYRFGEDHQPVTESQILS----------PRRWQD 212 A+ + P + L +D + Sbjct: 80 FIGHIKALSQRPVSPEEARQFFAGVLDEPVADDPDAPVSKALQQLSALYGGLGMGALLGS 139 Query: 213 ESNDLWTTYQRIQENLIKGGLSGRNAKGGRTHTRAVRGIDGDVKLNRALWVMAETLLT 270 W E + R+ +L + A LLT Sbjct: 140 SRGTAWGLVNAATE-FVDHHRRARSQDYRLDSAW----FGQGAQLKQQALQRAGALLT 192 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.307 0.126 0.318 Lambda K H 0.267 0.0388 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,434,301,864 Number of Sequences: 3077464 Number of extensions: 52860876 Number of successful extensions: 147837 Number of sequences better than 1.0e-01: 61 Number of HSP's better than 0.1 without gapping: 67 Number of HSP's successfully gapped in prelim test: 41 Number of HSP's that attempted gapping in prelim test: 147638 Number of HSP's gapped (non-prelim): 109 length of query: 273 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 146 effective length of database: 649,558,428 effective search space: 94835530488 effective search space used: 94835530488 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 92 (40.1 bits)