BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (235 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46793 Putative uncharacterized protein ygeN n=4 Tax=Es... 477 e-133 UniRef50_C8TGA5 Conserved predicted protein n=32 Tax=Enterobacte... 281 8e-75 UniRef50_P58654 Oxygen-regulated invasion protein orgB n=30 Tax=... 56 1e-06 UniRef50_Q6R8A1 OrgAb n=2 Tax=Sodalis glossinidius RepID=Q6R8A1_... 44 0.003 >UniRef50_Q46793 Putative uncharacterized protein ygeN n=4 Tax=Escherichia coli RepID=YGEN_ECOLI Length = 235 Score = 477 bits (1228), Expect = e-133, Method: Compositional matrix adjust. Identities = 235/235 (100%), Positives = 235/235 (100%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY Sbjct: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT Sbjct: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ Sbjct: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 Query: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL 235 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL Sbjct: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL 235 >UniRef50_C8TGA5 Conserved predicted protein n=32 Tax=Enterobacteriaceae RepID=C8TGA5_ECO26 Length = 143 Score = 281 bits (720), Expect = 8e-75, Method: Compositional matrix adjust. Identities = 136/143 (95%), Positives = 138/143 (96%) Query: 93 MINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKS 152 MINDLKSILLK SEEVDVFIKIFESW KLP+ISGPVNL+IPT FKDK LEVESYFVDKS Sbjct: 1 MINDLKSILLKSSEEVDVFIKIFESWRKKLPAISGPVNLYIPTRFKDKYLEVESYFVDKS 60 Query: 153 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 212 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL Sbjct: 61 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 120 Query: 213 VEKMFETHSLDMNNSVLASPEDL 235 VEKMFETHSLDMNNSVLASPEDL Sbjct: 121 VEKMFETHSLDMNNSVLASPEDL 143 >UniRef50_P58654 Oxygen-regulated invasion protein orgB n=30 Tax=Salmonella enterica RepID=ORGB_SALTY Length = 226 Score = 55.8 bits (133), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 40/190 (21%), Positives = 84/190 (44%), Gaps = 8/190 (4%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M K I + SP G++IKRK + I L+++ ++ EE+ + + Y Sbjct: 1 MLKNIPIPSPLSPVEGILIKRKTLERYFSIERLEQQAHQRAKRILREAEEEAKTLRMYAY 60 Query: 61 YDGYTKGIIDEMDNFIPLIS----LLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFE 116 +GY +G+ID + ++ + +EK +I + + P + + + + Sbjct: 61 QEGYEQGMIDALQQVAAYLTDNQTMAWKWMEKIQIYARELFSAAVDHP----ETLLTVLD 116 Query: 117 SWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAE 176 W+ G + L +P + K ++ ++ ++ +H ++RF+ IAE Sbjct: 117 EWLRDFDKPEGQLFLTLPVNAKKDHQKLMVLLMENWPGTFNLKYHQEQRFIMSCGDQIAE 176 Query: 177 FSPQEFVDNC 186 FSP++FV+ Sbjct: 177 FSPEQFVETA 186 >UniRef50_Q6R8A1 OrgAb n=2 Tax=Sodalis glossinidius RepID=Q6R8A1_SODGL Length = 215 Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust. Identities = 44/211 (20%), Positives = 95/211 (45%), Gaps = 23/211 (10%) Query: 18 VIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIP 77 +IKR+ + + L ++ E ++ E+ E+ Q +GY +G++ +D Sbjct: 1 MIKREHLQRQRCAMDLVKQARREAVKCLKQATEEAEQLRNQARNEGYQQGVLAAVDT--- 57 Query: 78 LISLLCSELEKKRINMINDL----KSILLKPSEEVDVFIKIFESWVTK--LPSISGPVNL 131 ++ +E +K ++ ++ +++L+ D+ + + E W+ + PSI P+ L Sbjct: 58 -VAGFFAERQKLIFSLQREVEEHARTLLMTALSHTDLLLLLLEDWLAQQPAPSIPAPLEL 116 Query: 132 HIPTSFKDKSLEVESYFVDKSIWN--VHITFHDDKRFVF-FTDQFIAEFSPQEFVDNCEQ 188 +P + +L + ++W+ I HD + F+ + DQ +AEF F+D + Sbjct: 117 WVPADRRAAALRLNRQI--GALWSGKYDIITHDGESFIMKYGDQ-VAEFDAGAFIDAATR 173 Query: 189 YLINNHCFSPDK---VNEICEQARHYLVEKM 216 L + PD + EQ LV+++ Sbjct: 174 QLTSR----PDYETLARHLSEQGLQALVKRL 200 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46793 Putative uncharacterized protein ygeN n=4 Tax=Es... 376 e-103 UniRef50_P58654 Oxygen-regulated invasion protein orgB n=30 Tax=... 269 5e-71 UniRef50_C8TGA5 Conserved predicted protein n=32 Tax=Enterobacte... 247 2e-64 Sequences not found previously or not previously below threshold: UniRef50_Q7NVC5 Probable oxygen-regulated invasion protein-cell ... 110 5e-23 UniRef50_Q6R8A1 OrgAb n=2 Tax=Sodalis glossinidius RepID=Q6R8A1_... 86 1e-15 UniRef50_C4K5N0 Oxygen-regulated invasion protein n=1 Tax=Candid... 76 1e-12 UniRef50_Q3YTN9 MxiN n=9 Tax=Shigella RepID=Q3YTN9_SHISS 63 7e-09 UniRef50_A2S1M7 Type III secretion apparatus protein, HrpE/YscL ... 61 3e-08 UniRef50_UPI00016A643C type III secretion apparatus protein, Hrp... 52 2e-05 UniRef50_B1FB92 Type III secretion apparatus protein, HrpE/YscL ... 50 7e-05 UniRef50_B6XHF3 Putative uncharacterized protein n=2 Tax=Provide... 49 1e-04 UniRef50_D2UDE1 Probable oxygen-regulated invasion protein orgb ... 44 0.005 UniRef50_C2LLW0 Type III secretion system protein (Oxygen-regula... 43 0.009 UniRef50_D2TZU3 Putative uncharacterized protein n=1 Tax=Arsenop... 41 0.037 >UniRef50_Q46793 Putative uncharacterized protein ygeN n=4 Tax=Escherichia coli RepID=YGEN_ECOLI Length = 235 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 235/235 (100%), Positives = 235/235 (100%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY Sbjct: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT Sbjct: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ Sbjct: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 Query: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL 235 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL Sbjct: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL 235 >UniRef50_P58654 Oxygen-regulated invasion protein orgB n=30 Tax=Salmonella enterica RepID=ORGB_SALTY Length = 226 Score = 269 bits (688), Expect = 5e-71, Method: Composition-based stats. Identities = 44/219 (20%), Positives = 96/219 (43%), Gaps = 8/219 (3%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M K I + SP G++IKRK + I L+++ ++ EE+ + + Y Sbjct: 1 MLKNIPIPSPLSPVEGILIKRKTLERYFSIERLEQQAHQRAKRILREAEEEAKTLRMYAY 60 Query: 61 YDGYTKGIIDEMDNFIPLIS----LLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFE 116 +GY +G+ID + ++ + +EK +I + + P + + + + Sbjct: 61 QEGYEQGMIDALQQVAAYLTDNQTMAWKWMEKIQIYARELFSAAVDHP----ETLLTVLD 116 Query: 117 SWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAE 176 W+ G + L +P + K ++ ++ ++ +H ++RF+ IAE Sbjct: 117 EWLRDFDKPEGQLFLTLPVNAKKDHQKLMVLLMENWPGTFNLKYHQEQRFIMSCGDQIAE 176 Query: 177 FSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEK 215 FSP++FV+ + ++ P I + A + L+++ Sbjct: 177 FSPEQFVETAVGVIKHHLDELPQDCRTISDNAINALIDE 215 >UniRef50_C8TGA5 Conserved predicted protein n=32 Tax=Enterobacteriaceae RepID=C8TGA5_ECO26 Length = 143 Score = 247 bits (630), Expect = 2e-64, Method: Composition-based stats. Identities = 136/143 (95%), Positives = 138/143 (96%) Query: 93 MINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKS 152 MINDLKSILLK SEEVDVFIKIFESW KLP+ISGPVNL+IPT FKDK LEVESYFVDKS Sbjct: 1 MINDLKSILLKSSEEVDVFIKIFESWRKKLPAISGPVNLYIPTRFKDKYLEVESYFVDKS 60 Query: 153 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 212 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL Sbjct: 61 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 120 Query: 213 VEKMFETHSLDMNNSVLASPEDL 235 VEKMFETHSLDMNNSVLASPEDL Sbjct: 121 VEKMFETHSLDMNNSVLASPEDL 143 >UniRef50_Q7NVC5 Probable oxygen-regulated invasion protein-cell invasion protein n=1 Tax=Chromobacterium violaceum RepID=Q7NVC5_CHRVO Length = 217 Score = 110 bits (274), Expect = 5e-23, Method: Composition-based stats. Identities = 33/176 (18%), Positives = 68/176 (38%), Gaps = 10/176 (5%) Query: 17 VVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFI 76 +++ R+ + VSL+ + ++ E++ E + DGY G+I + Sbjct: 1 MLVCREQLLRHDKAVSLEREARRHAARIVRQAEQEAESLRGRALLDGYGDGMIQALGQLA 60 Query: 77 PLIS----LLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLH 132 ++ LL E+ L + + P D + + W+ + LH Sbjct: 61 RHLANGDALLRHCRERLEAESRAMLSAAVDHP----DALLLALDEWLRERRREDESRTLH 116 Query: 133 I--PTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNC 186 + P + E+ + + + + +H D RFV + AEFSP+ +V+ Sbjct: 117 LLLPQQARASQAELMALLAESWGGRLSLDYHADSRFVMCCGEQAAEFSPELYVEPA 172 >UniRef50_Q6R8A1 OrgAb n=2 Tax=Sodalis glossinidius RepID=Q6R8A1_SODGL Length = 215 Score = 85.5 bits (210), Expect = 1e-15, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 82/204 (40%), Gaps = 9/204 (4%) Query: 18 VIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIP 77 +IKR+ + + L ++ E ++ E+ E+ Q +GY +G++ +D Sbjct: 1 MIKREHLQRQRCAMDLVKQARREAVKCLKQATEEAEQLRNQARNEGYQQGVLAAVDTVAG 60 Query: 78 LISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSIS--GPVNLHIPT 135 + + + + +++L+ D+ + + E W+ + P+ S P+ L +P Sbjct: 61 FFAERQKLIFSLQREVEEHARTLLMTALSHTDLLLLLLEDWLAQQPAPSIPAPLELWVPA 120 Query: 136 SFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHC 195 + +L + I HD + F+ +AEF F+D + L + Sbjct: 121 DRRAAALRLNRQIGALWSGKYDIITHDGESFIMKYGDQVAEFDAGAFIDAATRQLTSR-- 178 Query: 196 FSPDK---VNEICEQARHYLVEKM 216 PD + EQ LV+++ Sbjct: 179 --PDYETLARHLSEQGLQALVKRL 200 >UniRef50_C4K5N0 Oxygen-regulated invasion protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N0_HAMD5 Length = 155 Score = 75.9 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 26/128 (20%), Positives = 51/128 (39%), Gaps = 8/128 (6%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M KKI S GV++ K ++ + I+ EE E+ Q Y Sbjct: 1 MSKKIPNLSPLSAREGVLLTYAQIQQHKRAKNILNEAYRNAKKIIRQAEEDAEKIQKQAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRI----NMINDLKSILLKPSEEVDVFIKIFE 116 DGY +G++ + + + ++ + + + + + + L P D+ I + + Sbjct: 61 MDGYQQGVMSAIKHIVGYMNNTKNLMHQLNLDLEDHARQIFSTALDHP----DILIVLLD 116 Query: 117 SWVTKLPS 124 W+ LPS Sbjct: 117 EWLNTLPS 124 >UniRef50_Q3YTN9 MxiN n=9 Tax=Shigella RepID=Q3YTN9_SHISS Length = 231 Score = 63.2 bits (152), Expect = 7e-09, Method: Composition-based stats. Identities = 37/173 (21%), Positives = 71/173 (41%), Gaps = 7/173 (4%) Query: 15 NGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDN 74 +GVVIKR + K I + + I+ +K E I DGY GI ++ Sbjct: 20 DGVVIKRIEKELCKTIKDRDTESKKKAICVIKDATKKAESLRIDAVCDGYQIGIQTAFEH 79 Query: 75 FIPLISLLCSELEKKRINMIND---LKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNL 131 I I C K+ N N + S+L + + + + E W++ L + + + Sbjct: 80 IIDYI---CEWKLKQNENRRNIEDYITSLLSENLHDERIISTLLEQWLSSLRNTVTELKV 136 Query: 132 HIPTSFKDKSLEVESYFVD-KSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFV 183 +P ++E +S + + + + ++F + + EFSPQ+ + Sbjct: 137 VLPKCNLALRKKLELDLHKYRSDVKIILKYSEGNNYIFCSGNQVVEFSPQDVI 189 >UniRef50_A2S1M7 Type III secretion apparatus protein, HrpE/YscL family n=56 Tax=pseudomallei group RepID=A2S1M7_BURM9 Length = 264 Score = 61.2 bits (147), Expect = 3e-08, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 64/176 (36%), Gaps = 2/176 (1%) Query: 6 EMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYT 65 + +P GV++KR L ++ ++ E + + GY Sbjct: 8 QFPDTLAPLGGVLLKRAPLSQAARAERLLDEARRRAQRLVRDAEREADACRAHAATAGYE 67 Query: 66 KGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWV-TKLPS 124 G + + + ++ +++D++ L ++ D+ ++I + + + Sbjct: 68 AGFARAIAELAAGVERIDAQRATLLERVVDDVRRSLEHLLDDPDLLLRIVNALASRRACA 127 Query: 125 ISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 P+ + +P K + + D + + D + FV + + I EF P+ Sbjct: 128 TDRPLRVSVPPHAKRIAPAIRERLNDAYP-SAQVVVADTRTFVVESGEDILEFDPR 182 >UniRef50_UPI00016A643C type III secretion apparatus protein, HrpE/YscL family n=2 Tax=pseudomallei group RepID=UPI00016A643C Length = 203 Score = 52.0 bits (123), Expect = 2e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 60/165 (36%), Gaps = 2/165 (1%) Query: 17 VVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFI 76 +++KR L ++ ++ E + + GY G + Sbjct: 1 MLLKRAPLSQAARADRLLDEARRRAQRLVRDAESEADACRAHAATAGYEAGFARAIAEVA 60 Query: 77 PLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWV-TKLPSISGPVNLHIPT 135 I + + ++++++ L ++ D+ ++I + + P+ + +PT Sbjct: 61 ACIERIDMQRATLLERVVDNVRCSLEHLLDDPDLLLRIVNALASRHACAADRPLRVSVPT 120 Query: 136 SFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 K + + D + + D + FV +D+ I EF P+ Sbjct: 121 HAKRIAPAIRERLNDAYP-SAQVVVADTRTFVVESDEDILEFDPR 164 >UniRef50_B1FB92 Type III secretion apparatus protein, HrpE/YscL family n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FB92_9BURK Length = 228 Score = 50.1 bits (118), Expect = 7e-05, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 75/222 (33%), Gaps = 9/222 (4%) Query: 14 ANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMD 73 + V+ R K + +L + + I E Y DGY G+ Sbjct: 15 VDQAVVARSSLVRAKRVANLLDDAQRQAKHIISRANTGAEACRAHAYADGYEAGLAQAAV 74 Query: 74 NFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISG-PVNLH 132 + S K ++N ++ + +E + +++ ++ + + P+ + Sbjct: 75 IAARYFAQCESLQVKLYDQVVNAVQQTMALHLDEPEWLLRVTAAFARQRETYRPLPMRIA 134 Query: 133 IPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLIN 192 +P K + + +++ + I + D RF+ I EF Q N Sbjct: 135 LPPDAKGAAPALRRQL-EQAGVSADIAYGDTPRFIIEWGNEIVEFDAQAVARTISDAAFN 193 Query: 193 NHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPED 234 P + R +L + H LD V+A P++ Sbjct: 194 EADRPPTPALSV---TRSFLESVL---HELDA-TPVVAQPKE 228 >UniRef50_B6XHF3 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XHF3_9ENTR Length = 214 Score = 48.9 bits (115), Expect = 1e-04, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 70/192 (36%), Gaps = 9/192 (4%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M + + ++ ++ V+IK + + S+ + + IQ ++ E + Y Sbjct: 1 MSENTQPNIQQTIIEDVLIKHHALNSNQYAQSVVKYAQRKAKEHIQHAQQAEETLYTAAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 GY G+ + +FI ++ + + L +L ++ +I + T Sbjct: 61 QHGYQDGLQQLLQDFIAVLEVSEKHYQYTVNKTEERLTKVLTHLFADLR-LQEIIAEYFT 119 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 +L + LH+PT + K + K I + I + I FSP Sbjct: 120 QLQPETAKTQLHLPTELQTK--------LGKKINGIKILNSQNNHISLEVGNKITHFSPD 171 Query: 181 EFVDNCEQYLIN 192 N + +++ Sbjct: 172 VAAKNIQPQILS 183 >UniRef50_D2UDE1 Probable oxygen-regulated invasion protein orgb n=1 Tax=Xanthomonas albilineans RepID=D2UDE1_XANAL Length = 220 Score = 43.9 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 74/213 (34%), Gaps = 4/213 (1%) Query: 2 RKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYY 61 +K+ S A VV + +L++ I+ E ++ Q + Sbjct: 1 MQKVYNKAPDSEAQ-VVYRHAALHRANRRHALEQAAKKSARKIIEDAEHYAHTQYEQARF 59 Query: 62 DGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTK 121 +GY GI +D I + L + + + K E+ + E ++ Sbjct: 60 EGYRDGIRLFVDTLIDETTHLSQTYALQLEQERGAIAHNVRKLFEDSKTATVLIEEYLDT 119 Query: 122 LPSISGP-VNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSP- 179 + S V ++IP K L ++ + + + +T + FV QF F+P Sbjct: 120 QDADSDRSVTVYIPKWCKLSQLSID-HLQTNNGRRITLTTSPNDHFVISNGQFSVSFAPF 178 Query: 180 QEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 212 D C + + + PD I E Y+ Sbjct: 179 SASTDICSRAVQRHQQRVPDTTANIVEALLSYV 211 >UniRef50_C2LLW0 Type III secretion system protein (Oxygen-regulated invasion protein) n=2 Tax=Proteus mirabilis RepID=C2LLW0_PROMI Length = 230 Score = 42.7 bits (99), Expect = 0.009, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 73/185 (39%), Gaps = 7/185 (3%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M + + +L+K GV+I+ L I + K ++ + Sbjct: 1 MLRLLPDNLLKYTCEGVLIRAHYIRQLDNIHKTTLATKQAAKKMLHHFNHKLDKLRNKIA 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 + Y KG+ + + I + + L + + + + ++ +K+ + ++ Sbjct: 61 NEAYAKGLQVLLADIIRFSIEYQEKFVQYEFQQREQLVATIGEFLDSPEIQVKLTQYLMS 120 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 +P + V L IPT+ + + E E +D S N+ + H++K T I F P Sbjct: 121 SVP-LEQKVTLDIPTTLQ-RYFESE---LDNS--NIKLNCHNNKTIAIHTGDQITFFDPA 173 Query: 181 EFVDN 185 F+++ Sbjct: 174 IFLND 178 >UniRef50_D2TZU3 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TZU3_9ENTR Length = 224 Score = 40.8 bits (94), Expect = 0.037, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 76/195 (38%), Gaps = 13/195 (6%) Query: 14 ANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMD 73 + V+I+ K L + + I + + ++E+ F Q GY +G I++ Sbjct: 5 IDNVLIRAKFKQSHDYHERLINQAKKQAKKIIIAAQTEKEKIFNQAKIHGYCQG-INQTS 63 Query: 74 NFIPLISLLCSELEKKRIN-MINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLH 132 + L +L N +I ++ L++ V ++ + + K+ + ++ Sbjct: 64 TALALFIQAYQDLNHTLYNDIIKQIEQTLIQFLLTEPVIEQLLLNLMAKID-LQQQSKIY 122 Query: 133 IPTSFKDKSLEVESYFVDKS---IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQY 189 +P ++ + F ++ N+ +T H + V I FSP++ V + Sbjct: 123 LP-------EKLATAFSERWSDKFSNLKLTSHKENGIVLEMGNEILYFSPEKSVTEMMKN 175 Query: 190 LINNHCFSPDKVNEI 204 +I + + +I Sbjct: 176 IIADQQIAASNQQQI 190 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46793 Putative uncharacterized protein ygeN n=4 Tax=Es... 302 5e-81 UniRef50_P58654 Oxygen-regulated invasion protein orgB n=30 Tax=... 244 2e-63 UniRef50_B1FB92 Type III secretion apparatus protein, HrpE/YscL ... 210 3e-53 UniRef50_C8TGA5 Conserved predicted protein n=32 Tax=Enterobacte... 203 5e-51 UniRef50_Q6R8A1 OrgAb n=2 Tax=Sodalis glossinidius RepID=Q6R8A1_... 190 2e-47 UniRef50_Q3YTN9 MxiN n=9 Tax=Shigella RepID=Q3YTN9_SHISS 174 3e-42 UniRef50_Q7NVC5 Probable oxygen-regulated invasion protein-cell ... 173 4e-42 UniRef50_A2S1M7 Type III secretion apparatus protein, HrpE/YscL ... 167 3e-40 UniRef50_B6XHF3 Putative uncharacterized protein n=2 Tax=Provide... 166 5e-40 UniRef50_UPI00016A643C type III secretion apparatus protein, Hrp... 148 1e-34 UniRef50_C4K5N0 Oxygen-regulated invasion protein n=1 Tax=Candid... 142 1e-32 Sequences not found previously or not previously below threshold: UniRef50_UPI000197BD3F hypothetical protein PretD1_16275 n=1 Tax... 96 1e-18 UniRef50_Q9KKI5 Orf4 n=5 Tax=Yersinia RepID=Q9KKI5_YEREN 94 3e-18 UniRef50_B2Q5I0 Putative uncharacterized protein n=1 Tax=Provide... 93 9e-18 UniRef50_D2UDE1 Probable oxygen-regulated invasion protein orgb ... 83 7e-15 UniRef50_D2TZU3 Putative uncharacterized protein n=1 Tax=Arsenop... 76 1e-12 UniRef50_C2LLW0 Type III secretion system protein (Oxygen-regula... 71 3e-11 UniRef50_C0AR80 Putative uncharacterized protein n=1 Tax=Proteus... 58 3e-07 UniRef50_C4UMF0 Putative uncharacterized protein n=1 Tax=Yersini... 56 8e-07 UniRef50_D1U7K9 H+transporting two-sector ATPase E subunit n=1 T... 49 9e-05 UniRef50_B6QWW9 Type III secretion apparatus protein, HrpE/YscL ... 48 2e-04 UniRef50_A8RWD9 Putative uncharacterized protein n=3 Tax=Clostri... 47 4e-04 UniRef50_C7E4S1 Putative uncharacterized protein psa2 n=1 Tax=Pa... 47 4e-04 UniRef50_B2VID5 Putative uncharacterized protein n=1 Tax=Erwinia... 47 4e-04 UniRef50_Q11M64 Putative uncharacterized protein n=1 Tax=Chelati... 47 5e-04 UniRef50_B3RAD0 SctL: non flagellar T3S system conserved protein... 45 0.002 UniRef50_C0QV74 V-type ATP synthase subunit E n=2 Tax=Brachyspir... 45 0.002 UniRef50_A4XIX7 H+-transporting two-sector ATPase, E subunit n=2... 45 0.002 UniRef50_C4K5M9 Putative uncharacterized protein n=1 Tax=Candida... 44 0.003 UniRef50_Q3BYK1 HrcL protein n=12 Tax=Xanthomonas RepID=Q3BYK1_X... 43 0.007 UniRef50_Q3ADE2 Flagellar protein n=1 Tax=Carboxydothermus hydro... 43 0.012 UniRef50_Q1D9L2 FliH family protein n=2 Tax=Cystobacterineae Rep... 42 0.018 UniRef50_Q2SH65 Flagellar biosynthesis/type III secretory pathwa... 41 0.025 UniRef50_Q2B7F5 Flagellar assembly protein H n=1 Tax=Bacillus sp... 41 0.034 UniRef50_C2EDB0 Flagellar biosynthesis/type III secretory pathwa... 40 0.064 UniRef50_Q1MQ18 Flagellar biosynthesis/type III secretory pathwa... 40 0.066 UniRef50_B7ATQ9 Putative uncharacterized protein n=1 Tax=Bactero... 40 0.075 UniRef50_A6CES2 Flagellar assembly protein fliH, putative n=1 Ta... 40 0.076 UniRef50_D1RAU8 V-type ATP synthase subunit E n=1 Tax=Parachlamy... 39 0.085 >UniRef50_Q46793 Putative uncharacterized protein ygeN n=4 Tax=Escherichia coli RepID=YGEN_ECOLI Length = 235 Score = 302 bits (774), Expect = 5e-81, Method: Composition-based stats. Identities = 235/235 (100%), Positives = 235/235 (100%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY Sbjct: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT Sbjct: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ Sbjct: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 Query: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL 235 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL Sbjct: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPEDL 235 >UniRef50_P58654 Oxygen-regulated invasion protein orgB n=30 Tax=Salmonella enterica RepID=ORGB_SALTY Length = 226 Score = 244 bits (623), Expect = 2e-63, Method: Composition-based stats. Identities = 44/219 (20%), Positives = 96/219 (43%), Gaps = 8/219 (3%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M K I + SP G++IKRK + I L+++ ++ EE+ + + Y Sbjct: 1 MLKNIPIPSPLSPVEGILIKRKTLERYFSIERLEQQAHQRAKRILREAEEEAKTLRMYAY 60 Query: 61 YDGYTKGIIDEMDNFIPLIS----LLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFE 116 +GY +G+ID + ++ + +EK +I + + P + + + + Sbjct: 61 QEGYEQGMIDALQQVAAYLTDNQTMAWKWMEKIQIYARELFSAAVDHP----ETLLTVLD 116 Query: 117 SWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAE 176 W+ G + L +P + K ++ ++ ++ +H ++RF+ IAE Sbjct: 117 EWLRDFDKPEGQLFLTLPVNAKKDHQKLMVLLMENWPGTFNLKYHQEQRFIMSCGDQIAE 176 Query: 177 FSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEK 215 FSP++FV+ + ++ P I + A + L+++ Sbjct: 177 FSPEQFVETAVGVIKHHLDELPQDCRTISDNAINALIDE 215 >UniRef50_B1FB92 Type III secretion apparatus protein, HrpE/YscL family n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FB92_9BURK Length = 228 Score = 210 bits (534), Expect = 3e-53, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 75/225 (33%), Gaps = 9/225 (4%) Query: 11 KSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIID 70 + V+ R K + +L + + I E Y DGY G+ Sbjct: 12 LPLVDQAVVARSSLVRAKRVANLLDDAQRQAKHIISRANTGAEACRAHAYADGYEAGLAQ 71 Query: 71 EMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISG-PV 129 + S K ++N ++ + +E + +++ ++ + + P+ Sbjct: 72 AAVIAARYFAQCESLQVKLYDQVVNAVQQTMALHLDEPEWLLRVTAAFARQRETYRPLPM 131 Query: 130 NLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQY 189 + +P K + + +++ + I + D RF+ I EF Q Sbjct: 132 RIALPPDAKGAAPALRRQL-EQAGVSADIAYGDTPRFIIEWGNEIVEFDAQAVARTISDA 190 Query: 190 LINNHCFSPDKVNEICEQARHYLVEKMFETHSLDMNNSVLASPED 234 N P + R +L + H LD V+A P++ Sbjct: 191 AFNEADRPPTPALSV---TRSFLESVL---HELDA-TPVVAQPKE 228 >UniRef50_C8TGA5 Conserved predicted protein n=32 Tax=Enterobacteriaceae RepID=C8TGA5_ECO26 Length = 143 Score = 203 bits (515), Expect = 5e-51, Method: Composition-based stats. Identities = 136/143 (95%), Positives = 138/143 (96%) Query: 93 MINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKS 152 MINDLKSILLK SEEVDVFIKIFESW KLP+ISGPVNL+IPT FKDK LEVESYFVDKS Sbjct: 1 MINDLKSILLKSSEEVDVFIKIFESWRKKLPAISGPVNLYIPTRFKDKYLEVESYFVDKS 60 Query: 153 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 212 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL Sbjct: 61 IWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQARHYL 120 Query: 213 VEKMFETHSLDMNNSVLASPEDL 235 VEKMFETHSLDMNNSVLASPEDL Sbjct: 121 VEKMFETHSLDMNNSVLASPEDL 143 >UniRef50_Q6R8A1 OrgAb n=2 Tax=Sodalis glossinidius RepID=Q6R8A1_SODGL Length = 215 Score = 190 bits (483), Expect = 2e-47, Method: Composition-based stats. Identities = 36/201 (17%), Positives = 81/201 (40%), Gaps = 3/201 (1%) Query: 18 VIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIP 77 +IKR+ + + L ++ E ++ E+ E+ Q +GY +G++ +D Sbjct: 1 MIKREHLQRQRCAMDLVKQARREAVKCLKQATEEAEQLRNQARNEGYQQGVLAAVDTVAG 60 Query: 78 LISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSIS--GPVNLHIPT 135 + + + + +++L+ D+ + + E W+ + P+ S P+ L +P Sbjct: 61 FFAERQKLIFSLQREVEEHARTLLMTALSHTDLLLLLLEDWLAQQPAPSIPAPLELWVPA 120 Query: 136 SFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHC 195 + +L + I HD + F+ +AEF F+D + L + Sbjct: 121 DRRAAALRLNRQIGALWSGKYDIITHDGESFIMKYGDQVAEFDAGAFIDAATRQLTSRPD 180 Query: 196 FSPDKVNEICEQARHYLVEKM 216 + + EQ LV+++ Sbjct: 181 YE-TLARHLSEQGLQALVKRL 200 >UniRef50_Q3YTN9 MxiN n=9 Tax=Shigella RepID=Q3YTN9_SHISS Length = 231 Score = 174 bits (440), Expect = 3e-42, Method: Composition-based stats. Identities = 39/187 (20%), Positives = 74/187 (39%), Gaps = 7/187 (3%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M+K +GVVIKR + K I + + I+ +K E I Sbjct: 6 MQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVIKDATKKAESLRIDAV 65 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMIND---LKSILLKPSEEVDVFIKIFES 117 DGY GI ++ I I C K+ N N + S+L + + + + E Sbjct: 66 CDGYQIGIQTAFEHIIDYI---CEWKLKQNENRRNIEDYITSLLSENLHDERIISTLLEQ 122 Query: 118 WVTKLPSISGPVNLHIPTSFKDKSLEVESYFVD-KSIWNVHITFHDDKRFVFFTDQFIAE 176 W++ L + + + +P ++E +S + + + + ++F + + E Sbjct: 123 WLSSLRNTVTELKVVLPKCNLALRKKLELDLHKYRSDVKIILKYSEGNNYIFCSGNQVVE 182 Query: 177 FSPQEFV 183 FSPQ+ + Sbjct: 183 FSPQDVI 189 >UniRef50_Q7NVC5 Probable oxygen-regulated invasion protein-cell invasion protein n=1 Tax=Chromobacterium violaceum RepID=Q7NVC5_CHRVO Length = 217 Score = 173 bits (439), Expect = 4e-42, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 79/202 (39%), Gaps = 2/202 (0%) Query: 17 VVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFI 76 +++ R+ + VSL+ + ++ E++ E + DGY G+I + Sbjct: 1 MLVCREQLLRHDKAVSLEREARRHAARIVRQAEQEAESLRGRALLDGYGDGMIQALGQLA 60 Query: 77 PLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLH--IP 134 ++ + L R + + +++L + D + + W+ + LH +P Sbjct: 61 RHLANGDALLRHCRERLEAESRAMLSAAVDHPDALLLALDEWLRERRREDESRTLHLLLP 120 Query: 135 TSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNH 194 + E+ + + + + +H D RFV + AEFSP+ +V+ + Sbjct: 121 QQARASQAELMALLAESWGGRLSLDYHADSRFVMCCGEQAAEFSPELYVEPASSQAMQAL 180 Query: 195 CFSPDKVNEICEQARHYLVEKM 216 P + + A L +++ Sbjct: 181 GDLPSRCRGLSAAALSALRDEL 202 >UniRef50_A2S1M7 Type III secretion apparatus protein, HrpE/YscL family n=56 Tax=pseudomallei group RepID=A2S1M7_BURM9 Length = 264 Score = 167 bits (423), Expect = 3e-40, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 64/181 (35%), Gaps = 2/181 (1%) Query: 6 EMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYT 65 + +P GV++KR L ++ ++ E + + GY Sbjct: 8 QFPDTLAPLGGVLLKRAPLSQAARAERLLDEARRRAQRLVRDAEREADACRAHAATAGYE 67 Query: 66 KGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWV-TKLPS 124 G + + + ++ +++D++ L ++ D+ ++I + + + Sbjct: 68 AGFARAIAELAAGVERIDAQRATLLERVVDDVRRSLEHLLDDPDLLLRIVNALASRRACA 127 Query: 125 ISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVD 184 P+ + +P K + + D + + D + FV + + I EF P+ Sbjct: 128 TDRPLRVSVPPHAKRIAPAIRERLNDAYP-SAQVVVADTRTFVVESGEDILEFDPRAVAR 186 Query: 185 N 185 Sbjct: 187 A 187 >UniRef50_B6XHF3 Putative uncharacterized protein n=2 Tax=Providencia RepID=B6XHF3_9ENTR Length = 214 Score = 166 bits (420), Expect = 5e-40, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 70/192 (36%), Gaps = 9/192 (4%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M + + ++ ++ V+IK + + S+ + + IQ ++ E + Y Sbjct: 1 MSENTQPNIQQTIIEDVLIKHHALNSNQYAQSVVKYAQRKAKEHIQHAQQAEETLYTAAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 GY G+ + +FI ++ + + L +L ++ +I + T Sbjct: 61 QHGYQDGLQQLLQDFIAVLEVSEKHYQYTVNKTEERLTKVLTHLFADLR-LQEIIAEYFT 119 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 +L + LH+PT + K + K I + I + I FSP Sbjct: 120 QLQPETAKTQLHLPTELQTK--------LGKKINGIKILNSQNNHISLEVGNKITHFSPD 171 Query: 181 EFVDNCEQYLIN 192 N + +++ Sbjct: 172 VAAKNIQPQILS 183 >UniRef50_UPI00016A643C type III secretion apparatus protein, HrpE/YscL family n=2 Tax=pseudomallei group RepID=UPI00016A643C Length = 203 Score = 148 bits (374), Expect = 1e-34, Method: Composition-based stats. Identities = 24/173 (13%), Positives = 60/173 (34%), Gaps = 2/173 (1%) Query: 17 VVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFI 76 +++KR L ++ ++ E + + GY G + Sbjct: 1 MLLKRAPLSQAARADRLLDEARRRAQRLVRDAESEADACRAHAATAGYEAGFARAIAEVA 60 Query: 77 PLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWV-TKLPSISGPVNLHIPT 135 I + + ++++++ L ++ D+ ++I + + P+ + +PT Sbjct: 61 ACIERIDMQRATLLERVVDNVRCSLEHLLDDPDLLLRIVNALASRHACAADRPLRVSVPT 120 Query: 136 SFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQ 188 K + + D + + D + FV +D+ I EF P+ Sbjct: 121 HAKRIAPAIRERLNDAYP-SAQVVVADTRTFVVESDEDILEFDPRVIAHALGD 172 >UniRef50_C4K5N0 Oxygen-regulated invasion protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N0_HAMD5 Length = 155 Score = 142 bits (357), Expect = 1e-32, Method: Composition-based stats. Identities = 25/127 (19%), Positives = 54/127 (42%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M KKI S GV++ K ++ + I+ EE E+ Q Y Sbjct: 1 MSKKIPNLSPLSAREGVLLTYAQIQQHKRAKNILNEAYRNAKKIIRQAEEDAEKIQKQAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 DGY +G++ + + + ++ + + + +++ + + I + D+ I + + W+ Sbjct: 61 MDGYQQGVMSAIKHIVGYMNNTKNLMHQLNLDLEDHARQIFSTALDHPDILIVLLDEWLN 120 Query: 121 KLPSISG 127 LPS + Sbjct: 121 TLPSQNE 127 >UniRef50_UPI000197BD3F hypothetical protein PretD1_16275 n=1 Tax=Providencia rettgeri DSM 1131 RepID=UPI000197BD3F Length = 216 Score = 95.7 bits (236), Expect = 1e-18, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 70/194 (36%), Gaps = 13/194 (6%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M +K ++ ++ + V+IK + + + ++ + + + ++++E + Y Sbjct: 1 MSEKTPPNVKQTMSEKVLIKYQALSADRYHTIITQQAQKKASELYKQAKQQQETIYQTAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 GY GI + +FI + + ++ L IL + + + ++ Sbjct: 61 QQGYNDGIKQLLTDFIHAVETSEIQYQENISQSKEQLMKILNDIFGD-NHLQETVATYFE 119 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFH--DDKRFVFFTDQFIAEFS 178 + + + LH+P + + +I + F D D I FS Sbjct: 120 RQCAKTTNTTLHLPEKMQTRI----------TIGGSDMKFSTTADNTIALEVDNRITYFS 169 Query: 179 PQEFVDNCEQYLIN 192 P N ++ + Sbjct: 170 PVIASKNIFPHVFS 183 >UniRef50_Q9KKI5 Orf4 n=5 Tax=Yersinia RepID=Q9KKI5_YEREN Length = 231 Score = 94.1 bits (232), Expect = 3e-18, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 84/218 (38%), Gaps = 12/218 (5%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M ++ E + ++ V+IK ++ + +L + + + Q +++ EE Y Sbjct: 1 MSQQSEPDIRQTALEKVLIKHQVMKASRYEKTLTAQAKKQASEHCQQAQQEAEEIQRIAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 GY G+ + + + + + ++ + L+++L + + ++ + + ++ Sbjct: 61 QQGYQDGLQKLLADLLLGLESSQRQYQQTLASSEARLQTLLEEMFSDPRMYEIVSDHFIR 120 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 + S S + +H+P S K + ++ + ++ + I FSP+ Sbjct: 121 Q-HSESLQIQIHLPPSLLKKLKPTLAEMQ-----HITVLGGAEESIALEVNNEILHFSPE 174 Query: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYLVEKMFE 218 ++++ Q+R L K+ E Sbjct: 175 SAAQRTLPHILS------LPARCTILQSRKALYSKLSE 206 >UniRef50_B2Q5I0 Putative uncharacterized protein n=1 Tax=Providencia stuartii ATCC 25827 RepID=B2Q5I0_PROST Length = 216 Score = 92.6 bits (228), Expect = 9e-18, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 69/192 (35%), Gaps = 10/192 (5%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M +KI+ + ++ V+IK + + + ++++ + T + Q +++ E Y Sbjct: 1 MSEKIQPDIQQTACEKVLIKHHATRAFRYVKQIEQQAQQQATERYQQAQQQVEHIHKIAY 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 GY+ G+ + I I + ++ L L + +I ++ Sbjct: 61 QQGYSDGLKQLLSKLIESIEISEKCYQQAVNQSQETLLQQLSALFHDPH-LQEIIAKYLI 119 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 + P+ L++P ++ +E+ + I + + FSP Sbjct: 120 QQQ-ERRPITLYLPDRLQNMFNHIEA--------GLKILPSVEGNIALEIGNEVIHFSPT 170 Query: 181 EFVDNCEQYLIN 192 + +++ Sbjct: 171 IAAEQTLPQILS 182 >UniRef50_D2UDE1 Probable oxygen-regulated invasion protein orgb n=1 Tax=Xanthomonas albilineans RepID=D2UDE1_XANAL Length = 220 Score = 83.0 bits (203), Expect = 7e-15, Method: Composition-based stats. Identities = 35/201 (17%), Positives = 69/201 (34%), Gaps = 3/201 (1%) Query: 16 GVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNF 75 VV + +L++ I+ E ++ Q ++GY GI +D Sbjct: 14 QVVYRHAALHRANRRHALEQAAKKSARKIIEDAEHYAHTQYEQARFEGYRDGIRLFVDTL 73 Query: 76 IPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPS-ISGPVNLHIP 134 I + L + + + K E+ + E ++ + V ++IP Sbjct: 74 IDETTHLSQTYALQLEQERGAIAHNVRKLFEDSKTATVLIEEYLDTQDADSDRSVTVYIP 133 Query: 135 TSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSP-QEFVDNCEQYLINN 193 K L ++ + + + +T + FV QF F+P D C + + + Sbjct: 134 KWCKLSQLSID-HLQTNNGRRITLTTSPNDHFVISNGQFSVSFAPFSASTDICSRAVQRH 192 Query: 194 HCFSPDKVNEICEQARHYLVE 214 PD I E Y+ + Sbjct: 193 QQRVPDTTANIVEALLSYVQQ 213 >UniRef50_D2TZU3 Putative uncharacterized protein n=1 Tax=Arsenophonus nasoniae RepID=D2TZU3_9ENTR Length = 224 Score = 76.0 bits (185), Expect = 1e-12, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 66/179 (36%), Gaps = 5/179 (2%) Query: 13 PANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEM 72 + V+I+ K L + + I + + ++E+ F Q GY +GI Sbjct: 4 AIDNVLIRAKFKQSHDYHERLINQAKKQAKKIIIAAQTEKEKIFNQAKIHGYCQGINQTS 63 Query: 73 DNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLH 132 I ++I ++ L++ V ++ + + K+ + ++ Sbjct: 64 TALALFIQAYQDLNHTLYNDIIKQIEQTLIQFLLTEPVIEQLLLNLMAKID-LQQQSKIY 122 Query: 133 IPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLI 191 +P + + + DK N+ +T H + V I FSP++ V + +I Sbjct: 123 LP---EKLATAFSERWSDKFS-NLKLTSHKENGIVLEMGNEILYFSPEKSVTEMMKNII 177 >UniRef50_C2LLW0 Type III secretion system protein (Oxygen-regulated invasion protein) n=2 Tax=Proteus mirabilis RepID=C2LLW0_PROMI Length = 230 Score = 71.0 bits (172), Expect = 3e-11, Method: Composition-based stats. Identities = 33/232 (14%), Positives = 77/232 (33%), Gaps = 12/232 (5%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M + + +L+K GV+I+ L I + K ++ + Sbjct: 1 MLRLLPDNLLKYTCEGVLIRAHYIRQLDNIHKTTLATKQAAKKMLHHFNHKLDKLRNKIA 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 + Y KG+ + + I + + L + + + + ++ +K+ ++ Sbjct: 61 NEAYAKGLQVLLADIIRFSIEYQEKFVQYEFQQREQLVATIGEFLDSPEIQVKLT-QYLM 119 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQ 180 + V L IPT+ + + + N+ + H++K T I F P Sbjct: 120 SSVPLEQKVTLDIPTTLQ------RYFESELDNSNIKLNCHNNKTIAIHTGDQITFFDPA 173 Query: 181 EFVDNCEQYLINNHCFSPDKVNEICEQARHYL---VEKMFETHSLDMNNSVL 229 F+++ + FS + + L + + L +L Sbjct: 174 IFLNDLRTQF--HRPFSETYQPIFEQNIKQLLLNFINTFTPSDDLSSRKPIL 223 >UniRef50_C0AR80 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AR80_9ENTR Length = 171 Score = 57.6 bits (137), Expect = 3e-07, Method: Composition-based stats. Identities = 23/127 (18%), Positives = 49/127 (38%), Gaps = 7/127 (5%) Query: 63 GYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKL 122 GY++G+ + + + L S + L I + + + + D+ I++ + + Sbjct: 5 GYSEGLKALLGDILTLFSQYQTHLFNYEIKQRELITRTVAQYFQSPDMQIELTKQLIAAS 64 Query: 123 PSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEF 182 P + L+IP K +E D+ N+ + HD+ I F P F Sbjct: 65 P-FDHKLTLNIP---KTLQPYLEETLSDQ---NIELIPHDNTTISVSAGSQILFFDPPLF 117 Query: 183 VDNCEQY 189 + + + Sbjct: 118 MQDLKSQ 124 >UniRef50_C4UMF0 Putative uncharacterized protein n=1 Tax=Yersinia ruckeri ATCC 29473 RepID=C4UMF0_YERRU Length = 180 Score = 56.4 bits (134), Expect = 8e-07, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 49/147 (33%), Gaps = 6/147 (4%) Query: 46 QSIEEKREEKFIQGYYDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPS 105 Q E Q GY GI + + I +S + LK L++ Sbjct: 4 QQACNDGEHIKNQSQQQGYEAGIRLFLKDLISSLSHYQHAYHQAVGETEKRLKHKLVEIF 63 Query: 106 EEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKR 165 + ++I + + + +++P+ + +V+S F D + I + Sbjct: 64 SDPRT-VEIVAEHFANMNTHHSALEIYVPSKY---YEKVKSTFKDNE--EIKIEKSHRQH 117 Query: 166 FVFFTDQFIAEFSPQEFVDNCEQYLIN 192 + T I F+P + + + Sbjct: 118 LLLKTGDKIINFNPDSSPNQVLSSIFS 144 >UniRef50_D1U7K9 H+transporting two-sector ATPase E subunit n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U7K9_9DELT Length = 251 Score = 49.5 bits (116), Expect = 9e-05, Method: Composition-based stats. Identities = 14/161 (8%), Positives = 52/161 (32%), Gaps = 18/161 (11%) Query: 28 KEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIPLISLLCSELE 87 + + ++E+ + + E + E ++GY +G+ + + +E E Sbjct: 50 EYLNRVQERAREKAKEIMLFAELEAEALRATARHEGYAEGLAQAQADVEQHTRTISAEAE 109 Query: 88 KKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEVESY 147 K + + +I +++ I++ ++ + + + S K + Sbjct: 110 KLFARIGSQGSNIFEARRKDIMDLIRL---------AVEKTLRIEMSQSRKASLEALMRQ 160 Query: 148 FVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQ 188 +++ + + + F+ ++ Sbjct: 161 ALERIESRHQLV-------IRCAGEDAEGLD--AFLKTIQE 192 >UniRef50_B6QWW9 Type III secretion apparatus protein, HrpE/YscL family n=1 Tax=Pseudovibrio sp. JE062 RepID=B6QWW9_9RHOB Length = 215 Score = 48.3 bits (113), Expect = 2e-04, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 50/136 (36%), Gaps = 4/136 (2%) Query: 18 VIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIP 77 +IK ++ + +Q EE+ QGY DG K + + I Sbjct: 20 IIKEAEFASVETADQIIATAQRRAEEVLQDATRVYEEQKKQGYEDGVRKAQSEACERIIA 79 Query: 78 LISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKI---FESWVTKLPSISGPVNLHIP 134 +LL + L + +++ ++L + +E D ++ S + L + V LH+ Sbjct: 80 EQALLDTRLREVEHDVVTVTITLLRRLLDEFDDAERVRLMARSMLKSLRAEKR-VRLHVA 138 Query: 135 TSFKDKSLEVESYFVD 150 K+ + D Sbjct: 139 PEMYAKATAMTRAITD 154 >UniRef50_A8RWD9 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A8RWD9_9CLOT Length = 268 Score = 47.2 bits (110), Expect = 4e-04, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 69/196 (35%), Gaps = 24/196 (12%) Query: 18 VIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIP 77 +++ + + E+ + + + + Q +GY +G+ +++ Sbjct: 69 ILEHARKQAEQSAARILEEAYAQRDNIVNTARGEAGRIHDQAKTEGYNQGLNQALEDISQ 128 Query: 78 LISLLC----------SELEKKRINMINDLKSILLKPS-------EEVDVFIKIFESWVT 120 ++ + E +++ + L ++ + +E ++ + + ++ Sbjct: 129 DMAGIQAAVDRLGQGLEEFKRQMNERVAGLAFMMAEKILRKKVEYDEAELADMVAGAVLS 188 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFV---DKSIWNVHITF-HDDKRFV-FFTDQFIA 175 + + + +HIP +E DK + I FV T++ I Sbjct: 189 ERDKEN--ITVHIPEDAVGLVEALEKRLEPLRDKMGGVLRIKTESRPPGFVQVETEEGIV 246 Query: 176 EFSPQEFVDNCEQYLI 191 + S +DN ++ L+ Sbjct: 247 DASLDVQLDNLKKQLM 262 >UniRef50_C7E4S1 Putative uncharacterized protein psa2 n=1 Tax=Pantoea stewartii subsp. stewartii DC283 RepID=C7E4S1_ERWST Length = 230 Score = 47.2 bits (110), Expect = 4e-04, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 61/206 (29%), Gaps = 19/206 (9%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 M +L + N VV+K + + E+ L + E + + ++ Sbjct: 1 MLNITTDTLPEDIDNQVVVKSGERTRHQRMTVALERTLSRCSEI--HAEYELQTIRLRES 58 Query: 61 YD--GYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESW 118 + GY G D + LI + + + L + + +I Sbjct: 59 CEKQGYEAGFRLFFDQLVALIDDYQRLQRLHQTRFREQVGNALKSALHDTIIVNRIIHH- 117 Query: 119 VTKLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFS 178 + + + + IP++ K + + + + F DD D F Sbjct: 118 LQEKCGHQKVLRIIIPSAVK---------LPEGADAS-NYQFTDDNHITVQNDMDAIRFP 167 Query: 179 PQEFVDNCEQYLINNHCFSPDKVNEI 204 ++ + +N+ + Sbjct: 168 S----ESLCRQWLNHADEQVAPTEAV 189 >UniRef50_B2VID5 Putative uncharacterized protein n=1 Tax=Erwinia tasmaniensis RepID=B2VID5_ERWT9 Length = 214 Score = 47.2 bits (110), Expect = 4e-04, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 59/177 (33%), Gaps = 11/177 (6%) Query: 1 MRKKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGY 60 MRK E L + ++IK + + + S E+ L+ E +++E + Sbjct: 1 MRKITENELPTDSYSNILIKSSLVSRYQRLSSALERTLIHCNQIHLEYESRKDELQERYQ 60 Query: 61 YDGYTKGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVT 120 +GYT G+ ++ + + + + + + ++ + +I + Sbjct: 61 KEGYTAGLQLIFSQLTMMLDDYEQQHSTRIEKLKSLINDAVRTSFDDPVIVERIIYH-IK 119 Query: 121 KLPSISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEF 177 ++ + +P + + ++ D F D D+ F Sbjct: 120 RICKQQNIRKIIVP---RTVQFKDDADLSD-------YIFTDGSDITLQGDKEAVRF 166 >UniRef50_Q11M64 Putative uncharacterized protein n=1 Tax=Chelativorans sp. BNC1 RepID=Q11M64_MESSB Length = 202 Score = 47.2 bits (110), Expect = 5e-04, Method: Composition-based stats. Identities = 16/135 (11%), Positives = 45/135 (33%), Gaps = 8/135 (5%) Query: 13 PANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEM 72 P +GV+ K + L++ ++ + ++ + + GY + + Sbjct: 10 PPDGVL-KAEDLARLRKTEEIEIAARRKAAQYLRRARQGSGHIRRKAREAGY----AEAL 64 Query: 73 DNFIPLISLLCSELEKKRINMINDLKSILLKP---SEEVDVFIKIFESWVTKLPSISGPV 129 +F I L E + + + + L+ L + + + + + +L V Sbjct: 65 FSFSEAIRRLDEERGRLQADQESRLRHCLQQIVGRMPKEEWLSHVLHEVLGELQGRPEIV 124 Query: 130 NLHIPTSFKDKSLEV 144 + P + + Sbjct: 125 IMVHPDHLDAITTAI 139 >UniRef50_B3RAD0 SctL: non flagellar T3S system conserved protein n=1 Tax=Cupriavidus taiwanensis RepID=B3RAD0_CUPTR Length = 202 Score = 45.2 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 19/147 (12%), Positives = 49/147 (33%), Gaps = 4/147 (2%) Query: 18 VIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIP 77 V+K L + ++ E E + + EE+ GY +G+ + D + Sbjct: 20 VLKEAEYAALLDAAAVIESARSEAVRIRSEAQREAEERRQHGYREGWARAHADHAAAQVS 79 Query: 78 LISLLCSELEKKRINMINDLKSILLKPSE--EVDVFIKIFESWVTKLPSISGPVNLHIPT 135 + +L R M + + + + + D+ +++ V L + + IP Sbjct: 80 TAAEAERQLGAMRRTMAEIVMKAVQQIAGELDPDIILEVALQRVEALVRAEPFITIQIPL 139 Query: 136 SFKDKSLEVESYFVD--KSIWNVHITF 160 + + + + ++ Sbjct: 140 GREQAVRSLLDRLAETQEWPRRANVVV 166 >UniRef50_C0QV74 V-type ATP synthase subunit E n=2 Tax=Brachyspira RepID=C0QV74_BRAHW Length = 204 Score = 45.2 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 57/180 (31%), Gaps = 11/180 (6%) Query: 26 GLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIPLISLLCSE 85 K+ + E I+ E K EE + D + + + Sbjct: 22 SNKKADEIISNAKSEADRIIKEAEAKSEEIIKEAERKSEELKKNTITDVRMAGEQSISAL 81 Query: 86 LEKKRINM-----INDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDK 140 ++ + + LK S D+ +++ + W + S V ++ P S K Sbjct: 82 KQRIKDLVTAKVLEEGLKGAFADTSFLKDLILEVVKKW--DITSSDADVTVYFPESKKAD 139 Query: 141 SLEVESYFVDKSIWNVHITF----HDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCF 196 + +I N I F + + V + +F+ ++FV+ Y+ Sbjct: 140 IDSSFEKSIKSAIKNATINFDKKLSNGFKIVPEGGNYQLQFTDEDFVEFFSDYIKAKTEE 199 >UniRef50_A4XIX7 H+-transporting two-sector ATPase, E subunit n=2 Tax=Clostridia RepID=A4XIX7_CALS8 Length = 251 Score = 44.8 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 45/134 (33%), Gaps = 15/134 (11%) Query: 28 KEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIPLISLLCSELE 87 KE + EK E I+ E + E+ + + G+ G+ ++ +E + Sbjct: 60 KEAEEVLEKAKEEAKRIIKEAETQAEQIKKEAFEKGFNDGLSQ-------GLAAAETEYQ 112 Query: 88 KKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEVESY 147 K+ + +L E + + +P I V + + E Sbjct: 113 KRLQEIEMLKMQVLA---ERERILKDAQNELMILVPRIVEKV-----VENEARDKEFLKD 164 Query: 148 FVDKSIWNVHITFH 161 F+ +I + I + Sbjct: 165 FIKNAISQLSIKYG 178 >UniRef50_C4K5M9 Putative uncharacterized protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5M9_HAMD5 Length = 86 Score = 44.5 bits (103), Expect = 0.003, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 32/73 (43%), Gaps = 2/73 (2%) Query: 148 FVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLINNHCFSPDKVNEICEQ 207 +K + I +H RFV IAEF P+EF++ + L+ ++ + Sbjct: 1 MNEKWNHPIKIEYHPASRFVIKYQDQIAEFIPEEFMELAVKRLLLTQDTM-KACRQLSQG 59 Query: 208 ARHYLVEKMFETH 220 + L + F+TH Sbjct: 60 SLEQLFD-YFKTH 71 >UniRef50_Q3BYK1 HrcL protein n=12 Tax=Xanthomonas RepID=Q3BYK1_XANC5 Length = 233 Score = 43.3 bits (100), Expect = 0.007, Method: Composition-based stats. Identities = 16/178 (8%), Positives = 50/178 (28%), Gaps = 12/178 (6%) Query: 21 RKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNF----I 76 + +S +L ++ + A + +K E + GY G+ ++D + + Sbjct: 43 QALSQAQTRAQTLVDEAQQQAEAILHDARQKAE----RSARLGYAAGLRRQLDEWNESGL 98 Query: 77 PLISLLCSELEKKRINMINDLKSILLK--PSEEVDVFIKIFESWVTKLPSISGPVNLHIP 134 + ++ R + + + + + + + + Sbjct: 99 RHAFAAEAAAQRSRERLAEIVARTCEHIILGHDPAALYARAAQALEGALDEAKALRVSVY 158 Query: 135 TSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLIN 192 D + ++ W + + D + E+ F + L + Sbjct: 159 PDAVDAARRAFDAAATEAGWTLQVELCGDAD--LAVGACVCEWDTGVFETDLRDQLSS 214 >UniRef50_Q3ADE2 Flagellar protein n=1 Tax=Carboxydothermus hydrogenoformans Z-2901 RepID=Q3ADE2_CARHZ Length = 238 Score = 42.5 bits (98), Expect = 0.012, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 34/106 (32%), Gaps = 4/106 (3%) Query: 13 PANGVVIKRKISDGLKEIVS-LKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDE 71 P +VI + + E S L + ++ I + + E + GY +G + Sbjct: 25 PIKKIVILPEEVEKNNEESSALLNEAKIKAQEIINAARREAEIIREEAKAKGYQQGYTEG 84 Query: 72 MDNFIPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFES 117 E + + +L+ + E + IK E Sbjct: 85 Q---AKARQEFEKLQETLKEEYEKKIAEKVLEINREREKIIKGVEQ 127 >UniRef50_Q1D9L2 FliH family protein n=2 Tax=Cystobacterineae RepID=Q1D9L2_MYXXD Length = 223 Score = 41.8 bits (96), Expect = 0.018, Method: Composition-based stats. Identities = 17/160 (10%), Positives = 54/160 (33%), Gaps = 9/160 (5%) Query: 6 EMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYT 65 E +++ P GV+ ++ + + ++ E+ E + + +RE+ + G Sbjct: 20 ERPVLRPPRAGVM-NAEVFEARQGASAILEEAQREKERILAEAQREREDLLAKTREQGRQ 78 Query: 66 KGIIDEMDNFIPLISLLCSELEKKRINMINDLKSILLK-----PSEEVDVFIKIFESWVT 120 +G+ + + L ++I + K ++ ++ +++ + + Sbjct: 79 EGLAQATEIILRAKMQAGEVLTGHEQDVIALSLKVAEKIIGRSLEKDPELMVELCAAAIE 138 Query: 121 KLPSISGPVNLHIPTSF---KDKSLEVESYFVDKSIWNVH 157 L S + P + + K + + Sbjct: 139 NLRSARSMILRVHPKTAAVLRAKKPVLMELIGRAVDLAIK 178 >UniRef50_Q2SH65 Flagellar biosynthesis/type III secretory pathway protein n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SH65_HAHCH Length = 209 Score = 41.4 bits (95), Expect = 0.025, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 50/167 (29%), Gaps = 10/167 (5%) Query: 5 IEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGY 64 I K +IK + L + E A + + + + GY Sbjct: 8 IRNIQEKVKVRNSLIKASDYQSWYDADRLLDAARAEADAILADAKRQAQA----ALESGY 63 Query: 65 TKGIIDEMDNFIPLI---SLLCSELEKKRI-NMINDLKSILLKPSEEVDVFIKIFESWVT 120 G+ + + S+ ++ + ++I +K + K ++ +KI T Sbjct: 64 QDGLAKAKEESADRLITASMRAHQMLQMAESDLIELVKLSVEKLFGDIGDAVKIASLIQT 123 Query: 121 KLPS--ISGPVNLHIPTSFKDKSLEVESYFVDKSIWNVHITFHDDKR 165 L S V +H+ + + + + D R Sbjct: 124 GLASLREDYRVTVHVAPEMEAQVKAALPDVLSHFPGIEYFDITADPR 170 >UniRef50_Q2B7F5 Flagellar assembly protein H n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B7F5_9BACI Length = 256 Score = 41.0 bits (94), Expect = 0.034, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 67/204 (32%), Gaps = 20/204 (9%) Query: 33 LKEKILLETTAKIQSIEEKR------------------EEKFIQGYYDGYTKGIIDEMDN 74 L ++ E + IQ E+ EE +DG+ +G+ + + Sbjct: 48 LIDEARREAESIIQQANEEARQLLEHIEQERVRFEQEKEEARESARHDGFQEGLAEGRQS 107 Query: 75 FIPLISLLCSELEKKRINMINDLKSILLKPSEE-VDVFIKIFESWVTKLPSISGPVNLHI 133 + S ++ + D ++ + +D+ IK+ E ++ + L + Sbjct: 108 GLMEFSETIGFAKEIVESSKADYQNTIESSEGTILDIGIKVAERILSSVLEEDSSRFLPV 167 Query: 134 PTSFKDKSLEVESYFVDKSIWNVH-ITFHDDKRFVFFTDQFIAEFSPQEFVDNCEQYLIN 192 +S E + + H D+ + + P E ++ C + + Sbjct: 168 VKRALKESREYREVQLHVHPGRYSFLLEHKDELLAIYPRETNLYIYPDEELEECSCVIES 227 Query: 193 NHCFSPDKVNEICEQARHYLVEKM 216 V+ E+ + L+E + Sbjct: 228 PGGRINAGVDSQLEEMKRKLIELL 251 >UniRef50_C2EDB0 Flagellar biosynthesis/type III secretory pathway protein n=1 Tax=Lactobacillus ruminis ATCC 25644 RepID=C2EDB0_9LACO Length = 252 Score = 40.2 bits (92), Expect = 0.064, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 45/160 (28%), Gaps = 24/160 (15%) Query: 3 KKIEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQG--- 59 +K E ++I R KE ++K+ I + E+ Sbjct: 30 EKKEPEDNLPNEEQLLISRTERKLAKEREEHRKKLEQMEQEIIAKANAEAEQIREDAQKK 89 Query: 60 ---------YYDGYTKGI---IDEMDNFIPLISLLCSELEKKRINMINDLKSILLKPSE- 106 Y DGY G+ ++ ++ + +K DL+ +K +E Sbjct: 90 GYDDGQQSGYQDGYQAGLKIAQKVIEQEKGNLNQANEDCQKFAKEKEEDLRKFAIKLAEI 149 Query: 107 --------EVDVFIKIFESWVTKLPSISGPVNLHIPTSFK 138 + I +L + + +K Sbjct: 150 LIKKQLDVDTSTITSILAPVFMELEKPDEMIIVRANACYK 189 >UniRef50_Q1MQ18 Flagellar biosynthesis/type III secretory pathway protein n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MQ18_LAWIP Length = 481 Score = 39.8 bits (91), Expect = 0.066, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 64/191 (33%), Gaps = 27/191 (14%) Query: 5 IEMSLIKSPANGVVIKRKISDGLKEIVSLKEKILLETTAKIQSIE---EKREEKFIQGYY 61 ++ L K+ AN I++ ++ +K+ + ++ E E+ + + Sbjct: 67 VQAMLDKARANAETIRKTAKQWAEKTKKESDKLHKQAQQALKEAERIREEADHIRELAHE 126 Query: 62 DGYTKGIIDEMDNF-----------IPLISLLCSELEKKRINMINDLKSILLKPSE-EVD 109 +GY GI + ++ + + N +DL +L +E D Sbjct: 127 EGYRLGIEQAQEEIKEQIKSIHMTAASILKTIERQYTVIFDNWRSDLVKLLHTATEVATD 186 Query: 110 VFIK-----IFESWVT---KLPSISGPVNLHIPTSFK----DKSLEVESYFVDKSIWNVH 157 + I +S + + V + + + K D +++ F + W + Sbjct: 187 WILSKEHTAILDSVLNKAVQQLEERQRVTIRVNPNNKNTIIDLITNIKNQFPELKNWEIK 246 Query: 158 ITFHDDKRFVF 168 I + V Sbjct: 247 IDVTMGENDVI 257 >UniRef50_B7ATQ9 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ATQ9_9BACE Length = 285 Score = 39.8 bits (91), Expect = 0.075, Method: Composition-based stats. Identities = 12/101 (11%), Positives = 34/101 (33%), Gaps = 6/101 (5%) Query: 28 KEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNFIPLISLLCSELE 87 + ++ E + ++ + E+ + DG+ G + + + L+ Sbjct: 91 AQADAILEDARNQADRILEDARSQAEQIRQDAHDDGFDAGTAEAQNKY----EQDKKLLQ 146 Query: 88 KKRINMINDLKSILLKPSE--EVDVFIKIFESWVTKLPSIS 126 + L+ + E ++ I E + + +IS Sbjct: 147 SDYDSRKAALEKEYNELKAKMEPELVNVILEVFKNAIYAIS 187 >UniRef50_A6CES2 Flagellar assembly protein fliH, putative n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CES2_9PLAN Length = 239 Score = 39.8 bits (91), Expect = 0.076, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 49/170 (28%), Gaps = 10/170 (5%) Query: 16 GVVIKRKISDGLKEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKGIIDEMDNF 75 G I +D K + +T I + + E+ + Y G +G+ + + Sbjct: 19 GSKIAYNFNDIEKRCEDYISNVRNQTRQMIIDAQAEAEQIKQEAYQAGKQRGLEEALREV 78 Query: 76 IPLISLLCSELEKKRINMINDLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIP- 134 + + + L ++ V + +W + + + L I Sbjct: 79 EAQVQARSEAQASQM--VEEKLGTVFPAMQAAVAGLQQEQVNWRQTWDAAAVKMCLVIAE 136 Query: 135 -------TSFKDKSLEVESYFVDKSIWNVHITFHDDKRFVFFTDQFIAEF 177 + D + S + + HI F + V + F Sbjct: 137 KMVRHEIKTRPDTVKPMMSEALKLASGTQHIRFQMNPTDVVHLGKNAQSF 186 >UniRef50_D1RAU8 V-type ATP synthase subunit E n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1RAU8_9CHLA Length = 214 Score = 39.5 bits (90), Expect = 0.085, Method: Composition-based stats. Identities = 15/127 (11%), Positives = 41/127 (32%), Gaps = 5/127 (3%) Query: 28 KEIVSLKEKILLETTAKIQSIEEKREEKFIQGYYDGYTKG--IIDEMDNFIPLISLLCSE 85 +E + + + I+ E++ Q + ++ L S Sbjct: 31 QEAQEIIAEAHAKAAKIIKEAEQQAVTLHAQARKSIEQERNVFQSSLEQAAR--QGLESL 88 Query: 86 LEKKRINMIN-DLKSILLKPSEEVDVFIKIFESWVTKLPSISGPVNLHIPTSFKDKSLEV 144 + + N +L+ +L K + + + + + V + NL S + +V Sbjct: 89 RQSIEHKLFNEELEGLLEKQTADPKLIANVVNAIVEAVQKEGISSNLSAVISKRVSPEQV 148 Query: 145 ESYFVDK 151 + ++ Sbjct: 149 NALLLEN 155 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.301 0.119 0.268 Lambda K H 0.267 0.0373 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,105,088,571 Number of Sequences: 3077464 Number of extensions: 38049045 Number of successful extensions: 138509 Number of sequences better than 1.0e-01: 67 Number of HSP's better than 0.1 without gapping: 48 Number of HSP's successfully gapped in prelim test: 45 Number of HSP's that attempted gapping in prelim test: 138388 Number of HSP's gapped (non-prelim): 111 length of query: 235 length of database: 1,040,396,356 effective HSP length: 125 effective length of query: 110 effective length of database: 655,713,356 effective search space: 72128469160 effective search space used: 72128469160 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (20.9 bits) S2: 90 (39.4 bits)