BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (132 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_Q46795 Putative uncharacterized protein ygeO (Fragment)... 275 4e-73 UniRef50_Q7NVC4 Probable oxygen-regulated invasion protein; cell... 61 1e-08 UniRef50_Q93RD2 EorA protein n=20 Tax=Escherichia coli RepID=Q93... 59 5e-08 UniRef50_Q6R8A2 OrgA n=2 Tax=Sodalis glossinidius RepID=Q6R8A2_S... 50 3e-05 UniRef50_P58653 Oxygen-regulated invasion protein orgA n=34 Tax=... 48 1e-04 UniRef50_C4I6M4 Type III secretion apparatus protein OrgA/MxiK n... 43 0.003 UniRef50_C0AR78 Putative uncharacterized protein n=1 Tax=Proteus... 39 0.046 >UniRef50_Q46795 Putative uncharacterized protein ygeO (Fragment) n=8 Tax=Escherichia RepID=YGEO_ECOLI Length = 132 Score = 275 bits (702), Expect = 4e-73, Method: Compositional matrix adjust. Identities = 132/132 (100%), Positives = 132/132 (100%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY Sbjct: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD Sbjct: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 Query: 121 ELDKISCKWCCD 132 ELDKISCKWCCD Sbjct: 121 ELDKISCKWCCD 132 >UniRef50_Q7NVC4 Probable oxygen-regulated invasion protein; cell invasion protein n=1 Tax=Chromobacterium violaceum RepID=Q7NVC4_CHRVO Length = 192 Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 37/119 (31%), Positives = 56/119 (47%), Gaps = 2/119 (1%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W+ LP L GC R R A RG ++P R + +A+ + A + Sbjct: 66 LAEWHRLPQAAYLMGCQLLRARLAARGGLLRLPGWARGF-AAMQIGAEWPAAPLDEAPGH 124 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITF-YAKKN 118 I+ GF LL + + P A+ QR LLFP +VD +P +TLL + YAK++ Sbjct: 125 DAILLAGFGQLLSWRERMPEALAQRLPLLFPAWVDEASPCIPAQNTLLLTLALQYAKRH 183 >UniRef50_Q93RD2 EorA protein n=20 Tax=Escherichia coli RepID=Q93RD2_ECOLX Length = 96 Score = 58.9 bits (141), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 25/31 (80%), Positives = 26/31 (83%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYK 31 INNWNL PL CL SG HFYRERFAERGFF + Sbjct: 62 INNWNLFPLFCLFSGYHFYRERFAERGFFIR 92 >UniRef50_Q6R8A2 OrgA n=2 Tax=Sodalis glossinidius RepID=Q6R8A2_SODGL Length = 199 Score = 49.7 bits (117), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 31/93 (33%), Positives = 47/93 (50%), Gaps = 6/93 (6%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPG--IA 58 + W+LLP I LL C +R A+ G+ +PD LR + +A+ L +R PG I Sbjct: 64 VEAWHLLPQIALLMACQRHRASLAKGGYGASLPDWLRQF-AALSL---VSSRSSPGETIP 119 Query: 59 NYHNIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 ++ G LL + P ++QR +LLFP Sbjct: 120 PPAQLLAWGKYELLAFAGSLPRGLRQRLDLLFP 152 >UniRef50_P58653 Oxygen-regulated invasion protein orgA n=34 Tax=Salmonella enterica RepID=ORGA_SALTY Length = 199 Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 33/130 (25%), Positives = 58/130 (44%), Gaps = 9/130 (6%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLS---AIPLEINEKARYKPGI 57 + W LP + L GCH R A +G +PD + +L+ L + KA Sbjct: 76 LRQWRRLPQVAYLLGCHKLRADLARQGALLGLPDWAQAFLAMHQGTSLSVCNKA------ 129 Query: 58 ANYHNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKK 117 N+ +++ G++ L P ++ QRF LLFP F++ + ++L YA+K Sbjct: 130 PNHRFLLSVGYAQLNALNEFLPESLAQRFPLLFPPFIEEALKQDAVEMSILLLALQYAQK 189 Query: 118 NRDELDKISC 127 + + +C Sbjct: 190 YPNTVPAFAC 199 >UniRef50_C4I6M4 Type III secretion apparatus protein OrgA/MxiK n=29 Tax=pseudomallei group RepID=C4I6M4_BURPS Length = 200 Score = 43.1 bits (100), Expect = 0.003, Method: Compositional matrix adjust. Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 2/91 (2%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 +++W LP I L G R ERG + ++ + +L +PL KA G+ + Sbjct: 70 VDHWARLPRIAYLIGVQRLRAALVERGQYVRLDASSQRFLC-MPLAAVPKAACA-GMPDD 127 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 I+ G + L + P A++QR LLFP Sbjct: 128 DAIVAAGTACLTAALHDAPRALRQRLPLLFP 158 >UniRef50_C0AR78 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AR78_9ENTR Length = 188 Score = 38.9 bits (89), Expect = 0.046, Method: Compositional matrix adjust. Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 6/92 (6%) Query: 2 NNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIP--LEINEKARYKPGIAN 59 W+LLP L G + + G +Y++P L+ +LS P L E IA Sbjct: 71 TQWSLLPQCALFLGYFYSPQYILHSGDYYQLPSSLQAFLSLRPVILINEENKNDNNEIAP 130 Query: 60 YHNIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 IT G+ L +I +A+ QRF L FP Sbjct: 131 ----ITIGYQLLFTFIHSISVALAQRFKLCFP 158 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46795 Putative uncharacterized protein ygeO (Fragment)... 201 4e-51 UniRef50_P58653 Oxygen-regulated invasion protein orgA n=34 Tax=... 148 5e-35 UniRef50_Q7NVC4 Probable oxygen-regulated invasion protein; cell... 144 6e-34 UniRef50_Q6R8A2 OrgA n=2 Tax=Sodalis glossinidius RepID=Q6R8A2_S... 107 1e-22 UniRef50_Q93RD2 EorA protein n=20 Tax=Escherichia coli RepID=Q93... 55 9e-07 Sequences not found previously or not previously below threshold: UniRef50_C4K5N1 Oxygen-regulated invasion protein n=1 Tax=Candid... 67 2e-10 UniRef50_C4I6M4 Type III secretion apparatus protein OrgA/MxiK n... 63 2e-09 UniRef50_C7E4S2 Oxygen-regulated invasion protein n=1 Tax=Pantoe... 48 8e-05 UniRef50_C0AR78 Putative uncharacterized protein n=1 Tax=Proteus... 47 2e-04 UniRef50_Q6R8E4 Orf5 n=2 Tax=Sodalis glossinidius RepID=Q6R8E4_S... 45 0.001 UniRef50_C2LLV9 Type III secretion system protein (Oxygen-regula... 44 0.002 UniRef50_D2TZ25 Type III secretion apparatus protein OrgA,MxiK n... 43 0.003 UniRef50_B2VID4 Oxygen-regulated invasion protein n=1 Tax=Erwini... 41 0.012 UniRef50_D0FT12 Oxygen-regulated invasion protein n=2 Tax=Erwini... 39 0.042 UniRef50_D2TZU4 Type III secretion system protein (Oxygen-regula... 38 0.080 >UniRef50_Q46795 Putative uncharacterized protein ygeO (Fragment) n=8 Tax=Escherichia RepID=YGEO_ECOLI Length = 132 Score = 201 bits (512), Expect = 4e-51, Method: Composition-based stats. Identities = 132/132 (100%), Positives = 132/132 (100%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY Sbjct: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD Sbjct: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 Query: 121 ELDKISCKWCCD 132 ELDKISCKWCCD Sbjct: 121 ELDKISCKWCCD 132 >UniRef50_P58653 Oxygen-regulated invasion protein orgA n=34 Tax=Salmonella enterica RepID=ORGA_SALTY Length = 199 Score = 148 bits (373), Expect = 5e-35, Method: Composition-based stats. Identities = 33/130 (25%), Positives = 58/130 (44%), Gaps = 9/130 (6%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLS---AIPLEINEKARYKPGI 57 + W LP + L GCH R A +G +PD + +L+ L + KA Sbjct: 76 LRQWRRLPQVAYLLGCHKLRADLARQGALLGLPDWAQAFLAMHQGTSLSVCNKA------ 129 Query: 58 ANYHNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKK 117 N+ +++ G++ L P ++ QRF LLFP F++ + ++L YA+K Sbjct: 130 PNHRFLLSVGYAQLNALNEFLPESLAQRFPLLFPPFIEEALKQDAVEMSILLLALQYAQK 189 Query: 118 NRDELDKISC 127 + + +C Sbjct: 190 YPNTVPAFAC 199 >UniRef50_Q7NVC4 Probable oxygen-regulated invasion protein; cell invasion protein n=1 Tax=Chromobacterium violaceum RepID=Q7NVC4_CHRVO Length = 192 Score = 144 bits (364), Expect = 6e-34, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 56/125 (44%), Gaps = 2/125 (1%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W+ LP L GC R R A RG ++P R + +A+ + A + Sbjct: 66 LAEWHRLPQAAYLMGCQLLRARLAARGGLLRLPGWARGF-AAMQIGAEWPAAPLDEAPGH 124 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERIT-FYAKKNR 119 I+ GF LL + + P A+ QR LLFP +VD +P +TLL + YAK++ Sbjct: 125 DAILLAGFGQLLSWRERMPEALAQRLPLLFPAWVDEASPCIPAQNTLLLTLALQYAKRHP 184 Query: 120 DELDK 124 Sbjct: 185 HFPPS 189 >UniRef50_Q6R8A2 OrgA n=2 Tax=Sodalis glossinidius RepID=Q6R8A2_SODGL Length = 199 Score = 107 bits (267), Expect = 1e-22, Method: Composition-based stats. Identities = 37/127 (29%), Positives = 62/127 (48%), Gaps = 6/127 (4%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W+LLP I LL C +R A+ G+ +PD LR + +A+ L ++ ++ I Sbjct: 64 VEAWHLLPQIALLMACQRHRASLAKGGYGASLPDWLRQF-AALSL-VSSRSSPGETIPPP 121 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFV---DHIQSPLPLASTLLERITF-YAK 116 ++ G LL + P ++QR +LLFP D + LP LL ++ F +AK Sbjct: 122 AQLLAWGKYELLAFAGSLPRGLRQRLDLLFPPEEGRDDLSRLSLPPPCRLLLKLAFQHAK 181 Query: 117 KNRDELD 123 ++ D Sbjct: 182 RHPATPD 188 >UniRef50_C4K5N1 Oxygen-regulated invasion protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N1_HAMD5 Length = 192 Score = 66.6 bits (161), Expect = 2e-10, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 52/125 (41%), Gaps = 5/125 (4%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + +W LP I + C +R ++G ++P ++ + + + + ++ + N+ Sbjct: 65 LTHWFYLPQIAFIMACQRHRTILLKKGMLLRLPLWVQQF-AKLEMVEALPYQFNQKM-NF 122 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLE---RITFYAKK 117 ++ G L + ++ P A+ QR LFP ++ + S L + KK Sbjct: 123 LQLLAYGAKELSLWEKELPGAISQRMPFLFPFSMNAVLDFDRKISPNLLFINLAIQHVKK 182 Query: 118 NRDEL 122 N L Sbjct: 183 NPQSL 187 >UniRef50_C4I6M4 Type III secretion apparatus protein OrgA/MxiK n=29 Tax=pseudomallei group RepID=C4I6M4_BURPS Length = 200 Score = 63.2 bits (152), Expect = 2e-09, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 2/91 (2%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 +++W LP I L G R ERG + ++ + +L +PL KA G+ + Sbjct: 70 VDHWARLPRIAYLIGVQRLRAALVERGQYVRLDASSQRFLC-MPLAAVPKAACA-GMPDD 127 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 I+ G + L + P A++QR LLFP Sbjct: 128 DAIVAAGTACLTAALHDAPRALRQRLPLLFP 158 >UniRef50_Q93RD2 EorA protein n=20 Tax=Escherichia coli RepID=Q93RD2_ECOLX Length = 96 Score = 54.7 bits (130), Expect = 9e-07, Method: Composition-based stats. Identities = 25/32 (78%), Positives = 26/32 (81%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKV 32 INNWNL PL CL SG HFYRERFAERGFF + Sbjct: 62 INNWNLFPLFCLFSGYHFYRERFAERGFFIRF 93 >UniRef50_C7E4S2 Oxygen-regulated invasion protein n=1 Tax=Pantoea stewartii subsp. stewartii DC283 RepID=C7E4S2_ERWST Length = 185 Score = 48.2 bits (113), Expect = 8e-05, Method: Composition-based stats. Identities = 32/123 (26%), Positives = 47/123 (38%), Gaps = 10/123 (8%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 ++ W+L+P L G + R +R L + I L + + PG N Sbjct: 67 LSCWHLIPETAHLIGGYLMRSHLLKRAALLMSDPRLLAF---ISLPMPHRIALDPGDQNS 123 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLAST-----LLERITFYA 115 +CG + +L P A++QR L FP + PLP T LL YA Sbjct: 124 MATTSCGLAFILSQFPGLPQALRQRLLLSFPAGM--SIEPLPAGKTLNHINLLRMALTYA 181 Query: 116 KKN 118 K Sbjct: 182 KHY 184 >UniRef50_C0AR78 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AR78_9ENTR Length = 188 Score = 46.6 bits (109), Expect = 2e-04, Method: Composition-based stats. Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 6/91 (6%) Query: 3 NWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIP--LEINEKARYKPGIANY 60 W+LLP L G + + G +Y++P L+ +LS P L E IA Sbjct: 72 QWSLLPQCALFLGYFYSPQYILHSGDYYQLPSSLQAFLSLRPVILINEENKNDNNEIAP- 130 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 IT G+ L +I +A+ QRF L FP Sbjct: 131 ---ITIGYQLLFTFIHSISVALAQRFKLCFP 158 >UniRef50_Q6R8E4 Orf5 n=2 Tax=Sodalis glossinidius RepID=Q6R8E4_SODGL Length = 195 Score = 44.7 bits (104), Expect = 0.001, Method: Composition-based stats. Identities = 28/128 (21%), Positives = 46/128 (35%), Gaps = 15/128 (11%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPL---------EINEKA 51 ++NW L LL G H R R G F + ++ +L+ E E A Sbjct: 67 LDNWQRLGDAALLIGAHLLRNRLLSEGRFILLSAPVQSFLALPLPPLTPGALPQEQEEPA 126 Query: 52 RYKPGIANYHNIITCGFSTLLPYIRQQPLAMQQRFNLLFP---DFVDHIQSPLPLASTLL 108 + G LL +++ A+ +R LLFP S P + L+ Sbjct: 127 TTLLQDPDPA---AWGAYCLLSMLKRLSPAVYRRACLLFPVEQPLPQQATSLSPSSINLI 183 Query: 109 ERITFYAK 116 + +A+ Sbjct: 184 KMALHHAQ 191 >UniRef50_C2LLV9 Type III secretion system protein (Oxygen-regulated invasion protein) n=2 Tax=Proteus mirabilis RepID=C2LLV9_PROMI Length = 190 Score = 43.5 bits (101), Expect = 0.002, Method: Composition-based stats. Identities = 27/116 (23%), Positives = 43/116 (37%), Gaps = 5/116 (4%) Query: 4 WNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANYHNI 63 W+ LP L G + R + +++ P L+ +L+ P+ Sbjct: 72 WHTLPRCALFLGYFYSRHTLLAQNYYHLEPA-LKAFLALYPMLNRTVDSKILLSLKDIAP 130 Query: 64 ITCGFSTLLPYIRQQPLAMQQRFNLLFPD----FVDHIQSPLPLASTLLERITFYA 115 I G+ L +I+ A+ QRF LLF L L+ TL + YA Sbjct: 131 IDIGYQLLFNFIKLISNALAQRFKLLFSPKSLSINIEFPKSLVLSPTLFLLVLNYA 186 >UniRef50_D2TZ25 Type III secretion apparatus protein OrgA,MxiK n=1 Tax=Arsenophonus nasoniae RepID=D2TZ25_9ENTR Length = 182 Score = 43.1 bits (100), Expect = 0.003, Method: Composition-based stats. Identities = 24/123 (19%), Positives = 47/123 (38%), Gaps = 8/123 (6%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 I NWN++ + + C YR G K+ + +R++ ++ N+ Sbjct: 64 IKNWNVILEVSFMLACLRYRSLLFRSGKIVKLDESIRNFCELNIIDDFVFVAKN----NF 119 Query: 61 HNIITC--GFSTLLPYIRQQPLAMQQRFNLLFP--DFVDHIQSPLPLASTLLERITFYAK 116 +I + Y + +R +LFP D D+ + + LL+ YAK Sbjct: 120 SDIDLWLLARKEIYIYQDYLSDTVMKRLAILFPRIDNDDYNFAKINPQFNLLKLAVQYAK 179 Query: 117 KNR 119 + + Sbjct: 180 RYQ 182 >UniRef50_B2VID4 Oxygen-regulated invasion protein n=1 Tax=Erwinia tasmaniensis RepID=B2VID4_ERWT9 Length = 178 Score = 40.8 bits (94), Expect = 0.012, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 50/123 (40%), Gaps = 7/123 (5%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 ++ W L+ L G + R R ++ + L ++S +P+ + + Sbjct: 60 LDKWELINDAACLIGGYLLRSRILKQCTEIILNPRLSSFIS-LPIPHHVSLTTTGE---H 115 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFV---DHIQSPLPLASTLLERITFYAKK 117 N ++ G + +L I PLA+++R L P + D + P LL YAK Sbjct: 116 SNTLSLGLAFILSQIPHFPLALKERVLLFLPAEIDLPDTFIARNPNHLNLLTMALNYAKN 175 Query: 118 NRD 120 R+ Sbjct: 176 YRE 178 >UniRef50_D0FT12 Oxygen-regulated invasion protein n=2 Tax=Erwinia pyrifoliae RepID=D0FT12_ERWPY Length = 184 Score = 39.3 bits (90), Expect = 0.042, Method: Composition-based stats. Identities = 25/107 (23%), Positives = 41/107 (38%), Gaps = 6/107 (5%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W +P+ L G + R R + L ++S +PL + Sbjct: 67 LTQWEHMPVTAHLVGGYLLRARLLSQCAVLMSDSRLLAFIS-LPL---IPHISSLTLPRS 122 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTL 107 + I G + +L IR PLA+ +R + FP D L + TL Sbjct: 123 VDTIALGVAFILSQIRPLPLALMRRLLMSFP--TDIKLPQLHIERTL 167 >UniRef50_D2TZU4 Type III secretion system protein (Oxygen-regulated invasion protein) n=1 Tax=Arsenophonus nasoniae RepID=D2TZU4_9ENTR Length = 185 Score = 38.1 bits (87), Expect = 0.080, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 53/120 (44%), Gaps = 6/120 (5%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLS-AIPLEINEKARYKPGIA- 58 IN W+ LP+ L G + + ++P+ +LS A L +K + P IA Sbjct: 61 INYWSRLPIAALYLGGLYQSPQLMIANQLTQLPEQFSQFLSWARLLLPPQKKQIVPLIAT 120 Query: 59 --NYHNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVD--HIQSPLPLASTLLERITFY 114 + + CG ++P I + LA+ RF+LLF + LP+ +LL+ Y Sbjct: 121 SYSQDELTKCGMWAIIPLIEKCSLALTARFSLLFNKNYPQCQANTVLPVNYSLLKATLNY 180 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q46795 Putative uncharacterized protein ygeO (Fragment)... 168 7e-41 UniRef50_Q7NVC4 Probable oxygen-regulated invasion protein; cell... 128 6e-29 UniRef50_P58653 Oxygen-regulated invasion protein orgA n=34 Tax=... 125 4e-28 UniRef50_C4K5N1 Oxygen-regulated invasion protein n=1 Tax=Candid... 114 8e-25 UniRef50_C2LLV9 Type III secretion system protein (Oxygen-regula... 111 1e-23 UniRef50_Q6R8A2 OrgA n=2 Tax=Sodalis glossinidius RepID=Q6R8A2_S... 110 1e-23 UniRef50_C7E4S2 Oxygen-regulated invasion protein n=1 Tax=Pantoe... 99 4e-20 UniRef50_C4I6M4 Type III secretion apparatus protein OrgA/MxiK n... 93 3e-18 UniRef50_Q6R8E4 Orf5 n=2 Tax=Sodalis glossinidius RepID=Q6R8E4_S... 92 6e-18 UniRef50_C0AR78 Putative uncharacterized protein n=1 Tax=Proteus... 88 7e-17 UniRef50_Q93RD2 EorA protein n=20 Tax=Escherichia coli RepID=Q93... 48 1e-04 Sequences not found previously or not previously below threshold: UniRef50_B2VID4 Oxygen-regulated invasion protein n=1 Tax=Erwini... 64 1e-09 UniRef50_D0FT12 Oxygen-regulated invasion protein n=2 Tax=Erwini... 64 1e-09 UniRef50_D2UDE2 Probable oxygen-regulated invasion protein orga ... 54 1e-06 UniRef50_D2TZ25 Type III secretion apparatus protein OrgA,MxiK n... 53 2e-06 UniRef50_D2TZU4 Type III secretion system protein (Oxygen-regula... 47 2e-04 UniRef50_B1FB91 Type III secretion apparatus protein OrgA/MxiK n... 38 0.094 >UniRef50_Q46795 Putative uncharacterized protein ygeO (Fragment) n=8 Tax=Escherichia RepID=YGEO_ECOLI Length = 132 Score = 168 bits (424), Expect = 7e-41, Method: Composition-based stats. Identities = 132/132 (100%), Positives = 132/132 (100%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY Sbjct: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD Sbjct: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 Query: 121 ELDKISCKWCCD 132 ELDKISCKWCCD Sbjct: 121 ELDKISCKWCCD 132 >UniRef50_Q7NVC4 Probable oxygen-regulated invasion protein; cell invasion protein n=1 Tax=Chromobacterium violaceum RepID=Q7NVC4_CHRVO Length = 192 Score = 128 bits (321), Expect = 6e-29, Method: Composition-based stats. Identities = 37/125 (29%), Positives = 55/125 (44%), Gaps = 2/125 (1%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W+ LP L GC R R A RG ++P R + +A+ + A + Sbjct: 66 LAEWHRLPQAAYLMGCQLLRARLAARGGLLRLPGWARGF-AAMQIGAEWPAAPLDEAPGH 124 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLE-RITFYAKKNR 119 I+ GF LL + + P A+ QR LLFP +VD +P +TLL YAK++ Sbjct: 125 DAILLAGFGQLLSWRERMPEALAQRLPLLFPAWVDEASPCIPAQNTLLLTLALQYAKRHP 184 Query: 120 DELDK 124 Sbjct: 185 HFPPS 189 >UniRef50_P58653 Oxygen-regulated invasion protein orgA n=34 Tax=Salmonella enterica RepID=ORGA_SALTY Length = 199 Score = 125 bits (314), Expect = 4e-28, Method: Composition-based stats. Identities = 30/127 (23%), Positives = 55/127 (43%), Gaps = 3/127 (2%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W LP + L GCH R A +G +PD + +L+ + N+ Sbjct: 76 LRQWRRLPQVAYLLGCHKLRADLARQGALLGLPDWAQAFLA---MHQGTSLSVCNKAPNH 132 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLERITFYAKKNRD 120 +++ G++ L P ++ QRF LLFP F++ + ++L YA+K + Sbjct: 133 RFLLSVGYAQLNALNEFLPESLAQRFPLLFPPFIEEALKQDAVEMSILLLALQYAQKYPN 192 Query: 121 ELDKISC 127 + +C Sbjct: 193 TVPAFAC 199 >UniRef50_C4K5N1 Oxygen-regulated invasion protein n=1 Tax=Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum) RepID=C4K5N1_HAMD5 Length = 192 Score = 114 bits (285), Expect = 8e-25, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 52/125 (41%), Gaps = 5/125 (4%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + +W LP I + C +R ++G ++P ++ + + + + ++ + N+ Sbjct: 65 LTHWFYLPQIAFIMACQRHRTILLKKGMLLRLPLWVQQF-AKLEMVEALPYQFNQKM-NF 122 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLASTLLE---RITFYAKK 117 ++ G L + ++ P A+ QR LFP ++ + S L + KK Sbjct: 123 LQLLAYGAKELSLWEKELPGAISQRMPFLFPFSMNAVLDFDRKISPNLLFINLAIQHVKK 182 Query: 118 NRDEL 122 N L Sbjct: 183 NPQSL 187 >UniRef50_C2LLV9 Type III secretion system protein (Oxygen-regulated invasion protein) n=2 Tax=Proteus mirabilis RepID=C2LLV9_PROMI Length = 190 Score = 111 bits (276), Expect = 1e-23, Method: Composition-based stats. Identities = 27/118 (22%), Positives = 44/118 (37%), Gaps = 5/118 (4%) Query: 2 NNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANYH 61 +W+ LP L G + R + +++ P L+ +L+ P+ Sbjct: 70 THWHTLPRCALFLGYFYSRHTLLAQNYYHLEPA-LKAFLALYPMLNRTVDSKILLSLKDI 128 Query: 62 NIITCGFSTLLPYIRQQPLAMQQRFNLLFPD----FVDHIQSPLPLASTLLERITFYA 115 I G+ L +I+ A+ QRF LLF L L+ TL + YA Sbjct: 129 APIDIGYQLLFNFIKLISNALAQRFKLLFSPKSLSINIEFPKSLVLSPTLFLLVLNYA 186 >UniRef50_Q6R8A2 OrgA n=2 Tax=Sodalis glossinidius RepID=Q6R8A2_SODGL Length = 199 Score = 110 bits (275), Expect = 1e-23, Method: Composition-based stats. Identities = 34/128 (26%), Positives = 56/128 (43%), Gaps = 6/128 (4%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W+LLP I LL C +R A+ G+ +PD LR + + ++ ++ I Sbjct: 64 VEAWHLLPQIALLMACQRHRASLAKGGYGASLPDWLRQFAALS--LVSSRSSPGETIPPP 121 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFV---DHIQSPLPLASTLLE-RITFYAK 116 ++ G LL + P ++QR +LLFP D + LP LL +AK Sbjct: 122 AQLLAWGKYELLAFAGSLPRGLRQRLDLLFPPEEGRDDLSRLSLPPPCRLLLKLAFQHAK 181 Query: 117 KNRDELDK 124 ++ D Sbjct: 182 RHPATPDT 189 >UniRef50_C7E4S2 Oxygen-regulated invasion protein n=1 Tax=Pantoea stewartii subsp. stewartii DC283 RepID=C7E4S2_ERWST Length = 185 Score = 99.1 bits (245), Expect = 4e-20, Method: Composition-based stats. Identities = 32/123 (26%), Positives = 47/123 (38%), Gaps = 10/123 (8%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 ++ W+L+P L G + R +R L + I L + + PG N Sbjct: 67 LSCWHLIPETAHLIGGYLMRSHLLKRAALLMSDPRLLAF---ISLPMPHRIALDPGDQNS 123 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPLAST-----LLERITFYA 115 +CG + +L P A++QR L FP + PLP T LL YA Sbjct: 124 MATTSCGLAFILSQFPGLPQALRQRLLLSFPAGM--SIEPLPAGKTLNHINLLRMALTYA 181 Query: 116 KKN 118 K Sbjct: 182 KHY 184 >UniRef50_C4I6M4 Type III secretion apparatus protein OrgA/MxiK n=29 Tax=pseudomallei group RepID=C4I6M4_BURPS Length = 200 Score = 92.9 bits (229), Expect = 3e-18, Method: Composition-based stats. Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 2/91 (2%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 +++W LP I L G R ERG + ++ + +L +PL KA G+ + Sbjct: 70 VDHWARLPRIAYLIGVQRLRAALVERGQYVRLDASSQRFLC-MPLAAVPKAACA-GMPDD 127 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 I+ G + L + P A++QR LLFP Sbjct: 128 DAIVAAGTACLTAALHDAPRALRQRLPLLFP 158 >UniRef50_Q6R8E4 Orf5 n=2 Tax=Sodalis glossinidius RepID=Q6R8E4_SODGL Length = 195 Score = 91.8 bits (226), Expect = 6e-18, Method: Composition-based stats. Identities = 28/128 (21%), Positives = 46/128 (35%), Gaps = 15/128 (11%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPL---------EINEKA 51 ++NW L LL G H R R G F + ++ +L+ E E A Sbjct: 67 LDNWQRLGDAALLIGAHLLRNRLLSEGRFILLSAPVQSFLALPLPPLTPGALPQEQEEPA 126 Query: 52 RYKPGIANYHNIITCGFSTLLPYIRQQPLAMQQRFNLLFP---DFVDHIQSPLPLASTLL 108 + G LL +++ A+ +R LLFP S P + L+ Sbjct: 127 TTLLQDPDPA---AWGAYCLLSMLKRLSPAVYRRACLLFPVEQPLPQQATSLSPSSINLI 183 Query: 109 ERITFYAK 116 + +A+ Sbjct: 184 KMALHHAQ 191 >UniRef50_C0AR78 Putative uncharacterized protein n=1 Tax=Proteus penneri ATCC 35198 RepID=C0AR78_9ENTR Length = 188 Score = 88.3 bits (217), Expect = 7e-17, Method: Composition-based stats. Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 2/90 (2%) Query: 2 NNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANYH 61 W+LLP L G + + G +Y++P L+ +LS P+ + + Sbjct: 71 TQWSLLPQCALFLGYFYSPQYILHSGDYYQLPSSLQAFLSLRPVILINEENKNDNN--EI 128 Query: 62 NIITCGFSTLLPYIRQQPLAMQQRFNLLFP 91 IT G+ L +I +A+ QRF L FP Sbjct: 129 APITIGYQLLFTFIHSISVALAQRFKLCFP 158 >UniRef50_B2VID4 Oxygen-regulated invasion protein n=1 Tax=Erwinia tasmaniensis RepID=B2VID4_ERWT9 Length = 178 Score = 64.4 bits (155), Expect = 1e-09, Method: Composition-based stats. Identities = 26/123 (21%), Positives = 45/123 (36%), Gaps = 7/123 (5%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 ++ W L+ L G + R R ++ + L ++S + Sbjct: 60 LDKWELINDAACLIGGYLLRSRILKQCTEIILNPRLSSFISLPIPHHVSLTTTGE----H 115 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPD---FVDHIQSPLPLASTLLERITFYAKK 117 N ++ G + +L I PLA+++R L P D + P LL YAK Sbjct: 116 SNTLSLGLAFILSQIPHFPLALKERVLLFLPAEIDLPDTFIARNPNHLNLLTMALNYAKN 175 Query: 118 NRD 120 R+ Sbjct: 176 YRE 178 >UniRef50_D0FT12 Oxygen-regulated invasion protein n=2 Tax=Erwinia pyrifoliae RepID=D0FT12_ERWPY Length = 184 Score = 64.0 bits (154), Expect = 1e-09, Method: Composition-based stats. Identities = 23/121 (19%), Positives = 39/121 (32%), Gaps = 7/121 (5%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W +P+ L G + R R + L ++S + + Sbjct: 67 LTQWEHMPVTAHLVGGYLLRARLLSQCAVLMSDSRLLAFISLPLI----PHISSLTLPRS 122 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPD---FVDHIQSPLPLASTLLERITFYAKK 117 + I G + +L IR PLA+ +R + FP LL+ YA Sbjct: 123 VDTIALGVAFILSQIRPLPLALMRRLLMSFPTDIKLPQLHIERTLGHHNLLKMAINYAIH 182 Query: 118 N 118 Sbjct: 183 F 183 >UniRef50_D2UDE2 Probable oxygen-regulated invasion protein orga n=1 Tax=Xanthomonas albilineans RepID=D2UDE2_XANAL Length = 198 Score = 54.0 bits (128), Expect = 1e-06, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 44/130 (33%), Gaps = 17/130 (13%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 + W ++P +C L G R + + ++ + ++ + A + Sbjct: 66 LRKWYVIPDLCFLLGLFSQRGEWLSSNRYRQLDGRCKRFIELPLPHLAVPACVLDENVDP 125 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSPLPL--------------AST 106 + CG + L I P + +R LLF ++ + L L + T Sbjct: 126 ---VVCGLAALRFLIPAMPPVIDERLRLLFSSRIEAETAALSLQVGRASPEDCGRWISIT 182 Query: 107 LLERITFYAK 116 L YA+ Sbjct: 183 LFSMAAHYAQ 192 >UniRef50_D2TZ25 Type III secretion apparatus protein OrgA,MxiK n=1 Tax=Arsenophonus nasoniae RepID=D2TZ25_9ENTR Length = 182 Score = 53.2 bits (126), Expect = 2e-06, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 42/120 (35%), Gaps = 4/120 (3%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANY 60 I NWN++ + + C YR G K+ + +R++ ++ Sbjct: 64 IKNWNVILEVSFMLACLRYRSLLFRSGKIVKLDESIRNFCELNIIDDFVFVAKNNFSDID 123 Query: 61 HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVDHIQSP--LPLASTLLERITFYAKKN 118 ++ + Y + +R +LFP + + + LL+ YAK+ Sbjct: 124 LWLLA--RKEIYIYQDYLSDTVMKRLAILFPRIDNDDYNFAKINPQFNLLKLAVQYAKRY 181 >UniRef50_Q93RD2 EorA protein n=20 Tax=Escherichia coli RepID=Q93RD2_ECOLX Length = 96 Score = 47.8 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 25/32 (78%), Positives = 26/32 (81%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKV 32 INNWNL PL CL SG HFYRERFAERGFF + Sbjct: 62 INNWNLFPLFCLFSGYHFYRERFAERGFFIRF 93 >UniRef50_D2TZU4 Type III secretion system protein (Oxygen-regulated invasion protein) n=1 Tax=Arsenophonus nasoniae RepID=D2TZU4_9ENTR Length = 185 Score = 47.5 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 32/121 (26%), Positives = 52/121 (42%), Gaps = 6/121 (4%) Query: 1 INNWNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLS-AIPLEINEKARYKPGIAN 59 IN W+ LP+ L G + + ++P+ +LS A L +K + P IA Sbjct: 61 INYWSRLPIAALYLGGLYQSPQLMIANQLTQLPEQFSQFLSWARLLLPPQKKQIVPLIAT 120 Query: 60 Y---HNIITCGFSTLLPYIRQQPLAMQQRFNLLFPDFVD--HIQSPLPLASTLLERITFY 114 + CG ++P I + LA+ RF+LLF + LP+ +LL+ Y Sbjct: 121 SYSQDELTKCGMWAIIPLIEKCSLALTARFSLLFNKNYPQCQANTVLPVNYSLLKATLNY 180 Query: 115 A 115 Sbjct: 181 V 181 >UniRef50_B1FB91 Type III secretion apparatus protein OrgA/MxiK n=1 Tax=Burkholderia ambifaria IOP40-10 RepID=B1FB91_9BURK Length = 192 Score = 37.8 bits (86), Expect = 0.094, Method: Composition-based stats. Identities = 23/89 (25%), Positives = 34/89 (38%), Gaps = 2/89 (2%) Query: 4 WNLLPLICLLSGCHFYRERFAERGFFYKVPDVLRDYLSAIPLEINEKARYKPGIANYHNI 63 W L IC L G R E+ ++ K+ + R +L A P +I Sbjct: 67 WAHLRRICYLIGVLRLRSPIIEQAWYPKLDALARQFLQVPAP--ALPAAESPLRLEQADI 124 Query: 64 ITCGFSTLLPYIRQQPLAMQQRFNLLFPD 92 G + R+ P A++ R LLFP Sbjct: 125 YAAGSAGFAAVFRRMPAALRLRLPLLFPA 153 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.311 0.126 0.344 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 714,887,735 Number of Sequences: 3077464 Number of extensions: 23360457 Number of successful extensions: 95573 Number of sequences better than 1.0e-01: 18 Number of HSP's better than 0.1 without gapping: 23 Number of HSP's successfully gapped in prelim test: 18 Number of HSP's that attempted gapping in prelim test: 95511 Number of HSP's gapped (non-prelim): 43 length of query: 132 length of database: 1,040,396,356 effective HSP length: 97 effective length of query: 35 effective length of database: 741,882,348 effective search space: 25965882180 effective search space used: 25965882180 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.9 bits) S2: 87 (38.2 bits)