Miyakogusa Predicted Gene

Lj1g3v0395870.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v0395870.1 Non Chatacterized Hit- tr|I1KCE9|I1KCE9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48216
PE,77.18,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.25647.1
         (333 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G62960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   357   5e-99
AT1G10660.1 | Symbols:  | unknown protein; LOCATED IN: endomembr...   302   2e-82
AT1G10660.4 | Symbols:  | unknown protein; LOCATED IN: endomembr...   302   2e-82
AT1G10660.3 | Symbols:  | unknown protein; LOCATED IN: endomembr...   302   2e-82
AT1G10660.2 | Symbols:  | unknown protein; LOCATED IN: endomembr...   302   2e-82
AT3G27770.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   251   5e-67
AT3G27770.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   221   6e-58
AT1G70505.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   201   8e-52
AT2G47115.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   181   9e-46

>AT5G62960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G10660.1); Has 35333 Blast hits to 34131
           proteins in 2444 species: Archae - 798; Bacteria -
           22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
           - 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
           chr5:25269444-25271533 FORWARD LENGTH=347
          Length = 347

 Score =  357 bits (917), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 170/336 (50%), Positives = 228/336 (67%), Gaps = 18/336 (5%)

Query: 2   TAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREI---QEETSAT 58
           TA T   SYW NWR ++  I + ++   ++ LI+K+E  RR  R+   E+   ++E S  
Sbjct: 26  TANTTESSYWFNWRVMICCIWMAIATVITAFLIFKYEGFRR-KRSDVGEVDGGEKEWSGN 84

Query: 59  LYEDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIY 118
           +YEDETWRPCL+ IHPAWL+A                   DG +IF+YYTQWT   +T+Y
Sbjct: 85  VYEDETWRPCLRNIHPAWLLAFRVVAFFVLLVMLIVIGLVDGPTIFFYYTQWTFGLITLY 144

Query: 119 FGLGSLLSMHGCYQHHKKARGDKVDNVDG-DAEKGMHNAPAIPPGSNASDQEKNLKAPEK 177
           FGLGSLLS+HGCYQ++K+A GD+VD+++  D+E+           S  +D          
Sbjct: 145 FGLGSLLSLHGCYQYNKRAAGDRVDSIEAIDSERAR---------SKGADNTIQQSQYSS 195

Query: 178 DPLRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAV 237
           +P    AG WGY+FQIIFQ+NAGAV+LTDCVFWFIIVPFL I DY+LN  +++MH+ NA+
Sbjct: 196 NP----AGFWGYVFQIIFQMNAGAVLLTDCVFWFIIVPFLEIHDYSLNVLVINMHSLNAI 251

Query: 238 FLISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYF 297
           FL+ D ALN L FP FRI YF  WT+ YVIFQW +H+ +++WWPYPFLDLSS Y+PLWYF
Sbjct: 252 FLLGDAALNSLSFPCFRIAYFFFWTIAYVIFQWALHSLVHIWWPYPFLDLSSHYAPLWYF 311

Query: 298 AVALLHIPCYSIFPLVMKLKHDVFSTWYPDSYQCVR 333
           +VA++H+PCY  F L++KLKH +   W+P+SYQ  R
Sbjct: 312 SVAVMHLPCYGAFALLVKLKHRLLQRWFPESYQSPR 347


>AT1G10660.1 | Symbols:  | unknown protein; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 15 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G62960.1);
           Has 155 Blast hits to 154 proteins in 22 species: Archae
           - 0; Bacteria - 0; Metazoa - 10; Fungi - 4; Plants -
           139; Viruses - 0; Other Eukaryotes - 2 (source: NCBI
           BLink). | chr1:3533009-3534781 FORWARD LENGTH=320
          Length = 320

 Score =  302 bits (773), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)

Query: 1   MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
           M A T   SYWLNWR LL +++++  I  ++VLIWK+E  RR  R   RE+      TL+
Sbjct: 1   MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56

Query: 61  EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
           +DE W  C K IHP WL+A                   DG  IFY+YTQWT T VT+YFG
Sbjct: 57  QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116

Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
             S+LS++GC  ++K+A G+       GD E+G +  P    G   + +  N   P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174

Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
            R+ AG W YIFQI+FQ  AGAV+LTD VFW II PF   K Y L+F  V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232

Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
           + DT+LN LRFP FRI YF LW+  +V +QWI+HA  NLWWPY FLDLSS Y+PLWY  V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292

Query: 300 ALLHIPCYSIFPLVMKLKH 318
           A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311


>AT1G10660.4 | Symbols:  | unknown protein; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 15 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G62960.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr1:3533009-3534781 FORWARD
           LENGTH=320
          Length = 320

 Score =  302 bits (773), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)

Query: 1   MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
           M A T   SYWLNWR LL +++++  I  ++VLIWK+E  RR  R   RE+      TL+
Sbjct: 1   MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56

Query: 61  EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
           +DE W  C K IHP WL+A                   DG  IFY+YTQWT T VT+YFG
Sbjct: 57  QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116

Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
             S+LS++GC  ++K+A G+       GD E+G +  P    G   + +  N   P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174

Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
            R+ AG W YIFQI+FQ  AGAV+LTD VFW II PF   K Y L+F  V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232

Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
           + DT+LN LRFP FRI YF LW+  +V +QWI+HA  NLWWPY FLDLSS Y+PLWY  V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292

Query: 300 ALLHIPCYSIFPLVMKLKH 318
           A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311


>AT1G10660.3 | Symbols:  | unknown protein; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 15 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G62960.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr1:3533009-3534781 FORWARD
           LENGTH=320
          Length = 320

 Score =  302 bits (773), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)

Query: 1   MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
           M A T   SYWLNWR LL +++++  I  ++VLIWK+E  RR  R   RE+      TL+
Sbjct: 1   MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56

Query: 61  EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
           +DE W  C K IHP WL+A                   DG  IFY+YTQWT T VT+YFG
Sbjct: 57  QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116

Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
             S+LS++GC  ++K+A G+       GD E+G +  P    G   + +  N   P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174

Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
            R+ AG W YIFQI+FQ  AGAV+LTD VFW II PF   K Y L+F  V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232

Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
           + DT+LN LRFP FRI YF LW+  +V +QWI+HA  NLWWPY FLDLSS Y+PLWY  V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292

Query: 300 ALLHIPCYSIFPLVMKLKH 318
           A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311


>AT1G10660.2 | Symbols:  | unknown protein; LOCATED IN: endomembrane
           system; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 15 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G62960.1);
           Has 35333 Blast hits to 34131 proteins in 2444 species:
           Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
           991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
           (source: NCBI BLink). | chr1:3533009-3534781 FORWARD
           LENGTH=320
          Length = 320

 Score =  302 bits (773), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)

Query: 1   MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
           M A T   SYWLNWR LL +++++  I  ++VLIWK+E  RR  R   RE+      TL+
Sbjct: 1   MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56

Query: 61  EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
           +DE W  C K IHP WL+A                   DG  IFY+YTQWT T VT+YFG
Sbjct: 57  QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116

Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
             S+LS++GC  ++K+A G+       GD E+G +  P    G   + +  N   P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174

Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
            R+ AG W YIFQI+FQ  AGAV+LTD VFW II PF   K Y L+F  V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232

Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
           + DT+LN LRFP FRI YF LW+  +V +QWI+HA  NLWWPY FLDLSS Y+PLWY  V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292

Query: 300 ALLHIPCYSIFPLVMKLKH 318
           A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311


>AT3G27770.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; EXPRESSED IN: 22 plant structures; EXPRESSED
           DURING: 13 growth stages; BEST Arabidopsis thaliana
           protein match is: unknown protein (TAIR:AT5G62960.1);
           Has 158 Blast hits to 157 proteins in 21 species: Archae
           - 0; Bacteria - 0; Metazoa - 13; Fungi - 0; Plants -
           141; Viruses - 0; Other Eukaryotes - 4 (source: NCBI
           BLink). | chr3:10285818-10287474 REVERSE LENGTH=315
          Length = 315

 Score =  251 bits (641), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 134/322 (41%), Positives = 193/322 (59%), Gaps = 17/322 (5%)

Query: 4   QTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSA--TLYE 61
           +  +  YW NWR LL +I V+V +  S +++WK+E       + S + Q   +    L  
Sbjct: 8   EFTSFDYWFNWRVLLCAIWVIVPMIVSLLVLWKYE-------DSSVQTQPSLNGNDVLCI 60

Query: 62  DETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFGL 121
           D+ WRPC + IHP WL+                     G  I+YYYTQWT T + IYFG+
Sbjct: 61  DDVWRPCFERIHPGWLLGFRVLGFCFLLANNIARFANRGWRIYYYYTQWTFTLIAIYFGM 120

Query: 122 GSLLSMHGCYQHHKKAR-GDKVDNVDGDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDPL 180
           GSLLS++GC Q+ K+   G   D V  DAE G   +P I  G N    EK  K    + L
Sbjct: 121 GSLLSIYGCLQYKKQGNTGLIADQVGIDAENGFR-SPLID-GDNMVSFEKR-KTSGSEAL 177

Query: 181 RQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFLI 240
           +    ++ ++FQII+Q+ AGA +LTD ++W +I PFL+++DY ++F  V++HT N V L+
Sbjct: 178 K----SYVHLFQIIYQMGAGAAVLTDSIYWTVIFPFLSLQDYEMSFMTVNLHTSNLVLLL 233

Query: 241 SDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAVA 300
            DT LN L+FP FR  YF LWT  +V+FQWI+H  I++ WPYPFL+LS   +P+WY  VA
Sbjct: 234 IDTFLNRLKFPLFRFSYFILWTGCFVLFQWILHMFISVGWPYPFLNLSLDMAPVWYLLVA 293

Query: 301 LLHIPCYSIFPLVMKLKHDVFS 322
           LLH+P Y +F L++K+K+ + S
Sbjct: 294 LLHLPSYGLFALIVKIKYKLIS 315


>AT3G27770.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT5G62960.1); Has 150 Blast hits to 150 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 13;
           Fungi - 0; Plants - 133; Viruses - 0; Other Eukaryotes -
           4 (source: NCBI BLink). | chr3:10285818-10287461 REVERSE
           LENGTH=272
          Length = 272

 Score =  221 bits (563), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 114/259 (44%), Positives = 161/259 (62%), Gaps = 8/259 (3%)

Query: 65  WRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFGLGSL 124
           + PC + IHP WL+                     G  I+YYYTQWT T + IYFG+GSL
Sbjct: 21  YGPCFERIHPGWLLGFRVLGFCFLLANNIARFANRGWRIYYYYTQWTFTLIAIYFGMGSL 80

Query: 125 LSMHGCYQHHKKAR-GDKVDNVDGDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDPLRQL 183
           LS++GC Q+ K+   G   D V  DAE G   +P I  G N    EK  K    + L+  
Sbjct: 81  LSIYGCLQYKKQGNTGLIADQVGIDAENGFR-SPLID-GDNMVSFEKR-KTSGSEALK-- 135

Query: 184 AGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFLISDT 243
             ++ ++FQII+Q+ AGA +LTD ++W +I PFL+++DY ++F  V++HT N V L+ DT
Sbjct: 136 --SYVHLFQIIYQMGAGAAVLTDSIYWTVIFPFLSLQDYEMSFMTVNLHTSNLVLLLIDT 193

Query: 244 ALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAVALLH 303
            LN L+FP FR  YF LWT  +V+FQWI+H  I++ WPYPFL+LS   +P+WY  VALLH
Sbjct: 194 FLNRLKFPLFRFSYFILWTGCFVLFQWILHMFISVGWPYPFLNLSLDMAPVWYLLVALLH 253

Query: 304 IPCYSIFPLVMKLKHDVFS 322
           +P Y +F L++K+K+ + S
Sbjct: 254 LPSYGLFALIVKIKYKLIS 272


>AT1G70505.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G10660.1); Has 141 Blast hits to 140 proteins
           in 16 species: Archae - 0; Bacteria - 0; Metazoa - 4;
           Fungi - 0; Plants - 135; Viruses - 0; Other Eukaryotes -
           2 (source: NCBI BLink). | chr1:26570647-26572463 FORWARD
           LENGTH=358
          Length = 358

 Score =  201 bits (510), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 111/252 (44%), Positives = 149/252 (59%), Gaps = 7/252 (2%)

Query: 1   MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHER-LRRIARNGSREIQ-EETSAT 58
           MT++TN  SYWLNWRF + +I V+ S+  SS LIW++E  ++R  R   + ++ E+ +  
Sbjct: 27  MTSETNIPSYWLNWRFFVCAIFVLTSLFLSSYLIWRYEGPIKRKKRGDDQSLELEQLTGV 86

Query: 59  LYEDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIY 118
           +Y+DE+W   +K IHP WL+                    DG  IF +YTQWT T VTIY
Sbjct: 87  VYDDESWNTSVKEIHPNWLLGFRVFGFVVLLGLISGNAIADGTGIFIFYTQWTFTLVTIY 146

Query: 119 FGLGSLLSMHGCYQHHKKARGDKVDNVDGDAEKGMHNAPAIPPGSNASDQEKNLKAPEKD 178
           FGLGSL+S+   Y+      G+   ++  D E+G +  P     SN           E  
Sbjct: 147 FGLGSLVSI---YRFRSPDNGENRVSIV-DEEQGTYRPPGNAENSNVFKSSSG-HDRENM 201

Query: 179 PLRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVF 238
             RQ+A T GYI QI+FQ  AGAV+LTD VFWFII PFLT KD+NL+FFIV MH+ NA+F
Sbjct: 202 STRQVATTLGYIHQILFQTCAGAVLLTDGVFWFIIYPFLTAKDFNLDFFIVIMHSVNAIF 261

Query: 239 LISDTALNCLRF 250
           L+ +T LN L F
Sbjct: 262 LLGETFLNSLGF 273


>AT2G47115.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: endomembrane
           system; BEST Arabidopsis thaliana protein match is:
           unknown protein (TAIR:AT1G10660.1); Has 35333 Blast hits
           to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr2:19345376-19346864 REVERSE LENGTH=300
          Length = 300

 Score =  181 bits (458), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 107/293 (36%), Positives = 146/293 (49%), Gaps = 36/293 (12%)

Query: 36  KHERLRRIARNGSREIQEETSATLYEDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXX 95
            H+ L  + R+ +R      SA L     W  C   +HP WL+                 
Sbjct: 41  SHDSLLPLYRSNNRG-----SARL-----WASCWTRLHPGWLLFTRSTSFLSMAALLAWD 90

Query: 96  XXXDGGSIFYYYTQWTLTSVTIYFGLGSLLSMHGCYQHHKKARGDKVDNVDGDAEKGMHN 155
                 SIF YYT+WT   V IYF +G + S++GC  H K+                   
Sbjct: 91  VIKWDASIFVYYTEWTFMLVIIYFAMGIVASVYGCLIHLKEL------------------ 132

Query: 156 APAIPPGSNASDQEKNLKAPEKDPLRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVP 215
                        E  +     D  R+     G   Q IFQ +AGAV+LTD VFW +IVP
Sbjct: 133 --------TLETDEDVVVEKVGDEFRRRLEVCGCFMQTIFQTSAGAVVLTDIVFWLVIVP 184

Query: 216 FLTIKDYNLNFFIVSMHTFNAVFLISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHAS 275
           FL+   + LN   + MHT NA FL+ +T LN L FPWFR+GYF LW+  YVIFQWI+HA 
Sbjct: 185 FLSTTRFGLNTLTICMHTANAGFLLLETLLNSLPFPWFRMGYFVLWSCLYVIFQWIIHAC 244

Query: 276 INLWWPYPFLDLSSSYSPLWYFAVALLHIPCYSIFPLVMKLKHDVFSTWYPDS 328
              WWPYPFL+L   ++P+WY  +A++HIPCY  +  ++K K+  F   +P++
Sbjct: 245 GFTWWPYPFLELDKPWAPIWYLCMAIVHIPCYGAYAAIVKAKNSCFPYLFPNA 297