Miyakogusa Predicted Gene
- Lj1g3v0395870.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v0395870.1 Non Chatacterized Hit- tr|I1KCE9|I1KCE9_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.48216
PE,77.18,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.25647.1
(333 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT5G62960.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 357 5e-99
AT1G10660.1 | Symbols: | unknown protein; LOCATED IN: endomembr... 302 2e-82
AT1G10660.4 | Symbols: | unknown protein; LOCATED IN: endomembr... 302 2e-82
AT1G10660.3 | Symbols: | unknown protein; LOCATED IN: endomembr... 302 2e-82
AT1G10660.2 | Symbols: | unknown protein; LOCATED IN: endomembr... 302 2e-82
AT3G27770.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 251 5e-67
AT3G27770.2 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 221 6e-58
AT1G70505.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 201 8e-52
AT2G47115.1 | Symbols: | unknown protein; FUNCTIONS IN: molecul... 181 9e-46
>AT5G62960.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G10660.1); Has 35333 Blast hits to 34131
proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr5:25269444-25271533 FORWARD LENGTH=347
Length = 347
Score = 357 bits (917), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 170/336 (50%), Positives = 228/336 (67%), Gaps = 18/336 (5%)
Query: 2 TAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREI---QEETSAT 58
TA T SYW NWR ++ I + ++ ++ LI+K+E RR R+ E+ ++E S
Sbjct: 26 TANTTESSYWFNWRVMICCIWMAIATVITAFLIFKYEGFRR-KRSDVGEVDGGEKEWSGN 84
Query: 59 LYEDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIY 118
+YEDETWRPCL+ IHPAWL+A DG +IF+YYTQWT +T+Y
Sbjct: 85 VYEDETWRPCLRNIHPAWLLAFRVVAFFVLLVMLIVIGLVDGPTIFFYYTQWTFGLITLY 144
Query: 119 FGLGSLLSMHGCYQHHKKARGDKVDNVDG-DAEKGMHNAPAIPPGSNASDQEKNLKAPEK 177
FGLGSLLS+HGCYQ++K+A GD+VD+++ D+E+ S +D
Sbjct: 145 FGLGSLLSLHGCYQYNKRAAGDRVDSIEAIDSERAR---------SKGADNTIQQSQYSS 195
Query: 178 DPLRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAV 237
+P AG WGY+FQIIFQ+NAGAV+LTDCVFWFIIVPFL I DY+LN +++MH+ NA+
Sbjct: 196 NP----AGFWGYVFQIIFQMNAGAVLLTDCVFWFIIVPFLEIHDYSLNVLVINMHSLNAI 251
Query: 238 FLISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYF 297
FL+ D ALN L FP FRI YF WT+ YVIFQW +H+ +++WWPYPFLDLSS Y+PLWYF
Sbjct: 252 FLLGDAALNSLSFPCFRIAYFFFWTIAYVIFQWALHSLVHIWWPYPFLDLSSHYAPLWYF 311
Query: 298 AVALLHIPCYSIFPLVMKLKHDVFSTWYPDSYQCVR 333
+VA++H+PCY F L++KLKH + W+P+SYQ R
Sbjct: 312 SVAVMHLPCYGAFALLVKLKHRLLQRWFPESYQSPR 347
>AT1G10660.1 | Symbols: | unknown protein; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G62960.1);
Has 155 Blast hits to 154 proteins in 22 species: Archae
- 0; Bacteria - 0; Metazoa - 10; Fungi - 4; Plants -
139; Viruses - 0; Other Eukaryotes - 2 (source: NCBI
BLink). | chr1:3533009-3534781 FORWARD LENGTH=320
Length = 320
Score = 302 bits (773), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)
Query: 1 MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
M A T SYWLNWR LL +++++ I ++VLIWK+E RR R RE+ TL+
Sbjct: 1 MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56
Query: 61 EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
+DE W C K IHP WL+A DG IFY+YTQWT T VT+YFG
Sbjct: 57 QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116
Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
S+LS++GC ++K+A G+ GD E+G + P G + + N P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174
Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
R+ AG W YIFQI+FQ AGAV+LTD VFW II PF K Y L+F V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232
Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
+ DT+LN LRFP FRI YF LW+ +V +QWI+HA NLWWPY FLDLSS Y+PLWY V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292
Query: 300 ALLHIPCYSIFPLVMKLKH 318
A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311
>AT1G10660.4 | Symbols: | unknown protein; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G62960.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr1:3533009-3534781 FORWARD
LENGTH=320
Length = 320
Score = 302 bits (773), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)
Query: 1 MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
M A T SYWLNWR LL +++++ I ++VLIWK+E RR R RE+ TL+
Sbjct: 1 MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56
Query: 61 EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
+DE W C K IHP WL+A DG IFY+YTQWT T VT+YFG
Sbjct: 57 QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116
Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
S+LS++GC ++K+A G+ GD E+G + P G + + N P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174
Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
R+ AG W YIFQI+FQ AGAV+LTD VFW II PF K Y L+F V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232
Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
+ DT+LN LRFP FRI YF LW+ +V +QWI+HA NLWWPY FLDLSS Y+PLWY V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292
Query: 300 ALLHIPCYSIFPLVMKLKH 318
A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311
>AT1G10660.3 | Symbols: | unknown protein; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G62960.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr1:3533009-3534781 FORWARD
LENGTH=320
Length = 320
Score = 302 bits (773), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)
Query: 1 MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
M A T SYWLNWR LL +++++ I ++VLIWK+E RR R RE+ TL+
Sbjct: 1 MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56
Query: 61 EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
+DE W C K IHP WL+A DG IFY+YTQWT T VT+YFG
Sbjct: 57 QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116
Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
S+LS++GC ++K+A G+ GD E+G + P G + + N P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174
Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
R+ AG W YIFQI+FQ AGAV+LTD VFW II PF K Y L+F V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232
Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
+ DT+LN LRFP FRI YF LW+ +V +QWI+HA NLWWPY FLDLSS Y+PLWY V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292
Query: 300 ALLHIPCYSIFPLVMKLKH 318
A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311
>AT1G10660.2 | Symbols: | unknown protein; LOCATED IN: endomembrane
system; EXPRESSED IN: 24 plant structures; EXPRESSED
DURING: 15 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G62960.1);
Has 35333 Blast hits to 34131 proteins in 2444 species:
Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi -
991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610
(source: NCBI BLink). | chr1:3533009-3534781 FORWARD
LENGTH=320
Length = 320
Score = 302 bits (773), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 202/319 (63%), Gaps = 9/319 (2%)
Query: 1 MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSATLY 60
M A T SYWLNWR LL +++++ I ++VLIWK+E RR R RE+ TL+
Sbjct: 1 MAADTTASSYWLNWRVLLCALILLAPIVLAAVLIWKYEGKRRRQRESQREL----PGTLF 56
Query: 61 EDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFG 120
+DE W C K IHP WL+A DG IFY+YTQWT T VT+YFG
Sbjct: 57 QDEAWTTCFKRIHPLWLLAFRVFSFVAMLTLLISNVVRDGAGIFYFYTQWTFTLVTLYFG 116
Query: 121 LGSLLSMHGCYQHHKKARGDKVDNVD-GDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDP 179
S+LS++GC ++K+A G+ GD E+G + P G + + N P + P
Sbjct: 117 YASVLSVYGCCIYNKEASGNMESYTSIGDTEQGTYRPPIALDGEGNTSKASN--RPSEAP 174
Query: 180 LRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFL 239
R+ AG W YIFQI+FQ AGAV+LTD VFW II PF K Y L+F V MH+ NAVFL
Sbjct: 175 ARKTAGFWVYIFQILFQTCAGAVVLTDIVFWAIIYPF--TKGYKLSFLDVCMHSLNAVFL 232
Query: 240 ISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAV 299
+ DT+LN LRFP FRI YF LW+ +V +QWI+HA NLWWPY FLDLSS Y+PLWY V
Sbjct: 233 LGDTSLNSLRFPLFRIAYFVLWSCIFVAYQWIIHAVKNLWWPYQFLDLSSPYAPLWYLGV 292
Query: 300 ALLHIPCYSIFPLVMKLKH 318
A++HIPC+++F LV+KLK+
Sbjct: 293 AVMHIPCFAVFALVIKLKN 311
>AT3G27770.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; EXPRESSED IN: 22 plant structures; EXPRESSED
DURING: 13 growth stages; BEST Arabidopsis thaliana
protein match is: unknown protein (TAIR:AT5G62960.1);
Has 158 Blast hits to 157 proteins in 21 species: Archae
- 0; Bacteria - 0; Metazoa - 13; Fungi - 0; Plants -
141; Viruses - 0; Other Eukaryotes - 4 (source: NCBI
BLink). | chr3:10285818-10287474 REVERSE LENGTH=315
Length = 315
Score = 251 bits (641), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 193/322 (59%), Gaps = 17/322 (5%)
Query: 4 QTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHERLRRIARNGSREIQEETSA--TLYE 61
+ + YW NWR LL +I V+V + S +++WK+E + S + Q + L
Sbjct: 8 EFTSFDYWFNWRVLLCAIWVIVPMIVSLLVLWKYE-------DSSVQTQPSLNGNDVLCI 60
Query: 62 DETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFGL 121
D+ WRPC + IHP WL+ G I+YYYTQWT T + IYFG+
Sbjct: 61 DDVWRPCFERIHPGWLLGFRVLGFCFLLANNIARFANRGWRIYYYYTQWTFTLIAIYFGM 120
Query: 122 GSLLSMHGCYQHHKKAR-GDKVDNVDGDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDPL 180
GSLLS++GC Q+ K+ G D V DAE G +P I G N EK K + L
Sbjct: 121 GSLLSIYGCLQYKKQGNTGLIADQVGIDAENGFR-SPLID-GDNMVSFEKR-KTSGSEAL 177
Query: 181 RQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFLI 240
+ ++ ++FQII+Q+ AGA +LTD ++W +I PFL+++DY ++F V++HT N V L+
Sbjct: 178 K----SYVHLFQIIYQMGAGAAVLTDSIYWTVIFPFLSLQDYEMSFMTVNLHTSNLVLLL 233
Query: 241 SDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAVA 300
DT LN L+FP FR YF LWT +V+FQWI+H I++ WPYPFL+LS +P+WY VA
Sbjct: 234 IDTFLNRLKFPLFRFSYFILWTGCFVLFQWILHMFISVGWPYPFLNLSLDMAPVWYLLVA 293
Query: 301 LLHIPCYSIFPLVMKLKHDVFS 322
LLH+P Y +F L++K+K+ + S
Sbjct: 294 LLHLPSYGLFALIVKIKYKLIS 315
>AT3G27770.2 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT5G62960.1); Has 150 Blast hits to 150 proteins
in 21 species: Archae - 0; Bacteria - 0; Metazoa - 13;
Fungi - 0; Plants - 133; Viruses - 0; Other Eukaryotes -
4 (source: NCBI BLink). | chr3:10285818-10287461 REVERSE
LENGTH=272
Length = 272
Score = 221 bits (563), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 114/259 (44%), Positives = 161/259 (62%), Gaps = 8/259 (3%)
Query: 65 WRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIYFGLGSL 124
+ PC + IHP WL+ G I+YYYTQWT T + IYFG+GSL
Sbjct: 21 YGPCFERIHPGWLLGFRVLGFCFLLANNIARFANRGWRIYYYYTQWTFTLIAIYFGMGSL 80
Query: 125 LSMHGCYQHHKKAR-GDKVDNVDGDAEKGMHNAPAIPPGSNASDQEKNLKAPEKDPLRQL 183
LS++GC Q+ K+ G D V DAE G +P I G N EK K + L+
Sbjct: 81 LSIYGCLQYKKQGNTGLIADQVGIDAENGFR-SPLID-GDNMVSFEKR-KTSGSEALK-- 135
Query: 184 AGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVFLISDT 243
++ ++FQII+Q+ AGA +LTD ++W +I PFL+++DY ++F V++HT N V L+ DT
Sbjct: 136 --SYVHLFQIIYQMGAGAAVLTDSIYWTVIFPFLSLQDYEMSFMTVNLHTSNLVLLLIDT 193
Query: 244 ALNCLRFPWFRIGYFCLWTVTYVIFQWIVHASINLWWPYPFLDLSSSYSPLWYFAVALLH 303
LN L+FP FR YF LWT +V+FQWI+H I++ WPYPFL+LS +P+WY VALLH
Sbjct: 194 FLNRLKFPLFRFSYFILWTGCFVLFQWILHMFISVGWPYPFLNLSLDMAPVWYLLVALLH 253
Query: 304 IPCYSIFPLVMKLKHDVFS 322
+P Y +F L++K+K+ + S
Sbjct: 254 LPSYGLFALIVKIKYKLIS 272
>AT1G70505.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G10660.1); Has 141 Blast hits to 140 proteins
in 16 species: Archae - 0; Bacteria - 0; Metazoa - 4;
Fungi - 0; Plants - 135; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:26570647-26572463 FORWARD
LENGTH=358
Length = 358
Score = 201 bits (510), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 111/252 (44%), Positives = 149/252 (59%), Gaps = 7/252 (2%)
Query: 1 MTAQTNNHSYWLNWRFLLSSILVMVSITFSSVLIWKHER-LRRIARNGSREIQ-EETSAT 58
MT++TN SYWLNWRF + +I V+ S+ SS LIW++E ++R R + ++ E+ +
Sbjct: 27 MTSETNIPSYWLNWRFFVCAIFVLTSLFLSSYLIWRYEGPIKRKKRGDDQSLELEQLTGV 86
Query: 59 LYEDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXXXXXDGGSIFYYYTQWTLTSVTIY 118
+Y+DE+W +K IHP WL+ DG IF +YTQWT T VTIY
Sbjct: 87 VYDDESWNTSVKEIHPNWLLGFRVFGFVVLLGLISGNAIADGTGIFIFYTQWTFTLVTIY 146
Query: 119 FGLGSLLSMHGCYQHHKKARGDKVDNVDGDAEKGMHNAPAIPPGSNASDQEKNLKAPEKD 178
FGLGSL+S+ Y+ G+ ++ D E+G + P SN E
Sbjct: 147 FGLGSLVSI---YRFRSPDNGENRVSIV-DEEQGTYRPPGNAENSNVFKSSSG-HDRENM 201
Query: 179 PLRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVPFLTIKDYNLNFFIVSMHTFNAVF 238
RQ+A T GYI QI+FQ AGAV+LTD VFWFII PFLT KD+NL+FFIV MH+ NA+F
Sbjct: 202 STRQVATTLGYIHQILFQTCAGAVLLTDGVFWFIIYPFLTAKDFNLDFFIVIMHSVNAIF 261
Query: 239 LISDTALNCLRF 250
L+ +T LN L F
Sbjct: 262 LLGETFLNSLGF 273
>AT2G47115.1 | Symbols: | unknown protein; FUNCTIONS IN:
molecular_function unknown; INVOLVED IN:
biological_process unknown; LOCATED IN: endomembrane
system; BEST Arabidopsis thaliana protein match is:
unknown protein (TAIR:AT1G10660.1); Has 35333 Blast hits
to 34131 proteins in 2444 species: Archae - 798;
Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
BLink). | chr2:19345376-19346864 REVERSE LENGTH=300
Length = 300
Score = 181 bits (458), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 107/293 (36%), Positives = 146/293 (49%), Gaps = 36/293 (12%)
Query: 36 KHERLRRIARNGSREIQEETSATLYEDETWRPCLKGIHPAWLMAXXXXXXXXXXXXXXXX 95
H+ L + R+ +R SA L W C +HP WL+
Sbjct: 41 SHDSLLPLYRSNNRG-----SARL-----WASCWTRLHPGWLLFTRSTSFLSMAALLAWD 90
Query: 96 XXXDGGSIFYYYTQWTLTSVTIYFGLGSLLSMHGCYQHHKKARGDKVDNVDGDAEKGMHN 155
SIF YYT+WT V IYF +G + S++GC H K+
Sbjct: 91 VIKWDASIFVYYTEWTFMLVIIYFAMGIVASVYGCLIHLKEL------------------ 132
Query: 156 APAIPPGSNASDQEKNLKAPEKDPLRQLAGTWGYIFQIIFQINAGAVMLTDCVFWFIIVP 215
E + D R+ G Q IFQ +AGAV+LTD VFW +IVP
Sbjct: 133 --------TLETDEDVVVEKVGDEFRRRLEVCGCFMQTIFQTSAGAVVLTDIVFWLVIVP 184
Query: 216 FLTIKDYNLNFFIVSMHTFNAVFLISDTALNCLRFPWFRIGYFCLWTVTYVIFQWIVHAS 275
FL+ + LN + MHT NA FL+ +T LN L FPWFR+GYF LW+ YVIFQWI+HA
Sbjct: 185 FLSTTRFGLNTLTICMHTANAGFLLLETLLNSLPFPWFRMGYFVLWSCLYVIFQWIIHAC 244
Query: 276 INLWWPYPFLDLSSSYSPLWYFAVALLHIPCYSIFPLVMKLKHDVFSTWYPDS 328
WWPYPFL+L ++P+WY +A++HIPCY + ++K K+ F +P++
Sbjct: 245 GFTWWPYPFLELDKPWAPIWYLCMAIVHIPCYGAYAAIVKAKNSCFPYLFPNA 297