Miyakogusa Predicted Gene
- Lj1g3v4289080.4
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4289080.4 Non Chatacterized Hit- tr|A2Q3E1|A2Q3E1_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,82.93,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.32219.4
(413 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G42320.2 | Symbols: | nucleolar protein gar2-related | chr2:... 457 e-129
AT2G42320.1 | Symbols: | nucleolar protein gar2-related | chr2:... 457 e-129
AT3G57780.1 | Symbols: | BEST Arabidopsis thaliana protein matc... 411 e-115
AT3G01810.3 | Symbols: | FUNCTIONS IN: molecular_function unkno... 315 3e-86
AT3G01810.1 | Symbols: | FUNCTIONS IN: molecular_function unkno... 315 3e-86
AT5G06930.1 | Symbols: | LOCATED IN: chloroplast; EXPRESSED IN:... 309 2e-84
AT3G01810.2 | Symbols: | FUNCTIONS IN: molecular_function unkno... 307 8e-84
AT5G43230.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 286 2e-77
>AT2G42320.2 | Symbols: | nucleolar protein gar2-related |
chr2:17628102-17630657 FORWARD LENGTH=669
Length = 669
Score = 457 bits (1175), Expect = e-129, Method: Compositional matrix adjust.
Identities = 228/414 (55%), Positives = 284/414 (68%), Gaps = 37/414 (8%)
Query: 2 RLAESNGAGN---GKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSR 58
+++E N +GN GK +L+WK GF Q +EDWQET TFT+ALE++E W+FSR
Sbjct: 288 QISEPNESGNSDSGKKTNLRWKN--------GFQQLLEDWQETETFTTALEKIEFWVFSR 339
Query: 59 LVESVWWQALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLC 118
+VESVWWQ TP+MQSP D S++KS G+++GP+LGD NQG FSI+LW+ AF DA QR+C
Sbjct: 340 IVESVWWQVFTPHMQSPEDDSSASKSNGKLMGPSLGDQNQGTFSISLWKNAFRDALQRIC 399
Query: 119 PLRAGGHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPI 178
P+R GHECGCLPVLARMVM++CI R DVAMFNAILRES +IPTDP+SDPILDSKVLPI
Sbjct: 400 PMRGAGHECGCLPVLARMVMDKCIGRFDVAMFNAILRESEHQIPTDPVSDPILDSKVLPI 459
Query: 179 PAGDLSFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVX 238
PAGDLSFGSGAQLKN++GNWSR LT+MFGM+ +D + + + E+D E K FV
Sbjct: 460 PAGDLSFGSGAQLKNAIGNWSRCLTEMFGMNSDDSSAKEKRNSEDDH-----VESKAFVL 514
Query: 239 XXXXXXXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALN- 297
PKDML++ +REE+CPSI+L LI R+LCNFTPDEFCPD VPG VLE LN
Sbjct: 515 LNELSDLLMLPKDMLMEISIREEICPSISLPLIKRILCNFTPDEFCPDQVPGAVLEELNA 574
Query: 298 AETIAERRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYT 357
AE+I +R+LS SF ++AEKVAEA K L+RNVS +QR+GYT
Sbjct: 575 AESIGDRKLSEA---SFPYAASSVSYMPPSTMDIAEKVAEASAK--LSRNVSMIQRKGYT 629
Query: 358 XXXXXXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
PLTSI+DK + + T+NARY+LLR+VW
Sbjct: 630 SDEELEELDSPLTSIVDKAS---------------DFTGSATSNARYKLLRQVW 668
>AT2G42320.1 | Symbols: | nucleolar protein gar2-related |
chr2:17628102-17630657 FORWARD LENGTH=669
Length = 669
Score = 457 bits (1175), Expect = e-129, Method: Compositional matrix adjust.
Identities = 228/414 (55%), Positives = 284/414 (68%), Gaps = 37/414 (8%)
Query: 2 RLAESNGAGN---GKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSR 58
+++E N +GN GK +L+WK GF Q +EDWQET TFT+ALE++E W+FSR
Sbjct: 288 QISEPNESGNSDSGKKTNLRWKN--------GFQQLLEDWQETETFTTALEKIEFWVFSR 339
Query: 59 LVESVWWQALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLC 118
+VESVWWQ TP+MQSP D S++KS G+++GP+LGD NQG FSI+LW+ AF DA QR+C
Sbjct: 340 IVESVWWQVFTPHMQSPEDDSSASKSNGKLMGPSLGDQNQGTFSISLWKNAFRDALQRIC 399
Query: 119 PLRAGGHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPI 178
P+R GHECGCLPVLARMVM++CI R DVAMFNAILRES +IPTDP+SDPILDSKVLPI
Sbjct: 400 PMRGAGHECGCLPVLARMVMDKCIGRFDVAMFNAILRESEHQIPTDPVSDPILDSKVLPI 459
Query: 179 PAGDLSFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVX 238
PAGDLSFGSGAQLKN++GNWSR LT+MFGM+ +D + + + E+D E K FV
Sbjct: 460 PAGDLSFGSGAQLKNAIGNWSRCLTEMFGMNSDDSSAKEKRNSEDDH-----VESKAFVL 514
Query: 239 XXXXXXXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALN- 297
PKDML++ +REE+CPSI+L LI R+LCNFTPDEFCPD VPG VLE LN
Sbjct: 515 LNELSDLLMLPKDMLMEISIREEICPSISLPLIKRILCNFTPDEFCPDQVPGAVLEELNA 574
Query: 298 AETIAERRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYT 357
AE+I +R+LS SF ++AEKVAEA K L+RNVS +QR+GYT
Sbjct: 575 AESIGDRKLSEA---SFPYAASSVSYMPPSTMDIAEKVAEASAK--LSRNVSMIQRKGYT 629
Query: 358 XXXXXXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
PLTSI+DK + + T+NARY+LLR+VW
Sbjct: 630 SDEELEELDSPLTSIVDKAS---------------DFTGSATSNARYKLLRQVW 668
>AT3G57780.1 | Symbols: | BEST Arabidopsis thaliana protein match
is: nucleolar protein gar2-related (TAIR:AT2G42320.2);
Has 3163 Blast hits to 2460 proteins in 357 species:
Archae - 16; Bacteria - 291; Metazoa - 841; Fungi - 335;
Plants - 248; Viruses - 72; Other Eukaryotes - 1360
(source: NCBI BLink). | chr3:21399766-21402329 REVERSE
LENGTH=671
Length = 671
Score = 411 bits (1056), Expect = e-115, Method: Compositional matrix adjust.
Identities = 216/409 (52%), Positives = 269/409 (65%), Gaps = 26/409 (6%)
Query: 6 SNGAGNGKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWW 65
SNG+ + ++ K K +GF Q EDWQE+ TFT+ALE+VE WIFSR+VESVWW
Sbjct: 288 SNGSEHNVLGKVRRKKNQWTKQSNGFKQVFEDWQESQTFTAALEKVEFWIFSRIVESVWW 347
Query: 66 QALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGH 125
Q TP+MQSP ++ G+ LGD QG+FSI+LW+ AF+ RLCP+R H
Sbjct: 348 QVFTPHMQSP-------ENGGKTKEHILGDIEQGSFSISLWKNAFKVTLSRLCPMRGARH 400
Query: 126 ECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDLSF 185
ECGCLP+LA+MVME+CIAR+DVAMFNAILRES +IPTDP+SDPILDSKVLPI +G+LSF
Sbjct: 401 ECGCLPILAKMVMEKCIARIDVAMFNAILRESEHQIPTDPVSDPILDSKVLPILSGNLSF 460
Query: 186 GSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXXXX 245
GSGAQLKN++GNWSR L +MF ++ D V+ END + K F
Sbjct: 461 GSGAQLKNAIGNWSRCLAEMFSINTRDSVE------ENDPIES----EKSFSLLNELSDL 510
Query: 246 XXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAERR 305
PKDML+DR REEVCPSI+L+LI R+LCNFTPDEFCPD VPG VLE LN E+I+E++
Sbjct: 511 LMLPKDMLMDRSTREEVCPSISLALIKRILCNFTPDEFCPDDVPGAVLEELNNESISEQK 570
Query: 306 LSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYTXXXXXXXX 365
LS SF N VAE G S ++RNVS +QR+GYT
Sbjct: 571 LSG---VSFPYAASPVSYTPPSSTN----VAEVGDISRMSRNVSMIQRKGYTSDDELEEL 623
Query: 366 XXPLTSIIDKLPLSPTVSANGQD-NQKEHKSYTTTTNARYQLLREVWSM 413
PLTSII+ + LSP +SA G++ Q+ K T +RY+LLREVWSM
Sbjct: 624 DSPLTSIIENVSLSP-ISAQGRNVKQEAEKIGPGVTISRYELLREVWSM 671
>AT3G01810.3 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; EXPRESSED IN:
21 plant structures; EXPRESSED DURING: 13 growth stages;
BEST Arabidopsis thaliana protein match is: nucleolar
protein gar2-related (TAIR:AT2G42320.2). |
chr3:289218-292557 FORWARD LENGTH=921
Length = 921
Score = 315 bits (807), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 179/410 (43%), Positives = 235/410 (57%), Gaps = 38/410 (9%)
Query: 13 KSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYM 72
K +SLKWK P K + W + TF +ALE+VE+WIFSR+VES+WWQ LTP M
Sbjct: 535 KRSSLKWKDSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRM 591
Query: 73 QSPAGDF---------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAG 123
QS A +S K+FGR P+ + G+FS+ LW+ AF +A +RLCPLR
Sbjct: 592 QSSAASTREFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGS 649
Query: 124 GHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDL 183
GHECGCLP+ AR++MEQC+ARLDVAMFNAILR+S PTDP+SDPI D +VLPIP+
Sbjct: 650 GHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVSDPIADLRVLPIPSRTS 709
Query: 184 SFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXX 243
SFGSGAQLKNS+GNWSRWLTD+FG+D ED ++S + K F
Sbjct: 710 SFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENS-------YVEKSFKTFNLLKALS 762
Query: 244 XXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAE 303
PKDML++ VR+EVCP LI RVL NF PDEFCPDPVP VL++L +E AE
Sbjct: 763 DLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDPVPDAVLKSLESEEEAE 822
Query: 304 RRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKS--HLARNVSAVQRRGYTXXXX 361
+ + + S+ +++ + G L+R S++ R+ YT
Sbjct: 823 KSI----ITSYPCTAPSPVYCPPSRTSISTIIGNFGQPQAPQLSRIRSSITRKAYTSDDE 878
Query: 362 XXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
PL ++ + S ++ NG ++ RYQLLRE W
Sbjct: 879 LDELSSPLAVVVLQQAGSKKIN-NGDADE----------TIRYQLLRECW 917
>AT3G01810.1 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 21 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: nucleolar protein
gar2-related (TAIR:AT2G42320.2); Has 1327 Blast hits to
470 proteins in 132 species: Archae - 2; Bacteria - 131;
Metazoa - 139; Fungi - 114; Plants - 114; Viruses - 0;
Other Eukaryotes - 827 (source: NCBI BLink). |
chr3:289218-292557 FORWARD LENGTH=921
Length = 921
Score = 315 bits (807), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 179/410 (43%), Positives = 235/410 (57%), Gaps = 38/410 (9%)
Query: 13 KSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYM 72
K +SLKWK P K + W + TF +ALE+VE+WIFSR+VES+WWQ LTP M
Sbjct: 535 KRSSLKWKDSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRM 591
Query: 73 QSPAGDF---------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAG 123
QS A +S K+FGR P+ + G+FS+ LW+ AF +A +RLCPLR
Sbjct: 592 QSSAASTREFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGS 649
Query: 124 GHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDL 183
GHECGCLP+ AR++MEQC+ARLDVAMFNAILR+S PTDP+SDPI D +VLPIP+
Sbjct: 650 GHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVSDPIADLRVLPIPSRTS 709
Query: 184 SFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXX 243
SFGSGAQLKNS+GNWSRWLTD+FG+D ED ++S + K F
Sbjct: 710 SFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENS-------YVEKSFKTFNLLKALS 762
Query: 244 XXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAE 303
PKDML++ VR+EVCP LI RVL NF PDEFCPDPVP VL++L +E AE
Sbjct: 763 DLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDPVPDAVLKSLESEEEAE 822
Query: 304 RRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKS--HLARNVSAVQRRGYTXXXX 361
+ + + S+ +++ + G L+R S++ R+ YT
Sbjct: 823 KSI----ITSYPCTAPSPVYCPPSRTSISTIIGNFGQPQAPQLSRIRSSITRKAYTSDDE 878
Query: 362 XXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
PL ++ + S ++ NG ++ RYQLLRE W
Sbjct: 879 LDELSSPLAVVVLQQAGSKKIN-NGDADE----------TIRYQLLRECW 917
>AT5G06930.1 | Symbols: | LOCATED IN: chloroplast; EXPRESSED IN: 15
plant structures; EXPRESSED DURING: 7 growth stages;
BEST Arabidopsis thaliana protein match is: nucleolar
protein gar2-related (TAIR:AT2G42320.2); Has 3369 Blast
hits to 1526 proteins in 313 species: Archae - 2;
Bacteria - 910; Metazoa - 754; Fungi - 336; Plants -
137; Viruses - 11; Other Eukaryotes - 1219 (source: NCBI
BLink). | chr5:2145139-2147849 FORWARD LENGTH=723
Length = 723
Score = 309 bits (792), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 175/382 (45%), Positives = 216/382 (56%), Gaps = 29/382 (7%)
Query: 29 SGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAGDFSSNKSFGRV 88
+G EDW + T +AL RVES F++ VES+W Q + +M D + + G
Sbjct: 369 NGLNSLKEDWGDVRTLIAALRRVESCFFTQAVESIWSQVMMVHMIPQGVDSTMGEMIGNF 428
Query: 89 LGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMVMEQCIARLDVA 148
PA D Q +FS+NLW+ AFE+A QRLCP++A +CGCL VL RMVMEQCI RLDVA
Sbjct: 429 SEPATCDRLQESFSVNLWKEAFEEALQRLCPVQATRRQCGCLHVLTRMVMEQCIVRLDVA 488
Query: 149 MFNAILRESALEIPTDPISDPILDSKVLPIPAGDLSFGSGAQLKNSVGNWSRWLTDMFGM 208
MFNAILRESA IPTD SDPI DS+VLPIPAG LSF SG +LKN+V WSR LTD+FG+
Sbjct: 489 MFNAILRESAHHIPTDSASDPIADSRVLPIPAGVLSFESGVKLKNTVSYWSRLLTDIFGI 548
Query: 209 DVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXXXXXXXPKDMLIDRHVREEVCPSITL 268
DVE +Q GD KPF PK+M +D R+EVCPSI L
Sbjct: 549 DVEQKMQR------------GDETFKPFHLLNELSDLLMLPKEMFVDSSTRDEVCPSIGL 596
Query: 269 SLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAERR-LSAESVRSFXXXXXXXXXXXXX 327
SLI R++CNFTPDEFCP PVPGTVLE LNA++I E R LS ++ R F
Sbjct: 597 SLIKRIVCNFTPDEFCPYPVPGTVLEELNAQSILENRSLSRDTARGFPRQVNPVSYSPPS 656
Query: 328 XXNVAEKVAEAGGKSHLARNVSAVQRRGYTXXXXXXXXXXPLTSIIDKLPLSPTVSANGQ 387
++ + VAE K L S + GY+ P + K A +
Sbjct: 657 CSHLTDIVAEFSVKLKL----SMTHKNGYSSNEKVETPRSPPYYNVIK-------GAVAK 705
Query: 388 DNQKEHKSYTTTTNARYQLLRE 409
DN + TN RY+LL E
Sbjct: 706 DNLN-----LSETNERYRLLGE 722
>AT3G01810.2 | Symbols: | FUNCTIONS IN: molecular_function unknown;
INVOLVED IN: biological_process unknown; LOCATED IN:
plasma membrane; EXPRESSED IN: 21 plant structures;
EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
thaliana protein match is: nucleolar protein
gar2-related (TAIR:AT2G42320.2); Has 1232 Blast hits to
443 proteins in 120 species: Archae - 2; Bacteria - 119;
Metazoa - 136; Fungi - 117; Plants - 114; Viruses - 0;
Other Eukaryotes - 744 (source: NCBI BLink). |
chr3:289218-292375 FORWARD LENGTH=859
Length = 859
Score = 307 bits (787), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 159/311 (51%), Positives = 203/311 (65%), Gaps = 23/311 (7%)
Query: 13 KSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYM 72
K +SLKWK P K + W + TF +ALE+VE+WIFSR+VES+WWQ LTP M
Sbjct: 535 KRSSLKWKDSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRM 591
Query: 73 QSPAGDF---------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAG 123
QS A +S K+FGR P+ + G+FS+ LW+ AF +A +RLCPLR
Sbjct: 592 QSSAASTREFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGS 649
Query: 124 GHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDL 183
GHECGCLP+ AR++MEQC+ARLDVAMFNAILR+S PTDP+SDPI D +VLPIP+
Sbjct: 650 GHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVSDPIADLRVLPIPSRTS 709
Query: 184 SFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXX 243
SFGSGAQLKNS+GNWSRWLTD+FG+D ED ++S + K F
Sbjct: 710 SFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENS-------YVEKSFKTFNLLKALS 762
Query: 244 XXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAE 303
PKDML++ VR+EVCP LI RVL NF PDEFCPDPVP VL++L +E +
Sbjct: 763 DLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDPVPDAVLKSLESEKL-- 820
Query: 304 RRLSAESVRSF 314
RR+S++++ +
Sbjct: 821 RRVSSQAIHAL 831
>AT5G43230.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G01810.3); Has 30201 Blast hits to 17322
proteins in 780 species: Archae - 12; Bacteria - 1396;
Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
0; Other Eukaryotes - 2996 (source: NCBI BLink). |
chr5:17349125-17352747 FORWARD LENGTH=848
Length = 848
Score = 286 bits (731), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 167/390 (42%), Positives = 217/390 (55%), Gaps = 27/390 (6%)
Query: 26 KAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPA--GDFSSNK 83
KAGS ++W++ F +ALE+ ESWIFSR+V+SVWWQ++TP+MQSPA G +
Sbjct: 476 KAGS------DEWEDPRAFLAALEKFESWIFSRVVKSVWWQSMTPHMQSPAVKGSIARKV 529
Query: 84 SFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMVMEQCIA 143
S R LG NQG ++I LW+ AF A +RLCPLR ECGCLP+LA++VMEQ I+
Sbjct: 530 SGKR----RLGHRNQGLYAIELWKNAFRAACERLCPLRGSRQECGCLPMLAKLVMEQLIS 585
Query: 144 RLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDLSFGSGAQLKNSVGNWSRWLT 203
RLDVAMFNAILRESA E+PTDP+SDPI D VLPIPAG SFG+GAQLKN++G WSRWL
Sbjct: 586 RLDVAMFNAILRESAGEMPTDPVSDPISDINVLPIPAGKASFGAGAQLKNAIGTWSRWLE 645
Query: 204 DMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXXXXXXXPKDMLIDRHVREEVC 263
D F ED +D ND+ + + F P ML D+ R+EVC
Sbjct: 646 DQFEQK-EDKSGRNKDEDNNDKEKPECEHFRLFHLLNSLGDLMMLPFKMLADKSTRKEVC 704
Query: 264 PSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAERRLSAESVRSFXXXXXXXXX 323
P++ +I RVL NF PDEF P +P + + LN+E + E +V F
Sbjct: 705 PTLGPPIIKRVLRNFVPDEFNPHRIPRRLFDVLNSEGLTEEDNGCITV--FPSAASPTVY 762
Query: 324 XXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYTXXXXXXXXXXPLTSIIDKLPLSPTVS 383
++ + E S ++ S+V ++ YT + SI S
Sbjct: 763 LMPSTDSIKRFIGELNNPS-ISETGSSVFKKQYTSDDELDDLDTSINSIF---------S 812
Query: 384 ANGQDNQKE--HKSYTTTTNARYQLLREVW 411
A G N E K Y RYQLLRE+W
Sbjct: 813 APGTTNSSEWMPKGYGRRKTVRYQLLREIW 842