Miyakogusa Predicted Gene

Lj1g3v4289080.4
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4289080.4 Non Chatacterized Hit- tr|A2Q3E1|A2Q3E1_MEDTR
Putative uncharacterized protein OS=Medicago
truncatul,82.93,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT
NAMED,NULL,CUFF.32219.4
         (413 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT2G42320.2 | Symbols:  | nucleolar protein gar2-related | chr2:...   457   e-129
AT2G42320.1 | Symbols:  | nucleolar protein gar2-related | chr2:...   457   e-129
AT3G57780.1 | Symbols:  | BEST Arabidopsis thaliana protein matc...   411   e-115
AT3G01810.3 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   315   3e-86
AT3G01810.1 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   315   3e-86
AT5G06930.1 | Symbols:  | LOCATED IN: chloroplast; EXPRESSED IN:...   309   2e-84
AT3G01810.2 | Symbols:  | FUNCTIONS IN: molecular_function unkno...   307   8e-84
AT5G43230.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   286   2e-77

>AT2G42320.2 | Symbols:  | nucleolar protein gar2-related |
           chr2:17628102-17630657 FORWARD LENGTH=669
          Length = 669

 Score =  457 bits (1175), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 228/414 (55%), Positives = 284/414 (68%), Gaps = 37/414 (8%)

Query: 2   RLAESNGAGN---GKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSR 58
           +++E N +GN   GK  +L+WK         GF Q +EDWQET TFT+ALE++E W+FSR
Sbjct: 288 QISEPNESGNSDSGKKTNLRWKN--------GFQQLLEDWQETETFTTALEKIEFWVFSR 339

Query: 59  LVESVWWQALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLC 118
           +VESVWWQ  TP+MQSP  D S++KS G+++GP+LGD NQG FSI+LW+ AF DA QR+C
Sbjct: 340 IVESVWWQVFTPHMQSPEDDSSASKSNGKLMGPSLGDQNQGTFSISLWKNAFRDALQRIC 399

Query: 119 PLRAGGHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPI 178
           P+R  GHECGCLPVLARMVM++CI R DVAMFNAILRES  +IPTDP+SDPILDSKVLPI
Sbjct: 400 PMRGAGHECGCLPVLARMVMDKCIGRFDVAMFNAILRESEHQIPTDPVSDPILDSKVLPI 459

Query: 179 PAGDLSFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVX 238
           PAGDLSFGSGAQLKN++GNWSR LT+MFGM+ +D   + + + E+D       E K FV 
Sbjct: 460 PAGDLSFGSGAQLKNAIGNWSRCLTEMFGMNSDDSSAKEKRNSEDDH-----VESKAFVL 514

Query: 239 XXXXXXXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALN- 297
                     PKDML++  +REE+CPSI+L LI R+LCNFTPDEFCPD VPG VLE LN 
Sbjct: 515 LNELSDLLMLPKDMLMEISIREEICPSISLPLIKRILCNFTPDEFCPDQVPGAVLEELNA 574

Query: 298 AETIAERRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYT 357
           AE+I +R+LS     SF               ++AEKVAEA  K  L+RNVS +QR+GYT
Sbjct: 575 AESIGDRKLSEA---SFPYAASSVSYMPPSTMDIAEKVAEASAK--LSRNVSMIQRKGYT 629

Query: 358 XXXXXXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
                     PLTSI+DK                   + + T+NARY+LLR+VW
Sbjct: 630 SDEELEELDSPLTSIVDKAS---------------DFTGSATSNARYKLLRQVW 668


>AT2G42320.1 | Symbols:  | nucleolar protein gar2-related |
           chr2:17628102-17630657 FORWARD LENGTH=669
          Length = 669

 Score =  457 bits (1175), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 228/414 (55%), Positives = 284/414 (68%), Gaps = 37/414 (8%)

Query: 2   RLAESNGAGN---GKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSR 58
           +++E N +GN   GK  +L+WK         GF Q +EDWQET TFT+ALE++E W+FSR
Sbjct: 288 QISEPNESGNSDSGKKTNLRWKN--------GFQQLLEDWQETETFTTALEKIEFWVFSR 339

Query: 59  LVESVWWQALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLC 118
           +VESVWWQ  TP+MQSP  D S++KS G+++GP+LGD NQG FSI+LW+ AF DA QR+C
Sbjct: 340 IVESVWWQVFTPHMQSPEDDSSASKSNGKLMGPSLGDQNQGTFSISLWKNAFRDALQRIC 399

Query: 119 PLRAGGHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPI 178
           P+R  GHECGCLPVLARMVM++CI R DVAMFNAILRES  +IPTDP+SDPILDSKVLPI
Sbjct: 400 PMRGAGHECGCLPVLARMVMDKCIGRFDVAMFNAILRESEHQIPTDPVSDPILDSKVLPI 459

Query: 179 PAGDLSFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVX 238
           PAGDLSFGSGAQLKN++GNWSR LT+MFGM+ +D   + + + E+D       E K FV 
Sbjct: 460 PAGDLSFGSGAQLKNAIGNWSRCLTEMFGMNSDDSSAKEKRNSEDDH-----VESKAFVL 514

Query: 239 XXXXXXXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALN- 297
                     PKDML++  +REE+CPSI+L LI R+LCNFTPDEFCPD VPG VLE LN 
Sbjct: 515 LNELSDLLMLPKDMLMEISIREEICPSISLPLIKRILCNFTPDEFCPDQVPGAVLEELNA 574

Query: 298 AETIAERRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYT 357
           AE+I +R+LS     SF               ++AEKVAEA  K  L+RNVS +QR+GYT
Sbjct: 575 AESIGDRKLSEA---SFPYAASSVSYMPPSTMDIAEKVAEASAK--LSRNVSMIQRKGYT 629

Query: 358 XXXXXXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
                     PLTSI+DK                   + + T+NARY+LLR+VW
Sbjct: 630 SDEELEELDSPLTSIVDKAS---------------DFTGSATSNARYKLLRQVW 668


>AT3G57780.1 | Symbols:  | BEST Arabidopsis thaliana protein match
           is: nucleolar protein gar2-related (TAIR:AT2G42320.2);
           Has 3163 Blast hits to 2460 proteins in 357 species:
           Archae - 16; Bacteria - 291; Metazoa - 841; Fungi - 335;
           Plants - 248; Viruses - 72; Other Eukaryotes - 1360
           (source: NCBI BLink). | chr3:21399766-21402329 REVERSE
           LENGTH=671
          Length = 671

 Score =  411 bits (1056), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 216/409 (52%), Positives = 269/409 (65%), Gaps = 26/409 (6%)

Query: 6   SNGAGNGKSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWW 65
           SNG+ +     ++ K     K  +GF Q  EDWQE+ TFT+ALE+VE WIFSR+VESVWW
Sbjct: 288 SNGSEHNVLGKVRRKKNQWTKQSNGFKQVFEDWQESQTFTAALEKVEFWIFSRIVESVWW 347

Query: 66  QALTPYMQSPAGDFSSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGH 125
           Q  TP+MQSP       ++ G+     LGD  QG+FSI+LW+ AF+    RLCP+R   H
Sbjct: 348 QVFTPHMQSP-------ENGGKTKEHILGDIEQGSFSISLWKNAFKVTLSRLCPMRGARH 400

Query: 126 ECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDLSF 185
           ECGCLP+LA+MVME+CIAR+DVAMFNAILRES  +IPTDP+SDPILDSKVLPI +G+LSF
Sbjct: 401 ECGCLPILAKMVMEKCIARIDVAMFNAILRESEHQIPTDPVSDPILDSKVLPILSGNLSF 460

Query: 186 GSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXXXX 245
           GSGAQLKN++GNWSR L +MF ++  D V+      END  +      K F         
Sbjct: 461 GSGAQLKNAIGNWSRCLAEMFSINTRDSVE------ENDPIES----EKSFSLLNELSDL 510

Query: 246 XXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAERR 305
              PKDML+DR  REEVCPSI+L+LI R+LCNFTPDEFCPD VPG VLE LN E+I+E++
Sbjct: 511 LMLPKDMLMDRSTREEVCPSISLALIKRILCNFTPDEFCPDDVPGAVLEELNNESISEQK 570

Query: 306 LSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYTXXXXXXXX 365
           LS     SF               N    VAE G  S ++RNVS +QR+GYT        
Sbjct: 571 LSG---VSFPYAASPVSYTPPSSTN----VAEVGDISRMSRNVSMIQRKGYTSDDELEEL 623

Query: 366 XXPLTSIIDKLPLSPTVSANGQD-NQKEHKSYTTTTNARYQLLREVWSM 413
             PLTSII+ + LSP +SA G++  Q+  K     T +RY+LLREVWSM
Sbjct: 624 DSPLTSIIENVSLSP-ISAQGRNVKQEAEKIGPGVTISRYELLREVWSM 671


>AT3G01810.3 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; EXPRESSED IN:
           21 plant structures; EXPRESSED DURING: 13 growth stages;
           BEST Arabidopsis thaliana protein match is: nucleolar
           protein gar2-related (TAIR:AT2G42320.2). |
           chr3:289218-292557 FORWARD LENGTH=921
          Length = 921

 Score =  315 bits (807), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 179/410 (43%), Positives = 235/410 (57%), Gaps = 38/410 (9%)

Query: 13  KSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYM 72
           K +SLKWK  P  K      +    W +  TF +ALE+VE+WIFSR+VES+WWQ LTP M
Sbjct: 535 KRSSLKWKDSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRM 591

Query: 73  QSPAGDF---------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAG 123
           QS A            +S K+FGR   P+  +   G+FS+ LW+ AF +A +RLCPLR  
Sbjct: 592 QSSAASTREFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGS 649

Query: 124 GHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDL 183
           GHECGCLP+ AR++MEQC+ARLDVAMFNAILR+S    PTDP+SDPI D +VLPIP+   
Sbjct: 650 GHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVSDPIADLRVLPIPSRTS 709

Query: 184 SFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXX 243
           SFGSGAQLKNS+GNWSRWLTD+FG+D ED     ++S         +   K F       
Sbjct: 710 SFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENS-------YVEKSFKTFNLLKALS 762

Query: 244 XXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAE 303
                PKDML++  VR+EVCP     LI RVL NF PDEFCPDPVP  VL++L +E  AE
Sbjct: 763 DLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDPVPDAVLKSLESEEEAE 822

Query: 304 RRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKS--HLARNVSAVQRRGYTXXXX 361
           + +    + S+               +++  +   G      L+R  S++ R+ YT    
Sbjct: 823 KSI----ITSYPCTAPSPVYCPPSRTSISTIIGNFGQPQAPQLSRIRSSITRKAYTSDDE 878

Query: 362 XXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
                 PL  ++ +   S  ++ NG  ++            RYQLLRE W
Sbjct: 879 LDELSSPLAVVVLQQAGSKKIN-NGDADE----------TIRYQLLRECW 917


>AT3G01810.1 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           plasma membrane; EXPRESSED IN: 21 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: nucleolar protein
           gar2-related (TAIR:AT2G42320.2); Has 1327 Blast hits to
           470 proteins in 132 species: Archae - 2; Bacteria - 131;
           Metazoa - 139; Fungi - 114; Plants - 114; Viruses - 0;
           Other Eukaryotes - 827 (source: NCBI BLink). |
           chr3:289218-292557 FORWARD LENGTH=921
          Length = 921

 Score =  315 bits (807), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 179/410 (43%), Positives = 235/410 (57%), Gaps = 38/410 (9%)

Query: 13  KSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYM 72
           K +SLKWK  P  K      +    W +  TF +ALE+VE+WIFSR+VES+WWQ LTP M
Sbjct: 535 KRSSLKWKDSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRM 591

Query: 73  QSPAGDF---------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAG 123
           QS A            +S K+FGR   P+  +   G+FS+ LW+ AF +A +RLCPLR  
Sbjct: 592 QSSAASTREFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGS 649

Query: 124 GHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDL 183
           GHECGCLP+ AR++MEQC+ARLDVAMFNAILR+S    PTDP+SDPI D +VLPIP+   
Sbjct: 650 GHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVSDPIADLRVLPIPSRTS 709

Query: 184 SFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXX 243
           SFGSGAQLKNS+GNWSRWLTD+FG+D ED     ++S         +   K F       
Sbjct: 710 SFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENS-------YVEKSFKTFNLLKALS 762

Query: 244 XXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAE 303
                PKDML++  VR+EVCP     LI RVL NF PDEFCPDPVP  VL++L +E  AE
Sbjct: 763 DLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDPVPDAVLKSLESEEEAE 822

Query: 304 RRLSAESVRSFXXXXXXXXXXXXXXXNVAEKVAEAGGKS--HLARNVSAVQRRGYTXXXX 361
           + +    + S+               +++  +   G      L+R  S++ R+ YT    
Sbjct: 823 KSI----ITSYPCTAPSPVYCPPSRTSISTIIGNFGQPQAPQLSRIRSSITRKAYTSDDE 878

Query: 362 XXXXXXPLTSIIDKLPLSPTVSANGQDNQKEHKSYTTTTNARYQLLREVW 411
                 PL  ++ +   S  ++ NG  ++            RYQLLRE W
Sbjct: 879 LDELSSPLAVVVLQQAGSKKIN-NGDADE----------TIRYQLLRECW 917


>AT5G06930.1 | Symbols:  | LOCATED IN: chloroplast; EXPRESSED IN: 15
           plant structures; EXPRESSED DURING: 7 growth stages;
           BEST Arabidopsis thaliana protein match is: nucleolar
           protein gar2-related (TAIR:AT2G42320.2); Has 3369 Blast
           hits to 1526 proteins in 313 species: Archae - 2;
           Bacteria - 910; Metazoa - 754; Fungi - 336; Plants -
           137; Viruses - 11; Other Eukaryotes - 1219 (source: NCBI
           BLink). | chr5:2145139-2147849 FORWARD LENGTH=723
          Length = 723

 Score =  309 bits (792), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 175/382 (45%), Positives = 216/382 (56%), Gaps = 29/382 (7%)

Query: 29  SGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPAGDFSSNKSFGRV 88
           +G     EDW +  T  +AL RVES  F++ VES+W Q +  +M     D +  +  G  
Sbjct: 369 NGLNSLKEDWGDVRTLIAALRRVESCFFTQAVESIWSQVMMVHMIPQGVDSTMGEMIGNF 428

Query: 89  LGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMVMEQCIARLDVA 148
             PA  D  Q +FS+NLW+ AFE+A QRLCP++A   +CGCL VL RMVMEQCI RLDVA
Sbjct: 429 SEPATCDRLQESFSVNLWKEAFEEALQRLCPVQATRRQCGCLHVLTRMVMEQCIVRLDVA 488

Query: 149 MFNAILRESALEIPTDPISDPILDSKVLPIPAGDLSFGSGAQLKNSVGNWSRWLTDMFGM 208
           MFNAILRESA  IPTD  SDPI DS+VLPIPAG LSF SG +LKN+V  WSR LTD+FG+
Sbjct: 489 MFNAILRESAHHIPTDSASDPIADSRVLPIPAGVLSFESGVKLKNTVSYWSRLLTDIFGI 548

Query: 209 DVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXXXXXXXPKDMLIDRHVREEVCPSITL 268
           DVE  +Q             GD   KPF            PK+M +D   R+EVCPSI L
Sbjct: 549 DVEQKMQR------------GDETFKPFHLLNELSDLLMLPKEMFVDSSTRDEVCPSIGL 596

Query: 269 SLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAERR-LSAESVRSFXXXXXXXXXXXXX 327
           SLI R++CNFTPDEFCP PVPGTVLE LNA++I E R LS ++ R F             
Sbjct: 597 SLIKRIVCNFTPDEFCPYPVPGTVLEELNAQSILENRSLSRDTARGFPRQVNPVSYSPPS 656

Query: 328 XXNVAEKVAEAGGKSHLARNVSAVQRRGYTXXXXXXXXXXPLTSIIDKLPLSPTVSANGQ 387
             ++ + VAE   K  L    S   + GY+          P    + K        A  +
Sbjct: 657 CSHLTDIVAEFSVKLKL----SMTHKNGYSSNEKVETPRSPPYYNVIK-------GAVAK 705

Query: 388 DNQKEHKSYTTTTNARYQLLRE 409
           DN        + TN RY+LL E
Sbjct: 706 DNLN-----LSETNERYRLLGE 722


>AT3G01810.2 | Symbols:  | FUNCTIONS IN: molecular_function unknown;
           INVOLVED IN: biological_process unknown; LOCATED IN:
           plasma membrane; EXPRESSED IN: 21 plant structures;
           EXPRESSED DURING: 13 growth stages; BEST Arabidopsis
           thaliana protein match is: nucleolar protein
           gar2-related (TAIR:AT2G42320.2); Has 1232 Blast hits to
           443 proteins in 120 species: Archae - 2; Bacteria - 119;
           Metazoa - 136; Fungi - 117; Plants - 114; Viruses - 0;
           Other Eukaryotes - 744 (source: NCBI BLink). |
           chr3:289218-292375 FORWARD LENGTH=859
          Length = 859

 Score =  307 bits (787), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 159/311 (51%), Positives = 203/311 (65%), Gaps = 23/311 (7%)

Query: 13  KSASLKWKGFPNGKAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYM 72
           K +SLKWK  P  K      +    W +  TF +ALE+VE+WIFSR+VES+WWQ LTP M
Sbjct: 535 KRSSLKWKDSPLSKKD---IKSFGAWDDPVTFITALEKVEAWIFSRVVESIWWQTLTPRM 591

Query: 73  QSPAGDF---------SSNKSFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAG 123
           QS A            +S K+FGR   P+  +   G+FS+ LW+ AF +A +RLCPLR  
Sbjct: 592 QSSAASTREFDKGNGSASKKTFGRT--PSSTNQELGDFSLELWKKAFREAHERLCPLRGS 649

Query: 124 GHECGCLPVLARMVMEQCIARLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDL 183
           GHECGCLP+ AR++MEQC+ARLDVAMFNAILR+S    PTDP+SDPI D +VLPIP+   
Sbjct: 650 GHECGCLPIPARLIMEQCVARLDVAMFNAILRDSDDNFPTDPVSDPIADLRVLPIPSRTS 709

Query: 184 SFGSGAQLKNSVGNWSRWLTDMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXX 243
           SFGSGAQLKNS+GNWSRWLTD+FG+D ED     ++S         +   K F       
Sbjct: 710 SFGSGAQLKNSIGNWSRWLTDLFGIDDEDDDSSDENS-------YVEKSFKTFNLLKALS 762

Query: 244 XXXXXPKDMLIDRHVREEVCPSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAE 303
                PKDML++  VR+EVCP     LI RVL NF PDEFCPDPVP  VL++L +E +  
Sbjct: 763 DLMMLPKDMLLNSSVRKEVCPMFGAPLIKRVLNNFVPDEFCPDPVPDAVLKSLESEKL-- 820

Query: 304 RRLSAESVRSF 314
           RR+S++++ + 
Sbjct: 821 RRVSSQAIHAL 831


>AT5G43230.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G01810.3); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr5:17349125-17352747 FORWARD LENGTH=848
          Length = 848

 Score =  286 bits (731), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 167/390 (42%), Positives = 217/390 (55%), Gaps = 27/390 (6%)

Query: 26  KAGSGFTQFVEDWQETGTFTSALERVESWIFSRLVESVWWQALTPYMQSPA--GDFSSNK 83
           KAGS      ++W++   F +ALE+ ESWIFSR+V+SVWWQ++TP+MQSPA  G  +   
Sbjct: 476 KAGS------DEWEDPRAFLAALEKFESWIFSRVVKSVWWQSMTPHMQSPAVKGSIARKV 529

Query: 84  SFGRVLGPALGDHNQGNFSINLWRYAFEDAFQRLCPLRAGGHECGCLPVLARMVMEQCIA 143
           S  R     LG  NQG ++I LW+ AF  A +RLCPLR    ECGCLP+LA++VMEQ I+
Sbjct: 530 SGKR----RLGHRNQGLYAIELWKNAFRAACERLCPLRGSRQECGCLPMLAKLVMEQLIS 585

Query: 144 RLDVAMFNAILRESALEIPTDPISDPILDSKVLPIPAGDLSFGSGAQLKNSVGNWSRWLT 203
           RLDVAMFNAILRESA E+PTDP+SDPI D  VLPIPAG  SFG+GAQLKN++G WSRWL 
Sbjct: 586 RLDVAMFNAILRESAGEMPTDPVSDPISDINVLPIPAGKASFGAGAQLKNAIGTWSRWLE 645

Query: 204 DMFGMDVEDCVQEYQDSGENDERQGGDGEPKPFVXXXXXXXXXXXPKDMLIDRHVREEVC 263
           D F    ED     +D   ND+ +      + F            P  ML D+  R+EVC
Sbjct: 646 DQFEQK-EDKSGRNKDEDNNDKEKPECEHFRLFHLLNSLGDLMMLPFKMLADKSTRKEVC 704

Query: 264 PSITLSLIIRVLCNFTPDEFCPDPVPGTVLEALNAETIAERRLSAESVRSFXXXXXXXXX 323
           P++   +I RVL NF PDEF P  +P  + + LN+E + E      +V  F         
Sbjct: 705 PTLGPPIIKRVLRNFVPDEFNPHRIPRRLFDVLNSEGLTEEDNGCITV--FPSAASPTVY 762

Query: 324 XXXXXXNVAEKVAEAGGKSHLARNVSAVQRRGYTXXXXXXXXXXPLTSIIDKLPLSPTVS 383
                 ++   + E    S ++   S+V ++ YT           + SI          S
Sbjct: 763 LMPSTDSIKRFIGELNNPS-ISETGSSVFKKQYTSDDELDDLDTSINSIF---------S 812

Query: 384 ANGQDNQKE--HKSYTTTTNARYQLLREVW 411
           A G  N  E   K Y      RYQLLRE+W
Sbjct: 813 APGTTNSSEWMPKGYGRRKTVRYQLLREIW 842