Miyakogusa Predicted Gene

Lj1g3v2162770.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v2162770.1 Non Chatacterized Hit- tr|F6HGJ5|F6HGJ5_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,42.73,6e-18,Myb_DNA-bind_3,Myb/SANT-like domain,CUFF.28668.1
         (295 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    78   8e-15
AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    78   8e-15
AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    76   2e-14
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    76   2e-14
AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    69   3e-12
AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    67   1e-11
AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    65   7e-11
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    62   7e-10
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    62   7e-10

>AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
           in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
           - 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
           LENGTH=449
          Length = 449

 Score = 77.8 bits (190), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 124/289 (42%), Gaps = 19/289 (6%)

Query: 5   SSNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNF 64
           SS+ P+   ++  W+ S  K+F DL+V+    GNRP+  F+K+ W  I    N+ T L +
Sbjct: 160 SSSNPQ---TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGY 216

Query: 65  NNNQLRKHLDVLRTRFHNLKSAYDQNNGFVIDDSCCIGF--ELWE-DTGAQPRPEIVKVK 121
              QL+ H D  R  +         ++     +S   G   E W       PR    + K
Sbjct: 217 TRPQLKNHWDCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHK 276

Query: 122 DCPIYEQLCTIFSDSAAVPDGKYAQSSHYEVDATSLISCSEAGVSNHQYPSSSKPILTAE 181
           + P  +QL  IF +    P   Y   S        L + SE+       P S   +  AE
Sbjct: 277 EVPHADQLAIIF-NGVIEPGETYTPPSRSRKKL--LHNRSESPQWRDTTPLSKMHVDEAE 333

Query: 182 NVTKNSLDRKKKRPSEMQTTSLDQDSCDAMAEALLEMVGAYRLRTIVSTVGDDK--FSVT 239
              +N         +E Q   +D ++   + +  L      +   +   +   K  +S+ 
Sbjct: 334 TSRQNGCY------AESQEDRIDSENAQPLDDMKLMNDVMLQESPVFVEIESAKPMYSIG 387

Query: 240 NCIRALDEVDGINE--QLYFSALELFEDPSLREIFISLKCDKIRLAWLQ 286
            CI++L+ ++ + +  +LY  AL+LF     REIF+ LK   +R+AWLQ
Sbjct: 388 ECIKSLNAIEEVEQGSELYMFALDLFLKREYREIFLELKKPSLRIAWLQ 436



 Score = 50.4 bits (119), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 35/68 (51%)

Query: 13 RSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNFNNNQLRKH 72
          R +  W     ++F DL V+   LGN+P   F K+ W +I   F +QT   ++  QL+ H
Sbjct: 2  RPKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNH 61

Query: 73 LDVLRTRF 80
           D +  ++
Sbjct: 62 WDTMSRQW 69


>AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1743234-1744751
           REVERSE LENGTH=449
          Length = 449

 Score = 77.8 bits (190), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 75/289 (25%), Positives = 124/289 (42%), Gaps = 19/289 (6%)

Query: 5   SSNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNF 64
           SS+ P+   ++  W+ S  K+F DL+V+    GNRP+  F+K+ W  I    N+ T L +
Sbjct: 160 SSSNPQ---TKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGY 216

Query: 65  NNNQLRKHLDVLRTRFHNLKSAYDQNNGFVIDDSCCIGF--ELWE-DTGAQPRPEIVKVK 121
              QL+ H D  R  +         ++     +S   G   E W       PR    + K
Sbjct: 217 TRPQLKNHWDCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHK 276

Query: 122 DCPIYEQLCTIFSDSAAVPDGKYAQSSHYEVDATSLISCSEAGVSNHQYPSSSKPILTAE 181
           + P  +QL  IF +    P   Y   S        L + SE+       P S   +  AE
Sbjct: 277 EVPHADQLAIIF-NGVIEPGETYTPPSRSRKKL--LHNRSESPQWRDTTPLSKMHVDEAE 333

Query: 182 NVTKNSLDRKKKRPSEMQTTSLDQDSCDAMAEALLEMVGAYRLRTIVSTVGDDK--FSVT 239
              +N         +E Q   +D ++   + +  L      +   +   +   K  +S+ 
Sbjct: 334 TSRQNGCY------AESQEDRIDSENAQPLDDMKLMNDVMLQESPVFVEIESAKPMYSIG 387

Query: 240 NCIRALDEVDGINE--QLYFSALELFEDPSLREIFISLKCDKIRLAWLQ 286
            CI++L+ ++ + +  +LY  AL+LF     REIF+ LK   +R+AWLQ
Sbjct: 388 ECIKSLNAIEEVEQGSELYMFALDLFLKREYREIFLELKKPSLRIAWLQ 436



 Score = 50.4 bits (119), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 21/68 (30%), Positives = 35/68 (51%)

Query: 13 RSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNFNNNQLRKH 72
          R +  W     ++F DL V+   LGN+P   F K+ W +I   F +QT   ++  QL+ H
Sbjct: 2  RPKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNH 61

Query: 73 LDVLRTRF 80
           D +  ++
Sbjct: 62 WDTMSRQW 69


>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 76.3 bits (186), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 20/155 (12%)

Query: 6   SNQPK-QERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNF 64
           SNQ    +R+RT WT ++++ F DL+++H+  GNR    F+K+ WN +   FN +    +
Sbjct: 2   SNQTTCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQY 61

Query: 65  NNNQLRKHLDVLRTRFHNLKSAYD------QNNGFVIDDS--CCIGFE-LWE-DTGAQPR 114
           +        DVL++R+ NL   Y+       + GFV D +    IG + LW     A P 
Sbjct: 62  DK-------DVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPE 114

Query: 115 PEIVKVKDCPIYEQLCTIFSDSAAVPDGKYAQSSH 149
             + K K    +  LC I+     V DG+Y+ SSH
Sbjct: 115 ARVYKTKPVLNFSDLCLIY--GYTVADGRYSMSSH 147



 Score = 65.1 bits (157), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 47/172 (27%), Positives = 80/172 (46%), Gaps = 10/172 (5%)

Query: 2   EIESSNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTD 61
           E ++S +   +R+R  WT  +D    DL+V+ +  GNR    F    WN +   FN +  
Sbjct: 311 ETKASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFG 370

Query: 62  LNFNNNQLRKHLDVLRTRFHNLKSAYDQNNGFVID---DSCCIGFELWEDTGAQPRPEI- 117
              N + L+     LR  ++++K   +Q NGF  D   D      ++W +T  Q  PE  
Sbjct: 371 SQHNKDVLKNRYKHLRRLYNDIKFLLEQ-NGFSWDARRDMVIADDDIW-NTYIQAHPEAR 428

Query: 118 -VKVKDCPIYEQLCTIFSDSAAVPDGKYAQ-SSHYEVDATSLISCSEAGVSN 167
             +VK  P Y  LC IF    +  DG+Y + +  ++      +  +E+G ++
Sbjct: 429 SYRVKTIPSYPNLCFIFGKETS--DGRYTRLAQAFDPSPAETVRMNESGSTD 478



 Score = 49.3 bits (116), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 67/147 (45%), Gaps = 10/147 (6%)

Query: 11  QERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNFNNNQLR 70
           +E S+T WT  +D+ F +++V  I  GN+  + F K+ W  +   FN +    +    LR
Sbjct: 165 KESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQYGKRVLR 224

Query: 71  KHLDVLRTRFHNLKSAYDQNNGFVIDDSCCI---GFELWED-TGAQPRPEIVKVKDCPIY 126
              + L   + ++++   + +GF  D++  +      +W+      P     ++K  P Y
Sbjct: 225 HRYNKLLKYYKDMEAIL-KEDGFSWDETRLMISADDAVWDSYIKDHPLARTYRMKSLPSY 283

Query: 127 EQLCTIFSDSAAVP-----DGKYAQSS 148
             L TIF+  A        DG  AQ+S
Sbjct: 284 NDLDTIFACQAEQGTDHRDDGSAAQTS 310


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score = 76.3 bits (186), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 20/155 (12%)

Query: 6   SNQPK-QERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNF 64
           SNQ    +R+RT WT ++++ F DL+++H+  GNR    F+K+ WN +   FN +    +
Sbjct: 2   SNQTTCNDRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQY 61

Query: 65  NNNQLRKHLDVLRTRFHNLKSAYD------QNNGFVIDDS--CCIGFE-LWE-DTGAQPR 114
           +        DVL++R+ NL   Y+       + GFV D +    IG + LW     A P 
Sbjct: 62  DK-------DVLKSRYTNLWKQYNDVKCLLDHGGFVWDQTHQTVIGDDSLWSLYLKAHPE 114

Query: 115 PEIVKVKDCPIYEQLCTIFSDSAAVPDGKYAQSSH 149
             + K K    +  LC I+     V DG+Y+ SSH
Sbjct: 115 ARVYKTKPVLNFSDLCLIY--GYTVADGRYSMSSH 147



 Score = 57.4 bits (137), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/194 (24%), Positives = 80/194 (41%), Gaps = 31/194 (15%)

Query: 2   EIESSNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTD 61
           E ++S +   +R+R  WT  +D    DL+V+ +  GNR    F    WN +   FN +  
Sbjct: 311 ETKASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFG 370

Query: 62  LNFNNNQLRKHLDVLRTRFHNLKSAYDQNNGF---------VIDD--------SCCIGFE 104
              N + L+     LR  ++++K   +Q NGF         + DD        +C I F 
Sbjct: 371 SQHNKDVLKNRYKHLRRLYNDIKFLLEQ-NGFSWDARRDMVIADDDIWNTYIQACHILFL 429

Query: 105 L----------WEDTGAQPRPEIVKVKDCPIYEQLCTIFSDSAAVPDGKYAQ-SSHYEVD 153
                       +   A P     +VK  P Y  LC IF    +  DG+Y + +  ++  
Sbjct: 430 FKISVICLCLQMKHVQAHPEARSYRVKTIPSYPNLCFIFGKETS--DGRYTRLAQAFDPS 487

Query: 154 ATSLISCSEAGVSN 167
               +  +E+G ++
Sbjct: 488 PAETVRMNESGSTD 501



 Score = 49.3 bits (116), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 10/154 (6%)

Query: 4   ESSNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLN 63
           ES     +E S+T WT  +D+ F +++V  I  GN+  + F K+ W  +   FN +    
Sbjct: 158 ESVVLSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQ 217

Query: 64  FNNNQLRKHLDVLRTRFHNLKSAYDQNNGFVIDDSCCI---GFELWED-TGAQPRPEIVK 119
           +    LR   + L   + ++++   + +GF  D++  +      +W+      P     +
Sbjct: 218 YGKRVLRHRYNKLLKYYKDMEAIL-KEDGFSWDETRLMISADDAVWDSYIKDHPLARTYR 276

Query: 120 VKDCPIYEQLCTIFSDSAAVP-----DGKYAQSS 148
           +K  P Y  L TIF+  A        DG  AQ+S
Sbjct: 277 MKSLPSYNDLDTIFACQAEQGTDHRDDGSAAQTS 310


>AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
           in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
           LENGTH=460
          Length = 460

 Score = 69.3 bits (168), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 76/297 (25%), Positives = 130/297 (43%), Gaps = 35/297 (11%)

Query: 13  RSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNFNNNQLRKH 72
           +S+  W+ S  ++F DL+ +    GNRP+  + K+TW  I +  N+ T  +F   QL+ H
Sbjct: 163 QSKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNH 222

Query: 73  LDVLRTRFHNLKSAYDQNNGFVID--DSCCIGF----ELWEDTGAQ-PRPEIVKVKDCPI 125
            D  R  +      + Q  G  +   D+    F    E W++   +  R    + K  P 
Sbjct: 223 WDCTRKSW----KIWCQVIGAPVMKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQLPH 278

Query: 126 YEQLCTIFSDSAAVPDGKYAQSSHYEVDATSLISCSEAGVSNHQYPSSSKPILTAENVTK 185
            ++L TIF      P   Y +S    V    L   SE+   +   P S+  + T E V+ 
Sbjct: 279 ADKLATIFK-GLIEPGKAYFRSYRRRV----LDHHSESPQLHDPTPLST--LYTNEPVSG 331

Query: 186 NSLDRKKKRPSEMQTTSLDQD---SCDAMAEALLEMV-----GAYRLRT-------IVST 230
           +     +    +       Q    +    AE+ L+ V     G +RL+T       + ++
Sbjct: 332 SEGGADEDDNDDDDEQPTPQHRRFNSVGFAESRLQDVEIVTPGCHRLKTELMKESPVSAS 391

Query: 231 VGDDKFSVTNCIRALDEVDGINE--QLYFSALELFEDPSLREIFISLKCDKIRLAWL 285
           V   ++++  CI  LD ++ + +   LY  AL+LF     REIF+ LK   +R++WL
Sbjct: 392 VRQYEYTIGECIECLDSMEEVEQGSDLYLFALDLFVKKEYREIFLLLKNSSLRMSWL 448


>AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
           - 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
           LENGTH=439
          Length = 439

 Score = 67.0 bits (162), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/290 (24%), Positives = 120/290 (41%), Gaps = 34/290 (11%)

Query: 13  RSRTRWTASLDKIFADLVVKHIQLGNRP---NDVFDKKTWNHIRDDFNKQTDLNFNNNQL 69
           R   +W+ S   I  D   +    G RP   N +F K++W  I +  N+ T L + + QL
Sbjct: 160 RRHYKWSPSSHAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQL 219

Query: 70  RKHLDVLRTRFHNLKSA-------YDQNN---GFVIDDSCCIGFELWED-TGAQPRPEIV 118
             H    RT + +           +D N    G   +D        W+       R  + 
Sbjct: 220 ENHFTRTRTSWKHWCETIASPIMKWDANTRKFGATEED--------WDKYLMINKRARVF 271

Query: 119 KVKDCPIYEQLCTIFSDSAAVPDGKYAQSSHYEVDATSLISCSEAGVSNHQYPSSSKPIL 178
           K +  P  ++L TIF         K  +     +D  S        + +HQ P+ S  ++
Sbjct: 272 KRRHIPHADKLATIFKGRIEPGKTKTRRYRKRVIDHHS----ESPQLHDHQ-PTPSSVVV 326

Query: 179 TAENVTKNSLDRKKKRPSEMQTTSLDQDSCDAMAEALL-EMVGAYRLRTIVSTVGDDKFS 237
                 K S DR +     ++ TSL +   + +AE +  EM+   ++    S    +KF+
Sbjct: 327 NTNEPVKGSDDRAED--GNVEPTSLIRSDSEDVAETVTPEMME--KIPVNASVKKKEKFT 382

Query: 238 VTNCIRALDEVDGINE--QLYFSALELFEDPSLREIFISLKCDKIRLAWL 285
              C+  LD ++ + +   LY  AL+LF+    R +F+ L+   +R+AWL
Sbjct: 383 FEECVECLDAIEEVEKGGDLYMFALDLFKTKDYRYLFLMLQKSSLRMAWL 432


>AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
           in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
           LENGTH=539
          Length = 539

 Score = 64.7 bits (156), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 76/368 (20%), Positives = 139/368 (37%), Gaps = 96/368 (26%)

Query: 13  RSRTRWTASLDKIFADLVVKHIQLGNRP-----NDVFDKKTWNHIRDDFNKQTDLNFNNN 67
           R +  W++S  +IF DL+       NRP     N  + K+TWN + + FN++T L +   
Sbjct: 171 RYKAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRK 230

Query: 68  QLRKHLDVLRTRFHNLKSA-------YDQNNGFVIDDSCCIGFELWEDTGAQ-PRPEIVK 119
           QL+ H ++ R  +     A       +D N       S     E WE+   +  R E  +
Sbjct: 231 QLKNHWNITRDAWRRWCQAVGSPLLKWDANTKTFGATS-----EDWENYSKENKRAEQFR 285

Query: 120 VKDCPIYEQLCTIF------------------------------------------SDSA 137
           +K  P  ++L  IF                                          S+  
Sbjct: 286 LKHIPHADKLAIIFKGHVEPGKTALRPYRKRVNHHSEAPQHPAPSSALNINESVPGSEGG 345

Query: 138 AVPDGKYAQSSHY----------EVDATSLISCSEAGVSNHQY--------------PSS 173
           A  D       H+          E+D    ++ SE G  +  Y              PSS
Sbjct: 346 ADDDHHIVMDHHFESPHDPASSSEIDLNEPVTGSEGGADDDHYIVLNHLVESPHDRAPSS 405

Query: 174 ----SKPILTAENVTKNSLDRKKKRPSEMQTTSL---DQDSCDAMAEALLEMVGAYRLRT 226
               +KP+   E +  ++ D  +  P       +   +    + +  A  E +    +  
Sbjct: 406 ELDINKPVAGIEGIADDN-DNHEPTPHHWAFNGVGVEESQDVETVTPAPCERINIELVEK 464

Query: 227 IVST--VGDDKFSVTNCIRALDEVDGINE--QLYFSALELFEDPSLREIFISLKCDKIRL 282
           I S   V + ++++  C++ L+ ++ + +  +LY  AL+LF +   RE+F+ L+   +R+
Sbjct: 465 ITSNALVKEYEYTIGECMKCLNAMEEVEKGSELYMLALDLFMNKECREMFLLLETSTLRM 524

Query: 283 AWLQGKCS 290
           +WL  + S
Sbjct: 525 SWLLRRLS 532


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 61.6 bits (148), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 116/282 (41%), Gaps = 35/282 (12%)

Query: 13  RSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNFNNNQLRKH 72
           R RT W   +D+ F DL++   + GN+   VF K+ W  + + FN + + NF+ + L+  
Sbjct: 181 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 240

Query: 73  LDVLRTRFHNLKSAYDQNNGFVIDDS---CCIGFELWED-TGAQPRPEIVKVKDCPIYEQ 128
              LR +F+ +KS   +++GF  D+          +W+D   A         +  P Y+ 
Sbjct: 241 YKSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKD 299

Query: 129 LCTIFSDSAAVPDGKYAQSSHYEVDATSLISCSEAGVSNHQYPSSSKPILTAENVTKNSL 188
           LC +  DS    +  +     ++          E      +   ++   ++AE    NSL
Sbjct: 300 LCVLCGDSGIEENECFVAMDWFD---------PETEFQEFKSSGTTDLSISAEEEDSNSL 350

Query: 189 --DRKKKRPSEMQTTSLDQDSCDAMAEALLEMVGAYRLRTIVSTVGDDKFSVTNCIRALD 246
             D K KR              D +A      +   + R   +       S+ + + A+ 
Sbjct: 351 LFDPKNKR--------------DQLANTDTSPINPKKPRVDETQT----MSIEDTVEAIQ 392

Query: 247 EVDGINEQLYFSALELFEDPSLREIFISLKCDKIRLAWLQGK 288
            +  ++++L   A +L ED    + F++L   K+R  WL  K
Sbjct: 393 ALPDMDDELILDACDLLEDKLKAKTFLALDV-KLRKKWLLRK 433



 Score = 60.1 bits (144), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 77/162 (47%), Gaps = 17/162 (10%)

Query: 6   SNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPND-VFDKKTWNHIRDDFNKQTDLNF 64
           S++   ER RT WT  +D+ F +L+V+ ++ GNR  D +F K+ W  +   F  +    +
Sbjct: 2   SSRNGNERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLY 61

Query: 65  NNNQLRKHLDVLRTRFHNLKS-------AYDQNNGFVIDDSCCIGFELWED-TGAQPRPE 116
             + L+     LR  F ++ +       ++D     V+ D+C     +W++     P   
Sbjct: 62  GKDVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNC-----VWDEYLKIHPDSR 116

Query: 117 IVKVKDCPIYEQLCTIFSDSAAVPDGKYAQSSHYEVDATSLI 158
             ++K  P Y+ LC ++SD  +      A+ S  E ++ +LI
Sbjct: 117 SFRIKSIPCYKDLCLVYSDGMS---EHKAEESISEGESKTLI 155


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 61.6 bits (148), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 116/282 (41%), Gaps = 35/282 (12%)

Query: 13  RSRTRWTASLDKIFADLVVKHIQLGNRPNDVFDKKTWNHIRDDFNKQTDLNFNNNQLRKH 72
           R RT W   +D+ F DL++   + GN+   VF K+ W  + + FN + + NF+ + L+  
Sbjct: 181 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 240

Query: 73  LDVLRTRFHNLKSAYDQNNGFVIDDS---CCIGFELWED-TGAQPRPEIVKVKDCPIYEQ 128
              LR +F+ +KS   +++GF  D+          +W+D   A         +  P Y+ 
Sbjct: 241 YKSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKD 299

Query: 129 LCTIFSDSAAVPDGKYAQSSHYEVDATSLISCSEAGVSNHQYPSSSKPILTAENVTKNSL 188
           LC +  DS    +  +     ++          E      +   ++   ++AE    NSL
Sbjct: 300 LCVLCGDSGIEENECFVAMDWFD---------PETEFQEFKSSGTTDLSISAEEEDSNSL 350

Query: 189 --DRKKKRPSEMQTTSLDQDSCDAMAEALLEMVGAYRLRTIVSTVGDDKFSVTNCIRALD 246
             D K KR              D +A      +   + R   +       S+ + + A+ 
Sbjct: 351 LFDPKNKR--------------DQLANTDTSPINPKKPRVDETQT----MSIEDTVEAIQ 392

Query: 247 EVDGINEQLYFSALELFEDPSLREIFISLKCDKIRLAWLQGK 288
            +  ++++L   A +L ED    + F++L   K+R  WL  K
Sbjct: 393 ALPDMDDELILDACDLLEDKLKAKTFLALDV-KLRKKWLLRK 433



 Score = 60.1 bits (144), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 77/162 (47%), Gaps = 17/162 (10%)

Query: 6   SNQPKQERSRTRWTASLDKIFADLVVKHIQLGNRPND-VFDKKTWNHIRDDFNKQTDLNF 64
           S++   ER RT WT  +D+ F +L+V+ ++ GNR  D +F K+ W  +   F  +    +
Sbjct: 2   SSRNGNERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLY 61

Query: 65  NNNQLRKHLDVLRTRFHNLKS-------AYDQNNGFVIDDSCCIGFELWED-TGAQPRPE 116
             + L+     LR  F ++ +       ++D     V+ D+C     +W++     P   
Sbjct: 62  GKDVLKNRHKTLRNLFKSVNNLLIEDGFSWDDTRQMVVADNC-----VWDEYLKIHPDSR 116

Query: 117 IVKVKDCPIYEQLCTIFSDSAAVPDGKYAQSSHYEVDATSLI 158
             ++K  P Y+ LC ++SD  +      A+ S  E ++ +LI
Sbjct: 117 SFRIKSIPCYKDLCLVYSDGMS---EHKAEESISEGESKTLI 155