Miyakogusa Predicted Gene

Lj5g3v0175600.2
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj5g3v0175600.2 Non Chatacterized Hit- tr|F6HGJ5|F6HGJ5_VITVI
Putative uncharacterized protein OS=Vitis vinifera
GN=,48.15,2e-18,seg,NULL; Myb_DNA-bind_3,Myb/SANT-like domain;
coiled-coil,NULL,CUFF.52630.2
         (300 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    83   2e-16
AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    83   2e-16
AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    79   3e-15
AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    79   3e-15
AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    72   4e-13
AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...    72   5e-13
AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    72   5e-13
AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    63   2e-10
AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    57   1e-08

>AT5G05800.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 24 plant
           structures; EXPRESSED DURING: 15 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 881 Blast hits to 512 proteins
           in 30 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 38; Plants - 833; Viruses - 0; Other Eukaryotes
           - 8 (source: NCBI BLink). | chr5:1743234-1744751 REVERSE
           LENGTH=449
          Length = 449

 Score = 83.2 bits (204), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/280 (26%), Positives = 119/280 (42%), Gaps = 15/280 (5%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
           +++  W+ S  K+F DL+V++   GNRP+  F+K+ W  I    N  T L +   QL+ H
Sbjct: 165 QTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNH 224

Query: 73  LDVLRMRYNNLKSAYDHNNGFLLDDSICIGF--EQWE-DIGAQPRNETVRVKDCPIYEQL 129
            D  R  +         ++     +S   G   E+W   I   PR    R K+ P  +QL
Sbjct: 225 WDCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQL 284

Query: 130 CTIFVDSAADGKYAQSSHYEELDKSIGIDASGLYENPSSSKAISGNILTVDKETKNSLDR 189
             IF     +G       Y    +S         E+P        + + VD E + S  R
Sbjct: 285 AIIF-----NGVIEPGETYTPPSRSRKKLLHNRSESPQWRDTTPLSKMHVD-EAETS--R 336

Query: 190 KRKRHHETQTTTLDQGTCDAM--AGALFEMIXXXXXXXXXXXXXDDKFSITNCIRALDEI 247
           +   + E+Q   +D      +     + +++                +SI  CI++L+ I
Sbjct: 337 QNGCYAESQEDRIDSENAQPLDDMKLMNDVMLQESPVFVEIESAKPMYSIGECIKSLNAI 396

Query: 248 QDIDQ--LLYFSALDLFEDPRLRETFISLKSVKIRLTWLQ 285
           ++++Q   LY  ALDLF     RE F+ LK   +R+ WLQ
Sbjct: 397 EEVEQGSELYMFALDLFLKREYREIFLELKKPSLRIAWLQ 436



 Score = 51.6 bits (122), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 35/68 (51%)

Query: 13 RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
          R +  W     ++F DL V+Q  LGN+P   F K+ W +I   F  QT   ++  QL+ H
Sbjct: 2  RPKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNH 61

Query: 73 LDVLRMRY 80
           D +  ++
Sbjct: 62 WDTMSRQW 69


>AT5G05800.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 1807 Blast hits to 1807 proteins
           in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736;
           Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes
           - 339 (source: NCBI BLink). | chr5:1743234-1744751
           REVERSE LENGTH=449
          Length = 449

 Score = 83.2 bits (204), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 73/280 (26%), Positives = 119/280 (42%), Gaps = 15/280 (5%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
           +++  W+ S  K+F DL+V++   GNRP+  F+K+ W  I    N  T L +   QL+ H
Sbjct: 165 QTKGYWSPSTHKLFLDLLVQETLKGNRPDTHFNKEGWKTILGTINENTGLGYTRPQLKNH 224

Query: 73  LDVLRMRYNNLKSAYDHNNGFLLDDSICIGF--EQWE-DIGAQPRNETVRVKDCPIYEQL 129
            D  R  +         ++     +S   G   E+W   I   PR    R K+ P  +QL
Sbjct: 225 WDCTRKAWKIWCQLVGASSMKWDPESRSFGATEEEWRIYIRENPRAGQFRHKEVPHADQL 284

Query: 130 CTIFVDSAADGKYAQSSHYEELDKSIGIDASGLYENPSSSKAISGNILTVDKETKNSLDR 189
             IF     +G       Y    +S         E+P        + + VD E + S  R
Sbjct: 285 AIIF-----NGVIEPGETYTPPSRSRKKLLHNRSESPQWRDTTPLSKMHVD-EAETS--R 336

Query: 190 KRKRHHETQTTTLDQGTCDAM--AGALFEMIXXXXXXXXXXXXXDDKFSITNCIRALDEI 247
           +   + E+Q   +D      +     + +++                +SI  CI++L+ I
Sbjct: 337 QNGCYAESQEDRIDSENAQPLDDMKLMNDVMLQESPVFVEIESAKPMYSIGECIKSLNAI 396

Query: 248 QDIDQ--LLYFSALDLFEDPRLRETFISLKSVKIRLTWLQ 285
           ++++Q   LY  ALDLF     RE F+ LK   +R+ WLQ
Sbjct: 397 EEVEQGSELYMFALDLFLKREYREIFLELKKPSLRIAWLQ 436



 Score = 51.6 bits (122), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 22/68 (32%), Positives = 35/68 (51%)

Query: 13 RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
          R +  W     ++F DL V+Q  LGN+P   F K+ W +I   F  QT   ++  QL+ H
Sbjct: 2  RPKAVWEPEYHRVFVDLCVEQTMLGNKPGTHFSKEGWRNILISFQEQTGAMYDRMQLKNH 61

Query: 73 LDVLRMRY 80
           D +  ++
Sbjct: 62 WDTMSRQW 69


>AT2G24960.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 1453 Blast hits to 509 proteins
           in 26 species: Archae - 0; Bacteria - 0; Metazoa - 1;
           Fungi - 39; Plants - 1363; Viruses - 0; Other Eukaryotes
           - 50 (source: NCBI BLink). | chr2:10617263-10620034
           FORWARD LENGTH=797
          Length = 797

 Score = 79.3 bits (194), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 5/161 (3%)

Query: 12  ERSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRK 71
           +R+RT WT ++++ F DL+++ +  GNR    F+K+ WN +   FN +    ++ + L+ 
Sbjct: 9   DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68

Query: 72  HLDVLRMRYNNLKSAYDHNNGFLLDDS--ICIGFEQ-WE-DIGAQPRNETVRVKDCPIYE 127
               L  +YN++K   DH  GF+ D +    IG +  W   + A P     + K    + 
Sbjct: 69  RYTNLWKQYNDVKCLLDH-GGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFS 127

Query: 128 QLCTIFVDSAADGKYAQSSHYEELDKSIGIDASGLYENPSS 168
            LC I+  + ADG+Y+ SSH  E++  I  ++  L    SS
Sbjct: 128 DLCLIYGYTVADGRYSMSSHDLEIEDEINGESVVLSGKESS 168



 Score = 66.6 bits (161), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 67/169 (39%), Gaps = 26/169 (15%)

Query: 2   ELEPSIQPKQERSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTD 61
           E + S +   +R+R  WT  +D    DL+V+Q+  GNR    F    WN +   FN +  
Sbjct: 311 ETKASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFG 370

Query: 62  LNFNNNQLRKHLDVLRMRYNNLKSAYDHNNG--------------------------FLL 95
              N + L+     LR  YN++K   + N                            FL 
Sbjct: 371 SQHNKDVLKNRYKHLRRLYNDIKFLLEQNGFSWDARRDMVIADDDIWNTYIQACHILFLF 430

Query: 96  DDSICIGFEQWEDIGAQPRNETVRVKDCPIYEQLCTIFVDSAADGKYAQ 144
             S+     Q + + A P   + RVK  P Y  LC IF    +DG+Y +
Sbjct: 431 KISVICLCLQMKHVQAHPEARSYRVKTIPSYPNLCFIFGKETSDGRYTR 479



 Score = 58.5 bits (140), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 24/160 (15%)

Query: 4   EPSIQPKQERSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLN 63
           E  +   +E S+T WT  +D+ F +++V QI  GN+  + F K+ W  +   FN +    
Sbjct: 158 ESVVLSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQ 217

Query: 64  FNNNQLRKHLDVLRMRYNNLKSAYD------HNNGFLLDDS---ICIGFEQWED-IGAQP 113
           +          VLR RYN L   Y         +GF  D++   I      W+  I   P
Sbjct: 218 YGKR-------VLRHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHP 270

Query: 114 RNETVRVKDCPIYEQLCTIFV-------DSAADGKYAQSS 146
              T R+K  P Y  L TIF        D   DG  AQ+S
Sbjct: 271 LARTYRMKSLPSYNDLDTIFACQAEQGTDHRDDGSAAQTS 310


>AT2G24960.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 21 plant
           structures; EXPRESSED DURING: 12 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G02210.2); Has 30201 Blast hits to 17322
           proteins in 780 species: Archae - 12; Bacteria - 1396;
           Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses -
           0; Other Eukaryotes - 2996 (source: NCBI BLink). |
           chr2:10617263-10620034 FORWARD LENGTH=774
          Length = 774

 Score = 79.3 bits (194), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 83/161 (51%), Gaps = 5/161 (3%)

Query: 12  ERSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRK 71
           +R+RT WT ++++ F DL+++ +  GNR    F+K+ WN +   FN +    ++ + L+ 
Sbjct: 9   DRTRTYWTPTMERFFIDLMLEHLHRGNRTGHTFNKQAWNEMLTVFNSKFGSQYDKDVLKS 68

Query: 72  HLDVLRMRYNNLKSAYDHNNGFLLDDS--ICIGFEQ-WE-DIGAQPRNETVRVKDCPIYE 127
               L  +YN++K   DH  GF+ D +    IG +  W   + A P     + K    + 
Sbjct: 69  RYTNLWKQYNDVKCLLDH-GGFVWDQTHQTVIGDDSLWSLYLKAHPEARVYKTKPVLNFS 127

Query: 128 QLCTIFVDSAADGKYAQSSHYEELDKSIGIDASGLYENPSS 168
            LC I+  + ADG+Y+ SSH  E++  I  ++  L    SS
Sbjct: 128 DLCLIYGYTVADGRYSMSSHDLEIEDEINGESVVLSGKESS 168



 Score = 73.2 bits (178), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 47/147 (31%), Positives = 68/147 (46%), Gaps = 5/147 (3%)

Query: 2   ELEPSIQPKQERSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTD 61
           E + S +   +R+R  WT  +D    DL+V+Q+  GNR    F    WN +   FN +  
Sbjct: 311 ETKASQEQNSDRTRIFWTPPMDYHLIDLLVEQVNNGNRVGQTFITSAWNEMVTAFNAKFG 370

Query: 62  LNFNNNQLRKHLDVLRMRYNNLKSAYDHNNGFLLD---DSICIGFEQWED-IGAQPRNET 117
              N + L+     LR  YN++K   +  NGF  D   D +    + W   I A P   +
Sbjct: 371 SQHNKDVLKNRYKHLRRLYNDIKFLLEQ-NGFSWDARRDMVIADDDIWNTYIQAHPEARS 429

Query: 118 VRVKDCPIYEQLCTIFVDSAADGKYAQ 144
            RVK  P Y  LC IF    +DG+Y +
Sbjct: 430 YRVKTIPSYPNLCFIFGKETSDGRYTR 456



 Score = 58.5 bits (140), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 68/160 (42%), Gaps = 24/160 (15%)

Query: 4   EPSIQPKQERSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLN 63
           E  +   +E S+T WT  +D+ F +++V QI  GN+  + F K+ W  +   FN +    
Sbjct: 158 ESVVLSGKESSKTEWTLEMDQYFVEIMVDQIGRGNKTGNAFSKQAWIDMLVLFNARFSGQ 217

Query: 64  FNNNQLRKHLDVLRMRYNNLKSAYD------HNNGFLLDDS---ICIGFEQWED-IGAQP 113
           +          VLR RYN L   Y         +GF  D++   I      W+  I   P
Sbjct: 218 YGKR-------VLRHRYNKLLKYYKDMEAILKEDGFSWDETRLMISADDAVWDSYIKDHP 270

Query: 114 RNETVRVKDCPIYEQLCTIFV-------DSAADGKYAQSS 146
              T R+K  P Y  L TIF        D   DG  AQ+S
Sbjct: 271 LARTYRMKSLPSYNDLDTIFACQAEQGTDHRDDGSAAQTS 310


>AT3G11290.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11310.1); Has 720 Blast hits to 435 proteins
           in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 32; Plants - 682; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3535766-3537295 REVERSE
           LENGTH=460
          Length = 460

 Score = 72.4 bits (176), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 71/294 (24%), Positives = 117/294 (39%), Gaps = 30/294 (10%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
           +S+  W+ S  ++F DL+ ++   GNRP+  + K+TW  I +  N+ T  +F   QL+ H
Sbjct: 163 QSKGYWSPSSHELFVDLLFQEALKGNRPDSHYPKETWKMILETINQNTGKSFTRPQLKNH 222

Query: 73  LDVLRMRYNNLKSAYDHNNGFLLDDSICIGF----EQWED-IGAQPRNETVRVKDCPIYE 127
            D  R  +             +  D+    F    E W++ +    R    R K  P  +
Sbjct: 223 WDCTRKSWKIWCQVI--GAPVMKWDATSRTFGATDEDWKNYLKENHRAAPFRRKQLPHAD 280

Query: 128 QLCTIFVDSAADGK-YAQS-------SHYEELDKSIGIDASGLYENPSSSKAISGNILTV 179
           +L TIF      GK Y +S        H E          S LY N   S +  G     
Sbjct: 281 KLATIFKGLIEPGKAYFRSYRRRVLDHHSESPQLHDPTPLSTLYTNEPVSGSEGGADEDD 340

Query: 180 DKETKNSLDRKRKRHHET-------QTTTLDQGTCDAMAGALFEMIXXXXXXXXXXXXXD 232
           + +       + +R +         Q   +    C  +   L +                
Sbjct: 341 NDDDDEQPTPQHRRFNSVGFAESRLQDVEIVTPGCHRLKTELMK------ESPVSASVRQ 394

Query: 233 DKFSITNCIRALDEIQDIDQL--LYFSALDLFEDPRLRETFISLKSVKIRLTWL 284
            +++I  CI  LD +++++Q   LY  ALDLF     RE F+ LK+  +R++WL
Sbjct: 395 YEYTIGECIECLDSMEEVEQGSDLYLFALDLFVKKEYREIFLLLKNSSLRMSWL 448


>AT4G02210.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN:
           cellular_component unknown; EXPRESSED IN: 22 plant
           structures; EXPRESSED DURING: 13 growth stages; BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 72.0 bits (175), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 120/292 (41%), Gaps = 52/292 (17%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
           R RT W   +D+ F DL++ Q + GN+   +F K+ W  + + FN + + NF+ + L+  
Sbjct: 181 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 240

Query: 73  LDVLRMRYNNLKSAYDHNNGFLLDDS---ICIGFEQWED-IGAQPRNETVRVKDCPIYEQ 128
              LR ++N +KS    ++GF  D+    +      W+D I A         +  P Y+ 
Sbjct: 241 YKSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKD 299

Query: 129 LCTIFVDSAADGK--YAQSSHYEELDKSIGIDASGLYENPSSSKAISGNILTVDKETK-- 184
           LC +  DS  +    +     ++   +     +SG  +   S++    N L  D + K  
Sbjct: 300 LCVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEEDSNSLLFDPKNKRD 359

Query: 185 -------NSLDRKRKRHHETQTTTLDQGTCDAMAGALFEMIXXXXXXXXXXXXXDDKFSI 237
                  + ++ K+ R  ETQT                                    SI
Sbjct: 360 QLANTDTSPINPKKPRVDETQT-----------------------------------MSI 384

Query: 238 TNCIRALDEIQDIDQLLYFSALDLFEDPRLRETFISLKSVKIRLTWLQGKAK 289
            + + A+  + D+D  L   A DL ED    +TF++L  VK+R  WL  K +
Sbjct: 385 EDTVEAIQALPDMDDELILDACDLLEDKLKAKTFLAL-DVKLRKKWLLRKLR 435



 Score = 63.9 bits (154), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 71/150 (47%), Gaps = 28/150 (18%)

Query: 12  ERSRTRWTASLDKIFADLVVKQIQLGNRPND-IFDKKTWNHIRDEFNRQTDLNFNNNQLR 70
           ER RT WT  +D+ F +L+V+Q++ GNR  D +F K+ W  +   F  +    +      
Sbjct: 8   ERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGK---- 63

Query: 71  KHLDVLRMRYNNLKSAYDHNNGFLLDDSICIGFEQWED---------------IGAQPRN 115
              DVL+ R+  L++ +   N  L++D    GF  W+D               +   P +
Sbjct: 64  ---DVLKNRHKTLRNLFKSVNNLLIED----GF-SWDDTRQMVVADNCVWDEYLKIHPDS 115

Query: 116 ETVRVKDCPIYEQLCTIFVDSAADGKYAQS 145
            + R+K  P Y+ LC ++ D  ++ K  +S
Sbjct: 116 RSFRIKSIPCYKDLCLVYSDGMSEHKAEES 145


>AT4G02210.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins
           in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes
           - 26 (source: NCBI BLink). | chr4:974320-975917 REVERSE
           LENGTH=439
          Length = 439

 Score = 72.0 bits (175), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 120/292 (41%), Gaps = 52/292 (17%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRPNDIFDKKTWNHIRDEFNRQTDLNFNNNQLRKH 72
           R RT W   +D+ F DL++ Q + GN+   +F K+ W  + + FN + + NF+ + L+  
Sbjct: 181 RCRTTWHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNR 240

Query: 73  LDVLRMRYNNLKSAYDHNNGFLLDDS---ICIGFEQWED-IGAQPRNETVRVKDCPIYEQ 128
              LR ++N +KS    ++GF  D+    +      W+D I A         +  P Y+ 
Sbjct: 241 YKSLRRQFNAIKSIL-RSDGFAWDNERQMVTADNNVWQDYIKAHRDARQFMTRPIPYYKD 299

Query: 129 LCTIFVDSAADGK--YAQSSHYEELDKSIGIDASGLYENPSSSKAISGNILTVDKETK-- 184
           LC +  DS  +    +     ++   +     +SG  +   S++    N L  D + K  
Sbjct: 300 LCVLCGDSGIEENECFVAMDWFDPETEFQEFKSSGTTDLSISAEEEDSNSLLFDPKNKRD 359

Query: 185 -------NSLDRKRKRHHETQTTTLDQGTCDAMAGALFEMIXXXXXXXXXXXXXDDKFSI 237
                  + ++ K+ R  ETQT                                    SI
Sbjct: 360 QLANTDTSPINPKKPRVDETQT-----------------------------------MSI 384

Query: 238 TNCIRALDEIQDIDQLLYFSALDLFEDPRLRETFISLKSVKIRLTWLQGKAK 289
            + + A+  + D+D  L   A DL ED    +TF++L  VK+R  WL  K +
Sbjct: 385 EDTVEAIQALPDMDDELILDACDLLEDKLKAKTFLAL-DVKLRKKWLLRKLR 435



 Score = 63.9 bits (154), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 71/150 (47%), Gaps = 28/150 (18%)

Query: 12  ERSRTRWTASLDKIFADLVVKQIQLGNRPND-IFDKKTWNHIRDEFNRQTDLNFNNNQLR 70
           ER RT WT  +D+ F +L+V+Q++ GNR  D +F K+ W  +   F  +    +      
Sbjct: 8   ERLRTVWTPEMDQYFIELMVEQVRKGNRFEDHLFSKRAWKFMSCSFTAKFKFLYGK---- 63

Query: 71  KHLDVLRMRYNNLKSAYDHNNGFLLDDSICIGFEQWED---------------IGAQPRN 115
              DVL+ R+  L++ +   N  L++D    GF  W+D               +   P +
Sbjct: 64  ---DVLKNRHKTLRNLFKSVNNLLIED----GF-SWDDTRQMVVADNCVWDEYLKIHPDS 115

Query: 116 ETVRVKDCPIYEQLCTIFVDSAADGKYAQS 145
            + R+K  P Y+ LC ++ D  ++ K  +S
Sbjct: 116 RSFRIKSIPCYKDLCLVYSDGMSEHKAEES 145


>AT2G19220.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 443 Blast hits to 267 proteins
           in 21 species: Archae - 0; Bacteria - 0; Metazoa - 0;
           Fungi - 17; Plants - 426; Viruses - 0; Other Eukaryotes
           - 0 (source: NCBI BLink). | chr2:8340678-8342161 REVERSE
           LENGTH=439
          Length = 439

 Score = 63.2 bits (152), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 118/282 (41%), Gaps = 19/282 (6%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRP---NDIFDKKTWNHIRDEFNRQTDLNFNNNQL 69
           R   +W+ S   I  D   ++   G RP   N +F K++W  I ++ NR T L + + QL
Sbjct: 160 RRHYKWSPSSHAIVVDTCFQESLKGIRPIKRNHLFTKESWKMILEKINRITGLGYTHKQL 219

Query: 70  RKHLDVLRMRYNNLKSAYDHNNGFLLDDSICIGF----EQWED-IGAQPRNETVRVKDCP 124
             H    R  + +        +  +  D+    F    E W+  +    R    + +  P
Sbjct: 220 ENHFTRTRTSWKHWCETI--ASPIMKWDANTRKFGATEEDWDKYLMINKRARVFKRRHIP 277

Query: 125 IYEQLCTIFVDSAADGKYAQSSHYEELDKSIGIDASGLYENPSSSKAISGNILTVDKETK 184
             ++L TIF      GK  ++  Y +       ++  L+++  +  ++   ++  ++  K
Sbjct: 278 HADKLATIFKGRIEPGK-TKTRRYRKRVIDHHSESPQLHDHQPTPSSV---VVNTNEPVK 333

Query: 185 NSLDRKRKRHHETQTTTLDQGTCDAMAGALFEMIXXXXXXXXXXXXXDDKFSITNCIRAL 244
            S DR    + E  T+ +   + D       EM+              +KF+   C+  L
Sbjct: 334 GSDDRAEDGNVEP-TSLIRSDSEDVAETVTPEMMEKIPVNASVKKK--EKFTFEECVECL 390

Query: 245 DEIQDIDQL--LYFSALDLFEDPRLRETFISLKSVKIRLTWL 284
           D I+++++   LY  ALDLF+    R  F+ L+   +R+ WL
Sbjct: 391 DAIEEVEKGGDLYMFALDLFKTKDYRYLFLMLQKSSLRMAWL 432


>AT3G11310.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G11290.1); Has 575 Blast hits to 342 proteins
           in 22 species: Archae - 0; Bacteria - 2; Metazoa - 0;
           Fungi - 10; Plants - 559; Viruses - 0; Other Eukaryotes
           - 4 (source: NCBI BLink). | chr3:3542536-3544333 REVERSE
           LENGTH=539
          Length = 539

 Score = 57.4 bits (137), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 79/173 (45%), Gaps = 17/173 (9%)

Query: 13  RSRTRWTASLDKIFADLVVKQIQLGNRP-----NDIFDKKTWNHIRDEFNRQTDLNFNNN 67
           R +  W++S  +IF DL+  +    NRP     N  + K+TWN + + FN++T L +   
Sbjct: 171 RYKAYWSSSSHEIFVDLLFTESLKENRPKPARRNGYYAKETWNMMVESFNQKTGLRYTRK 230

Query: 68  QLRKHLDVLRMRYNNLKSAYDHNNGFLLDDSICIGF----EQWEDIGAQ-PRNETVRVKD 122
           QL+ H ++ R  +     A    +  L  D+    F    E WE+   +  R E  R+K 
Sbjct: 231 QLKNHWNITRDAWRRWCQAV--GSPLLKWDANTKTFGATSEDWENYSKENKRAEQFRLKH 288

Query: 123 CPIYEQLCTIFVDSAADGKYAQSSHYEELDKSIGIDASGLYENPSSSKAISGN 175
            P  ++L  IF      GK A   + + ++       S   ++P+ S A++ N
Sbjct: 289 IPHADKLAIIFKGHVEPGKTALRPYRKRVNHH-----SEAPQHPAPSSALNIN 336