Miyakogusa Predicted Gene

Lj1g3v4779510.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4779510.1 Non Chatacterized Hit- tr|I1NBS4|I1NBS4_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.34652
PE,64.2,0,seg,NULL; SUBFAMILY NOT NAMED,NULL; PHOSPHATIDYLINOSITOL
N-ACETYLGLUCOSAMINYLTRANSFERASE SUBUNIT P (,CUFF.33257.1
         (849 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT5G26910.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   256   4e-68
AT5G26910.3 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   256   4e-68
AT3G58650.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   160   4e-39
AT3G05750.1 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   155   2e-37
AT5G26910.2 | Symbols:  | unknown protein; BEST Arabidopsis thal...   150   3e-36
AT3G05750.2 | Symbols:  | unknown protein; FUNCTIONS IN: molecul...   135   1e-31
AT3G53540.1 | Symbols:  | unknown protein; LOCATED IN: plasma me...    52   1e-06
AT2G39435.1 | Symbols:  | Phosphatidylinositol N-acetyglucosamin...    52   2e-06

>AT5G26910.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G58650.1); Has 1322 Blast hits to 684 proteins
           in 162 species: Archae - 4; Bacteria - 497; Metazoa -
           157; Fungi - 101; Plants - 155; Viruses - 0; Other
           Eukaryotes - 408 (source: NCBI BLink). |
           chr5:9466169-9469523 REVERSE LENGTH=853
          Length = 853

 Score =  256 bits (654), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 254/863 (29%), Positives = 401/863 (46%), Gaps = 135/863 (15%)

Query: 11  SICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHGVSHCNEVALPMDEFC 70
           S+ SD+G G++AP +VARLMGL+SLP     E     LN      +   ++     D + 
Sbjct: 77  SVTSDDGQGTRAPSVVARLMGLESLPVPNVQE---PRLNPDLDPFLLRPSQNTNRWDAYE 133

Query: 71  PRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKLLSPIKS 130
              Y+N+    +  S D ++ R     N+P++RFQ+E  PP+SAKPI VT+N+ LSPI+S
Sbjct: 134 NLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAKPICVTNNRHLSPIRS 193

Query: 131 PGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRILDLKERLEAAQCAFT 189
           PGF+P +N  ++MEAA+++IE SP+   R R  PS   SSVP+RI DL+E+LEAAQ    
Sbjct: 194 PGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSPSSVPMRIQDLREKLEAAQ---- 249

Query: 190 PEKLVGPSNANPANGILYERSSNSHK---------CTSAFKGSRDSEKNSSSHSATRRRS 240
             K+    N+N    + Y    ++ K          TS F G     K+S+     + + 
Sbjct: 250 --KVSSRQNSNDTFNLKYPSGKHNEKRITTSLTTPSTSKFMG-----KSSTDGLKGKVKP 302

Query: 241 DSLALQAKPNVQNRDTLNSNGNRKYVKQKEQKEIKSNQLSRSQKPSSDRDVHQRTCTSRN 300
             ++ QAK          ++ N+K     ++  +KS    R                S  
Sbjct: 303 SYVSAQAKAGTTPLSVTRNSANQKEKADAKKCVVKSQNALRG------------APISMG 350

Query: 301 SNVLGQNNQKQNC------MTTTSKPISKIDSNKATARASSSESSIGTR--KTTGRGAKN 352
            N+  QNNQKQNC      MT+     S   +NK   +      SI  +   +T    KN
Sbjct: 351 KNMFKQNNQKQNCRDNQPSMTSVLNQKSSKVNNKVVNKVPVESGSISKQLGLSTASAEKN 410

Query: 353 VNVQPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCNF 412
            ++   R   + T  R + LP+       +K  IS      RS +          IKCN 
Sbjct: 411 TSLSLSR---KKTLPRSKKLPN-----GMQKSGISDDKRTKRSENM---------IKCNI 453

Query: 413 TTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDSPSSTEQAMEIRNSVGVNSPGHNDNS 472
           T DG +++   + K+  DVISFTF SP++    DS SST+         G+     +  S
Sbjct: 454 TIDGGLNKGKDDRKKEMDVISFTFSSPIKGLSSDSLSSTQ---------GIGQDTDSAVS 504

Query: 473 YHRNLSLSPPGLNMIDSDAXXXXXXXXXXXXTSRLNLPQCTLATEXXXXXXXXXXQDKVP 532
           ++            I  D+            TS+L    C+L  E           D++ 
Sbjct: 505 FN------------IGGDSLNALLEQKLRELTSKLESSSCSLTQE---EPSYSIPMDEMN 549

Query: 533 SMVSITSKEQDKSFYPDQFSDKLDCMHNYHCSSGDPVLNLNQ---QIQTSEVREDPRCSS 589
            M+S +S+      Y     + L  + +   S  D     ++   QIQ  E       + 
Sbjct: 550 GMISFSSE------YEKSTQNGLRKVLSESESVSDCTSFYDKQKFQIQAEEHEVSSISTV 603

Query: 590 KDANDLGFQHPNAVTVLETSFASESYLDSEDSTYGSTVYSSMQDEEVSDYSQTHESVSLA 649
            +A+DL             S  S+ + D   +    T+ SS  D+E++ +   +ES    
Sbjct: 604 TEADDL------------RSSCSKGFSDCRQTAEYGTIQSS-SDQELT-WVSLNESHQAQ 649

Query: 650 NEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELE--YIQDILENADFMSEEFV 707
           +E + SE                  +  L   E    ++ E  YI +IL +   M +E+ 
Sbjct: 650 DESELSES-----------------VVTLSYSEAEERLDWEFEYISEILGSDQLMVKEYA 692

Query: 708 MGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRCK 767
           +G A  V+  +LFD +E +G           +K++RK LFD V++CL LR  Q F+G C+
Sbjct: 693 LGMATDVLPASLFDEMEGRGEVTA-------AKIKRKTLFDFVNKCLALRCEQMFMGSCR 745

Query: 768 S-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFEE 826
               +     +++ WLAEEL +E+ G + M E+M+DELV K+MS+  G+WLDF+ E +EE
Sbjct: 746 GLLGKGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYEE 805

Query: 827 GSEVEQDILASLINELVSDLLLG 849
           G ++E +I+++L+++LV+DL+ G
Sbjct: 806 GIDIEGEIVSTLVDDLVNDLVSG 828


>AT5G26910.3 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: mitochondrion;
           EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT3G58650.1). |
           chr5:9466169-9469523 REVERSE LENGTH=852
          Length = 852

 Score =  256 bits (654), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 255/864 (29%), Positives = 402/864 (46%), Gaps = 137/864 (15%)

Query: 11  SICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHGVSHCNEVALPMDEFC 70
           S+ SD+G G++AP +VARLMGL+SLP     E     LN      +   ++     D + 
Sbjct: 76  SVTSDDGQGTRAPSVVARLMGLESLPVPNVQE---PRLNPDLDPFLLRPSQNTNRWDAYE 132

Query: 71  PRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKLLSPIKS 130
              Y+N+    +  S D ++ R     N+P++RFQ+E  PP+SAKPI VT+N+ LSPI+S
Sbjct: 133 NLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAKPICVTNNRHLSPIRS 192

Query: 131 PGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRILDLKERLEAAQCAFT 189
           PGF+P +N  ++MEAA+++IE SP+   R R  PS   SSVP+RI DL+E+LEAAQ    
Sbjct: 193 PGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSPSSVPMRIQDLREKLEAAQ---- 248

Query: 190 PEKLVGPSNANPANGILYERSSNSHK---------CTSAFKGSRDSEKNSSSHSATRRRS 240
             K+    N+N    + Y    ++ K          TS F G     K+S+     + + 
Sbjct: 249 --KVSSRQNSNDTFNLKYPSGKHNEKRITTSLTTPSTSKFMG-----KSSTDGLKGKVKP 301

Query: 241 DSLALQAKPNVQNRDTLNSNGNRKYVKQKEQKEIKSNQLSRSQKPSSDRDVHQRTCTSRN 300
             ++ QAK          ++ N+K     ++  +KS    R                S  
Sbjct: 302 SYVSAQAKAGTTPLSVTRNSANQKEKADAKKCVVKSQNALRG------------APISMG 349

Query: 301 SNVLGQNNQKQNC------MTTTSKPISKIDSNKATARASSSESSIGTR--KTTGRGAKN 352
            N+  QNNQKQNC      MT+     S   +NK   +      SI  +   +T    KN
Sbjct: 350 KNMFKQNNQKQNCRDNQPSMTSVLNQKSSKVNNKVVNKVPVESGSISKQLGLSTASAEKN 409

Query: 353 VNVQPKRSSLRATDNRKEFLP-SKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCN 411
            ++         + +RK+ LP SK      +K  IS      RS +          IKCN
Sbjct: 410 TSL---------SLSRKKTLPRSKKLPNGMQKSGISDDKRTKRSENM---------IKCN 451

Query: 412 FTTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDSPSSTEQAMEIRNSVGVNSPGHNDN 471
            T DG +++   + K+  DVISFTF SP++    DS SST+         G+     +  
Sbjct: 452 ITIDGGLNKGKDDRKKEMDVISFTFSSPIKGLSSDSLSSTQ---------GIGQDTDSAV 502

Query: 472 SYHRNLSLSPPGLNMIDSDAXXXXXXXXXXXXTSRLNLPQCTLATEXXXXXXXXXXQDKV 531
           S++            I  D+            TS+L    C+L  E           D++
Sbjct: 503 SFN------------IGGDSLNALLEQKLRELTSKLESSSCSLTQE---EPSYSIPMDEM 547

Query: 532 PSMVSITSKEQDKSFYPDQFSDKLDCMHNYHCSSGDPVLNLNQ---QIQTSEVREDPRCS 588
             M+S +S+      Y     + L  + +   S  D     ++   QIQ  E       +
Sbjct: 548 NGMISFSSE------YEKSTQNGLRKVLSESESVSDCTSFYDKQKFQIQAEEHEVSSIST 601

Query: 589 SKDANDLGFQHPNAVTVLETSFASESYLDSEDSTYGSTVYSSMQDEEVSDYSQTHESVSL 648
             +A+DL             S  S+ + D   +    T+ SS  D+E++ +   +ES   
Sbjct: 602 VTEADDL------------RSSCSKGFSDCRQTAEYGTIQSS-SDQELT-WVSLNESHQA 647

Query: 649 ANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELE--YIQDILENADFMSEEF 706
            +E + SE                  +  L   E    ++ E  YI +IL +   M +E+
Sbjct: 648 QDESELSES-----------------VVTLSYSEAEERLDWEFEYISEILGSDQLMVKEY 690

Query: 707 VMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVSECLELRFTQAFVGRC 766
            +G A  V+  +LFD +E +G           +K++RK LFD V++CL LR  Q F+G C
Sbjct: 691 ALGMATDVLPASLFDEMEGRGEVTA-------AKIKRKTLFDFVNKCLALRCEQMFMGSC 743

Query: 767 KS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMVDELVSKDMSTGCGKWLDFDIEAFE 825
           +    +     +++ WLAEEL +E+ G + M E+M+DELV K+MS+  G+WLDF+ E +E
Sbjct: 744 RGLLGKGGFLFEQRDWLAEELNREIHGLKKMREMMMDELVDKEMSSFEGRWLDFERETYE 803

Query: 826 EGSEVEQDILASLINELVSDLLLG 849
           EG ++E +I+++L+++LV+DL+ G
Sbjct: 804 EGIDIEGEIVSTLVDDLVNDLVSG 827


>AT3G58650.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: plasma membrane;
           EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G26910.1); Has 2350 Blast
           hits to 1412 proteins in 248 species: Archae - 0;
           Bacteria - 487; Metazoa - 577; Fungi - 236; Plants -
           184; Viruses - 4; Other Eukaryotes - 862 (source: NCBI
           BLink). | chr3:21696349-21699219 REVERSE LENGTH=820
          Length = 820

 Score =  160 bits (404), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 159/445 (35%), Positives = 215/445 (48%), Gaps = 64/445 (14%)

Query: 11  SICSDEGCGSKAPGLVARLMGLDSLPA------SANTELSCTSLNGSSSHGVSHCNEVAL 64
           S+ SD+G   +A  +VARLMGL+ LP         N +L    L  S        N    
Sbjct: 80  SVTSDDGNVVRA-SVVARLMGLEGLPLPNVLEPRVNPDLDPYFLRSSRQANTWDAN---- 134

Query: 65  PMDEFCPRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKL 124
            +D     D ++  H   + S  +   R R +E     RFQTE LPP+SAKPI VTHNKL
Sbjct: 135 -VDRQSDFDGVSWDH---LDSRTSKGPRKRMIE-----RFQTETLPPRSAKPISVTHNKL 185

Query: 125 LSPIKSPGFLPPKNAAHLMEAAAKIIEASPQHYTRDRMPSVRSSS--VPLRILDLKERLE 182
           LSPI++PGF+P +N A++MEAA+++IE SP+   R RM S   SS  VPLRI DLKE+LE
Sbjct: 186 LSPIRNPGFVPSRNPAYVMEAASRMIEQSPRMIARTRMVSSSDSSSPVPLRIRDLKEKLE 245

Query: 183 AAQCAFTPEKLVGPSNANPANGILYERSSNSHKCTSAF-KGSRDSEKNSSSHSATRRRSD 241
           AAQ A T      P  +N      Y R   + K T+   K S D+ K          +  
Sbjct: 246 AAQKASTSV----PQISNDTRNSRYLRGDQNEKKTTVLGKNSYDALKGGEV------KPP 295

Query: 242 SLALQAKPNV-QNRDTL--NSNGNRKYVK-QKEQKEIKSNQLSRSQKPSSDRDVHQRTCT 297
           S A QAK +  Q +D+L  +S+GN++    QKE+ E K N+  +SQ  S      + +  
Sbjct: 296 SFAAQAKVSSNQKQDSLSMSSSGNKRMSSGQKEKVEAK-NRAVKSQNSS------KGSSL 348

Query: 298 SRNSNVLGQNNQKQNCM-TTTSKPISKIDSNKATARASSSESSIGTRKTTGRGAKNVNVQ 356
           S   NVL QNNQKQNC     S+ +     NK    + S   S G   ++     ++ + 
Sbjct: 349 STGKNVLRQNNQKQNCRDNQQSRRVMNKVVNKVLVESGSISKSSGFTMSSAEKPTSLPLS 408

Query: 357 PKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCNFTTDG 416
            K+S  R+   R        ES   + K I R                 KSIKCN + DG
Sbjct: 409 RKKSLPRSKKPRNGV----QESGIYEDKRIKRG---------------EKSIKCNISIDG 449

Query: 417 RIHQDAFNMKEGKDVISFTFMSPLR 441
                  + K   DVISFTF S ++
Sbjct: 450 DSSTSKDDQKRDMDVISFTFSSSIK 474



 Score =  120 bits (302), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 61/165 (36%), Positives = 102/165 (61%), Gaps = 9/165 (5%)

Query: 686 NMELEYIQDILENADFMSEEFVMGQA--DTVIMPNLFDLLENQGSSGTENYGDEYSKLER 743
           + ELEYI +IL +   M ++F  G    ++++  +LFD +E    + T        K ER
Sbjct: 652 DWELEYITEILNSGQLMFQDFASGTTTNESLLPSSLFDEMERSRGAATS------MKTER 705

Query: 744 KVLFDCVSECLELRFTQAFVGRCKSWP-RWVTSVQRKRWLAEELYKEMFGFRNMEEVMVD 802
           K LFDCV++CL ++F +  +G CK         ++ +  LAEE+ +E+ G + M E+M+D
Sbjct: 706 KALFDCVNQCLAVKFERMLIGSCKGMMMSGGILLEHRDLLAEEVNREVKGLKKMREMMID 765

Query: 803 ELVSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLL 847
           ELV  DMS   G+W+ ++ E FEEG ++E +I+++L+++LVSD+L
Sbjct: 766 ELVDHDMSCFEGRWIGYEREMFEEGIDMEGEIVSALVDDLVSDIL 810


>AT3G05750.1 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: membrane;
           EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G26910.1); Has 2317 Blast
           hits to 1467 proteins in 247 species: Archae - 4;
           Bacteria - 750; Metazoa - 557; Fungi - 182; Plants -
           180; Viruses - 0; Other Eukaryotes - 644 (source: NCBI
           BLink). | chr3:1704677-1707546 FORWARD LENGTH=801
          Length = 801

 Score =  155 bits (391), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 150/452 (33%), Positives = 217/452 (48%), Gaps = 68/452 (15%)

Query: 16  EGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHGVSHCNEVALPMDEFCPRDYM 75
           +G GSKAP +VARLMGL+S+P     E      N          +  A   D +    Y+
Sbjct: 90  DGQGSKAPSVVARLMGLESIPVPNALE---PRRNPDFDPYFLRSSRKASTWDAYENLGYV 146

Query: 76  NMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKLLSPIKSPGFLP 135
           N+    +  S D ++ R  K  NRP+ RFQTE LPP+SAKPIPVTHN+LLSPI+SPGF+ 
Sbjct: 147 NLRSDYDGISWDHLDSRMNKECNRPIDRFQTETLPPRSAKPIPVTHNRLLSPIRSPGFVQ 206

Query: 136 PKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRILDLKERLEAAQCAFTPEKLV 194
            +N A +ME A+++IE SP+   + R   S  SSS+P++I DLKE+LEA+Q   +P+   
Sbjct: 207 SRNPASVMEEASRMIEPSPRVVAKTRFSSSDSSSSLPMKIRDLKEKLEASQKGQSPQISN 266

Query: 195 GPSNANPANGILYERSSNSHKCTSAFKGSRDSEKNSSSHSATRRRSD------------- 241
           G  N               +KC   F+G +D EK ++    T+ R++             
Sbjct: 267 GTCN---------------NKC---FRGKQD-EKRTTLPLKTQERNNLLGESRFGGSKGK 307

Query: 242 ----SLALQAKPN-VQNRD-TLNSNGNRKYVKQKEQKEIKSNQLSRSQKPSSDRDVHQRT 295
               S++  AK N +  RD ++ SNG   Y  QK++ E K+  +    K SS        
Sbjct: 308 VKPPSVSAHAKANTIHKRDSSMLSNG---YRDQKKKVETKNRIVKSGLKESS-------- 356

Query: 296 CTSRNSNVLGQNNQKQNCMTTTSKPISKIDSNKATARASSSESSIGTRKTTGRGAKNVNV 355
             S    V   NNQKQN    TS  +S     K   + +      GT  TT +       
Sbjct: 357 -ASTRKTVDKPNNQKQNQFAETS--VSNQRGRKVMKKVNKVLVENGT--TTKKPGFTATS 411

Query: 356 QPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCNFTTD 415
             K +S   +         + +++S+ KK  +       + D  +   + K IKCN T D
Sbjct: 412 AKKSTSSSLS---------RKKNLSRSKKPANGVQEAGVNSDKRIKKGE-KVIKCNITVD 461

Query: 416 GRIHQDAFNMKEGKDVISFTFMSPLRKSMHDS 447
           G +     + K+  DVISFTF SP++    DS
Sbjct: 462 GGLKTGDDDRKKDMDVISFTFSSPIKGLSSDS 493



 Score =  105 bits (262), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 130/227 (57%), Gaps = 12/227 (5%)

Query: 623 YGSTVYSSMQDEEVSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCE 682
           Y   ++ +  DEEV+ +S T E++ ++    +S   +      N+   +++    L   E
Sbjct: 584 YKKKIFQAEDDEEVNSFS-TAENLQISCSTSFSSSRNDYHH--NIEETELSESVALSEAE 640

Query: 683 VSSNMELEYIQDILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLE 742
              + ELEYI +I+ +   M +EF +G A  ++  +LFD         TE   D   K+E
Sbjct: 641 EGHDWELEYITEIIASGQLMIKEFSLGMATDILPLSLFD--------ETEGKRDARGKIE 692

Query: 743 RKVLFDCVSECLELRFTQAFVGRCKS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMV 801
           RK LFD V++ L L+  Q F+G CK    +    ++R+  LA+++ KE  G + M E+M+
Sbjct: 693 RKTLFDLVNQWLTLKCEQMFMGTCKGVLGKQDIFLERREILADQVLKEAQGLKKMREMMM 752

Query: 802 DELVSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLLL 848
           DELV  DMS+  GKWLD+  E +EEG E+E++I++ L+++L++DL++
Sbjct: 753 DELVDNDMSSCEGKWLDYMRETYEEGIEIEEEIVSELVDDLINDLIM 799


>AT5G26910.2 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G58650.1); Has 990 Blast hits to 447 proteins
           in 125 species: Archae - 0; Bacteria - 525; Metazoa -
           80; Fungi - 59; Plants - 91; Viruses - 0; Other
           Eukaryotes - 235 (source: NCBI BLink). |
           chr5:9466804-9469523 REVERSE LENGTH=638
          Length = 638

 Score =  150 bits (380), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 152/461 (32%), Positives = 223/461 (48%), Gaps = 63/461 (13%)

Query: 11  SICSDEGCGSKAPGLVARLMGLDSLPASANTELSCTSLNGSSSHGVSHCNEVALPMDEFC 70
           S+ SD+G G++AP +VARLMGL+SLP     E     LN      +   ++     D + 
Sbjct: 77  SVTSDDGQGTRAPSVVARLMGLESLPVPNVQE---PRLNPDLDPFLLRPSQNTNRWDAYE 133

Query: 71  PRDYMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKLLSPIKS 130
              Y+N+    +  S D ++ R     N+P++RFQ+E  PP+SAKPI VT+N+ LSPI+S
Sbjct: 134 NLGYVNLRSDYDGISWDHLDSRTNNGRNQPIERFQSETFPPRSAKPICVTNNRHLSPIRS 193

Query: 131 PGFLPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRILDLKERLEAAQCAFT 189
           PGF+P +N  ++MEAA+++IE SP+   R R  PS   SSVP+RI DL+E+LEAAQ    
Sbjct: 194 PGFVPSRNPIYVMEAASRMIEPSPRMVARTRFSPSNSPSSVPMRIQDLREKLEAAQ---- 249

Query: 190 PEKLVGPSNANPANGILYERSSNSHK---------CTSAFKGSRDSEKNSSSHSATRRRS 240
             K+    N+N    + Y    ++ K          TS F G     K+S+     + + 
Sbjct: 250 --KVSSRQNSNDTFNLKYPSGKHNEKRITTSLTTPSTSKFMG-----KSSTDGLKGKVKP 302

Query: 241 DSLALQAKPNVQNRDTLNSNGNRKYVKQKEQKEIKSNQLSRSQKPSSDRDVHQRTCTSRN 300
             ++ QAK          ++ N+K     ++  +KS    R                S  
Sbjct: 303 SYVSAQAKAGTTPLSVTRNSANQKEKADAKKCVVKSQNALRG------------APISMG 350

Query: 301 SNVLGQNNQKQNC------MTTTSKPISKIDSNKATARASSSESSIGTR--KTTGRGAKN 352
            N+  QNNQKQNC      MT+     S   +NK   +      SI  +   +T    KN
Sbjct: 351 KNMFKQNNQKQNCRDNQPSMTSVLNQKSSKVNNKVVNKVPVESGSISKQLGLSTASAEKN 410

Query: 353 VNVQPKRSSLRATDNRKEFLP-SKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCN 411
            ++     SL    +RK+ LP SK      +K  IS      RS +          IKCN
Sbjct: 411 TSL-----SL----SRKKTLPRSKKLPNGMQKSGISDDKRTKRSENM---------IKCN 452

Query: 412 FTTDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDSPSSTE 452
            T DG +++   + K+  DVISFTF SP++    DS SST+
Sbjct: 453 ITIDGGLNKGKDDRKKEMDVISFTFSSPIKGLSSDSLSSTQ 493


>AT3G05750.2 | Symbols:  | unknown protein; FUNCTIONS IN:
           molecular_function unknown; INVOLVED IN:
           biological_process unknown; LOCATED IN: membrane;
           EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10
           growth stages; BEST Arabidopsis thaliana protein match
           is: unknown protein (TAIR:AT5G26910.3); Has 35333 Blast
           hits to 34131 proteins in 2444 species: Archae - 798;
           Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants -
           531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI
           BLink). | chr3:1705300-1707546 FORWARD LENGTH=698
          Length = 698

 Score =  135 bits (340), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 132/394 (33%), Positives = 192/394 (48%), Gaps = 65/394 (16%)

Query: 74  YMNMTHKLEMSSSDAMELRARKMENRPMKRFQTEMLPPKSAKPIPVTHNKLLSPIKSPGF 133
           Y+N+    +  S D ++ R  K  NRP+ RFQTE LPP+SAKPIPVTHN+LLSPI+SPGF
Sbjct: 42  YVNLRSDYDGISWDHLDSRMNKECNRPIDRFQTETLPPRSAKPIPVTHNRLLSPIRSPGF 101

Query: 134 LPPKNAAHLMEAAAKIIEASPQHYTRDRM-PSVRSSSVPLRILDLKERLEAAQCAFTPEK 192
           +  +N A +ME A+++IE SP+   + R   S  SSS+P++I DLKE+LEA+Q   +P+ 
Sbjct: 102 VQSRNPASVMEEASRMIEPSPRVVAKTRFSSSDSSSSLPMKIRDLKEKLEASQKGQSPQI 161

Query: 193 LVGPSNANPANGILYERSSNSHKCTSAFKGSRDSEKNSSSHSATRRRSD----------- 241
             G  N               +KC   F+G +D EK ++    T+ R++           
Sbjct: 162 SNGTCN---------------NKC---FRGKQD-EKRTTLPLKTQERNNLLGESRFGGSK 202

Query: 242 ------SLALQAKPN-VQNRD-TLNSNGNRKYVKQKEQKEIKSNQLSRSQKPSSDRDVHQ 293
                 S++  AK N +  RD ++ SNG   Y  QK++ E K+  +    K SS      
Sbjct: 203 GKVKPPSVSAHAKANTIHKRDSSMLSNG---YRDQKKKVETKNRIVKSGLKESS------ 253

Query: 294 RTCTSRNSNVLGQNNQKQNCMTTTSKPISKIDSNKATARASSSESSIGTRKTTGRGAKNV 353
               S    V   NNQKQN    TS  +S     K   + +      GT  TT +     
Sbjct: 254 ---ASTRKTVDKPNNQKQNQFAETS--VSNQRGRKVMKKVNKVLVENGT--TTKKPGFTA 306

Query: 354 NVQPKRSSLRATDNRKEFLPSKTESISQKKKFISRSSHEARSPDHAVNNFQSKSIKCNFT 413
               K +S            S+ +++S+ KK  +       + D  +   + K IKCN T
Sbjct: 307 TSAKKSTSSSL---------SRKKNLSRSKKPANGVQEAGVNSDKRIKKGE-KVIKCNIT 356

Query: 414 TDGRIHQDAFNMKEGKDVISFTFMSPLRKSMHDS 447
            DG +     + K+  DVISFTF SP++    DS
Sbjct: 357 VDGGLKTGDDDRKKDMDVISFTFSSPIKGLSSDS 390



 Score =  105 bits (263), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 130/227 (57%), Gaps = 12/227 (5%)

Query: 623 YGSTVYSSMQDEEVSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCE 682
           Y   ++ +  DEEV+ +S T E++ ++    +S   +      N+   +++    L   E
Sbjct: 481 YKKKIFQAEDDEEVNSFS-TAENLQISCSTSFSSSRNDYHH--NIEETELSESVALSEAE 537

Query: 683 VSSNMELEYIQDILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLE 742
              + ELEYI +I+ +   M +EF +G A  ++  +LFD         TE   D   K+E
Sbjct: 538 EGHDWELEYITEIIASGQLMIKEFSLGMATDILPLSLFD--------ETEGKRDARGKIE 589

Query: 743 RKVLFDCVSECLELRFTQAFVGRCKS-WPRWVTSVQRKRWLAEELYKEMFGFRNMEEVMV 801
           RK LFD V++ L L+  Q F+G CK    +    ++R+  LA+++ KE  G + M E+M+
Sbjct: 590 RKTLFDLVNQWLTLKCEQMFMGTCKGVLGKQDIFLERREILADQVLKEAQGLKKMREMMM 649

Query: 802 DELVSKDMSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVSDLLL 848
           DELV  DMS+  GKWLD+  E +EEG E+E++I++ L+++L++DL++
Sbjct: 650 DELVDNDMSSCEGKWLDYMRETYEEGIEIEEEIVSELVDDLINDLIM 696


>AT3G53540.1 | Symbols:  | unknown protein; LOCATED IN: plasma
           membrane; EXPRESSED IN: 24 plant structures; EXPRESSED
           DURING: 14 growth stages; CONTAINS InterPro DOMAIN/s:
           Protein of unknown function DUF3741
           (InterPro:IPR022212); BEST Arabidopsis thaliana protein
           match is: Protein of unknown function (DUF3741)
           (TAIR:AT4G28760.2); Has 1710 Blast hits to 868 proteins
           in 206 species: Archae - 2; Bacteria - 409; Metazoa -
           304; Fungi - 204; Plants - 304; Viruses - 2; Other
           Eukaryotes - 485 (source: NCBI BLink). |
           chr3:19846805-19850670 REVERSE LENGTH=924
          Length = 924

 Score = 52.4 bits (124), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/262 (25%), Positives = 117/262 (44%), Gaps = 67/262 (25%)

Query: 585 PRCSSKDANDLGFQHPNAVTVLETSFASESYLDSEDSTYGSTVYSS---------MQDEE 635
           PR SSK+ +      P+ V+VLE SF        +D + GS  + S         MQ + 
Sbjct: 684 PRESSKEGD-----QPSPVSVLEASF-------DDDVSSGSECFESVSADLRGLRMQLQL 731

Query: 636 VSDYSQTHE--SVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELEYIQ 693
           +   S T++   + ++++    ++ SST T   M  K++             + +  Y+ 
Sbjct: 732 LKLESATYKEGGMLVSSDEDTDQEESSTITDEAMITKELRE----------EDWKSSYLV 781

Query: 694 DILENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVS-E 752
           D+L N+ F   +  +  A T + P+LF+ LE + SS   +     ++LERK+LFD +S E
Sbjct: 782 DLLANSSFSDSDHNIVMATTPVEPSLFEDLEKKYSSVKTS-----TRLERKLLFDQISRE 836

Query: 753 CL----ELRFTQAFVGRCKSWPRWVTS---------VQRK--------------RWLAEE 785
            L    +L     +V   K  P+W  +         V RK              +WL+ E
Sbjct: 837 VLHMLKQLSDPHPWVKSTKVCPKWDANKIQETLRDLVTRKDEKPSKYDVEEKELQWLSLE 896

Query: 786 LYKEMFGFRNMEEVMVDELVSK 807
              E+ G R +E ++ DEL+++
Sbjct: 897 DDIEIIG-REIEVMLTDELITE 917


>AT2G39435.1 | Symbols:  | Phosphatidylinositol
           N-acetyglucosaminlytransferase subunit P-related |
           chr2:16464806-16466492 REVERSE LENGTH=464
          Length = 464

 Score = 52.4 bits (124), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 70/276 (25%), Positives = 123/276 (44%), Gaps = 40/276 (14%)

Query: 585 PRC--SSKDANDLGFQHPNAVTVLETSFA------SESYLD-SEDSTYGSTVYSSMQDEE 635
           P C  +S+DA+      P+ V+VLE  F       SE  LD SED  Y + +    Q E 
Sbjct: 211 PECQTNSEDAH-----QPSPVSVLEPMFYEDNLDDSEDILDDSEDLPYPNFLSLENQLET 265

Query: 636 VSDYSQTHESVSLANEGKWSEQNSSTFTGGNMAVKQITRISDLGGCEVSSNMELEYIQDI 695
           +   S+++      ++G   E +S   +  + A+K+      +G  +   + +  YI DI
Sbjct: 266 LKSESESY------SDGSGMEVSSDEESALDSAIKESKESEPIGFLDTQESRDSSYIDDI 319

Query: 696 LENADFMSEEFVMGQADTVIMPNLFDLLENQGSSGTENYGDEYSKLERKVLFDCVSECLE 755
           L       +  V G+ D VI P +F+ LE +  + T      + + +RK+LFD V+  L 
Sbjct: 320 LAEVLLGDKNCVPGKRDLVITPKIFEKLEKKYYTET-----SWKRSDRKILFDRVNSSL- 373

Query: 756 LRFTQAFVGRCKSWPRWVTSVQRKR-------WLAEELYKEMFGFRNMEEVMVDELVSKD 808
           +   ++F       P W   V R+         L +EL+K +      E+    + ++K 
Sbjct: 374 VEILESFSAT----PTWKKPVSRRLGTALSTCGLKQELWKVL---SRQEKRSKKKSLAKV 426

Query: 809 MSTGCGKWLDFDIEAFEEGSEVEQDILASLINELVS 844
                 +WL+ + +      E+E  I+  L++E+VS
Sbjct: 427 PVIDIDEWLELEADDESVVCELESMIVDELLSEVVS 462