Miyakogusa Predicted Gene

Lj0g3v0328149.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0328149.1 Non Chatacterized Hit- tr|J3LWY5|J3LWY5_ORYBR
Uncharacterized protein OS=Oryza brachyantha
GN=OB04G1,27.4,7e-18,Kinesin-related,Kinesin-related conserved domain;
coiled-coil,NULL; seg,NULL; SUBFAMILY NOT NAMED,NU,CUFF.22338.1
         (497 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT3G20150.1 | Symbols:  | Kinesin motor family protein | chr3:70...   147   2e-35
AT5G55520.1 | Symbols:  | CONTAINS InterPro DOMAIN/s: Kinesin-re...   134   2e-31
AT5G55520.2 | Symbols:  | CONTAINS InterPro DOMAIN/s: Kinesin-re...   132   4e-31
AT4G26660.1 | Symbols:  | INVOLVED IN: biological_process unknow...   130   2e-30
AT3G23670.1 | Symbols: PAKRP1L, KINESIN-12B | phragmoplast-assoc...    96   6e-20
AT4G14150.1 | Symbols: PAKRP1, KINESIN-12A | phragmoplast-associ...    90   5e-18

>AT3G20150.1 | Symbols:  | Kinesin motor family protein |
           chr3:7031412-7036499 FORWARD LENGTH=1114
          Length = 1114

 Score =  147 bits (370), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 149/467 (31%), Positives = 214/467 (45%), Gaps = 65/467 (13%)

Query: 22  RFDNEIDEEVNVGEEDIRQLRQQIDELYMSCEGNPKDISVSEDCVPYYSVEESCDTDMTS 81
           + DN+ +EE+ V E+D ++L  QI  L  S     K   V+ D V    V    ++++  
Sbjct: 506 KIDND-EEEITVDEDDFKELHLQIKSLRGSFNQKLKKFPVNRDSVNSSFVTAFGESELMD 564

Query: 82  CDEIEKEEVCSKET--SSDLCHKDSAAP----EDTSRAIKSTFRESISVSSCCRSPILEE 135
            DEI  EEV  +E      L   DSAA      + SR  +     SIS+S C +S IL+E
Sbjct: 565 DDEICSEEVEVEENDFGESLEEHDSAATVCKSSEKSRIEEFVSENSISISPCRQSLILQE 624

Query: 136 PLLSESPKISNIKRKSVAFSSSCLGSWNNVAEENSSSSIGQSFKKDELMQSSLRSSKVFP 195
           P+ SESPK  +  RKS+A SSSCL + N++A+   S+     F + + ++SSLR SK+F 
Sbjct: 625 PIQSESPKFRDSLRKSIALSSSCLRNQNSLAKSIKST----CFAESQHIRSSLRGSKIFT 680

Query: 196 GPTESLAASLQRGLQIIDHHQRNSALNKSLASFSFEHLTLTPCPEIDKGDSCDSTIQQKK 255
           G TESLAASL+RGL IID+   N A N+   S S ++LT+ P  +               
Sbjct: 681 GSTESLAASLRRGLDIIDNPM-NPASNRCSVSLSSDNLTMQPPTD--------------- 724

Query: 256 YSIDERTATLLCESCRIKICDKEDSIKVKDSLKSLSDTAEAGNQDGLTANVPNDLEMAKS 315
              D    + LC +CR  IC  +           L    E    DG          M   
Sbjct: 725 ---DRLPLSPLCPTCR--ICSSK-----------LPSVVEG---DG--------YHMEGV 757

Query: 316 IIREKELERLCKEQAARIEELNQLVEKLKGEKEINSIVVYGQEXXXXXXXXXXXXHLPDN 375
           + +++ELE+LC EQAA+IE+L +LV + K + E  +  + G               L  +
Sbjct: 758 LEKQQELEKLCSEQAAKIEQLTRLVGQHKLQTEDETEKLMGASNGERLPSANENQLL--S 815

Query: 376 XXXXXXXXXXXXXFDQRNCSFDATEKEELLKEIQNLRSKLQLYNDAPAKMSTDXXXXXXX 435
                         D +   FD  EKE LLKEI++L+ KLQ     P  MST+       
Sbjct: 816 CITETYDVKQISDDDSKKTDFDIGEKEALLKEIEDLKKKLQ----TPVTMSTN-----EL 866

Query: 436 XXXXXXXXXXVFSNHNAXXXXXXXXXXXXXXSDWICLTDELRVDLES 482
                     + S +                S+WI LTDE RV++E+
Sbjct: 867 RSSLLARSFQLRSKNAEKDIEEERLRCTEMESEWISLTDEFRVEIET 913


>AT5G55520.1 | Symbols:  | CONTAINS InterPro DOMAIN/s:
           Kinesin-related protein (InterPro:IPR010544); BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G26660.1); Has 32425 Blast hits to 20462
           proteins in 1550 species: Archae - 335; Bacteria - 3392;
           Metazoa - 16996; Fungi - 2645; Plants - 1561; Viruses -
           54; Other Eukaryotes - 7442 (source: NCBI BLink). |
           chr5:22488205-22491187 REVERSE LENGTH=805
          Length = 805

 Score =  134 bits (336), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 145/457 (31%), Positives = 205/457 (44%), Gaps = 87/457 (19%)

Query: 35  EEDIRQLRQQIDELYMSCEGNPKDISVSEDCVPYYSVEESCDTDMTSCDEIEKEEVCSKE 94
           ++D+ +L + I++ + +C+ +     + +     ++    C+ D  S DEI     CS +
Sbjct: 190 DDDVVELSKHINKFHTNCDSD----DLRDSIQSSFASASGCEADSMSGDEI-----CSVD 240

Query: 95  TSSDLCHKDSAAPEDTSRAIKSTFRESISVSSCCRSPILEEPLLSESPKISNIKRKSVAF 154
              D  HKD A  +    A+ +     IS+S   +S ILEEP LSESPKI N  RKSVA 
Sbjct: 241 KHKD--HKDCALADSGPSAVGN----GISISLPHQSRILEEPPLSESPKIRNF-RKSVAA 293

Query: 155 SSSCLGSWNNVAEENSSSSIGQSFKKDELMQSSLRSSKVFPGPTESLAASLQRGLQIIDH 214
           S+    S  NV E   SSS G               ++    PT+SLAASLQRGL IID 
Sbjct: 294 STKFQASPRNVTE---SSSTG---------------NRKPLSPTDSLAASLQRGLNIIDC 335

Query: 215 HQRNSALNKSLASFSFEHLTLTPCPEIDKGDSCDSTIQQKKYSIDERTATLLCESCRIKI 274
           HQR+S  N+S  SFSF HL+L PC E D   S    + QK    +  ++ LLC SCR K+
Sbjct: 336 HQRSSLSNRSSVSFSFGHLSLKPCDEADDNLSASVKLLQKDRPKEGGSSILLCLSCRQKL 395

Query: 275 CDKEDSIKVKDSLKSLSDTAEAGNQDGLTANVPNDLEMAKSIIREKELERLCKEQAARIE 334
            D+E                    Q G  A       + ++ + EK L+ +C EQA +IE
Sbjct: 396 -DQE-------------------AQGGYKA-------IEEACVDEKHLKNMCVEQATKIE 428

Query: 335 ELNQLVEKLKGEKEINSIVVYGQEXXXXXXXXXXXXHLPDNXXXXXXXXXXXXXFDQRNC 394
           +L   +++ K      S  V  Q                D+              +QR+ 
Sbjct: 429 QLTYQLDEYKKNALQESSKVTQQLMKS------------DDGEDETEVVKETYETNQRSE 476

Query: 395 SF-----DATEKEELLKEIQNLRSKLQLYNDAPAKMSTDXXXXXXXXXXXXXXXXXVFSN 449
            F     D +EKE LLKEI  L+SKLQ     P K STD                  F+ 
Sbjct: 477 EFGKVRIDLSEKEALLKEIAELKSKLQ-----PTK-STDNVRSSLLLRSFQMRKSIDFTK 530

Query: 450 ---HNAXXXXXXXXXXXXXXSDWICLTDELRVDLESN 483
              +N+              S+WI LTD+LR+D++S+
Sbjct: 531 NTENNSEALEEERERWTEMESEWISLTDDLRMDIDSH 567


>AT5G55520.2 | Symbols:  | CONTAINS InterPro DOMAIN/s:
           Kinesin-related protein (InterPro:IPR010544); BEST
           Arabidopsis thaliana protein match is: unknown protein
           (TAIR:AT4G26660.1); Has 31032 Blast hits to 19733
           proteins in 1535 species: Archae - 330; Bacteria - 3150;
           Metazoa - 16413; Fungi - 2511; Plants - 1475; Viruses -
           48; Other Eukaryotes - 7105 (source: NCBI BLink). |
           chr5:22488205-22491187 REVERSE LENGTH=801
          Length = 801

 Score =  132 bits (333), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 146/458 (31%), Positives = 207/458 (45%), Gaps = 93/458 (20%)

Query: 35  EEDIRQLRQQIDELYMSCEGNPKDISVSEDCVPYYSVEESCDTDMTSCDEIEKEEVCSKE 94
           ++D+ +L + I++ + +C+ +     + +     ++    C+ D  S DEI     CS +
Sbjct: 190 DDDVVELSKHINKFHTNCDSD----DLRDSIQSSFASASGCEADSMSGDEI-----CSVD 240

Query: 95  TSSDLCHKDSAAPEDTSRAIKSTFRESISVSSCCRSPILEEPLLSESPKISNIKRKSVAF 154
              D  HKD A  +    A+ +     IS+S   +S ILEEP LSESPKI N  RKSVA 
Sbjct: 241 KHKD--HKDCALADSGPSAVGN----GISISLPHQSRILEEPPLSESPKIRNF-RKSVAA 293

Query: 155 SSSCLGSWNNVAEENSSSSIGQSFKKDELMQSSLRSSKVFPGPTESLAASLQRGLQIIDH 214
           S+    S  NV E   SSS G               ++    PT+SLAASLQRGL IID 
Sbjct: 294 STKFQASPRNVTE---SSSTG---------------NRKPLSPTDSLAASLQRGLNIIDC 335

Query: 215 HQRNSALNKSLASFSFEHLTLTPCPEIDKGDSCDSTIQQKKYSIDERTATLLCESCRIKI 274
           HQR+S  N+S  SFSF HL+L PC E D   S    + QK    +  ++ LLC SCR K+
Sbjct: 336 HQRSSLSNRSSVSFSFGHLSLKPCDEADDNLSASVKLLQKDRPKEGGSSILLCLSCRQKL 395

Query: 275 CDKEDSIKVKDSLKSLSDTAEAGNQDGLTANVPNDLEMAKSIIREKELERLCKEQAARIE 334
            D+E                    Q G  A       + ++ + EK L+ +C EQA +IE
Sbjct: 396 -DQE-------------------AQGGYKA-------IEEACVDEKHLKNMCVEQATKIE 428

Query: 335 ELN-QLVEKLKGEKEINSIVVYGQEXXXXXXXXXXXXHLPDNXXXXXXXXXXXXXFDQRN 393
           +L  QL E  K   + +S ++                   D+              +QR+
Sbjct: 429 QLTYQLDEYKKNALQESSKLMKS-----------------DDGEDETEVVKETYETNQRS 471

Query: 394 CSF-----DATEKEELLKEIQNLRSKLQLYNDAPAKMSTDXXXXXXXXXXXXXXXXXVFS 448
             F     D +EKE LLKEI  L+SKLQ     P K STD                  F+
Sbjct: 472 EEFGKVRIDLSEKEALLKEIAELKSKLQ-----PTK-STDNVRSSLLLRSFQMRKSIDFT 525

Query: 449 N---HNAXXXXXXXXXXXXXXSDWICLTDELRVDLESN 483
               +N+              S+WI LTD+LR+D++S+
Sbjct: 526 KNTENNSEALEEERERWTEMESEWISLTDDLRMDIDSH 563


>AT4G26660.1 | Symbols:  | INVOLVED IN: biological_process unknown;
           LOCATED IN: chloroplast; EXPRESSED IN: 13 plant
           structures; EXPRESSED DURING: 7 growth stages; CONTAINS
           InterPro DOMAIN/s: Kinesin-related protein
           (InterPro:IPR010544); BEST Arabidopsis thaliana protein
           match is: unknown protein (TAIR:AT5G55520.2); Has 1807
           Blast hits to 1807 proteins in 277 species: Archae - 0;
           Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
           Viruses - 0; Other Eukaryotes - 339 (source: NCBI
           BLink). | chr4:13448754-13451814 FORWARD LENGTH=806
          Length = 806

 Score =  130 bits (328), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 153/489 (31%), Positives = 223/489 (45%), Gaps = 93/489 (19%)

Query: 3   VAQMGVKMDTSKYVMSGKKRFDNEIDEEVNVGEEDIRQLRQQIDELYMSCEGNPKDISVS 62
           ++Q+ V ++ S  +MS  KR ++E  +EV V  ED+ +L + I++L+ S           
Sbjct: 170 LSQLRVSINKS-LLMSCPKRDESE-GKEVIVDGEDVLELNKHIEKLHGSY---------- 217

Query: 63  EDCVPYYSVEESC-DTDMTSCDEIEKEEVCSKETSSDLC--HKD----SAAPEDTSRAIK 115
            D V       SC + D  S D+   E+VCS++    +   HKD       P        
Sbjct: 218 -DSVHSSFASASCYEADSMSGDD---EDVCSEDLEKPMHGNHKDVDFVDNDPSQLDNVEF 273

Query: 116 STFRESISVSSCCRSPILEEPLLSESPKISNIKRKSVAFSSSCLGSWNNVAEENSSSSIG 175
            T    IS+ S   S +LEEP+ SESPK  N++ KSVA S+    +  NV+E   SS+IG
Sbjct: 274 DTTGSGISIRSQLPSCVLEEPIFSESPKFKNVQ-KSVAASTKFSANPRNVSE---SSNIG 329

Query: 176 QSFKKDELMQSSLRSSKVFPGPTESLAASLQRGLQIIDHHQRNSALNKSLASFSFEHLTL 235
                 ++ Q S   SK   GPT+SLAASLQRGLQIID+HQ +S    S  SFSF H+ L
Sbjct: 330 DM----KVNQISPCMSKKVSGPTDSLAASLQRGLQIIDYHQGSSLSKSSSVSFSFGHMAL 385

Query: 236 TPCPEIDKGDSCDSTIQQKKYSIDERTATLLCESCRIKICDKEDSIKVKDSLKSLSDTAE 295
            PC E +  ++   + ++ K S    ++ LLC SCR K+ D+E  +           T E
Sbjct: 386 KPCAEGENLNASVQSFRKDKASEGGLSSILLCLSCRKKV-DQEAEV-----------TEE 433

Query: 296 AGNQDGLTANVPNDLEMAKSIIREKELERLCKEQAARIEELNQLVEKLKGEKEINSIVVY 355
           AG+                    EK L+ +C EQAA+IEEL  L+ K    ++    +  
Sbjct: 434 AGSN-------------------EKHLKNMCMEQAAKIEELTLLLRKSDDGEDGTEFIKE 474

Query: 356 GQEXXXXXXXXXXXXHLPDNXXXXXXXXXXXXXFDQRNCSFDATEKEELLKEIQNLRSKL 415
             E                              F + N  F+ +EKE LLKEI +L+SKL
Sbjct: 475 TYE-----------------------TKQISEEFGKTN--FEVSEKEALLKEIADLKSKL 509

Query: 416 QLYNDAPAKMSTDXXXXXXXXXXXXXXXXXVFSN-HNAXXXXXXXXXXXXXXSDWICLTD 474
           Q     P K + +                 V  N  N+              S+WI LTD
Sbjct: 510 Q-----PTKSTDNLRSSLLLRSIQMRKSIDVSRNGENSDDLAKEREMWTEMESEWISLTD 564

Query: 475 ELRVDLESN 483
           +LR+D++++
Sbjct: 565 DLRMDIDNH 573


>AT3G23670.1 | Symbols: PAKRP1L, KINESIN-12B |
           phragmoplast-associated kinesin-related protein,
           putative | chr3:8519290-8525055 FORWARD LENGTH=1313
          Length = 1313

 Score = 95.9 bits (237), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 122/242 (50%), Gaps = 13/242 (5%)

Query: 122 ISVSSCCRSPILEEPLLSESPKISNIKRKSVAFSSSCLGSWNNVAEENSSS------SIG 175
           +SV+    SP+L  P  S SPKI N  RKS+  +S    S  ++   N  +      S  
Sbjct: 669 LSVAPVSVSPVLIPPTESASPKIRN-SRKSLRTTSMSTASQKDIERANQLTPEVVEPSPA 727

Query: 176 QSFKKDELMQS-SLRSSKVFPGPTESLAASLQRGLQIIDHHQRNSALNKSLASFSFEHLT 234
            S +   L  + S + S+ FP PT  LAASL RG++++D +++++AL +S    S++ L 
Sbjct: 728 MSTEVLNLYSALSTKKSEAFPVPTRQLAASLHRGMKLLDSYRQSTALRRSTFRLSYKALE 787

Query: 235 LTPCPEIDKGDSCDSTIQQKKYSIDERTATLLCESCRIKI-CDKEDSIKVKDSLKSLSDT 293
             P   + K D    T  Q     ++ +  +LC  C+ +  CD ++     +      D 
Sbjct: 788 CKPSTVLSKADVGVQTYPQADEIAEDNSKEVLCSRCKCRAECDAQEISDTSNLQLVPIDN 847

Query: 294 AEAGNQDGLTANVPNDLE--MAKSIIREKELERLCKEQAARIEELNQLVEKLKGEKEINS 351
           +E   +      VP  +E  +A SI RE  +E  C +QA+ I +LN+LV++ K E+E N+
Sbjct: 848 SEGSEKSNF--QVPKAVEKVLAGSIRREMAMEEFCTKQASEISQLNRLVQQYKHERECNA 905

Query: 352 IV 353
           I+
Sbjct: 906 II 907


>AT4G14150.1 | Symbols: PAKRP1, KINESIN-12A |
           phragmoplast-associated kinesin-related protein 1 |
           chr4:8158645-8165008 REVERSE LENGTH=1292
          Length = 1292

 Score = 89.7 bits (221), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 145/288 (50%), Gaps = 25/288 (8%)

Query: 78  DMTSCDEIEKEEVCSKETSSDLCHKDSAAPEDTSRAIKSTFRESISVSSCCRSPILEEPL 137
           D++SC ++  ++V    TS+++   D     D    + ++   S+ +     +P+L+ P 
Sbjct: 623 DVSSCPDLVPQDV----TSANVLIADGV---DDPEHLVNSASPSLCIDPVGATPVLKSPT 675

Query: 138 LSESPKISNIKRKSVAFSSSCLGSWN-----NVAEENSSSSIGQSFKKDELMQS-SLRSS 191
           LS SP I N  RKS+  S     S       N+  E +  S   S K +    + S + S
Sbjct: 676 LSVSPTIRN-SRKSLKTSELSTASQKDSEGENLVTEAADPSPATSKKMNNCSSALSTQKS 734

Query: 192 KVFPGPTESLAASLQRGLQIIDHHQRNSALNKSLASFSFEHLTLTPCPEIDKGDSCDSTI 251
           KVFP  TE LA+SL +G+++++ + +++A  +S   FSF+     P   I K D+   TI
Sbjct: 735 KVFPVRTERLASSLHKGIKLLESYCQSTAQRRSTYRFSFKAPDSEPSTSISKADAGVQTI 794

Query: 252 QQKKYSIDERTATLLCESCRIKICDKEDSIKVKDSLKSLS----DTAEAGNQDGLTANVP 307
                  +E T   LC  C+ K  ++ D+ ++ D + +L     D +E   +      VP
Sbjct: 795 PGADAISEENTKEFLC--CKCKCREQFDAQQMGD-MPNLQLVPVDNSEVAEKS--KNQVP 849

Query: 308 NDLE--MAKSIIREKELERLCKEQAARIEELNQLVEKLKGEKEINSIV 353
             +E  +A SI RE  LE  C +QA+ I +LN+LV++ K E+E N+I+
Sbjct: 850 KAVEKVLAGSIRREMALEEFCTKQASEITQLNRLVQQYKHERECNAII 897