Miyakogusa Predicted Gene
- Lj6g3v0802330.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v0802330.1 Non Chatacterized Hit- tr|Q6K765|Q6K765_ORYSJ
Putative phragmoplast-associated kinesin-related
prote,27.53,1e-18,Kinesin-related,Kinesin-related conserved domain;
seg,NULL; SUBFAMILY NOT NAMED,NULL; FAMILY NOT NAM,CUFF.58321.1
(366 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT3G23670.1 | Symbols: PAKRP1L, KINESIN-12B | phragmoplast-assoc... 154 1e-37
AT5G55520.2 | Symbols: | CONTAINS InterPro DOMAIN/s: Kinesin-re... 151 8e-37
AT3G20150.1 | Symbols: | Kinesin motor family protein | chr3:70... 150 1e-36
AT5G55520.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Kinesin-re... 150 1e-36
AT4G14150.1 | Symbols: PAKRP1, KINESIN-12A | phragmoplast-associ... 146 2e-35
AT4G26660.1 | Symbols: | INVOLVED IN: biological_process unknow... 140 1e-33
AT3G23670.2 | Symbols: PAKRP1L, KINESIN-12B | phragmoplast-assoc... 74 2e-13
>AT3G23670.1 | Symbols: PAKRP1L, KINESIN-12B | phragmoplast-associated
kinesin-related protein, putative | chr3:8519290-8525055
FORWARD LENGTH=1313
Length = 1313
Score = 154 bits (389), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 38/385 (9%)
Query: 13 LAVNLQRGIEIIDYHQQNSALNRSSTSFSFKHLTLTPEIDKVESYDQTIQQKPTSDKV-- 70
LA +L RG++++D ++Q++AL RS+ S+K L P + D +Q P +D++
Sbjct: 754 LAASLHRGMKLLDSYRQSTALRRSTFRLSYKALECKPST-VLSKADVGVQTYPQADEIAE 812
Query: 71 --TAAFICAYCRTKVSNPNQDSTEVQDSFKSSFETVGQKGNPEGLTDKVPKHLNKLMAKD 128
+ +C+ C+ + D+ E+ D+ + E +VPK + K++A
Sbjct: 813 DNSKEVLCSRCKCRAEC---DAQEISDTSNLQLVPIDNSEGSEKSNFQVPKAVEKVLAGS 869
Query: 129 IMREKELENVCKMQAARIEQLNQLVEKLKEGKELNSITVYSQ------------------ 170
I RE +E C QA+ I QLN+LV++ K +E N+I ++
Sbjct: 870 IRREMAMEEFCTKQASEISQLNRLVQQYKHERECNAIIGQTREDKIVRLESLMDGVLSKD 929
Query: 171 ---CKEYNSMKDENKLLRSTSSNGHLPSIIEEKSEMKEVQEAL-AQRDVSFDSAEKESLL 226
+E+ S+ E+KLL+ N P +++ + E+K VQE L + ++ D E+E LL
Sbjct: 930 DFLDEEFASLMHEHKLLKDMYENH--PEVLQTRIELKRVQEELESFKNFYGDMGEREVLL 987
Query: 227 DEIRNPRSKLQLYSD-----APAKKYTDKXXXXXXXXXXXXXNSGVFSHDSSSE-DLENE 280
+EI + +++LQ Y+D A + K N+ S D E LE E
Sbjct: 988 EEIHDLKAQLQCYTDSSLTSARRRGSLLKLTYACDPNQAPQLNTIPESVDEGPEKTLEQE 1047
Query: 281 RQRWTEMESEWICLTDELRADLESYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIGHGKM 340
R RWTE ES WI L +ELR +L++ R + ++ A+ GH +M
Sbjct: 1048 RLRWTEAESNWISLAEELRTELDTNRLLMEKQKRELDTEKRCAEELTEAMQMAMQGHARM 1107
Query: 341 VEHYADLQEKYDDLVAKHEAIMEGI 365
+E YADL+EK+ L+A+H I EGI
Sbjct: 1108 IEQYADLEEKHIQLLARHRRIREGI 1132
>AT5G55520.2 | Symbols: | CONTAINS InterPro DOMAIN/s:
Kinesin-related protein (InterPro:IPR010544); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G26660.1); Has 31032 Blast hits to 19733
proteins in 1535 species: Archae - 330; Bacteria - 3150;
Metazoa - 16413; Fungi - 2511; Plants - 1475; Viruses -
48; Other Eukaryotes - 7105 (source: NCBI BLink). |
chr5:22488205-22491187 REVERSE LENGTH=801
Length = 801
Score = 151 bits (381), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 175/364 (48%), Gaps = 61/364 (16%)
Query: 8 SRAESLAVNLQRGIEIIDYHQQNSALNRSSTSFSFKHLTLTP---EIDKVESYDQTIQQK 64
S +SLA +LQRG+ IID HQ++S NRSS SFSF HL+L P D + + + +Q+
Sbjct: 317 SPTDSLAASLQRGLNIIDCHQRSSLSNRSSVSFSFGHLSLKPCDEADDNLSASVKLLQKD 376
Query: 65 PTSDKVTAAFICAYCRTKVSNPNQDSTEVQDSFKSSFETVGQKGNPEGLTDKVPKHLNKL 124
+ ++ +C CR K+ Q G K E D
Sbjct: 377 RPKEGGSSILLCLSCRQKLDQEAQG---------------GYKAIEEACVD--------- 412
Query: 125 MAKDIMREKELENVCKMQAARIEQLNQLVEKLKEGKELNSITVYSQCKEYNSMKDENKLL 184
EK L+N+C QA +IEQL +++ K+ N++++ +KL+
Sbjct: 413 -------EKHLKNMCVEQATKIEQLTYQLDEYKK----------------NALQESSKLM 449
Query: 185 RSTSSNGHLPSIIEEKSEMKEVQEALAQRDVSFDSAEKESLLDEIRNPRSKLQLYSDAPA 244
+S +++E E + E + V D +EKE+LL EI +SKLQ P
Sbjct: 450 KSDDGEDET-EVVKETYETNQRSEEFGK--VRIDLSEKEALLKEIAELKSKLQ-----PT 501
Query: 245 KKYTDKXXXXXXXXXXXXXNSGVFSHDS--SSEDLENERQRWTEMESEWICLTDELRADL 302
K TD S F+ ++ +SE LE ER+RWTEMESEWI LTD+LR D+
Sbjct: 502 KS-TDNVRSSLLLRSFQMRKSIDFTKNTENNSEALEEERERWTEMESEWISLTDDLRMDI 560
Query: 303 ESYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIGHGKMVEHYADLQEKYDDLVAKHEAIM 362
+S+RR A +D L RA++GH + +E Y +LQEKYD+L +H M
Sbjct: 561 DSHRRHAEDLEIELKKEKMATEELNDALSRAMLGHSRFIEQYTELQEKYDELDERHSVTM 620
Query: 363 EGIA 366
GI
Sbjct: 621 AGIV 624
>AT3G20150.1 | Symbols: | Kinesin motor family protein |
chr3:7031412-7036499 FORWARD LENGTH=1114
Length = 1114
Score = 150 bits (379), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 180/372 (48%), Gaps = 73/372 (19%)
Query: 1 MRSSLQSSR-----AESLAVNLQRGIEIIDYHQQNSALNRSSTSFSFKHLTLTPEIDKVE 55
+RSSL+ S+ ESLA +L+RG++IID + N A NR S S S +LT+ P
Sbjct: 669 IRSSLRGSKIFTGSTESLAASLRRGLDIID-NPMNPASNRCSVSLSSDNLTMQP------ 721
Query: 56 SYDQTIQQKPTSDKVTAAFICAYCRTKVSNPNQDSTEVQDSFKSSFETVGQKGNPEGLTD 115
PT D++ + +C CR S S E G + EG+ +
Sbjct: 722 ---------PTDDRLPLSPLCPTCRICSSK-----------LPSVVE--GDGYHMEGVLE 759
Query: 116 KVPKHLNKLMAKDIMREKELENVCKMQAARIEQLNQLV--EKLKEGKELNSITVYSQCKE 173
K ++ELE +C QAA+IEQL +LV KL+ E + S +
Sbjct: 760 K---------------QQELEKLCSEQAAKIEQLTRLVGQHKLQTEDETEKLMGASNGER 804
Query: 174 YNSMKDENKLLRSTSSNGHLPSIIEEKSEMKEVQEALAQRDVSFDSAEKESLLDEIRNPR 233
S +EN+LL S I E ++K++ + +++ FD EKE+LL EI + +
Sbjct: 805 LPSA-NENQLL----------SCITETYDVKQISDDDSKK-TDFDIGEKEALLKEIEDLK 852
Query: 234 SKLQLYSDAPAKKYTDKXXXXXXXXXXXXXNSGVFSHDSSSEDLENERQRWTEMESEWIC 293
KLQ P T++ S ++ +D+E ER R TEMESEWI
Sbjct: 853 KKLQ----TPVTMSTNELRSSLLA------RSFQLRSKNAEKDIEEERLRCTEMESEWIS 902
Query: 294 LTDELRADLESYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIGHGKMVEHYADLQEKYDD 353
LTDE R ++E+ R RA +D L+RAV+GH + VEHY +LQEKY+D
Sbjct: 903 LTDEFRVEIETQRTRAEKAEAQLKQEKLSSEELEDALRRAVLGHARFVEHYTELQEKYND 962
Query: 354 LVAKHEAIMEGI 365
L +KH+A +E I
Sbjct: 963 LCSKHKATVEWI 974
>AT5G55520.1 | Symbols: | CONTAINS InterPro DOMAIN/s:
Kinesin-related protein (InterPro:IPR010544); BEST
Arabidopsis thaliana protein match is: unknown protein
(TAIR:AT4G26660.1); Has 32425 Blast hits to 20462
proteins in 1550 species: Archae - 335; Bacteria - 3392;
Metazoa - 16996; Fungi - 2645; Plants - 1561; Viruses -
54; Other Eukaryotes - 7442 (source: NCBI BLink). |
chr5:22488205-22491187 REVERSE LENGTH=805
Length = 805
Score = 150 bits (379), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 57/364 (15%)
Query: 8 SRAESLAVNLQRGIEIIDYHQQNSALNRSSTSFSFKHLTLTP---EIDKVESYDQTIQQK 64
S +SLA +LQRG+ IID HQ++S NRSS SFSF HL+L P D + + + +Q+
Sbjct: 317 SPTDSLAASLQRGLNIIDCHQRSSLSNRSSVSFSFGHLSLKPCDEADDNLSASVKLLQKD 376
Query: 65 PTSDKVTAAFICAYCRTKVSNPNQDSTEVQDSFKSSFETVGQKGNPEGLTDKVPKHLNKL 124
+ ++ +C CR K+ Q G K E D
Sbjct: 377 RPKEGGSSILLCLSCRQKLDQEAQG---------------GYKAIEEACVD--------- 412
Query: 125 MAKDIMREKELENVCKMQAARIEQLNQLVEKLKEGKELNSITVYSQCKEYNSMKDENKLL 184
EK L+N+C QA +IEQL +++ K+ S V Q + + +DE +
Sbjct: 413 -------EKHLKNMCVEQATKIEQLTYQLDEYKKNALQESSKVTQQLMKSDDGEDETE-- 463
Query: 185 RSTSSNGHLPSIIEEKSEMKEVQEALAQRDVSFDSAEKESLLDEIRNPRSKLQLYSDAPA 244
+++E E + E + V D +EKE+LL EI +SKLQ P
Sbjct: 464 -----------VVKETYETNQRSEEFGK--VRIDLSEKEALLKEIAELKSKLQ-----PT 505
Query: 245 KKYTDKXXXXXXXXXXXXXNSGVFSHDS--SSEDLENERQRWTEMESEWICLTDELRADL 302
K TD S F+ ++ +SE LE ER+RWTEMESEWI LTD+LR D+
Sbjct: 506 KS-TDNVRSSLLLRSFQMRKSIDFTKNTENNSEALEEERERWTEMESEWISLTDDLRMDI 564
Query: 303 ESYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIGHGKMVEHYADLQEKYDDLVAKHEAIM 362
+S+RR A +D L RA++GH + +E Y +LQEKYD+L +H M
Sbjct: 565 DSHRRHAEDLEIELKKEKMATEELNDALSRAMLGHSRFIEQYTELQEKYDELDERHSVTM 624
Query: 363 EGIA 366
GI
Sbjct: 625 AGIV 628
>AT4G14150.1 | Symbols: PAKRP1, KINESIN-12A | phragmoplast-associated
kinesin-related protein 1 | chr4:8158645-8165008 REVERSE
LENGTH=1292
Length = 1292
Score = 146 bits (369), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 184/384 (47%), Gaps = 33/384 (8%)
Query: 9 RAESLAVNLQRGIEIIDYHQQNSALNRSSTSFSFKHLTLTPEIDKVESYDQTIQQKPTSD 68
R E LA +L +GI++++ + Q++A RS+ FSFK P + D +Q P +D
Sbjct: 740 RTERLASSLHKGIKLLESYCQSTAQRRSTYRFSFKAPDSEPSTS-ISKADAGVQTIPGAD 798
Query: 69 KV----TAAFICAYCRTKVSNPNQDSTEVQDSFKSSFETVGQKGNPEGLTDKVPKHLNKL 124
+ T F+C C+ + D+ ++ D V E ++VPK + K+
Sbjct: 799 AISEENTKEFLCCKCKCR---EQFDAQQMGDMPNLQLVPVDNSEVAEKSKNQVPKAVEKV 855
Query: 125 MAKDIMREKELENVCKMQAARIEQLNQLVEKLKEGKELNSITVYSQ-------------- 170
+A I RE LE C QA+ I QLN+LV++ K +E N+I ++
Sbjct: 856 LAGSIRREMALEEFCTKQASEITQLNRLVQQYKHERECNAIIGQTREDKIIRLESLMDGV 915
Query: 171 -------CKEYNSMKDENKLLRSTSSNGHLPSIIEEKSEMKEVQEALAQ-RDVSFDSAEK 222
+E+ S+ E+KLL+ N P +++ K E++ QE + ++ D E+
Sbjct: 916 LSKEDFLDEEFASLLHEHKLLKDMYQNH--PEVLKTKIELERTQEEVENFKNFYGDMGER 973
Query: 223 ESLLDEIRNPRSKLQLYSDAPAKKYTDKXXXXXXXXXXXXXNSGVFSHDSSSE-DLENER 281
E LL+EI++ + +LQ Y D K N+ S D S E LE ER
Sbjct: 974 EVLLEEIQDLKLQLQCYIDPSLKSALKTCTLLKLSYQAPPVNAIPESQDESLEKTLEQER 1033
Query: 282 QRWTEMESEWICLTDELRADLESYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIGHGKMV 341
WTE E++WI L++ELR +LE+ + + ++ A+ GH +M+
Sbjct: 1034 LCWTEAETKWISLSEELRTELEASKALINKQKHELEIEKRCGEELKEAMQMAMEGHARML 1093
Query: 342 EHYADLQEKYDDLVAKHEAIMEGI 365
E YADL+EK+ L+A+H I +GI
Sbjct: 1094 EQYADLEEKHMQLLARHRRIQDGI 1117
>AT4G26660.1 | Symbols: | INVOLVED IN: biological_process unknown;
LOCATED IN: chloroplast; EXPRESSED IN: 13 plant
structures; EXPRESSED DURING: 7 growth stages; CONTAINS
InterPro DOMAIN/s: Kinesin-related protein
(InterPro:IPR010544); BEST Arabidopsis thaliana protein
match is: unknown protein (TAIR:AT5G55520.2); Has 1807
Blast hits to 1807 proteins in 277 species: Archae - 0;
Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385;
Viruses - 0; Other Eukaryotes - 339 (source: NCBI
BLink). | chr4:13448754-13451814 FORWARD LENGTH=806
Length = 806
Score = 140 bits (353), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 167/362 (46%), Gaps = 76/362 (20%)
Query: 7 SSRAESLAVNLQRGIEIIDYHQQNSALNRSSTSFSFKHLTLTP--EIDKVESYDQTIQQ- 63
S +SLA +LQRG++IIDYHQ +S SS SFSF H+ L P E + + + Q+ ++
Sbjct: 345 SGPTDSLAASLQRGLQIIDYHQGSSLSKSSSVSFSFGHMALKPCAEGENLNASVQSFRKD 404
Query: 64 KPTSDKVTAAFICAYCRTKVSNPNQDSTEVQDSFKSSFETVGQKGNPEGLTDKVPKHLNK 123
K + +++ +C CR KV EV + S+
Sbjct: 405 KASEGGLSSILLCLSCRKKVDQ----EAEVTEEAGSN----------------------- 437
Query: 124 LMAKDIMREKELENVCKMQAARIEQLNQLVEKLKEGKELNSITVYSQCKEYNSMKDENKL 183
EK L+N+C QAA+IE+L L+ K +G++
Sbjct: 438 --------EKHLKNMCMEQAAKIEELTLLLRKSDDGEDGTEF------------------ 471
Query: 184 LRSTSSNGHLPSIIEEKSEMKEVQEALAQRDVSFDSAEKESLLDEIRNPRSKLQLYSDAP 243
I+E E K++ E + +F+ +EKE+LL EI + +SKLQ P
Sbjct: 472 -------------IKETYETKQISEEFGK--TNFEVSEKEALLKEIADLKSKLQ-----P 511
Query: 244 AKKYTDKXXXXXXXXXXXXXNSGVFSHDSSSEDLENERQRWTEMESEWICLTDELRADLE 303
K + + V + +S+DL ER+ WTEMESEWI LTD+LR D++
Sbjct: 512 TKSTDNLRSSLLLRSIQMRKSIDVSRNGENSDDLAKEREMWTEMESEWISLTDDLRMDID 571
Query: 304 SYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIGHGKMVEHYADLQEKYDDLVAKHEAIME 363
++R RA +D L RAV+GH + +E Y +LQE Y++L KH +M
Sbjct: 572 NHRSRAENLEFELKQEKLATEELNDALTRAVLGHSRFIEQYTELQETYNELGEKHSVMMA 631
Query: 364 GI 365
GI
Sbjct: 632 GI 633
>AT3G23670.2 | Symbols: PAKRP1L, KINESIN-12B |
phragmoplast-associated kinesin-related protein,
putative | chr3:8519290-8525055 FORWARD LENGTH=971
Length = 971
Score = 73.6 bits (179), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 50/89 (56%)
Query: 277 LENERQRWTEMESEWICLTDELRADLESYRRRAXXXXXXXXXXXXXXXXXDDVLKRAVIG 336
LE ER RWTE ES WI L +ELR +L++ R + ++ A+ G
Sbjct: 702 LEQERLRWTEAESNWISLAEELRTELDTNRLLMEKQKRELDTEKRCAEELTEAMQMAMQG 761
Query: 337 HGKMVEHYADLQEKYDDLVAKHEAIMEGI 365
H +M+E YADL+EK+ L+A+H I EGI
Sbjct: 762 HARMIEQYADLEEKHIQLLARHRRIREGI 790