Miyakogusa Predicted Gene
- Lj0g3v0198949.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0198949.1 Non Chatacterized Hit- tr|B9R6Q7|B9R6Q7_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,62.47,0,seg,NULL; Symplekin_C,Symplekin C-terminal; DUF3453,Protein
of unknown function DUF3453; no
descrip,NODE_56710_length_4705_cov_39.415730.path2.1
(1339 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G27595.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Symplekin ... 739 0.0
AT5G01400.1 | Symbols: ESP4 | HEAT repeat-containing protein | c... 530 e-150
AT1G27590.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein of... 221 4e-57
AT1G27570.1 | Symbols: | phosphatidylinositol 3- and 4-kinase f... 207 5e-53
AT1G27570.2 | Symbols: | phosphatidylinositol 3- and 4-kinase f... 128 3e-29
>AT1G27595.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Symplekin tight
junction protein C-terminal (InterPro:IPR022075); BEST
Arabidopsis thaliana protein match is: HEAT
repeat-containing protein (TAIR:AT5G01400.1); Has 593
Blast hits to 297 proteins in 111 species: Archae - 0;
Bacteria - 3; Metazoa - 139; Fungi - 49; Plants - 57;
Viruses - 0; Other Eukaryotes - 345 (source: NCBI BLink).
| chr1:9596475-9602917 FORWARD LENGTH=961
Length = 961
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 409/724 (56%), Positives = 511/724 (70%), Gaps = 26/724 (3%)
Query: 629 KTEVVERPGEVHQNMEENAFLDPSIPLSDLRD-EDISTEKLS--DDTGTNGPD------S 679
+ +++ P E +E + LD +PLS RD E I K S D + PD S
Sbjct: 251 REDLLTVPNECSYPSKEISSLD--VPLSPCRDDEGIRETKYSVPDLDMLSVPDFDQHSPS 308
Query: 680 PSMLEFDEFSPDVQVASTSEDTCLELPQLPPYVQQSKEQESKVKHMAIMHIIESYKHLHG 739
S+ +FD+ P + E++ EL +P YV+ + EQ V +AI IIES +H+ G
Sbjct: 309 ASVPDFDQDPPAASDITAPEESYRELDPVPSYVELTTEQSKTVGKLAIERIIESNRHVFG 368
Query: 740 TDCQQFCMPLLARLVAQIDDDNEIIVMLQKHILEDQWQ-KGXXXXXXXXXXXXXXXXXDS 798
DC + M L+ARL+A+ID +++ +L++ I D + KG D+
Sbjct: 369 FDCNKIRMALIARLIARIDAGSDVATILRELISVDHREFKGHDLVLHVLYHLHSMAILDT 428
Query: 799 DGKSSSSAVLYENFLLGVAKALLEYFPASDKSFSRLLGEAPFLPESTLKILNDLCYSDVI 858
D +SS A +YENFL+ VA++ L+ PASDKSFSRL GEAP LP+S + +L++LC +
Sbjct: 429 D-ESSFYATVYENFLISVARSFLDALPASDKSFSRLFGEAPHLPDSAINLLDELCSTR-- 485
Query: 859 DHD--GRISRDIERVTQGLGAIWGLILGRPQNRQACLEIALKCAVHPQDEIRAKAIRLVT 916
HD G+ D ERVTQGLGA+W LIL RP R+ACL IALKC+VH ++E+RAKAIRLVT
Sbjct: 486 -HDPVGKEVCDSERVTQGLGAVWSLILVRPNERKACLAIALKCSVHSEEEVRAKAIRLVT 544
Query: 917 NKLFQLSYISEDVEKFATKMLLSAVDREV-LDAVQSRPAEPRAEAQVESHEVSSTSQVSE 975
NKL+ L+YI+E VE+FAT MLL+AV+ E L S + EA+ +++ T++
Sbjct: 545 NKLYHLTYIAEHVEQFATDMLLTAVNSETNLSQTASTAEGIKMEAK---SQITLTTESLG 601
Query: 976 PTISENDSARVAKPMIQSPSISFSEAQRFISLFFALCTKKPILLQIVFDVYGKAPRTVKQ 1035
S+ S + + + IS SEAQR ISLFFALC KKP LL++VF+VYG+AP+ V Q
Sbjct: 602 SGNSDIPSQQDLQTSREVSVISISEAQRLISLFFALCKKKPSLLRLVFEVYGRAPKMVNQ 661
Query: 1036 AFHRHIPILVRALGQSYSELLPIISDPPEGSENLLTLVLQILTQDTTPSSDLISTVKHLY 1095
AFHRHIPIL+R LG SY+ELL IISDPP+GSENLLT VLQILTQ+ PS DLI+TVKHLY
Sbjct: 662 AFHRHIPILIRELGSSYTELLQIISDPPKGSENLLTYVLQILTQELAPSLDLIATVKHLY 721
Query: 1096 ETKFKDVTILVPLLSSLSKKEVLPIFPRLVDLPLEKFQRALAHILQGSAHTGPALTPVEV 1155
ETK KDV+IL+PLLSSL+K EVLPIFP L++LP EKFQ ALAHILQGSAHTGPALTP EV
Sbjct: 722 ETKLKDVSILIPLLSSLTKDEVLPIFPPLLNLPPEKFQLALAHILQGSAHTGPALTPAEV 781
Query: 1156 LVAIHGIVPEKDGLALKKITDACSACFEQRTVFTQQVLAKALNQMVDQTPLPLLFMRTVI 1215
L+AIH IVPEKDG LKKITDACSACFEQRTVFTQQVLAKAL QMVD+TPLPLLFMRTVI
Sbjct: 782 LIAIHDIVPEKDGPPLKKITDACSACFEQRTVFTQQVLAKALGQMVDRTPLPLLFMRTVI 841
Query: 1216 QSIDAFPALVDFVMEILSKLVTKQVWRMPKLWVGFLKCVYQTQPRSFHVLLQLPPQQLES 1275
Q+IDAFP LVDFVMEILSKLV KQ+WR+PKLW GFLKCV QT+P SF VLL+LP QLES
Sbjct: 842 QAIDAFPTLVDFVMEILSKLVRKQIWRLPKLWPGFLKCVSQTKPHSFPVLLELPVPQLES 901
Query: 1276 ALNKHANLRAPLASYASQPTVKSSLTRSTLAVLGLANETHVQQHLSTSLHPSEASSSVSG 1335
+ K +LR L +YA+QPT++SSL S L+VLGL N Q + +HPS+A+SS+ G
Sbjct: 902 IMKKFPDLRPSLTAYANQPTIRSSLPNSALSVLGLDN----GQDSRSQMHPSDATSSIHG 957
Query: 1336 ATLT 1339
A LT
Sbjct: 958 AALT 961
Score = 103 bits (258), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 58/112 (51%), Positives = 71/112 (63%), Gaps = 1/112 (0%)
Query: 364 VPQDNEQLATGHEAISKRIRSGPDPCLTLSAQINESGKDLXXXXXXXXXXXXLDSDLNAV 423
+PQ + G E KR+R + LT Q NES + DS+L V
Sbjct: 1 MPQGEDDTING-EVAPKRVRHNTNMHLTQQVQTNESLQGPVSINGISSGNHLSDSELTPV 59
Query: 424 EQMIAVIGALLAEGERGAESLEILISNIHPDLLADIVITNMKHLPKTPPPLA 475
EQM+++IGALLAEG+RGA SLEILIS +HPD+LADIVIT+MKHLP TPP LA
Sbjct: 60 EQMVSMIGALLAEGDRGAASLEILISKLHPDMLADIVITSMKHLPSTPPTLA 111
>AT5G01400.1 | Symbols: ESP4 | HEAT repeat-containing protein |
chr5:162803-171072 REVERSE LENGTH=1467
Length = 1467
Score = 530 bits (1365), Expect = e-150, Method: Compositional matrix adjust.
Identities = 313/658 (47%), Positives = 422/658 (64%), Gaps = 46/658 (6%)
Query: 680 PSML--EFDEFSPDVQVASTSEDTCLELPQL-PPYVQQSKEQESKVKHMAIMHIIESYKH 736
PSM + +EFSP V + + + LP++ P V S E++ ++ + + I+E+YK
Sbjct: 620 PSMSVDKLEEFSPKA-VGTVASASQFVLPKISAPVVDLSDEEKDSLQKLVFLRIVEAYKQ 678
Query: 737 LHGTDCQQFCMPLLARLVAQIDDDNEIIVMLQKHILEDQW-QKGXXXXXXXXXXXXXXXX 795
+ + Q LLA L + + + +LQ+H+L D +G
Sbjct: 679 ISMSGGSQLRFSLLAHLGVEFPSELDPWKILQEHVLSDYLNHEGHELTVRVLYRLYGEAE 738
Query: 796 XDSDGKSSSSAV-LYENFLLGVAKALLEYFPASDKSFSRLLGEAPFLPESTLKILNDLCY 854
+ D SS++A YE+FLL VA+AL + FP SDKS S+LLG++P LP+S L +L C
Sbjct: 739 AEQDFFSSTTAASAYESFLLTVAEALRDSFPPSDKSLSKLLGDSPHLPKSVLMLLESFC- 797
Query: 855 SDVIDHDGRISRDIE---RVTQGLGAIWGLILGRPQNRQACLEIALKCAVHPQDEIRAKA 911
G + +D++ RVTQGL A+W LIL RP R CL IAL+ AVH +EIR KA
Sbjct: 798 ---CPGSGEVEKDLQHGDRVTQGLSAVWSLILMRPGIRNDCLNIALQSAVHHLEEIRMKA 854
Query: 912 IRLVTNKLFQLSYISEDVEKFATKMLLSAV----DREVLDAVQSRPAEPRAEAQVESHEV 967
IRLV NKL+ LS+I+E +E+FA L S V D+ LD ++S P +P+ H +
Sbjct: 855 IRLVANKLYSLSFITEQIEEFAKDRLFSVVSDDCDKMDLD-LKSPPNKPQ-------HSI 906
Query: 968 SSTSQVSEPTISENDSARVAKPMIQSPSISFSEAQRFISLFFALCTKK-------PILLQ 1020
S +S T SE S+ S S +EAQR +SL+FALCTK ++
Sbjct: 907 SG---MSMETPSEATSS----------STSVTEAQRCLSLYFALCTKVLRIFTILRLMTN 953
Query: 1021 IVFDVYGKAPRTVKQAFHRHIPILVRALGQSYSELLPIISDPPEGSENLLTLVLQILTQD 1080
+VF++Y A VKQA H IPILVR +G S SELL II+DPP GS+NLL VLQ LT+
Sbjct: 954 LVFNIYKNASDPVKQAIHLQIPILVRTMGSS-SELLKIIADPPSGSDNLLIQVLQTLTEG 1012
Query: 1081 TTPSSDLISTVKHLYETKFKDVTILVPLLSSLSKKEVLPIFPRLVDLPLEKFQRALAHIL 1140
TPSS+LI T++ L++T+ KDV IL P+L L + +VL IFP +V+LP+EKFQ AL+ +L
Sbjct: 1013 PTPSSELILTIRKLFDTRIKDVEILFPILPFLPRDDVLRIFPHMVNLPMEKFQVALSRVL 1072
Query: 1141 QGSAHTGPALTPVEVLVAIHGIVPEKDGLALKKITDACSACFEQRTVFTQQVLAKALNQM 1200
QGS+ +GP L+P E L+AIH I P +DG+ LK++TDAC+ CF QR FTQQVLA LNQ+
Sbjct: 1073 QGSSQSGPVLSPSEALIAIHSIDPARDGIPLKQVTDACNTCFAQRQTFTQQVLAGVLNQL 1132
Query: 1201 VDQTPLPLLFMRTVIQSIDAFPALVDFVMEILSKLVTKQVWRMPKLWVGFLKCVYQTQPR 1260
V Q PLP+LFMRTV+Q+I AFPAL DF++EILS+LV+KQ+W+ PKLWVGFLKC TQP+
Sbjct: 1133 VQQIPLPMLFMRTVLQAIGAFPALSDFILEILSRLVSKQIWKYPKLWVGFLKCTQTTQPQ 1192
Query: 1261 SFHVLLQLPPQQLESALNKHANLRAPLASYASQPTVKSSLTRSTLAVLGLANETHVQQ 1318
S+ VLLQLPP QL +AL K LRAPL ++ASQP ++SSL RSTLAVLGL ++ Q
Sbjct: 1193 SYKVLLQLPPLQLGNALTKIPALRAPLTAHASQPEIQSSLPRSTLAVLGLVPDSQGTQ 1250
Score = 191 bits (486), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/417 (32%), Positives = 208/417 (49%), Gaps = 28/417 (6%)
Query: 64 VRKLLIQIIEEIGFKAAEDSPKLISVLLTFLRDSDVEVVKQSIVSGSNIFCSGFEELVVQ 123
VRK + +I+ EIG K E P+++ +L+ L D V +Q I G+++F S E + VQ
Sbjct: 62 VRKFVAEILGEIGLKYVELIPEIVPLLIKSLEDETPAVARQVIACGADLFRSTLERVAVQ 121
Query: 124 FQQCGKVERWLEEIWMWMHRFKDAVFGIALEPGAVGIKLLALKFLETFILLFTSDNSNPD 183
++ LE W W+ +FKD + +A + G G+KL A+KF+E ILL+T
Sbjct: 122 GLHSSELNDLLESSWTWLIKFKDEICSVAFKQGNSGVKLCAMKFVEALILLYTPH----- 176
Query: 184 KPATEGVRKAVHISWLVGGHPVLDPVVLMSEANRTLGTLLNLLQ--SAGSLPGCLTITVV 241
EG+ +IS L GGHPVL L EA++ LG LL+ L+ +A SL I ++
Sbjct: 177 ----EGIEADFNISILRGGHPVLKIGDLSIEASQKLGLLLDQLRHPAAKSLNSSTIIVLI 232
Query: 242 NCLAAIARKRPQHYDTILSAMLDFNPNFQTVKGCHVASIQYSLRTTFLGFLRCTY---SP 298
N L+++A+KRP + IL +L +P +KG + A+ +L+T FL L+CT+ +P
Sbjct: 233 NSLSSVAKKRPAYCGRILPVLLSLDP-LSFLKGVYAAATNLALKTVFLSCLKCTHPAAAP 291
Query: 299 XXXXXXXXXXXXXGMNAGDAADQVIRQVDKMIKNGDRSTRDARASKDEVPSTQSPVSGE- 357
G A A + + + I++ D S D + S +E P S E
Sbjct: 292 DRLTSALKEIEGGGQAA--KAKDLFYKTNGSIQDKD-SVEDTKVSVEENPLCASSDVAES 348
Query: 358 -LSRKRPVPQDNEQLATGHEAISKRIRSGPDPCLTLSAQINESGKDLXXXXXXXXXXXXL 416
LSRKR + N L G + KR R P + +N G D
Sbjct: 349 NLSRKRSGSEYNIDL-NGDASDGKRARITPSVSEESTDGLN--GNDGVSLPRVASTSTGP 405
Query: 417 D-----SDLNAVEQMIAVIGALLAEGERGAESLEILISNIHPDLLADIVITNMKHLP 468
SD +Q++ + G L+++GE+ SLEILIS+I DLL D+V+ NM ++P
Sbjct: 406 SDSRGVSDSGPAQQLVGLFGTLVSQGEKAIGSLEILISSISADLLTDVVMANMHNIP 462
>AT1G27590.1 | Symbols: | CONTAINS InterPro DOMAIN/s: Protein of
unknown function DUF3453 (InterPro:IPR021850); BEST
Arabidopsis thaliana protein match is:
phosphatidylinositol 3- and 4-kinase family protein
(TAIR:AT1G27570.1); Has 109 Blast hits to 109 proteins
in 38 species: Archae - 0; Bacteria - 0; Metazoa - 65;
Fungi - 4; Plants - 38; Viruses - 0; Other Eukaryotes -
2 (source: NCBI BLink). | chr1:9592293-9593912 FORWARD
LENGTH=264
Length = 264
Score = 221 bits (562), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/224 (50%), Positives = 145/224 (64%)
Query: 19 NNHGVLSVKTSSLKQAKXXXXXXXXXXXXXXXXXXXXXXXXXXXXVRKLLIQIIEEIGFK 78
NNHG L+VK SSL+Q K VRK LI+IIEE+G +
Sbjct: 21 NNHGDLAVKLSSLRQVKEILLSLEPSLSAEIFPYLAELHLSREILVRKSLIEIIEEVGLR 80
Query: 79 AAEDSPKLISVLLTFLRDSDVEVVKQSIVSGSNIFCSGFEELVVQFQQCGKVERWLEEIW 138
+ S L+SVLL RD D V K+SI G+ FC+ EE+ +QF GKV+RW E+W
Sbjct: 81 MLDHSYVLVSVLLVLSRDEDPTVAKKSISVGTTFFCTILEEMAMQFHHRGKVDRWCGELW 140
Query: 139 MWMHRFKDAVFGIALEPGAVGIKLLALKFLETFILLFTSDNSNPDKPATEGVRKAVHISW 198
WM +FKD VF ALEPG VG+K+LALKF+ETFILLFT D S+P+K ++EG R +ISW
Sbjct: 141 TWMVKFKDTVFATALEPGCVGVKVLALKFMETFILLFTPDASDPEKASSEGSRHMFNISW 200
Query: 199 LVGGHPVLDPVVLMSEANRTLGTLLNLLQSAGSLPGCLTITVVN 242
L GGHP+L+P LMSEANRT G L++ +QSA LPG LTI+V++
Sbjct: 201 LAGGHPILNPATLMSEANRTFGILVDFIQSANRLPGALTISVIS 244
>AT1G27570.1 | Symbols: | phosphatidylinositol 3- and 4-kinase
family protein | chr1:9575538-9582376 FORWARD LENGTH=649
Length = 649
Score = 207 bits (527), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 138/356 (38%), Positives = 191/356 (53%), Gaps = 39/356 (10%)
Query: 1 MAAPPPP--KDQFLSFLSSANNHGVLSVKTSSLKQAKXXXXXXXXXXXXXXXXXXXXXXX 58
MAAP K Q L+ L++A NHG L+VK SSLK+ K
Sbjct: 1 MAAPADATTKGQALALLAAAKNHGDLAVKLSSLKEVKEILLSLEPSLSAEIFPYLRELCL 60
Query: 59 XXXXXVRKLLIQIIEEIGFKAAEDSPKLISVLLTFLRDSDVEVVKQSIVSGSNIFCSGFE 118
VR+ LI+IIEE+G + E S L+SVL+ + D+D V ++SI +G+ F S E
Sbjct: 61 SPEVLVRRSLIEIIEEVGLRMLEHSYVLVSVLIHLVGDNDPTVAEKSISTGTTFFRSILE 120
Query: 119 ELVVQFQQCGKVERWLEEIWMWMHRFKDAVFGIA--LEPG-AVGIKLLALKFLETFILLF 175
++ QF GKV+RW +W M FKDAVF IA LEPG VG+K+LALKF+ETFILL
Sbjct: 121 KMETQFHHRGKVDRWCVNLWTLMLMFKDAVFNIALDLEPGRVVGVKVLALKFMETFILLI 180
Query: 176 TSDNSNPDKPAT--EGVRKAVHISWLVGGHPVLDPVVLMSEANRTLGTLLNLLQSAGSLP 233
T S+P+K +T EG R+ ++IS L G P+L+ LMSE N+TL L + LQ+ +
Sbjct: 181 TPHASDPEKVSTSSEGSRQMINISSLAAGLPMLNLTGLMSEVNQTLVRLGSFLQAPTLIQ 240
Query: 234 GCLTITVVNC-----LAAIARKRPQHYDTILSAMLDFNPNFQTVKGCHVASIQYSLRTTF 288
L I V++C LA +ARKRP HYDT+LS
Sbjct: 241 DALPIAVIDCSLSFSLAVVARKRPVHYDTVLSV--------------------------- 273
Query: 289 LGFLRCTYSPXXXXXXXXXXXXXGMNAGDAADQVIRQVDKMIKNGDRSTRDARASK 344
LGFL+CT SP M+ D +DQV+R+VD++ + + + + R+S+
Sbjct: 274 LGFLKCTSSPIVESRDLLFRAFPAMDPADISDQVVREVDELFRVNEHAANENRSSQ 329
>AT1G27570.2 | Symbols: | phosphatidylinositol 3- and 4-kinase
family protein | chr1:9575538-9582376 FORWARD LENGTH=640
Length = 640
Score = 128 bits (322), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 162/363 (44%), Gaps = 62/363 (17%)
Query: 1 MAAPP--PPKDQFLSFLSSANNHGVLSVKTSSLKQAKXXXXXXXXXXXXXXXXXXXXXXX 58
MAAP K Q L+ L++A NHG L+VK SSLK+ K
Sbjct: 1 MAAPADATTKGQALALLAAAKNHGDLAVKLSSLKEVKEILLSLEPSLSAEIFPYLRELCL 60
Query: 59 XXXXXVRKLLIQIIEEIGFKAAEDSPKLISVLLTFLRDSDVEVVKQSIVSGSNIFCSGFE 118
VR+ LI+IIEE+ K IS TF R S +E ++ + + S + G+
Sbjct: 61 SPEVLVRRSLIEIIEEV--------EKSISTGTTFFR-SILEKMETQVHTVSQV---GY- 107
Query: 119 ELVVQFQQCGKVERWLEEIWMWMHRFKDAVFGIALEPGAVGIKLLALKFLETFILLFTSD 178
+ F GKV+RW +W M FKDAVF IAL+ L+ L F L
Sbjct: 108 ---LVFHHRGKVDRWCVNLWTLMLMFKDAVFNIALDLERRYYVLIMQFLLLLFRLYCILS 164
Query: 179 NSNPD------------KPATEGVRKAVHISWLVGGHPVLDPVVLMSEANRTLGTLLNLL 226
+ P +T G R+ ++IS L G P+L+ LMSE N+TL L + L
Sbjct: 165 GTAPYMLCSFCCARVNFSNSTAGSRQMINISSLAAGLPMLNLTGLMSEVNQTLVRLGSFL 224
Query: 227 QSAGSLPGCLTITVVNC-----LAAIARKRPQHYDTILSAMLDFNPNFQTVKGCHVASIQ 281
Q+ + L I V++C LA +ARKRP HYDT+LS
Sbjct: 225 QAPTLIQDALPIAVIDCSLSFSLAVVARKRPVHYDTVLSV-------------------- 264
Query: 282 YSLRTTFLGFLRCTYSPXXXXXXXXXXXXXGMNAGDAADQVIRQVDKMIKNGDRSTRDAR 341
LGFL+CT SP M+ D +DQV+R+VD++ + + + + R
Sbjct: 265 -------LGFLKCTSSPIVESRDLLFRAFPAMDPADISDQVVREVDELFRVNEHAANENR 317
Query: 342 ASK 344
+S+
Sbjct: 318 SSQ 320