Miyakogusa Predicted Gene
- Lj0g3v0198949.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0198949.1 Non Characterized Hit- tr|B9R6Q7|B9R6Q7_RICCO
Putative uncharacterized protein OS=Ricinus communis
G,62.47,0,seg,NULL; Symplekin_C,Symplekin C-terminal; DUF3453,Protein
of unknown function DUF3453; no
descrip,NODE_56710_length_4705_cov_39.415730.path2.1
(1339 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr3g111090.1 | symplekin tight junction protein carboxy-termi... 1942 0.0
Medtr4g015850.1 | symplekin tight junction protein carboxy-termi... 573 e-163
>Medtr3g111090.1 | symplekin tight junction protein carboxy-terminal
protein | HC | chr3:51920629-51901963 | 20130731
Length = 1338
Score = 1942 bits (5030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 983/1342 (73%), Positives = 1068/1342 (79%), Gaps = 7/1342 (0%)
Query: 1 MAAPPPPKDQFLSFLSSANNHGVLSVKTSSLKQAKXXXXXXXXXXXXXXXXXXXXXXXXX 60
MAAPPP KDQ LS L++ANNHG +SVKT+SLKQAK
Sbjct: 1 MAAPPPTKDQVLSLLAAANNHGDISVKTTSLKQAKDLLLSIDPSLAADLFPYLLELQSSH 60
Query: 61 XXXVRKLLIQIIEEIGFKAAEDSPKLISVLLTFLRDSDVEVVKQSIVSGSNIFCSGFEEL 120
VRKLLIQIIEEIGF+A + SP LIS LLTFLRD+D VVKQSI+SG+NIFC+ FEEL
Sbjct: 61 QPLVRKLLIQIIEEIGFRAVQHSPTLISSLLTFLRDTDATVVKQSIISGTNIFCACFEEL 120
Query: 121 VVQFQQCGKVERWLEEIWMWMHRFKDAVFGIALEPGAVGIKLLALKFLETFILLFTSDNS 180
++QFQQCGKVERWLEEIWMWM +FK+AVF IALE G+VGIKLLALKFLE F+LLFTSD S
Sbjct: 121 ILQFQQCGKVERWLEEIWMWMFKFKEAVFEIALEGGSVGIKLLALKFLEIFVLLFTSDIS 180
Query: 181 NPDKPATEGVRKAVHISWLVGGHPVLDPVVLMSEANRTLGTLLNLLQSAGSLPGCLTITV 240
+ +K ATEGVR+AV+ISWLVG HPVLDP+VLM+EANRT+G LL LLQ AG+ PGCLTITV
Sbjct: 181 DSEKSATEGVRQAVNISWLVGSHPVLDPMVLMTEANRTIGILLKLLQCAGNTPGCLTITV 240
Query: 241 VNCLAAIARKRPQHYDTILSAMLDFNPNFQTVKGCHVASIQYSLRTTFLGFLRCTYSPXX 300
VNCLAAIARKR QHYDTILSA+LDF+PN QTVKGCHV SIQYSLRT FLGFLRCTYSP
Sbjct: 241 VNCLAAIARKRSQHYDTILSALLDFDPNVQTVKGCHVPSIQYSLRTAFLGFLRCTYSPII 300
Query: 301 XXXXXXXXXXXGMNAGDAADQVIRQVDKMIKNGDRSTRDARASKDEVPSTQSPVSGELSR 360
MNAGDAADQVIRQVDKMIK+ DR RDAR +KD+ PS Q PVSGE SR
Sbjct: 301 ESRERLIRSLRAMNAGDAADQVIRQVDKMIKSADRFIRDARVNKDDQPSNQLPVSGESSR 360
Query: 361 KRPVPQDNEQLATGHEAISKRIRSGPDPCLTLSAQINESGKDLXXXXXXXXXXXXLDSDL 420
KRPVP DNEQLA GHEAI+KRIRSGPD TL AQ+N+SG+D L+S+L
Sbjct: 361 KRPVPHDNEQLANGHEAIAKRIRSGPDSDFTLPAQVNDSGRDHSSVNGVSPNVPVLESEL 420
Query: 421 NAVEQMIAVIGALLAEGERGAESLEILISNIHPDLLADIVITNMKHLPKTPPPLARLGNX 480
AVEQMIAVIGAL+AEGERGA+SLEILIS IHPDLLADIVI NMKHLPK PPPLARL +
Sbjct: 421 TAVEQMIAVIGALIAEGERGAKSLEILISQIHPDLLADIVIANMKHLPKAPPPLARLESP 480
Query: 481 XXXXXXXXXXXXXXXXXXXXXXXXXXXLAVTDQXXXXXXXXXXXXXXXX--XXXXXXNQP 538
L V+ Q N P
Sbjct: 481 SVTRPVGSLVSQSHVITTSASMSSVQSLTVSAQAQVSSTTAISAATTSSPSDTSNFSNLP 540
Query: 539 ADSKXXXXXXXXXXXXXXGAISPGGVAASTTDDTGATKLEVDDPVSSNTPVSHPVASTND 598
ADSK GAI+PGG A S TDDT AT LE +DPVS P SH VAST+D
Sbjct: 541 ADSKRDPRRDPRRLDPRRGAITPGGAAVSVTDDTAATHLESEDPVSFIKPASHHVASTDD 600
Query: 599 NTPSDLTMKIKNDDMVSEGPPVSAPDGITPKTEVVERPGEVHQNMEENAFLDPSIPLSDL 658
+ S+LT+KI+NDDM+SEGPPV PD ++PKTE +E PG+ HQ ME NA +DP + +D
Sbjct: 601 DIQSNLTIKIENDDMISEGPPVPGPDRVSPKTETLEGPGD-HQIMEANASMDPEVYSTDS 659
Query: 659 RDEDISTEKLSDDTGTNGPDSPSMLEFDEFSPDVQVASTSEDTCLELPQLPPYVQQSKEQ 718
+DE++ST L DD TNG DS S+LEFD+FS DVQV+ TSEDTCLELPQLPPY+Q S+EQ
Sbjct: 660 KDENLSTANLLDDNETNGIDSSSILEFDQFSVDVQVSPTSEDTCLELPQLPPYIQLSQEQ 719
Query: 719 ESKVKHMAIMHIIESYKHLHGTDCQQFCMPLLARLVAQIDDDNEIIVMLQKHILEDQWQK 778
ESKVKHMAI HIIESY HL G DCQQFCMPLLARLVAQID+DN II MLQKHILED W+K
Sbjct: 720 ESKVKHMAISHIIESYNHLQGADCQQFCMPLLARLVAQIDNDNVIITMLQKHILEDHWRK 779
Query: 779 GXXXXXXXXXXXXXXXXXDSDGKSSSSAVLYENFLLGVAKALLEYFPASDKSFSRLLGEA 838
G DS G +SSSAVLY+NFLLGVAK +L+ FPASDKSFS+LLGE
Sbjct: 780 GHELVLHVLYHLHSLMILDSAGNTSSSAVLYDNFLLGVAKTVLDSFPASDKSFSKLLGEV 839
Query: 839 PFLPESTLKILNDLCYSDVIDHDGRISRDIERVTQGLGAIWGLILGRPQNRQACLEIALK 898
PFLPES LKIL+DLCYSDV+DHDG+I RDIERVTQGLGAIW LILGRPQNRQ CL IALK
Sbjct: 840 PFLPESALKILDDLCYSDVVDHDGKIIRDIERVTQGLGAIWSLILGRPQNRQGCLGIALK 899
Query: 899 CAVHPQDEIRAKAIRLVTNKLFQLSYISEDVEKFATKMLLSAVDREVLDAVQSRPAEPRA 958
CAVHPQDEIRAKAIRLVTNKLFQLSYI+EDVEKFATKMLLSAVD EV +A QS P E R
Sbjct: 900 CAVHPQDEIRAKAIRLVTNKLFQLSYIAEDVEKFATKMLLSAVDHEVSNAAQSGPTEQRT 959
Query: 959 EAQVESHEVSSTSQVSEPTISENDSARVAKPMIQSPSISFS-EAQRFISLFFALCTKKPI 1017
EAQVES E+S TSQV EP +ND A VAKP QS S EAQR ISLFFALCTKKP
Sbjct: 960 EAQVESLEISGTSQVKEP---KNDPAGVAKPSSQSVSSISFSEAQRLISLFFALCTKKPS 1016
Query: 1018 LLQIVFDVYGKAPRTVKQAFHRHIPILVRALGQSYSELLPIISDPPEGSENLLTLVLQIL 1077
LLQIVFDVYG+A RTVKQAFHRHIP LVRALGQSYSELL IISDPP+GSENLLTLVLQIL
Sbjct: 1017 LLQIVFDVYGQASRTVKQAFHRHIPNLVRALGQSYSELLHIISDPPKGSENLLTLVLQIL 1076
Query: 1078 TQDTTPSSDLISTVKHLYETKFKDVTILVPLLSSLSKKEVLPIFPRLVDLPLEKFQRALA 1137
TQDTTPSSDLISTVKHLYETKF+DVTILVPLLSSLSK EVLPIFPRLVDLPLEKFQRALA
Sbjct: 1077 TQDTTPSSDLISTVKHLYETKFRDVTILVPLLSSLSKTEVLPIFPRLVDLPLEKFQRALA 1136
Query: 1138 HILQGSAHTGPALTPVEVLVAIHGIVPEKDGLALKKITDACSACFEQRTVFTQQVLAKAL 1197
HILQGSAHTGPALTPVEVLVAIHGIVPEKDGLALKKITDACSACFEQRTVFTQQVLAKAL
Sbjct: 1137 HILQGSAHTGPALTPVEVLVAIHGIVPEKDGLALKKITDACSACFEQRTVFTQQVLAKAL 1196
Query: 1198 NQMVDQTPLPLLFMRTVIQSIDAFPALVDFVMEILSKLVTKQVWRMPKLWVGFLKCVYQT 1257
NQMVDQTPLPLLFMRT+IQ++DAFPALVDFVMEILSKLVTKQVWRMPKLWVGFLKCVYQT
Sbjct: 1197 NQMVDQTPLPLLFMRTIIQAVDAFPALVDFVMEILSKLVTKQVWRMPKLWVGFLKCVYQT 1256
Query: 1258 QPRSFHVLLQLPPQQLESALNKHANLRAPLASYASQPTVKSSLTRSTLAVLGLANETHVQ 1317
QPRSFHVLLQLPPQQLESALNKHANLR PLASYA+QPTVKSSL+RSTL VLGLANETHVQ
Sbjct: 1257 QPRSFHVLLQLPPQQLESALNKHANLRGPLASYANQPTVKSSLSRSTLVVLGLANETHVQ 1316
Query: 1318 QHLSTSLHPSEASSSVSGATLT 1339
QHL TSLH SE S+SVSGATLT
Sbjct: 1317 QHLPTSLHHSETSTSVSGATLT 1338
>Medtr4g015850.1 | symplekin tight junction protein carboxy-terminal
protein | HC | chr4:4819906-4805018 | 20130731
Length = 1338
Score = 573 bits (1478), Expect = e-163, Method: Compositional matrix adjust.
Identities = 335/710 (47%), Positives = 453/710 (63%), Gaps = 25/710 (3%)
Query: 651 PSIPLSDLRDEDISTEK---LSDDTGTNGPDSPSMLEFDEFSP-----DVQVASTSEDTC 702
PS+ +D++ ED S E+ L + + +N S S + +E SP DV +S T
Sbjct: 623 PSLASTDMQIEDASQEQATSLDNRSPSNLVPSMSADKSEELSPKAVAPDVNSLVSSTATS 682
Query: 703 LELPQ-------LPPYVQQSKEQESKVKHMAIMHIIESYKHLHGTDCQQFCMPLLARLVA 755
LP + P V + EQ ++ M II++YK + + +LA L
Sbjct: 683 AVLPSRLVLPKMIAPVVDLADEQRDHLQTSCFMRIIDAYKQISVAGGSKVRFSILAYLGV 742
Query: 756 QIDDDNEIIVMLQKHILED-QWQKGXXXXXXXXXXXXXXXXXDSDGKSSSSAV-LYENFL 813
+ + + +LQKHIL D +G + D SS++A +YE FL
Sbjct: 743 EFPLELDPWKLLQKHILIDYSGHEGHELTLRVLYRLFGEAEVEPDFFSSTTAASVYETFL 802
Query: 814 LGVAKALLEYFPASDKSFSRLLGEAPFLPESTLKILNDLCY---SDVIDHDGRISRDIER 870
L VA+AL + FP SDKS S+LLGE+P+LP+S LKIL ++C D I+ + + +++R
Sbjct: 803 LTVAEALRDSFPPSDKSLSKLLGESPYLPKSVLKILENMCSPGNGDKIEKESH-TLNVDR 861
Query: 871 VTQGLGAIWGLILGRPQNRQACLEIALKCAVHPQDEIRAKAIRLVTNKLFQLSYISEDVE 930
VTQGL A+W LIL RP R+ CL+IAL+ AVH +E+R KAIRLV NKL+ LS IS+++E
Sbjct: 862 VTQGLSAVWSLILLRPPIRETCLKIALQSAVHHVEEVRMKAIRLVANKLYPLSSISKEIE 921
Query: 931 KFATKMLLSAVDREVLDAV-QSRPAEPRAEAQVESHEVSSTSQVSEPTISENDSARVAKP 989
FA +M+ S + + +A + A+ + VE ++ +S T +D+ + +
Sbjct: 922 DFAKEMVFSVMSSDASEATDEGSVADSQKGPDVE-KVINEPLSLSGSTKDVSDNRQSSTS 980
Query: 990 MIQSPSISFSEAQRFISLFFALCTKKPILLQIVFDVYGKAPRTVKQAFHRHIPILVRALG 1049
SP +S SEAQR +SLFFALCTKK L + +F +Y + KQA HR IPILVR LG
Sbjct: 981 EGTSP-LSVSEAQRGMSLFFALCTKKHSLFREIFVIYRSTSKAAKQAIHRQIPILVRTLG 1039
Query: 1050 QSYSELLPIISDPPEGSENLLTLVLQILTQDTTPSSDLISTVKHLYETKFKDVTILVPLL 1109
S S+LL IISDPP GSENLL VL LT T PS DLI VK L+++K KD IL+P+L
Sbjct: 1040 SS-SDLLEIISDPPNGSENLLMQVLHTLTDGTIPSKDLIVAVKRLHDSKLKDAEILIPIL 1098
Query: 1110 SSLSKKEVLPIFPRLVDLPLEKFQRALAHILQGSAHTGPALTPVEVLVAIHGIVPEKDGL 1169
L K EV+ +FP +V+LP EKFQ AL ILQGS+ +GP LTP EVL+AIHGI PE+DG+
Sbjct: 1099 PFLLKDEVMAVFPNIVNLPPEKFQGALGRILQGSSQSGPVLTPAEVLIAIHGIDPERDGI 1158
Query: 1170 ALKKITDACSACFEQRTVFTQQVLAKALNQMVDQTPLPLLFMRTVIQSIDAFPALVDFVM 1229
LKK+TDAC+ACFEQR FTQ+VLAK LNQ+V+Q PLPLLFMRTV+Q+I AFP LVDF+M
Sbjct: 1159 PLKKVTDACNACFEQRQTFTQEVLAKVLNQLVEQIPLPLLFMRTVLQAIGAFPTLVDFIM 1218
Query: 1230 EILSKLVTKQVWRMPKLWVGFLKCVYQTQPRSFHVLLQLPPQQLESALNKHANLRAPLAS 1289
IL++LV KQVW+ PKLWVGFLKC+ T+P+SF VLLQLPP QLE+ALN+ A L+APL +
Sbjct: 1219 GILARLVKKQVWKYPKLWVGFLKCLQLTKPQSFGVLLQLPPPQLEAALNRIAALKAPLIA 1278
Query: 1290 YASQPTVKSSLTRSTLAVLGLANETHVQQHLSTSLHPSEASSSVSGATLT 1339
+ASQP ++SSL RS L VLG+ +++ V TS + +S+ T+T
Sbjct: 1279 HASQPDIQSSLPRSVLVVLGIVSDSQVPSQPQTSQTQTGETSNSDKDTMT 1328
Score = 211 bits (536), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 141/417 (33%), Positives = 213/417 (51%), Gaps = 8/417 (1%)
Query: 64 VRKLLIQIIEEIGFKAAEDSPKLISVLLTFLRDSDVEVVKQSIVSGSNIFCSGFEELVVQ 123
VRK + ++I EIG K E P ++ VL+ L D VV+Q+I+ G +F S E++ +Q
Sbjct: 67 VRKFVTEMIGEIGLKNTEFLPDIVPVLIDVLDDDTPAVVRQAILCGIRLFRSTLEKIAIQ 126
Query: 124 FQQCGKVERWLEEIWMWMHRFKDAVFGIALEPGAVGIKLLALKFLETFILLFTSDNSNPD 183
++ +E W WM +FKD V+ IA + G G KLLALKF+E I L+T D +
Sbjct: 127 GLFSSDLDSAVESAWEWMVKFKDKVYSIAFQNGRGGAKLLALKFVEAVIRLYTLDPNGSA 186
Query: 184 KPAT-EGVRKAVHISWLVGGHPVLDPVVLMSEANRTLGTLLNLLQ--SAGSLPGCLTITV 240
+P + +G R +ISWL HPVL+ + EA+ +LG LL+ L+ + SL + I +
Sbjct: 187 EPNSHQGKRPVFNISWLRRDHPVLNIGDMSMEASNSLGLLLDQLRFPTVKSLGNSVIIVL 246
Query: 241 VNCLAAIARKRPQHYDTILSAMLDFNPNFQTVKGCHVASIQYSLRTTFLGFLRCTYSPXX 300
+ L+AIA +RP Y IL +L P+ V G V++ +L+ FL CT+
Sbjct: 247 IKSLSAIAIERPAFYGRILPVLLSLEPSTSVVNGVCVSAAHLALKKAFLTCTECTHPSAA 306
Query: 301 XXXXXXXXXXXGMNAGDAADQVIRQVDKMIKNGDRSTRDARAS--KDEVPSTQSPVSG-- 356
M + ADQV + ++ + D + K+E + S SG
Sbjct: 307 PWRDRLAGALKEMQSEGKADQVFHPISASNESILQREEDYQPVIIKEEDTAGSSFDSGHV 366
Query: 357 ELSRKRPVPQDNEQLATGHEAISKRIRSGPDPCLTLSAQINESGKDLXXXX-XXXXXXXX 415
L RKR Q+ LA + KR+R+ D T +++E +
Sbjct: 367 NLGRKRSGSQNGSDLAEDADVPGKRVRTTTDGLKTPKMELDECTANTQNDTPSTVPAFSK 426
Query: 416 LDSDLNAVEQMIAVIGALLAEGERGAESLEILISNIHPDLLADIVITNMKHLPKTPP 472
D D V+Q++A+ GAL+A+GE SLEILISNI DLLA++V+ NM++LP P
Sbjct: 427 GDVDNGPVQQLVAMFGALVAQGEMAVASLEILISNISADLLAEVVMANMRYLPPNCP 483