Miyakogusa Predicted Gene
- Lj0g3v0248759.3
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0248759.3 Non Characterized Hit- tr|I1JLE4|I1JLE4_SOYBN
Uncharacterized protein OS=Glycine max PE=4
SV=1,78.84,0,DUF3453,Protein of unknown function DUF3453;
SYMPLEKIN,NULL; FAMILY NOT NAMED,NULL; ARM repeat,Armad,CUFF.16295.3
(768 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr4g015850.1 | symplekin tight junction protein carboxy-termi... 1041 0.0
Medtr3g111090.1 | symplekin tight junction protein carboxy-termi... 171 2e-42
>Medtr4g015850.1 | symplekin tight junction protein carboxy-terminal
protein | HC | chr4:4819906-4805018 | 20130731
Length = 1338
Score = 1041 bits (2693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 536/768 (69%), Positives = 601/768 (78%), Gaps = 7/768 (0%)
Query: 1 MVGKPMATATSRERLASLVNSANMAIDIPSKLESLRQLRLELPQEDPVXXXXXXXXXXXX 60
MVGK MA TSRE+LASLV+SA +A DIPSK+ESLR+L++ELPQEDPV
Sbjct: 1 MVGKAMAV-TSREKLASLVHSAKLASDIPSKIESLRRLKIELPQEDPVLLTEFLPPIFDF 59
Query: 61 XSDHFSPVRKFVTEMVGEIGLKNTEFLSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRST 120
SD FSPVRKFVTEM+GEIGLKNTEFL RQAILCGI LFRST
Sbjct: 60 LSDQFSPVRKFVTEMIGEIGLKNTEFLPDIVPVLIDVLDDDTPAVVRQAILCGIRLFRST 119
Query: 121 LQKLAIQGIYSSDLDGALESAWSWMVRFKDKVYTLAFQHGSSGAKLLALKFVEAVIHLYT 180
L+K+AIQG++SSDLD A+ESAW WMV+FKDKVY++AFQ+G GAKLLALKFVEAVI LYT
Sbjct: 120 LEKIAIQGLFSSDLDSAVESAWEWMVKFKDKVYSIAFQNGRGGAKLLALKFVEAVIRLYT 179
Query: 181 PDPNGSSEPTPHQGRNMEFNISWLRRGHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLS 240
DPNGS+EP HQG+ FNISWLRR HP+LN+GD+S+EAS+SLGLLLDQLRFPTVKSL
Sbjct: 180 LDPNGSAEPNSHQGKRPVFNISWLRRDHPVLNIGDMSMEASNSLGLLLDQLRFPTVKSLG 239
Query: 241 NAVIIVVIKSLSAIAINRPAFYGRIXXXXXXXXXXXXXXNGVCVTAVHLALKNAFLSCSK 300
N+VIIV+IKSLSAIAI RPAFYGRI NGVCV+A HLALK AFL+C++
Sbjct: 240 NSVIIVLIKSLSAIAIERPAFYGRILPVLLSLEPSTSVVNGVCVSAAHLALKKAFLTCTE 299
Query: 301 CTHPSAAPWRDRLAGALKEMQSEGKADRVFHLISPSNGSI-QREKDDQPV-IKEEEPAVN 358
CTHPSAAPWRDRLAGALKEMQSEGKAD+VFH IS SN SI QRE+D QPV IKEE+ A +
Sbjct: 300 CTHPSAAPWRDRLAGALKEMQSEGKADQVFHPISASNESILQREEDYQPVIIKEEDTAGS 359
Query: 359 SSDSVHIKLARKRSGSQDRGDLSVDEDVPGKRVRTIV-GLTEPKKELDDCAVNSQDDAHS 417
S DS H+ L RKRSGSQ+ DL+ D DVPGKRVRT GL PK ELD+C N+Q+D S
Sbjct: 360 SFDSGHVNLGRKRSGSQNGSDLAEDADVPGKRVRTTTDGLKTPKMELDECTANTQNDTPS 419
Query: 418 T-PTSSKGDVENGPVQQLVAMFGALVAQGEKAVXXXXXXXXXXXXXXXXXVVMANMRYLP 476
T P SKGDV+NGPVQQLVAMFGALVAQGE AV VVMANMRYLP
Sbjct: 420 TVPAFSKGDVDNGPVQQLVAMFGALVAQGEMAVASLEILISNISADLLAEVVMANMRYLP 479
Query: 477 PNYPHAEGNDEQLHEICIFGSDDKAKYPPSFVAGVMSLSTTFPPVASLLETHRSVSNDVV 536
PN P+A+G DEQLH+I IFGS DKAKYP SFVAGVMSLS+TFPPVASLL+ H+SVSND+V
Sbjct: 480 PNCPNADG-DEQLHDISIFGSQDKAKYPQSFVAGVMSLSSTFPPVASLLDAHQSVSNDLV 538
Query: 537 -SHEEEEISATGVGSAVVHSGMNLSAVNVSSPPDFPSSDACIPGVENGSTAMASDIHDVG 595
S +EEIS+TGV S+V+HSGM LS+ N SP DFPSSD CIPGVEN ST++ DI DVG
Sbjct: 539 KSQGDEEISSTGVDSSVIHSGMILSSQNAPSPTDFPSSDTCIPGVENVSTSLPPDIDDVG 598
Query: 596 NLESGIPGLDSCGRSDXXXXXXXXXXXXXXXVQLEDADQEQVTSLDQRSSLNLVPSISTD 655
NLESGIPGLDS GR+D +Q+EDA QEQ TSLD RS NLVPS+S D
Sbjct: 599 NLESGIPGLDSFGRNDALSETLAAPSLASTDMQIEDASQEQATSLDNRSPSNLVPSMSAD 658
Query: 656 RSEELSPKGVVTDANSIVSSTATSAVLPSRLVLPKMIAPVVDLADEQKDDLQMSCFMRII 715
+SEELSPK V D NS+VSSTATSAVLPSRLVLPKMIAPVVDLADEQ+D LQ SCFMRII
Sbjct: 659 KSEELSPKAVAPDVNSLVSSTATSAVLPSRLVLPKMIAPVVDLADEQRDHLQTSCFMRII 718
Query: 716 DALKQIAEAGGSKVRFSILAYLGVEFPLDLDPWKLLQKHILIDYTSHE 763
DA KQI+ AGGSKVRFSILAYLGVEFPL+LDPWKLLQKHILIDY+ HE
Sbjct: 719 DAYKQISVAGGSKVRFSILAYLGVEFPLELDPWKLLQKHILIDYSGHE 766
>Medtr3g111090.1 | symplekin tight junction protein carboxy-terminal
protein | HC | chr3:51920629-51901963 | 20130731
Length = 1338
Score = 171 bits (434), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 132/476 (27%), Positives = 217/476 (45%), Gaps = 16/476 (3%)
Query: 11 SRERLASLVNSANMAIDIPSKLESLRQLRLELPQEDPVXXXXXXXXXXXXXSDHFSPVRK 70
+++++ SL+ +AN DI K SL+Q + L DP S H VRK
Sbjct: 7 TKDQVLSLLAAANNHGDISVKTTSLKQAKDLLLSIDPSLAADLFPYLLELQSSHQPLVRK 66
Query: 71 FVTEMVGEIGLKNTEFLSXXXXXXXXXXXXXXXXXXRQAILCGIDLFRSTLQKLAIQGIY 130
+ +++ EIG + + +Q+I+ G ++F + ++L +Q
Sbjct: 67 LLIQIIEEIGFRAVQHSPTLISSLLTFLRDTDATVVKQSIISGTNIFCACFEELILQFQQ 126
Query: 131 SSDLDGALESAWSWMVRFKDKVYTLAFQHGSSGAKLLALKFVEAVIHLYTPDPNGSSEPT 190
++ LE W WM +FK+ V+ +A + GS G KLLALKF+E + L+T D + SE +
Sbjct: 127 CGKVERWLEEIWMWMFKFKEAVFEIALEGGSVGIKLLALKFLEIFVLLFTSDIS-DSEKS 185
Query: 191 PHQGRNMEFNISWLRRGHPILNVGDLSIEASHSLGLLLDQLRFPTVKSLSNAVIIVVIKS 250
+G NISWL HP+L+ L EA+ ++G+LL L+ + + I V+
Sbjct: 186 ATEGVRQAVNISWLVGSHPVLDPMVLMTEANRTIGILLKLLQ--CAGNTPGCLTITVVNC 243
Query: 251 LSAIAINRPAFYGRIXXXXXXXXXXXXXXNGVCVTAVHLALKNAFLSCSKCTHPSAAPWR 310
L+AIA R Y I G V ++ +L+ AFL +CT+ R
Sbjct: 244 LAAIARKRSQHYDTILSALLDFDPNVQTVKGCHVPSIQYSLRTAFLGFLRCTYSPIIESR 303
Query: 311 DRLAGALKEMQSEGKADRVFHLISPSNGSIQREKDDQPVIKEEEPAVNSSDSVHIKLARK 370
+RL +L+ M + AD+V + S R D V K+++P+ + V + +RK
Sbjct: 304 ERLIRSLRAMNAGDAADQVIRQVDKMIKSADRFIRDARVNKDDQPS--NQLPVSGESSRK 361
Query: 371 RSGSQDRGDLSVDEDVPGKRVRTIVGLTEPKKELDDCA-VNSQDDAHS-----TPTSSKG 424
R D L+ + KR+R + P + A VN HS +P
Sbjct: 362 RPVPHDNEQLANGHEAIAKRIR-----SGPDSDFTLPAQVNDSGRDHSSVNGVSPNVPVL 416
Query: 425 DVENGPVQQLVAMFGALVAQGEKAVXXXXXXXXXXXXXXXXXVVMANMRYLPPNYP 480
+ E V+Q++A+ GAL+A+GE+ +V+ANM++LP P
Sbjct: 417 ESELTAVEQMIAVIGALIAEGERGAKSLEILISQIHPDLLADIVIANMKHLPKAPP 472