Miyakogusa Predicted Gene
- Lj5g3v1264010.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj5g3v1264010.1 Non Characterized Hit- tr|I1NII1|I1NII1_SOYBN
Uncharacterized protein OS=Glycine max PE=4 SV=1,74.5,0,seg,NULL;
SUBFAMILY NOT NAMED,NULL; PHOSPHATIDYLCHOLINE TRANSFER PROTEIN,NULL;
no description,START-,CUFF.55122.1
(466 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr1g078030.1 | mammalian STARD2 lipid-binding START domain pr... 575 e-164
Medtr3g092640.1 | membrane-related protein CP5, putative | HC | ... 447 e-125
Medtr5g007030.1 | mammalian STARD2 lipid-binding START domain pr... 339 3e-93
Medtr5g007030.2 | mammalian STARD2 lipid-binding START domain pr... 312 4e-85
Medtr4g005140.1 | homeobox/lipid-binding domain protein | HC | c... 137 3e-32
>Medtr1g078030.1 | mammalian STARD2 lipid-binding START domain
protein | HC | chr1:34888325-34884479 | 20130731
Length = 395
Score = 575 bits (1483), Expect = e-164, Method: Compositional matrix adjust.
Identities = 286/391 (73%), Positives = 318/391 (81%), Gaps = 6/391 (1%)
Query: 78 VLNFLVDLAMFIAPLWIAVLVGVVLGWAWKPKWAAAVDSEHAWTNFFHFRSFPWFPNSDL 137
V++ L +L FIAPLWIAV+ GVV+GWAWKPKWA ++ TN F FR PWF S+L
Sbjct: 9 VMDILGNLVTFIAPLWIAVIFGVVVGWAWKPKWAIEPNNYSWSTNLFKFR-IPWFNYSEL 67
Query: 138 HNGPEPDPDFSYSTSSGAPSLKGVSSLVTEQDLRQLSKLVEEKDGGLAWIQMMDRSTQTM 197
N +P ++S ++SSG S KG+ S+VTE DL+ L KLVEEKDGG AWIQMMDRST M
Sbjct: 68 QN--QPGLEYSATSSSGE-SEKGLRSIVTEHDLQNLCKLVEEKDGGPAWIQMMDRSTPNM 124
Query: 198 SYQAWRRDPESGPPQYRSRTVFEDASAELVRDFFWDDEYRLKWDDMLIHASIIEDCAVTG 257
+YQAWRRD E+GPPQYRSRTVFEDAS ELVRDFFWDDE+R +WDDMLIHAS I++C VTG
Sbjct: 125 TYQAWRRDQENGPPQYRSRTVFEDASPELVRDFFWDDEFRSRWDDMLIHASTIQECEVTG 184
Query: 258 AMMVHWVRKFPFFCSDREYIIGRRIWNAGNAYYCVTKGVPCSSMPRQNKPKRVDLYYSSF 317
MMV WVRKFPFFCSDREYIIGRRIW+A YYCVTKGVPCSS+PRQ+KP+RVDLYYSSF
Sbjct: 185 TMMVQWVRKFPFFCSDREYIIGRRIWDAERTYYCVTKGVPCSSIPRQSKPRRVDLYYSSF 244
Query: 318 CIRPVKSRKDGQLTACEVLLFHYEDMGIPWEIAKLGVRQGMWGAVKKFDPALRIYEKERA 377
IR VKSRKDGQLT+CEVL FHYEDMGIPWEIAKLGVRQGMWGAVKKFDP LR Y+KER
Sbjct: 245 FIRAVKSRKDGQLTSCEVLFFHYEDMGIPWEIAKLGVRQGMWGAVKKFDPGLRTYKKERD 304
Query: 378 SGA-LSRCANAAKINTKVTPDYLRCXXXXXXXXXXXXXQDS-SVKPIGRNIPKLLVVGGA 435
SG LS CAN AKINTKVT DY+RC QDS KPIGR+IPKLLVVGGA
Sbjct: 305 SGVPLSPCANNAKINTKVTADYVRCLEDSTSNLLETENQDSFDDKPIGRSIPKLLVVGGA 364
Query: 436 IALACTLDQGLVTKAVIFGVARRFAKIGRRL 466
IALACTLDQGLVTKAV+FG+ARRF K GRRL
Sbjct: 365 IALACTLDQGLVTKAVVFGIARRFGKFGRRL 395
>Medtr3g092640.1 | membrane-related protein CP5, putative | HC |
chr3:42338841-42335197 | 20130731
Length = 424
Score = 447 bits (1150), Expect = e-125, Method: Compositional matrix adjust.
Identities = 225/427 (52%), Positives = 291/427 (68%), Gaps = 37/427 (8%)
Query: 73 FQKPAVLNFLVDLAMFIAPLWIAVLVGVVLGWAWKPKW----------AAAVDSEHAWTN 122
+ PA+ F MF++P+ + +G+++GW WKPKW A+ + ++
Sbjct: 1 MESPAIWGFYA--TMFMSPMLLVFFLGIIVGWLWKPKWISSLAKSFDLASPISDSPIFSP 58
Query: 123 FFHFRSF-PWFPNSDLHNGPEPDP------------------DFSYSTSSGAPSLKGVSS 163
+ S P +S P PD +F STSS S + S+
Sbjct: 59 LKFYSSLSPCVNSSITMQTPNPDSLCINKEINKKGSSSSSPTNFDSSTSSNK-SGEDTSN 117
Query: 164 LVTEQDLRQLSKLVEEKDGGLAWIQMMDRSTQTMSYQAWRRDPESGPPQYRSRTVFEDAS 223
VT DL L KLVEEKDGGL WIQMMD+ST TMSYQAWRR+P+ GPPQYRS T+FEDA+
Sbjct: 118 GVTIDDLHHLYKLVEEKDGGLPWIQMMDKSTPTMSYQAWRREPKDGPPQYRSSTIFEDAT 177
Query: 224 AELVRDFFWDDEYRLKWDDMLIHASIIEDCAVTGAMMVHWVRKFPFFCSDREYIIGRRIW 283
E+VRD FWDD++R KWDDML++++ +E+C TG M V W+RKFPFFC DREYIIGRRIW
Sbjct: 178 PEMVRDLFWDDQFRPKWDDMLVNSTTLEECPTTGTMKVQWIRKFPFFCKDREYIIGRRIW 237
Query: 284 NAGNAYYCVTKGVPCSSMPRQNKPKRVDLYYSSFCIRPVKSRKD-GQLTACEVLLFHYED 342
G +YYC+TKGV C S+PRQ KP+RVD+YYSS+CIR V+S++D GQLTACE+LLFH+E+
Sbjct: 238 ECGRSYYCITKGVDCPSIPRQEKPRRVDVYYSSWCIRAVESKRDNGQLTACEILLFHHEE 297
Query: 343 MGIPWEIAKLGVRQGMWGAVKKFDPALRIYEKERASGA-LSRCANAAKINTKVTPDYLRC 401
MGIPWEIAKLGVR+GMWG V+K +P LR Y++ +ASGA LSR A A +NTK++P+YL+
Sbjct: 298 MGIPWEIAKLGVRKGMWGMVQKIEPGLRAYQEAKASGAPLSRSAFMAGVNTKISPEYLQS 357
Query: 402 XXXXXXXXXXXXXQ-DSSVKPIGRNIPKLLVVGGAIALACTLDQGLVTKAVIFGVARR-- 458
S KP G +PK+LV+GGA+ALAC+LD+GLVTKAVIFGVA+R
Sbjct: 358 IGSSDDESLQTESAITSDDKPKGMTVPKMLVIGGAVALACSLDKGLVTKAVIFGVAKRFG 417
Query: 459 FAKIGRR 465
FA +G+R
Sbjct: 418 FANMGKR 424
>Medtr5g007030.1 | mammalian STARD2 lipid-binding START domain
protein | HC | chr5:1173758-1179345 | 20130731
Length = 436
Score = 339 bits (870), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 180/428 (42%), Positives = 253/428 (59%), Gaps = 35/428 (8%)
Query: 69 LLGIFQKPAVLNFLVDLAMFIAPLWIAVLVGVVLGWAWKPKWAAAV-------------- 114
++ +KP++ VD+ + P+W+AV++G+V+GW+W+P+W +
Sbjct: 8 VMEFMKKPSITETFVDILLCAVPIWLAVMIGLVIGWSWRPRWTGLLFIGLRSKFRFLWTV 67
Query: 115 ----DSEHAWTNFFHFRSFP-----W--FPNSDLHNGPEPDP-------DFSYSTSSGAP 156
+ W F +F W F N + P + ++ SG
Sbjct: 68 PPGFGARRLWLAFTALSAFSICRRYWSNFKNKEKVLDPSSNSCSDDATDATKHAARSGDK 127
Query: 157 SLKGVSSLVTEQDLRQLSKLVEEKDGGLAWIQMMDRSTQTMSYQAWRRDPESGPPQYRSR 216
+ + V E DL L L+E KDG + W M+RST M Y+AWR D E+G YRSR
Sbjct: 128 ADERDKDTVREADLEHLLHLLEGKDGEIDWQSFMERSTPNMQYKAWRYDSETGATVYRSR 187
Query: 217 TVFEDASAELVRDFFWDDEYRLKWDDMLIHASIIEDCAVTGAMMVHWVRKFPFFCSDREY 276
TVFEDA+ ELVRDFFWDD++R KWD ML H ++++C G +VHW++KFPFFCSDREY
Sbjct: 188 TVFEDATPELVRDFFWDDDFRPKWDPMLAHCKVLKECPHNGTSIVHWIKKFPFFCSDREY 247
Query: 277 IIGRRIWNAGNAYYCVTKGVPCSSMPRQNKPKRVDLYYSSFCIRPVKSRK-DGQLTACEV 335
II RRIW AGNAYYCVTKGVP S+P+++KP+RVDLY+SS+ I+PV+SRK DGQL+ACEV
Sbjct: 248 IIARRIWQAGNAYYCVTKGVPYPSLPKRDKPRRVDLYFSSWVIKPVESRKGDGQLSACEV 307
Query: 336 LLFHYEDMGIPWEIAKLGVRQGMWGAVKKFDPALRIYEKERASGA-LSRCANAAKINTKV 394
L H+EDMGIP ++AKLGVR GMWGAVKK +R Y+ R + A LSRCA A T++
Sbjct: 308 TLLHHEDMGIPKDVAKLGVRHGMWGAVKKLHSGMRAYQNARKTDASLSRCALMASKTTRL 367
Query: 395 TPDYLRCXXXXXXXXXXXXXQDSSVKPIGRNIP-KLLVVGGAIALACTLDQGLVTKAVIF 453
+ + ++ + IG + K + +GG +A+ + G V +A++
Sbjct: 368 SSNGNLHSLEDASLMEEREQAINNARQIGHGLDWKWVALGGTVAVVLGIHSGAVGRALLL 427
Query: 454 GVARRFAK 461
G RFA+
Sbjct: 428 GAGHRFAR 435
>Medtr5g007030.2 | mammalian STARD2 lipid-binding START domain
protein | HC | chr5:1173861-1179345 | 20130731
Length = 326
Score = 312 bits (800), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 160/323 (49%), Positives = 212/323 (65%), Gaps = 6/323 (1%)
Query: 145 PDFSYSTS---SGAPSLKGVSSLVTEQDLRQLSKLVEEKDGGLAWIQMMDRSTQTMSYQA 201
P+ +Y SG + + V E DL L L+E KDG + W M+RST M Y+A
Sbjct: 3 PNINYQIDVPMSGDKADERDKDTVREADLEHLLHLLEGKDGEIDWQSFMERSTPNMQYKA 62
Query: 202 WRRDPESGPPQYRSRTVFEDASAELVRDFFWDDEYRLKWDDMLIHASIIEDCAVTGAMMV 261
WR D E+G YRSRTVFEDA+ ELVRDFFWDD++R KWD ML H ++++C G +V
Sbjct: 63 WRYDSETGATVYRSRTVFEDATPELVRDFFWDDDFRPKWDPMLAHCKVLKECPHNGTSIV 122
Query: 262 HWVRKFPFFCSDREYIIGRRIWNAGNAYYCVTKGVPCSSMPRQNKPKRVDLYYSSFCIRP 321
HW++KFPFFCSDREYII RRIW AGNAYYCVTKGVP S+P+++KP+RVDLY+SS+ I+P
Sbjct: 123 HWIKKFPFFCSDREYIIARRIWQAGNAYYCVTKGVPYPSLPKRDKPRRVDLYFSSWVIKP 182
Query: 322 VKSRK-DGQLTACEVLLFHYEDMGIPWEIAKLGVRQGMWGAVKKFDPALRIYEKERASGA 380
V+SRK DGQL+ACEV L H+EDMGIP ++AKLGVR GMWGAVKK +R Y+ R + A
Sbjct: 183 VESRKGDGQLSACEVTLLHHEDMGIPKDVAKLGVRHGMWGAVKKLHSGMRAYQNARKTDA 242
Query: 381 -LSRCANAAKINTKVTPDYLRCXXXXXXXXXXXXXQDSSVKPIGRNIP-KLLVVGGAIAL 438
LSRCA A T+++ + ++ + IG + K + +GG +A+
Sbjct: 243 SLSRCALMASKTTRLSSNGNLHSLEDASLMEEREQAINNARQIGHGLDWKWVALGGTVAV 302
Query: 439 ACTLDQGLVTKAVIFGVARRFAK 461
+ G V +A++ G RFA+
Sbjct: 303 VLGIHSGAVGRALLLGAGHRFAR 325
>Medtr4g005140.1 | homeobox/lipid-binding domain protein | HC |
chr4:85012-80984 | 20130731
Length = 381
Score = 137 bits (344), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 95/295 (32%), Positives = 146/295 (49%), Gaps = 27/295 (9%)
Query: 161 VSSLVTEQDLRQLSKLVEEKDGGL----AWIQMMDRSTQTMSYQAWRRDPESGPPQYRSR 216
S +VT +DL+ L + DG L W ++D+ + Y A P++GP +Y S
Sbjct: 68 TSKIVTNEDLKFLMMIF---DGNLNENAKWEDVIDKRNDHLCYNAKSCKPKNGPLRYLSV 124
Query: 217 TVFEDASAELVRDFFWDDEYRLKWDDMLIHASIIEDCAVTGAMMVHWVRKFPFFCSDREY 276
TVF + SAE++R+F+ D++YR +WD ++ + ++ G+ + V+KFP REY
Sbjct: 125 TVFNNISAEMLRNFYMDNDYRKQWDKTVVEHNQLQVDKSDGSEVGRTVKKFPLL-KPREY 183
Query: 277 IIGRRIWNAGN-AYYCVTKGVPCSSMPRQNKPKRVDLYYSSFCIRPVKSRKDGQLTACEV 335
++ ++W + +YC K + PRQNK RV+ + S + IR V R ACE+
Sbjct: 184 VLTWKLWEGRDKTFYCYIKECEHTLAPRQNKYVRVEFFRSGWRIRQVPGR-----NACEI 238
Query: 336 LLFHYEDMGIPWEIAKLGVRQGMWGAVKKFDPALRIYEKERASGALSRCANAAKINTKVT 395
+FH ED G+ E+AKL +G+W V K D ALR Y ASG LS ++ +
Sbjct: 239 TMFHQEDAGLNVEMAKLAFSKGIWSYVCKMDNALRRYSA--ASGHLSSSVTSSVNLMQKV 296
Query: 396 PDYLRCXXXXXXXXXXXXXQD-----SSVKPIGRNIPK------LLVVGGAIALA 439
P L D S V+ I R + +L+VGGAI L+
Sbjct: 297 PACLESSTSYASSSHPTIIHDQTTHESQVRVISRRPSRKFLANSVLLVGGAICLS 351