Miyakogusa Predicted Gene
- Lj0g3v0192989.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0192989.1 CUFF.12209.1
(428 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr4g005350.1 | DUF1666 family protein | HC | chr4:219435-2154... 670 0.0
Medtr6g084440.2 | DUF1666 family protein | HC | chr6:31613376-31... 167 1e-41
Medtr6g084440.1 | DUF1666 family protein | HC | chr6:31613376-31... 166 3e-41
Medtr8g104270.1 | DUF1666 family protein | HC | chr8:43907131-43... 152 5e-37
Medtr7g010160.1 | DUF1666 family protein | HC | chr7:2428673-242... 149 5e-36
Medtr5g040600.1 | transmembrane protein, putative | HC | chr5:17... 127 2e-29
Medtr8g467380.1 | transmembrane protein, putative | HC | chr8:24... 88 1e-17
Medtr8g467390.1 | DUF1666 family protein | LC | chr8:24215397-24... 79 1e-14
>Medtr4g005350.1 | DUF1666 family protein | HC | chr4:219435-215465
| 20130731
Length = 425
Score = 670 bits (1729), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 335/428 (78%), Positives = 352/428 (82%), Gaps = 3/428 (0%)
Query: 1 MDFLKIKKFRKSQKGNGEKDLTDKAVPEPEEPKTNTGGPDQCKSENXXXXXXXXXXXXFI 60
MDFLKIKKFRKSQK KDL DKAV EPEEPK +TG PDQCKSEN FI
Sbjct: 1 MDFLKIKKFRKSQKAGVVKDLVDKAVSEPEEPKPSTGDPDQCKSENADSCADAEDDDDFI 60
Query: 61 TNEVKRRLKELRRNSFMVLIXXXXXXXXXXXXXXXXXXTSSNEWRDVEAEGQQWWRGFDA 120
TNEVKRRLKELRRNSFMVLI T NEWRDVEAEGQQWWRGFDA
Sbjct: 61 TNEVKRRLKELRRNSFMVLIPEEDSCLEEGEDEEEEGETVPNEWRDVEAEGQQWWRGFDA 120
Query: 121 VFEKYCEKMLFFDRMSMQQLNEIGKGSLQTSTQSPRSASKKLTSPLRCLSLKKFEEPDED 180
VFEKYCE+MLFFDRM++Q L EIGKGS TST SPRS SKKL SPLRCLSLKKFE PD++
Sbjct: 121 VFEKYCERMLFFDRMNVQHLGEIGKGSQNTSTPSPRSTSKKLASPLRCLSLKKFEGPDDE 180
Query: 181 TEHLQQPGNDPYLDIEMAYVGQICLTWEALHCQYSHTSQKISWQPENPTCYNHSAQEFQQ 240
TEHLQ+P N PYLDIE AYVGQICLTWEALHCQYSH S KISWQ ENPTCY+ SAQEFQQ
Sbjct: 181 TEHLQEPENIPYLDIETAYVGQICLTWEALHCQYSHMSYKISWQHENPTCYSRSAQEFQQ 240
Query: 241 FQVLLQRFIENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHELTDDSDMRVLAPDLI 300
FQVLLQRFIENEPFEQGPR E YAR+R LPKLLQVPNIRGSDHE+TD+SD+RVLAPDLI
Sbjct: 241 FQVLLQRFIENEPFEQGPRPEIYARSRNTLPKLLQVPNIRGSDHEITDESDIRVLAPDLI 300
Query: 301 RIIENSILTFHLFLKRDKKKSSGVINLFGNQNQLATPLQQVQSTLDXXXXXXXXXXXXXX 360
RIIENSILTF LFLKRDKKKSS VINLFGNQNQLATPLQQVQSTL+
Sbjct: 301 RIIENSILTFRLFLKRDKKKSS-VINLFGNQNQLATPLQQVQSTLE--KKVVKLKELRKK 357
Query: 361 GWKKNSWPQKHEDIQLLLGLIDAKIISRVLRMTRMSREQLFWCEEKMKKLDLSNGRLERD 420
GW+KNSWPQKHED+QLLLGLIDAKI+SRVLRMTRM+REQLFWCEEKMKKLDLSN RLERD
Sbjct: 358 GWRKNSWPQKHEDVQLLLGLIDAKILSRVLRMTRMTREQLFWCEEKMKKLDLSNNRLERD 417
Query: 421 PCPILFPC 428
PCPILFPC
Sbjct: 418 PCPILFPC 425
>Medtr6g084440.2 | DUF1666 family protein | HC |
chr6:31613376-31617656 | 20130731
Length = 602
Score = 167 bits (424), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 179/317 (56%), Gaps = 20/317 (6%)
Query: 115 WRGFDAVFEKYCEKMLFFDRMSMQQLNEIGKGSLQTSTQSPRSASKKLTSPLRCLSLKKF 174
W + AVF+KY E+M F +R+S Q+L+E SL++ +PRS S ++ L ++ K
Sbjct: 300 WESY-AVFQKYDEEMSFLERISAQKLHETE--SLRSIKVAPRSISGRIVYKLSSMNKKP- 355
Query: 175 EEPDEDTEHLQQPGNDPYLDIEMAYVGQICLTWEALHCQYSHTSQKISWQPENPTCYNHS 234
ED H +PY ++E AYV QICLTWEAL+ Y + K + + C
Sbjct: 356 ----EDISH------NPYCELEGAYVAQICLTWEALNWNYKNFQTKRASNV-DVGCPATI 404
Query: 235 AQEFQQFQVLLQRFIENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHELTDDSDMRV 294
AQ+FQQFQVLLQR++ENEP+E G R E YAR R PKLL VP R D + + ++
Sbjct: 405 AQQFQQFQVLLQRYVENEPYEFGRRPEIYARMRHMAPKLLLVPEYRDDDQKENIGFNTKI 464
Query: 295 LAPDLIRIIENSILTFHLFLKRDKKKSSGVINLFGNQNQLA----TPLQQVQSTLDXXXX 350
+ + I+E+ I TF FLK DK+K ++ + +NQ T ++ ++
Sbjct: 465 SSASFLVIMEDGIRTFMNFLKADKEKPCQILASYFRRNQRGLVDPTLIRLLKKVNQKKKI 524
Query: 351 XXXXXXXXXXGWKKNSWPQKHEDIQLLLGLIDAKIISRVLRMTRMSREQLFWCEEKMKKL 410
+K + ++ E++++L+ LID K++SRVLRM+ M+ QL WCEEK K+
Sbjct: 525 KIKDLRRSHKCLRKRNL-KEEEEMEILMALIDLKLVSRVLRMSDMNENQLHWCEEKNSKV 583
Query: 411 DLSNGRLERDPCPILFP 427
+ +G+L+RD P+ FP
Sbjct: 584 RVIDGKLQRDSTPLFFP 600
>Medtr6g084440.1 | DUF1666 family protein | HC |
chr6:31613376-31617663 | 20130731
Length = 604
Score = 166 bits (421), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 182/319 (57%), Gaps = 22/319 (6%)
Query: 115 WRGFDAVFEKYCEKMLFFDRMSMQQLNEIGKGSLQTSTQSPRSASKKLTSPLRCLSLKKF 174
W + AVF+KY E+M F +R+S Q+L+E SL++ +PRS S ++ L ++ K
Sbjct: 300 WESY-AVFQKYDEEMSFLERISAQKLHETE--SLRSIKVAPRSISGRIVYKLSSMNKKP- 355
Query: 175 EEPDEDTEHLQQPGNDPYLDIEMAYVGQICLTWEALHCQYSHTSQKISWQPENPTCYNHS 234
ED H +PY ++E AYV QICLTWEAL+ Y + K + + C
Sbjct: 356 ----EDISH------NPYCELEGAYVAQICLTWEALNWNYKNFQTKRASNV-DVGCPATI 404
Query: 235 AQEFQQFQVLLQRFIENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHELTDDS--DM 292
AQ+FQQFQVLLQR++ENEP+E G R E YAR R PKLL VP R SD + ++ +
Sbjct: 405 AQQFQQFQVLLQRYVENEPYEFGRRPEIYARMRHMAPKLLLVPEYRESDDDQKENIGFNT 464
Query: 293 RVLAPDLIRIIENSILTFHLFLKRDKKKSSGVINLFGNQNQLA----TPLQQVQSTLDXX 348
++ + + I+E+ I TF FLK DK+K ++ + +NQ T ++ ++
Sbjct: 465 KISSASFLVIMEDGIRTFMNFLKADKEKPCQILASYFRRNQRGLVDPTLIRLLKKVNQKK 524
Query: 349 XXXXXXXXXXXXGWKKNSWPQKHEDIQLLLGLIDAKIISRVLRMTRMSREQLFWCEEKMK 408
+K + ++ E++++L+ LID K++SRVLRM+ M+ QL WCEEK
Sbjct: 525 KIKIKDLRRSHKCLRKRNL-KEEEEMEILMALIDLKLVSRVLRMSDMNENQLHWCEEKNS 583
Query: 409 KLDLSNGRLERDPCPILFP 427
K+ + +G+L+RD P+ FP
Sbjct: 584 KVRVIDGKLQRDSTPLFFP 602
>Medtr8g104270.1 | DUF1666 family protein | HC |
chr8:43907131-43910645 | 20130731
Length = 563
Score = 152 bits (385), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 119/325 (36%), Positives = 179/325 (55%), Gaps = 29/325 (8%)
Query: 115 WRGFDAVFEKY-CEKMLFFDRMSMQQLNEIGKGSLQTSTQSPRSASKKLTSPLRCLSLKK 173
W + +F++Y E F +R+S + + SL++ SPRS S+++ + L ++ K
Sbjct: 254 WESY-TLFQRYD-EDNAFLERISARNKRHETE-SLRSIQMSPRSISERIANKLSSINKK- 309
Query: 174 FEEPDEDTEHLQQPGNDPYLDIEMAYVGQICLTWEAL-----HCQYSHTSQKISWQPENP 228
P + G++PY ++E AYV QICLTWEAL + +Y H SQ S +
Sbjct: 310 ---PTD-------VGHNPYSELEAAYVAQICLTWEALSWNYTNFRYKHASQ--SRHDFDI 357
Query: 229 TCYNHSAQEFQQFQVLLQRFIENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHELTD 288
C AQ+FQQFQVLLQR++ENEP+E G R E YAR R PKLL VP S+ + D
Sbjct: 358 GCPATIAQQFQQFQVLLQRYVENEPYEHGRRPEIYARMRLLAPKLLLVPEYHDSEEDQMD 417
Query: 289 -DSDMRVLAPDLIRIIENSILTFHLFLKRDKKKSSGVINLFGNQNQLA----TPLQQVQS 343
D ++ + ++I+E I TF FLK DK+KS ++ + +N+ T L+ ++
Sbjct: 418 SDFHSKISSASFLKIMEGGIRTFMNFLKTDKEKSCQILTYYFRRNKRGMVDPTLLKLMKK 477
Query: 344 TLDXXXXXXXXXXXXXXGWKKNSWPQKHEDIQLLLGLIDAKIISRVLRMTRMSREQLFWC 403
G +K + E+I++L+GLID K++SRVLRM +S +QL WC
Sbjct: 478 VNQKKRVKVKDLSHLGKGLRKRKL-KVEEEIEILMGLIDLKVVSRVLRMKELSEQQLHWC 536
Query: 404 EEKMKKLDLSNGRLERD-PCPILFP 427
E+KM K+ + G+L RD P+ FP
Sbjct: 537 EKKMSKVRVVEGKLCRDYSTPLFFP 561
>Medtr7g010160.1 | DUF1666 family protein | HC |
chr7:2428673-2422982 | 20130731
Length = 745
Score = 149 bits (376), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 167/329 (50%), Gaps = 29/329 (8%)
Query: 122 FEKYCEKMLFFDRMSMQQLNEIGKGSLQTSTQSPRSAS--KKLTSPLRCL---SLKKF-- 174
+ Y E+M FD ++ Q++ +G L S +S S KK +S + C+ + F
Sbjct: 422 YRSYRERMRKFDILNYQKMYALG---LMKSKDPLKSFSIHKKSSSTITCILPRGINSFFR 478
Query: 175 EEPDEDTEHLQQPGNDPYLDIEMAYVGQICLTWEALHCQYSHTSQKISWQPENPTC--YN 232
+ D + +++ + Y D+EM YVG +CL+WE LH +Y + KI W+ + +N
Sbjct: 479 RNRNIDADPMKKFIRELYSDLEMVYVGHLCLSWEFLHWEY-EKALKI-WESDQYGLRRFN 536
Query: 233 HSAQEFQQFQVLLQRFIENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHELTDDSDM 292
A EFQQFQVLLQRFIENEPF QGPR E+YAR R A+ KLLQVP I+ +
Sbjct: 537 EVAGEFQQFQVLLQRFIENEPF-QGPRVENYARNRCAMKKLLQVPVIKEDKGKDKKKYRK 595
Query: 293 RVLAPD------LIRIIENSILTFHLFLKRDKKKSSGVINLFGNQN-QLATPLQQ---VQ 342
R + D L+ I+E SI T F++ D+ S+ I Q+ +L P V+
Sbjct: 596 REVDNDAITSDMLVEILEESIRTIWRFIRGDEDASNLTIKCLKEQHVELQDPADSQLLVE 655
Query: 343 STLDXXXXXXXXXXXXXXGWKKNSWPQKHED----IQLLLGLIDAKIISRVLRMTRMSRE 398
D G +KHED + +D K++ RVL M+R++ +
Sbjct: 656 ILTDLQKKEKRLREVLRSGSCILKKFKKHEDETDPVLYFFSQVDLKLVCRVLNMSRITTD 715
Query: 399 QLFWCEEKMKKLDLSNGRLERDPCPILFP 427
QL WC K+ K++ N R+ +P +LFP
Sbjct: 716 QLAWCRSKLNKINFVNRRIHVEPSFLLFP 744
>Medtr5g040600.1 | transmembrane protein, putative | HC |
chr5:17838417-17832271 | 20130731
Length = 909
Score = 127 bits (320), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 158/332 (47%), Gaps = 43/332 (12%)
Query: 121 VFEKYCEKMLFFDRMSMQQLNEIGKGSLQTSTQSPRSASKKLTSPLRCLSLKKFEEPDED 180
V++ Y EKM D ++ Q ++ +G LQ L PL+ +S+ K +
Sbjct: 597 VYKSYAEKMRKLDILNYQTMHALGL--LQ------------LKDPLKLISIPKSTISNGI 642
Query: 181 TEHLQQP------GNDPYL--------DIEMAYVGQICLTWEALHCQYSHTSQKISWQPE 226
P +DP+L D+E+ YVGQICL+WE L + + + +
Sbjct: 643 ISQNLWPRKSTKITSDPFLKLVHQLHRDLELVYVGQICLSWEILCWLHMKAIELQQYDSQ 702
Query: 227 NPTCYNHSAQEFQQFQVLLQRFIENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHEL 286
YNH A EFQ FQVL+QRFIENEPF+ GPR ++Y + R + LL VP I+ ++
Sbjct: 703 RSHRYNHVAGEFQLFQVLMQRFIENEPFQGGPRIQNYVKNRCVIRNLLHVPAIKD---DI 759
Query: 287 TDDSDMRVLAPDLIRIIENSILTFHLFLKRDKKKSSGVINLFGNQ--NQLATPLQQ---V 341
+ + + L II+ S+ F F++ D K +G +N+ Q + L P V
Sbjct: 760 KGGEEDPIASGRLQDIIKESMRVFWEFVRTD--KDNGNVNVISKQIGSDLKDPAIANLLV 817
Query: 342 QSTLDXXXXXXXXXXXXXXGWKKNSWPQKHEDIQL----LLGLIDAKIISRVLRMTRMSR 397
+ G QKH + QL L+ + ++ISRV+ M+++ +
Sbjct: 818 DIRIQLQKKDKKLKDIVRTGNCIVKKFQKHHEDQLDHEQLVAQVGLRLISRVINMSQLRK 877
Query: 398 EQLFWCEEKMKKLD-LSNGRLERDPCPILFPC 428
EQ+ WC EK+ ++ LS + +P +LFPC
Sbjct: 878 EQVLWCSEKLNRIKFLSRKIVHVEPSFLLFPC 909
>Medtr8g467380.1 | transmembrane protein, putative | HC |
chr8:24203821-24210537 | 20130731
Length = 829
Score = 88.2 bits (217), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 112/244 (45%), Gaps = 19/244 (7%)
Query: 189 NDPYLDIEMAYVGQICLTWEALHCQYSHTSQKISWQPENPTCYNHSAQEFQQFQVLLQRF 248
N+ + D+E+ YVGQICL+WE L Q+ + + P YN A EFQ FQVL+QRF
Sbjct: 582 NELHRDLEIVYVGQICLSWEILCWQHEKIKELKKYDSPRPRRYNLIAGEFQLFQVLMQRF 641
Query: 249 IENEPFEQGPRAESYARTRKALPKLLQVPNIRGSDHELTDD----SDMRVLAPDLIRIIE 304
+E+EPF Q R ++Y + R + LLQVP I+ + + + + L +II+
Sbjct: 642 LEDEPFRQDHRVQNYVKNRCVIRNLLQVPIIKDDSTKDKKKIKWGEEDGIASERLEQIIK 701
Query: 305 NSILTFHLFLKRDKKKSSGVINLFGNQ-------NQLATPLQQVQSTLDXXXXXXXXXXX 357
S+ F F++ DK + +F + +++ L+ +Q L+
Sbjct: 702 KSMQVFWKFVRADKDDDNVFHKVFHHHKENEVKDTEISELLRDIQIQLNKKERKLKERLR 761
Query: 358 XXXGWKKNSWPQKHEDIQL----LLGLIDAKIISRVLRMT----RMSREQLFWCEEKMKK 409
+ + IQL L + ++IS+V R+ R Q W ++ +
Sbjct: 762 SGNCIVRKFQKHNEDQIQLDHEQFLAQVGLRLISKVERLALNKCRSDHSQRLWRRQRCRS 821
Query: 410 LDLS 413
D+S
Sbjct: 822 RDMS 825
>Medtr8g467390.1 | DUF1666 family protein | LC |
chr8:24215397-24217284 | 20130731
Length = 363
Score = 78.6 bits (192), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 50/84 (59%)
Query: 194 DIEMAYVGQICLTWEALHCQYSHTSQKISWQPENPTCYNHSAQEFQQFQVLLQRFIENEP 253
D+E+ YVGQICL+WE L Q+ + + YN A EF FQ L+QRF+E +P
Sbjct: 206 DLELVYVGQICLSWEMLCWQHEKIKELKQYDLPWLRSYNQVAAEFLHFQALIQRFLEEDP 265
Query: 254 FEQGPRAESYARTRKALPKLLQVP 277
+QG R ++Y + R + LLQVP
Sbjct: 266 IQQGHRIQNYVKNRSLVRNLLQVP 289