Miyakogusa Predicted Gene
- Lj0g3v0153869.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj0g3v0153869.1 CUFF.9551.1
(554 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr8g028395.1 | group 1 family glycosyltransferase | HC | chr8... 919 0.0
Medtr1g090860.1 | UDP-glycosyltransferase family protein | HC | ... 431 e-121
>Medtr8g028395.1 | group 1 family glycosyltransferase | HC |
chr8:10728448-10719900 | 20130731
Length = 1023
Score = 919 bits (2375), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 429/536 (80%), Positives = 471/536 (87%)
Query: 1 MLEVISKGKISPLARNIASVGRSTAKNLMVSEAIDGYAALLRNILRLPSEVTPPKAVSEI 60
MLEVISKGKISPLARNIAS+GR TAKNLMVSEAIDGYA+LL+NILRLPSEV PPKAVSEI
Sbjct: 472 MLEVISKGKISPLARNIASMGRRTAKNLMVSEAIDGYASLLQNILRLPSEVAPPKAVSEI 531
Query: 61 SPGVKGQWQWHLFEAVPSLTYQNRTFRSNAFLDEYENRWNHSQKNRSTTMVSANDSFVYS 120
SP VK +WQWHLFEAVP+ TYQNR RSN FLD+YE+RWNHS+K++S+T ++ NDSFVY+
Sbjct: 532 SPNVKEKWQWHLFEAVPNSTYQNRALRSNTFLDKYEDRWNHSRKDKSSTTIADNDSFVYT 591
Query: 121 IWEEERNIQMAIAKKRREDEELKDRTEQSHGTWEEVYRNAKKADRLKNDLHERDEGELER 180
IWEEE+ IQ AI KKR EDEELKDRTEQSHGTWEEVYRNAKKADRLKNDLHERD+GELER
Sbjct: 592 IWEEEKYIQKAITKKRIEDEELKDRTEQSHGTWEEVYRNAKKADRLKNDLHERDDGELER 651
Query: 181 TGQPLCIYEPYIGEGSWPFLHHKPLYRGVSLSSKGRRPGRDDFDAPSRLPLLNSAYYRDI 240
TGQPL IYEPY GEG+W FLHH+ LYRGVSLSSKGRRPGRDDFDAPSRLPLLN+AYYRD+
Sbjct: 652 TGQPLSIYEPYFGEGAWAFLHHRSLYRGVSLSSKGRRPGRDDFDAPSRLPLLNNAYYRDV 711
Query: 241 LGEYGAFFAIANRVDRLHKNAWIGFQSWRATAQKASLSRTAENALLDAIQTKRHGDALYF 300
LGE+GAFFAIANR+DRLHKNAWIGFQSWRATA+KASLSR AENALLDA+Q+K++GD LYF
Sbjct: 712 LGEFGAFFAIANRIDRLHKNAWIGFQSWRATARKASLSRAAENALLDAVQSKKNGDTLYF 771
Query: 301 WVRMDMDQRNPSQMNFWSFCDAINAGGCQFAFSEAMKKMYGLKNDTVSFPPMPIDGDTWS 360
WVRMD D RNPSQ +FWSFCD+INAGGC+ AFSEAM++MYG++ D S PPMP+DGDTWS
Sbjct: 772 WVRMDTDPRNPSQKDFWSFCDSINAGGCKPAFSEAMRRMYGVQADANSLPPMPVDGDTWS 831
Query: 361 VMLSWALPTRSFLEFVMFSRMFVDALDAQMYNEHHSTGRCPLSLSTDKHCYSRVLELLVN 420
VMLSWALPTRSFLEFVMFSRMFVDALDAQMY+EHHSTG CPLSLS DKHCYSRVLELLVN
Sbjct: 832 VMLSWALPTRSFLEFVMFSRMFVDALDAQMYDEHHSTGHCPLSLSKDKHCYSRVLELLVN 891
Query: 421 VWAYHSARRMVFVNPETGLMQEQHKFKSRRGQMWIQWFSYNTLKSMXXXXXXXXXXXXPN 480
VWAYHSARRMVFVNPETG MQEQHKFK+RRG+MWI+WFSY+TLK+M PN
Sbjct: 892 VWAYHSARRMVFVNPETGAMQEQHKFKNRRGKMWIKWFSYSTLKNMDEDLAELSDSEDPN 951
Query: 481 RHWLWPSTGEVFWQGLYXXXXXXXXXXXXXXXXXXXXXLYRMRKRHRQQVIGKYVK 536
+HWLWPSTGEVFWQGLY L RMRKRHRQQVIGKYVK
Sbjct: 952 KHWLWPSTGEVFWQGLYERERSLRHKEKEKRKQKSLEKLNRMRKRHRQQVIGKYVK 1007
>Medtr1g090860.1 | UDP-glycosyltransferase family protein | HC |
chr1:40921678-40911754 | 20130731
Length = 1038
Score = 431 bits (1108), Expect = e-121, Method: Compositional matrix adjust.
Identities = 234/540 (43%), Positives = 314/540 (58%), Gaps = 27/540 (5%)
Query: 13 LARNIASVGRSTAKNLMVSEAIDGYAALLRNILRLPSEVTPPKAVSEISPGVKGQWQWHL 72
A+ I S GR AKN + + I GYA LL N+L PS+ P VS+I + W W
Sbjct: 505 FAQAIGSSGRQFAKNGLALDCIIGYARLLENVLSFPSDSLLPGPVSQIQ---QVAWGWSF 561
Query: 73 F----EAVPSLTYQNRTFRSNAFLDEYENRWNHSQKNRST------TMVSANDSFVYSIW 122
F E L + F + + + N ST T V D W
Sbjct: 562 FQNEIELDIDLLKMDDDFSNGKATVVHAVEKELASLNYSTNFLENGTDVPIQDELTKLDW 621
Query: 123 EEERNIQMAIAKKRREDEELKDRTEQSHGTWEEVYRNAKKADRLKNDLHERDEGELERTG 182
+ R I+++ + E E++++R E+ G W+E+YRNA+K+++LK + +ERDEGELERTG
Sbjct: 622 DILREIEISEESEMLEIEQVEERLEKDVGVWDEIYRNARKSEKLKFEANERDEGELERTG 681
Query: 183 QPLCIYEPYIGEGSWPFLHHKPLYRGVSLSSKGRRPGRDDFDAPSRLPLLNSAYYRDILG 242
QP+CIYE Y G G WPFLHH LYRG+SLS + +R DD DA RLPLLN YYRDIL
Sbjct: 682 QPVCIYEIYSGAGVWPFLHHGSLYRGLSLSRRSQRQSSDDVDAVGRLPLLNDTYYRDILC 741
Query: 243 EYGAFFAIANRVDRLHKNAWIGFQSWRATAQKASLSRTAENALLDAIQTKRHGDALYFWV 302
E G FAIANRVD +H+ WIGFQSWRA +K +LS AE+ L + + GD +YFW
Sbjct: 742 EMGGMFAIANRVDSIHRRPWIGFQSWRAAGRKVALSVEAESVLEETMHENARGDVIYFWG 801
Query: 303 RMDMDQ---RNPSQMNFWSFCDAINAGGCQFAFSEAMKKMYGLKNDTVSFPPMPIDGDTW 359
R+D+D + + + FWS CD +N G C+ F ++ ++MY L + PPMP DG W
Sbjct: 802 RLDLDGGAIGSNNALTFWSMCDILNGGNCRNVFQDSFRQMYSLPPHAEALPPMPEDGGYW 861
Query: 360 SVMLSWALPTRSFLEFVMFSRMFVDALDAQMYNEHHSTGR---CPLSLS--TDKHCYSRV 414
S + SW +PT SFLEFVMFSRMFVD++DA H +G+ C L S +KHCY R+
Sbjct: 862 SALHSWVMPTPSFLEFVMFSRMFVDSIDAF----HRDSGKYSMCLLGSSEIEEKHCYCRM 917
Query: 415 LELLVNVWAYHSARRMVFVNPETGLMQEQHKFKSRRGQMWIQWFSYNTLKSM-XXXXXXX 473
LELL+NVWAYHS+R+MV++NP TG +QEQH + R+ MW ++F+++ LKSM
Sbjct: 918 LELLINVWAYHSSRKMVYINPNTGSLQEQHLVEQRKSFMWAKYFNFSLLKSMDEDLAEAA 977
Query: 474 XXXXXPNRHWLWPSTGEVFWQGLYXXXXXXXXXXXXXXXXXXXXXLY-RMRKRHRQQVIG 532
P WLWP TGEV WQG+Y LY RM+ ++Q+ +G
Sbjct: 978 DDGDDPRDKWLWPMTGEVHWQGIYEREREERYRIKMDKKRKTKEKLYERMKYGYKQKSLG 1037