Miyakogusa Predicted Gene
- chr3.CM0127.280.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr3.CM0127.280.nc + phase: 0
(837 letters)
Database: Medicago_aa2.0
38,834 sequences; 10,231,785 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
IMGA|AC174318_26.4 Retrovirus capsid, N-terminal core; Prefoldin... 887 0.0
IMGA|AC126792_12.4 Protein kinase PKN/PRK1, effector chr05_pseud... 771 0.0
IMGA|CT030028_22.5 hypothetical protein chr03_pseudomolecule_IMG... 75 1e-13
IMGA|AC149032_5.5 Prefoldin chr02_pseudomolecule_IMGAG_V2 728244... 55 1e-07
IMGA|AC124963_7.5 t-snare chr07_pseudomolecule_IMGAG_V2 25847301... 50 5e-06
IMGA|AC146751_7.4 I/LWEQ; Prefoldin chr03_pseudomolecule_IMGAG_V... 44 2e-04
IMGA|AC151424_2.5 Protein of unknown function DUF827, plant chr0... 44 3e-04
>IMGA|AC174318_26.4 Retrovirus capsid, N-terminal core; Prefoldin
chr04_pseudomolecule_IMGAG_V2 21058583-21062716 E
EGN_Mt071002 20080227
Length = 858
Score = 887 bits (2291), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 517/820 (63%), Positives = 581/820 (70%), Gaps = 13/820 (1%)
Query: 1 MASKS--RSNLSENPNKASLATPXXXXXXXXXXXXQSDSPSPLQXXXXXXXXXXXXXXXK 58
MASKS RS S+N NKA ATP S+SPSPLQ K
Sbjct: 1 MASKSSTRSKPSDNSNKAPPATPKVSKVSKPVTKSASESPSPLQNSRLSVEKSPRSVNSK 60
Query: 59 PTVERKSPRPPSTPPDKQAPKAEKGXXXXXXXXXXXXDLKKAKEQLLQAEKEKVKATDEL 118
P VERKS + +TPPDKQAP+A KG DLKKAKEQ+LQAEKEKVKA DEL
Sbjct: 61 PAVERKSAKATATPPDKQAPRAAKGSELQNQLNVAQEDLKKAKEQILQAEKEKVKAIDEL 120
Query: 119 KEAQRVVEEANEKHREALVAQKRAEENSEIEKFRAVELEQAGIETVKKKEDEWQKELESV 178
KEAQRV EEANEK +EALVAQK+A+E SEIEKFRAVELEQAGIETV KKE+EWQKELESV
Sbjct: 121 KEAQRVAEEANEKLQEALVAQKQAKEESEIEKFRAVELEQAGIETVNKKEEEWQKELESV 180
Query: 179 RNQHALDVAALLSTTQELQQVKQELATTCDAKNQALNHADDATKIAETHAEKADLLSAEV 238
RNQHALDVA+L STT+EL++VKQEL CDAKNQALNHADDA K+AE HAEK ++ +AE+
Sbjct: 181 RNQHALDVASLASTTEELERVKQELTMMCDAKNQALNHADDAAKVAEVHAEKVEIYAAEL 240
Query: 239 TRLKALLDSKMETEASENEVILKLKTEIEALKQELEQARVYDEKLTEKETSIEQLNVELE 298
T+LKALLDS ET+AS+N +ILKLK EIEALK+EL++ YDE+LTEKET IEQLNVELE
Sbjct: 241 TQLKALLDSTQETKASDNNLILKLKAEIEALKKELDKGMSYDERLTEKETKIEQLNVELE 300
Query: 299 AAKMAESYAHSXXXXXXXXXXXXXMRVEETNKLERSASESLESVMKQLEGSNDLLDDAES 358
++MAESYA+S MRVEE+NKLERSASESLESVM QLE SN LL DAES
Sbjct: 301 TSRMAESYANSLLEEWKKKVEELEMRVEESNKLERSASESLESVMNQLEESNYLLHDAES 360
Query: 359 EISTLKEKVGLLEMTTGRQRTELEDSQHRLLMAKEENLELTKKVESLKSELETVKEERDQ 418
++LKEKVGLLEMT RQ+ +LEDS+ RLLMAKEENLE +KKVE+L+SELETVKEE+DQ
Sbjct: 361 VAASLKEKVGLLEMTIVRQKADLEDSERRLLMAKEENLEKSKKVEALESELETVKEEKDQ 420
Query: 419 ALNNEHLAASSVQTXXXXXXXXXXXXXXSRDEEEKSKKAMESLASALHEVSAEARETKEN 478
ALNNE LAAS VQT SR+EEEKSKKAMESLASALHEVSAE+RE KEN
Sbjct: 421 ALNNEQLAASHVQTLLEEKNKLINELDNSREEEEKSKKAMESLASALHEVSAESREAKEN 480
Query: 479 ILSSQAEKDSYENQIEDLKLVLKGTNEKYESMLDDARHEIDVLICSIENSKNAYENSKAE 538
LS+QAE++SYENQIEDLKLVLKGTNEKYESMLD+A+HEIDVLI SIENSKN +ENSKAE
Sbjct: 481 FLSTQAERESYENQIEDLKLVLKGTNEKYESMLDEAQHEIDVLIDSIENSKNVFENSKAE 540
Query: 539 WEQREFHLVSSLKKNEEENVFLEKEINRLVHLLKXXXXXXXXXXXXXXQLKENLKEVEGE 598
W+QRE HLVSSLKK EEEN EKEINRLV+LLK QLKENLKEVE E
Sbjct: 541 WDQRELHLVSSLKKTEEENAAAEKEINRLVYLLKQTEEESNANREEEAQLKENLKEVETE 600
Query: 599 AIQLQEALKEVTAENMKLKENLLDKENEMQSIFQENDELRFREAXXXXXXXXXXXXXXXX 658
AI LQEALKEVT+EN+KLKEN+LDKENEMQ++FQENDELR REA
Sbjct: 601 AIHLQEALKEVTSENVKLKENILDKENEMQNLFQENDELRAREAESIKKVEELSKLLDEA 660
Query: 659 XXXNHTD-ENGDLTDSEKDYDLLPKVVEFSEENGH--GGEDISK-VELPANXXXXXXXX- 713
NHT+ ENGDL+DSEKDYDLLPKVVEFSEENGH GGE+I K VEL
Sbjct: 661 TTRNHTEHENGDLSDSEKDYDLLPKVVEFSEENGHGYGGEEIPKVVELSLKQEEFKHNVL 720
Query: 714 XXXXXXNDK-NEEIEFPKLEDVNGXXXXXXXXXXXXXXXXXFKMWESCKIXXXXXXXXXX 772
NDK +E+IE PK +NG FKMWESC I
Sbjct: 721 EESMILNDKADEKIESPKPVKMNGKPKEDESKEKDDPEEVEFKMWESCTIEKKEFSFSPE 780
Query: 773 XXXXXXXXXXXXXIEE-----GSESFDKINGTSVTENIDD 807
+ E FDKINGT+V ENID+
Sbjct: 781 RELPEAKSLEEETESKTEEGGDGEGFDKINGTTVIENIDN 820
>IMGA|AC126792_12.4 Protein kinase PKN/PRK1, effector
chr05_pseudomolecule_IMGAG_V2 1085332-1089778 E
EGN_Mt071002 20080227
Length = 887
Score = 771 bits (1992), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 471/862 (54%), Positives = 546/862 (63%), Gaps = 67/862 (7%)
Query: 1 MASKSRSNLSEN---------------------------PNKASLATPXXXXXXXXXXXX 33
MASKSRS LSE PNK S ATP
Sbjct: 1 MASKSRSGLSETLPNKASSPATPNKATPASTLSKVPPATPNKTSPATPRVSKLGRGVSKP 60
Query: 34 QSDSPSPLQXXXXXXXXXXXXX-XXKPTVERKSPRPPSTPPDKQAPKA-EKGXXXXXXXX 91
+S+SPSPLQ KP ERKSPRP +TP DK P+A K
Sbjct: 61 ESESPSPLQTSRLSAEKASPRSLNSKPIAERKSPRP-TTPADKHTPRAVAKSSELQTQLN 119
Query: 92 XXXXDLKKAKEQLLQAEKEKVKATDELKEAQRVVEEANEKHREALVAQKRAEENSEIEKF 151
DLKKAKEQL+QAEKEK KA +ELKEAQR+ EEANEK REA+VAQKRAE++SEIEKF
Sbjct: 120 VAQEDLKKAKEQLIQAEKEKEKAINELKEAQRLSEEANEKLREAMVAQKRAEDDSEIEKF 179
Query: 152 RAVELEQAGIETVKKKEDEWQKELESVRNQHALDVAALLSTTQELQQVKQELATTCDAKN 211
RAVELEQAGIE +KKE+EWQ+ELESVRNQHALDV+ALL+TT ELQ+VKQEL TCDAKN
Sbjct: 180 RAVELEQAGIEAAQKKEEEWQRELESVRNQHALDVSALLATTNELQRVKQELVMTCDAKN 239
Query: 212 QALNHADDATKIAETHAEKADLLSAEVTRLKALLDSKMETEASENEVILKLKTEIEALKQ 271
QAL+HADDATKIAE H EK ++LSAE+ RLK LLDSK+ETEASEN +L+L+TEIEALK
Sbjct: 240 QALSHADDATKIAELHVEKVEILSAELIRLKGLLDSKLETEASENNTVLELQTEIEALKH 299
Query: 272 ELEQARVYDEKLTEKETSIEQLNVELEAAKMAESYAHSXXXXXXXXXXXXXMRVEETNKL 331
ELE+A+ YDEKL EKET IEQLNVE EAAKMAESYA S M+VEE N+L
Sbjct: 300 ELEKAKGYDEKLAEKETLIEQLNVESEAAKMAESYARSVLDECRKKVEELEMKVEEANQL 359
Query: 332 ERSASESLESVMKQLEGSNDLLDDAESEISTLKEKVGLLEMTTGRQRTELEDSQHRLLMA 391
ERSAS SLE+ KQLEG N+LL DAESEIS+LKEK+G+LEMT GRQR +LED++ LL A
Sbjct: 360 ERSASLSLETATKQLEGKNELLHDAESEISSLKEKLGMLEMTVGRQRGDLEDAERCLLAA 419
Query: 392 KEENLELTKKVESLKSELETVKEERDQALNNEHLAASSVQTXXXXXXXXXXXXXXSRDEE 451
KEEN+E++KK+ESL+SE+ETV +E+ QALNNE L+ASSVQT RDEE
Sbjct: 420 KEENIEMSKKIESLESEIETVSKEKAQALNNEKLSASSVQTLLEEKNKLINELEICRDEE 479
Query: 452 EKSKKAMESLASALHEVSAEARETKENILSSQAEKDSYENQ------------------- 492
EK+K AM+SLASALHEVSAEAR+TKE +L++QAE +SYE Q
Sbjct: 480 EKTKLAMDSLASALHEVSAEARDTKEKLLANQAEHESYETQIEDLKSDLEASKEKYESML 539
Query: 493 ------IEDLKLVLKGTNEKYESMLDDARHEIDVLICSIENSKNAYENSKAEWEQREFHL 546
IEDLK L+ + EKYESML+DA HEIDVL SIENSK NSKAEWEQ+E L
Sbjct: 540 NDAHHEIEDLKSDLEASKEKYESMLNDAHHEIDVLTSSIENSKMDILNSKAEWEQKEHDL 599
Query: 547 VSSLKKNEEENVFLEKEINRLVHLLKXXXXXXXXXXXXXXQLKENLKEVEGEAIQLQEAL 606
V +K+ EEEN L E+NRL+ LLK QLKEN+KEVE E I LQEAL
Sbjct: 600 VECIKRTEEENSSLGNEVNRLISLLKKTEEEANVKREEETQLKENMKEVEAEVIHLQEAL 659
Query: 607 KEVTAENMKLKENLLDKENEMQSIFQENDELRFREAXXXXXXXXXXXXXXXXXXXNHTDE 666
KE AE+MKLKE+LLDKENE Q+IFQEN++LR RE+ N +E
Sbjct: 660 KEAQAESMKLKESLLDKENEFQNIFQENEDLRSRESATIKKVEELSKSLEEATTRNTNEE 719
Query: 667 NGDLTDSEKDYDLLPKVVEFS-EENGHGGEDISKVELPANXXXXXXXXXXXXXXNDKNEE 725
NGDL+DSEKDYDLLPKVVEFS E G I K EL + +DK E+
Sbjct: 720 NGDLSDSEKDYDLLPKVVEFSEENGHGGEGGIFKEELSVS------AKEENIVLDDKFEK 773
Query: 726 IEFPKLEDVNGXXXXXXXXXXXXXXXXXFKMWESCKIXXXXXXXXXXXXXXXXXXXXXXX 785
E PK E+VNG KMWESCKI
Sbjct: 774 TESPKPENVNG-KLKEEDERKEKDDSVELKMWESCKIEKKEFSPEKGAEPEESFEEEVES 832
Query: 786 IEEGSESFDKINGTSVTENIDD 807
+G E+ NG SVTENI D
Sbjct: 833 KTDGGET----NGASVTENIGD 850
>IMGA|CT030028_22.5 hypothetical protein
chr03_pseudomolecule_IMGAG_V2 24922764-24924401 E
EGN_Mt071002 20080227
Length = 545
Score = 75.1 bits (183), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/182 (34%), Positives = 96/182 (52%), Gaps = 2/182 (1%)
Query: 451 EEKSKKAMESLASALHEVSAEARETKENILSSQAEKDSYENQIEDLKLVLKGTNEKYESM 510
EE SKKAM+ LA AL EV+ EA + K + SQ E + + E K +L+ T EKY+ +
Sbjct: 180 EENSKKAMDDLAFALKEVATEANQVKTKLTLSQVELEHTKGDAERWKTMLESTEEKYKEL 239
Query: 511 LDDARHEIDVLICSIENSKNAYENSKAEWEQREFHLVSSLKKNEEENVFLEKEINRLVHL 570
LD R E + + E + E S W +E V+ +K+ +EE + ++E +RL+ L
Sbjct: 240 LDATRKEAERFKNTAERLRLEAEESLLAWNGKETEFVTCIKRADEERLLAQQETSRLLDL 299
Query: 571 LKXXXXXXXXXXXXXXQLKENLKEVEGEAIQLQEALKEVTAENMKLKE--NLLDKENEMQ 628
L+ +L++ LK+ EA +EA + AEN +L++ +LL ENEM
Sbjct: 300 LQEAESKTKVSKEENQKLRDILKQALNEANVAKEASEIAKAENGRLQDSLSLLVHENEML 359
Query: 629 SI 630
I
Sbjct: 360 KI 361
>IMGA|AC149032_5.5 Prefoldin chr02_pseudomolecule_IMGAG_V2
7282447-7284738 H EGN_Mt071002 20080227
Length = 584
Score = 54.7 bits (130), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 82/154 (53%), Gaps = 13/154 (8%)
Query: 96 DLKKAKEQLLQAEKEKVKATDELKEAQRVVEEANEKHREALVAQKRAEENSEIEKFRAVE 155
+L K K++L AE K KA EL +A ++E +K +++ A E SE+ K +A E
Sbjct: 72 ELNKIKKKLESAESTKAKALIELDKANITLQELTKKLNTVRESKQSAMEESEVVKNQAKE 131
Query: 156 LEQA------GIETVKKKEDEWQKELESVRNQHALDVAALLSTTQELQQVKQELATTCDA 209
LE+A G E W++ELE R ++ V L ++ QEL +++Q+ +A
Sbjct: 132 LEKALSQKAIGYEA-------WKQELEHARKEYTTTVKELDASKQELNKIRQDFDAALEA 184
Query: 210 KNQALNHADDATKIAETHAEKADLLSAEVTRLKA 243
K A A +A + A+ ++EK + LS E+ +KA
Sbjct: 185 KLAAFQMAGEAQRSAKLNSEKINELSKEIATMKA 218
>IMGA|AC124963_7.5 t-snare chr07_pseudomolecule_IMGAG_V2
25847301-25851345 E EGN_Mt071002 20080227
Length = 610
Score = 49.7 bits (117), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 63/219 (28%), Positives = 111/219 (50%), Gaps = 36/219 (16%)
Query: 96 DLKKAKEQLLQAEKEKVKATDELKEAQRVVEEANEKHREALVAQKRAEEN-------SEI 148
DLK AK+QL +E K ++V EEA E ++ L K EE+ S
Sbjct: 89 DLKSAKDQLNSSESWK----------RKVQEEAEEAKKQILSLSKELEESRQQFSDLSAS 138
Query: 149 EKFRAVELEQAGIETVKKKEDEWQKELESVRNQHALDVAALLSTTQELQQVKQELATTCD 208
E+ R EL + + ++ WQ ELE+V+ QH++D +AL+S E+ ++K +L +
Sbjct: 139 EETRLQELSKIS----QDRDRAWQSELEAVQKQHSMDSSALVSAMNEIHKLKSQLERASE 194
Query: 209 AKNQALNHADDATKIAETHAEKADL---LSAEVTRLKALLDSKMETEASEN---EVILKL 262
+++ N+A HA+ DL LS ++ ++ L + + + SE+ EVI K+
Sbjct: 195 SESSQANNAKS------DHAQIQDLRMDLSEAISVMEKLRNEASDCKESESRALEVIGKM 248
Query: 263 KTEIEALKQELEQARVYDEKLTEKETSIEQLNVELEAAK 301
+ ++E + + +E R K TE + + L +ELE ++
Sbjct: 249 QMQLETVNKTVETLRSDGLKATE---AYKSLALELEQSR 284
>IMGA|AC146751_7.4 I/LWEQ; Prefoldin chr03_pseudomolecule_IMGAG_V2
12947283-12952487 E EGN_Mt071002 20080227
Length = 674
Score = 44.3 bits (103), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 51/183 (27%), Positives = 95/183 (51%), Gaps = 18/183 (9%)
Query: 96 DLKKAKEQLLQAEKEKVKATDELKEAQRVVEEANEK------HREALVAQKRAEENSEIE 149
+L K KEQ+ AE K +A EL+ AQR V++ +K RE+ V A ++ +
Sbjct: 98 ELNKLKEQVKNAETTKAQALVELERAQRTVDDLTQKLKLITESRESAVKATEAAKSQAKQ 157
Query: 150 KFRAVELEQAGIETVKKKEDEWQKELESVRNQHALDVAALLSTTQELQQVKQELATTCDA 209
K+ E G+ W++ELE+ ++A + L + Q+L++ +QE ++ DA
Sbjct: 158 KYG----ESDGVNGA------WKEELENAVQRYASIMTELDAAKQDLRKTRQEYDSSSDA 207
Query: 210 KNQALNHADDATKIAETHAEKADLLSAEVTRLK-ALLDSKMETEASENEVILKLKTEIEA 268
+ A+ ++A + + E+ LS E++ +K ++ +K+ S+ + L L TE +A
Sbjct: 208 RVSAVKRTEEAENAMKENTERVSELSKEISAVKESIEQTKLAYVESQQQQALVL-TEKDA 266
Query: 269 LKQ 271
L+Q
Sbjct: 267 LRQ 269
>IMGA|AC151424_2.5 Protein of unknown function DUF827, plant
chr01_pseudomolecule_IMGAG_V2 6841156-6837268 E
EGN_Mt071002 20080227
Length = 919
Score = 43.5 bits (101), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 7/117 (5%)
Query: 107 AEKEKVKATDELKEAQRVVEEANEKHREALVAQKRAEENSEIEKFRAVELEQAGIE--TV 164
+E+EKV+ EL A+R++EE A + +A ++SE+ K R E+EQ E +V
Sbjct: 331 SEQEKVQVLQELDSAKRLIEELKLSLERAQTEEHQARQDSELAKLRVEEMEQGIAEDSSV 390
Query: 165 KKKEDEWQKELESVRNQHALDVAALLSTTQELQQVKQELATTCDAKNQALNHADDAT 221
K +LE + ++ + L S EL ++ E A+ D K +A++ A+DA
Sbjct: 391 AAK-----AQLEVAKARYTSAITELTSVKHELDSLRVEYASLVDEKGEAIDKAEDAV 442