Miyakogusa Predicted Gene
- chr3.CM0127.50.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr3.CM0127.50.nc + phase: 0
(1339 letters)
Database: Medicago_aa2.0
38,834 sequences; 10,231,785 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
IMGA|CU302334_5.4 Aldehyde dehydrogenase; Zinc finger, FYVE/PHD-... 1040 0.0
IMGA|AC147775_11.4 Zinc finger, FYVE/PHD-type chr01_pseudomolecu... 89 9e-18
IMGA|CU182773_11.3 Zinc finger, FYVE/PHD-type chr03_pseudomolecu... 71 4e-12
IMGA|AC146664_14.4 Zinc finger, FYVE/PHD-type , related chr06_ps... 64 4e-10
IMGA|CU302337_3.4 Zinc finger, FYVE/PHD-type chr05_pseudomolecul... 64 6e-10
IMGA|AC143340_40.5 Zinc finger, FYVE/PHD-type chr03_pseudomole... 50 5e-06
IMGA|AC143340_38.5 Nuclear protein SET; Zinc finger, FYVE/PHD-ty... 43 0.001
>IMGA|CU302334_5.4 Aldehyde dehydrogenase; Zinc finger, FYVE/PHD-type
chr05_pseudomolecule_IMGAG_V2 1012576-1019588 E
EGN_Mt071002 20080227
Length = 1435
Score = 1040 bits (2689), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 635/1201 (52%), Positives = 755/1201 (62%), Gaps = 144/1201 (11%)
Query: 1 MRLEPGTCKVCSAPCSTCIHLNRVAMGSKAEEYSDENCRVGEAN-QYXXXXXXXXXXXXR 59
MRLE GTC VCSAPCS+C+H+N EE+SD+NCR GEAN Q R
Sbjct: 1 MRLESGTCNVCSAPCSSCMHVNHAP-----EEFSDDNCRSGEANVQNSMNEGNVHSLSSR 55
Query: 60 ACNRLKDAVTKTSNTPSDHSSHDFLSENAESKPTLSEKYQDSKCLEGLDDSISCNNRASK 119
AC L+ V++TSN S SSHD LSENAES+ L KYQD LEG DD+ SC +RAS
Sbjct: 56 ACENLQHGVSETSNMLSVSSSHDSLSENAESRQILLNKYQDPNHLEGHDDNTSCISRASD 115
Query: 120 ANLVSGSHQINSDGINISCSSASVSLLGKEGSRIGTSVDMSGLSDILSSKDAAIPENLSE 179
AN + IPE S+
Sbjct: 116 AN-------------------------------------------------SRIPEKNSK 126
Query: 180 CCIENADTSLTKERESIIVSGEKS-------LTVTAKVPLKIYPNSEADTDND-YCNAKD 231
C IEN +SLTKE + SGEK + T+ LK+ P S+AD DND C+AK
Sbjct: 127 CSIENCSSSLTKESAPVATSGEKCTANKDKLIEGTSNDSLKVCPKSQADPDNDKVCDAKV 186
Query: 232 INHRYSAHDILHENAEEPVKSPGVPVPQXXXXXXXXXIVEHDVKVCDICGDSGREDLLAI 291
+ + SAHD HE AEE VKSP Q +VEHDVKVCDICGD+GREDLLAI
Sbjct: 187 EDCKCSAHDGHHEKAEELVKSPRKQESQSENESDESDVVEHDVKVCDICGDAGREDLLAI 246
Query: 292 CCRCSDGAEHTYCMREMLEKVPEGDWLCEECKHAEGSANLRLDAEVNKNRKVSSSSQISG 351
CCRC+DGAEHTYCMREMLEK+PEGDWLCEEC+ A + N RLD E KN K +S+SQ+SG
Sbjct: 247 CCRCTDGAEHTYCMREMLEKLPEGDWLCEECQDAVEAENKRLDIEGKKNIKTTSTSQVSG 306
Query: 352 KRPSESVEVA-IAAKRQALESSTGSPKASNPKKTVSLSRESSFKSLDNGKVKPGQQIPIR 410
KR +++EVA AAKRQALE S GSPK S+PKK V LSRESSFKS D K K G +P R
Sbjct: 307 KRRPDNIEVAPPAAKRQALELSKGSPKVSSPKKLVPLSRESSFKSSDKLKGKSGLLMPPR 366
Query: 411 NHHGGDDIALARSLSTGPRSQAARSTLLKXXXXXXXXXKPRAKLIDEVVPQKQKGGGQYI 470
NH GGDD ARS S G R Q ++S LLK KP+ K+ DEV P + KGG +
Sbjct: 367 NHSGGDDAQTARSPSVGLRGQISKSMLLKSNSSNNLNSKPKVKIGDEVFPPRPKGGHEQT 426
Query: 471 SKNMDTPAGLMSKSMSFKSSNLGR--ATDSKVKMLSSKPGTAQDLKGSRHGKESGVFDRK 528
SKNM+T A + S+S FKSS+LGR A +SKVKML KP T QDLKGSRH KESG DRK
Sbjct: 427 SKNMETTARMTSRSTLFKSSSLGRSSAIESKVKML-PKPATIQDLKGSRHSKESGSLDRK 485
Query: 529 TLSRIDRP----VVSASKGDQKLTPRGETA-KPSAVNHNREFKVNQDGKLNSLSKSMNNI 583
LSR DRP VVS KGDQKLTPRGET KPSAVN NRE K+NQDGKL++ SKS NNI
Sbjct: 486 YLSRNDRPVASSVVSTPKGDQKLTPRGETVIKPSAVN-NRESKINQDGKLSASSKSTNNI 544
Query: 584 GHKSRELQ---ERTSTSGHETQQNGLPRSRDTANQIDKTKDGCSDRVRSSLTNTS----- 635
KS E Q ERT S E Q+ LPRSR+TANQ++K+++ SDR+R + S
Sbjct: 545 SRKSVEPQGSSERTIASNDEALQDVLPRSRETANQVEKSRESLSDRLRPVVPTASKSSYC 604
Query: 636 ----------ECCTIGGTQELGDEVSVNATSSSKEEMHNGNSLKAAIHAALLRRPEIHKK 685
E CT G QE G E+SV A+S SKEEMH GN LKAAI AALL+RPEI++K
Sbjct: 605 QKCEEFGHSLEGCTAGNLQESGAEISVTASSISKEEMHKGNKLKAAIQAALLKRPEIYRK 664
Query: 686 KDVPERTGEFPTSGTDLKCEVSYQDRVSVSNTLKNSISTEETNAKQETLDNSTFETSKCL 745
K+V +T E PTSGT+L CE + +D+V VSNTLKNSISTEET +QE L+NST E+SKC
Sbjct: 665 KEVSSQTDEIPTSGTELNCEATSRDQVLVSNTLKNSISTEETREQQEVLENSTSESSKCS 724
Query: 746 SANNLKQLHFCPADFRSQPRKSDSVGSASGKPVVKDLLNRALEISNVISKTSAIPEYKYI 805
SA++LKQL+ CP D SQ KSD VG + KP+V+DL +A+ IS+V+SK A PEY+YI
Sbjct: 725 SASDLKQLNSCPTDLCSQLGKSDLVGLNAQKPLVRDLSRKAVAISSVVSKMLAFPEYEYI 784
Query: 806 WQGVFEVHRSGKPPDLYTGIQAHLSSCASPKVLDVVNKFLPEVSLHEVSRLSTWPSQFHQ 865
WQGVFEVHR+GKPP+L TG+QAHLSS ASPKVL+VV KF PEVSL+EVSRLSTWPSQFH
Sbjct: 785 WQGVFEVHRNGKPPELCTGVQAHLSSSASPKVLEVVTKFSPEVSLNEVSRLSTWPSQFHH 844
Query: 866 GGGAKEDNIALYFFAKDIESYERYYKSLLDHMIKNDLALKGTFDGVELLIFTSNQLPENS 925
GGA+EDNIALYFFA+D+E +R+YK LLDHMI+NDLALKG FDGVELLIF SNQLPENS
Sbjct: 845 -GGAREDNIALYFFARDVE-RQRHYKGLLDHMIRNDLALKGIFDGVELLIFPSNQLPENS 902
Query: 926 QRWNTLFFLWGIFRGRRINHSDSAKKICIPSLNVIPNEKCFPTAVMTLSETPCSPARVGA 985
QRWN L FLWG+FRGRR++HS SAK ICIPSLN +P E+ TAV+TLSE C +
Sbjct: 903 QRWNMLLFLWGVFRGRRVDHSGSAKSICIPSLNAMPVEENSSTAVVTLSER-CLSKGIDE 961
Query: 986 ESIACCGKAGSALLPSTSIEQA--------------HILKGSAPVHGQD----------- 1020
+ I KAG+ L STS +Q+ + P+ D
Sbjct: 962 KPIN-SDKAGNTLPFSTSQDQSPTIASNNTDINHQTQLCSQQVPLEMSDGTIDSKTASRV 1020
Query: 1021 ------------------------RESKPLKATRTSEMNMMMETKTNYDISVGQEDSFSS 1056
ESKP + T M+E T+ S QE++
Sbjct: 1021 SKSCQQTKFTGSSLKASVVEDERCTESKPSEEMGTGVSYKMVEASTDSASSDKQENTLCQ 1080
Query: 1057 RIPYVGNEEIGTASNISKDEISESKNNDENQQRPKRKQIEDGLDINMEAKFQGEQIETGV 1116
IP V N++ A NISK+EI E N DE+QQR KRKQ ED I++E +
Sbjct: 1081 AIPSVSNQDRDAACNISKNEILERMNCDEDQQRTKRKQKEDCHYIDLEETIDNHETHAAS 1140
Query: 1117 N 1117
N
Sbjct: 1141 N 1141
Score = 136 bits (342), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 113/303 (37%), Positives = 141/303 (46%), Gaps = 67/303 (22%)
Query: 1061 VGNEEIGTASNISKDEISESKNNDENQQRPKRKQIEDGLDINMEAKFQGEQIETGVNCQL 1120
V N+E NI+KD IS+ K DE+QQR KRK ED I++EA Q + G + QL
Sbjct: 1176 VENQETDAGINITKDNISD-KIGDEDQQRLKRKAKEDCHYIDLEAPLQEDLSTEGADYQL 1234
Query: 1121 PHDKRVRHIDLSHTVVEASAVSCQNMPWDKVNXXXXXXXXXXXXXRNFSGIHGSYSSGVK 1180
P+DK V H+D S Q MPW++VN R S I+ +SSGV
Sbjct: 1235 PNDKEVHHVD-------PSVAGLQKMPWNEVNGKLEDAESSRKKLRT-SEIYDRHSSGVG 1286
Query: 1181 DPSSGNFASHVNDFSSCSSVEVKGCKEACDEKIIHEDLGRMERTFFPSDANN-------- 1232
D KGC+EA EKII EDLG MERTFFP D N
Sbjct: 1287 D---------------------KGCEEASVEKIIREDLGTMERTFFPVDTQNINGLQSVL 1325
Query: 1233 ---KLKGPHEHGDRFQARIPDLALALGGETK-------PSPKGMLPFFAGTADKKNNQEK 1282
+KG HE + IP+L LALG ET+ PKGMLPF G A+KKNN
Sbjct: 1326 NTMAMKGIHER----ENVIPNLNLALGDETEMPPSPPPAGPKGMLPFLVGPAEKKNNHAD 1381
Query: 1283 TPDLLEDEKKNDTDSVAAXXXXXXXXXXXDKEQIRPVSES----DQHVNAP-LLLFG-KF 1336
P + D AA + EQ + S++ D H +P LLFG ++
Sbjct: 1382 RP---------EDDVAAASLSLSLSFPSSNMEQTKASSKAELLPDGHRPSPSFLLFGRRY 1432
Query: 1337 TDK 1339
TDK
Sbjct: 1433 TDK 1435
>IMGA|AC147775_11.4 Zinc finger, FYVE/PHD-type
chr01_pseudomolecule_IMGAG_V2 26454892-26452159 E
EGN_Mt071002 20080227
Length = 369
Score = 89.4 bits (220), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 78/142 (54%), Gaps = 3/142 (2%)
Query: 805 IWQGVFEVHRSGKPPDLYTGIQAHLSSCASPKVLDVVNKFLPEV-SLHEVSRLSTWPSQF 863
IW+G K G+ AHLS+ ASPKVLD + KF P V S + R WP+ F
Sbjct: 223 IWRGNLIFCDKSKTIGRVNGLLAHLSNIASPKVLDEM-KFFPHVLSADLLPRSEVWPNSF 281
Query: 864 HQGGGAKEDNIALYFFAKDIESYERYYKSLLDHMIKNDLALKGTFDGVELLIFTSNQLPE 923
+ G E +IALYFF + + + L+D +I + A++ + LLIF S+ LP
Sbjct: 282 KEEGPTDE-SIALYFFPGNRRLSIKAFDKLVDDIICTEAAVRVVTENAVLLIFPSDLLPI 340
Query: 924 NSQRWNTLFFLWGIFRGRRINH 945
Q++ T ++LWG+F+ ++ +H
Sbjct: 341 RHQKFQTKYYLWGVFKKKQTSH 362
>IMGA|CU182773_11.3 Zinc finger, FYVE/PHD-type
chr03_pseudomolecule_IMGAG_V2 36457083-36464644 E
EGN_Mt071002 20080227
Length = 542
Score = 70.9 bits (172), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 80/168 (47%), Gaps = 13/168 (7%)
Query: 787 LEISNVISKTSAIPEY-KYI------WQGVFEVHRSGKPPDLYTGIQAHLSSCASPKVLD 839
+E S + S P Y KY W G F++ + +Y G +A + K +
Sbjct: 252 MEKSKIQSFVENFPRYQKYFPSSIRAWSGQFQIRQEAASGGIYDGFEAQPPCTINRKAYN 311
Query: 840 VVNKFLPEVSLHEVSRLSTWPSQFHQGGGAKEDNIALYFFAKDI-ESYERYYKSLLDHMI 898
+ +K + L + L+ +F + +D IALYFF D E + +LL M
Sbjct: 312 LSSKIPSVLQLESLPALNVLTDEFQNYSPSLQD-IALYFFPSDNNERSRKNLNNLLKFMN 370
Query: 899 KNDLALKGTFDGVELLIFTSNQLPENSQRWNTL----FFLWGIFRGRR 942
+L L+ +GVEL +FTS++L ++S+ + +FLWG+FR ++
Sbjct: 371 DENLMLRSLINGVELFLFTSHKLSDDSRGTIAVVHEGYFLWGVFRTKK 418
Score = 60.8 bits (146), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 25/58 (43%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 272 HDVKVCDICGDSGREDLLAICCRCSDGAEHTYCMREMLEKVPEGDWLCEECKHAEGSA 329
H V+ CDICG G +++ C +C EH YCM+ L +VP+ WLCE C+ GS
Sbjct: 9 HGVEPCDICGHFGFGEVIVTCSKCKVNREHVYCMKINLMEVPDY-WLCEPCQSNNGST 65
>IMGA|AC146664_14.4 Zinc finger, FYVE/PHD-type , related
chr06_pseudomolecule_IMGAG_V2 8876319-8873722 H
EGN_Mt071002 20080227
Length = 149
Score = 63.9 bits (154), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 77/153 (50%), Gaps = 14/153 (9%)
Query: 811 EVHRSGKPPDLYTGIQAHLSSCASPKVLDVVNKFLPE-VSLHEVSRLSTWPSQFHQGGGA 869
EV GK +L HLS+ A PKV + ++LP +S + + + + WP F + G
Sbjct: 2 EVSNIGKVIEL----MGHLSTLACPKVHEEA-RYLPNMISANFLQKSTVWPESF-KNSGT 55
Query: 870 KEDNIALYFFAKDIESYERYYKSLLDHMIKNDLALKGTFDGVELLIFTSNQLPENSQRWN 929
+I +YF + S + + L++ MI + LA+K +LLIF S LP + +
Sbjct: 56 NNFSIGIYFLSPHNPSVDGSFDELVEEMISDKLAIKVGVVNADLLIFPSTDLPSEYRTFQ 115
Query: 930 TLFFLWGIFRGRR-------INHSDSAKKICIP 955
+ ++LWG+FR ++ I++ +K+ P
Sbjct: 116 SRYYLWGVFRRKQTSIKNNYIDYKIEKRKLYFP 148
>IMGA|CU302337_3.4 Zinc finger, FYVE/PHD-type
chr05_pseudomolecule_IMGAG_V2 36369898-36371873 E
EGN_Mt071002 20080227
Length = 227
Score = 63.5 bits (153), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 11/124 (8%)
Query: 824 GIQAHLSSCASPKVLDVVNKFLPEV-SLHEVSRLSTWPSQFHQGGGAKEDNIALYFFAKD 882
G+ AHLS PKV + LP+V S + R WP F + G + NIALY F +
Sbjct: 98 GLMAHLSDLVCPKVWKE-TELLPDVLSADLLPRSEVWPDSFKKDGPTNK-NIALYLFPE- 154
Query: 883 IESYERYYKSLLDHMIKNDL----ALKGTFDGVELLIFTSNQLPENSQRWNTLFFLWGIF 938
YE LD++I + AL+ + +LLIF S LP Q++++ +LWG+F
Sbjct: 155 ---YEGPSMDALDNLIVEVIHAEAALRVVTENAQLLIFPSTLLPIQHQKFDSKNYLWGVF 211
Query: 939 RGRR 942
R ++
Sbjct: 212 RKKQ 215
Score = 43.1 bits (100), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 25/52 (48%)
Query: 276 VCDICGDSGREDLLAICCRCSDGAEHTYCMREMLEKVPEGDWLCEECKHAEG 327
VC CGD G ++ C C D A H YC+ + E WLCE+C G
Sbjct: 5 VCLTCGDIGFPEVRVFCNNCKDCALHRYCLDGPVIFTEEVIWLCEDCDEETG 56
>IMGA|AC143340_40.5 Zinc finger, FYVE/PHD-type
chr03_pseudomolecule_IMGAG_V2 34815936-34819491 E
EGN_Mt071002 20080227
Length = 161
Score = 50.4 bits (119), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 38/56 (67%), Gaps = 1/56 (1%)
Query: 887 ERYYKSLLDHMIKNDLALKGTFDG-VELLIFTSNQLPENSQRWNTLFFLWGIFRGR 941
E + +LD++I+ D ALK + +ELLIF+S+ LP + +R T ++LWGIF+ +
Sbjct: 103 EMIFDRVLDNVIEKDNALKAVINNNLELLIFSSHLLPPDERRICTKYYLWGIFKSK 158
>IMGA|AC143340_38.5 Nuclear protein SET; Zinc finger, FYVE/PHD-type
chr03_pseudomolecule_IMGAG_V2 34837562-34841279 H
EGN_Mt071002 20080227
Length = 390
Score = 42.7 bits (99), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/107 (27%), Positives = 49/107 (45%), Gaps = 17/107 (15%)
Query: 276 VCDICGDSGREDLLAICCRCSDGAEHTYCMREMLEKVPEGDWLCEECKHAEGSA------ 329
+C+ CG + + L +C +C +G H C+R ++ +VP G W+C +C +
Sbjct: 75 LCEQCGSGEQPEELLLCDKCDNGF-HMKCVRPIVVRVPIGPWICPKCSDVKVKKLKKLSQ 133
Query: 330 -------NLRLDAEVNKNRKVSSSSQISGKRPSESVEVAIAAKRQAL 369
LR D+ NR +SSQ + KR + + KR+ L
Sbjct: 134 KKILDFFGLRRDSLFGNNR---ASSQDAMKRRRRPRPLVVQKKRRRL 177