Miyakogusa Predicted Gene
- chr1.CM0033.650.nc
BLASTP 2.2.18 [Mar-02-2008]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= chr1.CM0033.650.nc + phase: 0
(551 letters)
Database: Medicago_aa2.0
38,834 sequences; 10,231,785 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
IMGA|AC128638_51.4 WD40-like - 3prime partial chr07_pseudomolecu... 270 8e-73
IMGA|AC126009_25.4 WD40-like chr07_pseudomolecule_IMGAG_V2 22673... 50 3e-06
IMGA|CR936327_16.4 Splicing factor motif; WD40-like chr05_pseudo... 49 7e-06
IMGA|AC174371_34.5 WD40-like chr08_pseudomolecule_IMGAG_V2 29053... 48 8e-06
IMGA|AC147435_58.4 Lissencephaly type-1-like homology motif; CTL... 44 1e-04
IMGA|AC142394_45.4 Lissencephaly type-1-like homology motif; CTL... 44 1e-04
IMGA|CU019603_21.4 Lissencephaly type-1-like homology motif; WD4... 44 2e-04
IMGA|AC149491_27.4 Cytochrome cd1-nitrite reductase-like, C-term... 44 2e-04
IMGA|AC153125_50.5 WD40-like chr08_pseudomolecule_IMGAG_V2 88700... 43 4e-04
IMGA|AC135317_6.4 WD40-like chr01_pseudomolecule_IMGAG_V2 166106... 42 7e-04
IMGA|CT485797_2.5 Lissencephaly type-1-like homology motif; CTLH... 42 0.001
>IMGA|AC128638_51.4 WD40-like - 3prime partial
chr07_pseudomolecule_IMGAG_V2 25639283-25636832 E
EGN_Mt071002 20080227
Length = 248
Score = 270 bits (691), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 140/216 (64%), Positives = 152/216 (70%), Gaps = 1/216 (0%)
Query: 26 GLKTYFKTPEGRYKLQYEKTHPSGLLHYAHGKTVTQVTLAHLKDKXXXXXXXXXXXXXXX 85
GLKTYFKTPEGRYKLQ++KTHPSGLL + HGKTV+ VTLAHLK+K
Sbjct: 31 GLKTYFKTPEGRYKLQFDKTHPSGLLQFNHGKTVSMVTLAHLKEKPAPLTPTASSSSFSA 90
Query: 86 XXXXXXXXXXXLLGGSNGSRALSFVXXXXXXXXXXXXXXXXXXXXXXXXXXVANPNFDGK 145
LLGGSNG+RALSFV VANPNFDGK
Sbjct: 91 SSGVRSAAAR-LLGGSNGNRALSFVGGNGSSKSNGGASRIGSIGSSSLSSSVANPNFDGK 149
Query: 146 GTYLIFNVGDAIFISDLNSQDKDPIKSIHFSNSNPVCHAFDQDSKDGHDLLIGLSSGDVY 205
G+YL+FN GDAI ISDLNSQDKDPIKSIHFSNSNPVCHAFDQD+KDGHDLLIGL SGDVY
Sbjct: 150 GSYLVFNAGDAILISDLNSQDKDPIKSIHFSNSNPVCHAFDQDAKDGHDLLIGLFSGDVY 209
Query: 206 SVSLRQQLQDVGKKLVGAQHYNKDGSVNNSRCTCIS 241
SVSLRQQLQDVGKK+VGA HYNKDG +NN + IS
Sbjct: 210 SVSLRQQLQDVGKKIVGAHHYNKDGILNNRQDFMIS 245
>IMGA|AC126009_25.4 WD40-like chr07_pseudomolecule_IMGAG_V2
22673342-22677128 E EGN_Mt071002 20080227
Length = 345
Score = 49.7 bits (117), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 8/101 (7%)
Query: 303 QGSINSISFSNDGAYLATVGRDGYLRVFD---YLKE----QLICGGK-SYYGALLCCAWS 354
Q I S+ S DG+YL T G D L ++D Y + +++ G + ++ LL C WS
Sbjct: 224 QDMITSMQLSPDGSYLLTNGMDCKLCIWDMRPYAPQNRCVKILEGHQHNFEKNLLKCGWS 283
Query: 355 MDGKYILTGGEDDLVQVWSMEDRKVVAWGEGHNSWVSGVAF 395
DG + G D +V +W R+++ GHN V+ F
Sbjct: 284 PDGSKVTAGSSDRMVYIWDTTSRRILYKLPGHNGSVNECVF 324
>IMGA|CR936327_16.4 Splicing factor motif; WD40-like
chr05_pseudomolecule_IMGAG_V2 8848188-8845304 E
EGN_Mt071002 20080227
Length = 501
Score = 48.5 bits (114), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/91 (25%), Positives = 50/91 (54%)
Query: 306 INSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSMDGKYILTGGE 365
+ S++F +DG+ A+ G+D RV+D ++ + + + + ++S +G ++ TGGE
Sbjct: 337 VYSLAFHHDGSLAASCGKDALARVWDLRTKRSLLALEGHVKPIRGISFSPNGYHLATGGE 396
Query: 366 DDLVQVWSMEDRKVVAWGEGHNSWVSGVAFD 396
D+ ++W + K + H+ +S V F+
Sbjct: 397 DNTCRIWDLRKNKSLYTIAAHSKLISQVKFE 427
>IMGA|AC174371_34.5 WD40-like chr08_pseudomolecule_IMGAG_V2
29053898-29056542 E EGN_Mt071002 20080227
Length = 326
Score = 48.1 bits (113), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 31/130 (23%), Positives = 60/130 (46%), Gaps = 14/130 (10%)
Query: 306 INSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSM---DGKYILT 362
++ + FS + ++ D LR+++Y + + + + C + S +GKY++
Sbjct: 208 VSYVKFSPNAKFILVGTLDNNLRLWNYSTGKFLKTYTGHVNSKYCISSSFSITNGKYVVG 267
Query: 363 GGEDDLVQVWSMEDRKVVAWGEGHNSWVSGVAFDSYWSSPNSNDNGETIMYRFGSVGQDT 422
G ED+ + +W ++ RK+V EGH V V+ P N M G++G D
Sbjct: 268 GSEDNCIYLWGLQSRKIVQKLEGHTDSVVSVS-----CHPTEN------MIASGALGNDK 316
Query: 423 QLLLWDLEMD 432
+ +W + D
Sbjct: 317 TVKIWTQQKD 326
Score = 42.7 bits (99), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/103 (22%), Positives = 51/103 (49%)
Query: 294 NPVARWHICQGSINSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAW 353
+P+ ++ Q ++ ++FS+D YL + D +R++D L+ + + C +
Sbjct: 69 SPMQQYEGHQHGVSDLAFSSDSRYLVSASDDKTIRLWDVPTGSLVKTLHGHTNYVFCVNF 128
Query: 354 SMDGKYILTGGEDDLVQVWSMEDRKVVAWGEGHNSWVSGVAFD 396
+ I++G D+ V+VW ++ K + H+ V+ V F+
Sbjct: 129 NPQSNVIVSGSFDETVRVWDVKSGKCLKVLPAHSDPVTAVDFN 171
>IMGA|AC147435_58.4 Lissencephaly type-1-like homology motif; CTLH,
C-terminal to LisH motif; WD40-like
chr03_pseudomolecule_IMGAG_V2 11652773-11640514 F
EGN_Mt071002 20080227
Length = 513
Score = 44.3 bits (103), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 9/94 (9%)
Query: 311 FSNDGAYLATVGRDGYLRVFDY----LKEQLICGGKSYYGA----LLCCAWSMDGKYILT 362
FS DG YL + DG++ V+DY LK+ L + + +LC +S D + I +
Sbjct: 222 FSPDGQYLVSCSIDGFIEVWDYISGKLKKDLQYQAEETFMMHDEPVLCVDFSRDSEMIAS 281
Query: 363 GGEDDLVQVWSMEDRKVVAWGE-GHNSWVSGVAF 395
G D ++VW + + + E H+ V+ V+F
Sbjct: 282 GSTDGKIKVWRIRTAQCLRRLEHAHSQGVTSVSF 315
>IMGA|AC142394_45.4 Lissencephaly type-1-like homology motif; CTLH,
C-terminal to LisH motif; WD40-like
chr03_pseudomolecule_IMGAG_V2 11566822-11554563 F
EGN_Mt071002 20080227
Length = 513
Score = 44.3 bits (103), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 9/94 (9%)
Query: 311 FSNDGAYLATVGRDGYLRVFDY----LKEQLICGGKSYYGA----LLCCAWSMDGKYILT 362
FS DG YL + DG++ V+DY LK+ L + + +LC +S D + I +
Sbjct: 222 FSPDGQYLVSCSIDGFIEVWDYISGKLKKDLQYQAEETFMMHDEPVLCVDFSRDSEMIAS 281
Query: 363 GGEDDLVQVWSMEDRKVVAWGE-GHNSWVSGVAF 395
G D ++VW + + + E H+ V+ V+F
Sbjct: 282 GSTDGKIKVWRIRTAQCLRRLEHAHSQGVTSVSF 315
>IMGA|CU019603_21.4 Lissencephaly type-1-like homology motif; WD40
associated region in TFIID subunit; WD40-like
chr03_pseudomolecule_IMGAG_V2 19953355-19962972 E
EGN_Mt071002 20080227
Length = 669
Score = 43.5 bits (101), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/131 (24%), Positives = 55/131 (41%), Gaps = 15/131 (11%)
Query: 309 ISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSMDGKYILTGGEDDL 368
+ FS G Y A+ D RV+ + Q + + + C W + YI TG D
Sbjct: 467 VQFSPMGHYFASSSHDRTARVWSMDRIQPLRIMAGHLSDVDCVQWHANCNYIATGSSDKT 526
Query: 369 VQVWSMEDRKVVAWGEGHNSWVSGVAFDSYWSSPNSNDNGETIMYRFGSVG-QDTQLLLW 427
V++W ++ + V GH + ++ SP+ R+ + G +D +++W
Sbjct: 527 VRLWDVQSGECVRVFVGHRGMILSLSM-----SPDG---------RYMASGDEDGTIMMW 572
Query: 428 DLEMDEIVVPL 438
DL V PL
Sbjct: 573 DLSSGRCVTPL 583
Score = 43.5 bits (101), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 24/92 (26%), Positives = 46/92 (50%)
Query: 306 INSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSMDGKYILTGGE 365
++ + + + Y+AT D +R++D + + + G +L + S DG+Y+ +G E
Sbjct: 506 VDCVQWHANCNYIATGSSDKTVRLWDVQSGECVRVFVGHRGMILSLSMSPDGRYMASGDE 565
Query: 366 DDLVQVWSMEDRKVVAWGEGHNSWVSGVAFDS 397
D + +W + + V GH S V +AF S
Sbjct: 566 DGTIMMWDLSSGRCVTPLVGHTSCVWSLAFSS 597
>IMGA|AC149491_27.4 Cytochrome cd1-nitrite reductase-like,
C-terminal haem d1 chr02_pseudomolecule_IMGAG_V2
8574069-8558070 E EGN_Mt071002 20080227
Length = 1826
Score = 43.5 bits (101), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 48/113 (42%), Gaps = 10/113 (8%)
Query: 295 PVARW-HICQGSINSISFSNDGAYLATVGRDGYLRVFDYL---------KEQLICGGKSY 344
P RW H ++ + G L R +R Y K Q I + +
Sbjct: 209 PYMRWPHTKANQVHGLHLREIGGGLPRHHRAPSIRAACYAIAKPSTMVQKMQNIKRIRGH 268
Query: 345 YGALLCCAWSMDGKYILTGGEDDLVQVWSMEDRKVVAWGEGHNSWVSGVAFDS 397
A+ C + G+Y++TG +D LV++WSME +A GH ++ +A S
Sbjct: 269 CNAVYCAIFDRSGRYVITGSDDRLVKIWSMETAYSLASCRGHVGDITDLAVSS 321
>IMGA|AC153125_50.5 WD40-like chr08_pseudomolecule_IMGAG_V2
8870067-8857000 H EGN_Mt071002 20080227
Length = 745
Score = 42.7 bits (99), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 17/93 (18%), Positives = 47/93 (50%)
Query: 306 INSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSMDGKYILTGGE 365
+ S+ + + L + G+D ++++D + + + +LC W+ +G ++LT +
Sbjct: 312 VKSVDWHPTKSLLVSGGKDNLVKLWDAKSGKELSSFHGHKNTVLCVKWNQNGNWVLTASK 371
Query: 366 DDLVQVWSMEDRKVVAWGEGHNSWVSGVAFDSY 398
D +++++ + K + GH V+ +A+ +
Sbjct: 372 DQIIKLYDIRAMKELESFRGHRKDVTTLAWHPF 404
>IMGA|AC135317_6.4 WD40-like chr01_pseudomolecule_IMGAG_V2
16610696-16612398 E EGN_Mt071002 20080227
Length = 316
Score = 42.0 bits (97), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 46/91 (50%)
Query: 306 INSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSMDGKYILTGGE 365
IN I++S+D Y+ + D LR++D + + + + C ++ YI++G
Sbjct: 72 INDIAWSSDSHYICSASDDKTLRIWDANTGDCVKTLRGHGHNVFCVNFNPQSNYIVSGSF 131
Query: 366 DDLVQVWSMEDRKVVAWGEGHNSWVSGVAFD 396
D+ V+VW ++ K V + H V+ V F+
Sbjct: 132 DETVRVWEVKTGKSVHVIKAHAMPVTSVDFN 162
>IMGA|CT485797_2.5 Lissencephaly type-1-like homology motif; CTLH,
C-terminal to LisH motif; WD40-like
chr05_pseudomolecule_IMGAG_V2 32812940-32817176 E
EGN_Mt071002 20080227
Length = 572
Score = 41.6 bits (96), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/80 (27%), Positives = 42/80 (52%)
Query: 303 QGSINSISFSNDGAYLATVGRDGYLRVFDYLKEQLICGGKSYYGALLCCAWSMDGKYILT 362
Q I+S+S+S + L T G + +R +D + + + L+ C W GK+IL+
Sbjct: 308 QKPISSVSWSPNDEELLTCGVEESIRRWDVSTGKCLQIYEKAGAGLVSCTWFPSGKHILS 367
Query: 363 GGEDDLVQVWSMEDRKVVAW 382
G D + +W ++ ++V +W
Sbjct: 368 GLSDKSICMWELDGKEVESW 387