Miyakogusa Predicted Gene
- Lj4g3v3061530.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v3061530.1 Non Characterized Hit- tr|I1L4J1|I1L4J1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.38716 PE,82.46,0,CRC,CRC
domain; CXC,CRC domain; TESMIN/TSO1-RELATED,NULL;
seg,NULL,CUFF.52210.1
(551 letters)
Database: Medicago_aa4.0v1
62,319 sequences; 21,947,249 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Medtr8g103320.1 | tesmin/TSO1-like CXC domain protein | HC | chr... 637 0.0
Medtr6g087590.1 | tesmin/TSO1-like CXC domain protein | HC | chr... 632 0.0
Medtr3g110122.1 | tesmin/TSO1-like CXC domain protein | HC | chr... 282 8e-76
Medtr5g006530.1 | tesmin/TSO1-like CXC domain protein | HC | chr... 100 6e-21
Medtr5g006530.2 | tesmin/TSO1-like CXC domain protein | HC | chr... 99 1e-20
Medtr1g012020.1 | cysteine-rich polycomb-like protein | HC | chr... 96 7e-20
Medtr1g103230.1 | tesmin/TSO1-like CXC domain protein | LC | chr... 54 3e-07
Medtr1g103180.1 | tesmin/TSO1-like CXC domain protein | LC | chr... 50 8e-06
>Medtr8g103320.1 | tesmin/TSO1-like CXC domain protein | HC |
chr8:43471929-43476739 | 20130731
Length = 478
Score = 637 bits (1642), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/486 (69%), Positives = 361/486 (74%), Gaps = 17/486 (3%)
Query: 1 MGEGEGGGDCPPNNASLEGVLLPSKKLARQLDFTAFGGXXXXXXXXXXXXXXXX-----X 55
MGEGEG D PP NA L+ V KKLARQLDF AFGG
Sbjct: 1 MGEGEGS-DIPPKNAPLDSV----KKLARQLDFNAFGGTPVTAPLPEHPQPSPTLPPLPA 55
Query: 56 XXXXKPESPKSKSRTNFEIKDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXX 115
KPESPKSKSR NFE KD T PKKQKQCNCKHS+CLKLYCECFASGI
Sbjct: 56 TKLGKPESPKSKSRPNFETKDAT-PKKQKQCNCKHSRCLKLYCECFASGIYCDGCNCVNC 114
Query: 116 XXXXXXEAARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKK 175
EAARREAVEATLERNPNAFRPKIASSPHGTRD +EE GE+ +L KHNKGCHCKK
Sbjct: 115 FNNVDNEAARREAVEATLERNPNAFRPKIASSPHGTRDNKEETGEVKVLVKHNKGCHCKK 174
Query: 176 SGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXX 235
SGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALF GD
Sbjct: 175 SGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQALFHGDQNNNMAYLQQAANAAI-- 232
Query: 236 TGAIGSSGFXXXXXXXXXXXXELFFGPTMKDPSVGRLGQQANHVRA-HAPSSSMSPIPGA 294
TGAIGSSG+ EL+ P+++DPS G+LGQQAN VR APSSS+SP+P
Sbjct: 233 TGAIGSSGYSSPPVARKRKGSELW--PSIRDPSFGKLGQQANPVRGPAAPSSSLSPVPVP 290
Query: 295 RVGP-TSGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKNLMDKHAEDQT 353
RVGP T GPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQA KTLTDQKNL+DKH +DQT
Sbjct: 291 RVGPSTLGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQATKTLTDQKNLIDKHTDDQT 350
Query: 354 ETSLASSTQEQLPSQKDVDVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTL 413
ETSLASS QEQLP+QK+ DVEKA+ADD SSANQ DK SP NSSS+ ADVPKGRPMSPGTL
Sbjct: 351 ETSLASSNQEQLPNQKEADVEKAVADDCSSANQTDKTSPENSSSDGADVPKGRPMSPGTL 410
Query: 414 ALMCDEQDPMFMTASSPIGTMARQCNPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVI 473
ALMCDEQD MFMTA+ +MA CN SSQ PYGQG E +AEQERIV TKFRDFLNRVI
Sbjct: 411 ALMCDEQDTMFMTAAPSTVSMAHACNTSSQLPYGQGAKEIYAEQERIVLTKFRDFLNRVI 470
Query: 474 TMGEIN 479
TMGEIN
Sbjct: 471 TMGEIN 476
>Medtr6g087590.1 | tesmin/TSO1-like CXC domain protein | HC |
chr6:32942677-32948999 | 20130731
Length = 607
Score = 632 bits (1631), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/612 (60%), Positives = 399/612 (65%), Gaps = 68/612 (11%)
Query: 1 MGEGEGGGDCPPNNA----------------------SLEGVLLPSKKLARQLDFTAF-- 36
M EGEGG DCPP N + + +PSKKLARQLDF A
Sbjct: 1 MAEGEGG-DCPPKNVVHPEFVTVTAAAATAAAAANATATAWLDVPSKKLARQLDFNAMLM 59
Query: 37 ----------------------GGXXXXXXXXXXXXXXXXXXXXXKPESPKSKSRTNFEI 74
G K ESPK +SR NFE+
Sbjct: 60 EQSKPQQQVVTQGSVMVQKPVGVGGLPMPVPAQVQTLQHSSVRVGKQESPKPRSRPNFEV 119
Query: 75 KDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXXXEAARREAVEATLE 134
K+ T PKKQ+QCNCKHSKCLKLYCECFASGI EAARREAVEATLE
Sbjct: 120 KEGT-PKKQRQCNCKHSKCLKLYCECFASGIYCDGCNCVNCFNNVDNEAARREAVEATLE 178
Query: 135 RNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSE 194
RNPNAFRPKIASSP G RD REEAGE LIL KH+KGCHCKKSGCLKKYCECFQAN+LCSE
Sbjct: 179 RNPNAFRPKIASSPQGARDSREEAGEGLILIKHHKGCHCKKSGCLKKYCECFQANVLCSE 238
Query: 195 NCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXXTGAIGSSGFXXXXXXXXXX 254
NC+CMDCKNFEGSEERQALF GD TGAIGS GF
Sbjct: 239 NCRCMDCKNFEGSEERQALFRGD---QNNNVYLQQAANAAITGAIGSYGFSSPPASKKRK 295
Query: 255 XXELFFGPTMKDPSVGRLGQQANHVRAHAPSSSMSPIPGAR-VGPTSG--PSKFMYRSLL 311
ELF PT KDPS+ + GQQ N V+ APSSS SP+ AR PT G PSK YRSLL
Sbjct: 296 GQELFLWPTAKDPSISKPGQQVNLVKGPAPSSSASPVSSARGTNPTLGQSPSKLKYRSLL 355
Query: 312 ADIIQPQHLKELCSVLVLVSGQAAKTLTDQKNLMDKHAEDQTETSLASSTQEQLPSQKDV 371
+D++QP HLKELCSVLVLVSGQAAKTL DQK ++K EDQTETSLASSTQEQL SQK+V
Sbjct: 356 SDVVQPHHLKELCSVLVLVSGQAAKTLADQKKTVEKRTEDQTETSLASSTQEQLLSQKEV 415
Query: 372 DVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTLALMCDEQDPMFMTASSPI 431
DVEKAM DD SSANQ DKISPGNS S+ ADVPK RPMSPGTLALMCDEQD MFMTA+SPI
Sbjct: 416 DVEKAMDDDCSSANQTDKISPGNSCSDGADVPK-RPMSPGTLALMCDEQDSMFMTAASPI 474
Query: 432 GTMARQCNPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVITMGEINETKCSSLARSEL 491
G CN SSQ P GQG+TE +AEQERIV T+FRDFLNRVITMGEINETKCSSLARSEL
Sbjct: 475 GQTTHACNTSSQFPDGQGVTEVYAEQERIVLTQFRDFLNRVITMGEINETKCSSLARSEL 534
Query: 492 ESQKDPIINGIANASTERTQQQGATSNGVAKAI-------------GNSITSTSLVPRSP 538
E++KD I N NASTE QQ ATSNG AKA + TST +VP
Sbjct: 535 ENKKDLINNETGNASTETVHQQEATSNGDAKAAIPPMAATSTPAVPPMATTSTPVVPSDT 594
Query: 539 VPENGESKPKVE 550
V ENGESK K+E
Sbjct: 595 VAENGESKLKME 606
>Medtr3g110122.1 | tesmin/TSO1-like CXC domain protein | HC |
chr3:51054541-51048745 | 20130731
Length = 596
Score = 282 bits (721), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 189/483 (39%), Positives = 254/483 (52%), Gaps = 31/483 (6%)
Query: 63 SPKSKSRTNFEIKDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXXXE 122
SP+S+S+ +KD T KKQK+CNCK+SKCLKLYCEC+A+GI E
Sbjct: 128 SPRSQSQNKAGLKDNTL-KKQKRCNCKNSKCLKLYCECYAAGIYCDGCNCQNCHNNLNNE 186
Query: 123 AARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKY 182
AAR+EA+ TLE+NPNAFRPKIASSP EE EI ++G+HNKGCHCKK GCLKKY
Sbjct: 187 AARKEAIGMTLEKNPNAFRPKIASSPQKPEVSMEEVSEIQLIGRHNKGCHCKK-GCLKKY 245
Query: 183 CECFQANILCSENCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXXTGAI--G 240
CECF AN+LCSENCKC+DCKNFEGS+ + + GA+ G
Sbjct: 246 CECFHANVLCSENCKCIDCKNFEGSDVWRIVL----QEECSLVQIRQATNAAINGAVGFG 301
Query: 241 SSGFXXXXXXXXXXXXELFFGPTMKDPSVGRLGQQANHVRAHAPSSSMSPIPGARVGPTS 300
S E F G ++ D V Q + + S+ V +
Sbjct: 302 PSISGTHITPKKRKIQESFSGKSLTDQPVSMTAQHQRVLLCIIDTVSLDLFNNHLVTLSV 361
Query: 301 GPSKFMY-RSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQK-NLMDKHAEDQTETSLA 358
S F + S+LAD++Q Q++K LCS+LV++S +AAKT + + K ++ E S+A
Sbjct: 362 VGSPFKFTMSVLADVLQTQNVKNLCSLLVVLSKEAAKTNAEMRGKAARKIKTEKYEASIA 421
Query: 359 SSTQEQLPSQKDVDVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTLALMCD 418
SS+Q S+ V R+S N A+K + + + D+ RP+SP TL LMCD
Sbjct: 422 SSSQLLQDSRNVV---------RASENHANK---DVADAVDIDI-HNRPLSPETLKLMCD 468
Query: 419 EQDPMFMTASSPIGTMARQC--NPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVITMG 476
E D MF S G N +S G T +AEQER++ TKFRD L +I +G
Sbjct: 469 ELDEMFFGNGSANGVAIDNAYQNMIQKSSNSDGYTAVYAEQERLILTKFRDVLGELIILG 528
Query: 477 EINETKCSSLARSELESQKDPIINGIANASTERTQQQGATSN------GVAKAIGNSITS 530
I ET SS + ++ +K P NG + A T+ T+N A+ G+ T
Sbjct: 529 SIKETMHSSSVKKDVSIEKTPKNNGDSGAETKGILLNNCTANCSIPVATYARTNGHDTTD 588
Query: 531 TSL 533
SL
Sbjct: 589 LSL 591
>Medtr5g006530.1 | tesmin/TSO1-like CXC domain protein | HC |
chr5:960319-955306 | 20130731
Length = 778
Score = 99.8 bits (247), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 72/133 (54%), Gaps = 7/133 (5%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXX----XXXXXXXXXXXXEAARREAVEATLERNPNA 139
K+CNCK SKCLKLYCECFA+G+ R+ +E+ RNP A
Sbjct: 488 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKPIHEDTVLQTRKQIES---RNPLA 544
Query: 140 FRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCM 199
F PK+ S + + + +H +GC+CKKS CLKKYCEC+Q + CS +C+C
Sbjct: 545 FAPKVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSISCRCE 604
Query: 200 DCKNFEGSEERQA 212
CKN G ++ A
Sbjct: 605 GCKNAFGRKDGSA 617
>Medtr5g006530.2 | tesmin/TSO1-like CXC domain protein | HC |
chr5:959632-955396 | 20130731
Length = 554
Score = 99.0 bits (245), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 69/130 (53%), Gaps = 1/130 (0%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXXXEAARREAVEATLE-RNPNAFRP 142
K+CNCK SKCLKLYCECFA+G+ +E RNP AF P
Sbjct: 264 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKPIHEDTVLQTRKQIESRNPLAFAP 323
Query: 143 KIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 202
K+ S + + + +H +GC+CKKS CLKKYCEC+Q + CS +C+C CK
Sbjct: 324 KVIRSADSVPETGIDPNKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSISCRCEGCK 383
Query: 203 NFEGSEERQA 212
N G ++ A
Sbjct: 384 NAFGRKDGSA 393
>Medtr1g012020.1 | cysteine-rich polycomb-like protein | HC |
chr1:2319368-2312974 | 20130731
Length = 867
Score = 96.3 bits (238), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/132 (40%), Positives = 66/132 (50%), Gaps = 9/132 (6%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXXXEAARREAVEATLE-RNPNAFRP 142
K CNCK SKCLKLYC+CF +GI + + +E RNP AF P
Sbjct: 498 KTCNCKKSKCLKLYCDCFGAGIFCGDGCACEGCGNRVEFQDKVVETKQQIESRNPQAFAP 557
Query: 143 KIAS-----SPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCK 197
KI P+ D+ +H +GC+CK+S C KKYCECFQAN+ CS C+
Sbjct: 558 KIVPCAADVPPNNMEDVNMTTP---ASARHKRGCNCKRSKCTKKYCECFQANVGCSTGCR 614
Query: 198 CMDCKNFEGSEE 209
C C N G E
Sbjct: 615 CDGCMNAFGKRE 626
>Medtr1g103230.1 | tesmin/TSO1-like CXC domain protein | LC |
chr1:46705795-46702340 | 20130731
Length = 624
Score = 54.3 bits (129), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 32/118 (27%), Positives = 52/118 (44%), Gaps = 1/118 (0%)
Query: 96 LYCECFASGIXXXXXXXXXXXXXXXXEAARREAVEATLE-RNPNAFRPKIASSPHGTRDI 154
L C CFA+G+ + +E RNP F PK+ ++ + I
Sbjct: 450 LNCGCFAAGVYCIGPCSCQDCLNKAINEDKVLQAHRMIEYRNPPVFVPKVITNSDSSPQI 509
Query: 155 REEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEERQA 212
+++ + + C +KS C K CECF+ + CS +CKC CKN ++ +A
Sbjct: 510 VDDSDKAPASNRRRIQCKSRKSSCTNKRCECFKGGVGCSPSCKCQGCKNIYDRKDSEA 567
>Medtr1g103180.1 | tesmin/TSO1-like CXC domain protein | LC |
chr1:46682000-46677103 | 20130731
Length = 366
Score = 49.7 bits (117), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 20/35 (57%), Positives = 22/35 (62%)
Query: 169 KGCHCKKSGCLKKYCECFQANILCSENCKCMDCKN 203
K CHCKK CLK CECF A + C+ C C DC N
Sbjct: 173 KNCHCKKLECLKLCCECFAARVYCTGTCSCEDCLN 207