Miyakogusa Predicted Gene
- Lj4g3v3061530.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj4g3v3061530.1 Non Chatacterized Hit- tr|I1L4J1|I1L4J1_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.38716 PE,82.46,0,CRC,CRC
domain; CXC,CRC domain; TESMIN/TSO1-RELATED,NULL;
seg,NULL,CUFF.52210.1
(551 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT4G29000.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing... 447 e-126
AT2G20110.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing... 395 e-110
AT2G20110.2 | Symbols: | Tesmin/TSO1-like CXC domain-containing... 388 e-108
AT5G25790.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing... 246 4e-65
AT3G16160.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing... 115 1e-25
AT3G22780.1 | Symbols: TSO1, ATTSO1 | Tesmin/TSO1-like CXC domai... 108 1e-23
AT3G22760.1 | Symbols: SOL1 | Tesmin/TSO1-like CXC domain-contai... 107 2e-23
AT4G14770.1 | Symbols: TCX2, ATTCX2 | TESMIN/TSO1-like CXC 2 | c... 107 2e-23
AT3G04850.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing... 103 2e-22
>AT4G29000.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing
protein | chr4:14293957-14296602 FORWARD LENGTH=603
Length = 603
Score = 447 bits (1151), Expect = e-126, Method: Compositional matrix adjust.
Identities = 253/468 (54%), Positives = 303/468 (64%), Gaps = 23/468 (4%)
Query: 61 PESPKSKS-RTNFEIKDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXX 119
PESPK++ R N E +D T P+K+KQCNCKHS+CLKLYCECFASG
Sbjct: 109 PESPKARGPRPNVEGRDGT-PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNV 167
Query: 120 XXEAARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCL 179
E ARREAVEATLERNP AFRPKIASSPHG RD RE+ GE+++LGKHNKGCHCKKSGCL
Sbjct: 168 DNEPARREAVEATLERNPFAFRPKIASSPHGGRDKREDIGEVVLLGKHNKGCHCKKSGCL 227
Query: 180 KKYCECFQANILCSENCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXXTGAI 239
KKYCECFQANILCSENCKC+DCKNFEGSEERQALF G+ TGA+
Sbjct: 228 KKYCECFQANILCSENCKCLDCKNFEGSEERQALFHGE--HSNHMAYLQQAANAAITGAV 285
Query: 240 GSSGFXXXXXXXXXXXXELFFGPTMKDPSVGRLGQQANHVRAHAPSSSMSPIPGARVG-- 297
GSSGF E+ F +KD S Q N+ R P+S SP P +R G
Sbjct: 286 GSSGFAPSPAPKRRKGQEILFNQAIKDSSRLSHFPQVNNGRTGGPTSGTSPSPVSRAGGN 345
Query: 298 PTSGPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKNLMDKHAEDQTETSL 357
+S PSKF+YRSLLADIIQP ++ LCSVLV V+G+AAKT TD++N ++ +DQTETSL
Sbjct: 346 ASSVPSKFVYRSLLADIIQPHDVRALCSVLVTVAGEAAKTSTDKRNEIENRVDDQTETSL 405
Query: 358 ASSTQEQLPSQKD-VDVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTLALM 416
ASS Q+Q + DVE D NQADK P S+S+ D K P+SP TLALM
Sbjct: 406 ASSAQDQPQGNNNAADVEMVATDH----NQADKSGPEESNSDGVDASKVTPLSPATLALM 461
Query: 417 CDEQDPMFMTAS-SPIGTMARQCNPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVITM 475
CDEQD +FM A+ SP G++ +P+ P QG +E +AEQER+V TKFRD LNR+I+
Sbjct: 462 CDEQDTIFMVAAPSPNGSV----DPNGCRPNSQGQSEIYAEQERLVLTKFRDCLNRLISY 517
Query: 476 GEINETKCSSLARSELESQKDPIINGIANASTER-TQQQGATSNGVAK 522
EI E+KC SLAR ++ IA TE QQQ NG ++
Sbjct: 518 AEIKESKCLSLARMHIQPP------AIATVKTENGIQQQVPIVNGASR 559
>AT2G20110.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing
protein | chr2:8684496-8686870 FORWARD LENGTH=571
Length = 571
Score = 395 bits (1016), Expect = e-110, Method: Compositional matrix adjust.
Identities = 232/487 (47%), Positives = 297/487 (60%), Gaps = 58/487 (11%)
Query: 60 KPESPKSKSRTNFEIKDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXX 119
+PESP S R E +D T P+K+KQCNCKHS+CLKLYCECFASG
Sbjct: 96 RPESPNSMPRPAGETRDGT-PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNV 154
Query: 120 XXEAARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCL 179
E ARR+AVE+TLERNPNAFRPKIA+SPHG RD REE G++++L +HNKGCHCKKSGCL
Sbjct: 155 ENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCL 214
Query: 180 KKYCECFQANILCSENCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXXTGAI 239
KKYCECFQANILCSENCKC+DCKNFEGSE RQ+LF G+ TGAI
Sbjct: 215 KKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGE---HSHNLAYLQHANAAITGAI 271
Query: 240 GSSGFXXXXXXXXXXXXELFFGPTMKDPSVGRLGQQANHVRAHAPSSSMSPIPGARVGPT 299
GSSGF E+FF KD S RLG QAN+ R + S G+R G
Sbjct: 272 GSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-QANNGR------TTSSQTGSRAGGN 324
Query: 300 S--GPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKNLMDKHAEDQTETSL 357
+ GPSK +Y+SLLA+II+P +K LCSVLV V+G+AAKTLT +K +Q ETS+
Sbjct: 325 ASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT------EKRLANQKETSV 378
Query: 358 ASSTQEQLPSQKDVDVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTLALMC 417
ASS Q+Q + EK+ +D + AD KGR +SP TLALMC
Sbjct: 379 ASSVQDQ--GHVNNKAEKSGLEDSN-----------------ADGSKGRSLSPETLALMC 419
Query: 418 DEQDPMFMTASSPIGTMARQCNPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVITMGE 477
DE+D M M A+SP ++ P+SQ P GQ +AEQE++V TKFRD LNR+I+ GE
Sbjct: 420 DERDTMLMVAASPNCSV----EPTSQLPNGQDQV--YAEQEKVVLTKFRDCLNRIISCGE 473
Query: 478 INETKCSSLARSELES------------QKDPIINGIANASTERTQQQGATSNGVAKAIG 525
+ E+ C S++R +L++ Q+ P+ NG++ + + +Q T N +
Sbjct: 474 VKESNC-SMSRMDLDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQLNTTTPN-TSSQTA 531
Query: 526 NSITSTS 532
N ++ T+
Sbjct: 532 NGVSQTA 538
>AT2G20110.2 | Symbols: | Tesmin/TSO1-like CXC domain-containing
protein | chr2:8684496-8686870 FORWARD LENGTH=578
Length = 578
Score = 388 bits (997), Expect = e-108, Method: Compositional matrix adjust.
Identities = 232/494 (46%), Positives = 297/494 (60%), Gaps = 65/494 (13%)
Query: 60 KPESPKSKSRTNFEIKDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXX 119
+PESP S R E +D T P+K+KQCNCKHS+CLKLYCECFASG
Sbjct: 96 RPESPNSMPRPAGETRDGT-PQKKKQCNCKHSRCLKLYCECFASGTYCDGCNCVNCFNNV 154
Query: 120 XXEAARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCL 179
E ARR+AVE+TLERNPNAFRPKIA+SPHG RD REE G++++L +HNKGCHCKKSGCL
Sbjct: 155 ENEPARRQAVESTLERNPNAFRPKIAASPHGGRDNREEVGDVVMLARHNKGCHCKKSGCL 214
Query: 180 KKYCECFQANILCSENCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXXTGAI 239
KKYCECFQANILCSENCKC+DCKNFEGSE RQ+LF G+ TGAI
Sbjct: 215 KKYCECFQANILCSENCKCLDCKNFEGSEVRQSLFHGE---HSHNLAYLQHANAAITGAI 271
Query: 240 GSSGFXXXXXXXXXXXXELFFGPTMKDPSVGRLGQQANHVRAHAPSSSMSPIPGARVGPT 299
GSSGF E+FF KD S RLG QAN+ R + S G+R G
Sbjct: 272 GSSGFASAPPPKRRKGQEIFFNQGTKDSSTHRLG-QANNGR------TTSSQTGSRAGGN 324
Query: 300 S--GPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKNLMDKHAEDQTETSL 357
+ GPSK +Y+SLLA+II+P +K LCSVLV V+G+AAKTLT +K +Q ETS+
Sbjct: 325 ASLGPSKVVYKSLLANIIKPMDVKALCSVLVAVAGEAAKTLT------EKRLANQKETSV 378
Query: 358 ASSTQEQLPSQKDVDVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTLALMC 417
ASS Q+Q + EK+ +D + AD KGR +SP TLALMC
Sbjct: 379 ASSVQDQ--GHVNNKAEKSGLEDSN-----------------ADGSKGRSLSPETLALMC 419
Query: 418 DEQDPMFMTASSPIGTMARQCNPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVITMGE 477
DE+D M M A+SP ++ P+SQ P GQ +AEQE++V TKFRD LNR+I+ GE
Sbjct: 420 DERDTMLMVAASPNCSV----EPTSQLPNGQDQV--YAEQEKVVLTKFRDCLNRIISCGE 473
Query: 478 IN-------ETKCSSLARSELES------------QKDPIINGIANASTERTQQQGATSN 518
+ E+ C S++R +L++ Q+ P+ NG++ + + +Q T N
Sbjct: 474 VKVFCLFVAESNC-SMSRMDLDTPVQTTVRIDPVVQQAPVANGVSQTAKQPSQLNTTTPN 532
Query: 519 GVAKAIGNSITSTS 532
+ N ++ T+
Sbjct: 533 -TSSQTANGVSQTA 545
>AT5G25790.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing
protein | chr5:8977233-8979181 REVERSE LENGTH=459
Length = 459
Score = 246 bits (627), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 154/419 (36%), Positives = 212/419 (50%), Gaps = 43/419 (10%)
Query: 62 ESPKSKSRTN-FEIKDTTTPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXX 120
+ P + R N E K+ T K+QK CNCK+SKCLKLYCECFASG
Sbjct: 72 QDPSTSRRHNEVESKENTPNKQQKHCNCKNSKCLKLYCECFASGSYCNGCNCVNCHNKLE 131
Query: 121 XEAARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLK 180
E++R+ A+ LERNP+AF+PKIA SPHG +D++E ++L++GKH+KGCHC+KSGCLK
Sbjct: 132 NESSRQVAISGILERNPDAFKPKIAGSPHGMKDLQENVQQVLLIGKHSKGCHCRKSGCLK 191
Query: 181 KYCECFQANILCSENCKCMDCKNFEGSEERQALFIGDXXXXXXXXXXXXXXXXXXTGAIG 240
KYCEC+QANILCSENC+C DCKNFEGSEER+AL G AI
Sbjct: 192 KYCECYQANILCSENCRCQDCKNFEGSEERKALLHG---SQVSDTYIQQMTNAAVNRAID 248
Query: 241 SSGFXXXXXXXXXXXXELFFGPTMKDPSVGRLGQQANHVRAHAPSSSMSPIPGARVGPTS 300
S + ++ S G Q+ANHVR + +S S +P + S
Sbjct: 249 MSAYLYPPESRKRKSKDISDSVV---SSYGVQYQRANHVRRNGENSLFS-LPNNKA--VS 302
Query: 301 GPSKFMYRSLLADIIQPQHLKELCSVLVLVSGQAAKTLTDQKNLMDKHAEDQTETSLASS 360
G + YRS ++ QP H++ELCS+LV S A L+D+ DK
Sbjct: 303 GSTTSAYRSSWSNTFQPHHVRELCSLLVSNSVDVANKLSDKGRKNDKG------------ 350
Query: 361 TQEQLPSQKDVDVEKAMADDRSSANQADKISPGNSSSEEADVPKGRPMSPGTLALMCDEQ 420
PS D + A++I+ +A+ +P+SP T ALMCDE+
Sbjct: 351 -----PSSLD-----------GAQRVANEINDSPDCVLDANRMDEKPISPATRALMCDEE 394
Query: 421 DPMFMTASSPIGTMARQCNPSSQSPYGQGMTENHAEQERIVSTKFRDFLNRVITMGEIN 479
+ + Q + + G + EQER + + FRD+L ++ IN
Sbjct: 395 HEIGSEKETSARVKTSQEKEDTDTSSGI-----YLEQERQILSIFRDYLIQLSNRARIN 448
>AT3G16160.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing
protein | chr3:5473366-5475090 REVERSE LENGTH=368
Length = 368
Score = 115 bits (287), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 81/141 (57%), Gaps = 8/141 (5%)
Query: 68 SRTNFEIKDTT-----TPKKQKQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXXXE 122
SR + E KD T T +K K C CK SKCLKLYC+CFASG+
Sbjct: 45 SREHSEAKDKTDEEGITSRKHKGCRCKQSKCLKLYCDCFASGVVCTDCDCVDCHNNSEKC 104
Query: 123 AARREAVEATLERNPNAFRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKY 182
AR A+ L RNPNAF K S D + +A G ++GC CK++ CLKKY
Sbjct: 105 DAREAAMVNVLGRNPNAFSEKALGS---LTDNQCKAAPDTKPGLLSRGCKCKRTRCLKKY 161
Query: 183 CECFQANILCSENCKCMDCKN 203
CECFQAN+LCS+NCKC++CKN
Sbjct: 162 CECFQANLLCSDNCKCINCKN 182
>AT3G22780.1 | Symbols: TSO1, ATTSO1 | Tesmin/TSO1-like CXC
domain-containing protein | chr3:8048927-8052058 FORWARD
LENGTH=695
Length = 695
Score = 108 bits (269), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 72/133 (54%), Gaps = 1/133 (0%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXXXXEAARREAVEATLE-RNPNAFRP 142
K+CNCK SKCLKLYCECFA+G+ A +E RNP AF P
Sbjct: 401 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEETVLATRKQIESRNPLAFAP 460
Query: 143 KIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 202
K+ + + ++A + +H +GC+CKKS C+KKYCEC+Q + CS NC+C C
Sbjct: 461 KVIRNADSIMEASDDASKTPASARHKRGCNCKKSNCMKKYCECYQGGVGCSMNCRCEGCT 520
Query: 203 NFEGSEERQALFI 215
N G ++ L I
Sbjct: 521 NVFGRKDGSLLVI 533
Score = 58.9 bits (141), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 34/54 (62%), Gaps = 4/54 (7%)
Query: 156 EEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCKNFEGSEE 209
E+AGE G+ K C+CKKS CLK YCECF A + C E C C+DC N EE
Sbjct: 392 EQAGE----GESCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEE 441
>AT3G22760.1 | Symbols: SOL1 | Tesmin/TSO1-like CXC
domain-containing protein | chr3:8044622-8047381 FORWARD
LENGTH=609
Length = 609
Score = 107 bits (268), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/138 (42%), Positives = 78/138 (56%), Gaps = 8/138 (5%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXX----XXXXXXXXXXXXEAARREAVEATLERNPNA 139
K+CNCK SKCLKLYCECFA+G A R+ +E+ RNP A
Sbjct: 328 KRCNCKKSKCLKLYCECFAAGFYCIEPCSCINCFNKPIHKDVVLATRKQIES---RNPLA 384
Query: 140 FRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCM 199
F PK+ + ++ E+A + +H +GC+CKKS CLKKYCEC+Q + CS NC+C
Sbjct: 385 FAPKVIRNSDSIIEVGEDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 444
Query: 200 DCKNFEGSEERQALFIGD 217
CKN G ++ +LF D
Sbjct: 445 GCKNAFGRKD-GSLFEQD 461
>AT4G14770.1 | Symbols: TCX2, ATTCX2 | TESMIN/TSO1-like CXC 2 |
chr4:8481522-8484825 REVERSE LENGTH=674
Length = 674
Score = 107 bits (268), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 75/130 (57%), Gaps = 7/130 (5%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXXXXX----XXXXXXXXXEAARREAVEATLERNPNA 139
K+CNCK SKCLKLYCECFA+G+ A R+ +E+ RNP A
Sbjct: 375 KRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKPIHEDVVLATRKQIES---RNPLA 431
Query: 140 FRPKIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCM 199
F PK+ + ++ ++A + +H +GC+CKKS CLKKYCEC+Q + CS NC+C
Sbjct: 432 FAPKVIRNSDSVQETGDDASKTPASARHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCE 491
Query: 200 DCKNFEGSEE 209
CKN G ++
Sbjct: 492 GCKNAFGRKD 501
>AT3G04850.1 | Symbols: | Tesmin/TSO1-like CXC domain-containing
protein | chr3:1332269-1335851 REVERSE LENGTH=639
Length = 639
Score = 103 bits (258), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 70/129 (54%), Gaps = 1/129 (0%)
Query: 84 KQCNCKHSKCLKLYCECFASGIXXXXXXXXXXXXXX-XXEAARREAVEATLERNPNAFRP 142
K+C C+ S+CLKLYCECF++G+ E ++ E RNP AF P
Sbjct: 453 KRCKCRKSQCLKLYCECFSAGLFCGEPCSCQNCFNKPIHEDLVMKSREVIKARNPLAFAP 512
Query: 143 KIASSPHGTRDIREEAGEILILGKHNKGCHCKKSGCLKKYCECFQANILCSENCKCMDCK 202
K+ S+ D+ E + +H +GC+C+KSGC KKYCECF + CS NC+CM CK
Sbjct: 513 KVVSTSDTVIDLWVENSKTPASARHKRGCNCRKSGCSKKYCECFMMGVGCSSNCRCMGCK 572
Query: 203 NFEGSEERQ 211
N G Q
Sbjct: 573 NTFGHTNEQ 581