Miyakogusa Predicted Gene
- Lj1g3v4691730.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4691730.1 Non Chatacterized Hit- tr|I1MCW3|I1MCW3_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.19572
PE,23.87,9e-18,seg,NULL; coiled-coil,NULL,CUFF.32878.1
(424 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G36650.1 | Symbols: | unknown protein; Has 35333 Blast hits ... 124 1e-28
AT3G25690.2 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein ... 73 4e-13
AT3G25690.1 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein ... 73 4e-13
AT1G52080.1 | Symbols: AR791 | actin binding protein family | ch... 68 1e-11
AT3G25690.3 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein ... 54 3e-07
>AT2G36650.1 | Symbols: | unknown protein; Has 35333 Blast hits to
34131 proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr2:15359994-15361194 FORWARD LENGTH=373
Length = 373
Score = 124 bits (311), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 137/231 (59%), Gaps = 15/231 (6%)
Query: 93 QEISGLRSQIEGMKIRELALRFQFDQYCDLKEQQSLLGEMKNMMSLETARVDLLDREISS 152
QEI L+S+ E ++ +E + F+++C+LK+Q+ +L E K+++SLE A++D +E+ +
Sbjct: 76 QEILSLKSRFEELQRKEYEMELHFERFCNLKDQEVMLIEHKSILSLEKAQLDFFRKEVLA 135
Query: 153 METENKRLENFAVQYLRVVEQIEYWKSENXXXXXXXXXXXXXXXALTRLTKEQALKIKEE 212
ME E+KR + + YL++V +I+ +SEN L R+ E + KI
Sbjct: 136 MEEEHKRGQALVIVYLKLVGEIQELRSENGLLEGKAKKLRRKSKQLYRVVNE-SRKIIGV 194
Query: 213 EAEISRNRDALGTKMNVIDKLEDEMRELQRVLDLLQDEKNELQKKLDIAEKSYHESKAFH 272
E E + D L TK N++ +LE ++++++ +D+LQ+EK EL K + S E
Sbjct: 195 EKEFLKCVDELETKNNIVKELEGKVKDMEAYVDVLQEEKEELFMK---SSNSTSEM---- 247
Query: 273 LQTEAGDVSREEYKQLLDELERTKKERTDEANELIHLRWTNACLRHDLMRH 323
VS E+Y+++++E E KK+ + E+I+LRW+NACLRH++MR+
Sbjct: 248 -------VSVEDYRRIVEEYEELKKDYANGVKEVINLRWSNACLRHEVMRN 291
>AT3G25690.2 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein
family protein | chr3:9354061-9357757 FORWARD
LENGTH=1004
Length = 1004
Score = 72.8 bits (177), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 123/242 (50%), Gaps = 18/242 (7%)
Query: 94 EISGLRSQIEGMKIRELALRFQFDQYCDLKEQQSLLGEMKNMMSLETARVDLLDREISSM 153
E+ L+ ++ ++ RE+ L + +Y LKEQ+S + E++ + ++T +D+L+ I+S+
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189
Query: 154 ETENKRLENFAVQYLRVVEQIEYWKSENXXXXXXXXXXXXXXXALTRLTKE--QALKIKE 211
+ E K+L+ Q V +++E +++ L K+ +L++KE
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249
Query: 212 EEAEISRNRDA-LGTKMNVIDKLEDEMRELQRVLDLLQDEKNELQKKLDIAEKSYHESKA 270
EEA N+D + K+ + LE ++ EL+R LQ EK EL KLD AE
Sbjct: 250 EEA---MNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIA---T 303
Query: 271 FHLQTEAGDVS--REEYKQ-------LLDELERTKKERTDEANELIHLRWTNACLRHDLM 321
TE+ V+ REE LL ++E + R E EL++LRW NACLR++L
Sbjct: 304 LSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELR 363
Query: 322 RH 323
+
Sbjct: 364 NY 365
>AT3G25690.1 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein
family protein | chr3:9354061-9357757 FORWARD
LENGTH=1004
Length = 1004
Score = 72.8 bits (177), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 72/242 (29%), Positives = 123/242 (50%), Gaps = 18/242 (7%)
Query: 94 EISGLRSQIEGMKIRELALRFQFDQYCDLKEQQSLLGEMKNMMSLETARVDLLDREISSM 153
E+ L+ ++ ++ RE+ L + +Y LKEQ+S + E++ + ++T +D+L+ I+S+
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189
Query: 154 ETENKRLENFAVQYLRVVEQIEYWKSENXXXXXXXXXXXXXXXALTRLTKE--QALKIKE 211
+ E K+L+ Q V +++E +++ L K+ +L++KE
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249
Query: 212 EEAEISRNRDA-LGTKMNVIDKLEDEMRELQRVLDLLQDEKNELQKKLDIAEKSYHESKA 270
EEA N+D + K+ + LE ++ EL+R LQ EK EL KLD AE
Sbjct: 250 EEA---MNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIA---T 303
Query: 271 FHLQTEAGDVS--REEYKQ-------LLDELERTKKERTDEANELIHLRWTNACLRHDLM 321
TE+ V+ REE LL ++E + R E EL++LRW NACLR++L
Sbjct: 304 LSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELR 363
Query: 322 RH 323
+
Sbjct: 364 NY 365
>AT1G52080.1 | Symbols: AR791 | actin binding protein family |
chr1:19369788-19371862 FORWARD LENGTH=573
Length = 573
Score = 68.2 bits (165), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 113/233 (48%), Gaps = 4/233 (1%)
Query: 92 EQEISGLRSQIEGMKIRELALRFQFDQYCDLKEQQSLLGEMKNMMSLETARVDLLDREIS 151
E EI+ LR+ + ++ RE L + +Y LKEQQ + E+++ + L + + +I
Sbjct: 138 ENEINRLRNTVRALRERERCLEDKLLEYYSLKEQQKIAMELRSRLKLNQMETKVFNFKIK 197
Query: 152 SMETENKRLENFAVQYLRVVEQIEYWKSENXXXXXXXXXXXXXXXALTRLTKEQALKIKE 211
++ EN++L+ ++ +V+ +++ KS+ A K++ +++E
Sbjct: 198 KLQAENEKLKAECFEHSKVLLELDMAKSQVQVLKKKLNINTQQHVAQILSLKQRVARLQE 257
Query: 212 EEAEISRNRDALGTKMNVIDKLEDEMRELQRVLDLLQDEKNELQKKLD----IAEKSYHE 267
EE + M + LE E+ EL LQ E EL +KL+ IA E
Sbjct: 258 EEIKAVLPDLEADKMMQRLRDLESEINELTDTNTRLQFENFELSEKLESVQIIANSKLEE 317
Query: 268 SKAFHLQTEAGDVSREEYKQLLDELERTKKERTDEANELIHLRWTNACLRHDL 320
+ E + R E ++L ++E+ + +R + +L++LRW NACLR++L
Sbjct: 318 PEEIETLREDCNRLRSENEELKKDVEQLQGDRCTDLEQLVYLRWINACLRYEL 370
>AT3G25690.3 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein
family protein | chr3:9354061-9357757 FORWARD LENGTH=863
Length = 863
Score = 53.5 bits (127), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 67/128 (52%), Gaps = 16/128 (12%)
Query: 206 ALKIKEEEAEISRNRDA-LGTKMNVIDKLEDEMRELQRVLDLLQDEKNELQKKLDIAEKS 264
+L++KEEEA N+D + K+ + LE ++ EL+R LQ EK EL KLD AE
Sbjct: 103 SLQMKEEEA---MNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEAR 159
Query: 265 YHESKAFHLQTEAGDVS--REEYKQL-------LDELERTKKERTDEANELIHLRWTNAC 315
TE+ V+ REE L L ++E + R E EL++LRW NAC
Sbjct: 160 IA---TLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNAC 216
Query: 316 LRHDLMRH 323
LR++L +
Sbjct: 217 LRYELRNY 224