Miyakogusa Predicted Gene
- Lj3g3v2318370.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v2318370.1 CUFF.43885.1
(628 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G52080.1 | Symbols: AR791 | actin binding protein family | ch... 301 6e-82
AT3G25690.2 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein ... 218 1e-56
AT3G25690.1 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein ... 218 1e-56
AT3G25690.3 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein ... 157 3e-38
AT2G36650.1 | Symbols: | unknown protein; Has 35333 Blast hits ... 63 5e-10
>AT1G52080.1 | Symbols: AR791 | actin binding protein family |
chr1:19369788-19371862 FORWARD LENGTH=573
Length = 573
Score = 301 bits (772), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 161/326 (49%), Positives = 229/326 (70%), Gaps = 5/326 (1%)
Query: 102 LSPRTK-QSGEEDEFLLPEFNDVMKDAEFGVAGNS---FKKVGPPVAYASLEKDDYEQEI 157
+SPR + E+D FLLPEF + K + V + + P+A+ S E+ D+E EI
Sbjct: 82 VSPRRECDLDEKDVFLLPEFEEEAKKLDLLVCDDCETPRSDITAPLAFPSEEEADHENEI 141
Query: 158 WQLRNMIRMLQERERSLEVQLLEYCGLKEQETVVMELQNRLKISNMEAKMFNLKVETLQS 217
+LRN +R L+ERER LE +LLEY LKEQ+ + MEL++RLK++ ME K+FN K++ LQ+
Sbjct: 142 NRLRNTVRALRERERCLEDKLLEYYSLKEQQKIAMELRSRLKLNQMETKVFNFKIKKLQA 201
Query: 218 ENWRLAEQVADHAKVLAELDAAKTKVKFLKRKIRHEAEQNKEQIINLKQRVSKLQDLESQ 277
EN +L + +H+KVL ELD AK++V+ LK+K+ +Q+ QI++LKQRV++LQ+ E +
Sbjct: 202 ENEKLKAECFEHSKVLLELDMAKSQVQVLKKKLNINTQQHVAQILSLKQRVARLQEEEIK 261
Query: 278 ATASDQEIETKLRRLKDLEAEAEQLRKTNLRLQMDNSDLARRLDSTQILANAVLEDP-EA 336
A D E + ++RL+DLE+E +L TN RLQ +N +L+ +L+S QI+AN+ LE+P E
Sbjct: 262 AVLPDLEADKMMQRLRDLESEINELTDTNTRLQFENFELSEKLESVQIIANSKLEEPEEI 321
Query: 337 DAXXXXXXXXXXXXXXXTKEVEQLQADRCSDVEELVYLRWINACLRHEMRNYQPPPGKTV 396
+ K+VEQLQ DRC+D+E+LVYLRWINACLR+E+R YQPP GKTV
Sbjct: 322 ETLREDCNRLRSENEELKKDVEQLQGDRCTDLEQLVYLRWINACLRYELRTYQPPAGKTV 381
Query: 397 ARDLSKSLSPTSEKKAKQLIVEYANN 422
ARDLS +LSPTSE+KAKQLI+EYA++
Sbjct: 382 ARDLSTTLSPTSEEKAKQLILEYAHS 407
>AT3G25690.2 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein
family protein | chr3:9354061-9357757 FORWARD
LENGTH=1004
Length = 1004
Score = 218 bits (554), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 202/321 (62%), Gaps = 9/321 (2%)
Query: 113 DEFLLPEFNDVMK-DAEFGVAG--NSFKKVGPPVAYASLEKDDYEQEIWQLRNMIRMLQE 169
D+ +LPEF D++ + E+ + N+ +K Y +E + E+ +L+ +++ L+E
Sbjct: 85 DDDILPEFEDLLSGEIEYPLPDDDNNLEKAEKERKY-EVEMAYNDGELERLKQLVKELEE 143
Query: 170 RERSLEVQLLEYCGLKEQETVVMELQNRLKISNMEAKMFNLKVETLQSENWRLAEQVADH 229
RE LE +LLEY GLKEQE+ ++ELQ +LKI +E M N+ + +LQ+E +L E+++ +
Sbjct: 144 REVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEELSQN 203
Query: 230 AKVLAELDAAKTKVKFLKRKIRHEAEQNKEQIINLKQRVSKLQDLESQATASDQEIETKL 289
V EL+ A+ K+K L+R+I+ +A Q K Q++ LKQ VS LQ E +A D E+E KL
Sbjct: 204 GIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDTEVERKL 263
Query: 290 RRLKDLEAEAEQLRKTNLRLQMDNSDLARRLDSTQ----ILANAVLEDPEADAXXXXXXX 345
+ ++DLE + +L++ N LQ + +L+ +LDS + L+N D A
Sbjct: 264 KAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKV-REEVNN 322
Query: 346 XXXXXXXXTKEVEQLQADRCSDVEELVYLRWINACLRHEMRNYQPPPGKTVARDLSKSLS 405
K+VE LQ +R S+VEELVYLRW+NACLR+E+RNYQ P GK ARDLSK+LS
Sbjct: 323 LKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLS 382
Query: 406 PTSEKKAKQLIVEYANNTEGR 426
P S+ KAK+L++EYA + G+
Sbjct: 383 PKSQAKAKRLMLEYAGSERGQ 403
>AT3G25690.1 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein
family protein | chr3:9354061-9357757 FORWARD
LENGTH=1004
Length = 1004
Score = 218 bits (554), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 202/321 (62%), Gaps = 9/321 (2%)
Query: 113 DEFLLPEFNDVMK-DAEFGVAG--NSFKKVGPPVAYASLEKDDYEQEIWQLRNMIRMLQE 169
D+ +LPEF D++ + E+ + N+ +K Y +E + E+ +L+ +++ L+E
Sbjct: 85 DDDILPEFEDLLSGEIEYPLPDDDNNLEKAEKERKY-EVEMAYNDGELERLKQLVKELEE 143
Query: 170 RERSLEVQLLEYCGLKEQETVVMELQNRLKISNMEAKMFNLKVETLQSENWRLAEQVADH 229
RE LE +LLEY GLKEQE+ ++ELQ +LKI +E M N+ + +LQ+E +L E+++ +
Sbjct: 144 REVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKKLQEELSQN 203
Query: 230 AKVLAELDAAKTKVKFLKRKIRHEAEQNKEQIINLKQRVSKLQDLESQATASDQEIETKL 289
V EL+ A+ K+K L+R+I+ +A Q K Q++ LKQ VS LQ E +A D E+E KL
Sbjct: 204 GIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNKDTEVERKL 263
Query: 290 RRLKDLEAEAEQLRKTNLRLQMDNSDLARRLDSTQ----ILANAVLEDPEADAXXXXXXX 345
+ ++DLE + +L++ N LQ + +L+ +LDS + L+N D A
Sbjct: 264 KAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAKV-REEVNN 322
Query: 346 XXXXXXXXTKEVEQLQADRCSDVEELVYLRWINACLRHEMRNYQPPPGKTVARDLSKSLS 405
K+VE LQ +R S+VEELVYLRW+NACLR+E+RNYQ P GK ARDLSK+LS
Sbjct: 323 LKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLS 382
Query: 406 PTSEKKAKQLIVEYANNTEGR 426
P S+ KAK+L++EYA + G+
Sbjct: 383 PKSQAKAKRLMLEYAGSERGQ 403
>AT3G25690.3 | Symbols: CHUP1 | Hydroxyproline-rich glycoprotein
family protein | chr3:9354061-9357757 FORWARD LENGTH=863
Length = 863
Score = 157 bits (396), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 132/209 (63%), Gaps = 5/209 (2%)
Query: 222 LAEQVADHAKVLAELDAAKTKVKFLKRKIRHEAEQNKEQIINLKQRVSKLQDLESQATAS 281
L E+++ + V EL+ A+ K+K L+R+I+ +A Q K Q++ LKQ VS LQ E +A
Sbjct: 55 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKEEEAMNK 114
Query: 282 DQEIETKLRRLKDLEAEAEQLRKTNLRLQMDNSDLARRLDSTQ----ILANAVLEDPEAD 337
D E+E KL+ ++DLE + +L++ N LQ + +L+ +LDS + L+N D A
Sbjct: 115 DTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVAK 174
Query: 338 AXXXXXXXXXXXXXXXTKEVEQLQADRCSDVEELVYLRWINACLRHEMRNYQPPPGKTVA 397
K+VE LQ +R S+VEELVYLRW+NACLR+E+RNYQ P GK A
Sbjct: 175 VREEVNNLKHNNEDL-LKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISA 233
Query: 398 RDLSKSLSPTSEKKAKQLIVEYANNTEGR 426
RDLSK+LSP S+ KAK+L++EYA + G+
Sbjct: 234 RDLSKNLSPKSQAKAKRLMLEYAGSERGQ 262
>AT2G36650.1 | Symbols: | unknown protein; Has 35333 Blast hits to
34131 proteins in 2444 species: Archae - 798; Bacteria -
22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses
- 0; Other Eukaryotes - 9610 (source: NCBI BLink). |
chr2:15359994-15361194 FORWARD LENGTH=373
Length = 373
Score = 63.2 bits (152), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 61/235 (25%), Positives = 115/235 (48%), Gaps = 21/235 (8%)
Query: 155 QEIWQLRNMIRMLQERERSLEVQLLEYCGLKEQETVVMELQNRLKISNMEAKMFNLKVET 214
QEI L++ LQ +E +E+ +C LK+QE +++E ++ L + + F +V
Sbjct: 76 QEILSLKSRFEELQRKEYEMELHFERFCNLKDQEVMLIEHKSILSLEKAQLDFFRKEVLA 135
Query: 215 LQSENWRLAEQVADHAKVLAELDAAKTKVKFLKRKIRHEAEQNKEQIINLKQRVSKLQDL 274
++ E+ R V + K++ E+ +++ L+ K + ++K Q+ + K+ +
Sbjct: 136 MEEEHKRGQALVIVYLKLVGEIQELRSENGLLEGKAKKLRRKSK-QLYRVVNESRKIIGV 194
Query: 275 ESQATASDQEIETKLRRLKDLEAEAEQLRKTNLRLQMDNSDL-ARRLDSTQILANAVLED 333
E + E+ETK +K+LE + + + LQ + +L + +ST + + +ED
Sbjct: 195 EKEFLKCVDELETKNNIVKELEGKVKDMEAYVDVLQEEKEELFMKSSNSTSEMVS--VED 252
Query: 334 PEADAXXXXXXXXXXXXXXXTKEVEQLQADRCSDVEELVYLRWINACLRHE-MRN 387
+E E+L+ D + V+E++ LRW NACLRHE MRN
Sbjct: 253 ----------------YRRIVEEYEELKKDYANGVKEVINLRWSNACLRHEVMRN 291