Miyakogusa Predicted Gene
- Lj6g3v1880260.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj6g3v1880260.1 Non Chatacterized Hit- tr|K4AXN7|K4AXN7_SOLLC
Uncharacterized protein OS=Solanum lycopersicum
GN=Sol,41.73,0.000000000000009,seg,NULL; Acid proteases,Peptidase
aspartic; no description,Peptidase aspartic, catalytic; BASIC 7S
,NODE_43012_length_1315_cov_14.914829.path1.1
(215 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family pr... 142 1e-34
AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family pr... 136 9e-33
AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 2e-26
AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family pr... 115 2e-26
AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family pr... 113 1e-25
AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family pr... 80 1e-15
AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family pr... 55 2e-08
AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family pr... 55 5e-08
AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family pr... 52 4e-07
AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family pr... 52 4e-07
AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family pr... 49 4e-06
>AT1G03220.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:787143-788444 FORWARD LENGTH=433
Length = 433
Score = 142 bits (359), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/183 (42%), Positives = 111/183 (60%), Gaps = 7/183 (3%)
Query: 12 EFQTTPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSID-KKGNGGTK 70
QTTPL++NPV+T + G S EYFI V ++ I + + P++L I+ G GGTK
Sbjct: 228 SLQTTPLLINPVSTASAFSQGEKSSEYFIGVTAIQIVEKTVPINPTLLKINASTGIGGTK 287
Query: 71 ISTISAFTELQSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPT 130
IS+++ +T L+SS+Y F E++K A+ +KRVA+V PF AC+ + + T G VP
Sbjct: 288 ISSVNPYTVLESSIYNAFTSEFVKQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPE 347
Query: 131 IDLVMQG-GAQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLL 189
I+LV+ +I GAN+MV V +V CL VDGG R S+V+G QLEDNL+
Sbjct: 348 IELVLHSKDVVWRIFGANSMVSVSDDVICLGFVDGGVNAR-----TSVVIGGFQLEDNLI 402
Query: 190 VFD 192
FD
Sbjct: 403 EFD 405
>AT1G03230.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:790110-791414 FORWARD LENGTH=434
Length = 434
Score = 136 bits (343), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 108/183 (59%), Gaps = 7/183 (3%)
Query: 12 EFQTTPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSID-KKGNGGTK 70
Q TPL++NP T G S EYFI V ++ I L + P++L I+ G GGTK
Sbjct: 229 RLQKTPLLINPGTTVFEFSKGEKSPEYFIGVTAIKIVEKTLPIDPTLLKINASTGIGGTK 288
Query: 71 ISTISAFTELQSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPT 130
IS+++ +T L+SS+Y+ F E+++ A+ +KRVA+V PF AC+ + + T G VP
Sbjct: 289 ISSVNPYTVLESSIYKAFTSEFIRQAAARSIKRVASVKPFGACFSTKNVGVTRLGYAVPE 348
Query: 131 IDLVMQG-GAQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLL 189
I LV+ +I GAN+MV V +V CL VDGG P AS+V+G QLEDNL+
Sbjct: 349 IQLVLHSKDVVWRIFGANSMVSVSDDVICLGFVDGGVNP-----GASVVIGGFQLEDNLI 403
Query: 190 VFD 192
FD
Sbjct: 404 EFD 406
>AT5G48430.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:19627892-19629112 REVERSE LENGTH=406
Length = 406
Score = 115 bits (289), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 91/157 (57%), Gaps = 13/157 (8%)
Query: 36 QEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKA 95
YF+ +K + +NGN + P+ + D+ G+GG +STI FT L+S +YR+FI + +A
Sbjct: 237 NNYFLGLKGISVNGNRILFAPNAFAFDRNGDGGVTLSTIFPFTMLRSDIYRVFIEAFSQA 296
Query: 96 ASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKN 155
S + RV++ PFE C +TT F VP IDL + G K+ ANAM V +
Sbjct: 297 TSG--IPRVSSTTPFEFCLSTTTNF------QVPRIDLELANGVIWKLSPANAMKKVSDD 348
Query: 156 VACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
VACLA V+GG A ++++G HQ+E+ L+ FD
Sbjct: 349 VACLAFVNGG-----DAAAQAVMIGIHQMENTLVEFD 380
>AT5G19100.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6408242-6409417 REVERSE LENGTH=391
Length = 391
Score = 115 bits (288), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/180 (38%), Positives = 97/180 (53%), Gaps = 35/180 (19%)
Query: 13 FQTTPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKIS 72
F +TPLI N G S EY IDVKS+ I + + G TKIS
Sbjct: 220 FASTPLIGN-----------GKSGEYLIDVKSIQIGAKTVPIP----------YGATKIS 258
Query: 73 TISAFTELQSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTID 132
T++ +T Q+S+Y+ + + + + K+ + AV PF AC+ S G VP ID
Sbjct: 259 TLAPYTVFQTSLYKALLTAFTE---NIKIAKAPAVKPFGACFYSN------GGRGVPVID 309
Query: 133 LVMQGGAQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
LV+ GGA+ +I G+N++V V KNV CL VDGG +P K IV+G Q+EDNL+ FD
Sbjct: 310 LVLSGGAKWRIYGSNSLVKVNKNVVCLGFVDGGVKP-----KYPIVIGGFQMEDNLVEFD 364
>AT5G19110.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6411720-6413170 REVERSE LENGTH=405
Length = 405
Score = 113 bits (282), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 96/176 (54%), Gaps = 15/176 (8%)
Query: 21 NPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTEL 80
NP+ I G S +Y I VKS+ + G L L P +L+ GG K+ST+ +T L
Sbjct: 215 NPIPRTLTPIKGTDSGDYLITVKSIYVGGTALKLNPDLLT------GGAKLSTVVHYTVL 268
Query: 81 QSSVYRIFIREYLKAASDSKLKRVAAVAPFEACYDSTTIFNTL-AGLNVPTIDLVMQGG- 138
Q+ +Y + + A + +V +VAPF+ C+DS T L AG NVP I++ + G
Sbjct: 269 QTDIYNALAQSFTLKAKAMGIAKVPSVAPFKHCFDSRTAGKNLTAGPNVPVIEIGLPGRI 328
Query: 139 --AQGKILGANAMVMVKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
+ GAN +V VK+ V CLA +DGG P K +V+G HQL+D++L FD
Sbjct: 329 GEVKWGFYGANTVVKVKETVMCLAFIDGGKTP-----KDLMVIGTHQLQDHMLEFD 379
>AT5G19120.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr5:6414585-6415745 FORWARD LENGTH=386
Length = 386
Score = 79.7 bits (195), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 85/161 (52%), Gaps = 25/161 (15%)
Query: 33 GASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREY 92
G+S Y I+VKS+ +NG L+++ G ++ST+ +T L+SS+Y++F Y
Sbjct: 231 GSSGNYVINVKSIRVNGEKLSVE---------GPLAVELSTVVPYTILESSIYKVFAEAY 281
Query: 93 LKAASDSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGG-AQGKILGANAMVM 151
KAA ++ V VAPF C+ S F P +DL +Q + +I G N MV
Sbjct: 282 AKAAGEA--TSVPPVAPFGLCFTSDVDF--------PAVDLALQSEMVRWRIHGKNLMVD 331
Query: 152 VKKNVACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
V V C IVDGG+ R++ IV+G QLE +L FD
Sbjct: 332 VGGGVRCSGIVDGGSS-RVNP----IVMGGLQLEGFILDFD 367
>AT1G25510.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr1:8959372-8960823 REVERSE LENGTH=483
Length = 483
Score = 55.5 bits (132), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 71/156 (45%), Gaps = 15/156 (9%)
Query: 38 YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
Y++ + + + G +L + S +D+ G+GG I + +A T LQ+ +Y ++K
Sbjct: 329 YYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTL 388
Query: 98 DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKK-NV 156
D L++ A VA F+ CY+ + + VPT+ GG + N M+ V
Sbjct: 389 D--LEKAAGVAMFDTCYN----LSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGT 442
Query: 157 ACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
CLA P S++ ++G Q + + FD
Sbjct: 443 FCLAFA-----PTASSLA---IIGNVQQQGTRVTFD 470
>AT3G25700.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:9358937-9360295 FORWARD LENGTH=452
Length = 452
Score = 54.7 bits (130), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 65/147 (44%), Gaps = 16/147 (10%)
Query: 16 TPLIVNPVATGAVTIPGGASQEYFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTIS 75
TPL+ NP++ Y++ +KSV +NG L + PS+ ID GNGGT + + +
Sbjct: 280 TPLLTNPLSP----------TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGT 329
Query: 76 AFTELQSSVYRIFIREYLKAASDSKLKRVAAVAP-FEACYDSTTIFNTLAGLNVPTIDLV 134
L YR I + KL A+ P F+ C + + + T +P +
Sbjct: 330 TLAFLAEPAYRSVIAAVRRRV---KLPIADALTPGFDLCVNVSGV--TKPEKILPRLKFE 384
Query: 135 MQGGAQGKILGANAMVMVKKNVACLAI 161
GGA N + ++ + CLAI
Sbjct: 385 FSGGAVFVPPPRNYFIETEEQIQCLAI 411
>AT3G59080.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=535
Length = 535
Score = 51.6 bits (122), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 73/155 (47%), Gaps = 12/155 (7%)
Query: 38 YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
Y++ +KS+L+ G VLN+ +I G GGT I + + + Y FI+ + +
Sbjct: 377 YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE-FIKNKIAEKA 435
Query: 98 DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKNVA 157
K + C++ + I N + +P + + GA N+ + + +++
Sbjct: 436 KGKYPVYRDFPILDPCFNVSGIHN----VQLPELGIAFADGAVWNFPTENSFIWLNEDLV 491
Query: 158 CLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
CLA++ GT P+ A ++G +Q ++ +++D
Sbjct: 492 CLAML--GT-PK----SAFSIIGNYQQQNFHILYD 519
>AT3G59080.2 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:21836812-21838419 FORWARD LENGTH=499
Length = 499
Score = 51.6 bits (122), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 73/155 (47%), Gaps = 12/155 (7%)
Query: 38 YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
Y++ +KS+L+ G VLN+ +I G GGT I + + + Y FI+ + +
Sbjct: 341 YYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYE-FIKNKIAEKA 399
Query: 98 DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKNVA 157
K + C++ + I N + +P + + GA N+ + + +++
Sbjct: 400 KGKYPVYRDFPILDPCFNVSGIHN----VQLPELGIAFADGAVWNFPTENSFIWLNEDLV 455
Query: 158 CLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
CLA++ GT P+ A ++G +Q ++ +++D
Sbjct: 456 CLAML--GT-PK----SAFSIIGNYQQQNFHILYD 483
>AT3G18490.1 | Symbols: | Eukaryotic aspartyl protease family
protein | chr3:6349090-6350592 REVERSE LENGTH=500
Length = 500
Score = 48.5 bits (114), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 71/156 (45%), Gaps = 14/156 (8%)
Query: 38 YFIDVKSVLINGNVLNLKPSMLSIDKKGNGGTKISTISAFTELQSSVYRIFIREYLKAAS 97
Y++ + + G + L ++ +D G+GG + +A T LQ+ Y +LK
Sbjct: 345 YYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTV 404
Query: 98 DSKLKRVAAVAPFEACYDSTTIFNTLAGLNVPTIDLVMQGGAQGKILGANAMVMVKKN-V 156
+ K K ++++ F+ CYD F++L+ + VPT+ GG + N ++ V +
Sbjct: 405 NLK-KGSSSISLFDTCYD----FSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGT 459
Query: 157 ACLAIVDGGTEPRMSAVKASIVVGAHQLEDNLLVFD 192
C A P S++ ++G Q + + +D
Sbjct: 460 FCFAFA-----PTSSSLS---IIGNVQQQGTRITYD 487