Miyakogusa Predicted Gene
- Lj3g3v0965930.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj3g3v0965930.1 Non Chatacterized Hit- tr|I1MJ67|I1MJ67_SOYBN
Uncharacterized protein OS=Glycine max GN=Gma.33985 PE,92.85,0,FAMILY
NOT NAMED,NULL; Trypsin-like serine proteases,Peptidase
cysteine/serine, trypsin-like; seg,NU,CUFF.41911.1
(601 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT2G35155.1 | Symbols: | Trypsin family protein | chr2:14819209... 718 0.0
AT5G45030.2 | Symbols: | Trypsin family protein | chr5:18172981... 716 0.0
AT5G45030.1 | Symbols: | Trypsin family protein | chr5:18172981... 716 0.0
AT3G12950.1 | Symbols: | Trypsin family protein | chr3:4132798-... 705 0.0
>AT2G35155.1 | Symbols: | Trypsin family protein |
chr2:14819209-14821330 REVERSE LENGTH=579
Length = 579
Score = 718 bits (1854), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/592 (63%), Positives = 438/592 (73%), Gaps = 35/592 (5%)
Query: 10 VRCSGSTPSEESALDLERNC-CGHSNXXXXXXXXX-QPFASAGQHCESNAAYFSWP--SR 65
++ + S+ SE+SALDLERN C H + QPF QH ESNA YFSWP SR
Sbjct: 11 IQAAASSESEDSALDLERNHHCNHLSLPSSSSPSPLQPFTLNIQHAESNAPYFSWPTLSR 70
Query: 66 LNDAAEERANYFLNLQKGVLPETLGRPPKGQQATTLLELMTIRAFHSKILRCYSLGTAIG 125
LND E+RANYF NLQKGVLPET+GR P GQQATTLLELMTIRAFHSKILR +SLGTA+G
Sbjct: 71 LNDTVEDRANYFGNLQKGVLPETVGRLPSGQQATTLLELMTIRAFHSKILRRFSLGTAVG 130
Query: 126 FRIRLGVLTDIPAILVFVSRKVNKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPEP 185
FRI GVLT++PAILVFV+RKV++QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP
Sbjct: 131 FRISRGVLTNVPAILVFVARKVHRQWLNPMQCLPSALEGPGGVWCDVDVVEFQYYGAPAA 190
Query: 186 VPKEQLYTELVDDLRGGDPYIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVD 245
PKEQ+Y ELVD LRG DP IGSGSQVASQETYGTLGAIV+S+TG+ QVGFLTNRHVAVD
Sbjct: 191 TPKEQVYNELVDGLRGSDPCIGSGSQVASQETYGTLGAIVKSRTGNHQVGFLTNRHVAVD 250
Query: 246 LDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDEVWYGIFAGINPETFVRADGAFIPFA 305
LDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+ WYGIFAG NPETFVRADGAFIPFA
Sbjct: 251 LDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDQWYGIFAGTNPETFVRADGAFIPFA 310
Query: 306 DDFDMSFVTTSVRGVGDIGDVKIIDLQSPISSLIGKQVVKVGRSSGLTTGIVLAYGLEYN 365
+DF+ S VTT ++G+G+IGDV +IDLQSPI SLIGKQVVKVGRSSG TTG ++AY LEYN
Sbjct: 311 EDFNTSNVTTLIKGIGEIGDVHVIDLQSPIDSLIGKQVVKVGRSSGYTTGTIMAYALEYN 370
Query: 366 DEKGICFLTDFLVVGENQQTFDLEGDSGSLIMFKGDNGEKPRPIGIIWGGTANRGRLKLK 425
DEKGICFLTDFLV+GENQQTFDLEGDSGSLI+ G NG+KPRP+GIIWGGTANRGRLKL
Sbjct: 371 DEKGICFLTDFLVIGENQQTFDLEGDSGSLILLTGPNGQKPRPVGIIWGGTANRGRLKLI 430
Query: 426 IGQPPENWTSGVDXXXXXXXXXXXXXXXXXXXGVAVQEQRA--ASATVIGSTVGDSSTPD 483
GQ PENWTSGVD A + S T + STV SS PD
Sbjct: 431 AGQEPENWTSGVDLGRLLDLLELDLITSNHELEAAAAAREERNTSVTALDSTVSQSSPPD 490
Query: 484 GILPKDKAEEKFEPLGLHVQSIPLGVEPSSQETKPSSMETEFKVEDRVKVVPSI-EHQFI 542
+ DK +E FEP IP EF +E+ +K + EH FI
Sbjct: 491 PVPSGDKQDESFEPF------IP----------------PEFHIEEAIKPTLEVEEHIFI 528
Query: 543 -PSFIGRSPLHKNSIKDRA--ATENLSSLRNGCDEDLCVSLQLGDNEAKRRR 591
P + S ++IK + +NL +L+N +E++ +SL LG+ + K+ +
Sbjct: 529 APISVNES---TSAIKGQEIPKLDNLMALKNSSEEEVNISLHLGEPKLKKPK 577
>AT5G45030.2 | Symbols: | Trypsin family protein |
chr5:18172981-18175238 FORWARD LENGTH=607
Length = 607
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/603 (63%), Positives = 453/603 (75%), Gaps = 21/603 (3%)
Query: 1 MERPRLNMRVRCSGSTPSEESA-LDLERNCCGHSNXXXXXXXXXQPFASAGQHCESNAA- 58
ME RL++R S S+ S ESA LDL++N H QPF S QH E++AA
Sbjct: 1 MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNH--IKLASSSPLQPFPSGAQHPETSAAA 58
Query: 59 -YFSWP--SRLNDAAEERANYFLNLQKGVLPETLGRPPKGQQATTLLELMTIRAFHSKIL 115
YFSWP SRLND+AE+RANYF NLQKGVLPE+ P G++ATTLLELM IRAFHSK L
Sbjct: 59 AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 118
Query: 116 RCYSLGTAIGFRIRLGVLTDIPAILVFVSRKVNKQWLSPIQCLPTALEGPGGVWCDVDVV 175
R +SLGTAIGFRIR GVLT+I AILVFV+RKV+KQWL+P+QCLPTALEGPGGVWCDVDVV
Sbjct: 119 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 178
Query: 176 EFSYFGAPEPVPKEQLYTELVDDLRGGDPYIGSGSQVASQETYGTLGAIVRSQTGSRQVG 235
EF Y+GAP PKEQ+YTELVDDLRG IGSGSQVASQETYGTLGAIV+S+TG RQVG
Sbjct: 179 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQVG 238
Query: 236 FLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDEVWYGIFAGINPETFV 295
FLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD++WYGIFAG NPETFV
Sbjct: 239 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 298
Query: 296 RADGAFIPFADDFDMSFVTTSVRGVGDIGDVKIIDLQSPISSLIGKQVVKVGRSSGLTTG 355
RADGAFIPFA+DF+ + VTT+V+G+G+IGD+ DLQSP++SLIG++VVKVGRSSGLTTG
Sbjct: 299 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 358
Query: 356 IVLAYGLEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIMFKG--DNGEKPRPIGIIW 413
++AY LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI+ + EKPRP+GIIW
Sbjct: 359 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 418
Query: 414 GGTANRGRLKLKIGQPPENWTSGVDXXXXXXXXXXXXXXXXXXXGVAVQEQR-AASATVI 472
GGTANRGRLKLK+G+ PENWTSGVD AV EQR +
Sbjct: 419 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 478
Query: 473 GSTVGDSSTPDGILPKDKAEEKFEPLGLHVQSIPLGVEPSSQETKPSSMETEFKVEDRVK 532
STV +SS + + K E FEP+ L+VQ + + E S++ EF++ED ++
Sbjct: 479 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLI-------EDDNSNIHPEFQIEDVLE 531
Query: 533 VVPSI-EHQFIPSFIGR-SPLH-KNSIKDRAATENLSSLRNGCDED-LCVSLQLGDNEAK 588
V I EHQFIPS S LH K + + ++NLSSL+ D + SLQLG+++ K
Sbjct: 532 SVAVIEEHQFIPSSSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQLGESDTK 591
Query: 589 RRR 591
+R+
Sbjct: 592 KRK 594
>AT5G45030.1 | Symbols: | Trypsin family protein |
chr5:18172981-18175238 FORWARD LENGTH=607
Length = 607
Score = 716 bits (1848), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 383/603 (63%), Positives = 453/603 (75%), Gaps = 21/603 (3%)
Query: 1 MERPRLNMRVRCSGSTPSEESA-LDLERNCCGHSNXXXXXXXXXQPFASAGQHCESNAA- 58
ME RL++R S S+ S ESA LDL++N H QPF S QH E++AA
Sbjct: 1 MEGKRLDLRFHHSTSSQSVESAALDLDKNVYNH--IKLASSSPLQPFPSGAQHPETSAAA 58
Query: 59 -YFSWP--SRLNDAAEERANYFLNLQKGVLPETLGRPPKGQQATTLLELMTIRAFHSKIL 115
YFSWP SRLND+AE+RANYF NLQKGVLPE+ P G++ATTLLELM IRAFHSK L
Sbjct: 59 AYFSWPTSSRLNDSAEDRANYFANLQKGVLPESFDGLPTGKKATTLLELMMIRAFHSKNL 118
Query: 116 RCYSLGTAIGFRIRLGVLTDIPAILVFVSRKVNKQWLSPIQCLPTALEGPGGVWCDVDVV 175
R +SLGTAIGFRIR GVLT+I AILVFV+RKV+KQWL+P+QCLPTALEGPGGVWCDVDVV
Sbjct: 119 RRFSLGTAIGFRIRRGVLTNIAAILVFVARKVHKQWLNPLQCLPTALEGPGGVWCDVDVV 178
Query: 176 EFSYFGAPEPVPKEQLYTELVDDLRGGDPYIGSGSQVASQETYGTLGAIVRSQTGSRQVG 235
EF Y+GAP PKEQ+YTELVDDLRG IGSGSQVASQETYGTLGAIV+S+TG RQVG
Sbjct: 179 EFQYYGAPAQTPKEQVYTELVDDLRGSGSSIGSGSQVASQETYGTLGAIVKSKTGIRQVG 238
Query: 236 FLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDEVWYGIFAGINPETFV 295
FLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD++WYGIFAG NPETFV
Sbjct: 239 FLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDLWYGIFAGTNPETFV 298
Query: 296 RADGAFIPFADDFDMSFVTTSVRGVGDIGDVKIIDLQSPISSLIGKQVVKVGRSSGLTTG 355
RADGAFIPFA+DF+ + VTT+V+G+G+IGD+ DLQSP++SLIG++VVKVGRSSGLTTG
Sbjct: 299 RADGAFIPFAEDFNTNNVTTTVKGIGEIGDIHATDLQSPVNSLIGRKVVKVGRSSGLTTG 358
Query: 356 IVLAYGLEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIMFKG--DNGEKPRPIGIIW 413
++AY LEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI+ + EKPRP+GIIW
Sbjct: 359 TIMAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLILLAAGDEKNEKPRPVGIIW 418
Query: 414 GGTANRGRLKLKIGQPPENWTSGVDXXXXXXXXXXXXXXXXXXXGVAVQEQR-AASATVI 472
GGTANRGRLKLK+G+ PENWTSGVD AV EQR +
Sbjct: 419 GGTANRGRLKLKVGEQPENWTSGVDLGRVLNLLELDLITSNEGLQAAVLEQRNGIMCAAV 478
Query: 473 GSTVGDSSTPDGILPKDKAEEKFEPLGLHVQSIPLGVEPSSQETKPSSMETEFKVEDRVK 532
STV +SS + + K E FEP+ L+VQ + + E S++ EF++ED ++
Sbjct: 479 DSTVVESSPGVCNISRCKTGENFEPINLNVQQVLI-------EDDNSNIHPEFQIEDVLE 531
Query: 533 VVPSI-EHQFIPSFIGR-SPLH-KNSIKDRAATENLSSLRNGCDED-LCVSLQLGDNEAK 588
V I EHQFIPS S LH K + + ++NLSSL+ D + SLQLG+++ K
Sbjct: 532 SVAVIEEHQFIPSSSNNGSALHQKPNGPENLESKNLSSLKTSSSGDEIGFSLQLGESDTK 591
Query: 589 RRR 591
+R+
Sbjct: 592 KRK 594
>AT3G12950.1 | Symbols: | Trypsin family protein |
chr3:4132798-4134818 REVERSE LENGTH=558
Length = 558
Score = 705 bits (1820), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/577 (65%), Positives = 437/577 (75%), Gaps = 45/577 (7%)
Query: 46 FASAGQHCESNAA-YFSWP--SRLNDAAEERANYFLNLQKG------VLPETLGRPPKGQ 96
+ S GQHCE AA YFSWP SRL++AAEERANYF NLQK V PE + PKGQ
Sbjct: 4 YGSTGQHCEFTAASYFSWPTSSRLSNAAEERANYFSNLQKEEDDDDEVSPEPVSTEPKGQ 63
Query: 97 QATTLLELMTIRAFHSKILRCYSLGTAIGFRIRLGVLTDIPAILVFVSRKVNKQWLSPIQ 156
+ATTLLELMTIRAFHSK+LRCYSLGTAIGFRIR GVLTDIPAI+VFVSRKV+KQWLSP+Q
Sbjct: 64 RATTLLELMTIRAFHSKMLRCYSLGTAIGFRIRRGVLTDIPAIIVFVSRKVHKQWLSPLQ 123
Query: 157 CLPTALEGPGGVWCDVDVVEFSYFGAP--EPVPKEQLYTELVDDLRGGDPYIGSGSQVAS 214
CLPTALEG GG+WCDVDVVEFSYFG P +P PK+ T++VD L+G DP+IGSGSQVAS
Sbjct: 124 CLPTALEGAGGIWCDVDVVEFSYFGEPDHQPTPKQTFTTDIVDHLQGSDPFIGSGSQVAS 183
Query: 215 QETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERAT 274
QET GTLGAIVRSQTG RQVGF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERAT
Sbjct: 184 QETCGTLGAIVRSQTGGRQVGFVTNRHVAVNLDYPSQKMFHPLPPALGPGVYLGAVERAT 243
Query: 275 SFITDEVWYGIFAGINPETFVRADGAFIPFADDFDMSFVTTSVR-GVGDIGDVKIIDLQS 333
SFITD++W+GIFAG NPETFVRADGAFIPFADD+D+S VTTSV+ GVG+IG+VK I+LQS
Sbjct: 244 SFITDDLWFGIFAGTNPETFVRADGAFIPFADDYDLSRVTTSVKGGVGEIGEVKAIELQS 303
Query: 334 PISSLIGKQVVKVGRSSGLTTGIVLAYGLEYNDEKGICFLTDFLVVGENQQT-FDLEGDS 392
P+ SL+GKQVVKVGRSSGLTTG VLAY LEYNDE+G+CFLTDFLVVGEN ++ FDLEGDS
Sbjct: 304 PVGSLVGKQVVKVGRSSGLTTGTVLAYALEYNDERGVCFLTDFLVVGENHRSPFDLEGDS 363
Query: 393 GSLIMFKGDNGEKPRPIGIIWGGTANRGRLKLKIGQPPENWTSGVDXXXXXXXXXXXXXX 452
GSLI+ KG+ EK RPIGIIWGGT +RGRLKLK+G+ PE+WT+GVD
Sbjct: 364 GSLIVMKGE--EKARPIGIIWGGTGSRGRLKLKVGECPESWTTGVDLGRLLTHLQLDLIT 421
Query: 453 XXXXXGVAVQEQRAASATVIGSTVGDSSTPDGILPKDK--AEEKFEP-LG-LHVQSIPLG 508
AVQEQRAAS T + S V DSS P L K+K EEK E LG L VQ I L
Sbjct: 422 TDEGLKAAVQEQRAASTTGMSSMVADSSPPYVNLKKEKRSPEEKLEASLGPLQVQHIDL- 480
Query: 509 VEPSSQETKPSSMETEFKVEDRVKV---VPSIEHQFIPSFIGRSPLHKNSIKDRAATENL 565
E+R++ PS+EHQF+P+F G+ ++ + A + +
Sbjct: 481 -------------------EERIETKGGAPSVEHQFMPTFSGQ--CSASAWPETAREDLV 519
Query: 566 SSLRNG-CDEDLCVSLQLGDNEAKRRRSEASTSTEEP 601
+ NG CD DLCV L+LGD+ AKRRR++ + P
Sbjct: 520 AGFTNGSCDGDLCVGLRLGDDGAKRRRTQVTNERMRP 556