Miyakogusa Predicted Gene

Lj6g3v1934110.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj6g3v1934110.1 CUFF.60229.1
         (703 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G30010.1 | Symbols:  | Intron maturase, type II family protei...   997   0.0  
AT5G46920.1 | Symbols:  | Intron maturase, type II family protei...   735   0.0  
AT1G74350.1 | Symbols:  | Intron maturase, type II family protei...    54   5e-07
ATMG00520.1 | Symbols: MATR | Intron maturase, type II family pr...    53   8e-07
AT5G04050.1 | Symbols:  | RNA-directed DNA polymerase (reverse t...    50   4e-06
AT5G04050.2 | Symbols:  | RNA-directed DNA polymerase (reverse t...    50   4e-06

>AT1G30010.1 | Symbols:  | Intron maturase, type II family protein |
           chr1:10513194-10515329 FORWARD LENGTH=711
          Length = 711

 Score =  997 bits (2578), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 482/717 (67%), Positives = 561/717 (78%), Gaps = 36/717 (5%)

Query: 2   SLRKIVEHLRCNRSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPEQEQQEQDPH 61
           SLR IVEH  CN                                       +    QDP+
Sbjct: 10  SLRNIVEHFHCN-------------------TFHKPISSLSISPTLKSESREPSSTQDPY 50

Query: 62  SLLKQDPIEICTSLWVKTFSSPKTTTFPNLTGFLSNFDLWLFAYQRTCAHTTGTFPPRNA 121
           SLLKQDP++IC SLWVK+FSSP + TF NLTGFLS FDLW+ AYQRTCAH TGTFPPRNA
Sbjct: 51  SLLKQDPVDICLSLWVKSFSSPPSATFSNLTGFLSKFDLWVLAYQRTCAHVTGTFPPRNA 110

Query: 122 IHTHVLRDLVSLRNAVIR--GRFSWNDKTRQIIRSPYDK---TFSKPLSKRKLHAVVQSS 176
           IH + LR L+SL+NAV R  G+F WNDK  Q +RSP DK      + +SK K+  +++S 
Sbjct: 111 IHANALRSLLSLQNAVTRSGGKFRWNDKMNQYVRSPKDKISMNGGEGMSKGKVRRIIESE 170

Query: 177 EPCFQDRVVQEVLLMILEPVFEPRFSPKSHGFRPGRNAHTVIRTIRSNFAGYLWFLKGDL 236
           EP FQDRVV EVLLMILEP FE RFS KSHGFRPGRN HTVIRTIRSNFAGYLWF+KGD+
Sbjct: 171 EPIFQDRVVHEVLLMILEPFFEARFSSKSHGFRPGRNPHTVIRTIRSNFAGYLWFMKGDV 230

Query: 237 SEIFENVDVNVVMGCVEKGTRDKKVLGLIKSAL-------VARVIRREESKEELNXX--- 286
           SE+ ++VDV+VVM C++K  +D+KVLGLI+S+L       + RV+ +  +   L      
Sbjct: 231 SEMLDHVDVDVVMNCLQKVVKDRKVLGLIESSLKFSDKRVLKRVVEKHGNDNGLGTKRRI 290

Query: 287 --XXXXXXXXXXXXENEPKPDPYWLRTFFSFAPEEAAKVPSYGQCGILSPLLANVVLNEL 344
                         ++EPKPDPYWLRTF+SFAP+EAAKVPSYG CG+LSPLLANV LNEL
Sbjct: 291 EREKRNKTKKKILSDDEPKPDPYWLRTFYSFAPKEAAKVPSYGYCGVLSPLLANVCLNEL 350

Query: 345 DYMVEEMIVEFFRPSKFDAIWKHSIDDGCHNPAWPEFVPSSGKEKTRKMDYIRYGGHFLI 404
           D  +E  IVE+F P K D+IWK SI+DGCHNPAWPEFVPSSGKEKTRKMDYIRYGGHFLI
Sbjct: 351 DRFMETKIVEYFSPCKDDSIWKESIEDGCHNPAWPEFVPSSGKEKTRKMDYIRYGGHFLI 410

Query: 405 GIRGPREDAVEIRKKIVNFCESTFGLRLDNSKLEIEHITRGIQFLDHIICRRVIHPTLRY 464
           GIRGPREDAV++RK+I++FC+  FG+RLDNSKLEIEHI+RGIQFLDHIICRRVI+PTLRY
Sbjct: 411 GIRGPREDAVKMRKEIIDFCDRVFGVRLDNSKLEIEHISRGIQFLDHIICRRVIYPTLRY 470

Query: 465 TGSGGNIVSEKGVGTLLSVTASLQQCIRQFRRLELVKGDKDPEPLPCNPMLYSGQAHTNS 524
           TGSGG+IVS+KGVGTLLSV+ASL+QCIRQFRRL  VKGDKDPEPLPCNPMLYS Q+H+NS
Sbjct: 471 TGSGGSIVSKKGVGTLLSVSASLEQCIRQFRRLAFVKGDKDPEPLPCNPMLYSSQSHSNS 530

Query: 525 QMNKFLETMADWYRYADNRKKVVGFCAYVVRSSLAKLYAARYRLKSRAKVYGIASRNLSR 584
           QMNKFLETMADWY+YADNRKK VGFCAYV+RSSLAKLYAARYRLKSRAKVY IASR+LS 
Sbjct: 531 QMNKFLETMADWYKYADNRKKAVGFCAYVIRSSLAKLYAARYRLKSRAKVYSIASRDLSH 590

Query: 585 PLRESTNNSAPEYSDLLRMGLVDAIEGVQFSHMSLIPSCDYAPFARNWIPDHERVLHEYI 644
           PL ES+NNSAPEYSDLLRMGLVDAIEGVQFS MSLIPSCDY PF RNWIP+HE+VL EYI
Sbjct: 591 PLSESSNNSAPEYSDLLRMGLVDAIEGVQFSRMSLIPSCDYTPFPRNWIPNHEQVLQEYI 650

Query: 645 KLENPKFFCDLLRSIKQKGLILPQDEVSQMVWDYKTLGVRHFQPDGDKEVKNDLKEI 701
           +L++PKFFC L RSIK++GL LPQDE+S+ VWD+KTLG    + +  +E  + L+++
Sbjct: 651 RLQDPKFFCGLHRSIKREGLTLPQDEISEAVWDFKTLGAWRSKYENKREADDGLQKL 707


>AT5G46920.1 | Symbols:  | Intron maturase, type II family protein |
           chr5:19053668-19055875 FORWARD LENGTH=735
          Length = 735

 Score =  735 bits (1897), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/625 (54%), Positives = 454/625 (72%), Gaps = 8/625 (1%)

Query: 59  DPHSLLKQDPIEICTSLWVKTFSSPKTTTFPNLTGFLSNFDLWLFAYQRTCAHTTGTFPP 118
           DP +LLK+D + +C+ +W++ F  P  T   NLT +L  F+LW+ AYQ+ C    G + P
Sbjct: 65  DPANLLKEDGVSLCSQMWLENFKEPDKTA-TNLTSYLRRFELWVLAYQKVCCDELGAYVP 123

Query: 119 RNAIHTHVLRDLVSLRNAVIRGRFSWNDKTRQIIRSPYDKTFSKPLSKRKLHAVVQSSEP 178
           R++I    L +L++LRN+V+  RF W  +    I+SP DKT  + LSKRK+ A++ +++P
Sbjct: 124 RSSIQRSALENLLALRNSVLDDRFKWGSRLDFYIKSPRDKTDYESLSKRKIKAILTTTQP 183

Query: 179 C-FQDRVVQEVLLMILEPVFEPRFSPKSHGFRPGRNAHTVIRTIRSNFAGYLWFLKGDLS 237
             FQDR+VQEVLLMILEP++E RFS KS  FRPGR AHTV+R IR NFAGYLW++KGDLS
Sbjct: 184 TPFQDRIVQEVLLMILEPIYESRFSQKSFAFRPGRTAHTVLRVIRRNFAGYLWYVKGDLS 243

Query: 238 EIFENVDVNVVMGCVEKGTRDKKVLGLIKSALVARVIRREESKEELNXXXXXXXXXXXXX 297
            + + + V  V+  + +  RDKKV+ LIKSALV  V+  +    E               
Sbjct: 244 VVLDGMKVGFVISSLMRDVRDKKVIDLIKSALVTPVVTSKVEDGEKKKTKKRKYQKKRVL 303

Query: 298 XENEPKPDPYWLRTFFSFAPEEAAKVPSYGQCGILSPLLANVVLNELDYMVEEMIVEFFR 357
            E+EPKPDPYWL TFF FAPEEA K P +G CGILSPLL NV L+ELD  +E  + +F+R
Sbjct: 304 AEDEPKPDPYWLETFFGFAPEEAGKSPQWGHCGILSPLLVNVCLDELDRWMETKVKDFYR 363

Query: 358 PSKFDAIWKH---SIDDGCHNPAWPEFVPSSGKEKTRKMDYIRYGGHFLIGIRGPREDAV 414
           PSK D IW +     D G  N +WPEFVP+SG +KTRKMDY+RYGGH LIG+RGPR DA 
Sbjct: 364 PSKSDVIWNNPEGEADQG--NTSWPEFVPTSGPDKTRKMDYVRYGGHILIGVRGPRADAA 421

Query: 415 EIRKKIVNFCESTFGLRLDNSKLEIEHITRGIQFLDHIICRRVIHPTLRYTGSGGNIVSE 474
            +RK+++ F +  + LRLDN  L IEHIT+GI FLDH++CRRV++PTLRYT +GG I+SE
Sbjct: 422 TLRKELIEFVDQKYMLRLDNENLPIEHITKGIMFLDHVLCRRVVYPTLRYTATGGKIISE 481

Query: 475 KGVGTLLSVTASLQQCIRQFRRLELVKGDKDPEPLPCNPMLYSGQAHTNSQMNKFLETMA 534
           KGVGTLLSVTASL+QCI+QFR+L  +KGD+DP+P PC  M ++ QAHTN+QMNKFL T+A
Sbjct: 482 KGVGTLLSVTASLKQCIKQFRKLLFIKGDRDPDPQPCFRMFHATQAHTNNQMNKFLTTIA 541

Query: 535 DWYRYADNRKKVVGFCAYVVRSSLAKLYAARYRLKSRAKVYGIASRNLSRPLRESTNNSA 594
           +WYR+ADNRKK+V FC+Y++R SLAKLYAA+Y+L+SRAKVY  A+RNLS PL +    S 
Sbjct: 542 EWYRFADNRKKIVNFCSYIIRGSLAKLYAAKYKLRSRAKVYKFANRNLSLPLLQKKGQS- 600

Query: 595 PEYSDLLRMGLVDAIEGVQFSHMSLIPSCDYAPFARNWIPDHERVLHEYIKLENPKFFCD 654
           PEY +LLRMGL ++++G+ ++ MSL+P  DY+PF  NW P+HE+ L EY+ L+ PK   +
Sbjct: 601 PEYQNLLRMGLAESVDGLVYTRMSLVPETDYSPFPGNWRPEHEKFLIEYLTLDEPKTLEE 660

Query: 655 LLRSIKQKGLILPQDEVSQMVWDYK 679
             R I++KGL+ PQD  S +VW+YK
Sbjct: 661 QKRFIREKGLVSPQDYTSMLVWNYK 685


>AT1G74350.1 | Symbols:  | Intron maturase, type II family protein |
           chr1:27949022-27951283 REVERSE LENGTH=753
          Length = 753

 Score = 53.5 bits (127), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 68/289 (23%), Positives = 109/289 (37%), Gaps = 85/289 (29%)

Query: 178 PCFQDRVVQEVLLMILEPVFEPRFSPKSHGFRPGRNAHTVIRTIRSNFAGYLWFLKGDLS 237
           P    +VVQE + ++LE VF P FS  SH  R GR   + ++ I +N +   W     L+
Sbjct: 131 PSVALKVVQEAIRIVLEVVFSPHFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLN 190

Query: 238 E-----IFENVDVNVVMGCVEKGTRDKKVLGLIKSALVARVIRREESKEELNXXXXXXXX 292
           +     +FEN     ++  +E+   D  +  L++S   ARV+  E               
Sbjct: 191 KKLDVSVFEN-----LLSVMEEKVEDSSLSILLRSMFEARVLNLE--------------- 230

Query: 293 XXXXXXENEPKPDPYWLRTFFSFAPEEAAKVPSYGQCGILSPLLANVVLNELDYMVEEMI 352
                               F   P    K     Q G+LS +L N+ L+  D+      
Sbjct: 231 --------------------FGGFP----KGHGLPQEGVLSRVLMNIYLDRFDH------ 260

Query: 353 VEFFRPS-KFDAIWKHSIDD----GCHNPAW-PEFVPSSGKEKTRKMDY------IRYGG 400
            EF+R S + +A+   S  D    G    +W        G + T + D        R+  
Sbjct: 261 -EFYRISMRHEALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVALRVYCCRFMD 319

Query: 401 HFLIGIRGPREDAVEIRKKIVNF-----------------CESTFGLRL 432
                + GP++ A +IR + + F                 CE+T GLR+
Sbjct: 320 EIYFSVSGPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRV 368


>ATMG00520.1 | Symbols: MATR | Intron maturase, type II family
           protein | chrM:144294-146312 REVERSE LENGTH=672
          Length = 672

 Score = 52.8 bits (125), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 51/90 (56%), Gaps = 1/90 (1%)

Query: 182 DRVVQEVLLMILEPVFEPRFSPKSHGFRPGRNAHTVIRTIRSNFAGYLWFLKGDLSEIFE 241
           +++++E + M+LE +++P F   SH FR G+  H+V+R I+  +    WFL+ D+ + F 
Sbjct: 14  EKIMKEAIRMVLESIYDPEFPDTSH-FRSGQGCHSVLRRIKEEWGISRWFLEFDIRKCFH 72

Query: 242 NVDVNVVMGCVEKGTRDKKVLGLIKSALVA 271
            +D + ++  +++   D K    I+    A
Sbjct: 73  TIDRHRLIQILKEEIDDPKFFYSIQKVFSA 102


>AT5G04050.1 | Symbols:  | RNA-directed DNA polymerase (reverse
           transcriptase) | chr5:1096092-1097894 FORWARD LENGTH=600
          Length = 600

 Score = 50.4 bits (119), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 178 PCFQDRVVQEVLLMILEPVFEPRFSPKSHGFRPGRNAHTVIRTIRSNFAGYLWFLKGDLS 237
           P  + +V+ E + M+LE V++ RF+  S+G R G   HT IR ++++     W+ +   +
Sbjct: 132 PNLKLKVLIEAIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFA 191

Query: 238 -EIFENVDVNVVMGCVEKGTRDKKVLGLIK 266
            E+FE  +V+++ G V +   D  ++ +IK
Sbjct: 192 REMFEERNVDILCGFVGEKINDVMLIEMIK 221


>AT5G04050.2 | Symbols:  | RNA-directed DNA polymerase (reverse
           transcriptase) | chr5:1096092-1098512 FORWARD LENGTH=695
          Length = 695

 Score = 50.4 bits (119), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 52/90 (57%), Gaps = 1/90 (1%)

Query: 178 PCFQDRVVQEVLLMILEPVFEPRFSPKSHGFRPGRNAHTVIRTIRSNFAGYLWFLKGDLS 237
           P  + +V+ E + M+LE V++ RF+  S+G R G   HT IR ++++     W+ +   +
Sbjct: 132 PNLKLKVLIEAIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFA 191

Query: 238 -EIFENVDVNVVMGCVEKGTRDKKVLGLIK 266
            E+FE  +V+++ G V +   D  ++ +IK
Sbjct: 192 REMFEERNVDILCGFVGEKINDVMLIEMIK 221