Lotus
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= TM0301a.4
         (147 letters)

Database: ara_mips 
           26,719 sequences; 11,318,596 total letters

Searching..................................................done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

At2g14640 putative retroelement pol polyprotein                       100  4e-22
At3g11970 hypothetical protein                                         99  6e-22
At1g36590 hypothetical protein                                         99  6e-22
At2g05610 putative retroelement pol polyprotein                        75  1e-14
At3g31970 hypothetical protein                                         66  6e-12
At2g14650 putative retroelement pol polyprotein                        61  2e-10
At4g10580 putative reverse-transcriptase -like protein                 61  2e-10
At2g04670 putative retroelement pol polyprotein                        61  2e-10
At3g29490 hypothetical protein                                         56  8e-09
At1g36120 putative reverse transcriptase gb|AAD22339.1                 54  3e-08
At1g35370 hypothetical protein                                         52  9e-08
At4g16910 retrotransposon like protein                                 51  2e-07
At2g10780 pseudogene                                                   51  3e-07
At4g03840 putative transposon protein                                  48  2e-06
At2g07660 putative retroelement pol polyprotein                        45  1e-05
At2g06470 putative retroelement pol polyprotein                        45  1e-05
At4g07830 putative reverse transcriptase                               42  2e-04
At4g26310 unknown protein                                              32  0.16
At3g31480 hypothetical protein                                         30  0.46
At5g59120 cucumisin precursor - like                                   30  0.60

>At2g14640 putative retroelement pol polyprotein
          Length = 945

 Score =  100 bits (248), Expect = 4e-22
 Identities = 49/142 (34%), Positives = 82/142 (57%), Gaps = 2/142 (1%)

Query: 1   VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
           V +++   R+ ++  R   +L+  +YGP+ +  + G VAYRL LPEG R+HPVFH SLLK
Sbjct: 804 VLLRIQPYRQKTLFRRSSQKLSHRFYGPFQVASKHGEVAYRLTLPEGTRIHPVFHVSLLK 863

Query: 61  EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
             VG+   ++  L  L       + P +V+  R+ ++ +  +  + +QW+G   ++ TW+
Sbjct: 864 PWVGDGEPDMGQLPPLRNNGELKLQPTAVLEVRWRSQDKKRVADLLVQWEGLHIEDATWE 923

Query: 121 DTLNIRSQFP--VFNLEDKVDL 140
           +   + + FP  V NLEDKV L
Sbjct: 924 EYDQLAASFPEFVLNLEDKVRL 945


>At3g11970 hypothetical protein
          Length = 1499

 Score = 99.4 bits (246), Expect = 6e-22
 Identities = 51/132 (38%), Positives = 75/132 (56%), Gaps = 2/132 (1%)

Query: 1    VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
            V++KL   R+ SVV R   +L+  Y+GPY II R G VAY+L LP   +VHPVFH S LK
Sbjct: 1368 VYVKLQPYRQQSVVMRANQKLSPKYFGPYKIIDRCGEVAYKLALPSYSQVHPVFHVSQLK 1427

Query: 61   EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
              VGN S  + L   +  ++V    P  V+  +   RQ   + +V ++W  +P +E TW+
Sbjct: 1428 VLVGNVSTTVHLPSVM--QDVFEKVPEKVVERKMVNRQGKAVTKVLVKWSNEPLEEATWE 1485

Query: 121  DTLNIRSQFPVF 132
               +++  FP F
Sbjct: 1486 FLFDLQKTFPEF 1497


>At1g36590 hypothetical protein
          Length = 1499

 Score = 99.4 bits (246), Expect = 6e-22
 Identities = 51/132 (38%), Positives = 75/132 (56%), Gaps = 2/132 (1%)

Query: 1    VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
            V++KL   R+ SVV R   +L+  Y+GPY II R G VAY+L LP   +VHPVFH S LK
Sbjct: 1368 VYVKLQPYRQQSVVMRANQKLSPKYFGPYKIIDRCGEVAYKLALPSYSQVHPVFHVSQLK 1427

Query: 61   EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
              VGN S  + L   +  ++V    P  V+  +   RQ   + +V ++W  +P +E TW+
Sbjct: 1428 VLVGNVSTTVHLPSVM--QDVFEKVPEKVVERKMVNRQGKAVTKVLVKWSNEPLEEATWE 1485

Query: 121  DTLNIRSQFPVF 132
               +++  FP F
Sbjct: 1486 FLFDLQKTFPEF 1497


>At2g05610 putative retroelement pol polyprotein
          Length = 780

 Score = 75.5 bits (184), Expect = 1e-14
 Identities = 36/73 (49%), Positives = 46/73 (62%)

Query: 1   VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
           VF+KL   R+ SVV R   +L+  Y+GPY +I R G VAY+LQLP   +VHPVFH S L+
Sbjct: 696 VFVKLQPYRQQSVVMRSTQKLSPKYFGPYKVIDRCGEVAYKLQLPANSQVHPVFHVSQLR 755

Query: 61  EAVGNNSVELQLL 73
             VG  +    LL
Sbjct: 756 VLVGTVTTSTHLL 768


>At3g31970 hypothetical protein
          Length = 1329

 Score = 66.2 bits (160), Expect = 6e-12
 Identities = 38/132 (28%), Positives = 69/132 (51%), Gaps = 5/132 (3%)

Query: 1    VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
            V++K+  LR  N  ++    +LT  Y GP+ I++R+G VAYRL+LP+  R  H VFH S+
Sbjct: 1190 VYLKMAMLRGPNRSISET--KLTPRYMGPFRIVERVGPVAYRLELPDVMRAFHKVFHVSM 1247

Query: 59   LKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
            L++ +  ++ V  ++L+ L         P  ++  R    +   +P + + W      E 
Sbjct: 1248 LRKCLHKDDEVLAKILEDLQPNMTLEARPVRILERRIKELRRKKIPLIKVLWNCDGVTEE 1307

Query: 118  TWKDTLNIRSQF 129
            TW+    +++ F
Sbjct: 1308 TWEPEARMKASF 1319


>At2g14650 putative retroelement pol polyprotein
          Length = 1328

 Score = 61.2 bits (147), Expect = 2e-10
 Identities = 36/132 (27%), Positives = 69/132 (52%), Gaps = 5/132 (3%)

Query: 1    VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
            V++K+  LR  N  ++    +L+  Y GP+ I++R+G VAYRL+LP+  R  H VFH S+
Sbjct: 1192 VYLKMAMLRGPNRSISET--KLSPRYMGPFRIVERVGPVAYRLELPDVMRAFHKVFHVSM 1249

Query: 59   LKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
            L++ +  ++ V  ++ + L         P  V+  R    +   +P + + W      + 
Sbjct: 1250 LRKCLHKDDEVLAKIPEDLQPNMTLEARPVRVLERRIKELRRKKIPLIKVLWDCDGVTKE 1309

Query: 118  TWKDTLNIRSQF 129
            TW+    ++++F
Sbjct: 1310 TWEPEARMKARF 1321


>At4g10580 putative reverse-transcriptase -like protein
          Length = 1240

 Score = 60.8 bits (146), Expect = 2e-10
 Identities = 36/132 (27%), Positives = 68/132 (51%), Gaps = 5/132 (3%)

Query: 1    VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
            V++K+  LR  N  ++    +L+  Y GP+ I++R+  VAYRL+LP+  R  H VFH S+
Sbjct: 1101 VYLKMAMLRGPNRSISET--KLSPRYMGPFKIVERVEPVAYRLELPDVMRAFHKVFHVSM 1158

Query: 59   LKEAVGNNSVEL-QLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
            L++ +  +   L ++ + L         P  V+  R    ++  +P + + W      E 
Sbjct: 1159 LRKCLHKDDEALAKIPEDLQPNMTLEARPVRVLERRIKELRQKKIPLIKVLWDCDGVTEE 1218

Query: 118  TWKDTLNIRSQF 129
            TW+    ++++F
Sbjct: 1219 TWEPEARMKARF 1230


>At2g04670 putative retroelement pol polyprotein
          Length = 1411

 Score = 60.8 bits (146), Expect = 2e-10
 Identities = 36/133 (27%), Positives = 69/133 (51%), Gaps = 7/133 (5%)

Query: 1    VFIKLMALR--ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHAS 57
            V++K+  LR    S++     +L+  Y GP+ I++R+G VAYRL+LP+  R  H VFH  
Sbjct: 1272 VYLKMAMLRGPNRSILET---KLSPRYMGPFRIVERVGPVAYRLELPDVMRAFHKVFHVL 1328

Query: 58   LLKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADE 116
            +L++ +  ++ V +++ + L         P  V+  R    +   +P + + W      E
Sbjct: 1329 MLRKCLHKDDEVLVKIPEDLQPNMTLEARPVRVLERRIKELRRKKIPLIKVLWDCDGVTE 1388

Query: 117  PTWKDTLNIRSQF 129
             TW+    ++++F
Sbjct: 1389 ETWEPEARMKARF 1401


>At3g29490 hypothetical protein
          Length = 438

 Score = 55.8 bits (133), Expect = 8e-09
 Identities = 35/132 (26%), Positives = 66/132 (49%), Gaps = 5/132 (3%)

Query: 1   VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
           V++K+  LR  N  ++    +L+  Y GP+ I++R+G VAY L+LP+  R  H VFH S+
Sbjct: 202 VYLKMAMLRGPNRSISET--KLSLRYMGPFRIVERVGPVAYMLELPDVMRAFHKVFHVSM 259

Query: 59  LKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
           L++ +  ++ V  ++ + L            V+  R    Q   +  + + W      E 
Sbjct: 260 LRKCLHKDDEVLAKIPEDLQPNMTLEARQVRVLERRIKELQRKKISLIKVLWDCDGVTEE 319

Query: 118 TWKDTLNIRSQF 129
           TW+    ++++F
Sbjct: 320 TWQPEARMKARF 331


>At1g36120 putative reverse transcriptase gb|AAD22339.1
          Length = 1235

 Score = 53.9 bits (128), Expect = 3e-08
 Identities = 29/101 (28%), Positives = 52/101 (50%), Gaps = 2/101 (1%)

Query: 31   IIQRIGAVAYRLQLPEGGRV-HPVFHASLLKEAV-GNNSVELQLLDHLTGEEVASVHPFS 88
            I++R+G VAYRL+LP+  R  H VFH S+L++ +  ++ V  ++ + L         P  
Sbjct: 1125 IVERVGPVAYRLELPDVMRAFHNVFHVSMLRKCLHKDDEVLAKIPEDLQPNMTLEARPVR 1184

Query: 89   VITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQF 129
            V+  R    +   +P + + W      E TW+    I+++F
Sbjct: 1185 VLERRIKEVRRKKIPMIKVLWDCDGVTEETWEPEARIKARF 1225


>At1g35370 hypothetical protein
          Length = 1447

 Score = 52.4 bits (124), Expect = 9e-08
 Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 25/129 (19%)

Query: 1    VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
            V++KL   R+ SVV R   +L+  Y+GPY II++ G V                      
Sbjct: 1339 VYVKLQPYRQQSVVLRVNQKLSPKYFGPYKIIEKCGEV---------------------- 1376

Query: 61   EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
              VGN +   QL   L   ++    P  ++  +   RQ      V ++W G+P +E TWK
Sbjct: 1377 -MVGNVTTSTQLPSVL--PDIFEKAPEYILERKLVKRQGRAATMVLVKWIGEPVEEATWK 1433

Query: 121  DTLNIRSQF 129
               + + +F
Sbjct: 1434 FLFDRQQKF 1442


>At4g16910 retrotransposon like protein
          Length = 687

 Score = 51.2 bits (121), Expect = 2e-07
 Identities = 32/117 (27%), Positives = 57/117 (48%), Gaps = 10/117 (8%)

Query: 20  QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ-----LL 73
           +L   Y GPY +I+R+GAVAY+L L P+    H VFH S L++ +      ++     L 
Sbjct: 551 KLRPRYVGPYKVIERVGAVAYKLDLPPKLDAFHNVFHVSQLRKCLSEQEESMEDVPPGLK 610

Query: 74  DHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFP 130
           +++T E      P  ++       +  ++  + I W     +E TW+    +++ FP
Sbjct: 611 ENMTVE----AWPVRIMDQMKKGTRGKSMDLLKILWNCGGREEYTWETETKMKANFP 663


>At2g10780 pseudogene
          Length = 1611

 Score = 50.8 bits (120), Expect = 3e-07
 Identities = 29/113 (25%), Positives = 54/113 (47%), Gaps = 2/113 (1%)

Query: 20   QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ-LLDHLT 77
            +L+  Y GPY +I+R+GAVAY+L L P+    H VFH S L++ + +    ++ +   L 
Sbjct: 1467 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNAFHNVFHVSQLRKCLSDQEESVEDIPPGLK 1526

Query: 78   GEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFP 130
                    P  ++       +      + + W  +  +E TW+    +++ FP
Sbjct: 1527 ENMTVEAWPVRIMDRMTKGTRGKARDLLKVLWNCRGREEYTWETENKMKANFP 1579


>At4g03840 putative transposon protein
          Length = 973

 Score = 48.1 bits (113), Expect = 2e-06
 Identities = 30/116 (25%), Positives = 57/116 (48%), Gaps = 10/116 (8%)

Query: 20  QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ-----LL 73
           +L+  Y GPY +I+R+GAVAY+L L P+    H VFH S L++ + N    ++     L 
Sbjct: 829 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNAFHNVFHVSQLRKCLSNQEESVEDVPPGLK 888

Query: 74  DHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQF 129
           +++T E      P  ++       +  +   + + W     ++ TW+    +++ F
Sbjct: 889 ENMTVE----AWPVQIMDRMTKGTRGKSRDLLKVLWNCGGREQYTWETENKMKANF 940


>At2g07660 putative retroelement pol polyprotein
          Length = 949

 Score = 45.4 bits (106), Expect = 1e-05
 Identities = 22/53 (41%), Positives = 34/53 (63%), Gaps = 1/53 (1%)

Query: 20  QLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASLLKEAVGNNSVELQ 71
           +L+  Y GPY +I+R+GAVAY+L LP    V H VFH S L++ + +    ++
Sbjct: 859 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNVFHNVFHVSQLRKYLSDQEESVE 911


>At2g06470 putative retroelement pol polyprotein
          Length = 899

 Score = 45.4 bits (106), Expect = 1e-05
 Identities = 21/53 (39%), Positives = 34/53 (63%), Gaps = 1/53 (1%)

Query: 20  QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ 71
           +L+  Y GPY +I+R+GAVAY+L L P+    H VFH S L++ + +    ++
Sbjct: 826 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNAFHNVFHVSQLRKCLSDQEESVE 878


>At4g07830 putative reverse transcriptase
          Length = 611

 Score = 41.6 bits (96), Expect = 2e-04
 Identities = 25/99 (25%), Positives = 48/99 (48%), Gaps = 2/99 (2%)

Query: 33  QRIGAVAYRLQLPEGGRV-HPVFHASLLKEAV-GNNSVELQLLDHLTGEEVASVHPFSVI 90
           QR+G VA+RL+L +  R  H VFH S+L++ +  ++ V  ++ + L         P  V+
Sbjct: 467 QRVGPVAFRLELSDVMRAFHKVFHVSMLRKCLHKDDEVLAKIPEDLQPNMTLEARPVRVL 526

Query: 91  TSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQF 129
             R    +   +P + +        E TW+    ++++F
Sbjct: 527 ERRIKELRRKKIPLIKVLRNCDGVTEETWEPEARLKARF 565


>At4g26310 unknown protein
          Length = 258

 Score = 31.6 bits (70), Expect = 0.16
 Identities = 20/70 (28%), Positives = 33/70 (46%), Gaps = 6/70 (8%)

Query: 18  CPQLTAPYYGPYPIIQRIGAVAYRLQLPEG------GRVHPVFHASLLKEAVGNNSVELQ 71
           C + T P + P+  +QR G     +QL  G      GR   V  A   ++  G  S++++
Sbjct: 52  CCRETPPLHSPWSALQRRGVKVNAIQLRAGNVIERTGRTFRVVEAEHKQQGRGGASIQVE 111

Query: 72  LLDHLTGEEV 81
           L D  TG ++
Sbjct: 112 LRDVDTGNKL 121


>At3g31480 hypothetical protein
          Length = 338

 Score = 30.0 bits (66), Expect = 0.46
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query: 20  QLTAPYYGPYPIIQRIGAVAYR 41
           +L+  Y GP+ I++R+G VAYR
Sbjct: 187 KLSPKYMGPFRIVERVGPVAYR 208


>At5g59120 cucumisin precursor - like
          Length = 732

 Score = 29.6 bits (65), Expect = 0.60
 Identities = 30/116 (25%), Positives = 51/116 (43%), Gaps = 16/116 (13%)

Query: 27  GPYPIIQRIGAVA--YRLQLPEGGRVHPVFHASLLKEAVGNNSVELQLLDH---LTGEEV 81
           G   I++ +GAV   YR   P+   +HP+  A LL E   +    L+  D    +  +  
Sbjct: 395 GGLKIVESVGAVGLIYRTPKPDVAFIHPLPAAGLLTEDFESLVSYLESTDSPQAIVLKTE 454

Query: 82  ASVHPFSVITSRFTTRQESTL-----------PQVWIQWQGKPADEPTWKDTLNIR 126
           A  +  S + + F++R  +T+           P V I     PA EP+  DT +++
Sbjct: 455 AIFNRTSPVIASFSSRGPNTIAVDILKPDITAPGVEILAAYSPAGEPSQDDTRHVK 510


  Database: ara_mips
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,978,382
  Number of sequences in database:  6832
  
  Database: /data/blast2/ara_mips_chr2
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,737,135
  Number of sequences in database:  4184
  
  Database: /data/blast2/ara_mips_chr3
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,236,886
  Number of sequences in database:  5377
  
  Database: /data/blast2/ara_mips_chr4
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 1,748,816
  Number of sequences in database:  4030
  
  Database: /data/blast2/ara_mips_chr5
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 2,569,679
  Number of sequences in database:  6098
  
  Database: /data/blast2/ara_mips_chl
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 25,951
  Number of sequences in database:  85
  
  Database: /data/blast2/ara_mips_mit
    Posted date:  Jul 15, 2004 10:29 AM
  Number of letters in database: 21,747
  Number of sequences in database:  113
  
Lambda     K      H
   0.320    0.137    0.416 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,364,322
Number of Sequences: 26719
Number of extensions: 130940
Number of successful extensions: 313
Number of sequences better than 10.0: 25
Number of HSP's better than 10.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 282
Number of HSP's gapped (non-prelim): 27
length of query: 147
length of database: 11,318,596
effective HSP length: 90
effective length of query: 57
effective length of database: 8,913,886
effective search space: 508091502
effective search space used: 508091502
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (25.8 bits)


Lotus: description of TM0301a.4