
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0301a.4
(147 letters)
Database: ara_mips
26,719 sequences; 11,318,596 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
At2g14640 putative retroelement pol polyprotein 100 4e-22
At3g11970 hypothetical protein 99 6e-22
At1g36590 hypothetical protein 99 6e-22
At2g05610 putative retroelement pol polyprotein 75 1e-14
At3g31970 hypothetical protein 66 6e-12
At2g14650 putative retroelement pol polyprotein 61 2e-10
At4g10580 putative reverse-transcriptase -like protein 61 2e-10
At2g04670 putative retroelement pol polyprotein 61 2e-10
At3g29490 hypothetical protein 56 8e-09
At1g36120 putative reverse transcriptase gb|AAD22339.1 54 3e-08
At1g35370 hypothetical protein 52 9e-08
At4g16910 retrotransposon like protein 51 2e-07
At2g10780 pseudogene 51 3e-07
At4g03840 putative transposon protein 48 2e-06
At2g07660 putative retroelement pol polyprotein 45 1e-05
At2g06470 putative retroelement pol polyprotein 45 1e-05
At4g07830 putative reverse transcriptase 42 2e-04
At4g26310 unknown protein 32 0.16
At3g31480 hypothetical protein 30 0.46
At5g59120 cucumisin precursor - like 30 0.60
>At2g14640 putative retroelement pol polyprotein
Length = 945
Score = 100 bits (248), Expect = 4e-22
Identities = 49/142 (34%), Positives = 82/142 (57%), Gaps = 2/142 (1%)
Query: 1 VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
V +++ R+ ++ R +L+ +YGP+ + + G VAYRL LPEG R+HPVFH SLLK
Sbjct: 804 VLLRIQPYRQKTLFRRSSQKLSHRFYGPFQVASKHGEVAYRLTLPEGTRIHPVFHVSLLK 863
Query: 61 EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
VG+ ++ L L + P +V+ R+ ++ + + + +QW+G ++ TW+
Sbjct: 864 PWVGDGEPDMGQLPPLRNNGELKLQPTAVLEVRWRSQDKKRVADLLVQWEGLHIEDATWE 923
Query: 121 DTLNIRSQFP--VFNLEDKVDL 140
+ + + FP V NLEDKV L
Sbjct: 924 EYDQLAASFPEFVLNLEDKVRL 945
>At3g11970 hypothetical protein
Length = 1499
Score = 99.4 bits (246), Expect = 6e-22
Identities = 51/132 (38%), Positives = 75/132 (56%), Gaps = 2/132 (1%)
Query: 1 VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
V++KL R+ SVV R +L+ Y+GPY II R G VAY+L LP +VHPVFH S LK
Sbjct: 1368 VYVKLQPYRQQSVVMRANQKLSPKYFGPYKIIDRCGEVAYKLALPSYSQVHPVFHVSQLK 1427
Query: 61 EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
VGN S + L + ++V P V+ + RQ + +V ++W +P +E TW+
Sbjct: 1428 VLVGNVSTTVHLPSVM--QDVFEKVPEKVVERKMVNRQGKAVTKVLVKWSNEPLEEATWE 1485
Query: 121 DTLNIRSQFPVF 132
+++ FP F
Sbjct: 1486 FLFDLQKTFPEF 1497
>At1g36590 hypothetical protein
Length = 1499
Score = 99.4 bits (246), Expect = 6e-22
Identities = 51/132 (38%), Positives = 75/132 (56%), Gaps = 2/132 (1%)
Query: 1 VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
V++KL R+ SVV R +L+ Y+GPY II R G VAY+L LP +VHPVFH S LK
Sbjct: 1368 VYVKLQPYRQQSVVMRANQKLSPKYFGPYKIIDRCGEVAYKLALPSYSQVHPVFHVSQLK 1427
Query: 61 EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
VGN S + L + ++V P V+ + RQ + +V ++W +P +E TW+
Sbjct: 1428 VLVGNVSTTVHLPSVM--QDVFEKVPEKVVERKMVNRQGKAVTKVLVKWSNEPLEEATWE 1485
Query: 121 DTLNIRSQFPVF 132
+++ FP F
Sbjct: 1486 FLFDLQKTFPEF 1497
>At2g05610 putative retroelement pol polyprotein
Length = 780
Score = 75.5 bits (184), Expect = 1e-14
Identities = 36/73 (49%), Positives = 46/73 (62%)
Query: 1 VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
VF+KL R+ SVV R +L+ Y+GPY +I R G VAY+LQLP +VHPVFH S L+
Sbjct: 696 VFVKLQPYRQQSVVMRSTQKLSPKYFGPYKVIDRCGEVAYKLQLPANSQVHPVFHVSQLR 755
Query: 61 EAVGNNSVELQLL 73
VG + LL
Sbjct: 756 VLVGTVTTSTHLL 768
>At3g31970 hypothetical protein
Length = 1329
Score = 66.2 bits (160), Expect = 6e-12
Identities = 38/132 (28%), Positives = 69/132 (51%), Gaps = 5/132 (3%)
Query: 1 VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
V++K+ LR N ++ +LT Y GP+ I++R+G VAYRL+LP+ R H VFH S+
Sbjct: 1190 VYLKMAMLRGPNRSISET--KLTPRYMGPFRIVERVGPVAYRLELPDVMRAFHKVFHVSM 1247
Query: 59 LKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
L++ + ++ V ++L+ L P ++ R + +P + + W E
Sbjct: 1248 LRKCLHKDDEVLAKILEDLQPNMTLEARPVRILERRIKELRRKKIPLIKVLWNCDGVTEE 1307
Query: 118 TWKDTLNIRSQF 129
TW+ +++ F
Sbjct: 1308 TWEPEARMKASF 1319
>At2g14650 putative retroelement pol polyprotein
Length = 1328
Score = 61.2 bits (147), Expect = 2e-10
Identities = 36/132 (27%), Positives = 69/132 (52%), Gaps = 5/132 (3%)
Query: 1 VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
V++K+ LR N ++ +L+ Y GP+ I++R+G VAYRL+LP+ R H VFH S+
Sbjct: 1192 VYLKMAMLRGPNRSISET--KLSPRYMGPFRIVERVGPVAYRLELPDVMRAFHKVFHVSM 1249
Query: 59 LKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
L++ + ++ V ++ + L P V+ R + +P + + W +
Sbjct: 1250 LRKCLHKDDEVLAKIPEDLQPNMTLEARPVRVLERRIKELRRKKIPLIKVLWDCDGVTKE 1309
Query: 118 TWKDTLNIRSQF 129
TW+ ++++F
Sbjct: 1310 TWEPEARMKARF 1321
>At4g10580 putative reverse-transcriptase -like protein
Length = 1240
Score = 60.8 bits (146), Expect = 2e-10
Identities = 36/132 (27%), Positives = 68/132 (51%), Gaps = 5/132 (3%)
Query: 1 VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
V++K+ LR N ++ +L+ Y GP+ I++R+ VAYRL+LP+ R H VFH S+
Sbjct: 1101 VYLKMAMLRGPNRSISET--KLSPRYMGPFKIVERVEPVAYRLELPDVMRAFHKVFHVSM 1158
Query: 59 LKEAVGNNSVEL-QLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
L++ + + L ++ + L P V+ R ++ +P + + W E
Sbjct: 1159 LRKCLHKDDEALAKIPEDLQPNMTLEARPVRVLERRIKELRQKKIPLIKVLWDCDGVTEE 1218
Query: 118 TWKDTLNIRSQF 129
TW+ ++++F
Sbjct: 1219 TWEPEARMKARF 1230
>At2g04670 putative retroelement pol polyprotein
Length = 1411
Score = 60.8 bits (146), Expect = 2e-10
Identities = 36/133 (27%), Positives = 69/133 (51%), Gaps = 7/133 (5%)
Query: 1 VFIKLMALR--ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHAS 57
V++K+ LR S++ +L+ Y GP+ I++R+G VAYRL+LP+ R H VFH
Sbjct: 1272 VYLKMAMLRGPNRSILET---KLSPRYMGPFRIVERVGPVAYRLELPDVMRAFHKVFHVL 1328
Query: 58 LLKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADE 116
+L++ + ++ V +++ + L P V+ R + +P + + W E
Sbjct: 1329 MLRKCLHKDDEVLVKIPEDLQPNMTLEARPVRVLERRIKELRRKKIPLIKVLWDCDGVTE 1388
Query: 117 PTWKDTLNIRSQF 129
TW+ ++++F
Sbjct: 1389 ETWEPEARMKARF 1401
>At3g29490 hypothetical protein
Length = 438
Score = 55.8 bits (133), Expect = 8e-09
Identities = 35/132 (26%), Positives = 66/132 (49%), Gaps = 5/132 (3%)
Query: 1 VFIKLMALR-ENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASL 58
V++K+ LR N ++ +L+ Y GP+ I++R+G VAY L+LP+ R H VFH S+
Sbjct: 202 VYLKMAMLRGPNRSISET--KLSLRYMGPFRIVERVGPVAYMLELPDVMRAFHKVFHVSM 259
Query: 59 LKEAV-GNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEP 117
L++ + ++ V ++ + L V+ R Q + + + W E
Sbjct: 260 LRKCLHKDDEVLAKIPEDLQPNMTLEARQVRVLERRIKELQRKKISLIKVLWDCDGVTEE 319
Query: 118 TWKDTLNIRSQF 129
TW+ ++++F
Sbjct: 320 TWQPEARMKARF 331
>At1g36120 putative reverse transcriptase gb|AAD22339.1
Length = 1235
Score = 53.9 bits (128), Expect = 3e-08
Identities = 29/101 (28%), Positives = 52/101 (50%), Gaps = 2/101 (1%)
Query: 31 IIQRIGAVAYRLQLPEGGRV-HPVFHASLLKEAV-GNNSVELQLLDHLTGEEVASVHPFS 88
I++R+G VAYRL+LP+ R H VFH S+L++ + ++ V ++ + L P
Sbjct: 1125 IVERVGPVAYRLELPDVMRAFHNVFHVSMLRKCLHKDDEVLAKIPEDLQPNMTLEARPVR 1184
Query: 89 VITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQF 129
V+ R + +P + + W E TW+ I+++F
Sbjct: 1185 VLERRIKEVRRKKIPMIKVLWDCDGVTEETWEPEARIKARF 1225
>At1g35370 hypothetical protein
Length = 1447
Score = 52.4 bits (124), Expect = 9e-08
Identities = 35/129 (27%), Positives = 56/129 (43%), Gaps = 25/129 (19%)
Query: 1 VFIKLMALRENSVVTRDCPQLTAPYYGPYPIIQRIGAVAYRLQLPEGGRVHPVFHASLLK 60
V++KL R+ SVV R +L+ Y+GPY II++ G V
Sbjct: 1339 VYVKLQPYRQQSVVLRVNQKLSPKYFGPYKIIEKCGEV---------------------- 1376
Query: 61 EAVGNNSVELQLLDHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWK 120
VGN + QL L ++ P ++ + RQ V ++W G+P +E TWK
Sbjct: 1377 -MVGNVTTSTQLPSVL--PDIFEKAPEYILERKLVKRQGRAATMVLVKWIGEPVEEATWK 1433
Query: 121 DTLNIRSQF 129
+ + +F
Sbjct: 1434 FLFDRQQKF 1442
>At4g16910 retrotransposon like protein
Length = 687
Score = 51.2 bits (121), Expect = 2e-07
Identities = 32/117 (27%), Positives = 57/117 (48%), Gaps = 10/117 (8%)
Query: 20 QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ-----LL 73
+L Y GPY +I+R+GAVAY+L L P+ H VFH S L++ + ++ L
Sbjct: 551 KLRPRYVGPYKVIERVGAVAYKLDLPPKLDAFHNVFHVSQLRKCLSEQEESMEDVPPGLK 610
Query: 74 DHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFP 130
+++T E P ++ + ++ + I W +E TW+ +++ FP
Sbjct: 611 ENMTVE----AWPVRIMDQMKKGTRGKSMDLLKILWNCGGREEYTWETETKMKANFP 663
>At2g10780 pseudogene
Length = 1611
Score = 50.8 bits (120), Expect = 3e-07
Identities = 29/113 (25%), Positives = 54/113 (47%), Gaps = 2/113 (1%)
Query: 20 QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ-LLDHLT 77
+L+ Y GPY +I+R+GAVAY+L L P+ H VFH S L++ + + ++ + L
Sbjct: 1467 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNAFHNVFHVSQLRKCLSDQEESVEDIPPGLK 1526
Query: 78 GEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQFP 130
P ++ + + + W + +E TW+ +++ FP
Sbjct: 1527 ENMTVEAWPVRIMDRMTKGTRGKARDLLKVLWNCRGREEYTWETENKMKANFP 1579
>At4g03840 putative transposon protein
Length = 973
Score = 48.1 bits (113), Expect = 2e-06
Identities = 30/116 (25%), Positives = 57/116 (48%), Gaps = 10/116 (8%)
Query: 20 QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ-----LL 73
+L+ Y GPY +I+R+GAVAY+L L P+ H VFH S L++ + N ++ L
Sbjct: 829 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNAFHNVFHVSQLRKCLSNQEESVEDVPPGLK 888
Query: 74 DHLTGEEVASVHPFSVITSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQF 129
+++T E P ++ + + + + W ++ TW+ +++ F
Sbjct: 889 ENMTVE----AWPVQIMDRMTKGTRGKSRDLLKVLWNCGGREQYTWETENKMKANF 940
>At2g07660 putative retroelement pol polyprotein
Length = 949
Score = 45.4 bits (106), Expect = 1e-05
Identities = 22/53 (41%), Positives = 34/53 (63%), Gaps = 1/53 (1%)
Query: 20 QLTAPYYGPYPIIQRIGAVAYRLQLPEGGRV-HPVFHASLLKEAVGNNSVELQ 71
+L+ Y GPY +I+R+GAVAY+L LP V H VFH S L++ + + ++
Sbjct: 859 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNVFHNVFHVSQLRKYLSDQEESVE 911
>At2g06470 putative retroelement pol polyprotein
Length = 899
Score = 45.4 bits (106), Expect = 1e-05
Identities = 21/53 (39%), Positives = 34/53 (63%), Gaps = 1/53 (1%)
Query: 20 QLTAPYYGPYPIIQRIGAVAYRLQL-PEGGRVHPVFHASLLKEAVGNNSVELQ 71
+L+ Y GPY +I+R+GAVAY+L L P+ H VFH S L++ + + ++
Sbjct: 826 KLSPRYVGPYKVIERVGAVAYKLDLPPKLNAFHNVFHVSQLRKCLSDQEESVE 878
>At4g07830 putative reverse transcriptase
Length = 611
Score = 41.6 bits (96), Expect = 2e-04
Identities = 25/99 (25%), Positives = 48/99 (48%), Gaps = 2/99 (2%)
Query: 33 QRIGAVAYRLQLPEGGRV-HPVFHASLLKEAV-GNNSVELQLLDHLTGEEVASVHPFSVI 90
QR+G VA+RL+L + R H VFH S+L++ + ++ V ++ + L P V+
Sbjct: 467 QRVGPVAFRLELSDVMRAFHKVFHVSMLRKCLHKDDEVLAKIPEDLQPNMTLEARPVRVL 526
Query: 91 TSRFTTRQESTLPQVWIQWQGKPADEPTWKDTLNIRSQF 129
R + +P + + E TW+ ++++F
Sbjct: 527 ERRIKELRRKKIPLIKVLRNCDGVTEETWEPEARLKARF 565
>At4g26310 unknown protein
Length = 258
Score = 31.6 bits (70), Expect = 0.16
Identities = 20/70 (28%), Positives = 33/70 (46%), Gaps = 6/70 (8%)
Query: 18 CPQLTAPYYGPYPIIQRIGAVAYRLQLPEG------GRVHPVFHASLLKEAVGNNSVELQ 71
C + T P + P+ +QR G +QL G GR V A ++ G S++++
Sbjct: 52 CCRETPPLHSPWSALQRRGVKVNAIQLRAGNVIERTGRTFRVVEAEHKQQGRGGASIQVE 111
Query: 72 LLDHLTGEEV 81
L D TG ++
Sbjct: 112 LRDVDTGNKL 121
>At3g31480 hypothetical protein
Length = 338
Score = 30.0 bits (66), Expect = 0.46
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 20 QLTAPYYGPYPIIQRIGAVAYR 41
+L+ Y GP+ I++R+G VAYR
Sbjct: 187 KLSPKYMGPFRIVERVGPVAYR 208
>At5g59120 cucumisin precursor - like
Length = 732
Score = 29.6 bits (65), Expect = 0.60
Identities = 30/116 (25%), Positives = 51/116 (43%), Gaps = 16/116 (13%)
Query: 27 GPYPIIQRIGAVA--YRLQLPEGGRVHPVFHASLLKEAVGNNSVELQLLDH---LTGEEV 81
G I++ +GAV YR P+ +HP+ A LL E + L+ D + +
Sbjct: 395 GGLKIVESVGAVGLIYRTPKPDVAFIHPLPAAGLLTEDFESLVSYLESTDSPQAIVLKTE 454
Query: 82 ASVHPFSVITSRFTTRQESTL-----------PQVWIQWQGKPADEPTWKDTLNIR 126
A + S + + F++R +T+ P V I PA EP+ DT +++
Sbjct: 455 AIFNRTSPVIASFSSRGPNTIAVDILKPDITAPGVEILAAYSPAGEPSQDDTRHVK 510
Database: ara_mips
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,978,382
Number of sequences in database: 6832
Database: /data/blast2/ara_mips_chr2
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,737,135
Number of sequences in database: 4184
Database: /data/blast2/ara_mips_chr3
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,236,886
Number of sequences in database: 5377
Database: /data/blast2/ara_mips_chr4
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 1,748,816
Number of sequences in database: 4030
Database: /data/blast2/ara_mips_chr5
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 2,569,679
Number of sequences in database: 6098
Database: /data/blast2/ara_mips_chl
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 25,951
Number of sequences in database: 85
Database: /data/blast2/ara_mips_mit
Posted date: Jul 15, 2004 10:29 AM
Number of letters in database: 21,747
Number of sequences in database: 113
Lambda K H
0.320 0.137 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,364,322
Number of Sequences: 26719
Number of extensions: 130940
Number of successful extensions: 313
Number of sequences better than 10.0: 25
Number of HSP's better than 10.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 282
Number of HSP's gapped (non-prelim): 27
length of query: 147
length of database: 11,318,596
effective HSP length: 90
effective length of query: 57
effective length of database: 8,913,886
effective search space: 508091502
effective search space used: 508091502
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 55 (25.8 bits)
Lotus: description of TM0301a.4