
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147202.9 + phase: 0 /pseudo
(520 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
MC50_ARATH (P92555) Hypothetical mitochondrial protein AtMg01250... 74 9e-13
LIN1_HUMAN (P08547) LINE-1 reverse transcriptase homolog 61 6e-09
POL2_MOUSE (P11369) Retrovirus-related Pol polyprotein [Contains... 60 1e-08
LIN1_NYCCO (P08548) LINE-1 reverse transcriptase homolog 60 1e-08
YTX2_XENLA (P14381) Transposon TX1 hypothetical 149 kDa protein ... 50 2e-05
PO23_POPJA (Q05118) Retrovirus-related Pol polyprotein from type... 47 9e-05
M310_ARATH (P93295) Hypothetical mitochondrial protein AtMg00310... 44 0.001
PO22_POPJA (Q03274) Retrovirus-related Pol polyprotein from type... 41 0.009
PO21_NASVI (Q03278) Retrovirus-related Pol polyprotein from type... 39 0.043
POLR_DROME (P16423) Retrovirus-related Pol polyprotein from type... 34 0.80
YMH5_CAEEL (P34472) Hypothetical protein F58A4.5 in chromosome III 34 1.1
ARAB_BACHD (Q9KBQ3) Ribulokinase (EC 2.7.1.16) 33 2.3
YO84_CAEEL (P34620) Hypothetical protein ZK1236.4 in chromosome III 32 3.1
SES1_XENLA (P58003) Sestrin 1 (p53-regulated protein PA26) (XPA26) 32 3.1
>MC50_ARATH (P92555) Hypothetical mitochondrial protein AtMg01250
(ORF102)
Length = 122
Score = 73.9 bits (180), Expect = 9e-13
Identities = 33/67 (49%), Positives = 45/67 (66%)
Query: 48 LVNQESEGPILPGSGLRKGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFT 107
++N +G + P GLR+GDPLSPYLFILC + L+ L +A+ +G L G+RV +P
Sbjct: 13 IINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRIN 72
Query: 108 HLLFVDD 114
HLLF DD
Sbjct: 73 HLLFADD 79
>LIN1_HUMAN (P08547) LINE-1 reverse transcriptase homolog
Length = 1259
Score = 61.2 bits (147), Expect = 6e-09
Identities = 50/184 (27%), Positives = 88/184 (47%), Gaps = 8/184 (4%)
Query: 1 LILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVN-QESEGPILP 59
+I+ +D KA+D+I ++ + + +LG D + I + +++N Q+ E P L
Sbjct: 594 MIISIDAEKAFDKIQQPFMLKPLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAPPLK 653
Query: 60 GSGLRKGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCFLFC 119
+G R+G PLSP L + + L I + + ++ G+++ G LF DD ++
Sbjct: 654 -TGTRQGCPLSPLLPNIVLEVLARAIRQEK---EIKGIQL--GKEEVKLSLFADDMIVYL 707
Query: 120 RAT*REVNALKNILSIYEDASGQQINFQKSEIFFSKNVGA*AKVNILNILQVNIAI*TGK 179
L ++S + SG +IN QKS+ F N + I++ L IA K
Sbjct: 708 ENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTN-NRQTESQIMSELPFTIASKRIK 766
Query: 180 YLGL 183
YLG+
Sbjct: 767 YLGI 770
>POL2_MOUSE (P11369) Retrovirus-related Pol polyprotein [Contains:
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1300
Score = 60.5 bits (145), Expect = 1e-08
Identities = 49/184 (26%), Positives = 83/184 (44%), Gaps = 8/184 (4%)
Query: 1 LILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPG 60
+I+ +D KA+D+I ++ +V+ R G + I + VN E I
Sbjct: 621 MIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVNGEKLEAIPLK 680
Query: 61 SGLRKGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCFLFCR 120
SG R+G PLSPYLF + + L I + + ++ G+++ G L DD ++
Sbjct: 681 SGTRQGCPLSPYLFNIVLEVLARAIRQQK---EIKGIQI--GKEEVKISLLADDMIVYIS 735
Query: 121 AT*REVNALKNILSIYEDASGQQINFQKSEIF-FSKNVGA*AKVNILNILQVNIAI*TGK 179
L N+++ + + G +IN KS F ++KN A+ I +I K
Sbjct: 736 DPKNSTRELLNLINSFGEVVGYKINSNKSMAFLYTKNKQ--AEKEIRETTPFSIVTNNIK 793
Query: 180 YLGL 183
YLG+
Sbjct: 794 YLGV 797
>LIN1_NYCCO (P08548) LINE-1 reverse transcriptase homolog
Length = 1260
Score = 60.1 bits (144), Expect = 1e-08
Identities = 49/187 (26%), Positives = 81/187 (43%), Gaps = 14/187 (7%)
Query: 1 LILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPG 60
+IL +D KA+D I ++ R + ++G + G + +E++ K N G L
Sbjct: 594 MILSIDAEKAFDNIQHPFMIRTLKKIGIE----GTFLKLIEAIYSKPTANIILNGVKLKS 649
Query: 61 ----SGLRKGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCF 116
SG R+G PLSP LF + + L I + + +H G+ LF DD
Sbjct: 650 FPLRSGTRQGCPLSPLLFNIVMEVLAIAIREEKAIKGIH-----IGSEEIKLSLFADDMI 704
Query: 117 LFCRAT*REVNALKNILSIYEDASGQQINFQKSEIFFSKNVGA*AKVNILNILQVNIAI* 176
++ T L ++ Y + SG +IN KS F N A+ + + + +
Sbjct: 705 VYLENTRDSTTKLLEVIKEYSNVSGYKINTHKSVAFIYTNNNQ-AEKTVKDSIPFTVVPK 763
Query: 177 TGKYLGL 183
KYLG+
Sbjct: 764 KMKYLGV 770
>YTX2_XENLA (P14381) Transposon TX1 hypothetical 149 kDa protein
(ORF 2)
Length = 1308
Score = 49.7 bits (117), Expect = 2e-05
Identities = 37/147 (25%), Positives = 65/147 (44%), Gaps = 6/147 (4%)
Query: 3 LKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPGSG 62
L +D KA+DR++ +YL + F ++ G++ S + V +N P+ G G
Sbjct: 591 LSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAFGRG 650
Query: 63 LRKGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCFLFCRAT 122
+R+G PLS L+ L + L+ K L G+ + + DD L +
Sbjct: 651 VRQGCPLSGQLYSLAIEPFLCLLRKR-----LTGLVLKEPDMRVVLSAYADDVILVAQDL 705
Query: 123 *REVNALKNILSIYEDASGQQINFQKS 149
++ + +Y AS +IN+ KS
Sbjct: 706 -VDLERAQECQEVYAAASSARINWSKS 731
>PO23_POPJA (Q05118) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 606
Score = 47.4 bits (111), Expect = 9e-05
Identities = 33/117 (28%), Positives = 54/117 (45%), Gaps = 8/117 (6%)
Query: 2 ILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPGS 61
++ +D+ KA+D ++ + R M G D +IM + ++V + I +
Sbjct: 25 VVSLDIRKAFDTVSHPAILRAMRAFGIDDGMQDFIMSTITDAYTNIVVGGRTTNKIYIRN 84
Query: 62 GLRKGDPLSPYLF-ILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCFL 117
G+++GDPLSP LF I+ + +T L D+ G T A L F DD L
Sbjct: 85 GVKQGDPLSPVLFNIVLDELVTRLNDEQPGAS-------MTPACKIASLAFADDLLL 134
>M310_ARATH (P93295) Hypothetical mitochondrial protein AtMg00310
(ORF154)
Length = 154
Score = 43.5 bits (101), Expect = 0.001
Identities = 18/58 (31%), Positives = 35/58 (60%)
Query: 305 LKARYYRRNAIFEANKGNNLSYVWKSIHEAIPMVKQGTRWKIGNGDKIRVFQDRWLVN 362
L++RY+ +++ E + G SY W+SI ++ +G IG+G +V+ DRW+++
Sbjct: 89 LRSRYFPHSSMMECSVGTRPSYAWRSIIHGRELLSRGLLRTIGDGIHTKVWLDRWIMD 146
>PO22_POPJA (Q03274) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 711
Score = 40.8 bits (94), Expect = 0.009
Identities = 31/117 (26%), Positives = 49/117 (41%), Gaps = 7/117 (5%)
Query: 2 ILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEG-PILPG 60
++ +D+ KA+D ++ + R + RLG D +I + + V S+ I
Sbjct: 139 VVSLDVRKAFDTVSHSSICRALQRLGIDEGTSNYITGSLSDSTTTIRVGPGSQTRKICIR 198
Query: 61 SGLRKGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCFL 117
G+++GDPLSP+LF L + G G G L F DD L
Sbjct: 199 RGVKQGDPLSPFLFNAVLDELLCSLQSTPGIGG------TIGEEKIPVLAFADDLLL 249
>PO21_NASVI (Q03278) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 1025
Score = 38.5 bits (88), Expect = 0.043
Identities = 29/113 (25%), Positives = 47/113 (40%), Gaps = 8/113 (7%)
Query: 5 VDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPGSGLR 64
+D+ KA+D + + + R + +IM + K ++ V + I P G+R
Sbjct: 455 LDVKKAFDSVEHRSILDALRRKKLPLEMRNYIMWVYRNSKTRLEVVKTKGRWIRPARGVR 514
Query: 65 KGDPLSPYLFILCAQGLTSLIDKAEGRGDLHGVRVCTGAPTFTHLLFVDDCFL 117
+GDPLSP LF + + + G GA L+F DD L
Sbjct: 515 QGDPLSPLLFNCVMDAVLRRLPENTG--------FLMGAEKIGALVFADDLVL 559
>POLR_DROME (P16423) Retrovirus-related Pol polyprotein from type II
retrotransposable element R2DM [Contains: Protease (EC
3.4.23.-); Reverse transcriptase (EC 2.7.7.49);
Endonuclease]
Length = 1057
Score = 34.3 bits (77), Expect = 0.80
Identities = 20/75 (26%), Positives = 36/75 (47%)
Query: 2 ILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPGS 61
I +D+SKA+D ++ + + G + ++ E + + S +P
Sbjct: 478 IANLDVSKAFDSLSHASIYDTLRAYGAPKGFVDYVQNTYEGGGTSLNGDGWSSEEFVPAR 537
Query: 62 GLRKGDPLSPYLFIL 76
G+++GDPLSP LF L
Sbjct: 538 GVKQGDPLSPILFNL 552
>YMH5_CAEEL (P34472) Hypothetical protein F58A4.5 in chromosome III
Length = 1222
Score = 33.9 bits (76), Expect = 1.1
Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 3/86 (3%)
Query: 2 ILKVDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILP-G 60
IL D +KA+D+++ L + + G D W + + V +N+ P
Sbjct: 750 ILFFDFAKAFDKVSHPILLKKLALFGLDKLTCSWFKEFLHLRTFSVKINKFVSSNAYPIS 809
Query: 61 SGLRKGDPLSPYLFILCAQGLTSLID 86
SG+ +G P LFIL L LID
Sbjct: 810 SGVPQGSVSGPLLFILFINDL--LID 833
>ARAB_BACHD (Q9KBQ3) Ribulokinase (EC 2.7.1.16)
Length = 563
Score = 32.7 bits (73), Expect = 2.3
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 5/81 (6%)
Query: 47 VLVNQESEGPILPGSGLRKGDPLSPYLFILCAQGLTSLIDKAEGRG-DLHGVRVCTGAPT 105
+LV+ E G +L + K + + L A G +++D GRG ++H + C G P
Sbjct: 391 ILVDTELSGMLLGYTLQTKPEEIYRALLEATAFGTRAIVDAFHGRGVEVHELYACGGLPQ 450
Query: 106 FTHLLFVDDCFLFCRAT*REV 126
HLL +F T RE+
Sbjct: 451 KNHLLMQ----IFADVTNREI 467
>YO84_CAEEL (P34620) Hypothetical protein ZK1236.4 in chromosome
III
Length = 364
Score = 32.3 bits (72), Expect = 3.1
Identities = 17/70 (24%), Positives = 34/70 (48%)
Query: 5 VDMSKAYDRINWEYLRRVMLRLGFDAKWFGWIMMCVESVKYKVLVNQESEGPILPGSGLR 64
+D SKA+D+++ + L + + + W+ + + + +KV V P G+
Sbjct: 23 LDFSKAFDKVSHDILLDKLTSIKINKHLIRWLDVFLTNRSFKVKVGNTLSEPKKTVCGVP 82
Query: 65 KGDPLSPYLF 74
+G +SP LF
Sbjct: 83 QGSVISPVLF 92
>SES1_XENLA (P58003) Sestrin 1 (p53-regulated protein PA26) (XPA26)
Length = 481
Score = 32.3 bits (72), Expect = 3.1
Identities = 19/77 (24%), Positives = 39/77 (49%), Gaps = 6/77 (7%)
Query: 379 NVKELLQPGQKSWNLDVLQHYLSSQSVFEAMNTLLFNSTAYDERVWRPGKRGIYTVR--S 436
++K+LL+ G+ SW+L L H + + + ++ + F E G +T R S
Sbjct: 174 HIKQLLKAGEHSWSLAELIHAIVLLAHYHSLASFTFGCGINPE----IHSNGGHTFRPPS 229
Query: 437 ASRFCLTELVEGDYGCQ 453
S +C+ ++ G++G +
Sbjct: 230 VSNYCICDITNGNHGTE 246
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.344 0.154 0.514
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,886,785
Number of Sequences: 164201
Number of extensions: 2230188
Number of successful extensions: 7522
Number of sequences better than 10.0: 14
Number of HSP's better than 10.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 7508
Number of HSP's gapped (non-prelim): 15
length of query: 520
length of database: 59,974,054
effective HSP length: 115
effective length of query: 405
effective length of database: 41,090,939
effective search space: 16641830295
effective search space used: 16641830295
T: 11
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.6 bits)
S2: 68 (30.8 bits)
Medicago: description of AC147202.9