
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147536.4 + phase: 0 /pseudo
(805 letters)
Database: sprot
164,201 sequences; 59,974,054 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
POL2_MOUSE (P11369) Retrovirus-related Pol polyprotein [Contains... 82 7e-15
LIN1_HUMAN (P08547) LINE-1 reverse transcriptase homolog 81 1e-14
LIN1_NYCCO (P08548) LINE-1 reverse transcriptase homolog 80 2e-14
MC50_ARATH (P92555) Hypothetical mitochondrial protein AtMg01250... 65 9e-10
POLR_DROME (P16423) Retrovirus-related Pol polyprotein from type... 64 1e-09
PO23_POPJA (Q05118) Retrovirus-related Pol polyprotein from type... 57 3e-07
PO21_NASVI (Q03278) Retrovirus-related Pol polyprotein from type... 49 7e-05
PO22_POPJA (Q03274) Retrovirus-related Pol polyprotein from type... 45 8e-04
YTX2_XENLA (P14381) Transposon TX1 hypothetical 149 kDa protein ... 44 0.002
YO84_CAEEL (P34620) Hypothetical protein ZK1236.4 in chromosome III 35 0.60
PO21_SCICO (Q03279) Retrovirus-related Pol polyprotein from type... 34 1.8
TRPB_STRR6 (Q8DNM8) Tryptophan synthase beta chain (EC 4.2.1.20) 33 3.0
TRPB_STRPN (Q97P32) Tryptophan synthase beta chain (EC 4.2.1.20) 33 3.0
AI2M_YEAST (P03876) Putative COX1/OXI3 intron 2 protein 33 3.0
>POL2_MOUSE (P11369) Retrovirus-related Pol polyprotein [Contains:
Reverse transcriptase (EC 2.7.7.49); Endonuclease]
Length = 1300
Score = 81.6 bits (200), Expect = 7e-15
Identities = 62/196 (31%), Positives = 99/196 (49%), Gaps = 9/196 (4%)
Query: 1 MSKMKGNKAFMTIQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWN 60
++K+K +K M I +D EKA+D++ F+ + L + +I S I N
Sbjct: 612 INKLK-DKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQGPYLNMIKAIYSKPVANIKVN 670
Query: 61 GDKTETFYPSRGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLL 120
G+K E G RQG PLSPYLF I +E L+ I Q E K +++G+ IS L
Sbjct: 671 GEKLEAIPLKSGTRQGCPLSPYLFNIVLEVLARAIRQQKEI---KGIQIGKEEVKIS--L 725
Query: 121 FADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINREKTQVY-FSKNVDNHLREDIIQHT 179
ADD++++ +L+ ++ F + G KIN K+ + ++KN ++I + T
Sbjct: 726 LADDMIVYISDPKNSTRELLNLINSFGEVVGYKINSNKSMAFLYTKN--KQAEKEIRETT 783
Query: 180 GFNQVNNLGKYLGANI 195
F+ V N KYLG +
Sbjct: 784 PFSIVTNNIKYLGVTL 799
>LIN1_HUMAN (P08547) LINE-1 reverse transcriptase homolog
Length = 1259
Score = 80.9 bits (198), Expect = 1e-14
Identities = 62/211 (29%), Positives = 101/211 (47%), Gaps = 8/211 (3%)
Query: 11 MTIQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNGDKTETFYPS 70
M I ID EKA+D++ F+ + L + +KII + I+ NG K E
Sbjct: 594 MIISIDAEKAFDKIQQPFMLKPLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAPPLK 653
Query: 71 RGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLLFADDLLLFAE 130
G RQG PLSP L I +E L+ I + E K +++G+ +S LFADD++++ E
Sbjct: 654 TGTRQGCPLSPLLPNIVLEVLARAIRQEKEI---KGIQLGKEEVKLS--LFADDMIVYLE 708
Query: 131 ASIEQAHCVLHCLDMFCQSSGQKINREKTQVYFSKNVDNHLREDIIQHTGFNQVNNLGKY 190
I A +L + F + SG KIN +K+Q + N + I+ F + KY
Sbjct: 709 NPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTN-NRQTESQIMSELPFTIASKRIKY 767
Query: 191 LGANIAPGRTS--RGHFNHIINKIQNKLSGW 219
LG + + ++ ++N+I+ + W
Sbjct: 768 LGIQLTRDVKDLFKENYKPLLNEIKEDTNKW 798
>LIN1_NYCCO (P08548) LINE-1 reverse transcriptase homolog
Length = 1260
Score = 80.5 bits (197), Expect = 2e-14
Identities = 67/232 (28%), Positives = 105/232 (44%), Gaps = 16/232 (6%)
Query: 1 MSKMKGNKAFMTIQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWN 60
++K+K NK M + ID EKA+D + F+ L + +K+I S + I+ N
Sbjct: 585 INKLK-NKDHMILSIDAEKAFDNIQHPFMIRTLKKIGIEGTFLKLIEAIYSKPTANIILN 643
Query: 61 GDKTETFYPSRGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLL 120
G K ++F G RQG PLSP LF I ME L+ I E K + +G +S L
Sbjct: 644 GVKLKSFPLRSGTRQGCPLSPLLFNIVMEVLAIAIR---EEKAIKGIHIGSEEIKLS--L 698
Query: 121 FADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINREKTQVYFSKNVDNHLREDIIQHTG 180
FADD++++ E + + +L + + SG KIN K+ + N +N + +
Sbjct: 699 FADDMIVYLENTRDSTTKLLEVIKEYSNVSGYKINTHKSVAFIYTN-NNQAEKTVKDSIP 757
Query: 181 FNQVNNLGKYLGANIAPG---------RTSRGHFNHIINKIQNKLSGWMFKL 223
F V KYLG + T R +NK +N W+ ++
Sbjct: 758 FTVVPKKMKYLGVYLTKDVKDLYKENYETLRKEIAEDVNKWKNIPCSWLGRI 809
>MC50_ARATH (P92555) Hypothetical mitochondrial protein AtMg01250
(ORF102)
Length = 122
Score = 64.7 bits (156), Expect = 9e-10
Identities = 33/65 (50%), Positives = 39/65 (59%)
Query: 60 NGDKTETFYPSRGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHL 119
NG PSRG+RQGDPLSPYLF++C E LS + E +RV P I+HL
Sbjct: 15 NGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRINHL 74
Query: 120 LFADD 124
LFADD
Sbjct: 75 LFADD 79
>POLR_DROME (P16423) Retrovirus-related Pol polyprotein from type II
retrotransposable element R2DM [Contains: Protease (EC
3.4.23.-); Reverse transcriptase (EC 2.7.7.49);
Endonuclease]
Length = 1057
Score = 64.3 bits (155), Expect = 1e-09
Identities = 43/144 (29%), Positives = 72/144 (49%), Gaps = 9/144 (6%)
Query: 15 IDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNGDKTETFYPSRGIR 74
+D+ KA+D L+ + + L P + + + + +G +E F P+RG++
Sbjct: 481 LDVSKAFDSLSHASIYDTLRAYGAPKGFVDYVQNTYEGGGTSLNGDGWSSEEFVPARGVK 540
Query: 75 QGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLLFADDLLLFAEASIE 134
QGDPLSP LF + M+RL + ++ + +VG + + FADDL+LFAE +
Sbjct: 541 QGDPLSPILFNLVMDRLLRTLPSEIGA------KVG--NAITNAAAFADDLVLFAETRMG 592
Query: 135 QAHCVLHCLDMFCQSSGQKINREK 158
+ LD F G K+N +K
Sbjct: 593 LQVLLDKTLD-FLSIVGLKLNADK 615
>PO23_POPJA (Q05118) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 606
Score = 56.6 bits (135), Expect = 3e-07
Identities = 42/158 (26%), Positives = 71/158 (44%), Gaps = 8/158 (5%)
Query: 1 MSKMKGNKAFMTIQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWN 60
+ ++KG K + + +D+ KA+D ++ + + + I I+ I+
Sbjct: 15 LRRLKG-KTYNVVSLDIRKAFDTVSHPAILRAMRAFGIDDGMQDFIMSTITDAYTNIVVG 73
Query: 61 GDKTETFYPSRGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLL 120
G T Y G++QGDPLSP LF I ++ L + D+ P I+ L
Sbjct: 74 GRTTNKIYIRNGVKQGDPLSPVLFNIVLDELVTRLNDEQPGASMTP------ACKIASLA 127
Query: 121 FADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINREK 158
FADDLLL + I+ + + F ++ G +N EK
Sbjct: 128 FADDLLLLEDRDIDVPNSLATTCAYF-RTRGMTLNPEK 164
>PO21_NASVI (Q03278) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 1025
Score = 48.5 bits (114), Expect = 7e-05
Identities = 37/129 (28%), Positives = 61/129 (46%), Gaps = 9/129 (6%)
Query: 2 SKMKGNKAFMTIQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNG 61
++MK ++ I +D++KA+D + + + L K P ++ I ++
Sbjct: 443 ARMKIKGLYIAI-LDVKKAFDSVEHRSILDALRRKKLPLEMRNYIMWVYRNSKTRLEVVK 501
Query: 62 DKTETFYPSRGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLLF 121
K P+RG+RQGDPLSP LF M+ + + + +G I L+F
Sbjct: 502 TKGRWIRPARGVRQGDPLSPLLFNCVMDAVLRRLPENT------GFLMG--AEKIGALVF 553
Query: 122 ADDLLLFAE 130
ADDL+L AE
Sbjct: 554 ADDLVLLAE 562
>PO22_POPJA (Q03274) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 711
Score = 45.1 bits (105), Expect = 8e-04
Identities = 41/160 (25%), Positives = 69/160 (42%), Gaps = 10/160 (6%)
Query: 2 SKMKGNKAFMTIQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKI-MWN 60
S+ + K + + +D+ KA+D ++ + + L I +S + I +
Sbjct: 129 SRREQRKTYNVVSLDVRKAFDTVSHSSICRALQRLGIDEGTSNYITGSLSDSTTTIRVGP 188
Query: 61 GDKTETFYPSRGIRQGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYG-PLISHL 119
G +T RG++QGDPLSP+LF ++ L + P G G I L
Sbjct: 189 GSQTRKICIRRGVKQGDPLSPFLFNAVLDELLCSLQS-------TPGIGGTIGEEKIPVL 241
Query: 120 LFADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINREKT 159
FADDLLL + + L + F + G +N +K+
Sbjct: 242 AFADDLLLLEDNDV-LLPTTLATVANFFRLRGMSLNAKKS 280
>YTX2_XENLA (P14381) Transposon TX1 hypothetical 149 kDa protein
(ORF 2)
Length = 1308
Score = 43.9 bits (102), Expect = 0.002
Identities = 45/191 (23%), Positives = 89/191 (46%), Gaps = 22/191 (11%)
Query: 13 IQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNGDKTETFYPSRG 72
+ +D EKA+DR++ ++ L F + + + ++ + N T RG
Sbjct: 591 LSLDQEKAFDRVDHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAFGRG 650
Query: 73 IRQGDPLSPYLFVICMERLSHIIADQVESDYWK--PMRVGRYGPLISHLLFADDLLLFAE 130
+RQG PLS L+ + +E ++ ++ K MRV ++S +ADD++L A+
Sbjct: 651 VRQGCPLSGQLYSLAIEPFLCLLRKRLTGLVLKEPDMRV-----VLS--AYADDVILVAQ 703
Query: 131 --ASIEQAHCVLHCLDMFCQSSGQKINREKTQVYFSKNVDNHLREDIIQHTGFNQV---N 185
+E+A C +++ +S +IN K+ S ++ L+ D + F + +
Sbjct: 704 DLVDLERAQ---ECQEVYAAASSARINWSKS----SGLLEGSLKVDFLP-PAFRDISWES 755
Query: 186 NLGKYLGANIA 196
+ KYLG ++
Sbjct: 756 KIIKYLGVYLS 766
>YO84_CAEEL (P34620) Hypothetical protein ZK1236.4 in chromosome III
Length = 364
Score = 35.4 bits (80), Expect = 0.60
Identities = 20/78 (25%), Positives = 39/78 (49%)
Query: 15 IDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNGDKTETFYPSRGIR 74
+D KA+D+++ + + + L K LI+ + ++ S+K+ +E G+
Sbjct: 23 LDFSKAFDKVSHDILLDKLTSIKINKHLIRWLDVFLTNRSFKVKVGNTLSEPKKTVCGVP 82
Query: 75 QGDPLSPYLFVICMERLS 92
QG +SP LF I + +S
Sbjct: 83 QGSVISPVLFGIFVNEIS 100
>PO21_SCICO (Q03279) Retrovirus-related Pol polyprotein from type I
retrotransposable element R2 [Contains: Reverse
transcriptase (EC 2.7.7.49); Endonuclease] (Fragment)
Length = 869
Score = 33.9 bits (76), Expect = 1.8
Identities = 37/149 (24%), Positives = 67/149 (44%), Gaps = 16/149 (10%)
Query: 15 IDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNGDKTETFYPSRGIR 74
+DL KA++ + + + + + E P ++ I + + ++ + G K E G+
Sbjct: 296 LDLVKAFNSVYHSALIDAITEAGCPPGVVDYIADMYNNVITEMQFEG-KCELASILAGVY 354
Query: 75 QGDPLSPYLFVICMERLSHIIADQVESDYWKPMRVGRYGPLISHLLFADDLLLFAEASIE 134
QGDPLS LF + E+ + ++ D +RV ++DD LL A I
Sbjct: 355 QGDPLSGPLFTLAYEKALRALNNEGRFDI-ADVRVNASA-------YSDDGLLLAMTVIG 406
Query: 135 QAHCVLHCLDMFCQS---SGQKINREKTQ 160
+ H LD F ++ G +IN K++
Sbjct: 407 ----LQHNLDKFGETLAKIGLRINSRKSK 431
>TRPB_STRR6 (Q8DNM8) Tryptophan synthase beta chain (EC 4.2.1.20)
Length = 407
Score = 33.1 bits (74), Expect = 3.0
Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 14/95 (14%)
Query: 103 YWKPMRVGRYGPLISHLLFADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINR---EKT 159
Y +P + G YG F + L+ A +E+A+ F + Q + + +T
Sbjct: 3 YQEPNKDGFYGKFGGR--FVPETLMTAVLELEKAYRESQADPSFQEELNQLLRQYVGRET 60
Query: 160 QVYFSKNVDNHL--------REDIIQHTGFNQVNN 186
+Y++KN+ H+ RED + HTG +++NN
Sbjct: 61 PLYYAKNLTQHIGGAKIYLKRED-LNHTGAHKINN 94
>TRPB_STRPN (Q97P32) Tryptophan synthase beta chain (EC 4.2.1.20)
Length = 407
Score = 33.1 bits (74), Expect = 3.0
Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 14/95 (14%)
Query: 103 YWKPMRVGRYGPLISHLLFADDLLLFAEASIEQAHCVLHCLDMFCQSSGQKINR---EKT 159
Y +P + G YG F + L+ A +E+A+ F + Q + + +T
Sbjct: 3 YQEPNKDGFYGKFGGR--FVPETLMTAVLELEKAYRESQADPSFQEELNQLLRQYVGRET 60
Query: 160 QVYFSKNVDNHL--------REDIIQHTGFNQVNN 186
+Y++KN+ H+ RED + HTG +++NN
Sbjct: 61 PLYYAKNLTQHIGGAKIYLKRED-LNHTGAHKINN 94
>AI2M_YEAST (P03876) Putative COX1/OXI3 intron 2 protein
Length = 789
Score = 33.1 bits (74), Expect = 3.0
Identities = 25/94 (26%), Positives = 46/94 (48%), Gaps = 11/94 (11%)
Query: 13 IQIDLEKAYDRLNWNFVEECLMECKFPSKLIKIIHHCISTLSYKIMWNG--DKTETFYPS 70
I++DL K +D + N + L E +I L YK++ G DK ++ +
Sbjct: 350 IKVDLNKCFDTIPHNMLINVLNE--------RIKDKGFMDLLYKLLRAGYVDKNNNYHNT 401
Query: 71 R-GIRQGDPLSPYLFVICMERLSHIIADQVESDY 103
GI QG +SP L I +++L + ++ E+++
Sbjct: 402 TLGIPQGSVVSPILCNIFLDKLDKYLENKFENEF 435
Database: sprot
Posted date: Nov 25, 2004 10:54 AM
Number of letters in database: 59,974,054
Number of sequences in database: 164,201
Lambda K H
0.363 0.164 0.635
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 80,656,316
Number of Sequences: 164201
Number of extensions: 3064880
Number of successful extensions: 10690
Number of sequences better than 10.0: 14
Number of HSP's better than 10.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 10663
Number of HSP's gapped (non-prelim): 21
length of query: 805
length of database: 59,974,054
effective HSP length: 118
effective length of query: 687
effective length of database: 40,598,336
effective search space: 27891056832
effective search space used: 27891056832
T: 11
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (22.0 bits)
S2: 70 (31.6 bits)
Medicago: description of AC147536.4