
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146555.15 - phase: 0 /pseudo
(552 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||G86301 probable retroelement polyprotein [imported] - Arabi... 241 6e-62
gb|AAU89730.1| putative polyprotein [Solanum tuberosum] 234 4e-60
emb|CAB79271.1| putative protein [Arabidopsis thaliana] gi|30212... 223 9e-57
ref|NP_194047.2| protein kinase family protein [Arabidopsis thal... 223 9e-57
gb|AAU89728.1| putative retroelement pol polyprotein-like [Solan... 223 1e-56
gb|AAD25830.1| putative retroelement pol polyprotein [Arabidopsi... 219 1e-55
pir||E96608 probable retroelement polyprotein F25P12.89 [importe... 216 1e-54
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7... 216 1e-54
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 216 2e-54
gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|22655202|gb|A... 216 2e-54
gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thalia... 215 3e-54
gb|AAF69161.1| F27F5.19 [Arabidopsis thaliana] gi|25301673|pir||... 215 3e-54
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 214 8e-54
emb|CAB77940.1| putative polyprotein [Arabidopsis thaliana] gi|4... 213 2e-53
gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsi... 209 1e-52
gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsi... 209 1e-52
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 209 2e-52
gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsi... 207 9e-52
gb|AAO26685.1| gag-pol polyprotein [Vitis vinifera] 206 1e-51
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 206 2e-51
>pir||G86301 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989054|gb|AAG10817.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1413
Score = 241 bits (614), Expect = 6e-62
Identities = 137/325 (42%), Positives = 195/325 (59%), Gaps = 18/325 (5%)
Query: 1 FTAHTHIPTTSPSP----PITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTT 56
F H ++ P P+ +TS S P + +R P Y+ DY CN T+ST
Sbjct: 830 FFPHIYVDRNDSHPSQPLPVQETSASNV---PAEKQNSRVSRPPAYLKDYHCNSVTSST- 885
Query: 57 NNSHVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQ 116
+PIS LS++ LS + IF ++ EP +Y +A + W M E+T L+
Sbjct: 886 -----DHPISEVLSYSSLSDPYMIFINAVNKIPEPHTYAQARQIKEWCDAMGMEITALED 940
Query: 117 TGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVA 176
GTW + LP K +GC WV K+K N DGS+ERYK LVAKGY Q EGLDY DTFS VA
Sbjct: 941 NGTWVVCSLPVGKKAVGCKWVYKIKLNADGSLERYKARLVAKGYTQTEGLDYVDTFSPVA 1000
Query: 177 KVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCK 231
K+TT++L+IA+A+ L QLD++NAFL+G L E++YM +PPG S + PN VC+
Sbjct: 1001 KLTTVKLLIAVAAAKGWSLSQLDISNAFLNGSLDEEIYMTLPPGYSPRQGDSFPPNAVCR 1060
Query: 232 LSKSLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAG 291
L KSLYGLK ASR+WY K + G+ Q++ + +LFT+K+ +S++ +LVYV DI +A
Sbjct: 1061 LKKSLYGLKQASRQWYLKFSESLKALGFTQSSGDHTLFTRKSKNSYMAVLVYVDDIIIAS 1120
Query: 292 DFLSEITFIKNALNQASKSKILVSL 316
E +++AL ++SK + L +L
Sbjct: 1121 SCDRETELLRDALQRSSKLRDLGTL 1145
>gb|AAU89730.1| putative polyprotein [Solanum tuberosum]
Length = 1280
Score = 234 bits (598), Expect = 4e-60
Identities = 141/316 (44%), Positives = 174/316 (54%), Gaps = 17/316 (5%)
Query: 1 FTAHTHIP---TTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTN 57
F AH I + PSP + + S P P + ITP PT T
Sbjct: 649 FQAHLPIQYWDSEPPSPVMFHSPESHTSSPEPTVPLSPTPITPVS--------PTILTPP 700
Query: 58 NSHVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQT 117
S N S T LS + SL S TEPKS+DEA KH W + M+ EL L
Sbjct: 701 RS------PNIFSFTALSVPNQNIIHSLSSITEPKSFDEACKHSGWHKAMETELAALHLN 754
Query: 118 GTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAK 177
GTW++V+LPP +P+ C WV KVKHN DGSIER K LV +G Q EG+D+ +TFS V K
Sbjct: 755 GTWEVVELPPGKRPLPCKWVYKVKHNSDGSIERLKARLVVRGDIQKEGIDFSETFSPVVK 814
Query: 178 VTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKLSKSLY 237
+TTIR ++ +A + QLDVNNAFLHG+LHE+VYM PPG PN VC L KSLY
Sbjct: 815 ITTIRCLLTIAIKKGWRMSQLDVNNAFLHGELHEEVYMRFPPGFPPPSPNHVCLLRKSLY 874
Query: 238 GLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEI 297
GLK ASR+WY +L GY + ++ SLF K N +L+VYV DI L G SEI
Sbjct: 875 GLKQASRQWYARLAGALAYKGYSSSLNDYSLFFKTNGALISILVVYVDDILLTGSDTSEI 934
Query: 298 TFIKNALNQASKSKIL 313
I + LN + K L
Sbjct: 935 ASITDFLNSEFRVKNL 950
>emb|CAB79271.1| putative protein [Arabidopsis thaliana] gi|3021268|emb|CAA18463.1|
putative protein [Arabidopsis thaliana]
gi|7485945|pir||T04833 hypothetical protein F21P8.50 -
Arabidopsis thaliana
Length = 1240
Score = 223 bits (569), Expect = 9e-57
Identities = 122/311 (39%), Positives = 187/311 (59%), Gaps = 12/311 (3%)
Query: 9 TTSPSPPITKTSPSPPFIPPP-IRITTRNKITPTYM*DYICNIPTTSTTNNSHVQYPISN 67
T+S S I ++ +P P + + R P Y+ DY C+ + T ++ IS
Sbjct: 9 TSSSSIDIMPSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTIHD------ISQ 62
Query: 68 FLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKIVDLPP 127
FLS+ +S + F + + EP +Y+EA + W M +E+ ++ T TW+I LPP
Sbjct: 63 FLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPP 122
Query: 128 SSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIRLVIAL 187
+ KPIGC WV K+K+N DG+IERYK LVAKGY Q EG+D+ +TFS V K+T+++L++A+
Sbjct: 123 NKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAI 182
Query: 188 ASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCKLSKSLYGLKPA 242
++I LHQLD++NAFL+GDL E++YM +PPG + + PN VC L KS+YGLK A
Sbjct: 183 SAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQA 242
Query: 243 SRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEITFIKN 302
SR+W+ K + I G+ Q+ S+ + F K F+ +LVYV DI + + + + +K+
Sbjct: 243 SRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKS 302
Query: 303 ALNQASKSKIL 313
L K + L
Sbjct: 303 QLKSCFKLRDL 313
>ref|NP_194047.2| protein kinase family protein [Arabidopsis thaliana]
Length = 1262
Score = 223 bits (569), Expect = 9e-57
Identities = 122/311 (39%), Positives = 187/311 (59%), Gaps = 12/311 (3%)
Query: 9 TTSPSPPITKTSPSPPFIPPP-IRITTRNKITPTYM*DYICNIPTTSTTNNSHVQYPISN 67
T+S S I ++ +P P + + R P Y+ DY C+ + T ++ IS
Sbjct: 9 TSSSSIDIMPSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTIHD------ISQ 62
Query: 68 FLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKIVDLPP 127
FLS+ +S + F + + EP +Y+EA + W M +E+ ++ T TW+I LPP
Sbjct: 63 FLSYEKVSPLYHSFLVCIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPP 122
Query: 128 SSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIRLVIAL 187
+ KPIGC WV K+K+N DG+IERYK LVAKGY Q EG+D+ +TFS V K+T+++L++A+
Sbjct: 123 NKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAI 182
Query: 188 ASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCKLSKSLYGLKPA 242
++I LHQLD++NAFL+GDL E++YM +PPG + + PN VC L KS+YGLK A
Sbjct: 183 SAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQA 242
Query: 243 SRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEITFIKN 302
SR+W+ K + I G+ Q+ S+ + F K F+ +LVYV DI + + + + +K+
Sbjct: 243 SRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKS 302
Query: 303 ALNQASKSKIL 313
L K + L
Sbjct: 303 QLKSCFKLRDL 313
>gb|AAU89728.1| putative retroelement pol polyprotein-like [Solanum tuberosum]
Length = 1476
Score = 223 bits (568), Expect = 1e-56
Identities = 130/325 (40%), Positives = 181/325 (55%), Gaps = 9/325 (2%)
Query: 2 TAHTHIPTTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNSHV 61
T+ IP SP ++ PP P R + R P + D+I TTST+ ++H
Sbjct: 855 TSEEIIPVASPPSAVSDDHLHPP---PERRRSYRTGKPPIWQKDFI----TTSTSRSNHC 907
Query: 62 QYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWK 121
YPIS+ + ++ LS ++ + S TEP+ Y +A W M+ E+ L+ TW+
Sbjct: 908 LYPISDNIDYSCLSSTYQCYIASSSVETEPQFYYQAANDCRWVHAMKEEIQALEDNKTWE 967
Query: 122 IVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTI 181
+V LP K IGC WV K+K+ G IER+K LVAKGYNQ EGLDY +TFS V K+ T+
Sbjct: 968 VVSLPKGKKAIGCKWVYKIKYKASGEIERFKARLVAKGYNQKEGLDYQETFSPVVKMVTL 1027
Query: 182 RLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPN--QVCKLSKSLYGL 239
R V+ LA + Q+DV NAFL GDL E+VYM +P G K +VC+L KSLYGL
Sbjct: 1028 RTVLTLAVSKGWDIQQMDVYNAFLQGDLIEEVYMQLPQGFQYDKTGDPKVCRLLKSLYGL 1087
Query: 240 KPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEITF 299
K ASR+W KLT + G+QQ+ + SL K+ D +++L+YV D+ + G L I
Sbjct: 1088 KQASRQWNVKLTTALLAAGFQQSHLDYSLMLKRTADGIVIVLIYVDDLLITGSSLQLIDD 1147
Query: 300 IKNALNQASKSKILVSLNIS*ALKF 324
K L K K L +L ++F
Sbjct: 1148 AKQVLKANFKIKDLGTLRYFLGMEF 1172
>gb|AAD25830.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25411082|pir||B84458 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1015
Score = 219 bits (559), Expect = 1e-55
Identities = 117/273 (42%), Positives = 168/273 (60%), Gaps = 22/273 (8%)
Query: 5 THIPTTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNSHVQYP 64
+H + S P T TS S R P ++ DY CN+ + VQYP
Sbjct: 561 SHEDSAPKSVPTTSTSRSK-----------RESKQPAHLKDYFCNL------SRKGVQYP 603
Query: 65 ISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKIVD 124
+S+++S+ LS + + S+ ++EP S+ +A K D W + M EL L+ T TW+I
Sbjct: 604 LSDYMSYDQLSTPYRAYICSVTKFSEPSSFFQAKKSDDWIKAMNAELQALEGTATWEICS 663
Query: 125 LPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIRLV 184
LP + K IGC WV KVK NVDG++ERYK LVAKGY Q EG+D+ DTFS VAK+TT++ +
Sbjct: 664 LPSNKKAIGCKWVYKVKLNVDGTLERYKARLVAKGYTQQEGVDFEDTFSPVAKMTTVKTL 723
Query: 185 IALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCKLSKSLYGL 239
+A+A+ LHQLD++NAFL+ DL+E++YM + PG + + PN VCKL KSLYGL
Sbjct: 724 LAVAAAKKWSLHQLDISNAFLNRDLYEEIYMNLAPGYTPKEGEEIPPNAVCKLKKSLYGL 783
Query: 240 KPASRKWYEKLTCLPITNGYQQATSNASLFTKK 272
K SR+W+ K ++ G+QQ+ ++ +LF KK
Sbjct: 784 KQDSRQWFLKFRSTLLSLGFQQSHADHTLFVKK 816
>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] - Arabidopsis
thaliana gi|9954746|gb|AAG09097.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 216 bits (551), Expect = 1e-54
Identities = 123/315 (39%), Positives = 183/315 (58%), Gaps = 19/315 (6%)
Query: 8 PTTSPSPPITKTSPSPPFIPPPIRITT-----RNKITPTYM*DYICNI---PTTSTTNNS 59
PT + SP S S P PP +++ R K P + DY+ + P S T
Sbjct: 895 PTATESP----ASSSSPVHPPAVQLELLGKGHRPKRPPVKLADYVTTLLHQPFPSAT--- 947
Query: 60 HVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGT 119
YP+ N++S + S ++ + +++ S EP++Y+EA+ D WK + +E+ +L+ GT
Sbjct: 948 --PYPLDNYISSSRFSDNYQAYILAITSGNEPRNYNEAMLDDHWKGAVSHEIGSLENLGT 1005
Query: 120 WKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVT 179
W + DLPP K +GC WV ++K+ DG++ER+K LV G NQ EGLDY +TF+ VAK+
Sbjct: 1006 WTVEDLPPGKKALGCKWVFRLKYKSDGTLERHKARLVVLGNNQTEGLDYTETFAPVAKMV 1065
Query: 180 TIRLVI-ALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKLSKSLYG 238
T+R + + S+++ +HQ+DV+NAFLHGDL E+VYM PPG T +VC+L KSLYG
Sbjct: 1066 TVRAFLQQVVSLDW-EVHQMDVHNAFLHGDLDEEVYMQFPPGFRTGDKTKVCRLRKSLYG 1124
Query: 239 LKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEIT 298
LK A R W+ KLT G+ Q S+ SLF + +LVYV D+ + G ++ IT
Sbjct: 1125 LKQAPRCWFAKLTSALKNYGFIQDISDYSLFIFHKNGVRLHVLVYVDDLIITGTTIAVIT 1184
Query: 299 FIKNALNQASKSKIL 313
K+ L+ K L
Sbjct: 1185 EFKHYLSSCFYMKDL 1199
>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
hypothetical protein 1 - wild cabbage transposon Melmoth
Length = 1131
Score = 216 bits (551), Expect = 1e-54
Identities = 120/311 (38%), Positives = 181/311 (57%), Gaps = 15/311 (4%)
Query: 10 TSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNSHVQYPISNFL 69
++PSP I+ S PF +I+ R + P ++ D+ C S YPIS+ L
Sbjct: 823 SAPSPNIS----SSPFSTLSPQISKRQRTVPAHLKDFHCYSVHDSA-------YPISSTL 871
Query: 70 SHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKIVDLPPSS 129
S++ +S H + S+ + P+SY E + W + EL +++ TW +V LP
Sbjct: 872 SYSQISSHHLAYINSITNIPIPQSYAEVRQSKEWTESADKELDAMEENDTWDVVPLPKGK 931
Query: 130 KPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIRLVIALAS 189
K IGC WV +K N DG++ER K LV KGY Q EGLDY +TFS VAK+ T++L++ + +
Sbjct: 932 KAIGCRWVHTLKFNADGTLERRKSRLVGKGYTQKEGLDYIETFSPVAKMATVKLLLKVGA 991
Query: 190 INY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK----PNQVCKLSKSLYGLKPASRK 245
FLHQLD++NAFL+G+L E++YM +P G + K PN VCKL KS+YGLK ASR+
Sbjct: 992 SKKWFLHQLDISNAFLNGELDEEIYMKLPEGYAERKGDLPPNAVCKLKKSIYGLKQASRQ 1051
Query: 246 WYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEITFIKNALN 305
W++K + G+Q+A + +LF ++ + F+ +LVYV DI +A + +K+ L
Sbjct: 1052 WFKKFSTSLFQLGFQKAHGDHTLFVRQTENDFVAVLVYVDDIVIASTDDAVAVKLKSDLK 1111
Query: 306 QASKSKILVSL 316
K + L SL
Sbjct: 1112 SFFKLRDLGSL 1122
>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|7444418|pir||T00499 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1496
Score = 216 bits (550), Expect = 2e-54
Identities = 120/322 (37%), Positives = 183/322 (56%), Gaps = 24/322 (7%)
Query: 19 TSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNSHVQ---------------- 62
++PS P +P + R K + D++ N + T + ++
Sbjct: 891 STPSSPGLPELLGKGCREKKKSVLLKDFVTNTTSKKKTASHNIHSPSQVLPSGLPTSLSA 950
Query: 63 --------YPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTL 114
YP+S+FL+++ S +H F +++ EPK + +AI W + M E+ L
Sbjct: 951 DSVSGKTLYPLSDFLTNSGYSANHIAFMAAILDSNEPKHFKDAILIKEWCEAMSKEIDAL 1010
Query: 115 DQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSL 174
+ TW I DLP K I WV K+K+N DG++ER+K LV G +Q EG+D+ +TF+
Sbjct: 1011 EANHTWDITDLPHGKKAISSKWVYKLKYNSDGTLERHKARLVVMGNHQKEGVDFKETFAP 1070
Query: 175 VAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKLSK 234
VAK+TT+R ++A+A+ +HQ+DV+NAFLHGDL E+VYM +PPG S P++VC+L K
Sbjct: 1071 VAKLTTVRTILAVAAAKDWEVHQMDVHNAFLHGDLEEEVYMRLPPGFKCSDPSKVCRLRK 1130
Query: 235 SLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFL 294
SLYGLK A R W+ KL+ G+ Q+ + SLF+ KN D+ I +LVYV D+ +AG+ L
Sbjct: 1131 SLYGLKQAPRCWFSKLSTALRNIGFTQSYEDYSLFSLKNGDTIIHVLVYVDDLIVAGNNL 1190
Query: 295 SEITFIKNALNQASKSKILVSL 316
I K+ L++ K L L
Sbjct: 1191 DAIDRFKSQLHKCFHMKDLGKL 1212
>gb|AAR24647.1| At2g23330 [Arabidopsis thaliana] gi|22655202|gb|AAM98191.1| unknown
protein [Arabidopsis thaliana]
Length = 776
Score = 216 bits (550), Expect = 2e-54
Identities = 120/322 (37%), Positives = 183/322 (56%), Gaps = 24/322 (7%)
Query: 19 TSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNSHVQ---------------- 62
++PS P +P + R K + D++ N + T + ++
Sbjct: 171 STPSSPGLPELLGKGCREKKKSVLLKDFVTNTTSKKKTASHNIHSPSQVLPSGLPTSLSA 230
Query: 63 --------YPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTL 114
YP+S+FL+++ S +H F +++ EPK + +AI W + M E+ L
Sbjct: 231 DSVSGKTLYPLSDFLTNSGYSANHIAFMAAILDSNEPKHFKDAILIKEWCEAMSKEIDAL 290
Query: 115 DQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSL 174
+ TW I DLP K I WV K+K+N DG++ER+K LV G +Q EG+D+ +TF+
Sbjct: 291 EANHTWDITDLPHGKKAISSKWVYKLKYNSDGTLERHKARLVVMGNHQKEGVDFKETFAP 350
Query: 175 VAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKLSK 234
VAK+TT+R ++A+A+ +HQ+DV+NAFLHGDL E+VYM +PPG S P++VC+L K
Sbjct: 351 VAKLTTVRTILAVAAAKDWEVHQMDVHNAFLHGDLEEEVYMRLPPGFKCSDPSKVCRLRK 410
Query: 235 SLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFL 294
SLYGLK A R W+ KL+ G+ Q+ + SLF+ KN D+ I +LVYV D+ +AG+ L
Sbjct: 411 SLYGLKQAPRCWFSKLSTALRNIGFTQSYEDYSLFSLKNGDTIIHVLVYVDDLIVAGNNL 470
Query: 295 SEITFIKNALNQASKSKILVSL 316
I K+ L++ K L L
Sbjct: 471 DAIDRFKSQLHKCFHMKDLGKL 492
>gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thaliana]
gi|25403501|pir||H86486 protein Ty1/copia-element
polyprotein [imported] - Arabidopsis thaliana
Length = 1152
Score = 215 bits (547), Expect = 3e-54
Identities = 119/317 (37%), Positives = 177/317 (55%), Gaps = 16/317 (5%)
Query: 7 IPTTSPSPPITKTSPSPPFIP-----------PPIRITTRNKITPTYM*DYI-----CNI 50
+P T+P P I + SPP P PP+R R + + DY C
Sbjct: 835 VPATNPLPAIIDVNDSPPSSPIITATPAAASPPPLRRGLRQRQENVRLKDYQTYSAQCES 894
Query: 51 PTTSTTNNSHVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNE 110
T + N YP++N++S S S+ F ++ P++Y++AI+ W+ + E
Sbjct: 895 TQTLSDNIGTCIYPMANYVSGEIFSPSNQHFLAAISMVDPPQTYNQAIREKEWRNAVFFE 954
Query: 111 LTTLDQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFD 170
+ L+ GTW I LP K IG WV ++K+N +G++ERYK LVA G +Q EG+D+
Sbjct: 955 VDALEDQGTWDITKLPQGVKAIGSKWVFRIKYNSNGTVERYKARLVALGNHQKEGIDFTK 1014
Query: 171 TFSLVAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVC 230
TF+ V K+ T+RL++ +A+ LHQ+DV+NAFLHGDL ED+YM PPG T+ P+ VC
Sbjct: 1015 TFAPVVKMQTVRLLLDVAAAKDWELHQMDVHNAFLHGDLKEDIYMKPPPGFKTTDPSLVC 1074
Query: 231 KLSKSLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLA 290
KL KS+YGLK A R W+EKL+ + G+ Q+ + SLFT + ++VYV D+ +
Sbjct: 1075 KLKKSIYGLKQAPRCWFEKLSTSLLKFGFTQSKKDYSLFTSIRGSKVLHVIVYVDDVVIC 1134
Query: 291 GDFLSEITFIKNALNQA 307
G + E K AL +
Sbjct: 1135 GKAVRENNTSKLALGSS 1151
>gb|AAF69161.1| F27F5.19 [Arabidopsis thaliana] gi|25301673|pir||F96509 protein
F27F5.19 [imported] - Arabidopsis thaliana
Length = 1309
Score = 215 bits (547), Expect = 3e-54
Identities = 120/314 (38%), Positives = 179/314 (56%), Gaps = 15/314 (4%)
Query: 5 THIPTTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNSHVQYP 64
T + P ++ + PF P + R K P + D+ C N S + YP
Sbjct: 794 TSLSDAHPHQDVSSSKALVPFDPQ----SKRQKKPPKHFQDFHCY------NNTSTILYP 843
Query: 65 ISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKIVD 124
I +++S++Y+ + F ++ + P+ Y EA W M+ E+ + QT TW +V
Sbjct: 844 IKDYISYSYIVEPFHAFINNITNAVVPQRYSEAKDFKAWCDAMKEEIGAMIQTNTWSVVS 903
Query: 125 LPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIRLV 184
LPP+ K IGC WV +KHN DGSIERYK LVAKGY Q E LDY +TFS VAK+T++R++
Sbjct: 904 LPPNKKAIGCKWVFTIKHNADGSIERYKARLVAKGYTQEESLDYEETFSPVAKLTSVRMM 963
Query: 185 IALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVS-----TSKPNQVCKLSKSLYGL 239
+ LA+ + QLD++NAFL+GDL E++YM IPPG + + P+ VC+L KS+YGL
Sbjct: 964 LLLAAKMKWSVLQLDISNAFLNGDLDEEIYMKIPPGYADLIGESLPPHAVCRLHKSIYGL 1023
Query: 240 KPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEITF 299
K ASR+WY KL+ G+Q++ ++ +LF K + +LVYV DI + + + +T
Sbjct: 1024 KQASRQWYLKLSNTLKGMGFQKSNADHTLFIKFASGVLMGVLVYVDDIMIVSNSDNAVTQ 1083
Query: 300 IKNALNQASKSKIL 313
L K + L
Sbjct: 1084 FTTELKSYFKLRDL 1097
>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301695|pir||D84481 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1413
Score = 214 bits (544), Expect = 8e-54
Identities = 125/324 (38%), Positives = 180/324 (54%), Gaps = 17/324 (5%)
Query: 8 PTTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYI-----CN-------IPTTST 55
P+T+ SPP T+P PP R R P + DYI C P+TS
Sbjct: 886 PSTNVSPPQQDTTPIIENTPP--RQGKRQVQQPARLKDYILYNASCTPNTPHVLSPSTSQ 943
Query: 56 TNNS---HVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELT 112
+++S ++QYP+++++S S H +F ++ + EPK + E +K W M E+
Sbjct: 944 SSSSIQGNLQYPLTDYISDECFSAGHKVFLAAITANDEPKHFKEDVKVKVWNDAMYKEVD 1003
Query: 113 TLDQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTF 172
L+ TW IVDLP IG WV K K N DG++ERYK LV +G NQIEG DY +TF
Sbjct: 1004 ALEVNKTWDIVDLPTGKVAIGSQWVYKTKFNADGTVERYKARLVVQGNNQIEGEDYTETF 1063
Query: 173 SLVAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKL 232
+ V K+TT+R ++ L + N ++Q+DV+NAFLHGDL E+VYM +PPG S P++VC+L
Sbjct: 1064 APVVKMTTVRTLLRLVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRL 1123
Query: 233 SKSLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGD 292
KSLYGLK A R W++KL+ G+ Q + S F+ + +LVYV D+ + G+
Sbjct: 1124 RKSLYGLKQAPRCWFKKLSDALKRFGFIQGYEDYSFFSYSCKGIELRVLVYVDDLIICGN 1183
Query: 293 FLSEITFIKNALNQASKSKILVSL 316
+ K L + K L L
Sbjct: 1184 DEYMVQKFKEYLGRCFSMKDLGKL 1207
>emb|CAB77940.1| putative polyprotein [Arabidopsis thaliana] gi|4325355|gb|AAD17352.1|
contains similarity to retrovirus-related polyproteins
[Arabidopsis thaliana] gi|25301678|pir||C85077 probable
polyprotein [imported] - Arabidopsis thaliana
Length = 1366
Score = 213 bits (541), Expect = 2e-53
Identities = 124/319 (38%), Positives = 181/319 (55%), Gaps = 11/319 (3%)
Query: 3 AHTHIPTTSPSPPITKTSPSPPFIPPPIRITTRNKIT--PTYM*DYICNIPTTSTTNNSH 60
AHTH T P++ T + F K T P+Y+ Y C+ +++ H
Sbjct: 801 AHTH---TRSLAPLSTTVTNDQFGNDMDNTLMPRKETRAPSYLSQYHCSNVLKEPSSSLH 857
Query: 61 -VQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGT 119
+ +S+ LS+ LS + +F ++++ EP ++ EA W M EL L T T
Sbjct: 858 GTAHSLSSHLSYDKLSNEYRLFCFAIIAEKEPTTFKEAALLQKWLDAMNVELDALVSTST 917
Query: 120 WKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVT 179
+I L + IGC WV K+K+ DG+IERYK LVA GY Q EG+DY DTFS +AK+T
Sbjct: 918 REICSLHDGKRAIGCKWVFKIKYKSDGTIERYKARLVANGYTQQEGVDYIDTFSPIAKLT 977
Query: 180 TIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCKLSK 234
++RL++ALA+I+ + Q+DV NAFLHGD E++YM +P G + K VC+L K
Sbjct: 978 SVRLILALAAIHNWSISQMDVTNAFLHGDFEEEIYMQLPQGYTPRKGELLPKRPVCRLVK 1037
Query: 235 SLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFL 294
SLYGLK ASR+W+ K + + I NG+ Q+ + +LF + D+F+ LLVYV DI L +
Sbjct: 1038 SLYGLKQASRQWFHKFSGVLIQNGFMQSLFDPTLFVRVREDTFLALLVYVDDIMLVSNKD 1097
Query: 295 SEITFIKNALNQASKSKIL 313
S + +K L + K K L
Sbjct: 1098 SAVIEVKQILAKEFKLKDL 1116
>gb|AAD25646.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301701|pir||E84589 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1461
Score = 209 bits (533), Expect = 1e-52
Identities = 122/316 (38%), Positives = 183/316 (57%), Gaps = 15/316 (4%)
Query: 5 THIPTTSPSPPITKTSPSPPFIPPPIRITTRNKIT--PTYM*DYICNIPTTSTTNNSHVQ 62
T + S IT PSP I P +I+ R +IT P ++ DY C N
Sbjct: 867 TPMDPLSSGNSITSHLPSPQ-ISPSTQISKR-RITKFPAHLQDYHCYFV------NKDDS 918
Query: 63 YPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKI 122
+PIS+ LS++ +S SH ++ ++ P+SY EA W + E+ +++T TW+I
Sbjct: 919 HPISSSLSYSQISPSHMLYINNISKIPIPQSYHEAKDSKEWCGAIDQEIGAMERTDTWEI 978
Query: 123 VDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIR 182
LPP K +GC WV VK + DGS+ER+K +VAKGY Q EGLDY +TFS VAK+ T++
Sbjct: 979 TSLPPGKKAVGCKWVFTVKFHADGSLERFKARIVAKGYTQKEGLDYTETFSPVAKMATVK 1038
Query: 183 LVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCKLSKSLY 237
L++ +++ +L+QLD++NAFL+GDL E +YM +P G + K PN VC+L KS+Y
Sbjct: 1039 LLLKVSASKKWYLNQLDISNAFLNGDLEETIYMKLPDGYADIKGTSLPPNVVCRLKKSIY 1098
Query: 238 GLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEI 297
GLK ASR+W+ K + + G+++ + +LF + FI+LLVYV DI +A
Sbjct: 1099 GLKQASRQWFLKFSNSLLALGFEKQHGDHTLFVRCIGSEFIVLLVYVDDIVIASTTEQAA 1158
Query: 298 TFIKNALNQASKSKIL 313
+ AL + K + L
Sbjct: 1159 QSLTEALKASFKLREL 1174
>gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301674|pir||D84639 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1156
Score = 209 bits (533), Expect = 1e-52
Identities = 107/248 (43%), Positives = 154/248 (61%), Gaps = 5/248 (2%)
Query: 50 IPTTSTTNNSHVQ-----YPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWK 104
+P +S+ ++S VQ YP+S+++S S H F ++ + EPK + EA++ W
Sbjct: 593 LPDSSSQSSSMVQGTSSLYPLSDYVSDDCFSAGHKAFLAAITANDEPKHFKEAVRIKVWN 652
Query: 105 QVMQNELTTLDQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIE 164
M E+ L+ TW IVDLPP IG WV K K+N DGSIERYK LV +G Q+E
Sbjct: 653 DAMFKEVDALEINKTWDIVDLPPGKVAIGSQWVYKTKYNADGSIERYKARLVVQGNKQVE 712
Query: 165 GLDYFDTFSLVAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTS 224
G DY +TF+ V K+TT+R ++ L + N ++Q+DVNNAFLHGDL E+VYM +PPG S
Sbjct: 713 GEDYNETFAPVVKMTTVRTLLRLVAANQWEVYQMDVNNAFLHGDLDEEVYMKLPPGFRHS 772
Query: 225 KPNQVCKLSKSLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYV 284
P++VC+L KSLYGLK A R W++KL+ + G+ Q + S F+ + +LVYV
Sbjct: 773 HPDKVCRLRKSLYGLKQAPRCWFKKLSDALLRFGFVQGHEDYSFFSYTRNGIELRVLVYV 832
Query: 285 YDITLAGD 292
D+ + G+
Sbjct: 833 DDLLICGN 840
>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 209 bits (531), Expect = 2e-52
Identities = 124/324 (38%), Positives = 178/324 (54%), Gaps = 17/324 (5%)
Query: 8 PTTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYI-----CN-------IPTTST 55
P+T+ SPP T+P PP R R + DYI C P+TS
Sbjct: 886 PSTNVSPPQQDTTPIIENTPP--RQGKRQVQQLARLKDYILYNASCTPNTPHVLSPSTSQ 943
Query: 56 TNNS---HVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELT 112
+++S + QYP+++++ S H +F ++ + EPK + EA+K W M E+
Sbjct: 944 SSSSIQGNSQYPLTDYIFDECFSAGHKVFLAAITANDEPKHFKEAVKVKVWNDAMYKEVD 1003
Query: 113 TLDQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTF 172
L+ TW IVDLP IG WV K K N DG++ERYK LV +G NQIEG DY +TF
Sbjct: 1004 ALEVNKTWDIVDLPTGKVAIGSQWVYKTKFNADGTVERYKARLVVQGNNQIEGEDYTETF 1063
Query: 173 SLVAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKL 232
+ V K+TT+R ++ L + N ++Q+DV+NAFLHGDL E+VYM +PPG S P++VC+L
Sbjct: 1064 APVVKMTTVRTLLRLVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRL 1123
Query: 233 SKSLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGD 292
KSLYGLK A R W++KL+ G+ Q + S F+ + +LVYV D+ + G+
Sbjct: 1124 RKSLYGLKQAPRCWFKKLSDALKRFGFIQGYEDYSFFSYSCKGIELRVLVYVDDLIICGN 1183
Query: 293 FLSEITFIKNALNQASKSKILVSL 316
+ K L + K L L
Sbjct: 1184 DEYMVQKFKEYLGRCFSMKDLGKL 1207
>gb|AAD26943.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301694|pir||E84535 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1454
Score = 207 bits (526), Expect = 9e-52
Identities = 118/312 (37%), Positives = 182/312 (57%), Gaps = 17/312 (5%)
Query: 16 ITKTSPSPPFIP------PPIRITTRNKITPTYM*DYICNIPTTSTTNNSHVQYPISNFL 69
I+ T+ SP +P PP + R + P ++ DY CN T S +YPIS+ +
Sbjct: 865 ISDTTHSPSSLPSQISDLPPQISSQRVRKPPAHLNDYHCN------TMQSDHKYPISSTI 918
Query: 70 SHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQTGTWKIVDLPPSS 129
S++ +S SH + ++ P +Y EA W + + E+ +++T TW+I LP
Sbjct: 919 SYSKISPSHMCYINNITKIPIPTNYAEAQDTKEWCEAVDAEIGAMEKTNTWEITTLPKGK 978
Query: 130 KPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVAKVTTIRLVIALAS 189
K +GC WV +K DG++ERYK LVAKGY Q EGLDY DTFS VAK+TTI+L++ +++
Sbjct: 979 KAVGCKWVFTLKFLADGNLERYKARLVAKGYTQKEGLDYTDTFSPVAKMTTIKLLLKVSA 1038
Query: 190 INY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSK-----PNQVCKLSKSLYGLKPASR 244
FL QLDV+NAFL+G+L E+++M IP G + K N V +L +S+YGLK ASR
Sbjct: 1039 SKKWFLKQLDVSNAFLNGELEEEIFMKIPEGYAERKGIVLPSNVVLRLKRSIYGLKQASR 1098
Query: 245 KWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDFLSEITFIKNAL 304
+W++K + ++ G+++ + +LF K F+++LVYV DI +A + + L
Sbjct: 1099 QWFKKFSSSLLSLGFKKTHGDHTLFLKMYDGEFVIVLVYVDDIVIASTSEAAAAQLTEEL 1158
Query: 305 NQASKSKILVSL 316
+Q K + L L
Sbjct: 1159 DQRFKLRDLGDL 1170
>gb|AAO26685.1| gag-pol polyprotein [Vitis vinifera]
Length = 373
Score = 206 bits (525), Expect = 1e-51
Identities = 116/289 (40%), Positives = 168/289 (57%), Gaps = 23/289 (7%)
Query: 13 SPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNIPTTSTTNNS------------- 59
S P+ P+P PP +++ +R +T C P S+++ S
Sbjct: 91 SGPVVNVPPTPAK-PPIVQVYSRRPMTTD-----TCPAPAPSSSDPSSDLDLPISLRKGK 144
Query: 60 -HVQ--YPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTTLDQ 116
H + Y I+NF+S+ +LS S S+ S+ S + PK+ EA+ H WK M E+ L+
Sbjct: 145 RHCKSIYSIANFVSYDHLSSSSSVLVASIDSISVPKTVTEALNHPGWKNAMLAEICALED 204
Query: 117 TGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFSLVA 176
TWK+VDLP K +GC WVS VK N DGS+ R K LVA+GY Q G+DY DTFS VA
Sbjct: 205 NHTWKLVDLPQGKKVVGCKWVSAVKVNPDGSVVRLKARLVARGYAQTYGVDYSDTFSPVA 264
Query: 177 KVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPG-VSTSKPNQVCKLSKS 235
K+ ++RL I++A+ +HQLD+ NAFLHGDL E+VY+ PPG V+ + +VC+L K+
Sbjct: 265 KLNSVRLFISIAASQQRMIHQLDIKNAFLHGDLEEEVYLEQPPGFVAQGEYGKVCRLKKA 324
Query: 236 LYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYV 284
LYGLK + R W+ K + G ++ + S+F KK+ I+L+VYV
Sbjct: 325 LYGLKQSPRAWFGKFSKEIQAFGMNKSEKDHSVFYKKSAAGIILLVVYV 373
>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301698|pir||C84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1501
Score = 206 bits (524), Expect = 2e-51
Identities = 114/323 (35%), Positives = 177/323 (54%), Gaps = 14/323 (4%)
Query: 8 PTTSPSPPITKTSPSPPFIPPPIRITTRNKITPTYM*DYICNI--------------PTT 53
P T +P + + P PP R + R P + DY+ P+
Sbjct: 895 PNTPTTPIVVPVASPIPVSPPKQRKSKRATHPPPKLNDYVLYNAMYTPSSIHALPADPSQ 954
Query: 54 STTNNSHVQYPISNFLSHTYLSKSHSIFAMSLVSYTEPKSYDEAIKHDCWKQVMQNELTT 113
S+T +P+++++S S SH + ++ EPK + EA++ W M E+
Sbjct: 955 SSTVPGKSLFPLTDYVSDAAFSSSHRAYLAAITDNVEPKHFKEAVQIKVWNDAMFTEVDA 1014
Query: 114 LDQTGTWKIVDLPPSSKPIGCIWVSKVKHNVDGSIERYKVCLVAKGYNQIEGLDYFDTFS 173
L+ TW IVDLPP IG WV K K+N DG++ERYK LV +G Q+EG DY +TF+
Sbjct: 1015 LEINKTWDIVDLPPGKVAIGSQWVFKTKYNSDGTVERYKARLVVQGNKQVEGEDYKETFA 1074
Query: 174 LVAKVTTIRLVIALASINY*FLHQLDVNNAFLHGDLHEDVYMAIPPGVSTSKPNQVCKLS 233
V ++TT+R ++ + N ++Q+DV+NAFLHGDL E+VYM +PPG S P++VC+L
Sbjct: 1075 PVVRMTTVRTLLRNVAANQWEVYQMDVHNAFLHGDLEEEVYMKLPPGFRHSHPDKVCRLR 1134
Query: 234 KSLYGLKPASRKWYEKLTCLPITNGYQQATSNASLFTKKNLDSFIMLLVYVYDITLAGDF 293
KSLYGLK A R W++KL+ + G+ Q+ + SLF+ + + +L+YV D+ + G+
Sbjct: 1135 KSLYGLKQAPRCWFKKLSDSLLRFGFVQSYEDYSLFSYTRNNIELRVLIYVDDLLICGND 1194
Query: 294 LSEITFIKNALNQASKSKILVSL 316
+ K+ L++ K L L
Sbjct: 1195 GYMLQKFKDYLSRCFSMKDLGKL 1217
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.336 0.146 0.457
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 841,364,824
Number of Sequences: 2540612
Number of extensions: 33731700
Number of successful extensions: 206082
Number of sequences better than 10.0: 1554
Number of HSP's better than 10.0 without gapping: 1383
Number of HSP's successfully gapped in prelim test: 176
Number of HSP's that attempted gapping in prelim test: 201744
Number of HSP's gapped (non-prelim): 2365
length of query: 552
length of database: 863,360,394
effective HSP length: 133
effective length of query: 419
effective length of database: 525,458,998
effective search space: 220167320162
effective search space used: 220167320162
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 78 (34.7 bits)
Medicago: description of AC146555.15