
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139747.1 - phase: 0 /pseudo
(441 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAO73525.1| gag-pol polyprotein [Glycine max] 443 e-123
gb|AAC18777.1| gag-protease polyprotein [Glycine max] gi|7488678... 434 e-120
gb|AAO73523.1| gag-pol polyprotein [Glycine max] 432 e-120
gb|AAO73529.1| gag-pol polyprotein [Glycine max] 432 e-119
gb|AAO73521.1| gag-pol polyprotein [Glycine max] 430 e-119
gb|AAO73527.1| gag-pol polyprotein [Glycine max] 430 e-119
gb|AAC64917.1| gag-pol polyprotein [Glycine max] 397 e-109
dbj|BAB11308.1| copia-like retroelement pol polyprotein [Arabido... 210 6e-53
gb|AAG52949.1| gag/pol polyprotein [Arabidopsis thaliana] 210 6e-53
emb|CAB77910.1| putative transposon protein [Arabidopsis thalian... 202 2e-50
gb|AAC69114.1| putative gag-protease polyprotein [Arabidopsis th... 148 3e-34
gb|AAF18630.1| F5J5.1 [Arabidopsis thaliana] gi|25403474|pir||C8... 146 1e-33
gb|AAC95170.1| copia-like retroelement pol polyprotein [Arabidop... 145 2e-33
emb|CAB80821.1| putative transposon protein [Arabidopsis thalian... 121 5e-26
emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana] gi... 111 4e-23
gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis tha... 110 1e-22
gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis tha... 110 1e-22
gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25... 110 1e-22
emb|CAB75469.1| copia-type reverse transcriptase-like protein [A... 110 1e-22
emb|CAA69271.1| lectin receptor kinase [Arabidopsis thaliana] 110 1e-22
>gb|AAO73525.1| gag-pol polyprotein [Glycine max]
Length = 1576
Score = 443 bits (1140), Expect = e-123
Identities = 230/440 (52%), Positives = 314/440 (71%), Gaps = 5/440 (1%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G TN
Sbjct: 3 MEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTL-EILKIAHEGTTK 121
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+K
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACGEILKTTHEGTSK 122
Query: 122 VKSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPK 181
VK ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+++DEKLVRKILRSLPK
Sbjct: 123 VKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPK 182
Query: 182 RFDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQ 241
RFDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ +N+KK K +AF S+ E +E + +
Sbjct: 183 RFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRNEKKSKNLAFVSNDEGEEDEYDL 242
Query: 242 EDDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKG 301
+ DE +T ++ LLG+QF K++ + D+R + +NI D + E ++DEK KG
Sbjct: 243 DTDEGLTNAVGLLGKQFNKVLNRMDRRQKPHVRNI--PFDIRKGSEYHKKSDEKPSHSKG 300
Query: 302 VQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDA 361
+QCH CEGYGHIK EC + LKK +K L+V SDD ++ + E +S + V AL GR SD
Sbjct: 301 IQCHGCEGYGHIKAECPTHLKKQRKGLSVCRSDD--TESEQESDSDRDVNALTGRFESDE 358
Query: 362 ESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRM 421
+S ++ +DEL +SY++L K+ I +Q + K + NLE E+ + E+ SEL EV
Sbjct: 359 DSSDIEITFDELAISYRKLCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEVGF 418
Query: 422 LNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 419 LNSKLENMTKSIKMLNKGSD 438
>gb|AAC18777.1| gag-protease polyprotein [Glycine max] gi|7488678|pir||T06419
gag-proteinase polyprotein - soybean retrovirus-like
element
Length = 640
Score = 434 bits (1117), Expect = e-120
Identities = 229/442 (51%), Positives = 310/442 (69%), Gaps = 8/442 (1%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++ WEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDG 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+++DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNLRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + D+R + +NI D + E R+DEK KG
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNI--PFDIRKGSEYQKRSDEKPSHSKGF 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHIK EC + LKK +K L+V SDD ++ + E +S + V AL GR F AE
Sbjct: 301 QCHGCEGYGHIKAECPTHLKKQRKGLSVCRSDD--TESEQESDSDRDVNALTGR-FESAE 357
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY+ L K+ I +Q + K + NLE E+ + E+ SEL EV
Sbjct: 358 DSSDTDSEITFDELATSYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEV 417
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 418 GFLNSKLENMTKSIKMLNKGSD 439
>gb|AAO73523.1| gag-pol polyprotein [Glycine max]
Length = 1576
Score = 432 bits (1112), Expect = e-120
Identities = 229/442 (51%), Positives = 312/442 (69%), Gaps = 9/442 (2%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+I+DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + DKR + QNI +I +K R+D K KG+
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQK--RSDVKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHI EC + LKK +K L+V SD ++ + E +S + V ALIG +F AE
Sbjct: 301 QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSD---TESEQESDSDRDVNALIG-IFETAE 356
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY++L K+ I +Q + K + +LE E+ + E+ SEL EV
Sbjct: 357 DSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEV 416
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 417 GFLNSKLENMTKSIKMLNKGSD 438
>gb|AAO73529.1| gag-pol polyprotein [Glycine max]
Length = 1577
Score = 432 bits (1110), Expect = e-119
Identities = 225/442 (50%), Positives = 311/442 (69%), Gaps = 8/442 (1%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G TN
Sbjct: 3 MEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+ IL+IAN+ + GE+++DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LG+QF K++ + D+R + +NI ++D + E ++DEK KG+
Sbjct: 243 TDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNI--SLDIRKGSEYQRKSDEKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QC CEGYGHIK EC + LKK +K L+V SDD ++ + E +S + V AL GR F AE
Sbjct: 301 QCRGCEGYGHIKAECPTHLKKQRKGLSVCRSDD--TESEQESDSDRDVNALTGR-FESAE 357
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL + Y+ L K+ I +Q + K + NLE E+ + E+ S+L EV
Sbjct: 358 DSSDTDSEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISKLKGEV 417
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 418 GFLNSKLENMTKSIKMLNKGSD 439
>gb|AAO73521.1| gag-pol polyprotein [Glycine max]
Length = 1574
Score = 430 bits (1106), Expect = e-119
Identities = 228/442 (51%), Positives = 311/442 (69%), Gaps = 9/442 (2%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILKI HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+I+DEKLVRKILRSLPKR
Sbjct: 123 KISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLN 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + DKR + QNI +I +K ++D K KG+
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQK--KSDVKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHI EC + LKK +K L+V SD ++ + E +S + V AL G +F AE
Sbjct: 301 QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSD---TESEQESDSDRDVNALTG-IFETAE 356
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY++L K+ I +Q + K + +LE E+ + E+ SEL EV
Sbjct: 357 DSSDTDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEV 416
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 417 GFLNSKLENMTKSIKMLNKGSD 438
>gb|AAO73527.1| gag-pol polyprotein [Glycine max]
Length = 1576
Score = 430 bits (1105), Expect = e-119
Identities = 228/442 (51%), Positives = 311/442 (69%), Gaps = 9/442 (2%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILKI HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+I+DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + DKR + QNI +I +K R+D K KG+
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQK--RSDVKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHI EC + LKK +K L+V SD ++ + E +S + V AL G +F AE
Sbjct: 301 QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSD---TESEQESDSDRDVNALTG-IFETAE 356
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY++L K+ I +Q + K + +LE E+ + E+ SEL EV
Sbjct: 357 DSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHEEEISELKGEV 416
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L + K +KM+ +D
Sbjct: 417 GFLNSKLETMKKSIKMLNKGSD 438
>gb|AAC64917.1| gag-pol polyprotein [Glycine max]
Length = 1550
Score = 397 bits (1020), Expect = e-109
Identities = 210/417 (50%), Positives = 291/417 (69%), Gaps = 8/417 (1%)
Query: 28 MSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAEDELALGNSKALNAL 87
M FLK +D++T AV++GWEHP L +G TN LKP E+WT EDELALGNSKALNAL
Sbjct: 1 MVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDELALGNSKALNAL 60
Query: 88 FNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKYENLRMLDDESIQD 147
FN VDKN+F+LI C +AKD EILK HEGT+KVK ++ QLL TK+ENL+M ++E I +
Sbjct: 61 FNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHE 120
Query: 148 YHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEARDISSFKVDELIGS 207
+H+NIL+IAN+ + GE+++DEKLVRKILRSLPKRFDMKVTAIEEA+DI + +VDELIGS
Sbjct: 121 FHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGS 180
Query: 208 LQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTLLGRQFKKIVKQYDK 267
LQ FE+ ++ + +KK K +AF S+ E +E + + + DE +T ++ LLG+QF K++ + D+
Sbjct: 181 LQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDR 240
Query: 268 RPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGVQCHECEGYGHIKIECASFLKK*KKS 327
R + +NI D + E R+DEK KG+QCH CEGYGHIK EC + LKK +K
Sbjct: 241 RQKPHVRNI--PFDIRKGSEYQKRSDEKPSHSKGIQCHGCEGYGHIKAECPTHLKKQRKG 298
Query: 328 LTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAESCSE---DLAYDELVVSYKRLNDKN 384
L+V SDD ++ + E +S + V AL GR F AE S+ ++ +DEL +SY+ L K+
Sbjct: 299 LSVCRSDD--TESEQESDSDRDVNALTGR-FESAEDSSDTDSEITFDELAISYRELCIKS 355
Query: 385 TDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNSQLSNVMKQVKMMAARTD 441
I +Q + K + NLE E+ + ++ SEL E+ LNS+L N+ K +KM+ +D
Sbjct: 356 EKILQQEAQLKKVIANLEAEKEAHEDEISELKGEIGFLNSKLENMTKSIKMLNKGSD 412
>dbj|BAB11308.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1013
Score = 210 bits (535), Expect = 6e-53
Identities = 135/455 (29%), Positives = 236/455 (51%), Gaps = 45/455 (9%)
Query: 5 KEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLK 64
KE V +L+ NY +WK +M ++ + + IA GW+ PV +G +VLK
Sbjct: 5 KEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGE--DVLK 62
Query: 65 P*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKS 124
++WT AE+ A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK
Sbjct: 63 TEDQWTDAEEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKR 122
Query: 125 AKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFD 184
++ +L +++ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+
Sbjct: 123 SRIDMLASQFENLTMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFE 182
Query: 185 MKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDD 244
K TA+ + D + +E++G LQ +E+ + S KG+A A S E +E+Q
Sbjct: 183 SKRTAMGTSLDTDTIDFEEVVGMLQAYELEITSGKGGYSKGVALAVSSEKNEIQ------ 236
Query: 245 EDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGVQC 304
++ +S++++ + F + +K+ +KR + NQ S R ++N + +QC
Sbjct: 237 -ELKDSMSMMAKNFSRAMKRVEKRGFA---------RNQGSDRDRDRDRDRNSKRSEIQC 286
Query: 305 HECEGYGHIKIECASFLKK*KKSLT----------------------VSWSDDDGSKGDG 342
HEC+GYGHIK EC S +K K ++ SD D D
Sbjct: 287 HECQGYGHIKAECPSLKRKDLKCSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDS 346
Query: 343 ERESIKHVAALIGRVFSD---AESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITN 399
E E +K + +G + D ++S ++ ++ +S +D D+ + +
Sbjct: 347 E-EDVKGFVSFVGIIEDDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWL 405
Query: 400 NLEEERVGYLEKNSELNSEVRMLNSQLSNVMKQVK 434
L +E+V +LE+ ++ ++ L +L+ V Q+K
Sbjct: 406 VLSKEKVIWLEEKVKVQEQIEQLKGELA-VANQIK 439
>gb|AAG52949.1| gag/pol polyprotein [Arabidopsis thaliana]
Length = 1643
Score = 210 bits (535), Expect = 6e-53
Identities = 135/455 (29%), Positives = 236/455 (51%), Gaps = 45/455 (9%)
Query: 5 KEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLK 64
KE V +L+ NY +WK +M ++ + + IA GW+ PV +G +VLK
Sbjct: 5 KEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGE--DVLK 62
Query: 65 P*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKS 124
++WT AE+ A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK
Sbjct: 63 TEDQWTDAEEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKR 122
Query: 125 AKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFD 184
++ +L +++ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+
Sbjct: 123 SRIDMLASQFENLTMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFE 182
Query: 185 MKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDD 244
K TA+ + D + +E++G LQ +E+ + S KG+A A S E +E+Q
Sbjct: 183 SKRTAMGTSLDTDTIDFEEVVGMLQAYELEITSGKGGYSKGVALAVSSEKNEIQ------ 236
Query: 245 EDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGVQC 304
++ +S++++ + F + +K+ +KR + NQ S R ++N + +QC
Sbjct: 237 -ELKDSMSMMAKNFSRAMKRVEKRGFA---------RNQGSDRDRDRDRDRNSKRSEIQC 286
Query: 305 HECEGYGHIKIECASFLKK*KKSLT----------------------VSWSDDDGSKGDG 342
HEC+GYGHIK EC S +K K ++ SD D D
Sbjct: 287 HECQGYGHIKAECPSLKRKDLKCSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDS 346
Query: 343 ERESIKHVAALIGRVFSD---AESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITN 399
E E +K + +G + D ++S ++ ++ +S +D D+ + +
Sbjct: 347 E-EDVKGFVSFVGIIEDDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWL 405
Query: 400 NLEEERVGYLEKNSELNSEVRMLNSQLSNVMKQVK 434
L +E+V +LE+ ++ ++ L +L+ V Q+K
Sbjct: 406 VLSKEKVIWLEEKVKVQEQIEQLKGELA-VANQIK 439
>emb|CAB77910.1| putative transposon protein [Arabidopsis thaliana]
gi|4773881|gb|AAD29754.1| putative transposon protein
[Arabidopsis thaliana] gi|25407270|pir||H85055 probable
transposon protein [imported] - Arabidopsis thaliana
Length = 1008
Score = 202 bits (514), Expect = 2e-50
Identities = 138/476 (28%), Positives = 247/476 (50%), Gaps = 51/476 (10%)
Query: 5 KEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLK 64
KE V +L+ NY +WK +M ++ + + IA GW+ PV DG +VLK
Sbjct: 5 KEFVAVGKTIMLEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGE--DVLK 62
Query: 65 P*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKS 124
++W AE+ A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK
Sbjct: 63 TKDQWNDAEEAKAKANSRALSLIFNFVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKR 122
Query: 125 AKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFD 184
++ +L +++ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+
Sbjct: 123 SRIDMLASQFENLSMEETENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFE 182
Query: 185 MKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDD 244
K TA+ + D S +E++G LQ +E+ + S KG+A A+S + +E+Q
Sbjct: 183 SKRTAMGTSLDTDSIDFEEVVGMLQAYELEITSGKGGYSKGLALAASAKKNEIQ------ 236
Query: 245 EDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKM----------VRTDE 294
++ ++++++ + F + +++ +K+ Q D ++++ ++ +
Sbjct: 237 -ELKDTMSMMAKDFSRAMRRVEKKGFGRNQGTDRYRDRSSKRDEIQCHECQGYGHIKAEC 295
Query: 295 KNFQYKGVQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALI 354
+ + K ++C EC G GH K +C K KS + S S+ D + GD E + IK + +
Sbjct: 296 PSLKRKDLKCSECNGLGHTKFDCVGSKSKPDKSCS-SESESDSNDGDSE-DYIKGFVSFV 353
Query: 355 GRV-------FSDAESCSEDLAYDE---------LVVSYKRLNDKNTDICKQ----LEEQ 394
G + S+A+ ED + DE + +++L D + K+ LEE+
Sbjct: 354 GIIEEKDESSDSEADGEDEDNSADEDSDIEKDVNINEEFRKLYDSWLMLSKEKVAWLEEK 413
Query: 395 ---KNITNNLEEERVGYLEKNSEL-------NSEVRMLNSQLSNVMKQVKMMAART 440
+ +T L+ E +KNSEL + R L+ +LS+ K + M+ + T
Sbjct: 414 LKVQELTEKLKGELTAANQKNSELIQKCSVAEEKNRELSQELSDTRKNIHMLNSGT 469
>gb|AAC69114.1| putative gag-protease polyprotein [Arabidopsis thaliana]
gi|25411268|pir||B84482 probable gag-proteinase
polyprotein [imported] - Arabidopsis thaliana
Length = 627
Score = 148 bits (374), Expect = 3e-34
Identities = 124/427 (29%), Positives = 200/427 (46%), Gaps = 53/427 (12%)
Query: 15 LLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAED 74
LLD Y YWK M+ ++RG E DG + KP WTA E
Sbjct: 15 LLDTKRYGYWKVCMT------------QIIRGQE-------DG--FKITKPKANWTAEEK 53
Query: 75 ELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKY 134
+ N++A+ A+FN VD++ FKLI+ C AK + L+ +HEGT+ VK + + T++
Sbjct: 54 LQSKFNARAMKAIFNGVDEDEFKLIQGCKSAKQAWDTLQKSHEGTSSVKRTRLDHIATQF 113
Query: 135 ENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEAR 194
E L+M DE I + I +AN E G+ D+KLV+K+LR LP +F + A
Sbjct: 114 EYLKMEPDEKIVKFSSKISALANEAEVMGKTYKDQKLVKKLLRCLPPKFAAHKAVMRVAG 173
Query: 195 DISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTLL 254
+ +L+G L+ E+ + K K IAF + ++ Q ++ + + LL
Sbjct: 174 NTDKISFVDLVGMLKLEEMKADQDKVKPSKNIAFNADQGSEQFQ-------EIKDGMALL 226
Query: 255 GRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNF-QYKGVQCHECEGYGHI 313
R F K +K R ID + S+ + R++ + + K +QC+EC G+GHI
Sbjct: 227 ARNFGKALK-------------RVEIDGERSRGRFSRSENDDLRKKKEIQCYECGGFGHI 273
Query: 314 KIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAESCSED------ 367
K EC + K K+ + +K + +S +LI FSD+ES E
Sbjct: 274 KPECP--ITKRKEMKCLKCKGVGHTKFECPNKSKLKEKSLIS--FSDSESDDEGEELLNF 329
Query: 368 LAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNSQLS 427
+A+ S K ++D ++D C + K+ L + V + +L E L ++L+
Sbjct: 330 VAFMASSDSSKFMSDTDSD-CDEELNPKDKYRVLYDSWVQLSKDKLKLVKEKLTLEAKLA 388
Query: 428 NVMKQVK 434
NV + K
Sbjct: 389 NVSTEDK 395
>gb|AAF18630.1| F5J5.1 [Arabidopsis thaliana] gi|25403474|pir||C86482 protein
F5J5.1 [imported] - Arabidopsis thaliana
Length = 1463
Score = 146 bits (369), Expect = 1e-33
Identities = 116/433 (26%), Positives = 204/433 (46%), Gaps = 53/433 (12%)
Query: 15 LLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAED 74
LLD Y YWK RM+ ++ AV GWE P L DG + KP WTA E
Sbjct: 15 LLDTKRYGYWKVRMTQIIRGQGEDAWTAVEEGWEPPFDLTEDG--FKITKPKANWTAEEK 72
Query: 75 ELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKY 134
+ N++A+NA+ N +D++ FKLI+ C AK + L+ +HEGT+ VK + + T++
Sbjct: 73 LQSKFNARAMNAIVNGIDEDEFKLIQGCKSAKQAWDTLQKSHEGTSSVKRTRLDHIATQF 132
Query: 135 ENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEAR 194
E L+M E+I + I +AN E G+ D+KLV+K+LR LP +F + A
Sbjct: 133 EYLKMEPYETIVKFSSKISALANEAEVLGKTYKDQKLVKKLLRCLPPKFPAHKAVMRVAG 192
Query: 195 DISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTLL 254
+ +L+G L++ E+ + K K IAF + ++ Q + + + LL
Sbjct: 193 NTDKISFVDLVGMLKSEEMEPDQDKVKPSKNIAFNADQGSEQFQ-------QIKDGMALL 245
Query: 255 GRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNF-QYKGVQCHECEGYGHI 313
R F K +K+ + R ++ N D + S+ + R++ + + K +QC+
Sbjct: 246 ARNFGKALKRVE-RGQNRDSTSWSNKDGETSRGRFSRSENDDLGKKKEIQCY-------- 296
Query: 314 KIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALI----GRVFSDAES-CSEDL 368
DD S +GE E + VA + +V SD +S C +++
Sbjct: 297 --------------------DDPESDDEGE-ELLNFVAFMASSDSSKVMSDTDSDCDQEV 335
Query: 369 ----AYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSE-VRMLN 423
Y L S+ +L DK ++E + + +++ +L++ + + ++L
Sbjct: 336 NPKDEYRVLYDSWMQLKDKQKLSGITVDEN---SQDYYQKKFDWLQEECHMERDRAKLLE 392
Query: 424 SQLSNVMKQVKMM 436
+L++ KQ++M+
Sbjct: 393 RELNDKHKQIRML 405
>gb|AAC95170.1| copia-like retroelement pol polyprotein [Arabidopsis thaliana]
gi|25411196|pir||B84473 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 916
Score = 145 bits (366), Expect = 2e-33
Identities = 79/227 (34%), Positives = 135/227 (58%), Gaps = 2/227 (0%)
Query: 15 LLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAED 74
+L+ NY +WK +M ++ + + IA GW+ PV DG +VLK ++W AE+
Sbjct: 27 ILEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGE--DVLKTEDQWNDAEE 84
Query: 75 ELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKY 134
A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK ++ +L +++
Sbjct: 85 AKATANSRALSLIFNSVNQNQFKQIQNCESAKEAWDKLAKAYEGTSSVKRSRIDMLASQF 144
Query: 135 ENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEAR 194
ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+ K TA+ +
Sbjct: 145 ENLTMEETENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFESKRTAMGTSL 204
Query: 195 DISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQ 241
D +S +E++G Q +E+ + S G A S++ +L+ ++
Sbjct: 205 DTNSIDFEEVVGMFQAYELEITSGKGGYGHIKAECPSLKRKDLKCSE 251
Score = 54.3 bits (129), Expect = 7e-06
Identities = 48/181 (26%), Positives = 87/181 (47%), Gaps = 32/181 (17%)
Query: 290 VRTDEKNFQYKGVQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKH 349
++ + + + K ++C EC+G GHIK +C K +S + S S+ D + GD E + IK
Sbjct: 235 IKAECPSLKRKDLKCSECKGLGHIKFDCVGSKSKPDRSCS-SESESDSNDGDSE-DYIKG 292
Query: 350 VAALIGRV-------FSDAESCSEDLAYDE---------LVVSYKRLNDKNTDICKQ--- 390
+ +G + S+A+ ED + DE + +++L D + K+
Sbjct: 293 FVSFVGIIEEKDESSDSEADGEDEDNSADEDSDIEKDVKINEEFRKLYDSWLMLSKEKVA 352
Query: 391 -LEEQ---KNITNNLEEERVGYLEKNSELNSEV-------RMLNSQLSNVMKQVKMMAAR 439
LEE+ + +T L+ E +KNSEL + R L+ +LS+ K++ M+ +
Sbjct: 353 WLEEKLKVQELTEKLKGELTAANQKNSELTQKCSVAEEKNRELSQELSDTRKKIHMLNSG 412
Query: 440 T 440
T
Sbjct: 413 T 413
>emb|CAB80821.1| putative transposon protein [Arabidopsis thaliana]
gi|4773900|gb|AAD29770.1| putative transposon protein
[Arabidopsis thaliana] gi|25407277|pir||E85057 probable
transposon protein [imported] - Arabidopsis thaliana
Length = 590
Score = 121 bits (303), Expect = 5e-26
Identities = 73/248 (29%), Positives = 125/248 (49%), Gaps = 5/248 (2%)
Query: 13 PPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAA 72
P LD +Y YWK + ++ +D AV GW P T D V K EW A
Sbjct: 345 PLKLDAEHYGYWKVLIKRSIQSIDMDAWFAVEDGWMPPTT--KDAKRDIVSKSRTEWIAD 402
Query: 73 EDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTT 132
E A NS+AL+ +F + +N F ++ C+ AK+ EIL+++ E T VK + +L +
Sbjct: 403 EKTAANHNSQALSVIFGSLLRNKFTQVQGCLSAKEVWEILQVSFECTNNVKRTRLDMLAS 462
Query: 133 KYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEE 192
++ENL M +ES+ D++ + I G+ D+K+V+K LRSLP +F +AI+
Sbjct: 463 EFENLTMEAEESVDDFNGKLSSITQEAVVLGKTYKDKKMVKKFLRSLPDKFQSHKSAIDV 522
Query: 193 ARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLT 252
+ + K D+++G +Q ++ K + + ++E D+ V ++ +SL
Sbjct: 523 SLNSDQLKFDQVVGMMQAYD---TDKEEILNSYATYFGAIEDDDHTVEEDAQMGTIKSLI 579
Query: 253 LLGRQFKK 260
L+ +K
Sbjct: 580 LIQSDSEK 587
Score = 58.2 bits (139), Expect = 5e-07
Identities = 29/94 (30%), Positives = 54/94 (56%)
Query: 133 KYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEE 192
+YENL+M + ++I + ++++ N GE+ SD ++V+KIL SLPKRFD+ V +++
Sbjct: 97 EYENLKMKESDNINTFMTKLIEMGNQLRVHGEEKSDYQIVQKILISLPKRFDIIVAMMKQ 156
Query: 193 ARDISSFKVDELIGSLQNFEITVNSKNDKKGKGI 226
+D++S + + + KK KG+
Sbjct: 157 TKDLTSLSAGKWCDVCERKNHNESDCWMKKNKGV 190
>emb|CAB71063.1| copia-type polyprotein [Arabidopsis thaliana]
gi|11278364|pir||T47925 copia-type polyprotein -
Arabidopsis thaliana
Length = 1352
Score = 111 bits (278), Expect = 4e-23
Identities = 81/310 (26%), Positives = 151/310 (48%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E + KK + IA ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYE-----EKKKKKEDIA----EQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>gb|AAG60117.1| copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IIEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>gb|AAG50765.1| copia-type polyprotein, putative [Arabidopsis thaliana]
gi|12321254|gb|AAG50698.1| copia-type polyprotein,
putative [Arabidopsis thaliana] gi|25301687|pir||F96614
probable copia-type polyprotein T18I24.5 [imported] -
Arabidopsis thaliana
Length = 1320
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>gb|AAD50001.1| Hypothetical protein [Arabidopsis thaliana] gi|25301681|pir||F86246
hypothetical protein [imported] - Arabidopsis thaliana
Length = 1352
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>emb|CAB75469.1| copia-type reverse transcriptase-like protein [Arabidopsis
thaliana] gi|11278363|pir||T49313 copia-type reverse
transcriptase-like protein - Arabidopsis thaliana
Length = 1272
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>emb|CAA69271.1| lectin receptor kinase [Arabidopsis thaliana]
Length = 544
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 27 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 81
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 82 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 135
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 136 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 195
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 196 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 242
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 243 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 300
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 301 GKFGHYASEC 310
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.318 0.135 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 689,308,274
Number of Sequences: 2540612
Number of extensions: 28543577
Number of successful extensions: 111756
Number of sequences better than 10.0: 760
Number of HSP's better than 10.0 without gapping: 205
Number of HSP's successfully gapped in prelim test: 573
Number of HSP's that attempted gapping in prelim test: 110160
Number of HSP's gapped (non-prelim): 1998
length of query: 441
length of database: 863,360,394
effective HSP length: 131
effective length of query: 310
effective length of database: 530,540,222
effective search space: 164467468820
effective search space used: 164467468820
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)
Medicago: description of AC139747.1