
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139747.1 - phase: 0 /pseudo
(441 letters)
Database: uniref100
2,790,947 sequences; 848,049,833 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef100_Q84VI0 Gag-pol polyprotein [Glycine max] 443 e-123
UniRef100_Q84VI2 Gag-pol polyprotein [Glycine max] 432 e-120
UniRef100_Q84VH6 Gag-pol polyprotein [Glycine max] 432 e-119
UniRef100_Q84VI4 Gag-pol polyprotein [Glycine max] 430 e-119
UniRef100_Q84VH8 Gag-pol polyprotein [Glycine max] 430 e-119
UniRef100_O65147 Gag-pol polyprotein [Glycine max] 397 e-109
UniRef100_Q9FG84 Copia-like retroelement pol polyprotein [Arabid... 210 6e-53
UniRef100_Q9C5V1 Gag/pol polyprotein [Arabidopsis thaliana] 210 6e-53
UniRef100_Q9XEC0 Putative transposon protein [Arabidopsis thaliana] 202 2e-50
UniRef100_Q9ZV83 Putative gag-protease polyprotein [Arabidopsis ... 148 3e-34
UniRef100_Q9SKW9 F5J5.1 [Arabidopsis thaliana] 146 1e-33
UniRef100_Q9ZUF5 Copia-like retroelement pol polyprotein [Arabid... 145 2e-33
UniRef100_Q9XEB1 Putative transposon protein [Arabidopsis thaliana] 121 5e-26
UniRef100_Q9M2D1 Copia-type polyprotein [Arabidopsis thaliana] 111 4e-23
UniRef100_Q9C536 Copia-type polyprotein, putative [Arabidopsis t... 110 1e-22
UniRef100_Q9SXB2 T28P6.8 protein [Arabidopsis thaliana] 110 1e-22
UniRef100_Q9M197 Copia-type reverse transcriptase-like protein [... 110 1e-22
UniRef100_Q9C739 Copia-type polyprotein, putative [Arabidopsis t... 110 1e-22
UniRef100_Q9SFE1 T26F17.17 [Arabidopsis thaliana] 102 2e-20
UniRef100_Q9LH44 Copia-like retrotransposable element [Arabidops... 101 4e-20
>UniRef100_Q84VI0 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 443 bits (1140), Expect = e-123
Identities = 230/440 (52%), Positives = 314/440 (71%), Gaps = 5/440 (1%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G TN
Sbjct: 3 MEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTL-EILKIAHEGTTK 121
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+K
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACGEILKTTHEGTSK 122
Query: 122 VKSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPK 181
VK ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+++DEKLVRKILRSLPK
Sbjct: 123 VKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERMTDEKLVRKILRSLPK 182
Query: 182 RFDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQ 241
RFDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ +N+KK K +AF S+ E +E + +
Sbjct: 183 RFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRNEKKSKNLAFVSNDEGEEDEYDL 242
Query: 242 EDDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKG 301
+ DE +T ++ LLG+QF K++ + D+R + +NI D + E ++DEK KG
Sbjct: 243 DTDEGLTNAVGLLGKQFNKVLNRMDRRQKPHVRNI--PFDIRKGSEYHKKSDEKPSHSKG 300
Query: 302 VQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDA 361
+QCH CEGYGHIK EC + LKK +K L+V SDD ++ + E +S + V AL GR SD
Sbjct: 301 IQCHGCEGYGHIKAECPTHLKKQRKGLSVCRSDD--TESEQESDSDRDVNALTGRFESDE 358
Query: 362 ESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRM 421
+S ++ +DEL +SY++L K+ I +Q + K + NLE E+ + E+ SEL EV
Sbjct: 359 DSSDIEITFDELAISYRKLCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISELKGEVGF 418
Query: 422 LNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 419 LNSKLENMTKSIKMLNKGSD 438
>UniRef100_Q84VI2 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 432 bits (1112), Expect = e-120
Identities = 229/442 (51%), Positives = 312/442 (69%), Gaps = 9/442 (2%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+I+DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + DKR + QNI +I +K R+D K KG+
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQK--RSDVKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHI EC + LKK +K L+V SD ++ + E +S + V ALIG +F AE
Sbjct: 301 QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSD---TESEQESDSDRDVNALIG-IFETAE 356
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY++L K+ I +Q + K + +LE E+ + E+ SEL EV
Sbjct: 357 DSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEV 416
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 417 GFLNSKLENMTKSIKMLNKGSD 438
>UniRef100_Q84VH6 Gag-pol polyprotein [Glycine max]
Length = 1577
Score = 432 bits (1110), Expect = e-119
Identities = 225/442 (50%), Positives = 311/442 (69%), Gaps = 8/442 (1%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G TN
Sbjct: 3 MEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILK HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+ IL+IAN+ + GE+++DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LG+QF K++ + D+R + +NI ++D + E ++DEK KG+
Sbjct: 243 TDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNI--SLDIRKGSEYQRKSDEKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QC CEGYGHIK EC + LKK +K L+V SDD ++ + E +S + V AL GR F AE
Sbjct: 301 QCRGCEGYGHIKAECPTHLKKQRKGLSVCRSDD--TESEQESDSDRDVNALTGR-FESAE 357
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL + Y+ L K+ I +Q + K + NLE E+ + E+ S+L EV
Sbjct: 358 DSSDTDSEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISKLKGEV 417
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 418 GFLNSKLENMTKSIKMLNKGSD 439
>UniRef100_Q84VI4 Gag-pol polyprotein [Glycine max]
Length = 1574
Score = 430 bits (1106), Expect = e-119
Identities = 228/442 (51%), Positives = 311/442 (69%), Gaps = 9/442 (2%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILKI HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+I+DEKLVRKILRSLPKR
Sbjct: 123 KISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLN 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + DKR + QNI +I +K ++D K KG+
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQK--KSDVKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHI EC + LKK +K L+V SD ++ + E +S + V AL G +F AE
Sbjct: 301 QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSD---TESEQESDSDRDVNALTG-IFETAE 356
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY++L K+ I +Q + K + +LE E+ + E+ SEL EV
Sbjct: 357 DSSDTDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEV 416
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L N+ K +KM+ +D
Sbjct: 417 GFLNSKLENMTKSIKMLNKGSD 438
>UniRef100_Q84VH8 Gag-pol polyprotein [Glycine max]
Length = 1576
Score = 430 bits (1105), Expect = e-119
Identities = 228/442 (51%), Positives = 311/442 (69%), Gaps = 9/442 (2%)
Query: 3 MDKEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNV 62
M+KEGG VN PP+LDG NY+YWK+RM FLK +D++T AV++GWEHP L +G T+
Sbjct: 3 MEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDE 62
Query: 63 LKP*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKV 122
LKP E+WT EDELALGNSKALNALFN VDKN+F+LI C +AKD EILKI HEGT+KV
Sbjct: 63 LKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKV 122
Query: 123 KSAKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKR 182
K ++ QLL TK+ENL+M ++E I D+H+NIL+IAN+ + GE+I+DEKLVRKILRSLPKR
Sbjct: 123 KMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKR 182
Query: 183 FDMKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQE 242
FDMKVTAIEEA+DI + +VDELIGSLQ FE+ ++ + +KK K +AF S+ E +E + + +
Sbjct: 183 FDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYDLD 242
Query: 243 DDEDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGV 302
DE +T ++ LLG+QF K++ + DKR + QNI +I +K R+D K KG+
Sbjct: 243 TDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQK--RSDVKPSHSKGI 300
Query: 303 QCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAE 362
QCH CEGYGHI EC + LKK +K L+V SD ++ + E +S + V AL G +F AE
Sbjct: 301 QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSD---TESEQESDSDRDVNALTG-IFETAE 356
Query: 363 SCSE---DLAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEV 419
S+ ++ +DEL SY++L K+ I +Q + K + +LE E+ + E+ SEL EV
Sbjct: 357 DSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHEEEISELKGEV 416
Query: 420 RMLNSQLSNVMKQVKMMAARTD 441
LNS+L + K +KM+ +D
Sbjct: 417 GFLNSKLETMKKSIKMLNKGSD 438
>UniRef100_O65147 Gag-pol polyprotein [Glycine max]
Length = 1550
Score = 397 bits (1020), Expect = e-109
Identities = 210/417 (50%), Positives = 291/417 (69%), Gaps = 8/417 (1%)
Query: 28 MSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAEDELALGNSKALNAL 87
M FLK +D++T AV++GWEHP L +G TN LKP E+WT EDELALGNSKALNAL
Sbjct: 1 MVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKEEDELALGNSKALNAL 60
Query: 88 FNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKYENLRMLDDESIQD 147
FN VDKN+F+LI C +AKD EILK HEGT+KVK ++ QLL TK+ENL+M ++E I +
Sbjct: 61 FNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHE 120
Query: 148 YHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEARDISSFKVDELIGS 207
+H+NIL+IAN+ + GE+++DEKLVRKILRSLPKRFDMKVTAIEEA+DI + +VDELIGS
Sbjct: 121 FHMNILEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGS 180
Query: 208 LQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTLLGRQFKKIVKQYDK 267
LQ FE+ ++ + +KK K +AF S+ E +E + + + DE +T ++ LLG+QF K++ + D+
Sbjct: 181 LQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDR 240
Query: 268 RPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGVQCHECEGYGHIKIECASFLKK*KKS 327
R + +NI D + E R+DEK KG+QCH CEGYGHIK EC + LKK +K
Sbjct: 241 RQKPHVRNI--PFDIRKGSEYQKRSDEKPSHSKGIQCHGCEGYGHIKAECPTHLKKQRKG 298
Query: 328 LTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAESCSE---DLAYDELVVSYKRLNDKN 384
L+V SDD ++ + E +S + V AL GR F AE S+ ++ +DEL +SY+ L K+
Sbjct: 299 LSVCRSDD--TESEQESDSDRDVNALTGR-FESAEDSSDTDSEITFDELAISYRELCIKS 355
Query: 385 TDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNSQLSNVMKQVKMMAARTD 441
I +Q + K + NLE E+ + ++ SEL E+ LNS+L N+ K +KM+ +D
Sbjct: 356 EKILQQEAQLKKVIANLEAEKEAHEDEISELKGEIGFLNSKLENMTKSIKMLNKGSD 412
>UniRef100_Q9FG84 Copia-like retroelement pol polyprotein [Arabidopsis thaliana]
Length = 1013
Score = 210 bits (535), Expect = 6e-53
Identities = 135/455 (29%), Positives = 236/455 (51%), Gaps = 45/455 (9%)
Query: 5 KEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLK 64
KE V +L+ NY +WK +M ++ + + IA GW+ PV +G +VLK
Sbjct: 5 KEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGE--DVLK 62
Query: 65 P*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKS 124
++WT AE+ A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK
Sbjct: 63 TEDQWTDAEEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKR 122
Query: 125 AKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFD 184
++ +L +++ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+
Sbjct: 123 SRIDMLASQFENLTMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFE 182
Query: 185 MKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDD 244
K TA+ + D + +E++G LQ +E+ + S KG+A A S E +E+Q
Sbjct: 183 SKRTAMGTSLDTDTIDFEEVVGMLQAYELEITSGKGGYSKGVALAVSSEKNEIQ------ 236
Query: 245 EDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGVQC 304
++ +S++++ + F + +K+ +KR + NQ S R ++N + +QC
Sbjct: 237 -ELKDSMSMMAKNFSRAMKRVEKRGFA---------RNQGSDRDRDRDRDRNSKRSEIQC 286
Query: 305 HECEGYGHIKIECASFLKK*KKSLT----------------------VSWSDDDGSKGDG 342
HEC+GYGHIK EC S +K K ++ SD D D
Sbjct: 287 HECQGYGHIKAECPSLKRKDLKCSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDS 346
Query: 343 ERESIKHVAALIGRVFSD---AESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITN 399
E E +K + +G + D ++S ++ ++ +S +D D+ + +
Sbjct: 347 E-EDVKGFVSFVGIIEDDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWL 405
Query: 400 NLEEERVGYLEKNSELNSEVRMLNSQLSNVMKQVK 434
L +E+V +LE+ ++ ++ L +L+ V Q+K
Sbjct: 406 VLSKEKVIWLEEKVKVQEQIEQLKGELA-VANQIK 439
>UniRef100_Q9C5V1 Gag/pol polyprotein [Arabidopsis thaliana]
Length = 1643
Score = 210 bits (535), Expect = 6e-53
Identities = 135/455 (29%), Positives = 236/455 (51%), Gaps = 45/455 (9%)
Query: 5 KEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLK 64
KE V +L+ NY +WK +M ++ + + IA GW+ PV +G +VLK
Sbjct: 5 KEFVAVGKAIMLEKGNYGHWKVKMRALIRGLGKEAWIATSVGWKAPVVKGENGE--DVLK 62
Query: 65 P*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKS 124
++WT AE+ A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK
Sbjct: 63 TEDQWTDAEEAKATANSRALSLIFNSVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKR 122
Query: 125 AKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFD 184
++ +L +++ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+
Sbjct: 123 SRIDMLASQFENLTMDESENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFE 182
Query: 185 MKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDD 244
K TA+ + D + +E++G LQ +E+ + S KG+A A S E +E+Q
Sbjct: 183 SKRTAMGTSLDTDTIDFEEVVGMLQAYELEITSGKGGYSKGVALAVSSEKNEIQ------ 236
Query: 245 EDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNFQYKGVQC 304
++ +S++++ + F + +K+ +KR + NQ S R ++N + +QC
Sbjct: 237 -ELKDSMSMMAKNFSRAMKRVEKRGFA---------RNQGSDRDRDRDRDRNSKRSEIQC 286
Query: 305 HECEGYGHIKIECASFLKK*KKSLT----------------------VSWSDDDGSKGDG 342
HEC+GYGHIK EC S +K K ++ SD D D
Sbjct: 287 HECQGYGHIKAECPSLKRKDLKCSECRGIGHTKFDCIGSKSKPDRSYIAESDSDSDDEDS 346
Query: 343 ERESIKHVAALIGRVFSD---AESCSEDLAYDELVVSYKRLNDKNTDICKQLEEQKNITN 399
E E +K + +G + D ++S ++ ++ +S +D D+ + +
Sbjct: 347 E-EDVKGFVSFVGIIEDDNVSSDSSDSEVGCEKEEISADDESDVEMDVDGEFRKLYENWL 405
Query: 400 NLEEERVGYLEKNSELNSEVRMLNSQLSNVMKQVK 434
L +E+V +LE+ ++ ++ L +L+ V Q+K
Sbjct: 406 VLSKEKVIWLEEKVKVQEQIEQLKGELA-VANQIK 439
>UniRef100_Q9XEC0 Putative transposon protein [Arabidopsis thaliana]
Length = 1008
Score = 202 bits (514), Expect = 2e-50
Identities = 138/476 (28%), Positives = 247/476 (50%), Gaps = 51/476 (10%)
Query: 5 KEGGFVNTPPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLK 64
KE V +L+ NY +WK +M ++ + + IA GW+ PV DG +VLK
Sbjct: 5 KEFVAVGKTIMLEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGE--DVLK 62
Query: 65 P*EEWTAAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKS 124
++W AE+ A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK
Sbjct: 63 TKDQWNDAEEAKAKANSRALSLIFNFVNQNQFKRIQNCESAKEAWDKLAKAYEGTSSVKR 122
Query: 125 AKFQLLTTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFD 184
++ +L +++ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+
Sbjct: 123 SRIDMLASQFENLSMEETENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFE 182
Query: 185 MKVTAIEEARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDD 244
K TA+ + D S +E++G LQ +E+ + S KG+A A+S + +E+Q
Sbjct: 183 SKRTAMGTSLDTDSIDFEEVVGMLQAYELEITSGKGGYSKGLALAASAKKNEIQ------ 236
Query: 245 EDMTESLTLLGRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKM----------VRTDE 294
++ ++++++ + F + +++ +K+ Q D ++++ ++ +
Sbjct: 237 -ELKDTMSMMAKDFSRAMRRVEKKGFGRNQGTDRYRDRSSKRDEIQCHECQGYGHIKAEC 295
Query: 295 KNFQYKGVQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALI 354
+ + K ++C EC G GH K +C K KS + S S+ D + GD E + IK + +
Sbjct: 296 PSLKRKDLKCSECNGLGHTKFDCVGSKSKPDKSCS-SESESDSNDGDSE-DYIKGFVSFV 353
Query: 355 GRV-------FSDAESCSEDLAYDE---------LVVSYKRLNDKNTDICKQ----LEEQ 394
G + S+A+ ED + DE + +++L D + K+ LEE+
Sbjct: 354 GIIEEKDESSDSEADGEDEDNSADEDSDIEKDVNINEEFRKLYDSWLMLSKEKVAWLEEK 413
Query: 395 ---KNITNNLEEERVGYLEKNSEL-------NSEVRMLNSQLSNVMKQVKMMAART 440
+ +T L+ E +KNSEL + R L+ +LS+ K + M+ + T
Sbjct: 414 LKVQELTEKLKGELTAANQKNSELIQKCSVAEEKNRELSQELSDTRKNIHMLNSGT 469
>UniRef100_Q9ZV83 Putative gag-protease polyprotein [Arabidopsis thaliana]
Length = 627
Score = 148 bits (374), Expect = 3e-34
Identities = 124/427 (29%), Positives = 200/427 (46%), Gaps = 53/427 (12%)
Query: 15 LLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAED 74
LLD Y YWK M+ ++RG E DG + KP WTA E
Sbjct: 15 LLDTKRYGYWKVCMT------------QIIRGQE-------DG--FKITKPKANWTAEEK 53
Query: 75 ELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKY 134
+ N++A+ A+FN VD++ FKLI+ C AK + L+ +HEGT+ VK + + T++
Sbjct: 54 LQSKFNARAMKAIFNGVDEDEFKLIQGCKSAKQAWDTLQKSHEGTSSVKRTRLDHIATQF 113
Query: 135 ENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEAR 194
E L+M DE I + I +AN E G+ D+KLV+K+LR LP +F + A
Sbjct: 114 EYLKMEPDEKIVKFSSKISALANEAEVMGKTYKDQKLVKKLLRCLPPKFAAHKAVMRVAG 173
Query: 195 DISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTLL 254
+ +L+G L+ E+ + K K IAF + ++ Q ++ + + LL
Sbjct: 174 NTDKISFVDLVGMLKLEEMKADQDKVKPSKNIAFNADQGSEQFQ-------EIKDGMALL 226
Query: 255 GRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNF-QYKGVQCHECEGYGHI 313
R F K +K R ID + S+ + R++ + + K +QC+EC G+GHI
Sbjct: 227 ARNFGKALK-------------RVEIDGERSRGRFSRSENDDLRKKKEIQCYECGGFGHI 273
Query: 314 KIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALIGRVFSDAESCSED------ 367
K EC + K K+ + +K + +S +LI FSD+ES E
Sbjct: 274 KPECP--ITKRKEMKCLKCKGVGHTKFECPNKSKLKEKSLIS--FSDSESDDEGEELLNF 329
Query: 368 LAYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSEVRMLNSQLS 427
+A+ S K ++D ++D C + K+ L + V + +L E L ++L+
Sbjct: 330 VAFMASSDSSKFMSDTDSD-CDEELNPKDKYRVLYDSWVQLSKDKLKLVKEKLTLEAKLA 388
Query: 428 NVMKQVK 434
NV + K
Sbjct: 389 NVSTEDK 395
>UniRef100_Q9SKW9 F5J5.1 [Arabidopsis thaliana]
Length = 1463
Score = 146 bits (369), Expect = 1e-33
Identities = 116/433 (26%), Positives = 204/433 (46%), Gaps = 53/433 (12%)
Query: 15 LLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAED 74
LLD Y YWK RM+ ++ AV GWE P L DG + KP WTA E
Sbjct: 15 LLDTKRYGYWKVRMTQIIRGQGEDAWTAVEEGWEPPFDLTEDG--FKITKPKANWTAEEK 72
Query: 75 ELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKY 134
+ N++A+NA+ N +D++ FKLI+ C AK + L+ +HEGT+ VK + + T++
Sbjct: 73 LQSKFNARAMNAIVNGIDEDEFKLIQGCKSAKQAWDTLQKSHEGTSSVKRTRLDHIATQF 132
Query: 135 ENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEAR 194
E L+M E+I + I +AN E G+ D+KLV+K+LR LP +F + A
Sbjct: 133 EYLKMEPYETIVKFSSKISALANEAEVLGKTYKDQKLVKKLLRCLPPKFPAHKAVMRVAG 192
Query: 195 DISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTLL 254
+ +L+G L++ E+ + K K IAF + ++ Q + + + LL
Sbjct: 193 NTDKISFVDLVGMLKSEEMEPDQDKVKPSKNIAFNADQGSEQFQ-------QIKDGMALL 245
Query: 255 GRQFKKIVKQYDKRPRSIGQNIRPNIDNQPSKEKMVRTDEKNF-QYKGVQCHECEGYGHI 313
R F K +K+ + R ++ N D + S+ + R++ + + K +QC+
Sbjct: 246 ARNFGKALKRVE-RGQNRDSTSWSNKDGETSRGRFSRSENDDLGKKKEIQCY-------- 296
Query: 314 KIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKHVAALI----GRVFSDAES-CSEDL 368
DD S +GE E + VA + +V SD +S C +++
Sbjct: 297 --------------------DDPESDDEGE-ELLNFVAFMASSDSSKVMSDTDSDCDQEV 335
Query: 369 ----AYDELVVSYKRLNDKNTDICKQLEEQKNITNNLEEERVGYLEKNSELNSE-VRMLN 423
Y L S+ +L DK ++E + + +++ +L++ + + ++L
Sbjct: 336 NPKDEYRVLYDSWMQLKDKQKLSGITVDEN---SQDYYQKKFDWLQEECHMERDRAKLLE 392
Query: 424 SQLSNVMKQVKMM 436
+L++ KQ++M+
Sbjct: 393 RELNDKHKQIRML 405
>UniRef100_Q9ZUF5 Copia-like retroelement pol polyprotein [Arabidopsis thaliana]
Length = 916
Score = 145 bits (366), Expect = 2e-33
Identities = 79/227 (34%), Positives = 135/227 (58%), Gaps = 2/227 (0%)
Query: 15 LLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAED 74
+L+ NY +WK +M ++ + + IA GW+ PV DG +VLK ++W AE+
Sbjct: 27 ILEKGNYGHWKVKMRALIRGLGKEAWIATSIGWKAPVIKGEDGE--DVLKTEDQWNDAEE 84
Query: 75 ELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTKY 134
A NS+AL+ +FN V++N FK I+ C AK+ + L A+EGT+ VK ++ +L +++
Sbjct: 85 AKATANSRALSLIFNSVNQNQFKQIQNCESAKEAWDKLAKAYEGTSSVKRSRIDMLASQF 144
Query: 135 ENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEAR 194
ENL M + E+I+++ I IA+ + G+K D+KLV+K+LR LP RF+ K TA+ +
Sbjct: 145 ENLTMEETENIEEFSGKISAIASEAHNLGKKYKDKKLVKKLLRCLPSRFESKRTAMGTSL 204
Query: 195 DISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQ 241
D +S +E++G Q +E+ + S G A S++ +L+ ++
Sbjct: 205 DTNSIDFEEVVGMFQAYELEITSGKGGYGHIKAECPSLKRKDLKCSE 251
Score = 54.3 bits (129), Expect = 7e-06
Identities = 48/181 (26%), Positives = 87/181 (47%), Gaps = 32/181 (17%)
Query: 290 VRTDEKNFQYKGVQCHECEGYGHIKIECASFLKK*KKSLTVSWSDDDGSKGDGERESIKH 349
++ + + + K ++C EC+G GHIK +C K +S + S S+ D + GD E + IK
Sbjct: 235 IKAECPSLKRKDLKCSECKGLGHIKFDCVGSKSKPDRSCS-SESESDSNDGDSE-DYIKG 292
Query: 350 VAALIGRV-------FSDAESCSEDLAYDE---------LVVSYKRLNDKNTDICKQ--- 390
+ +G + S+A+ ED + DE + +++L D + K+
Sbjct: 293 FVSFVGIIEEKDESSDSEADGEDEDNSADEDSDIEKDVKINEEFRKLYDSWLMLSKEKVA 352
Query: 391 -LEEQ---KNITNNLEEERVGYLEKNSELNSEV-------RMLNSQLSNVMKQVKMMAAR 439
LEE+ + +T L+ E +KNSEL + R L+ +LS+ K++ M+ +
Sbjct: 353 WLEEKLKVQELTEKLKGELTAANQKNSELTQKCSVAEEKNRELSQELSDTRKKIHMLNSG 412
Query: 440 T 440
T
Sbjct: 413 T 413
>UniRef100_Q9XEB1 Putative transposon protein [Arabidopsis thaliana]
Length = 590
Score = 121 bits (303), Expect = 5e-26
Identities = 73/248 (29%), Positives = 125/248 (49%), Gaps = 5/248 (2%)
Query: 13 PPLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAA 72
P LD +Y YWK + ++ +D AV GW P T D V K EW A
Sbjct: 345 PLKLDAEHYGYWKVLIKRSIQSIDMDAWFAVEDGWMPPTT--KDAKRDIVSKSRTEWIAD 402
Query: 73 EDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTT 132
E A NS+AL+ +F + +N F ++ C+ AK+ EIL+++ E T VK + +L +
Sbjct: 403 EKTAANHNSQALSVIFGSLLRNKFTQVQGCLSAKEVWEILQVSFECTNNVKRTRLDMLAS 462
Query: 133 KYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEE 192
++ENL M +ES+ D++ + I G+ D+K+V+K LRSLP +F +AI+
Sbjct: 463 EFENLTMEAEESVDDFNGKLSSITQEAVVLGKTYKDKKMVKKFLRSLPDKFQSHKSAIDV 522
Query: 193 ARDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLT 252
+ + K D+++G +Q ++ K + + ++E D+ V ++ +SL
Sbjct: 523 SLNSDQLKFDQVVGMMQAYD---TDKEEILNSYATYFGAIEDDDHTVEEDAQMGTIKSLI 579
Query: 253 LLGRQFKK 260
L+ +K
Sbjct: 580 LIQSDSEK 587
Score = 58.2 bits (139), Expect = 5e-07
Identities = 29/94 (30%), Positives = 54/94 (56%)
Query: 133 KYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEE 192
+YENL+M + ++I + ++++ N GE+ SD ++V+KIL SLPKRFD+ V +++
Sbjct: 97 EYENLKMKESDNINTFMTKLIEMGNQLRVHGEEKSDYQIVQKILISLPKRFDIIVAMMKQ 156
Query: 193 ARDISSFKVDELIGSLQNFEITVNSKNDKKGKGI 226
+D++S + + + KK KG+
Sbjct: 157 TKDLTSLSAGKWCDVCERKNHNESDCWMKKNKGV 190
>UniRef100_Q9M2D1 Copia-type polyprotein [Arabidopsis thaliana]
Length = 1352
Score = 111 bits (278), Expect = 4e-23
Identities = 81/310 (26%), Positives = 151/310 (48%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E + KK + IA ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYE-----EKKKKKEDIA----EQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>UniRef100_Q9C536 Copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1320
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>UniRef100_Q9SXB2 T28P6.8 protein [Arabidopsis thaliana]
Length = 1352
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>UniRef100_Q9M197 Copia-type reverse transcriptase-like protein [Arabidopsis
thaliana]
Length = 1272
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IVEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>UniRef100_Q9C739 Copia-type polyprotein, putative [Arabidopsis thaliana]
Length = 1352
Score = 110 bits (274), Expect = 1e-22
Identities = 79/310 (25%), Positives = 147/310 (46%), Gaps = 32/310 (10%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W RM L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLRMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGADQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKNDKKGKGIAFASSVELDELQVNQEDDEDMTESLTL 253
+D+ + +++L+GSLQ +E K D ++ +Q+ +E++ +
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED---------IIEQVLNMQITKEENGQSYQ---- 226
Query: 254 LGRQFKKIVKQYDKRPRSIGQNIRPNIDN------QPSKEKMVRTDEKNFQYKGVQCHEC 307
R+ V+ + G+ RP+ DN S+ + + + V+C+ C
Sbjct: 227 --RRGGGQVRGRGRGGYGNGRGWRPHEDNTNQRGENSSRGRGKGHPKSRYDKSSVKCYNC 284
Query: 308 EGYGHIKIEC 317
+GH EC
Sbjct: 285 GKFGHYASEC 294
>UniRef100_Q9SFE1 T26F17.17 [Arabidopsis thaliana]
Length = 1291
Score = 102 bits (254), Expect = 2e-20
Identities = 61/207 (29%), Positives = 110/207 (52%), Gaps = 11/207 (5%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGWEHPVTLYADGNMTNVLKP*EEWTAAE 73
P+L NYD W +M L D V +G+ P +G+++ K +
Sbjct: 11 PVLTKSNYDNWSLQMKAILGAHDVWE--IVEKGFIEPEN---EGSLSQTQKDGLRDSRKR 65
Query: 74 DELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLLTTK 133
D+ KAL ++ +D++ F+ + + AK+ E L+ +++G +VK + Q L +
Sbjct: 66 DK------KALCLIYQGLDEDTFEKVVEATSAKEAWEKLRTSYKGVDQVKKVRLQTLRGE 119
Query: 134 YENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAIEEA 193
+E L+M + E + DY +L + N+ + GEK+ D +++ K+LRSL +F+ VT IEE
Sbjct: 120 FEALQMKEGELVSDYFSRVLTVTNNLKRNGEKLDDVRIMEKVLRSLDLKFEHIVTVIEET 179
Query: 194 RDISSFKVDELIGSLQNFEITVNSKND 220
+D+ + +++L+GSLQ +E K D
Sbjct: 180 KDLEAMTIEQLLGSLQAYEEKKKKKED 206
>UniRef100_Q9LH44 Copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1499
Score = 101 bits (252), Expect = 4e-20
Identities = 63/215 (29%), Positives = 113/215 (52%), Gaps = 18/215 (8%)
Query: 14 PLLDGLNYDYWKSRMSVFLKFVDNKT*IAVLRGW---EHPVTLYADGNMTNVLKP*EEWT 70
P+ +G +Y +WK +M LK + W E+ VT + + L T
Sbjct: 10 PIFNGESYGFWKIKMITILK---------TRKLWDVIENGVTSNSSPETSPAL------T 54
Query: 71 AAEDELALGNSKALNALFNVVDKNMFKLIKQCIMAKDTLEILKIAHEGTTKVKSAKFQLL 130
D+ + + AL L + V ++F I A + L++ +G+++VK Q L
Sbjct: 55 RERDDQVMKDMMALQILQSAVSDSIFPRIAPASSATEAWNALEMEFQGSSQVKMINLQTL 114
Query: 131 TTKYENLRMLDDESIQDYHLNILDIANSFESAGEKISDEKLVRKILRSLPKRFDMKVTAI 190
+YENL+M + E+I D+ +++++N GE+ SD ++V+KIL S+P++FD V +
Sbjct: 115 RREYENLKMEEGETINDFTTKLINLSNQLRVHGEEKSDYQVVQKILISVPQQFDSIVGVL 174
Query: 191 EEARDISSFKVDELIGSLQNFEITVNSKNDKKGKG 225
E+ +D+S+ V ELIG+L+ E +N + D+ +G
Sbjct: 175 EQTKDLSTLSVTELIGTLKAHERRLNLREDRINEG 209
Database: uniref100
Posted date: Jan 5, 2005 1:24 AM
Number of letters in database: 848,049,833
Number of sequences in database: 2,790,947
Lambda K H
0.318 0.135 0.378
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 699,604,159
Number of Sequences: 2790947
Number of extensions: 29641089
Number of successful extensions: 120353
Number of sequences better than 10.0: 803
Number of HSP's better than 10.0 without gapping: 212
Number of HSP's successfully gapped in prelim test: 606
Number of HSP's that attempted gapping in prelim test: 118606
Number of HSP's gapped (non-prelim): 2174
length of query: 441
length of database: 848,049,833
effective HSP length: 130
effective length of query: 311
effective length of database: 485,226,723
effective search space: 150905510853
effective search space used: 150905510853
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)
Medicago: description of AC139747.1