
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146709.5 + phase: 0 /pseudo
(1304 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsi... 229 4e-58
emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana] gi... 225 9e-57
gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsi... 224 1e-56
ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultiv... 220 2e-55
dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsi... 218 8e-55
ref|NP_916849.1| retrovirus-related pol polyprotein from transpo... 218 1e-54
gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cult... 214 1e-53
ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cu... 210 3e-52
gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-... 210 3e-52
emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] gi|1... 209 5e-52
gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsi... 209 6e-52
gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-... 206 3e-51
ref|XP_470528.1| Putative retrovirus-related pol polyprotein [Or... 205 9e-51
ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sa... 204 2e-50
emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana] ... 201 1e-49
gb|AAK29467.1| polyprotein-like [Lycopersicon chilense] 199 4e-49
ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cu... 196 4e-48
gb|AAP53029.1| putative retrotransposon-related protein [Oryza s... 193 3e-47
ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cu... 191 1e-46
dbj|BAA97536.1| retroelement pol polyprotein-like [Arabidopsis t... 186 6e-45
>gb|AAD32759.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301696|pir||F84486 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1356
Score = 229 bits (585), Expect = 4e-58
Identities = 137/391 (35%), Positives = 219/391 (55%), Gaps = 46/391 (11%)
Query: 7 DIEKFTGSNDFGLWKVKMRA---ILIQQKCVEALKGEAQMSAHLTPAEKT---------- 53
++EKF G D+ +WK K+ A IL ++ + + + L +++
Sbjct: 7 EVEKFDGRGDYTMWKEKLLAHMDILGLNTALKESESTGEKKSVLDESDEDYEEKLEKFEA 66
Query: 54 --EMNDKAVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYR 111
E KA SAI+L + D+VLR++ +E+TA +M LD LYM+K+L +R KQ+LY ++
Sbjct: 67 LEEKKKKARSAIVLSVTDRVLRKIKKESTAAAMLLALDKLYMSKALPNRIYPKQKLYSFK 126
Query: 112 MVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGK-EGT 170
M E+ + + EF +II DL N++V + DED+ + LL ALP++F+ KDT+ Y +
Sbjct: 127 MSENLSVEGNIDEFLQIITDLENMNVIISDEDQAILLLTALPKAFDQLKDTLKYSSGKSI 186
Query: 171 ITLEEVQAALRTKEL---TKFKELKVDDSG---EGLNISRGRSHNKGKGKGKNSRSKSRS 224
+TL+EV AA+ +KEL + K +KV G + N ++G+ KGKGKGK +SK +
Sbjct: 187 LTLDEVAAAIYSKELELGSVKKSIKVQAEGLYVKDKNENKGKGEQKGKGKGKKGKSKKKP 246
Query: 225 KGDGNKTQYKCFICHNPGHFKKDCPERK------------DNGGGESSVQIASKDEGYES 272
C+ C GHF+ CP + ++ GG+ ++ A+ GY
Sbjct: 247 ---------GCWTCGEEGHFRSSCPNQNKPQFKQSQVVKGESSGGKGNLAEAA---GYYV 294
Query: 273 AGALTVTSWEPEKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIR 332
+ AL+ T E W+LD+GCSYH+ ++E+F GG VR+GN +++G+GTIR
Sbjct: 295 SEALSSTEVHLEDEWILDTGCSYHMTYKREWFHEFNEDAGGSVRMGNKTVSRVRGVGTIR 354
Query: 333 LKMFDDRDFLLKNVRYIPELKRNLISISMFD 363
+K D +L NVRYIP++ RNL+S+ F+
Sbjct: 355 VKNSDGLTIVLTNVRYIPDMDRNLLSLGTFE 385
>emb|CAB75481.1| copia-like polyprotein [Arabidopsis thaliana]
gi|11278366|pir||T47492 copia-like polyprotein -
Arabidopsis thaliana
Length = 1363
Score = 225 bits (573), Expect = 9e-57
Identities = 142/400 (35%), Positives = 225/400 (55%), Gaps = 49/400 (12%)
Query: 3 GSKWDIEKFTGSNDFGLWKVKM---------RAILIQQKCVEALKGEAQMSAHLTPAEKT 53
G++ ++EKF G D+ +WK K+ A+L + + + +++ S E+
Sbjct: 3 GARIEVEKFDGRGDYTMWKEKLLAHIDMLGLSAVLRESETPMGKERDSEKSDEDEKEERE 62
Query: 54 EMND------KAVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQL 107
+M KA S I+L + D+VLR++ +E +A +M LD LYM+K+L +R LKQ+L
Sbjct: 63 KMEAFEEKKRKARSTIVLSVSDRVLRKIKKETSAAAMLEALDRLYMSKALPNRIYLKQKL 122
Query: 108 YFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGK 167
Y ++M E+ I + EF I+ DL N++V + DED+ + LL +LP+ F+ KDT+ Y
Sbjct: 123 YSFKMSENLSIEGNIDEFLHIVADLENLNVLVSDEDQAILLLMSLPKPFDQLKDTLKYSS 182
Query: 168 EGTI-TLEEVQAALRTKELTKFKELK--VDDSGEGLNI-----SRGRSHNKGKGKGKNSR 219
T+ +L+EV AA+ ++EL +F +K + EGL + +RGRS K KGKGK S+
Sbjct: 183 GKTVLSLDEVAAAIYSREL-EFGSVKKSIKGQAEGLYVKDKAENRGRSEQKDKGKGKRSK 241
Query: 220 SKSRSKGDGNKTQYKCFICHNPGHFKKDCPER-----KDNGG--GESSVQIASKDEG--- 269
SKS+ C+IC GH K CP + K+ G GESS + EG
Sbjct: 242 SKSKR---------GCWICGEDGHLKSTCPNKNKPQFKNQGSNKGESSGGKGNLVEGSVN 292
Query: 270 -YESAG-----ALTVTSWEPEKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKAC 323
ESAG AL+ T E W++D+GC YH+ ++E+ E + + GG VR+GN
Sbjct: 293 FVESAGMFVSEALSSTDIHLEDEWIMDTGCIYHMTHKREWLEDFDEEAGGSVRMGNKSIS 352
Query: 324 KIQGMGTIRLKMFDDRDFLLKNVRYIPELKRNLISISMFD 363
+++G+GT+R+ + L+NVRYIP++ RNL+S+ F+
Sbjct: 353 RVKGVGTVRIVNDNGLTVTLQNVRYIPDMDRNLLSLGTFE 392
>gb|AAD23679.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25412027|pir||G84599 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 838
Score = 224 bits (571), Expect = 1e-56
Identities = 137/380 (36%), Positives = 213/380 (56%), Gaps = 28/380 (7%)
Query: 7 DIEKFTGSNDFGLWKVKMRAI---------LIQQKCVEALKGEAQMSAHLTPAEKT---E 54
++EK G D+ LWK K+ A L + + +E + A+ + LT E E
Sbjct: 7 EVEKLDGEGDYVLWKEKLLAHIELLGLLEGLEEDEAIEEEESTAETDSLLTKTEDKVLKE 66
Query: 55 MNDKAVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVE 114
KA S +IL LG+ VLR+V +E TA M LD L+M KSL +R LKQ+LY Y+M +
Sbjct: 67 KRGKARSTVILSLGNHVLRKVIKEKTAAGMIRVLDKLFMAKSLPNRIYLKQRLYGYKMSD 126
Query: 115 SKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLE 174
S I E + +F K+I DL N+ V++ DED+ + LL +LP+ F+ KDT+ YGK T+ L+
Sbjct: 127 SMTIEENVNDFFKLISDLENVKVSVPDEDQAIVLLMSLPKQFDQLKDTLKYGKT-TLALD 185
Query: 175 EVQAALRTKELTKFKELK-VDDSGEGLNI-SRGRSHNKGKGKGKNSRSKSRSKGDGNKTQ 232
E+ A+R+K L K + +S + L + RGRS + K +N +S+SRSK K
Sbjct: 186 EITGAIRSKVLELGASGKMLKNSSDALFVQDRGRSEKRDKSSERN-KSQSRSKSREKKV- 243
Query: 233 YKCFICHNPGHFKKDC---PERKDNGGGESSVQIASKDEGYESAGALTV------TSWEP 283
C++C GHFKK C E+ G + ++ A AL V + E
Sbjct: 244 --CWVCGKEGHFKKQCYVWKEKNKKGNNSEKGESSNVIGQAADAAALAVREESNADNQEV 301
Query: 284 EKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLL 343
+ W++D+GCS+H+ PR+++F + + G V++ N +I+G+G+IR++ D+ LL
Sbjct: 302 DNEWIMDTGCSFHMTPRRDWFVEFDESQTGRVKMANQTYSEIKGIGSIRIQNDDNTTVLL 361
Query: 344 KNVRYIPELKRNLISISMFD 363
KNVRY+P + +NLIS+ +
Sbjct: 362 KNVRYVPSMSKNLISMGTLE 381
>ref|XP_474090.1| OSJNBa0033G05.13 [Oryza sativa (japonica cultivar-group)]
gi|38344889|emb|CAD41912.2| OSJNBa0033G05.13 [Oryza
sativa (japonica cultivar-group)]
Length = 1181
Score = 220 bits (561), Expect = 2e-55
Identities = 131/379 (34%), Positives = 217/379 (56%), Gaps = 27/379 (7%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L QQ+ +AL G + + + EK + + KA+S I
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQELDDALSGFDKRTQDWSNDEK-KRDRKAMSYIH 63
Query: 65 LCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + +M+ L+
Sbjct: 64 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123
Query: 125 FNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTKE 184
F +I+ DL +++V +++D L LLC+LP S+ NF+DT+LY ++ T+TL+EV AL KE
Sbjct: 124 FKEIVADLESMEVKYDEKDLALILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHAKE 182
Query: 185 LTKFKELKVDDSGEGLNISRGRSHNKGKGKGKNSRSKSRSKGDG-------NKTQYK-CF 236
++K EG N +G + KN+ +KSR K ++ +YK C
Sbjct: 183 -----KMKKMVPSEGSNSQAEGLVVRGSQQEKNTNNKSRDKSSSSYRGRSKSRGRYKSCK 237
Query: 237 ICHNPGHFKKDC--PERKDNGGGESSVQIASKDEGY------ESAGALTVTSW----EPE 284
C GH C + KD G+ + ++EG E + A + ++ +
Sbjct: 238 YCKRDGHDISKCWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS 297
Query: 285 KSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLK 344
W+LD+ C+YH+CP +++F T E+ +GG V +G++ C++ G+GT+++KMFD L
Sbjct: 298 DQWILDTACTYHMCPNRDWFATYEVVQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLS 357
Query: 345 NVRYIPELKRNLISISMFD 363
+VR+IP LKR+LIS+ D
Sbjct: 358 DVRHIPNLKRSLISLCTLD 376
>dbj|BAB09923.1| copia-like retrotransposable element [Arabidopsis thaliana]
Length = 1342
Score = 218 bits (556), Expect = 8e-55
Identities = 140/388 (36%), Positives = 211/388 (54%), Gaps = 37/388 (9%)
Query: 7 DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKT------------- 53
++EKF G D+ LWK K+ A + +E L E + + E +
Sbjct: 7 EVEKFDGDGDYILWKEKLLAHMEMLGLLEGLGEEEEAVVEDSTTEISDGGNQDPETATSK 66
Query: 54 -------EMNDKAVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQ 106
E KA S IIL LG+ VLR+V ++ TA M LD L+M KSL +R LKQ+
Sbjct: 67 LEDKILKEKRGKARSTIILSLGNNVLRKVIKQKTAAGMIKVLDQLFMAKSLPNRIYLKQR 126
Query: 107 LYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYG 166
LY Y+M E+ + E + +F K+I DL N+ V + DED+ + LL +LPR F+ K+T+ Y
Sbjct: 127 LYGYKMSENMTMEENVNDFFKLISDLENVKVVVPDEDQAIVLLMSLPRQFDQLKETLKYC 186
Query: 167 KEGTITLEEVQAALRTKELTKFKELK-VDDSGEGLNI-SRGRSHNKGKGKGKNSRSKSRS 224
K T+ LEE+ +A+R+K L K + ++ +GL + RGRS +GKG KN +S+S+S
Sbjct: 187 KT-TLHLEEITSAIRSKILELGASGKLLKNNSDGLFVQDRGRSETRGKGPNKN-KSRSKS 244
Query: 225 KGDGNKTQYKCFICHNPGHFKKDC---PERKDNGGGESSVQIASKDEGYESAGALTVT-- 279
KG G C+IC GHFKK C ER G + ++ A AL V+
Sbjct: 245 KGAGK----TCWICGKEGHFKKQCYVWKERNKQGSTSERGEASTVTARVTDAAALVVSRA 300
Query: 280 ----SWEPEKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKM 335
+ +W+LD+GCS+H+ RK++ + G VR+GN+ +++G+G +R+K
Sbjct: 301 LLGFAEVTPDTWILDTGCSFHMTCRKDWIIDFKETASGKVRMGNDTYSEVKGIGDVRIKN 360
Query: 336 FDDRDFLLKNVRYIPELKRNLISISMFD 363
D LL +VRYIPE+ +NLIS+ +
Sbjct: 361 EDGSTILLTDVRYIPEMSKNLISLGTLE 388
>ref|NP_916849.1| retrovirus-related pol polyprotein from transposon TNT 1-94-like
[Oryza sativa (japonica cultivar-group)]
Length = 425
Score = 218 bits (554), Expect = 1e-54
Identities = 122/356 (34%), Positives = 210/356 (58%), Gaps = 14/356 (3%)
Query: 4 SKWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAI 63
SK+++ KF G+ +F LW+++++ +L QQ +AL E M + + EM +A + I
Sbjct: 7 SKFEMVKFDGTGNFVLWQMRLKDLLAQQGISKAL--EETMPEKMDAGKWEEMKAQAAATI 64
Query: 64 ILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 123
L L D V+ +V E T +W+KL SLYM+KSL + LKQQLY +M E + + +
Sbjct: 65 RLSLSDSVMYQVMDEKTPKEIWDKLASLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHVD 124
Query: 124 EFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTK 183
FN+++ DL+ +DVNL DEDK + LLC+LP S+E+ T+ +GK+ TI EE+ ++L +
Sbjct: 125 VFNQLVVDLSKLDVNLYDEDKAIILLCSLPPSYEHVVTTLTHGKD-TIKTEEIISSLLAR 183
Query: 184 ELTKFKELKVDDSGEGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGH 243
+L + K+ + ++ + ++ H+ G SKS KG +C+ CH GH
Sbjct: 184 DLRRSKKNEAMEASQAESLLVKAKHDHEAGV-----SKSNEKG------ARCYKCHEFGH 232
Query: 244 FKKDCPERKDNGGGESSVQIASKDEGYESAGALTVTSWEPEKSWVLDSGCSYHICPRKEY 303
+++CP K G +S+ D S LTV++ + ++W+LDS SYH+ ++E+
Sbjct: 233 IRRNCPLLKKRKDGIASLAARGDDSDSSSHEILTVSNEKSGEAWMLDSASSYHVTSKREW 292
Query: 304 FETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPELKRNLISI 359
F + + + GVV LG++ + + G+ ++ KM+D + LL +VR++P L+++LIS+
Sbjct: 293 FFSYKSGDFGVVYLGDDTSYHVVGVDDVKFKMYDGNEVLLSDVRHVPGLRKSLISL 348
>gb|AAP54315.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37535452|ref|NP_922028.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
gi|22094359|gb|AAM91886.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1280
Score = 214 bits (546), Expect = 1e-53
Identities = 131/377 (34%), Positives = 207/377 (54%), Gaps = 23/377 (6%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L QQ +AL G + + + EK + + KA+S I
Sbjct: 40 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEKKK-DRKAMSYIH 98
Query: 65 LCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + +M+ L+
Sbjct: 99 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLST 158
Query: 125 FNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTKE 184
F +I+ DL +I+V ++ED L LLC+LP S+ NF+DT+LY + T+ L+EV AL KE
Sbjct: 159 FKEIVADLESIEVKYDEEDLGLILLCSLPSSYANFRDTILYSHD-TLILKEVYDALHAKE 217
Query: 185 LTKFKELKVDD----SGEGLNISRGRSHNKGKGKGKNSRSKSRSKG-DGNKTQYK-CFIC 238
K K++ + EGL + RGR K +S S +G ++ +YK C C
Sbjct: 218 --KMKKMVPSEGSNSQAEGL-VVRGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYC 274
Query: 239 HNPGHFKKDCPERKDNGGGESSVQIASKDEGYESAGALTVTSWEPE------------KS 286
GH +C + +D K E A +T + E
Sbjct: 275 KRDGHDISECWKLQDKDKRTGKYIPKGKKEEEGKAAVVTDEKSDTELLVAYAGCAQTSDQ 334
Query: 287 WVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNV 346
W+LD+ +YH+CP +++F T E +GG V +G++ C++ G+GT+++KMFD L +V
Sbjct: 335 WILDTAWTYHMCPNRDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGYIRTLSDV 394
Query: 347 RYIPELKRNLISISMFD 363
R+IP LKR+LIS+ D
Sbjct: 395 RHIPNLKRSLISLCTLD 411
>ref|XP_475663.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475188|gb|AAT44257.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1211
Score = 210 bits (534), Expect = 3e-52
Identities = 130/379 (34%), Positives = 212/379 (55%), Gaps = 27/379 (7%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L QQ +AL G + + + EK + + K +S I
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKTMSYIH 63
Query: 65 LCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + +M+ L+
Sbjct: 64 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDGSVMDHLSA 123
Query: 125 FNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTKE 184
F KI+ DL +++V ++ED L LLC+LP S+ NF+DT+LY + T+TL+EV AL KE
Sbjct: 124 FKKIVADLESMEVKYDEEDLCLILLCSLPSSYANFRDTILYSCD-TLTLKEVYDALHAKE 182
Query: 185 LTKFKELKVDDSGEGLNISRGRSHNKGKGKGKNSRSKSRSKGDG-------NKTQYK-CF 236
++K EG N +G+ + KN+ SKSR K ++ +YK C
Sbjct: 183 -----KIKKMVPSEGSNSQAEGLVVRGRQQEKNTNSKSRDKSSSSYRGRSKSRGRYKSCK 237
Query: 237 ICHNPGHFKKDC--PERKDNGGGESSVQIASKDEGY------ESAGALTVTSW----EPE 284
GH +C + KD G+ + ++EG E + A + ++ +
Sbjct: 238 YYKRDGHDISECWKLQDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTS 297
Query: 285 KSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLK 344
W+LD+ C+YH+C +++F T E +GG V +G++ C++ G+ T+++KMFD L
Sbjct: 298 DQWILDTACTYHMCLNRDWFATYEAVQGGTVLMGDDTPCEVAGVETVQIKMFDGCIRTLS 357
Query: 345 NVRYIPELKRNLISISMFD 363
+VR+IP LKR+LIS+ D
Sbjct: 358 DVRHIPNLKRSLISLCTLD 376
>gb|AAX92861.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 1373
Score = 210 bits (534), Expect = 3e-52
Identities = 137/372 (36%), Positives = 209/372 (55%), Gaps = 19/372 (5%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMR IL Q EAL + A T AE+ + KA++ I
Sbjct: 2 KFDLPLLNYDTRFSLWQVKMRGILAQTHDYDEALDNFGKRRAEWT-AEEIRKDQKALALI 60
Query: 64 ILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 123
L L + +L+E E T+ +W KL+S+ M+K L + +K +L+ +M E ++ +
Sbjct: 61 QLHLHNDILQECLTEKTSAELWLKLESICMSKDLTSKMQMKMKLFTLKMKEEDSVITHMA 120
Query: 124 EFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTK 183
EF KI+ DL +++V +DED L LLC+LP S+ NF+DT+L ++ +TL+EV AL+ K
Sbjct: 121 EFKKIVADLVSMEVKYDDEDLGLLLLCSLPNSYANFRDTILLSRD-ELTLKEVYDALQNK 179
Query: 184 ELTKF---KELKVDDSGEGLNISRGRSHNK-GKGKGKNSRSKSRSKGDGNKTQYKCFIC- 238
E K + GE L++ RGR+ N+ K + R +S+SK GNK C C
Sbjct: 180 EKMKIMVQNDGSSSSKGEALHV-RGRTENRTSNEKNYDRRGRSKSKPPGNKK--FCVYCK 236
Query: 239 ---HNPGHFKK-DCPERKDNGGGESSVQIASKDEGYESAGALTVTSW--EPEKSWVLDSG 292
HN KK ERK+ G+ SV A+ + +S L V + W+LDS
Sbjct: 237 LKNHNIDECKKVQAKERKNKKDGKVSVASAAASDD-DSGDCLVVFAGCVAGHDEWILDSA 295
Query: 293 CSYHICPRKEYFETLE-LKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPE 351
CS+HIC ++ +F + + +++G VVR+G++ C I G+G++++K D LKNVRYIP
Sbjct: 296 CSFHICTKRNWFSSYKPVQKGDVVRMGDDNPCAIVGIGSVQIKTDDGMTRTLKNVRYIPG 355
Query: 352 LKRNLISISMFD 363
+ RNLIS+S D
Sbjct: 356 MSRNLISLSTLD 367
>emb|CAA32025.1| unnamed protein product [Nicotiana tabacum]
gi|130582|sp|P10978|POLX_TOBAC Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease]
Length = 1328
Score = 209 bits (532), Expect = 5e-52
Identities = 127/369 (34%), Positives = 203/369 (54%), Gaps = 19/369 (5%)
Query: 3 GSKWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSA 62
G K+++ KF G N F W+ +MR +LIQQ + L +++ + + +++++A SA
Sbjct: 3 GVKYEVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASA 62
Query: 63 IILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQL 122
I L L D V+ + E TA +W +L+SLYM+K+L ++ LK+QLY M E + L
Sbjct: 63 IRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHL 122
Query: 123 TEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRT 182
FN +I LAN+ V +E+EDK + LL +LP S++N T+L+GK TI L++V +AL
Sbjct: 123 NVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKT-TIELKDVTSALLL 181
Query: 183 KELTKFKELKVDDSGEGL-NISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNP 241
E + K ++ G+ L RGRS+ + S ++ +SK C+ C+ P
Sbjct: 182 NEKMR---KKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQP 238
Query: 242 GHFKKDCPERKDNGGGESSVQ-----IASKDEGYESAGALTVTSWE-------PEKSWVL 289
GHFK+DCP + G GE+S Q A+ + ++ L + E PE WV+
Sbjct: 239 GHFKRDCPNPR-KGKGETSGQKNDDNTAAMVQNNDNV-VLFINEEEECMHLSGPESEWVV 296
Query: 290 DSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYI 349
D+ S+H P ++ F + G V++GN KI G+G I +K +LK+VR++
Sbjct: 297 DTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHV 356
Query: 350 PELKRNLIS 358
P+L+ NLIS
Sbjct: 357 PDLRMNLIS 365
>gb|AAD23690.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301702|pir||E84601 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 209 bits (531), Expect = 6e-52
Identities = 139/380 (36%), Positives = 206/380 (53%), Gaps = 46/380 (12%)
Query: 7 DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDK-------- 58
++EKF G D+ +WK K+ A L ALK E + + + TE +K
Sbjct: 7 EVEKFDGRGDYTMWKEKLMAHLDILGLSVALKEEDDLVEKVAEMQLTEEEEKEEVLRREL 66
Query: 59 -------AVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYR 111
A SAI+L + D+VLR++ +E +A +M LD LYM+K+L +R KQ+LY ++
Sbjct: 67 LEEKRRKARSAIVLSVTDRVLRKIKKEQSAAAMLGVLDKLYMSKALPNRIYQKQKLYSFK 126
Query: 112 MVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYG-KEGT 170
M E+ I + EF +II DL N +V + DED+ + LL +LP+ F+ +DT+ YG T
Sbjct: 127 MSENLSIEGNIDEFLRIIADLENTNVLVSDEDQAILLLMSLPKPFDQLRDTLKYGLGRVT 186
Query: 171 ITLEEVQAALRTKELTKFKELK-VDDSGEGLNI-----SRGRSHNKG-KGKGKNSRSKSR 223
++L+EV AA+ +KEL K + EGL + +RGR+ +G K SRSKSR
Sbjct: 187 LSLDEVVAAIYSKELELGSNKKSIKGQAEGLFVKEKTETRGRTEQRGNNNNNKKSRSKSR 246
Query: 224 SKGDGNKTQYKCFICHNPGHFKKDCPERKDNGGGESSVQIASKDEGYESAGALTVTSWEP 283
SK C+IC ++ G S+ S+ G + AL+ T
Sbjct: 247 SKKG-------CWIC-------------GESSNGSSNY---SEANGLYVSEALSSTDIHL 283
Query: 284 EKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLL 343
E WV+D+GCSYH+ ++E+FE L GG VR+GN K++G+GTIR+K L
Sbjct: 284 EDEWVMDTGCSYHMTYKREWFEDLNEDAGGSVRMGNKTVSKVRGIGTIRVKNEAGMVVRL 343
Query: 344 KNVRYIPELKRNLISISMFD 363
NVRYIPE+ RNL+S+ F+
Sbjct: 344 TNVRYIPEMDRNLLSLGTFE 363
>gb|AAX92941.1| retrotransposon protein, putative, Ty1-copia sub-class [Oryza
sativa (japonica cultivar-group)]
Length = 2340
Score = 206 bits (525), Expect = 3e-51
Identities = 127/376 (33%), Positives = 210/376 (55%), Gaps = 21/376 (5%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L QQ +AL G + + + EK + + KA+S I
Sbjct: 212 KYDLPLLYRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTHDWSNDEK-KRDRKAMSYIH 270
Query: 65 LCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 124
L L + +L+EV +E A +W KL+ + MTK L + LKQ L+ +++ + +M+ L+
Sbjct: 271 LHLSNNILQEVLKEEIAAGLWLKLEQICMTKDLTSKMHLKQTLFLHKLQDDGSVMDHLSA 330
Query: 125 FNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTKE 184
F +II DL +++V ++ED L LLC+LP S+ NF+DT+LY ++ T+TL+EV AL KE
Sbjct: 331 FKEIIADLESMEVKYDEEDLGLILLCSLPSSYANFRDTILYSRD-TLTLKEVYDALHVKE 389
Query: 185 LTKFKELKVDD----SGEGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYK-CFICH 239
K K++ + EGL + + K + ++ S S ++ +YK C C
Sbjct: 390 --KMKKMVPSEGSNSQAEGLIVWGRQQEKNTKNQSRDKSSSSYRGRSKSRGRYKSCKYCK 447
Query: 240 NPGHFKKDC--PERKDNGGGESSVQIASKDEGY------ESAGALTVTSW----EPEKSW 287
GH +C KD G+ + ++EG E + A + ++ + W
Sbjct: 448 RDGHDIFECWKLHDKDKRTGKYVPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQTSDQW 507
Query: 288 VLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVR 347
+L++ C YH+CP +++F T E + G V +G++ C++ G+GT+++KMFD L +VR
Sbjct: 508 ILNTACIYHMCPNRDWFATYEAVQVGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVR 567
Query: 348 YIPELKRNLISISMFD 363
+IP LKR+LIS+ D
Sbjct: 568 HIPNLKRSLISLCTLD 583
>ref|XP_470528.1| Putative retrovirus-related pol polyprotein [Oryza sativa (japonica
cultivar-group)] gi|27436748|gb|AAO13467.1| Putative
retrovirus-related pol polyprotein [Oryza sativa
(japonica cultivar-group)]
Length = 556
Score = 205 bits (521), Expect = 9e-51
Identities = 117/342 (34%), Positives = 201/342 (58%), Gaps = 14/342 (4%)
Query: 18 GLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAIILCLGDKVLREVSR 77
G +++++ +L QQ +AL E M + + EM +A + I L L D V+ +V
Sbjct: 152 GSSQMRLKDLLAQQGISKAL--EETMPEKMDVGKWVEMKAQATAIIRLSLSDFVMYQVMD 209
Query: 78 EATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTEFNKIIDDLANIDV 137
E T +W+KL SLYM+KSL + LKQQLY +M E + + + FN+++ DL+ +DV
Sbjct: 210 EKTPKEIWDKLASLYMSKSLTSKLYLKQQLYGLQMQEESDLRKHVDVFNQLVVDLSKLDV 269
Query: 138 NLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTKELTKFKELKVDDSG 197
L+DEDK + LLC+LP SFE+ T+ +GK+ T+ EE+ ++L ++L + K+ + ++
Sbjct: 270 KLDDEDKAIILLCSLPPSFEHVVTTLTHGKD-TVKTEEIISSLLARDLRRSKKNEAMEAS 328
Query: 198 EGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKDNGGG 257
+ ++ H+ G SKS+ KG +C+ CH GH +++CP K GG
Sbjct: 329 QAESLLVKAKHDHEAGV-----SKSKEKG------ARCYKCHEFGHIRRNCPLLKKRKGG 377
Query: 258 ESSVQIASKDEGYESAGALTVTSWEPEKSWVLDSGCSYHICPRKEYFETLELKEGGVVRL 317
+S+ + S LTV++ + ++W+LDS SYH+ + E+F + + + GVV L
Sbjct: 378 IASLAARGDNSDSSSHEILTVSNEKSGEAWMLDSASSYHVTSKWEWFSSYKSGDFGVVYL 437
Query: 318 GNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPELKRNLISI 359
GN+ + ++ G+G I+ KM+D + LL +VR++P L+++LIS+
Sbjct: 438 GNDTSYRVIGVGDIKFKMYDGNEVLLSDVRHVPGLRKSLISL 479
>ref|XP_470868.1| Putative retroelement pol polyprotein [Oryza sativa]
gi|14029020|gb|AAK52561.1| Putative retroelement pol
polyprotein [Oryza sativa]
Length = 1326
Score = 204 bits (518), Expect = 2e-50
Identities = 127/372 (34%), Positives = 210/372 (56%), Gaps = 20/372 (5%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMRAIL Q + EAL+ + + AE+ + KA+ I
Sbjct: 5 KYDLPLLDYKTRFSLWQVKMRAILAQTSDLDEALESFGKKKSTEWTAEEKRKDRKALLLI 64
Query: 64 ILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 123
L L + +L+EV +E TA +W KL+S+ M+K L + +K +L+ +++ ES ++ ++
Sbjct: 65 QLHLSNDILQEVLQEKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 124
Query: 124 EFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTK 183
F +I+ DL +I+V +DED L LLC+LP S+ NF+DT+L ++ +TL EV AL+ +
Sbjct: 125 VFKEIVVDLVSIEVQFDDEDLGLLLLCSLPSSYANFRDTILLSRD-ELTLAEVYEALQNR 183
Query: 184 ELTKFKELKVDDS----GEGLNI---SRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCF 236
E K K + D+ GE L + S R++N + K S+S+ RSK G K C
Sbjct: 184 E--KMKGMVQSDASSSKGEALQVRGRSEQRTYNDSSDRDK-SQSRGRSKSRGKKF---CK 237
Query: 237 ICHNPGHFKKDC--PERKDNGGGESSVQIASKDEGYESAGALTVTSW--EPEKSWVLDSG 292
C HF ++C + K+ + + + E +S L V + W+LD+
Sbjct: 238 YCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGCVASHDEWILDTA 297
Query: 293 CSYHICPRKEYFETLE-LKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPE 351
CS+HIC +++F + + ++ G VVR+G++ +I G+G++++K D LK+VR+IP
Sbjct: 298 CSFHICINRDWFSSYKSVQNGDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPG 357
Query: 352 LKRNLISISMFD 363
+ RNLIS+S D
Sbjct: 358 MARNLISLSTLD 369
>emb|CAB78169.1| putative retrotransposon [Arabidopsis thaliana]
gi|4539406|emb|CAB40039.1| putative retrotransposon
[Arabidopsis thaliana] gi|7444416|pir||T04181
hypothetical protein F7L13.40 - Arabidopsis thaliana
Length = 1230
Score = 201 bits (511), Expect = 1e-49
Identities = 134/387 (34%), Positives = 194/387 (49%), Gaps = 54/387 (13%)
Query: 7 DIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEK-------------T 53
++EKF G D+ LWK K+ A + AL+ +S L E+
Sbjct: 7 EMEKFDGHGDYTLWKEKLMAHMDLLGLTVALRETQSVSDPLESEEEGKESEKGDKEALME 66
Query: 54 EMNDKAVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMV 113
E KA S I+L + D+VLR+ +E TA SM LD LYM+K+L +R LKQ+LY Y+M
Sbjct: 67 EKRQKARSTIVLSVSDQVLRKSKKEKTAPSMLEALDKLYMSKALPNRIYLKQKLYSYKMQ 126
Query: 114 ESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGK-EGTIT 172
E+ + + EF ++I DL N +V + DED+ + LL +LP+ F+ KDT+ YG T++
Sbjct: 127 ENLSVEGNIDEFLRLIADLENTNVLVSDEDQAILLLMSLPKQFDQLKDTLKYGSGRTTLS 186
Query: 173 LEEVQAALRTKELTKFKELK-VDDSGEGLNIS---RGRSHNKGKGKGKNSRSKSRSKGDG 228
++EV AA+ +KEL K + EGL + R ++ K KG RS+SRSKG
Sbjct: 187 VDEVVAAIYSKELELGSNKKSIRGQAEGLYVKDKPETRGMSEQKEKGNKGRSRSRSKG-- 244
Query: 229 NKTQYKCFICHNPGHFKKDCPER-------KDNGGGESSVQI-----ASKDEGYESAGAL 276
C+IC GHFK CP + KD G S+ GY + AL
Sbjct: 245 ---WKGCWICGEEGHFKTSCPNKGKQQNKGKDQASGSKGEAATIKGNTSEGSGYYVSEAL 301
Query: 277 TVTSWEPEKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMF 336
T WV+D+GC+YH+ +KE+FE L GG VR+GN K +
Sbjct: 302 HSTDVNLGNEWVMDTGCNYHMTHKKEWFEELSEDAGGTVRMGNKSTSKFR---------- 351
Query: 337 DDRDFLLKNVRYIPELKRNLISISMFD 363
V+YIP++ RNL+S+ +
Sbjct: 352 ---------VKYIPDMDRNLLSMGTLE 369
>gb|AAK29467.1| polyprotein-like [Lycopersicon chilense]
Length = 1328
Score = 199 bits (507), Expect = 4e-49
Identities = 126/368 (34%), Positives = 198/368 (53%), Gaps = 16/368 (4%)
Query: 3 GSKWDIEKFTGSND-FGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVS 61
G K+++ KF G F +W+ +M+ +LIQQ +AL G+++ + + E+++KA S
Sbjct: 3 GVKYEVAKFNGDKPVFSMWQRRMKDLLIQQGLHKALGGKSKKPESMKLEDWEELDEKAAS 62
Query: 62 AIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQ 121
AI L L D V+ + E +A +W KL++LYM+K+L ++ LK+QLY M E +
Sbjct: 63 AIRLHLTDDVVNNIVDEESACGIWTKLENLYMSKTLTNKLYLKKQLYTLHMDEGTNFLSH 122
Query: 122 LTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALR 181
L N +I LAN+ V +E+EDK + LL +LP S++ T+L+GK+ +I L++V +AL
Sbjct: 123 LNVLNGLITQLANLGVKIEEEDKRIVLLNSLPSSYDTLSTTILHGKD-SIQLKDVTSALL 181
Query: 182 TKELTKFKELKVDDSGEGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNP 241
E K ++ + + SRGRS+ + S ++ +SK C+ C P
Sbjct: 182 LNE--KMRKKPENHGQVFITESRGRSYQRSSSNYGRSGARGKSKVRSKSKARNCYNCDQP 239
Query: 242 GHFKKDCPERKDNGGGESSVQ-----IASKDEGYESAGALTVTSWE------PEKSWVLD 290
GHFK+DCP K G GESS Q A+ + + L E E WV+D
Sbjct: 240 GHFKRDCPNPK-RGKGESSGQKNDDNTAAMVQNNDDVVLLINEEEECMHLAGTESEWVVD 298
Query: 291 SGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIP 350
+ SYH P ++ F + G V++GN KI G+G I K +LK+VR++P
Sbjct: 299 TAASYHATPVRDLFCRYVAGDYGNVKMGNTSYSKIAGIGDICFKTNVGCTLVLKDVRHVP 358
Query: 351 ELKRNLIS 358
+L+ NLIS
Sbjct: 359 DLRMNLIS 366
>ref|XP_469192.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|53370655|gb|AAU89150.1| integrase core domain
containing protein [Oryza sativa (japonica
cultivar-group)] gi|40538906|gb|AAR87163.1| putative
polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1322
Score = 196 bits (498), Expect = 4e-48
Identities = 121/370 (32%), Positives = 204/370 (54%), Gaps = 16/370 (4%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMRA+L Q + EAL+ + AE+ + KA+S I
Sbjct: 5 KYDLPLLDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 64
Query: 64 ILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 123
L L + +L+EV ++ TA +W KL+S+ M+K L + +K +L+ +++ ES ++ ++
Sbjct: 65 QLHLSNDILQEVLQKKTAAELWLKLESICMSKDLTSKMHIKMKLFSHKLHESGSVLNHIS 124
Query: 124 EFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTK 183
F +I+ DL +++V +DED L LLC+LP S+ NF+ T+L ++ +TL EV AL+ +
Sbjct: 125 VFKEIVADLVSMEVQFDDEDLGLLLLCSLPSSYANFRHTILLSRD-ELTLAEVYEALQNR 183
Query: 184 ELTK--FKELKVDDSGEGLNISRGRSHNKGKGKGKN---SRSKSRSKGDGNKTQYKCFIC 238
E K + GE L + RGRS + + S+S+ RSK G K C C
Sbjct: 184 EKMKGMVQSYASSSKGEALQV-RGRSEQRTYNDSNDHDKSQSRGRSKSRGKKF---CKYC 239
Query: 239 HNPGHFKKDC--PERKDNGGGESSVQIASKDEGYESAGALTVTSW--EPEKSWVLDSGCS 294
HF ++C + K+ + + + E +S L V + W+LD+ CS
Sbjct: 240 KKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSGDCLVVFAGYVASHDEWILDTACS 299
Query: 295 YHICPRKEYFETLE-LKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPELK 353
+HIC +++F + + ++ VVR+G++ +I G+G++++K D LK+VR+IP +
Sbjct: 300 FHICINRDWFSSYKSVQNEDVVRMGDDNPREIVGIGSVQIKTHDGMTRTLKDVRHIPGMA 359
Query: 354 RNLISISMFD 363
RNLIS+S D
Sbjct: 360 RNLISLSTLD 369
>gb|AAP53029.1| putative retrotransposon-related protein [Oryza sativa (japonica
cultivar-group)] gi|37532880|ref|NP_920742.1| putative
retrotransposon-related protein [Oryza sativa (japonica
cultivar-group)] gi|22655747|gb|AAN04164.1| Putative
retrotransposon protein [Oryza sativa (japonica
cultivar-group)] gi|16905223|gb|AAL31093.1| putative
retrotransposon-related protein [Oryza sativa]
Length = 1229
Score = 193 bits (491), Expect = 3e-47
Identities = 123/372 (33%), Positives = 206/372 (55%), Gaps = 20/372 (5%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCV-EALKGEAQMSAHLTPAEKTEMNDKAVSAI 63
K+D+ F LW+VKMRA+L Q + EAL+ + AE+ + KA+S I
Sbjct: 2 KYDLPLQDYKTRFSLWQVKMRAVLAQTSDLDEALESFGKKKTTEWTAEEKRKDRKALSLI 61
Query: 64 ILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLT 123
L L + +L++V +E TA +W KL+S+ M+K L + +K +L+ +++ ES ++ ++
Sbjct: 62 QLHLSNDILQKVLQEKTAAELWFKLESICMSKDLTSKMHIKMKLFSHKLQESGSVLNHIS 121
Query: 124 EFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTK 183
F +II DL +++V +DED L LLC+LP + NF+DT+L ++ +TL EV AL+ +
Sbjct: 122 VFKEIIADLVSMEVQFDDEDLGLLLLCSLPSLYANFRDTILLSRD-ELTLAEVYEALQNR 180
Query: 184 ELTKFKELKVDDS----GEGLNI---SRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCF 236
E K K + D+ G+ L + S R++N + K S+S+ RSK G K C
Sbjct: 181 E--KMKGMVQSDASSSKGKALQVRGRSEQRTYNDSNDRDK-SQSRGRSKSRGKKF---CK 234
Query: 237 ICHNPGHFKKDC--PERKDNGGGESSVQIASKDEGYESAGALTVTSW--EPEKSWVLDSG 292
C HF ++C + K+ + + + E +SA L + W+LD+
Sbjct: 235 YCKKKNHFIEECWKLQNKEKRKSDGKASVVTSAENSDSADCLVFFAGCVASHDEWILDTA 294
Query: 293 CSYHICPRKEYFET-LELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPE 351
C + IC +++F + ++ G VVR+G+N +I G+G++++K D LK+VR+IP
Sbjct: 295 CLFLICINRDWFSSHKSVQNGDVVRMGDNNPREIMGIGSVQIKTHDGMTRTLKDVRHIPG 354
Query: 352 LKRNLISISMFD 363
+ RNLIS+S D
Sbjct: 355 MARNLISLSTLD 366
>ref|XP_475489.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|48475213|gb|AAT44282.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1243
Score = 191 bits (485), Expect = 1e-46
Identities = 122/362 (33%), Positives = 206/362 (56%), Gaps = 16/362 (4%)
Query: 5 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVSAII 64
K+D+ F LW+VKMRA+L QQ +AL G + + + EK + + KA+S I
Sbjct: 5 KYDLPLLDRDTRFSLWQVKMRAVLAQQDLDDALSGFDKRTQDWSNDEK-KRDRKAISYIH 63
Query: 65 LCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 124
L L + +L+EV +E TA +W KL+ + MTK L + LKQ+L+ +++ + + +M+ L+
Sbjct: 64 LHLSNNILQEVLKEETAAGLWLKLEQICMTKDLTSKMHLKQKLFLHKLQDDESVMDHLSA 123
Query: 125 FNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALRTKE 184
F +I+ DL +++V +++D L LLC+LP S+ NF+ T+LY ++ T+TL+EV A KE
Sbjct: 124 FKEIVADLESMEVKYDEDDLGLILLCSLPSSYANFRGTILYSRD-TLTLKEVYDAFHAKE 182
Query: 185 LTKFKELKVDD----SGEGLNISRGRSHNKGKGKGKNSRSKSRSKG-DGNKTQYK-CFIC 238
K K++ + EGL + RGR K +S S +G ++ +YK C C
Sbjct: 183 --KMKKMVTSEGSNSQAEGL-VVRGRQQKKNTKNQSRDKSSSSYRGRTKSRGRYKSCKYC 239
Query: 239 HNPGHFKKDCPERKDNGGGESSVQIASKDEGYESAGALTVTSWEPE-KSWVLDSGCSYHI 297
GH +C + +D + + + K + E A VT + + + V +GC+
Sbjct: 240 KRDGHDISECWKLQDK--DKRTGKYIPKGKKEEEGKAAVVTDEKSDAELLVAYAGCAQ-- 295
Query: 298 CPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDRDFLLKNVRYIPELKRNLI 357
+++F T E +GG V +G++ C++ G+GT+++KMFD L +V++IP LKR+LI
Sbjct: 296 TSDQDWFATYEALQGGTVLMGDDTPCEVAGIGTVQIKMFDGCIRTLSDVQHIPNLKRSLI 355
Query: 358 SI 359
S+
Sbjct: 356 SL 357
>dbj|BAA97536.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1338
Score = 186 bits (471), Expect = 6e-45
Identities = 116/384 (30%), Positives = 196/384 (50%), Gaps = 33/384 (8%)
Query: 9 EKFTGSNDFGLWKVKMRAILIQQKCVEALK--------------------GEAQMSAHLT 48
E+F G D+ LWK K+ A L +ALK E + H
Sbjct: 18 ERFDGRGDYTLWKRKLLAQLEVMGISDALKEKEEKKEAVETERVKVVSSSSERRREEHKK 77
Query: 49 PAEKTEMNDKAVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLY 108
+ E +KA S IIL + D +LR + E TA M + LD LY++ L+ R LK++L+
Sbjct: 78 DHSREEKENKARSVIILSVADNILRRIRTEETAAGMISVLDKLYLSDPLSSRISLKRKLF 137
Query: 109 FYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKE 168
++M E+K + E + +F +I++DL +DV + DEDK LL +LPR E K ++ Y +E
Sbjct: 138 EFKMSENKAVEENIEDFFRIVEDLEKLDVYVSDEDKAFMLLLSLPRKLEQLKYSLDYCEE 197
Query: 169 GTITLEEVQAALRTKELTKFK-ELKVDDSGEGLNISRGRSHNKGKGKGKNSRSKSRSKGD 227
+TL V A+ KEL + E + ++ + L++ R + + + ++ K + + +
Sbjct: 198 -PLTLGRVMTAIYKKELEVAQIERQTEEEEKRLSL---RERERSDYREEQAKGKEKVRSE 253
Query: 228 GNKTQYKCFICHNPGHFKKDCPERKDNGGGESSVQI-ASKDEGYESAGALTVTSWEPEKS 286
+ + C+ C GH K +C + K N + SV+ S + S G++ + S ++
Sbjct: 254 AREKKGPCWRCGQKGHVKTECFQEKKNKSRKKSVRYEESSAQSIVSGGSVFMVSEAAARA 313
Query: 287 -------WVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMFDDR 339
W+ D+GC+ H+ RKE+FE L E G V + N+ +++G+G++R+ D
Sbjct: 314 SKGSSEEWICDTGCTSHMSSRKEWFEDLVFSESGNVSMANDTTLQVKGIGSVRILNDDGT 373
Query: 340 DFLLKNVRYIPELKRNLISISMFD 363
LL NV YIP + +NLIS+ +
Sbjct: 374 TVLLTNVMYIPGMSKNLISLGTLE 397
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.357 0.158 0.567
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,816,154,689
Number of Sequences: 2540612
Number of extensions: 65196175
Number of successful extensions: 317946
Number of sequences better than 10.0: 857
Number of HSP's better than 10.0 without gapping: 482
Number of HSP's successfully gapped in prelim test: 376
Number of HSP's that attempted gapping in prelim test: 315579
Number of HSP's gapped (non-prelim): 1574
length of query: 1304
length of database: 863,360,394
effective HSP length: 140
effective length of query: 1164
effective length of database: 507,674,714
effective search space: 590933367096
effective search space used: 590933367096
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 81 (35.8 bits)
Medicago: description of AC146709.5