
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146853.2 - phase: 0 /pseudo
(952 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japon... 275 4e-72
gb|AAG46116.1| putative copia-like retrotransposon polyprotein [... 275 4e-72
ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (jap... 274 9e-72
gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (ja... 271 6e-71
gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cult... 255 6e-66
emb|CAE05956.3| OSJNBb0088C09.15 [Oryza sativa (japonica cultiva... 242 4e-62
gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein ... 218 8e-55
emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana] 211 1e-52
gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H... 207 1e-51
gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum] 207 2e-51
emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|32692... 205 5e-51
gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 proteas... 204 1e-50
gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hop... 195 5e-48
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 194 1e-47
emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7... 194 2e-47
gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thalia... 192 6e-47
gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum] 190 2e-46
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 188 6e-46
gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cult... 188 8e-46
emb|CAB10526.1| retrotransposon like protein [Arabidopsis thalia... 187 1e-45
>gb|AAP54850.1| putative gag-pol polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37536522|ref|NP_922563.1| putative
gag-pol polyprotein [Oryza sativa (japonica
cultivar-group)] gi|13310887|gb|AAG13591.2| putative
gag/pol polyprotein [Oryza sativa]
Length = 1417
Score = 275 bits (704), Expect = 4e-72
Identities = 140/351 (39%), Positives = 213/351 (59%), Gaps = 7/351 (1%)
Query: 165 QKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYL 224
+ + +G V RLSCPY+S QNGK+ER +R+ N+ +RT+L+H++ P SFW AL+ AT+L
Sbjct: 634 RSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVRTMLVHSAAPLSFWAEALQTATHL 693
Query: 225 MNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYP 284
+N P + +P ++L P+Y+HLRVFGCLC+P ++ +KL PRS CVF+GYP
Sbjct: 694 INRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYP 753
Query: 285 SNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQHLMDQ 344
++HRGY+C ++S ++ RHV F E++FP+ P+P+ D+ ++ Q
Sbjct: 754 ADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSAPPPPDHGDDTIVLLPAPAQ 813
Query: 345 TQTGPPIPQPAHQPNHITSPCPNSPNITSPPQ--SNPTSPIQQNLPPIFQPTLQANTKPI 402
P PAH SP ++P+ +P + P SP + + P P A T
Sbjct: 814 HVVTPVGTAPAHDAASPPSPASSTPSSAAPAHDVAPPPSP-ETSSPASASPPRHAMT--- 869
Query: 403 TRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVP 462
TR++ GI KPN + Y + T SP P + +AL+DPNW+ AM E+++L+ N+TW LVP
Sbjct: 870 TRARAGISKPNPR-YAMTATSTLSPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVP 928
Query: 463 RPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
RPP +I W+F+ K +DG+ +++KAR V G Q+ G+D GETFSP+
Sbjct: 929 RPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPV 979
>gb|AAG46116.1| putative copia-like retrotransposon polyprotein [Oryza sativa]
Length = 1302
Score = 275 bits (704), Expect = 4e-72
Identities = 140/351 (39%), Positives = 213/351 (59%), Gaps = 7/351 (1%)
Query: 165 QKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYL 224
+ + +G V RLSCPY+S QNGK+ER +R+ N+ +RT+L+H++ P SFW AL+ AT+L
Sbjct: 536 RSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDYVRTMLVHSAAPLSFWAEALQTATHL 595
Query: 225 MNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYP 284
+N P + +P ++L P+Y+HLRVFGCLC+P ++ +KL PRS CVF+GYP
Sbjct: 596 INRRPCRATGSLTPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYP 655
Query: 285 SNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQHLMDQ 344
++HRGY+C ++S ++ RHV F E++FP+ P+P+ D+ ++ Q
Sbjct: 656 ADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSAPPPPDHGDDTIVLLPAPAQ 715
Query: 345 TQTGPPIPQPAHQPNHITSPCPNSPNITSPPQ--SNPTSPIQQNLPPIFQPTLQANTKPI 402
P PAH SP ++P+ +P + P SP + + P P A T
Sbjct: 716 HVVTPVGTAPAHDAASPPSPASSTPSSAAPAHDVAPPPSP-ETSSPASASPPRHAMT--- 771
Query: 403 TRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVP 462
TR++ GI KPN + Y + T SP P + +AL+DPNW+ AM E+++L+ N+TW LVP
Sbjct: 772 TRARAGISKPNPR-YAMTATSTLSPTPSSVRAALRDPNWRAAMQAEFDALLANRTWTLVP 830
Query: 463 RPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
RPP +I W+F+ K +DG+ +++KAR V G Q+ G+D GETFSP+
Sbjct: 831 RPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPV 881
>ref|NP_916434.1| putative gag/pol polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1090
Score = 274 bits (701), Expect = 9e-72
Identities = 170/419 (40%), Positives = 230/419 (54%), Gaps = 45/419 (10%)
Query: 138 EHTSTHNFKEKLKQFNVTMAVN-VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRST 196
E+ ST F LK F VN+ + G RLSCPYTS QNGK+ER +R+
Sbjct: 232 EYVSTQ-FGLPLKSFQADNGREFVNTAITTFLASRGTQLRLSCPYTSPQNGKAERMLRTI 290
Query: 197 NNIIRTLLIHASLPSSFWHHALEMATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFG 256
NN IRTLLI AS+P S+W AL ATYL+N P +I+ P ++L+ P ++HLRVFG
Sbjct: 291 NNSIRTLLIQASMPPSYWAEALATATYLLNRRPSSSIHQSLPFQLLHRTIPDFSHLRVFG 350
Query: 257 CLCFPLFPSSKIYKLQPRSTKCVFLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYA 316
CLC+P ++ +KL PRST CVFLGYP++H+GY+CL L +++III RHV+F+E+ FP+A
Sbjct: 351 CLCYPNLSATTPHKLSPRSTACVFLGYPTSHKGYRCLDLSTHRIIISRHVVFDESQFPFA 410
Query: 317 KLHIPQPNTYTFLDNELSPYIIQHL-MDQTQ----------TGPPIPQPAHQPNHIT--- 362
P +++ FL LSP L ++Q + P +P P+ + + T
Sbjct: 411 ATP-PAASSFDFLLQGLSPADAPSLEVEQPRPLTVAPSTEVEQPYLPLPSRRLSAGTVTV 469
Query: 363 ---SPCPNSPNI------TSPP-----QSNPTSPIQQ--------NLPPIFQPTLQ---A 397
+P +P + +PP S SP + +PP + A
Sbjct: 470 ASEAPSAGAPLVGTSSADATPPGSATRASTIVSPFRHVYTRRPVTTVPPSSSTAVTNAVA 529
Query: 398 NTKP---ITRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIK 454
+P +TRSQ G +P + T SP+P N SAL DPNW+ AM DEY L+
Sbjct: 530 APQPHSMVTRSQSGSLRPVDRLTYTATQAAASPVPANYHSALADPNWRAAMADEYKELVD 589
Query: 455 NKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
N TW LV RPP N+ WIF+HK SDG+ R+KAR V G QQ GID ETFSP+
Sbjct: 590 NGTWRLVSRPPRANIATGKWIFKHKFHSDGSLARYKARWVVRGYSQQHGIDYDETFSPV 648
>gb|AAP51971.1| putative copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|37530764|ref|NP_919684.1| putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)] gi|20042923|gb|AAM08751.1| Putative
copia-type polyprotein [Oryza sativa (japonica
cultivar-group)]
Length = 1803
Score = 271 bits (694), Expect = 6e-71
Identities = 139/351 (39%), Positives = 210/351 (59%), Gaps = 7/351 (1%)
Query: 165 QKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYL 224
+ + +G V RLSCPY+S QNGK+ER +R+ N+ +RT+L+H++ P SFW AL+ A +L
Sbjct: 634 RSLLSLHGAVLRLSCPYSSQQNGKAERILRTINDCVRTMLVHSAAPLSFWAEALQTAMHL 693
Query: 225 MNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYP 284
+N P + P ++L P+Y+HLRVFGCLC+P ++ +KL PRS CVF+GYP
Sbjct: 694 INRRPCRATGSLKPYQLLLGAPPTYDHLRVFGCLCYPNTIATAPHKLSPRSLACVFIGYP 753
Query: 285 SNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQHLMDQ 344
++HRGY+C ++S ++ RHV F E++FP+ P+P+ D+ ++ Q
Sbjct: 754 ADHRGYRCYDMVSRRVFTSRHVTFVEDVFPFRDAPSPRPSAPPPPDHGDDTIVLLPAPAQ 813
Query: 345 TQTGPPIPQPAHQPNHITSPCPNSPNITSPPQ--SNPTSPIQQNLPPIFQPTLQANTKPI 402
P PAH SP ++P+ +P + P SP + + P P A T
Sbjct: 814 HVVTPVGTAPAHDAASPPSPASSTPSSAAPAHDVAPPPSP-ETSSPASASPPRHAMT--- 869
Query: 403 TRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVP 462
TR++ GI KPN + Y + T SP P + AL+DPNW+ AM E+++L+ N+TW LVP
Sbjct: 870 TRARAGISKPNPR-YAMTATSTLSPTPSSVRVALRDPNWRAAMQAEFDALLANRTWTLVP 928
Query: 463 RPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
RPP +I W+F+ K +DG+ +++KAR V G Q+ G+D GETFSP+
Sbjct: 929 RPPGARIITGKWVFKTKLHADGSLDKYKARWVVRGFNQRPGVDFGETFSPV 979
>gb|AAV24907.1| hypothetical protein [Oryza sativa (japonica cultivar-group)]
gi|51854440|gb|AAU10819.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1679
Score = 255 bits (651), Expect = 6e-66
Identities = 164/436 (37%), Positives = 224/436 (50%), Gaps = 86/436 (19%)
Query: 161 NSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEM 220
NS + NG+ R+SCP+TS QNGK+ER +RS NNI+R++L A LP SFW AL
Sbjct: 827 NSAARTFFLTNGVHLRMSCPHTSPQNGKAERILRSLNNIVRSMLFQAKLPGSFWVEALHT 886
Query: 221 ATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVF 280
AT+L+N P KT++ +P LY PSY+HLRVFGC C+P ++ +KL PRST CVF
Sbjct: 887 ATHLINRHPTKTLDRHTPHFALYGTHPSYSHLRVFGCKCYPNLSATTPHKLAPRSTMCVF 946
Query: 281 LGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLH--IPQPNTYTFLDNELSPYII 338
LGYP H+GY+C LSN++II RHV+F+E+ FP+ +L + FL++ +P
Sbjct: 947 LGYPLYHKGYRCFDPLSNRVIISRHVVFDEHSFPFTELTNGVSNATDLDFLEDFTAPAQA 1006
Query: 339 Q-------HLMDQTQT----------------------------------GPPIPQPAHQ 357
+ TQT GPP P PA
Sbjct: 1007 PIGATRRPAVAPTTQTASSPMVHGLERPPPCSPTRPVSTPGGPSSPDSRLGPPSPTPALI 1066
Query: 358 PNHITSPCPNS----PNITSPPQSN--------PTSP-----IQQNLPPIFQ-------- 392
TSP P S P+ ++ P S+ PT P +LPP
Sbjct: 1067 GPASTSPGPPSAGPAPSASTCPASSTWETVARPPTPPGLPRLDGPHLPPAPHVPRRLRSV 1126
Query: 393 -------PTLQANTKPI-------TRSQHGIFKPNQKYYGLHTHVTK-SPLPRNPVSALK 437
P P+ TR++ G KP + L+ H S +P+ +AL
Sbjct: 1127 RATGAPTPLSGLEISPVVNDHVMTTRAKSGHHKPVHR---LNLHAAPLSLVPKTYRAALA 1183
Query: 438 DPNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDG 497
DP W+ AM +EYN+L+ N+TWDLVPRP VNV+ WIF+HK +DG+ +R+KAR V G
Sbjct: 1184 DPLWRAAMEEEYNALLANRTWDLVPRPAGVNVVTGKWIFKHKFHADGSLDRYKARWVLRG 1243
Query: 498 AGQQVGIDCGETFSPM 513
Q+ G+D ETFSP+
Sbjct: 1244 FTQRPGVDFDETFSPV 1259
>emb|CAE05956.3| OSJNBb0088C09.15 [Oryza sativa (japonica cultivar-group)]
gi|32487794|emb|CAE05417.1| OSJNBa0035I04.5 [Oryza
sativa (japonica cultivar-group)]
Length = 1246
Score = 242 bits (618), Expect = 4e-62
Identities = 149/378 (39%), Positives = 199/378 (52%), Gaps = 36/378 (9%)
Query: 160 VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALE 219
VN N G + RLSCPYTS QNGK+ER IR+ NN IRTLL+ AS+P S+W L
Sbjct: 451 VNHNTTSFLAGRGSLLRLSCPYTSPQNGKAERMIRTLNNSIRTLLLQASMPPSYWAEGLA 510
Query: 220 MATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCV 279
ATYL+N P ++N P ++L+ K P+Y+ LRVFGCLC+P ++ +KL P S CV
Sbjct: 511 TATYLLNRRPSSSVNNSIPFQLLHRKIPNYSMLRVFGCLCYPNLSATAAHKLAPYSAACV 570
Query: 280 FLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQ 339
FLGYPS+H+GY CL + + +III HV+F+E FP++ + + L + +P +I
Sbjct: 571 FLGYPSSHKGYCCLNISTRRIIISCHVIFDETQFPFSGDPVDASSLDFLLQDAPAPSVI- 629
Query: 340 HLMDQTQTGPPIPQPAHQPNHITSPCPNSPNITSPPQSNPTSPIQQNLPPIFQPTL---Q 396
P QP+ +P P N+ + S ++LP QP Q
Sbjct: 630 ---------APSLAGVEQPHLPHAPFP--VNVEQRLPTGAPSTKDEHLPYYVQPAAHCGQ 678
Query: 397 ANTKPITRSQH-GIFKPN----QKYYGLHTHVTKS--------PLPRNPVSALKDPNWK- 442
+ K T H IF Y +H T S PLP + P+ +
Sbjct: 679 DDGKFHTAGCHVSIFLKQGTIVTSYPAVHVFFTTSRHGGAIPVPLPTTTGADDSAPSHRC 738
Query: 443 -------MAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVG 495
AM +E+ +LI N TW LVPRPP NV+ WIF+HK SDG+ RHKAR V
Sbjct: 739 GCTHTSPAAMAEEFKALIDNGTWRLVPRPPGANVVTGKWIFKHKFHSDGSLARHKARWVV 798
Query: 496 DGAGQQVGIDCGETFSPM 513
G QQ GID ETFSP+
Sbjct: 799 HGYSQQHGIDYDETFSPV 816
>gb|AAA57005.1| copia-like retrotransposon Hopscotch polyprotein [Zea mays]
gi|7444442|pir||T02087 gag/pol polyprotein - maize
retrotransposon Hopscotch
Length = 1439
Score = 218 bits (555), Expect = 8e-55
Identities = 155/495 (31%), Positives = 227/495 (45%), Gaps = 90/495 (18%)
Query: 94 YGPLQF*VQLVTNIMFFSWIIIQIFCGLFQCHINPKFIQYLKN*EHTSTHNFKEKLKQFN 153
+G ++ V + + F+WI + + + F ++ +H F K+ F
Sbjct: 530 FGRYKYYVSFIDDYSKFTWIYLL----RHKSDVYKSFCEF----QHLVERMFGRKIIAFQ 581
Query: 154 VTMA---VNVNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLP 210
+N++F+ + GI ++SCP+T QNG +ERK R + LL +S+P
Sbjct: 582 SDWGGEYEKLNAHFKTI----GIHHQVSCPHTHQQNGAAERKHRHIVEVGLALLAQSSMP 637
Query: 211 SSFWHHALEMATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYK 270
+W HA A YL+N P KTI ++PL L P Y+ LR+FGC C+P +K
Sbjct: 638 LKYWDHAFLAAVYLINRTPSKTIAHDTPLHKLTGATPDYSSLRIFGCACWPNLRPYNQHK 697
Query: 271 LQPRSTKCVFLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLH----------- 319
LQ RST+CVFLGY + H+G+KCL + + +I I R V+F+E++FP+A L+
Sbjct: 698 LQFRSTRCVFLGYSNMHKGFKCLDISTGRIYISRDVVFDEHVFPFASLNKNAGVKYTSEV 757
Query: 320 --IPQP----NTYTFLDNELS------PYIIQHLMD-----QTQTGPPIPQPAHQPNHIT 362
+P N T N L P++ QH + T + PA PN ++
Sbjct: 758 LLLPHDSCGNNMLTDHANNLPGSSSPLPFLAQHFLQGNSEVPTSNNTAMALPASGPNEVS 817
Query: 363 SPCPNSPNITSPPQSNPTSPIQQNL------------PPIF----------QPTLQAN-- 398
P P+ P S + + N PP+ P LQA
Sbjct: 818 VPPALVPSSLVPAASPAPTGVSANAEPAPEADSLSSGPPVATESVTGVPDADPLLQAPGS 877
Query: 399 --------------TKPITRSQHGIFKPNQ------KYYGLHTHVTKSPLPRNPVSALKD 438
P TR QHGI KP Q +Y +T+ P + AL D
Sbjct: 878 SVAHQTPDSAPLSAAAPRTRLQHGISKPKQFTDGTVRYGNAAARITE---PSSVSEALAD 934
Query: 439 PNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGA 498
P W+ AM E+ +L KN TW LVP N+I W+F+ K +DG+ +R KARLV G
Sbjct: 935 PQWRAAMEAEFQALQKNNTWTLVPPDRTRNLIDCKWVFKVKYNADGSIDRLKARLVAKGF 994
Query: 499 GQQVGIDCGETFSPM 513
QQ GID +TFSP+
Sbjct: 995 KQQYGIDYDDTFSPV 1009
>emb|CAC37623.1| copia-like polyprotein [Arabidopsis thaliana]
Length = 1466
Score = 211 bits (536), Expect = 1e-52
Identities = 139/420 (33%), Positives = 205/420 (48%), Gaps = 27/420 (6%)
Query: 118 FCGLFQCHINPKFIQYLKN*EHTSTHNFKEKLKQFNVTMAVNVNSN-FQKMCEANGIVFR 176
F F + KFI + + K+K+F SN ++ +GI R
Sbjct: 546 FSWFFPLRMKSKFISVFIAYQKLVENQLGTKIKEFQSDGGGEFTSNKLKEHFREHGIHHR 605
Query: 177 LSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYLMNILPHKTINFE 236
+SCPYT QNG +ERK R + ++L H+ P FW A A YL N+LP +
Sbjct: 606 ISCPYTPQQNGVAERKHRHLVELGLSMLYHSHTPLKFWVEAFFTANYLSNLLPSSVLKEI 665
Query: 237 SPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYPSNHRGYKCLGLL 296
SP + L+ + Y LRVFG C+P K PRS +CVFLGY + ++GY+CL
Sbjct: 666 SPYETLFQQKVDYTPLRVFGTACYPCLRPLAKNKFDPRSLQCVFLGYHNQYKGYRCLYPP 725
Query: 297 SNKIIICRHVLFNENIFPY-AKLHIPQPNTYTFL-----DNELSPYII--QHLMDQTQTG 348
+ K+ I RHV+F+E FP+ K H P T L +L+P + L +
Sbjct: 726 TGKVYISRHVIFDEAQFPFKEKYHSLVPKYQTTLLQAWQHTDLTPPSVPSSQLQPLARQM 785
Query: 349 PPIPQPAHQP--NHITSPCPN-SPNITSPPQSNPTSPIQQNLPPIF----------QPTL 395
P+ +QP N+ T N + +S ++ + P+ Q +L
Sbjct: 786 TPMATSENQPMMNYETEEAVNVNMETSSDEETESNDEFDHEVAPVLNDQNEDNALGQGSL 845
Query: 396 QANTKPITRSQHGIFKPNQKYYGLHTHVTKSPL--PRNPVSALKDPNWKMAMNDEYNSLI 453
+ ITRS+ GI KPN +Y + V+KS P+ +A+K P+W A+ DE + +
Sbjct: 846 ENLHPMITRSKDGIQKPNPRYALI---VSKSSFDEPKTITTAMKHPSWNAAVMDEIDRIH 902
Query: 454 KNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
TW LVP D+N++ S W+F+ K K DGT ++ KARLV G Q+ G+D ETFSP+
Sbjct: 903 MLNTWSLVPATEDMNILTSKWVFKTKLKPDGTIDKLKARLVAKGFDQEEGVDYLETFSPV 962
>gb|AAD43604.1| T3P18.3 [Arabidopsis thaliana] gi|25301688|pir||H96650 protein
T3P18.3 [imported] - Arabidopsis thaliana
Length = 1309
Score = 207 bits (528), Expect = 1e-51
Identities = 134/390 (34%), Positives = 196/390 (49%), Gaps = 27/390 (6%)
Query: 148 KLKQFNVTMAVNVNSN-FQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIH 206
K+K+F SN ++ +GI R+SCPYT QNG +ERK R + ++L H
Sbjct: 419 KIKEFQSDGGGEFTSNKLKEHFREHGIHHRISCPYTPQQNGVAERKHRHLVELGLSMLYH 478
Query: 207 ASLPSSFWHHALEMATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSS 266
+ P FW A A YL N+LP + SP + L+ + Y LRVFG C+P
Sbjct: 479 SHTPLKFWVEAFFTANYLSNLLPSSVLKEISPYETLFQQKVDYTPLRVFGTACYPCLRPL 538
Query: 267 KIYKLQPRSTKCVFLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPY-AKLHIPQPNT 325
K PRS +CVFLGY + ++GY+CL + K+ I RHV+F+E FP+ K H P
Sbjct: 539 AKNKFDPRSLQCVFLGYHNQYKGYRCLYPPTGKVYISRHVIFDEAQFPFKEKYHSLVPKY 598
Query: 326 YTFL-----DNELSPYII--QHLMDQTQTGPPIPQPAHQP--NHITSPCPN-SPNITSPP 375
T L +L+P + L + P+ +QP N+ T N + +S
Sbjct: 599 QTTLLQAWQHTDLTPPSVPSSQLQPLARQVTPMATSENQPMMNYETEEAVNVNMETSSDE 658
Query: 376 QSNPTSPIQQNLPPIF----------QPTLQANTKPITRSQHGIFKPNQKYYGLHTHVTK 425
++ + P+ Q +L+ ITRS+ GI KPN +Y + V+K
Sbjct: 659 ETESNDEFDHEVAPVLNDQNEDNALGQGSLENLHPMITRSKDGIQKPNPRYALI---VSK 715
Query: 426 SPL--PRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSD 483
S P+ +A+K P W A+ DE + + TW LVP D+N++ S W+F+ K K D
Sbjct: 716 SSFDEPKTITTAMKHPGWNAAVMDEIDRIHMLNTWSLVPATEDMNILTSKWVFKTKLKPD 775
Query: 484 GTFERHKARLVGDGAGQQVGIDCGETFSPM 513
GT ++ KARLV G Q+ G+D ETFSP+
Sbjct: 776 GTIDKLKARLVAKGFDQEEGVDYLETFSPV 805
>gb|AAU89779.1| gag-pol polyprotein-like [Solanum tuberosum]
Length = 1212
Score = 207 bits (526), Expect = 2e-51
Identities = 131/353 (37%), Positives = 183/353 (51%), Gaps = 27/353 (7%)
Query: 164 FQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATY 223
F+K GIV + SCPYT QNG +ERK R ++ RTLLI +S+PS +W AL A Y
Sbjct: 584 FKKFLLDKGIVSQHSCPYTPQQNGVAERKNRHLLDVTRTLLIESSVPSKYWVEALSTAVY 643
Query: 224 LMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGY 283
L+N LP K +N ESP LYH++P+Y+ FGC+CF P S+ KL +STKC F+GY
Sbjct: 644 LINRLPSKVLNLESPYFRLYHQNPNYSDFHTFGCVCFVHLPPSQCNKLSVQSTKCAFMGY 703
Query: 284 PSNHRGYKCLGLLSNKIIICRHVLFNEN--IFP-YAKLHIPQPNTYTFLDNELSPYIIQH 340
++ +G+ C S+K I R+V+F EN FP L P TF D S +
Sbjct: 704 STSQKGFICYDPCSHKFRISRNVVFFENQYFFPTIVDLSSVSPLLPTFEDLSSSFKRFKP 763
Query: 341 LMDQTQTGPPIPQPAHQPNHITSPCPNSPNITSPPQSNPTSPIQQNLPPIFQPTLQANTK 400
+ P +P P P T+P S N S+ + P++
Sbjct: 764 GFVYERRRPTLPYPNTDPPPETAPQLESEN------SSRSGPLE---------------- 801
Query: 401 PITRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDL 460
P RS PN +YG + ++ +P A K W+ AM +E +L +N TWD+
Sbjct: 802 PTRRSTRVSRTPN--WYGFSSTLSNISVPSCYSQASKHECWQKAMEEELLALKENDTWDI 859
Query: 461 VPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
V P +V I W++ K SDGT +R+KARLV G Q+ G+D ETF+P+
Sbjct: 860 VSCPSNVRPIGCKWVYSIKLHSDGTLDRYKARLVVLGNRQEYGVDYEETFAPV 912
>emb|CAB79576.1| putative protein [Arabidopsis thaliana] gi|3269282|emb|CAA19715.1|
putative protein [Arabidopsis thaliana]
gi|7444417|pir||T05745 hypothetical protein M4I22.20 -
Arabidopsis thaliana
Length = 1318
Score = 205 bits (522), Expect = 5e-51
Identities = 135/399 (33%), Positives = 197/399 (48%), Gaps = 47/399 (11%)
Query: 160 VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALE 219
V+ F + +++GI +LSCP+T QNG +ERK R + ++L + +P FW A
Sbjct: 402 VSHKFLQHLQSHGIQQQLSCPHTPQQNGLAERKHRHLVELGLSMLFQSHVPHKFWVEAFF 461
Query: 220 MATYLMNILPHKTINFE-SPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKC 278
A +L+N+LP + SP + LY K P Y LR FG CFP K P S KC
Sbjct: 462 TANFLINLLPTSALKESISPYEKLYDKKPDYTSLRSFGSACFPTLRDYAENKFNPCSLKC 521
Query: 279 VFLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFP----YAKLHIPQPNT---YTFLDN 331
VFLGY ++GY+CL + ++ I RHV+F+E+++P Y LH PQP T +L +
Sbjct: 522 VFLGYNEKYKGYRCLYPPTGRLYISRHVIFDESVYPFSHTYKHLH-PQPRTPLLAAWLRS 580
Query: 332 ELSPYIIQHLMDQTQTG-------PPIPQP--------------AHQPNHITSPCPN--- 367
SP +++ PP+PQ +H N T P+
Sbjct: 581 SDSPAPSTSTSPSSRSPLFTSADFPPLPQRKTPLLPTLVPISSVSHASNITTQQSPDFDS 640
Query: 368 ------------SPNITSPPQSNPTSPIQQNLPPIFQPTLQANTKP-ITRSQHGIFKPNQ 414
+ +S S+ IQQ + Q N P +TR++ GI KPN
Sbjct: 641 ERTTDFDSASIGDSSHSSQAGSDSEETIQQASVNVHQTHASTNVHPMVTRAKVGISKPNP 700
Query: 415 KYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMW 474
+Y L +H P P+ +ALK P W AM +E + + +TW LVP D++V+ S W
Sbjct: 701 RYVFL-SHKVSYPEPKTVTAALKHPGWTGAMTEEIGNCSETQTWSLVPYKSDMHVLGSKW 759
Query: 475 IFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
+FR K +DGT + KAR+V G Q+ GID ET+SP+
Sbjct: 760 VFRTKLHADGTLNKLKARIVAKGFLQEEGIDYLETYSPV 798
>gb|AAD21687.1| Strong similarity to gi|3600044 T12H20.12 protease homolog from
Arabidopsis thaliana BAC gb|AF080119 and is a member of
the reverse transcriptase family PF|00078
gi|25301706|pir||C86438 hypothetical protein F28K20.17 -
Arabidopsis thaliana
Length = 1415
Score = 204 bits (519), Expect = 1e-50
Identities = 122/363 (33%), Positives = 189/363 (51%), Gaps = 14/363 (3%)
Query: 160 VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALE 219
V++ + +GI R+SCPYT QNG +ERK R + ++L H+ P FW +
Sbjct: 587 VSNKLKTHLSEHGIHHRISCPYTPQQNGLAERKHRHLVELGLSMLFHSHTPQKFWVESFF 646
Query: 220 MATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCV 279
A Y++N LP + SP + L+ + P Y+ LRVFG C+P K PRS +CV
Sbjct: 647 TANYIINRLPSSVLKNLSPYEALFGEKPDYSSLRVFGSACYPCLRPLAQNKFDPRSLQCV 706
Query: 280 FLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLH---IPQ---PNTYTFLDNEL 333
FLGY S ++GY+C + K+ I R+V+FNE+ P+ + + +PQ P + N++
Sbjct: 707 FLGYNSQYKGYRCFYPPTGKVYISRNVIFNESELPFKEKYQSLVPQYSTPLLQAWQHNKI 766
Query: 334 SPYIIQHLMDQTQTGP---PIPQPAHQPNHITSPCPNSPNITSPPQSNPTSPIQQNLPPI 390
S + Q + P + +T P P S N S + N P+ + +
Sbjct: 767 SEISVPAAPVQLFSKPIDLNTYAGSQVTEQLTDPEPTSNNEGSDEEVN---PVAEEIAAN 823
Query: 391 FQPTLQANTKPITRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYN 450
+ + ++ TRS+ GI KPN + Y L T + P+ SA+K P W A+++E N
Sbjct: 824 QEQVINSHAM-TTRSKAGIQKPNTR-YALITSRMNTAEPKTLASAMKHPGWNEAVHEEIN 881
Query: 451 SLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETF 510
+ TW LVP D+N++ S W+F+ K DG+ ++ KARLV G Q+ G+D ETF
Sbjct: 882 RVHMLHTWSLVPPTDDMNILSSKWVFKTKLHPDGSIDKLKARLVAKGFDQEEGVDYLETF 941
Query: 511 SPM 513
SP+
Sbjct: 942 SPV 944
>gb|AAB61111.1| Strong similarity to Zea mays retrotransposon Hopscotch polyprotein
(gb|U12626). [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
Length = 1315
Score = 195 bits (496), Expect = 5e-48
Identities = 121/368 (32%), Positives = 177/368 (47%), Gaps = 37/368 (10%)
Query: 163 NFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMAT 222
NF + + GIV SCP T QN ERK + N+ R+L + +P S+W + A
Sbjct: 538 NFTQFYHSKGIVPYHSCPETPQQNSVVERKHQHILNVARSLFFQSHIPISYWGDCILTAV 597
Query: 223 YLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLG 282
YL+N LP + + P +VL P+Y+H++VFGCLC+ +K PR+ C F+G
Sbjct: 598 YLINRLPAPILEDKCPFEVLTKTVPTYDHIKVFGCLCYASTSPKDRHKFSPRAKACAFIG 657
Query: 283 YPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQHLM 342
YPS +GYK L L ++ II+ RHV+F+E +FP+ + Q F D
Sbjct: 658 YPSGFKGYKLLDLETHSIIVSRHVVFHEELFPFLGSDLSQEEQNFFPDLN---------- 707
Query: 343 DQTQTGPPIPQPAHQPNHITSPCPNSPNITSPPQSNPTSPIQQNLPPIFQPTLQANTKP- 401
P P Q + +P +S ++ P +NPT+ + + P Q + + KP
Sbjct: 708 -------PTPPMQRQSSDHVNPSDSSSSVEILPSANPTNNVPE---PSVQTSHRKAKKPA 757
Query: 402 ----------ITRSQHGIFK------PNQKYYGLHTHVTKSPLPRNPVSALKDPNWKMAM 445
++ + H I K N Y + K+ P N A K W+ AM
Sbjct: 758 YLQDYYCHSVVSSTPHEIRKFLSYDRINDPYLTFLACLDKTKEPSNYTEAEKLQVWRDAM 817
Query: 446 NDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGID 505
E++ L TW++ P D I WIF+ K SDG+ ER+KARLV G Q+ GID
Sbjct: 818 GAEFDFLEGTHTWEVCSLPADKRCIGCRWIFKIKYNSDGSVERYKARLVAQGYTQKEGID 877
Query: 506 CGETFSPM 513
ETFSP+
Sbjct: 878 YNETFSPV 885
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 194 bits (493), Expect = 1e-47
Identities = 131/394 (33%), Positives = 196/394 (49%), Gaps = 46/394 (11%)
Query: 160 VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALE 219
+++ F++ + G + +LSCP +QNG +ERK R RTLLI + +P+ FW A+
Sbjct: 456 MSNAFREFLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAIS 515
Query: 220 MATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCV 279
A YL+N+ P ++ SP +VL+ P Y+HLRVFGC C+ L + KL +S +CV
Sbjct: 516 TAVYLINMQPSSSLQGRSPGEVLFGSPPRYDHLRVFGCTCYVLLAPRERTKLTAQSVECV 575
Query: 280 FLGYPSNHRGYKCLGLLSNKIIICRHVLFNEN-IFPYAKLHIPQ--PNTYTFLDNELSPY 336
FLGY H+GY+C + +I I R V F+EN F Y+ + P N+ +FL L P
Sbjct: 576 FLGYSLEHKGYRCYDPSARRIRISRDVTFDENKPFFYSSTNQPSSPENSISFL--YLPPI 633
Query: 337 IIQHLMDQTQTGP---PIPQPAHQPNHITSPCPN-SPNITSPPQSN-PTSPIQQNLPPIF 391
+ + P PIP P ++ P P+ SP+ SPP S+ P S ++P
Sbjct: 634 PSPESLPSSPITPSPSPIPPSVPSPTYVPPPPPSPSPSPVSPPPSHIPASSSPPHVPSTI 693
Query: 392 ----------------------QPTLQANTKPI----------TRSQHGIFKPNQKYYGL 419
QPTL+ T + R++ + PN+ + +
Sbjct: 694 TLDTFPFHYSRRPKIPNESQPSQPTLEDPTCSVDDSSPAPRYNLRARDALRAPNRDDFVV 753
Query: 420 HTHVTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHK 479
P A+ P+WK+AM++E +L + TWD+VP P I W+++ K
Sbjct: 754 GVVFE----PSTYQEAIVLPHWKLAMSEELAALERTNTWDVVPLPSHAVPITCKWVYKVK 809
Query: 480 EKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
KSDG ER+KARLV G Q G D ETF+P+
Sbjct: 810 TKSDGQVERYKARLVARGFQQAHGRDYDETFAPV 843
>emb|CAA72989.1| unnamed protein product [Brassica oleracea] gi|7488558|pir||T14517
hypothetical protein 1 - wild cabbage transposon Melmoth
Length = 1131
Score = 194 bits (492), Expect = 2e-47
Identities = 129/400 (32%), Positives = 188/400 (46%), Gaps = 53/400 (13%)
Query: 125 HINPKFIQYLKN*EHTSTHNFKEKLKQFNVTMAVNVNSNFQKMCEANGIVFRLSCPYTSS 184
HI P F+ ++ + K+K A ++ F ++ + GIV SCP T
Sbjct: 620 HIFPTFVNQIET-------QYNTKIKSVRRDNAPELS--FTELFKEKGIVSYHSCPETLE 670
Query: 185 QNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYLMNILPHKTINFESPLKVLYH 244
QN ERK + N+ R L+ + +P +W + A +L+N P + +SP +VL
Sbjct: 671 QNSVLERKHQHLLNVARALMFQSQVPLQYWGDCVLTAAFLINRTPSPLLANKSPYEVLMG 730
Query: 245 KDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYPSNHRGYKCLGLLSNKIIICR 304
K P Y+ LR FGCLC+ + +K PRS CVFLGYPS ++GYK L L SNKI I R
Sbjct: 731 KAPQYDQLRTFGCLCYGSTSPKQRHKFMPRSRACVFLGYPSGYKGYKLLDLESNKIYISR 790
Query: 305 HVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQHLMDQTQTGPPIPQPAHQPNHITSP 364
+V F+E+IFP AK ++ F P +T P
Sbjct: 791 NVTFHEDIFPMAKHQKMDESSLHFF----------------------------PPKVTVP 822
Query: 365 CPNSPNITSPPQSNPTSPIQQNLPPIFQPTLQANTKPI-TRSQHGIFKP---NQKYYGLH 420
SPNI+S P S + I + Q T+ A+ K S H P Y +
Sbjct: 823 SAPSPNISSSPFSTLSPQISKR-----QRTVPAHLKDFHCYSVHDSAYPISSTLSYSQIS 877
Query: 421 TH-------VTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSM 473
+H +T P+P++ + W + + E +++ +N TWD+VP P I
Sbjct: 878 SHHLAYINSITNIPIPQSYAEVRQSKEWTESADKELDAMEENDTWDVVPLPKGKKAIGCR 937
Query: 474 WIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
W+ K +DGT ER K+RLVG G Q+ G+D ETFSP+
Sbjct: 938 WVHTLKFNADGTLERRKSRLVGKGYTQKEGLDYIETFSPV 977
>gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thaliana]
gi|25403501|pir||H86486 protein Ty1/copia-element
polyprotein [imported] - Arabidopsis thaliana
Length = 1152
Score = 192 bits (487), Expect = 6e-47
Identities = 126/371 (33%), Positives = 187/371 (49%), Gaps = 36/371 (9%)
Query: 172 GIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYLMNILPHK 231
GIV ++SC YT QNG+ ERK R N+ R+LL A LP SFW ++ A YL+N P
Sbjct: 656 GIVHQISCVYTHQQNGRVERKHRHILNVARSLLFQAELPISFWEESVLTAAYLINRTPTP 715
Query: 232 TINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYPSNHRGYK 291
++ ++P K+LY + PSY LRVFG LCF + ++ K Q R KC+F+GYP +G++
Sbjct: 716 ILDGKTPYKILYSQPPSYASLRVFGSLCFARKHTGRLDKFQERGRKCIFVGYPHGQKGWR 775
Query: 292 CLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTF------LDNELSPYIIQHLMDQT 345
+ S + R V+F E+IFP+A + N TF + + + PY + L D
Sbjct: 776 IYDIESQIFFVSRDVVFQEDIFPFAD----KKNKDTFSSPAAVIPSPILPYDDEFL-DIY 830
Query: 346 QTGP-PIPQPAHQPNHITSPCPNSPNITSPPQSNPTSPI-------QQNLPPIFQPTLQA 397
Q G P P + P+SP IT+ P + P+ Q+N+ T A
Sbjct: 831 QIGDVPATNPLPAIIDVNDSPPSSPIITATPAAASPPPLRRGLRQRQENVRLKDYQTYSA 890
Query: 398 NTK--------------PITRSQHG-IFKPNQKYYGLHTHVTKSPLPRNPVSALKDPNWK 442
+ P+ G IF P+ +++ + P N A+++ W+
Sbjct: 891 QCESTQTLSDNIGTCIYPMANYVSGEIFSPSNQHFLAAISMVDPPQTYN--QAIREKEWR 948
Query: 443 MAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQV 502
A+ E ++L TWD+ P V I S W+FR K S+GT ER+KARLV G Q+
Sbjct: 949 NAVFFEVDALEDQGTWDITKLPQGVKAIGSKWVFRIKYNSNGTVERYKARLVALGNHQKE 1008
Query: 503 GIDCGETFSPM 513
GID +TF+P+
Sbjct: 1009 GIDFTKTFAPV 1019
>gb|AAT38758.1| putative gag-pol polyprotein [Solanum demissum]
Length = 1333
Score = 190 bits (482), Expect = 2e-46
Identities = 120/359 (33%), Positives = 183/359 (50%), Gaps = 38/359 (10%)
Query: 160 VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALE 219
++++F CE NGI L+ PYT QNG +ERK R+ + R+ L LP FW A+
Sbjct: 578 LSNDFNLFCEENGIRRELTAPYTPEQNGVAERKNRTVVEMARSSLKAKGLPDYFWGEAVA 637
Query: 220 MATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCV 279
Y +NI P K + +PL+ K P +HLR+FGC+ + L KL +STKC+
Sbjct: 638 TVVYFLNISPTKDVWNTTPLEAWNGKKPRVSHLRIFGCIAYALVNFHS--KLDEKSTKCI 695
Query: 280 FLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQ 339
F+GY + Y+ +S K+II R+V+FNE++ ++ F
Sbjct: 696 FVGYSLQSKAYRLYNPISGKVIISRNVVFNEDV------------SWNFNSG-------- 735
Query: 340 HLMDQTQTGPPIPQPAHQPNHITSPCPNSPNITSP-PQSNPTSPIQQNLPPIFQPTLQAN 398
++M Q P + A + + P S +++SP S +P + ++ PI
Sbjct: 736 NMMSNIQLLPTDEESAVDFGNSPNSSPVSSSVSSPIAPSTTVAPDESSVEPI-------- 787
Query: 399 TKPITRSQHGIFKPNQKYYG-LHTHVTKSPLPRNPV---SALKDPNWKMAMNDEYNSLIK 454
P+ RS KPN KY ++T + L +P+ A++ WK AM +E ++ +
Sbjct: 788 --PLRRSTRE-KKPNPKYSNTVNTSCQFALLVSDPICYEEAVEQSEWKNAMIEEIQAIER 844
Query: 455 NKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
N TW+LV P NVI W+FR K +DG+ ++HKARLV G QQ G+D ETFSP+
Sbjct: 845 NSTWELVDAPEGKNVIGLKWVFRTKYNADGSIQKHKARLVAKGYSQQQGVDFDETFSPV 903
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 188 bits (478), Expect = 6e-46
Identities = 134/422 (31%), Positives = 192/422 (44%), Gaps = 48/422 (11%)
Query: 136 N*EHTSTHNFKEKLKQFNVTMAVNVNSN-FQKMCEANGIVFRLSCPYTSSQNGKSERKIR 194
N E T+ F K+K F S F+ GI+ + SCPYT QNG +ERK R
Sbjct: 559 NFETYVTNQFNAKIKVFRTDNGGEYTSQKFRDHLAKRGIIHQTSCPYTPQQNGVAERKNR 618
Query: 195 STNNIIRTLLIHASLPSSFWHHALEMATYLMNILPHKTINFESPLKVLYHKDPSYNHLRV 254
+ R+++ H S+P FW A+ A YL+N P K ++ SP +VL + P +HLRV
Sbjct: 619 HLMEVARSMMFHTSVPKRFWGDAVLTACYLINRTPTKVLSDLSPFEVLNNTKPFIDHLRV 678
Query: 255 FGCLCFPLFPSSKIYKLQPRSTKCVFLGYPSNHRGYKCLGLLSNKIIICRHVLFN----- 309
FGC+CF L P + KL +STKC+FLGY + +GYKC N+ I R V F
Sbjct: 679 FGCVCFVLIPGEQRSKLDAKSTKCMFLGYSTTQKGYKCFDPTKNRTFISRDVKFLENQDY 738
Query: 310 ------ENIFPYAKLHIPQPNTYTFLDNEL---SPYIIQHLMDQTQTGPPIPQP------ 354
EN+ + T FL + L S QH + TQ + Q
Sbjct: 739 NNKKDWENLKDLTHSTSDRVETLKFLLDHLGNDSTSTTQHQPEMTQDQEDLNQENEEVSL 798
Query: 355 AHQPN--HITSPCPNSPNITSPPQ-----SNPTSPIQQNLPPIFQPTLQANTKPITRSQH 407
HQ N H+ PN+ + Q S+ Q LPP P L+ +T+ R +
Sbjct: 799 QHQENLTHVQEDPPNTQEHSEHVQEIQDDSSEDEEPTQVLPP--PPPLRRSTR--IRRKK 854
Query: 408 GIFKPNQ----------------KYYGLHTHVTKSPLPRNPVSALKDPNWKMAMNDEYNS 451
F N + + +++ +P+ A++ W+ A+ DE N+
Sbjct: 855 EFFNSNAVAHPFQATCSLALVPLDHQAFLSKISEHWIPQTYEEAMEVKEWRDAIADEINA 914
Query: 452 LIKNKTWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFS 511
+ +N TWD P + S W+F K KS+G ER+K RLV G Q G D ETF+
Sbjct: 915 MKRNHTWDEDDLPKGKKTVSSRWVFTIKYKSNGDIERYKTRLVARGFTQTYGSDYMETFA 974
Query: 512 PM 513
P+
Sbjct: 975 PV 976
>gb|AAP46257.1| putative polyprotein [Oryza sativa (japonica cultivar-group)]
gi|50919599|ref|XP_470160.1| putative polyprotein [Oryza
sativa (japonica cultivar-group)]
Length = 1335
Score = 188 bits (477), Expect = 8e-46
Identities = 114/357 (31%), Positives = 178/357 (48%), Gaps = 21/357 (5%)
Query: 160 VNSNFQKMCEANGIVFRLSCPYTSSQNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALE 219
++ F+K CE GI +L+ Y++ QNG +ERK R+ N++ ++L +P SFW A+
Sbjct: 592 ISKEFEKYCENAGIRRQLTAGYSAQQNGVAERKNRTINDMANSMLQDKGMPKSFWAEAVN 651
Query: 220 MATYLMNILPHKTINFESPLKVLYHKDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCV 279
A Y++N P K + +P + Y K P H+RVFGC+C+ P+ K K +S +C+
Sbjct: 652 TAVYILNRSPTKAVTNRTPFEAWYGKKPVIGHMRVFGCICYAQVPAQKRVKFDNKSDRCI 711
Query: 280 FLGYPSNHRGYKCLGLLSNKIIICRHVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQ 339
F+GY +GY+ L KIII R +F+E+ T+ + E S +
Sbjct: 712 FVGYADGIKGYRLYNLEKKKIIISRDAIFDESA------------TWNWKSPEASSTPLL 759
Query: 340 HLMDQTQTGPPIPQPAHQPNHITSPCPNSPNITSPPQSNPTSPIQQNLPPIFQPTLQANT 399
T P + +H SP P+SP ++S S+ +SP + Q + +
Sbjct: 760 PTTTITLGQPHMHGTHEVEDHTPSPQPSSP-MSSSSASSDSSPSSEE-----QISTPESA 813
Query: 400 KPITRSQHGIFKPNQKYYGLHTHVTKSPLPRNPVS---ALKDPNWKMAMNDEYNSLIKNK 456
RS + + + G H + P S A K NW AM DE + + KN
Sbjct: 814 PRRVRSMVELLESTSQQRGSEQHEFCNYSVVEPQSFQEAEKHDNWIKAMEDEIHMIEKNN 873
Query: 457 TWDLVPRPPDVNVIRSMWIFRHKEKSDGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
TW+LV RP D VI W+++ K DG+ +++KARLV G Q+ GID ET++P+
Sbjct: 874 TWELVDRPRDREVIGVKWVYKTKLNPDGSVQKYKARLVAKGFKQKPGIDYYETYAPV 930
>emb|CAB10526.1| retrotransposon like protein [Arabidopsis thaliana]
gi|7268497|emb|CAB78748.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7444421|pir||A71444 probable
LTR retrotransposon - Arabidopsis thaliana
Length = 1433
Score = 187 bits (475), Expect = 1e-45
Identities = 122/391 (31%), Positives = 188/391 (47%), Gaps = 37/391 (9%)
Query: 125 HINPKFIQYLKN*EHTSTHNFKEKLKQFNVTMAVNVNSNFQKMCEANGIVFRLSCPYTSS 184
H+ P FI + HT ++ KLK A + F + A+GIV SCP T
Sbjct: 646 HVFPAFINMV----HTQ---YQTKLKSVRSDNAHELK--FTDLFAAHGIVAYHSCPETPE 696
Query: 185 QNGKSERKIRSTNNIIRTLLIHASLPSSFWHHALEMATYLMNILPHKTINFESPLKVLYH 244
QN ERK + N+ R LL +++P FW + A +L+N LP +N +SP + L +
Sbjct: 697 QNSVVERKHQHILNVARALLFQSNIPLEFWGDCVLTAVFLINRLPTPVLNNKSPYEKLKN 756
Query: 245 KDPSYNHLRVFGCLCFPLFPSSKIYKLQPRSTKCVFLGYPSNHRGYKCLGLLSNKIIICR 304
P+Y L+ FGCLC+ + +K +PR+ CVFLGYP ++GYK L + ++ + I R
Sbjct: 757 IPPAYESLKTFGCLCYSSTSPKQRHKFEPRARACVFLGYPLGYKGYKLLDIETHAVSISR 816
Query: 305 HVLFNENIFPYAKLHIPQPNTYTFLDNELSPYIIQHLMDQTQTGPPIPQPAHQPNHITSP 364
HV+F+E+IFP+ I D P + PA + P
Sbjct: 817 HVIFHEDIFPFISSTIKD--------------------DIKDFFPLLQFPARTDD---LP 853
Query: 365 CPNSPNITSPPQSNPTSPIQQNLPPIFQPTLQANTKPITRSQ--HGIFKPNQKYYGLHTH 422
+ I + P + +S + L P F P + KP Q H + ++ +
Sbjct: 854 LEQTSIIDTHPHQDVSS--SKALVP-FDPLSKRQKKPPKHLQDFHCYNNTTEPFHAFINN 910
Query: 423 VTKSPLPRNPVSALKDPNWKMAMNDEYNSLIKNKTWDLVPRPPDVNVIRSMWIFRHKEKS 482
+T + +P+ A W AM +E ++++ TW +V PP+ I W+F K +
Sbjct: 911 ITNAVIPQRYSEAKDFKAWCDAMKEEIGAMVRTNTWSVVSLPPNKKAIGCKWVFTIKHNA 970
Query: 483 DGTFERHKARLVGDGAGQQVGIDCGETFSPM 513
DG+ ER+KARLV G Q+ G+D ETFSP+
Sbjct: 971 DGSIERYKARLVAKGYTQEEGLDYEETFSPV 1001
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.337 0.147 0.487
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,591,177,766
Number of Sequences: 2540612
Number of extensions: 68510575
Number of successful extensions: 298623
Number of sequences better than 10.0: 1133
Number of HSP's better than 10.0 without gapping: 839
Number of HSP's successfully gapped in prelim test: 326
Number of HSP's that attempted gapping in prelim test: 292561
Number of HSP's gapped (non-prelim): 3627
length of query: 952
length of database: 863,360,394
effective HSP length: 138
effective length of query: 814
effective length of database: 512,755,938
effective search space: 417383333532
effective search space used: 417383333532
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.7 bits)
S2: 80 (35.4 bits)
Medicago: description of AC146853.2