
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148227.2 + phase: 0 /pseudo
(1075 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA36615.1| unnamed protein product [Solanum tuberosum] gi|4... 196 5e-48
gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsi... 148 8e-34
gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsi... 140 2e-31
dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis t... 140 3e-31
pir||F86470 probable retroelement polyprotein [imported] - Arabi... 134 2e-29
gb|AAT40550.1| putative receptor kinase [Solanum demissum] 132 5e-29
ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-gr... 130 2e-28
dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis t... 127 2e-27
gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsi... 121 1e-25
gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thalia... 121 1e-25
gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsi... 120 2e-25
dbj|BAD99220.1| polypeptide with an integrase domain [Petunia x ... 119 5e-25
gb|AAT38747.1| putative polyprotein [Solanum demissum] 117 2e-24
pir||E96608 probable retroelement polyprotein F25P12.89 [importe... 110 3e-22
gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsi... 108 1e-21
gb|AAU89730.1| putative polyprotein [Solanum tuberosum] 100 3e-19
emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis... 99 8e-19
gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica ... 95 1e-17
gb|AAC33963.1| contains similarity to reverse transcriptases (Pf... 86 5e-15
gb|AAO26691.1| gag-pol polyprotein [Vitis vinifera] 82 1e-13
>emb|CAA36615.1| unnamed protein product [Solanum tuberosum] gi|421954|pir||S25786
hypothetical protein 3 - potato transposon Tst1
Length = 675
Score = 196 bits (497), Expect = 5e-48
Identities = 164/532 (30%), Positives = 246/532 (45%), Gaps = 85/532 (15%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
CNL+S KL C+ F+ C FQ+ +SGK I SA+++GGLY+L D QL I
Sbjct: 32 CNLVSFRKLTRSLNCRVIFYSDLCEFQEKVSGKMIGSARESGGLYFL-DNGNNSLQLNPI 90
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQH-R 119
F S FV N VMLWH LGHPSF YL+ + P+LF + S FQCE E +K H
Sbjct: 91 --FLNSTFVLNK---VMLWHYGLGHPSFYYLRHLLPQLFRNKNPSLFQCEFCEMAKHHVD 145
Query: 120 SSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DK 179
+SFP Q Y+ SK F++IHSDVWGP+RI+ T G + I D +
Sbjct: 146 TSFPSQRYQASKPFTMIHSDVWGPSRIS---TMFGKRWFVTFIDDH------------TR 190
Query: 180 LSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETF------------------FFMENG 221
LS + L + + +++ T + + TQF E FF + G
Sbjct: 191 LSWVFLLKGKSEVKNVFE----TFHVMVETQFNEKIKIFRSDNGREFFNEQLGSFFRKTG 246
Query: 222 IVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*G------CINCCTLNK 275
+V QS+C +PQQNGI ERKNRHLLE RAL+F + L G IN
Sbjct: 247 VVHQSSCPDTPQQNGIAERKNRHLLEATRALMFTSKVPQHLWGEALLTATYLINRMPSRP 306
Query: 276 LYVISCFKP*DSSRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVG 335
L + FK S ++ + P + +H + RA K +FVG
Sbjct: 307 LEFKTPFKVFRESFPSSRLTTDLPLRVFGCTTFVHVH-------NRSKLEPRAKKCIFVG 359
Query: 336 YSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFFFRTIIFKGGNQMKIHLIFFEDLILFEN 395
Y+P++KGYKC D ++++ +VTMD+TFF +F + Q + HL ++
Sbjct: 360 YAPSQKGYKCYDPHARKIIVTMDLTFFESQLYFTTHL------QGEYHLGEDSFFVILRK 413
Query: 396 MFMSHSSRPFVSKENAP-----DNVSEHTPSMSEDVTKLVATNQNSNNDSLEPNDNQELI 450
+ + R + + N +++++ P +D + L+ Q + + P+++
Sbjct: 414 LDIK-QMRSLILQINTDVRDVGEDINKCDPRDDKDQSDLMIKTQKFKPEPVAPSND---- 468
Query: 451 QMSLHEHPYNETERKFGEVEGTWKGIIYGRRNHDKVVEDLIPQHSHESEPRE 502
+ K G E + +Y RRN + QH +S P++
Sbjct: 469 ------------KNKNGNREQKTEMQVYSRRNRTQEKRTEDSQHCQKSVPQD 508
>gb|AAD19784.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301698|pir||C84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1501
Score = 148 bits (374), Expect = 8e-34
Identities = 113/365 (30%), Positives = 165/365 (44%), Gaps = 27/365 (7%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C L+SVSKL+ +C F DT C QD S I S ++ GG+YYL D
Sbjct: 464 CTLISVSKLLKQTQCLATFTDTLCFLQDRSSKTLIGSGEERGGVYYLTDVTPA------- 516
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRS 120
+N D LWH RLGHPSF L + +S C++ +KQ R
Sbjct: 517 -----KIHTANVDSDQALWHQRLGHPSFSVLSSLPLFSKTSSTVTSHSCDVCFRAKQTRE 571
Query: 121 SFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKL 180
FP K + FS+IH DVWGP R+ P G L I+ D+ ++ ++
Sbjct: 572 VFPESINKTEECFSLIHCDVWGPYRV---PASCGAVYFLTIVDDYSRAVWTYLLLEKSEV 628
Query: 181 SRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITER 240
++L N + + K ++ N + +F ENGI+ Q++CV +PQQNG ER
Sbjct: 629 RQVLTNFLKYAEKQFGKTVKMVRSDNGTEFMCLSSYFRENGIIHQTSCVGTPQQNGRVER 688
Query: 241 KNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKILSKCPY 300
K+RH+L +ARALLF G + L S S R ++L +
Sbjct: 689 KHRHILNVARALLFQASLPIKFWGESILTAAYLINRTPSSIL----SGRTPYEVL----H 740
Query: 301 FSRFAFKNIRMH----YFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVT 356
S+ + +R+ Y T+ + R+ +FVGY +KG+K D+ FLV+
Sbjct: 741 GSKPVYSQLRVFGSACYVHRVTRDKDKFGQRSRSCIFVGYPFGKKGWKVYDIERNEFLVS 800
Query: 357 MDVTF 361
DV F
Sbjct: 801 RDVIF 805
>gb|AAC67205.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301695|pir||D84481 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1413
Score = 140 bits (353), Expect = 2e-31
Identities = 114/372 (30%), Positives = 166/372 (43%), Gaps = 41/372 (11%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C+L+SVSKL+ +C F DT C+ QD S I + ++ G+YYL D T I
Sbjct: 447 CSLISVSKLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDGVYYLTDAATTTVHKVDI 506
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHD--FSSFQCEIYEFSKQH 118
++ D LWH RLGHPSF L + LF G SS C++ +KQ
Sbjct: 507 TT------------DHALWHQRLGHPSFSVLSSL--PLFSGSSCSVSSRSCDVCFRAKQT 552
Query: 119 RSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*D 178
R FP + K + FS+IH DVWGP R+ P+ G L I+ DF ++
Sbjct: 553 REVFPDSSNKSTDCFSLIHCDVWGPYRV---PSSCGAVYFLTIVDDFSRSVWTYLLLAKS 609
Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGIT 238
++ +L N + K ++ N + +F E GIV Q++CV +PQQNG
Sbjct: 610 EVRSVLTNFLAYTEKQFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRV 669
Query: 239 ERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL---------NKLYVISCFKP*DSSR 289
ERK+RH+L ++RALLF G + L N L
Sbjct: 670 ERKHRHILNVSRALLFQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDY 729
Query: 290 NLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLN 349
+ L++ Y R T+ + R+ +FVGY +KG+K DL+
Sbjct: 730 DQLRVFGSACYAHRV-------------TRDKDKFGERSRLCIFVGYPFGQKGWKVYDLS 776
Query: 350 SKRFLVTMDVTF 361
+ F+V+ DV F
Sbjct: 777 TNEFIVSRDVVF 788
>dbj|BAA97287.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1491
Score = 140 bits (352), Expect = 3e-31
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 41/372 (11%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C+L+SVSKL+ +C F DT C+ QD S I + ++ G+YYL D T +
Sbjct: 447 CSLISVSKLVKQIKCLALFTDTICVLQDRFSRTLIGTGEERDGVYYLTDAATTTVHKVDV 506
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHD--FSSFQCEIYEFSKQH 118
++ D LWH RLGHPSF L + LF G SS C++ +KQ
Sbjct: 507 TT------------DHALWHQRLGHPSFSVLSSL--PLFSGSSCSVSSRSCDVCFRAKQT 552
Query: 119 RSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*D 178
R FP + K + FS+IH DVWGP R+ P+ G L I+ DF ++
Sbjct: 553 REVFPDSSNKSTDCFSLIHCDVWGPYRV---PSSCGAVYFLTIVDDFSRSVWTYLLLAKS 609
Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGIT 238
++ +L N + K ++ N + +F E GIV Q++CV +PQQNG
Sbjct: 610 EVRSVLTNFLAYTEKQFGKSVKIIRSDNGTEFMCLSSYFKEQGIVHQTSCVGTPQQNGRV 669
Query: 239 ERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL---------NKLYVISCFKP*DSSR 289
ERK+RH+L ++RALLF G + L N L
Sbjct: 670 ERKHRHILNVSRALLFQASLPIKFWGEAVMTAAYLINRTPSSIHNGLSPYELLHGCKPDY 729
Query: 290 NLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLN 349
+ L++ Y R T+ + R+ +FVGY +KG+K DL+
Sbjct: 730 DQLRVFGSACYAHRV-------------TRDKDKFGERSRLCIFVGYPFGQKGWKVYDLS 776
Query: 350 SKRFLVTMDVTF 361
+ F+V+ DV F
Sbjct: 777 TNEFIVSRDVVF 788
>pir||F86470 probable retroelement polyprotein [imported] - Arabidopsis thaliana
gi|9989049|gb|AAG10812.1| Putative retroelement
polyprotein [Arabidopsis thaliana]
Length = 1404
Score = 134 bits (336), Expect = 2e-29
Identities = 133/518 (25%), Positives = 205/518 (38%), Gaps = 82/518 (15%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
NLLSV + D C F FQD +GK I G LY LED +S
Sbjct: 390 NLLSVKRTTRDLNCYAIFGPNDVYFQDIETGKVIGEGGSKGELYVLED----------LS 439
Query: 62 SFSESFFVSNNKDDVM---LWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQH 118
S S F S + + LWH RLGHP + LK++ P + F H CE K
Sbjct: 440 PNSSSCFSSKSHLGISFNTLWHARLGHPHTRALKLMLPNISFDHT----SCEACILGKHC 495
Query: 119 RSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*D 178
+S FP K F ++HSDVW ++ +D + I + Y + D
Sbjct: 496 KSVFPKSLTIYEKCFDLVHSDVWTSPCVS----RDNNKYFVTFINEKSKYTWITLLPSKD 551
Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILT-----QF*ETFF---FMENGIVQQSTCVS 230
++ N Y ++ ++ + ++ F + GI+ Q++C
Sbjct: 552 RVFEAFTN------FETYVTNQFNAKIKVFRTDNGGEYTSQKFRDHLAKRGIIHQTSCPY 605
Query: 231 SPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL---NKLYVISCFKP*DS 287
+PQQNG+ ERKNRHL+E+AR+++F K G + C L V+S P +
Sbjct: 606 TPQQNGVAERKNRHLMEVARSMMFHTSVPKRFWGDAVLTACYLINRTPTKVLSDLSPFEV 665
Query: 288 SRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
N + F F I P Q ++ +++K +F+GYS T+KGYKC D
Sbjct: 666 LNNTKPFIDHLRVFGCVCFVLI------PGEQRSKLD-AKSTKCMFLGYSTTQKGYKCFD 718
Query: 348 LNSKRFLVTMDVTFF*K*TFFFRTIIFKGGNQMKIHLIFFEDLILFENMFMSHSSRPFVS 407
R ++ DV F + + K +K D + + H
Sbjct: 719 PTKNRTFISRDVKFLENQDYNNK----KDWENLKDLTHSTSDRVETLKFLLDHLG----- 769
Query: 408 KENAPDNVSEHTPSMSEDVTKLVATNQNSNNDSLEPNDNQELIQMSLHEHPYNETERKFG 467
N + ++H P M++D L NQ + SL+ +N +Q E P N E
Sbjct: 770 --NDSTSTTQHQPEMTQDQEDL---NQENEEVSLQHQENLTHVQ----EDPPNTQE---- 816
Query: 468 EVEGTWKGIIYGRRNHDKVVEDLIPQHSHESEPRENQP 505
H + V+++ S + EP + P
Sbjct: 817 ---------------HSEHVQEIQDDSSEDEEPTQVLP 839
>gb|AAT40550.1| putative receptor kinase [Solanum demissum]
Length = 1358
Score = 132 bits (333), Expect = 5e-29
Identities = 115/378 (30%), Positives = 163/378 (42%), Gaps = 54/378 (14%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
NL SVS+L C FFD L QD +G+ I + ++ GLYYL
Sbjct: 425 NLASVSRLTKALHCSITFFDDFFLMQDRSTGQMIGTGHESQGLYYLTS------------ 472
Query: 62 SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
S S + D L H RLGH S L+ + P L S+ CE + K R++
Sbjct: 473 --SNSLAACSITDSPDLIHKRLGHSSLSKLQKMVPSL---SSLSTLDCESCQLGKHTRAT 527
Query: 122 FPVQTYKPSK-LFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKL 180
F T S+ +FS++HSD+WGP+R++S G + I D+ K
Sbjct: 528 FSRSTEGRSESIFSLVHSDIWGPSRVSSTL---GFRYFVSFIDDY------------SKC 572
Query: 181 SRILLN*CRLSLIPLYK--FSELTMELNIL--------------TQF*ETFFFMENGIVQ 224
+ + L R L ++K F+E+ + + +QF E F GI+
Sbjct: 573 TWVFLMKDRSELFSIFKSFFAEIQNQFGVSIRTFRSDNALEYLSSQFRE--FMTHQGIIH 630
Query: 225 QSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP 284
Q+TC +PQQNG+ ERKNRHL+E AR LL G + C L S +
Sbjct: 631 QTTCPYTPQQNGVAERKNRHLIETARTLLLESNVPLRFWGDAVLTSCYLINRMPSSSIQN 690
Query: 285 *DSSRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYK 344
L P R +H P + RA K VF+GYS +KGY+
Sbjct: 691 QVPHSILFPQSHLYPIPPRVFGSTCFVHNLAP---GKDKLAPRALKCVFLGYSRVQKGYR 747
Query: 345 CLDLNSKRFLVTMDVTFF 362
C + R+L++ DVTFF
Sbjct: 748 CYSHDLHRYLMSADVTFF 765
>ref|NP_918613.1| polyprotein [Oryza sativa (japonica cultivar-group)]
Length = 1554
Score = 130 bits (328), Expect = 2e-28
Identities = 112/371 (30%), Positives = 177/371 (47%), Gaps = 38/371 (10%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
NLLSVS I +C F + CLFQ+ +G+ I + + GL+Y+ E +LG +
Sbjct: 469 NLLSVSSAIDQLKCIVVFDENSCLFQEKWTGRRIGTGVRRDGLWYINHE-----ELGLAA 523
Query: 62 SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
V + + ++ L H +LGHPSF+ L ++P LF D C+ E K RS+
Sbjct: 524 ------VVGDVEKEISLLHCQLGHPSFEILSKLYPDLFSRVDKHRLVCDACELGKHTRST 577
Query: 122 FPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMII--LDFIGYIY*KKNQR*DK 179
+ + +LF +IHSDVWGP + S G + I + +IY K++ +
Sbjct: 578 YVGIGLRNCELFILIHSDVWGPCPVTSV---SGFKWFVTFIDCHTRMTWIYMLKHK--SE 632
Query: 180 LSRILLN*CRL------SLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQ 233
+ R + +L + + + + T +N +F + + GI+ Q+TC +P
Sbjct: 633 VLRCFQDFHKLVTTQFDAKVKIIRTDNGTEYIN--NEF--VSYVSDEGIIHQTTCPGTPP 688
Query: 234 QNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTL-NKL--YVISCFKP*DSSRN 290
QNG+ ERKNRHLLE+AR+L+F K L + L N++ ++ P +
Sbjct: 689 QNGVAERKNRHLLEVARSLMFQMNVPKYLWSEAVMTAAYLINRMPSRILGMKSPAELLLG 748
Query: 291 LLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNS 350
+ F F +R H + + A K VFVGY+ ++KGYKC D
Sbjct: 749 KREFKVPPKVFGCVCF--VRDH-----RPSVGKLDPHAVKCVFVGYASSQKGYKCWDPIG 801
Query: 351 KRFLVTMDVTF 361
+R V+MDVTF
Sbjct: 802 RRLFVSMDVTF 812
>dbj|BAA97099.1| retroelement pol polyprotein-like [Arabidopsis thaliana]
Length = 1098
Score = 127 bits (320), Expect = 2e-27
Identities = 115/374 (30%), Positives = 162/374 (42%), Gaps = 43/374 (11%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C L+SV+KL+ C F DT C QD + I + ++ G+YY L G
Sbjct: 438 CTLISVAKLLKHTGCVAIFTDTLCFLQDRFTRTLIGAGEEREGVYYFTGVLAARVNKG-- 495
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQ----CEIYEFSK 116
F ES LWH RLGHPS L + FP+ F S + C+I +K
Sbjct: 496 --FKES-------SSATLWHHRLGHPSTGVL-LSFPE--FASSSSDLEIIKSCDICYRAK 543
Query: 117 QHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR 176
Q R F K + F +IH DVWGP R P G L I+ DF ++
Sbjct: 544 QAREVFSPSLNKTTVCFELIHCDVWGPYRT---PASCGSVYFLTIVDDFSRSVWTFLMAE 600
Query: 177 *DKLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNG 236
++SR++ N C +S K + N FF E GI+ Q++CV + QQNG
Sbjct: 601 KSEVSRLIRNFCAMSERQFCKSIKTVHSDNGTEFMCLKSFFQEQGIIHQTSCVDTRQQNG 660
Query: 237 ITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKIL- 295
ERK+RH+L +AR LF + G + L +R KIL
Sbjct: 661 RVERKHRHILNVARTCLFQSHLPRKFRGESILTAIHL-------------INRTPTKILH 707
Query: 296 SKCPY----FSRFAFKNIR----MHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
K PY SR ++ +R + Y + + R+ + VFVGY +KG++ D
Sbjct: 708 GKSPYEVLFGSRPSYSALRTFGCLCYAHYRARDKDKFSERSRRCVFVGYPYGKKGWRLYD 767
Query: 348 LNSKRFLVTMDVTF 361
L +F V+ DV F
Sbjct: 768 LEKNKFFVSRDVVF 781
>gb|AAB87099.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|7444418|pir||T00499 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1496
Score = 121 bits (304), Expect = 1e-25
Identities = 106/368 (28%), Positives = 160/368 (42%), Gaps = 30/368 (8%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C L+SVSKL+ F DT C QD I + ++ G+YY TG ++
Sbjct: 434 CTLISVSKLLKQTSSIAIFTDTFCFLQDRFLRTLIGAGEEREGVYYF-----TGVLAPRV 488
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQ-CEIYEFSKQHR 119
S F +S + LWH RLGHPS L + F C+ SKQ R
Sbjct: 489 HKASSDFAISGD-----LWHRRLGHPSTSVLLSLPECNRSSQGFDKIDSCDTCFRSKQTR 543
Query: 120 SSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DK 179
FP+ K + FS+IH DVWGP R P+ G L ++ D+ ++ +
Sbjct: 544 EVFPISNNKTMECFSLIHGDVWGPYRT---PSTTGAVYFLTLVDDYSRSVWTYLMSSKTE 600
Query: 180 LSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITE 239
+S+++ N C +S K + N T +F +GI+ Q++CV +PQQNG E
Sbjct: 601 VSQLIKNFCAMSERQFGKQVKAFRTDNGTEFMCLTPYFQTHGILHQTSCVDTPQQNGRVE 660
Query: 240 RKNRHLLEMARALLF------FH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLK 293
RK+RH+L +ARA LF IL IN L + ++ R
Sbjct: 661 RKHRHILNVARACLFQGNLPVKFWGESILTATHLINRTPSAVLKGKTPYELLFGERPSYD 720
Query: 294 ILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRF 353
+L F + +IR + + +R+ K VF+GY +K ++ DL + +
Sbjct: 721 MLRS---FGCLCYAHIR-------PRNKDKFTSRSRKCVFIGYPHGKKAWRVYDLETGKI 770
Query: 354 LVTMDVTF 361
+ DV F
Sbjct: 771 FASRDVRF 778
>gb|AAG51258.1| Ty1/copia-element polyprotein [Arabidopsis thaliana]
gi|25403501|pir||H86486 protein Ty1/copia-element
polyprotein [imported] - Arabidopsis thaliana
Length = 1152
Score = 121 bits (303), Expect = 1e-25
Identities = 114/374 (30%), Positives = 170/374 (44%), Gaps = 40/374 (10%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C L+SV++L+ + C F D C+ QD S I ++ G+Y+L+ +
Sbjct: 446 CTLISVARLLRELHCFAIFTDKVCVIQDRTSKMLIGVGTESNGVYHLQ----------RA 495
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQ------CEIYEF 114
+ S V K + LWH+RLGHPS K L V P L DF S C++
Sbjct: 496 EVVATSANVVKWKTNKALWHMRLGHPSSKVLSSVLPSL---EDFDSCSSDLKTICDVCVR 552
Query: 115 SKQHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKN 174
+KQ R+SF K + FS IH DVWGP + + + G L I+ D ++
Sbjct: 553 AKQTRASFSESFNKAEECFSFIHYDVWGPYK---HASSCGAHYFLTIVDDHSRAVWIHLM 609
Query: 175 QR*DKLSRILLN*CRLSLIPLYKFSELTMELNILTQF*E-TFFFMENGIVQQSTCVSSPQ 233
+++ +L ++ K T+ N T+F +F E GIV Q +CV + Q
Sbjct: 610 LAKSEVASLLQQFIAMASRQFNK-QVKTVRSNNGTEFMSLKSYFAERGIVHQISCVYTHQ 668
Query: 234 QNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVIS-CFKP*DSSRNLL 292
QNG ERK+RH+L +AR+LLF + + L Y+I+ P +
Sbjct: 669 QNGRVERKHRHILNVARSLLF-----QAELPISFWEESVLTAAYLINRTPTPILDGKTPY 723
Query: 293 KILSKCP--YFSRFAFKNI---RMHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
KIL P Y S F ++ R H T + R K +FVGY +KG++ D
Sbjct: 724 KILYSQPPSYASLRVFGSLCFARKH-----TGRLDKFQERGRKCIFVGYPHGQKGWRIYD 778
Query: 348 LNSKRFLVTMDVTF 361
+ S+ F V+ DV F
Sbjct: 779 IESQIFFVSRDVVF 792
>gb|AAD23883.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301674|pir||D84639 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1156
Score = 120 bits (302), Expect = 2e-25
Identities = 104/344 (30%), Positives = 157/344 (45%), Gaps = 39/344 (11%)
Query: 28 DSISGKTIVSAKKNGGLYYLEDELETGHQLGQISSFSESFFVSNNKDDVMLWHLRLGHPS 87
D S I S ++ G+YYL D ++SS D LWH RLGHPS
Sbjct: 52 DRFSRTLIGSGEERDGVYYLTDVATAKIHTAKVSS------------DQALWHQRLGHPS 99
Query: 88 FKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSSFPVQTYKPSKLFSIIHSDVWGPNRIN 147
F L + S C++ +KQ R FPV T K + FS+IH DVWGP R+
Sbjct: 100 FSVLSSLPVLTSSSLSVGSRSCDVCFRAKQTREVFPVSTNKSIECFSLIHCDVWGPYRV- 158
Query: 148 SYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKLSRILLN*CRLSLIPLYKFSELTMELNI 207
P+ G L I+ DF ++ ++ +L N +Y + + +
Sbjct: 159 --PSSCGAVYFLTIVDDFSRAVWTYLLLAKSEVRTVLTN------FLVYTEKQFGKSVKV 210
Query: 208 LTQF*ETFF------FMENGIVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKI 261
L T F F E+GIV Q++CV +PQQNG ERK+RH+L +ARA+LF +
Sbjct: 211 LRSDNGTEFMCLASYFREHGIVHQTSCVGTPQQNGRVERKHRHILNVARAILF-----QA 265
Query: 262 LMG*GCINCCTLNKLYVISCFKP*DSSRNLLKILSKCPYFSRFAFKNIRMH----YFCP* 317
+ L Y+I+ + S N L + + S+ ++++R+ Y
Sbjct: 266 SLPIQFWGEAVLTAAYLIN--RTPTSLHNGLSPY-EILHNSKPNYEHLRVFGSACYVHRA 322
Query: 318 TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVTMDVTF 361
++ + R+ VF+GY +KG+K D+ K FLV+ DV F
Sbjct: 323 SRDKDKFGERSRLCVFIGYPFAQKGWKVFDMEKKEFLVSRDVVF 366
>dbj|BAD99220.1| polypeptide with an integrase domain [Petunia x hybrida]
Length = 492
Score = 119 bits (298), Expect = 5e-25
Identities = 94/305 (30%), Positives = 141/305 (45%), Gaps = 30/305 (9%)
Query: 71 NNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSSFPVQTYKPS 130
++ D+ LWH RLGH F +K + + +F C+I ++Q + FP T K
Sbjct: 8 DSMDESKLWHFRLGHLPFHAMKTIKTLPVTVDNKQTFPCDICPMARQSKPPFPSSTIKSK 67
Query: 131 KLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIG----YIY*KKNQR*DKLSRILLN 186
+ F +IH D WGP + PT G L I+ DF Y+ K+ L + LL+
Sbjct: 68 QCFELIHIDTWGPYNV---PTYKGERYFLTIVDDFSRATWTYLLTTKSNAFATL-KSLLS 123
Query: 187 *CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITERKNRHLL 246
K + + + F GI+ Q+TCV +PQQNG+ ERK+RHLL
Sbjct: 124 LIERQFSSKVKIIRSDNAYELGSGVIPSEFLASLGIIHQTTCVGTPQQNGVVERKHRHLL 183
Query: 247 EMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKILS-KCPYFSRFA 305
E RALL+ K G C L Y+I+ F K+L+ KCPY F
Sbjct: 184 ETCRALLYQSHLPKKFWG-----DCLLTATYLINRFPS--------KVLNGKCPYQVLFG 230
Query: 306 ----FKNIR----MHYFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVTM 357
+ +++ + + T+ + RA VF+GY +KGYK L+L + + +V+
Sbjct: 231 SLPDYSHLKSFGSLCFVSTLTRHRDKLMPRAIPGVFLGYPFAQKGYKVLNLQTSQVIVSR 290
Query: 358 DVTFF 362
DV FF
Sbjct: 291 DVKFF 295
>gb|AAT38747.1| putative polyprotein [Solanum demissum]
Length = 1336
Score = 117 bits (293), Expect = 2e-24
Identities = 88/259 (33%), Positives = 125/259 (47%), Gaps = 28/259 (10%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
NL+SVSKL + +C + HCLFQD ++ + I + GLY L++ I
Sbjct: 430 NLISVSKLTKELKCFVSLYPDHCLFQDLMTKQIIGKRHVSDGLYILDEWTPPSVACSSIV 489
Query: 62 SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
S E+ H RLGHPS LK + P+ H+ S CE F+K HR S
Sbjct: 490 SPFEA-------------HCRLGHPSLPVLKKLCPQF---HNVPSIDCESCHFAKHHRIS 533
Query: 122 F-PVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDF--IGYIY*KKNQR*D 178
P + + F ++HSDVWGP + S K G + + DF + +IY KN+
Sbjct: 534 LSPRNNKRANFAFELVHSDVWGPCPVVS---KVGFRYFVTFMDDFSRMTWIYFMKNR--S 588
Query: 179 KLSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFF---FMENGIVQQSTCVSSPQQN 235
++ N C + + S + + +F F + GI+ QS+CV +P QN
Sbjct: 589 EVFSHFSNFC-AEIKTQFNASVHILRSDNAREFMSASFQNYMNQYGILHQSSCVDTPSQN 647
Query: 236 GITERKNRHLLEMARALLF 254
G+ ERKNRHLLE AR LLF
Sbjct: 648 GVAERKNRHLLETARVLLF 666
Score = 39.3 bits (90), Expect = 0.72
Identities = 18/42 (42%), Positives = 27/42 (63%)
Query: 327 RASKRVFVGYSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFF 368
+A K VF+GYS +KGY+C R++V++DV F +FF
Sbjct: 736 KALKCVFLGYSRLQKGYRCYSPTLNRYMVSIDVVFSESISFF 777
>pir||E96608 probable retroelement polyprotein F25P12.89 [imported] -
Arabidopsis thaliana gi|9954746|gb|AAG09097.1| Putative
retroelement polyprotein [Arabidopsis thaliana]
Length = 1486
Score = 110 bits (275), Expect = 3e-22
Identities = 105/374 (28%), Positives = 167/374 (44%), Gaps = 45/374 (12%)
Query: 1 CNLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQI 60
C+L+SVS+L +RC D C+ QD + I + ++ GLY+ +ET +
Sbjct: 441 CHLISVSQLTRTRRCIFQITDKVCIVQDRTTLMLIGAGRELNGLYFFRG-VETAAAV--- 496
Query: 61 SSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRS 120
S LWH RLGHPS K L ++ F S CEI +KQ R
Sbjct: 497 --------TSKALPSSQLWHQRLGHPSSKALHLLPFSDVTSSTFDSKTCEICIQAKQTRD 548
Query: 121 SFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGYIY*KKNQR*DKL 180
FP+ + K S F ++H D+WGP R S G L ++ D+ ++ +
Sbjct: 549 PFPLSSNKTSFAFELVHCDLWGPYRTTSI---CGSRYFLTLVDDYSRAVWLYLLPSKQEA 605
Query: 181 SRILLN*CRLSLIPLYKFSELTMELNILTQF*ETF-----FFMENGIVQQSTCVSSPQQN 235
+ L N I L + T I + F FF + GI+ +++CV +PQQN
Sbjct: 606 PKHLKN-----FIALVERQYTTNIKMIRSDNGSEFICLSDFFAQKGIIHETSCVGTPQQN 660
Query: 236 GITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKIL 295
G ERK+RH+L +ARAL F + + + C L Y+I+ + LLK
Sbjct: 661 GRVERKHRHILNVARALRF-----QSGLPIEFWSYCALTAAYLIN-----RTPTPLLK-- 708
Query: 296 SKCPYFSRF----AFKNIRMH----YFCP*TQTNRQT*TRASKRVFVGYSPTRKGYKCLD 347
K P+ + ++IR+ Y + +R++K +F+GY +KG++ +
Sbjct: 709 GKTPFELIYNRPPPLQHIRIFGCICYVHNLKHGGDKFASRSNKSIFLGYPFAKKGWRVYN 768
Query: 348 LNSKRFLVTMDVTF 361
+ + V+ DV F
Sbjct: 769 IETGVVSVSRDVVF 782
>gb|AAD24600.1| putative retroelement pol polyprotein [Arabidopsis thaliana]
gi|25301700|pir||G84542 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana
Length = 1333
Score = 108 bits (270), Expect = 1e-21
Identities = 98/375 (26%), Positives = 164/375 (43%), Gaps = 48/375 (12%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
+L+S+ +L+ + RC D + QD S + + ++ GG ++ +
Sbjct: 327 DLISIGQLMDENRCVLQMSDRFLVVQDRTSRMVMGAGRRVGGTFHFRS-----------T 375
Query: 62 SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIV-FPKLFFGHDFSSFQCEIYEFSKQHRS 120
+ S V K+ LWH R+GHP+ + + ++ + + C++ +KQ R+
Sbjct: 376 EIAASVTVKEEKN-YELWHSRMGHPAARVVSLIPESSVSVSSTHLNKACDVCHRAKQTRN 434
Query: 121 SFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIG----YIY*KKNQR 176
SFP+ K ++F +I+ D+WGP R P+ G L II D+ Y+ K++
Sbjct: 435 SFPLSINKTLRIFELIYCDLWGPYRT---PSHTGARYFLTIIDDYSRGVWLYLLNDKSEA 491
Query: 177 *DKLSRILLN*CRLSLIPLYKF-SELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQN 235
L R + + S+ E LT+ FF E G++ + +CV++P++N
Sbjct: 492 PCHLKNFFAMTDRQFNVKIKTVRSDNGTEFLCLTK-----FFQEQGVIHERSCVATPERN 546
Query: 236 GITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLKIL 295
ERK+RHLL +ARAL F G C L Y+I +R +L
Sbjct: 547 DRVERKHRHLLNVARALRFQANLPIQFWG-----ECVLTAAYLI--------NRTPSSVL 593
Query: 296 SKCPYFSRFAFKNIRMHY------FCP*TQTNR---QT*TRASKRVFVGYSPTRKGYKCL 346
+ + R K R + C NR + R+ + VFVGY +KG++
Sbjct: 594 NDSTPYERLHKKQPRFDHLRVFGSLCYAHNRNRGGDKFAERSRRCVFVGYPHGQKGWRLF 653
Query: 347 DLNSKRFLVTMDVTF 361
DL F V+ DV F
Sbjct: 654 DLEQNEFFVSRDVVF 668
>gb|AAU89730.1| putative polyprotein [Solanum tuberosum]
Length = 1280
Score = 100 bits (249), Expect = 3e-19
Identities = 68/194 (35%), Positives = 101/194 (52%), Gaps = 20/194 (10%)
Query: 70 SNNKDDVMLWHLRLGHPSFKYLK----IVFPKLFFGHDFSSFQCEIYEFSKQHRSSFPVQ 125
+N D+ LWH+RLGH F +K I FP + S + C + ++Q+R FPV
Sbjct: 467 ANVVSDIALWHVRLGHLPFSAMKNLDFISFPSV------SPYICPVCPKARQNRLPFPVS 520
Query: 126 TYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIG----YIY*KKNQR*DKLS 181
+ K ++F +IH D WGP + T DG + L I+ DF +I K+ L
Sbjct: 521 SIKSKRIFELIHIDTWGPFNTS---THDGYNYFLTIVDDFSRGTWTFILKTKSNAFPVLK 577
Query: 182 RILLN*CRLSLIPLYKF-SELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITER 240
L R + + + S+ +EL +Q ET F GI+ + +CV++PQQNG+ ER
Sbjct: 578 DFLAMVERQFELKVQRIRSDNALELGRGSQ--ETMFLHSQGILHERSCVATPQQNGVVER 635
Query: 241 KNRHLLEMARALLF 254
K++HLLE AR L F
Sbjct: 636 KHKHLLEAARGLFF 649
>emb|CAB10225.1| retrovirus-related like polyprotein [Arabidopsis thaliana]
gi|7268152|emb|CAB78488.1| retrovirus-related like
polyprotein [Arabidopsis thaliana]
gi|7488175|pir||G71406 probable retrovirus-related
polyprotein - Arabidopsis thaliana
Length = 1489
Score = 99.0 bits (245), Expect = 8e-19
Identities = 104/364 (28%), Positives = 158/364 (42%), Gaps = 43/364 (11%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
NL+SVS L+ C HF+ CL Q+ G I G LY+ LET + S
Sbjct: 529 NLMSVSSLVKTISCSAHFYVDCCLIQELSQGLMI----GRGRLYHNLYILETENTSPSTS 584
Query: 62 SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
+ + F + +D LWH RLGHPS +V KL R +
Sbjct: 585 TPAACLFTGSVLNDGHLWHQRLGHPS----SVVLQKL-------------------KRLA 621
Query: 122 FPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDF--IGYIY*KKNQR*DK 179
+ S F ++H D+WGP I S +G L ++ D ++Y +N++
Sbjct: 622 YISHNNLASNPFDLVHLDIWGPFSIESI---EGFRYFLTVVDDCTRTTWVYMLRNKK--D 676
Query: 180 LSRILLN*CRLSLIPLYKFSELTMELNILTQF*ETFFFMENGIVQQSTCVSSPQQNGITE 239
+S + +L + + + + + T E+G++ +C +PQQN + E
Sbjct: 677 VSSVFPEFIKL-VSTQFNAKIKAIRSDNAPELGFTEIVKEHGMLHHFSCAYTPQQNSVVE 735
Query: 240 RKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNKLYVISCFKP*DSSRNLLK-ILSKC 298
RK++H+L +ARALLF S I M +C T + P ++++ + IL+K
Sbjct: 736 RKHQHILNVARALLF---QSNIPMQ-YWSDCVTTAVFLINRLPSPLLNNKSPYELILNKQ 791
Query: 299 PYFSRFAFKNIRMHYFCP*TQTNRQT*T-RASKRVFVGYSPTRKGYKCLDLNSKRFLVTM 357
P +S KN F R T RA VF+GY KGYK LDL S V+
Sbjct: 792 PDYS--LLKNFGCLCFVSTNAHERTKFTPRARACVFLGYPSGYKGYKVLDLESHSVTVSR 849
Query: 358 DVTF 361
+V F
Sbjct: 850 NVVF 853
>gb|AAP53905.1| putative pol polyprotein [Oryza sativa (japonica cultivar-group)]
gi|37534632|ref|NP_921618.1| putative pol polyprotein
[Oryza sativa (japonica cultivar-group)]
Length = 1688
Score = 94.7 bits (234), Expect = 1e-17
Identities = 101/394 (25%), Positives = 160/394 (39%), Gaps = 55/394 (13%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSA---KKNGGLYYLEDELETGHQLG 58
NL+SV +L D C F DT C QD +G I + K++ GLY L+
Sbjct: 248 NLISVGQLT-DTNCFVGFDDTSCFVQDRHTGAVIGTGHRQKRSCGLYILDSLSLPSSSTN 306
Query: 59 QISSFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHD--FSSFQCEIYEFSK 116
S +S S WH RLGH L + + G ++F C+ + K
Sbjct: 307 TPSVYSP--MCSTACKSFPQWHHRLGHLCGSRLATLINQGVLGSVPVDTTFVCKGCKLGK 364
Query: 117 QHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSYPTKDGLSPLLMIILDFIGY--IY*KKN 174
Q + +P T + S+ F ++HSDVWG + +P+K G + ++ + D+ Y IY K+
Sbjct: 365 QVQLPYPSSTSRSSRPFDLVHSDVWGK---SPFPSKGGHNYYVIFVDDYSRYTWIYFMKH 421
Query: 175 QR*DKLSRILLN*CRLSLIPLYK-FSELTMELNILTQF*ETF------------------ 215
R LI +Y+ F+++ I TQF
Sbjct: 422 --------------RSQLISIYQSFAQM-----IHTQFSSAIRIFRSDSGGEYMSNAFRE 462
Query: 216 FFMENGIVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GCINCCTLNK 275
F + G + Q +C + QNG+ ERK+RH++E AR LL L
Sbjct: 463 FLVSQGTLPQLSCPGAHAQNGVAERKHRHIIETARTLLIASFVPAHFWAEAISTAVYLIN 522
Query: 276 LYVISCFKP*DSSRNLLKILSKCPYFSRFAFKNIRMHYFCP*TQTNRQT*TRASKRVFVG 335
+ S + L P + + + + T ++ + VF+G
Sbjct: 523 MQPSSSLQGRSPGEVL---FGSPPRYDHLRVFGCTCYVLLAPRERTKLT-AQSVECVFLG 578
Query: 336 YSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFFF 369
YS KGY+C D +++R ++ DVTF FF+
Sbjct: 579 YSLEHKGYRCYDPSARRIRISRDVTFDENKPFFY 612
>gb|AAC33963.1| contains similarity to reverse transcriptases (Pfam; rvt.hmm,
score: 11.19) [Arabidopsis thaliana]
gi|7486705|pir||T01879 hypothetical protein F8M12.17 -
Arabidopsis thaliana
Length = 1633
Score = 86.3 bits (212), Expect = 5e-15
Identities = 115/431 (26%), Positives = 178/431 (40%), Gaps = 63/431 (14%)
Query: 32 GKTIVSAKKNGGLYYLEDELETGHQLGQISSFSESFFVSNNKDDVMLWHLRLGHPSFKYL 91
G I K LY LE Q +SFS S + ++ HPS L
Sbjct: 473 GLMIGRGKTYNNLYILET---------QRTSFSPSLPAATSR-----------HPSLPAL 512
Query: 92 KIVFPKLFFGHDFSSF--QCEIYEFSKQHRSSFPVQTYKPSKLFSIIHSDVWGPNRINSY 149
+ + + SS C I +KQ R ++ S F +IH D+WGP I S
Sbjct: 513 QKLVSSIPSLKSVSSTASHCRISPLAKQKRLAYVSHNNLASSPFDLIHLDIWGPFSIESV 572
Query: 150 PTKDGLSPLLMIILDFIG--YIY*KKNQR*DKLSRILLN*CRLSLIPLYKFSELTMELNI 207
DG L ++ D ++Y KN+ ++S I +L + Y + +
Sbjct: 573 ---DGFRYFLTLVDDCTRTTWVYMMKNK--SEVSNIFPVFVKL-IFTQYNAKIKAIRSDN 626
Query: 208 LTQF*ETFFFMENGIVQQSTCVSSPQQNGITERKNRHLLEMARALLFFH*SSKILMG*GC 267
+ + T F E G++ Q +C +PQQN + ERK++HLL +AR+LLF S + +
Sbjct: 627 VKELAFTKFVKEQGMIHQFSCAYTPQQNSVVERKHQHLLNIARSLLF---QSNVPLQ--Y 681
Query: 268 INCCTLNKLYVISCFKP*--DSSRNLLKILSKCPYFSR------FAFKNIR-MHYFCP*T 318
+ C L Y+I+ D+ +L K P ++ +A N+ + F P
Sbjct: 682 WSDCVLTAAYLINRLPSPLLDNKTPFELLLKKIPDYTLLKSCLCYASTNVHDRNKFSP-- 739
Query: 319 QTNRQT*TRASKRVFVGYSPTRKGYKCLDLNSKRFLVTMDVTFF*K*TFFFRTIIFKGGN 378
RA VF+GY KGYK LDL S +T +V F + F F+T F
Sbjct: 740 --------RARPCVFLGYPSGYKGYKVLDLESHSISITRNVVFH-ETKFPFKTSKF---- 786
Query: 379 QMKIHLIFFEDLIL-FENMFMSHSSRPFVSKENAPDN--VSEHTPSMSEDVTKLVATNQN 435
+K + F + IL S P A DN + ++ S + + L +T
Sbjct: 787 -LKESVDMFPNSILPLPAPLHFVESMPLDDDLRADDNNASTSNSASSASSIPPLPSTVNT 845
Query: 436 SNNDSLEPNDN 446
N D+L+ + N
Sbjct: 846 QNTDALDIDTN 856
>gb|AAO26691.1| gag-pol polyprotein [Vitis vinifera]
Length = 450
Score = 81.6 bits (200), Expect = 1e-13
Identities = 51/148 (34%), Positives = 71/148 (47%), Gaps = 17/148 (11%)
Query: 2 NLLSVSKLIWDKRCQTHFFDTHCLFQDSISGKTIVSAKKNGGLYYLEDELETGHQLGQIS 61
NL+SVSKL + C FF HC+FQD ++ +T + GLY L++ + +
Sbjct: 316 NLISVSKLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTA 375
Query: 62 SFSESFFVSNNKDDVMLWHLRLGHPSFKYLKIVFPKLFFGHDFSSFQCEIYEFSKQHRSS 121
S E+ H RLGHPS LK + P+ S C+ F+K HRSS
Sbjct: 376 SPVEA-------------HCRLGHPSLPVLKKLCPQF---DTLPSLDCKSCHFAKHHRSS 419
Query: 122 F-PVQTYKPSKLFSIIHSDVWGPNRINS 148
P + LF ++HSDVWGP + S
Sbjct: 420 LGPRLNKRAESLFELVHSDVWGPCPVTS 447
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.355 0.158 0.554
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,528,345,794
Number of Sequences: 2540612
Number of extensions: 57345355
Number of successful extensions: 313861
Number of sequences better than 10.0: 533
Number of HSP's better than 10.0 without gapping: 440
Number of HSP's successfully gapped in prelim test: 93
Number of HSP's that attempted gapping in prelim test: 312252
Number of HSP's gapped (non-prelim): 1308
length of query: 1075
length of database: 863,360,394
effective HSP length: 139
effective length of query: 936
effective length of database: 510,215,326
effective search space: 477561545136
effective search space used: 477561545136
T: 11
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.6 bits)
S2: 81 (35.8 bits)
Medicago: description of AC148227.2