
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC137079.15 + phase: 0 /pseudo
(864 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 437 e-122
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 433 e-121
BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Ara... 171 2e-42
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 170 2e-42
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 163 3e-40
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 143 3e-34
BM307983 137 2e-32
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 136 5e-32
TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 ... 129 6e-30
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 127 2e-29
CO982036 125 8e-29
BQ081067 weakly similar to GP|23495377|dbj orf490 {Oryza sativa ... 89 2e-27
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 118 1e-26
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 91 2e-25
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 114 2e-25
AI959950 112 9e-25
BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {... 72 8e-24
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 103 3e-22
TC232995 102 7e-22
AW185460 102 7e-22
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 437 bits (1123), Expect = e-122
Identities = 282/857 (32%), Positives = 431/857 (49%), Gaps = 13/857 (1%)
Frame = +1
Query: 20 SCNSVFTDCFDVWHMRLGHVSSSGLSVISKQFPF--IPCIK--NAPPCDACHYAKQKRLP 75
+C S D +WH R GH+ G+ I + IP +K C C KQ ++
Sbjct: 2038 TCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMS 2217
Query: 76 FPHSSIK---SSAPFDLLHADLWGPYSTPSFLGHKYFLTLVDDYSRFTWVIFLKTKDETQ 132
H ++ +S +LLH DL GP S G +Y +VDD+SRFTWV F++ K ET
Sbjct: 2218 --HQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSETF 2391
Query: 133 KHLKHFISYVENQFHTTLKCLRSDNGSEF--IAMTSFLLSKGIIHHKTCVETPQQNGVVE 190
+ K ++ + +K +RSD+G EF T F S+GI H + TPQQNG+VE
Sbjct: 2392 EVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVE 2571
Query: 191 RKHQHILNVARSLAFHSHVPITMWNFTVQHAVHIINRIPSPLLKFKSPFELLHKEPPSII 250
RK++ + AR + +P +W + A +I NR+ + +E+ PS+
Sbjct: 2572 RKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPSVK 2751
Query: 251 HLKVFGCLAYASTLQAHRTKFNPRARKTIFLGFKEGTKGSILYDLNSNELFVSRNVIFYE 310
H +FG Y + R K +P++ IFLG+ ++ +++ + + S NV+ +
Sbjct: 2752 HFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVV-D 2928
Query: 311 NHFPFTLATKQANIPTTSSHIDLGDPITDLSPHPISAPEFQLTSTPPSQYVSAPAVQHAI 370
+ P + ++ T+ GD + D + S +++
Sbjct: 2929 DLSPARKKDVEEDVRTS------GDNVADAAK-------------------SGENAENSD 3033
Query: 371 PVTD--SISEPTVRKSTRISQRPSYLADYHCNLPSKSCSNVSSGISSYPLSSFLSYDNCS 428
TD +I++P R STRI + + + P++ + S + S
Sbjct: 3034 SATDESNINQPDKRSSTRIQKM--HPKELIIGDPNRGVTTRSREVEIVSNS--------- 3180
Query: 429 PTYTHFCCTISSINEPKTFAQANKSECWREAMTTELNALAKNNTWSVVTLPPGKVPIGCK 488
C +S I EPK +A E W AM EL +N W +V P G IG K
Sbjct: 3181 -------CFVSKI-EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTK 3336
Query: 489 WVYKVKYHANGSIERYKARLVAQGYTQTEGVDYFDTFSPVAKLTTIRVLLSLAAIKGWHL 548
W++K K + G I R KARLVAQGYTQ EGVD+ +TF+PVA+L +IR+LL +A I + L
Sbjct: 3337 WIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKL 3516
Query: 549 EQLDVNNAFLHGDLHEEVYMALPPGY--PTINSSQVCKLNKSLYGLKQASRQWYSKLSTS 606
Q+DV +AFL+G L+EEVY+ P G+ PT + V +L K+LYGLKQA R WY +L+
Sbjct: 3517 YQMDVKSAFLNGYLNEEVYVEQPKGFADPT-HPDHVYRLKKALYGLKQAPRAWYERLTEF 3693
Query: 607 LISFGYTQSLADYSLFVKVSGASFTALLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDL 666
L GY + D +LFVK + +YVDDIV G ++ + ++F + +
Sbjct: 3694 LTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLV 3873
Query: 667 GQLRYFLGFEIARSKSGILLNQRKYTLELLEDAGTLGSKPAATPFDPSTKLGATTGTPFT 726
G+L YFLG ++ + + I L+Q +Y +++ G + TP KL
Sbjct: 3874 GELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSV 4053
Query: 727 DASSYRRLIGRLLYLTNTRPDISYSVQNLSQFVSRPMVPHYQAAQRILKYLKSSPAKGLF 786
D S YR +IG LLYLT +RPDI+Y+V +++ + P + H +RILKY+ + G+
Sbjct: 4054 DQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISHLTQVKRILKYVNGTSDYGIM 4233
Query: 787 FSSSSELKLHGFADSDWACCPDTRRSVTGYCVLLGSSLISWKSKKQSTVSRSSTEAEYRA 846
+ S L G+ D+DWA D R+S +G C LG++LISW SKKQ+ VS S+ EAEY A
Sbjct: 4234 YCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNLISWFSKKQNCVSLSTAEAEYIA 4413
Query: 847 LAHLTCELQWLNYLFHD 863
+L W+ + +
Sbjct: 4414 AGSSCSQLVWMKQMLKE 4464
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 433 bits (1114), Expect = e-121
Identities = 276/844 (32%), Positives = 424/844 (49%), Gaps = 11/844 (1%)
Frame = +1
Query: 31 VWHMRLGHVSSSGLSVISKQFPF--IPCIK--NAPPCDACHYAKQKRLPFPHSSIK---S 83
+WH R GH+ G+ I + IP +K C C KQ ++ H ++ +
Sbjct: 2074 IWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGKQVKMS--HQKLQHQTT 2247
Query: 84 SAPFDLLHADLWGPYSTPSFLGHKYFLTLVDDYSRFTWVIFLKTKDETQKHLKHFISYVE 143
S +LLH DL GP S G +Y +VDD+SRFTWV F++ K +T + K ++
Sbjct: 2248 SRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKSDTFEVFKELSLRLQ 2427
Query: 144 NQFHTTLKCLRSDNGSEF--IAMTSFLLSKGIIHHKTCVETPQQNGVVERKHQHILNVAR 201
+ +K +RSD+G EF T F S+GI H + TPQQNG+VERK++ + AR
Sbjct: 2428 REKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVERKNRTLQEAAR 2607
Query: 202 SLAFHSHVPITMWNFTVQHAVHIINRIPSPLLKFKSPFELLHKEPPSIIHLKVFGCLAYA 261
+ +P +W + A +I NR+ + +E+ P++ H +FG Y
Sbjct: 2608 VMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKPTVKHFHIFGSPCYI 2787
Query: 262 STLQAHRTKFNPRARKTIFLGFKEGTKGSILYDLNSNELFVSRNVIFYENHFPFTLATKQ 321
+ R K +P++ IFLG+ ++ +++ + + S NV+ ++ P +
Sbjct: 2788 LADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVVV-DDLTPARKKDVE 2964
Query: 322 ANIPTTSSHIDLGDPITDLSPHPISAPEFQLTSTPPSQYVSAPAVQHAIPVTDSISEPTV 381
++ T+ GD + D + +A + P+ I++P
Sbjct: 2965 EDVRTS------GDNVADTAKSAENAENSDSATDEPN-----------------INQPDK 3075
Query: 382 RKSTRISQRPSYLADYHCNLPSKSCSNVSSGISSYPLSSFLSYDNCSPTYTHFCCTISSI 441
R S RI + + + P++ + S I S C +S I
Sbjct: 3076 RPSIRIQKM--HPKELIIGDPNRGVTTRSREIEIVSNS----------------CFVSKI 3201
Query: 442 NEPKTFAQANKSECWREAMTTELNALAKNNTWSVVTLPPGKVPIGCKWVYKVKYHANGSI 501
EPK +A E W AM EL +N W +V P G IG KW++K K + G I
Sbjct: 3202 -EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVI 3378
Query: 502 ERYKARLVAQGYTQTEGVDYFDTFSPVAKLTTIRVLLSLAAIKGWHLEQLDVNNAFLHGD 561
R KARLVAQGYTQ EGVD+ +TF+PVA+L +IR+LL +A I + L Q+DV +AFL+G
Sbjct: 3379 TRNKARLVAQGYTQIEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGY 3558
Query: 562 LHEEVYMALPPGY--PTINSSQVCKLNKSLYGLKQASRQWYSKLSTSLISFGYTQSLADY 619
L+EE Y+ P G+ PT + V +L K+LYGLKQA R WY +L+ L GY + D
Sbjct: 3559 LNEEAYVEQPKGFVDPT-HPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDK 3735
Query: 620 SLFVKVSGASFTALLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFLGFEIAR 679
+LFVK + +YVDDIV G ++ + ++F + +G+L YFLG ++ +
Sbjct: 3736 TLFVKQDAENLMIAQIYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQ 3915
Query: 680 SKSGILLNQRKYTLELLEDAGTLGSKPAATPFDPSTKLGATTGTPFTDASSYRRLIGRLL 739
+ I L+Q KY +++ G + TP KL D S YR +IG LL
Sbjct: 3916 MEDSIFLSQSKYAKNIVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLL 4095
Query: 740 YLTNTRPDISYSVQNLSQFVSRPMVPHYQAAQRILKYLKSSPAKGLFFSSSSELKLHGFA 799
YLT +RPDI+Y+V +++ + P + H +RILKY+ + G+ + S+ L G+
Sbjct: 4096 YLTASRPDITYAVGVCARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYC 4275
Query: 800 DSDWACCPDTRRSVTGYCVLLGSSLISWKSKKQSTVSRSSTEAEYRALAHLTCELQWLNY 859
D+DWA D R+S +G C LG++LISW SKKQ+ VS S+ EAEY A +L W+
Sbjct: 4276 DADWAGSADDRKSTSGGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQ 4455
Query: 860 LFHD 863
+ +
Sbjct: 4456 MLKE 4467
>BI317550 weakly similar to GP|9759590|dbj| polyprotein-like {Arabidopsis
thaliana}, partial (18%)
Length = 421
Score = 171 bits (432), Expect = 2e-42
Identities = 83/139 (59%), Positives = 105/139 (74%)
Frame = -2
Query: 587 KSLYGLKQASRQWYSKLSTSLISFGYTQSLADYSLFVKVSGASFTALLVYVDDIVLAGNC 646
KSLYGLKQASR+WY KL+ L+ GY QS++DYSLF G +FTALLVYVDDI+LAG+
Sbjct: 420 KSLYGLKQASRKWYEKLTNLLLKEGYIQSISDYSLFTLTKGNTFTALLVYVDDIILAGDS 241
Query: 647 ISEIKSVKTFLDNKFCIKDLGQLRYFLGFEIARSKSGILLNQRKYTLELLEDAGTLGSKP 706
I E +K LD F IK+LG+L+YFLG E+A S+ GI ++QRKY L+LL+D+G LG KP
Sbjct: 240 IDEFDRIKNVLDLAFKIKNLGKLKYFLGLEVAHSRLGITISQRKYCLDLLKDSGLLGCKP 61
Query: 707 AATPFDPSTKLGATTGTPF 725
A+TP D S KL + GTP+
Sbjct: 60 ASTPLDTSIKLHSAAGTPY 4
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 170 bits (431), Expect = 2e-42
Identities = 79/130 (60%), Positives = 97/130 (73%), Gaps = 1/130 (0%)
Frame = -2
Query: 476 VTLPPGKVPIGCKWVYKVKYHANGSIERYKARLVAQGYTQTEGVDYFDTFSPVAKLTTIR 535
V LPPGK P+GC+WVY VK G ++R KARLVA+GYTQ G+DY DTFSPVAKLTT+R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 536 VLLSLAAIKGWHLEQLDVNNAFLHGDLHEEVYMALPPGYPTINS-SQVCKLNKSLYGLKQ 594
+ L++AAI W L QLD+ NAFLHGDL E++YM PPG+ VCKL++SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 595 ASRQWYSKLS 604
+ R W+ K S
Sbjct: 46 SPRAWFGKFS 17
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 163 bits (413), Expect = 3e-40
Identities = 74/134 (55%), Positives = 95/134 (70%)
Frame = +3
Query: 438 ISSINEPKTFAQANKSECWREAMTTELNALAKNNTWSVVTLPPGKVPIGCKWVYKVKYHA 497
+SS+ P T +A WR+AM E+ AL N TW +V LPPGK +GC+WVY VK
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 498 NGSIERYKARLVAQGYTQTEGVDYFDTFSPVAKLTTIRVLLSLAAIKGWHLEQLDVNNAF 557
NG ++R KARLVA+GYTQ G++Y DTFSPV LTT+R+ L++AAI+ W L QLD+ NAF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 558 LHGDLHEEVYMALP 571
LHGDL E++YM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 143 bits (361), Expect = 3e-34
Identities = 73/151 (48%), Positives = 100/151 (65%), Gaps = 1/151 (0%)
Frame = -3
Query: 459 AMTTELNALAKNNTWSVVTLPPGKVPIGCKWVYKVKYHANGSIERYKARLVAQGYTQTEG 518
AM ELN +NN W +V P IG KWV++ K +G I R KARLVA+GY Q EG
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 519 VDYFDTFSPVAKLTTIRVLLSLAAIKGWHLEQLDVNNAFLHGDLHEEVYMALPPGYPTIN 578
+DY +T++PVA+L IR+LL+ +I + L Q+DV +AFL+G + EEVY+ PPG+ +
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 579 S-SQVCKLNKSLYGLKQASRQWYSKLSTSLI 608
+ V KL K+LYGLKQA R WY ++S L+
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLL 6
>BM307983
Length = 406
Score = 137 bits (346), Expect = 2e-32
Identities = 66/133 (49%), Positives = 92/133 (68%), Gaps = 2/133 (1%)
Frame = +2
Query: 485 IGCKWVYKVKYHANGSIERYKARLVAQGYTQTEGVDYFDTFSPVAK-LTTIRVLLSLAAI 543
+GC+W+Y VKY A+ +++RYKARLVA+GY QT G+DY +TF+ K + + A
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 544 KGWHLEQLDVNNAFLHGDLHEEVYMALPPGYPTIN-SSQVCKLNKSLYGLKQASRQWYSK 602
GW + Q DV NAFLHG L EEVYM +PPGY N ++VC+L K+LYGLKQ+ R W+ +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 603 LSTSLISFGYTQS 615
+ +++S GY QS
Sbjct: 362 FTQAMLSLGYKQS 400
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 136 bits (342), Expect = 5e-32
Identities = 83/179 (46%), Positives = 109/179 (60%), Gaps = 2/179 (1%)
Frame = +1
Query: 498 NGSIERYKARLVAQGYTQTEGVDYFDTFSPVAKLTTIRVLLSLAAIKGWHLEQLDVNNAF 557
+G+I+++KARLVA+ YTQ G DY TFSPVAK+ + +L S+A + W L LD NAF
Sbjct: 25 SGTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAF 204
Query: 558 LHGDLHEEVYMALPPGYPT--INSSQVCKLNKSLYGLKQASRQWYSKLSTSLISFGYTQS 615
LHG L EEVYM P G+ +S+ VC+L +S YGLKQ+ R W + I Y
Sbjct: 205 LHGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAI--WYDSH 378
Query: 616 LADYSLFVKVSGASFTALLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFLG 674
AD+S+F S L+VYVDDI + G+ I +K L +F KDLG+LRYFLG
Sbjct: 379 EADHSVFYCHSPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLG 555
>TC222001 similar to UP|C716_NEPRA (O04164) Cytochrome P450 71A6 (Fragment)
, partial (21%)
Length = 912
Score = 129 bits (324), Expect = 6e-30
Identities = 58/117 (49%), Positives = 78/117 (66%)
Frame = -2
Query: 209 VPITMWNFTVQHAVHIINRIPSPLLKFKSPFELLHKEPPSIIHLKVFGCLAYASTLQAHR 268
+P WN+ + HA ++IN IP+P L+ SP+E LH P I HL++FGCL YAST++A+R
Sbjct: 911 MPPNFWNYALLHAAYLINCIPTPFLQNTSPYERLHGHIPDISHLRIFGCLCYASTIKANR 732
Query: 269 TKFNPRARKTIFLGFKEGTKGSILYDLNSNELFVSRNVIFYENHFPFTLATKQANIP 325
K PRA IF+GFK TKG +LYDL+S+ + SRNV+FYENH + T P
Sbjct: 731 KKLEPRAHPCIFIGFKPNTKGYMLYDLHSHNIITSRNVVFYENHDAMSFNTSSLTAP 561
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial
(16%)
Length = 662
Score = 127 bits (319), Expect = 2e-29
Identities = 62/98 (63%), Positives = 75/98 (76%), Gaps = 2/98 (2%)
Frame = +3
Query: 769 AAQRILKYLKSSPAKGLFFSSSSELKLHGFADSDWACCPDTRRSVTGYCVLLGSSLISWK 828
AA R+LKYLK P KGL FS S +++ GF+D+DWA C D+ +S+T YC LGSSLISWK
Sbjct: 18 AATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLISWK 197
Query: 829 SKKQSTVSR--SSTEAEYRALAHLTCELQWLNYLFHDL 864
+KKQ+TVSR SS+EA+YRAL TCELQWL YL DL
Sbjct: 198 AKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDL 311
>CO982036
Length = 674
Score = 125 bits (314), Expect = 8e-29
Identities = 74/203 (36%), Positives = 117/203 (57%), Gaps = 3/203 (1%)
Frame = -2
Query: 633 LLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFLGFEIARSKSGILLNQRKYT 692
LLVYVD I++ G+ + I+++ + L++ F +K LG+L YF+ E+ +S +L + R
Sbjct: 646 LLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEV-KSMPDLLFSLRTSI 473
Query: 693 LELLEDAGTLGSKPAATPFDPSTKLGATTGTPFTDASSYRRLIGRLLYLTNTRPDISYSV 752
E+ ++P ++P + KL + F+ + YR ++G L Y T RP+IS++V
Sbjct: 472 FEIFCRKPR*QAQPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTVIRPEISFAV 293
Query: 753 QNLSQFVSRPMVPHYQAAQRILKYLKSSPAKGLFFS---SSSELKLHGFADSDWACCPDT 809
+ QF+S P+ H+ +RIL+YLK S + GL SS L + GF D+DWA D
Sbjct: 292 NKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCDADWASAVDD 113
Query: 810 RRSVTGYCVLLGSSLISWKSKKQ 832
+RS +G V LG +LISW KQ
Sbjct: 112 KRSTSGAAVFLGPNLISWWXXKQ 44
>BQ081067 weakly similar to GP|23495377|dbj orf490 {Oryza sativa (japonica
cultivar-group)}, partial (18%)
Length = 430
Score = 89.0 bits (219), Expect(2) = 2e-27
Identities = 41/89 (46%), Positives = 60/89 (67%)
Frame = +1
Query: 443 EPKTFAQANKSECWREAMTTELNALAKNNTWSVVTLPPGKVPIGCKWVYKVKYHANGSIE 502
EP T QA S WR+AM + +AL +N T ++ +LP GK I CKWV+++K + G++
Sbjct: 31 EPSTVKQALISPPWRQAMQADFDALMENKTLTLTSLPSGKAAIDCKWVFRIKENLYGTLN 210
Query: 503 RYKARLVAQGYTQTEGVDYFDTFSPVAKL 531
RY++RLVA+G+ G DY +TFSPV +L
Sbjct: 211 RYRSRLVAKGFHLKFGCDYSETFSPVIEL 297
Score = 53.1 bits (126), Expect(2) = 2e-27
Identities = 24/42 (57%), Positives = 31/42 (73%)
Frame = +3
Query: 533 TIRVLLSLAAIKGWHLEQLDVNNAFLHGDLHEEVYMALPPGY 574
TIR++L +A W L+Q+D+NNAFLHG L EEVYM PG+
Sbjct: 300 TIRLILFIALTNHWPLQQVDINNAFLHGLLTEEVYMVQLPGF 425
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 118 bits (296), Expect = 1e-26
Identities = 63/181 (34%), Positives = 105/181 (57%), Gaps = 2/181 (1%)
Frame = +2
Query: 33 HMRLGHVSSSGLSVISKQFPFIPCIKNAPPCDACHYAKQKRLPFPHSSIKSSAPFDLLHA 92
H RLGH S L ++ P + IK+ C++C K R H + +PF ++H
Sbjct: 317 HERLGHPHLSKLKIM---VPSLEKIKDLF-CESCQLGKHVRSSXRHVESRVDSPFLVIHX 484
Query: 93 DLWGPYSTPSFLGHKYFLTLVDDYSRFTWVIFLKTKDETQKHLKHFISYVENQFHTTLKC 152
D+WGP S + ++YF+T +D++S+ T V +K + E L ++ ++ QF T+K
Sbjct: 485 DIWGPNRVSS-MSYRYFVTFIDEFSQCTRVFLMKERSEILSFLTS-VNKIKTQFGKTIKI 658
Query: 153 LRSDNGSEFIA--MTSFLLSKGIIHHKTCVETPQQNGVVERKHQHILNVARSLAFHSHVP 210
LRSDN E+ + ++ F ++GI+H +C TPQQN + ERK++H++ AR+L H++ P
Sbjct: 659 LRSDNAKEYFSSVISPFXSAQGILHQFSCPHTPQQNDIAERKNRHLVETARTLLLHANEP 838
Query: 211 I 211
I
Sbjct: 839 I 841
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 91.3 bits (225), Expect(2) = 2e-25
Identities = 59/186 (31%), Positives = 90/186 (47%)
Frame = +3
Query: 629 SFTALLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFLGFEIARSKSGILLNQ 688
+F + +YVDDI+ K + + F G+L++ LG +I + GI ++Q
Sbjct: 483 TFLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQ 662
Query: 689 RKYTLELLEDAGTLGSKPAATPFDPSTKLGATTGTPFTDASSYRRLIGRLLYLTNTRPDI 748
KYT L+ +KP ATP ST + T Y +I L YLT++RPDI
Sbjct: 663 EKYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDI 842
Query: 749 SYSVQNLSQFVSRPMVPHYQAAQRILKYLKSSPAKGLFFSSSSELKLHGFADSDWACCPD 808
+ V ++F S P + H A +RIL+YL + L+F SE L G+ D +A
Sbjct: 843 VFVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKV 1022
Query: 809 TRRSVT 814
R+S +
Sbjct: 1023ERKSTS 1040
Score = 43.9 bits (102), Expect(2) = 2e-25
Identities = 19/36 (52%), Positives = 27/36 (74%)
Frame = +2
Query: 589 LYGLKQASRQWYSKLSTSLISFGYTQSLADYSLFVK 624
+YGLKQA R WY +LS+ L+S G+T+ + D +LF K
Sbjct: 362 VYGLKQALRAWYERLSSFLVSNGFTRGITDPALFRK 469
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 114 bits (284), Expect = 2e-25
Identities = 60/138 (43%), Positives = 87/138 (62%), Gaps = 1/138 (0%)
Frame = +2
Query: 615 SLADYSLFV-KVSGASFTALLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFL 673
S AD+S+F S L+VYVDDI++ ++I +K L N F KDL L+YFL
Sbjct: 8 SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFL 187
Query: 674 GFEIARSKSGILLNQRKYTLELLEDAGTLGSKPAATPFDPSTKLGATTGTPFTDASSYRR 733
G E+A+S G++++QRKY L++LE+ G + +P DP+ KL A + D YRR
Sbjct: 188 GIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRR 367
Query: 734 LIGRLLYLTNTRPDISYS 751
L+G+L+YLT TRPDIS++
Sbjct: 368 LVGKLIYLTITRPDISFA 421
>AI959950
Length = 466
Score = 112 bits (279), Expect = 9e-25
Identities = 61/131 (46%), Positives = 82/131 (62%), Gaps = 1/131 (0%)
Frame = -1
Query: 458 EAMTTELNALAKNNTWSVVTLPPGKVPIGCKWVYKVKYHANGSIERYKARLVAQGYTQTE 517
+AM EL+ KNN +V LP K +G KW++ K +G + RYKARLVA+GY+Q E
Sbjct: 394 KAMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQE 215
Query: 518 GVDYFDTFSPVAKLTTIRVLLSLAAIKGWHLEQLDVNNAFLHGDLHEEVYMALPPGYPTI 577
G+DY TF+ VA+L I +LLS A L Q+DV +AFL+G + +EVY+ PPG+
Sbjct: 214 GIDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENE 35
Query: 578 NSSQ-VCKLNK 587
Q V KLNK
Sbjct: 34 TLHQHVFKLNK 2
>BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 437
Score = 72.0 bits (175), Expect(2) = 8e-24
Identities = 34/64 (53%), Positives = 47/64 (73%)
Frame = +2
Query: 633 LLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFLGFEIARSKSGILLNQRKYT 692
L+VYVDDIV+ GN +I +K L + F KDLG+ YFLG E+A+SK GI+++QRKY
Sbjct: 41 LMVYVDDIVITGNDQGKIAQLKGHLFSHFQTKDLGKFEYFLGIEVAQSKDGIIISQRKYA 220
Query: 693 LELL 696
L++L
Sbjct: 221 LDIL 232
Score = 57.8 bits (138), Expect(2) = 8e-24
Identities = 28/71 (39%), Positives = 42/71 (58%)
Frame = +1
Query: 696 LEDAGTLGSKPAATPFDPSTKLGATTGTPFTDASSYRRLIGRLLYLTNTRPDISYSVQNL 755
+ G +P + DP+ KL G P++D+ YR L+G+L+YLT TRP+IS+ V +
Sbjct: 220 IRHTGMSDCRPIDSLMDPNKKLLPNQGKPYSDSERYRILVGKLIYLTITRPNISFVVGVV 399
Query: 756 SQFVSRPMVPH 766
SQF+ P H
Sbjct: 400 SQFMQSPHNDH 432
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein,
partial (7%)
Length = 804
Score = 103 bits (257), Expect = 3e-22
Identities = 52/141 (36%), Positives = 84/141 (58%), Gaps = 3/141 (2%)
Frame = +1
Query: 727 DASSYRRLIGRLLYLTNTRPDISYSVQNLSQFVSRPMVPHYQAAQRILKYLKSSPAKGLF 786
D + +RRLIG L YL N+RP+I ++V +S+F+ RP + H QAA+R+L+ +K + G+
Sbjct: 10 DVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVL 189
Query: 787 F---SSSSELKLHGFADSDWACCPDTRRSVTGYCVLLGSSLISWKSKKQSTVSRSSTEAE 843
F + S + L G+ DSDW P+ +S GY + + ++ SKKQ ++ S+ EAE
Sbjct: 190 FPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAE 369
Query: 844 YRALAHLTCELQWLNYLFHDL 864
Y A + C+ W+ L +L
Sbjct: 370 YVAASLGACQAVWMMNLLEEL 432
>TC232995
Length = 1009
Score = 102 bits (254), Expect = 7e-22
Identities = 57/170 (33%), Positives = 90/170 (52%), Gaps = 1/170 (0%)
Frame = +2
Query: 571 PPGYPTINS-SQVCKLNKSLYGLKQASRQWYSKLSTSLISFGYTQSLADYSLFVKVSGAS 629
PPG+ + + V KL K+LYGLKQA R WY +LS L+ +++ D +LF+K
Sbjct: 11 PPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRKHND 190
Query: 630 FTALLVYVDDIVLAGNCISEIKSVKTFLDNKFCIKDLGQLRYFLGFEIARSKSGILLNQR 689
+ +YVDDI+ S K + ++F + +G+L+YFLG +I +++ GI +NQ
Sbjct: 191 ILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFINQS 370
Query: 690 KYTLELLEDAGTLGSKPAATPFDPSTKLGATTGTPFTDASSYRRLIGRLL 739
KY EL++ G +K +TP + L D YR IG ++
Sbjct: 371 KYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
>AW185460
Length = 411
Score = 102 bits (254), Expect = 7e-22
Identities = 53/126 (42%), Positives = 79/126 (62%), Gaps = 2/126 (1%)
Frame = +2
Query: 725 FTDASSYRRLIGRLLYLTN--TRPDISYSVQNLSQFVSRPMVPHYQAAQRILKYLKSSPA 782
F D R LY+ + TRPDI Y+ LS+F+ P H+ A +RIL+YL+ + A
Sbjct: 29 FMDRGFRRSKSEPTLYIKSQATRPDIMYATSLLSRFMQSPSQIHFGAGKRILRYLQGTKA 208
Query: 783 KGLFFSSSSELKLHGFADSDWACCPDTRRSVTGYCVLLGSSLISWKSKKQSTVSRSSTEA 842
G+++++ + +L G+ DSDWA D +S +GY LGS + SW SKKQ+TV++S+ EA
Sbjct: 209 FGIWYTTETNSELLGYTDSDWAGSTDDMKSTSGYAFSLGSGMFSWASKKQATVAQSTAEA 388
Query: 843 EYRALA 848
EY A+A
Sbjct: 389 EYVAVA 406
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.134 0.410
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 44,980,155
Number of Sequences: 63676
Number of extensions: 747910
Number of successful extensions: 4791
Number of sequences better than 10.0: 143
Number of HSP's better than 10.0 without gapping: 4623
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 4747
length of query: 864
length of database: 12,639,632
effective HSP length: 105
effective length of query: 759
effective length of database: 5,953,652
effective search space: 4518821868
effective search space used: 4518821868
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)
Medicago: description of AC137079.15