
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148918.3 + phase: 0
(1351 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 801 0.0
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 799 0.0
TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement p... 256 5e-68
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 218 2e-56
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 181 2e-45
BI321712 166 6e-41
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 146 7e-41
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 164 2e-40
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 159 8e-39
TC232995 150 4e-36
BU549979 148 2e-35
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 144 3e-34
CO981347 96 3e-34
BG508993 143 4e-34
AI959950 137 3e-32
TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (F... 134 2e-31
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 132 8e-31
CO983516 130 4e-30
AW185460 130 4e-30
BM307983 126 6e-29
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 801 bits (2069), Expect = 0.0
Identities = 449/1140 (39%), Positives = 654/1140 (56%), Gaps = 29/1140 (2%)
Frame = +1
Query: 238 GRGRGSFRGRGRGNFNQWRDNNYNNFNPSHQGKGGNNFGSNNRGRGRGYYNQERTNNGCF 297
G+ G+ RG G + + R F P+ G +R G +R C
Sbjct: 1342 GKNVGNQRGLGFNHKSAGR-TTMTEFVPAKNSTGATMSQHRSRHHGTQQKKSKRKKWRCH 1518
Query: 298 NCGKYGHKAADCRYKHQANMAENSYQHFGESSQNQH------------SLFLASNTLSEE 345
CGKYGH C + H + H G S + SL + ++ +
Sbjct: 1519 YCGKYGHIKPFCYHLH-------GHPHHGTQSSSSGRKMMWVPKHKIVSLVVHTSLRASA 1677
Query: 346 ENIWYLDTGCSNHMCGKKELFSSLDETVKSTVKFGNNSNIPIEGKGQIAIRLKDGSQNFI 405
+ WYLD+GCS HM G KE +++ S V FG+ S I G G++ + DG + +
Sbjct: 1678 KEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKL---VHDGLPS-L 1845
Query: 406 GDVFYAPGLHHNLLSMGQLSEKDYNMQIHKGYCTLIDGNGRFITKVKMSHNRLFPLRIQH 465
V GL NL+S+ QL ++ +N+ K C + + + K S + + Q
Sbjct: 1846 NKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQE 2025
Query: 466 DQFSCLSSIIPNDDW-LWHMRFGHFHFSGLNYLSRKEYVSGLPVVKIPSG-VCETCQMGK 523
+S D+ +WH RFGH H G+ + K V G+P +KI G +C CQ+GK
Sbjct: 2026 TSYSSTCLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQIGK 2205
Query: 524 KHRESFPTGKSWRAKKLLEIVHSDLCS-VEIPTPGGCRYFITFIDDFSRKAWVYFLKQKS 582
+ + S + ++LE++H DL +++ + GG RY +DDFSR WV F+++KS
Sbjct: 2206 QVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIREKS 2385
Query: 583 EAVDSFKTFKAFVEKQSGCPIKALRTDRGQEYLVG--TDFFEQHGIQHQLTTRYTPQQNG 640
+ + FK ++++ C IK +R+D G+E+ T+F GI H+ + TPQQNG
Sbjct: 2386 DTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNG 2565
Query: 641 VAERKNRTIMDMVRCMLKAKQMPKEFWAEAVATAVYILNRCPTKSVQEKTPEEAGSGRRP 700
+ ERKNRT+ + R ML AK++P WAEA+ TA YI NR + T E GR+P
Sbjct: 2566 IVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGRKP 2745
Query: 701 SIRHLRVFGCIAYAHVPDQIRKKLDDKGERCIFIGYCSNSKAYKLYNPETKKVIISRDVT 760
+++H +FG Y + R+K+D K + IF+GY +NS+AY+++N T+ V+ S +V
Sbjct: 2746 TVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESINVV 2925
Query: 761 FDEGGMWNWSSKSQKEPIVTPNDYEEED----EHVDTTPDEPDEPETSNREKRN--RRLP 814
D+ + K +E + T D + E+ + + DEP + +KR R
Sbjct: 2926 VDD--LTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRIQK 3099
Query: 815 ARLQDCVLGTDN------DPSDEEIINFALFADCEPVTFEEASRDENWIKAMDEEINAIE 868
++ ++G N E + N + EP +EA DE WI AM EE+ +
Sbjct: 3100 MHPKELIIGDPNRGVTTRSREIEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEELEQFK 3279
Query: 869 KNKTWELTELPPDKKPIGVKWVYKTKYKPSGEIDRYKARLVAKGYKQKPGIDYFEVFAPV 928
+N+ WEL P IG KW++K K G I R KARLVA+GY Q G+D+ E FAPV
Sbjct: 3280 RNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFAPV 3459
Query: 929 ARLDTIRMLISLSAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVVRGKEDKVYRLKK 988
ARL++IR+L+ ++ +K++QMDVKSAFLNG L EE YVEQP G+V D VYRLKK
Sbjct: 3460 ARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDHVYRLKK 3639
Query: 989 ALYGLKQAPRAWYKKIDSYFIQNGFQRCPFEHTLYIKFIDPGDVLIVCLYVDDLIFTGNN 1048
ALYGLKQAPRAWY+++ + Q G+++ + TL++K D +++I +YVDD++F G +
Sbjct: 3640 ALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQ-DAENLMIAQIYVDDIVFGGMS 3816
Query: 1049 SKMIAEFRGAMISYFEMTDLGLMSYFLGIEVIQQKDGIFISQKKYASDILKKFKMEHSKP 1108
++M+ F M S FEM+ +G ++YFLG++V Q +D IF+SQ KYA +I+KKF ME++
Sbjct: 3817 NEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGMENASH 3996
Query: 1109 ISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTATRPDIVYGVGLLSRYMEDPCVSH 1168
TP LKL+++ G VD + Y+S+IGSL YLTA+RPDI Y VG+ +RY +P +SH
Sbjct: 3997 KRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKISH 4176
Query: 1169 LQGAKRILRYIKGTLTEGIFYGNNSDVKLVGYTDSDWAGDTETRKSTSGYAFHLGTGAIS 1228
L KRIL+Y+ GT GI Y + SD LVGY D+DWAG + RKSTSG F+LGT IS
Sbjct: 4177 LNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLGTNLIS 4356
Query: 1229 WSSKKQHVVALSTAEAEYITATSCATQTVWLRRILEVMHHEQNTPTKIYCDNKSAIALSK 1288
W SKKQ+ V+LSTAEAEYI A S +Q VW++++L+ + EQ+ T +YCDN SAI +SK
Sbjct: 4357 WFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMT-LYCDNMSAINISK 4533
Query: 1289 NPVFHGRSKHIDIRFHKIRELIAEKEVVIEYCPTKEQIADIFTKPLKIESFYKLKKMLGM 1348
NPV H R+KHIDIR H IR+L+ +K + +E+ T+EQIADIFTK L F KL+ LG+
Sbjct: 4534 NPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFEKLRGKLGI 4713
Score = 104 bits (259), Expect = 3e-22
Identities = 64/189 (33%), Positives = 99/189 (51%)
Frame = +1
Query: 10 WSGPKLNSELDFNYWEFMMTTHLKAHNIWSYVESGLQQGADELARRRDQLALSQILQGID 69
W PK+ LD E T LK W+ E DELA + AL+ + G+D
Sbjct: 139 WEHPKM---LDT---EGKPTNELKPEEDWTKEE-------DELALGNSK-ALNALFNGVD 276
Query: 70 YSIFGKIANAKTSKEAWDILKLSHKGVEKAQKSKLQSLRREYERYEMSSSETVDQYFTRV 129
+IF I +K+AW+ILK +H+G K + S+LQ L ++E +M E + + +
Sbjct: 277 KNIFRLINTCTVAKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNI 456
Query: 130 INIVNKMRVYGEDIQDSKVVEKILRTMPMKYDHVVTTILESHDTDTLSVAELQGSIESHV 189
+ I N GE + D K+V KILR++P ++D VT I E+ D + V EL GS+++
Sbjct: 457 LEIANACTALGERMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFE 636
Query: 190 NRILEKTEK 198
+ ++TEK
Sbjct: 637 LGLSDRTEK 663
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 799 bits (2063), Expect = 0.0
Identities = 462/1202 (38%), Positives = 680/1202 (56%), Gaps = 31/1202 (2%)
Frame = +1
Query: 178 VAELQGSIESHVNRILEKTEKVKEEALKSQVN-LNNVAESSQMGEARARDNFNNGGRGNF 236
+A+L+ E+H I E LK +V LN+ E+ +
Sbjct: 1183 IADLEAEKEAHKEEISE---------LKGEVGFLNSKLENMTKSIKMLNKGSDTLDEVLL 1335
Query: 237 RGRGRGSFRGRGRGNFNQWRDNNYNNFNPSHQGKGGNNFGSNNRGRGRGYYNQERTNNGC 296
G+ G+ RG G + R F P+ G +R G +R C
Sbjct: 1336 LGKNAGNQRGLGFNPKSAGR-TTMTEFVPAKNRTGATMSQHRSRHHGMQQKKSKRKKWRC 1512
Query: 297 FNCGKYGHKAADCRYKHQANMAENSYQHFGESSQNQH------------SLFLASNTLSE 344
CGKYGH C + H + H G S N SL + ++ +
Sbjct: 1513 HYCGKYGHIKPFCYHLH-------GHPHHGTQSSNSRKKMMWVPKHKAVSLVVHTSLRAS 1671
Query: 345 EENIWYLDTGCSNHMCGKKELFSSLDETVKSTVKFGNNSNIPIEGKGQIAIRLKDGSQNF 404
+ WYLD+GCS HM G KE +++ S V FG+ S I G G++ + DG +
Sbjct: 1672 AKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKL---VHDGLPS- 1839
Query: 405 IGDVFYAPGLHHNLLSMGQLSEKDYNMQIHKGYCTLIDGNGRFITKVKMSHNRLFPLRIQ 464
+ V GL NL+S+ QL ++ +N+ K C + + + K S + + Q
Sbjct: 1840 LNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWTPQ 2019
Query: 465 HDQFS--CLSSIIPNDDWLWHMRFGHFHFSGLNYLSRKEYVSGLPVVKIPSG-VCETCQM 521
+S CLSS ++ +WH RFGH H G+ + K V G+P +KI G +C CQ+
Sbjct: 2020 ETSYSSTCLSSK-EDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGECQI 2196
Query: 522 GKKHRESFPTGKSWRAKKLLEIVHSDLCS-VEIPTPGGCRYFITFIDDFSRKAWVYFLKQ 580
GK+ + S + ++LE++H DL +++ + GG RY +DDFSR WV F+++
Sbjct: 2197 GKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFIRE 2376
Query: 581 KSEAVDSFKTFKAFVEKQSGCPIKALRTDRGQEYLVG--TDFFEQHGIQHQLTTRYTPQQ 638
KSE + FK ++++ C IK +R+D G+E+ T+F GI H+ + TPQQ
Sbjct: 2377 KSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQ 2556
Query: 639 NGVAERKNRTIMDMVRCMLKAKQMPKEFWAEAVATAVYILNRCPTKSVQEKTPEEAGSGR 698
NG+ ERKNRT+ + R ML AK++P WAEA+ TA YI NR + T E GR
Sbjct: 2557 NGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWKGR 2736
Query: 699 RPSIRHLRVFGCIAYAHVPDQIRKKLDDKGERCIFIGYCSNSKAYKLYNPETKKVIISRD 758
+PS++H +FG Y + R+K+D K + IF+GY +NS+AY+++N T+ V+ S +
Sbjct: 2737 KPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMESIN 2916
Query: 759 VTFDEGGMWNWSSKSQKEPIVTPNDY----EEEDEHVDTTPDEPDEPETSNREKRNRRLP 814
V D+ + K +E + T D + E+ + + DE + +KR+
Sbjct: 2917 VVVDD--LSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSSTRI 3090
Query: 815 ARL--QDCVLGTDND-----PSDEEIINFALFAD-CEPVTFEEASRDENWIKAMDEEINA 866
++ ++ ++G N + EI++ + F EP +EA DE WI AM EE+
Sbjct: 3091 QKMHPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEELEQ 3270
Query: 867 IEKNKTWELTELPPDKKPIGVKWVYKTKYKPSGEIDRYKARLVAKGYKQKPGIDYFEVFA 926
++N+ WEL P IG KW++K K G I R KARLVA+GY Q G+D+ E FA
Sbjct: 3271 FKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDETFA 3450
Query: 927 PVARLDTIRMLISLSAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVVRGKEDKVYRL 986
PVARL++IR+L+ ++ +K++QMDVKSAFLNG L EEVYVEQP G+ D VYRL
Sbjct: 3451 PVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVYRL 3630
Query: 987 KKALYGLKQAPRAWYKKIDSYFIQNGFQRCPFEHTLYIKFIDPGDVLIVCLYVDDLIFTG 1046
KKALYGLKQAPRAWY+++ + Q G+++ + TL++K D +++I +YVDD++F G
Sbjct: 3631 KKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQ-DAENLMIAQIYVDDIVFGG 3807
Query: 1047 NNSKMIAEFRGAMISYFEMTDLGLMSYFLGIEVIQQKDGIFISQKKYASDILKKFKMEHS 1106
+++M+ F M S FEM+ +G ++YFLG++V Q +D IF+SQ +YA +I+KKF ME++
Sbjct: 3808 MSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGMENA 3987
Query: 1107 KPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTATRPDIVYGVGLLSRYMEDPCV 1166
TP LKL+++ G VD + Y+S+IGSL YLTA+RPDI Y VG+ +RY +P +
Sbjct: 3988 SHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQANPKI 4167
Query: 1167 SHLQGAKRILRYIKGTLTEGIFYGNNSDVKLVGYTDSDWAGDTETRKSTSGYAFHLGTGA 1226
SHL KRIL+Y+ GT GI Y + S+ LVGY D+DWAG + RKSTSG F+LG
Sbjct: 4168 SHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLGNNL 4347
Query: 1227 ISWSSKKQHVVALSTAEAEYITATSCATQTVWLRRILEVMHHEQNTPTKIYCDNKSAIAL 1286
ISW SKKQ+ V+LSTAEAEYI A S +Q VW++++L+ + EQ+ T +YCDN SAI +
Sbjct: 4348 ISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMT-LYCDNMSAINI 4524
Query: 1287 SKNPVFHGRSKHIDIRFHKIRELIAEKEVVIEYCPTKEQIADIFTKPLKIESFYKLKKML 1346
SKNPV H R+KHIDIR H IR+L+ +K + +++ T+EQIADIFTK L F KL+ L
Sbjct: 4525 SKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFEKLRGKL 4704
Query: 1347 GM 1348
G+
Sbjct: 4705 GI 4710
Score = 104 bits (259), Expect = 3e-22
Identities = 63/209 (30%), Positives = 103/209 (49%), Gaps = 23/209 (11%)
Frame = +1
Query: 13 PKLNSELDFNYWEFMMTTHLKA--HNIWSYVESGLQ---------QGADEL------ARR 55
P + ++ YW+ M LK+ W V G + + DEL +
Sbjct: 37 PPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKE 216
Query: 56 RDQLALSQ------ILQGIDYSIFGKIANAKTSKEAWDILKLSHKGVEKAQKSKLQSLRR 109
D+LAL + G+D +IF I +K+AW+ILK++H+G K + S+LQ L
Sbjct: 217 EDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVKMSRLQLLAT 396
Query: 110 EYERYEMSSSETVDQYFTRVINIVNKMRVYGEDIQDSKVVEKILRTMPMKYDHVVTTILE 169
++E +M E + + ++ I N GE I D K+V KILR++P ++D VT I E
Sbjct: 397 KFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLPKRFDMKVTAIEE 576
Query: 170 SHDTDTLSVAELQGSIESHVNRILEKTEK 198
+ D + V EL GS+++ + ++ EK
Sbjct: 577 AQDICNMRVDELIGSLQTFELGLSDRAEK 663
>TC222253 similar to UP|Q9ZQE4 (Q9ZQE4) Copia-like retroelement pol
polyprotein, partial (4%)
Length = 919
Score = 256 bits (654), Expect = 5e-68
Identities = 124/177 (70%), Positives = 145/177 (81%)
Frame = +1
Query: 488 HFHFSGLNYLSRKEYVSGLPVVKIPSGVCETCQMGKKHRESFPTGKSWRAKKLLEIVHSD 547
+FHF + +++ V LP++ I GVC+TC++GKKHRESFPTGKSWR KKLL+IVH D
Sbjct: 250 NFHFLD*-IIFQEKIVYDLPIMNILDGVCDTCEIGKKHRESFPTGKSWRMKKLLKIVHLD 426
Query: 548 LCSVEIPTPGGCRYFITFIDDFSRKAWVYFLKQKSEAVDSFKTFKAFVEKQSGCPIKALR 607
LC+VEIPT G YFITFIDDFS+K WVYFLKQKSEA ++FK FKAF EKQ+GC +KAL
Sbjct: 427 LCTVEIPTHGDNNYFITFIDDFSKKMWVYFLKQKSEACNAFKMFKAFAEKQNGCKVKALI 606
Query: 608 TDRGQEYLVGTDFFEQHGIQHQLTTRYTPQQNGVAERKNRTIMDMVRCMLKAKQMPK 664
D+GQEYL T FFE+HGIQHQLTT+YTPQ NGV ERKN+TIMDMVRCMLKAKQ K
Sbjct: 607 IDKGQEYLSYTIFFEKHGIQHQLTTKYTPQHNGVTERKNKTIMDMVRCMLKAKQSVK 777
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 218 bits (554), Expect = 2e-56
Identities = 105/218 (48%), Positives = 147/218 (67%), Gaps = 3/218 (1%)
Frame = +1
Query: 1128 VDSTHYKSLIGSLRYLTATRPDIVYGVGLLSRYMEDPCVSHLQGAKRILRYIKGTLTEGI 1187
VD T ++ LIGSLRYL +RP+I + V L+SR+M+ P +SH+Q AKR+LR IKGT+ G+
Sbjct: 7 VDVTEFRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGV 186
Query: 1188 ---FYGNNSDVKLVGYTDSDWAGDTETRKSTSGYAFHLGTGAISWSSKKQHVVALSTAEA 1244
F + L+GYTDSDW D E KST GY F ++ SSKKQ V+ALST EA
Sbjct: 187 LFPFKAKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEA 366
Query: 1245 EYITATSCATQTVWLRRILEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIRFH 1304
EY+ A+ A Q VW+ +LE + + P + DNKSAI L+K+P HGRSKHI++RFH
Sbjct: 367 EYVAASLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFH 546
Query: 1305 KIRELIAEKEVVIEYCPTKEQIADIFTKPLKIESFYKL 1342
IR+ +++ V +EYC +EQ+AD+ TKP+++ F ++
Sbjct: 547 YIRDQVSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQI 660
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 181 bits (459), Expect = 2e-45
Identities = 86/152 (56%), Positives = 113/152 (73%)
Frame = -3
Query: 859 AMDEEINAIEKNKTWELTELPPDKKPIGVKWVYKTKYKPSGEIDRYKARLVAKGYKQKPG 918
AM EE+N E+N W+L E P + IG KWV++ K G I R KARLVAKGY Q+ G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 919 IDYFEVFAPVARLDTIRMLISLSAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVVRG 978
IDY E +APVARL+ IRML++ + N+K++QMDVKSAFLNG ++EEVYVEQP G+ +
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 979 KEDKVYRLKKALYGLKQAPRAWYKKIDSYFIQ 1010
K VY+L+KALYGLKQAPRAWY++I ++ ++
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>BI321712
Length = 399
Score = 166 bits (420), Expect = 6e-41
Identities = 80/132 (60%), Positives = 103/132 (77%)
Frame = -3
Query: 1031 DVLIVCLYVDDLIFTGNNSKMIAEFRGAMISYFEMTDLGLMSYFLGIEVIQQKDGIFISQ 1090
++ +CLYVDDLIFTGNN M EF+ M + FEMTD+GLM+Y+LGIEV Q+ GIFI+Q
Sbjct: 397 EIF*LCLYVDDLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQEDKGIFITQ 218
Query: 1091 KKYASDILKKFKMEHSKPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTATRPDI 1150
+ YA ++LKKFKM+ + P+ TP+E KL++ G+ VD T YKSLIGSLRYLT TRPDI
Sbjct: 217 EGYAKEVLKKFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLTCTRPDI 38
Query: 1151 VYGVGLLSRYME 1162
+Y VG++SRYME
Sbjct: 37 LYVVGVVSRYME 2
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 146 bits (369), Expect(2) = 7e-41
Identities = 74/184 (40%), Positives = 112/184 (60%)
Frame = +3
Query: 1033 LIVCLYVDDLIFTGNNSKMIAEFRGAMISYFEMTDLGLMSYFLGIEVIQQKDGIFISQKK 1092
LI+ +YVDD+IF + +M EF M FE + G + + LG+++IQ+ GIFI Q+K
Sbjct: 489 LIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQEK 668
Query: 1093 YASDILKKFKMEHSKPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTATRPDIVY 1152
Y LK+F+M+ +KP++TP+ + ++ G Y +I SL YLT++RPDIV+
Sbjct: 669 YTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIVF 848
Query: 1153 GVGLLSRYMEDPCVSHLQGAKRILRYIKGTLTEGIFYGNNSDVKLVGYTDSDWAGDTETR 1212
V L +R+ P +SH+ KRILRY+ GT +++ S+ L+GY D +AGD R
Sbjct: 849 VVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVER 1028
Query: 1213 KSTS 1216
KSTS
Sbjct: 1029 KSTS 1040
Score = 40.8 bits (94), Expect(2) = 7e-41
Identities = 17/36 (47%), Positives = 25/36 (69%)
Frame = +2
Query: 990 LYGLKQAPRAWYKKIDSYFIQNGFQRCPFEHTLYIK 1025
+YGLKQA RAWY+++ S+ + NGF R + L+ K
Sbjct: 362 VYGLKQALRAWYERLSSFLVSNGFTRGITDPALFRK 469
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 164 bits (416), Expect = 2e-40
Identities = 77/139 (55%), Positives = 103/139 (73%)
Frame = +3
Query: 1210 ETRKSTSGYAFHLGTGAISWSSKKQHVVALSTAEAEYITATSCATQTVWLRRILEVMHHE 1269
+ RKST+G+ F +G A +W SKKQ +V LST EAEY+ ATSC +WLR +L+ +
Sbjct: 9 DDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKMP 188
Query: 1270 QNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIRFHKIRELIAEKEVVIEYCPTKEQIADI 1329
Q P +I DNKSA+AL+KNPVFH +SKHID R+H IRE I +KEV ++Y +++Q ADI
Sbjct: 189 QEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAADI 368
Query: 1330 FTKPLKIESFYKLKKMLGM 1348
FTKPLK+E+F KL+ MLG+
Sbjct: 369 FTKPLKLETFVKLRSMLGV 425
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 159 bits (402), Expect = 8e-39
Identities = 69/126 (54%), Positives = 99/126 (77%)
Frame = -2
Query: 878 LPPDKKPIGVKWVYKTKYKPSGEIDRYKARLVAKGYKQKPGIDYFEVFAPVARLDTIRML 937
LPP K P+G +WVY K P+GE+DR KARLVAKGY Q GIDY + F+PVA+L T+R+
Sbjct: 400 LPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVRLF 221
Query: 938 ISLSAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVVRGKEDKVYRLKKALYGLKQAP 997
++++A +W +HQ+D+K+AFL+G LEE++Y+EQP G+V +G+ V +L ++LYGLKQ+P
Sbjct: 220 LAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQSP 41
Query: 998 RAWYKK 1003
RAW+ K
Sbjct: 40 RAWFGK 23
>TC232995
Length = 1009
Score = 150 bits (379), Expect = 4e-36
Identities = 72/173 (41%), Positives = 113/173 (64%)
Frame = +2
Query: 968 VEQPAGYVVRGKEDKVYRLKKALYGLKQAPRAWYKKIDSYFIQNGFQRCPFEHTLYIKFI 1027
VEQP G+ + K + VY+L+KALYGLKQAPRAWY+++ ++ ++ F R + TL+IK
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKR- 178
Query: 1028 DPGDVLIVCLYVDDLIFTGNNSKMIAEFRGAMISYFEMTDLGLMSYFLGIEVIQQKDGIF 1087
D+L+V +YVDD+IF N + EF M S FEM+ +G + YFLG+++ Q + GIF
Sbjct: 179 KHNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIF 358
Query: 1088 ISQKKYASDILKKFKMEHSKPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSL 1140
I+Q KY +++K+F M+ +K +STP+ L ++ G+ +D Y+ IG +
Sbjct: 359 INQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEV 517
>BU549979
Length = 615
Score = 148 bits (373), Expect = 2e-35
Identities = 72/186 (38%), Positives = 116/186 (61%), Gaps = 2/186 (1%)
Frame = -1
Query: 1156 LLSRYMEDPCVSHLQGAKRILRYIKGTLTEGIFYGNNSDVKLVGYTDSDWAGDTETRKST 1215
+L RY +P + H + AK+++RY++GT + Y + ++++GY+DSD+AG ++R+ST
Sbjct: 615 VLGRYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRST 436
Query: 1216 SGYAFHLGTGAISWSSKKQHVVALSTAEAEYITATSCATQTVWLRRILEVMH--HEQNTP 1273
SGY F L G +SW S KQ ++A ST E E++ + VWL+ + + + P
Sbjct: 435 SGYIFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRP 256
Query: 1274 TKIYCDNKSAIALSKNPVFHGRSKHIDIRFHKIRELIAEKEVVIEYCPTKEQIADIFTKP 1333
K+YCDN +A+ ++KN RSKHIDI++ IRE + EK+VVIE+ T+ I D TK
Sbjct: 255 LKLYCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKG 76
Query: 1334 LKIESF 1339
+ ++F
Sbjct: 75 MTPKNF 58
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 144 bits (363), Expect = 3e-34
Identities = 64/128 (50%), Positives = 91/128 (71%)
Frame = +3
Query: 844 PVTFEEASRDENWIKAMDEEINAIEKNKTWELTELPPDKKPIGVKWVYKTKYKPSGEIDR 903
P T EA W +AM +E+ A+E N TWEL LPP K +G +WVY K P+G++DR
Sbjct: 21 PSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGPNGKVDR 200
Query: 904 YKARLVAKGYKQKPGIDYFEVFAPVARLDTIRMLISLSAQNNWKIHQMDVKSAFLNGTLE 963
KARLVAKGY Q GI+Y + F+PV L T+R+ ++++A +W +HQ+D+K+AFL+G LE
Sbjct: 201 LKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAFLHGDLE 380
Query: 964 EEVYVEQP 971
E++Y+EQP
Sbjct: 381 EDIYMEQP 404
>CO981347
Length = 624
Score = 96.3 bits (238), Expect(3) = 3e-34
Identities = 48/112 (42%), Positives = 69/112 (60%), Gaps = 2/112 (1%)
Frame = +2
Query: 588 FKTFKAFVEKQSGCPIKALRTDRGQEYLVG--TDFFEQHGIQHQLTTRYTPQQNGVAERK 645
F+ + Q G +K LRTD G E+++ +F + GI+ +TP QNG+AER
Sbjct: 116 FRE*HTLIGNQLGTKLKVLRTDNGLEFVLEQFNEFCRKIGIKRHKIVPHTP*QNGLAERM 295
Query: 646 NRTIMDMVRCMLKAKQMPKEFWAEAVATAVYILNRCPTKSVQEKTPEEAGSG 697
N TI++ VRCML + ++PK FW EA T Y++NRCP+ ++ KTP EA SG
Sbjct: 296 NMTILERVRCMLLSARLPKTFWGEAANTTSYLINRCPSSTLGFKTPMEAWSG 451
Score = 47.0 bits (110), Expect(3) = 3e-34
Identities = 19/32 (59%), Positives = 24/32 (74%)
Frame = +3
Query: 553 IPTPGGCRYFITFIDDFSRKAWVYFLKQKSEA 584
+ T GG YF+T IDDFSR+ W+Y LK KSE+
Sbjct: 12 VKTHGGSSYFLTIIDDFSRRVWLYVLKNKSES 107
Score = 42.4 bits (98), Expect(3) = 3e-34
Identities = 22/55 (40%), Positives = 32/55 (58%)
Frame = +3
Query: 695 GSGRRPSIRHLRVFGCIAYAHVPDQIRKKLDDKGERCIFIGYCSNSKAYKLYNPE 749
G + P+ L+VFG +A+ HV + KLD + +C+FIGY K YKL+ E
Sbjct: 444 GVVKPPNYSGLKVFGSLAFDHVK---QGKLDARAVKCVFIGYPKGVKRYKLWKLE 599
>BG508993
Length = 374
Score = 143 bits (361), Expect = 4e-34
Identities = 68/123 (55%), Positives = 87/123 (70%)
Frame = +1
Query: 1180 KGTLTEGIFYGNNSDVKLVGYTDSDWAGDTETRKSTSGYAFHLGTGAISWSSKKQHVVAL 1239
KGT+ G+FY +++ KLVG+ DSD+AGD + RKST+G+ F +G +WSSKKQ +V L
Sbjct: 4 KGTIDFGLFYSPSNNYKLVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIVTL 183
Query: 1240 STAEAEYITATSCATQTVWLRRILEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHI 1299
T EAEY+ ATSC +WLRR+LE + Q TKIY DN+SA L+KN VFH RSKHI
Sbjct: 184 FTCEAEYVAATSCTCHAIWLRRLLEELQLLQKESTKIYVDNRSAQELAKNSVFHERSKHI 363
Query: 1300 DIR 1302
D R
Sbjct: 364 DTR 372
>AI959950
Length = 466
Score = 137 bits (345), Expect = 3e-32
Identities = 67/132 (50%), Positives = 93/132 (69%)
Frame = -1
Query: 857 IKAMDEEINAIEKNKTWELTELPPDKKPIGVKWVYKTKYKPSGEIDRYKARLVAKGYKQK 916
+KAM EE++ +KN +L +LP KK +GVKW++ K G++ RYKARLVAKGY Q+
Sbjct: 397 MKAMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQ 218
Query: 917 PGIDYFEVFAPVARLDTIRMLISLSAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVV 976
GIDY + FA VARL+ I +L+S + +N K++QMDVKSAFLNG +++EVYVEQP G+
Sbjct: 217 EGIDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFEN 38
Query: 977 RGKEDKVYRLKK 988
V++L K
Sbjct: 37 ETLHQHVFKLNK 2
>TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (Fragment),
partial (16%)
Length = 562
Score = 134 bits (338), Expect = 2e-31
Identities = 62/122 (50%), Positives = 91/122 (73%)
Frame = +1
Query: 1033 LIVCLYVDDLIFTGNNSKMIAEFRGAMISYFEMTDLGLMSYFLGIEVIQQKDGIFISQKK 1092
LIV LYVDDL+ T ++++++ EF+ M+ FEMT+LGLM+YFLGIE+ Q ++ + I Q+K
Sbjct: 193 LIVSLYVDDLLVTRDDARLVEEFKQEMMQAFEMTNLGLMTYFLGIEIKQSQNKVLICQRK 372
Query: 1093 YASDILKKFKMEHSKPISTPVEEKLKLTRESDGKRVDSTHYKSLIGSLRYLTATRPDIVY 1152
YA +ILKKF+ME K +STP+ +K K + ++D +Y+SLIG L YLTATRPDI++
Sbjct: 373 YAKEILKKFQMEECKSVSTPMNQKEKFNKVDGADKIDEGYYRSLIGCLMYLTATRPDILF 552
Query: 1153 GV 1154
+
Sbjct: 553 AI 558
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 132 bits (333), Expect = 8e-31
Identities = 62/142 (43%), Positives = 92/142 (64%)
Frame = +2
Query: 1194 DVKLVGYTDSDWAGDTETRKSTSGYAFHLGTGAISWSSKKQHVVALSTAEAEYITATSCA 1253
+ +L GY D+DWAG R+STSGY +G +SW SKKQ VVA S+AEAEY +
Sbjct: 8 NTQLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVT 187
Query: 1254 TQTVWLRRILEVMHHEQNTPTKIYCDNKSAIALSKNPVFHGRSKHIDIRFHKIRELIAEK 1313
+ +W+++ L+ + + K+YCDN++A+ ++ NPVFH R+KHI+I H IRE + K
Sbjct: 188 CELMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSK 367
Query: 1314 EVVIEYCPTKEQIADIFTKPLK 1335
E+V E+ + +Q DI TK L+
Sbjct: 368 EIVTEFIGSNDQPVDILTKSLR 433
>CO983516
Length = 724
Score = 130 bits (327), Expect = 4e-30
Identities = 63/120 (52%), Positives = 89/120 (73%)
Frame = +2
Query: 925 FAPVARLDTIRMLISLSAQNNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVVRGKEDKVY 984
F PVARL++IR+L+ ++ +K++QMDVKSAFLNG L EEVYVEQP G++ D VY
Sbjct: 365 FHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPDHVY 544
Query: 985 RLKKALYGLKQAPRAWYKKIDSYFIQNGFQRCPFEHTLYIKFIDPGDVLIVCLYVDDLIF 1044
RLKKALYGLKQAPRAWY+++ Q G+++ + TL++K D +++I +YVDD++F
Sbjct: 545 RLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVK-QDAENLMIAQIYVDDIVF 721
>AW185460
Length = 411
Score = 130 bits (327), Expect = 4e-30
Identities = 62/103 (60%), Positives = 76/103 (73%)
Frame = +2
Query: 1145 ATRPDIVYGVGLLSRYMEDPCVSHLQGAKRILRYIKGTLTEGIFYGNNSDVKLVGYTDSD 1204
ATRPDI+Y LLSR+M+ P H KRILRY++GT GI+Y ++ +L+GYTDSD
Sbjct: 89 ATRPDIMYATSLLSRFMQSPSQIHFGAGKRILRYLQGTKAFGIWYTTETNSELLGYTDSD 268
Query: 1205 WAGDTETRKSTSGYAFHLGTGAISWSSKKQHVVALSTAEAEYI 1247
WAG T+ KSTSGYAF LG+G SW+SKKQ VA STAEAEY+
Sbjct: 269 WAGSTDDMKSTSGYAFSLGSGMFSWASKKQATVAQSTAEAEYV 397
Score = 37.4 bits (85), Expect = 0.045
Identities = 15/27 (55%), Positives = 19/27 (69%)
Frame = +2
Query: 999 AWYKKIDSYFIQNGFQRCPFEHTLYIK 1025
AWY +I+ YF+ GF+R E TLYIK
Sbjct: 2 AWYSRINQYFMDRGFRRSKSEPTLYIK 82
>BM307983
Length = 406
Score = 126 bits (317), Expect = 6e-29
Identities = 65/132 (49%), Positives = 86/132 (64%), Gaps = 1/132 (0%)
Frame = +2
Query: 885 IGVKWVYKTKYKPSGEIDRYKARLVAKGYKQKPGIDYFEVFAPVAR-LDTIRMLISLSAQ 943
+G +W+Y KY +DRYKARLVAKGY Q GIDY E FA + + + AQ
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 944 NNWKIHQMDVKSAFLNGTLEEEVYVEQPAGYVVRGKEDKVYRLKKALYGLKQAPRAWYKK 1003
W++HQ DVK+AFL+G+LEEEVY+E P GY +KV RLKKALYGLKQ+PRAW+ +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 1004 IDSYFIQNGFQR 1015
+ G+++
Sbjct: 362 FTQAMLSLGYKQ 397
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.135 0.404
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 60,620,902
Number of Sequences: 63676
Number of extensions: 894839
Number of successful extensions: 5349
Number of sequences better than 10.0: 241
Number of HSP's better than 10.0 without gapping: 4764
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 5070
length of query: 1351
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1242
effective length of database: 5,698,948
effective search space: 7078093416
effective search space used: 7078093416
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)
Medicago: description of AC148918.3