
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0166.7
(1602 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 994 0.0
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 988 0.0
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 231 2e-60
TC232995 226 6e-59
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 219 8e-57
TC213445 124 7e-48
BM143109 173 6e-43
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 172 1e-42
AI959950 168 2e-41
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 159 9e-39
CO983516 152 1e-36
AI855982 147 3e-35
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 140 4e-33
AI966222 112 5e-30
BU549979 127 4e-29
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 127 5e-29
CO982036 126 9e-29
BG508993 118 2e-26
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 117 3e-26
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 113 6e-25
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 994 bits (2571), Expect = 0.0
Identities = 500/1020 (49%), Positives = 681/1020 (66%)
Frame = +1
Query: 569 MLQISLIAPLKHQSWYLDSGCSRHMTGEKRMFRELKLKPGGEVGFGGNEKGKIIGTGTIC 628
++ SL A K + WYLDSGCSRHMTG K ++ V FG KGKIIG G +
Sbjct: 1645 VVHTSLRASAK-EDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLV 1821
Query: 629 VDSSPCIDNVLLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNI 688
D P ++ VLLV GLT NL+SISQL D+G++V F + C ++ ++ S+ K+N
Sbjct: 1822 HDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNC 2001
Query: 689 YKIRLSELEAQNVKCLLSVNEEQWVWHRRLGHASMRKISQLSKLNLVRGLPNLKFASDAL 748
Y E + CL S +E +WH+R GH +R + ++ VRG+PNLK +
Sbjct: 2002 YLWTPQETSYSST-CLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRI 2178
Query: 749 CETCQKGKFTKVPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTW 808
C CQ GK K+ + +TSR LELLH+DL GP++ ES+GGKRY V+VDD+SR+TW
Sbjct: 2179 CGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTW 2358
Query: 809 VKFLTRKDESHAVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESLFDSYGIAHDFSC 868
V F+ K E+ VF ++Q EK C I R+RSDHG EFEN +F S GI H+FS
Sbjct: 2359 VNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSA 2538
Query: 869 PRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPY 928
TPQQNG+VERKNRTLQE AR ML + + WAEA+NTACYI NR+++R T Y
Sbjct: 2539 AITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLY 2718
Query: 929 ELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDPKSSKCLLLGYSDRSKGFRFYNTDAKT 988
E+WK KP++ +FH FG CY+L +++ K DPKS + LGYS S+ +R +N+ +T
Sbjct: 2719 EIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRT 2898
Query: 989 IEESIHVRFDDKLDSDQSKLVEKFADLSINVSDKGKAPEEVEPEEDEPEEEAGPSNSQTL 1048
+ ESI+V DD + + + E NV+D K+ E E D +E+ +
Sbjct: 2899 VMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAE-NSDSATDESNINQPDKR 3075
Query: 1049 KKSRITAAHPKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWI 1108
+RI HPKELI+G+ + V TRS E ++S VS IEPK++ EAL D+ WI
Sbjct: 3076 SSTRIQKMHPKELIIGDPNRGVTTRSR----EVEIVSNSCFVSKIEPKNVKEALTDEFWI 3243
Query: 1109 LAMKEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQE 1168
AM+EEL QF +N+VW LV +PE +VIGTKW+F+NK NE+G + RNKARLVAQGY+Q E
Sbjct: 3244 NAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIE 3423
Query: 1169 GIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQPPGFEDE 1228
G+D+ ETFAPVARLE+IRLL+ + L+QMDVKSAFLNGY++EEVYV QP GF D
Sbjct: 3424 GVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADP 3603
Query: 1229 KKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIY 1288
PDHV++LKK+LYGLKQAPRAWYERL+ FL + + +G +D TLF K ++++I QIY
Sbjct: 3604 THPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIY 3783
Query: 1289 VDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKELL 1348
VDDI+FG + + + F + MQ+EFEMS++GEL YFLG+QV Q + ++ QS+Y K ++
Sbjct: 3784 VDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIV 3963
Query: 1349 KKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCA 1408
KKF M ++ +TP L K++ V Q LYR MIGSLLYLTASRPDI ++V +CA
Sbjct: 3964 KKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCA 4143
Query: 1409 RFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGN 1468
R+Q++P+ +HLT VKRIL+Y+ GT++ G+MY S L GYCDAD+AG +RKSTSG
Sbjct: 4144 RYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGG 4323
Query: 1469 CQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQIFESNIPIYCD 1528
C +LG+NL+SW SK+Q+ ++LSTAEAEYI+A +Q++WMK L++Y + + + +YCD
Sbjct: 4324 CFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLYCD 4503
Query: 1529 NTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAEDRF 1588
N +AI++SKNP+ HSR KHI++++H+IRD V V+ LK VDT+ Q ADIFTK L ++F
Sbjct: 4504 NMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQF 4683
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 988 bits (2554), Expect = 0.0
Identities = 499/1022 (48%), Positives = 681/1022 (65%), Gaps = 2/1022 (0%)
Frame = +1
Query: 569 MLQISLIAPLKHQSWYLDSGCSRHMTGEKRMFRELKLKPGGEVGFGGNEKGKIIGTGTIC 628
++ SL A K + WYLDSGCSRHMTG K ++ V FG KGKI G G +
Sbjct: 1648 VVHTSLRASAK-EDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKLV 1824
Query: 629 VDSSPCIDNVLLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNI 688
D P ++ VLLV GLT NL+SISQL D+G++V F + C ++ ++ S+ K+N
Sbjct: 1825 HDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNC 2004
Query: 689 YKIRLSELEAQNVKCLLSVNEEQWVWHRRLGHASMRKISQLSKLNLVRGLPNLKFASDAL 748
Y E + CL S +E +WH+R GH +R + ++ VRG+PNLK +
Sbjct: 2005 YLWTPQETSYSST-CLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRI 2181
Query: 749 CETCQKGKFTKVPFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTW 808
C CQ GK K+ + +TSR LELLH+DL GP++ ES+GGKRY V+VDD+SR+TW
Sbjct: 2182 CGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTW 2361
Query: 809 VKFLTRKDESHAVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESLFDSYGIAHDFSC 868
V F+ K ++ VF ++Q EK C I R+RSDHG EFEN KF S GI H+FS
Sbjct: 2362 VNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSA 2541
Query: 869 PRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPY 928
TPQQNG+VERKNRTLQE AR ML + + WAEA+NTACYI NR+++R T Y
Sbjct: 2542 AITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLY 2721
Query: 929 ELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDPKSSKCLLLGYSDRSKGFRFYNTDAKT 988
E+WK KP + +FH FG CY+L +++ K DPKS + LGYS S+ +R +N+ +T
Sbjct: 2722 EIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRT 2901
Query: 989 IEESIHVRFDDKLDSDQSKLVEKFADLSINVSDKGKAPEEVEPEEDEPEEEAGPSNSQTL 1048
+ ESI+V DD + + + E NV+D K+ E E + +E P+ +Q
Sbjct: 2902 VMESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDE---PNINQPD 3072
Query: 1049 KKS--RITAAHPKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKD 1106
K+ RI HPKELI+G+ + V TRS E ++S VS IEPK++ EAL D+
Sbjct: 3073 KRPSIRIQKMHPKELIIGDPNRGVTTRSR----EIEIVSNSCFVSKIEPKNVKEALTDEF 3240
Query: 1107 WILAMKEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQ 1166
WI AM+EEL QF +N+VW LV +PE +VIGTKW+F+NK NE+G + RNKARLVAQGY+Q
Sbjct: 3241 WINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQ 3420
Query: 1167 QEGIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQPPGFE 1226
EG+D+ ETFAPVARLE+IRLL+ + L+QMDVKSAFLNGY++EE YV QP GF
Sbjct: 3421 IEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFV 3600
Query: 1227 DEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQ 1286
D PDHV++LKK+LYGLKQAPRAWYERL+ FL + + +G +D TLF K ++++I Q
Sbjct: 3601 DPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQ 3780
Query: 1287 IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQSKYTKE 1346
IYVDDI+FG + + + F + MQ+EFEMS++GEL YFLG+QV Q + ++ QSKY K
Sbjct: 3781 IYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKN 3960
Query: 1347 LLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHL 1406
++KKF M ++ +TP L K++ V Q LYR MIGSLLYLTASRPDI ++V +
Sbjct: 3961 IVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGV 4140
Query: 1407 CARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTS 1466
CAR+Q++P+ +HL VKRIL+Y+ GT++ G+MY S+ L GYCDAD+AG +RKSTS
Sbjct: 4141 CARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTS 4320
Query: 1467 GNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQIFESNIPIY 1526
G C +LG+NL+SW SK+Q+ ++LSTAEAEYI+A +Q++WMK L++Y + + + +Y
Sbjct: 4321 GGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLY 4500
Query: 1527 CDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAED 1586
CDN +AI++SKNP+ HSR KHI++++H+IRD V V+ L+ VDT+ Q ADIFTK L +
Sbjct: 4501 CDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDAN 4680
Query: 1587 RF 1588
+F
Sbjct: 4681 QF 4686
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 231 bits (589), Expect = 2e-60
Identities = 110/153 (71%), Positives = 132/153 (85%)
Frame = -3
Query: 1109 LAMKEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQE 1168
+AM+EELNQF +N+VW LV+KPEN VIGTKWVFRNKL+E G ++RNKARLVA+GY+Q+E
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 1169 GIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQPPGFEDE 1228
GIDY ET+APVARLE IR+L+++ N L+QMDVKSAFLNG I EEVYV QPPGFE
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 1229 KKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLE 1261
KP HV+KL+K+LYGLKQAPRAWYER+S+FLLE
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>TC232995
Length = 1009
Score = 226 bits (576), Expect = 6e-59
Identities = 112/173 (64%), Positives = 133/173 (76%)
Frame = +2
Query: 1219 VHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTY 1278
V QPPGFE KP+HV+KL+K+LYGLKQAPRAWYERLS+FLLE EF RGKVDTTLF K
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 1279 KDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYI 1338
+DIL+VQIYVDDIIFGS N SLCKEFS MQ+EFEMSMMGELKYFLG+Q+ QT G +I
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 1339 HQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLL 1391
+QSKY KEL+K+F M + TPM C L+K++ + K YR IG ++
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 219 bits (558), Expect = 8e-57
Identities = 122/247 (49%), Positives = 159/247 (63%), Gaps = 2/247 (0%)
Frame = +3
Query: 1224 GFEDEKKPDHVFKLKKSL--YGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDD 1281
GFED+++P HVF + L G+K ++ S ++ K+
Sbjct: 330 GFEDKERPCHVFMV*NKL*ELGMKG*VHF*FQMDSPEE*RTPHYSERLK--------KET 485
Query: 1282 ILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQS 1341
LI+ IYVDDIIFG+ ++ +CKEF E+M+ FE SM GELK+ LG+Q+ Q G +IHQ
Sbjct: 486 FLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQE 665
Query: 1342 KYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTASRPDIL 1401
KYTK LK+F M E+ TPMH + I++K++K K Y GMI SL YLT+SRPDI+
Sbjct: 666 KYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIV 845
Query: 1402 FSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTE 1461
F V LCARFQS P+ +H+TAVKRILRYL GTTN L +KK SE+ L GYCD +AGD+ E
Sbjct: 846 FVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVE 1025
Query: 1462 RKSTSGN 1468
RKSTS N
Sbjct: 1026 RKSTSRN 1046
>TC213445
Length = 705
Score = 124 bits (310), Expect(2) = 7e-48
Identities = 58/98 (59%), Positives = 77/98 (78%)
Frame = +1
Query: 1459 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQI 1518
+T+R+STS C F+GS LVSW SK+Q+++ LSTAEAEYISA Q+ WM+ QL DY +
Sbjct: 400 KTDRESTSDTCHFIGSALVSWHSKKQNSVVLSTAEAEYISARSYYAQIFWMRQQLFDYGL 579
Query: 1519 FESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIR 1556
+IPI CDNT+AI+LSKN IL+SR KHIE+++HF+R
Sbjct: 580 KLDHIPIRCDNTSAINLSKNHILYSRTKHIEIRHHFLR 693
Score = 87.4 bits (215), Expect(2) = 7e-48
Identities = 40/68 (58%), Positives = 51/68 (74%)
Frame = +2
Query: 1386 MIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEY 1445
MI S LYL+ SRP I+FSV +C R+Q++P+E+HL+ +KRI+RYL G NLGL Y K S Y
Sbjct: 197 MIESFLYLSTSRPHIMFSVCMCVRYQANPKESHLSVIKRIMRYLLGIINLGLWYPKNSSY 376
Query: 1446 KLSGYCDA 1453
L GY DA
Sbjct: 377 NLVGYSDA 400
>BM143109
Length = 415
Score = 173 bits (438), Expect = 6e-43
Identities = 86/133 (64%), Positives = 103/133 (76%)
Frame = +1
Query: 1221 QPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKD 1280
QPP ++ +KP+HVFKLKK LYGLKQA RAWYE LS FLL+ F +GKVDT LF +
Sbjct: 4 QPPVRKNSEKPNHVFKLKKVLYGLKQALRAWYELLSKFLLDKGFSKGKVDTNLFI*KKLN 183
Query: 1281 DILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEGTYIHQ 1340
DIL+VQIYVDDIIFGS N SLCK+FS+ MQ EFEMSMM EL +FLG+Q+ QT G +I Q
Sbjct: 184 DILLVQIYVDDIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNGIFISQ 363
Query: 1341 SKYTKELLKKFNM 1353
SKY K+L+ +F M
Sbjct: 364 SKYCKDLIHRFGM 402
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 172 bits (435), Expect = 1e-42
Identities = 90/217 (41%), Positives = 138/217 (63%), Gaps = 4/217 (1%)
Frame = +1
Query: 1383 YRGMIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK-- 1440
+R +IGSL YL SRP+I F+V L +RF PR +H+ A KR+LR +KGT G+++
Sbjct: 22 FRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFK 201
Query: 1441 -KTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISA 1499
K+ + L GY D+D+ D + KST G V+ +SK+Q IALST EAEY++A
Sbjct: 202 AKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAA 381
Query: 1500 AICSTQMLWMKHQLEDYQIFESN-IPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDY 1558
++ + Q +WM + LE+ ++ E + + DN +AI+L+K+P LH R+KHIE+++H+IRD
Sbjct: 382 SLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHYIRDQ 561
Query: 1559 VQKGVLLLKFVDTDHQWADIFTKPLAEDRFNFILKNL 1595
V KG + +++ + Q AD+ TKP+ RF I L
Sbjct: 562 VSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>AI959950
Length = 466
Score = 168 bits (426), Expect = 2e-41
Identities = 86/130 (66%), Positives = 102/130 (78%)
Frame = -1
Query: 1110 AMKEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEG 1169
AM+EEL+QF KN+V LVK P+ V+G KW+F NKL+E G VVR KARLVA+GYSQQEG
Sbjct: 391 AMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQEG 212
Query: 1170 IDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQPPGFEDEK 1229
IDY +TFA VARLE I +L+SF+ N+ L+QMDVKSAFLNG I +EVYV QPPGFE+E
Sbjct: 211 IDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENET 32
Query: 1230 KPDHVFKLKK 1239
HVFKL K
Sbjct: 31 LHQHVFKLNK 2
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 159 bits (402), Expect = 9e-39
Identities = 74/111 (66%), Positives = 95/111 (84%)
Frame = +3
Query: 1094 EPKSIDEALQDKDWILAMKEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVV 1153
EPK+I EA+ D +WI+ M+EELNQF +N+VW LV+KPEN VIGTKWVFRNKL+E G ++
Sbjct: 3 EPKNIKEAIVDDNWIIVMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIII 182
Query: 1154 RNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDV 1204
RNKARLVA+GY+Q+EGIDY ET+APVARLEAIR+L++++ N L+QMDV
Sbjct: 183 RNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMNFKLYQMDV 335
>CO983516
Length = 724
Score = 152 bits (384), Expect = 1e-36
Identities = 73/120 (60%), Positives = 92/120 (75%)
Frame = +2
Query: 1176 FAPVARLEAIRLLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVF 1235
F PVARLE+IRLL+ + L+QMDVKSAFLNGY++EEVYV QP GF D PDHV+
Sbjct: 365 FHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPDHVY 544
Query: 1236 KLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFG 1295
+LKK+LYGLKQAPRAWYERL+ L + + +G +D TLF K ++++I QIYVDDI+FG
Sbjct: 545 RLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFG 724
>AI855982
Length = 484
Score = 147 bits (372), Expect = 3e-35
Identities = 78/165 (47%), Positives = 111/165 (67%)
Frame = +2
Query: 1058 PKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMKEELNQ 1117
P + I+G+ + V TR + + L + VS+IEPK+I EA+ D +WI+AM+EELNQ
Sbjct: 2 PLDNIIGDISKGVTTRHSLKD----LCNNMAFVSMIEPKNIKEAIVDDNWIIAMQEELNQ 169
Query: 1118 FSKNDVWSLVKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFA 1177
F +N+VW LV+KP+N VI TKWVFRNKL+E ++ +KARLVA+GY+Q +G+DY T+A
Sbjct: 170 FERNNVWKLVEKPDNYPVI*TKWVFRNKLDEHRIIIIHKARLVAEGYNQVDGLDYEHTYA 349
Query: 1178 PVARLEAIRLLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQP 1222
+ARL I + +S+ N L+ SA L+G + EVYV QP
Sbjct: 350 SIARL*VIIMPLSYVYIMNSTLYHYACVSALLHGLLLHEVYVDQP 484
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 140 bits (353), Expect = 4e-33
Identities = 65/151 (43%), Positives = 100/151 (66%), Gaps = 1/151 (0%)
Frame = +2
Query: 1446 KLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQ 1505
+LSGYCDAD+AG +R+STSG C F+G NLVSW SK+Q+ +A S+AEAEY S A+ + +
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1506 MLWMKHQLEDYQIFES-NIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVL 1564
++W+K L++ + E + +YCDN AA+ ++ NP+ H R KHIE+ HFIR+ + +
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1565 LLKFVDTDHQWADIFTKPLAEDRFNFILKNL 1595
+ +F+ ++ Q DI TK L + + L
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKL 466
>AI966222
Length = 430
Score = 112 bits (280), Expect(2) = 5e-30
Identities = 50/87 (57%), Positives = 62/87 (70%)
Frame = +1
Query: 887 EMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPYELWKNIKPNISYFHPFGC 946
EMART L + KHF AE +N CY+QN+I +RPIL +TPYELWK KPNISYF+PF C
Sbjct: 1 EMARTTLNDNLTPKHF*AEVMNIVCYLQNKIYIRPILKRTPYELWKGRKPNISYFYPFRC 180
Query: 947 VCYVLNTKDRLHKFDPKSSKCLLLGYS 973
C+++NTKD L K D KS + + YS
Sbjct: 181 KCFIINTKDNLGKIDSKSDCGIFIAYS 261
Score = 38.9 bits (89), Expect(2) = 5e-30
Identities = 20/44 (45%), Positives = 29/44 (65%), Gaps = 1/44 (2%)
Frame = +2
Query: 976 SKGFRFYNTDAKTIEESIHVRF-DDKLDSDQSKLVEKFADLSIN 1018
SK FR YN+ IEE+IH+RF +K + + +L E FADL ++
Sbjct: 269 SKAFRVYNSGTLVIEEAIHIRFGKNKPNKELLELDESFADLRLD 400
>BU549979
Length = 615
Score = 127 bits (319), Expect = 4e-29
Identities = 65/183 (35%), Positives = 111/183 (60%), Gaps = 3/183 (1%)
Frame = -1
Query: 1409 RFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGN 1468
R+QS+P H K+++RYL+GT + LMYK+T+ ++ GY D+D+AG R+STSG
Sbjct: 606 RYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGY 427
Query: 1469 CQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQIFES---NIPI 1525
L +VSW S +Q+ IA ST E E++ ++ +W+K + ++ +S + +
Sbjct: 426 IFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKL 247
Query: 1526 YCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAE 1585
YCDN AA+ ++KN +R+KHI++KY IR+ V++ ++++ V+T+ D TK +
Sbjct: 246 YCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTP 67
Query: 1586 DRF 1588
F
Sbjct: 66 KNF 58
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 127 bits (318), Expect = 5e-29
Identities = 61/130 (46%), Positives = 90/130 (68%)
Frame = -2
Query: 1127 VKKPENVHVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIR 1186
V P +G +WV+ K+ G+V R KARLVA+GY+Q GIDY +TF+PVA+L +R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 1187 LLISFSVNHNIILHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQ 1246
L ++ + + LHQ+D+K+AFL+G + E++Y+ QPPGF + + V KL +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1247 APRAWYERLS 1256
+PRAW+ + S
Sbjct: 46 SPRAWFGKFS 17
>CO982036
Length = 674
Score = 126 bits (316), Expect = 9e-29
Identities = 76/212 (35%), Positives = 119/212 (55%), Gaps = 5/212 (2%)
Frame = -2
Query: 1278 YKDDILIVQ--IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEG 1335
YK IL V +YVD II GS+ +L + + + + F + ++G+L YF+ I+V P+
Sbjct: 673 YKTHILTVYLLVYVDIIITGSSC-TLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDL 497
Query: 1336 TYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEDKSGKVCQKLYRGMIGSLLYLTA 1395
+ ++ + +K ++ +PM TC L K D YR ++G+L Y T
Sbjct: 496 LFSLRTSIFEIFCRKPR*QAQPIS-SPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 1396 SRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK---KTSEYKLSGYCD 1452
RP+I F+V+ +F S+P ++H T VKRILRYLKG+ + GL K + + G+CD
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 1453 ADYAGDRTERKSTSGNCQFLGSNLVSWASKRQ 1484
AD+A +++STSG FLG NL+SW +Q
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>BG508993
Length = 374
Score = 118 bits (296), Expect = 2e-26
Identities = 52/123 (42%), Positives = 82/123 (66%), Gaps = 1/123 (0%)
Frame = +1
Query: 1430 KGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIAL 1489
KGT + GL Y ++ YKL G+CD+D+AGD +RKST+G F+G + +W+SK+Q + L
Sbjct: 4 KGTIDFGLFYSPSNNYKLVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIVTL 183
Query: 1490 STAEAEYISAAICSTQMLWMKHQLEDYQIFE-SNIPIYCDNTAAISLSKNPILHSRAKHI 1548
T EAEY++A C+ +W++ LE+ Q+ + + IY DN +A L+KN + H R+KHI
Sbjct: 184 FTCEAEYVAATSCTCHAIWLRRLLEELQLLQKESTKIYVDNRSAQELAKNSVFHERSKHI 363
Query: 1549 EVK 1551
+ +
Sbjct: 364 DTR 372
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 117 bits (294), Expect = 3e-26
Identities = 55/131 (41%), Positives = 88/131 (66%), Gaps = 1/131 (0%)
Frame = +3
Query: 1459 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQI 1518
R +RKST+G F+G +W SK+Q + LST EAEY++A C +W+++ L++ ++
Sbjct: 6 RDDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKM 185
Query: 1519 -FESNIPIYCDNTAAISLSKNPILHSRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWAD 1577
E + I DN +A++L+KNP+ H ++KHI+ +YHFIR+ ++K + LK+V + Q AD
Sbjct: 186 PQEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAAD 365
Query: 1578 IFTKPLAEDRF 1588
IFTKPL + F
Sbjct: 366 IFTKPLKLETF 398
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 113 bits (283), Expect = 6e-25
Identities = 56/134 (41%), Positives = 81/134 (59%)
Frame = +3
Query: 1089 LVSLIEPKSIDEALQDKDWILAMKEELNQFSKNDVWSLVKKPENVHVIGTKWVFRNKLNE 1148
L SL P +I EAL W AM +E+ N W LV P +G +WV+ K+
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 1149 KGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSAF 1208
G V R KARLVA+GY+Q GI+Y +TF+PV L +RL ++ + + LHQ+D+K+AF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 1209 LNGYISEEVYVHQP 1222
L+G + E++Y+ QP
Sbjct: 363 LHGDLEEDIYMEQP 404
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.332 0.142 0.437
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 68,898,298
Number of Sequences: 63676
Number of extensions: 968294
Number of successful extensions: 6202
Number of sequences better than 10.0: 145
Number of HSP's better than 10.0 without gapping: 6021
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 6142
length of query: 1602
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1492
effective length of database: 5,635,272
effective search space: 8407825824
effective search space used: 8407825824
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 66 (30.0 bits)
Lotus: description of TM0166.7