
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0128.9
(1596 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 991 0.0
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 985 0.0
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 229 6e-60
TC232995 224 3e-58
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 217 4e-56
TC213445 126 1e-48
BM143109 172 1e-42
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 171 3e-42
AI959950 168 2e-41
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 157 3e-38
CO983516 152 9e-37
AI855982 145 1e-34
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 140 4e-33
AI966222 113 2e-30
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 127 3e-29
BU549979 126 9e-29
CO982036 122 1e-27
BG508993 120 6e-27
TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%) 116 9e-26
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 114 3e-25
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 991 bits (2561), Expect = 0.0
Identities = 497/1020 (48%), Positives = 680/1020 (65%)
Frame = +1
Query: 563 MLQISLIAPLKHQSWYLDSGCSRHMTGEKRMFRELKLKPGGEVGFGGNEKGKIVGTGTIC 622
++ SL A K + WYLDSGCSRHMTG K ++ V FG KGKI+G G +
Sbjct: 1645 VVHTSLRASAK-EDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLV 1821
Query: 623 VDSSPCIDNVLLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNI 682
D P ++ VLLV GLT NL+SISQL D+G++V F + C ++ ++ S+ K+N
Sbjct: 1822 HDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNC 2001
Query: 683 YKIRLSELEAQNVKCLLSVDEEQWVWHRRLGHASMRKISQLSKLNLVRGLPNLKFASDAL 742
Y E + CL S ++E +WH+R GH +R + ++ VRG+PNLK +
Sbjct: 2002 YLWTPQETSYSST-CLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRI 2178
Query: 743 CEACQKGKFTKVPFKAKNVVSTSRPLELLHIDLFGPVKIESIGGKRYGMVIVDDYSRWTW 802
C CQ GK K+ + +TSR LELLH+DL GP+++ES+GGKRY V+VDD+SR+TW
Sbjct: 2179 CGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTW 2358
Query: 803 VKFLTRKDESHVVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESLFDSYGIAHDFSC 862
V F+ K E+ VF ++Q EK C I R+RSDHG EFEN +F S GI H+FS
Sbjct: 2359 VNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSA 2538
Query: 863 PRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPY 922
TPQQNG+VERKNRTLQE AR ML + + WAEA+NTACYI NR+++R T Y
Sbjct: 2539 AITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLY 2718
Query: 923 ELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKT 982
E+WK KP++ +FH FG CY+L +++ K D KS + LGYS S+ +R +N+ +T
Sbjct: 2719 EIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRT 2898
Query: 983 IEESIHVRFDDKLDSDQSKLVEKFADLSINVSDKGKAPEEVEPEEDEPEEEAGPSNSQTL 1042
+ ESI+V DD + + + E NV+D K+ E E D +E+ +
Sbjct: 2899 VMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAE-NSDSATDESNINQPDKR 3075
Query: 1043 KKSRITAAHPKELILGNKDEPVRTKSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWI 1102
+RI HPKELI+G+ + V T+S E ++S VS IEPK++ EAL D+ WI
Sbjct: 3076 SSTRIQKMHPKELIIGDPNRGVTTRSR----EVEIVSNSCFVSKIEPKNVKEALTDEFWI 3243
Query: 1103 LAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQE 1162
AM+EEL QF +N+VW LV +PE VIGTKW+F+NK NE+G + RNKARLVAQGY+Q E
Sbjct: 3244 NAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIE 3423
Query: 1163 GIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDE 1222
G+D+ ETFAPVARLE+IRLL+ + L+QMDVKSAFLNGY++EEVYV QP GF D
Sbjct: 3424 GVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADP 3603
Query: 1223 KKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIY 1282
PDHV++LKK+LYGLKQAPRAWYERL+ FL + + +G +D TLF K ++++I QIY
Sbjct: 3604 THPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIY 3783
Query: 1283 VDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYIHQSKYTKELL 1342
VDDI+FG + + + F + MQ+EFEMS++GEL YF+G+QV Q + ++ QS+Y K ++
Sbjct: 3784 VDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIV 3963
Query: 1343 KKFNMLESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHLCA 1402
KKF M ++ +TP L K+ V Q LYR MIGSLLYLTASRPDI ++V +CA
Sbjct: 3964 KKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCA 4143
Query: 1403 RFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGN 1462
R+Q++P+ +HLT VKRIL+Y+ GT++ G+MY S L GYCDAD+AG +RKSTSG
Sbjct: 4144 RYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGG 4323
Query: 1463 CQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESNIPIYCD 1522
C +LG+NL+SW SK+Q+ ++LSTAEAEYI+A +Q++WMK L++Y + + + +YCD
Sbjct: 4324 CFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLYCD 4503
Query: 1523 NTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAEDRF 1582
N +AI++SKNP+ HSR KHI+++HH+IRD V V+ LK VDT+ Q ADIFTK L ++F
Sbjct: 4504 NMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQF 4683
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 985 bits (2546), Expect = 0.0
Identities = 497/1022 (48%), Positives = 680/1022 (65%), Gaps = 2/1022 (0%)
Frame = +1
Query: 563 MLQISLIAPLKHQSWYLDSGCSRHMTGEKRMFRELKLKPGGEVGFGGNEKGKIVGTGTIC 622
++ SL A K + WYLDSGCSRHMTG K ++ V FG KGKI G G +
Sbjct: 1648 VVHTSLRASAK-EDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGKITGMGKLV 1824
Query: 623 VDSSPCIDNVLLVDGLTHNLLSISQLADKGYDVIFNQKSCRAVSQIDGSVLFNSKRKNNI 682
D P ++ VLLV GLT NL+SISQL D+G++V F + C ++ ++ S+ K+N
Sbjct: 1825 HDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNC 2004
Query: 683 YKIRLSELEAQNVKCLLSVDEEQWVWHRRLGHASMRKISQLSKLNLVRGLPNLKFASDAL 742
Y E + CL S ++E +WH+R GH +R + ++ VRG+PNLK +
Sbjct: 2005 YLWTPQETSYSST-CLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRI 2181
Query: 743 CEACQKGKFTKVPFKAKNVVSTSRPLELLHIDLFGPVKIESIGGKRYGMVIVDDYSRWTW 802
C CQ GK K+ + +TSR LELLH+DL GP+++ES+GGKRY V+VDD+SR+TW
Sbjct: 2182 CGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTW 2361
Query: 803 VKFLTRKDESHVVFSTFIAQVQNEKACRIVRVRSDHGGEFENDKFESLFDSYGIAHDFSC 862
V F+ K ++ VF ++Q EK C I R+RSDHG EFEN KF S GI H+FS
Sbjct: 2362 VNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSA 2541
Query: 863 PRTPQQNGVVERKNRTLQEMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPY 922
TPQQNG+VERKNRTLQE AR ML + + WAEA+NTACYI NR+++R T Y
Sbjct: 2542 AITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLY 2721
Query: 923 ELWKNIKPNISYFHPFGCVCYVLNTKDRLHKFDAKSSKCLLLGYSERSKGFRFYNTDAKT 982
E+WK KP + +FH FG CY+L +++ K D KS + LGYS S+ +R +N+ +T
Sbjct: 2722 EIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRT 2901
Query: 983 IEESIHVRFDDKLDSDQSKLVEKFADLSINVSDKGKAPEEVEPEEDEPEEEAGPSNSQTL 1042
+ ESI+V DD + + + E NV+D K+ E E + +E P+ +Q
Sbjct: 2902 VMESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDE---PNINQPD 3072
Query: 1043 KKS--RITAAHPKELILGNKDEPVRTKSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKD 1100
K+ RI HPKELI+G+ + V T+S E ++S VS IEPK++ EAL D+
Sbjct: 3073 KRPSIRIQKMHPKELIIGDPNRGVTTRSR----EIEIVSNSCFVSKIEPKNVKEALTDEF 3240
Query: 1101 WILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQ 1160
WI AM+EEL QF +N+VW LV +PE VIGTKW+F+NK NE+G + RNKARLVAQGY+Q
Sbjct: 3241 WINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQ 3420
Query: 1161 QEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFE 1220
EG+D+ ETFAPVARLE+IRLL+ + L+QMDVKSAFLNGY++EE YV QP GF
Sbjct: 3421 IEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFV 3600
Query: 1221 DEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQ 1280
D PDHV++LKK+LYGLKQAPRAWYERL+ FL + + +G +D TLF K ++++I Q
Sbjct: 3601 DPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQ 3780
Query: 1281 IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYIHQSKYTKE 1340
IYVDDI+FG + + + F + MQ+EFEMS++GEL YF+G+QV Q + ++ QSKY K
Sbjct: 3781 IYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKN 3960
Query: 1341 LLKKFNMLESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLLYLTASRPDILFSVHL 1400
++KKF M ++ +TP L K+ V Q LYR MIGSLLYLTASRPDI ++V +
Sbjct: 3961 IVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGV 4140
Query: 1401 CARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTS 1460
CAR+Q++P+ +HL VKRIL+Y+ GT++ G+MY S+ L GYCDAD+AG +RKSTS
Sbjct: 4141 CARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTS 4320
Query: 1461 GNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESNIPIY 1520
G C +LG+NL+SW SK+Q+ ++LSTAEAEYI+A +Q++WMK L++Y + + + +Y
Sbjct: 4321 GGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLY 4500
Query: 1521 CDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAED 1580
CDN +AI++SKNP+ HSR KHI+++HH+IRD V V+ L+ VDT+ Q ADIFTK L +
Sbjct: 4501 CDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDAN 4680
Query: 1581 RF 1582
+F
Sbjct: 4681 QF 4686
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 229 bits (585), Expect = 6e-60
Identities = 109/153 (71%), Positives = 132/153 (86%)
Frame = -3
Query: 1103 LAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQE 1162
+AM+EELNQF +N+VW LV+KPE+ VIGTKWVFRNKL+E G ++RNKARLVA+GY+Q+E
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 1163 GIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDE 1222
GIDY ET+APVARLE IR+L+++ N L+QMDVKSAFLNG I EEVYV QPPGFE
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 1223 KKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLE 1255
KP HV+KL+K+LYGLKQAPRAWYER+S+FLLE
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>TC232995
Length = 1009
Score = 224 bits (570), Expect = 3e-58
Identities = 111/173 (64%), Positives = 132/173 (76%)
Frame = +2
Query: 1213 VHQPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTY 1272
V QPPGFE KP+HV+KL+K+LYGLKQAPRAWYERLS+FLLE EF RGKVDTTLF K
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 1273 KDDILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYI 1332
+DIL+VQIYVDDIIFGS N SLCKEFS MQ+EFEMSMMGELKYF+G+Q+ QT G +I
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 1333 HQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLL 1385
+QSKY KEL+K+F M + TPM C L+K+ + K YR IG ++
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEVV 520
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 217 bits (552), Expect = 4e-56
Identities = 121/247 (48%), Positives = 158/247 (62%), Gaps = 2/247 (0%)
Frame = +3
Query: 1218 GFEDEKKPDHVFKLKKSL--YGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDD 1275
GFED+++P HVF + L G+K ++ S ++ K+
Sbjct: 330 GFEDKERPCHVFMV*NKL*ELGMKG*VHF*FQMDSPEE*RTPHYSERLK--------KET 485
Query: 1276 ILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYIHQS 1335
LI+ IYVDDIIFG+ ++ +CKEF E+M+ FE SM GELK+ +G+Q+ Q G +IHQ
Sbjct: 486 FLIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQE 665
Query: 1336 KYTKELLKKFNMLESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLLYLTASRPDIL 1395
KYTK LK+F M E+ TPMH + I++K+ K K Y GMI SL YLT+SRPDI+
Sbjct: 666 KYTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIV 845
Query: 1396 FSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTE 1455
F V LCARFQS P+ +H+TAVKRILRYL GTTN L +KK SE+ L GYCD +AGD+ E
Sbjct: 846 FVVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVE 1025
Query: 1456 RKSTSGN 1462
RKSTS N
Sbjct: 1026 RKSTSRN 1046
>TC213445
Length = 705
Score = 126 bits (317), Expect(2) = 1e-48
Identities = 59/98 (60%), Positives = 77/98 (78%)
Frame = +1
Query: 1453 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQI 1512
+T+R+STS C F+GS LVSW SK+Q+++ LSTAEAEYISA Q+ WM+ QL DY +
Sbjct: 400 KTDRESTSDTCHFIGSALVSWHSKKQNSVVLSTAEAEYISARSYYAQIFWMRQQLFDYGL 579
Query: 1513 LESNIPIYCDNTAAISLSKNPILHSRAKHIEVKHHFIR 1550
+IPI CDNT+AI+LSKN IL+SR KHIE++HHF+R
Sbjct: 580 KLDHIPIRCDNTSAINLSKNHILYSRTKHIEIRHHFLR 693
Score = 87.4 bits (215), Expect(2) = 1e-48
Identities = 40/68 (58%), Positives = 51/68 (74%)
Frame = +2
Query: 1380 MIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEY 1439
MI S LYL+ SRP I+FSV +C R+Q++P+E+HL+ +KRI+RYL G NLGL Y K S Y
Sbjct: 197 MIESFLYLSTSRPHIMFSVCMCVRYQANPKESHLSVIKRIMRYLLGIINLGLWYPKNSSY 376
Query: 1440 KLSGYCDA 1447
L GY DA
Sbjct: 377 NLVGYSDA 400
>BM143109
Length = 415
Score = 172 bits (436), Expect = 1e-42
Identities = 85/133 (63%), Positives = 103/133 (76%)
Frame = +1
Query: 1215 QPPGFEDEKKPDHVFKLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKD 1274
QPP ++ +KP+HVFKLKK LYGLKQA RAWYE LS FLL+ F +GKVDT LF +
Sbjct: 4 QPPVRKNSEKPNHVFKLKKVLYGLKQALRAWYELLSKFLLDKGFSKGKVDTNLFI*KKLN 183
Query: 1275 DILIVQIYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEGTYIHQ 1334
DIL+VQIYVDDIIFGS N SLCK+FS+ MQ EFEMSMM EL +F+G+Q+ QT G +I Q
Sbjct: 184 DILLVQIYVDDIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNGIFISQ 363
Query: 1335 SKYTKELLKKFNM 1347
SKY K+L+ +F M
Sbjct: 364 SKYCKDLIHRFGM 402
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein, partial
(7%)
Length = 804
Score = 171 bits (432), Expect = 3e-42
Identities = 90/217 (41%), Positives = 137/217 (62%), Gaps = 4/217 (1%)
Frame = +1
Query: 1377 YRGMIGSLLYLTASRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK-- 1434
+R +IGSL YL SRP+I F+V L +RF PR +H+ A KR+LR +KGT G+++
Sbjct: 22 FRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFK 201
Query: 1435 -KTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISA 1493
K+ + L GY D+D+ D + KST G V+ +SK+Q IALST EAEY++A
Sbjct: 202 AKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAA 381
Query: 1494 AICSTQMLWMKHQLEDYQILESN-IPIYCDNTAAISLSKNPILHSRAKHIEVKHHFIRDY 1552
++ + Q +WM + LE+ ++ E + + DN +AI+L+K+P LH R+KHIE++ H+IRD
Sbjct: 382 SLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHYIRDQ 561
Query: 1553 VQKGVLLLKFVDTDHQWADIFTKPLAEDRFNFILKNL 1589
V KG + +++ + Q AD+ TKP+ RF I L
Sbjct: 562 VSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>AI959950
Length = 466
Score = 168 bits (426), Expect(2) = 2e-41
Identities = 86/130 (66%), Positives = 102/130 (78%)
Frame = -1
Query: 1104 AMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEG 1163
AM+EEL+QF KN+V LVK P+ V+G KW+F NKL+E G VVR KARLVA+GYSQQEG
Sbjct: 391 AMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQEG 212
Query: 1164 IDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEK 1223
IDY +TFA VARLE I +L+SF+ N+ L+QMDVKSAFLNG I +EVYV QPPGFE+E
Sbjct: 211 IDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENET 32
Query: 1224 KPDHVFKLKK 1233
HVFKL K
Sbjct: 31 LHQHVFKLNK 2
Score = 21.2 bits (43), Expect(2) = 2e-41
Identities = 8/16 (50%), Positives = 13/16 (81%)
Frame = -2
Query: 1083 LVSLIEPKSIDEALQD 1098
L+ ++PK IDEA++D
Sbjct: 453 LIFEMKPKHIDEAIKD 406
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 157 bits (398), Expect = 3e-38
Identities = 73/111 (65%), Positives = 95/111 (84%)
Frame = +3
Query: 1088 EPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVV 1147
EPK+I EA+ D +WI+ M+EELNQF +N+VW LV+KPE+ VIGTKWVFRNKL+E G ++
Sbjct: 3 EPKNIKEAIVDDNWIIVMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIII 182
Query: 1148 RNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDV 1198
RNKARLVA+GY+Q+EGIDY ET+APVARLEAIR+L++++ N L+QMDV
Sbjct: 183 RNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMNFKLYQMDV 335
>CO983516
Length = 724
Score = 152 bits (385), Expect = 9e-37
Identities = 73/120 (60%), Positives = 92/120 (75%)
Frame = +2
Query: 1170 FAPVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVF 1229
F PVARLE+IRLL+ + L+QMDVKSAFLNGY++EEVYV QP GF D PDHV+
Sbjct: 365 FHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPDHVY 544
Query: 1230 KLKKSLYGLKQAPRAWYERLSSFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDIIFG 1289
+LKK+LYGLKQAPRAWYERL+ L + + +G +D TLF K ++++I QIYVDDI+FG
Sbjct: 545 RLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFG 724
>AI855982
Length = 484
Score = 145 bits (366), Expect = 1e-34
Identities = 78/165 (47%), Positives = 111/165 (67%)
Frame = +2
Query: 1052 PKELILGNKDEPVRTKSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEEELNQ 1111
P + I+G+ + V T R S + L + VS+IEPK+I EA+ D +WI+AM+EELNQ
Sbjct: 2 PLDNIIGDISKGVTT----RHSLKDLCNNMAFVSMIEPKNIKEAIVDDNWIIAMQEELNQ 169
Query: 1112 FSKNDVWSLVKKPESVLVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFA 1171
F +N+VW LV+KP++ VI TKWVFRNKL+E ++ +KARLVA+GY+Q +G+DY T+A
Sbjct: 170 FERNNVWKLVEKPDNYPVI*TKWVFRNKLDEHRIIIIHKARLVAEGYNQVDGLDYEHTYA 349
Query: 1172 PVARLEAIRLLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQP 1216
+ARL I + +S+ N L+ SA L+G + EVYV QP
Sbjct: 350 SIARL*VIIMPLSYVYIMNSTLYHYACVSALLHGLLLHEVYVDQP 484
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 140 bits (353), Expect = 4e-33
Identities = 65/151 (43%), Positives = 100/151 (66%), Gaps = 1/151 (0%)
Frame = +2
Query: 1440 KLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQ 1499
+LSGYCDAD+AG +R+STSG C F+G NLVSW SK+Q+ +A S+AEAEY S A+ + +
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 1500 MLWMKHQLEDYQILES-NIPIYCDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVL 1558
++W+K L++ + E + +YCDN AA+ ++ NP+ H R KHIE+ HFIR+ + +
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 1559 LLKFVDTDHQWADIFTKPLAEDRFNFILKNL 1589
+ +F+ ++ Q DI TK L + + L
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKL 466
>AI966222
Length = 430
Score = 113 bits (283), Expect(2) = 2e-30
Identities = 50/88 (56%), Positives = 64/88 (71%)
Frame = +1
Query: 881 EMARTMLQETGMAKHFWAEAVNTACYIQNRISVRPILNKTPYELWKNIKPNISYFHPFGC 940
EMART L + KHF AE +N CY+QN+I +RPIL +TPYELWK KPNISYF+PF C
Sbjct: 1 EMARTTLNDNLTPKHF*AEVMNIVCYLQNKIYIRPILKRTPYELWKGRKPNISYFYPFRC 180
Query: 941 VCYVLNTKDRLHKFDAKSSKCLLLGYSE 968
C+++NTKD L K D+KS + + YS+
Sbjct: 181 KCFIINTKDNLGKIDSKSDCGIFIAYSK 264
Score = 39.3 bits (90), Expect(2) = 2e-30
Identities = 22/51 (43%), Positives = 32/51 (62%), Gaps = 1/51 (1%)
Frame = +2
Query: 963 LLGYSERSKGFRFYNTDAKTIEESIHVRF-DDKLDSDQSKLVEKFADLSIN 1012
LL + SK FR YN+ IEE+IH+RF +K + + +L E FADL ++
Sbjct: 248 LLHTLKLSKAFRVYNSGTLVIEEAIHIRFGKNKPNKELLELDESFADLRLD 400
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 127 bits (320), Expect = 3e-29
Identities = 61/130 (46%), Positives = 90/130 (68%)
Frame = -2
Query: 1121 VKKPESVLVIGTKWVFRNKLNEKGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIR 1180
V P +G +WV+ K+ G+V R KARLVA+GY+Q GIDY +TF+PVA+L +R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 1181 LLISFSVNHNIVLHQMDVKSAFLNGYISEEVYVHQPPGFEDEKKPDHVFKLKKSLYGLKQ 1240
L ++ + + LHQ+D+K+AFL+G + E++Y+ QPPGF + + V KL +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 1241 APRAWYERLS 1250
+PRAW+ + S
Sbjct: 46 SPRAWFGKFS 17
>BU549979
Length = 615
Score = 126 bits (316), Expect = 9e-29
Identities = 64/183 (34%), Positives = 112/183 (60%), Gaps = 3/183 (1%)
Frame = -1
Query: 1403 RFQSDPRETHLTAVKRILRYLKGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGN 1462
R+QS+P H K+++RYL+GT + LMYK+T+ ++ GY D+D+AG R+STSG
Sbjct: 606 RYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGY 427
Query: 1463 CQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILES---NIPI 1519
L +VSW S +Q+ IA ST E E++ ++ +W+K + ++++S + +
Sbjct: 426 IFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKL 247
Query: 1520 YCDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAE 1579
YCDN AA+ ++KN +R+KHI++K+ IR+ V++ ++++ V+T+ D TK +
Sbjct: 246 YCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTP 67
Query: 1580 DRF 1582
F
Sbjct: 66 KNF 58
>CO982036
Length = 674
Score = 122 bits (307), Expect = 1e-27
Identities = 75/212 (35%), Positives = 118/212 (55%), Gaps = 5/212 (2%)
Frame = -2
Query: 1272 YKDDILIVQ--IYVDDIIFGSANQSLCKEFSEMMQAEFEMSMMGELKYFMGIQVDQTPEG 1329
YK IL V +YVD II GS+ +L + + + + F + ++G+L YF+ I+V P+
Sbjct: 673 YKTHILTVYLLVYVDIIITGSSC-TLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDL 497
Query: 1330 TYIHQSKYTKELLKKFNMLESTVAKTPMHPTCILEKEYKSGKVCQKLYRGMIGSLLYLTA 1389
+ ++ + +K ++ +PM TC L K YR ++G+L Y T
Sbjct: 496 LFSLRTSIFEIFCRKPR*QAQPIS-SPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 1390 SRPDILFSVHLCARFQSDPRETHLTAVKRILRYLKGTTNLGLMYK---KTSEYKLSGYCD 1446
RP+I F+V+ +F S+P ++H T VKRILRYLKG+ + GL K + + G+CD
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 1447 ADYAGDRTERKSTSGNCQFLGSNLVSWASKRQ 1478
AD+A +++STSG FLG NL+SW +Q
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>BG508993
Length = 374
Score = 120 bits (300), Expect = 6e-27
Identities = 53/123 (43%), Positives = 83/123 (67%), Gaps = 1/123 (0%)
Frame = +1
Query: 1424 KGTTNLGLMYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIAL 1483
KGT + GL Y ++ YKL G+CD+D+AGD +RKST+G F+G + +W+SK+Q + L
Sbjct: 4 KGTIDFGLFYSPSNNYKLVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIVTL 183
Query: 1484 STAEAEYISAAICSTQMLWMKHQLEDYQILE-SNIPIYCDNTAAISLSKNPILHSRAKHI 1542
T EAEY++A C+ +W++ LE+ Q+L+ + IY DN +A L+KN + H R+KHI
Sbjct: 184 FTCEAEYVAATSCTCHAIWLRRLLEELQLLQKESTKIYVDNRSAQELAKNSVFHERSKHI 363
Query: 1543 EVK 1545
+ +
Sbjct: 364 DTR 372
>TC213888 similar to UP|Q9SFE1 (Q9SFE1) T26F17.17, partial (11%)
Length = 493
Score = 116 bits (290), Expect = 9e-26
Identities = 54/131 (41%), Positives = 88/131 (66%), Gaps = 1/131 (0%)
Frame = +3
Query: 1453 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQI 1512
R +RKST+G F+G +W SK+Q + LST EAEY++A C +W+++ L++ ++
Sbjct: 6 RDDRKSTTGFVFFMGDTAFTWMSKKQPIVTLSTCEAEYVAATSCVCHAIWLRNLLKELKM 185
Query: 1513 -LESNIPIYCDNTAAISLSKNPILHSRAKHIEVKHHFIRDYVQKGVLLLKFVDTDHQWAD 1571
E + I DN +A++L+KNP+ H ++KHI+ ++HFIR+ ++K + LK+V + Q AD
Sbjct: 186 PQEEPMEICVDNKSALALAKNPVFHEKSKHIDTRYHFIRECIEKKEVKLKYVMSQDQAAD 365
Query: 1572 IFTKPLAEDRF 1582
IFTKPL + F
Sbjct: 366 IFTKPLKLETF 398
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 114 bits (285), Expect = 3e-25
Identities = 56/134 (41%), Positives = 81/134 (59%)
Frame = +3
Query: 1083 LVSLIEPKSIDEALQDKDWILAMEEELNQFSKNDVWSLVKKPESVLVIGTKWVFRNKLNE 1142
L SL P +I EAL W AM +E+ N W LV P +G +WV+ K+
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 1143 KGDVVRNKARLVAQGYSQQEGIDYTETFAPVARLEAIRLLISFSVNHNIVLHQMDVKSAF 1202
G V R KARLVA+GY+Q GI+Y +TF+PV L +RL ++ + + LHQ+D+K+AF
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 1203 LNGYISEEVYVHQP 1216
L+G + E++Y+ QP
Sbjct: 363 LHGDLEEDIYMEQP 404
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.332 0.142 0.437
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 68,651,079
Number of Sequences: 63676
Number of extensions: 963056
Number of successful extensions: 8040
Number of sequences better than 10.0: 144
Number of HSP's better than 10.0 without gapping: 7778
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 7980
length of query: 1596
length of database: 12,639,632
effective HSP length: 110
effective length of query: 1486
effective length of database: 5,635,272
effective search space: 8374014192
effective search space used: 8374014192
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.5 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0128.9