
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0101.5
(967 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 884 0.0
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 882 0.0
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 214 1e-55
TC232995 207 2e-53
TC211311 weakly similar to UP|O24587 (O24587) Pol protein, parti... 202 4e-52
TC213445 116 5e-46
BM143109 161 1e-39
AI959950 161 2e-39
BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Gl... 154 1e-37
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 147 3e-35
CO983516 146 4e-35
AI855982 142 9e-34
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 126 5e-29
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 122 6e-28
CO982036 117 2e-26
BU549979 115 9e-26
BG508993 114 3e-25
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 112 8e-25
AI966222 110 2e-24
AW185460 108 1e-23
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 884 bits (2284), Expect = 0.0
Identities = 445/956 (46%), Positives = 630/956 (65%), Gaps = 1/956 (0%)
Frame = +1
Query: 1 PCIDNVLLVDGLNHNLLSISQLADKGYDIIFNQKSCRAVSQIDGSVLFNSKRRNNTYKIR 60
P ++ VLLV GL NL+SISQL D+G+++ F + C ++ ++ S+ ++N Y
Sbjct: 1834 PSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWT 2013
Query: 61 LFELETQKVKCLLSVNEEQWV*HRRLGHASMRKIS*LCKLDLVRGLPTLKFSSDALCEAC 120
E CL S +E + H+R GH +R + + VRG+P LK +C C
Sbjct: 2014 PQETSYSST-CLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGEC 2190
Query: 121 QKGKFTKVSFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFI 180
Q GK K+S + +TSR LELLH+DL GP++ ES+GGKRY V+VDD+SR+TWV FI
Sbjct: 2191 QIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFI 2370
Query: 181 SRKDESHSVFSTFIVQVQNENACRIMRVRSDHGGVFENDKFESLFDSYGIAYDFSCPRTP 240
K E+ VF +++Q E C I R+RSDHG FEN +F S GI ++FS TP
Sbjct: 2371 REKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITP 2550
Query: 241 QQNGVVERKNRTLQEMSRTMLQEIDMAKHFWAEAVNTSCYIQNRISVRPILNKTPYELWK 300
QQNG+VERKNRTLQE +R ML ++ + WAEA+NT+CYI NR+++R T YE+WK
Sbjct: 2551 QQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWK 2730
Query: 301 KVKPNISYFHPFGCVCYALNTKDRLHKFDSKSSKCLLLGYSKRSKGFRIYNTDAKTIEEY 360
KP++ +FH FG CY L +++ K D KS + LGYS S+ +R++N+ +T+ E
Sbjct: 2731 GRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMES 2910
Query: 361 IHVRFDDKLDSDQSKLVEKFVDMSINVSDKGKAPEEAEPEEDSPEEVGPSDPQPQKKSRI 420
I+V DD + + + E NV+D K+ E AE + + +E + P + +RI
Sbjct: 2911 INVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSSTRI 3090
Query: 421 VASHPKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEE 480
HPKELI+G+ + V TRS E ++S VS IEPK++ EAL D+ WI AM+E
Sbjct: 3091 QKMHPKELIIGDPNRGVTTRS----REVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQE 3258
Query: 481 ELNQFSKNDVWNIVKKPQGVHIIGTKWVFRNKLNEKGDVVRNKARLVAQGYRQQEGIDYT 540
EL QF +N+VW +V +P+G ++IGTKW+F+NK NE+G + RNKARLVAQGY Q EG+D+
Sbjct: 3259 ELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFD 3438
Query: 541 ETFAPVARLEAIRLLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQPPGFEDEKNPDH 600
ETFAPVARLE+IRLL+ + L+QMDVKS FLNGY++EEVYV QP GF D +PDH
Sbjct: 3439 ETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDH 3618
Query: 601 VFNLKKSLYGLKQAPRAWYERL-RFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDII 659
V+ LKK+LYGLKQAPRAWYERL FL + + +G +D TLF K ++++I QIYVDDI+
Sbjct: 3619 VYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIV 3798
Query: 660 FGSANSSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEATYIHQSKYTKELLKKFNM 719
FG ++ + + F + MQ+EFEMS++GEL YFLG+QV Q ++ ++ QS+Y K ++KKF M
Sbjct: 3799 FGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKNIVKKFGM 3978
Query: 720 TESIIAKTLMHPTCILEKEDASGKVCQKLYRGMICSFLYLTASRPDILFSEHLCARFQSD 779
+ +T L K++A V Q LYR MI S LYLTASRPDI ++ +CAR+Q++
Sbjct: 3979 ENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQAN 4158
Query: 780 PRETHLTVVKRILRYLKGTTNLGLLYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLG 839
P+ +HLT VKRIL+Y+ GT++ G++Y S L GYCDAD+AG +RKSTSG C +LG
Sbjct: 4159 PKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTSGGCFYLG 4338
Query: 840 SNLVSWASKRQSTIALSTAEAEYISIAICNT*MLWMKHQLEDYQILESNIPIYCDNTAAI 899
+NL+SW SK+Q+ ++LSTAEAEYI+ + ++WMK L++Y + + + +YCDN +AI
Sbjct: 4339 NNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLYCDNMSAI 4518
Query: 900 SLSKNPILHSRAKYIEVKYHFIRDYVQKGYFF*SSLILTINSADIFTKPLAEDRFK 955
++SKNP+ HSR K+I++++H+IRD V + ADIFTK L ++F+
Sbjct: 4519 NISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDANQFE 4686
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 882 bits (2278), Expect = 0.0
Identities = 444/956 (46%), Positives = 627/956 (65%), Gaps = 1/956 (0%)
Frame = +1
Query: 1 PCIDNVLLVDGLNHNLLSISQLADKGYDIIFNQKSCRAVSQIDGSVLFNSKRRNNTYKIR 60
P ++ VLLV GL NL+SISQL D+G+++ F + C ++ ++ S+ ++N Y
Sbjct: 1837 PSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCYLWT 2016
Query: 61 LFELETQKVKCLLSVNEEQWV*HRRLGHASMRKIS*LCKLDLVRGLPTLKFSSDALCEAC 120
E CL S +E + H+R GH +R + + VRG+P LK +C C
Sbjct: 2017 PQETSYSST-CLFSKEDEVKIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICGEC 2193
Query: 121 QKGKFTKVSFKAKNVVSTSRPLELLHIDLFGPVKTESIGGKRYGMVIVDDYSRWTWVKFI 180
Q GK K+S + +TSR LELLH+DL GP++ ES+GGKRY V+VDD+SR+TWV FI
Sbjct: 2194 QIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVNFI 2373
Query: 181 SRKDESHSVFSTFIVQVQNENACRIMRVRSDHGGVFENDKFESLFDSYGIAYDFSCPRTP 240
K ++ VF +++Q E C I R+RSDHG FEN KF S GI ++FS TP
Sbjct: 2374 REKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITP 2553
Query: 241 QQNGVVERKNRTLQEMSRTMLQEIDMAKHFWAEAVNTSCYIQNRISVRPILNKTPYELWK 300
QQNG+VERKNRTLQE +R ML ++ + WAEA+NT+CYI NR+++R T YE+WK
Sbjct: 2554 QQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEIWK 2733
Query: 301 KVKPNISYFHPFGCVCYALNTKDRLHKFDSKSSKCLLLGYSKRSKGFRIYNTDAKTIEEY 360
KP + +FH FG CY L +++ K D KS + LGYS S+ +R++N+ +T+ E
Sbjct: 2734 GRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVMES 2913
Query: 361 IHVRFDDKLDSDQSKLVEKFVDMSINVSDKGKAPEEAEPEEDSPEEVGPSDPQPQKKSRI 420
I+V DD + + + E NV+D K+ E AE + + +E + P + RI
Sbjct: 2914 INVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDEPNINQPDKRPSIRI 3093
Query: 421 VASHPKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEE 480
HPKELI+G+ + V TRS E ++S VS IEPK++ EAL D+ WI AM+E
Sbjct: 3094 QKMHPKELIIGDPNRGVTTRS----REIEIVSNSCFVSKIEPKNVKEALTDEFWINAMQE 3261
Query: 481 ELNQFSKNDVWNIVKKPQGVHIIGTKWVFRNKLNEKGDVVRNKARLVAQGYRQQEGIDYT 540
EL QF +N+VW +V +P+G ++IGTKW+F+NK NE+G + RNKARLVAQGY Q EG+D+
Sbjct: 3262 ELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFD 3441
Query: 541 ETFAPVARLEAIRLLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQPPGFEDEKNPDH 600
ETFAPVARLE+IRLL+ + L+QMDVKS FLNGY++EE YV QP GF D +PDH
Sbjct: 3442 ETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFVDPTHPDH 3621
Query: 601 VFNLKKSLYGLKQAPRAWYERL-RFLLENEFVRGKVDTTLFCKTYKDDILIVQIYVDDII 659
V+ LKK+LYGLKQAPRAWYERL FL + + +G +D TLF K ++++I QIYVDDI+
Sbjct: 3622 VYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIV 3801
Query: 660 FGSANSSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEATYIHQSKYTKELLKKFNM 719
FG ++ + + F + MQ+EFEMS++GEL YFLG+QV Q ++ ++ QSKY K ++KKF M
Sbjct: 3802 FGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKNIVKKFGM 3981
Query: 720 TESIIAKTLMHPTCILEKEDASGKVCQKLYRGMICSFLYLTASRPDILFSEHLCARFQSD 779
+ +T L K++A V Q LYR MI S LYLTASRPDI ++ +CAR+Q++
Sbjct: 3982 ENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGVCARYQAN 4161
Query: 780 PRETHLTVVKRILRYLKGTTNLGLLYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLG 839
P+ +HL VKRIL+Y+ GT++ G++Y S+ L GYCDAD+AG +RKSTSG C +LG
Sbjct: 4162 PKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTSGGCFYLG 4341
Query: 840 SNLVSWASKRQSTIALSTAEAEYISIAICNT*MLWMKHQLEDYQILESNIPIYCDNTAAI 899
+NL+SW SK+Q+ ++LSTAEAEYI+ + ++WMK L++Y + + + +YCDN +AI
Sbjct: 4342 TNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVEQDVMTLYCDNMSAI 4521
Query: 900 SLSKNPILHSRAKYIEVKYHFIRDYVQKGYFF*SSLILTINSADIFTKPLAEDRFK 955
++SKNP+ HSR K+I++++H+IRD V + ADIFTK L ++F+
Sbjct: 4522 NISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDANQFE 4689
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 214 bits (546), Expect = 1e-55
Identities = 102/153 (66%), Positives = 125/153 (81%), Gaps = 1/153 (0%)
Frame = -3
Query: 476 LAMEEELNQFSKNDVWNIVKKPQGVHIIGTKWVFRNKLNEKGDVVRNKARLVAQGYRQQE 535
+AM+EELNQF +N+VW +V+KP+ +IGTKWVFRNKL+E G ++RNKARLVA+GY Q+E
Sbjct: 461 IAMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEE 282
Query: 536 GIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQPPGFEDE 595
GIDY ET+APVARLE IR+L+++ N L+QMDVKS FLNG I EEVYV QPPGFE
Sbjct: 281 GIDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIP 102
Query: 596 KNPDHVFNLKKSLYGLKQAPRAWYERL-RFLLE 627
P HV+ L+K+LYGLKQAPRAWYER+ FLLE
Sbjct: 101 DKPTHVYKLQKALYGLKQAPRAWYERISNFLLE 3
>TC232995
Length = 1009
Score = 207 bits (527), Expect = 2e-53
Identities = 106/169 (62%), Positives = 125/169 (73%), Gaps = 1/169 (0%)
Frame = +2
Query: 586 VHQPPGFEDEKNPDHVFNLKKSLYGLKQAPRAWYERL-RFLLENEFVRGKVDTTLFCKTY 644
V QPPGFE P+HV+ L+K+LYGLKQAPRAWYERL FLLE EF RGKVDTTLF K
Sbjct: 2 VEQPPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRK 181
Query: 645 KDDILIVQIYVDDIIFGSANSSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEATYI 704
+DIL+VQIYVDDIIFGS N SLCKEFS MQ+EFEMSMMGELKYFLG+Q+ QT +I
Sbjct: 182 HNDILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFI 361
Query: 705 HQSKYTKELLKKFNMTESIIAKTLMHPTCILEKEDASGKVCQKLYRGMI 753
+QSKY KEL+K+F M + T M C L+K+++ + K YR I
Sbjct: 362 NQSKYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAI 508
>TC211311 weakly similar to UP|O24587 (O24587) Pol protein, partial (15%)
Length = 1213
Score = 202 bits (515), Expect = 4e-52
Identities = 116/246 (47%), Positives = 151/246 (61%), Gaps = 2/246 (0%)
Frame = +3
Query: 591 GFEDEKNPDHVFNLKKSL--YGLKQAPRAWYERLRFLLENEFVRGKVDTTLFCKTYKDDI 648
GFED++ P HVF + L G+K + F + + + K+
Sbjct: 330 GFEDKERPCHVFMV*NKL*ELGMKG*-------VHF*FQMDSPEE*RTPHYSERLKKETF 488
Query: 649 LIVQIYVDDIIFGSANSSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEATYIHQSK 708
LI+ IYVDDIIFG+ + +CKEF E+M+ FE SM GELK+ LG+Q+ Q +IHQ K
Sbjct: 489 LIIHIYVDDIIFGATSKRMCKEFFELMKDGFETSMKGELKFLLGLQIIQKVYGIFIHQEK 668
Query: 709 YTKELLKKFNMTESIIAKTLMHPTCILEKEDASGKVCQKLYRGMICSFLYLTASRPDILF 768
YTK LK+F M E+ T MH + I++K++ K Y GMI S YLT+SRPDI+F
Sbjct: 669 YTKSHLKRFRMDEAKPMATPMHRSTIIDKDEKGNHTS*KEYSGMIDSLSYLTSSRPDIVF 848
Query: 769 SEHLCARFQSDPRETHLTVVKRILRYLKGTTNLGLLYKKTSEYKLSGYCDADYAGDRTER 828
LCARFQS P+ +H+T VKRILRYL GTTN L +KK SE+ L GYCD +AGD+ ER
Sbjct: 849 VVCLCARFQSYPKISHVTAVKRILRYLVGTTNHCLWFKKRSEFDLLGYCDVYFAGDKVER 1028
Query: 829 KSTSGN 834
KSTS N
Sbjct: 1029KSTSRN 1046
>TC213445
Length = 705
Score = 116 bits (291), Expect(2) = 5e-46
Identities = 55/98 (56%), Positives = 75/98 (76%)
Frame = +1
Query: 825 RTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISIAICNT*MLWMKHQLEDYQI 884
+T+R+STS C F+GS LVSW SK+Q+++ LSTAEAEYIS + WM+ QL DY +
Sbjct: 400 KTDRESTSDTCHFIGSALVSWHSKKQNSVVLSTAEAEYISARSYYAQIFWMRQQLFDYGL 579
Query: 885 LESNIPIYCDNTAAISLSKNPILHSRAKYIEVKYHFIR 922
+IPI CDNT+AI+LSKN IL+SR K+IE+++HF+R
Sbjct: 580 KLDHIPIRCDNTSAINLSKNHILYSRTKHIEIRHHFLR 693
Score = 87.8 bits (216), Expect(2) = 5e-46
Identities = 41/68 (60%), Positives = 52/68 (76%)
Frame = +2
Query: 752 MICSFLYLTASRPDILFSEHLCARFQSDPRETHLTVVKRILRYLKGTTNLGLLYKKTSEY 811
MI SFLYL+ SRP I+FS +C R+Q++P+E+HL+V+KRI+RYL G NLGL Y K S Y
Sbjct: 197 MIESFLYLSTSRPHIMFSVCMCVRYQANPKESHLSVIKRIMRYLLGIINLGLWYPKNSSY 376
Query: 812 KLSGYCDA 819
L GY DA
Sbjct: 377 NLVGYSDA 400
>BM143109
Length = 415
Score = 161 bits (408), Expect = 1e-39
Identities = 82/133 (61%), Positives = 100/133 (74%), Gaps = 1/133 (0%)
Frame = +1
Query: 588 QPPGFEDEKNPDHVFNLKKSLYGLKQAPRAWYERL-RFLLENEFVRGKVDTTLFCKTYKD 646
QPP ++ + P+HVF LKK LYGLKQA RAWYE L +FLL+ F +GKVDT LF +
Sbjct: 4 QPPVRKNSEKPNHVFKLKKVLYGLKQALRAWYELLSKFLLDKGFSKGKVDTNLFI*KKLN 183
Query: 647 DILIVQIYVDDIIFGSANSSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEATYIHQ 706
DIL+VQIYVDDIIFGS N SLCK+FS+ MQ EFEMSMM EL +FLG+Q+ QT +I Q
Sbjct: 184 DILLVQIYVDDIIFGSTNDSLCKKFSQDMQNEFEMSMMRELNFFLGLQIKQTKNGIFISQ 363
Query: 707 SKYTKELLKKFNM 719
SKY K+L+ +F M
Sbjct: 364 SKYCKDLIHRFGM 402
>AI959950
Length = 466
Score = 161 bits (407), Expect(2) = 2e-39
Identities = 81/130 (62%), Positives = 99/130 (75%)
Frame = -1
Query: 477 AMEEELNQFSKNDVWNIVKKPQGVHIIGTKWVFRNKLNEKGDVVRNKARLVAQGYRQQEG 536
AM+EEL+QF KN+V +VK P+ ++G KW+F NKL+E G VVR KARLVA+GY QQEG
Sbjct: 391 AMQEELDQFQKNNV*KLVKLPKRKKVVGVKWIFCNKLDEDGKVVRYKARLVAKGYSQQEG 212
Query: 537 IDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQPPGFEDEK 596
IDY +TFA VARLE I +L+SF+ N+ L+QMDVKS FLNG I +EVYV QPPGFE+E
Sbjct: 211 IDYPKTFALVARLEVICILLSFATYSNMKLYQMDVKSAFLNGLIQKEVYVEQPPGFENET 32
Query: 597 NPDHVFNLKK 606
HVF L K
Sbjct: 31 LHQHVFKLNK 2
Score = 21.2 bits (43), Expect(2) = 2e-39
Identities = 8/16 (50%), Positives = 13/16 (81%)
Frame = -2
Query: 456 LVSLIEPKSIDEALQD 471
L+ ++PK IDEA++D
Sbjct: 453 LIFEMKPKHIDEAIKD 406
>BF596801 weakly similar to GP|29423270|g gag-pol polyprotein {Glycine max},
partial (7%)
Length = 336
Score = 154 bits (390), Expect = 1e-37
Identities = 70/111 (63%), Positives = 93/111 (83%)
Frame = +3
Query: 461 EPKSIDEALQDKDWILAMEEELNQFSKNDVWNIVKKPQGVHIIGTKWVFRNKLNEKGDVV 520
EPK+I EA+ D +WI+ M+EELNQF +N+VW +V+KP+ +IGTKWVFRNKL+E G ++
Sbjct: 3 EPKNIKEAIVDDNWIIVMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIII 182
Query: 521 RNKARLVAQGYRQQEGIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDV 571
RNKARLVA+GY Q+EGIDY ET+APVARLEAIR+L++++ N L+QMDV
Sbjct: 183 RNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMNFKLYQMDV 335
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein,
partial (7%)
Length = 804
Score = 147 bits (370), Expect = 3e-35
Identities = 84/217 (38%), Positives = 126/217 (57%), Gaps = 4/217 (1%)
Frame = +1
Query: 749 YRGMICSFLYLTASRPDILFSEHLCARFQSDPRETHLTVVKRILRYLKGTTNLGLLYK-- 806
+R +I S YL SRP+I F+ L +RF PR +H+ KR+LR +KGT G+L+
Sbjct: 22 FRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFK 201
Query: 807 -KTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISI 865
K+ + L GY D+D+ D + KST G V+ +SK+Q IALST EAEY++
Sbjct: 202 AKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAA 381
Query: 866 AICNT*MLWMKHQLEDYQILESN-IPIYCDNTAAISLSKNPILHSRAKYIEVKYHFIRDY 924
++ +WM + LE+ ++ E + + DN +AI+L+K+P LH R+K+IE+++H+IRD
Sbjct: 382 SLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHYIRDQ 561
Query: 925 VQKGYFF*SSLILTINSADIFTKPLAEDRFKFILKNL 961
V KG AD+ TKP+ RFK I L
Sbjct: 562 VSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSEL 672
>CO983516
Length = 724
Score = 146 bits (369), Expect = 4e-35
Identities = 73/120 (60%), Positives = 91/120 (75%), Gaps = 1/120 (0%)
Frame = +2
Query: 543 FAPVARLEAIRLLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQPPGFEDEKNPDHVF 602
F PVARLE+IRLL+ + L+QMDVKS FLNGY++EEVYV QP GF D +PDHV+
Sbjct: 365 FHPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFIDPTHPDHVY 544
Query: 603 NLKKSLYGLKQAPRAWYERLRFLLENE-FVRGKVDTTLFCKTYKDDILIVQIYVDDIIFG 661
LKK+LYGLKQAPRAWYERL LL + + +G +D TLF K ++++I QIYVDDI+FG
Sbjct: 545 RLKKALYGLKQAPRAWYERLTELLTQQGYRKGGIDKTLFVKQDAENLMIAQIYVDDIVFG 724
>AI855982
Length = 484
Score = 142 bits (357), Expect = 9e-34
Identities = 74/165 (44%), Positives = 107/165 (64%)
Frame = +2
Query: 425 PKELILGNKDEPVRTRSAFRPSEETLLSLKGLVSLIEPKSIDEALQDKDWILAMEEELNQ 484
P + I+G+ + V TR + + L + VS+IEPK+I EA+ D +WI+AM+EELNQ
Sbjct: 2 PLDNIIGDISKGVTTRHSLKD----LCNNMAFVSMIEPKNIKEAIVDDNWIIAMQEELNQ 169
Query: 485 FSKNDVWNIVKKPQGVHIIGTKWVFRNKLNEKGDVVRNKARLVAQGYRQQEGIDYTETFA 544
F +N+VW +V+KP +I TKWVFRNKL+E ++ +KARLVA+GY Q +G+DY T+A
Sbjct: 170 FERNNVWKLVEKPDNYPVI*TKWVFRNKLDEHRIIIIHKARLVAEGYNQVDGLDYEHTYA 349
Query: 545 PVARLEAIRLLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQP 589
+ARL I + +S+ N L+ S L+G + EVYV QP
Sbjct: 350 SIARL*VIIMPLSYVYIMNSTLYHYACVSALLHGLLLHEVYVDQP 484
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
partial (30%)
Length = 687
Score = 126 bits (316), Expect = 5e-29
Identities = 62/151 (41%), Positives = 93/151 (61%), Gaps = 1/151 (0%)
Frame = +2
Query: 812 KLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISIAICNT* 871
+LSGYCDAD+AG +R+STSG C F+G NLVSW SK+Q+ +A S+AEAEY S+A+
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 872 MLWMKHQLEDYQILES-NIPIYCDNTAAISLSKNPILHSRAKYIEVKYHFIRDYVQKGYF 930
++W+K L++ + E + +YCDN AA+ ++ NP+ H R K+IE+ HFIR+ +
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 931 F*SSLILTINSADIFTKPLAEDRFKFILKNL 961
+ DI TK L + + + L
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKL 466
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 122 bits (307), Expect = 6e-28
Identities = 59/126 (46%), Positives = 85/126 (66%)
Frame = -2
Query: 494 VKKPQGVHIIGTKWVFRNKLNEKGDVVRNKARLVAQGYRQQEGIDYTETFAPVARLEAIR 553
V P G +G +WV+ K+ G+V R KARLVA+GY Q GIDY +TF+PVA+L +R
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 554 LLISFSVNHNIILHQMDVKSVFLNGYISEEVYVHQPPGFEDEKNPDHVFNLKKSLYGLKQ 613
L ++ + + LHQ+D+K+ FL+G + E++Y+ QPPGF + V L +SLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 614 APRAWY 619
+PRAW+
Sbjct: 46 SPRAWF 29
>CO982036
Length = 674
Score = 117 bits (293), Expect = 2e-26
Identities = 73/212 (34%), Positives = 115/212 (53%), Gaps = 5/212 (2%)
Frame = -2
Query: 644 YKDDILIVQ--IYVDDIIFGSANSSLCKEFSEMMQAEFEMSMMGELKYFLGIQVDQTPEA 701
YK IL V +YVD II GS+ +L + + + + F + ++G+L YF+ I+V P+
Sbjct: 673 YKTHILTVYLLVYVDIIITGSS-CTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDL 497
Query: 702 TYIHQSKYTKELLKKFNMTESIIAKTLMHPTCILEKEDASGKVCQKLYRGMICSFLYLTA 761
+ ++ + +K I+ M TC L K D+ YR ++ + Y T
Sbjct: 496 LFSLRTSIFEIFCRKPR*QAQPISSP-MTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 762 SRPDILFSEHLCARFQSDPRETHLTVVKRILRYLKGTTNLGLLYK---KTSEYKLSGYCD 818
RP+I F+ + +F S+P ++H T VKRILRYLKG+ + GL K + + G+CD
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 819 ADYAGDRTERKSTSGNCQFLGSNLVSWASKRQ 850
AD+A +++STSG FLG NL+SW +Q
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>BU549979
Length = 615
Score = 115 bits (288), Expect = 9e-26
Identities = 62/184 (33%), Positives = 106/184 (56%), Gaps = 3/184 (1%)
Frame = -1
Query: 775 RFQSDPRETHLTVVKRILRYLKGTTNLGLLYKKTSEYKLSGYCDADYAGDRTERKSTSGN 834
R+QS+P H K+++RYL+GT + L+YK+T+ ++ GY D+D+AG R+STSG
Sbjct: 606 RYQSNPGIDHWKTAKKVMRYLQGTKDYMLMYKQTNCLEVIGYSDSDFAGCVDSRRSTSGY 427
Query: 835 CQFLGSNLVSWASKRQSTIALSTAEAEYISIAICNT*MLWMKHQLEDYQILES---NIPI 891
L +VSW S +Q+ IA ST E E++ + +W+K + ++++S + +
Sbjct: 426 IFMLADGVVSWRSSKQTLIATSTMEVEFVPCFEATSHGVWLKSFMSSLRVVDSISRPLKL 247
Query: 892 YCDNTAAISLSKNPILHSRAKYIEVKYHFIRDYVQKGYFF*SSLILTINSADIFTKPLAE 951
YCDN AA+ ++KN +R+K+I++KY IR+ V++ + + D TK +
Sbjct: 246 YCDNFAAVFMAKNNKSGNRSKHIDIKYLVIRERVKEKKVVIEHVNTELMIVDPLTKGMTP 67
Query: 952 DRFK 955
FK
Sbjct: 66 KNFK 55
>BG508993
Length = 374
Score = 114 bits (284), Expect = 3e-25
Identities = 51/123 (41%), Positives = 81/123 (65%), Gaps = 1/123 (0%)
Frame = +1
Query: 796 KGTTNLGLLYKKTSEYKLSGYCDADYAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIAL 855
KGT + GL Y ++ YKL G+CD+D+AGD +RKST+G F+G + +W+SK+Q + L
Sbjct: 4 KGTIDFGLFYSPSNNYKLVGFCDSDFAGDVDDRKSTTGFVFFMGDCVFTWSSKKQGIVTL 183
Query: 856 STAEAEYISIAICNT*MLWMKHQLEDYQILE-SNIPIYCDNTAAISLSKNPILHSRAKYI 914
T EAEY++ C +W++ LE+ Q+L+ + IY DN +A L+KN + H R+K+I
Sbjct: 184 FTCEAEYVAATSCTCHAIWLRRLLEELQLLQKESTKIYVDNRSAQELAKNSVFHERSKHI 363
Query: 915 EVK 917
+ +
Sbjct: 364 DTR 372
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 112 bits (280), Expect = 8e-25
Identities = 55/134 (41%), Positives = 80/134 (59%)
Frame = +3
Query: 456 LVSLIEPKSIDEALQDKDWILAMEEELNQFSKNDVWNIVKKPQGVHIIGTKWVFRNKLNE 515
L SL P +I EAL W AM +E+ N W +V P G +G +WV+ K+
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 516 KGDVVRNKARLVAQGYRQQEGIDYTETFAPVARLEAIRLLISFSVNHNIILHQMDVKSVF 575
G V R KARLVA+GY Q GI+Y +TF+PV L +RL ++ + + LHQ+D+K+ F
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 576 LNGYISEEVYVHQP 589
L+G + E++Y+ QP
Sbjct: 363 LHGDLEEDIYMEQP 404
>AI966222
Length = 430
Score = 110 bits (276), Expect = 2e-24
Identities = 51/88 (57%), Positives = 63/88 (70%)
Frame = +1
Query: 255 EMSRTMLQEIDMAKHFWAEAVNTSCYIQNRISVRPILNKTPYELWKKVKPNISYFHPFGC 314
EM+RT L + KHF AE +N CY+QN+I +RPIL +TPYELWK KPNISYF+PF C
Sbjct: 1 EMARTTLNDNLTPKHF*AEVMNIVCYLQNKIYIRPILKRTPYELWKGRKPNISYFYPFRC 180
Query: 315 VCYALNTKDRLHKFDSKSSKCLLLGYSK 342
C+ +NTKD L K DSKS + + YSK
Sbjct: 181 KCFIINTKDNLGKIDSKSDCGIFIAYSK 264
Score = 39.3 bits (90), Expect = 0.008
Identities = 23/61 (37%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Frame = +2
Query: 327 KFDSKSSKCLLLGYSKRSKGFRIYNTDAKTIEEYIHVRF-DDKLDSDQSKLVEKFVDMSI 385
K K + LL K SK FR+YN+ IEE IH+RF +K + + +L E F D+ +
Sbjct: 218 KLTQKVTVEYLLHTLKLSKAFRVYNSGTLVIEEAIHIRFGKNKPNKELLELDESFADLRL 397
Query: 386 N 386
+
Sbjct: 398 D 400
>AW185460
Length = 411
Score = 108 bits (270), Expect = 1e-23
Identities = 52/106 (49%), Positives = 72/106 (67%)
Frame = +2
Query: 761 ASRPDILFSEHLCARFQSDPRETHLTVVKRILRYLKGTTNLGLLYKKTSEYKLSGYCDAD 820
A+RPDI+++ L +RF P + H KRILRYL+GT G+ Y + +L GY D+D
Sbjct: 89 ATRPDIMYATSLLSRFMQSPSQIHFGAGKRILRYLQGTKAFGIWYTTETNSELLGYTDSD 268
Query: 821 YAGDRTERKSTSGNCQFLGSNLVSWASKRQSTIALSTAEAEYISIA 866
+AG + KSTSG LGS + SWASK+Q+T+A STAEAEY+++A
Sbjct: 269 WAGSTDDMKSTSGYAFSLGSGMFSWASKKQATVAQSTAEAEYVAVA 406
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.322 0.137 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,023,829
Number of Sequences: 63676
Number of extensions: 589799
Number of successful extensions: 2863
Number of sequences better than 10.0: 132
Number of HSP's better than 10.0 without gapping: 2774
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2816
length of query: 967
length of database: 12,639,632
effective HSP length: 106
effective length of query: 861
effective length of database: 5,889,976
effective search space: 5071269336
effective search space used: 5071269336
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (29.3 bits)
Lotus: description of TM0101.5