
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144727.8 + phase: 0 /pseudo
(770 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 405 e-113
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 405 e-113
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 140 2e-33
BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vi... 133 3e-31
TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, parti... 129 4e-30
CO982036 128 1e-29
CO981879 96 6e-29
TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polypro... 123 3e-28
BI427153 121 1e-27
CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {V... 119 4e-27
BM307983 117 1e-26
TC232995 116 3e-26
AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza ... 115 7e-26
BE211208 115 7e-26
TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, ... 114 1e-25
BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberos... 112 8e-25
CO983154 111 1e-24
BI321712 105 8e-23
BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {... 68 2e-22
TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotr... 104 2e-22
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 405 bits (1041), Expect = e-113
Identities = 251/792 (31%), Positives = 409/792 (50%), Gaps = 28/792 (3%)
Frame = +1
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FK+ ++ + +K RSD E+ + F E+ +GI + S TPQQNG+ ER
Sbjct: 2395 VFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAITPQQNGIVER 2574
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSP--FFHLFKFQL-D 118
KNR L + R +L +P W EA+ T ++ NR + + +P + ++K +
Sbjct: 2575 KNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNR---VTLRRGTPTTLYEIWKGRKPS 2745
Query: 119 YSDLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTF 178
H FG C++ ++ K+ +S F+GYS + + + ++ SR T
Sbjct: 2746 VKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFN--------SRTRT- 2898
Query: 179 FYNQFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDP 238
++ SI+ ++D++ + + + K + + T+
Sbjct: 2899 -----VMESINVVVDDLSPARKKDV-EEDVRTSGDNVADAAKSGENAENSDSATDESNIN 3060
Query: 239 EPVEPRRSSRTSRA---------PDRFSPDRYGSKHTSLTASLSSISIPTCYSQVVKDVR 289
+P + R S+R + P+R R + S P + + D
Sbjct: 3061 QP-DKRSSTRIQKMHPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEF 3237
Query: 290 WIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQ 349
WI AM EE++ + N W++V P +G KW++ K N +G + R ARLV Q
Sbjct: 3238 WINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQ 3417
Query: 350 EYGIDYDETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLF 395
G+D+DETFAPVA++ ++ +QMDVK+ FL+G L E++Y+ P+G
Sbjct: 3418 IEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFA 3597
Query: 396 SSSKG--VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLL 453
+ V +LK++LYGLKQAPRAWYE+ L + + +LF+ + + +++
Sbjct: 3598 DPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQ 3777
Query: 454 LYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATY 513
+YVDD+V G N ++ +Q+ + F M +G L YFLGL+V IFL Q +YA
Sbjct: 3778 IYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSRYAKN 3957
Query: 514 LISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQ 573
++ G+++ + TP ++K +D+ D LYR ++GSL LT +RPDI++ V
Sbjct: 3958 IVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGV 4137
Query: 574 VSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVT 633
+++ +P+ HL V+RI++Y+ G+S G+ + + P L Y DA+WAG D R S +
Sbjct: 4138 CARYQANPKISHLTQVKRILKYVNGTSDYGIMYCHCSNPMLVGYCDADWAGSADDRKSTS 4317
Query: 634 SWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSL 693
C +L ++LISW SKKQ VS S+ E+EY A ++CS+++W++ L E + +L
Sbjct: 4318 GGCFYLGNNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTL 4494
Query: 694 YADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
Y DN SAI NPV H RTK I++ H IRD D+ +I+L HV T+ QIADI TKA+
Sbjct: 4495 YCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLKHVDTEEQIADIFTKALDA 4674
Query: 754 PRHQFLVGKLMI 765
+ + L GKL I
Sbjct: 4675 NQFEKLRGKLGI 4710
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 405 bits (1040), Expect = e-113
Identities = 253/792 (31%), Positives = 408/792 (50%), Gaps = 28/792 (3%)
Frame = +1
Query: 2 MFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAER 61
+FK+ ++ + +K RSD E+ + +F E+ +GI + S TPQQNG+ ER
Sbjct: 2398 VFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSEGITHEFSAAITPQQNGIVER 2577
Query: 62 KNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSP--FFHLFKFQLD- 118
KNR L + R +L +P W EA+ T ++ NR + + +P + ++K +
Sbjct: 2578 KNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNR---VTLRRGTPTTLYEIWKGRKPT 2748
Query: 119 YSDLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHSFRVSRNVTF 178
H FG C++ ++ K+ +S F+GYS + + + ++ SR T
Sbjct: 2749 VKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFN--------SRTRT- 2901
Query: 179 FYNQFMLHSISPDINDITILPNFSIMPQSIERYKPEFTYVRKCIKQIPTAPPDTEPPPDP 238
++ SI+ ++D+T + + + K + + T+ P
Sbjct: 2902 -----VMESINVVVDDLTPARKKDV-EEDVRTSGDNVADTAKSAENAENSDSATDEPNIN 3063
Query: 239 EPVEPRRSSRTSRA---------PDRFSPDRYGSKHTSLTASLSSISIPTCYSQVVKDVR 289
+P + R S R + P+R R + S P + + D
Sbjct: 3064 QP-DKRPSIRIQKMHPKELIIGDPNRGVTTRSREIEIVSNSCFVSKIEPKNVKEALTDEF 3240
Query: 290 WIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQ 349
WI AM EE++ + N W++V P +G KW++ K N +G + R ARLV Q
Sbjct: 3241 WINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQ 3420
Query: 350 EYGIDYDETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLF 395
G+D+DETFAPVA++ ++ +QMDVK+ FL+G L E+ Y+ P+G
Sbjct: 3421 IEGVDFDETFAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEAYVEQPKGFV 3600
Query: 396 SSSKG--VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTGIVLLL 453
+ V +LK++LYGLKQAPRAWYE+ L + + +LF+ + + +++
Sbjct: 3601 DPTHPDHVYRLKKALYGLKQAPRAWYERLTEFLTQQGYRKGGIDKTLFVKQDAENLMIAQ 3780
Query: 454 LYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYATY 513
+YVDD+V G N ++ +Q+ + F M +G L YFLGL+V IFL Q KYA
Sbjct: 3781 IYVDDIVFGGMSNEMLRHFVQQMQSEFEMSLVGELTYFLGLQVKQMEDSIFLSQSKYAKN 3960
Query: 514 LISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQ 573
++ G+++ + TP ++K +D+ D LYR ++GSL LT +RPDI++ V
Sbjct: 3961 IVKKFGMENASHKRTPAPTHLKLSKDEAGTSVDQSLYRSMIGSLLYLTASRPDITYAVGV 4140
Query: 574 VSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVT 633
+++ +P+ HL V+RI++Y+ G+S G+ + + L Y DA+WAG D R S +
Sbjct: 4141 CARYQANPKISHLNQVKRILKYVNGTSDYGIMYCHCSDSMLVGYCDADWAGSADDRKSTS 4320
Query: 634 SWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRGFLAELGFP*TEPTSL 693
C +L ++LISW SKKQ VS S+ E+EY A ++CS+++W++ L E + +L
Sbjct: 4321 GGCFYLGTNLISWFSKKQNCVSLSTAEAEYIAAGSSCSQLVWMKQMLKEYNVE-QDVMTL 4497
Query: 694 YADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LISLPHVSTQLQIADILTKAVPR 753
Y DN SAI NPV H RTK I++ H IRD D+ +I+L HV T+ QIADI TKA+
Sbjct: 4498 YCDNMSAINISKNPVQHSRTKHIDIRHHYIRDLVDDKVITLEHVDTEEQIADIFTKALDA 4677
Query: 754 PRHQFLVGKLMI 765
+ + L GKL I
Sbjct: 4678 NQFEKLRGKLGI 4713
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
partial (30%)
Length = 687
Score = 140 bits (353), Expect = 2e-33
Identities = 70/155 (45%), Positives = 98/155 (63%)
Frame = +2
Query: 613 KLSAYSDANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSE 672
+LS Y DA+WAGCP R S + +C+F+ +L+SWKSKKQ V++SS E+EYR+M E
Sbjct: 14 QLSGYCDADWAGCPMDRRSTSGYCVFIGGNLVSWKSKKQTVVARSSAEAEYRSMAMVTCE 193
Query: 673 IIWLRGFLAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDAYDE*LI 732
++W++ FL EL F LY DN +A+ +NPVFHERTK IE+DCH IR+ I
Sbjct: 194 LMWIKQFLQELRFCEELQMKLYCDNQAALHIASNPVFHERTKHIEIDCHFIREKLLSKEI 373
Query: 733 SLPHVSTQLQIADILTKAVPRPRHQFLVGKLMIFD 767
+ + Q DILTK++ P+ Q + KL +D
Sbjct: 374 VTEFIGSNDQPVDILTKSLRGPKIQIVCSKLGAYD 478
>BF596070 similar to GP|27901698|gb gag-pol polyprotein {Vitis vinifera},
partial (34%)
Length = 407
Score = 133 bits (334), Expect = 3e-31
Identities = 68/129 (52%), Positives = 83/129 (63%), Gaps = 16/129 (12%)
Frame = -2
Query: 310 VSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDETFAPVAKMTTV- 368
V PP P+GC+WVY+VK+ G ++R ARLV Q YGIDY +TF+PVAK+TTV
Sbjct: 406 VPLPPGKTPVGCRWVYTVKVGPTGEVDRLKARLVAKGYTQVYGIDYCDTFSPVAKLTTVR 227
Query: 369 -------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG--VCKLKRSLYGLKQ 413
HQ+D+KN FLHGDL EDIYM P G + + VCKL RSLYGLKQ
Sbjct: 226 LFLAMAAICHWPLHQLDIKNAFLHGDLEEDIYMEQPPGFVAQGEYGLVCKLHRSLYGLKQ 47
Query: 414 APRAWYEKF 422
+PRAW+ KF
Sbjct: 46 SPRAWFGKF 20
>TC234303 weakly similar to UP|Q8W153 (Q8W153) Polyprotein, partial (10%)
Length = 558
Score = 129 bits (325), Expect = 4e-30
Identities = 78/179 (43%), Positives = 102/179 (56%), Gaps = 17/179 (9%)
Frame = +1
Query: 333 GSLNRYNARLVTLRNKQEYGIDYDETFAPVAKMTTVHQM--------------DVKNIFL 378
G+++++ ARLV Q YG DY TF+PVAKM VH + D KN FL
Sbjct: 28 GTIDQFKARLVAKSYTQVYGQDYTGTFSPVAKMAYVHLLWSMAVVCHWPLF*LDAKNAFL 207
Query: 379 HGDLAEDIYMTPPQGLFS---SSKGVCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQ 435
HG L E++YM P G + SS VC+L RS YGLKQ+PRAW +C + + + SH
Sbjct: 208 HGYLEEEVYMEQPLGFVAQGESSNMVCQLCRSFYGLKQSPRAWPFLYCGAAIWYD-SHEA 384
Query: 436 YNSSLFIHRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGL 494
+S + H + G + L++YVDD+ ITGSD I LK L F KDLG L YFLG+
Sbjct: 385 DHSVFYCH-SPQGCIYLIVYVDDIGITGSDQHGIT*LK*XLCCQFQTKDLGKLRYFLGI 558
>CO982036
Length = 674
Score = 128 bits (321), Expect = 1e-29
Identities = 79/212 (37%), Positives = 115/212 (53%), Gaps = 3/212 (1%)
Frame = -2
Query: 443 HRTSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKG 502
++T V LL+YVD ++ITGS IQ L +L++SF +K LG L YF+ +EV S
Sbjct: 673 YKTHILTVYLLVYVD-IIITGSSCTLIQNLTSKLNSSFPLKLLGKLDYFVEIEVKSMPDL 497
Query: 503 IFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTI 562
+F + Q+ P+ +P+ K + D D+ P YR +VG+L T+
Sbjct: 496 LFSLRTSIFEIFCRKPR*QA-QPISSPMTTTCKLSKSDSDLFSGPTFYRSVVGALQYTTV 320
Query: 563 TRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFF--SIGNYP-KLSAYSD 619
RP+ISF V +V QFM +P H V+RI+RYLKGS GL +I + P + + D
Sbjct: 319 IRPEISFAVNKVCQFMSNPLDSHWTEVKRILRYLKGSLSYGL*LKPAISSQPLPIRGFCD 140
Query: 620 ANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQ 651
A+WA D + S + +FL +LISW KQ
Sbjct: 139 ADWASAVDDKRSTSGAAVFLGPNLISWWXXKQ 44
>CO981879
Length = 576
Score = 96.3 bits (238), Expect(2) = 6e-29
Identities = 46/97 (47%), Positives = 62/97 (63%)
Frame = -1
Query: 1 SMFKKFLTYVETQFQSSVKIFRSDSSGEYMSHEFQEYLQHKGILSQRSCPNTPQQNGLAE 60
S+FK F ++TQFQ +K+FRSD+ EY + + GI+ Q SC +TPQQNG+AE
Sbjct: 573 SIFKTFFQMIQTQFQVKIKVFRSDNGREYFNKHLSKXXLENGIIHQSSCVDTPQQNGVAE 394
Query: 61 RKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINR 97
RKNRHL +V R+LLF P W EA+ T +L N+
Sbjct: 393 RKNRHLXEVARALLFQNKAPKYXWGEAILTGTYLKNK 283
Score = 50.4 bits (119), Expect(2) = 6e-29
Identities = 28/89 (31%), Positives = 48/89 (53%), Gaps = 6/89 (6%)
Frame = -2
Query: 97 RLPSIVIEFDSPFFHLFKFQLDYS------DLHTFGCVCFVHLPPFEKHKLGVQSVQCAF 150
R+PS ++ F +P +F + L FGC FVH+ + KL ++ +C F
Sbjct: 284 RMPSKILNFRTPL-DVFTSAFPNNRLSCTLPLKIFGCTVFVHIHEPNQGKLEPRAKKCVF 108
Query: 151 MGYSNSHKGFVCYDVSNHSFRVSRNVTFF 179
+GY+ + KG+ C+D ++ V+ +VTFF
Sbjct: 107 VGYAPNQKGYKCFDPTSKKTFVTIDVTFF 21
>TC223792 weakly similar to UP|Q9FH39 (Q9FH39) Copia-type polyprotein,
partial (7%)
Length = 804
Score = 123 bits (309), Expect = 3e-28
Identities = 68/218 (31%), Positives = 116/218 (53%), Gaps = 3/218 (1%)
Frame = +1
Query: 550 YRQLVGSLN*LTITRPDISFVVQQVSQFMHSPRHLHLAAVRRIIRYLKGSSHCGLFFSI- 608
+R+L+GSL L +RP+I F V +S+FM PR H+ A +R++R +KG+ G+ F
Sbjct: 22 FRRLIGSLRYLCNSRPNICFAVSLISRFMKRPRLSHMQAAKRVLRLIKGTIGSGVLFPFK 201
Query: 609 --GNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAM 666
P L Y+D++W P+ S + + ++ SKKQ ++ S+ E+EY A
Sbjct: 202 AKSGKPDLLGYTDSDWKRDPEQEKSTGGYLFMYNDAPVA*SSKKQDVIALSTCEAEYVAA 381
Query: 667 YAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRDA 726
+ +W+ L EL +P +L DN SAI +P H R+K IE+ H IRD
Sbjct: 382 SLGACQAVWMMNLLEELKLRERKPVNLLIDNKSAINLAKHPTLHGRSKHIELRFHYIRDQ 561
Query: 727 YDE*LISLPHVSTQLQIADILTKAVPRPRHQFLVGKLM 764
+ +++ + + Q+AD++TK + R + + +L+
Sbjct: 562 VSKGNVTVEYCKAEEQLADLMTKPIQVSRFKQICSELV 675
>BI427153
Length = 422
Score = 121 bits (303), Expect = 1e-27
Identities = 62/135 (45%), Positives = 81/135 (59%), Gaps = 1/135 (0%)
Frame = +1
Query: 46 QRSCPNTPQQNGLAERKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEF 105
Q +CP+TPQQNG+AERKN HLL+ RSL+ + +P W +A+ T FLINR+PS +E
Sbjct: 4 QSTCPHTPQQNGIAERKNHHLLETARSLMLNSNVPTHHWGDAVLTACFLINRMPSSSLEN 183
Query: 106 DSPFFHLFKFQ-LDYSDLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYD 164
P +F L Y FGC CFVH KL +SV+C F+GYS KG+ CY
Sbjct: 184 QIPHSIVFPNDLLFYVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYTCYF 363
Query: 165 VSNHSFRVSRNVTFF 179
+ + +S NVTFF
Sbjct: 364 PNMRRYYMSANVTFF 408
>CA784773 weakly similar to GP|27901698|gb gag-pol polyprotein {Vitis
vinifera}, partial (34%)
Length = 409
Score = 119 bits (299), Expect = 4e-27
Identities = 59/134 (44%), Positives = 84/134 (62%), Gaps = 14/134 (10%)
Frame = +3
Query: 272 LSSISIPTCYSQVVKDVRWIKAMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNS 331
LSS+++P+ + + W +AM +E+QAL+ N TW++V PP +GC+WVY+VK+
Sbjct: 3 LSSLTVPSTIREALDHPGWRQAMVDEMQALENNGTWELVPLPPGKTTVGCRWVYTVKVGP 182
Query: 332 DGSLNRYNARLVTLRNKQEYGIDYDETFAPVAKMTTV--------------HQMDVKNIF 377
+G ++R ARLV Q YGI+Y +TF+PV +TTV HQ+D+KN F
Sbjct: 183 NGKVDRLKARLVAKGYTQVYGIEYCDTFSPVFFLTTVRLFLAMAAIRHWPLHQLDIKNAF 362
Query: 378 LHGDLAEDIYMTPP 391
LHGDL EDIYM P
Sbjct: 363 LHGDLEEDIYMEQP 404
>BM307983
Length = 406
Score = 117 bits (294), Expect = 1e-26
Identities = 61/134 (45%), Positives = 83/134 (61%), Gaps = 17/134 (12%)
Frame = +2
Query: 319 MGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYGIDYDETFAPVAK-------------- 364
+GC+W+Y+VK +D +L+RY ARLV Q YGIDY+ETFA K
Sbjct: 2 VGCRWIYTVKY*ADDTLDRYKARLVAKGYIQTYGIDYEETFAQWQK*IQSGSSSP*QQAQ 181
Query: 365 -MTTVHQMDVKNIFLHGDLAEDIYMTPPQGLFSSSKG--VCKLKRSLYGLKQAPRAWYEK 421
+HQ DVKN FLHG L E++YM P G +S+ G VC+LK++LYGLKQ+PRAW+ +
Sbjct: 182 FGWEMHQFDVKNAFLHGSLEEEVYMEIPPGYGASNGGNKVCRLKKALYGLKQSPRAWFGR 361
Query: 422 FCSSLLGFCFSHSQ 435
F ++L + SQ
Sbjct: 362 FTQAMLSLGYKQSQ 403
>TC232995
Length = 1009
Score = 116 bits (291), Expect = 3e-26
Identities = 62/169 (36%), Positives = 95/169 (55%), Gaps = 2/169 (1%)
Frame = +2
Query: 391 PQGLFSSSKG--VCKLKRSLYGLKQAPRAWYEKFCSSLLGFCFSHSQYNSSLFIHRTSTG 448
P G S K V KL+++LYGLKQAPRAWYE+ + LL FS + +++LFI R
Sbjct: 11 PPGFEISDKPNHVYKLQKALYGLKQAPRAWYERLSNFLLEKEFSRGKVDTTLFIKRKHND 190
Query: 449 IVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQH 508
I+L+ +YVDD++ ++++ + + + F M +G L YFLGL++ T GIF++Q
Sbjct: 191 ILLVQIYVDDIIFGSTNDSLCKEFSLDMQSEFEMSMMGELKYFLGLQIKQTQ*GIFINQS 370
Query: 509 KYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSL 557
KY LI G+ S + TP+ N +D+ D YR +G +
Sbjct: 371 KYCKELIKRFGMDSAKHMSTPMSTNCYLDKDESGQSIDIKQYRDAIGEV 517
>AI855818 weakly similar to GP|21741393|e OSJNBb0051N19.6 {Oryza sativa
(japonica cultivar-group)}, partial (10%)
Length = 463
Score = 115 bits (288), Expect = 7e-26
Identities = 63/151 (41%), Positives = 88/151 (57%), Gaps = 16/151 (10%)
Frame = -3
Query: 293 AMNEEIQALQENLTWDIVSCPPDIKPMGCKWVYSVKLNSDGSLNRYNARLVTLRNKQEYG 352
AM EE+ + N W +V P + +G KWV+ KL+ G + R ARLV QE G
Sbjct: 458 AMQEELNQFERNNVWKLVEKPENYPVIGTKWVFRNKLDEHGIIIRNKARLVAKGYNQEEG 279
Query: 353 IDYDETFAPVAKMTTV--------------HQMDVKNIFLHGDLAEDIYMTPPQGLFSSS 398
IDY+ET+APVA++ + +QMDVK+ FL+G + E++Y+ P G
Sbjct: 278 IDYEETYAPVARLEVIRMLLAYVSIMNFKLYQMDVKSAFLNGLIQEEVYVEQPPGFEIPD 99
Query: 399 K--GVCKLKRSLYGLKQAPRAWYEKFCSSLL 427
K V KL+++LYGLKQAPRAWYE+ + LL
Sbjct: 98 KPTHVYKLQKALYGLKQAPRAWYERISNFLL 6
>BE211208
Length = 413
Score = 115 bits (288), Expect = 7e-26
Identities = 57/130 (43%), Positives = 82/130 (62%), Gaps = 1/130 (0%)
Frame = +2
Query: 449 IVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKG-IFLHQ 507
+V LL+YVDD++ITG N IQ L L+++F +K LG L YFLG+EVH T G + L Q
Sbjct: 23 LVYLLVYVDDIIITGRSNYLIQSLVHHLNSNFSLKQLGQLDYFLGIEVHHTPTGSVLLTQ 202
Query: 508 HKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDI 567
KY L+ + P+ +P+ N++ ++ D+L DP +YR +VG+L TITRP+I
Sbjct: 203 SKYICDLLHKTDMAEAKPISSPMVTNLRLSKNGDDLLSDPTMYRSVVGALQYPTITRPEI 382
Query: 568 SFVVQQVSQF 577
SF +V QF
Sbjct: 383 SFAANKVCQF 412
>TC231544 weakly similar to UP|Q9FLR2 (Q9FLR2) Polyprotein-like, partial
(16%)
Length = 662
Score = 114 bits (286), Expect = 1e-25
Identities = 74/186 (39%), Positives = 108/186 (57%), Gaps = 4/186 (2%)
Frame = +3
Query: 586 LAAVRRIIRYLKGSSHCGLFFSIGNYPKLSAYSDANWAGCPDTRHSVTSWCMFLRSSLIS 645
L A R+++YLKG GL FS + ++ +SDA+WA C D+ S+T +C FL SSLIS
Sbjct: 12 LCAATRVLKYLKGCPRKGLSFSRESPIQILGFSDADWATCIDSSKSITWYCFFLGSSLIS 191
Query: 646 WKSKKQARVSKSSTESE--YRAMYAACSEIIWLRGFLAELGFP*TEPTSLYADNTSAI-* 702
WK+KKQ VS+SS+ SE YRA+ + E+ WL L +L T +Y DN SA+ *
Sbjct: 192 WKAKKQNTVSRSSSSSEAKYRALTSTTCELQWLTYLLKDL-----HVTLIYCDNQSALQ* 356
Query: 703 FVANPVFHERTKLIEVDCHSIRDAYDE*LI-SLPHVSTQLQIADILTKAVPRPRHQFLVG 761
++H + +E+DCH +R+ + L+ L VS+ Q+ADI TKA+ +
Sbjct: 357 LPIKVIYHGQ---LEIDCHIVREKTQQGLMHCLLPVSSSNQLADIFTKALSPKLFSSNLS 527
Query: 762 KLMIFD 767
KL + D
Sbjct: 528 KLGLSD 545
>BI787454 weakly similar to GP|21434|emb|CA ORF4 {Solanum tuberosum}, partial
(21%)
Length = 421
Score = 112 bits (279), Expect = 8e-25
Identities = 62/137 (45%), Positives = 85/137 (61%), Gaps = 1/137 (0%)
Frame = +2
Query: 434 SQYNSSLFIHRTSTG-IVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFL 492
S+ + S+F TS G V L++YVDD++IT D I +LKE L F KDL L YFL
Sbjct: 8 SEADHSVFYCHTSPGKCVYLMVYVDDIMITKKDATKIVQLKEHLFNHFQTKDLRYLKYFL 187
Query: 493 GLEVHSTSKGIFLHQHKYATYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQ 552
G+EV + G+ + Q KYA ++ G+Q+ VD+P++ N+K ++ PDP YR+
Sbjct: 188 GIEVAQSGDGVVISQRKYALDILEETGMQNCRLVDSPMDPNLKLMAYQSEVYPDPERYRR 367
Query: 553 LVGSLN*LTITRPDISF 569
LVG L LTITRPDISF
Sbjct: 368 LVGKLIYLTITRPDISF 418
>CO983154
Length = 568
Score = 111 bits (278), Expect = 1e-24
Identities = 67/171 (39%), Positives = 94/171 (54%), Gaps = 14/171 (8%)
Frame = +3
Query: 51 NTPQQNGLAERKNRHLLDVTRSLLF*AFIPPRFWVEALATTVFLINRLPSIVIEFDSPFF 110
+TPQQNG+AERKNRHLL+ RSL+ +P W +A+ T+ FLINR+PS +E P
Sbjct: 3 HTPQQNGIAERKNRHLLETARSLMLNLNVPIHHWGDAVLTSCFLINRMPSSSLENQIPHS 182
Query: 111 HLFKFQ-LDYSDLHTFGCVCFVHLPPFEKHKLGVQSVQCAFMGYSNSHKGFVCYDVSNHS 169
+F L + FGC CFVH KL +SV+C F+GYS KG+ CY +
Sbjct: 183 LVFPHDPLFHVSPKVFGCTCFVHDLSPGLDKLSARSVKCVFLGYSRLQKGYKCYSPTMRR 362
Query: 170 FRVSRNVTFFYN-QFMLHSI--SPDINDITILP----------NFSIMPQS 207
+ +S +VTFF + F S+ S + ++ +P N SI+P S
Sbjct: 363 YYMSADVTFFEDTPFFSPSVDHSSSLQEVLPIPSPYPLXNSGQNVSIVPSS 515
>BI321712
Length = 399
Score = 105 bits (262), Expect = 8e-23
Identities = 52/127 (40%), Positives = 76/127 (58%)
Frame = -3
Query: 452 LLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIFLHQHKYA 511
L LYVDD++ TG++ + + K+ + F M D+G + Y+LG+EV KGIF+ Q YA
Sbjct: 385 LCLYVDDLIFTGNNPSMFEEFKKDMSNEFEMTDMGLMAYYLGIEVKQEDKGIFITQEGYA 206
Query: 512 TYLISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVV 571
++ + NPV TP+E K + + DP LY+ L+GSL LT TRPDI +VV
Sbjct: 205 KEVLKKFKMDDANPVGTPMECGSKLSKHEKGENVDPTLYKSLIGSLRYLTCTRPDILYVV 26
Query: 572 QQVSQFM 578
VS++M
Sbjct: 25 GVVSRYM 5
>BM527454 weakly similar to GP|27901709|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 437
Score = 67.8 bits (164), Expect(2) = 2e-22
Identities = 33/67 (49%), Positives = 44/67 (65%)
Frame = +2
Query: 445 TSTGIVLLLLYVDDMVITGSDNAFIQRLKEQLHASFHMKDLGNLHYFLGLEVHSTSKGIF 504
+S+ V L++YVDD+VITG+D I +LK L + F KDLG YFLG+EV + GI
Sbjct: 20 SSSRCVYLMVYVDDIVITGNDQGKIAQLKGHLFSHFQTKDLGKFEYFLGIEVAQSKDGII 199
Query: 505 LHQHKYA 511
+ Q KYA
Sbjct: 200 ISQRKYA 220
Score = 57.4 bits (137), Expect(2) = 2e-22
Identities = 33/71 (46%), Positives = 41/71 (57%)
Frame = +1
Query: 515 ISMAGLQSTNPVDTPLEVNVKYHRDDGDILPDPLLYRQLVGSLN*LTITRPDISFVVQQV 574
I G+ P+D+ ++ N K + G D YR LVG L LTITRP+ISFVV V
Sbjct: 220 IRHTGMSDCRPIDSLMDPNKKLLPNQGKPYSDSERYRILVGKLIYLTITRPNISFVVGVV 399
Query: 575 SQFMHSPRHLH 585
SQFM SP + H
Sbjct: 400 SQFMQSPHNDH 432
>TC225402 weakly similar to UP|Q6I923 (Q6I923) Copia-like retrotransposon
Hopscotch polyprotein, partial (7%)
Length = 1446
Score = 104 bits (259), Expect = 2e-22
Identities = 50/107 (46%), Positives = 69/107 (63%)
Frame = +2
Query: 619 DANWAGCPDTRHSVTSWCMFLRSSLISWKSKKQARVSKSSTESEYRAMYAACSEIIWLRG 678
DANWA P R S +C+ + +L+ WKS K V++SS E+EY+AM A E+IW++
Sbjct: 8 DANWAVSPIDRGSTLGYCVSIGENLVLWKSNK*NVVARSSAEAEYKAMTVATCELIWIKQ 187
Query: 679 FLAELGFP*TEPTSLYADNTSAI*FVANPVFHERTKLIEVDCHSIRD 725
L EL F T+ L DN +A+ +NPVFHERTK IE+DCH +R+
Sbjct: 188 LLQELKFGSTQQMKLCCDNQAALHIASNPVFHERTKHIEIDCHFVRE 328
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.326 0.139 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 39,294,890
Number of Sequences: 63676
Number of extensions: 684061
Number of successful extensions: 9051
Number of sequences better than 10.0: 234
Number of HSP's better than 10.0 without gapping: 6507
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 8084
length of query: 770
length of database: 12,639,632
effective HSP length: 105
effective length of query: 665
effective length of database: 5,953,652
effective search space: 3959178580
effective search space used: 3959178580
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 63 (28.9 bits)
Medicago: description of AC144727.8