
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC135101.6 + phase: 0 /pseudo
(1281 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AW704309 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 92 8e-36
TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (F... 78 2e-14
BG840035 weakly similar to GP|9294121|dbj copia-like retrotransp... 73 9e-13
BE805076 weakly similar to GP|9294121|dbj copia-like retrotransp... 69 1e-11
NP004897 gag-protease polyprotein 61 3e-09
TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, co... 60 6e-09
TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete 58 2e-08
AW185460 57 7e-08
BU083646 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 53 8e-07
BE804023 49 1e-05
CO983112 47 5e-05
CO981347 43 8e-04
BI701169 43 0.001
TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotei... 34 0.36
TC205511 similar to UP|Q9MAH1 (Q9MAH1) F12M16.20 (At1g53300), pa... 32 1.8
TC209363 similar to UP|Q6NLD7 (Q6NLD7) At1g64640, partial (46%) 31 3.1
BM188122 weakly similar to PIR|I46200|I4620 retrovirus-related r... 31 3.1
TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Frag... 30 8.9
>AW704309 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (23%)
Length = 423
Score = 91.7 bits (226), Expect(2) = 8e-36
Identities = 53/90 (58%), Positives = 63/90 (69%)
Frame = -2
Query: 114 SLSCRK*KSLRQSKNTQTNCCLLQTRLDYLALDLLIPEL*NFFWLQCLKDMRHL*PV*RT 173
+LS + *KS RQSKNTQTNC +L TR + LLI EL* W +C + M+HL* RT
Sbjct: 272 NLSFKG*KSQRQSKNTQTNCWVLSTR*SCWEVILLIREL*KKIW*RCRRGMKHL*LHWRT 93
Query: 174 QRILVRSLWQKSYMPCKPKNNEGL*GKIVL 203
QRI +S WQK YMPCK K++EG *GKIVL
Sbjct: 92 QRIYRKSHWQKCYMPCKLKSSEG**GKIVL 3
Score = 79.0 bits (193), Expect(2) = 8e-36
Identities = 36/48 (75%), Positives = 43/48 (89%)
Frame = -1
Query: 65 AKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIRSIQMLNL 112
A+SCLF GV + +F RIMTLK+PKAIWD LKEEYAGD+RIRS+Q+LNL
Sbjct: 423 ARSCLFTGVSQMIFIRIMTLKSPKAIWDCLKEEYAGDDRIRSMQVLNL 280
>TC232593 weakly similar to UP|Q9XG91 (Q9XG91) Tpv2-1c protein (Fragment),
partial (16%)
Length = 562
Score = 78.2 bits (191), Expect = 2e-14
Identities = 60/117 (51%), Positives = 73/117 (62%)
Frame = +2
Query: 967 FLCMLMIF**QEAT*H*LKNSSKK*KMSLR*LILVL*LIFWAWKSLKRRMRFLFVRKNRQ 1026
FL MLM F** E *L++SSKK* L+*LILV *LIF +S K R + V+ N Q
Sbjct: 200 FLSMLMTF**LEMMQG*LRSSSKK*CKLLK*LILVS*LIFLELRSSKVRTKC*SVKGNMQ 379
Query: 1027 RRFLKNFSLMNAKQ*TLLCIKRRSSSRMMGLIKLMKLILEV*LGV*CILLLKGLTSY 1083
++F ++F NA IKRRSS+R LIKLMK I+ *L V*CI L +G T Y
Sbjct: 380 KKF*RSFKWRNANLLAHQ*IKRRSSTR*TVLIKLMKDIIGA*LDV*CISLQQGQTFY 550
>BG840035 weakly similar to GP|9294121|dbj copia-like retrotransposable element
{Arabidopsis thaliana}, partial (3%)
Length = 804
Score = 72.8 bits (177), Expect = 9e-13
Identities = 54/121 (44%), Positives = 68/121 (55%)
Frame = +3
Query: 1161 GVQRNKRL*LSQQQKLSL*LQQQL*IKLFG*GIS*QIWA*SKSKVHRFLLIIKQLFPYHI 1220
GVQ++K + Q L QL*IKLFG*G+ I+ +K +HRFLLIIKQLFP+ I
Sbjct: 204 GVQKSKTQRQNMWQPL------QL*IKLFG*GVYLLIYIWNKRNLHRFLLIIKQLFPFQI 365
Query: 1221 IQYFMGRPSTLMSSYFT*EKCKKMVM*I*FIVKQKIKLLICLPSLFHSAGLNF*RRS*EF 1280
I +FM S SS F * K ++ V *F + KIK+ I F L F* +* F
Sbjct: 366 IMFFMAGLSISRSSCFF*GKHREKVKSN*FTAEVKIKVQIS*QKFFQKLDLKF*EIN*VF 545
Query: 1281 A 1281
A
Sbjct: 546 A 548
>BE805076 weakly similar to GP|9294121|dbj copia-like retrotransposable
element {Arabidopsis thaliana}, partial (2%)
Length = 388
Score = 69.3 bits (168), Expect = 1e-11
Identities = 32/90 (35%), Positives = 50/90 (55%)
Frame = +1
Query: 2 SKVAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTK 61
S ++ P+F+G NY+ W VK+E YL + D+W+ +EE + +P + +Q K K+ K K
Sbjct: 118 SAISVPIFNGENYDFWRVKMETYLSSQDLWDIVEEGFTIPADTSALNASQEKELKKNKQK 297
Query: 62 KAKAKSCLFAGVLETVFTRIMTLKTPKAIW 91
+K L + +F RIM KT K W
Sbjct: 298 NSKTLFTLQQAETDPIFPRIMGAKTAKEAW 387
>NP004897 gag-protease polyprotein
Length = 1923
Score = 61.2 bits (147), Expect = 3e-09
Identities = 34/111 (30%), Positives = 55/111 (48%), Gaps = 11/111 (9%)
Frame = +1
Query: 6 PPLFDGNNYELWAVKIEAYLEALD--VWEAIEEDYEVPPLPN---------NPTMAQLKY 54
PP+ DG NYE W ++ A+L++LD W+A+ +D+E P + + P K
Sbjct: 37 PPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKDWEHPKMLDTEGKPTDGLKPEEDWTKE 216
Query: 55 HKERKTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIR 105
E +KA + LF GV + +F I T K W+ LK + G +++
Sbjct: 217 EDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVK 369
>TC204438 homologue to UP|Q84VH6 (Q84VH6) Gag-pol polyprotein, complete
Length = 4734
Score = 60.1 bits (144), Expect = 6e-09
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 11/111 (9%)
Frame = +1
Query: 6 PPLFDGNNYELWAVKIEAYLEALD--VWEAIEEDYEVPPL------PNN---PTMAQLKY 54
PP+ DG NYE W ++ A+L++LD W+A+ + +E P + P N P K
Sbjct: 37 PPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTNELKPEEDWTKE 216
Query: 55 HKERKTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIR 105
E +KA + LF GV + +F I T K W+ LK + G +++
Sbjct: 217 EDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTSKVK 369
Score = 31.2 bits (69), Expect = 3.1
Identities = 33/110 (30%), Positives = 45/110 (40%)
Frame = +2
Query: 522 LLLLTT*PECAGFIFLNTNQK*LQCFGNSKFMLKIKAIAVFKFRGQIMEKSMYQISFNNF 581
+LL P+ G NQ L+ + K K + G M +S+ S N
Sbjct: 2327 MLLWMISPDLPGSTLSERNQTPLKYSRS*V*DFKEKKTVSSRESGVTMAESLKTASLLNS 2506
Query: 582 VMK*GSNIS*LLLTLLNKME*VKGKIDQSWRWLGVCFIKRNCQRNYGLKP 631
S +S L NKM *+KGK + LG CF+ +N GLKP
Sbjct: 2507 AHLKASLMSSLQPLHHNKMA*LKGKTGLCKKLLGSCFMPKNFPIISGLKP 2656
>TC204439 UP|Q84VI4 (Q84VI4) Gag-pol polyprotein, complete
Length = 4731
Score = 58.2 bits (139), Expect = 2e-08
Identities = 33/111 (29%), Positives = 55/111 (48%), Gaps = 11/111 (9%)
Frame = +1
Query: 6 PPLFDGNNYELWAVKIEAYLEALD--VWEAIEEDYEVPPLPN---------NPTMAQLKY 54
PP+ DG+NYE W ++ A+L++LD W+A+ + +E P + + P K
Sbjct: 37 PPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPTDELKPEEDWTKE 216
Query: 55 HKERKTKKAKAKSCLFAGVLETVFTRIMTLKTPKAIWDYLKEEYAGDERIR 105
E +KA + LF GV + +F I T K W+ LK + G +++
Sbjct: 217 EDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTSKVK 369
>AW185460
Length = 411
Score = 56.6 bits (135), Expect = 7e-08
Identities = 46/103 (44%), Positives = 52/103 (49%)
Frame = +3
Query: 1075 LLLKGLTSYSQLACCQDSCIVLVNCT*RQQKEL*GTSKELSIME*STARSKNSNYLATLT 1134
L L GLT QDSC V T Q+KE *G KE NYLAT T
Sbjct: 84 LKLHGLT*CMLQVFYQDSCKAQVKYTLEQEKEF*GIYKEQKRSVYGILPKPTQNYLATPT 263
Query: 1135 VIGQVAWMI*RVLQGTVST*VQVFSLGVQRNKRL*LSQQQKLS 1177
VIGQV M *RV +S * + SLG +R+K * +QQQK S
Sbjct: 264 VIGQVQQMT*RVPLAMLSH*DRECSLGRRRSKLQ*HNQQQKQS 392
>BU083646 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (19%)
Length = 428
Score = 53.1 bits (126), Expect = 8e-07
Identities = 51/140 (36%), Positives = 73/140 (51%), Gaps = 1/140 (0%)
Frame = +2
Query: 289 SKRRKTISLLQHV-FPAAVHVNLG*LIVDALIT*PTIKLYSRIWRTLKSNGSELEMVYIF 347
+ +++ LL HV P A+H +G*LIV T*P I + L S EM +I
Sbjct: 8 TNHKRSSCLLYHVLLPVALH-KVG*LIVVVQTT*PMIVNSLQNLMRLFFLKSRSEMKHIL 184
Query: 348 QLKAREQ*L*QATQVRKISLMFYMCLKLIRTCLVLDNYLKKGSKLFLKIRIV*LRTQQAK 407
KA++ + TQV M YM KL++T V + KK K LK +I *L+TQ+ +
Sbjct: 185 M*KAKKLWQFKDTQV*N*FPMCYMSRKLVKTF*VFLSCSKKVIKCCLKTKIA*LKTQKVE 364
Query: 408 KCSK*K*RARIFHLIQ*RRS 427
+CS K*+A + I * +S
Sbjct: 365 RCSIFK*KA*VLPWIS*TKS 424
>BE804023
Length = 407
Score = 48.9 bits (115), Expect = 1e-05
Identities = 37/116 (31%), Positives = 53/116 (44%), Gaps = 7/116 (6%)
Frame = -1
Query: 186 YMPCKPKNNEGL*GKIVLLKVLYQPRANNHTL*RVIQQVV-------NTTKERVRRKVFH 238
YM CK + E *GK LK YQP R I+ ++ ++ + R
Sbjct: 401 YMSCKRRIRED**GKNQELKEFYQPGIRMQPNTRKIRTLMEWDQVQHQPMQKEIGRNFIR 222
Query: 239 LVNTAEKWVTHHLNVGEGQMQSAASAMK*VMKM*FVKQKSNKLQLKQKMYSKRRKT 294
L NT E+ H +V + M SA S +K VMK+ F K N + Q++ +RR T
Sbjct: 221 LANTVERKDILHSDVRKDLMLSAVSVIKWVMKLLFAKTGINHIMKLQRLLIRRRST 54
>CO983112
Length = 653
Score = 47.0 bits (110), Expect = 5e-05
Identities = 25/63 (39%), Positives = 38/63 (59%)
Frame = +3
Query: 3 KVAPPLFDGNNYELWAVKIEAYLEALDVWEAIEEDYEVPPLPNNPTMAQLKYHKERKTKK 62
+V +FDG +Y+LW ++I++YLE LD EA+EED+ V L + +L+ H T K
Sbjct: 42 QVFSSIFDG*SYDLWTLRIKSYLEILDQ*EAMEEDFNVSLLF*YGSTKELRMHIHSCTNK 221
Query: 63 AKA 65
A
Sbjct: 222 KAA 230
>CO981347
Length = 624
Score = 43.1 bits (100), Expect = 8e-04
Identities = 36/110 (32%), Positives = 55/110 (49%)
Frame = +3
Query: 549 NSKFMLKIKAIAVFKFRGQIMEKSMYQISFNNFVMK*GSNIS*LLLTLLNKME*VKGKID 608
N +L+I + KF G M S++ S +F K* S + LT ++M * KG I
Sbjct: 123 NDILLLEINLVQN*KF*GLTMAWSLF*SSSMSFAGK*ASKGTK*SLTHHSRMV*QKG*I* 302
Query: 609 QSWRWLGVCFIKRNCQRNYGLKPQVPLFSYKIDSLHELCKIKLHLRLGLV 658
W+ G C+ ++CQR +G K Q ID+LH+ + +LG+V
Sbjct: 303 PFWKE*GACY*VQDCQRPFGEKLQTQHHI*LIDALHQP*VSRHQWKLGVV 452
>BI701169
Length = 407
Score = 42.7 bits (99), Expect = 0.001
Identities = 31/85 (36%), Positives = 46/85 (53%)
Frame = +3
Query: 533 GFIFLNTNQK*LQCFGNSKFMLKIKAIAVFKFRGQIMEKSMYQISFNNFVMK*GSNIS*L 592
G IFL+ N + L+ +SK +L+ K + K I+E++ Q+SF + VM + *
Sbjct: 150 GCIFLSIN*RCLRISKSSKPLLRKKVVLKLKP*DPIVEENSNQMSFKSIVMIMEFDDP*W 329
Query: 593 LLTLLNKME*VKGKIDQSWRWLGVC 617
NKME KG+I+QS W G C
Sbjct: 330 C*DPPNKMEWQKGRIEQS*TWQGAC 404
>TC235104 weakly similar to UP|Q850H8 (Q850H8) Gag-pol polyprotein
(Fragment), partial (28%)
Length = 865
Score = 34.3 bits (77), Expect = 0.36
Identities = 22/62 (35%), Positives = 33/62 (52%)
Frame = +3
Query: 554 LKIKAIAVFKFRGQIMEKSMYQISFNNFVMK*GSNIS*LLLTLLNKME*VKGKIDQSWRW 613
LK+ + + G IM K+ + + F + +IS L+L LLN+M KG+ID R
Sbjct: 627 LKLSLVRQLRSLGVIMRKNTFLLLFLLXXQRRAFSISSLVLILLNRMTLPKGRIDILLRL 806
Query: 614 LG 615
LG
Sbjct: 807 LG 812
>TC205511 similar to UP|Q9MAH1 (Q9MAH1) F12M16.20 (At1g53300), partial (50%)
Length = 1458
Score = 32.0 bits (71), Expect = 1.8
Identities = 20/70 (28%), Positives = 36/70 (50%), Gaps = 5/70 (7%)
Frame = +2
Query: 13 NYELWAVKIEAYLEALDVWEAIEEDYEV--PPLPNNPTMAQLKYHKE---RKTKKAKAKS 67
NY ++ A L+ WE +DYE+ LPN+ +A+ +H + +K++ + K+
Sbjct: 563 NYTKALLRRAASNSKLERWEEAVKDYEILRKELPNDNEVAESLFHAQVALKKSRGEEVKN 742
Query: 68 CLFAGVLETV 77
F G +E V
Sbjct: 743 LKFGGEVEEV 772
>TC209363 similar to UP|Q6NLD7 (Q6NLD7) At1g64640, partial (46%)
Length = 1016
Score = 31.2 bits (69), Expect = 3.1
Identities = 14/36 (38%), Positives = 20/36 (54%)
Frame = -1
Query: 608 DQSWRWLGVCFIKRNCQRNYGLKPQVPLFSYKIDSL 643
D W+W G +K+N + L ++PLF Y I SL
Sbjct: 437 DFFWQWPGSPLVKKNWPFDVMLNKELPLFMYNIGSL 330
>BM188122 weakly similar to PIR|I46200|I4620 retrovirus-related reverse
transcriptase homolog - pine (Pinus coulteri)
retrotransposon copia-like, partial (53%)
Length = 420
Score = 31.2 bits (69), Expect = 3.1
Identities = 24/60 (40%), Positives = 31/60 (51%)
Frame = +2
Query: 951 LKLHFMSSMLVLIF*SFLCMLMIF**QEAT*H*LKNSSKK*KMSLR*LILVL*LIFWAWK 1010
+KL +MS V+I+ FL +LM *Q +NS K LR* L L FWAW+
Sbjct: 149 VKLSYMSKKNVVIYL*FLFILMTCL*Q*LMRSWWENSKLKCSNYLR*QTLASCLTFWAWR 328
>TC231899 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment), partial
(30%)
Length = 687
Score = 29.6 bits (65), Expect = 8.9
Identities = 19/51 (37%), Positives = 28/51 (54%)
Frame = +3
Query: 1128 NYLATLTVIGQVAWMI*RVLQGTVST*VQVFSLGVQRNKRL*LSQQQKLSL 1178
NY + +IG I + Q VS+ ++ LG RN+RL*L QKL++
Sbjct: 15 NYQDIVMLIGLAVPWIGGLHQAIVSSLEEILFLGKARNRRL*LGLVQKLNI 167
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.361 0.159 0.550
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,733,843
Number of Sequences: 63676
Number of extensions: 818278
Number of successful extensions: 8817
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 4770
Number of HSP's successfully gapped in prelim test: 278
Number of HSP's that attempted gapping in prelim test: 3894
Number of HSP's gapped (non-prelim): 5367
length of query: 1281
length of database: 12,639,632
effective HSP length: 108
effective length of query: 1173
effective length of database: 5,762,624
effective search space: 6759557952
effective search space used: 6759557952
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.9 bits)
S2: 65 (29.6 bits)
Medicago: description of AC135101.6