
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149600.1 - phase: 0
(494 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AV780009 160 4e-40
BP075943 128 2e-30
BP066094 110 7e-28
AV424544 87 8e-18
BP043850 85 3e-17
BP084332 84 4e-17
TC16929 weakly similar to UP|Q9LFY6 (Q9LFY6) T7N9.5, partial (4%) 82 2e-16
AV766665 81 4e-16
TC15664 weakly similar to UP|O81617 (O81617) F8M12.17 protein, p... 77 8e-15
TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela... 70 8e-13
BP057233 69 2e-12
TC8952 similar to UP|O82607 (O82607) T2L5.9 protein, partial (3%) 65 3e-11
AV777635 46 2e-05
TC10484 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragm... 38 4e-05
TC18909 40 2e-04
BP052124 42 3e-04
AV772199 32 5e-04
TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-rela... 40 7e-04
AV767042 40 7e-04
TC19474 weakly similar to UP|Q9FJA1 (Q9FJA1) Similarity to retro... 37 0.010
>AV780009
Length = 529
Score = 160 bits (405), Expect = 4e-40
Identities = 72/159 (45%), Positives = 107/159 (67%)
Frame = -1
Query: 313 RILRYLKSTPGKGILFSNNGHLRVEGYTDADWAGSADDRRSTSGYFTFVGGNLVTWRSKK 372
R+LRY+K P +G+ FS + L+++ Y+D+DWAG D RRS +GY F+G +L++WR+KK
Sbjct: 517 RVLRYVKGAPAQGLFFSADSPLKLQAYSDSDWAGCPDTRRSVTGYSIFLGTSLISWRTKK 338
Query: 373 QPVVARSSAEVEFRGMALGMCELLWVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNPVQH 432
Q V+RSS+E E+R +A +CE+ W+ + L + L+CDN SA+ IAHNP H
Sbjct: 337 QTTVSRSSSEAEYRALAATVCEVQWLSYLFQFLKLNVPLPVPLFCDNQSALHIAHNPTFH 158
Query: 433 DRTKHVEIDRHFIKEKLDAGTIVFPFVRSEQQLADMLTK 471
+RTKH+E+D H ++ KL AG I + + QLAD+ TK
Sbjct: 157 ERTKHIELDCHVVRAKLQAGLIHLLPISTHHQLADIFTK 41
>BP075943
Length = 547
Score = 128 bits (321), Expect = 2e-30
Identities = 67/161 (41%), Positives = 100/161 (61%), Gaps = 1/161 (0%)
Frame = -3
Query: 169 GKITALIIYVDDMIVTGNNQDEISSLQKYLTSKFEMKQLGNLKYFMGIEVARSKHGIFLC 228
G TAL++YVDD+++ GN+ EI ++ L +F +K LG KYF+G+E+ARS GI L
Sbjct: 536 GSFTALLLYVDDVLLAGNDMHEIQLVKSSLHDQFRIKDLGEAKYFLGLEIARSTSGIVLN 357
Query: 229 QRKYTLDLLSETGLLASKSAETPIEQNHK-LFHCLNSNITDKGRYQSLVGKLIYLSHTRP 287
QRKY L L+S++G + N + L + +TD G Y+ +VG+L+YL+ TRP
Sbjct: 356 QRKYALQLISDSGHFGFQPRFYSHGTNSQTLGTNTGTPLTDIGSYRRIVGRLLYLNTTRP 177
Query: 288 DITYAVNVVSQFMHDPRKPHMDVVERILRYLKSTPGKGILF 328
DIT+AVN +SQF+ P H + L+YL +PG G+ +
Sbjct: 176 DITFAVNQLSQFLSAPTDIHEQQLTGFLKYL*GSPGSGLFY 54
>BP066094
Length = 532
Score = 110 bits (275), Expect(2) = 7e-28
Identities = 59/149 (39%), Positives = 88/149 (58%), Gaps = 1/149 (0%)
Frame = +2
Query: 70 AKLNTVRILLSLAANQDWPLLQFDVKNAFLHGEISEDIYMDTPPGMV-YSNGLKVCKLKK 128
+K +R+L+S + N + L Q +VK+AFL+G ISE++Y+ PPG N + KLKK
Sbjct: 86 SKTEAIRLLISFSVNHNIILHQMNVKSAFLNGYISEEVYVHQPPGXEDEKNSDHIFKLKK 265
Query: 129 ALYGLKQSPRAWFGRFTKSMKTFGYKASNSDHTLFLKRGEGKITALIIYVDDMIVTGNNQ 188
+LYGLKQ+PRAW+ R + + D TLF K + I + IYVDD+I N
Sbjct: 266 SLYGLKQAPRAWYERLSSFLLENEXVRGKVDTTLFCKTYKDDILIVQIYVDDIIFGSANP 445
Query: 189 DEISSLQKYLTSKFEMKQLGNLKYFMGIE 217
+ + ++FEM+ +G LKYF+GI+
Sbjct: 446 SLCKEFSEMMQAEFEMRMMGELKYFLGIQ 532
Score = 30.4 bits (67), Expect(2) = 7e-28
Identities = 12/18 (66%), Positives = 15/18 (82%)
Frame = +3
Query: 54 YTQSYGVDYQETFAPVAK 71
Y+Q G+DY ETFAPVA+
Sbjct: 39 YSQQ*GIDYTETFAPVAR 92
>AV424544
Length = 276
Score = 86.7 bits (213), Expect = 8e-18
Identities = 40/90 (44%), Positives = 59/90 (65%)
Frame = +3
Query: 143 RFTKSMKTFGYKASNSDHTLFLKRGEGKITALIIYVDDMIVTGNNQDEISSLQKYLTSKF 202
+ + + GY S DH+LF K + T +++YVDD+I+ GN+ +EI ++ L +F
Sbjct: 6 KLSSYLHILGYIQSAHDHSLFTKFRDASFTVILVYVDDLILAGNDLNEIQCVKNKLDIQF 185
Query: 203 EMKQLGNLKYFMGIEVARSKHGIFLCQRKY 232
+K LG LKYF+G+EVARS G+FL QRKY
Sbjct: 186 RIKDLGTLKYFLGLEVARSSCGLFLSQRKY 275
>BP043850
Length = 515
Score = 84.7 bits (208), Expect = 3e-17
Identities = 39/92 (42%), Positives = 61/92 (65%)
Frame = -1
Query: 394 ELLWVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAGT 453
E++W++ +LS+LGF + L+ DNTSAI+IA NPV H+ T+H+E+D H ++E D
Sbjct: 512 EIIWLRGLLSELGFLQSQPTPLHADNTSAIQIAANPVYHEWTRHIEVDCHSVREAYDRRV 333
Query: 454 IVFPFVRSEQQLADMLTKGVSSKVFNESLLKL 485
I P V + Q+AD+LTK ++ + N + KL
Sbjct: 332 ITLPHVSTSVQIADILTKSLTRQRHNFLVSKL 237
>BP084332
Length = 368
Score = 84.3 bits (207), Expect = 4e-17
Identities = 41/113 (36%), Positives = 66/113 (58%)
Frame = +2
Query: 373 QPVVARSSAEVEFRGMALGMCELLWVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNPVQH 432
Q +A S+AE E+ A+ ++LW+K L D + +YCDNT+AI ++ NP+ H
Sbjct: 32 QSTIALSTAEAEYISAAICSTQMLWMKHQLEDYQILESN-IPIYCDNTAAISLSKNPILH 208
Query: 433 DRTKHVEIDRHFIKEKLDAGTIVFPFVRSEQQLADMLTKGVSSKVFNESLLKL 485
R KH+E+ HFI++ + G ++ FV ++ Q AD+ TK ++ FN L L
Sbjct: 209 SRAKHIEVKYHFIRDYVQKGVLLLKFVDTDHQWADIFTKPLAEDRFNFILKNL 367
>TC16929 weakly similar to UP|Q9LFY6 (Q9LFY6) T7N9.5, partial (4%)
Length = 553
Score = 82.0 bits (201), Expect = 2e-16
Identities = 40/97 (41%), Positives = 61/97 (62%)
Frame = +1
Query: 397 WVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAGTIVF 456
W+ +L DL ++ +YCDN SA IA NPV H+RTKH+EID H ++E++ G I
Sbjct: 4 WLTYLLQDLKVPFEQPALVYCDNNSARHIAANPVFHERTKHIEIDCHIVRERIQKGLIHL 183
Query: 457 PFVRSEQQLADMLTKGVSSKVFNESLLKLGMCDIHAP 493
+ S + LAD+ TK +S + F++ KLG+ +I +P
Sbjct: 184 LPISSSEPLADIYTKALSPQNFHQICAKLGLINICSP 294
>AV766665
Length = 601
Score = 80.9 bits (198), Expect = 4e-16
Identities = 39/74 (52%), Positives = 50/74 (66%)
Frame = +3
Query: 313 RILRYLKSTPGKGILFSNNGHLRVEGYTDADWAGSADDRRSTSGYFTFVGGNLVTWRSKK 372
RI YLK+ +G LF G ++G+T AD+ GS DR ST GY+ F+ GNLVTWRSK+
Sbjct: 372 RIF*YLKANSRRGPLFQKEGKSSMDGFTYADYLGSIVDRLSTMGYYMFLSGNLVTWRSKQ 551
Query: 373 QPVVARSSAEVEFR 386
Q ++ARSS E E R
Sbjct: 552 QNIIARSSGEAELR 593
>TC15664 weakly similar to UP|O81617 (O81617) F8M12.17 protein, partial (4%)
Length = 670
Score = 76.6 bits (187), Expect = 8e-15
Identities = 42/100 (42%), Positives = 59/100 (59%)
Frame = +1
Query: 393 CELLWVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAG 452
CEL W+ +L DL LYCD+ SA IA N V H+RTKH++ID H ++EKL A
Sbjct: 1 CEL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 180
Query: 453 TIVFPFVRSEQQLADMLTKGVSSKVFNESLLKLGMCDIHA 492
+ S Q AD+LTK + S F+ + KLG+ +I++
Sbjct: 181 LFHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYS 300
>TC12832 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease] , partial (9%)
Length = 747
Score = 70.1 bits (170), Expect = 8e-13
Identities = 44/147 (29%), Positives = 72/147 (48%), Gaps = 9/147 (6%)
Frame = +2
Query: 144 FTKSMKTFGYKASNSDHTLFLKR-GEGKITALIIYVDDMIVTGNNQDEISSLQKYLTSKF 202
F + + GY +SDH + KR + L++YVDDM+V G N+D + L+ L +F
Sbjct: 2 FDSFIMSLGYNRLSSDHCTYHKRFDDNDFIILLLYVDDMLVVGPNKDRVQELKAQLAREF 181
Query: 203 EMKQLGNLKYFMGIEVARSK--HGIFLCQRKYTLDLLSETGLLASKSAETPIEQNHKLFH 260
+MK LG +G+++ R + I+L Q+ Y +L + TP+ N+KL
Sbjct: 182 DMKDLGPANKILGMQIHRDRKDRRIWLSQKNYLQKVLRRFNMQDCNPISTPLPVNYKLSS 361
Query: 261 CL----NSNITDKGR--YQSLVGKLIY 281
+ + + R Y S VG L+Y
Sbjct: 362 SMIPSSEAERMEMSRVPYASAVGSLMY 442
>BP057233
Length = 473
Score = 68.9 bits (167), Expect = 2e-12
Identities = 35/79 (44%), Positives = 53/79 (66%)
Frame = -2
Query: 415 LYCDNTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAGTIVFPFVRSEQQLADMLTKGVS 474
L+CDN SA +A NPV H R+KH+EID H+I++++ +V +V + Q+AD LTK +S
Sbjct: 445 LWCDNLSAKALASNPVLHARSKHIEIDVHYIRDQVLQNEVVVAYVPTTDQIADCLTKPLS 266
Query: 475 SKVFNESLLKLGMCDIHAP 493
F++ KLG+ IH+P
Sbjct: 265 HTRFSQLRDKLGV--IHSP 215
>TC8952 similar to UP|O82607 (O82607) T2L5.9 protein, partial (3%)
Length = 550
Score = 64.7 bits (156), Expect = 3e-11
Identities = 33/82 (40%), Positives = 48/82 (58%)
Frame = +3
Query: 393 CELLWVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAG 452
CE LW+ L+DL + +YCDN SA+ +A N V H RT+++EID H + K+ G
Sbjct: 18 CEALWLTYALADLRIASLLLVVIYCDNRSALHLAANSVFHKRTENIEIDCHIV*VKVLFG 197
Query: 453 TIVFPFVRSEQQLADMLTKGVS 474
+ V S Q+AD+ TK +S
Sbjct: 198 ILHLLHVPSSDQVADVFTKTIS 263
>AV777635
Length = 382
Score = 45.8 bits (107), Expect = 2e-05
Identities = 20/32 (62%), Positives = 27/32 (83%)
Frame = -2
Query: 297 SQFMHDPRKPHMDVVERILRYLKSTPGKGILF 328
SQFMHDP + H+D RIL+YLK++PG+G+LF
Sbjct: 381 SQFMHDPHERHLD---RILQYLKASPGRGLLF 295
>TC10484 similar to UP|Q850H7 (Q850H7) Gag-pol polyprotein (Fragment),
partial (20%)
Length = 479
Score = 38.1 bits (87), Expect(2) = 4e-05
Identities = 15/33 (45%), Positives = 22/33 (66%)
Frame = +3
Query: 397 WVKSVLSDLGFEPKEAMSLYCDNTSAIEIAHNP 429
WV +LS++G E M L+CDN +A+ I+ NP
Sbjct: 3 WVGQILSEMGIERISPMPLWCDNQAALHISSNP 101
Score = 25.8 bits (55), Expect(2) = 4e-05
Identities = 8/12 (66%), Positives = 11/12 (91%)
Frame = +2
Query: 432 HDRTKHVEIDRH 443
H+RTKH+E+D H
Sbjct: 113 HERTKHIEVDCH 148
>TC18909
Length = 621
Score = 40.4 bits (93), Expect = 7e-04
Identities = 27/85 (31%), Positives = 38/85 (43%), Gaps = 1/85 (1%)
Frame = +2
Query: 285 TRPDITYAVNVVSQFMHDPRKPHMDVVERILRYL-KSTPGKGILFSNNGHLRVEGYTDAD 343
T P I+++VN V F+ + + H + IL K+ G N ++DAD
Sbjct: 326 TSPQISFSVNKVC*FLSESLEEHGTAGKCILNVT*KAL*TMGFFIQTN----FPAFSDAD 493
Query: 344 WAGSADDRRSTSGYFTFVGGNLVTW 368
WA + DDRRST F G W
Sbjct: 494 WASNVDDRRSTFWEIVFTLGPKFRW 568
Score = 31.6 bits (70), Expect(2) = 2e-04
Identities = 16/22 (72%), Positives = 17/22 (76%)
Frame = +2
Query: 61 DYQETFAPVAKLNTVRILLSLA 82
DY ET +PV K TVRILLSLA
Sbjct: 62 DYTETVSPVVKPVTVRILLSLA 127
Score = 29.6 bits (65), Expect(2) = 2e-04
Identities = 12/18 (66%), Positives = 17/18 (93%)
Frame = +1
Query: 40 GTIDRYKARLVAKGYTQS 57
G+I++YKARLVAKG+ Q+
Sbjct: 1 GSINKYKARLVAKGFHQT 54
>BP052124
Length = 467
Score = 41.6 bits (96), Expect = 3e-04
Identities = 18/39 (46%), Positives = 24/39 (61%)
Frame = -2
Query: 416 YCDNTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAGTI 454
+CDN SA+ +A P+ H R H E+D + KEKL G I
Sbjct: 454 FCDNNSALTLAPRPI*HSRPVHFEVDCPYPKEKLGTGLI 338
>AV772199
Length = 495
Score = 31.6 bits (70), Expect(2) = 5e-04
Identities = 16/26 (61%), Positives = 18/26 (68%)
Frame = +2
Query: 425 IAHNPVQHDRTKHVEIDRHFIKEKLD 450
IA+N Q DRTKH EI H IKEK +
Sbjct: 182 IANNSNQ*DRTKHKEIH*HSIKEKFE 259
Score = 28.5 bits (62), Expect(2) = 5e-04
Identities = 14/25 (56%), Positives = 17/25 (68%)
Frame = +1
Query: 365 LVTWRSKKQPVVARSSAEVEFRGMA 389
+V RSKKQ + ARSS E EF+ A
Sbjct: 16 IVI*RSKKQEMAARSSVETEFQARA 90
>TC19389 weakly similar to UP|POLX_TOBAC (P10978) Retrovirus-related Pol
polyprotein from transposon TNT 1-94 [Contains: Protease
; Reverse transcriptase ; Endonuclease] , partial (6%)
Length = 498
Score = 40.4 bits (93), Expect = 7e-04
Identities = 34/134 (25%), Positives = 58/134 (42%)
Frame = +3
Query: 359 TFVGGNLVTWRSKKQPVVARSSAEVEFRGMALGMCELLWVKSVLSDLGFEPKEAMSLYCD 418
TF GG V W S+ Q VA S+AE EF ELLW+K+ L + F +
Sbjct: 9 TFAGG-AVAWPSRLQKCVALSTAEAEFIAATEACHELLWMKNFLQNAWFHSHPILCCIVI 185
Query: 419 NTSAIEIAHNPVQHDRTKHVEIDRHFIKEKLDAGTIVFPFVRSEQQLADMLTKGVSSKVF 478
+ +A + + +++ L++ + + ++ ADM+TK + +
Sbjct: 186 TKALFTLARILLFIQDPSTLMFVIIGLRDVLNSKLLELEKIHTDDDGADMMTKSLPRE-- 359
Query: 479 NESLLKLGMCDIHA 492
KL +CD+ A
Sbjct: 360 -----KLEVCDMIA 386
>AV767042
Length = 444
Score = 40.4 bits (93), Expect = 7e-04
Identities = 16/44 (36%), Positives = 30/44 (67%)
Frame = -3
Query: 56 QSYGVDYQETFAPVAKLNTVRILLSLAANQDWPLLQFDVKNAFL 99
Q+ GVD +TF+PV K +R + ++A ++ WP+ Q +V+N ++
Sbjct: 349 QTAGVDCNKTFSPVVKPAPIRTVFTIALSRSWPIPQLNVQNLWI 218
>TC19474 weakly similar to UP|Q9FJA1 (Q9FJA1) Similarity to retroelement pol
polyprotein, partial (3%)
Length = 517
Score = 36.6 bits (83), Expect = 0.010
Identities = 16/26 (61%), Positives = 21/26 (80%)
Frame = -1
Query: 364 NLVTWRSKKQPVVARSSAEVEFRGMA 389
NL++W+SK+ +VARS AE EFR MA
Sbjct: 508 NLISWKSKETIIVARSRAEAEFRVMA 431
Score = 33.1 bits (74), Expect = 0.11
Identities = 17/38 (44%), Positives = 22/38 (57%)
Frame = -2
Query: 434 RTKHVEIDRHFIKEKLDAGTIVFPFVRSEQQLADMLTK 471
R KH+EID HF+ ++ I FV S QL D+ TK
Sbjct: 309 RAKHIEIDFHFL*DRRLYLDISTRFVNSNDQLTDVFTK 196
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.320 0.136 0.403
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,995,697
Number of Sequences: 28460
Number of extensions: 102735
Number of successful extensions: 505
Number of sequences better than 10.0: 53
Number of HSP's better than 10.0 without gapping: 496
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 500
length of query: 494
length of database: 4,897,600
effective HSP length: 94
effective length of query: 400
effective length of database: 2,222,360
effective search space: 888944000
effective search space used: 888944000
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (26.6 bits)
Medicago: description of AC149600.1