
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146805.1 - phase: 0 /pseudo
(1445 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NP595172 polyprotein [Glycine max] 278 2e-74
TC234828 208 1e-53
NP395548 reverse transcriptase [Glycine max] 138 2e-32
CF922488 124 2e-28
NP395547 reverse transcriptase [Glycine max] 123 7e-28
NP334778 reverse transcriptase [Glycine max] 105 1e-22
TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, part... 94 4e-19
BI317507 64 6e-10
CF922531 60 9e-09
BM142940 40 2e-08
BE806882 similar to GP|27764548|gb polyprotein {Glycine max}, pa... 54 4e-07
CD487724 47 6e-05
BG725601 similar to PIR|H86337|H86 protein F5M15.26 [imported] -... 44 7e-04
BU549069 35 0.31
TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%) 32 2.0
CO981082 32 2.6
BE348096 31 3.5
CD408465 31 4.5
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 278 bits (710), Expect = 2e-74
Identities = 152/375 (40%), Positives = 225/375 (59%), Gaps = 28/375 (7%)
Frame = +1
Query: 387 PLSMFGRDFEMDLVCLPLSGMDVILGMNWL--------EYKHVHINCFSKSVYFSSAEEE 438
PL + G++ ++ + L +SG DVILG WL +Y + + F + + E
Sbjct: 1357 PLHIQGQEVKVPVYLLQISGADVILGSTWLATLGPHVADYAALTLKFFQNDKFITLQGE- 1533
Query: 439 CGAEFLSTKQLKQMERDGILMFSLMASLSIENQAVIDKLQ------VVCDFPEVYPDEIP 492
G + QL R L + SIE I +Q + D P E+
Sbjct: 1534 -GNSEATQAQLHHFRR-------LQNTKSIEECFAIQLIQKEVPEDTLKDLPTNIDPELA 1689
Query: 493 --------------DVPPEREVEFSINLVPGTKPVSMAPYRMSASELSELKKQLEDLLEK 538
+PP+RE + +I L G+ PV + PYR ++ +++K ++++L +
Sbjct: 1690 ILLHTYAQVFAVPASLPPQREQDHAIPLKQGSGPVKVRPYRYPHTQKDQIEKMIQEMLVQ 1869
Query: 539 KFVRPSVSPWGAPVLLVKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVF 598
++PS SP+ P+LLVKKKDGS R C DYR LN +T+K+ +P+P +D+L+D+L GA+ F
Sbjct: 1870 GIIQPSNSPFSLPILLVKKKDGSWRFCTDYRALNAITVKDSFPMPTVDELLDELHGAQYF 2049
Query: 599 SKIDLRSGYHQIKVKDEDMQKTALRTRYGHYEYKVMPFGVTNAPRVFMEYMNRIFHAFLD 658
SK+DLRSGYHQI V+ ED +KTA RT +GHYE+ VMPFG+TNAP F MN+IF L
Sbjct: 2050 SKLDLRSGYHQILVQPEDREKTAFRTHHGHYEWLVMPFGLTNAPATFQCLMNKIFQFALR 2229
Query: 659 RFVVVFIDDILIYSKTEEEHAEHLKIVLQVLKEKKLYAKLSKCEFWLKEVSFLGHVISGD 718
+FV+VF DDILIYS + ++H +HL+ VLQ LK+ +L+A+LSKC F EV +LGH +SG
Sbjct: 2230 KFVLVFFDDILIYSASWKDHLKHLESVLQTLKQHQLFARLSKCSFGDTEVDYLGHKVSGL 2409
Query: 719 GIAVDPSKVEAVSQW 733
G++++ +KV+AV W
Sbjct: 2410 GVSMENTKVQAVLDW 2454
>TC234828
Length = 857
Score = 208 bits (530), Expect = 1e-53
Identities = 102/217 (47%), Positives = 147/217 (67%), Gaps = 4/217 (1%)
Frame = +3
Query: 339 YYTSATHCFIAFDCVSALGLDLSDMS*EMVVETPAKGSVTTSLVCLRCPLSMFGRDFEMD 398
Y + ATH FI+ CV LGL +++ +MVV TP VTTS VCL+CP+ + GR F D
Sbjct: 225 YDSGATHSFISHACVERLGLCATELPYDMVVSTPTSEPVTTSRVCLKCPIIVEGRSFMAD 404
Query: 399 LVCLPLSGMDVILGMNWLEYKHVHINCFSKSVYFSSAEEECGAEFLSTKQLKQ----MER 454
L+CLPL+ +DVILGM+WL H+ ++C K + F G + + ++ LK+ E
Sbjct: 405 LICLPLAHLDVILGMDWLSTNHIFLDCKEKMLVF-------GGDVVPSEPLKEDAANEET 563
Query: 455 DGILMFSLMASLSIENQAVIDKLQVVCDFPEVYPDEIPDVPPEREVEFSINLVPGTKPVS 514
+ + + ++ S+ +E A + + VV +FPEV+PD++ ++PPEREVEF I++VPG PVS
Sbjct: 564 EDVRTYMVLFSMYVEEDAEVSCIPVVSEFPEVFPDDVCELPPEREVEFIIDVVPGANPVS 743
Query: 515 MAPYRMSASELSELKKQLEDLLEKKFVRPSVSPWGAP 551
+APYRMS EL+E+K Q++DLL K+FVRPS SPWGAP
Sbjct: 744 IAPYRMSPVELAEVKAQVQDLLSKQFVRPSASPWGAP 854
>NP395548 reverse transcriptase [Glycine max]
Length = 762
Score = 138 bits (347), Expect = 2e-32
Identities = 78/224 (34%), Positives = 124/224 (54%), Gaps = 19/224 (8%)
Frame = +1
Query: 528 LKKQLEDLLEKKFVRP-SVSPWGAPVLLVKKKDG------------------SMRLCIDY 568
++K++ LLE + P S S W +PVL+V KK+G S +LCIDY
Sbjct: 1 VRKEVLKLLEVGLIYPISDSAWVSPVLVVSKKEGMTVIRNEKNDLIPTRTVTSWKLCIDY 180
Query: 569 RQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQIKVKDEDMQKTALRTRYGH 628
R+LN+ T K+ +PLP +D ++++L G + +D GY+QI V +D +K A +G
Sbjct: 181 RKLNEATRKDHFPLPFMDQMLERLAGHAYYCFLDAYFGYNQIVVDPKDQEKMAFTCPFGV 360
Query: 629 YEYKVMPFGVTNAPRVFMEYMNRIFHAFLDRFVVVFIDDILIYSKTEEEHAEHLKIVLQV 688
+ Y+ +PFG+ NAP F M IF +++ + VF+DD ++ + E + L++VLQ
Sbjct: 361 FAYRRIPFGLCNAPTTFQMCMLAIFADIVEKSIEVFMDDFSVFVPSLESCLKKLEMVLQR 540
Query: 689 LKEKKLYAKLSKCEFWLKEVSFLGHVISGDGIAVDPSKVEAVSQ 732
E L KC F ++E LGH IS GI VD +K++ + +
Sbjct: 541 CVETNLVLNWEKCHFMVREGIVLGHKISTRGIEVDQTKIDVIEK 672
>CF922488
Length = 741
Score = 124 bits (312), Expect = 2e-28
Identities = 65/176 (36%), Positives = 106/176 (59%)
Frame = +3
Query: 555 VKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQIKVKD 614
V K+DG + +C+DYR LN + K+++PLP I+ L+D FS +D SGY+QIK+
Sbjct: 3 VLKEDGKV*MCVDYRDLN*ASPKDKFPLPHINVLVDNTTSFSQFSFMDGFSGYNQIKIAP 182
Query: 615 EDMQKTALRTRYGHYEYKVMPFGVTNAPRVFMEYMNRIFHAFLDRFVVVFIDDILIYSKT 674
EDM+KT T +G + YK M FG+ N + M +F + + + V++DD+++ S+T
Sbjct: 183 EDMEKTTFITLWGTFCYKAMSFGLKNVGATYQRAMVALF*DMMHKEIEVYMDDMIVKSRT 362
Query: 675 EEEHAEHLKIVLQVLKEKKLYAKLSKCEFWLKEVSFLGHVISGDGIAVDPSKVEAV 730
EEEH +L+ + + L++ +L +KC F +K L + S GI VD +KV+ +
Sbjct: 363 EEEHLVNLRKLFRRLRKYRLRLNPAKCMFEVKSRKLLDFIDS*RGIEVDSNKVKVI 530
>NP395547 reverse transcriptase [Glycine max]
Length = 762
Score = 123 bits (308), Expect = 7e-28
Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 19/224 (8%)
Frame = +1
Query: 528 LKKQLEDLLEKKFVRP-SVSPWGAPVLLVKKKDGSM------------------RLCIDY 568
++K++ LLE + P S S W +PV +V KK G R+CIDY
Sbjct: 1 VRKEVFKLLEAGLIYPISDSSWVSPVQVVPKKGGMTVVKNDRNELIPTRRVTRWRMCIDY 180
Query: 569 RQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQIKVKDEDMQKTALRTRYGH 628
R+LN+ T K+ YPLP +D ++ +L + +D SGY+QI V +D +KTA +
Sbjct: 181 RKLNEATRKDHYPLPFMDQMLKRLARQSFYRFLDGYSGYNQIAVDPQDQEKTAFTCPFSV 360
Query: 629 YEYKVMPFGVTNAPRVFMEYMNRIFHAFLDRFVVVFIDDILIYSKTEEEHAEHLKIVLQV 688
+ Y+ MPFG+ NA F M IF +++ + VF+DD + + +L+ VLQ
Sbjct: 361 FAYRRMPFGLCNASTTFQRCMMAIFDDMVEKCIEVFMDDFSFFGASFGNCLANLEKVLQR 540
Query: 689 LKEKKLYAKLSKCEFWLKEVSFLGHVISGDGIAVDPSKVEAVSQ 732
++ L KC F ++E LGH IS GI V K++ + +
Sbjct: 541 CEKSNLVLNWEKCHFMVQEGIVLGHKISKRGIEVVKEKLDVIDK 672
>NP334778 reverse transcriptase [Glycine max]
Length = 431
Score = 105 bits (262), Expect = 1e-22
Identities = 53/143 (37%), Positives = 85/143 (59%)
Frame = +3
Query: 563 RLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYHQIKVKDEDMQKTAL 622
R+C+DYR LN+ + K+ +PLP ID LM + +FS +D SGY+QIK+ EDM+KT
Sbjct: 3 RMCVDYRDLNRASPKDNFPLPHIDILMANMASFALFSFMDGFSGYNQIKMAPEDMEKTTF 182
Query: 623 RTRYGHYEYKVMPFGVTNAPRVFMEYMNRIFHAFLDRFVVVFIDDILIYSKTEEEHAEHL 682
T +G + YKVM FG+ N + M +F + + + ++D+++ S+ EEEH +L
Sbjct: 183 ITLWGTFCYKVMSFGLKNFGATYHRAMVALFQDMMHKEIEAYVDEMIAKSRMEEEHLVNL 362
Query: 683 KIVLQVLKEKKLYAKLSKCEFWL 705
+ + L++ +L KC F L
Sbjct: 363 QNLFGQLRKYRLRLNPRKCVFGL 431
>TC233837 similar to UP|Q6WAY3 (Q6WAY3) Gag/pol polyprotein, partial (6%)
Length = 402
Score = 94.0 bits (232), Expect = 4e-19
Identities = 48/133 (36%), Positives = 79/133 (59%)
Frame = +2
Query: 598 FSKIDLRSGYHQIKVKDEDMQKTALRTRYGHYEYKVMPFGVTNAPRVFMEYMNRIFHAFL 657
FS +D SGY+QI + ED++KT T +G + Y+VM FG+ N + M +FH +
Sbjct: 2 FSFMDGFSGYNQI*MAREDVEKTTFVTLWGTFSYRVMAFGLKNTGATYQRAMVALFHDMM 181
Query: 658 DRFVVVFIDDILIYSKTEEEHAEHLKIVLQVLKEKKLYAKLSKCEFWLKEVSFLGHVISG 717
+ + V++DD++ S+TE EH +L + L++ +L +KC F +K LG ++S
Sbjct: 182 HKEIEVYVDDMIAKSRTETEHLVNLCKLFGRLQKYQLKLNPTKCTFGVKSGKLLGFIVSQ 361
Query: 718 DGIAVDPSKVEAV 730
GI +DP KV+A+
Sbjct: 362 KGIEIDPEKVKAL 400
>BI317507
Length = 359
Score = 63.5 bits (153), Expect = 6e-10
Identities = 31/67 (46%), Positives = 44/67 (65%)
Frame = -1
Query: 667 DILIYSKTEEEHAEHLKIVLQVLKEKKLYAKLSKCEFWLKEVSFLGHVISGDGIAVDPSK 726
+ILIYS + H HL VL VLK+++L A KC F + +LGHVIS D +A+D +K
Sbjct: 353 NILIYSPDWKSHIMHLTAVLDVLKKERLVANRKKCYFSQTTIEYLGHVISKDCVAMDSNK 174
Query: 727 VEAVSQW 733
V++V +W
Sbjct: 173 VKSVIEW 153
>CF922531
Length = 602
Score = 59.7 bits (143), Expect = 9e-09
Identities = 39/108 (36%), Positives = 61/108 (56%), Gaps = 6/108 (5%)
Frame = -2
Query: 458 LMFSLMASLSIENQAVIDKL-----QVVCDFPEVYPDEIP-DVPPEREVEFSINLVPGTK 511
L+ S SLS I+ L +++ +F +++P EIP +PP R +E I+LVP
Sbjct: 325 LLLSRETSLSTVTIPTIETLPPKVQELLHEFGDIFPKEIPLGLPPLRGIEHQIDLVPRAS 146
Query: 512 PVSMAPYRMSASELSELKKQLEDLLEKKFVRPSVSPWGAPVLLVKKKD 559
+ YR + E E++ Q+++LLEK +V+ S+S VLLV KKD
Sbjct: 145 LPNRPTYRTNPQETKEIESQVKELLEKGWVQESLSLCVVLVLLVPKKD 2
>BM142940
Length = 422
Score = 40.0 bits (92), Expect(2) = 2e-08
Identities = 20/34 (58%), Positives = 23/34 (66%)
Frame = -1
Query: 661 VVVFIDDILIYSKTEEEHAEHLKIVLQVLKEKKL 694
+ F DILIYS E +H +HLKIVLQ KE KL
Sbjct: 260 ICFFSYDILIYSINEGKHEKHLKIVLQTFKEAKL 159
Score = 38.1 bits (87), Expect(2) = 2e-08
Identities = 16/41 (39%), Positives = 24/41 (58%)
Frame = -3
Query: 693 KLYAKLSKCEFWLKEVSFLGHVISGDGIAVDPSKVEAVSQW 733
K+ A KC F K+ +L H+IS G++ DP K+E + W
Sbjct: 165 KVGANRKKCSFGCKKFEYLVHMISVKGVSADPKKLEVMVAW 43
>BE806882 similar to GP|27764548|gb polyprotein {Glycine max}, partial (3%)
Length = 153
Score = 54.3 bits (129), Expect = 4e-07
Identities = 26/51 (50%), Positives = 34/51 (65%)
Frame = -1
Query: 638 VTNAPRVFMEYMNRIFHAFLDRFVVVFIDDILIYSKTEEEHAEHLKIVLQV 688
+TNAP F MN IF L ++V+VF DDIL+YS T EH HL++V +V
Sbjct: 153 LTNAPTSFQCLMNHIFQHALRKYVLVFFDDILVYSSTWHEHLCHLEVVFKV 1
>CD487724
Length = 676
Score = 47.0 bits (110), Expect = 6e-05
Identities = 48/217 (22%), Positives = 92/217 (42%), Gaps = 22/217 (10%)
Frame = +3
Query: 343 ATHCFIAFDCVSALGLDLSDMS*EMVVETPAKGSVTTSLVCLRCPLSMFGRDFEMDLVCL 402
+TH F+ VS LGL + V + + +C P+S+ +F + L L
Sbjct: 48 STHNFVQQPLVSQLGLPCRSTP-PLRVMVGNGHHLKCTTICEAIPISIQNIEFLVHLYVL 224
Query: 403 PLSGMDVILGMNWLEYKH---VHINCFSKSVYFS------SAEEECGAEFLSTKQLKQME 453
P+ G +++LG+ WL+ V N S ++ E E L+ QL+++
Sbjct: 225 PIVGANIVLGVQWLKTLGPILVDYNSLSMQFFYQHRLVQLKGESEAQLGLLNHHQLRRLH 404
Query: 454 RDGILMFSLMASLSIEN-------------QAVIDKLQVVCDFPEVYPDEIPDVPPEREV 500
+ + ++ EN Q ++D+ + P+ +PP RE
Sbjct: 405 QTHEPVTYFHIAILTENTSPTSSPPLPQPIQHLLDQFSALFQ*PQ-------GLPPARET 563
Query: 501 EFSINLVPGTKPVSMAPYRMSASELSELKKQLEDLLE 537
+ I+L+P ++PV+M Y +E++ Q+ +L+
Sbjct: 564 DHHIHLLP*SEPVNMRLY*YPHY*NNEIEHQVNLMLQ 674
>BG725601 similar to PIR|H86337|H86 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (1%)
Length = 285
Score = 43.5 bits (101), Expect = 7e-04
Identities = 20/41 (48%), Positives = 26/41 (62%)
Frame = -3
Query: 552 VLLVKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQL 592
V++VKK +G R+C DY LN K+ YPLP ID + D L
Sbjct: 124 VVMVKKPNGKWRICTDYIDLN*ACPKDAYPLPNIDHMTDGL 2
>BU549069
Length = 615
Score = 34.7 bits (78), Expect = 0.31
Identities = 21/65 (32%), Positives = 33/65 (50%)
Frame = -3
Query: 1310 SHSYDWCRTCIEVKEVDPEVYWSVSDIRKSWNGGLSSGFTTASFKFARRFPCVATSEVCS 1369
SHS DW + E+ + +Y S + KS + G+ + T +F ++ CV+T V
Sbjct: 610 SHSKDWGWSSTEIPKTHTSLYRSFPNS*KSXSCGIPNCITPITF*SSQCLSCVSTPYVYP 431
Query: 1370 GSISC 1374
SISC
Sbjct: 430 *SISC 416
>TC211067 similar to UP|Q9LQH2 (Q9LQH2) F15O4.13, partial (9%)
Length = 589
Score = 32.0 bits (71), Expect = 2.0
Identities = 10/31 (32%), Positives = 20/31 (64%)
Frame = +2
Query: 703 FWLKEVSFLGHVISGDGIAVDPSKVEAVSQW 733
F + + F G V+ +G+ +DP K++A+ +W
Sbjct: 2 FCVDNIFFSGFVVGRNGVQMDPEKIKAIQEW 94
>CO981082
Length = 670
Score = 31.6 bits (70), Expect = 2.6
Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 1/68 (1%)
Frame = +1
Query: 366 EMVVETPAKGSV-TTSLVCLRCPLSMFGRDFEMDLVCLPLSGMDVILGMNWLEYKHVHIN 424
E+ V +P K T S++CL +SMF F ++C ++ +YKH H
Sbjct: 229 ELWVTSPTKAKAFTMSIICLCSCISMFPGHFPFSIICTVSEMQTNSETISTTKYKHTH-- 402
Query: 425 CFSKSVYF 432
+ VYF
Sbjct: 403 TLNSCVYF 426
>BE348096
Length = 445
Score = 31.2 bits (69), Expect = 3.5
Identities = 17/39 (43%), Positives = 24/39 (60%)
Frame = -1
Query: 826 RR*SGGLCFKTVENS*EELPDA*SRVGGCGLCIEDMETL 864
RR S LC ++S ++L A*SRVG G+C+ +E L
Sbjct: 187 RRPSSDLCSSIGQDSLKDLSYA*SRVGSHGVCLHILEAL 71
>CD408465
Length = 630
Score = 30.8 bits (68), Expect = 4.5
Identities = 14/40 (35%), Positives = 22/40 (55%), Gaps = 1/40 (2%)
Frame = -1
Query: 486 VYPDEIPD-VPPEREVEFSINLVPGTKPVSMAPYRMSASE 524
++P EIP +PP R +E ++L+PG S Y+ E
Sbjct: 630 IFPKEIPHGLPPSRGIEHQVDLLPGASLPSRPTYKSYPQE 511
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.360 0.161 0.597
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 64,996,846
Number of Sequences: 63676
Number of extensions: 924836
Number of successful extensions: 8985
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 4780
Number of HSP's successfully gapped in prelim test: 294
Number of HSP's that attempted gapping in prelim test: 4046
Number of HSP's gapped (non-prelim): 5389
length of query: 1445
length of database: 12,639,632
effective HSP length: 109
effective length of query: 1336
effective length of database: 5,698,948
effective search space: 7613794528
effective search space used: 7613794528
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC146805.1