
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147714.2 - phase: 0 /pseudo
(971 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF003873 similar to GP|14715222|em putative polyprotein {Cicer a... 228 7e-60
BI265348 196 3e-50
TC77595 weakly similar to PIR|T18350|T18350 probable pol polypro... 96 7e-20
BG454871 weakly similar to GP|10140673|g putative gag-pol polypr... 72 9e-17
BG644699 similar to PIR|T07863|T078 probable polyprotein - pinea... 65 1e-10
TC84979 63 6e-10
BG587176 weakly similar to PIR|G84493|G84 probable retroelement ... 55 1e-07
TC91755 weakly similar to PIR|G75077|G75077 hypothetical protein... 55 1e-07
BF520058 40 6e-05
BG646285 46 8e-05
BE941429 weakly similar to GP|12323718|gb| hypothetical protein;... 43 7e-04
BG586308 weakly similar to PIR|F84528|F8 probable retroelement p... 34 0.24
BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana... 33 0.53
BQ122462 33 0.53
BF636649 similar to GP|21628724|emb OSJNBa0033H08.7 {Oryza sativ... 32 0.91
TC76794 similar to GP|7416846|dbj|BAA94084.1 NAD-dependent sorbi... 30 4.5
BE325109 similar to PIR|T04011|T040 hypothetical protein T5L19.2... 30 5.9
BG586326 similar to PIR|G84493|G8 probable retroelement pol poly... 30 5.9
BF647164 weakly similar to GP|6691191|gb F7F22.15 {Arabidopsis t... 29 7.7
>BF003873 similar to GP|14715222|em putative polyprotein {Cicer arietinum},
partial (82%)
Length = 559
Score = 228 bits (582), Expect = 7e-60
Identities = 114/126 (90%), Positives = 117/126 (92%)
Frame = +2
Query: 846 GVXRALKSKKLTPKFIGPYQILERVGTVAYRVGLPPHLSNLHNVFHVSQLRKYVPDPSHV 905
GV RALKSKKLT +FIGPYQI ERVGTVAYRVGLPPHL NLH+VFHVSQLRKYVPDPSHV
Sbjct: 2 GVGRALKSKKLTVRFIGPYQISERVGTVAYRVGLPPHLLNLHDVFHVSQLRKYVPDPSHV 181
Query: 906 IQSDDVQVRDNLTVETLPVRIDDRKVKMLRGKEIPLVRVVWTGATSESLTWELESKMLES 965
IQSDDVQVRDNLTVETLPVRIDDRKVK LRGKEIPLVRVVW A ESLTWELESKM+ES
Sbjct: 182 IQSDDVQVRDNLTVETLPVRIDDRKVKTLRGKEIPLVRVVWDRANGESLTWELESKMVES 361
Query: 966 YPELFA 971
YPELFA
Sbjct: 362 YPELFA 379
>BI265348
Length = 556
Score = 196 bits (499), Expect = 3e-50
Identities = 121/181 (66%), Positives = 132/181 (72%), Gaps = 7/181 (3%)
Frame = +1
Query: 144 IQLTFGND*RRR--EL*VWGFQTNEN*YNK-RSLGIVGFI*ITSPSQTKLKAYNVMLDLI 200
IQ T ND* + + VW F N K +SLGI+G * + YN+MLDLI
Sbjct: 22 IQSTLDND*TKGL*DRKVWVF*LKVNNKRKDKSLGIIGPS**QVLHKPNY*TYNLMLDLI 201
Query: 201 SQDSLSYVPLATKPISLT*LLLDFGSFNIQAYFADLIVISSLEDRNYMSQRLQRLFRCLC 260
SQDS S+VPLATKPISLT*LLLDFGS N QAYFADLI+ISSLEDRNY+SQRL RLFRCL
Sbjct: 202 SQDSFSFVPLATKPISLT*LLLDFGSSNNQAYFADLIIISSLEDRNYISQRL*RLFRCLY 381
Query: 261 NPCLNSLV*LSKLHFRFMN*YLE*WINNYQTLHFRFHS----LIMVDPC*SGEFR*EIRV 316
N CLNSLV +SKLHFRFMN*YL *WINNYQTLHFRFHS LI V +GE R EIRV
Sbjct: 382 NLCLNSLVRISKLHFRFMN*YLR*WINNYQTLHFRFHSFDNGLIHVK---AGELRLEIRV 552
Query: 317 T 317
T
Sbjct: 553 T 555
>TC77595 weakly similar to PIR|T18350|T18350 probable pol polyprotein - rice
blast fungus gypsy retroelement (fragment), partial
(14%)
Length = 1708
Score = 95.9 bits (237), Expect = 7e-20
Identities = 71/229 (31%), Positives = 108/229 (47%), Gaps = 6/229 (2%)
Frame = +2
Query: 680 GVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSERTIQSLEDLLRICVL 739
G+P SIVSDR + RFW+ G LS++YHPQTDG +ER Q ++ +LR V
Sbjct: 563 GMPQSIVSDRGSNWVGRFWREFCRLTGVTQLLSTSYHPQTDGGTERWNQEIQAVLRAYVC 742
Query: 740 EQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRRCRTPLCWFESGERVVLGPE---- 795
W LP ++ N ++SSIG PF +G P+ E VV E
Sbjct: 743 WSQDNWGDLLPTVQLALRNRHNSSIGATPFFVEHGYHV-DPIPTVEDTGGVVSEGEAAAQ 919
Query: 796 -IVQQTTEKVQMIQEKMKASQSRQKSYHDKRRKDLE-FQEGDHVFLRVTPMTGVXRALKS 853
+V++ + IQ ++ A+Q R ++ +KRR + +Q GD V+L V+ + K
Sbjct: 920 LLVKRMKDVTGFIQAEIVAAQQRSEASANKRRCPADRYQVGDKVWLNVSNYKSPRPSKKL 1099
Query: 854 KKLTPKFIGPYQILERVGTVAYRVGLPPHLSNLHNVFHVSQLRKYVPDP 902
L K Y++ V + +P ++ FHV LR+ DP
Sbjct: 1100DWLHHK----YEVTRFVTPHVVELNVP---GTVYPKFHVDLLRRAASDP 1225
>BG454871 weakly similar to GP|10140673|g putative gag-pol polyprotein {Oryza
sativa (japonica cultivar-group)}, partial (7%)
Length = 674
Score = 72.4 bits (176), Expect(2) = 9e-17
Identities = 37/81 (45%), Positives = 46/81 (56%)
Frame = +2
Query: 695 SRFWKSLQEALGSKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEF 754
S FWK L + G+ L +SSAYHP +DGQSE + E LR + W P E+
Sbjct: 32 SNFWKQLFKLHGTILTMSSAYHP*SDGQSEALNKGXEMYLRCLMFTDPLKWSKAFPWAEY 211
Query: 755 TYNNSYHSSIGMAPFEALYGR 775
YN SY+ S M PF+ALYGR
Sbjct: 212 WYNTSYNISAAMTPFKALYGR 274
Score = 33.5 bits (75), Expect(2) = 9e-17
Identities = 22/73 (30%), Positives = 34/73 (46%), Gaps = 2/73 (2%)
Frame = +1
Query: 814 SQSRQKSYHDKRRKDLEFQEGDHVFLRVTPMTGVXRALK--SKKLTPKFIGPYQILERVG 871
+Q K DK+R+ EFQ G+HV +++ P AL+ K +P F +
Sbjct: 388 AQQTMKHQADKKRRHFEFQLGEHVLVKLQPYQQSSVALRKYQKFGSPNFGSLLTVCSL*V 567
Query: 872 TVAYRVGLPPHLS 884
A+ PP+LS
Sbjct: 568 ESAFHCKSPPYLS 606
>BG644699 similar to PIR|T07863|T078 probable polyprotein - pineapple
retrotransposon dea1 (fragment), partial (5%)
Length = 231
Score = 65.1 bits (157), Expect = 1e-10
Identities = 31/74 (41%), Positives = 50/74 (66%), Gaps = 1/74 (1%)
Frame = +2
Query: 835 DHVFLRVTPMT-GVXRALKSKKLTPKFIGPYQILERVGTVAYRVGLPPHLSNLHNVFHVS 893
+ V L+V P G R K KL+ ++IGP+++++R+G VAY + LPP LS +H VFHVS
Sbjct: 2 EQVLLKVLPTERGDCRFGKRGKLSLRYIGPFEVIKRIGEVAYELALPPGLSGVHPVFHVS 181
Query: 894 QLRKYVPDPSHVIQ 907
++Y D +++I+
Sbjct: 182 MFKRYHGDGNYIIR 223
>TC84979
Length = 641
Score = 62.8 bits (151), Expect = 6e-10
Identities = 38/49 (77%), Positives = 40/49 (81%)
Frame = -1
Query: 469 PSNIHS*NKIKCNL**TILKRK*Y*YIIK*NLNKTKLKSI*K*SISDSS 517
PSNIHS*N IKCNL**TI KRK*Y*Y I NL K KL +I*K* I+DSS
Sbjct: 641 PSNIHS*NIIKCNL**TISKRK*Y*YKINLNLIKIKLNNI*K*IINDSS 495
>BG587176 weakly similar to PIR|G84493|G84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 729
Score = 55.5 bits (132), Expect = 1e-07
Identities = 26/54 (48%), Positives = 35/54 (64%)
Frame = -1
Query: 917 LTVETLPVRIDDRKVKMLRGKEIPLVRVVWTGATSESLTWELESKMLESYPELF 970
L +ET PVRI DR K +R K I +V++VW + E +TWE E++M YPE F
Sbjct: 717 LDLETRPVRILDRMEKAMRKKPIQMVKIVWDCSGREEITWETEARMKADYPEWF 556
>TC91755 weakly similar to PIR|G75077|G75077 hypothetical protein PAB1697 -
Pyrococcus abyssi (strain Orsay), partial (4%)
Length = 746
Score = 55.5 bits (132), Expect = 1e-07
Identities = 36/71 (50%), Positives = 45/71 (62%), Gaps = 7/71 (9%)
Frame = +3
Query: 392 DGWLLYL*HTLDLKSKQLH-------LG*K*SSLKSKQKAAKGTLEGGTAVPRLARAVPS 444
DG LLYL*+T+D KSKQ++ + + S+ K++K LEGGTAVPRLARAVPS
Sbjct: 48 DG*LLYL*YTID*KSKQVYS*NIPR*IVRQNRIALSQSKSSKRHLEGGTAVPRLARAVPS 227
Query: 445 F*LWGDIFLHP 455
F G L P
Sbjct: 228 FLTCGRYILFP 260
Score = 40.8 bits (94), Expect(2) = 1e-06
Identities = 30/59 (50%), Positives = 32/59 (53%), Gaps = 7/59 (11%)
Frame = +1
Query: 421 KQKAAKGTLEGGTAVPRLARAVPSF*LWGDIF-------LHPIGSKSLSFAPRLNPSNI 472
K KAAKG +F*L GDIF L PI SKSLSFAPRL+PSNI
Sbjct: 157 KAKAAKGI*RVARPCQG*HGPCQAF*LVGDIFCFPIFMFLCPICSKSLSFAPRLSPSNI 333
Score = 30.4 bits (67), Expect(2) = 1e-06
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 9/61 (14%)
Frame = +2
Query: 377 KQLSLIVFQVKKNICDGWL---------LYL*HTLDLKSKQLHLG*K*SSLKSKQKAAKG 427
KQ+SLIVFQVK N+ WL L +* ++ L+ + K*+S +SKQK K
Sbjct: 2 KQMSLIVFQVKMNV-**WLATLFIVHN*LKI*ASIFLEYSSVDCAPK*NSFESKQKQQKA 178
Query: 428 T 428
+
Sbjct: 179 S 181
>BF520058
Length = 616
Score = 40.0 bits (92), Expect(2) = 6e-05
Identities = 19/24 (79%), Positives = 20/24 (83%)
Frame = -1
Query: 387 KKNICDGWLLYL*HTLDLKSKQLH 410
K NICDG LLYL* T DL+SKQLH
Sbjct: 556 KMNICDG*LLYL*FTFDLESKQLH 485
Score = 25.4 bits (54), Expect(2) = 6e-05
Identities = 13/22 (59%), Positives = 15/22 (68%)
Frame = -3
Query: 414 K*SSLKSKQKAAKGTLEGGTAV 435
K*SSLKSKQK KG + T +
Sbjct: 458 K*SSLKSKQKQQKGIMGRATFI 393
>BG646285
Length = 640
Score = 45.8 bits (107), Expect = 8e-05
Identities = 32/61 (52%), Positives = 38/61 (61%), Gaps = 1/61 (1%)
Frame = -1
Query: 118 LMR*NRKCTVLPK*YKKIIIPTGSSLIQLTFGND*RRREL*VWGFQ-TNEN*YNKRSLGI 176
LM *N KCTVL K* K I+PTGS L QLT +D*+R V F EN KR+LG+
Sbjct: 295 LMG*NYKCTVLSK**K*SIVPTGSCLSQLTLVDD*KRLSCKVGVFNWLKENIIIKRALGL 116
Query: 177 V 177
+
Sbjct: 115 L 113
Score = 36.2 bits (82), Expect = 0.063
Identities = 26/53 (49%), Positives = 31/53 (58%), Gaps = 5/53 (9%)
Frame = -3
Query: 160 WGFQTNEN*YN-KRSLGIVGFI*ITSPSQTKLKAYNVMLDLISQD----SLSY 207
WGFQ E YN K+S GI+G * + K N+MLDLIS+D SLSY
Sbjct: 170 WGFQLVERKYNNKKSFGIIGSA**QILHKPKP*T*NLMLDLISKDRFLFSLSY 12
>BE941429 weakly similar to GP|12323718|gb| hypothetical protein; 28267-27009
{Arabidopsis thaliana}, partial (7%)
Length = 493
Score = 42.7 bits (99), Expect = 7e-04
Identities = 37/84 (44%), Positives = 45/84 (53%), Gaps = 8/84 (9%)
Frame = -2
Query: 115 ICPLMR*NRKCTVLPK*YKKIIIPTGS-------SLIQLTFGND*RRREL*VWGFQTNEN 167
+C LMR*NRKCTVL K* I+PTGS +I+L F ++ +GF
Sbjct: 306 LCALMR*NRKCTVLSK**N*SIVPTGSYEFNQL*IMIRLKF------YKIESFGFLIESK 145
Query: 168 *YNKR-SLGIVGFI*ITSPSQTKL 190
* KR FI*+TS SQTKL
Sbjct: 144 **KKR*KPWDYWFI*LTSSSQTKL 73
Score = 38.5 bits (88), Expect = 0.013
Identities = 21/42 (50%), Positives = 27/42 (64%)
Frame = -3
Query: 172 RSLGIVGFI*ITSPSQTKLKAYNVMLDLISQDSLSYVPLATK 213
+SLGI+G + YN+MLDL S+DS S+VPLATK
Sbjct: 128 KSLGIIGSSN*QVLHKPNY*TYNLMLDLTSKDSFSFVPLATK 3
>BG586308 weakly similar to PIR|F84528|F8 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(7%)
Length = 686
Score = 34.3 bits (77), Expect = 0.24
Identities = 20/70 (28%), Positives = 36/70 (50%)
Frame = -2
Query: 680 GVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSERTIQSLEDLLRICVL 739
G+P IV+D F S ++ E +L +S +PQ++GQ+E + + + D L+ +
Sbjct: 682 GLPYEIVTDNGSHFISNKFREFCERWRIRLNTASPRYPQSNGQAEASNKIIIDGLKKRLD 503
Query: 740 EQGGTWDSHL 749
+ G W L
Sbjct: 502 LKKGCWADEL 473
>BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana}, partial
(10%)
Length = 624
Score = 33.1 bits (74), Expect = 0.53
Identities = 14/45 (31%), Positives = 28/45 (62%)
Frame = +2
Query: 680 GVPSSIVSDRDPRFTSRFWKSLQEALGSKLRLSSAYHPQTDGQSE 724
GVP ++SD F ++ ++ L + G + ++++AYHPQ +S+
Sbjct: 482 GVPRVVISDGGSHFINKVFEKLLKKNGVRHKVATAYHPQKAERSK 616
>BQ122462
Length = 757
Score = 33.1 bits (74), Expect = 0.53
Identities = 27/45 (60%), Positives = 27/45 (60%), Gaps = 2/45 (4%)
Frame = -2
Query: 463 FAPRLNPSNI--HS*NKIKCNL**TILKRK*Y*YIIK*NLNKTKL 505
FAPRLNPSNI *N L** KRK Y Y *NLNKT L
Sbjct: 756 FAPRLNPSNIILEI*N---MQL**YNSKRKYYKYKYN*NLNKTIL 631
>BF636649 similar to GP|21628724|emb OSJNBa0033H08.7 {Oryza sativa}, partial
(4%)
Length = 653
Score = 32.3 bits (72), Expect = 0.91
Identities = 35/166 (21%), Positives = 66/166 (39%), Gaps = 4/166 (2%)
Frame = +1
Query: 719 TDGQSERTIQSLEDLLRICVLEQGGTWDSHLPLIEFTYNNSYHSSIGMAPFEALYGRRCR 778
+D Q+ +LE L EQ G + L E YN ++H + G PF+ +Y
Sbjct: 7 SDEQAGLLNHTLETHLLYFTSEQQGV*NFFLTWAECLYNTNFHRTAGCTPFKVVY---VV 177
Query: 779 TPLCWFESGERVVLGPEIVQQTTEKVQMIQEKMKASQSRQKSY----HDKRRKDLEFQEG 834
L F ++ E + +++ + + +Y H +R + +
Sbjct: 178 AHLQKFVVARDLIYRNEGLHKSST*TSFGRGTRAYEALSRPAYETC*HPRRPLSIVYTRD 357
Query: 835 DHVFLRVTPMTGVXRALKSKKLTPKFIGPYQILERVGTVAYRVGLP 880
+V P K GPYQ+++++G+VA+++ LP
Sbjct: 358 RTYEWQVLP-----------KYVA*CYGPYQVIKQIGSVAFKL*LP 462
>TC76794 similar to GP|7416846|dbj|BAA94084.1 NAD-dependent sorbitol
dehydrogenase {Prunus persica}, partial (93%)
Length = 1691
Score = 30.0 bits (66), Expect = 4.5
Identities = 12/21 (57%), Positives = 17/21 (80%)
Frame = -2
Query: 884 SNLHNVFHVSQLRKYVPDPSH 904
SNL N+FHVS + YVP+P++
Sbjct: 1690 SNLINIFHVS*FKVYVPNPTN 1628
>BE325109 similar to PIR|T04011|T040 hypothetical protein T5L19.200 -
Arabidopsis thaliana, partial (9%)
Length = 430
Score = 29.6 bits (65), Expect = 5.9
Identities = 14/43 (32%), Positives = 25/43 (57%)
Frame = +1
Query: 699 KSLQEALGSKLRLSSAYHPQTDGQSERTIQSLEDLLRICVLEQ 741
KSLQ G++++L + P+ D ERT+Q D +I + ++
Sbjct: 79 KSLQTKTGARIQLIPQHLPEGDDSKERTVQVTGDKRQIEIAQE 207
>BG586326 similar to PIR|G84493|G8 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (13%)
Length = 736
Score = 29.6 bits (65), Expect = 5.9
Identities = 26/64 (40%), Positives = 30/64 (46%)
Frame = +3
Query: 23 QDSSEFMRRITRRMI*SWRLWSLY*RYGDIACMVRGLRCLVITRA*SICSIRKN*I*GIE 82
Q S E MR T MI* W *R+G CMV R + + *SI +* *G
Sbjct: 126 QGS*ENMRETTPPMI*KWLR*YSP*RFGAHTCMVPRFRYIRTIKV*SIFLPSLS*T*GRG 305
Query: 83 GGSN 86
GG N
Sbjct: 306 GGWN 317
>BF647164 weakly similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana},
partial (6%)
Length = 469
Score = 29.3 bits (64), Expect = 7.7
Identities = 19/58 (32%), Positives = 29/58 (49%)
Frame = +1
Query: 809 EKMKASQSRQKSYHDKRRKDLEFQEGDHVFLRVTPMTGVXRALKSKKLTPKFIGPYQI 866
E K + R K +HD+R EF+EG+ V L + + L KL + GP+Q+
Sbjct: 142 ENAKIYKERTKKWHDRRIIRREFREGELVLLFNSRL-----KLFPGKLRSHWSGPFQV 300
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.350 0.155 0.524
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 27,667,229
Number of Sequences: 36976
Number of extensions: 364592
Number of successful extensions: 3071
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 1258
Number of HSP's successfully gapped in prelim test: 129
Number of HSP's that attempted gapping in prelim test: 1775
Number of HSP's gapped (non-prelim): 1455
length of query: 971
length of database: 9,014,727
effective HSP length: 105
effective length of query: 866
effective length of database: 5,132,247
effective search space: 4444525902
effective search space used: 4444525902
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 63 (28.9 bits)
Medicago: description of AC147714.2