
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC143339.8 + phase: 0 /pseudo
(518 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imp... 198 5e-51
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non... 153 1e-37
BE248682 similar to GP|18568269|gb putative gag-pol polyprotein ... 115 4e-26
CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [i... 103 2e-22
AW686588 76 4e-14
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot... 74 2e-13
BE999296 65 5e-11
BE941032 63 3e-10
BF650593 62 4e-10
TC82520 59 3e-09
BF520135 57 1e-08
BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-li... 57 2e-08
BG456581 52 4e-07
AW774658 similar to GP|2808681|emb| Hcr9-4B {Lycopersicon hirsut... 50 2e-06
BG647708 weakly similar to GP|13786450|gb| putative reverse tran... 50 2e-06
TC83226 weakly similar to PIR|G86419|G86419 probable reverse tra... 42 7e-04
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At... 41 0.001
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 41 0.001
BF006686 40 0.002
BG585866 40 0.003
>TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imported] -
Arabidopsis thaliana, partial (6%)
Length = 951
Score = 198 bits (503), Expect = 5e-51
Identities = 107/197 (54%), Positives = 137/197 (69%)
Frame = +2
Query: 1 ICTATMYVLVNGSPTPEFPLERGLRKGDPLSQFLFLLAAEGLNVMMQSMVEINIFSGYSI 60
+ TAT VLVNGSPT NV+M+S+V+ +F+ YS
Sbjct: 422 VSTATTSVLVNGSPT---------------------------NVLMKSLVQTQLFTRYSF 520
Query: 61 GEHDSMVISHLQFADDTLLVGVKSWANVRALRAVLLLLR*CRSGLKMNFHKSLLVGVNVS 120
G + +V+SHLQFA+DTLL+ K+WAN+RALRA L++ * SGLK+NFHKS LV VN++
Sbjct: 521 GVVNPVVVSHLQFANDTLLLETKNWANIRALRAALVIF-*AMSGLKVNFHKSGLVCVNIA 697
Query: 121 ESWLNEATTMLGCKVGKVPFLYLGLSIGGDPRRLLFWESVVDRVKKRLSGWRSKFLSFGG 180
SWL+EA ++L KVGKVPFLYLG+ I G+ RRL FWE +V+R+K RL+GW S+FLSFGG
Sbjct: 698 PSWLSEAASVLSWKVGKVPFLYLGMPIEGNSRRLSFWEPIVNRIKARLTGWNSRFLSFGG 877
Query: 181 RLVLLKVFLSSLHVYAL 197
RLVLLK L+SL VYAL
Sbjct: 878 RLVLLKSVLTSLSVYAL 928
>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
retroelement reverse transcriptase {Oryza sativa
(japonica cultivar-group)}, partial (2%)
Length = 1262
Score = 153 bits (387), Expect = 1e-37
Identities = 73/144 (50%), Positives = 94/144 (64%), Gaps = 2/144 (1%)
Frame = +2
Query: 375 WQWRRRLWAWEEELLGECRILLSDITLQHT--DRWVWRLDPSNGYSVNGVYQMLTSQPVQ 432
W+W RRL+ WEEE + EC ILL++ LQ D+W W LDP NGYSV Y+ +TS
Sbjct: 422 WEWTRRLFVWEEECVRECCILLNNFVLQDNVNDKWRWLLDPVNGYSVKVFYRYITSTGHI 601
Query: 433 TSEVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCGDFET 492
+ L+D +WHK +P KVS+F WRLL+NRLPTKDNL GV+ + N CV GC D E+
Sbjct: 602 SDRSLVDDVWHKHIPSKVSLFVWRLLRNRLPTKDNLVHRGVLLAT--NAACVCGCVDSES 775
Query: 493 AQHLFRSCPFYVALWGQIRSWLGI 516
HLF C + +LW +R+WLGI
Sbjct: 776 TTHLFLHCNVFCSLWSLVRNWLGI 847
>BE248682 similar to GP|18568269|gb putative gag-pol polyprotein {Zea mays},
partial (1%)
Length = 441
Score = 115 bits (288), Expect = 4e-26
Identities = 59/132 (44%), Positives = 88/132 (65%)
Frame = +3
Query: 17 EFPLERGLRKGDPLSQFLFLLAAEGLNVMMQSMVEINIFSGYSIGEHDSMVISHLQFADD 76
E ++RGL++GDPL+ FLFLL AEG++ +M++ V N+F G+ + + V SHLQ+ADD
Sbjct: 9 EISVQRGLKQGDPLAPFLFLLVAEGISGLMKNAVNRNLFQGFDVKRGGTRV-SHLQYADD 185
Query: 77 TLLVGVKSWANVRALRAVLLLLR*CRSGLKMNFHKSLLVGVNVSESWLNEATTMLGCKVG 136
TL +G+ + N+ L+A+L SGLK+NFHKS L+G+NV ++ A L C+
Sbjct: 186 TLCIGMPTVDNLWTLKALLQGFE-MASGLKVNFHKSSLIGINVPRDFMEAACRFLNCREE 362
Query: 137 KVPFLYLGLSIG 148
+PF+YLGL G
Sbjct: 363 SIPFIYLGLPGG 398
>CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [imported] -
Sulfolobus solfataricus, partial (3%)
Length = 789
Score = 103 bits (256), Expect = 2e-22
Identities = 45/77 (58%), Positives = 58/77 (74%)
Frame = -2
Query: 439 DLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCGDFETAQHLFR 498
++IWH+QVPLKVS+FAWRLL++RLPTK NL GVIP + CV GCG E+AQHLF
Sbjct: 590 EMIWHRQVPLKVSVFAWRLLRDRLPTKSNLIYRGVIP--TEAGLCVSGCGALESAQHLFL 417
Query: 499 SCPFYVALWGQIRSWLG 515
SC ++ +LW +R W+G
Sbjct: 416 SCSYFASLWSLVRDWIG 366
>AW686588
Length = 567
Score = 75.9 bits (185), Expect = 4e-14
Identities = 36/72 (50%), Positives = 49/72 (68%)
Frame = +1
Query: 446 VPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCGDFETAQHLFRSCPFYVA 505
VPLKVSI AWRL+++RLPTK NL + + + CV+GCG ETA HLF C + A
Sbjct: 142 VPLKVSILAWRLIRDRLPTKANLVRRRCL--AVEAAGCVVGCGIAETANHLFLHCATFGA 315
Query: 506 LWGQIRSWLGIA 517
+W IR+W+G++
Sbjct: 316 VWQHIRAWIGVS 351
>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
dioxygenase1 {Pisum sativum}, partial (43%)
Length = 1865
Score = 73.6 bits (179), Expect = 2e-13
Identities = 49/147 (33%), Positives = 72/147 (48%), Gaps = 9/147 (6%)
Frame = -2
Query: 377 WRR-RLWAWEEELLGECRILLSDITLQH-TDRWVWRLDPSNGYSVNGVY------QMLTS 428
WRR L+ WE+E L E L + L++ D WVW+ D +SVN Y ++L
Sbjct: 538 WRRLELFEWEKERLLELLGRLEGVVLRYWADIWVWKPDKEGVFSVNSCYFLLQNLRLLED 359
Query: 429 QPVQTSEVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVI-GC 487
+ EV+ +W + P KV F+W L +R+PT NL ++ D++ CV GC
Sbjct: 358 RLSYEEEVIFRELWKSKAPAKVLAFSWTLFLDRIPTMVNLGKRRLL-RVEDSKRCVFCGC 182
Query: 488 GDFETAQHLFRSCPFYVALWGQIRSWL 514
D ET HLF C + ++ WL
Sbjct: 181 QD-ETVVHLFLHCDVISKV*REVMRWL 104
>BE999296
Length = 384
Score = 65.5 bits (158), Expect = 5e-11
Identities = 37/99 (37%), Positives = 46/99 (46%)
Frame = -1
Query: 419 VNGVYQMLTSQPVQTSEVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSH 478
++G YQ L S +D +WH +PLKV F R+L+N LPTKDN VI H
Sbjct: 306 IHGAYQFLMSADAPLDREYIDSVWHNHIPLKVCFFVLRVLRNCLPTKDNFVRRRVIHEEH 127
Query: 479 DNQHCVIGCGDFETAQHLFRSCPFYVALWGQIRSWLGIA 517
C GC ET LF LW + WL I+
Sbjct: 126 --MLCPTGCSFKETTDDLF--------LWPLV*QWLHIS 40
>BE941032
Length = 435
Score = 62.8 bits (151), Expect = 3e-10
Identities = 37/93 (39%), Positives = 49/93 (51%), Gaps = 1/93 (1%)
Frame = +2
Query: 423 YQML-TSQPVQTSEVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQ 481
YQ L S P+ TS + + + ++ SIF W +L NR PTKDNL GVI + Q
Sbjct: 59 YQCL*FSSPISTSLQMTTVYFDIKMFF*KSIFLWCVLLNRFPTKDNLLKRGVISAIY--Q 232
Query: 482 HCVIGCGDFETAQHLFRSCPFYVALWGQIRSWL 514
CV CG+ A HLF C F+ +W + WL
Sbjct: 233 SCVGECGNLYDATHLFLHCNFFRQIWINVSDWL 331
>BF650593
Length = 486
Score = 62.4 bits (150), Expect = 4e-10
Identities = 30/60 (50%), Positives = 35/60 (58%)
Frame = +3
Query: 443 HKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCGDFETAQHLFRSCPF 502
HK VPLKVS WRL QN L T+DNL GV+ ++ CV CG E+ H F CPF
Sbjct: 297 HKSVPLKVSCLVWRLFQNXLATRDNLSKRGVL--DQNSIXCVXDCGREESVSHFFFECPF 470
>TC82520
Length = 833
Score = 59.3 bits (142), Expect = 3e-09
Identities = 35/90 (38%), Positives = 49/90 (53%), Gaps = 2/90 (2%)
Frame = +3
Query: 421 GVYQMLT--SQPVQTSEVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSH 478
G Y+ LT QP+ ++V D +W K +P KVS+F WRL NRLPTK NL V+ +H
Sbjct: 126 GTYRFLTISGQPLDRNQV--DDVWQKNIPSKVSMFVWRLFHNRLPTKVNLMQRHVLQQTH 299
Query: 479 DNQHCVIGCGDFETAQHLFRSCPFYVALWG 508
C+ G QH+ ++AL+G
Sbjct: 300 --TACISGVA-IRKRQHICFYIVIFLALFG 380
Score = 46.6 bits (109), Expect = 2e-05
Identities = 41/140 (29%), Positives = 57/140 (40%), Gaps = 6/140 (4%)
Frame = +2
Query: 381 LWAWEEELLGECRILLSDITLQHT--DRWVWRLDPSNGYSVNGV----YQMLTSQPVQTS 434
L+AWEEE + E LL + LQ D W LDP GY+ + Y T +
Sbjct: 2 LFAWEEESVREWYALLHNTVLQENVHDVCRWLLDPI*GYTEGNISLSHYLWTTIG*ESS* 181
Query: 435 EVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCGDFETAQ 494
L K V ++V+ + T + S ++ GCGD ETA
Sbjct: 182 RCLAKEYSFKGVYVRVASLSQ*-------TSHEG*FNAATCSSTNSHGLHFGCGDSETAT 340
Query: 495 HLFRSCPFYVALWGQIRSWL 514
HLF C + +LW + WL
Sbjct: 341 HLFLHCDIFGSLWSHVLRWL 400
>BF520135
Length = 202
Score = 57.4 bits (137), Expect = 1e-08
Identities = 29/61 (47%), Positives = 35/61 (56%)
Frame = +3
Query: 440 LIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCGDFETAQHLFRS 499
L+W K+ KVSIFAWRL RLPTK N+ G++ HD CV C E+ HLF
Sbjct: 12 LLWRKEDLSKVSIFAWRLFHGRLPTKANVFKRGIV--HHDAHMCVTRCRLIESDVHLFLH 185
Query: 500 C 500
C
Sbjct: 186 C 188
>BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-like protein
{Arabidopsis thaliana}, partial (18%)
Length = 789
Score = 57.0 bits (136), Expect = 2e-08
Identities = 37/114 (32%), Positives = 58/114 (50%)
Frame = -3
Query: 1 ICTATMYVLVNGSPTPEFPLERGLRKGDPLSQFLFLLAAEGLNVMMQSMVEINIFSGYSI 60
+ T + L+NG P RGLR+GDPLS +LF+L E L+ + Q + G +
Sbjct: 451 VSTVSYSFLINGGPQGRVLPSRGLRQGDPLSPYLFILCTEVLSGLCQQALRKGTLPGVKV 272
Query: 61 GEHDSMVISHLQFADDTLLVGVKSWANVRALRAVLLLLR*CRSGLKMNFHKSLL 114
+ I+HL FADDT+ G + ++ L +++ R SG +N KS +
Sbjct: 271 A-RNCPPINHLLFADDTMFFGKSNASSCAILLSIMDKYR-AASGRCIN*TKSAI 116
>BG456581
Length = 683
Score = 52.4 bits (124), Expect = 4e-07
Identities = 35/93 (37%), Positives = 40/93 (42%)
Frame = +2
Query: 422 VYQMLTSQPVQTSEVLLDLIWHKQVPLKVSIFAWRLLQNRLPTKDNLCVHGVIPHSHDNQ 481
VY LTS + IW+ +P S WRL +RLPT DNL G S
Sbjct: 29 VYSFLTSHT--SCAPWASTIWNSCIPPSHSFICWRLAHDRLPTDDNLSSRGCALVS---- 190
Query: 482 HCVIGCGDFETAQHLFRSCPFYVALWGQIRSWL 514
C ET+ HLF C F V LW SWL
Sbjct: 191 MCSFCLEQVETSDHLFLRCKFVVTLW----SWL 277
>AW774658 similar to GP|2808681|emb| Hcr9-4B {Lycopersicon hirsutum}, partial
(4%)
Length = 665
Score = 50.4 bits (119), Expect = 2e-06
Identities = 24/40 (60%), Positives = 29/40 (72%)
Frame = -1
Query: 48 SMVEINIFSGYSIGEHDSMVISHLQFADDTLLVGVKSWAN 87
+++ +I SIG H V SHLQFADDTLL+GVKSWAN
Sbjct: 446 NLLSHSICLNLSIGMHSLTVFSHLQFADDTLLLGVKSWAN 327
>BG647708 weakly similar to GP|13786450|gb| putative reverse transcriptase
{Oryza sativa}, partial (9%)
Length = 708
Score = 50.4 bits (119), Expect = 2e-06
Identities = 37/111 (33%), Positives = 57/111 (51%)
Frame = +1
Query: 21 ERGLRKGDPLSQFLFLLAAEGLNVMMQSMVEINIFSGYSIGEHDSMVISHLQFADDTLLV 80
E+GLR+GDPLS +LF+L A L+ +++ G + D I+HL FADD+LL
Sbjct: 7 EKGLRQGDPLSPYLFILCANVLSGLLKREGNKQNLHGIQVARSDPK-ITHLLFADDSLLF 183
Query: 81 GVKSWANVRALRAVLLLLR*CRSGLKMNFHKSLLVGVNVSESWLNEATTML 131
+ + VL + SG +NF KS V+ S++ N+ M+
Sbjct: 184 ARANLTEAATIMQVLHSYQ-SASGQLVNFEKS---EVSYSQNVPNQEKEMI 324
>TC83226 weakly similar to PIR|G86419|G86419 probable reverse transcriptase
100033-105622 [imported] - Arabidopsis thaliana, partial
(2%)
Length = 885
Score = 41.6 bits (96), Expect = 7e-04
Identities = 32/114 (28%), Positives = 45/114 (39%), Gaps = 14/114 (12%)
Frame = +3
Query: 408 VWRLDPSNGYSVNGVYQMLTS---QPVQTSEVLLD--LIWHKQVPLKV----SIFAWRLL 458
+W +P+ YSV Y L + Q + + D LIW K L + WR+L
Sbjct: 3 MWMHNPTGIYSVKSGYNTLRTWQTQQINNTSTSSDETLIWKKIWSLHTIPRHKVLLWRIL 182
Query: 459 QNRLPTKDNLCVHGV-----IPHSHDNQHCVIGCGDFETAQHLFRSCPFYVALW 507
+ LP + +L G+ P H ET HLF SCP +W
Sbjct: 183 NDSLPVRSSLRKRGIQCYPLCPRCHSKT---------ETITHLFMSCPLSKRVW 317
>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (10%)
Length = 767
Score = 41.2 bits (95), Expect = 0.001
Identities = 32/116 (27%), Positives = 45/116 (38%), Gaps = 13/116 (11%)
Frame = -3
Query: 405 DRWVWRLDPSNGYSVNGVYQMLTS------------QPVQTSEVLLDLIWHKQVPLKVSI 452
D + W S YSV Y + T+ QP + + L +W KV
Sbjct: 426 DSYSWEYSKSGHYSVKSGYYVQTNIIAAANQRGTVDQP--SLDDLYQRVWKYNTSPKVRH 253
Query: 453 FAWRLLQNRLPTKDNLCVHGVIPHSHDNQHCVIGCG-DFETAQHLFRSCPFYVALW 507
F WR + N LPT N+ H + C CG + ET H+ CP+ +W
Sbjct: 252 FLWRCISNSLPTAANMRSR----HISKDGSC-SRCGMESETVNHILFQCPYARLIW 100
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 41.2 bits (95), Expect = 0.001
Identities = 19/57 (33%), Positives = 34/57 (59%)
Frame = +1
Query: 109 FHKSLLVGVNVSESWLNEATTMLGCKVGKVPFLYLGLSIGGDPRRLLFWESVVDRVK 165
F + + +N+ ES++ + L C V +VPF +LGL IG +P+R + V+D ++
Sbjct: 256 FSRVNFMALNLEESFVEASPNFLLCNVNEVPFCFLGLPIGANPKRSSTRKPVLDSLQ 426
>BF006686
Length = 325
Score = 40.0 bits (92), Expect = 0.002
Identities = 17/32 (53%), Positives = 22/32 (68%)
Frame = +3
Query: 154 LLFWESVVDRVKKRLSGWRSKFLSFGGRLVLL 185
L WE +++ V K L W +K LSFGGR+VLL
Sbjct: 228 LPMWEPLLEHVNKMLKSWGNKLLSFGGRIVLL 323
>BG585866
Length = 828
Score = 39.7 bits (91), Expect = 0.003
Identities = 27/109 (24%), Positives = 43/109 (38%), Gaps = 3/109 (2%)
Frame = +3
Query: 405 DRWVWRLDPSNGYSVNGVYQMLTSQP--VQTSEVLLDLIWHKQVPLKVSIFAWRLLQNRL 462
D ++W + + Y+ Y + SQ V + IW ++P K F W N +
Sbjct: 348 DAYIWPHNSNGVYTAKSGYSWILSQTETVNYNNSSWSWIWRLKIPEKYKFFLWLACHNAV 527
Query: 463 PTKDNLCVHGVIPHSHDNQHCVIGCGDFETA-QHLFRSCPFYVALWGQI 510
PT L ++ N CG+ E + H R C F +W +I
Sbjct: 528 PTLSLLNHRNMV-----NSAICSRCGEHEESFFHCVRDCRFSKIIWHKI 659
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.338 0.150 0.517
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 19,920,393
Number of Sequences: 36976
Number of extensions: 345947
Number of successful extensions: 2665
Number of sequences better than 10.0: 55
Number of HSP's better than 10.0 without gapping: 2591
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2644
length of query: 518
length of database: 9,014,727
effective HSP length: 100
effective length of query: 418
effective length of database: 5,317,127
effective search space: 2222559086
effective search space used: 2222559086
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 61 (28.1 bits)
Medicago: description of AC143339.8