
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC145219.13 - phase: 0 /pseudo
(702 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non... 140 2e-33
AW686588 126 3e-29
CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [i... 101 9e-22
TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imp... 79 6e-15
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot... 77 3e-14
BE941032 63 3e-10
BE999296 63 3e-10
BF520135 62 1e-09
BF650593 60 2e-09
TC82520 59 8e-09
BG456581 48 1e-05
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At... 47 2e-05
TC83226 weakly similar to PIR|G86419|G86419 probable reverse tra... 42 8e-04
BG585866 37 0.026
TC81859 similar to GP|18491135|gb|AAL69536.1 At1g04290/F19P19_27... 36 0.044
TC80531 similar to GP|3426042|gb|AAC32241.1| unknown protein {Ar... 35 0.075
AW980456 35 0.098
TC93136 35 0.13
BG586862 32 0.64
BQ124492 weakly similar to GP|9909168|dbj| putative transposable... 31 1.4
>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
retroelement reverse transcriptase {Oryza sativa
(japonica cultivar-group)}, partial (2%)
Length = 1262
Score = 140 bits (353), Expect = 2e-33
Identities = 67/149 (44%), Positives = 91/149 (60%)
Frame = +2
Query: 554 LGGRMEEGAWNWRRPLWVWEEELVEECRKLLNGVVLQSDISDRWLWESNNDDVYTVRGAY 613
LG ++ AW W R L+VWEEE V EC LLN VLQ +++D+W W + + Y+V+ Y
Sbjct: 395 LGWTVDGRAWEWTRRLFVWEEECVRECCILLNNFVLQDNVNDKWRWLLDPVNGYSVKVFY 574
Query: 614 QILTTMDDPPIVGVGDLVWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCAD 673
+ +T+ + D VWHK +P KVS+ W LLR+RLPTK NLV RG L A C
Sbjct: 575 RYITSTGHISDRSLVDDVWHKHIPSKVSLFVWRLLRNRLPTKDNLVHRGVLLATNAACVC 754
Query: 674 GCGIAETANHLFLHCTTFGAVWQHIRAWI 702
GC +E+ HLFLHC F ++W +R W+
Sbjct: 755 GCVDSESTTHLFLHCNVFCSLWSLVRNWL 841
Score = 31.6 bits (70), Expect = 1.1
Identities = 16/28 (57%), Positives = 17/28 (60%)
Frame = +2
Query: 443 C*VNGVGECWWIEVVFGIGCWW*DMVRR 470
C* NGVG WWI+ GI C D VRR
Sbjct: 62 C*ENGVGVFWWIKRGCGIEC*KLDTVRR 145
>AW686588
Length = 567
Score = 126 bits (316), Expect = 3e-29
Identities = 58/67 (86%), Positives = 60/67 (88%)
Frame = +1
Query: 636 VPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHCTTFGAVW 695
VPLKVSI+AW L+RDRLPTK NLVRR CL VEAAGC GCGIAETANHLFLHC TFGAVW
Sbjct: 142 VPLKVSILAWRLIRDRLPTKANLVRRRCLAVEAAGCVVGCGIAETANHLFLHCATFGAVW 321
Query: 696 QHIRAWI 702
QHIRAWI
Sbjct: 322 QHIRAWI 342
>CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [imported] -
Sulfolobus solfataricus, partial (3%)
Length = 789
Score = 101 bits (252), Expect = 9e-22
Identities = 43/79 (54%), Positives = 56/79 (70%)
Frame = -2
Query: 624 IVGVGDLVWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANH 683
I+ +++WH+QVPLKVS+ AW LLRDRLPTK NL+ RG + EA C GCG E+A H
Sbjct: 605 ILPYAEMIWHRQVPLKVSVFAWRLLRDRLPTKSNLIYRGVIPTEAGLCVSGCGALESAQH 426
Query: 684 LFLHCTTFGAVWQHIRAWI 702
LFL C+ F ++W +R WI
Sbjct: 425 LFLSCSYFASLWSLVRDWI 369
>TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imported] -
Arabidopsis thaliana, partial (6%)
Length = 951
Score = 79.0 bits (193), Expect = 6e-15
Identities = 62/143 (43%), Positives = 73/143 (50%)
Frame = +3
Query: 241 CLKGTQLVMTCQ**FLTFNLLMTLCYLEIKVGLMSEL*GPFCICLRGCLACE*IFIRVCW 300
CL G L Q + FNL TLC ++K+G S L*G + R CL *IFIRV W
Sbjct: 501 CLLGIVLGWLIQLWYHIFNLQTTLCC*KLKIGQTSVL*GLHLLFFRLCLV*R*IFIRVAW 680
Query: 301 LELISRILG*MRPSLF*IAKLVKLHFCIWVFLLVVIRRG*HFGSLC*RPLSLDYRGGKIA 360
IS +LG*+R LF*+ K + HFC V L I * FG+L * L G +
Sbjct: 681 FV*ISLLLG*VRRLLF*VGKWARYHFCT*VCLSREILGA*VFGNLL*IA*KLG*LVGIVV 860
Query: 361 FSLLGVGWFYSSLS*PHCLSMPF 383
F LL F S *P CLSM F
Sbjct: 861 FYLLAAV*FC*SRF*PLCLSMRF 929
Score = 39.7 bits (91), Expect(2) = 3e-08
Identities = 31/74 (41%), Positives = 39/74 (51%)
Frame = +3
Query: 54 SFTAMVN*LEVSITLS*LSFQKLIALNVLMTFDLFLWLDLFTKFYRRF*RID*KRFWERL 113
SFTA+ N +V LS* SF +L L+ LM F LFLW ++ KF +* I
Sbjct: 18 SFTAIGNYSKVLTPLS*PSFLRLTTLSALMIFVLFLWWEVCIKF*ANY*LIVCVWLLVL* 197
Query: 114 FRIRRLPS*KIGKF 127
FR+R KI KF
Sbjct: 198 FRMRSQRLLKIDKF 239
Score = 36.6 bits (83), Expect(2) = 3e-08
Identities = 19/35 (54%), Positives = 23/35 (65%)
Frame = +2
Query: 135 MRLSMRLARRKRN*YCLRWISKRLMIRSIGDTWML 169
MRL MRL ++ CLRWI KRL+ SIG W+L
Sbjct: 263 MRLWMRLRN*RKIFCCLRWILKRLITLSIGLIWIL 367
>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
dioxygenase1 {Pisum sativum}, partial (43%)
Length = 1865
Score = 76.6 bits (187), Expect = 3e-14
Identities = 49/146 (33%), Positives = 75/146 (50%), Gaps = 8/146 (5%)
Frame = -2
Query: 565 WRR-PLWVWEEELVEECRKLLNGVVLQSDISDRWLWESNNDDVYTVRGAYQILTTMD--D 621
WRR L+ WE+E + E L GVVL+ +D W+W+ + + V++V Y +L + +
Sbjct: 538 WRRLELFEWEKERLLELLGRLEGVVLRY-WADIWVWKPDKEGVFSVNSCYFLLQNLRLLE 362
Query: 622 PPIVGVGDLV----WHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGI 677
+ +++ W + P KV +W L DR+PT VNL +R L VE + CG
Sbjct: 361 DRLSYEEEVIFRELWKSKAPAKVLAFSWTLFLDRIPTMVNLGKRRLLRVEDSKRCVFCGC 182
Query: 678 A-ETANHLFLHCTTFGAVWQHIRAWI 702
ET HLFLHC V + + W+
Sbjct: 181 QDETVVHLFLHCDVISKV*REVMRWL 104
>BE941032
Length = 435
Score = 63.2 bits (152), Expect = 3e-10
Identities = 26/62 (41%), Positives = 35/62 (55%)
Frame = +2
Query: 641 SIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHCTTFGAVWQHIRA 700
SI WC+L +R PTK NL++RG + C CG A HLFLHC F +W ++
Sbjct: 146 SIFLWCVLLNRFPTKDNLLKRGVISAIYQSCVGECGNLYDATHLFLHCNFFRQIWINVSD 325
Query: 701 WI 702
W+
Sbjct: 326 WL 331
>BE999296
Length = 384
Score = 63.2 bits (152), Expect = 3e-10
Identities = 34/78 (43%), Positives = 41/78 (51%)
Frame = -1
Query: 609 VRGAYQILTTMDDPPIVGVGDLVWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEA 668
+ GAYQ L + D P D VWH +PLKV +LR+ LPTK N VRR + E
Sbjct: 306 IHGAYQFLMSADAPLDREYIDSVWHNHIPLKVCFFVLRVLRNCLPTKDNFVRRRVIHEEH 127
Query: 669 AGCADGCGIAETANHLFL 686
C GC ET + LFL
Sbjct: 126 MLCPTGCSFKETTDDLFL 73
>BF520135
Length = 202
Score = 61.6 bits (148), Expect = 1e-09
Identities = 29/63 (46%), Positives = 37/63 (58%)
Frame = +3
Query: 630 LVWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHCT 689
L+W K+ KVSI AW L RLPTK N+ +RG + +A C C + E+ HLFLHC
Sbjct: 12 LLWRKEDLSKVSIFAWRLFHGRLPTKANVFKRGIVHHDAHMCVTRCRLIESDVHLFLHCD 191
Query: 690 TFG 692
G
Sbjct: 192 VLG 200
>BF650593
Length = 486
Score = 60.5 bits (145), Expect = 2e-09
Identities = 28/63 (44%), Positives = 36/63 (56%)
Frame = +3
Query: 633 HKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHCTTFG 692
HK VPLKVS + W L ++ L T+ NL +RG LD + C CG E+ +H F C F
Sbjct: 297 HKSVPLKVSCLVWRLFQNXLATRDNLSKRGVLDQNSIXCVXDCGREESVSHFFFEC-PFS 473
Query: 693 AVW 695
VW
Sbjct: 474 XVW 482
>TC82520
Length = 833
Score = 58.5 bits (140), Expect = 8e-09
Identities = 33/88 (37%), Positives = 44/88 (49%)
Frame = +3
Query: 608 TVRGAYQILTTMDDPPIVGVGDLVWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVE 667
T RG Y+ LT P D VW K +P KVS+ W L +RLPTKVNL++R L
Sbjct: 117 TRRGTYRFLTISGQPLDRNQVDDVWQKNIPSKVSMFVWRLFHNRLPTKVNLMQRHVLQQT 296
Query: 668 AAGCADGCGIAETANHLFLHCTTFGAVW 695
C G I + H+ + F A++
Sbjct: 297 HTACISGVAIRK-RQHICFYIVIFLALF 377
Score = 55.1 bits (131), Expect = 9e-08
Identities = 40/137 (29%), Positives = 59/137 (42%), Gaps = 3/137 (2%)
Frame = +2
Query: 569 LWVWEEELVEECRKLLNGVVLQSDISDRWLWESNNDDVYT---VRGAYQILTTMDDPPIV 625
L+ WEEE V E LL+ VLQ ++ D W + YT + ++ + TT+
Sbjct: 2 LFAWEEESVREWYALLHNTVLQENVHDVCRWLLDPI*GYTEGNISLSHYLWTTIG*ES-- 175
Query: 626 GVGDLVWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLF 685
K+ K + L + C + G GCG +ETA HLF
Sbjct: 176 --S*RCLAKEYSFKGVYVRVASLSQ*TSHEG*FNAATCSSTNSHGLHFGCGDSETATHLF 349
Query: 686 LHCTTFGAVWQHIRAWI 702
LHC FG++W H+ W+
Sbjct: 350 LHCDIFGSLWSHVLRWL 400
>BG456581
Length = 683
Score = 48.1 bits (113), Expect = 1e-05
Identities = 25/65 (38%), Positives = 33/65 (50%)
Frame = +2
Query: 631 VWHKQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHCTT 690
+W+ +P S + W L DRLPT NL RGC V C+ ET++HLFL C
Sbjct: 80 IWNSCIPPSHSFICWRLAHDRLPTDDNLSSRGCALVSM--CSFCLEQVETSDHLFLRCKF 253
Query: 691 FGAVW 695
+W
Sbjct: 254 VVTLW 268
>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (10%)
Length = 767
Score = 47.4 bits (111), Expect = 2e-05
Identities = 29/112 (25%), Positives = 45/112 (39%), Gaps = 11/112 (9%)
Frame = -3
Query: 595 DRWLWESNNDDVYTVRGAYQILT----------TMDDPPIVGVGDLVWHKQVPLKVSIMA 644
D + WE + Y+V+ Y + T T+D P + + VW KV
Sbjct: 426 DSYSWEYSKSGHYSVKSGYYVQTNIIAAANQRGTVDQPSLDDLYQRVWKYNTSPKVRHFL 247
Query: 645 WCLLRDRLPTKVNLVRRGCLDVEAAGCADGCGI-AETANHLFLHCTTFGAVW 695
W + + LPT N+ R + G CG+ +ET NH+ C +W
Sbjct: 246 WRCISNSLPTAANMRSR---HISKDGSCSRCGMESETVNHILFQCPYARLIW 100
>TC83226 weakly similar to PIR|G86419|G86419 probable reverse transcriptase
100033-105622 [imported] - Arabidopsis thaliana, partial
(2%)
Length = 885
Score = 42.0 bits (97), Expect = 8e-04
Identities = 30/109 (27%), Positives = 43/109 (38%), Gaps = 11/109 (10%)
Frame = +3
Query: 598 LWESNNDDVYTVRGAYQILTTMDDPPIVGVGD-----LVWHKQVPLKV----SIMAWCLL 648
+W N +Y+V+ Y L T I L+W K L ++ W +L
Sbjct: 3 MWMHNPTGIYSVKSGYNTLRTWQTQQINNTSTSSDETLIWKKIWSLHTIPRHKVLLWRIL 182
Query: 649 RDRLPTKVNLVRRG--CLDVEAAGCADGCGIAETANHLFLHCTTFGAVW 695
D LP + +L +RG C + C ET HLF+ C VW
Sbjct: 183 NDSLPVRSSLRKRGIQCYPL----CPRCHSKTETITHLFMSCPLSKRVW 317
>BG585866
Length = 828
Score = 37.0 bits (84), Expect = 0.026
Identities = 25/100 (25%), Positives = 42/100 (42%), Gaps = 2/100 (2%)
Frame = +3
Query: 591 SDISDRWLWESNNDDVYTVRGAYQILTTMDDPPIVGVG--DLVWHKQVPLKVSIMAWCLL 648
+ I D ++W N++ VYT + Y + + + +W ++P K W
Sbjct: 336 ASIGDAYIWPHNSNGVYTAKSGYSWILSQTETVNYNNSSWSWIWRLKIPEKYKFFLWLAC 515
Query: 649 RDRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHC 688
+ +PT L R V +A C+ CG E + F HC
Sbjct: 516 HNAVPTLSLLNHRNM--VNSAICS-RCGEHEES---FFHC 617
>TC81859 similar to GP|18491135|gb|AAL69536.1 At1g04290/F19P19_27
{Arabidopsis thaliana}, partial (65%)
Length = 960
Score = 36.2 bits (82), Expect = 0.044
Identities = 17/46 (36%), Positives = 26/46 (55%)
Frame = +2
Query: 650 DRLPTKVNLVRRGCLDVEAAGCADGCGIAETANHLFLHCTTFGAVW 695
DRLPTK N++ R +++++ C G + ET+ LF C F W
Sbjct: 617 DRLPTKYNVLPREINNLDSSFCVGGWEVNETSQSLFF*CPLF*QGW 754
>TC80531 similar to GP|3426042|gb|AAC32241.1| unknown protein {Arabidopsis
thaliana}, partial (27%)
Length = 999
Score = 35.4 bits (80), Expect = 0.075
Identities = 18/54 (33%), Positives = 27/54 (49%)
Frame = -3
Query: 553 ILGGRMEEGAWNWRRPLWVWEEELVEECRKLLNGVVLQSDISDRWLWESNNDDV 606
+L G M G W W WVWE E ++ K + G+ L+ + +W+W N V
Sbjct: 211 LLTGTMISGFWVWVWA-WVWEREEEQKDLKEMVGLSLEESLFMKWVWVR*NSGV 53
>AW980456
Length = 779
Score = 35.0 bits (79), Expect = 0.098
Identities = 20/68 (29%), Positives = 32/68 (46%), Gaps = 4/68 (5%)
Frame = -2
Query: 595 DRWLWESNNDDVYTVRGAYQILTT--MDDPPIVGVGDL--VWHKQVPLKVSIMAWCLLRD 650
DR +W+ Y V+ AY+ D + G+ +W +VP KV + W + R
Sbjct: 208 DRLIWKDEKHGKYYVKSAYRFCVEELFDSSYLHRPGNWSGIWKLKVPPKVQNLVWRMCRG 29
Query: 651 RLPTKVNL 658
LPT++ L
Sbjct: 28 CLPTRIRL 5
>TC93136
Length = 722
Score = 34.7 bits (78), Expect = 0.13
Identities = 12/24 (50%), Positives = 16/24 (66%)
Frame = +1
Query: 679 ETANHLFLHCTTFGAVWQHIRAWI 702
ET++HLFLHC +VW I W+
Sbjct: 262 ETSSHLFLHCPFLSSVWSKILGWL 333
>BG586862
Length = 804
Score = 32.3 bits (72), Expect = 0.64
Identities = 25/75 (33%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Frame = -1
Query: 623 PIVGVGDLVWH-KQVPLKVSIMAWCLLRDRLPTKVNLVRRGCLDVEAAGCADGC-GIAET 680
PI + + VW K +P S + W LL + LP K L +RG + + C ET
Sbjct: 678 PI*FIREKVWGIKTIPRHKSFL-WRLLHNALPVKDELHKRG---IRCSLLCPRCESKIET 511
Query: 681 ANHLFLHCTTFGAVW 695
HLFL+C W
Sbjct: 510 VQHLFLNCEVTQKEW 466
>BQ124492 weakly similar to GP|9909168|dbj| putative transposable element
Tip100 protein {Oryza sativa (japonica cultivar-group)},
partial (8%)
Length = 694
Score = 31.2 bits (69), Expect = 1.4
Identities = 10/24 (41%), Positives = 15/24 (61%)
Frame = +2
Query: 679 ETANHLFLHCTTFGAVWQHIRAWI 702
+TA HLFL C F ++W + W+
Sbjct: 530 KTARHLFLDCDIFSSLWSQVWLWL 601
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.363 0.165 0.639
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,600,862
Number of Sequences: 36976
Number of extensions: 398693
Number of successful extensions: 4753
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 1901
Number of HSP's successfully gapped in prelim test: 184
Number of HSP's that attempted gapping in prelim test: 2710
Number of HSP's gapped (non-prelim): 2284
length of query: 702
length of database: 9,014,727
effective HSP length: 103
effective length of query: 599
effective length of database: 5,206,199
effective search space: 3118513201
effective search space used: 3118513201
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (22.0 bits)
S2: 62 (28.5 bits)
Medicago: description of AC145219.13