
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0118a.5
(351 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG585866 92 3e-19
BG585499 72 2e-13
TC83226 weakly similar to PIR|G86419|G86419 probable reverse tra... 54 1e-07
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At... 53 2e-07
CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [i... 44 7e-05
BG586862 44 1e-04
TC82520 31 0.015
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot... 35 0.033
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non... 33 0.16
AJ496898 similar to GP|23171295|gb CG3731-PA {Drosophila melanog... 33 0.16
AW980456 32 0.28
BG452711 32 0.37
BG644917 homologue to GP|10177015|dbj| ubiquitin-like protein {A... 29 0.39
AW686588 31 0.82
BG586638 homologue to GP|9279730|dbj formin-like protein {Arabid... 30 1.8
BG456581 29 2.4
TC88753 similar to PIR|T10586|T10586 small nuclear ribonucleopro... 29 2.4
TC89483 similar to GP|18377662|gb|AAL66981.1 unknown protein {Ar... 29 3.1
AJ501131 28 4.1
TC91679 similar to PIR|E84908|E84908 hypothetical protein At2g46... 28 5.3
>BG585866
Length = 828
Score = 91.7 bits (226), Expect(2) = 3e-19
Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 1/131 (0%)
Frame = +3
Query: 1 IWGPSPTGCYTAREAYGW-LNNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNKALQTN 59
IW + G YTA+ Y W L+ + N W W+W+LK+PEK F+WL + A+ T
Sbjct: 357 IWPHNSNGVYTAKSGYSWILSQTETVNYNNSSWSWIWRLKIPEKYKFFLWLACHNAVPTL 536
Query: 60 GNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELWLRLGATTWRSFATMDVEAWITS 119
++ S SRC EE+ HC+RDC S+ +W ++G ++ F++ V+ W+
Sbjct: 537 SLLNHRNMVNSAICSRCGEHEESFFHCVRDCRFSKIIWHKIGFSSPDFFSSSSVQDWLKD 716
Query: 120 LARSNHAISFL 130
+ +FL
Sbjct: 717 GISCHRPTTFL 749
Score = 20.8 bits (42), Expect(2) = 3e-19
Identities = 6/13 (46%), Positives = 8/13 (61%)
Frame = +2
Query: 132 GIWSVWLWRNNMC 144
G+W +W R MC
Sbjct: 752 GLWWIWRHRTLMC 790
>BG585499
Length = 792
Score = 72.4 bits (176), Expect = 2e-13
Identities = 37/126 (29%), Positives = 64/126 (50%), Gaps = 10/126 (7%)
Frame = +3
Query: 32 WQWVWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCP 91
W+ +W + P + F+WL+ + + TN R R + C A+ET+LH L DC
Sbjct: 225 WKMLWGWRGPHRTQTFMWLVAHGCILTNYRRSRWGTRVLATCPCCGNADETVLHVLCDCR 404
Query: 92 HSRELWLRLGATTW--RSFATMDVEAWI-TSLARSNHAIS-------FLSGIWSVWLWRN 141
+ ++W+RL + W F+ D W+ +L++ ++ +S F++ W +W WRN
Sbjct: 405 PASQVWIRLVPSDWITNFFSFDDCRDWVFKNLSKRSNGVSKFKWQPTFMTTCWHMWTWRN 584
Query: 142 NMCFEE 147
FEE
Sbjct: 585 KAIFEE 602
>TC83226 weakly similar to PIR|G86419|G86419 probable reverse transcriptase
100033-105622 [imported] - Arabidopsis thaliana, partial
(2%)
Length = 885
Score = 53.5 bits (127), Expect = 1e-07
Identities = 28/105 (26%), Positives = 44/105 (41%), Gaps = 8/105 (7%)
Frame = +3
Query: 1 IWGPSPTGCYTAREAYGWL--------NNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLIL 52
+W +PTG Y+ + Y L NN W+ +W L + + +W IL
Sbjct: 3 MWMHNPTGIYSVKSGYNTLRTWQTQQINNTSTSSDETLIWKKIWSLHTIPRHKVLLWRIL 182
Query: 53 NKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELW 97
N +L + + + P RC + ETI H CP S+ +W
Sbjct: 183 NDSLPVRSSLRKRGIQCYPLCPRCHSKTETITHLFMSCPLSKRVW 317
>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (10%)
Length = 767
Score = 52.8 bits (125), Expect = 2e-07
Identities = 31/105 (29%), Positives = 48/105 (45%), Gaps = 9/105 (8%)
Frame = -3
Query: 2 WGPSPTGCYTAREAYGWLNNL-DHEGQNGRC--------WQWVWKLKVPEKM*LFVWLIL 52
W S +G Y+ + Y N+ Q G +Q VWK K+ F+W +
Sbjct: 414 WEYSKSGHYSVKSGYYVQTNIIAAANQRGTVDQPSLDDLYQRVWKYNTSPKVRHFLWRCI 235
Query: 53 NKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELW 97
+ +L T N H+++ S SRC ET+ H L CP++R +W
Sbjct: 234 SNSLPTAANMRSRHISKDGSCSRCGMESETVNHILFQCPYARLIW 100
>CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [imported] -
Sulfolobus solfataricus, partial (3%)
Length = 789
Score = 44.3 bits (103), Expect = 7e-05
Identities = 34/131 (25%), Positives = 53/131 (39%), Gaps = 15/131 (11%)
Frame = -2
Query: 33 QWVWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSR---SRCSAAEETILHCLRD 89
+ +W +VP K+ +F W +L L T N + + + S C A E H
Sbjct: 590 EMIWHRQVPLKVSVFAWRLLRDRLPTKSNLIYRGVIPTEAGLCVSGCGALESA-QHLFLS 414
Query: 90 CPHSRELWLRLGATTWRSFATMDVEA-------WITSLARSNHAISFLSGIWSVWLW--- 139
C + LW + W F +D ++ S + + SFL IW + W
Sbjct: 413 CSYFASLWSLV--RDWIGFVGVDTNVLSDHFVQFVHSTGGNKASQSFLQLIWLLCAWVLW 240
Query: 140 --RNNMCFEET 148
RNNMCF ++
Sbjct: 239 TERNNMCFNDS 207
>BG586862
Length = 804
Score = 43.5 bits (101), Expect = 1e-04
Identities = 30/117 (25%), Positives = 52/117 (43%), Gaps = 5/117 (4%)
Frame = -1
Query: 35 VWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCLRDCPHSR 94
VW +K + F+W +L+ AL + + S RC + ET+ H +C ++
Sbjct: 654 VWGIKTIPRHKSFLWRLLHNALPVKDELHKRGIRCSLLCPRCESKIETVQHLFLNCEVTQ 475
Query: 95 ELWL--RLGATTWRSFATMDVEAWITSLARSNH---AISFLSGIWSVWLWRNNMCFE 146
+ W +LG + S + WIT+ N I+ + ++S+W RN FE
Sbjct: 474 KEWFGSQLG-INFHSSGVLHFHDWITNFILKNDEETIIALTALLYSIWHARNQKVFE 307
>TC82520
Length = 833
Score = 31.2 bits (69), Expect(2) = 0.015
Identities = 12/35 (34%), Positives = 20/35 (56%)
Frame = +3
Query: 35 VWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQ 69
VW+ +P K+ +FVW + + L T N + H+ Q
Sbjct: 186 VWQKNIPSKVSMFVWRLFHNRLPTKVNLMQRHVLQ 290
Score = 24.3 bits (51), Expect(2) = 0.015
Identities = 29/122 (23%), Positives = 43/122 (34%), Gaps = 15/122 (12%)
Frame = +2
Query: 81 ETILHCLRDCPHSRELW----------LRLGATTWRSFATMDVEAWITSLARSNHAISFL 130
ET H C LW L L A + F A S I +
Sbjct: 329 ETATHLFLHCDIFGSLWSHVLRWLHLLLVLPADIRQFFIQFTSMAGSPRFTHSFLQIMWF 508
Query: 131 SGIWSVWLWRNNMCFEET---PWNLAEAWRRLSHVHDEMLQTSQDWSPGD--LNSLLCVR 185
+ +W +W RNN F+ + P E + S + + Q + +S D + LLC+
Sbjct: 509 ASVWVLWKKRNNRVFQNSLSDPSTFVEQVKMHSFLWLKFQQATFSFSYHDWWKHPLLCMG 688
Query: 186 WH 187
H
Sbjct: 689 VH 694
>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
dioxygenase1 {Pisum sativum}, partial (43%)
Length = 1865
Score = 35.4 bits (80), Expect = 0.033
Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 8/98 (8%)
Frame = -2
Query: 1 IWGPSPTGCYTAREAYGWLNNLDH-----EGQNGRCWQWVWKLKVPEKM*LFVWLILNKA 55
+W P G ++ Y L NL + ++ +WK K P K+ F W +
Sbjct: 439 VWKPDKEGVFSVNSCYFLLQNLRLLEDRLSYEEEVIFRELWKSKAPAKVLAFSWTLFLDR 260
Query: 56 LQTNGNRFRCHLAQSPSRSR---CSAAEETILHCLRDC 90
+ T N + L + R C +ET++H C
Sbjct: 259 IPTMVNLGKRRLLRVEDSKRCVFCGCQDETVVHLFLHC 146
>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
retroelement reverse transcriptase {Oryza sativa
(japonica cultivar-group)}, partial (2%)
Length = 1262
Score = 33.1 bits (74), Expect = 0.16
Identities = 25/98 (25%), Positives = 37/98 (37%), Gaps = 2/98 (2%)
Frame = +2
Query: 2 WGPSPTGCYTAREAYGWLNNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNKALQTNGN 61
W P Y+ + Y ++ + H VW +P K+ LFVW +L L T N
Sbjct: 530 WLLDPVNGYSVKVFYRYITSTGHISDRSLVDD-VWHKHIPSKVSLFVWRLLRNRLPTKDN 706
Query: 62 R-FRCHLAQSPSRSRCSAAE-ETILHCLRDCPHSRELW 97
R L + + C + E+ H C LW
Sbjct: 707 LVHRGVLLATNAACVCGCVDSESTTHLFLHCNVFCSLW 820
>AJ496898 similar to GP|23171295|gb CG3731-PA {Drosophila melanogaster},
partial (35%)
Length = 698
Score = 33.1 bits (74), Expect = 0.16
Identities = 16/48 (33%), Positives = 24/48 (49%), Gaps = 8/48 (16%)
Frame = +2
Query: 120 LARSNHAISFLS--------GIWSVWLWRNNMCFEETPWNLAEAWRRL 159
+A N A SF S G+W V+ + M E WN+ EAW+++
Sbjct: 152 VAEDNCAHSFQSFNTCYKDTGLWGVYFVSDGMTIENMVWNIQEAWKKM 295
>AW980456
Length = 779
Score = 32.3 bits (72), Expect = 0.28
Identities = 23/72 (31%), Positives = 30/72 (40%), Gaps = 6/72 (8%)
Frame = -2
Query: 1 IWGPSPTGCYTAREAYGWL------NNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNK 54
IW G Y + AY + ++ H N W +WKLKVP K+ VW +
Sbjct: 199 IWKDEKHGKYYVKSAYRFCVEELFDSSYLHRPGN---WSGIWKLKVPPKVQNLVWRMCRG 29
Query: 55 ALQTNGNRFRCH 66
L T R R H
Sbjct: 28 CLPT---RIRLH 2
>BG452711
Length = 672
Score = 32.0 bits (71), Expect = 0.37
Identities = 19/95 (20%), Positives = 35/95 (36%)
Frame = +3
Query: 3 GPSPTGCYTAREAYGWLNNLDHEGQNGRCWQWVWKLKVPEKM*LFVWLILNKALQTNGNR 62
G T + Y WL + W+ +L P + LF+ + +
Sbjct: 198 GSGSTAHLIVSKGYWWLMGVHTSFLGKES*NWISRLCAPSNIKLFL*QL*RDYVHFRSIL 377
Query: 63 FRCHLAQSPSRSRCSAAEETILHCLRDCPHSRELW 97
C+L S C+ + +LH L C ++++W
Sbjct: 378 LFCNLISSNLCPICNQRSQDMLHALFSCTRAKDVW 482
>BG644917 homologue to GP|10177015|dbj| ubiquitin-like protein {Arabidopsis
thaliana}, partial (98%)
Length = 751
Score = 28.9 bits (63), Expect(2) = 0.39
Identities = 10/15 (66%), Positives = 10/15 (66%)
Frame = +3
Query: 144 CFEETPWNLAEAWRR 158
CF WNL EAWRR
Sbjct: 684 CFANCKWNLGEAWRR 728
Score = 21.6 bits (44), Expect(2) = 0.39
Identities = 7/16 (43%), Positives = 10/16 (61%)
Frame = +2
Query: 130 LSGIWSVWLWRNNMCF 145
+ G W VWL+ +CF
Sbjct: 626 MRGFWRVWLFWILICF 673
>AW686588
Length = 567
Score = 30.8 bits (68), Expect = 0.82
Identities = 33/131 (25%), Positives = 45/131 (34%), Gaps = 14/131 (10%)
Frame = +1
Query: 33 QWVWKLKVPEKM*LFVWLILNKALQTNGN--RFRCHLAQSPSRSRCSAAEETILHCLRDC 90
Q W L VP K+ + W ++ L T N R RC ++ ET H C
Sbjct: 124 QTTW-L*VPLKVSILAWRLIRDRLPTKANLVRRRCLAVEAAGCVVGCGIAETANHLFLHC 300
Query: 91 PHSRELWLRLGATTWRSFATMDVE------------AWITSLARSNHAISFLSGIWSVWL 138
+W + A W + D T RS + +L +W VW
Sbjct: 301 ATFGAVWQHIRA--WIGVSGADPHDLSDHFIQFITCTGHTRARRSFMQLIWLLCVWMVWN 474
Query: 139 WRNNMCFEETP 149
RNN F P
Sbjct: 475 ERNNRLFN*YP 507
>BG586638 homologue to GP|9279730|dbj formin-like protein {Arabidopsis
thaliana}, partial (1%)
Length = 723
Score = 29.6 bits (65), Expect = 1.8
Identities = 15/53 (28%), Positives = 25/53 (46%)
Frame = +2
Query: 87 LRDCPHSRELWLRLGATTWRSFATMDVEAWITSLARSNHAISFLSGIWSVWLW 139
L +C + LW + W +D +W+ SL N A+ +S +W W+W
Sbjct: 548 LENCCYV*LLWCWI----WICCGRVDSVSWLISL--ENTAVVVMSKLWGFWIW 688
>BG456581
Length = 683
Score = 29.3 bits (64), Expect = 2.4
Identities = 30/126 (23%), Positives = 46/126 (35%), Gaps = 11/126 (8%)
Frame = +2
Query: 31 CWQW---VWKLKVPEKM*LFVWLILNKALQTNGNRFRCHLAQSPSRSRCSAAEETILHCL 87
C W +W +P W + + L T+ N A S C ET H
Sbjct: 59 CAPWASTIWNSCIPPSHSFICWRLAHDRLPTDDNLSSRGCALVSMCSFCLEQVETSDHLF 238
Query: 88 RDCPHSRELW------LRLGATTWRSFATM--DVEAWITSLARSNHAISFLSGIWSVWLW 139
C LW LR+G + SF + + +S R + + + + S+W
Sbjct: 239 LRCKFVVTLWSWLCSQLRVGLD-FSSFKALLSSLPRHCSSQVRDLYVAAVVHMVHSIWWA 415
Query: 140 RNNMCF 145
RNN+ F
Sbjct: 416 RNNVRF 433
>TC88753 similar to PIR|T10586|T10586 small nuclear
ribonucleoprotein-associated protein homolog F9F13.90 -
Arabidopsis thaliana, partial (77%)
Length = 1273
Score = 29.3 bits (64), Expect = 2.4
Identities = 25/112 (22%), Positives = 41/112 (36%), Gaps = 5/112 (4%)
Frame = +1
Query: 97 WLRLGATTWRSFATMDVEAWITSLARSNHAI-SFLSGIWSVWLWRNNMCFEETPWNLAEA 155
W A+TW + + +W S A + S+ +G ++W ET W+
Sbjct: 592 WSACYASTWSNAVSWT--SWTRSTADGEGSTASYAAG--AIWA--------ETRWSSTTI 735
Query: 156 WRRLSHVHDEMLQTSQDWSPGDLNSLLCVRWHPPAR----GGSN*MWMTVTW 203
W S V E + WS G+ S W+ + S+ W + W
Sbjct: 736 WYATSSVWAETNGATSSWSDGERTSCSSSAWNAASASSWYASSSWKWCSCVW 891
>TC89483 similar to GP|18377662|gb|AAL66981.1 unknown protein {Arabidopsis
thaliana}, partial (35%)
Length = 1711
Score = 28.9 bits (63), Expect = 3.1
Identities = 9/13 (69%), Positives = 10/13 (76%)
Frame = +1
Query: 132 GIWSVWLWRNNMC 144
GIWS WLW+ N C
Sbjct: 802 GIWSGWLWKKNNC 840
>AJ501131
Length = 451
Score = 28.5 bits (62), Expect = 4.1
Identities = 12/29 (41%), Positives = 14/29 (47%)
Frame = -1
Query: 129 FLSGIWSVWLWRNNMCFEETPWNLAEAWR 157
FL W V+LW CF W+L WR
Sbjct: 262 FLWSGWFVFLWSGFGCFVFRLWSLVSLWR 176
>TC91679 similar to PIR|E84908|E84908 hypothetical protein At2g46890
[imported] - Arabidopsis thaliana, partial (77%)
Length = 1156
Score = 28.1 bits (61), Expect = 5.3
Identities = 16/38 (42%), Positives = 20/38 (52%)
Frame = +1
Query: 120 LARSNHAISFLSGIWSVWLWRNNMCFEETPWNLAEAWR 157
L RSN AI L+ +WS+ L N E+ W E WR
Sbjct: 292 LWRSNIAI-LLTWVWSIRLTHNYFRREKWQWGAREDWR 402
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.342 0.147 0.572
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,732,704
Number of Sequences: 36976
Number of extensions: 277843
Number of successful extensions: 3138
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 3057
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 3132
length of query: 351
length of database: 9,014,727
effective HSP length: 97
effective length of query: 254
effective length of database: 5,428,055
effective search space: 1378725970
effective search space used: 1378725970
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.5 bits)
S2: 59 (27.3 bits)
Lotus: description of TM0118a.5