
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0048.4
(422 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC88200 similar to GP|6671963|gb|AAF23222.1| unknown protein {Ar... 729 0.0
BG587229 similar to PIR|T05664|T05 hypothetical protein F22I13.1... 107 7e-24
TC83459 similar to GP|14334724|gb|AAK59540.1 unknown protein {Ar... 90 2e-18
TC91388 similar to GP|14334724|gb|AAK59540.1 unknown protein {Ar... 56 3e-08
TC84836 similar to PIR|T05165|T05165 hypothetical protein F18E5.... 52 4e-07
TC83919 similar to GP|17065412|gb|AAL32860.1 Unknown protein {Ar... 32 0.46
TC78374 31 1.0
TC92429 similar to PIR|T46092|T46092 hypothetical protein T20E23... 30 1.3
TC85922 similar to SP|P56807|RR18_ARATH Chloroplast 30S ribosoma... 30 1.8
AW774707 similar to GP|22795253|gb unknown protein {Oryza sativa... 29 3.0
TC91992 similar to GP|8099125|dbj|BAA90497.1 rice EST C27893 cor... 29 3.0
TC78063 similar to GP|14194099|gb|AAK56244.1 AT3g52990/F8J2_160 ... 29 3.9
TC78728 similar to PIR|E96523|E96523 hypothetical protein F11A17... 28 6.7
BQ144271 weakly similar to GP|21322711|em pherophorin-dz1 protei... 28 8.7
TC85714 similar to GP|14194099|gb|AAK56244.1 AT3g52990/F8J2_160 ... 28 8.7
TC92575 weakly similar to GP|21428434|gb|AAM49877.1 LD13350p {Dr... 28 8.7
>TC88200 similar to GP|6671963|gb|AAF23222.1| unknown protein {Arabidopsis
thaliana}, partial (88%)
Length = 1788
Score = 729 bits (1881), Expect = 0.0
Identities = 357/418 (85%), Positives = 387/418 (92%)
Frame = +1
Query: 5 LPIYLYIVAFLCTIGAVSLALLHIYKHLVNYTEPTYQRYIVRIVFMVPVYALMSFLSLVL 64
LP++ Y++AF CT+GA++LA+LHIY+HL+NYTEPTYQR+IVRIVFMVPVYALMSFLSLVL
Sbjct: 145 LPLFFYVIAFFCTVGAIALAILHIYRHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVL 324
Query: 65 PASSIYFNSIREVYEAWVIYNFLSLCLAWVGGPGAVVLSLSGRVLKPSWFLMTCCLSPIP 124
P SIYFNSIREVYEAWVIYNFLSLCLAWVGGPG+VV+SLSGRVLKPS LMTCC PIP
Sbjct: 325 PRLSIYFNSIREVYEAWVIYNFLSLCLAWVGGPGSVVISLSGRVLKPSVCLMTCCFPPIP 504
Query: 125 LDGRFIRKCKQGGLQFVILKPILVAVTLILYVKGKYKDGNFSPKQSYLYLTIIYTFSYTM 184
LDGRFIRKCKQG LQFVILKPILV VTLILY KGKYKDGNF+PKQSYLYLTIIYTFSYTM
Sbjct: 505 LDGRFIRKCKQGCLQFVILKPILVVVTLILYAKGKYKDGNFNPKQSYLYLTIIYTFSYTM 684
Query: 185 ALYALALFYVACKDLLHPFNPVPKFIIIKSVVFLTYWQGVLVFLAAKSKFIKDADEAAIL 244
ALYALALFYVACKDLL PFNPVPKFIIIKSVVFLTYWQGVL FLAAKS FI+DADEAA+L
Sbjct: 685 ALYALALFYVACKDLLQPFNPVPKFIIIKSVVFLTYWQGVLFFLAAKSGFIQDADEAALL 864
Query: 245 QDFIICVEMLVAAVGHFYAFPYKEYAGANIGGVSRGFTASLGHALMLNDFYHDTVHQFAP 304
Q+FIICVEML+AAVGHFYAFPYKEYAGANIGG SRG TASLGHAL LNDFYHDTVHQFAP
Sbjct: 865 QNFIICVEMLIAAVGHFYAFPYKEYAGANIGG-SRGLTASLGHALKLNDFYHDTVHQFAP 1041
Query: 305 TYHDYVLYNHGEGEEGTRKYRSRTFVPLGPEMDAVRRNKHMFENKVDDIQLSRFSSSNSS 364
TYHDYVLYNH EGEEGTRKYRSRTFVP+GPEMD VR+NKH+ NKVDDIQL+ F SS+SS
Sbjct: 1042TYHDYVLYNHSEGEEGTRKYRSRTFVPIGPEMDNVRKNKHITGNKVDDIQLTSF-SSDSS 1218
Query: 365 TPSNSGPISDVSHSDAMKSSLLVDVSNSVSVPYDLTLIDLDLSDHPEKVPAVDKAGTR 422
TPSNSG + D S+SDA+KSSLLVDVS S S+PYDLTLIDLD+S +PE+VPA DKAG R
Sbjct: 1219TPSNSGSLPDASNSDAIKSSLLVDVSTSASIPYDLTLIDLDVSSYPEEVPAADKAGLR 1392
>BG587229 similar to PIR|T05664|T05 hypothetical protein F22I13.130 -
Arabidopsis thaliana, partial (46%)
Length = 690
Score = 107 bits (268), Expect = 7e-24
Identities = 73/227 (32%), Positives = 111/227 (48%), Gaps = 24/227 (10%)
Frame = +3
Query: 58 SFLSLVLPASSIYFNSIREVYEAWVIYNFLSLCLAWVGGPGAVV--LSLSGRV------- 108
SF+SLV P+ S+ +R+ YE++ +Y F +A +GG + + GR
Sbjct: 9 SFVSLVNPSISVDCAILRDCYESFAMYCFGRYLVACLGGEDRTLDFMEKEGRATFKTPLL 188
Query: 109 ----------LKPSWFLMTCCLSPIPLDGRFIRKCKQGGLQFVILKPILVAVTLILYVKG 158
+ F + L P L RF + K G +Q++I+K + +IL G
Sbjct: 189 RHYHSSHSPGIVKHPFPIKYFLKPWILGPRFYQIVKFGIVQYMIIKSFTAILAVILEAFG 368
Query: 159 KYKDGNFSPKQSYLYLTIIYTFSYTMALYALALFYVACKDLLHPFNPVPKFIIIKSVVFL 218
Y +G F Y Y+ ++ FS + ALY L FY KD L P+ KF+ KS+VFL
Sbjct: 369 VYCEGEFKLGCGYPYVAVVLNFSQSWALYCLVQFYTVTKDELAHIKPLAKFLTFKSIVFL 548
Query: 219 TYWQGVLV-----FLAAKSKFIKDADEAAILQDFIICVEMLVAAVGH 260
T+WQGV + F KS + + +QDFIIC+EM +A++ H
Sbjct: 549 TWWQGVAIALLYTFGLFKSPIAQGLQFKSSVQDFIICIEMGIASIVH 689
>TC83459 similar to GP|14334724|gb|AAK59540.1 unknown protein {Arabidopsis
thaliana}, partial (42%)
Length = 1248
Score = 89.7 bits (221), Expect = 2e-18
Identities = 49/102 (48%), Positives = 57/102 (55%), Gaps = 3/102 (2%)
Frame = +1
Query: 171 YLYLTIIYTFSYTMALYALALFYVACKDLLHPFNPVPKFIIIKSVVFLTYWQGVLVFLAA 230
Y YL I FS T ALY L FY KD L P P+ KF+ KS+VFLT+WQGV V
Sbjct: 175 YPYLASILNFSQTWALYCLVQFYSVIKDKLEPIKPLAKFLTFKSIVFLTWWQGVAVAFLF 354
Query: 231 KSKFIKDA---DEAAILQDFIICVEMLVAAVGHFYAFPYKEY 269
K A + +QD+IIC+EM VAAV H Y FP Y
Sbjct: 355 SMGAFKGALAQELRTRIQDYIICIEMGVAAVVHLYVFPAVPY 480
>TC91388 similar to GP|14334724|gb|AAK59540.1 unknown protein {Arabidopsis
thaliana}, partial (14%)
Length = 691
Score = 55.8 bits (133), Expect = 3e-08
Identities = 26/71 (36%), Positives = 47/71 (65%)
Frame = +2
Query: 13 AFLCTIGAVSLALLHIYKHLVNYTEPTYQRYIVRIVFMVPVYALMSFLSLVLPASSIYFN 72
A + + A+ L++ I++HL Y +P Q++++ ++ MVPVYAL SFLSL+ +++
Sbjct: 479 ASIFVLVALVLSMYLIFEHLAAYNQPEEQKFLIGLILMVPVYALESFLSLLDSSAAFNCE 658
Query: 73 SIREVYEAWVI 83
IR+ YEA+ +
Sbjct: 659 VIRDCYEAFAL 691
>TC84836 similar to PIR|T05165|T05165 hypothetical protein F18E5.190 -
Arabidopsis thaliana, partial (53%)
Length = 634
Score = 52.0 bits (123), Expect = 4e-07
Identities = 33/87 (37%), Positives = 52/87 (58%), Gaps = 4/87 (4%)
Frame = +3
Query: 7 IYLYIVAFLCTIGAVSLALLHIYKHLVNYTEPTYQRYIVRIVFMVPVYALMSFLSLV-LP 65
I +Y AF C + ++ L + +HL + P Q+ I+ I+ M P+YA++SF+ L+ +
Sbjct: 144 ITVYGSAF-CVMLSMHFTLQLLSQHLFYWKNPKEQKAIIIIILMAPIYAIVSFVGLLDIR 320
Query: 66 ASSIYF---NSIREVYEAWVIYNFLSL 89
S +F SI+E YEA+VI FLSL
Sbjct: 321 GSKEFFTLLESIKECYEAFVIAKFLSL 401
>TC83919 similar to GP|17065412|gb|AAL32860.1 Unknown protein {Arabidopsis
thaliana}, partial (61%)
Length = 926
Score = 32.0 bits (71), Expect = 0.46
Identities = 20/64 (31%), Positives = 35/64 (54%)
Frame = +2
Query: 5 LPIYLYIVAFLCTIGAVSLALLHIYKHLVNYTEPTYQRYIVRIVFMVPVYALMSFLSLVL 64
+P Y+V FLCT+GA +L Y L++ + T+++ + + F V V + + SLV
Sbjct: 587 IPKGKYVVGFLCTLGASAL-----YSLLLSLMQLTFEKVLKKETFSV-VLEMQIYTSLVA 748
Query: 65 PASS 68
+S
Sbjct: 749 TCAS 760
>TC78374
Length = 1185
Score = 30.8 bits (68), Expect = 1.0
Identities = 14/27 (51%), Positives = 18/27 (65%)
Frame = -1
Query: 343 KHMFENKVDDIQLSRFSSSNSSTPSNS 369
+H+F +V + SRFSS N ST SNS
Sbjct: 1110 EHLFNTRVSSVGGSRFSSGNDSTFSNS 1030
>TC92429 similar to PIR|T46092|T46092 hypothetical protein T20E23.210 -
Arabidopsis thaliana, partial (10%)
Length = 675
Score = 30.4 bits (67), Expect = 1.3
Identities = 12/26 (46%), Positives = 17/26 (65%)
Frame = +3
Query: 171 YLYLTIIYTFSYTMALYALALFYVAC 196
YL++ + Y F YTM +AL L +AC
Sbjct: 48 YLFVLMAYKFQYTMKYFALFLALIAC 125
>TC85922 similar to SP|P56807|RR18_ARATH Chloroplast 30S ribosomal protein
S18. [Mouse-ear cress] {Arabidopsis thaliana}, partial
(74%)
Length = 3196
Score = 30.0 bits (66), Expect = 1.8
Identities = 18/57 (31%), Positives = 32/57 (55%), Gaps = 4/57 (7%)
Frame = -3
Query: 8 YLYIVAFLCTIGAVSLALLHIYKHLVNYTEPT----YQRYIVRIVFMVPVYALMSFL 60
Y + FL ++ L LL+IY +++ Y+ + Y YIV I + P+Y+ +S+L
Sbjct: 1748 YFKFLYFLLYFNSIILMLLYIYLYIL-YSLCSCYYYYTLYIVNIYIIYPIYSSISYL 1581
>AW774707 similar to GP|22795253|gb unknown protein {Oryza sativa (japonica
cultivar-group)}, partial (15%)
Length = 729
Score = 29.3 bits (64), Expect = 3.0
Identities = 24/74 (32%), Positives = 38/74 (50%), Gaps = 6/74 (8%)
Frame = +3
Query: 340 RRNKHMFENKVDDIQLSR----FSSSNSSTPSNSGPISDVSHSDAMKS--SLLVDVSNSV 393
RRN H+ +NK+D S FS SN+ S + S + K+ + L++++ +
Sbjct: 6 RRNHHILKNKIDSHLQSNGPLCFSPSNNMWGSKN*CSSLFNQH*YSKNDCTKLINITIAR 185
Query: 394 SVPYDLTLIDLDLS 407
P LTLI +DLS
Sbjct: 186 KAPQVLTLIHIDLS 227
>TC91992 similar to GP|8099125|dbj|BAA90497.1 rice EST C27893 corresponds to
a region of the predicated gene; unknown protein {Oryza
sativa}, partial (28%)
Length = 733
Score = 29.3 bits (64), Expect = 3.0
Identities = 18/60 (30%), Positives = 25/60 (41%), Gaps = 1/60 (1%)
Frame = +1
Query: 262 YAFPYKEYAGANIGGVSRGFTASLGHALMLNDFYHDTVHQFAPTYHDYVL-YNHGEGEEG 320
Y +PY G+ IGG+ G A+ G H H +H Y Y HG+ + G
Sbjct: 313 YGYPYPSGRGSGIGGLIAGAAAAYG--------AHHLSHGHGGYHHGYGYGYGHGKYKHG 468
>TC78063 similar to GP|14194099|gb|AAK56244.1 AT3g52990/F8J2_160
{Arabidopsis thaliana}, complete
Length = 1843
Score = 28.9 bits (63), Expect = 3.9
Identities = 23/105 (21%), Positives = 41/105 (38%)
Frame = +2
Query: 304 PTYHDYVLYNHGEGEEGTRKYRSRTFVPLGPEMDAVRRNKHMFENKVDDIQLSRFSSSNS 363
P YH L N + T+K + +GPE+ V + + D L N
Sbjct: 206 PEYHQETLENLRAAIKSTKKLCAVMLDTVGPELQVVNKTDRPITLEA-DTSLVLTPDQNK 382
Query: 364 STPSNSGPISDVSHSDAMKSSLLVDVSNSVSVPYDLTLIDLDLSD 408
SN P++ S A+K + + + + T + L++S+
Sbjct: 383 EATSNLLPVNFSGLSKAVKKGDTIFIGKYLFTGSETTSVWLEVSE 517
>TC78728 similar to PIR|E96523|E96523 hypothetical protein F11A17.10
[imported] - Arabidopsis thaliana, partial (68%)
Length = 762
Score = 28.1 bits (61), Expect = 6.7
Identities = 14/53 (26%), Positives = 26/53 (48%), Gaps = 7/53 (13%)
Frame = +1
Query: 331 PLGPEMDAVRRNKHMFENKVDDIQLSRFSSSNSSTPS-------NSGPISDVS 376
P P + R NKH+F +DD ++ +S+++ + SGP +V+
Sbjct: 241 PERPRLSVFRSNKHLFVQVIDDTKMHTLASASTMQKAIAEELNFTSGPTIEVA 399
>BQ144271 weakly similar to GP|21322711|em pherophorin-dz1 protein {Volvox
carteri f. nagariensis}, partial (5%)
Length = 919
Score = 27.7 bits (60), Expect = 8.7
Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 1/65 (1%)
Frame = -3
Query: 35 YTEPTYQRYIVRIVFMVPVYALMSFL-SLVLPASSIYFNSIREVYEAWVIYNFLSLCLAW 93
+T P+Y ++ R++ P Y L S+V S + F WV++ L W
Sbjct: 431 FTYPSYPLFLFRVISFSPFYFPGGPLGSIVGSPSGLGFRCD----VGWVVFMCWLLLSYW 264
Query: 94 VGGPG 98
GGPG
Sbjct: 263 EGGPG 249
>TC85714 similar to GP|14194099|gb|AAK56244.1 AT3g52990/F8J2_160
{Arabidopsis thaliana}, complete
Length = 2146
Score = 27.7 bits (60), Expect = 8.7
Identities = 22/105 (20%), Positives = 45/105 (41%)
Frame = +1
Query: 304 PTYHDYVLYNHGEGEEGTRKYRSRTFVPLGPEMDAVRRNKHMFENKVDDIQLSRFSSSNS 363
P YH L N +GT+K + +G EM V +++ ++ D Q+ +
Sbjct: 406 PEYHQETLENLKTAIKGTKKLCAVMLDTVGAEMQVVNKSETTISLEI-DAQVVLTPNQGQ 582
Query: 364 STPSNSGPISDVSHSDAMKSSLLVDVSNSVSVPYDLTLIDLDLSD 408
S PI+ + A+K+ + + + + T + L++S+
Sbjct: 583 EASSEILPINFDGLAQAVKTGDTIFIGQYLFTGSETTSVWLEVSE 717
>TC92575 weakly similar to GP|21428434|gb|AAM49877.1 LD13350p {Drosophila
melanogaster}, partial (3%)
Length = 557
Score = 27.7 bits (60), Expect = 8.7
Identities = 16/46 (34%), Positives = 24/46 (51%)
Frame = +3
Query: 139 QFVILKPILVAVTLILYVKGKYKDGNFSPKQSYLYLTIIYTFSYTM 184
QFVILKPI++ + L NF P ++L L + FS+ +
Sbjct: 444 QFVILKPIIIFLILF----------NFIPSINFLILF*FFPFSFLL 551
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.325 0.141 0.428
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,546,859
Number of Sequences: 36976
Number of extensions: 237630
Number of successful extensions: 1851
Number of sequences better than 10.0: 33
Number of HSP's better than 10.0 without gapping: 1825
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1844
length of query: 422
length of database: 9,014,727
effective HSP length: 99
effective length of query: 323
effective length of database: 5,354,103
effective search space: 1729375269
effective search space used: 1729375269
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0048.4