
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0106.4
(423 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC93172 44 2e-04
BI270589 similar to PIR|C96829|C96 unknown protein F19K16.21 [im... 40 0.001
TC86622 weakly similar to GP|20197623|gb|AAM15156.1 putative myo... 40 0.002
TC86877 similar to GP|8953376|emb|CAB96649.1 putative protein {A... 37 0.019
TC80942 GP|21112489|gb|AAM40721.1 conserved hypothetical protein... 34 0.093
TC86727 similar to GP|4432859|gb|AAD20707.1| unknown protein {Ar... 34 0.12
AL365936 33 0.21
TC88788 weakly similar to SP|P05790|FBOH_BOMMO Fibroin heavy cha... 32 0.60
TC76445 similar to PIR|T07623|T07623 extensin homolog HRGP2 - so... 31 1.0
BQ751066 31 1.0
TC87847 similar to GP|15929245|gb|AAH15068.1 Similar to adaptor-... 30 1.3
TC92273 similar to GP|21622503|emb|CAD37050. related to OSBP-rel... 30 2.3
CA860047 weakly similar to GP|15081680|gb| AT3g12390/T2E22_130 {... 30 2.3
TC81780 homologue to GP|20521948|dbj|BAB13408. KIAA1582 protein ... 30 2.3
TC76311 homologue to GP|3204132|emb|CAA07235.1 extensin {Cicer a... 30 2.3
TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~un... 29 3.0
TC88854 similar to GP|7767667|gb|AAF69164.1| F27F5.26 {Arabidops... 29 3.9
TC76326 homologue to PIR|C29356|C29356 hydroxyproline-rich glyco... 29 3.9
BQ752101 similar to PIR|S57177|S57 branched-chain-amino-acid tra... 29 3.9
TC83200 weakly similar to GP|21554083|gb|AAM63164.1 unknown {Ara... 28 5.1
>TC93172
Length = 861
Score = 43.5 bits (101), Expect = 2e-04
Identities = 36/186 (19%), Positives = 84/186 (44%), Gaps = 9/186 (4%)
Frame = +1
Query: 230 IEAALVKLKGLVFDAGFIGSVRVNPQLGQEVKELLAFL--LSQQLDQIQADALVELQTLV 287
I+ L +L+ LVF + + + LG+EVK LL L + +L + Q+ + + +
Sbjct: 115 IKMLLDELRDLVFSRNLLKHLPNDVTLGEEVKALLVKLNYRANELSEKQSSGITDFARIF 294
Query: 288 VDIVATLDKAIAVEQSIETKKAQVASNTNGLLPAKEEVLK----LTARRTLIAA---ELS 340
+ +D+ +++ + + L +K +++K ++A I A E+
Sbjct: 295 TEATVNIDEGKLGNVTLQHLNVDHKDSMSKLQASKYKIMKFDESISAAEDKIKARDVEIE 474
Query: 341 SVDARLDELHREVARLEKQKAMLIDEGPAVNSRLNTLAAECTGVMLSTSQLSKEVAAEQR 400
+ A++ L + +++++K+ L D + + E V +T Q +++ ++
Sbjct: 475 DIKAQIRLLEEKARKVQQEKSQLEDACSKCKEKRTEIVEEAKNVASTTIQTREKIDNLKK 654
Query: 401 MKQSLD 406
K+ LD
Sbjct: 655 KKRELD 672
>BI270589 similar to PIR|C96829|C96 unknown protein F19K16.21 [imported] -
Arabidopsis thaliana, partial (20%)
Length = 651
Score = 40.4 bits (93), Expect = 0.001
Identities = 41/146 (28%), Positives = 67/146 (45%), Gaps = 2/146 (1%)
Frame = +3
Query: 268 LSQQLDQIQADALVELQTLVVDIVATLDKAIAVEQSIETKKAQVASNTNGLLPAKEEVLK 327
L Q L + + A+ + L DI + A E+ E QV +T LL E +
Sbjct: 33 LRQTLSRKEQQAVFKEDMLCRDIEDLQKRYQASERRCEELITQVPESTRPLLRQIEAMQD 212
Query: 328 LTARRTLI-AAELSSVDARLDELHREVARLEKQKAMLIDEGPAVNSRLNTLAAECTGVML 386
ARR AA ++++RL E + A E+++ + D SR+N L A+ + +
Sbjct: 213 SNARRAEAWAAVERTLNSRLQEAEAKAATAEERERSVNDRLSQTLSRINVLEAQISCLRA 392
Query: 387 STSQLSKEVAAE-QRMKQSLDCRIAA 411
+QLS+ + E QR +S +AA
Sbjct: 393 EQTQLSRTLEKERQRAAESRQEYLAA 470
>TC86622 weakly similar to GP|20197623|gb|AAM15156.1 putative myosin heavy
chain {Arabidopsis thaliana}, partial (17%)
Length = 2546
Score = 40.0 bits (92), Expect = 0.002
Identities = 42/192 (21%), Positives = 81/192 (41%), Gaps = 12/192 (6%)
Frame = +2
Query: 227 DSRIEAALVKLKGLVFDAGFIGSVRVNPQLGQEVKELLAFLLSQQLDQIQADA-LVELQT 285
DS +++ L K+K + + G+ + L E +E L+ L S Q + +VE +
Sbjct: 509 DSEVQSLLEKIK--ILEENIAGAGEQSISLKSEFEESLSKLASLQSENEDLKRQIVEAEK 682
Query: 286 LVVDIVATLDKAIAVEQSIETKKAQVASNTNGLLPAKEEVLKLTARRTLIAAELSSVDAR 345
+ + + ++TK ++ + N ++ KE + + AEL+ V ++
Sbjct: 683 KTSQSFSENELLVGTNIQLKTKIDELQESLNSVVSEKEVTAQELVSHKNLLAELNDVQSK 862
Query: 346 LDELH--REVARLEKQKAM---------LIDEGPAVNSRLNTLAAECTGVMLSTSQLSKE 394
E+H EV LE + + E +N +LNTL + + + Q +
Sbjct: 863 SSEIHSANEVRILEVESKLQEALQKHTEKESETKELNEKLNTLEGQ---IKIYEEQAHEA 1033
Query: 395 VAAEQRMKQSLD 406
VAA + K L+
Sbjct: 1034VAAAENRKAELE 1069
>TC86877 similar to GP|8953376|emb|CAB96649.1 putative protein {Arabidopsis
thaliana}, partial (12%)
Length = 1558
Score = 36.6 bits (83), Expect = 0.019
Identities = 36/119 (30%), Positives = 55/119 (45%), Gaps = 4/119 (3%)
Frame = +2
Query: 259 EVKELLAFLLSQQLDQIQADALVELQTLVVDIVATLDKAIAVEQSIETKKAQVASNTNGL 318
+VKELLA L + Q+ L VV+ +A K I +S+E KA S L
Sbjct: 1076 KVKELLAILNDVESAQLHVTWL----RTVVNEIAEYIKLIDEHRSVEAAKANSDSEMESL 1243
Query: 319 ---LPAKEEVLKLTARR-TLIAAELSSVDARLDELHREVARLEKQKAMLIDEGPAVNSR 373
L +K E+L R T I ++ + RL EL + + LEK + + + ++SR
Sbjct: 1244 RKELESKVEILTQKEREVTDIKTKIEGIRERLGELEMKSSDLEKNRLSIKSKVDNLDSR 1420
>TC80942 GP|21112489|gb|AAM40721.1 conserved hypothetical protein
{Xanthomonas campestris pv. campestris str. ATCC 33913},
partial (9%)
Length = 482
Score = 34.3 bits (77), Expect = 0.093
Identities = 27/104 (25%), Positives = 45/104 (42%), Gaps = 18/104 (17%)
Frame = -3
Query: 67 YTKECRDNLS-NFP-PQDMQNLPSPAREEN-----ILHDEVQGGEDPIIQPDQVIGVMPP 119
+T + + N + FP P+ + ++P P R N H Q P ++G+ P
Sbjct: 384 FTHQTQQNQTMKFPKPKQIMSMPQPPRLRNQSLK*FTHQTPQNLTMKFPNPKLIMGMPRP 205
Query: 120 PKA---DVLASAHSEPP--------PRQLQTSPQQPRLSPQRIQ 152
PK+ + S H P P+Q+ + PQ PRL Q ++
Sbjct: 204 PKSRNPSLKQSTHQTPQNQTMKFPKPKQIMSMPQPPRLRNQSLK 73
>TC86727 similar to GP|4432859|gb|AAD20707.1| unknown protein {Arabidopsis
thaliana}, partial (27%)
Length = 1852
Score = 33.9 bits (76), Expect = 0.12
Identities = 46/219 (21%), Positives = 85/219 (38%), Gaps = 8/219 (3%)
Frame = +2
Query: 26 SGLESSSSSLDSTESDPVWDKLTFYFNSKPIAWQHPGCEPYYTKECRDNLSNFPPQDMQN 85
S +++ + E + KL F P +Q P K+ + P +
Sbjct: 884 SNIQAKTKESQEAEIKRLRKKLAFKATPMPSFYQEPTPSRVELKKIPTTRAKSPKLGRKK 1063
Query: 86 LPSPAREENILHDEVQGGEDPIIQPDQVIGVMPPPKADVLASAHSEPPPRQLQTSPQQPR 145
+ + E ++ + + + D+ + P K +H +P Q ++ P PR
Sbjct: 1064 SSTMSSELDV--NSNSSAQQCRLSLDEKVSQNNPTKG----ISHVQPKKPQRRSLP--PR 1219
Query: 146 LSPQRIQQD----ARAALISPHAKGGSASS----KTAASSHTSSERLSALLVEDPLSIFQ 197
L+P+RI AR + + H + S SS T S T E++ A + S F
Sbjct: 1220 LTPERISSSNSVTARTSSKAVHDEKTSLSSVTTEVTTLSVATREEKVEAAAAIEENSAFS 1399
Query: 198 SFFDGTLDLESPPRQAEIAETAQSGGVPDDSRIEAALVK 236
GT L P ++AE+ +G + + + + LV+
Sbjct: 1400 DETSGTPSLNIEP---DVAESQLNGDIVIEDKPQLILVQ 1507
>AL365936
Length = 441
Score = 33.1 bits (74), Expect = 0.21
Identities = 27/104 (25%), Positives = 44/104 (41%), Gaps = 18/104 (17%)
Frame = +2
Query: 67 YTKECRDNLS-NFP-PQDMQNLPSPAREEN-----ILHDEVQGGEDPIIQPDQVIGVMPP 119
+T + NL+ FP P+ + +P P + N H Q P ++G+ P
Sbjct: 83 FTHQTPQNLTMKFPNPKLIMGMPRPPKSRNPSLKQSTHQTPQNQTMKFPNPKLIMGMPRP 262
Query: 120 PKA---DVLASAHSEPP--------PRQLQTSPQQPRLSPQRIQ 152
PK+ + S H P P+Q+ + PQ PRL Q ++
Sbjct: 263 PKSRNPSLKQSTHQTPQNQTMKFPKPKQIMSMPQPPRLRNQSLK 394
>TC88788 weakly similar to SP|P05790|FBOH_BOMMO Fibroin heavy chain
precursor (Fib-H) (H-fibroin). [Silk moth] {Bombyx
mori}, partial (2%)
Length = 782
Score = 31.6 bits (70), Expect = 0.60
Identities = 27/89 (30%), Positives = 36/89 (40%), Gaps = 3/89 (3%)
Frame = -2
Query: 87 PSPAREENILHDEVQGGEDPIIQPDQVIGVMPPPKADVLASAHSEPPPRQLQTSPQQPRL 146
P PAR+ EDP PD + P P+ D + PPP + P
Sbjct: 355 PDPARDPAWEPASEPAPEDPDPDPDPEVEAEPDPEPDPAPAPEPAPPPPPSPSPKPAPNP 176
Query: 147 SPQRIQQDAR---AALISPHAKGGSASSK 172
P + + R AAL+S HA A+SK
Sbjct: 175 IPTPPKSNPRSFLAALLS-HAITFIATSK 92
>TC76445 similar to PIR|T07623|T07623 extensin homolog HRGP2 - soybean
(fragment), partial (76%)
Length = 821
Score = 30.8 bits (68), Expect = 1.0
Identities = 35/139 (25%), Positives = 41/139 (29%), Gaps = 9/139 (6%)
Frame = +2
Query: 15 KPPPIALSSSPSGLESSSSSLDSTESDPVWDKLTFYFNSKPIAWQHPGCEPYYTKECRDN 74
K PP S PS S S P +Y+ S P P YY +
Sbjct: 29 KSPPPPSPSPPSPYYYKSPPPPSPSPPP-----PYYYKSPPPPSPSPPPPYYYKSPPPPS 193
Query: 75 LSNFPPQDMQNLPSPAREENILHDEVQGGEDPIIQPDQVIGVMPPPKADVLASAH----- 129
S PP Q+ P P+ PI P PPP H
Sbjct: 194 PSPPPPYHYQSPPPPS---------------PISHPPNYYKSPPPPSPSPPPPYHYVSPP 328
Query: 130 ----SEPPPRQLQTSPQQP 144
S PPP + SP P
Sbjct: 329 PPVKSPPPPAYIYASPPPP 385
>BQ751066
Length = 750
Score = 30.8 bits (68), Expect = 1.0
Identities = 20/59 (33%), Positives = 29/59 (48%), Gaps = 3/59 (5%)
Frame = +2
Query: 132 PPPRQ---LQTSPQQPRLSPQRIQQDARAALISPHAKGGSASSKTAASSHTSSERLSAL 187
PPP L+ SP RL P + ++ +SP A AS K A ++ + S LSA+
Sbjct: 296 PPPSTQWLLRQSPPTARLPPMAMSTMTQST*LSPSAALARASPKRAIATRSPSSTLSAV 472
>TC87847 similar to GP|15929245|gb|AAH15068.1 Similar to adaptor-related
protein complex AP-3 beta 1 subunit {Mus musculus},
partial (2%)
Length = 931
Score = 30.4 bits (67), Expect = 1.3
Identities = 31/136 (22%), Positives = 50/136 (35%), Gaps = 3/136 (2%)
Frame = +1
Query: 68 TKECRDNLSNFPPQDMQNLPSPAREENILHDEVQGGEDPIIQPDQVIGVMPPPKADVLAS 127
+K+ + N PP + P PA E+ + G + +P+ PPP V
Sbjct: 277 SKDSKPASKNPPPSTPISNPKPAESESGSESGSESGTESDSEPEHKSEPTPPPNPKVKPL 456
Query: 128 AHSEPPPRQLQTSPQQ---PRLSPQRIQQDARAALISPHAKGGSASSKTAASSHTSSERL 184
A S+P Q Q Q P S + ++ S +K + ++ + E
Sbjct: 457 A-SKPMKTQTQAQAQSTPVPIKSGTKRVAESSTGNDSKRSKKKTTTAGGGSDDENEVEED 633
Query: 185 SALLVEDPLSIFQSFF 200
+ L ED FQ F
Sbjct: 634 AKLTGEDSKKNFQRVF 681
>TC92273 similar to GP|21622503|emb|CAD37050. related to OSBP-related
protein 7 {Neurospora crassa}, partial (36%)
Length = 1517
Score = 29.6 bits (65), Expect = 2.3
Identities = 15/33 (45%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Frame = -1
Query: 133 PPRQLQTSPQQPR-LSPQRIQQDARAALISPHA 164
PPR L SP+Q + L+ QR+++ A AL+ P A
Sbjct: 530 PPRNLSLSPKQLKSLAKQRVERFANCALLRPAA 432
>CA860047 weakly similar to GP|15081680|gb| AT3g12390/T2E22_130 {Arabidopsis
thaliana}, partial (37%)
Length = 543
Score = 29.6 bits (65), Expect = 2.3
Identities = 18/40 (45%), Positives = 23/40 (57%)
Frame = -3
Query: 16 PPPIALSSSPSGLESSSSSLDSTESDPVWDKLTFYFNSKP 55
P IA+SSS SG+ SS SS DST + +F+S P
Sbjct: 295 PDGIAVSSSNSGISSSESSSDST--------VVVFFSSAP 200
>TC81780 homologue to GP|20521948|dbj|BAB13408. KIAA1582 protein {Homo
sapiens}, partial (1%)
Length = 787
Score = 29.6 bits (65), Expect = 2.3
Identities = 14/35 (40%), Positives = 23/35 (65%)
Frame = +3
Query: 9 ATCFSSKPPPIALSSSPSGLESSSSSLDSTESDPV 43
AT FSS PPPI SS+ + +++++ +T S P+
Sbjct: 9 ATIFSSSPPPIFSSSTTTTTTTTTTTTTTTISKPL 113
>TC76311 homologue to GP|3204132|emb|CAA07235.1 extensin {Cicer arietinum},
partial (77%)
Length = 973
Score = 29.6 bits (65), Expect = 2.3
Identities = 33/133 (24%), Positives = 41/133 (30%)
Frame = +2
Query: 12 FSSKPPPIALSSSPSGLESSSSSLDSTESDPVWDKLTFYFNSKPIAWQHPGCEPYYTKEC 71
+ S PPP S SP S + S P +Y+ S P P +Y
Sbjct: 83 YKSPPPP---SPSPPPPYYYKSPPPPSPSPPP----PYYYQSPPPPSPTPHTPYHYKSPP 241
Query: 72 RDNLSNFPPQDMQNLPSPAREENILHDEVQGGEDPIIQPDQVIGVMPPPKADVLASAHSE 131
S PP + P P H P P + PPP S
Sbjct: 242 PPTASPPPPYHYVSPPPPTSSPPPYHYTSPPPPSPAPAPTYIYKSPPPP-------MKSP 400
Query: 132 PPPRQLQTSPQQP 144
PPP + SP P
Sbjct: 401 PPPVYIYASPPPP 439
>TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~unknown
protein {Arabidopsis thaliana}, partial (46%)
Length = 2077
Score = 29.3 bits (64), Expect = 3.0
Identities = 20/47 (42%), Positives = 26/47 (54%), Gaps = 6/47 (12%)
Frame = -2
Query: 13 SSKPPPIAL-----SSSPSGLESSSSSLDSTE-SDPVWDKLTFYFNS 53
SS PPP+ SSS S L SSSSS S++ S P F+F++
Sbjct: 576 SSIPPPLTFSFSSSSSSSSSLPSSSSSSSSSKSSSPSLPPFAFFFST 436
>TC88854 similar to GP|7767667|gb|AAF69164.1| F27F5.26 {Arabidopsis
thaliana}, partial (22%)
Length = 1043
Score = 28.9 bits (63), Expect = 3.9
Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Frame = +1
Query: 140 SPQQPRLSPQRIQQDARAALISPHAKGGSASSK--TAASSHTSSE 182
S P+ P+R+ + R +L SP ++GGS+S+ + TSSE
Sbjct: 34 SRPNPQQYPRRLSEYVRRSLFSPGSEGGSSSNNYPSLRGPSTSSE 168
>TC76326 homologue to PIR|C29356|C29356 hydroxyproline-rich glycoprotein
(clone Hyp2.13) - kidney bean (fragment), partial (32%)
Length = 777
Score = 28.9 bits (63), Expect = 3.9
Identities = 33/138 (23%), Positives = 41/138 (28%), Gaps = 9/138 (6%)
Frame = +1
Query: 16 PPPIALSSSPSGLESSSSSLDSTESDPVWDKLTFYFNSKPIAWQHPGCEPYYTKECRDNL 75
PPP + S P S + P +Y+ S P P YY +
Sbjct: 10 PPPPSPSPPPPYYYKSPPPPSPSPPPP------YYYKSPPPPSPSPPPPYYYKSPPPPSP 171
Query: 76 SNFPPQDMQNLPSPAREENILHDEVQGGEDPIIQPDQVIGVMPPPKADVLASAH------ 129
S PP Q+ P P+ PI P PPP H
Sbjct: 172 SPPPPYHYQSPPPPS---------------PISHPPYYYKSPPPPSPSPPPPYHYVSPPP 306
Query: 130 ---SEPPPRQLQTSPQQP 144
S PPP + SP P
Sbjct: 307 PVKSPPPPTYIYASPPPP 360
>BQ752101 similar to PIR|S57177|S57 branched-chain-amino-acid transaminase
(EC 2.6.1.42) BAT2 cytosolic - yeast, partial (46%)
Length = 720
Score = 28.9 bits (63), Expect = 3.9
Identities = 13/31 (41%), Positives = 18/31 (57%)
Frame = -1
Query: 2 SKMAVPRATCFSSKPPPIALSSSPSGLESSS 32
S A+P CF+S PPP A ++ P+ S S
Sbjct: 696 SASALPAPPCFTSSPPPSAPTTLPASKPSPS 604
>TC83200 weakly similar to GP|21554083|gb|AAM63164.1 unknown {Arabidopsis
thaliana}, partial (71%)
Length = 565
Score = 28.5 bits (62), Expect = 5.1
Identities = 18/57 (31%), Positives = 26/57 (45%)
Frame = -3
Query: 106 PIIQPDQVIGVMPPPKADVLASAHSEPPPRQLQTSPQQPRLSPQRIQQDARAALISP 162
P QP +PPP++ V +S+ PP L SP P L P + ++SP
Sbjct: 476 PPKQPKAQKPXLPPPQSPV---PNSQAPPPSLSPSPPPPILVPPSQVLSPPSPILSP 315
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.312 0.128 0.352
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,405,874
Number of Sequences: 36976
Number of extensions: 181960
Number of successful extensions: 1323
Number of sequences better than 10.0: 67
Number of HSP's better than 10.0 without gapping: 1170
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1269
length of query: 423
length of database: 9,014,727
effective HSP length: 99
effective length of query: 324
effective length of database: 5,354,103
effective search space: 1734729372
effective search space used: 1734729372
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0106.4