
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0010.8
(430 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BQ750853 similar to GP|13562018|gb fibroin 3 {Plectreurys tristi... 41 8e-04
TC81880 similar to PIR|B85431|B85431 trichohyalin like protein [... 36 0.025
TC88557 weakly similar to GP|17065320|gb|AAL32814.1 Unknown prot... 35 0.073
TC88558 weakly similar to GP|18481426|gb|AAL73441.1 telomere rep... 35 0.073
TC81530 similar to GP|10177440|dbj|BAB10736. gb|AAC55944.1~gene_... 34 0.095
AW776489 similar to GP|18481426|gb| telomere repeat binding fact... 34 0.12
TC76659 weakly similar to PIR|T09127|T09127 probable erythrocyte... 33 0.16
BQ137255 homologue to PIR|B34768|B347 ORF5 protein - Orf virus (... 33 0.16
TC83557 weakly similar to PIR|H96535|H96535 hypothetical protein... 33 0.16
TC88509 similar to PIR|T01826|T01826 microfibril-associated prot... 33 0.21
TC87229 homologue to GP|16226927|gb|AAL16300.1 At1g63720/F24D7_9... 33 0.28
AW685995 similar to PIR|I51618|I516 nucleolar phosphoprotein - A... 32 0.36
BQ137234 similar to EGAD|35719|3709 hypothetical protein {Burkho... 32 0.47
BQ144502 weakly similar to GP|6523547|emb hydroxyproline-rich gl... 31 1.1
BQ137254 similar to GP|22946423|gb| CG31813-PA {Drosophila melan... 30 1.4
BQ135692 GP|18676552|d FLJ00173 protein {Homo sapiens}, partial ... 30 1.4
TC85190 similar to GP|8894548|emb|CAB95829.1 hypothetical protei... 29 3.1
TC85152 similar to GP|8894548|emb|CAB95829.1 hypothetical protei... 29 3.1
TC87527 similar to GP|21689681|gb|AAM67462.1 unknown protein {Ar... 29 4.0
AL369112 weakly similar to GP|13540393|gb| histone H1 {Pisum sat... 29 4.0
>BQ750853 similar to GP|13562018|gb fibroin 3 {Plectreurys tristis}, partial
(0%)
Length = 692
Score = 41.2 bits (95), Expect = 8e-04
Identities = 25/71 (35%), Positives = 36/71 (50%), Gaps = 10/71 (14%)
Frame = -3
Query: 298 KPLVKDVRPPQKPPDVSICDGAPVMA-------TRPPL---KLPDMFMCGRDPCFYAGES 347
KP+ + +RP Q+P V + G P+ TRPPL +LP R+P + +
Sbjct: 624 KPVRQGLRPGQQPVCVQLPRGQPLRCHLPPHSRTRPPLPALQLPPRQPLPRNPAAQSSPA 445
Query: 348 WQLVTANRPPA 358
W +VT RPPA
Sbjct: 444 WSVVTTTRPPA 412
>TC81880 similar to PIR|B85431|B85431 trichohyalin like protein [imported] -
Arabidopsis thaliana, partial (4%)
Length = 1043
Score = 36.2 bits (82), Expect = 0.025
Identities = 32/95 (33%), Positives = 43/95 (44%), Gaps = 4/95 (4%)
Frame = +2
Query: 8 ALDRLD-ESLKRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVA 66
A+DR E+ R++ R R E AA++R AE Q A + + A A
Sbjct: 599 AVDRATFEARDRAYAEGRERAE-------RAAFDRATAEARQRALAEAKERLEKACAEAR 757
Query: 67 MAEEENAAAVEREVEAER---ERAAAEAEEDAAEK 98
+ A E ++AER ERA AEA E A EK
Sbjct: 758 DKSYTDKATAEARLKAERAAVERATAEARERAMEK 862
>TC88557 weakly similar to GP|17065320|gb|AAL32814.1 Unknown protein
{Arabidopsis thaliana}, partial (38%)
Length = 673
Score = 34.7 bits (78), Expect = 0.073
Identities = 22/45 (48%), Positives = 28/45 (61%)
Frame = +2
Query: 60 AAAAAVAMAEEENAAAVEREVEAERERAAAEAEEDAAEKYEQSAT 104
AAAAA A+AE E AA+ A RE AEAE +AA + ++AT
Sbjct: 365 AAAAAKAVAEAE--AAIAEAETAAREAETAEAEAEAARVFAKAAT 493
>TC88558 weakly similar to GP|18481426|gb|AAL73441.1 telomere repeat binding
factor 2 {Arabidopsis thaliana}, partial (27%)
Length = 802
Score = 34.7 bits (78), Expect = 0.073
Identities = 22/45 (48%), Positives = 28/45 (61%)
Frame = +3
Query: 60 AAAAAVAMAEEENAAAVEREVEAERERAAAEAEEDAAEKYEQSAT 104
AAAAA A+AE E AA+ A RE AEAE +AA + ++AT
Sbjct: 237 AAAAAKAVAEAE--AAIAEAETAAREAETAEAEAEAARVFAKAAT 365
>TC81530 similar to GP|10177440|dbj|BAB10736.
gb|AAC55944.1~gene_id:K9H21.2~similar to unknown protein
{Arabidopsis thaliana}, partial (24%)
Length = 1135
Score = 34.3 bits (77), Expect = 0.095
Identities = 25/77 (32%), Positives = 38/77 (48%)
Frame = +1
Query: 11 RLDESLKRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEE 70
R +++ ++ E A ++E R E+ E ++ A + EAKAA A AE
Sbjct: 364 RFADTILKAQEKALEKDEKRDPEKLRIEREELERRQKEEKARLQA-EAKAAEEARRKAEA 540
Query: 71 ENAAAVEREVEAERERA 87
E AA +R+ E ERE A
Sbjct: 541 EAAAEAKRKRELEREAA 591
>AW776489 similar to GP|18481426|gb| telomere repeat binding factor 2
{Arabidopsis thaliana}, partial (18%)
Length = 640
Score = 33.9 bits (76), Expect = 0.12
Identities = 20/60 (33%), Positives = 32/60 (53%)
Frame = +1
Query: 44 AEYEQYMAAYERYEAKAAAAAVAMAEEENAAAVEREVEAERERAAAEAEEDAAEKYEQSA 103
++ E ++ A+ AAAA A A E A+ + A RE AEAE +AA+ + ++A
Sbjct: 178 SQIEAELSKVRGMSAQEAAAAAAKAVAEAEVAIAQAEAAAREAEIAEAEAEAAQVFAKAA 357
>TC76659 weakly similar to PIR|T09127|T09127 probable erythrocyte-binding
protein MAEBL - Plasmodium yoelii, partial (2%)
Length = 843
Score = 33.5 bits (75), Expect = 0.16
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 2/78 (2%)
Frame = +1
Query: 27 EEVRQGEEAAAAYERHVAEYEQYMAA--YERYEAKAAAAAVAMAEEENAAAVEREVEAER 84
E+ RQ E R+ A++E+ + ++ YE K A EEE+ +E E EAE
Sbjct: 415 EKRRQSELFEIRRRRNSAKFEEQRMSIFFKNYEEKFCAEMERWWEEEDKKFLEEEKEAE- 591
Query: 85 ERAAAEAEEDAAEKYEQS 102
E E EED + E++
Sbjct: 592 EELLKEIEEDRKQAEEEA 645
>BQ137255 homologue to PIR|B34768|B347 ORF5 protein - Orf virus (strain NZ2),
partial (10%)
Length = 1161
Score = 33.5 bits (75), Expect = 0.16
Identities = 32/104 (30%), Positives = 42/104 (39%), Gaps = 12/104 (11%)
Frame = +3
Query: 10 DRLDESLKRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERY--EAKAAA----- 62
D +DES ARRR+ R+ A AA R E + A + R E +A A
Sbjct: 831 DEIDESADTRAARARRRDSRRRASRATAARARQTKERRERQATHARVLDETRAHADGRDT 1010
Query: 63 -AAVAMAEEENAAAVEREVEAE----RERAAAEAEEDAAEKYEQ 101
A M + +AA E A+ R RA A A A E+
Sbjct: 1011RARATMTRDVASAARHGETRADACAARARARAAARTGRARGSER 1142
>TC83557 weakly similar to PIR|H96535|H96535 hypothetical protein F2J10.16
[imported] - Arabidopsis thaliana, partial (24%)
Length = 552
Score = 33.5 bits (75), Expect = 0.16
Identities = 27/62 (43%), Positives = 33/62 (52%), Gaps = 2/62 (3%)
Frame = +3
Query: 27 EEVRQ--GEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEEENAAAVEREVEAER 84
E++R +EAAA R VAE E MA E +A AA E A A+E VEAER
Sbjct: 48 EKIRSMSAQEAAAYAARAVAEAEALMAEAEEATKEAEAA------EAEADAMEAFVEAER 209
Query: 85 ER 86
+R
Sbjct: 210 KR 215
Score = 32.0 bits (71), Expect = 0.47
Identities = 19/45 (42%), Positives = 25/45 (55%)
Frame = +3
Query: 58 AKAAAAAVAMAEEENAAAVEREVEAERERAAAEAEEDAAEKYEQS 102
A+ AAA A A E A + EA +E AAEAE DA E + ++
Sbjct: 69 AQEAAAYAARAVAEAEALMAEAEEATKEAEAAEAEADAMEAFVEA 203
>TC88509 similar to PIR|T01826|T01826 microfibril-associated protein homolog
T15F16.8 - Arabidopsis thaliana, partial (83%)
Length = 1306
Score = 33.1 bits (74), Expect = 0.21
Identities = 30/91 (32%), Positives = 41/91 (44%), Gaps = 2/91 (2%)
Frame = +1
Query: 10 DRLDESLKRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAA--AAVAM 67
D + E +R E R+R++ EEA E E E+ YE + VAM
Sbjct: 427 DAMAERRRRIKEKLRQRDQ----EEALPQEEEEEEEEEEEEEEESDYETDSDEEYTGVAM 594
Query: 68 AEEENAAAVEREVEAERERAAAEAEEDAAEK 98
+ ER+ AERER EAEE+A E+
Sbjct: 595 VKPVFVPKSERDTIAERERL--EAEEEALEE 681
>TC87229 homologue to GP|16226927|gb|AAL16300.1 At1g63720/F24D7_9 {Arabidopsis
thaliana}, partial (27%)
Length = 1686
Score = 32.7 bits (73), Expect = 0.28
Identities = 27/95 (28%), Positives = 45/95 (46%), Gaps = 10/95 (10%)
Frame = +1
Query: 33 EEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEEEN----------AAAVEREVEA 82
++A+++ E Q+ +++ AAAAA +EEN E ++
Sbjct: 1204 QKASSSVENKPPASSQWTKVLSKFKNDAAAAAKTTDKEENHSIENECDDKQVVTETLIDT 1383
Query: 83 ERERAAAEAEEDAAEKYEQSATVTVFFGSSKAFSF 117
++R AAEA D EK QS T++ S+K F+F
Sbjct: 1384 TKQRKAAEATVD--EKDHQSLTLS--SSSTKEFNF 1476
>AW685995 similar to PIR|I51618|I516 nucleolar phosphoprotein - African
clawed frog, partial (5%)
Length = 516
Score = 32.3 bits (72), Expect = 0.36
Identities = 20/98 (20%), Positives = 47/98 (47%)
Frame = +3
Query: 7 KALDRLDESLKRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVA 66
K + +E K+ + + + EE +++ E + ++ A KAAA +
Sbjct: 99 KKAAKAEEEKKKQQKKKEEKPAPMEVEEDSSSSEESSSSEDEAPAKKAAPAKKAAAKKES 278
Query: 67 MAEEENAAAVEREVEAERERAAAEAEEDAAEKYEQSAT 104
+EEE++++ E E+ A A+++++ + E S++
Sbjct: 279 SSEEESSSSESESEEEEKPAKKAAAKKESSSEEESSSS 392
Score = 28.9 bits (63), Expect = 4.0
Identities = 25/96 (26%), Positives = 44/96 (45%), Gaps = 9/96 (9%)
Frame = +3
Query: 12 LDESLKRSHEAARRREEVRQGEEA----AAAYERHVAEYEQYMAAYERYEA-----KAAA 62
++E S E++ +E + A AAA + +E E + E E KAAA
Sbjct: 174 VEEDSSSSEESSSSEDEAPAKKAAPAKKAAAKKESSSEEESSSSESESEEEEKPAKKAAA 353
Query: 63 AAVAMAEEENAAAVEREVEAERERAAAEAEEDAAEK 98
+ +EEE++++ E E E+ A E++ A +
Sbjct: 354 KKESSSEEESSSSEEESDEEEKNEARRR*EQEEARR 461
>BQ137234 similar to EGAD|35719|3709 hypothetical protein {Burkholderia
cepacia}, partial (6%)
Length = 1184
Score = 32.0 bits (71), Expect = 0.47
Identities = 27/97 (27%), Positives = 43/97 (43%), Gaps = 9/97 (9%)
Frame = +3
Query: 17 KRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEEE----- 71
+++ AARRR G+E A ++ + A ER + K+A A E E
Sbjct: 885 RKARGAARRRRARTHGQERAHEQQKTSERRARRKEATERDDRKSADKRQAKEEREERNRE 1064
Query: 72 ---NAAAVERE-VEAERERAAAEAEEDAAEKYEQSAT 104
N ++E VEA+ + + AEE+A + AT
Sbjct: 1065RPANDTRTQKETVEAQTKTTSGGAEEEATAAATRGAT 1175
>BQ144502 weakly similar to GP|6523547|emb hydroxyproline-rich glycoprotein
DZ-HRGP {Volvox carteri f. nagariensis}, partial (39%)
Length = 1358
Score = 30.8 bits (68), Expect = 1.1
Identities = 32/117 (27%), Positives = 41/117 (34%), Gaps = 3/117 (2%)
Frame = -3
Query: 304 VRPPQKPPDVSICDGAPVMATRPPL---KLPDMFMCGRDPCFYAGESWQLVTANRPPAKP 360
+ PP PP S APV P + P R P + Q RPPA+P
Sbjct: 576 IPPPTSPPPAS--PPAPVPPQSPSTIRGRAPPPAPPPRRPPARPPPAPQRPRPRRPPARP 403
Query: 361 PDSTKDGRGRKRDEGVKGSILAVTVKGKRPPKSGCLLNPSPSPIRVGQAQVVQRRSW 417
P GR+R K ++ PK G + +PSP RR W
Sbjct: 402 PS------GRRR---------GARAKVRKLPKGGTPGSRAPSPAPPSPTHPAPRRPW 277
>BQ137254 similar to GP|22946423|gb| CG31813-PA {Drosophila melanogaster},
partial (12%)
Length = 1220
Score = 30.4 bits (67), Expect = 1.4
Identities = 26/98 (26%), Positives = 40/98 (40%), Gaps = 7/98 (7%)
Frame = +2
Query: 8 ALDRLDESLKRSHEAARRREEVRQGEEAAAAYERHVAEYEQYMAAYERYE----AKAAAA 63
A R + + R HE+ARR E+ R+G E A+ A ++ R + A
Sbjct: 827 AATRAESA*GREHESARRSEQQRRGHETRHAHRTRAATTQKARKRGGRATRAGLRRTRAT 1006
Query: 64 AVAMAEEEN---AAAVEREVEAERERAAAEAEEDAAEK 98
A + N A+A R ER AA+ + A +
Sbjct: 1007PTARTQRRNQ*TASARGRTATGERAHAASTSAAHATPR 1120
>BQ135692 GP|18676552|d FLJ00173 protein {Homo sapiens}, partial (1%)
Length = 1156
Score = 30.4 bits (67), Expect = 1.4
Identities = 19/64 (29%), Positives = 25/64 (38%), Gaps = 14/64 (21%)
Frame = +2
Query: 318 GAPVMATRPPLKLPD--------------MFMCGRDPCFYAGESWQLVTANRPPAKPPDS 363
GA + PPL +PD MF R PC +G W+ T P +
Sbjct: 374 GAAAFVSVPPLVVPDGPGEGTAWSRPMRSMFGAFRSPCDPSGPRWRCGTHAMEKTTEPKN 553
Query: 364 TKDG 367
T+DG
Sbjct: 554 TRDG 565
>TC85190 similar to GP|8894548|emb|CAB95829.1 hypothetical protein {Cicer
arietinum}, partial (50%)
Length = 1496
Score = 29.3 bits (64), Expect = 3.1
Identities = 29/104 (27%), Positives = 43/104 (40%), Gaps = 1/104 (0%)
Frame = +3
Query: 27 EEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEEENAAAVEREVEAERER 86
E + EE A V E EQ +A AA A AA ++E A+
Sbjct: 141 EPQKPAEEVATTTSETVVEKEQ--------QADGVVAAAVTAAAVTAATTDKEAVADPPP 296
Query: 87 AAA-EAEEDAAEKYEQSATVTVFFGSSKAFSFGFLNKSCVLPKI 129
A A EAE+ A ++ A TV S + S F ++ V+ ++
Sbjct: 297 AVADEAEKPAEVVADKVADETVVDESKVSQSVSFKEETNVVSEL 428
>TC85152 similar to GP|8894548|emb|CAB95829.1 hypothetical protein {Cicer
arietinum}, partial (18%)
Length = 775
Score = 29.3 bits (64), Expect = 3.1
Identities = 29/104 (27%), Positives = 43/104 (40%), Gaps = 1/104 (0%)
Frame = +1
Query: 27 EEVRQGEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEEENAAAVEREVEAERER 86
E + EE A V E EQ +A AA A AA ++E A+
Sbjct: 145 EPQKPAEEVATTTSETVVEKEQ--------QADGVVAAAVTAAAVTAATTDKEAVADPPP 300
Query: 87 AAA-EAEEDAAEKYEQSATVTVFFGSSKAFSFGFLNKSCVLPKI 129
A A EAE+ A ++ A TV S + S F ++ V+ ++
Sbjct: 301 AVADEAEKPAEVVADKVADETVVDESKVSQSVSFKEETNVVSEL 432
>TC87527 similar to GP|21689681|gb|AAM67462.1 unknown protein {Arabidopsis
thaliana}, partial (96%)
Length = 1582
Score = 28.9 bits (63), Expect = 4.0
Identities = 22/87 (25%), Positives = 37/87 (42%), Gaps = 1/87 (1%)
Frame = +2
Query: 7 KALDRLDESLKRSHEAARRREEVRQ-GEEAAAAYERHVAEYEQYMAAYERYEAKAAAAAV 65
+A + +E + EAA RR EVR+ E E+ + + ++ K A +
Sbjct: 836 RAAKKREEEADKKAEAAARRAEVRRLAELEEKELEKMIKKPDKKANRVSIPVPKVTEAEL 1015
Query: 66 AMAEEENAAAVEREVEAERERAAAEAE 92
EE A ++ E ++R A E E
Sbjct: 1016RKRREEEQAMAMKKAEEAKKRTAGEDE 1096
>AL369112 weakly similar to GP|13540393|gb| histone H1 {Pisum sativum},
partial (20%)
Length = 371
Score = 28.9 bits (63), Expect = 4.0
Identities = 24/67 (35%), Positives = 33/67 (48%)
Frame = +2
Query: 37 AAYERHVAEYEQYMAAYERYEAKAAAAAVAMAEEENAAAVEREVEAERERAAAEAEEDAA 96
AA + A Q AA ++ AAAA A A ++A A + E E+AA + E AA
Sbjct: 47 AAKKGETASKSQKAAAKKK---AAAAAKKADAGAKDAKADTKAAEKPAEKAAEKPAEKAA 217
Query: 97 EKYEQSA 103
EK + A
Sbjct: 218 EKPAEKA 238
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.137 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,410,667
Number of Sequences: 36976
Number of extensions: 253501
Number of successful extensions: 2001
Number of sequences better than 10.0: 55
Number of HSP's better than 10.0 without gapping: 1445
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1784
length of query: 430
length of database: 9,014,727
effective HSP length: 99
effective length of query: 331
effective length of database: 5,354,103
effective search space: 1772208093
effective search space used: 1772208093
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0010.8