
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0199.3
(462 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC91682 45 8e-12
TC80516 47 1e-05
BG583811 42 5e-04
CA919517 42 7e-04
AW774547 similar to GP|17221128|gb| glycoprotein gp2 {Equine her... 41 9e-04
TC79449 weakly similar to GP|14423394|gb|AAK62379.1 Unknown prot... 39 0.006
TC78344 similar to GP|13540405|gb|AAK29456.1 histone H1 {Lens cu... 36 0.027
TC90963 similar to PIR|A84431|A84431 probable C2H2-type zinc fin... 36 0.036
TC80649 similar to PIR|T48006|T48006 hypothetical protein T17J13... 32 0.52
TC81054 similar to GP|12321513|gb|AAG50816.1 unknown protein {Ar... 31 0.88
BQ151174 similar to GP|11036868|gb| PxORF73 peptide {Plutella xy... 31 0.88
TC84589 weakly similar to GP|8885570|dbj|BAA97500.1 gene_id:F15L... 30 1.5
BQ141665 similar to GP|18568267|gb putative polyprotein {Zea may... 30 1.5
AL372026 30 2.6
BQ141693 weakly similar to GP|160968|gb|AA eggshell protein {Sch... 30 2.6
TC76621 homologue to SP|O22582|H2B_GOSHI Histone H2B. [Upland co... 30 2.6
TC89338 29 3.4
TC90281 similar to SP|P54225|HEMZ_SYNY3 Ferrochelatase (EC 4.99.... 29 3.4
BQ150260 29 3.4
TC85835 similar to PIR|T12180|T12180 probable transcription fact... 29 4.4
>TC91682
Length = 1093
Score = 45.1 bits (105), Expect(2) = 8e-12
Identities = 18/45 (40%), Positives = 30/45 (66%)
Frame = +1
Query: 414 KCPKQKNLTDCGYYVLKYMRDIIIAGDSQTLQEVVILVIKYIVFI 458
+CP Q N DCGY+V+++M++II+A L+ V + V+ Y F+
Sbjct: 751 QCPMQTNGIDCGYFVMRFMKEIILANQDMILENVCM*VLIYC*FV 885
Score = 42.7 bits (99), Expect(2) = 8e-12
Identities = 37/115 (32%), Positives = 57/115 (49%), Gaps = 2/115 (1%)
Frame = +3
Query: 302 LVQNQLTERFFFLSPHNLSAYESRVSQYIADALKQNEQPNRVTFAPYNMG-DHWVLLVIY 360
LV +++ F+ PH Y R+++ + L ++ N++ FA N+G +HWVLLVI
Sbjct: 339 LVGWKISSHFY---PH---IYLKRINK--GNILLTHKFKNKLIFALVNLGLNHWVLLVIN 494
Query: 361 TSEFAIEYFDSF-DGEPTDDVHMKNIFDAGLMIYHANSNDVPKKKVKYIKWRKMK 414
I Y DS +G P DV K +A M + N K K I W+++K
Sbjct: 495 PGAEMIYYMDSLPEGHPNIDVVRKKFMNA--MCICRSLNPKLKSKSSIIPWKEIK 653
Score = 28.1 bits (61), Expect = 7.5
Identities = 12/29 (41%), Positives = 16/29 (54%)
Frame = +2
Query: 289 VICVYMRFLYSEVLVQNQLTERFFFLSPH 317
V+C RFL ++ L +F FLSPH
Sbjct: 293 VVCSVSRFLSENLIKPRGLENKFSFLSPH 379
>TC80516
Length = 548
Score = 47.4 bits (111), Expect = 1e-05
Identities = 22/52 (42%), Positives = 33/52 (63%)
Frame = +3
Query: 386 FDAGLMIYHANSNDVPKKKVKYIKWRKMKCPKQKNLTDCGYYVLKYMRDIII 437
FD+ + I + N K K I W+K+KCP Q N DCGY+V+++M++I I
Sbjct: 33 FDSAMCICRS-LNPKLKSKSSIIPWKKIKCPIQTNGIDCGYFVMRFMKEINI 185
>BG583811
Length = 548
Score = 42.0 bits (97), Expect = 5e-04
Identities = 15/34 (44%), Positives = 25/34 (73%)
Frame = +1
Query: 412 KMKCPKQKNLTDCGYYVLKYMRDIIIAGDSQTLQ 445
++KCP Q N DCGY+V+++M++II+A L+
Sbjct: 1 EIKCPMQTNGIDCGYFVMRFMKEIILANQDMILE 102
>CA919517
Length = 569
Score = 41.6 bits (96), Expect = 7e-04
Identities = 21/42 (50%), Positives = 26/42 (61%)
Frame = -2
Query: 406 KYIKWRKMKCPKQKNLTDCGYYVLKYMRDIIIAGDSQTLQEV 447
K KW + K KQ N DCGYYV+K M DII A +++ EV
Sbjct: 424 KKAKWFRPKPRKQPNGNDCGYYVMKNMLDIISANITKSWMEV 299
>AW774547 similar to GP|17221128|gb| glycoprotein gp2 {Equine herpesvirus 4},
partial (4%)
Length = 582
Score = 41.2 bits (95), Expect = 9e-04
Identities = 17/37 (45%), Positives = 24/37 (63%)
Frame = +3
Query: 402 KKKVKYIKWRKMKCPKQKNLTDCGYYVLKYMRDIIIA 438
K K I W K KCP Q N GY+V+++M++II+A
Sbjct: 288 KSKSSIIPWMKTKCPMQTNGIGFGYFVMQFMKEIILA 398
>TC79449 weakly similar to GP|14423394|gb|AAK62379.1 Unknown protein
{Arabidopsis thaliana}, partial (44%)
Length = 1486
Score = 38.5 bits (88), Expect = 0.006
Identities = 24/90 (26%), Positives = 36/90 (39%), Gaps = 3/90 (3%)
Frame = +2
Query: 345 FAPYNMGDHWVLLVIYTSEFAIEYFDSFDGEPTDDVHMKNIFDAGLMIYHANSNDVPKKK 404
F P + HW L VI + +Y DS G + + + Y D K
Sbjct: 1190 FVPIHKEIHWCLAVINKRDAKFQYLDSLKGMD------RRVLEVLARYYVDEVKDKTGKD 1351
Query: 405 VKYIKWRK---MKCPKQKNLTDCGYYVLKY 431
+ W K P+Q+N DCG +++KY
Sbjct: 1352 IDVSTWEKEYVEDLPEQENGFDCGVFMIKY 1441
>TC78344 similar to GP|13540405|gb|AAK29456.1 histone H1 {Lens culinaris},
partial (86%)
Length = 1215
Score = 36.2 bits (82), Expect = 0.027
Identities = 23/67 (34%), Positives = 31/67 (45%)
Frame = +2
Query: 147 KATSRAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVHEQAKKQVDKGKKDVGQGKRAP 206
K + + +A AKKPA KP +A PVA+ +AK K K + K AP
Sbjct: 446 KGSYKLPAKSAAPAKKPAAAKPKPKPKAKAPVAKAPAAKSKAKAAPAKAKAK-AKAKAAP 622
Query: 207 GQAKKQA 213
+AK A
Sbjct: 623 AKAKPAA 643
>TC90963 similar to PIR|A84431|A84431 probable C2H2-type zinc finger protein
[imported] - Arabidopsis thaliana, partial (21%)
Length = 1769
Score = 35.8 bits (81), Expect = 0.036
Identities = 24/85 (28%), Positives = 40/85 (46%), Gaps = 4/85 (4%)
Frame = +2
Query: 135 STMTPASSDNNHK----ATSRAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVHEQAKK 190
S +P S N+++ T+ G+ VQ Q K+ A A E+ +QAKK
Sbjct: 1058 SNSSPKESSNSNEKVQSTTTNNMGLLRVQE-----QAKEQLRVAMAEKAYAEEARKQAKK 1222
Query: 191 QVDKGKKDVGQGKRAPGQAKKQADK 215
Q++ +++ KR QA+ + DK
Sbjct: 1223 QIEMAEQEFNNAKRIRQQAQSELDK 1297
>TC80649 similar to PIR|T48006|T48006 hypothetical protein T17J13.100 -
Arabidopsis thaliana, partial (51%)
Length = 1058
Score = 32.0 bits (71), Expect = 0.52
Identities = 23/58 (39%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Frame = +1
Query: 142 SDNNHKATSRAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVHEQAKK-QVDKGKKD 198
S+ H+ +A + AVQ +KPA +KPA R P+ KV QAKK +VD+G +
Sbjct: 544 SNTVHELKEKAS-VLAVQEEKPAAGKRKPASR---PLNMIIKVKPQAKKAKVDEGNTE 705
>TC81054 similar to GP|12321513|gb|AAG50816.1 unknown protein {Arabidopsis
thaliana}, partial (32%)
Length = 1050
Score = 31.2 bits (69), Expect = 0.88
Identities = 18/57 (31%), Positives = 26/57 (45%)
Frame = -2
Query: 226 SKDKLQNCKFVWAYARSVLKSEDLLKIPMPENILNTNTEEEFVEQIGEEQVNEIYYH 282
SK LQ Y LKS K P+N+++ T+ + QI E VN ++H
Sbjct: 1004 SKISLQKKPVYVGYNIQELKSSSNSKSNSPKNLVSPETQNTYDSQISGEDVNRNFHH 834
>BQ151174 similar to GP|11036868|gb| PxORF73 peptide {Plutella xylostella
granulovirus}, partial (42%)
Length = 909
Score = 31.2 bits (69), Expect = 0.88
Identities = 33/109 (30%), Positives = 43/109 (39%), Gaps = 1/109 (0%)
Frame = +1
Query: 150 SRAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVHEQAKKQVDKGKKDVGQGKRAPG-Q 208
++ KG T K P G+ P R A+G KKQ + GK RAPG +
Sbjct: 556 TKKKGET--HKKSPGGRTPPPTPRPGGGGARGAPAPPPQKKQREGGKSG---PPRAPGHE 720
Query: 209 AKKQADKGTEIQPLGASSKDKLQNCKFVWAYARSVLKSEDLLKIPMPEN 257
KK A + P GA K+ A+AR K + K P N
Sbjct: 721 EKKPAATSRKRPPRGAREKNPA-------AHAREREKKKTPKKNPQNHN 846
>TC84589 weakly similar to GP|8885570|dbj|BAA97500.1
gene_id:F15L12.8~unknown protein {Arabidopsis thaliana},
partial (63%)
Length = 666
Score = 30.4 bits (67), Expect = 1.5
Identities = 24/88 (27%), Positives = 33/88 (37%), Gaps = 3/88 (3%)
Frame = +3
Query: 351 GDHWVLLVIYTSEFAIEYFDS---FDGEPTDDVHMKNIFDAGLMIYHANSNDVPKKKVKY 407
G HW LL Y + + DS + P ++ + GL K Y
Sbjct: 387 GSHWSLLAYYRNANVFVHHDSCRSMNATPAKKLYKAVVGYMGL--------SESGSKAGY 542
Query: 408 IKWRKMKCPKQKNLTDCGYYVLKYMRDI 435
++W P+Q N DCG YV R I
Sbjct: 543 LEWTDS--PRQANGYDCGLYVTAIARVI 620
>BQ141665 similar to GP|18568267|gb putative polyprotein {Zea mays}, partial
(1%)
Length = 1195
Score = 30.4 bits (67), Expect = 1.5
Identities = 23/74 (31%), Positives = 34/74 (45%)
Frame = +3
Query: 151 RAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVHEQAKKQVDKGKKDVGQGKRAPGQAK 210
RAKG A + KPAE+ P+A E + V +G D G+GK+ + +
Sbjct: 306 RAKGEERTGAGPRTKRATKPAEKDPPPMALAEN-----NRAVTRG--DKGRGKQPERRGR 464
Query: 211 KQADKGTEIQPLGA 224
Q +G +P GA
Sbjct: 465 SQKARGEPGRPPGA 506
>AL372026
Length = 462
Score = 29.6 bits (65), Expect = 2.6
Identities = 21/60 (35%), Positives = 28/60 (46%), Gaps = 3/60 (5%)
Frame = +2
Query: 383 KNIF---DAGLMIYHANSNDVPKKKVKYIKWRKMKCPKQKNLTDCGYYVLKYMRDIIIAG 439
KNI D+ L YH V KKK W C +Q +CGYY++ +M I+ G
Sbjct: 5 KNIIQTVDSALDEYH-KLQGVQKKKPT---WIVPVCQRQPESYECGYYIMIHMLKIVSDG 172
>BQ141693 weakly similar to GP|160968|gb|AA eggshell protein {Schistosoma
japonicum}, partial (12%)
Length = 1226
Score = 29.6 bits (65), Expect = 2.6
Identities = 14/36 (38%), Positives = 19/36 (51%)
Frame = -2
Query: 161 KKPAGQVKKPAERANKPVAQGEKVHEQAKKQVDKGK 196
K P G+ KP R KP +GEK + K+ +GK
Sbjct: 124 KGPKGKNTKPTPRITKPEREGEKGERKEKRGGKRGK 17
>TC76621 homologue to SP|O22582|H2B_GOSHI Histone H2B. [Upland cotton]
{Gossypium hirsutum}, partial (89%)
Length = 589
Score = 29.6 bits (65), Expect = 2.6
Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 3/71 (4%)
Frame = +1
Query: 151 RAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVH---EQAKKQVDKGKKDVGQGKRAPG 207
R K IT+ Q P K E +GE++H E + ++ KG+K+ +GKR
Sbjct: 13 RGKTITSFQIPNPFNGTKGREE-----TCRGEEIHRRRESSGREEAKGRKEASKGKRFSL 177
Query: 208 QAKKQADKGTE 218
+++ +K E
Sbjct: 178 WREEEEEKQEE 210
>TC89338
Length = 1088
Score = 29.3 bits (64), Expect = 3.4
Identities = 12/26 (46%), Positives = 18/26 (69%)
Frame = +1
Query: 422 TDCGYYVLKYMRDIIIAGDSQTLQEV 447
+DCGYYV+K M DI+ A + + +V
Sbjct: 718 SDCGYYVMKNMLDIVSANITTSWMKV 795
>TC90281 similar to SP|P54225|HEMZ_SYNY3 Ferrochelatase (EC 4.99.1.1)
(Protoheme ferro-lyase) (Heme synthetase). [strain PCC
6803], partial (4%)
Length = 1079
Score = 29.3 bits (64), Expect = 3.4
Identities = 20/72 (27%), Positives = 30/72 (40%)
Frame = +2
Query: 130 PKRLVSTMTPASSDNNHKATSRAKGITAVQAKKPAGQVKKPAERANKPVAQGEKVHEQAK 189
PK LV+ PAS + K + + Q K+ KP+AQG+ + A
Sbjct: 101 PKSLVAPRKPASVARKFVVRAEEKSVVDQAEEAFKQQAKQVENTVQKPLAQGQPATD-AN 277
Query: 190 KQVDKGKKDVGQ 201
DK K+ G+
Sbjct: 278 NSWDKEIKEAGK 313
>BQ150260
Length = 519
Score = 29.3 bits (64), Expect = 3.4
Identities = 12/26 (46%), Positives = 15/26 (57%)
Frame = +3
Query: 21 QAQKKADVQPGDSPCYSGEGSCPPSP 46
+A+KK P SP +S E CPP P
Sbjct: 135 KAKKKGPYPPNRSPFFSKEDRCPPPP 212
>TC85835 similar to PIR|T12180|T12180 probable transcription factor - fava
bean, partial (79%)
Length = 1737
Score = 28.9 bits (63), Expect = 4.4
Identities = 20/82 (24%), Positives = 36/82 (43%)
Frame = +3
Query: 121 DAIGTFVAWPKRLVSTMTPASSDNNHKATSRAKGITAVQAKKPAGQVKKPAERANKPVAQ 180
D +G P +L+ A+ A + A + K G KKPA+ +KP+
Sbjct: 231 DLLGDDAEDPSQLI-----AAEQQKAAAAAAAAPKKGLDQGKQTGAAKKPAQLPSKPLPP 395
Query: 181 GEKVHEQAKKQVDKGKKDVGQG 202
+ V E ++ + +G + G+G
Sbjct: 396 SQAVRE-SRNEPSRGGRGGGRG 458
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.316 0.133 0.388
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,411,062
Number of Sequences: 36976
Number of extensions: 183746
Number of successful extensions: 1082
Number of sequences better than 10.0: 50
Number of HSP's better than 10.0 without gapping: 1073
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1079
length of query: 462
length of database: 9,014,727
effective HSP length: 99
effective length of query: 363
effective length of database: 5,354,103
effective search space: 1943539389
effective search space used: 1943539389
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.7 bits)
Lotus: description of TM0199.3