
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0101.14
(1733 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alterna... 155 1e-37
BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin... 99 2e-20
BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - ... 65 2e-10
BQ151174 similar to GP|11036868|gb| PxORF73 peptide {Plutella xy... 39 0.018
TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 35 0.25
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 35 0.33
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 34 0.43
AW696163 similar to GP|13359054|dbj contains ESTs D15403(C0585) ... 34 0.56
TC86939 similar to GP|21703147|gb|AAM74513.1 At1g07230/F10K1_4 {... 34 0.56
TC91298 similar to GP|3928086|gb|AAC79612.1| unknown protein {Ar... 34 0.56
BQ144306 weakly similar to GP|23093252|gb| CG13731-PA {Drosophil... 33 0.96
TC82733 similar to GP|10177404|dbj|BAB10535. gene_id:K24M7.12~pi... 33 0.96
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 33 1.3
TC84709 similar to GP|16326133|dbj|BAB70510. Myb {Nicotiana taba... 33 1.3
TC91327 similar to GP|22597168|gb|AAN03471.1 unknown protein {Gl... 32 1.6
AL378279 homologue to GP|17133068|dbj cobyrinic acid a c-diamide... 32 2.1
TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-inf... 32 2.1
TC79800 similar to SP|O43280|TREA_HUMAN Trehalase precursor (EC ... 32 2.1
TC77663 similar to GP|12744987|gb|AAK06873.1 unknown protein {Ar... 32 2.1
>TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alternaria
alternata}, partial (21%)
Length = 1540
Score = 155 bits (392), Expect = 1e-37
Identities = 115/403 (28%), Positives = 190/403 (46%), Gaps = 17/403 (4%)
Frame = +1
Query: 1274 ENIQSSICSDLPNAFWERKS-----------HMVELPYEKDFSDKQIPTKARPIQMNEEL 1322
+++ + + + P+ F K+ H + L +KD +D +P +EL
Sbjct: 325 KHVPAGVLEEFPDLFNPEKAYQVPASRGLLDHAIPLIPDKDGNDPPLPWGPLYGMSRQEL 504
Query: 1323 LQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQALCWIRY 1382
L +K + DLL K I+ S S +V K G R ++Y+ LN RY
Sbjct: 505 LVL-KKTLEDLLDKGFIKASGSAAGAPVLFVRKPG----GGIRFCVDYRALNAITKKDRY 669
Query: 1383 PIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVMPFGLKN 1442
P+P + L R+ A+ F+K D+ + F +++++++D+ KTAF +G +EW V PFGL
Sbjct: 670 PLPLISETLRRVAGARWFTKLDVVAAFHKMRIKDEDQEKTAFRTRYGLFEWIVCPFGLTG 849
Query: 1443 APSEFQRIMNEIFNPY-SNFTIVYIDDVLIFSQ-SIDQHFKHLNTFISVIKKNGLAVSKT 1500
AP+ FQR +N+ + + +F YIDDVLI++ S H + + + GL++
Sbjct: 850 APATFQRYINKTLHEFLDDFVTAYIDDVLIYTTGSKKDHEAQVRRVLRRLADAGLSLDPK 1029
Query: 1501 KVSLFQTKIRFLGHNIHQGTIIPIN--RAIEFTDKFPDQIIDKTQLQRFLGCLNYVADFC 1558
K T ++++G + G + + + D P + + FLG NY DF
Sbjct: 1030 KCEFSVTTVKYVGFILTAGKGVSCDPLKLAAIRDWLPPGSVKGA--RSFLGFCNYYKDFI 1203
Query: 1559 PQLSTIIKLLHDRLKKD-PPPWSDVHTNVVKQIKLRIKNLPCLYLPNPQAFKIVETDASD 1617
P S I + L +KD P W ++K P L + +P+A VETD S
Sbjct: 1204 PGYSEITEPLTRLTRKDFPFRWGAEQEAAFTKLKRLFAEEPVLRMFDPEAVTTVETDCSG 1383
Query: 1618 IGFGGILKQKIFDN-EQIIAFTSKHWNPAQQNYSTVKKEVLAI 1659
GG+L Q+ +AF S+ +PA+ NY KE+LA+
Sbjct: 1384 FALGGVLTQEDGTGAAHPVAFHSQRLSPAEYNYPIHDKELLAV 1512
>BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin cluster;
Ty3-Gypsy type {Oryza sativa}, partial (15%)
Length = 716
Score = 98.6 bits (244), Expect = 2e-20
Identities = 60/180 (33%), Positives = 97/180 (53%), Gaps = 1/180 (0%)
Frame = +2
Query: 1317 QMNEELLQFCQKEINDLLQKKLIRRSKSPWSCATFYVNKQAEIERGTPRLVINYKPLNQA 1376
++N L+ + ++ DLL+K I+ S P ++ K+ G R+ I+Y LN
Sbjct: 92 RINPLKLKVLKLQLKDLLEKGFIQPSIYP*GVVVLFLKKKD----GFLRMSIDYPQLNNV 259
Query: 1377 LCWIRYPIPNKKDLLARLHDAKIFSKFDMKSGFWQIQLQEKDRYKTAFTVPFGQYEWNVM 1436
I+YP+P +L L +K F K D++ G Q ++ +D KTAF + +G YE VM
Sbjct: 260 NIKIKYPLPLIDELFDNLQGSKWFFKIDLRLG*HQHRVIGEDVPKTAFRIRYGHYEILVM 439
Query: 1437 PFGLKNAPSEFQRIMNEIFNPY-SNFTIVYIDDVLIFSQSIDQHFKHLNTFISVIKKNGL 1495
FG N P F +MN +F Y + IV+ +D+LI+S++ ++H HL + V+K GL
Sbjct: 440 SFG*TNPPMAFMELMNRVFQDYLDSLVIVFSNDILIYSKNENEHENHLRLALKVLKDIGL 619
>BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (13%)
Length = 763
Score = 65.1 bits (157), Expect = 2e-10
Identities = 38/102 (37%), Positives = 54/102 (52%), Gaps = 1/102 (0%)
Frame = +2
Query: 1418 DRYKTAFTVPFGQYEWNVMPFGLKNAPSEFQRIMNEIF-NPYSNFTIVYIDDVLIFSQSI 1476
D KTAF G Y + VMPFGLKNA S +QR++N +F + N VYIDD+L+ S
Sbjct: 14 DLEKTAFITDRGTYCYKVMPFGLKNAGSTYQRLVNRMFADKLGNTMEVYIDDMLVKSLRA 193
Query: 1477 DQHFKHLNTFISVIKKNGLAVSKTKVSLFQTKIRFLGHNIHQ 1518
H HL + + + ++ K + T FLG+ + Q
Sbjct: 194 TDHLNHLKE*FKTLDEYIMKLNPAKCTFGVTSGEFLGYIVTQ 319
>BQ151174 similar to GP|11036868|gb| PxORF73 peptide {Plutella xylostella
granulovirus}, partial (42%)
Length = 909
Score = 38.9 bits (89), Expect = 0.018
Identities = 22/59 (37%), Positives = 27/59 (45%), Gaps = 8/59 (13%)
Frame = +2
Query: 843 PKKPK--PRKHDPPPKQQWRRNSSRNHDHRKPKPRSKPH------STQAAKNPPENRPS 893
PKK K P PPP+ + R NH H KP+ + PH T NPP N P+
Sbjct: 110 PKKRKTPPEASHPPPRPRPPRPPQPNHQHDKPQKTTHPHHPHTPPPTPHPPNPPPNHPN 286
Score = 30.4 bits (67), Expect = 6.2
Identities = 14/54 (25%), Positives = 26/54 (47%), Gaps = 4/54 (7%)
Frame = +2
Query: 843 PKKPKPRKHDPPPKQQWRR----NSSRNHDHRKPKPRSKPHSTQAAKNPPENRP 892
P+ P P+ PPK++ + N + ++K P+ + +A+ PP RP
Sbjct: 5 PRPPNPQPPSDPPKKRTPKPETPNRKKPQKNKKEPPKKRKTPPEASHPPPRPRP 166
>TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (16%)
Length = 421
Score = 35.0 bits (79), Expect = 0.25
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 3/73 (4%)
Frame = +3
Query: 844 KKPKPRKHDPPPKQQWR---RNSSRNHDHRKPKPRSKPHSTQAAKNPPENRPSQGKNVTC 900
KK K K + + R R+ SR+ R K RS HS + A ++ ++ C
Sbjct: 171 KKEKNLKMSSDSRSRSRSRSRSRSRSRSPRIRKIRSDRHSYRDAPYRRDSSRGFSRDNLC 350
Query: 901 YNCGKPGHISRYC 913
NC +PGH +R C
Sbjct: 351 KNCKRPGHYAREC 389
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 34.7 bits (78), Expect = 0.33
Identities = 11/20 (55%), Positives = 15/20 (75%)
Frame = +3
Query: 895 GKNVTCYNCGKPGHISRYCR 914
G ++ CY CG+PGH +R CR
Sbjct: 321 GSDLKCYECGEPGHFARECR 380
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 34.3 bits (77), Expect = 0.43
Identities = 11/20 (55%), Positives = 15/20 (75%)
Frame = +1
Query: 895 GKNVTCYNCGKPGHISRYCR 914
G ++ CY CG+PGH +R CR
Sbjct: 304 GDDLKCYECGEPGHFARECR 363
>AW696163 similar to GP|13359054|dbj contains ESTs D15403(C0585)
C98080(C0585)~unknown protein {Oryza sativa (japonica
cultivar-group)}, partial (9%)
Length = 571
Score = 33.9 bits (76), Expect = 0.56
Identities = 18/54 (33%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Frame = +3
Query: 929 EDKINNLLIQTSDEEESASSDSEVSEDLNQIQNDDDPQS---SSSINVLTNEQD 979
ED NN + Q SD+ + SD EVS + +++D P + ++S +L++E D
Sbjct: 126 EDYDNNQVAQVSDDNDDNDSDGEVSSASSGYKSEDSPANDIDANSAGLLSSEDD 287
>TC86939 similar to GP|21703147|gb|AAM74513.1 At1g07230/F10K1_4 {Arabidopsis
thaliana}, partial (91%)
Length = 2037
Score = 33.9 bits (76), Expect = 0.56
Identities = 26/80 (32%), Positives = 35/80 (43%), Gaps = 4/80 (5%)
Frame = +3
Query: 843 PKKPKPRKH---DPPPKQQWRRNSSRNH-DHRKPKPRSKPHSTQAAKNPPENRPSQGKNV 898
P P P +H +PPP++ ++NH +R K +P S A NP NR V
Sbjct: 45 PLLPPPHRHRR*NPPPQKTQNHRPNQNHSSNRNGKSLLRPCSRLAETNPTRNRRINRH*V 224
Query: 899 TCYNCGKPGHISRYCRLKRR 918
C KP + LKRR
Sbjct: 225 KSDPCLKP-FFPKNPSLKRR 281
>TC91298 similar to GP|3928086|gb|AAC79612.1| unknown protein {Arabidopsis
thaliana}, partial (66%)
Length = 777
Score = 33.9 bits (76), Expect = 0.56
Identities = 15/44 (34%), Positives = 19/44 (43%)
Frame = +1
Query: 844 KKPKPRKHDPPPKQQWRRNSSRNHDHRKPKPRSKPHSTQAAKNP 887
+KPKP P P QW +H +P KPHS +P
Sbjct: 109 EKPKPSSIQPNPNHQWPPQHRHHHRQNLHQPCQKPHSHATQFSP 240
>BQ144306 weakly similar to GP|23093252|gb| CG13731-PA {Drosophila
melanogaster}, partial (4%)
Length = 1261
Score = 33.1 bits (74), Expect = 0.96
Identities = 17/46 (36%), Positives = 18/46 (38%), Gaps = 1/46 (2%)
Frame = -1
Query: 843 PKKPKPRKHDPPPKQQWRRNSSRNHDHRKPKP-RSKPHSTQAAKNP 887
P P H PPP S R H R P P R +PH Q P
Sbjct: 523 PFSSLPSHHPPPPPHTLHLPSPRQHPPRPPPPTRQRPHPPQPTPTP 386
>TC82733 similar to GP|10177404|dbj|BAB10535.
gene_id:K24M7.12~pir||S42136~similar to unknown protein
{Arabidopsis thaliana}, partial (57%)
Length = 710
Score = 33.1 bits (74), Expect = 0.96
Identities = 17/67 (25%), Positives = 27/67 (39%)
Frame = +3
Query: 850 KHDPPPKQQWRRNSSRNHDHRKPKPRSKPHSTQAAKNPPENRPSQGKNVTCYNCGKPGHI 909
K DP P + +KP SKP + + P +P +C+ C HI
Sbjct: 144 KVDPTPPNDPSKKKKNKFKRKKPDSNSKPRTGKRPLRVPGMKPGD----SCFICKGLDHI 311
Query: 910 SRYCRLK 916
+++C K
Sbjct: 312 AKFCTQK 332
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 32.7 bits (73), Expect = 1.3
Identities = 10/19 (52%), Positives = 14/19 (73%)
Frame = +2
Query: 895 GKNVTCYNCGKPGHISRYC 913
G ++ CY CG+PGH +R C
Sbjct: 329 GSDLKCYXCGEPGHFARXC 385
>TC84709 similar to GP|16326133|dbj|BAB70510. Myb {Nicotiana tabacum},
partial (8%)
Length = 392
Score = 32.7 bits (73), Expect = 1.3
Identities = 16/38 (42%), Positives = 24/38 (63%)
Frame = +2
Query: 698 DAVNSLIFTIAQHFVGDPSLIKDRSGDLLSNLKCKSLG 735
D+ +L+ + A+ F G PS++K R DLLS L K +G
Sbjct: 134 DSPEALLKSAAKTFAGTPSILKKRCRDLLSPLSDKRIG 247
>TC91327 similar to GP|22597168|gb|AAN03471.1 unknown protein {Glycine max},
partial (70%)
Length = 738
Score = 32.3 bits (72), Expect = 1.6
Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 3/54 (5%)
Frame = +1
Query: 843 PKKP-KPRKHDPPPKQQWRRNSSRNHDHRKPKPRSKPHSTQAAKNP--PENRPS 893
PK+P KP++ + P + + +N + + KPK KP + K P P+ +P+
Sbjct: 316 PKEPVKPKEPEKPKEPEKPKNPEKPKEPEKPKEPEKPKEPEKPKEPEKPKEKPA 477
>AL378279 homologue to GP|17133068|dbj cobyrinic acid a c-diamide synthase
{Nostoc sp. PCC 7120}, partial (2%)
Length = 321
Score = 32.0 bits (71), Expect = 2.1
Identities = 20/66 (30%), Positives = 27/66 (40%), Gaps = 15/66 (22%)
Frame = +1
Query: 843 PKKPKPRKHD------------PPPKQQWRRNSSRN---HDHRKPKPRSKPHSTQAAKNP 887
PK P R H P P+QQWR + S N KP + P+ A +P
Sbjct: 118 PKTPPFRSHQNFPLPNFTSESPPSPRQQWR*HLSENSPPSPPAKPPSPTSPNPQSAVPHP 297
Query: 888 PENRPS 893
P+ P+
Sbjct: 298 PKKSPT 315
>TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-infected
erythrocyte surface antigen {Plasmodium falciparum},
partial (2%)
Length = 2007
Score = 32.0 bits (71), Expect = 2.1
Identities = 39/175 (22%), Positives = 65/175 (36%), Gaps = 8/175 (4%)
Frame = +1
Query: 925 EPEIEDKINNLL---IQTSDEEESASSDS-----EVSEDLNQIQNDDDPQSSSSINVLTN 976
E + E + NN +QT+D +D+ E SED N + D SS++N N
Sbjct: 844 EQKQEQEQNNTTKDDVQTTDTSSQNGNDTTEKQNETSEDANSKKED-----SSALNTTPN 1008
Query: 977 EQDLLFRAINSIPDPDEKKIYLERLKFTLEDKPPKNPITTNKFNLRDTFRRLEKSTIKPV 1036
+D D T TN +DT +
Sbjct: 1009 NEDSKSGVAGDQADST-----------TTTSSSETQDGNTNHGEYKDTTNENPEKNSGQE 1155
Query: 1037 TIQDLQSEVHTLQAEVKSLKQIQISQQLILDKLTEENSEESSSSSSTPNSASNNN 1091
Q+ S +T + + ++Q++ D +E+ +ESSS+ S S+ N+N
Sbjct: 1156 GTQESGSSSNTFDNKDAASNKVQLTTTS--DTSSEQKKDESSSAESKSESSQNDN 1314
>TC79800 similar to SP|O43280|TREA_HUMAN Trehalase precursor (EC 3.2.1.28)
(Alpha alpha-trehalase) (Alpha alpha-trehalose
glucohydrolase)., partial (2%)
Length = 654
Score = 32.0 bits (71), Expect = 2.1
Identities = 13/52 (25%), Positives = 24/52 (46%)
Frame = -3
Query: 840 QGCPKKPKPRKHDPPPKQQWRRNSSRNHDHRKPKPRSKPHSTQAAKNPPENR 891
Q P + H+PPP + +++ + DH P P +KP + + + R
Sbjct: 451 QPLPSSSPQKHHNPPPASPQQPSTASHTDHAYPPPSTKPGRREHQRREKQRR 296
>TC77663 similar to GP|12744987|gb|AAK06873.1 unknown protein {Arabidopsis
thaliana}, partial (11%)
Length = 1286
Score = 32.0 bits (71), Expect = 2.1
Identities = 20/64 (31%), Positives = 38/64 (59%), Gaps = 4/64 (6%)
Frame = +3
Query: 1039 QDLQSEVHTLQAEVKSLK----QIQISQQLILDKLTEENSEESSSSSSTPNSASNNNVGD 1094
++L E+ ++AE K LK Q + +L K+ N ESSSSSS+ + +S+++ G+
Sbjct: 369 EELLRELEKVRAEEKELKKKMKQEKKKAKLKPSKMKTCNKSESSSSSSSESESSDSDCGE 548
Query: 1095 FLEI 1098
+++
Sbjct: 549 VVDM 560
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.339 0.149 0.490
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 57,672,882
Number of Sequences: 36976
Number of extensions: 907616
Number of successful extensions: 8432
Number of sequences better than 10.0: 59
Number of HSP's better than 10.0 without gapping: 4502
Number of HSP's successfully gapped in prelim test: 446
Number of HSP's that attempted gapping in prelim test: 2961
Number of HSP's gapped (non-prelim): 5869
length of query: 1733
length of database: 9,014,727
effective HSP length: 110
effective length of query: 1623
effective length of database: 4,947,367
effective search space: 8029576641
effective search space used: 8029576641
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.8 bits)
S2: 65 (29.6 bits)
Lotus: description of TM0101.14