
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0252c.1
(325 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Ci... 111 4e-25
TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA PO... 51 7e-07
TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulch... 50 2e-06
BG647713 homologue to GP|15042313|gb| 232R {Chilo iridescent vir... 49 3e-06
TC92636 homologue to GP|15042313|gb|AAK82093.1 232R {Chilo iride... 40 0.001
AL366725 39 0.004
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.... 36 0.018
BF649369 35 0.051
BG644717 33 0.11
TC84935 similar to PIR|G96631|G96631 probable RNA-binding protei... 33 0.15
BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.... 33 0.15
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 33 0.15
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 33 0.20
TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 32 0.33
BG644741 31 0.74
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 31 0.74
TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA... 30 0.97
TC87381 30 1.3
BG447595 29 2.2
TC82733 similar to GP|10177404|dbj|BAB10535. gene_id:K24M7.12~pi... 28 3.7
>TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Cicer
arietinum}, partial (8%)
Length = 516
Score = 111 bits (277), Expect = 4e-25
Identities = 55/140 (39%), Positives = 79/140 (56%)
Frame = +2
Query: 66 DQNRGLNNFIRQNPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLLLGDAE 125
D R L F+R +PP F G PD A W++EIE+IF V+Q E KV T++L +A+
Sbjct: 92 DGTRMLETFLRNHPPTFKGRYAPDGA*KWLKEIERIFRVMQCFETQKVQFGTHMLAEEAD 271
Query: 126 YWWRGARGMMEANHVEVNWNSFRAAFLEKYFPDSARDERESQFLTLRQGSMTIPEYAAKL 185
WW ++E + V W FR FL +YFP+ R ++E +FL L+QG M++ EYAAK
Sbjct: 272 DWWISLLPVLEQDDAVVTWAMFRKEFLGRYFPEDVRGKKEIEFLELKQGDMSVTEYAAKF 451
Query: 186 ESLAKHFRFFRDQVDEPYMC 205
LA + + + E C
Sbjct: 452 VELATFYPHYSAETAEFSKC 511
>TC87383 similar to GP|19168656|emb|CAD26175. DNA-DIRECTED RNA POLYMERASE II
{Encephalitozoon cuniculi}, partial (0%)
Length = 1247
Score = 50.8 bits (120), Expect = 7e-07
Identities = 45/188 (23%), Positives = 75/188 (38%), Gaps = 2/188 (1%)
Frame = -2
Query: 15 KRIQNMVNANQLAEMVATLVQAMTVQTNDNAQRRAAEDARELHLRQREASLDQNRGLNNF 74
+++Q +VNA Q L++A + D+ + +R ++RE + N
Sbjct: 880 QQLQEIVNAQQ------ALLEAQQKRFKDHVSSSDSLSSRSSRSQRREFQM-------ND 740
Query: 75 IRQNPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLLLGDAEYWWRGA--R 132
I+ + P F G PD+ W+Q +E++F+ + E KV + L A WW R
Sbjct: 739 IK*DIPDFEGNLQPDDLLDWLQIMERLFKYKEVLEEQKVKIVAAKLKKLASIWWENVKRR 560
Query: 133 GMMEANHVEVNWNSFRAAFLEKYFPDSARDERESQFLTLRQGSMTIPEYAAKLESLAKHF 192
E W R KY P Q + T P+ + K S +HF
Sbjct: 559 RKREGKSKIKTWEKMRQKLTRKYLPPH-----------YYQDNYTQPQLSKK--SSYRHF 419
Query: 193 RFFRDQVD 200
++Q+D
Sbjct: 418 SPTKNQID 395
>TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulchellus},
partial (7%)
Length = 2304
Score = 49.7 bits (117), Expect = 2e-06
Identities = 41/152 (26%), Positives = 57/152 (36%), Gaps = 8/152 (5%)
Frame = +2
Query: 14 PKRIQNMVNANQLAEMVATLVQAMTVQTNDNA-----QRRAAEDARELHLRQREASLDQN 68
P R QN + ++ +M + Q + A QRR D +S Q
Sbjct: 413 PPRRQNERSLQEMEDMRRQIQQLQEIINAQQALLEAEQRRFEGDVSYSDSSSSRSSHSQR 592
Query: 69 RGLN-NFIRQNPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLLLGDAEYW 127
R L N I+ + P F G D+ W+Q IE++FE + E KV + L A W
Sbjct: 593 RQLQMNDIKVDIPDFEGNLQLDDFLDWLQTIERVFEYKEVPEEQKVKIVAAKLKKHALIW 772
Query: 128 WRG--ARGMMEANHVEVNWNSFRAAFLEKYFP 157
W R E W+ R KY P
Sbjct: 773 WENLKRRRKREGKSKIKTWDKMRQKLTRKYLP 868
>BG647713 homologue to GP|15042313|gb| 232R {Chilo iridescent virus}, partial
(1%)
Length = 726
Score = 48.5 bits (114), Expect = 3e-06
Identities = 34/141 (24%), Positives = 58/141 (41%), Gaps = 2/141 (1%)
Frame = +2
Query: 17 IQNMVNANQLAEMVATLVQAMTVQTNDNAQRRAAEDARELHLRQREASLDQNRGLNNFIR 76
+Q VNA Q L++A + +D+ + +R +R+ + + I+
Sbjct: 197 LQETVNAQQ------ALLEAQRRRNDDDGSGSDSSSSRSSRSHRRQTRMSK-------IK 337
Query: 77 QNPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLLLGDAEYWWRGARGM-- 134
+ P F G PDE W+Q IE++F+ + +E KV + L A WW+ +
Sbjct: 338 VDIPDF*GKLQPDEFVDWLQTIERVFKYKEVAEEQKVKIVAAKLKKHASIWWKNLKRKRN 517
Query: 135 MEANHVEVNWNSFRAAFLEKY 155
E W+ R KY
Sbjct: 518 CEGKSKIKTWDKMRQKLTRKY 580
>TC92636 homologue to GP|15042313|gb|AAK82093.1 232R {Chilo iridescent
virus}, partial (1%)
Length = 772
Score = 40.4 bits (93), Expect = 0.001
Identities = 29/96 (30%), Positives = 41/96 (42%), Gaps = 8/96 (8%)
Frame = +3
Query: 25 QLAEMVATLVQAMTVQTNDNAQ--------RRAAEDARELHLRQREASLDQNRGLNNFIR 76
Q EM Q +Q NAQ RR +D R + + + L N I+
Sbjct: 249 QEMEMEEMRRQIQELQETVNAQQAILEAERRRVDDDGSSDSSSSRSSRSHRRKTLMNDIK 428
Query: 77 QNPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAK 112
+ P F G PDE W+Q IE++FE + GA+
Sbjct: 429 VDIPDFEGELQPDEFVDWLQAIERVFEYKEIPRGAQ 536
>AL366725
Length = 485
Score = 38.5 bits (88), Expect = 0.004
Identities = 32/134 (23%), Positives = 49/134 (35%)
Frame = +2
Query: 190 KHFRFFRDQVDEPYMCKRFVRGLRADIEDSVRPLGIMRFQALVEKATEVELMKNRRMDRA 249
K + + + E C +F GLR DI+ R +G + + + + +
Sbjct: 2 KFYPHYAAETAEFSKCIKFENGLRPDIK---RAIGYQQLRVFPDLVNTCRIYEEDTKAHD 172
Query: 250 GTGGPMRTSSRSYQGKGKLQRKKPYQRPTGEGFTPGLYRPTIAAAGGAGSQAGSREITCF 309
+T KG+ R KPY P +G + + + EI CF
Sbjct: 173 KVVNERKT-------KGQ*SRPKPYSAPADKG------KQRMVDDRRPKKKDAPAEIVCF 313
Query: 310 KCGEIGHYSTKCPK 323
GE GH S CPK
Sbjct: 314 NYGEKGHKSNVCPK 355
>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
Arabidopsis thaliana, partial (17%)
Length = 378
Score = 36.2 bits (82), Expect = 0.018
Identities = 13/30 (43%), Positives = 17/30 (56%)
Frame = +1
Query: 295 GGAGSQAGSREITCFKCGEIGHYSTKCPKG 324
GG G G +C+ CGE GH++ CP G
Sbjct: 97 GGGGGGGGGGGGSCYSCGESGHFARDCPTG 186
>BF649369
Length = 631
Score = 34.7 bits (78), Expect = 0.051
Identities = 35/160 (21%), Positives = 65/160 (39%)
Frame = +3
Query: 80 PKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLLLGDAEYWWRGARGMMEANH 139
P F G D+ WI E F+V T + +V L+ + G +W+ ++
Sbjct: 135 PLFEG----DDPVAWITRAEIYFDVQNTPDDMRVKLSRLSMEGPTIHWF----NLLMETE 290
Query: 140 VEVNWNSFRAAFLEKYFPDSARDERESQFLTLRQGSMTIPEYAAKLESLAKHFRFFRDQV 199
+++ + A + +Y D R E + L+ + ++ E+ E L+ ++
Sbjct: 291 DDLSREKLKKALIARY--DGRRLENPFEELSTLRQIGSVEEFVEAFELLSSQV----GRL 452
Query: 200 DEPYMCKRFVRGLRADIEDSVRPLGIMRFQALVEKATEVE 239
E F+ GL+A I VR L ++ A +VE
Sbjct: 453 PEEQYLGYFMSGLKAHIRRRVRTLNPTTRMQMMRIAKDVE 572
>BG644717
Length = 267
Score = 33.5 bits (75), Expect = 0.11
Identities = 17/43 (39%), Positives = 25/43 (57%)
Frame = -2
Query: 78 NPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLL 120
N P+F G ++ ++ EI+KIFEV+ S V LA+Y L
Sbjct: 266 NSPEFLGSQINEDPQNFLDEIKKIFEVMHVSGNDLVELASYQL 138
>TC84935 similar to PIR|G96631|G96631 probable RNA-binding protein F8A5.17
[imported] - Arabidopsis thaliana, partial (41%)
Length = 552
Score = 33.1 bits (74), Expect = 0.15
Identities = 16/40 (40%), Positives = 22/40 (55%), Gaps = 4/40 (10%)
Frame = +2
Query: 287 YRPTIAAAG----GAGSQAGSREITCFKCGEIGHYSTKCP 322
YR ++ G GAG + G + CFKCG GH++ CP
Sbjct: 404 YRGGFSSGGRGSYGAGDRVGQDD--CFKCGRPGHWARDCP 517
>BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.5 [imported]
- Arabidopsis thaliana, partial (5%)
Length = 627
Score = 33.1 bits (74), Expect = 0.15
Identities = 11/34 (32%), Positives = 19/34 (55%)
Frame = -1
Query: 289 PTIAAAGGAGSQAGSREITCFKCGEIGHYSTKCP 322
P+++AA +G C+KC + GH++ CP
Sbjct: 459 PSMSAANRVSGGSGGASGNCYKCNQPGHWANNCP 358
Score = 32.7 bits (73), Expect = 0.20
Identities = 13/38 (34%), Positives = 22/38 (57%)
Frame = -1
Query: 285 GLYRPTIAAAGGAGSQAGSREITCFKCGEIGHYSTKCP 322
G Y T++ +GGA + C+KC + GH+++ CP
Sbjct: 549 GAYVNTVSGSGGASGK-------CYKCQQPGHWASNCP 457
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 33.1 bits (74), Expect = 0.15
Identities = 12/34 (35%), Positives = 21/34 (61%), Gaps = 6/34 (17%)
Frame = +1
Query: 294 AGGAGSQAGSR------EITCFKCGEIGHYSTKC 321
+GG G + G R ++ C++CGE GH++ +C
Sbjct: 259 SGGGGGRGGGRGGRGGDDLKCYECGEPGHFAREC 360
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 32.7 bits (73), Expect = 0.20
Identities = 10/27 (37%), Positives = 17/27 (62%)
Frame = +3
Query: 295 GGAGSQAGSREITCFKCGEIGHYSTKC 321
G G G ++ C++CGE GH++ +C
Sbjct: 297 GRGGGGGGGSDLKCYECGEPGHFAREC 377
>TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (96%)
Length = 1286
Score = 32.0 bits (71), Expect = 0.33
Identities = 26/96 (27%), Positives = 42/96 (43%), Gaps = 19/96 (19%)
Frame = +1
Query: 246 MDRAGTGGPMRTSSRSYQGKGKLQRKKPYQRPTGEGFT----------PGLYR---PTIA 292
MDR+ + P+ RS + R+ PY+R + GF+ PG Y P +A
Sbjct: 307 MDRSRSRSPVDRRIRSERFS---HREAPYRRDSRRGFSQDNLCKNCKRPGHYVRECPNVA 477
Query: 293 AA------GGAGSQAGSREITCFKCGEIGHYSTKCP 322
G S+ ++ + C+ C E GH ++ CP
Sbjct: 478 VCHNCSLPGHIASECSTKSL-CWNCKEPGHMASSCP 582
>BG644741
Length = 735
Score = 30.8 bits (68), Expect = 0.74
Identities = 24/97 (24%), Positives = 46/97 (46%), Gaps = 5/97 (5%)
Frame = -2
Query: 120 LLGDAEYWWRGARGMMEANHVEVNWNSFRAAFLEKYFPDSAR---DERESQFLTL--RQG 174
L+G+A+ W+ + N + WN R FL +Y+P S + ++R + F+ L
Sbjct: 566 LMGEADIWFTE----LPYNSI-FTWNQLRDVFLARYYPVSKKLNHNDRVNNFVALPGESV 402
Query: 175 SMTIPEYAAKLESLAKHFRFFRDQVDEPYMCKRFVRG 211
S + + + L S+ H ++D+ + + F RG
Sbjct: 401 SSSWDRFTSFLRSVPNH------RIDDDSLKEYFYRG 309
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 30.8 bits (68), Expect = 0.74
Identities = 12/28 (42%), Positives = 17/28 (59%), Gaps = 1/28 (3%)
Frame = +2
Query: 295 GGAG-SQAGSREITCFKCGEIGHYSTKC 321
GG G S G ++ C+ CGE GH++ C
Sbjct: 302 GGRGRSGGGGSDLKCYXCGEPGHFARXC 385
>TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA helicase
{Oryza sativa}, partial (7%)
Length = 624
Score = 30.4 bits (67), Expect = 0.97
Identities = 20/71 (28%), Positives = 33/71 (46%)
Frame = +1
Query: 252 GGPMRTSSRSYQGKGKLQRKKPYQRPTGEGFTPGLYRPTIAAAGGAGSQAGSREITCFKC 311
G R RSY+ + +R + + + G + + +++ S AG TCF C
Sbjct: 19 GDSSRRGGRSYKSGNSWSKP---ERSSRDDWLIGGRQSSRSSSSPNRSFAG----TCFTC 177
Query: 312 GEIGHYSTKCP 322
GE GH ++ CP
Sbjct: 178 GESGHRASDCP 210
>TC87381
Length = 814
Score = 30.0 bits (66), Expect = 1.3
Identities = 19/60 (31%), Positives = 26/60 (42%)
Frame = +3
Query: 73 NFIRQNPPKFTGGTDPDEADLWIQEIEKIFEVLQTSEGAKVGLATYLLLGDAEYWWRGAR 132
N I+ + P F G DE +Q IE +FE + E KV + L A WW +
Sbjct: 615 NDIKVDIPDFEGELQSDEFVD*LQAIECVFEYKEIPEDHKVKVVAV*LKKHALIWWENLK 794
>BG447595
Length = 309
Score = 29.3 bits (64), Expect = 2.2
Identities = 17/37 (45%), Positives = 20/37 (53%)
Frame = -2
Query: 139 HVEVNWNSFRAAFLEKYFPDSARDERESQFLTLRQGS 175
H E NW SFRA F EK S R + + LR+GS
Sbjct: 278 HSERNWGSFRA*FGEKQ*IYSRRGQNWQEKWKLREGS 168
>TC82733 similar to GP|10177404|dbj|BAB10535.
gene_id:K24M7.12~pir||S42136~similar to unknown protein
{Arabidopsis thaliana}, partial (57%)
Length = 710
Score = 28.5 bits (62), Expect = 3.7
Identities = 11/24 (45%), Positives = 13/24 (53%)
Frame = +3
Query: 300 QAGSREITCFKCGEIGHYSTKCPK 323
+ G+ CF C E GH S CPK
Sbjct: 489 EGGTMFAQCFVCKEQGHLSKNCPK 560
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.321 0.136 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,488,505
Number of Sequences: 36976
Number of extensions: 90865
Number of successful extensions: 402
Number of sequences better than 10.0: 45
Number of HSP's better than 10.0 without gapping: 389
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 401
length of query: 325
length of database: 9,014,727
effective HSP length: 96
effective length of query: 229
effective length of database: 5,465,031
effective search space: 1251492099
effective search space used: 1251492099
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0252c.1