
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC143337.6 + phase: 0
(276 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF635063 weakly similar to PIR|F84486|F84 probable retroelement ... 348 1e-96
BG646342 weakly similar to PIR|F84486|F84 probable retroelement ... 138 2e-33
BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x pa... 64 6e-11
TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-... 41 4e-04
TC81230 40 0.001
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 40 0.001
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 39 0.003
TC84935 similar to PIR|G96631|G96631 probable RNA-binding protei... 36 0.014
BG644747 36 0.014
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 36 0.018
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.... 35 0.031
TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA... 32 0.27
BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.... 30 0.77
TC87639 GP|9663153|emb|CAC01132.1 transport-secretion protein 2.... 30 1.0
TC82733 similar to GP|10177404|dbj|BAB10535. gene_id:K24M7.12~pi... 29 1.7
BG587684 similar to GP|12858190|dbj evidence:NAS~putative~unclas... 29 1.7
TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 28 2.9
TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helica... 28 2.9
TC91834 similar to PIR|T08416|T08416 disease resistance protein ... 28 3.8
>BF635063 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 677
Score = 348 bits (893), Expect = 1e-96
Identities = 180/189 (95%), Positives = 184/189 (97%)
Frame = -2
Query: 1 MGSKWDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEGSLPVTMSRAEKTKMVDKARS 60
MGSK DIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGE SLPVTMSRAEKT+MVDKARS
Sbjct: 568 MGSKRDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEVSLPVTMSRAEKTEMVDKARS 389
Query: 61 AVVLCLGDKVLREVAKEATAASMWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQ 120
A+VLCLGDKVLREVAKE TAASMWAKL SLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQ
Sbjct: 388 AIVLCLGDKVLREVAKERTAASMWAKL*SLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQ 209
Query: 121 LTEFNKILDDLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALR 180
LTEFNKILDDLENIEVQLEDE+KAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALR
Sbjct: 208 LTEFNKILDDLENIEVQLEDEEKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALR 29
Query: 181 TKDLTKSKD 189
TK+LTKS D
Sbjct: 28 TKELTKSND 2
>BG646342 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 599
Score = 138 bits (348), Expect = 2e-33
Identities = 74/94 (78%), Positives = 80/94 (84%)
Frame = +2
Query: 58 ARSAVVLCLGDKVLREVAKEATAASMWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAI 117
ARSA+VLCLGDKVLREVAKE TA SM AKLE LYMTKSLAHRQFLKQQLYSF+MVESKAI
Sbjct: 2 ARSAIVLCLGDKVLREVAKEPTATSMCAKLEYLYMTKSLAHRQFLKQQLYSFKMVESKAI 181
Query: 118 MEQLTEFNKILDDLENIEVQLEDEDKAILLLCAL 151
E L EFNKI+ DLENIEV LED A+++ C L
Sbjct: 182 TELLVEFNKIIGDLENIEVHLEDAG-ALMVWCCL 280
Score = 54.3 bits (129), Expect = 5e-08
Identities = 23/26 (88%), Positives = 25/26 (95%)
Frame = +2
Query: 251 EDAGALMVWCCLEEEKGDVSHLGIDA 276
EDAGALMVWCCLE+E+GDVSHLG DA
Sbjct: 245 EDAGALMVWCCLEDEEGDVSHLGNDA 322
>BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x paradisi},
partial (3%)
Length = 658
Score = 63.9 bits (154), Expect = 6e-11
Identities = 37/138 (26%), Positives = 73/138 (52%), Gaps = 7/138 (5%)
Frame = +2
Query: 64 LCLG-------DKVLREVAKEATAASMWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKA 116
+CLG D + +A +W KLE+ YM + ++FL +++MV++K+
Sbjct: 197 ICLGHILNGMSDSLFDIYQSSPSAKDLWDKLETRYMREDATSKKFLVSHFNNYKMVDNKS 376
Query: 117 IMEQLTEFNKILDDLENIEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQ 176
+MEQL E +IL++ + + +++ ++ LP S++ FK TM + KE ++LE++
Sbjct: 377 VMEQLYEIERILNNYKQHNMNMDETIIVSSIIDKLPPSWKDFKRTMKHKKE-DISLEQLG 553
Query: 177 AALRTKDLTKSKDLTHEH 194
LR + + ++ H
Sbjct: 554 NHLRLXEEYRKQEGIKNH 607
>TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-1 {Triticum
aestivum}, partial (39%)
Length = 630
Score = 41.2 bits (95), Expect = 4e-04
Identities = 24/69 (34%), Positives = 31/69 (44%), Gaps = 4/69 (5%)
Frame = +3
Query: 185 TKSKDLTHEHGEGLSVTR----GNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKING 240
TK+ D+T GE L V + G GGGRG R C+ C GH +DC + +
Sbjct: 228 TKAVDVTGPKGEPLQVRQDNHGGGGGGRGFRGGERRNGGGGCYTCGDTGHIARDCDRSDR 407
Query: 241 NSAQIVSEG 249
N S G
Sbjct: 408 NDRNDRSGG 434
Score = 33.9 bits (76), Expect = 0.070
Identities = 22/74 (29%), Positives = 30/74 (39%)
Frame = +3
Query: 202 RGNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKINGNSAQIVSEGYEDAGALMVWCC 261
R GGG G+R ++ C+ C HF +DC + GN+ GY G C
Sbjct: 423 RSGGGGGGDRDRA-------CYTCGSFEHFARDCMRGGGNNNN-GGGGYGGGGTSCYRC- 575
Query: 262 LEEEKGDVSHLGID 275
G V H+ D
Sbjct: 576 -----GGVGHIARD 602
>TC81230
Length = 958
Score = 40.0 bits (92), Expect = 0.001
Identities = 38/171 (22%), Positives = 68/171 (39%), Gaps = 12/171 (7%)
Frame = +1
Query: 80 AASMWAKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILDDLENIEVQLE 139
A +W L+ Y L+H+ L + L + + + + E L + I + L + E L+
Sbjct: 475 AKEVWDHLKQRYTISDLSHQYQLLKDLSNLKQQSGQPVYEFLAQMEVIWNQLTSCEPSLK 654
Query: 140 DED------------KAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKDLTKS 187
D + I L AL +E + + L+ + TLE L++++ T+
Sbjct: 655 DATDMKTYETHRNRVRLIQFLMALTDEYEPVRASSLH-QNPLPTLENALPCLKSEE-TRL 828
Query: 188 KDLTHEHGEGLSVTRGNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKI 238
+ + + +VT N + C +C K GH DCP I
Sbjct: 829 QLVPPKADLAFAVT--------------NNATKPCRHCQKSGHSFSDCPTI 939
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 39.7 bits (91), Expect = 0.001
Identities = 16/41 (39%), Positives = 21/41 (51%)
Frame = +3
Query: 203 GNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKINGNSA 243
G GGGRG G S +C+ C + GHF ++C G A
Sbjct: 279 GGGGGRGRGGGGGGGSDLKCYECGEPGHFARECRNRGGGGA 401
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 38.5 bits (88), Expect = 0.003
Identities = 16/40 (40%), Positives = 20/40 (50%)
Frame = +2
Query: 203 GNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKINGNS 242
G GGG G R G S +C+ C + GHF + C G S
Sbjct: 287 GGGGGGGRGRSGGGGSDLKCYXCGEPGHFARXCNSSPGGS 406
>TC84935 similar to PIR|G96631|G96631 probable RNA-binding protein F8A5.17
[imported] - Arabidopsis thaliana, partial (41%)
Length = 552
Score = 36.2 bits (82), Expect = 0.014
Identities = 13/41 (31%), Positives = 22/41 (52%)
Frame = +2
Query: 204 NGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKINGNSAQ 244
+ GGRG+ + +CF C + GH+ +DCP G+ +
Sbjct: 419 SSGGRGSYGAGDRVGQDDCFKCGRPGHWARDCPLAGGDGGR 541
>BG644747
Length = 685
Score = 36.2 bits (82), Expect = 0.014
Identities = 24/104 (23%), Positives = 52/104 (49%), Gaps = 1/104 (0%)
Frame = +1
Query: 57 KARSAVVLCLGDKVLREVAKE-ATAASMWAKLESLYMTKSLAHRQFLKQQLYSFRMVESK 115
K R + CL D + +++ +W L+S+Y + ++ + F+MV++K
Sbjct: 247 KCRYHIFKCLYDNFYDYYDRTYSSSKKIWKALQSMYDIEDARA*KYTDS*FFRFKMVDNK 426
Query: 116 AIMEQLTEFNKILDDLENIEVQLEDEDKAILLLCALPKSFESFK 159
++++Q +F I+ L + EV++ D ++ LP S + F+
Sbjct: 427 SMVDQAQDFIMIVRYLRSKEVKIGDNLIVCGIVDKLPPS*KKFQ 558
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 35.8 bits (81), Expect = 0.018
Identities = 13/33 (39%), Positives = 18/33 (54%)
Frame = +1
Query: 203 GNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDC 235
G GGGRG R +C+ C + GHF ++C
Sbjct: 262 GGGGGRGGGRGGRGGDDLKCYECGEPGHFAREC 360
>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
Arabidopsis thaliana, partial (17%)
Length = 378
Score = 35.0 bits (79), Expect = 0.031
Identities = 17/50 (34%), Positives = 25/50 (50%)
Frame = +1
Query: 187 SKDLTHEHGEGLSVTRGNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCP 236
+++ T G G G GGG G G+ C++C + GHF +DCP
Sbjct: 46 ARECTSGGGGGGGRYGGGGGGGGGGGGGGS-----CYSCGESGHFARDCP 180
>TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA helicase
{Oryza sativa}, partial (7%)
Length = 624
Score = 32.0 bits (71), Expect = 0.27
Identities = 15/39 (38%), Positives = 20/39 (50%), Gaps = 3/39 (7%)
Frame = +1
Query: 206 GGRGNRRKSGNKSRF---ECFNCHKMGHFKKDCPKINGN 241
GGR + R S + +R CF C + GH DCP G+
Sbjct: 109 GGRQSSRSSSSPNRSFAGTCFTCGESGHRASDCPNKRGD 225
>BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.5 [imported]
- Arabidopsis thaliana, partial (5%)
Length = 627
Score = 30.4 bits (67), Expect = 0.77
Identities = 17/55 (30%), Positives = 26/55 (46%)
Frame = -1
Query: 200 VTRGNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKINGNSAQIVSEGYEDAG 254
V+ G+GG GN C+ C++ GH+ +CP + SA S G + G
Sbjct: 435 VSGGSGGASGN-----------CYKCNQPGHWANNCPNM---SAAPQSHGNSNTG 313
Score = 27.7 bits (60), Expect = 5.0
Identities = 15/55 (27%), Positives = 26/55 (47%)
Frame = -1
Query: 195 GEGLSVTRGNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDCPKINGNSAQIVSEG 249
G ++ G+GG G +C+ C + GH+ +CP + ++A VS G
Sbjct: 549 GAYVNTVSGSGGASG-----------KCYKCQQPGHWASNCPSM--SAANRVSGG 424
>TC87639 GP|9663153|emb|CAC01132.1 transport-secretion protein 2.2 (TTS-2.2)
{Homo sapiens}, partial (2%)
Length = 1522
Score = 30.0 bits (66), Expect = 1.0
Identities = 37/183 (20%), Positives = 64/183 (34%), Gaps = 32/183 (17%)
Frame = -1
Query: 85 AKLESLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILDD-----------LEN 133
A L+ Y+ ++ R QL S R +++ + L F K+L D +
Sbjct: 778 AYLDRTYLDPNIQSRAVA--QLQSLRQKDTERLATFLPRFEKVLADAGGYSWPDVVQISL 605
Query: 134 IEVQLEDEDKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKDLTKSKDLTHE 193
+E L K +L+ LP + + L +VQ + K+
Sbjct: 604 LETALVPRLKELLITVELPTVYSQW-------------LSKVQDIAWKMERMKTPPTRWA 464
Query: 194 HGEGLSVTRGNGGG-------RGNRRKSGNKSRF--------------ECFNCHKMGHFK 232
L V++ G RR+ G+ S EC++CH+ GH
Sbjct: 463 PATRLPVSKDRDGDMMMTGAIHKQRRRRGSSSSVSSAEGAPPPRRDMRECYSCHERGHIA 284
Query: 233 KDC 235
++C
Sbjct: 283 RNC 275
>TC82733 similar to GP|10177404|dbj|BAB10535.
gene_id:K24M7.12~pir||S42136~similar to unknown protein
{Arabidopsis thaliana}, partial (57%)
Length = 710
Score = 29.3 bits (64), Expect = 1.7
Identities = 9/17 (52%), Positives = 12/17 (69%)
Frame = +3
Query: 221 ECFNCHKMGHFKKDCPK 237
+CF C + GH K+CPK
Sbjct: 510 QCFVCKEQGHLSKNCPK 560
>BG587684 similar to GP|12858190|dbj evidence:NAS~putative~unclassifiable
{Mus musculus}, partial (14%)
Length = 819
Score = 29.3 bits (64), Expect = 1.7
Identities = 11/16 (68%), Positives = 13/16 (80%)
Frame = +3
Query: 1 MGSKWDIEKFTGDNDF 16
M +KWDIE F G+NDF
Sbjct: 735 MDTKWDIEIFIGENDF 782
>TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (96%)
Length = 1286
Score = 28.5 bits (62), Expect = 2.9
Identities = 9/15 (60%), Positives = 10/15 (66%)
Frame = +1
Query: 222 CFNCHKMGHFKKDCP 236
C NC K GH +DCP
Sbjct: 730 CNNCRKTGHLARDCP 774
Score = 28.5 bits (62), Expect = 2.9
Identities = 11/34 (32%), Positives = 15/34 (43%)
Frame = +1
Query: 202 RGNGGGRGNRRKSGNKSRFECFNCHKMGHFKKDC 235
RG GG + G C +C + GH +DC
Sbjct: 853 RGGGGSLRGGYRDGGFRDVVCRSCQQFGHMSRDC 954
Score = 28.1 bits (61), Expect = 3.8
Identities = 22/71 (30%), Positives = 32/71 (44%), Gaps = 10/71 (14%)
Frame = +1
Query: 210 NRRKSGNKSRFECFN------CHKMGHFKKDCPKIN----GNSAQIVSEGYEDAGALMVW 259
N RK+G+ +R +C N C+ GH + CPK N + GY D G V
Sbjct: 736 NCRKTGHLAR-DCPNDPICNLCNISGHVARQCPKSNVIGDRGGGGSLRGGYRDGGFRDVV 912
Query: 260 CCLEEEKGDVS 270
C ++ G +S
Sbjct: 913 CRSCQQFGHMS 945
Score = 26.9 bits (58), Expect = 8.5
Identities = 7/17 (41%), Positives = 12/17 (70%)
Frame = +1
Query: 222 CFNCHKMGHFKKDCPKI 238
C NC + GH+ ++CP +
Sbjct: 424 CKNCKRPGHYVRECPNV 474
>TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helicase {Oryza
sativa}, partial (3%)
Length = 737
Score = 28.5 bits (62), Expect = 2.9
Identities = 14/38 (36%), Positives = 18/38 (46%), Gaps = 3/38 (7%)
Frame = +3
Query: 202 RGNGGGRGNRRKSGNKSRF---ECFNCHKMGHFKKDCP 236
R +G NR S N+ CF+C + GH DCP
Sbjct: 78 RSSGYSSSNRSSSPNRRGSYGGACFSCGQPGHRASDCP 191
>TC91834 similar to PIR|T08416|T08416 disease resistance protein homolog
F18B3.230 - Arabidopsis thaliana, partial (3%)
Length = 803
Score = 28.1 bits (61), Expect = 3.8
Identities = 14/34 (41%), Positives = 24/34 (70%), Gaps = 1/34 (2%)
Frame = +2
Query: 111 MVESKAIMEQL-TEFNKILDDLENIEVQLEDEDK 143
+VE + ++ L ++FN I DDLE+I+ L+D D+
Sbjct: 107 VVEERTLVTGLESDFNDIKDDLESIQSFLKDADR 208
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.316 0.132 0.382
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,600,684
Number of Sequences: 36976
Number of extensions: 86518
Number of successful extensions: 601
Number of sequences better than 10.0: 48
Number of HSP's better than 10.0 without gapping: 578
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 599
length of query: 276
length of database: 9,014,727
effective HSP length: 95
effective length of query: 181
effective length of database: 5,502,007
effective search space: 995863267
effective search space used: 995863267
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (26.9 bits)
Medicago: description of AC143337.6