
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146709.5 + phase: 0 /pseudo
(1304 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF635063 weakly similar to PIR|F84486|F84 probable retroelement ... 290 2e-78
TC89912 weakly similar to PIR|B84512|B84512 probable retroelemen... 162 6e-40
BG646342 weakly similar to PIR|F84486|F84 probable retroelement ... 117 2e-26
BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x pa... 64 5e-10
AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 54 5e-07
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 45 2e-04
TC93065 45 2e-04
TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative ret... 42 0.002
AW736233 GP|21105451|gb small nuclear ribonucleoprotein D1 {Dani... 41 0.003
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 35 0.19
TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 34 0.32
TC84935 similar to PIR|G96631|G96631 probable RNA-binding protei... 33 0.55
TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-... 33 0.55
TC81230 33 0.94
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 32 1.6
CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotia... 32 1.6
TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA... 32 2.1
TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helica... 32 2.1
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.... 30 4.7
TC88667 similar to GP|6728982|gb|AAF26980.1| unknown protein {Ar... 30 6.1
>BF635063 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 677
Score = 290 bits (743), Expect = 2e-78
Identities = 147/186 (79%), Positives = 163/186 (87%)
Frame = -2
Query: 2 MGSKWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMSAHLTPAEKTEMNDKAVS 61
MGSK DIEKFTG NDFGLWKVKM A+LIQQKC +ALKGE + ++ AEKTEM DKA S
Sbjct: 568 MGSKRDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEVSLPVTMSRAEKTEMVDKARS 389
Query: 62 AIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQ 121
AI+LCLGDKVLREV++E TA SMW KL SLYMTKSLAHRQ LKQQLY +RMVESK IMEQ
Sbjct: 388 AIVLCLGDKVLREVAKERTAASMWAKL*SLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQ 209
Query: 122 LTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALR 181
LTEFNKI+DDL NI+V LEDE+K + LLCALP+SFE+FKDTMLYGKEGT+TLEEVQAALR
Sbjct: 208 LTEFNKILDDLENIEVQLEDEEKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALR 29
Query: 182 TKELTK 187
TKELTK
Sbjct: 28 TKELTK 11
>TC89912 weakly similar to PIR|B84512|B84512 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(10%)
Length = 814
Score = 162 bits (411), Expect = 6e-40
Identities = 94/164 (57%), Positives = 114/164 (69%)
Frame = +2
Query: 1125 KL*SGF*DI*MDL*EVV*GIQGHNQVGMLWRDM*MRTMQVT*TLENLCQILCLHCSVQR* 1184
KL*SG * I*M L* V* Q + LWR M M+TM+ T ENL ++LCL +
Sbjct: 2 KL*SGC*SI*MSL*RAV*STQKQLKRKTLWRGMLMQTMRAMWTQENLYRVLCLLSMARLL 181
Query: 1185 LGRQINNQL*LFQQLKQST*PLLKGSRKPYG*KV*LEKWGLVKDV*RYIVIAKVPFIWQI 1244
+GRQINN * +QQLK+ST PL KG + PYG*KV*L L+K++*RYIVI KVPF W+I
Sbjct: 182 VGRQINNPW*HYQQLKRSTLPL*KG*KMPYG*KV*LVS*ELLKNM*RYIVIVKVPFTWRI 361
Query: 1245 IRCIMRGQSTLTFVCTSSET*LRLRRSW*RKWHRKTIQQTCSPN 1288
I+CIMRG STLTF CT ET*L +R W +KWHRK I++ C P+
Sbjct: 362 IKCIMRGLSTLTFACTLLET*LNQKRLWWKKWHRKRIRRMCLPS 493
>BG646342 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 599
Score = 117 bits (294), Expect = 2e-26
Identities = 62/92 (67%), Positives = 72/92 (77%)
Frame = +2
Query: 59 AVSAIILCLGDKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPI 118
A SAI+LCLGDKVLREV++E TA SM KL+ LYMTKSLAHRQ LKQQLY ++MVESK I
Sbjct: 2 ARSAIVLCLGDKVLREVAKEPTATSMCAKLEYLYMTKSLAHRQFLKQQLYSFKMVESKAI 181
Query: 119 MEQLTEFNKIIDDLANIDVNLEDEDKVLHLLC 150
E L EFNKII DL NI+V+LED ++ C
Sbjct: 182 TELLVEFNKIIGDLENIEVHLEDAGALMVWCC 277
>BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x paradisi},
partial (3%)
Length = 658
Score = 63.5 bits (153), Expect = 5e-10
Identities = 38/127 (29%), Positives = 70/127 (54%), Gaps = 7/127 (5%)
Frame = +2
Query: 65 LCLG-------DKVLREVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKP 117
+CLG D + +A +W+KL++ YM + ++ L Y+MV++K
Sbjct: 197 ICLGHILNGMSDSLFDIYQSSPSAKDLWDKLETRYMREDATSKKFLVSHFNNYKMVDNKS 376
Query: 118 IMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQ 177
+MEQL E +I+++ ++N+++ V ++ LP S+++FK TM + KE I+LE++
Sbjct: 377 VMEQLYEIERILNNYKQHNMNMDETIIVSSIIDKLPPSWKDFKRTMKHKKE-DISLEQLG 553
Query: 178 AALRTKE 184
LR E
Sbjct: 554 NHLRLXE 574
>AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (18%)
Length = 567
Score = 53.5 bits (127), Expect = 5e-07
Identities = 25/83 (30%), Positives = 48/83 (57%)
Frame = +3
Query: 277 TVTSWEPEKSWVLDSGCSYHICPRKEYFETLELKEGGVVRLGNNKACKIQGMGTIRLKMF 336
T + +P K W++DSGC++H+ ++ F+ L VR+ N +++G+GT+ +K
Sbjct: 78 TFATKQPSKYWLIDSGCTHHMTHDRDLFKELNKSTISKVRMLNGAHIEVEGIGTVLVKSH 257
Query: 337 DDRDFLLKNVRYIPELKRNLISI 359
+ NV Y P+L ++L+S+
Sbjct: 258 SGYK-QISNVLYAPKLNQSLLSV 323
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 45.1 bits (105), Expect = 2e-04
Identities = 19/63 (30%), Positives = 26/63 (41%)
Frame = +3
Query: 195 DSGEGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKDN 254
D G + + G G G + R G G + KC+ C PGHF ++C R
Sbjct: 216 DGKNGWRVELSHNSRSGGGGGGGGGGRGRGGGGGGGSDLKCYECGEPGHFARECRNRGGG 395
Query: 255 GGG 257
G G
Sbjct: 396 GAG 404
>TC93065
Length = 783
Score = 45.1 bits (105), Expect = 2e-04
Identities = 40/173 (23%), Positives = 80/173 (46%), Gaps = 14/173 (8%)
Frame = +2
Query: 103 LKQQLYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKVLHLLCALPRSFENFKDT 162
L+++ +M E++ + E +K++ + + +L D+ V +L LP FE +
Sbjct: 215 LRREFEALKMKETETVREFSDRISKVVTQIRLLGEDLSDQRVVEKILVCLPEMFEAKISS 394
Query: 163 MLYGKE-GTITLEEVQAALRTKELTKFKELKVDDSGEGLNISRGRSHNKGKGKGKNSRSK 221
+ K IT+ E+ AL+ E + + L+++++ EG ++ +NKGK + S K
Sbjct: 395 LEENKNFSEITVAELVNALQASE--QRRSLRMEENVEGAFLA----NNKGKNQSFKSFGK 556
Query: 222 SR------SKGDGNKTQY-------KCFICHNPGHFKKDCPERKDNGGGESSV 261
+ K D + ++ KC C+ GH +K C + + E+ V
Sbjct: 557 KKFPPCPHCKKDTHLDKFCWYRPGVKCRACNQLGHVEKVCKNKTNQQEQEARV 715
>TC93066 weakly similar to GP|19920130|gb|AAM08562.1 Putative retroelement
{Oryza sativa} [Oryza sativa (japonica cultivar-group)],
partial (10%)
Length = 823
Score = 41.6 bits (96), Expect = 0.002
Identities = 66/227 (29%), Positives = 98/227 (43%), Gaps = 2/227 (0%)
Frame = +2
Query: 472 LSMFIRIFGVQHRRRLMGEVHISCLSLMIILEEYGFTF*RRKVMLLKNSRNGIHL*KIRL 531
L +FI FG + M + I LSLMI L +GF F K+ L +SR+G L K+R
Sbjct: 98 LIIFILTFGDLQKLLPMEDAAI**LSLMIFLGRFGFIFCGIKMRLFPHSRSGEFLLKLRQ 277
Query: 532 VPN*KC*ELTMAWSLFQNSLMSFAGRKV*RGIEPWHTHLNRMVLLKE*TGLCWSA*GVCC 591
*+ + S +LMS A V +P+ N+ VL E*+GL VC
Sbjct: 278 GRM*RSS*QIID*SSVVVTLMSSAQIMVLLDTKPFQGIPNKTVLQNE*SGLYLRELDVCS 457
Query: 592 *KLD--CPRVSGERLLVLQHI*LTDVHQQG*ISRHLWRFGVEDRQTTLT*KSSEL*RLPM 649
L + G R +L T +H Q S+ FG L *+ ++ + +
Sbjct: 458 QMLGYRIDVIFGSRQHLLHVTWSTVLHIQHLTSKFQKIFGQVILLIILI*EFLDVQHMHL 637
Query: 650 SGKTNLMLEL*SVFSWVILKV*KVIDCGRWNLENQNLL*AGMLLLMR 696
S N + E S + +++ + I CG ++N M LLMR
Sbjct: 638 SMMAN*LQEPVSAYFFLMHLSLRGIVCGALIQNHKN*FLVEM*LLMR 778
>AW736233 GP|21105451|gb small nuclear ribonucleoprotein D1 {Danio rerio},
partial (14%)
Length = 510
Score = 40.8 bits (94), Expect = 0.003
Identities = 27/100 (27%), Positives = 40/100 (40%)
Frame = +1
Query: 198 EGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKDNGGG 257
E N RGR +G+G+G +G N + +C IC H C R D
Sbjct: 214 EYYNAGRGRGRGRGRGRG---------RGRSNSNRLQCQICARNNHDAARCCFRYDQASS 366
Query: 258 ESSVQIASKDEGYESAGALTVTSWEPEKSWVLDSGCSYHI 297
+ A E Y ++ + + W P DSG S+H+
Sbjct: 367 SQAHHRAPPSE-YAASSSYSEAPWYP------DSGASHHL 465
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 35.0 bits (79), Expect = 0.19
Identities = 17/56 (30%), Positives = 21/56 (37%), Gaps = 6/56 (10%)
Frame = +2
Query: 207 SHNKGKGKGKNSRSKS------RSKGDGNKTQYKCFICHNPGHFKKDCPERKDNGG 256
SHN G G R R + G + KC+ C PGHF + C G
Sbjct: 242 SHNSKTGGGGGGRGGGGGGGGGRGRSGGGGSDLKCYXCGEPGHFARXCNSSPGGSG 409
>TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (16%)
Length = 421
Score = 34.3 bits (77), Expect = 0.32
Identities = 22/77 (28%), Positives = 33/77 (42%), Gaps = 10/77 (12%)
Frame = +3
Query: 183 KELTKFKELKVDDSGEGLNISRGRSHNKGKG-KGKNSRSKSRSKGD---------GNKTQ 232
KE+ K K LK+ + SR RS ++ + + + RS S D G
Sbjct: 162 KEVKKEKNLKMSSDSRSRSRSRSRSRSRSRSPRIRKIRSDRHSYRDAPYRRDSSRGFSRD 341
Query: 233 YKCFICHNPGHFKKDCP 249
C C PGH+ ++CP
Sbjct: 342 NLCKNCKRPGHYARECP 392
>TC84935 similar to PIR|G96631|G96631 probable RNA-binding protein F8A5.17
[imported] - Arabidopsis thaliana, partial (41%)
Length = 552
Score = 33.5 bits (75), Expect = 0.55
Identities = 17/44 (38%), Positives = 23/44 (51%), Gaps = 1/44 (2%)
Frame = +2
Query: 214 KGKNSRSKSRSKGDGNKT-QYKCFICHNPGHFKKDCPERKDNGG 256
+G S S G G++ Q CF C PGH+ +DCP +GG
Sbjct: 407 RGGFSSGGRGSYGAGDRVGQDDCFKCGRPGHWARDCPLAGGDGG 538
>TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-1 {Triticum
aestivum}, partial (39%)
Length = 630
Score = 33.5 bits (75), Expect = 0.55
Identities = 20/66 (30%), Positives = 29/66 (43%), Gaps = 5/66 (7%)
Frame = +3
Query: 197 GEGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCP-----ER 251
GE L + + ++ G G G+ R R G G C+ C + GH +DC +R
Sbjct: 258 GEPLQVRQ--DNHGGGGGGRGFRGGERRNGGGG-----CYTCGDTGHIARDCDRSDRNDR 416
Query: 252 KDNGGG 257
D GG
Sbjct: 417 NDRSGG 434
Score = 30.4 bits (67), Expect = 4.7
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Frame = +3
Query: 216 KNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPE---RKDNGGG 257
+N R+ G G C+ C + HF +DC +NGGG
Sbjct: 405 RNDRNDRSGGGGGGDRDRACYTCGSFEHFARDCMRGGGNNNNGGG 539
>TC81230
Length = 958
Score = 32.7 bits (73), Expect = 0.94
Identities = 26/123 (21%), Positives = 52/123 (42%), Gaps = 12/123 (9%)
Frame = +1
Query: 74 EVSREATAVSMWNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTEFNKIIDDLA 133
+ R A +W+ L Y L+H+ L + L + +P+ E L + I + L
Sbjct: 454 QFGRFENAKEVWDHLKQRYTISDLSHQYQLLKDLSNLKQQSGQPVYEFLAQMEVIWNQLT 633
Query: 134 NIDVNLEDED------------KVLHLLCALPRSFENFKDTMLYGKEGTITLEEVQAALR 181
+ + +L+D +++ L AL +E + + L+ + TLE L+
Sbjct: 634 SCEPSLKDATDMKTYETHRNRVRLIQFLMALTDEYEPVRASSLH-QNPLPTLENALPCLK 810
Query: 182 TKE 184
++E
Sbjct: 811 SEE 819
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 32.0 bits (71), Expect = 1.6
Identities = 14/42 (33%), Positives = 19/42 (44%)
Frame = +1
Query: 207 SHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDC 248
SHN G G + +G KC+ C PGHF ++C
Sbjct: 244 SHNSKSGGGGG---RGGGRGGRGGDDLKCYECGEPGHFAREC 360
>CB893680 weakly similar to GP|1167523|db ORF(AA 1-1338) {Nicotiana tabacum},
partial (7%)
Length = 780
Score = 32.0 bits (71), Expect = 1.6
Identities = 24/57 (42%), Positives = 30/57 (52%), Gaps = 1/57 (1%)
Frame = -3
Query: 888 LLTCIILS*NKWM*RPHFYMESWKKLSICNN*KVL*KTIQK-CVC*RNLCMG*SKVQ 943
LL I N WM*R HF++E+ ++ C N K K K *R CM *+KVQ
Sbjct: 508 LLPLKIFILNSWM*RLHFFVET*LRIYTCTNLKDSHKKWGKWWEN*RRACMD*NKVQ 338
>TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA helicase
{Oryza sativa}, partial (7%)
Length = 624
Score = 31.6 bits (70), Expect = 2.1
Identities = 22/71 (30%), Positives = 28/71 (38%), Gaps = 12/71 (16%)
Frame = +1
Query: 194 DDSGEGLNISRGRSHNKGKGKGKNSRSK-----------SRSKGDGNKT-QYKCFICHNP 241
DD G+ GRS+ G K RS SRS N++ CF C
Sbjct: 10 DDFGDSSRRG-GRSYKSGNSWSKPERSSRDDWLIGGRQSSRSSSSPNRSFAGTCFTCGES 186
Query: 242 GHFKKDCPERK 252
GH DCP ++
Sbjct: 187 GHRASDCPNKR 219
>TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helicase {Oryza
sativa}, partial (3%)
Length = 737
Score = 31.6 bits (70), Expect = 2.1
Identities = 14/42 (33%), Positives = 19/42 (44%)
Frame = +3
Query: 212 KGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKD 253
+ G +S ++S S CF C PGH DCP +D
Sbjct: 78 RSSGYSSSNRSSSPNRRGSYGGACFSCGQPGHRASDCPN*RD 203
>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
Arabidopsis thaliana, partial (17%)
Length = 378
Score = 30.4 bits (67), Expect = 4.7
Identities = 16/53 (30%), Positives = 23/53 (43%)
Frame = +1
Query: 197 GEGLNISRGRSHNKGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCP 249
GE +++R + G G G+ G G C+ C GHF +DCP
Sbjct: 28 GESGHMARECTSGGGGGGGRYGGGGGGGGGGGGGGS--CYSCGESGHFARDCP 180
>TC88667 similar to GP|6728982|gb|AAF26980.1| unknown protein {Arabidopsis
thaliana}, partial (79%)
Length = 1018
Score = 30.0 bits (66), Expect = 6.1
Identities = 13/28 (46%), Positives = 20/28 (71%)
Frame = -1
Query: 137 VNLEDEDKVLHLLCALPRSFENFKDTML 164
+NLE +DK +H + AL RS ++ + TML
Sbjct: 946 LNLEPKDKRMHAIAALKRSLKSLRITML 863
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.357 0.158 0.567
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 42,712,371
Number of Sequences: 36976
Number of extensions: 615104
Number of successful extensions: 6364
Number of sequences better than 10.0: 42
Number of HSP's better than 10.0 without gapping: 1820
Number of HSP's successfully gapped in prelim test: 283
Number of HSP's that attempted gapping in prelim test: 4406
Number of HSP's gapped (non-prelim): 2350
length of query: 1304
length of database: 9,014,727
effective HSP length: 108
effective length of query: 1196
effective length of database: 5,021,319
effective search space: 6005497524
effective search space used: 6005497524
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 64 (29.3 bits)
Medicago: description of AC146709.5