
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144730.2 - phase: 0 /pseudo
(866 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF635063 weakly similar to PIR|F84486|F84 probable retroelement ... 276 2e-74
BG646342 weakly similar to PIR|F84486|F84 probable retroelement ... 117 2e-26
BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x pa... 52 1e-06
AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {V... 49 1e-05
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 48 1e-05
BQ122739 40 0.005
TC93065 37 0.025
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 37 0.033
TC84935 similar to PIR|G96631|G96631 probable RNA-binding protei... 37 0.033
AW736233 GP|21105451|gb small nuclear ribonucleoprotein D1 {Dani... 37 0.033
TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 36 0.073
TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-... 35 0.095
TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.... 33 0.36
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 32 0.80
TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA... 32 1.0
CA859315 homologue to GP|23504972|em hypothetical protein {Plasm... 32 1.4
TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 31 2.3
BQ151242 similar to PIR|G86292|G86 hypothetical protein AAF82153... 31 2.3
TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helica... 30 4.0
BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.... 29 6.8
>BF635063 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 677
Score = 276 bits (706), Expect = 2e-74
Identities = 143/183 (78%), Positives = 159/183 (86%)
Frame = -2
Query: 1 KWDIEKFTGSNDFGLWKVKMRAILIQQKCVEALKGEAQMDVHLTPAEKTEMNDKAVSAII 60
K DIEKFTG NDFGLWKVKM A+LIQQKC +ALKGE + V ++ AEKTEM DKA SAI+
Sbjct: 559 KRDIEKFTGDNDFGLWKVKMEAVLIQQKCEKALKGEVSLPVTMSRAEKTEMVDKARSAIV 380
Query: 61 LCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPIMEQLTE 120
LCLGDKVLREV++E TA SM KL SLYMTKSLAHRQ LKQQLY +RMVESK IMEQLTE
Sbjct: 379 LCLGDKVLREVAKERTAASMWAKL*SLYMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTE 200
Query: 121 FNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQAALRTKE 180
FNKI+DDL NI+V LEDE+KA+ L CALP+SFE+FKDTMLYGK GT+TLEEVQAALRTKE
Sbjct: 199 FNKILDDLENIEVQLEDEEKAILLLCALPKSFESFKDTMLYGKEGTVTLEEVQAALRTKE 20
Query: 181 LTK 183
LTK
Sbjct: 19 LTK 11
>BG646342 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 599
Score = 117 bits (292), Expect = 2e-26
Identities = 65/94 (69%), Positives = 74/94 (78%)
Frame = +2
Query: 55 AVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKPI 114
A SAI+LCLGDKVLREV++E TA SM KL+ LYMTKSLAHRQ LKQQLY ++MVESK I
Sbjct: 2 ARSAIVLCLGDKVLREVAKEPTATSMCAKLEYLYMTKSLAHRQFLKQQLYSFKMVESKAI 181
Query: 115 MEQLTEFNKIIDDLANIDVNLEDEDKALHLPCAL 148
E L EFNKII DL NI+V+LED AL + C L
Sbjct: 182 TELLVEFNKIIGDLENIEVHLEDAG-ALMVWCCL 280
>BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x paradisi},
partial (3%)
Length = 658
Score = 52.0 bits (123), Expect = 1e-06
Identities = 35/127 (27%), Positives = 66/127 (51%), Gaps = 7/127 (5%)
Frame = +2
Query: 61 LCLG-------DKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQCLKQQLYFYRMVESKP 113
+CLG D + +A + +KL++ YM + ++ L Y+MV++K
Sbjct: 197 ICLGHILNGMSDSLFDIYQSSPSAKDLWDKLETRYMREDATSKKFLVSHFNNYKMVDNKS 376
Query: 114 IMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDTMLYGK*GTITLEEVQ 173
+MEQL E +I+++ ++N+++ + LP S+++FK TM + K I+LE++
Sbjct: 377 VMEQLYEIERILNNYKQHNMNMDETIIVSSIIDKLPPSWKDFKRTMKHKK-EDISLEQLG 553
Query: 174 AALRTKE 180
LR E
Sbjct: 554 NHLRLXE 574
>AJ499215 weakly similar to GP|6642775|gb| gag-pol polyprotein {Vitis
vinifera}, partial (18%)
Length = 567
Score = 48.5 bits (114), Expect = 1e-05
Identities = 26/91 (28%), Positives = 48/91 (52%)
Frame = +3
Query: 273 TVTSWEPEKGWVLDSGCSYHICPRKEYFEMLELEEGGVVCLGNNKACKIQVIGTIRLKMF 332
T + +P K W++DSGC++H+ ++ F+ L V + N +++ IGT+ +K
Sbjct: 78 TFATKQPSKYWLIDSGCTHHMTHDRDLFKELNKSTISKVRMLNGAHIEVEGIGTVLVKSH 257
Query: 333 DDRDFLLKDVRYIPKLRRNLISISMFDGLGY 363
+ +V Y PKL ++L+S+ GY
Sbjct: 258 SGYK-QISNVLYAPKLNQSLLSVPQLLTKGY 347
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 48.1 bits (113), Expect = 1e-05
Identities = 21/63 (33%), Positives = 27/63 (42%)
Frame = +3
Query: 191 DSGEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKGN 250
D G V + G G G + R G G + KC+ C PGHF ++C R G
Sbjct: 216 DGKNGWRVELSHNSRSGGGGGGGGGGRGRGGGGGGGSDLKCYECGEPGHFARECRNRGGG 395
Query: 251 GGG 253
G G
Sbjct: 396 GAG 404
>BQ122739
Length = 575
Score = 39.7 bits (91), Expect = 0.005
Identities = 25/75 (33%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Frame = +1
Query: 355 ISMFDGLGYCTRIERGVMRISHGALIIAKGSKIHGLYILEGSTVIADASVASVD-TLDVT 413
I + D GY IE G +RI++ ++++ KG +GL +L G T + A + L +
Sbjct: 214 ILLLDDQGYIFNIEDGDLRITNDSMVLMKGKLENGLSLL*GRTSMDTADAIYIRCNLVSS 393
Query: 414 KLWHLRLGHVSERGI 428
+ R+GHVS+ GI
Sbjct: 394 RSSAYRMGHVSKGGI 438
>TC93065
Length = 783
Score = 37.4 bits (85), Expect = 0.025
Identities = 35/159 (22%), Positives = 72/159 (45%), Gaps = 10/159 (6%)
Frame = +2
Query: 99 LKQQLYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCALPRSFENFKDT 158
L+++ +M E++ + E +K++ + + +L D+ + LP FE +
Sbjct: 215 LRREFEALKMKETETVREFSDRISKVVTQIRLLGEDLSDQRVVEKILVCLPEMFEAKISS 394
Query: 159 MLYGK*-GTITLEEVQAALRTKELTKFKELKVEDSGEGLNVSRERSQNRG-KGKGKNSRS 216
+ K IT+ E+ AL+ E + + L++E++ EG ++ + +N+ K GK
Sbjct: 395 LEENKNFSEITVAELVNALQASE--QRRSLRMEENVEGAFLANNKGKNQSFKSFGKKKFP 568
Query: 217 KS-RSKGDGNKTQY-------KCFICHNPGHFKKDCPER 247
K D + ++ KC C+ GH +K C +
Sbjct: 569 PCPHCKKDTHLDKFCWYRPGVKCRACNQLGHVEKVCKNK 685
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 37.0 bits (84), Expect = 0.033
Identities = 25/86 (29%), Positives = 31/86 (35%), Gaps = 6/86 (6%)
Frame = +2
Query: 177 RTKELTKFKELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKS----RSKGDGNKTQYKCF 232
R L +EL D G V + G G G R + G + KC+
Sbjct: 179 RRDALDAIREL---DGKNGWRVELSHNSKTGGGGGGRGGGGGGGGGRGRSGGGGSDLKCY 349
Query: 233 ICHNPGHFKKDCPERKGNGGG--NPS 256
C PGHF + C G G NPS
Sbjct: 350 XCGEPGHFARXCNSSPGGSGXRRNPS 427
>TC84935 similar to PIR|G96631|G96631 probable RNA-binding protein F8A5.17
[imported] - Arabidopsis thaliana, partial (41%)
Length = 552
Score = 37.0 bits (84), Expect = 0.033
Identities = 24/76 (31%), Positives = 38/76 (49%), Gaps = 8/76 (10%)
Frame = +2
Query: 185 KELKVEDSGEGLNVSRERSQNRGKGKGKNSRSKSR-------SKGDGNKT-QYKCFICHN 236
KE+ + G+ + +S ++Q R G + R + S G G++ Q CF C
Sbjct: 314 KEMDGREIGDRI-ISVNKAQPRMGGDDADQRYRGGFSSGGRGSYGAGDRVGQDDCFKCGR 490
Query: 237 PGHFKKDCPERKGNGG 252
PGH+ +DCP G+GG
Sbjct: 491 PGHWARDCPLAGGDGG 538
>AW736233 GP|21105451|gb small nuclear ribonucleoprotein D1 {Danio rerio},
partial (14%)
Length = 510
Score = 37.0 bits (84), Expect = 0.033
Identities = 34/129 (26%), Positives = 53/129 (40%), Gaps = 2/129 (1%)
Frame = +1
Query: 167 ITLEEVQAALRTKEL-TKFKELKVEDSG-EGLNVSRERSQNRGKGKGKNSRSKSRSKGDG 224
+T V A L EL +K + ++ G E N R R + RG+G+G +G
Sbjct: 127 LTNPSVSANLAQSELPSKPGNSESQEVGTEYYNAGRGRGRGRGRGRG---------RGRS 279
Query: 225 NKTQYKCFICHNPGHFKKDCPERKGNGGGNPSVQIASNEEGYESAGALTVTSWEPEKGWV 284
N + +C IC H C R + + A E Y ++ + + W P
Sbjct: 280 NSNRLQCQICARNNHDAARCCFRYDQASSSQAHHRAPPSE-YAASSSYSEAPWYP----- 441
Query: 285 LDSGCSYHI 293
DSG S+H+
Sbjct: 442 -DSGASHHL 465
>TC83119 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (16%)
Length = 421
Score = 35.8 bits (81), Expect = 0.073
Identities = 23/77 (29%), Positives = 34/77 (43%), Gaps = 10/77 (12%)
Frame = +3
Query: 179 KELTKFKELKVEDSGEGLNVSRERSQNRGKG-KGKNSRSKSRSKGD---------GNKTQ 228
KE+ K K LK+ + SR RS++R + + + RS S D G
Sbjct: 162 KEVKKEKNLKMSSDSRSRSRSRSRSRSRSRSPRIRKIRSDRHSYRDAPYRRDSSRGFSRD 341
Query: 229 YKCFICHNPGHFKKDCP 245
C C PGH+ ++CP
Sbjct: 342 NLCKNCKRPGHYARECP 392
>TC81207 similar to GP|21322752|dbj|BAB78536. cold shock protein-1 {Triticum
aestivum}, partial (39%)
Length = 630
Score = 35.4 bits (80), Expect = 0.095
Identities = 22/69 (31%), Positives = 30/69 (42%), Gaps = 8/69 (11%)
Frame = +3
Query: 193 GEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCP------- 245
GE L V ++ + G G G+ R R G G C+ C + GH +DC
Sbjct: 258 GEPLQVRQDN--HGGGGGGRGFRGGERRNGGGG-----CYTCGDTGHIARDCDRSDRNDR 416
Query: 246 -ERKGNGGG 253
+R G GGG
Sbjct: 417 NDRSGGGGG 443
Score = 30.8 bits (68), Expect = 2.3
Identities = 15/45 (33%), Positives = 20/45 (44%), Gaps = 3/45 (6%)
Frame = +3
Query: 212 KNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCPERKG---NGGG 253
+N R+ G G C+ C + HF +DC G NGGG
Sbjct: 405 RNDRNDRSGGGGGGDRDRACYTCGSFEHFARDCMRGGGNNNNGGG 539
>TC89725 similar to PIR|T05494|T05494 glycine-rich protein T19K4.150 -
Arabidopsis thaliana, partial (17%)
Length = 378
Score = 33.5 bits (75), Expect = 0.36
Identities = 17/53 (32%), Positives = 24/53 (45%)
Frame = +1
Query: 193 GEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCP 245
GE +++RE + G G G+ G G C+ C GHF +DCP
Sbjct: 28 GESGHMARECTSGGGGGGGRYGGGGGGGGGGGGGGS--CYSCGESGHFARDCP 180
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 32.3 bits (72), Expect = 0.80
Identities = 16/51 (31%), Positives = 23/51 (44%)
Frame = +1
Query: 194 EGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDC 244
+G N R + + K G R R G+ KC+ C PGHF ++C
Sbjct: 214 DGKNGWRVQLSHNSKSGGGGGRGGGRGGRGGD--DLKCYECGEPGHFAREC 360
>TC83030 weakly similar to GP|18855061|gb|AAL79753.1 putative RNA helicase
{Oryza sativa}, partial (7%)
Length = 624
Score = 32.0 bits (71), Expect = 1.0
Identities = 14/37 (37%), Positives = 19/37 (50%), Gaps = 1/37 (2%)
Frame = +1
Query: 215 RSKSRSKGDGNKT-QYKCFICHNPGHFKKDCPERKGN 250
R SRS N++ CF C GH DCP ++G+
Sbjct: 115 RQSSRSSSSPNRSFAGTCFTCGESGHRASDCPNKRGD 225
>CA859315 homologue to GP|23504972|em hypothetical protein {Plasmodium
falciparum 3D7}, partial (1%)
Length = 643
Score = 31.6 bits (70), Expect = 1.4
Identities = 28/120 (23%), Positives = 50/120 (41%), Gaps = 10/120 (8%)
Frame = +1
Query: 39 MDVHLTPAEKTEMNDKAVSAIILCLGDKVLREVSRESTAVSMRNKLDSLYMTKSLAHRQC 98
++V+ +K + ND+ A+IL K T S N+LD+L+ Q
Sbjct: 274 INVNEMAKKKKKENDEKKKAVILAYRAKAQALAVLSDTTTSSSNELDTLFEKTWRELAQW 453
Query: 99 LKQ----------QLYFYRMVESKPIMEQLTEFNKIIDDLANIDVNLEDEDKALHLPCAL 148
L Q+Y R ++ L NK +++L + +++ +KAL+L L
Sbjct: 454 LDSIPPTSDFKNLQIYIIRERKAHRFGNALKALNKYLNELGLTNDTIKEYEKALNLKSKL 633
>TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (96%)
Length = 1286
Score = 30.8 bits (68), Expect = 2.3
Identities = 19/68 (27%), Positives = 27/68 (38%), Gaps = 19/68 (27%)
Frame = +1
Query: 197 NVSRERSQNRGKGKGKNSRSKS-------------------RSKGDGNKTQYKCFICHNP 237
+VSR RS++R + SRS+S R G C C P
Sbjct: 265 SVSRSRSKSRSRSPMDRSRSRSPVDRRIRSERFSHREAPYRRDSRRGFSQDNLCKNCKRP 444
Query: 238 GHFKKDCP 245
GH+ ++CP
Sbjct: 445 GHYVRECP 468
>BQ151242 similar to PIR|G86292|G86 hypothetical protein AAF82153.1
[imported] - Arabidopsis thaliana, partial (5%)
Length = 1371
Score = 30.8 bits (68), Expect = 2.3
Identities = 12/36 (33%), Positives = 21/36 (58%)
Frame = +1
Query: 193 GEGLNVSRERSQNRGKGKGKNSRSKSRSKGDGNKTQ 228
GEG E+ + G+G G+ ++ K + G G+KT+
Sbjct: 184 GEGRRTKGEKKRGGGRGGGREAQGKKKGGGRGSKTR 291
>TC89153 similar to GP|18855061|gb|AAL79753.1 putative RNA helicase {Oryza
sativa}, partial (3%)
Length = 737
Score = 30.0 bits (66), Expect = 4.0
Identities = 13/38 (34%), Positives = 17/38 (44%)
Frame = +3
Query: 208 KGKGKNSRSKSRSKGDGNKTQYKCFICHNPGHFKKDCP 245
+ G +S ++S S CF C PGH DCP
Sbjct: 78 RSSGYSSSNRSSSPNRRGSYGGACFSCGQPGHRASDCP 191
>BG645355 similar to PIR|G96590|G965 hypothetical protein T24C10.5 [imported]
- Arabidopsis thaliana, partial (5%)
Length = 627
Score = 29.3 bits (64), Expect = 6.8
Identities = 13/43 (30%), Positives = 18/43 (41%), Gaps = 6/43 (13%)
Frame = -1
Query: 230 KCFICHNPGHFKKDCPER------KGNGGGNPSVQIASNEEGY 266
KC+ C PGH+ +CP G GG N+ G+
Sbjct: 504 KCYKCQQPGHWASNCPSMSAANRVSGGSGGASGNCYKCNQPGH 376
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.344 0.151 0.505
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 27,070,410
Number of Sequences: 36976
Number of extensions: 375520
Number of successful extensions: 3327
Number of sequences better than 10.0: 50
Number of HSP's better than 10.0 without gapping: 1375
Number of HSP's successfully gapped in prelim test: 161
Number of HSP's that attempted gapping in prelim test: 1875
Number of HSP's gapped (non-prelim): 1639
length of query: 866
length of database: 9,014,727
effective HSP length: 104
effective length of query: 762
effective length of database: 5,169,223
effective search space: 3938947926
effective search space used: 3938947926
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.6 bits)
S2: 63 (28.9 bits)
Medicago: description of AC144730.2