
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC134822.5 - phase: 0
(195 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BF635063 weakly similar to PIR|F84486|F84 probable retroelement ... 162 8e-41
BG646342 weakly similar to PIR|F84486|F84 probable retroelement ... 82 2e-16
BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x pa... 49 2e-06
AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing fact... 35 0.018
TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR... 35 0.018
BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR p... 34 0.041
TC78550 weakly similar to GP|21689669|gb|AAM67456.1 unknown prot... 32 0.12
TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finge... 31 0.27
AW776282 similar to PIR|T12180|T12 probable transcription factor... 31 0.35
TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulch... 31 0.35
TC87196 similar to PIR|C85041|C85041 probable DNA-binding protei... 30 0.59
TC81696 weakly similar to GP|18377708|gb|AAL67004.1 unknown prot... 30 0.59
TC87639 GP|9663153|emb|CAC01132.1 transport-secretion protein 2.... 30 0.59
TC90683 similar to GP|22136732|gb|AAM91685.1 unknown protein {Ar... 30 0.78
TC84935 similar to PIR|G96631|G96631 probable RNA-binding protei... 29 1.0
TC81230 29 1.3
TC82883 28 2.3
TC92733 similar to PIR|H84731|H84731 hypothetical protein At2g32... 28 2.3
BF633350 similar to GP|17473640|gb| unknown protein {Arabidopsis... 28 3.0
TC88777 similar to GP|15081626|gb|AAK82468.1 AT5g26830/F2P16_90 ... 28 3.0
>BF635063 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 677
Score = 162 bits (410), Expect = 8e-41
Identities = 79/99 (79%), Positives = 90/99 (90%)
Frame = -2
Query: 7 YKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS 66
Y TKSLAHRQ LKQQLYSF+M+ESK+I EQL EFNKILDDL NIEV +EDE+KA+LLLC+
Sbjct: 298 YMTKSLAHRQFLKQQLYSFRMVESKAIMEQLTEFNKILDDLENIEVQLEDEEKAILLLCA 119
Query: 67 LPKSFEHFKDTILYGKEGTTTLEEIQSALRTKKLTKSKD 105
LPKSFE FKDT+LYGKEGT TLEE+Q+ALRTK+LTKS D
Sbjct: 118 LPKSFESFKDTMLYGKEGTVTLEEVQAALRTKELTKSND 2
>BG646342 weakly similar to PIR|F84486|F84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(4%)
Length = 599
Score = 81.6 bits (200), Expect = 2e-16
Identities = 42/61 (68%), Positives = 50/61 (81%)
Frame = +2
Query: 7 YKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCS 66
Y TKSLAHRQ LKQQLYSFKM+ESK+I+E L EFNKI+ DL NIEV++ED AL++ C
Sbjct: 101 YMTKSLAHRQFLKQQLYSFKMVESKAITELLVEFNKIIGDLENIEVHLEDAG-ALMVWCC 277
Query: 67 L 67
L
Sbjct: 278 L 280
Score = 35.4 bits (80), Expect = 0.014
Identities = 18/26 (69%), Positives = 21/26 (80%), Gaps = 1/26 (3%)
Frame = +2
Query: 170 EDAGALVV-SCWEDDEGEVSHLDIDA 194
EDAGAL+V C ED+EG+VSHL DA
Sbjct: 245 EDAGALMVWCCLEDEEGDVSHLGNDA 322
>BF648150 similar to GP|14586969|gb| pol polyprotein {Citrus x paradisi},
partial (3%)
Length = 658
Score = 48.5 bits (114), Expect = 2e-06
Identities = 27/95 (28%), Positives = 54/95 (56%)
Frame = +2
Query: 2 KVLQKYKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMEDEDKAL 61
K+ +Y + ++ L ++KM+++KS+ EQL E +IL++ +NM++
Sbjct: 284 KLETRYMREDATSKKFLVSHFNNYKMVDNKSVMEQLYEIERILNNYKQHNMNMDETIIVS 463
Query: 62 LLLCSLPKSFEHFKDTILYGKEGTTTLEEIQSALR 96
++ LP S++ FK T+ + KE +LE++ + LR
Sbjct: 464 SIIDKLPPSWKDFKRTMKHKKE-DISLEQLGNHLR 565
>AJ388976 similar to PIR|E84638|E84 probable RSZp22 splicing factor
[imported] - Arabidopsis thaliana, partial (62%)
Length = 508
Score = 35.0 bits (79), Expect = 0.018
Identities = 16/33 (48%), Positives = 19/33 (57%)
Frame = +2
Query: 120 GNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKR 152
G GGG G RG S G + KC+ C + GHF R
Sbjct: 284 GGGGGGGGRGRSGGGGSD-LKCYXCGEPGHFAR 379
>TC87868 similar to PIR|T05112|T05112 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (91%)
Length = 860
Score = 35.0 bits (79), Expect = 0.018
Identities = 20/64 (31%), Positives = 29/64 (45%)
Frame = +3
Query: 90 EIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGH 149
+ Q A+R + + NS G GGGRG G G+ KC++C + GH
Sbjct: 189 DAQDAIRDLDGKNGWRVELSHNSRSGGGGGGGGGGRGRGGGGGGGSD--LKCYECGEPGH 362
Query: 150 FKRD 153
F R+
Sbjct: 363 FARE 374
>BG450974 similar to PIR|T05112|T05 splicing factor 9G8-like SR protein
RSZp22 [validated] - Arabidopsis thaliana, partial (54%)
Length = 364
Score = 33.9 bits (76), Expect = 0.041
Identities = 15/34 (44%), Positives = 20/34 (58%)
Frame = +1
Query: 120 GNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
G GGGRG RG + KC++C + GHF R+
Sbjct: 274 GRGGGRGGRGG------DDLKCYECGEPGHFARE 357
>TC78550 weakly similar to GP|21689669|gb|AAM67456.1 unknown protein
{Arabidopsis thaliana}, partial (69%)
Length = 1486
Score = 32.3 bits (72), Expect = 0.12
Identities = 25/78 (32%), Positives = 40/78 (51%), Gaps = 6/78 (7%)
Frame = +1
Query: 5 QKYKTKSLAHRQLLKQQLYSFKMLESKSISEQL-----AEFNKI-LDDLANIEVNMEDED 58
+K K + +H++L +L + LE K I E+ AE KI + DL N ++ED
Sbjct: 277 EKLKESAQSHQKLSPSELQK-RQLEIKEIMEKTKMLSDAELMKIAIKDLNNASTSLEDRY 453
Query: 59 KALLLLCSLPKSFEHFKD 76
+ALL L L + ++ D
Sbjct: 454 RALLELLELVEPLDNAND 507
>TC88387 similar to GP|13357253|gb|AAK20050.1 putative zinc finger protein
{Oryza sativa (japonica cultivar-group)}, partial (96%)
Length = 1286
Score = 31.2 bits (69), Expect = 0.27
Identities = 13/32 (40%), Positives = 15/32 (46%)
Frame = +1
Query: 122 GGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
GGG RG + G C C +FGH RD
Sbjct: 856 GGGGSLRGGYRDGGFRDVVCRSCQQFGHMSRD 951
>AW776282 similar to PIR|T12180|T12 probable transcription factor - fava
bean, partial (48%)
Length = 688
Score = 30.8 bits (68), Expect = 0.35
Identities = 29/109 (26%), Positives = 45/109 (40%), Gaps = 3/109 (2%)
Frame = +2
Query: 29 ESKSISEQLAEFNKILDDLANIEVNMEDEDKALLLLCSLPKSFEHFKDTILYGKEGTTTL 88
E ++ +AE NK D AN E EDK + L E ++ + ++ TL
Sbjct: 164 EKPAVEADVAEGNK--DSPANEAEEKEPEDKEMTL--------EEYEKVLEEKRKALQTL 313
Query: 89 EEIQSALRTKKLTKSKDLRA---NENSEGLCVSRGNGGGRGNRGSSKSG 134
+ ++ + +K KS + E S G G GRG RG+ G
Sbjct: 314 KTEEAYDKEEKAKKSVSINEFLKPAEGESHYNSXGRGRGRGGRGARGGG 460
>TC87382 similar to EGAD|146423|156195 vitellogenin {Anolis pulchellus},
partial (7%)
Length = 2304
Score = 30.8 bits (68), Expect = 0.35
Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 6/65 (9%)
Frame = +2
Query: 125 RGNRGSSKSGNKERYKCFKCHKFGHFKRD------FSEDNENFAQVVSEEYEDAGALVVS 178
R R + K N KCF C +GH D F+ NE + EE ED V
Sbjct: 998 RPQRNTIKERNTNIPKCFICQGYGHIALDCVNQKVFTIVNEEINNIFEEERED----VYE 1165
Query: 179 CWEDD 183
+ED+
Sbjct: 1166 SFEDE 1180
>TC87196 similar to PIR|C85041|C85041 probable DNA-binding protein
[imported] - Arabidopsis thaliana, partial (32%)
Length = 2258
Score = 30.0 bits (66), Expect = 0.59
Identities = 21/64 (32%), Positives = 34/64 (52%), Gaps = 3/64 (4%)
Frame = +2
Query: 95 LRTKKLTKSKDLRAN---ENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFK 151
L+ K+L K ++ AN + S G+ RG+G G+ + GSSK G+ + + G +
Sbjct: 470 LKDKRLLK-EEANANGRQDRSSGVIQDRGSGLGQDSCGSSKHGDYKYLDPKEVESNGLYN 646
Query: 152 RDFS 155
RD S
Sbjct: 647 RDLS 658
>TC81696 weakly similar to GP|18377708|gb|AAL67004.1 unknown protein
{Arabidopsis thaliana}, partial (21%)
Length = 715
Score = 30.0 bits (66), Expect = 0.59
Identities = 22/61 (36%), Positives = 27/61 (44%)
Frame = +1
Query: 101 TKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRDFSEDNEN 160
TKS R + SEGL SR NG G +G + + H+F H R E EN
Sbjct: 142 TKSAKARKVQFSEGLFESRSNGPTSGGKGDKVANGGKSSAAKDPHQFEH--RVDQELPEN 315
Query: 161 F 161
F
Sbjct: 316 F 318
>TC87639 GP|9663153|emb|CAC01132.1 transport-secretion protein 2.2 (TTS-2.2)
{Homo sapiens}, partial (2%)
Length = 1522
Score = 30.0 bits (66), Expect = 0.59
Identities = 30/152 (19%), Positives = 61/152 (39%), Gaps = 19/152 (12%)
Frame = -1
Query: 21 QLYSFKMLESKSISEQLAEFNKILDD-----------LANIEVNMEDEDKALLLLCSLPK 69
QL S + +++ ++ L F K+L D ++ +E + K LL+ LP
Sbjct: 724 QLQSLRQKDTERLATFLPRFEKVLADAGGYSWPDVVQISLLETALVPRLKELLITVELPT 545
Query: 70 SFEHFKDTILYGKEGTTTLEEIQSA----LRTKKLTKSKDLRANENSEGLCVSRGNGGGR 125
+ + + ++ +E +++ +L SKD + G + G
Sbjct: 544 VYSQWLSKV---QDIAWKMERMKTPPTRWAPATRLPVSKDRDGDMMMTGAIHKQRRRRGS 374
Query: 126 GNRGSSKSG----NKERYKCFKCHKFGHFKRD 153
+ SS G ++ +C+ CH+ GH R+
Sbjct: 373 SSSVSSAEGAPPPRRDMRECYSCHERGHIARN 278
>TC90683 similar to GP|22136732|gb|AAM91685.1 unknown protein {Arabidopsis
thaliana}, partial (19%)
Length = 554
Score = 29.6 bits (65), Expect = 0.78
Identities = 21/53 (39%), Positives = 29/53 (54%), Gaps = 2/53 (3%)
Frame = +3
Query: 92 QSALRTKKLTKSKDLRANENSEGLCVSRGN--GGGRGNRGSSKSGNKERYKCF 142
Q A + K+++ +K L+ L V GN GGG NR S+S KER+K F
Sbjct: 24 QHANQYKRISWAKILQR------LTVQGGNSSGGGDSNRNISRSSVKERFKTF 164
>TC84935 similar to PIR|G96631|G96631 probable RNA-binding protein F8A5.17
[imported] - Arabidopsis thaliana, partial (41%)
Length = 552
Score = 29.3 bits (64), Expect = 1.0
Identities = 13/33 (39%), Positives = 20/33 (60%)
Frame = +2
Query: 121 NGGGRGNRGSSKSGNKERYKCFKCHKFGHFKRD 153
+ GGRG+ G+ ++ CFKC + GH+ RD
Sbjct: 419 SSGGRGSYGAGDRVGQD--DCFKCGRPGHWARD 511
>TC81230
Length = 958
Score = 28.9 bits (63), Expect = 1.3
Identities = 14/52 (26%), Positives = 27/52 (51%)
Frame = +1
Query: 5 QKYKTKSLAHRQLLKQQLYSFKMLESKSISEQLAEFNKILDDLANIEVNMED 56
Q+Y L+H+ L + L + K + + E LA+ I + L + E +++D
Sbjct: 502 QRYTISDLSHQYQLLKDLSNLKQQSGQPVYEFLAQMEVIWNQLTSCEPSLKD 657
>TC82883
Length = 571
Score = 28.1 bits (61), Expect = 2.3
Identities = 20/64 (31%), Positives = 26/64 (40%), Gaps = 3/64 (4%)
Frame = +3
Query: 83 EGTTTLEEIQSALRTKKLTKSKDLRANENSEGLCVSRGNGGGRGNRGSSKSGNKE---RY 139
E T E L K+L K K+ AN + R +GSSK NKE +Y
Sbjct: 60 EDNTMKNEKGPKLHKKRLGKKKNYNANSRKQR----------RSGKGSSKRSNKE*ELKY 209
Query: 140 KCFK 143
C +
Sbjct: 210 NCIQ 221
>TC92733 similar to PIR|H84731|H84731 hypothetical protein At2g32340
[imported] - Arabidopsis thaliana, partial (22%)
Length = 667
Score = 28.1 bits (61), Expect = 2.3
Identities = 15/55 (27%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Frame = +3
Query: 45 DDLANIEVNMEDEDKALLLLCSLPKSFEHFKDTILYGKEG--TTTLEEIQSALRT 97
DDL N+ M+D D L++ + K F +T+++ ++ ++TL ++ S R+
Sbjct: 108 DDLNNMLKEMDDSDMLTLVIQEMSKEFPTLMETLVHERDQYMSSTLLKVASESRS 272
>BF633350 similar to GP|17473640|gb| unknown protein {Arabidopsis thaliana},
partial (15%)
Length = 513
Score = 27.7 bits (60), Expect = 3.0
Identities = 11/17 (64%), Positives = 12/17 (69%)
Frame = -2
Query: 140 KCFKCHKFGHFKRDFSE 156
KCFK H+FGHF SE
Sbjct: 353 KCFKEHQFGHFSGSDSE 303
>TC88777 similar to GP|15081626|gb|AAK82468.1 AT5g26830/F2P16_90
{Arabidopsis thaliana}, partial (24%)
Length = 822
Score = 27.7 bits (60), Expect = 3.0
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 4/57 (7%)
Frame = +2
Query: 26 KMLESKSISEQLAEFNKIL----DDLANIEVNMEDEDKALLLLCSLPKSFEHFKDTI 78
+ ++ K QLA++N IL ++ +V++ DKA + ++ EHFKD +
Sbjct: 359 RKIQKKVREAQLAQYNYILVVGEEEAKTGQVSVRVRDKADHSVMTIENLLEHFKDEV 529
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.313 0.132 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,365,643
Number of Sequences: 36976
Number of extensions: 64323
Number of successful extensions: 344
Number of sequences better than 10.0: 49
Number of HSP's better than 10.0 without gapping: 339
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 342
length of query: 195
length of database: 9,014,727
effective HSP length: 91
effective length of query: 104
effective length of database: 5,649,911
effective search space: 587590744
effective search space used: 587590744
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (26.2 bits)
Medicago: description of AC134822.5