
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC142096.6 - phase: 0 /pseudo
(930 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AW774707 similar to GP|22795253|gb unknown protein {Oryza sativa... 295 7e-80
AW774803 weakly similar to GP|22795253|gb| unknown protein {Oryz... 236 3e-62
TC77517 similar to PIR|T51159|T51159 HMG protein [imported] - Ar... 37 0.027
TC77438 homologue to PIR|T06807|T06807 nucleosome assembly prote... 36 0.078
TC89852 similar to GP|15620063|gb|AAL03490.1 unknown {Rickettsia... 36 0.078
AL371165 homologue to PIR|D84685|D846 hypothetical protein At2g2... 35 0.13
TC89048 weakly similar to PIR|T46215|T46215 hypothetical protein... 35 0.17
CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect... 33 0.51
TC93070 weakly similar to GP|17065230|gb|AAL32769.1 Unknown prot... 33 0.66
TC84149 weakly similar to GP|21751020|dbj|BAC03887. unnamed prot... 32 0.86
TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~un... 32 1.1
TC83157 similar to GP|21595368|gb|AAM66095.1 unknown {Arabidopsi... 31 1.9
BE248780 similar to GP|5929884|gb|A nucleolin-related protein NR... 31 1.9
TC87692 similar to GP|21537190|gb|AAM61531.1 putative SKP1-like ... 31 1.9
TC86645 similar to PIR|H86321|H86321 hypothetical protein AAF271... 31 2.5
TC90460 similar to GP|15293207|gb|AAK93714.1 unknown protein {Ar... 31 2.5
CA991433 weakly similar to GP|2117310|emb| hypothetical serine-r... 30 3.3
BG646240 similar to GP|18491263|gb| F14N22.7/F14N22.7 {Arabidops... 30 3.3
TC90776 similar to GP|7269000|emb|CAB80733.1 contains EST gb:AI9... 30 3.3
AW257087 homologue to GP|6683126|dbj KIAA0339 protein {Homo sapi... 30 3.3
>AW774707 similar to GP|22795253|gb unknown protein {Oryza sativa (japonica
cultivar-group)}, partial (15%)
Length = 729
Score = 295 bits (754), Expect = 7e-80
Identities = 162/167 (97%), Positives = 164/167 (98%)
Frame = -1
Query: 764 RIYQCL**LEFVSRNIFTNFEITT*DSRTEKYAKCITG*N*RCG*AD*VESG*TSYFEAT 823
RIYQCL**LEFV RNIFTNFEITT*DS TEKYAKCITG*N*RCG*AD*VESG*TSYFEAT
Sbjct: 729 RIYQCL**LEFVPRNIFTNFEITT*DS*TEKYAKCITG*N*RCG*AD*VESG*TSYFEAT 550
Query: 824 TSNAETKASSNQITES*V*GKLR*G*RL*SRPCAS*KKKIEERGKA*S*RGCS*NT*RQL 883
TSNAETKASSNQITES*V*GKL *G*RL*SRPCAS*KKKIEERG+A*S*RGCS*NT*RQL
Sbjct: 549 TSNAETKASSNQITES*V*GKLC*G*RL*SRPCAS*KKKIEERGEA*S*RGCS*NT*RQL 370
Query: 884 LLA*CEGKRKVFNGESKS*EVWKD*SFPSRARTCFQIWTTRKREEEK 930
LLA*CEGKRKVFNGE+KS*EVWKD*SFPSRARTCFQIWTTRKREEEK
Sbjct: 369 LLA*CEGKRKVFNGENKS*EVWKD*SFPSRARTCFQIWTTRKREEEK 229
>AW774803 weakly similar to GP|22795253|gb| unknown protein {Oryza sativa
(japonica cultivar-group)}, partial (7%)
Length = 397
Score = 236 bits (602), Expect = 3e-62
Identities = 120/120 (100%), Positives = 120/120 (100%)
Frame = +3
Query: 1 MAKSSKRSIANGTNTSNKSTSKKKKNNKMGPEAVAMKAKAQKTNTNPFESIWSRRKFEVM 60
MAKSSKRSIANGTNTSNKSTSKKKKNNKMGPEAVAMKAKAQKTNTNPFESIWSRRKFEVM
Sbjct: 36 MAKSSKRSIANGTNTSNKSTSKKKKNNKMGPEAVAMKAKAQKTNTNPFESIWSRRKFEVM 215
Query: 61 GQKRKGDTKRMGLARSLAVEKRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILR 120
GQKRKGDTKRMGLARSLAVEKRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILR
Sbjct: 216 GQKRKGDTKRMGLARSLAVEKRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILR 395
>TC77517 similar to PIR|T51159|T51159 HMG protein [imported] - Arabidopsis
thaliana, partial (70%)
Length = 907
Score = 37.4 bits (85), Expect = 0.027
Identities = 46/178 (25%), Positives = 70/178 (38%), Gaps = 8/178 (4%)
Frame = +3
Query: 12 GTNTSNKSTSKKKKNNKMGPEAVAMKA-KAQKTNTNPFESIWSRRKFEVMGQKRKGDTKR 70
GT +++ K + K+G A K K KT K E +K KR
Sbjct: 138 GTAKTSRDALKPVDDRKVGKRKAAAKPEKVPKT------------KKEKKAKKDPNKPKR 281
Query: 71 MGLARSLAVEKRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILRSQR------- 123
A + +E +KT E + K+ + + GE + L KA ++
Sbjct: 282 PPSAFFVFLEDFRKTFKAE-NPNVKAVSAVGKAGGEKWKSLTKAEKAPYEAKAAKRKVEY 458
Query: 124 ERQLNVKSSKKSKYHLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDET*NQEGSDE 181
E+ +N ++K S DDDE E + EDE GEDD + DE + E D+
Sbjct: 459 EKLMNAYNNKPSS-----ADDDEEESDKDNSEVNNEDEASGEDDHQDDEEEDDEDEDD 617
>TC77438 homologue to PIR|T06807|T06807 nucleosome assembly protein 1 -
garden pea, partial (93%)
Length = 1607
Score = 35.8 bits (81), Expect = 0.078
Identities = 35/140 (25%), Positives = 61/140 (43%), Gaps = 19/140 (13%)
Frame = +2
Query: 81 KRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILRSQRERQLNVKSSKKSKY--- 137
K K + K + + F + E+DE +D+ L++Q E+ ++ S+ + K
Sbjct: 803 KNAKPITKTETCESFFNFFNPPEVPEDDEDIDEDMAEELQNQMEQDYDIGSTIRDKIIPH 982
Query: 138 ----------------HLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDET*NQEGSDE 181
L +EDDDE + D D+ EDE EDDDE +E + + +
Sbjct: 983 AVSWFTGEAAQGEEFGDLDDEDDDEDD--DAEDDDEDEDEDEDEDDDEDEE---ETKTKK 1147
Query: 182 RSYCKKQVLQG*KSKRKGKR 201
+S KK + ++G+R
Sbjct: 1148KSSAKKSGIAQLGEGQQGER 1207
>TC89852 similar to GP|15620063|gb|AAL03490.1 unknown {Rickettsia conorii},
partial (22%)
Length = 1001
Score = 35.8 bits (81), Expect = 0.078
Identities = 38/144 (26%), Positives = 59/144 (40%), Gaps = 1/144 (0%)
Frame = +1
Query: 4 SSKRSIANGTNTSNKSTSKKKKNNKMGPEAVAMKAKAQKTNTNPFESIWSRRKFEVMGQK 63
S+ + NG T +S KKKKNNK E A++ A N K
Sbjct: 505 STDAKVDNGVETEKRSKHKKKKNNKSNIEGGAIEQNAVTEN------------------K 630
Query: 64 RKGDTK-RMGLARSLAVEKRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILRSQ 122
K D + EKR K +K+ ++S + +IG+ D ++ AI +
Sbjct: 631 VKDDLSIDAKVENGAETEKRSKHKMKKKDKSISEGD-AKEQIGDPDATNEE---AIPEEK 798
Query: 123 RERQLNVKSSKKSKYHLSEEDDDE 146
K SKK K +S+E+D++
Sbjct: 799 -------KDSKKRKRPISKENDEQ 849
>AL371165 homologue to PIR|D84685|D846 hypothetical protein At2g28480
[imported] - Arabidopsis thaliana, partial (11%)
Length = 487
Score = 35.0 bits (79), Expect = 0.13
Identities = 25/73 (34%), Positives = 33/73 (44%), Gaps = 12/73 (16%)
Frame = -1
Query: 121 SQRERQLNVKSSKKSKYHLSE-----------EDDDEFE-GIDGLGRDDFEDEMLGEDDD 168
S +R + K+ S Y+LSE E +D FE G + D ED MLG DDD
Sbjct: 364 SMEKRNHDKKNLDSSSYYLSETESDSSQTELSESEDNFENGNLSMSESDSEDSMLGSDDD 185
Query: 169 ETDET*NQEGSDE 181
+ E + DE
Sbjct: 184 QEREVYLTKMQDE 146
>TC89048 weakly similar to PIR|T46215|T46215 hypothetical protein T8P19.220
- Arabidopsis thaliana, partial (35%)
Length = 1329
Score = 34.7 bits (78), Expect = 0.17
Identities = 20/72 (27%), Positives = 36/72 (49%), Gaps = 5/72 (6%)
Frame = +2
Query: 122 QRERQLNVKSSKKSKYHLSEED-----DDEFEGIDGLGRDDFEDEMLGEDDDETDET*NQ 176
Q E + + K + ++E+D DE + IDG D +DE G D+D+ D+ +
Sbjct: 341 QVENEAKDEPKPKEEVEVTEQDAEVAESDEKKEIDGHEDSDKDDEDEGGDEDDADDDEEE 520
Query: 177 EGSDERSYCKKQ 188
+ +E+ KK+
Sbjct: 521 DAGEEKKGVKKE 556
>CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect virus}
[Heliothis zea virus 1], partial (2%)
Length = 803
Score = 33.1 bits (74), Expect = 0.51
Identities = 20/60 (33%), Positives = 32/60 (53%)
Frame = -1
Query: 129 VKSSKKSKYHLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDET*NQEGSDERSYCKKQ 188
V +S K +Y + ++DD+E E D+ EDE ED++E D N D S+ K++
Sbjct: 680 VDNSDKVRYKMDDDDDEEEE-------DEDEDENENEDEEEEDWL-NYNNLDSNSHSKRR 525
>TC93070 weakly similar to GP|17065230|gb|AAL32769.1 Unknown protein
{Arabidopsis thaliana}, partial (19%)
Length = 679
Score = 32.7 bits (73), Expect = 0.66
Identities = 31/95 (32%), Positives = 44/95 (45%), Gaps = 6/95 (6%)
Frame = +1
Query: 93 STKSSEFIDRRIGENDEGLDDFGKAILRSQRERQLNVKSSKKSKY--HLSEED-DDEFEG 149
S+ SS F+ RR K + S +K+++KS Y LS D DDE E
Sbjct: 100 SSSSSSFLKRR------------KFTVLSAHSNPKILKTNRKSTYGKFLSPYDSDDEIEE 243
Query: 150 ID---GLGRDDFEDEMLGEDDDETDET*NQEGSDE 181
+D D+ EDE +DDD+ DE + +G E
Sbjct: 244 MDFEDDEDEDEDEDEDDDDDDDDDDEDDDDDGFAE 348
>TC84149 weakly similar to GP|21751020|dbj|BAC03887. unnamed protein product
{Homo sapiens}, partial (16%)
Length = 658
Score = 32.3 bits (72), Expect = 0.86
Identities = 21/58 (36%), Positives = 28/58 (48%), Gaps = 5/58 (8%)
Frame = +1
Query: 119 LRSQRERQLNVKSSKKSKYHLSEED-----DDEFEGIDGLGRDDFEDEMLGEDDDETD 171
LR+ S + H+ +ED DDE D LG DD ED + G+DDD+ D
Sbjct: 457 LRNSSYSHPAANESTRDYDHVDDEDRLGGNDDE----DRLGGDDDEDRLGGDDDDDDD 618
>TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~unknown
protein {Arabidopsis thaliana}, partial (46%)
Length = 2077
Score = 32.0 bits (71), Expect = 1.1
Identities = 20/60 (33%), Positives = 30/60 (49%), Gaps = 8/60 (13%)
Frame = +2
Query: 130 KSSKKSKYHLSEEDDDEFEGIDGLGRDDFEDEML--------GEDDDETDET*NQEGSDE 181
K KK + L EE+ D+F+ DDFE GEDD E ++ ++EGS++
Sbjct: 344 KLEKKKRVELEEEESDDFDEELDDDDDDFEGVEKKKAKGGSEGEDDFEEEDDEDEEGSED 523
Score = 28.9 bits (63), Expect = 9.6
Identities = 26/94 (27%), Positives = 42/94 (44%)
Frame = +2
Query: 79 VEKRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILRSQRERQLNVKSSKKSKYH 138
+EK+K+ L+E E S+ D + ++D DDF + +K K
Sbjct: 347 LEKKKRVELEEEE-----SDDFDEELDDDD---DDF---------------EGVEKKKAK 457
Query: 139 LSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDE 172
E +D+FE DD ++E ++DDE DE
Sbjct: 458 GGSEGEDDFE-----EEDDEDEEGSEDEDDEEDE 544
>TC83157 similar to GP|21595368|gb|AAM66095.1 unknown {Arabidopsis
thaliana}, partial (57%)
Length = 1108
Score = 31.2 bits (69), Expect = 1.9
Identities = 15/41 (36%), Positives = 26/41 (62%)
Frame = +3
Query: 133 KKSKYHLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDET 173
K+ H E++DD+ +G D DD+++E G+ +DE D+T
Sbjct: 315 KEGGTHEIEDEDDDSDGDDDDDDDDYDEEE-GDYEDEDDDT 434
>BE248780 similar to GP|5929884|gb|A nucleolin-related protein NRP {Rattus
norvegicus}, partial (4%)
Length = 561
Score = 31.2 bits (69), Expect = 1.9
Identities = 15/41 (36%), Positives = 26/41 (62%)
Frame = +2
Query: 133 KKSKYHLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDET 173
K+ H E++DD+ +G D DD+++E G+ +DE D+T
Sbjct: 308 KEGGTHEIEDEDDDSDGDDDDDDDDYDEEE-GDYEDEDDDT 427
>TC87692 similar to GP|21537190|gb|AAM61531.1 putative SKP1-like protein
{Arabidopsis thaliana}, partial (50%)
Length = 1539
Score = 31.2 bits (69), Expect = 1.9
Identities = 29/109 (26%), Positives = 47/109 (42%), Gaps = 18/109 (16%)
Frame = +3
Query: 81 KRKKTLLKEYEQSTKSSEFIDRRIGENDEGLDDFGKAILRSQRERQLNVKSSKKSKY--- 137
K+++ LLKE E K+ + + E++ +DD I +E + K+ KK K
Sbjct: 792 KKRRELLKERESIKKNVD-----VEEDERSVDDLLSFINGDPKEIKTTSKNKKKRKKGQH 956
Query: 138 -------------HLSEEDDDEFE--GIDGLGRDDFEDEMLGEDDDETD 171
+ S E D FE G+ G +F+D+ + DDE D
Sbjct: 957 KKNVKVNGHDDIGNQSAETDKPFETSGLHGDFMVEFDDDNSDDIDDEID 1103
>TC86645 similar to PIR|H86321|H86321 hypothetical protein AAF27100.1
[imported] - Arabidopsis thaliana, partial (76%)
Length = 1116
Score = 30.8 bits (68), Expect = 2.5
Identities = 17/33 (51%), Positives = 21/33 (63%)
Frame = +3
Query: 140 SEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDE 172
+EEDDDE G G DD ED+ EDDD+ +E
Sbjct: 738 AEEDDDE-AGDAGKDDDDSEDDDDQEDDDDDEE 833
>TC90460 similar to GP|15293207|gb|AAK93714.1 unknown protein {Arabidopsis
thaliana}, partial (28%)
Length = 806
Score = 30.8 bits (68), Expect = 2.5
Identities = 26/79 (32%), Positives = 37/79 (45%), Gaps = 1/79 (1%)
Frame = +3
Query: 103 RIGENDEGLDDFGKAILRSQRERQLNVKSSKKSKYHLSEEDDDEFEGIDGLGRDDFEDEM 162
R G ND+ L + L + R K K +E+DDD+ E I+ +G DD D+
Sbjct: 288 RKGNNDQNLQETMDKELDTHRLEH-GPKKRKIPDGTNNEDDDDDVEDIN-VGEDDMMDDE 461
Query: 163 LGEDDD-ETDET*NQEGSD 180
L +D DET E S+
Sbjct: 462 LDDDSGRRIDETIKTETSN 518
>CA991433 weakly similar to GP|2117310|emb| hypothetical serine-rich protein
(52 consecutive serines); contains WSC domain; possibly
involved, partial (9%)
Length = 414
Score = 30.4 bits (67), Expect = 3.3
Identities = 10/29 (34%), Positives = 21/29 (71%)
Frame = +1
Query: 144 DDEFEGIDGLGRDDFEDEMLGEDDDETDE 172
D+E +G+D + D +D+ + ED+++T+E
Sbjct: 85 DEETQGVDHVEEDQKQDDSMDEDEEDTEE 171
>BG646240 similar to GP|18491263|gb| F14N22.7/F14N22.7 {Arabidopsis
thaliana}, partial (26%)
Length = 775
Score = 30.4 bits (67), Expect = 3.3
Identities = 17/55 (30%), Positives = 30/55 (53%), Gaps = 2/55 (3%)
Frame = +3
Query: 130 KSSKKSKYHLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDE--TDET*NQEGSDER 182
+ + +S ++ E++DE + + D +DE L +DDDE + T GS+ER
Sbjct: 288 EEASRSDQDVNNEENDEACDHELVNNGDDDDEKLAKDDDEGSSSSTKRSSGSNER 452
>TC90776 similar to GP|7269000|emb|CAB80733.1 contains EST gb:AI998867.1
{Arabidopsis thaliana}, partial (17%)
Length = 1264
Score = 30.4 bits (67), Expect = 3.3
Identities = 21/86 (24%), Positives = 42/86 (48%), Gaps = 11/86 (12%)
Frame = +2
Query: 79 VEKRKKTLLKEYEQSTKS---------SEFIDR--RIGENDEGLDDFGKAILRSQRERQL 127
++ RK+++ K +++ K+ S F + +N G+D K + R +
Sbjct: 524 IDSRKESVFKNFDEIVKNPGPKTTYEVSIFASDTWKKAKNKNGIDTDIKKSSKFTRSVRH 703
Query: 128 NVKSSKKSKYHLSEEDDDEFEGIDGL 153
NVK+S+K + + DDE + +DG+
Sbjct: 704 NVKNSEKDQLGEDSDTDDEGQMVDGI 781
>AW257087 homologue to GP|6683126|dbj KIAA0339 protein {Homo sapiens},
partial (2%)
Length = 466
Score = 30.4 bits (67), Expect = 3.3
Identities = 20/54 (37%), Positives = 26/54 (48%)
Frame = -3
Query: 131 SSKKSKYHLSEEDDDEFEGIDGLGRDDFEDEMLGEDDDETDET*NQEGSDERSY 184
SS S L +ED+DE E D+ EDE EDD+E E E + S+
Sbjct: 407 SSSSSSSLLDDEDEDEDE-------DEDEDEDEDEDDEELLEESESESESDSSF 267
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.377 0.171 0.654
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 25,567,983
Number of Sequences: 36976
Number of extensions: 347207
Number of successful extensions: 6806
Number of sequences better than 10.0: 52
Number of HSP's better than 10.0 without gapping: 1238
Number of HSP's successfully gapped in prelim test: 262
Number of HSP's that attempted gapping in prelim test: 5274
Number of HSP's gapped (non-prelim): 1768
length of query: 930
length of database: 9,014,727
effective HSP length: 105
effective length of query: 825
effective length of database: 5,132,247
effective search space: 4234103775
effective search space used: 4234103775
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 13 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 35 (21.6 bits)
S2: 63 (28.9 bits)
Medicago: description of AC142096.6