
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC121237.5 + phase: 2 /pseudo
(194 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC77833 similar to PIR|T09559|T09559 hypothetical protein L73G19... 256 3e-77
TC89483 similar to GP|18377662|gb|AAL66981.1 unknown protein {Ar... 35 0.018
TC80239 similar to GP|7208779|emb|CAB76912.1 hypothetical protei... 35 0.018
BQ752218 similar to GP|19170914|emb hypothetical protein {Enceph... 34 0.041
TC83675 similar to PIR|T14321|T14321 nuclear matrix constituent ... 34 0.041
TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~un... 33 0.091
AW685995 similar to PIR|I51618|I516 nucleolar phosphoprotein - A... 32 0.16
TC85955 similar to GP|3860319|emb|CAA10127.1 nucleolar protein {... 32 0.20
AJ503517 similar to GP|8572252|gb|A erythropoietin receptor {Sus... 32 0.20
TC87387 homologue to PIR|T47775|T47775 hypothetical protein F24I... 31 0.26
BQ147280 similar to PIR|C86333|C863 hypothetical protein AAF7991... 30 0.45
TC85442 similar to GP|13543783|gb|AAH06040.1 Unknown (protein fo... 30 0.59
TC83625 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana taba... 30 0.77
TC81816 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana taba... 30 0.77
BG455563 similar to PIR|T05151|T051 hypothetical protein F18E5.5... 29 1.0
CA920908 homologue to PIR|T06377|T06 SAR DNA-binding protein-1 -... 29 1.3
AW690594 similar to GP|23498163|emb hypothetical protein {Plasmo... 29 1.3
TC88253 weakly similar to GP|15810597|gb|AAL07186.1 unknown prot... 29 1.3
TC83583 similar to PIR|T06379|T06379 SAR DNA-binding protein 2 -... 28 1.7
CA860047 weakly similar to GP|15081680|gb| AT3g12390/T2E22_130 {... 28 2.2
>TC77833 similar to PIR|T09559|T09559 hypothetical protein L73G19.50 -
Arabidopsis thaliana, partial (49%)
Length = 1472
Score = 256 bits (653), Expect(2) = 3e-77
Identities = 127/128 (99%), Positives = 127/128 (99%)
Frame = +2
Query: 44 GQQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDE 103
GQQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISEL KKKQKKKEEQQRLDE
Sbjct: 620 GQQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELGKKKQKKKEEQQRLDE 799
Query: 104 EGAAIAEAVALHVLLDEDSDDSYKVECKTWDDYNNNLDFFMSGKRACFPNLDGSTWSVTS 163
EGAAIAEAVALHVLLDEDSDDSYKVECKTWDDYNNNLDFFMSGKRACFPNLDGSTWSVTS
Sbjct: 800 EGAAIAEAVALHVLLDEDSDDSYKVECKTWDDYNNNLDFFMSGKRACFPNLDGSTWSVTS 979
Query: 164 QNGKWSIS 171
QNGKWSIS
Sbjct: 980 QNGKWSIS 1003
Score = 87.0 bits (214), Expect = 4e-18
Identities = 44/44 (100%), Positives = 44/44 (100%)
Frame = +1
Query: 1 WESASVVVS*EEWGQGMDLRVLMNFL*CCLLSFLSGIA*FVK*G 44
WESASVVVS*EEWGQGMDLRVLMNFL*CCLLSFLSGIA*FVK*G
Sbjct: 262 WESASVVVS*EEWGQGMDLRVLMNFL*CCLLSFLSGIA*FVK*G 393
Score = 49.7 bits (117), Expect(2) = 3e-77
Identities = 23/23 (100%), Positives = 23/23 (100%)
Frame = +1
Query: 172 FRTVRKQCTRTTLRSRMGSYKIL 194
FRTVRKQCTRTTLRSRMGSYKIL
Sbjct: 1003 FRTVRKQCTRTTLRSRMGSYKIL 1071
>TC89483 similar to GP|18377662|gb|AAL66981.1 unknown protein {Arabidopsis
thaliana}, partial (35%)
Length = 1711
Score = 35.0 bits (79), Expect = 0.018
Identities = 19/80 (23%), Positives = 43/80 (53%)
Frame = +1
Query: 45 QQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEE 104
++ R ++ + + E DR+++ ++KRR+E+ + E E ++++K++E+Q E+
Sbjct: 169 REKERVLERYEREAERDRIRKEREQKRRIEEVERQFELQLKEWEYREREKEKERQYEKEK 348
Query: 105 GAAIAEAVALHVLLDEDSDD 124
+L DE+ DD
Sbjct: 349 EKDRERKRRKEILYDEEDDD 408
>TC80239 similar to GP|7208779|emb|CAB76912.1 hypothetical protein {Cicer
arietinum}, partial (43%)
Length = 930
Score = 35.0 bits (79), Expect = 0.018
Identities = 19/62 (30%), Positives = 39/62 (62%), Gaps = 1/62 (1%)
Frame = +3
Query: 45 QQHARKTKKKQVKDELDRLKQA-EKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDE 103
++ K K+K+ +E+++ K+A ++KKR EK A A+ ++ +QK+KE ++R +
Sbjct: 129 KEEEAKLKEKKRLEEIEKAKEALQRKKRNAEK--AQQRALYKAQKEAEQKEKEREKRAKK 302
Query: 104 EG 105
+G
Sbjct: 303 KG 308
>BQ752218 similar to GP|19170914|emb hypothetical protein {Encephalitozoon
cuniculi}, partial (5%)
Length = 637
Score = 33.9 bits (76), Expect = 0.041
Identities = 20/55 (36%), Positives = 31/55 (56%), Gaps = 2/55 (3%)
Frame = -2
Query: 52 KKKQVKDELDRLKQAEKKKRRLEKALATSAAIISEL--EKKKQKKKEEQQRLDEE 104
KK+ K E K+AEKKK++ + A A E +K K++KKE +++ EE
Sbjct: 633 KKRPKKKEEKAKKEAEKKKKKAREEAAKKAKKAREAAYKKAKEEKKEAEKKAKEE 469
>TC83675 similar to PIR|T14321|T14321 nuclear matrix constituent protein 1 -
carrot, partial (17%)
Length = 905
Score = 33.9 bits (76), Expect = 0.041
Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 9/77 (11%)
Frame = +3
Query: 55 QVKDELDR--------LKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLD-EEG 105
+VKD L++ L +AEK++ L KAL + +LEK ++ + E ++
Sbjct: 39 EVKDALEQEKAAHLFALSEAEKREENLRKALGVEKECVLDLEKALREMRSEHAKIKFAAD 218
Query: 106 AAIAEAVALHVLLDEDS 122
+ +AEA AL ++E S
Sbjct: 219 SKLAEANALIASVEEKS 269
>TC87688 similar to GP|10177535|dbj|BAB10930. gene_id:K1F13.21~unknown protein
{Arabidopsis thaliana}, partial (46%)
Length = 2077
Score = 32.7 bits (73), Expect = 0.091
Identities = 18/44 (40%), Positives = 28/44 (62%)
Frame = +1
Query: 53 KKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKE 96
K VK+E + L QAE+K+RR K A +++LEKK + +K+
Sbjct: 1501 KGDVKEEAE-LTQAERKRRRANKKRKFKAEAVNKLEKKARVEKK 1629
>AW685995 similar to PIR|I51618|I516 nucleolar phosphoprotein - African
clawed frog, partial (5%)
Length = 516
Score = 32.0 bits (71), Expect = 0.16
Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 3/65 (4%)
Frame = +3
Query: 50 KTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQ---QRLDEEGA 106
KT KK +E KK ++EK AA E +KK+QKKKEE+ ++E+ +
Sbjct: 33 KTSKKVKAEE-------PKKVEKVEKTKGKKAAKAEEEKKKQQKKKEEKPAPMEVEEDSS 191
Query: 107 AIAEA 111
+ E+
Sbjct: 192 SSEES 206
>TC85955 similar to GP|3860319|emb|CAA10127.1 nucleolar protein {Cicer
arietinum}, partial (98%)
Length = 1981
Score = 31.6 bits (70), Expect = 0.20
Identities = 18/57 (31%), Positives = 33/57 (57%)
Frame = +2
Query: 48 ARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEE 104
A+KTKKK + K A+ ++KA + + + +K+KKK+E+++LD+E
Sbjct: 1418 AKKTKKK-------KQKAADGDDMAVDKAAEITNGDAEDHKSEKKKKKKEKRKLDQE 1567
>AJ503517 similar to GP|8572252|gb|A erythropoietin receptor {Sus scrofa},
partial (3%)
Length = 591
Score = 31.6 bits (70), Expect = 0.20
Identities = 25/84 (29%), Positives = 36/84 (42%)
Frame = -1
Query: 49 RKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEEGAAI 108
+ K Q +++ Q E KK + K E +++K K KE ++DEEG
Sbjct: 393 KNQKHHQNQNQNQTESQKEDKKSKRNKV---------EEDRRKNKDKE*DDQVDEEGNKK 241
Query: 109 AEAVALHVLLDEDSDDSYKVECKT 132
+ A H LLD ECKT
Sbjct: 240 EDPKADHCLLDH--------ECKT 193
>TC87387 homologue to PIR|T47775|T47775 hypothetical protein F24I3.230 -
Arabidopsis thaliana, partial (14%)
Length = 1074
Score = 31.2 bits (69), Expect = 0.26
Identities = 13/53 (24%), Positives = 33/53 (61%)
Frame = +3
Query: 45 QQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEE 97
++ +K K++ +D +++++KKK++ ++ + + + EKKK+KK +E
Sbjct: 456 EKEKKKKHKEKGEDGSPEVEKSDKKKKKHKETSEVGSPEVDKSEKKKKKKDKE 614
Score = 29.3 bits (64), Expect = 1.0
Identities = 19/56 (33%), Positives = 35/56 (61%), Gaps = 3/56 (5%)
Frame = +3
Query: 50 KTKKKQVKD---ELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLD 102
K K K+V D E++ K+ +KKK++ +K +A+ ++EK+K KKK +++ D
Sbjct: 333 KVKTKKVDDAAVEVEDDKKEKKKKKKKDKENGAAASDEEKVEKEK-KKKHKEKGED 497
>BQ147280 similar to PIR|C86333|C863 hypothetical protein AAF79914.1
[imported] - Arabidopsis thaliana, partial (10%)
Length = 666
Score = 30.4 bits (67), Expect = 0.45
Identities = 16/51 (31%), Positives = 31/51 (60%)
Frame = +2
Query: 48 ARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQ 98
+R KK++KD+ D + + K + K ++ + + E++KKK+ KKEE+
Sbjct: 245 SRSKVKKELKDDDDDSDEDDDDKP-IAKKISKTKVVKEEVKKKKKVKKEEE 394
>TC85442 similar to GP|13543783|gb|AAH06040.1 Unknown (protein for MGC:7642)
{Mus musculus}, partial (62%)
Length = 585
Score = 30.0 bits (66), Expect = 0.59
Identities = 19/64 (29%), Positives = 33/64 (50%)
Frame = +1
Query: 42 K*GQQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRL 101
+* QQ +++ KK Q R K KKK+R +K + K ++KKK +++R
Sbjct: 43 Q*DQQRSQRRKKSQ------RKKNQRKKKKRKKKQRRKKRRKKGKRRKNQRKKKRKRKRR 204
Query: 102 DEEG 105
++G
Sbjct: 205 KKKG 216
>TC83625 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana tabacum},
partial (9%)
Length = 908
Score = 29.6 bits (65), Expect = 0.77
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Frame = +2
Query: 50 KTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEE-QQRLDEEGAAI 108
KT + KD+ D+ K+ ++KK K + EKKK++KKE+ ++ D++G
Sbjct: 608 KTDVDEGKDKKDKEKKKKEKKEENVKGEEEDGDEKKDKEKKKKEKKEKGKEDKDKDGEEK 787
Query: 109 AEAVALHVLLDEDSDD 124
D++ DD
Sbjct: 788 KSKKDKEKKKDKNEDD 835
Score = 29.3 bits (64), Expect = 1.0
Identities = 16/46 (34%), Positives = 25/46 (53%)
Frame = +2
Query: 59 ELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEE 104
E D K+ +K K + +K + EKKK++KKEE + +EE
Sbjct: 560 EKDDEKKEKKDKEKKDKTDVDEGKDKKDKEKKKKEKKEENVKGEEE 697
Score = 28.5 bits (62), Expect = 1.7
Identities = 17/64 (26%), Positives = 37/64 (57%), Gaps = 4/64 (6%)
Frame = +2
Query: 45 QQHARKTKKKQVKDELDRLK----QAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQR 100
++ +K K+K+ K ++D K + +KKK + E+ + E + K++KKKE++++
Sbjct: 572 EKKEKKDKEKKDKTDVDEGKDKKDKEKKKKEKKEENVKGEEEDGDEKKDKEKKKKEKKEK 751
Query: 101 LDEE 104
E+
Sbjct: 752 GKED 763
>TC81816 similar to GP|8096269|dbj|BAA95789.1 KED {Nicotiana tabacum},
partial (13%)
Length = 663
Score = 29.6 bits (65), Expect = 0.77
Identities = 21/76 (27%), Positives = 37/76 (48%), Gaps = 1/76 (1%)
Frame = +3
Query: 50 KTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEE-QQRLDEEGAAI 108
KT + KD+ D+ K+ ++KK K + EKKK++KKE+ ++ D++G
Sbjct: 216 KTDVDEGKDKKDKEKKKKEKKEENVKGEEEDGDEKKDKEKKKKEKKEKGKEDKDKDGEEK 395
Query: 109 AEAVALHVLLDEDSDD 124
D++ DD
Sbjct: 396 KSKKDKEKKKDKNEDD 443
Score = 29.3 bits (64), Expect = 1.0
Identities = 16/46 (34%), Positives = 25/46 (53%)
Frame = +3
Query: 59 ELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEE 104
E D K+ +K K + +K + EKKK++KKEE + +EE
Sbjct: 168 EKDDEKKEKKDKEKKDKTDVDEGKDKKDKEKKKKEKKEENVKGEEE 305
Score = 28.5 bits (62), Expect = 1.7
Identities = 17/64 (26%), Positives = 37/64 (57%), Gaps = 4/64 (6%)
Frame = +3
Query: 45 QQHARKTKKKQVKDELDRLK----QAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQR 100
++ +K K+K+ K ++D K + +KKK + E+ + E + K++KKKE++++
Sbjct: 180 EKKEKKDKEKKDKTDVDEGKDKKDKEKKKKEKKEENVKGEEEDGDEKKDKEKKKKEKKEK 359
Query: 101 LDEE 104
E+
Sbjct: 360 GKED 371
Score = 28.1 bits (61), Expect = 2.2
Identities = 22/82 (26%), Positives = 33/82 (39%)
Frame = +3
Query: 50 KTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEEGAAIA 109
K KKK ++ D + KKK+ + KK+KKKEE ++ EEG
Sbjct: 411 KEKKKDKNEDDDEGEDGSKKKKN---------------KDKKEKKKEEDEK--EEGKVSV 539
Query: 110 EAVALHVLLDEDSDDSYKVECK 131
+ + E + K E K
Sbjct: 540 RDIDIEETAKEGKEKKKKKEDK 605
>BG455563 similar to PIR|T05151|T051 hypothetical protein F18E5.50 -
Arabidopsis thaliana, partial (7%)
Length = 663
Score = 29.3 bits (64), Expect = 1.0
Identities = 18/67 (26%), Positives = 34/67 (49%), Gaps = 7/67 (10%)
Frame = +3
Query: 45 QQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSA-------AIISELEKKKQKKKEE 97
Q++ + KKK+ D ++ ++ KKK++ +K + E+EKKK K+
Sbjct: 273 QRNMKMKKKKKQNDVVEVEIRSNKKKKKSKKNDVVDVQLDLIRMVLQREVEKKKNPNKKN 452
Query: 98 QQRLDEE 104
+ +DEE
Sbjct: 453 VKNVDEE 473
>CA920908 homologue to PIR|T06377|T06 SAR DNA-binding protein-1 - garden pea,
partial (37%)
Length = 748
Score = 28.9 bits (63), Expect = 1.3
Identities = 13/49 (26%), Positives = 30/49 (60%)
Frame = -2
Query: 52 KKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQR 100
KKK+ K++ + K+ E+ ++ A+ S +KKK+KK++++++
Sbjct: 147 KKKEKKEKKKKEKKEEEDTQKSNTAMDEDTQEPSAADKKKEKKEKKEKK 1
>AW690594 similar to GP|23498163|emb hypothetical protein {Plasmodium
falciparum 3D7}, partial (10%)
Length = 633
Score = 28.9 bits (63), Expect = 1.3
Identities = 15/52 (28%), Positives = 30/52 (56%)
Frame = +1
Query: 49 RKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQR 100
RKTK K K +++ ++ + R +K ++ + KKK+KKK++++R
Sbjct: 61 RKTKVK*KKKKMNMTRKKKMNMMRKKKKKKKKKKKMNMMTKKKKKKKKKKRR 216
Score = 28.9 bits (63), Expect = 1.3
Identities = 17/65 (26%), Positives = 41/65 (62%)
Frame = +1
Query: 41 VK*GQQHARKTKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQR 100
VK* ++ T+KK++ + + + K+ +KKK+++ ++++ +KKK+KKK ++
Sbjct: 73 VK*KKKKMNMTRKKKM-NMMRKKKKKKKKKKKMN--------MMTKKKKKKKKKKRRRRM 225
Query: 101 LDEEG 105
+ ++G
Sbjct: 226 MMKKG 240
>TC88253 weakly similar to GP|15810597|gb|AAL07186.1 unknown protein
{Arabidopsis thaliana}, partial (51%)
Length = 1122
Score = 28.9 bits (63), Expect = 1.3
Identities = 15/45 (33%), Positives = 29/45 (64%)
Frame = +3
Query: 56 VKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQKKKEEQQR 100
VK +D K ++ +R +K A+I+ + EKK+++KKEE+++
Sbjct: 621 VKGVIDPTKLVDEVFKRTKK----QASIVKKEEKKEEEKKEEEKK 743
>TC83583 similar to PIR|T06379|T06379 SAR DNA-binding protein 2 - garden
pea, partial (33%)
Length = 852
Score = 28.5 bits (62), Expect = 1.7
Identities = 14/43 (32%), Positives = 23/43 (52%)
Frame = +2
Query: 51 TKKKQVKDELDRLKQAEKKKRRLEKALATSAAIISELEKKKQK 93
T KK+ K E K+ EKK+ +E ++ + +KKK+K
Sbjct: 395 TDKKKEKKEKKEKKKKEKKEEEVEDVEEPEEEVVKKEKKKKKK 523
>CA860047 weakly similar to GP|15081680|gb| AT3g12390/T2E22_130 {Arabidopsis
thaliana}, partial (37%)
Length = 543
Score = 28.1 bits (61), Expect = 2.2
Identities = 20/58 (34%), Positives = 27/58 (46%)
Frame = +3
Query: 67 EKKKRRLEKALATSAAIISELEKKKQKKKEEQQRLDEEGAAIAEAVALHVLLDEDSDD 124
E K + + + + + SE EKKKQ K Q + EE E V +EDSDD
Sbjct: 87 EPKIQEITEENSAEGSKPSESEKKKQSVKI--QEITEESGGALEKKTTTVESEEDSDD 254
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.328 0.137 0.418
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,603,771
Number of Sequences: 36976
Number of extensions: 74179
Number of successful extensions: 827
Number of sequences better than 10.0: 60
Number of HSP's better than 10.0 without gapping: 721
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 783
length of query: 194
length of database: 9,014,727
effective HSP length: 91
effective length of query: 103
effective length of database: 5,649,911
effective search space: 581940833
effective search space used: 581940833
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 56 (26.2 bits)
Medicago: description of AC121237.5