
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC145767.1 - phase: 0
(191 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC89847 similar to GP|20127030|gb|AAM10939.1 putative bHLH trans... 280 3e-76
TC86368 similar to GP|15451010|gb|AAK96776.1 Unknown protein {Ar... 145 9e-36
TC86369 similar to GP|15451010|gb|AAK96776.1 Unknown protein {Ar... 141 2e-34
TC86370 similar to GP|15451010|gb|AAK96776.1 Unknown protein {Ar... 122 1e-30
TC78236 weakly similar to GP|15451010|gb|AAK96776.1 Unknown prot... 123 5e-29
TC89281 similar to GP|11994197|dbj|BAB01300. emb|CAA18500.1~gene... 66 9e-12
TC88472 similar to GP|19423958|gb|AAL87269.1 unknown protein {Ar... 50 5e-07
AJ388878 35 0.014
TC91697 similar to GP|10177356|dbj|BAB10699. contains similarity... 33 0.052
TC91264 weakly similar to PIR|T27543|T27543 hypothetical protein... 33 0.088
TC80007 33 0.088
TC79050 homologue to GP|18030195|gb|AAG00027.2 Hypothetical prot... 32 0.12
TC79945 similar to GP|19310501|gb|AAL84984.1 AT5g59210/mnc17_100... 32 0.15
TC77220 similar to GP|22597162|gb|AAN03468.1 bZIP transcription ... 32 0.20
TC79250 similar to PIR|T06329|T06329 symbiotic ammonium transpor... 31 0.26
TC82414 similar to PIR|T49915|T49915 pre-mRNA splicing factor AT... 30 0.44
TC79565 similar to SP|Q9ZQ70|WRK3_ARATH Probable WRKY transcript... 30 0.75
TC86963 similar to GP|15148918|gb|AAK84886.1 homeodomain leucine... 30 0.75
TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein... 30 0.75
AJ500404 29 1.3
>TC89847 similar to GP|20127030|gb|AAM10939.1 putative bHLH transcription
factor {Arabidopsis thaliana}, partial (42%)
Length = 1030
Score = 280 bits (715), Expect = 3e-76
Identities = 138/138 (100%), Positives = 138/138 (100%)
Frame = +3
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 113
RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE
Sbjct: 348 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 527
Query: 114 LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPPMAAYQASVNKMAVYPNYGYIPMWHYL 173
LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPPMAAYQASVNKMAVYPNYGYIPMWHYL
Sbjct: 528 LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPPMAAYQASVNKMAVYPNYGYIPMWHYL 707
Query: 174 PQSARDTSQDHELRPPAA 191
PQSARDTSQDHELRPPAA
Sbjct: 708 PQSARDTSQDHELRPPAA 761
>TC86368 similar to GP|15451010|gb|AAK96776.1 Unknown protein {Arabidopsis
thaliana}, partial (71%)
Length = 1071
Score = 145 bits (366), Expect = 9e-36
Identities = 78/144 (54%), Positives = 105/144 (72%), Gaps = 6/144 (4%)
Frame = +2
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 113
+F +L ++LEPGRP +TDK AIL DA+R+++QL+ EAQ+LK+SN L E+IK LK EKNE
Sbjct: 320 KFVELGSILEPGRPPKTDKAAILIDAVRMVTQLRGEAQKLKDSNSGLQEKIKELKVEKNE 499
Query: 114 LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPPM--AAY----QASVNKMAVYPNYGYI 167
LR+EK LKA+KEK+E+Q+KSM P GF+ PP AA+ QA NK+ + +Y +
Sbjct: 500 LRDEKQRLKAEKEKLEQQVKSMNTQP-GFLTHPPAIPAAFAHQGQAPSNKLMPFMSYPGV 676
Query: 168 PMWHYLPQSARDTSQDHELRPPAA 191
MW ++P +A DTSQDH LRPP A
Sbjct: 677 AMWQFMPPAAVDTSQDHVLRPPVA 748
>TC86369 similar to GP|15451010|gb|AAK96776.1 Unknown protein {Arabidopsis
thaliana}, partial (74%)
Length = 1500
Score = 141 bits (355), Expect = 2e-34
Identities = 75/144 (52%), Positives = 102/144 (70%), Gaps = 6/144 (4%)
Frame = +1
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 113
+F +L ++LEPG P +TDK AIL DA+R+++QL+ EAQ+LK++N L E+IK LK EKNE
Sbjct: 742 KFIELGSILEPGGPAKTDKAAILIDAVRMVTQLRGEAQKLKDANSGLQEKIKELKVEKNE 921
Query: 114 LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPP------MAAYQASVNKMAVYPNYGYI 167
LR+EK LKA+KEK+E+QLKSM +P F+P P A QA NK+ + +Y +
Sbjct: 922 LRDEKQRLKAEKEKLEQQLKSMN-APPSFLPTPTALPAAFAAQGQAHGNKLVPFISYPGV 1098
Query: 168 PMWHYLPQSARDTSQDHELRPPAA 191
MW ++P +A DTSQDH LRPP A
Sbjct: 1099AMWQFMPPAAVDTSQDHVLRPPVA 1170
>TC86370 similar to GP|15451010|gb|AAK96776.1 Unknown protein {Arabidopsis
thaliana}, partial (80%)
Length = 1050
Score = 122 bits (306), Expect(2) = 1e-30
Identities = 65/130 (50%), Positives = 92/130 (70%), Gaps = 6/130 (4%)
Frame = +3
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 113
+F +L ++LEPGRP +TDK AIL DA+R+++QL+ EAQ+LK++N L E+IK LK EKNE
Sbjct: 531 KFIELGSILEPGRPAKTDKAAILIDAVRMVTQLRGEAQKLKDANSGLQEKIKELKVEKNE 710
Query: 114 LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPP------MAAYQASVNKMAVYPNYGYI 167
LR+EK LKA+KEK+E+QLKSM +P F+P P A QA NK+ + +Y +
Sbjct: 711 LRDEKQRLKAEKEKLEQQLKSMN-APPSFLPTPTALPAAFAAQGQAHGNKLVPFISYPGV 887
Query: 168 PMWHYLPQSA 177
MW ++P +A
Sbjct: 888 AMWQFMPPAA 917
Score = 27.3 bits (59), Expect(2) = 1e-30
Identities = 11/13 (84%), Positives = 11/13 (84%)
Frame = +1
Query: 179 DTSQDHELRPPAA 191
DTSQDH LRPP A
Sbjct: 922 DTSQDHVLRPPVA 960
>TC78236 weakly similar to GP|15451010|gb|AAK96776.1 Unknown protein
{Arabidopsis thaliana}, partial (61%)
Length = 1303
Score = 123 bits (308), Expect = 5e-29
Identities = 66/138 (47%), Positives = 94/138 (67%)
Frame = +1
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 113
RF +LS+VLEP +TDK ++L+DA+RV++QL+ EA+ LKE N++L E++K LKAEK E
Sbjct: 397 RFMELSSVLEPDTLPKTDKVSLLNDAVRVVTQLRNEAERLKERNDELREKVKELKAEKKE 576
Query: 114 LREEKLVLKADKEKIEKQLKSMPVSPAGFMPPPPMAAYQASVNKMAVYPNYGYIPMWHYL 173
LR+EK LK DKEK+E+Q+K V + F+ A Q + +K+ + Y I MW ++
Sbjct: 577 LRDEKNKLKLDKEKLEQQVKLASVQ-SNFLSNAMAAKGQTANHKLMPFIGYPGISMWQFM 753
Query: 174 PQSARDTSQDHELRPPAA 191
+ DTSQDH LRPP A
Sbjct: 754 SPATVDTSQDHLLRPPVA 807
>TC89281 similar to GP|11994197|dbj|BAB01300.
emb|CAA18500.1~gene_id:MPN9.10~similar to unknown
protein {Arabidopsis thaliana}, partial (43%)
Length = 733
Score = 65.9 bits (159), Expect = 9e-12
Identities = 42/125 (33%), Positives = 64/125 (50%), Gaps = 14/125 (11%)
Frame = +1
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNE 113
+F +L +L+P RP + DK IL D +++L L ++ +LK+ L EE + L EKN+
Sbjct: 265 QFLELGNILDPDRP-KNDKATILGDTVQLLKDLSSQVSKLKDEYTMLNEESRELSQEKND 441
Query: 114 LREEKLVLKADKEKIEKQLK--------------SMPVSPAGFMPPPPMAAYQASVNKMA 159
LREEK LK+D E + Q + S+ ++P + P PM S+ M
Sbjct: 442 LREEKASLKSDIENLNNQYQLQLRTMYPWPAMDHSVMMAPPSYPYPVPMPVPAGSI-PMQ 618
Query: 160 VYPNY 164
YP Y
Sbjct: 619 PYPYY 633
>TC88472 similar to GP|19423958|gb|AAL87269.1 unknown protein {Arabidopsis
thaliana}, partial (46%)
Length = 1019
Score = 50.1 bits (118), Expect = 5e-07
Identities = 31/93 (33%), Positives = 48/93 (51%)
Frame = +2
Query: 55 FCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNEL 114
F DL+ L+ P K +IL +A R+L L + Q LK+ N LL E + EKNEL
Sbjct: 353 FLDLANALDLSEP-NNGKASILIEASRLLKDLLCQIQSLKKENVSLLSESHYVTMEKNEL 529
Query: 115 REEKLVLKADKEKIEKQLKSMPVSPAGFMPPPP 147
+EE L+ EK++ ++++ + PP
Sbjct: 530 KEENSSLETQIEKLQGEIQARIAQSKPDLNAPP 628
>AJ388878
Length = 436
Score = 35.4 bits (80), Expect = 0.014
Identities = 36/106 (33%), Positives = 56/106 (51%), Gaps = 9/106 (8%)
Frame = -1
Query: 63 EPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLK 122
+PG PV T+ PA L+++ + S +L+ S ++ +E ++ L+A E ++L+
Sbjct: 412 DPGSPVSTEHPARLEESYQ--SP*ANL*LKLRHSPDQTVE-LRTLQAVAQA--PE*IMLR 248
Query: 123 --------ADKEKIEKQLKSMPVSPAGFMP-PPPMAAYQASVNKMA 159
ADK + +QL S PVSP G P M AYQASV + A
Sbjct: 247 *RAFSA*RADKVILHEQLPSTPVSPPG*KPLLRRMWAYQASVTQNA 110
>TC91697 similar to GP|10177356|dbj|BAB10699. contains similarity to unknown
protein~gb|AAF20227.1~gene_id:K15N18.15 {Arabidopsis
thaliana}, partial (76%)
Length = 862
Score = 33.5 bits (75), Expect = 0.052
Identities = 19/48 (39%), Positives = 29/48 (59%)
Frame = +2
Query: 87 KTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLKADKEKIEKQLKS 134
K E ELKE EK +EIK LK E + L E +K + E+ +K++++
Sbjct: 263 KKENVELKEKEEKASKEIKQLKKELSTLTEGLKKVKMESEEKDKRVET 406
>TC91264 weakly similar to PIR|T27543|T27543 hypothetical protein ZC395.10 -
Caenorhabditis elegans, partial (24%)
Length = 767
Score = 32.7 bits (73), Expect = 0.088
Identities = 15/58 (25%), Positives = 36/58 (61%)
Frame = +2
Query: 76 LDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLKADKEKIEKQLK 133
+DD ++ K E+ E+ E ++ E K +K+E+ E++ E+ +K+++++I+ + K
Sbjct: 440 VDDTTKINDSTKEESSEISEESK---SEEKEIKSEEKEIKSEEKEIKSEEKEIKSEEK 604
>TC80007
Length = 1599
Score = 32.7 bits (73), Expect = 0.088
Identities = 23/75 (30%), Positives = 38/75 (50%)
Frame = +1
Query: 56 CDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNELR 115
C+ A LE + ++ + I V K A + K ++ EE K LKA+++ELR
Sbjct: 1060 CNQLATLEQRKKELEEQINAIKANISVFQSAKITATKRKR---EVFEEAKTLKAQRDELR 1230
Query: 116 EEKLVLKADKEKIEK 130
E+ LK ++E +K
Sbjct: 1231 EQVPHLKDEREVAKK 1275
>TC79050 homologue to GP|18030195|gb|AAG00027.2 Hypothetical protein W01C8.3
{Caenorhabditis elegans}, partial (1%)
Length = 494
Score = 32.3 bits (72), Expect = 0.12
Identities = 20/54 (37%), Positives = 30/54 (55%)
Frame = -2
Query: 77 DDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLKADKEKIEK 130
D ++ +S L+ E EL + N + EEIK L E ELR + + +EKIE+
Sbjct: 331 DQSVTKISALEKERDELVQENNEKKEEIKKLTLEMEELRSKG---EEMREKIEE 179
>TC79945 similar to GP|19310501|gb|AAL84984.1 AT5g59210/mnc17_100
{Arabidopsis thaliana}, partial (41%)
Length = 957
Score = 32.0 bits (71), Expect = 0.15
Identities = 19/56 (33%), Positives = 30/56 (52%)
Frame = +3
Query: 83 LSQLKTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLKADKEKIEKQLKSMPVS 138
++ L +E QEL+E + E AE ++ +K DK+K++KQL M VS
Sbjct: 780 IASLMSEKQELEEKLNSMSREA----AEVSDKATQKTFTMEDKQKLDKQLHDMGVS 935
>TC77220 similar to GP|22597162|gb|AAN03468.1 bZIP transcription factor ATB2
{Glycine max}, partial (65%)
Length = 1341
Score = 31.6 bits (70), Expect = 0.20
Identities = 23/82 (28%), Positives = 37/82 (45%)
Frame = +3
Query: 63 EPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLK 122
E R R K LDD LSQL+ E Q++ S + +++E + LR + L
Sbjct: 678 ESARRSRMRKQKHLDDLAVQLSQLRNENQQILTSVNLTTQRFLAVESENSVLRAQLNELN 857
Query: 123 ADKEKIEKQLKSMPVSPAGFMP 144
+ E + + + M V+ F P
Sbjct: 858 SRFESLNEIINFMNVANGVFEP 923
>TC79250 similar to PIR|T06329|T06329 symbiotic ammonium transport protein
SAT1 - soybean, partial (34%)
Length = 1145
Score = 31.2 bits (69), Expect = 0.26
Identities = 20/61 (32%), Positives = 36/61 (58%), Gaps = 1/61 (1%)
Frame = +2
Query: 54 RFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQLKTEAQELKESNEK-LLEEIKCLKAEKN 112
+F LSA+L + + DK ++L DAI + QL+ + + L+E N+K +E + + EK
Sbjct: 554 KFIALSALLPDLK--KMDKASVLGDAINHVKQLQEKVKLLEEKNQKNNVESVSMVYVEKT 727
Query: 113 E 113
+
Sbjct: 728 K 730
>TC82414 similar to PIR|T49915|T49915 pre-mRNA splicing factor ATP-dependent
RNA helicase-like protein - Arabidopsis thaliana,
partial (13%)
Length = 740
Score = 30.4 bits (67), Expect = 0.44
Identities = 16/41 (39%), Positives = 25/41 (60%)
Frame = +3
Query: 93 LKESNEKLLEEIKCLKAEKNELREEKLVLKADKEKIEKQLK 133
+KES+ LLE K K EK + EE LK ++ ++E++ K
Sbjct: 282 VKESDTSLLEHKKKQKREKTAMEEEMENLKKEQAELERENK 404
>TC79565 similar to SP|Q9ZQ70|WRK3_ARATH Probable WRKY transcription factor 3
(WRKY DNA-binding protein 3). [Mouse-ear cress], partial
(35%)
Length = 1602
Score = 29.6 bits (65), Expect = 0.75
Identities = 15/51 (29%), Positives = 27/51 (52%)
Frame = -1
Query: 2 SLIIVVLRHTETPRHERYAYMCTITKHLLSVLYCVLLLHMIKGCSFIVIVC 52
+L+ + H++T +H+ C + H+L +LY + LH I CS + C
Sbjct: 1527 ALVKTLATHSKTKKHQTLKITCEL*-HMLPILYKI*KLHYIYICSSFNLRC 1378
>TC86963 similar to GP|15148918|gb|AAK84886.1 homeodomain leucine zipper
protein HDZ2 {Phaseolus vulgaris}, partial (87%)
Length = 1519
Score = 29.6 bits (65), Expect = 0.75
Identities = 17/51 (33%), Positives = 26/51 (50%)
Frame = +1
Query: 86 LKTEAQELKESNEKLLEEIKCLKAEKNELREEKLVLKADKEKIEKQLKSMP 136
LK + L + N+KL EE+ LK + ++ DKEK+ + KS P
Sbjct: 946 LKDDYDNLLQENDKLKEEVNSLKNK---------LIPRDKEKVNSEDKSSP 1071
>TC80288 weakly similar to PIR|T06029|T06029 hypothetical protein T28I19.100
- Arabidopsis thaliana, partial (14%)
Length = 1460
Score = 29.6 bits (65), Expect = 0.75
Identities = 13/53 (24%), Positives = 35/53 (65%), Gaps = 2/53 (3%)
Frame = +1
Query: 83 LSQLKTEAQELKESNEKLLEEIK--CLKAEKNELREEKLVLKADKEKIEKQLK 133
++ + + +E KE++ KL +++K C+K EKN ++++ + + + K++ ++K
Sbjct: 451 VNTMNSNKKEKKETSTKLKKKVKTMCMKGEKNRMKKKISMEQKCRRKMKAKVK 609
>AJ500404
Length = 630
Score = 28.9 bits (63), Expect = 1.3
Identities = 29/113 (25%), Positives = 47/113 (40%), Gaps = 13/113 (11%)
Frame = -2
Query: 29 LLSVLYCVLLLHMIKGCSFIVIVCCRFCDLSAVLEPGRPVRTDKPAILDDAIRVLSQ--- 85
L SVLY L+ KGC+ + C L + VR P+ +++L
Sbjct: 383 LESVLYIKWLVEKYKGCNKLFFCCSDESRLQPIQSYCTTVRLSSPS-TQQIVKILEYIVQ 207
Query: 86 ---LKTEAQELK----ESNEKLLEEIKCLKA---EKNELREEKLVLKADKEKI 128
+K + +K S L + I+ L+A KN L ++ LVL ++ I
Sbjct: 206 EEGIKLSHESIKSIVLRSKNNLRQAIRSLEATYRNKNALNDDDLVLTGWEDDI 48
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.321 0.136 0.405
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,542,897
Number of Sequences: 36976
Number of extensions: 100298
Number of successful extensions: 893
Number of sequences better than 10.0: 70
Number of HSP's better than 10.0 without gapping: 873
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 884
length of query: 191
length of database: 9,014,727
effective HSP length: 91
effective length of query: 100
effective length of database: 5,649,911
effective search space: 564991100
effective search space used: 564991100
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 55 (25.8 bits)
Medicago: description of AC145767.1