
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC138579.4 + phase: 0
(835 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG587681 similar to GP|4538624|emb| hypothetical protein {Nicoti... 152 4e-37
TC90728 similar to GP|7715599|gb|AAF68117.1| F20B17.17 {Arabidop... 141 9e-34
CB892335 similar to GP|7715599|gb| F20B17.17 {Arabidopsis thalia... 112 4e-25
TC82359 weakly similar to GP|22655127|gb|AAM98154.1 putative pro... 107 8e-24
AL370071 similar to GP|15795135|dbj transposase-like protein {Ar... 64 3e-10
TC82286 weakly similar to PIR|B96544|B96544 hypothetical protein... 36 0.070
BF004402 similar to GP|7268572|emb| predicted protein of unknown... 35 0.091
TC89058 similar to GP|19071218|gb|AAL84162.1 putative heavy meta... 33 0.35
TC90414 32 0.77
BQ751257 32 0.77
CA857944 32 1.0
CB892426 similar to PIR|T05689|T0 hypothetical protein F20M13.17... 30 2.9
TC90737 30 5.0
TC90610 similar to GP|18766642|gb|AAL79042.1 NIMA-related protei... 29 8.5
AW688516 PIR|C96516|C965 F16N3.15 [imported] - Arabidopsis thali... 29 8.5
>BG587681 similar to GP|4538624|emb| hypothetical protein {Nicotiana
tabacum}, partial (12%)
Length = 412
Score = 152 bits (385), Expect = 4e-37
Identities = 73/76 (96%), Positives = 75/76 (98%)
Frame = +1
Query: 301 KLRKEVFSSICKFFCHAGIPLQAADSVYFHKMLELAGQYGQGLACPSSQLISGRFLQEEI 360
++ KEVFSSICKFFCHAGIPLQAADSVYFHKMLELAGQYGQGLACPSSQLISGRFLQEEI
Sbjct: 127 EITKEVFSSICKFFCHAGIPLQAADSVYFHKMLELAGQYGQGLACPSSQLISGRFLQEEI 306
Query: 361 NSIKNYLAEYKASWAI 376
NSIKNYLAEYKASWAI
Sbjct: 307 NSIKNYLAEYKASWAI 354
Score = 122 bits (305), Expect = 7e-28
Identities = 69/137 (50%), Positives = 79/137 (57%)
Frame = +2
Query: 259 SSNTSPEPALRRSRLDSFYLKHPTNQNLQTCKQLKVKTGPTKKLRKEVFSSICKFFCHAG 318
SSNTSPEPALRRSRLDSFYLKHPTNQNLQTCKQLKVKTGPTKKL ++ F F
Sbjct: 2 SSNTSPEPALRRSRLDSFYLKHPTNQNLQTCKQLKVKTGPTKKLPRKFFLQFASSFAMQE 181
Query: 319 IPLQAADSVYFHKMLELAGQYGQGLACPSSQLISGRFLQEEINSIKNYLAEYKASWAITG 378
+ F K + + L F + + ++ L + T
Sbjct: 182 FLYKLQTLYIFIKCWNWLVNMDKDWHAHPANLFRVAFCRRKSIPLRITLLNIRLPGQFTC 361
Query: 379 CSIMADSWRDAQGRTII 395
CSIMADSWRDAQGRTII
Sbjct: 362 CSIMADSWRDAQGRTII 412
>TC90728 similar to GP|7715599|gb|AAF68117.1| F20B17.17 {Arabidopsis
thaliana}, partial (58%)
Length = 1171
Score = 141 bits (356), Expect = 9e-34
Identities = 89/298 (29%), Positives = 159/298 (52%), Gaps = 2/298 (0%)
Frame = +3
Query: 531 SSFATLQNLLDHRVSLRRMFLSNKW-MSSRFSSSSQGKEVQKIVLNVTFWKKMQSVRNSV 589
S+F +LQ +L R L+ MF S ++ + + +++ Q I + FW+ ++
Sbjct: 12 STFLSLQTMLKLRTRLKHMFHSPEYALDTSYANKPQSLSCIAIAEDGDFWRTVEECVAIS 191
Query: 590 YPILQVFQKVSSGESLSMPYIYNDLYRAKLAIKSIHGDDARKYEPFWKVIDRHCNSLFCH 649
P L+V ++VS G+ ++ IY + RAK +I++ + D K + F ++D+
Sbjct: 192 EPFLKVLREVSEGKP-TVGSIYELMTRAKESIRTYYIMDENKCKTFLDIVDKKWRDQLHS 368
Query: 650 PLYLAAYFLNPSYRYRQDFVSHSDVVRGLNECIVRL-ELDNMRRISASMQIPHYNSAQDD 708
PL+ AA FLNPS +Y + S + + +L + +MRR + QI + A
Sbjct: 369 PLHAAAAFLNPSIQYNPEIKFLSSIKEDFYHVLEKLLPVPDMRR-DITNQIYTFTKAHGM 545
Query: 709 FGTELAISTRTGLEPAAWWQQHGISCLELQRIAVRILSQTCSSFACEHDGSMYDQIYSKR 768
FG L R + P WW+Q+G S LQR+A+RILSQ CS+F+ + S + QI+S++
Sbjct: 546 FGCSLTKEARNTVAPWLWWEQYGDSAPGLQRVAIRILSQVCSTFSFQRQWSTFRQIHSEK 725
Query: 769 KNRLSQKKLNDIMYVHYNLRLRECQVRKRSRESKSTSAENVLQEHLLGDWIVDTTAQS 826
KN++ ++ LND++Y++YNL+L Q+ +S E +++ + +W+ + S
Sbjct: 726 KNKIDRETLNDLVYINYNLKLNR-QMSAKSLEVDLLQFDDI---DMTSEWVEENETVS 887
>CB892335 similar to GP|7715599|gb| F20B17.17 {Arabidopsis thaliana}, partial
(27%)
Length = 788
Score = 112 bits (281), Expect = 4e-25
Identities = 58/176 (32%), Positives = 97/176 (54%)
Frame = +2
Query: 309 SICKFFCHAGIPLQAADSVYFHKMLELAGQYGQGLACPSSQLISGRFLQEEINSIKNYLA 368
SI FF + A S + M++ + G G PS++++ +L+ + +
Sbjct: 257 SIALFFFENKLDFSVARSSSYQLMIDAITKCGPGFTGPSAEILKTIWLERIKSEVGLQSK 436
Query: 369 EYKASWAITGCSIMADSWRDAQGRTIINFLVSSPHGVYFVSSVDATNVVEDATYLFKLLD 428
+ + WA TGC+I+AD+W D + + IINFLVSSP ++F SVDA+ ++ +L L D
Sbjct: 437 DVEKEWATTGCTIIADTWTDYKSKAIINFLVSSPSRIFFHKSVDASAYFKNTKWLADLFD 616
Query: 429 KVVEELGEENVVQVITENTPNYKAAGKMLEERRRNLFWTPCAIYCINQVLEDFLKI 484
V++E G ENVVQ+I +++ NY G + + +F +PCA +N + D K+
Sbjct: 617 SVIQEFGPENVVQIIMDSSFNYTGIGNHIVQNYGTIFVSPCASQWLNLIFGDSPKM 784
>TC82359 weakly similar to GP|22655127|gb|AAM98154.1 putative protein
{Arabidopsis thaliana}, partial (14%)
Length = 907
Score = 107 bits (268), Expect(2) = 8e-24
Identities = 58/166 (34%), Positives = 91/166 (53%)
Frame = +3
Query: 654 AAYFLNPSYRYRQDFVSHSDVVRGLNECIVRLELDNMRRISASMQIPHYNSAQDDFGTEL 713
A ++LNP + Y +++ G+ +CI RL D + S ++ Y SA DFG ++
Sbjct: 57 AGFYLNPKFFYSIQGDVPNEIRSGMLDCIERLVPDTRVQDKISKELNLYKSAAGDFGRKM 236
Query: 714 AISTRTGLEPAAWWQQHGISCLELQRIAVRILSQTCSSFACEHDGSMYDQIYSKRKNRLS 773
AI R L P+ WW +G C L R+A+RILSQT S C+ + ++QI + R N +
Sbjct: 237 AIRARDNLLPSEWWSTYGGGCPNLSRLAIRILSQTSSVMFCKRNQIPFEQIINTR-NYIE 413
Query: 774 QKKLNDIMYVHYNLRLRECQVRKRSRESKSTSAENVLQEHLLGDWI 819
++ D+++VHYNLRLR+ + K S S +N+ + DWI
Sbjct: 414 RQHFTDLVFVHYNLRLRQMFMNKEQESSDPLSFDNICN---VEDWI 542
Score = 21.6 bits (44), Expect(2) = 8e-24
Identities = 7/17 (41%), Positives = 12/17 (70%)
Frame = +2
Query: 636 WKVIDRHCNSLFCHPLY 652
W +I + +SL+ HPL+
Sbjct: 2 WNIIHQRWDSLWHHPLH 52
>AL370071 similar to GP|15795135|dbj transposase-like protein {Arabidopsis
thaliana}, partial (3%)
Length = 370
Score = 63.5 bits (153), Expect = 3e-10
Identities = 32/91 (35%), Positives = 43/91 (47%)
Frame = +3
Query: 637 KVIDRHCNSLFCHPLYLAAYFLNPSYRYRQDFVSHSDVVRGLNECIVRLELDNMRRISAS 696
K+ID + F PL+ A YFLN Y Y F V RGL CI R+ D+ R
Sbjct: 60 KIIDERWDKQFHSPLHAAGYFLNAQYHYSPGFRDDVKVKRGLQHCITRMVTDHEERSKIE 239
Query: 697 MQIPHYNSAQDDFGTELAISTRTGLEPAAWW 727
+Q+ ++ + FG +AI T P WW
Sbjct: 240 IQLDDFDKQANQFGHPIAIITADMEIPPIWW 332
>TC82286 weakly similar to PIR|B96544|B96544 hypothetical protein F4M15.4
[imported] - Arabidopsis thaliana, partial (5%)
Length = 1782
Score = 35.8 bits (81), Expect = 0.070
Identities = 27/105 (25%), Positives = 43/105 (40%), Gaps = 1/105 (0%)
Frame = +2
Query: 133 DPGWEHGVAQDERK-KKVKCSYCEKVVSGGINRFKQHLARIPGEVAPCKSAPEEVYLKIK 191
D GW++ +++ KKV C YC +GGI+R KQH + + + +
Sbjct: 200 DIGWKYNSLKNKSNIKKVTCDYCLLESTGGISRAKQHQGERGKQGEQVSKGKGKKVVIVD 379
Query: 192 ENMKWHRTGKRHRQPEAKDLMPFYPKSDNEDDEYEQQEDTLHHMN 236
E T + E E++E E++ED HMN
Sbjct: 380 EYDPEFITDWGDEEKE------------EEEEENEEKEDEAEHMN 478
Score = 33.1 bits (74), Expect = 0.45
Identities = 16/37 (43%), Positives = 22/37 (59%), Gaps = 1/37 (2%)
Frame = +2
Query: 11 DPGWDHGIAQDERK-KKVRCNYCGKVVSGGIYRLKQH 46
D GW + +++ KKV C+YC +GGI R KQH
Sbjct: 200 DIGWKYNSLKNKSNIKKVTCDYCLLESTGGISRAKQH 310
>BF004402 similar to GP|7268572|emb| predicted protein of unknown function
{Arabidopsis thaliana}, partial (14%)
Length = 438
Score = 35.4 bits (80), Expect = 0.091
Identities = 25/92 (27%), Positives = 37/92 (40%), Gaps = 1/92 (1%)
Frame = +1
Query: 150 KCSYCEKVVSGGINRFKQHLARIPGEVAPCKSAPEEVYLKIKENMKWH-RTGKRHRQPEA 208
K +C + G H +R GE C P L+ +N+ WH + +Q ++
Sbjct: 85 KNRFCANTLLSGSLFCGNHNSRAEGEWIQCPIDPSHSVLE--QNLNWHVKRCPLLKQVQS 258
Query: 209 KDLMPFYPKSDNEDDEYEQQEDTLHHMNKEAL 240
PFY K N + EQQE+ N L
Sbjct: 259 LSDQPFYKKGINAGSDGEQQEEETSGFNDSKL 354
>TC89058 similar to GP|19071218|gb|AAL84162.1 putative heavy metal
transporter {Arabidopsis thaliana}, partial (10%)
Length = 1618
Score = 33.5 bits (75), Expect = 0.35
Identities = 21/64 (32%), Positives = 32/64 (49%), Gaps = 2/64 (3%)
Frame = -1
Query: 11 DPGWDHGIAQDERKKKV--RCNYCGKVVSGGIYRLKQHLARVSGEVTYCEKAPEEVYLKM 68
D G DH DER +++ ++CG V IY H + EVT C K P +++ +
Sbjct: 472 DGGDDH---DDERSRRLCRHIHFCGHDVDLTIYNFFLHACALLAEVTCCLK*PTDMFQRQ 302
Query: 69 KENL 72
+E L
Sbjct: 301 QELL 290
>TC90414
Length = 621
Score = 32.3 bits (72), Expect = 0.77
Identities = 37/137 (27%), Positives = 56/137 (40%), Gaps = 5/137 (3%)
Frame = +1
Query: 61 PEEVYLKMKENLEGCRSNKKQKQVDAQAYMNFQSNDDEDDEEQVGCRSKGKQLMDGRNVS 120
P+ YLKM + + Q D Q+ DE + E S+ + + +VS
Sbjct: 214 PQSFYLKMA-------NEEVQSPNDVSPTQQTQTALDETNVE-----SQEPEAVREPSVS 357
Query: 121 VNLTPLRSLGYVDPGWEHGVAQD-ERKKKVKCSYCEKVVSG----GINRFKQHLARIPGE 175
N L+S+ W H Q + K K C YC+K + G G + K HL+
Sbjct: 358 PNKRGLKSVY-----WRHYKRQKFDGKFKAICKYCDKKLGGETTNGTSHLKDHLSIC--- 513
Query: 176 VAPCKSAPEEVYLKIKE 192
A K +P + LK+ E
Sbjct: 514 AARNKRSPMQALLKVSE 564
>BQ751257
Length = 707
Score = 32.3 bits (72), Expect = 0.77
Identities = 17/26 (65%), Positives = 18/26 (68%)
Frame = +2
Query: 540 LDHRVSLRRMFLSNKWMSSRFSSSSQ 565
L H SLRR+FLSN M S SSSSQ
Sbjct: 284 LSHIPSLRRIFLSNAMMRSSISSSSQ 361
>CA857944
Length = 796
Score = 32.0 bits (71), Expect = 1.0
Identities = 18/42 (42%), Positives = 24/42 (56%)
Frame = -2
Query: 38 GGIYRLKQHLARVSGEVTYCEKAPEEVYLKMKENLEGCRSNK 79
GGI K HL RV G +T+C + EE+ K+K N R N+
Sbjct: 690 GGI*SKKIHLKRVMGIITHC-NSDEEISRKIKHNR*TARKNR 568
>CB892426 similar to PIR|T05689|T0 hypothetical protein F20M13.170 -
Arabidopsis thaliana, partial (15%)
Length = 580
Score = 30.4 bits (67), Expect = 2.9
Identities = 30/95 (31%), Positives = 42/95 (43%), Gaps = 8/95 (8%)
Frame = +1
Query: 257 GMSSNTSPEPALRRSRLDSFYLKHPTNQNLQTCKQ----LKVKT----GPTKKLRKEVFS 308
G SSN S PAL R F + + TN+ L Q L V T GP K + S
Sbjct: 106 GDSSNDSVSPALSRPPEQIFEIVNLTNELLPPLPQGTISLPVSTNFVKGPVVK-KSPAGS 282
Query: 309 SICKFFCHAGIPLQAADSVYFHKMLELAGQYGQGL 343
S+ + + +P +A ++ EL GQ+G L
Sbjct: 283 SVQQEDTNGNVPEISAREKLLNEQPELLGQFGMDL 387
>TC90737
Length = 760
Score = 29.6 bits (65), Expect = 5.0
Identities = 13/25 (52%), Positives = 16/25 (64%), Gaps = 2/25 (8%)
Frame = +2
Query: 194 MKWHRTGKRHRQ--PEAKDLMPFYP 216
+KW R+ KRHRQ P + M FYP
Sbjct: 113 LKWWRSAKRHRQNKPNLVNAMSFYP 187
>TC90610 similar to GP|18766642|gb|AAL79042.1 NIMA-related protein kinase
{Populus x canescens}, partial (46%)
Length = 1507
Score = 28.9 bits (63), Expect = 8.5
Identities = 11/30 (36%), Positives = 18/30 (59%), Gaps = 1/30 (3%)
Frame = +3
Query: 358 EEINSIKN-YLAEYKASWAITGCSIMADSW 386
E I+ ++N ++ EYK SW GC ++ W
Sbjct: 414 ELISKVRNPFIVEYKDSWVEKGCFVLYSHW 503
>AW688516 PIR|C96516|C965 F16N3.15 [imported] - Arabidopsis thaliana, partial
(1%)
Length = 591
Score = 28.9 bits (63), Expect = 8.5
Identities = 22/71 (30%), Positives = 38/71 (52%), Gaps = 3/71 (4%)
Frame = +3
Query: 567 KEVQKIVLNVTFWKKMQSVRNS-VYPILQVFQKVSSGESLSMPYIYNDLYRAKLAIK--S 623
KE QKI+++ W+K++ V+N+ + P Q ++ SG+SL L R L ++
Sbjct: 99 KERQKILIDFDQWEKVEKVQNALLQPQPQPNPQMKSGQSLF------SLTRDTLMLR*GQ 260
Query: 624 IHGDDARKYEP 634
I R+Y+P
Sbjct: 261 IKAKYVRRYQP 293
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.318 0.133 0.399
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 27,400,451
Number of Sequences: 36976
Number of extensions: 394907
Number of successful extensions: 2113
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 2056
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2105
length of query: 835
length of database: 9,014,727
effective HSP length: 104
effective length of query: 731
effective length of database: 5,169,223
effective search space: 3778702013
effective search space used: 3778702013
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)
Medicago: description of AC138579.4