
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148241.11 + phase: 0
(390 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC79182 similar to GP|15028159|gb|AAK76703.1 unknown protein {Ar... 401 e-112
TC79183 similar to GP|15028159|gb|AAK76703.1 unknown protein {Ar... 142 3e-34
TC83098 weakly similar to PIR|T04894|T04894 hypothetical protein... 105 3e-23
BI268769 similar to GP|15028159|gb| unknown protein {Arabidopsis... 80 7e-17
CA917046 similar to PIR|T04894|T048 hypothetical protein F18F4.2... 57 9e-14
TC89235 similar to SP|P57758|CTNS_ARATH Cystinosin homolog. [Mou... 35 0.038
TC83511 similar to GP|18252961|gb|AAL62407.1 unknown protein {Ar... 35 0.050
TC89374 similar to PIR|T05879|T05879 hypothetical protein T29A15... 32 0.55
AJ500787 31 0.72
BG644368 similar to GP|12597768|g DNA ligase I putative {Arabid... 30 1.2
BQ158052 30 2.1
TC79820 similar to GP|21537325|gb|AAM61666.1 unknown {Arabidopsi... 30 2.1
AW682928 similar to GP|22507458|gb| Unknown (protein for MGC:303... 29 2.7
TC82244 similar to GP|22507458|gb|AAH19429.1 Unknown (protein fo... 29 2.7
TC85514 homologue to SP|P32869|PSAD_CUCSA Photosystem I reaction... 29 3.6
BI309637 similar to PIR|D86241|D86 protein T16B5.8 [imported] - ... 28 4.6
NP496346 NP496346|AJ418371.1|CAD10810.1 nod region linked recept... 28 6.1
TC77448 SYMRK; MtSYMRK [Medicago truncatula]; nod region linked ... 28 6.1
TC83218 28 7.9
BF643248 weakly similar to PIR|D96595|D96 probable acetyl-CoA sy... 28 7.9
>TC79182 similar to GP|15028159|gb|AAK76703.1 unknown protein {Arabidopsis
thaliana}, partial (61%)
Length = 1205
Score = 401 bits (1030), Expect = e-112
Identities = 217/360 (60%), Positives = 255/360 (70%), Gaps = 1/360 (0%)
Frame = +2
Query: 30 DDISFSLGLMSLVSWGVAEIPQIITIFRNKSSHGISLAFLLTWVAGDICNLVGCLLEPAT 89
D+ISF+ G +SL+ WGVAEIPQIIT FR KSSHG+S+ FLLTWVAGDI NLVGCLLEPAT
Sbjct: 2 DNISFTFGFISLICWGVAEIPQIITNFRAKSSHGVSIVFLLTWVAGDIFNLVGCLLEPAT 181
Query: 90 LPTQFYTALTVINCSFMQALQSYYYCKLYTMITSSDGANIVRMLRVNCYFYNCIILQDNE 149
LPTQ+YTAL + + +QS+YY +Y NI + E
Sbjct: 182 LPTQYYTALLYTITTIVLVVQSFYYDYIYKWCKRRQKINIE---------------ETYE 316
Query: 150 EEKRPLNPKPSQVYSGIAIPNGTQKEAARGEYYYMSARSLAGSATPPSFTHLRAAKSGPS 209
EEK+PL PK + GI I +G + + EYYY SARSLAG+ TPPS T++R AKSGPS
Sbjct: 317 EEKKPLKPK-ERFELGIPIRSGRHRAIPKPEYYYGSARSLAGNVTPPSRTYMRVAKSGPS 493
Query: 210 ALEFIHDSSDDDEASQVTSNISTTKPWSIPRSVDGRYGTFLATAINLPLKGNSMRYGYIG 269
A+ DSS DDEA V + T+P IPRS G YGTFLA +INLP + N+++ GYI
Sbjct: 494 AMGLNEDSSSDDEAHSVPA----TQPRQIPRSA-GSYGTFLAASINLPHQSNALKVGYIA 658
Query: 270 FTGIKLLKVYVVVHSTYGQYLGWIMAAIYTCSRIPQIWLNIKRGSVEGLNPFMFVFALIA 329
+G KLL V HS GQ+LGW+MAAIYT RIPQIWLNIKRGSVEGLNPFMF+FALIA
Sbjct: 659 LSGRKLLSQEHVTHSALGQWLGWLMAAIYTGGRIPQIWLNIKRGSVEGLNPFMFIFALIA 838
Query: 330 NTSYVGSILVRTTEFESIKANLPWLLDATVCVALDFFIISQYIYYRYFR-SSESSDDGEY 388
N +YVGSILVRTTE+ESIKAN+PWLLDA VCVALD FII QYI YRY R ++ SSD G Y
Sbjct: 839 NATYVGSILVRTTEWESIKANMPWLLDAIVCVALDLFIILQYINYRYHRKTTTSSDYGNY 1018
>TC79183 similar to GP|15028159|gb|AAK76703.1 unknown protein {Arabidopsis
thaliana}, partial (23%)
Length = 931
Score = 142 bits (357), Expect = 3e-34
Identities = 98/253 (38%), Positives = 127/253 (49%), Gaps = 46/253 (18%)
Frame = +2
Query: 93 QFYTALTVINCSFMQALQSYYYCKLYTMITSSDGANIVRMLRVN-----------CYFYN 141
Q YT T++ +QS+YY +Y NI +L +N + +
Sbjct: 140 QLYTITTIV-----LVVQSFYYDYIYKWCKRRQKINIEEVLSLNQI*EEGKYNDTLFLLS 304
Query: 142 CIILQ-----DNEEEKRPLNPKPSQVYSGIAIPNGTQKEAARGEYYY------------- 183
II+ EEEK+PL PK + GI I +G + + EYYY
Sbjct: 305 YIIIVML***TYEEEKKPLKPK-ERFELGIPIRSGRHRAIPKPEYYYG*VLIFN*IVN** 481
Query: 184 -----------------MSARSLAGSATPPSFTHLRAAKSGPSALEFIHDSSDDDEASQV 226
SARSLAG+ TPPS T++R AKSGPSA+ DSS DDEA V
Sbjct: 482 IKQSSELMSFELID*VNRSARSLAGNVTPPSRTYMRVAKSGPSAMGLNEDSSSDDEAHSV 661
Query: 227 TSNISTTKPWSIPRSVDGRYGTFLATAINLPLKGNSMRYGYIGFTGIKLLKVYVVVHSTY 286
+ T+P IPRS G YGTFLA +INLP + N+++ GYI +G KLL V HS
Sbjct: 662 PA----TQPRQIPRSA-GSYGTFLAASINLPHQSNALKVGYIALSGRKLLSQEHVTHSAL 826
Query: 287 GQYLGWIMAAIYT 299
GQ+LGW+MAAIYT
Sbjct: 827 GQWLGWLMAAIYT 865
>TC83098 weakly similar to PIR|T04894|T04894 hypothetical protein F18F4.200
- Arabidopsis thaliana, partial (29%)
Length = 467
Score = 105 bits (262), Expect = 3e-23
Identities = 48/86 (55%), Positives = 64/86 (73%)
Frame = +2
Query: 33 SFSLGLMSLVSWGVAEIPQIITIFRNKSSHGISLAFLLTWVAGDICNLVGCLLEPATLPT 92
S +LG++S++ W +AEIPQ+IT +R KSSHG+S+ FLLTW+ GD+ NL GCLLEPATLPT
Sbjct: 170 SITLGVISVIVWMIAEIPQLITNYREKSSHGLSVTFLLTWIIGDLFNLFGCLLEPATLPT 349
Query: 93 QFYTALTVINCSFMQALQSYYYCKLY 118
Q YTA+ + LQ+ YY +Y
Sbjct: 350 QLYTAVLYTLITLTLCLQATYYGHIY 427
>BI268769 similar to GP|15028159|gb| unknown protein {Arabidopsis thaliana},
partial (13%)
Length = 554
Score = 79.7 bits (195), Expect(2) = 7e-17
Identities = 41/54 (75%), Positives = 45/54 (82%), Gaps = 1/54 (1%)
Frame = +3
Query: 336 SILVRTTEFESIKANLPWLLDATVCVALDFFIISQYIYYRYFR-SSESSDDGEY 388
SILVRTTE+ESIKAN+PWLLDA VCVALD FII QYI YRY R ++ SSD G Y
Sbjct: 222 SILVRTTEWESIKANMPWLLDAIVCVALDLFIILQYINYRYHRKTTTSSDYGNY 383
Score = 25.0 bits (53), Expect(2) = 7e-17
Identities = 9/10 (90%), Positives = 10/10 (100%)
Frame = +2
Query: 317 GLNPFMFVFA 326
GLNPFMF+FA
Sbjct: 62 GLNPFMFIFA 91
>CA917046 similar to PIR|T04894|T048 hypothetical protein F18F4.200 -
Arabidopsis thaliana, partial (19%)
Length = 663
Score = 56.6 bits (135), Expect(3) = 9e-14
Identities = 24/42 (57%), Positives = 33/42 (78%)
Frame = +1
Query: 316 EGLNPFMFVFALIANTSYVGSILVRTTEFESIKANLPWLLDA 357
EG+NP MF+FALI NT+YV SILV + ++ + NLPWL+D+
Sbjct: 538 EGVNPLMFLFALIGNTTYVASILVSSMDWSKLGPNLPWLVDS 663
Score = 33.1 bits (74), Expect(3) = 9e-14
Identities = 40/150 (26%), Positives = 55/150 (36%), Gaps = 19/150 (12%)
Frame = +2
Query: 168 IPNGTQKEAARGEYYYMSARSLAGSATPPSFTHLRAAKS-----------GPSALEFIHD 216
IP QK + YY SAR L+ S TP S R S PS
Sbjct: 65 IPFPAQKSHVETQSYYQSARYLSKSHTPKSELAQRMPSSLILDPIEEPLLVPSVFTKSAP 244
Query: 217 SSDDDEASQVTSNISTTKPWSIPRSVDGRYGTFLATAINLPLKGNSMRYGYIGFTGIKLL 276
S + S ++ ++ S D R + +A R ++ + G KLL
Sbjct: 245 SLKIKNTLCLVSTLTFLGALNLLHSPDTRIHSDVAKP----------RKEFVIYVGRKLL 394
Query: 277 KVY--------VVVHSTYGQYLGWIMAAIY 298
+V V + + G YLGW MA IY
Sbjct: 395 QVSGHKLSDQGVEAYHSIGTYLGWAMAVIY 484
Score = 23.9 bits (50), Expect(3) = 9e-14
Identities = 10/16 (62%), Positives = 13/16 (80%)
Frame = +3
Query: 302 RIPQIWLNIKRGSVEG 317
R+PQI LNI+RG+ G
Sbjct: 495 RLPQICLNIRRGNF*G 542
>TC89235 similar to SP|P57758|CTNS_ARATH Cystinosin homolog. [Mouse-ear
cress] {Arabidopsis thaliana}, partial (92%)
Length = 1103
Score = 35.4 bits (80), Expect = 0.038
Identities = 20/55 (36%), Positives = 31/55 (56%)
Frame = +1
Query: 280 VVVHSTYGQYLGWIMAAIYTCSRIPQIWLNIKRGSVEGLNPFMFVFALIANTSYV 334
V + TY + LGW +++ S PQ+ LN +R SV GLN + L ++SY+
Sbjct: 40 VSLEVTY-EVLGWFAFIVWSISFYPQVILNFRRKSVVGLNFDFVLLNLTKHSSYL 201
Score = 31.2 bits (69), Expect = 0.72
Identities = 13/35 (37%), Positives = 21/35 (59%)
Frame = +1
Query: 36 LGLMSLVSWGVAEIPQIITIFRNKSSHGISLAFLL 70
LG + + W ++ PQ+I FR KS G++ F+L
Sbjct: 67 LGWFAFIVWSISFYPQVILNFRRKSVVGLNFDFVL 171
>TC83511 similar to GP|18252961|gb|AAL62407.1 unknown protein {Arabidopsis
thaliana}, complete
Length = 1014
Score = 35.0 bits (79), Expect = 0.050
Identities = 27/82 (32%), Positives = 41/82 (49%), Gaps = 2/82 (2%)
Frame = +1
Query: 296 AIYTCSRIPQIWLNIKRGSVEGLNPFMFVFALIANTSYVGSILVRTTEFESIKANLP--W 353
AI+ C+RIPQI+ N S L + + + G +VR F +I+ N P
Sbjct: 505 AIFLCARIPQIFQNFSNKSTGEL-------SFLTSFMNFGGSMVRV--FTTIQENAPKSV 657
Query: 354 LLDATVCVALDFFIISQYIYYR 375
LL + VA +F I+SQ + Y+
Sbjct: 658 LLGYGIGVATNFTILSQIVIYQ 723
>TC89374 similar to PIR|T05879|T05879 hypothetical protein T29A15.230 -
Arabidopsis thaliana, partial (71%)
Length = 645
Score = 31.6 bits (70), Expect = 0.55
Identities = 12/25 (48%), Positives = 17/25 (68%)
Frame = -2
Query: 80 LVGCLLEPATLPTQFYTALTVINCS 104
L+GCL+ LPTQ +T +T +N S
Sbjct: 386 LLGCLIRSFVLPTQHFTTITAVNIS 312
>AJ500787
Length = 566
Score = 31.2 bits (69), Expect = 0.72
Identities = 16/42 (38%), Positives = 21/42 (49%)
Frame = +3
Query: 35 SLGLMSLVSWGVAEIPQIITIFRNKSSHGISLAFLLTWVAGD 76
+LG +SL +PQ F+ S G S +LTWV GD
Sbjct: 210 TLGYLSLGIESTVPMPQAYQNFKRHSVSGFSKWIILTWVGGD 335
>BG644368 similar to GP|12597768|g DNA ligase I putative {Arabidopsis
thaliana}, partial (12%)
Length = 641
Score = 30.4 bits (67), Expect = 1.2
Identities = 13/25 (52%), Positives = 16/25 (64%)
Frame = +3
Query: 298 YTCSRIPQIWLNIKRGSVEGLNPFM 322
YT S+ WL +KR VEGLN F+
Sbjct: 567 YTPSKRSDAWLKVKRDYVEGLNDFL 641
>BQ158052
Length = 983
Score = 29.6 bits (65), Expect = 2.1
Identities = 20/84 (23%), Positives = 33/84 (38%)
Frame = +2
Query: 59 KSSHGISLAFLLTWVAGDICNLVGCLLEPATLPTQFYTALTVINCSFMQALQSYYYCKLY 118
K+SH S FLL WV C P F + +N + + + + Y + +
Sbjct: 11 KNSHSGSFLFLLVWV---------CCTGPDMYLILFPRKVPYVNSTMITCMPTSYQNRTF 163
Query: 119 TMITSSDGANIVRMLRVNCYFYNC 142
IVR + +N + +NC
Sbjct: 164 VF--------IVRFILINLFCFNC 211
>TC79820 similar to GP|21537325|gb|AAM61666.1 unknown {Arabidopsis
thaliana}, partial (89%)
Length = 1225
Score = 29.6 bits (65), Expect = 2.1
Identities = 18/56 (32%), Positives = 28/56 (49%), Gaps = 9/56 (16%)
Frame = +3
Query: 80 LVGCLLEPATLPTQFYTALTVIN-----CSFMQALQSYYYCK----LYTMITSSDG 126
L+G LLEP P +F + ++N C F+ A+ YY + LYT ++ G
Sbjct: 375 LIGKLLEPVWGPREFIKFIFIVNILTSLCIFITAIALYYITRQEIYLYTPLSGFHG 542
>AW682928 similar to GP|22507458|gb| Unknown (protein for MGC:30371) {Mus
musculus}, partial (13%)
Length = 641
Score = 29.3 bits (64), Expect = 2.7
Identities = 24/68 (35%), Positives = 31/68 (45%), Gaps = 5/68 (7%)
Frame = +2
Query: 200 HLRAAKSGPSALEFIHDSSDDDEASQ-VTSNISTTKPWS----IPRSVDGRYGTFLATAI 254
H R +A I SSD D A V S ISTT PW+ + T +A+A
Sbjct: 125 HYRRTAEAAAAASSIRYSSDFDIAPPGVASRISTTNPWANAYPSAAAAVAAAATNVASAA 304
Query: 255 NLPLKGNS 262
+L LK +S
Sbjct: 305 SLDLKRSS 328
>TC82244 similar to GP|22507458|gb|AAH19429.1 Unknown (protein for
MGC:30371) {Mus musculus}, partial (13%)
Length = 847
Score = 29.3 bits (64), Expect = 2.7
Identities = 24/68 (35%), Positives = 31/68 (45%), Gaps = 5/68 (7%)
Frame = +3
Query: 200 HLRAAKSGPSALEFIHDSSDDDEASQ-VTSNISTTKPWS----IPRSVDGRYGTFLATAI 254
H R +A I SSD D A V S ISTT PW+ + T +A+A
Sbjct: 360 HYRRTAEAAAAASSIRYSSDFDIAPPGVASRISTTNPWANAFPSAAAAVAAAATNVASAA 539
Query: 255 NLPLKGNS 262
+L LK +S
Sbjct: 540 SLDLKRSS 563
>TC85514 homologue to SP|P32869|PSAD_CUCSA Photosystem I reaction center
subunit II chloroplast precursor (Photosystem I 20 kDa
subunit), partial (77%)
Length = 870
Score = 28.9 bits (63), Expect = 3.6
Identities = 14/46 (30%), Positives = 24/46 (51%)
Frame = +3
Query: 197 SFTHLRAAKSGPSALEFIHDSSDDDEASQVTSNISTTKPWSIPRSV 242
+ TH S S+ FI++ + +AS T +S+ KPW P ++
Sbjct: 15 TLTHTNKHHSSSSSSTFINNMAMATQASLFTPPLSSPKPWKQPSTL 152
>BI309637 similar to PIR|D86241|D86 protein T16B5.8 [imported] - Arabidopsis
thaliana, partial (28%)
Length = 714
Score = 28.5 bits (62), Expect = 4.6
Identities = 20/69 (28%), Positives = 28/69 (39%)
Frame = +2
Query: 181 YYYMSARSLAGSATPPSFTHLRAAKSGPSALEFIHDSSDDDEASQVTSNISTTKPWSIPR 240
Y S SLAG PS +HL E IH+S + N+ T K W +
Sbjct: 311 YGLASWLSLAG----PSLSHLELRMDNLGDNEIIHESPSKLDCIGAAVNVETLKLWGVLI 478
Query: 241 SVDGRYGTF 249
+ ++ TF
Sbjct: 479 KLIPKWETF 505
>NP496346 NP496346|AJ418371.1|CAD10810.1 nod region linked receptor kinase
[Medicago truncatula]
Length = 2706
Score = 28.1 bits (61), Expect = 6.1
Identities = 15/41 (36%), Positives = 22/41 (53%)
Frame = +1
Query: 182 YYMSARSLAGSATPPSFTHLRAAKSGPSALEFIHDSSDDDE 222
+ +S L S TPP L+ A + P LEF+HD + D+
Sbjct: 682 FNVSNVDLKDSVTPP-LQVLQTALTHPERLEFVHDGLETDD 801
>TC77448 SYMRK; MtSYMRK [Medicago truncatula]; nod region linked receptor
kinase [Medicago truncatula]
Length = 3568
Score = 28.1 bits (61), Expect = 6.1
Identities = 15/41 (36%), Positives = 22/41 (53%)
Frame = +1
Query: 182 YYMSARSLAGSATPPSFTHLRAAKSGPSALEFIHDSSDDDE 222
+ +S L S TPP L+ A + P LEF+HD + D+
Sbjct: 1057 FNVSNVDLKDSVTPP-LQVLQTALTHPERLEFVHDGLETDD 1176
>TC83218
Length = 1170
Score = 27.7 bits (60), Expect = 7.9
Identities = 18/54 (33%), Positives = 24/54 (44%)
Frame = +2
Query: 60 SSHGISLAFLLTWVAGDICNLVGCLLEPATLPTQFYTALTVINCSFMQALQSYY 113
S H FLLT C L C+ E +TLP + L+ + SF+ A Y
Sbjct: 512 SQHRTKRCFLLT*HLEAACLLRSCVFEASTLPYNSISDLSWSSRSFLPAYAPAY 673
>BF643248 weakly similar to PIR|D96595|D96 probable acetyl-CoA synthetase
45051-31547 [imported] - Arabidopsis thaliana, partial
(2%)
Length = 690
Score = 27.7 bits (60), Expect = 7.9
Identities = 19/72 (26%), Positives = 36/72 (49%), Gaps = 1/72 (1%)
Frame = +2
Query: 31 DISFSLGLMSLVSWGVAEIPQ-IITIFRNKSSHGISLAFLLTWVAGDICNLVGCLLEPAT 89
D+ +SL L L S E P+ I+ + S HG W++G + N+ C L+P++
Sbjct: 446 DVYWSLVLKEL-SISFIEPPKCILDTSSDPSKHGGK------WLSGSVLNIADCCLQPSS 604
Query: 90 LPTQFYTALTVI 101
P + ++ ++
Sbjct: 605 HPNKPDDSIAIV 640
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.322 0.137 0.421
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,058,458
Number of Sequences: 36976
Number of extensions: 217323
Number of successful extensions: 1143
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 1124
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1137
length of query: 390
length of database: 9,014,727
effective HSP length: 98
effective length of query: 292
effective length of database: 5,391,079
effective search space: 1574195068
effective search space used: 1574195068
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (27.3 bits)
Medicago: description of AC148241.11