
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146940.1 - phase: 0
(285 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC80327 similar to GP|3540182|gb|AAC34332.1| Unknown protein {Ar... 453 e-128
TC83500 similar to PIR|T04231|T04231 hypothetical protein F14M19... 191 4e-49
TC80019 similar to PIR|G86446|G86446 unknown protein [imported] ... 158 2e-39
AJ501822 similar to GP|20258913|gb unknown protein {Arabidopsis ... 155 2e-38
BE248404 similar to GP|20260538|gb unknown protein {Arabidopsis ... 151 3e-37
BE316120 similar to GP|20258913|gb unknown protein {Arabidopsis ... 139 2e-33
TC83808 similar to PIR|T04231|T04231 hypothetical protein F14M19... 99 3e-33
TC82635 similar to GP|20260538|gb|AAM13167.1 unknown protein {Ar... 92 3e-19
TC91256 similar to GP|20258913|gb|AAM14150.1 unknown protein {Ar... 72 3e-13
BQ140879 similar to GP|20260538|gb unknown protein {Arabidopsis ... 48 4e-06
AW687325 similar to GP|21689821|gb unknown protein {Arabidopsis ... 33 0.096
TC93826 similar to PIR|H86366|H86366 protein F26F24.12 [imported... 32 0.36
TC88543 similar to GP|20466782|gb|AAM20708.1 unknown protein {Ar... 31 0.62
TC85802 weakly similar to PIR|T47837|T47837 beta-glucosidase-lik... 29 1.8
CB065387 weakly similar to GP|20257161|gb| 2-dehydro-3-deoxygala... 29 1.8
TC84460 similar to PIR|T47960|T47960 hypothetical protein F15G16... 28 4.0
TC88031 similar to PIR|E84706|E84706 hypothetical protein At2g30... 27 6.9
>TC80327 similar to GP|3540182|gb|AAC34332.1| Unknown protein {Arabidopsis
thaliana}, partial (34%)
Length = 822
Score = 453 bits (1166), Expect = e-128
Identities = 220/221 (99%), Positives = 220/221 (99%)
Frame = +1
Query: 1 MESQQIPKVQHSVVDVQINNKNIKKLKFPKFGCFRIQHDATGDGFDIEVVDASGHRSNPT 60
MESQQIPKVQHSVVDVQINNKNIKKLKFPKFGCFRIQHDATGDGFDIEVVDASGHRSNPT
Sbjct: 154 MESQQIPKVQHSVVDVQINNKNIKKLKFPKFGCFRIQHDATGDGFDIEVVDASGHRSNPT 333
Query: 61 HLIIMVNGLIGSAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTGNRLAEEVI 120
HLIIMVNGLIGSAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTGNRLAEEVI
Sbjct: 334 HLIIMVNGLIGSAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTGNRLAEEVI 513
Query: 121 SVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHCEGQISNQECHVRK 180
SVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHCEGQISNQECHVRK
Sbjct: 514 SVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHCEGQISNQECHVRK 693
Query: 181 YEGKIAGLEPINFITSATPHLGCRGHKQVPLLCGFHSLEKT 221
YEGKIAGLEPINFITSATPHLGCRGHKQVPL CGFHSLEKT
Sbjct: 694 YEGKIAGLEPINFITSATPHLGCRGHKQVPLXCGFHSLEKT 816
>TC83500 similar to PIR|T04231|T04231 hypothetical protein F14M19.50 -
Arabidopsis thaliana, partial (29%)
Length = 994
Score = 191 bits (484), Expect = 4e-49
Identities = 100/184 (54%), Positives = 131/184 (70%), Gaps = 6/184 (3%)
Frame = +2
Query: 106 DGVDVTGNRLAEEVISVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKELSQG-- 163
DGVD EEV+S+++ P ++KISF+AHSLGGL+ARYAIA+L+ D SK L G
Sbjct: 2 DGVDTWVRG*PEEVLSIVRCWPGLQKISFVAHSLGGLVARYAIARLF--DYSKTLEAGVT 175
Query: 164 --NVHCEGQIS-NQECHVRKYEGKIAGLEPINFITSATPHLGCRGHKQVPLLCGFHSLEK 220
N C+ + + C + YE +IAGLEP+NFIT ATPHLG RGH+Q+P LCG LE+
Sbjct: 176 CRNCDCKEEAECTKNCTEQHYEARIAGLEPMNFITFATPHLGSRGHRQLPFLCGIPFLER 355
Query: 221 TASRLSRFL-GKTGKHLFLTDGKNEKPPLLLQMVRDSEDIKFMSALRSFKRRVAYANIRY 279
AS+ + + G+TGKHLFL D + KPPLLL+M+ DS+D+KFMSAL FKRRVAYAN +
Sbjct: 356 RASQTAHLIVGRTGKHLFLMDNDDGKPPLLLRMIEDSDDLKFMSALCVFKRRVAYANANF 535
Query: 280 DRIL 283
D ++
Sbjct: 536 DHMV 547
>TC80019 similar to PIR|G86446|G86446 unknown protein [imported] -
Arabidopsis thaliana, partial (35%)
Length = 1518
Score = 158 bits (400), Expect = 2e-39
Identities = 77/85 (90%), Positives = 80/85 (93%)
Frame = +2
Query: 128 SVRKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHCEGQISNQECHVRKYEGKIAG 187
SV+KISFIAHSLGGLIARYAIAKLYERDISKELSQGNVH E QISNQECH+RKYEGKIAG
Sbjct: 674 SVQKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHSESQISNQECHIRKYEGKIAG 853
Query: 188 LEPINFITSATPHLGCRGHKQVPLL 212
LEPINFITS PHLGCRGHKQ+ LL
Sbjct: 854 LEPINFITSTMPHLGCRGHKQLILL 928
Score = 42.7 bits (99), Expect = 2e-04
Identities = 20/26 (76%), Positives = 21/26 (79%)
Frame = +3
Query: 209 VPLLCGFHSLEKTASRLSRFLGKTGK 234
VPL+CGF SLEKT SRLSRF GK K
Sbjct: 1167 VPLVCGFDSLEKTTSRLSRFFGKNIK 1244
>AJ501822 similar to GP|20258913|gb unknown protein {Arabidopsis thaliana},
partial (41%)
Length = 597
Score = 155 bits (391), Expect = 2e-38
Identities = 81/154 (52%), Positives = 107/154 (68%)
Frame = +3
Query: 53 SGHRSNPTHLIIMVNGLIGSAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTG 112
S S+ HL++MVNG++GS+ +WK+A++QF+K P V VHCSE N S T DGVDV G
Sbjct: 162 SSDSSSADHLVVMVNGILGSSTDWKFASEQFVKELPDKVFVHCSERNVSKHTLDGVDVMG 341
Query: 113 NRLAEEVISVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHCEGQIS 172
RLAEEVI VI+R P++RK+SFI+HS+GGL+ARYAI KLY R E Q + + E ++
Sbjct: 342 ERLAEEVIEVIRRKPNMRKVSFISHSVGGLVARYAIGKLY-RPPGNEPIQDSGNKESKVD 518
Query: 173 NQECHVRKYEGKIAGLEPINFITSATPHLGCRGH 206
+ G I GLE +NF+T ATPHLG RG+
Sbjct: 519 S--------IGTICGLEAMNFVTVATPHLGSRGN 596
>BE248404 similar to GP|20260538|gb unknown protein {Arabidopsis thaliana},
partial (32%)
Length = 571
Score = 151 bits (381), Expect = 3e-37
Identities = 87/190 (45%), Positives = 116/190 (60%), Gaps = 3/190 (1%)
Frame = +1
Query: 72 SAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTGNRLAEEVISVIKRHPSVRK 131
S +W YA ++ ++H S N+ T TF G+D G RLA+EV+ V+K++ S+++
Sbjct: 10 SPGDWSYAEEELKMNLGKSFLIHASSSNAYTKTFTGIDEAGKRLADEVMQVVKKNQSLKR 189
Query: 132 ISFIAHSLGGLIARYAIAKLYERD-ISKELSQGNVHCEGQISNQECHVRKYEGKIAGLEP 190
ISF+AHSLGGL ARYAIA LY D + V+CE + S + R G IAGLEP
Sbjct: 190 ISFLAHSLGGLFARYAIAVLYSPDTYNSGQPDDPVNCEMENSQKTDFSR---GMIAGLEP 360
Query: 191 INFITSATPHLGCRGHKQVPLLCGFHSLEKTASRLS-RFLGKTGKHLFLTDGKNEKPPLL 249
+NFIT ATPHLG RG Q+P L G LEK + ++ F+G+TG LFLTD K KP L
Sbjct: 361 MNFITLATPHLGVRGKNQLPFLFGVPILEKLVAPVAPLFIGRTGSQLFLTDDKPNKPSLS 540
Query: 250 L-QMVRDSED 258
+ + D ED
Sbjct: 541 F*EWLSDCED 570
>BE316120 similar to GP|20258913|gb unknown protein {Arabidopsis thaliana},
partial (43%)
Length = 625
Score = 139 bits (349), Expect = 2e-33
Identities = 79/162 (48%), Positives = 105/162 (64%)
Frame = +2
Query: 53 SGHRSNPTHLIIMVNGLIGSAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTG 112
S S+ HL++MVNG++GS+ +WK+A++QF+K P V VHCSE N S T DGVDV G
Sbjct: 158 SSDSSSADHLVVMVNGILGSSTDWKFASEQFVKELPDKVFVHCSERNVSKHTLDGVDVMG 337
Query: 113 NRLAEEVISVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKELSQGNVHCEGQIS 172
RLAEEVI VI+R P++RK+SFI+HS+GGL+ARYAI KLY R E Q + + E ++
Sbjct: 338 ERLAEEVIEVIRRKPNMRKVSFISHSVGGLVARYAIGKLY-RPPGNEPIQDSGNKESKVD 514
Query: 173 NQECHVRKYEGKIAGLEPINFITSATPHLGCRGHKQVPLLCG 214
+ +Y + G E N T LG G+KQVP L G
Sbjct: 515 S-----NRYNMRPGGNEFRN--CCYTSSLGQGGNKQVPFLFG 619
>TC83808 similar to PIR|T04231|T04231 hypothetical protein F14M19.50 -
Arabidopsis thaliana, partial (24%)
Length = 685
Score = 99.0 bits (245), Expect(2) = 3e-33
Identities = 51/99 (51%), Positives = 69/99 (69%)
Frame = +2
Query: 22 NIKKLKFPKFGCFRIQHDATGDGFDIEVVDASGHRSNPTHLIIMVNGLIGSAHNWKYAAK 81
N K++ P+ +++ + +G+ F +AS + P HL+IMVNG+ GSA +W+YAA+
Sbjct: 176 NSIKMQLPRL---KVEAEISGEDF----FNASTSKRIPRHLVIMVNGITGSASDWRYAAE 334
Query: 82 QFLKRYPYDVIVHCSECNSSTLTFDGVDVTGNRLAEEVI 120
QF+KR P VIVH SECNSS LTFDGVD G RLAEEV+
Sbjct: 335 QFVKRLPDKVIVHRSECNSSRLTFDGVDTMGERLAEEVL 451
Score = 60.1 bits (144), Expect(2) = 3e-33
Identities = 31/70 (44%), Positives = 45/70 (64%), Gaps = 3/70 (4%)
Frame = +1
Query: 119 VISVIKRHPSVRKISFIAHSLGGLIARYAIAKLYERDISKE---LSQGNVHCEGQISNQE 175
V+SV++R P V KISF+AHSLGGL+ARYAI +LY+ E S+ + E +++
Sbjct: 454 VLSVVRRWPEVHKISFVAHSLGGLVARYAIGRLYDNSSKLEHVGNSRNHFKEEKTEYSKQ 633
Query: 176 CHVRKYEGKI 185
C + YE K+
Sbjct: 634 CLTQSYEAKL 663
>TC82635 similar to GP|20260538|gb|AAM13167.1 unknown protein {Arabidopsis
thaliana}, partial (21%)
Length = 611
Score = 91.7 bits (226), Expect = 3e-19
Identities = 40/90 (44%), Positives = 64/90 (70%)
Frame = +2
Query: 56 RSNPTHLIIMVNGLIGSAHNWKYAAKQFLKRYPYDVIVHCSECNSSTLTFDGVDVTGNRL 115
R++P HL+++V+G++ S +W YA + KR + +++ S N+ T TF G+D G RL
Sbjct: 341 RNDPDHLLVLVHGILASTADWTYAEAELKKRLGKNFLIYVSSSNAYTKTFTGIDGAGKRL 520
Query: 116 AEEVISVIKRHPSVRKISFIAHSLGGLIAR 145
A+EV+ V+K+ S+++ISF+AHSLGGL AR
Sbjct: 521 ADEVLQVVKKTESLKRISFLAHSLGGLFAR 610
>TC91256 similar to GP|20258913|gb|AAM14150.1 unknown protein {Arabidopsis
thaliana}, partial (48%)
Length = 675
Score = 71.6 bits (174), Expect = 3e-13
Identities = 37/73 (50%), Positives = 50/73 (67%), Gaps = 1/73 (1%)
Frame = +2
Query: 212 LCGFHSLEKTASRLSRFL-GKTGKHLFLTDGKNEKPPLLLQMVRDSEDIKFMSALRSFKR 270
L G + EK AS + ++ +TG+HLFLTD KPPLL +M+ D + FMSALR+FKR
Sbjct: 5 LFGVTAFEKLASVVIHWIFRRTGRHLFLTDDDEGKPPLLKRMIEDYDGYYFMSALRTFKR 184
Query: 271 RVAYANIRYDRIL 283
RV Y+N+ YD I+
Sbjct: 185 RVIYSNVGYDHIV 223
>BQ140879 similar to GP|20260538|gb unknown protein {Arabidopsis thaliana},
partial (36%)
Length = 618
Score = 48.1 bits (113), Expect = 4e-06
Identities = 22/39 (56%), Positives = 29/39 (73%)
Frame = +1
Query: 245 KPPLLLQMVRDSEDIKFMSALRSFKRRVAYANIRYDRIL 283
KP LLL+M D ED KF+SAL +F+ RV YAN+ YD ++
Sbjct: 4 KPSLLLRMASDCEDGKFISALGAFRSRVVYANVSYDHMV 120
>AW687325 similar to GP|21689821|gb unknown protein {Arabidopsis thaliana},
partial (19%)
Length = 505
Score = 33.5 bits (75), Expect = 0.096
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 9/45 (20%)
Frame = +3
Query: 114 RLAEEVISVIKRHPSVR---------KISFIAHSLGGLIARYAIA 149
RLA+EVIS +K+ ++SF+ HS+G LI R AIA
Sbjct: 6 RLAQEVISFVKKKMDKESRCGNLRDIRLSFVGHSIGNLIIRTAIA 140
>TC93826 similar to PIR|H86366|H86366 protein F26F24.12 [imported] -
Arabidopsis thaliana, partial (56%)
Length = 672
Score = 31.6 bits (70), Expect = 0.36
Identities = 19/64 (29%), Positives = 29/64 (44%), Gaps = 3/64 (4%)
Frame = +2
Query: 164 NVHCEGQI---SNQECHVRKYEGKIAGLEPINFITSATPHLGCRGHKQVPLLCGFHSLEK 220
N+ EG+I S+Q C K GK++G +++ T + CR V +C S
Sbjct: 29 NLKTEGKILKISSQRCSTMKLYGKLSGTNHCSYMAKITTGIFCRNPYNVTGICNRSSCPL 208
Query: 221 TASR 224
SR
Sbjct: 209 ANSR 220
>TC88543 similar to GP|20466782|gb|AAM20708.1 unknown protein {Arabidopsis
thaliana}, partial (46%)
Length = 2101
Score = 30.8 bits (68), Expect = 0.62
Identities = 20/67 (29%), Positives = 33/67 (48%), Gaps = 2/67 (2%)
Frame = +1
Query: 130 RKISFIAHSLGGLIARYAI--AKLYERDISKELSQGNVHCEGQISNQECHVRKYEGKIAG 187
+K + HSL +I R ++ A+ E + K L+ HCE + V + GKIA
Sbjct: 844 KKQYLLLHSLKEVIVRQSVDKAEFQESSVEKILNLLFNHCESEEEGVRNVVAECLGKIAL 1023
Query: 188 LEPINFI 194
+EP+ +
Sbjct: 1024IEPVKLV 1044
>TC85802 weakly similar to PIR|T47837|T47837 beta-glucosidase-like protein -
Arabidopsis thaliana, partial (43%)
Length = 1119
Score = 29.3 bits (64), Expect = 1.8
Identities = 11/26 (42%), Positives = 17/26 (65%)
Frame = -1
Query: 59 PTHLIIMVNGLIGSAHNWKYAAKQFL 84
PT+L I +N LI + HNW + Q++
Sbjct: 1116 PTNLFIDINLLISALHNWYHQLSQYI 1039
>CB065387 weakly similar to GP|20257161|gb| 2-dehydro-3-deoxygalactonate
kinase {Bradyrhizobium japonicum}, partial (16%)
Length = 633
Score = 29.3 bits (64), Expect = 1.8
Identities = 17/46 (36%), Positives = 24/46 (51%), Gaps = 1/46 (2%)
Frame = -3
Query: 36 IQHDATGDGFDIEVVDASGHRSNPT-HLIIMVNGLIGSAHNWKYAA 80
I A DGF++ DA G + L ++ G+IGSA W+ AA
Sbjct: 553 IAGQACSDGFELAFDDACGDWLDAQPELPVIACGMIGSAQGWREAA 416
>TC84460 similar to PIR|T47960|T47960 hypothetical protein F15G16.70 -
Arabidopsis thaliana, partial (23%)
Length = 843
Score = 28.1 bits (61), Expect = 4.0
Identities = 11/27 (40%), Positives = 17/27 (62%)
Frame = +2
Query: 118 EVISVIKRHPSVRKISFIAHSLGGLIA 144
E++ +KRH K+ F HSLGG ++
Sbjct: 578 EIMDHLKRHGDRAKLQFTGHSLGGSLS 658
>TC88031 similar to PIR|E84706|E84706 hypothetical protein At2g30280
[imported] - Arabidopsis thaliana, partial (11%)
Length = 1443
Score = 27.3 bits (59), Expect = 6.9
Identities = 15/55 (27%), Positives = 23/55 (41%), Gaps = 4/55 (7%)
Frame = -3
Query: 166 HCEGQISNQECHVRK----YEGKIAGLEPINFITSATPHLGCRGHKQVPLLCGFH 216
H Q N C + ++ ++ NF+ + L CRGHK PL+ H
Sbjct: 853 HLSRQFHNLNCQAHQNNLLHQLELVDKNVKNFLQRLSHPLQCRGHKHSPLVWNDH 689
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.322 0.138 0.411
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,777,676
Number of Sequences: 36976
Number of extensions: 120502
Number of successful extensions: 603
Number of sequences better than 10.0: 34
Number of HSP's better than 10.0 without gapping: 595
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 597
length of query: 285
length of database: 9,014,727
effective HSP length: 95
effective length of query: 190
effective length of database: 5,502,007
effective search space: 1045381330
effective search space used: 1045381330
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.9 bits)
Medicago: description of AC146940.1