
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149209.7 - phase: 0
(384 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB895143 214 4e-62
AW689408 186 1e-47
BG581317 homologue to OMNI|NTL01EC003 IS2 hypothetical protein {... 43 2e-04
TC76418 similar to GP|16226228|gb|AAL16109.1 At1g20110/T20H2_10 ... 43 2e-04
BG451025 similar to GP|20907905|gb| Transporter {Methanosarcina ... 39 0.003
CB891664 similar to GP|3341693|gb| unknown protein {Arabidopsis ... 37 0.010
TC85765 37 0.017
TC91851 weakly similar to PIR|F96673|F96673 hypothetical protein... 35 0.037
TC81629 similar to GP|15081654|gb|AAK82482.1 AT4g26750/F10M23_90... 35 0.037
TC81087 homologue to GP|530207|gb|AAA66338.1|| heat shock protei... 35 0.037
TC89118 weakly similar to PIR|T06098|T06098 hypothetical protein... 35 0.063
AW773761 similar to GP|21618133|gb| bcnt-like protein {Arabidops... 34 0.11
TC89634 similar to GP|11994370|dbj|BAB02329. gb|AAF13084.1~gene_... 33 0.14
TC86330 similar to SP|Q43644|NUAM_SOLTU NADH-ubiquinone oxidored... 33 0.14
TC76827 similar to GP|17154773|gb|AAL35979.1 extensin-like prote... 33 0.18
TC92131 similar to GP|4115937|gb|AAD03447.1| contains similarity... 33 0.24
TC81124 homologue to GP|14597044|emb|CAC43563. unnamed protein p... 33 0.24
TC88610 homologue to PIR|G96755|G96755 developmental protein hom... 33 0.24
BG457880 weakly similar to GP|1063689|gb|A ORF 66; this sequence... 33 0.24
TC80054 SP|Q28009|FUS_BOVIN RNA-binding protein FUS (Pigpen prot... 32 0.31
>CB895143
Length = 770
Score = 214 bits (545), Expect(2) = 4e-62
Identities = 105/168 (62%), Positives = 126/168 (74%), Gaps = 6/168 (3%)
Frame = +2
Query: 143 SWNTEQNLVLISGWIKYGTCSVVGRNQTSESYWGKIAEYCNEHCSFDSPRDIVACRNRFN 202
+WN +QNL+LISGWIKYGT SVVGRNQ +SY GKI +Y NEHCSF+ P D CRN +N
Sbjct: 206 NWNIDQNLILISGWIKYGTNSVVGRNQKCDSY*GKIVDYFNEHCSFNPPCDAAPCRNHYN 385
Query: 203 YMNKLINKWVGAYDSAKRMQGSGWSEDDVFKKAQELYGCGKNVQFTLMEEWRALRDQPRY 262
YMNK++NKW+ AYD AKRMQ SG S+ DV KA ELY GK+ F L+ W +RDQPRY
Sbjct: 386 YMNKILNKWIDAYDGAKRMQQSG*SKIDVLAKAHELYSSGKSEHFILISGWFVVRDQPRY 565
Query: 263 GSQVGGNVDSGSSGSKRSYE------DSVGSSARPMDRDAAKKKGKKK 304
SQVGGN SGSSGSKR+++ +SVGSS+ PM RDAAKK+ K+K
Sbjct: 566 DSQVGGNTGSGSSGSKRAHKSDASDSNSVGSSSCPMGRDAAKKR*KEK 709
Score = 42.0 bits (97), Expect(2) = 4e-62
Identities = 18/27 (66%), Positives = 22/27 (80%)
Frame = +3
Query: 299 KKGKKKSKGETLEKVEKEWVQFKELKE 325
KKG+KKSKG TLE V +EW +FK+ KE
Sbjct: 690 KKGEKKSKGATLESVNEEWNEFKQCKE 770
>AW689408
Length = 583
Score = 186 bits (472), Expect = 1e-47
Identities = 92/133 (69%), Positives = 95/133 (71%), Gaps = 8/133 (6%)
Frame = +2
Query: 1 MDPTNNQSNTQNSSDYPFSYQN--------QFPNSHPQNPNQFPNQHPQNPNQFPNQNPQ 52
MDP NNQ NTQNSS Y FSYQN QFPN H NPNQF N HPQN NQFPNQ+PQ
Sbjct: 185 MDPNNNQFNTQNSSHYTFSYQNPNNYQHPNQFPNQHLLNPNQFSN*HPQNSNQFPNQHPQ 364
Query: 53 NMSNFGFASNFNHPSFVPNNFNPYFRSMMGYSSQTPPFNGYMPMVNENFQSVGEYPDFSS 112
NMSNFGFA NFN S PNNFNPY RSMMGY SQTP FNGYM M N NF SVGEYP F
Sbjct: 365 NMSNFGFALNFNQSSSXPNNFNPYHRSMMGYPSQTPQFNGYMSMXNANFXSVGEYPXFPX 544
Query: 113 QINRGGMTRDNEV 125
+ GMTR NE+
Sbjct: 545 K*IGXGMTRANEI 583
>BG581317 homologue to OMNI|NTL01EC003 IS2 hypothetical protein {Escherichia
coli K12-MG1655}, partial (55%)
Length = 808
Score = 43.1 bits (100), Expect = 2e-04
Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 8/117 (6%)
Frame = +2
Query: 5 NNQSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPN---QNPQNMSNFGFAS 61
N+ +T + DY + QFP+ QNPN PN P + PN P N+ ++S
Sbjct: 53 NSLRSTMHQGDYTSAPYYQFPHMQ-QNPNPIPNPTPPQSDPIPNHYASAPPFTPNYDYSS 229
Query: 62 NF-----NHPSFVPNNFNPYFRSMMGYSSQTPPFNGYMPMVNENFQSVGEYPDFSSQ 113
+ ++P VP++ NP F PPF P+ + Q YP + Q
Sbjct: 230 TYSPYPPHNPDHVPSS-NPSF--------NPPPFESSNPLYQQPSQPY--YPPYDQQ 367
>TC76418 similar to GP|16226228|gb|AAL16109.1 At1g20110/T20H2_10
{Arabidopsis thaliana}, partial (54%)
Length = 2020
Score = 43.1 bits (100), Expect = 2e-04
Identities = 33/117 (28%), Positives = 50/117 (42%), Gaps = 8/117 (6%)
Frame = +3
Query: 5 NNQSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPN---QNPQNMSNFGFAS 61
N+ +T + DY + QFP+ QNPN PN P + PN P N+ ++S
Sbjct: 9 NSLRSTMHQGDYTSAPYYQFPHMQ-QNPNPIPNPTPPQSDPIPNHYASAPPFTPNYDYSS 185
Query: 62 NF-----NHPSFVPNNFNPYFRSMMGYSSQTPPFNGYMPMVNENFQSVGEYPDFSSQ 113
+ ++P VP++ NP F PPF P+ + Q YP + Q
Sbjct: 186 TYSPYPPHNPDHVPSS-NPSF--------NPPPFESSNPLYQQPSQPY--YPPYDQQ 323
>BG451025 similar to GP|20907905|gb| Transporter {Methanosarcina mazei Goe1},
partial (3%)
Length = 422
Score = 39.3 bits (90), Expect = 0.003
Identities = 22/89 (24%), Positives = 48/89 (53%), Gaps = 5/89 (5%)
Frame = +3
Query: 299 KKGKKKSKGETLEKVEKEWVQFKELKEQEIEQLKELTLVKQQKNKLLQEKTQAKKMKMYL 358
+KGK+KSK E V E ++ KE+ + ++ + E+ V+ ++NKL +++ ++ + L
Sbjct: 39 EKGKEKSKSEETSSVNVELIEVKEIPKTKVLVMAEMARVQDEQNKLKEKELNLREREFEL 218
Query: 359 K-----LRDEEHLDDRKNELLGKLERELF 382
+ + + R +E+L + RE +
Sbjct: 219 EFLFKDISGMKARQQRDHEMLCNVIREKY 305
>CB891664 similar to GP|3341693|gb| unknown protein {Arabidopsis thaliana},
partial (26%)
Length = 580
Score = 37.4 bits (85), Expect = 0.010
Identities = 20/81 (24%), Positives = 38/81 (46%), Gaps = 1/81 (1%)
Frame = +1
Query: 127 PTPEDTTPKSKRNQQPSWNTEQNLVLISGWIKYGTCSVVGRNQTSESYWGKIAEYCNEHC 186
P P P ++R P W+ E+ L LI + + +GR ++W ++A+ + C
Sbjct: 109 PAPLPIPPSTRRLPPPCWSPEETLALIDSYRE--KWYSLGRGNLKATHWQEVADSVSNRC 282
Query: 187 SFDSP-RDIVACRNRFNYMNK 206
+P + V CR++ + K
Sbjct: 283 PNVTPAKTPVQCRHKMEKLRK 345
>TC85765
Length = 862
Score = 36.6 bits (83), Expect = 0.017
Identities = 23/84 (27%), Positives = 45/84 (53%), Gaps = 1/84 (1%)
Frame = +1
Query: 285 VGSSARPMDRDAAKKK-GKKKSKGETLEKVEKEWVQFKELKEQEIEQLKELTLVKQQKNK 343
+G S P+ +K +KK K + +E +++ + K+ KEQ++E + L +K K
Sbjct: 157 MGPSRLPLPPQMERKLLHEKKRKEQEVESTKEDLLHGKKPKEQKVESTEGDLLYDGEKPK 336
Query: 344 LLQEKTQAKKMKMYLKLRDEEHLD 367
+ A+ +K+Y KL+D + +D
Sbjct: 337 EFECPVYAETLKVYNKLKDIDTID 408
>TC91851 weakly similar to PIR|F96673|F96673 hypothetical protein F13O11.30
[imported] - Arabidopsis thaliana, partial (12%)
Length = 981
Score = 35.4 bits (80), Expect = 0.037
Identities = 38/132 (28%), Positives = 63/132 (46%), Gaps = 11/132 (8%)
Frame = +3
Query: 260 PRYGSQVGGNVDSGSSGSKRSYE-DSVGSSARPMDRDAAKK-KGKKKSKG-----ETLEK 312
P S++ S SK + E S ++A P D+ A + KG + E L+K
Sbjct: 228 PLQNSRLSVEKSPRSVNSKPAVERKSAKATATPPDKQAPRAAKGSELQNQLNVAQEDLKK 407
Query: 313 VEKEWVQFKELKEQEIEQLKELTLVKQQKNKLLQE----KTQAKKMKMYLKLRDEEHLDD 368
+++ +Q ++ K + I++LKE V ++ N+ LQE + QAK+ K R E L+
Sbjct: 408 AKEQILQAEKEKVKAIDELKEAQRVAEEANEKLQEALVAQKQAKEESEIEKFRAVE-LEQ 584
Query: 369 RKNELLGKLERE 380
E + K E E
Sbjct: 585 AGIETVNKKEEE 620
>TC81629 similar to GP|15081654|gb|AAK82482.1 AT4g26750/F10M23_90
{Arabidopsis thaliana}, partial (19%)
Length = 824
Score = 35.4 bits (80), Expect = 0.037
Identities = 38/159 (23%), Positives = 58/159 (35%), Gaps = 27/159 (16%)
Frame = +2
Query: 3 PTNNQSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASN 62
P+ + S DYP + HP P+Q + P P+Q +Q P S+ ++
Sbjct: 50 PSQDYHPPPPSQDYPPPPSQDY---HPPPPSQ--DYQPPPPSQDYHQPPPARSDSSYSEP 214
Query: 63 FNHPSFVPN---NFNPYFRSMMGYSSQTPP------FNGYMPMVNENFQSV--------G 105
+NH + P+ N P + S +TPP F Y + SV G
Sbjct: 215 YNHQQYSPDQSQNLGPNYP-----SHETPPPYSLPHFQSYPSFTESSLPSVPVNHTYYQG 379
Query: 106 EYPDFSSQI----------NRGGMTRDNEVAPTPEDTTP 134
+SSQ + + N P P+ TTP
Sbjct: 380 PDASYSSQSAPLTTNHSLNTQNSSSSRNGTVPEPKPTTP 496
>TC81087 homologue to GP|530207|gb|AAA66338.1|| heat shock protein {Glycine
max}, partial (41%)
Length = 1318
Score = 35.4 bits (80), Expect = 0.037
Identities = 24/60 (40%), Positives = 27/60 (45%), Gaps = 4/60 (6%)
Frame = -1
Query: 7 QSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQF----PNQNPQNMSNFGFASN 62
QSN N S + + P S P N + FPN N N P QNP N SN ASN
Sbjct: 904 QSN*PNISKIARNITSNNPLSQPFNNSSFPNTRLTNKNWIILSPP*QNPHNSSNLFVASN 725
>TC89118 weakly similar to PIR|T06098|T06098 hypothetical protein T5J17.90 -
Arabidopsis thaliana, partial (82%)
Length = 1208
Score = 34.7 bits (78), Expect = 0.063
Identities = 22/65 (33%), Positives = 28/65 (42%)
Frame = +2
Query: 25 PNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHPSFVPNNFNPYFRSMMGYS 84
PN P PN PNQ P + PN NP SN P+ PN+F P + ++
Sbjct: 188 PNQLPPPPNHPPNQLPHSSPASPNLNP--------PSNLISPN--PNSFPPLIHLISNHT 337
Query: 85 SQTPP 89
S P
Sbjct: 338 STKSP 352
>AW773761 similar to GP|21618133|gb| bcnt-like protein {Arabidopsis
thaliana}, partial (35%)
Length = 622
Score = 33.9 bits (76), Expect = 0.11
Identities = 24/73 (32%), Positives = 40/73 (53%)
Frame = -1
Query: 275 SGSKRSYEDSVGSSARPMDRDAAKKKGKKKSKGETLEKVEKEWVQFKELKEQEIEQLKEL 334
S SK + E + + P DA ++ KKK+K L+K +K+W +FKE + + +E EL
Sbjct: 601 SDSKEAIERA--KAPAPSAVDAVLEQIKKKAKLSVLDKTKKDWGEFKE-ENKGLE--NEL 437
Query: 335 TLVKQQKNKLLQE 347
K+ N+ L +
Sbjct: 436 DTYKKSSNQYLDK 398
>TC89634 similar to GP|11994370|dbj|BAB02329.
gb|AAF13084.1~gene_id:MDC16.11~similar to unknown
protein {Arabidopsis thaliana}, partial (5%)
Length = 898
Score = 33.5 bits (75), Expect = 0.14
Identities = 24/104 (23%), Positives = 39/104 (37%)
Frame = +1
Query: 2 DPTNNQSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFAS 61
D + D+ S +++ P S + Q P+ N +QN N N G
Sbjct: 349 DDVSGSKQVDLRDDHVRSSESESPASGAASEQQLPD----NKESSSSQNLDNYGNIGLVR 516
Query: 62 NFNHPSFVPNNFNPYFRSMMGYSSQTPPFNGYMPMVNENFQSVG 105
+ + PS+ P +M G+S+ PP +P N G
Sbjct: 517 DTS-PSYAPAAQQQDSHNMPGFSAYDPPTGYDIPYFRPNMDETG 645
>TC86330 similar to SP|Q43644|NUAM_SOLTU NADH-ubiquinone oxidoreductase 75
kDa subunit mitochondrial precursor (EC 1.6.5.3) (EC
1.6.99.3), partial (95%)
Length = 2619
Score = 33.5 bits (75), Expect = 0.14
Identities = 20/64 (31%), Positives = 32/64 (49%)
Frame = +3
Query: 26 NSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHPSFVPNNFNPYFRSMMGYSS 85
+S Q+P PNQ+ NPN N NP ++S+ HPS + +P +++ +SS
Sbjct: 144 SSSVQSP---PNQNSANPNPPLNPNPNSLSSISLPV---HPSLALESISPIPMTLLRFSS 305
Query: 86 QTPP 89
P
Sbjct: 306 MVIP 317
>TC76827 similar to GP|17154773|gb|AAL35979.1 extensin-like protein {Cucumis
sativus}, partial (60%)
Length = 1093
Score = 33.1 bits (74), Expect = 0.18
Identities = 24/74 (32%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Frame = +2
Query: 2 DPTNNQSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFAS 61
+ TN ++ NSS+ +S + S P NP+ F + +P+ P N SNF +++
Sbjct: 236 ETTNCETTCYNSSNIAYSSSS----STPSNPSNFA--YSSSPS-----TPSNPSNFAYST 382
Query: 62 NF-NHPSFV-PNNF 73
NF N +F P+NF
Sbjct: 383 NFANSSNFAYPSNF 424
>TC92131 similar to GP|4115937|gb|AAD03447.1| contains similarity to human
PCF11p homolog (GB:AF046935) {Arabidopsis thaliana},
partial (24%)
Length = 1003
Score = 32.7 bits (73), Expect = 0.24
Identities = 31/120 (25%), Positives = 46/120 (37%)
Frame = +3
Query: 11 QNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHPSFVP 70
Q+ + F+Y+ P P N PN P P P Q N ++F F P P
Sbjct: 102 QDPAASQFTYRPSLPGHGPAMSNPLPNVRPVMPLPLPGQRIGN-NSFHFQGGALRPLPPP 278
Query: 71 NNFNPYFRSMMGYSSQTPPFNGYMPMVNENFQSVGEYPDFSSQINRGGMTRDNEVAPTPE 130
P SQ PP P V +VG +S + +G ++ N+ AP+ +
Sbjct: 279 GPHAP---------SQMPPHPNPSPFVPNQQPTVGYSNLINSLMAQGVISLTNQ-APSQD 428
>TC81124 homologue to GP|14597044|emb|CAC43563. unnamed protein product
{Arabidopsis thaliana}, partial (21%)
Length = 1609
Score = 32.7 bits (73), Expect = 0.24
Identities = 31/114 (27%), Positives = 49/114 (42%), Gaps = 7/114 (6%)
Frame = +1
Query: 22 NQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHPSFVPNNFNPYFRSMM 81
N F N++ PNQ N P +P+Q QN N ++ FN P+F+ +N
Sbjct: 463 NSFFNNNIHEPNQ--NFVPHDPSQQQGLMMQNSHNSNNSNLFNMPNFLSSN--------- 609
Query: 82 GYSSQTPPFNGYMPMVNEN---FQSVGEYPD----FSSQINRGGMTRDNEVAPT 128
+S F+ ++ + NE F G FS+ ++GG + N A T
Sbjct: 610 STNSSNNSFSEHLNVKNEGTNFFNGSGGTSSAPSLFSNVFSQGGNSNINAAAGT 771
>TC88610 homologue to PIR|G96755|G96755 developmental protein homolog DG1118
[imported] - Arabidopsis thaliana, partial (79%)
Length = 689
Score = 32.7 bits (73), Expect = 0.24
Identities = 18/58 (31%), Positives = 28/58 (48%)
Frame = -3
Query: 27 SHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHPSFVPNNFNPYFRSMMGYS 84
+HP P+Q P+ P+ Q P ++ Q + N NHP V +N+ P + G S
Sbjct: 465 NHPHQPSQHPSP*PEQHEQPPPED*QKL------PNHNHPPPVASNY*PPLPELKGKS 310
>BG457880 weakly similar to GP|1063689|gb|A ORF 66; this sequence overlaps at
the HindIII site with the sequence reported for the
OpMNPV lef-3, partial (5%)
Length = 669
Score = 32.7 bits (73), Expect = 0.24
Identities = 33/124 (26%), Positives = 49/124 (38%)
Frame = +3
Query: 19 SYQNQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHPSFVPNNFNPYFR 78
S+QN+ P++ PQ+P Q Q PQ PN PN QN + P+ N + Y
Sbjct: 12 SWQNRAPSA-PQDPYQQQQQPPQQPND-PNLQTQN------SQQPQVPTASDPNSSYY-- 161
Query: 79 SMMGYSSQTPPFNGYMPMVNENFQSVGEYPDFSSQINRGGMTRDNEVAPTPEDTTPKSKR 138
+ + Q P P N++ YP+ G + PTP T + +
Sbjct: 162 -LTPHQPQQTPLGPSAPDTNQS-----PYPNLQQPSQYPGGSVSAPSQPTPGQTPAQVAQ 323
Query: 139 NQQP 142
QP
Sbjct: 324 PVQP 335
>TC80054 SP|Q28009|FUS_BOVIN RNA-binding protein FUS (Pigpen protein).
[Bovine] {Bos taurus}, partial (4%)
Length = 988
Score = 32.3 bits (72), Expect = 0.31
Identities = 36/146 (24%), Positives = 52/146 (34%), Gaps = 10/146 (6%)
Frame = +1
Query: 7 QSNTQNSSDYPFSYQNQFPNSHPQNPNQFPNQHPQNPNQFPNQNPQNMSNFGFASNFNHP 66
Q Q S P + Q + S PQ + + Q P+ P Q+ S A + P
Sbjct: 394 QGAAQPSYGVPPTSQTAY-GSQPQTQSGYGPPQTQKPSGTPPAYGQSQSP-NAAGGYGQP 567
Query: 67 SFVPNNFNPYFRSMMGYSSQTPPFNGY-MPMVNENFQSV--GEY-------PDFSSQINR 116
+ P+ P S GY PP +GY P + QS G Y P +S+ N
Sbjct: 568 GYPPSQPPPSGYSQPGYPPSQPPPSGYSQPGYGQAPQSYNSGSYGAGYPQAPAYSADGNA 747
Query: 117 GGMTRDNEVAPTPEDTTPKSKRNQQP 142
G R P ++ + P
Sbjct: 748 SGNARGGSYDGAPAQGAQQASVAKSP 825
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.311 0.129 0.385
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,650,651
Number of Sequences: 36976
Number of extensions: 212341
Number of successful extensions: 1412
Number of sequences better than 10.0: 107
Number of HSP's better than 10.0 without gapping: 1278
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1366
length of query: 384
length of database: 9,014,727
effective HSP length: 98
effective length of query: 286
effective length of database: 5,391,079
effective search space: 1541848594
effective search space used: 1541848594
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (27.3 bits)
Medicago: description of AC149209.7