
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC139854.4 + phase: 0
(299 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC80014 weakly similar to GP|21536784|gb|AAM61116.1 unknown {Ara... 388 e-108
BI312288 similar to PIR|T49048|T490 hypothetical protein T5P19.1... 355 9e-99
BI269628 40 8e-09
TC90162 33 0.13
TC79739 homologue to PIR|G86423|G86423 probable hydrophilic prot... 29 1.9
TC79398 similar to PIR|A86316|A86316 protein T10O22.3 [imported]... 29 2.5
TC90702 similar to GP|15028243|gb|AAK76710.1 unknown protein {Ar... 28 3.3
TC87813 similar to SP|P49299|CYSZ_CUCMA Citrate synthase glyoxy... 28 4.3
CA917688 weakly similar to GP|9294325|dbj| gene_id:K24M9.13~unkn... 27 7.3
TC88942 similar to GP|16649083|gb|AAL24393.1 Unknown protein {Ar... 27 9.5
TC86956 similar to GP|18252955|gb|AAL62404.1 putative protein {A... 27 9.5
BQ137800 weakly similar to GP|4494984|gb|A latent nuclear antige... 27 9.5
AW257550 homologue to GP|17064976|gb putative Na+-dependent inor... 27 9.5
TC78912 similar to GP|4587553|gb|AAD25784.1| F15I1.20 {Arabidops... 27 9.5
>TC80014 weakly similar to GP|21536784|gb|AAM61116.1 unknown {Arabidopsis
thaliana}, partial (25%)
Length = 1587
Score = 388 bits (996), Expect = e-108
Identities = 189/300 (63%), Positives = 223/300 (74%), Gaps = 2/300 (0%)
Frame = +2
Query: 1 MYFPKKGNCYDFYDPVQRKTYSLELPELDGCRVCYTKDGWLLLNRQDWRRLDGNHIFSLF 60
MYFPK G Y+FYDPVQRKTYS+E PEL+G RVCYTKDGWLLL R R+ F
Sbjct: 308 MYFPKFGQWYEFYDPVQRKTYSIEFPELNGSRVCYTKDGWLLLYRPRTDRV------FFF 469
Query: 61 NPFTRDLITLPKFDRTYQIAAFSCAPTSTGCVILIFRRVGSSLVAISTCYPGEKEWTTVY 120
NPFTR+ I +P+F+ TYQI AFSCAPTS CV+ + V ++VAISTC+PG EW TV
Sbjct: 470 NPFTRETIKMPRFEMTYQIVAFSCAPTSPDCVLFTVKHVSPTIVAISTCHPGATEWVTVN 649
Query: 121 YDAELS--CSMCDKLVFSNGLFYCLSDRGWLGVFDPLERTWTVFKVPPPKCLAESSTAKN 178
Y L S+ +KLVF NGLFYCLS GWLGVFDP ERTW+V VPPPKC E+ AKN
Sbjct: 650 YQNRLPFVSSIWNKLVFCNGLFYCLSLTGWLGVFDPSERTWSVLSVPPPKC-PENFFAKN 826
Query: 179 WSKGKFMIEHKGNIFVVHICCGEDPIIFKLDLTLMEWKEVRSLNGVTLFASFLSSHSRTY 238
W KGKFM E +G++ V++ C E+PIIFKLD MEW+E+++L+G TLFASFLSSHSRT
Sbjct: 827 WWKGKFMTEQEGDVIVMYTCSSENPIIFKLDQASMEWEELKTLDGATLFASFLSSHSRTD 1006
Query: 239 ATGIMRNSVYFPKVRFYGKRCISFSLDDRRYYPSEQCRDKVEPNTFENFWIEPPKDFTGW 298
G MRNS+YF KVRFYGKRCISFSLDD RYYP +Q D E + FE+ WIEPPKDF+G+
Sbjct: 1007LLGNMRNSIYFSKVRFYGKRCISFSLDDYRYYPRKQWHDWGEQDPFESIWIEPPKDFSGF 1186
>BI312288 similar to PIR|T49048|T490 hypothetical protein T5P19.120 -
Arabidopsis thaliana, partial (18%)
Length = 705
Score = 355 bits (912), Expect = 9e-99
Identities = 164/165 (99%), Positives = 164/165 (99%)
Frame = +2
Query: 1 MYFPKKGNCYDFYDPVQRKTYSLELPELDGCRVCYTKDGWLLLNRQDWRRLDGNHIFSLF 60
MYFPKKGNCYDFYDPVQRKTYSLELPELDGCRVCYTKDGWLLLNRQDWRRLDGNHIFSLF
Sbjct: 209 MYFPKKGNCYDFYDPVQRKTYSLELPELDGCRVCYTKDGWLLLNRQDWRRLDGNHIFSLF 388
Query: 61 NPFTRDLITLPKFDRTYQIAAFSCAPTSTGCVILIFRRVGSSLVAISTCYPGEKEWTTVY 120
NPFTRDLITLPKFDRTYQIAAFSCAPTSTGCVILIFRRVGSSLVAISTCYPGEKEWTTV
Sbjct: 389 NPFTRDLITLPKFDRTYQIAAFSCAPTSTGCVILIFRRVGSSLVAISTCYPGEKEWTTVN 568
Query: 121 YDAELSCSMCDKLVFSNGLFYCLSDRGWLGVFDPLERTWTVFKVP 165
YDAELSCSMCDKLVFSNGLFYCLSDRGWLGVFDPLERTWTVFKVP
Sbjct: 569 YDAELSCSMCDKLVFSNGLFYCLSDRGWLGVFDPLERTWTVFKVP 703
>BI269628
Length = 376
Score = 40.0 bits (92), Expect(3) = 8e-09
Identities = 16/29 (55%), Positives = 21/29 (72%)
Frame = +2
Query: 270 YPSEQCRDKVEPNTFENFWIEPPKDFTGW 298
YP +Q D E + FE+ WIEPPKDF+G+
Sbjct: 77 YPRKQWHDWGEQDPFESIWIEPPKDFSGF 163
Score = 29.3 bits (64), Expect(3) = 8e-09
Identities = 14/22 (63%), Positives = 14/22 (63%)
Frame = +1
Query: 251 KVRFYGKRCISFSLDDRRYYPS 272
K F RCISFSLDD RY S
Sbjct: 19 KFVFMESRCISFSLDDYRYLSS 84
Score = 26.6 bits (57), Expect(3) = 8e-09
Identities = 10/12 (83%), Positives = 11/12 (91%)
Frame = +3
Query: 246 SVYFPKVRFYGK 257
S+YF KVRFYGK
Sbjct: 3 SIYFSKVRFYGK 38
>TC90162
Length = 937
Score = 33.1 bits (74), Expect = 0.13
Identities = 44/208 (21%), Positives = 81/208 (38%), Gaps = 13/208 (6%)
Frame = +2
Query: 54 NHIFSLFNPFTR--DLITLPKFDRTYQIAAFSCAPTSTGCVILIFRRVGSSLVAISTC-- 109
N+ F L NPFTR +I F + A+ V+L F + V ++ C
Sbjct: 89 NNSFLLMNPFTRRKKVINTSTFKVNFSYFAYR--------VLLAFDKGSKDFVLVALCKS 244
Query: 110 ------YPGEKEWTTVYYDAELSCSMCDKLVFSNGLFYCLSDRGWLGVFDPLERTWTVFK 163
Y Y + D +V N + Y ++D+ +G+ L K
Sbjct: 245 SNSLHVYQSRNFGWVTYSTMGYPWMIVDFVVLHNTI-YVVNDKANIGI---LNLNSANIK 412
Query: 164 VPPPKCLAESSTAKNWSKGKFMIEHKGNIFVVHICCGEDPIIFKLDLTLMEWKEVRSLNG 223
KC+ ++ + ++ +FVVH G ++K+D + M++ ++++L
Sbjct: 413 FLEMKCIPSVTSLSHLR----LVSCDEQLFVVHTKPGVVFNVYKIDFSTMKYVKLKTLGD 580
Query: 224 VTLFASFLSSH---SRTYATGIMRNSVY 248
+ L + ++ S G NSVY
Sbjct: 581 IALLYAPGGNYYALSNPNRWGYESNSVY 664
>TC79739 homologue to PIR|G86423|G86423 probable hydrophilic protein
29542-30030 [imported] - Arabidopsis thaliana, partial
(95%)
Length = 676
Score = 29.3 bits (64), Expect = 1.9
Identities = 12/20 (60%), Positives = 13/20 (65%)
Frame = -3
Query: 259 CISFSLDDRRYYPSEQCRDK 278
C SFSLDD YP E C D+
Sbjct: 158 CTSFSLDDSSLYPFEAC*DR 99
>TC79398 similar to PIR|A86316|A86316 protein T10O22.3 [imported] -
Arabidopsis thaliana, partial (74%)
Length = 1110
Score = 28.9 bits (63), Expect = 2.5
Identities = 13/46 (28%), Positives = 26/46 (56%)
Frame = +2
Query: 153 DPLERTWTVFKVPPPKCLAESSTAKNWSKGKFMIEHKGNIFVVHIC 198
+P ++WT ++ST + W+KG+ ++KG+IF + +C
Sbjct: 554 EPSNKSWTF----------KTSTNQPWTKGRQGTKYKGSIFYLVLC 661
>TC90702 similar to GP|15028243|gb|AAK76710.1 unknown protein {Arabidopsis
thaliana}, partial (32%)
Length = 1039
Score = 28.5 bits (62), Expect = 3.3
Identities = 17/48 (35%), Positives = 24/48 (49%), Gaps = 7/48 (14%)
Frame = +2
Query: 232 SSHSRTYATG-------IMRNSVYFPKVRFYGKRCISFSLDDRRYYPS 272
S HS T +T I R S++ +V G RC + LD R ++PS
Sbjct: 383 SPHSNTVSTFNQR*TKFIQRRSIFNLRVNPRGSRCATCQLDRRLFHPS 526
>TC87813 similar to SP|P49299|CYSZ_CUCMA Citrate synthase glyoxysomal
precursor (EC 4.1.3.7) (GCS). [Pumpkin Winter squash],
partial (57%)
Length = 1235
Score = 28.1 bits (61), Expect = 4.3
Identities = 23/85 (27%), Positives = 34/85 (39%)
Frame = -2
Query: 73 FDRTYQIAAFSCAPTSTGCVILIFRRVGSSLVAISTCYPGEKEWTTVYYDAELSCSMCDK 132
F T + S AP G +R +A+ T P + W T + +SCS C
Sbjct: 1201 FQFTQHLRTASLAPP*RGP----YRAPTPPAIAVYTSTPLDARWRTADVEQFISCSACRM 1034
Query: 133 LVFSNGLFYCLSDRGWLGVFDPLER 157
+ S+ R LG++D L R
Sbjct: 1033 KMMSSARV-----RRGLGLYDRLPR 974
>CA917688 weakly similar to GP|9294325|dbj| gene_id:K24M9.13~unknown protein
{Arabidopsis thaliana}, partial (5%)
Length = 873
Score = 27.3 bits (59), Expect = 7.3
Identities = 8/26 (30%), Positives = 16/26 (60%)
Frame = -2
Query: 124 ELSCSMCDKLVFSNGLFYCLSDRGWL 149
EL+C + ++F + +++C D WL
Sbjct: 389 ELTCDKPESIIFGSPVYFCSVDTSWL 312
>TC88942 similar to GP|16649083|gb|AAL24393.1 Unknown protein {Arabidopsis
thaliana}, partial (43%)
Length = 1879
Score = 26.9 bits (58), Expect = 9.5
Identities = 14/39 (35%), Positives = 22/39 (55%)
Frame = -1
Query: 138 GLFYCLSDRGWLGVFDPLERTWTVFKVPPPKCLAESSTA 176
G + L +R +F+ + +WT+F V PPK + S TA
Sbjct: 1594 G*LHILRNRQAQEMFNFEQTSWTIFPVTPPKDILFSVTA 1478
>TC86956 similar to GP|18252955|gb|AAL62404.1 putative protein {Arabidopsis
thaliana}, partial (64%)
Length = 1078
Score = 26.9 bits (58), Expect = 9.5
Identities = 13/32 (40%), Positives = 17/32 (52%)
Frame = +2
Query: 245 NSVYFPKVRFYGKRCISFSLDDRRYYPSEQCR 276
N+ F +V +C S S DD+RYY S R
Sbjct: 119 NNRIFRRVILNQTKCSSLSDDDKRYYSSSSSR 214
>BQ137800 weakly similar to GP|4494984|gb|A latent nuclear antigen {Macaca
mulatta rhadinovirus 17577}, partial (4%)
Length = 1127
Score = 26.9 bits (58), Expect = 9.5
Identities = 12/44 (27%), Positives = 20/44 (45%)
Frame = +2
Query: 244 RNSVYFPKVRFYGKRCISFSLDDRRYYPSEQCRDKVEPNTFENF 287
R+ ++ RC+S D RR Y S +CR + P+ +
Sbjct: 347 RDHIHAHTQHLTSSRCVSHMSDVRRRYAS*RCRLRTSPSALTRY 478
>AW257550 homologue to GP|17064976|gb putative Na+-dependent inorganic
phosphate cotransporter {Arabidopsis thaliana}, partial
(20%)
Length = 723
Score = 26.9 bits (58), Expect = 9.5
Identities = 17/76 (22%), Positives = 35/76 (45%), Gaps = 4/76 (5%)
Frame = -3
Query: 103 LVAISTCYPGEKEWTTVYYDAELS---CSMCDKLVFSNGLFYCLSDRGWLGVFD-PLERT 158
++ + YP E+ V+ + + C+M ++FS C+ W G F P++R+
Sbjct: 589 ILGYCSVYPIPLEYWLVFLEQQQQVTFCNMAHGMMFSRFQLGCIWSEPWCGTFSRPVKRS 410
Query: 159 WTVFKVPPPKCLAESS 174
T +P L+ ++
Sbjct: 409 *TNILLPTIDTLSHNT 362
>TC78912 similar to GP|4587553|gb|AAD25784.1| F15I1.20 {Arabidopsis
thaliana}, partial (72%)
Length = 1071
Score = 26.9 bits (58), Expect = 9.5
Identities = 11/23 (47%), Positives = 17/23 (73%)
Frame = -3
Query: 3 FPKKGNCYDFYDPVQRKTYSLEL 25
FPKK Y+FY+ +RK Y++E+
Sbjct: 973 FPKKTA*YNFYNLNRRKIYNMEI 905
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.326 0.142 0.476
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,127,970
Number of Sequences: 36976
Number of extensions: 230670
Number of successful extensions: 1196
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 1189
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1192
length of query: 299
length of database: 9,014,727
effective HSP length: 96
effective length of query: 203
effective length of database: 5,465,031
effective search space: 1109401293
effective search space used: 1109401293
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 58 (26.9 bits)
Medicago: description of AC139854.4