
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144618.8 + phase: 0 /pseudo
(558 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC89308 similar to GP|9758091|dbj|BAB08535.1 dbj|BAA95714.1~gene... 286 1e-77
TC82900 weakly similar to PIR|D85437|D85437 hypothetical protein... 180 1e-45
CA858891 similar to PIR|T06626|T06 hypothetical protein T20K18.3... 177 7e-45
TC89309 weakly similar to GP|9758091|dbj|BAB08535.1 dbj|BAA95714... 109 3e-24
TC84608 weakly similar to GP|9758091|dbj|BAB08535.1 dbj|BAA95714... 53 3e-07
TC82096 similar to PIR|D85437|D85437 hypothetical protein AT4g37... 47 3e-05
BE941840 38 0.009
AW690560 similar to GP|11993482|gb| ubiquitin-specific protease ... 31 1.1
TC78933 similar to GP|17473880|gb|AAL38360.1 putative protein {A... 31 1.4
CA917730 similar to PIR|C86310|C863 protein F1L3.2 [imported] - ... 31 1.4
TC82256 similar to GP|20042913|gb|AAM08741.1 Unknown protein {Or... 31 1.4
TC91391 weakly similar to GP|21427473|gb|AAM53249.1 actin-relate... 30 1.9
TC85533 similar to SP|P42043|HMZ1_ARATH Ferrochelatase I chloro... 29 5.4
TC88525 pyrroline-5-carboxylate synthetase 1 [Medicago truncatula] 29 5.4
TC78713 similar to GP|21310093|gb|AAM46142.1 terminal flower-lik... 28 7.1
BE325693 similar to GP|18087608|gb AT3g51050/F24M12_90 {Arabidop... 28 9.3
AJ501232 homologue to GP|16186265|gb ring box-1-like protein {Ar... 28 9.3
BQ137274 GP|12053007|e hypothetical protein {Homo sapiens}, part... 28 9.3
>TC89308 similar to GP|9758091|dbj|BAB08535.1
dbj|BAA95714.1~gene_id:MNF13.19~strong similarity to
unknown protein {Arabidopsis thaliana}, partial (29%)
Length = 1052
Score = 286 bits (733), Expect = 1e-77
Identities = 150/333 (45%), Positives = 216/333 (64%)
Frame = +1
Query: 200 DVVLITSIAIWKSPYMLFRGWKRLLEDLVGRRGPFLETECVPFAGLAIILWPLAVLGAVL 259
DV +I+++A++K PYMLF+GW RL DL+GR GPFLET CVPFAGLAI+LWPLAV GAV+
Sbjct: 67 DVPVISTVALFKCPYMLFKGWHRLFHDLIGREGPFLETICVPFAGLAILLWPLAVAGAVV 246
Query: 260 AAIIVSFFLSLYGGVVVHQEDSLKMGFAYIVSVVSLFDEYVNDLLYLREGSCLPRPIYRK 319
++ +F+ Y GVV +QE S G Y ++ +S++DEY+ND+L + EG+C PRP YR+
Sbjct: 247 VSMFSGYFIGAYAGVVAYQESSFSFGINYAIAALSIYDEYINDILDMPEGTCFPRPQYRR 426
Query: 320 NVKHAVERKKLEGIDHNLKNRRDSSQNSKHTLQQSRSMKWKIQQYKPVQVWDWFFKSCEV 379
NV A+ + R+ S++S +++ + I + K +++ D FK C +
Sbjct: 427 NVDSALRTSDSASVSRPSSFRKAPSRSS--SIKNN-----NIVELKSLELLDGLFKECCI 585
Query: 380 NGRIVLRDGLISVKEIEECIFKGNCKKLSIKLPAWSLLQCLLTSAKSNSDGLVISDDIEL 439
G ++ +GLI+ K+IEE ++I LPA+ LLQ LL SAK NS+G++I+DD EL
Sbjct: 586 VGGKMISEGLITGKDIEEAKSSKGSNVITIGLPAYCLLQGLLRSAKVNSEGILINDDTEL 765
Query: 440 TRMNGPKDRVFEWFIGPLLIMKEQLKNLELEESEETCLKELVMRSKNDIPEDWDSTGFPS 499
T N P+++ FEWF+ PLLI+KEQ+K L SEE L +LV+ + G
Sbjct: 766 TPSNRPREKFFEWFLNPLLIIKEQIKAENLSASEEDYLCKLVLLRGDAERIKNSCIGPGP 945
Query: 500 KDNVRRAQLQAIIRRLQGIGSSMSRMPTFRRKF 532
V+RA+L A+ RRLQGI SMSR+PTF+R+F
Sbjct: 946 XSEVKRAELDALARRLQGITRSMSRVPTFKRRF 1044
>TC82900 weakly similar to PIR|D85437|D85437 hypothetical protein AT4g37030
[imported] - Arabidopsis thaliana, partial (41%)
Length = 1079
Score = 180 bits (456), Expect = 1e-45
Identities = 101/277 (36%), Positives = 161/277 (57%), Gaps = 12/277 (4%)
Frame = +2
Query: 287 AYIVSVVSLFDEYVNDLLYLREGSCLPRPIYRKNVKHAVERKKLEGIDHNLKNRRDSSQN 346
AYI+++V+ FDEY ND LYLREGS P+P YRK + G N +S N
Sbjct: 2 AYIMAMVAEFDEYTNDWLYLREGSFFPKPQYRKKMVSQSSEFSTRG-----NNTSESRSN 166
Query: 347 SKH--------TLQQSRSMKWKIQQYKPVQVWDWFFKSCEVNGRIVLRDGLISVKEIEEC 398
+ L SRS++ IQ+ K VQ+W + CE+ G+ +L +++ ++ E
Sbjct: 167 TTMEPPAMFMPNLAPSRSVRETIQEVKMVQIWGNMMRDCEIRGKELLDANVLTAADLYEW 346
Query: 399 IFKGNCKKLSIK---LPAWSLLQCLLTSAKSNSDGLVISDDIELTRMNGPKDRVFEWFIG 455
+ N + SI LP +SLLQ LL S K+NS G+++ DD E+T N PKD++ +WF
Sbjct: 347 LRGKNVNEASIVGVGLPCYSLLQTLLFSIKANSSGVLLLDDFEITHFNRPKDKLLDWFFN 526
Query: 456 PLLIMKEQLKNLELEESEETCLKELVMRSKN-DIPEDWDSTGFPSKDNVRRAQLQAIIRR 514
P++++KEQ++ ++L E E L+++V+ N + WD+ D +R AQ++ I RR
Sbjct: 527 PVMVLKEQIRVIKLVEGEVRYLEKVVLFGVNKQRLKTWDNGSLLIPDGLRAAQIEGISRR 706
Query: 515 LQGIGSSMSRMPTFRRKFRNLVKILYIEALQASASAK 551
+ G+ +S++PT+RRKFR ++K L+ +L+ AS K
Sbjct: 707 MIGMIRGVSKLPTYRRKFRQVLKALFTHSLEKDASEK 817
>CA858891 similar to PIR|T06626|T06 hypothetical protein T20K18.30 -
Arabidopsis thaliana, partial (26%)
Length = 763
Score = 177 bits (450), Expect = 7e-45
Identities = 113/187 (60%), Positives = 119/187 (63%), Gaps = 18/187 (9%)
Frame = +3
Query: 146 NFIIASLMVVGQQFKQAVQWFRMSQTFVFTPIFHTWMN*EKI*TLKKSHLISTFDVVLIT 205
NFIIASLMVVGQQFKQAVQWFRMSQTFVFTPIFHTWMN*EKI*TLKKSHLIS +
Sbjct: 6 NFIIASLMVVGQQFKQAVQWFRMSQTFVFTPIFHTWMN*EKI*TLKKSHLISNCRYYHVA 185
Query: 206 SIAIWKS-PYMLF--------RGWKRLLED---------LVGRRGPFLETECVPFAGLAI 247
W P M F R LED G L + +
Sbjct: 186 CW*FWLGCPLMWF*SHQLLYGRALTCCLEDGRDY*KIWLEEGDLS*KLNVSHLLVLPSSF 365
Query: 248 ILWPLAVLGAVLAAIIVSFFLSLYGGVVVHQEDSLKMGFAYIVSVVSLFDEYVNDLLYLR 307
LW VL L+ + FFL+ GGVVVHQEDSLKMGFAYIVSVVSLFDEYVNDLLYLR
Sbjct: 366 GLWLF*VLY*QLSLSV--FFLACMGGVVVHQEDSLKMGFAYIVSVVSLFDEYVNDLLYLR 539
Query: 308 EGSCLPR 314
EGSCLPR
Sbjct: 540 EGSCLPR 560
Score = 152 bits (384), Expect = 3e-37
Identities = 98/187 (52%), Positives = 109/187 (57%), Gaps = 58/187 (31%)
Frame = +2
Query: 199 FDVVLITSIAIWKSPYMLFRGWKRLLEDLVGRRGPFLETECVPFAGLAIILWPLAVLGAV 258
FDVVLITSIAIWKSPYMLFRGWKRLLEDLVGRRGPFLETECVPFAGLAIILWPLAVLGAV
Sbjct: 212 FDVVLITSIAIWKSPYMLFRGWKRLLEDLVGRRGPFLETECVPFAGLAIILWPLAVLGAV 391
Query: 259 LAAIIVSFFLSLY-------GGVVVHQEDSLKM----GFAY--------IVSVVSLFDEY 299
LAAIIVSFFLSLY GG++ ED + + GF + ++ L +
Sbjct: 392 LAAIIVSFFLSLYGWSRCPSGGLI---EDGVCLHCICGFTF**ICE*FTLLERRILPSQV 562
Query: 300 VNDLLYLREGSCLP---------------------------------------RPIYRKN 320
+++ LY LP RPIYRKN
Sbjct: 563 LSNFLYHAISFFLPCLKIIIVEKGCFF*YSSY*VSLFSVANIGH*FQVVLVTARPIYRKN 742
Query: 321 VKHAVER 327
VKHAVER
Sbjct: 743 VKHAVER 763
>TC89309 weakly similar to GP|9758091|dbj|BAB08535.1
dbj|BAA95714.1~gene_id:MNF13.19~strong similarity to
unknown protein {Arabidopsis thaliana}, partial (7%)
Length = 876
Score = 109 bits (272), Expect = 3e-24
Identities = 61/121 (50%), Positives = 83/121 (68%), Gaps = 1/121 (0%)
Frame = +3
Query: 421 LTSAKSNSDGLVISDDIELTRMNGPKDRVFEWFIGPLLIMKEQLKNLELEESEETCLKEL 480
L SAK NS+G++I+DD ELT N P+++ FEWF+ PLLI+KEQ+K L SEE L +L
Sbjct: 42 LRSAKVNSEGILINDDTELTPSNRPREKFFEWFLNPLLIIKEQIKAENLSASEEDYLCKL 221
Query: 481 VMRSKNDIPEDWDSTGFPSKDN-VRRAQLQAIIRRLQGIGSSMSRMPTFRRKFRNLVKIL 539
V+ + D +S P ++ V+RA+L A+ RRLQGI SMSR PTF+R+F LV+ L
Sbjct: 222 VL-LRGDAERIKNSCIGPGPESEVKRAELDALARRLQGITRSMSRFPTFKRRFDELVRTL 398
Query: 540 Y 540
Y
Sbjct: 399 Y 401
>TC84608 weakly similar to GP|9758091|dbj|BAB08535.1
dbj|BAA95714.1~gene_id:MNF13.19~strong similarity to
unknown protein {Arabidopsis thaliana}, partial (12%)
Length = 614
Score = 53.1 bits (126), Expect = 3e-07
Identities = 28/67 (41%), Positives = 37/67 (54%), Gaps = 2/67 (2%)
Frame = +1
Query: 21 LAGALIGPIAFAIMVVGNSAVIIGLWTAHVFWTYYCVARLYFL--SKFFVICFICSNSIS 78
+ G +I P+A I+ +GN +IIGLW HV W+YYCV R L S VIC
Sbjct: 409 IKGVIICPLACLIISIGNFGIIIGLWPVHVIWSYYCVLRAKQLGPSLKAVICIFVLP--V 582
Query: 79 LINCWPL 85
L+ WP+
Sbjct: 583 LLTLWPI 603
>TC82096 similar to PIR|D85437|D85437 hypothetical protein AT4g37030
[imported] - Arabidopsis thaliana, partial (39%)
Length = 740
Score = 46.6 bits (109), Expect = 3e-05
Identities = 18/40 (45%), Positives = 29/40 (72%)
Frame = +1
Query: 21 LAGALIGPIAFAIMVVGNSAVIIGLWTAHVFWTYYCVARL 60
+ G ++GPIA I+++GN VI+GL+ AHV WT Y + ++
Sbjct: 163 IKGFIVGPIAALILIIGNVGVILGLFPAHVAWTVYTLLKI 282
Score = 34.3 bits (77), Expect = 0.13
Identities = 14/21 (66%), Positives = 18/21 (85%)
Frame = +1
Query: 200 DVVLITSIAIWKSPYMLFRGW 220
+V L T+IAI KSPY+LF+GW
Sbjct: 661 EVPLFTAIAIVKSPYLLFKGW 723
>BE941840
Length = 614
Score = 38.1 bits (87), Expect = 0.009
Identities = 17/40 (42%), Positives = 28/40 (69%)
Frame = +3
Query: 512 IRRLQGIGSSMSRMPTFRRKFRNLVKILYIEALQASASAK 551
I R+ G+ +S++PT+RRKFR ++K L+ +L+ AS K
Sbjct: 471 IVRMIGMIRGVSKLPTYRRKFRQVLKALFTHSLEKDASEK 590
>AW690560 similar to GP|11993482|gb| ubiquitin-specific protease 21
{Arabidopsis thaliana}, partial (6%)
Length = 639
Score = 31.2 bits (69), Expect = 1.1
Identities = 10/31 (32%), Positives = 17/31 (54%)
Frame = +3
Query: 70 CFICSNSISLINCWPLKTEQRGLDWFSRLRR 100
C+ C + I+CW +K+E +W R +R
Sbjct: 387 CY*CFSLPQTISCWAIKSENSSFEWIGRYQR 479
>TC78933 similar to GP|17473880|gb|AAL38360.1 putative protein {Arabidopsis
thaliana}, partial (14%)
Length = 710
Score = 30.8 bits (68), Expect = 1.4
Identities = 22/80 (27%), Positives = 36/80 (44%), Gaps = 1/80 (1%)
Frame = +1
Query: 464 LKNLELEESEETCLKELVMRSKNDIPEDWDST-GFPSKDNVRRAQLQAIIRRLQGIGSSM 522
+ + E+ E EE C +EL S+N D T + K R +++ G+G S
Sbjct: 208 ISDSEIGEYEEKCYEELKSGSQNVKTSDEKFTCPYCPKKRKRDYLYNELLQHASGVGQSS 387
Query: 523 SRMPTFRRKFRNLVKILYIE 542
S+ R K +L + Y+E
Sbjct: 388 SQKRKPREKATHLALVKYLE 447
>CA917730 similar to PIR|C86310|C863 protein F1L3.2 [imported] - Arabidopsis
thaliana, partial (2%)
Length = 786
Score = 30.8 bits (68), Expect = 1.4
Identities = 19/67 (28%), Positives = 33/67 (48%), Gaps = 5/67 (7%)
Frame = +2
Query: 36 VGNSAVIIGLWTAHVFWTYYCVARLYFLSKFFV-----ICFICSNSISLINCWPLKTEQR 90
VG+S G++T + +T+ C ++ S F V F+C+ +++ CWP
Sbjct: 455 VGSSDKTRGIYTLILIYTFLCHKTSHYWSIFEVQQHCLFLFMCNPKVTISICWPNLGYVI 634
Query: 91 GLDWFSR 97
GL FS+
Sbjct: 635 GLLSFSK 655
>TC82256 similar to GP|20042913|gb|AAM08741.1 Unknown protein {Oryza sativa
(japonica cultivar-group)}, partial (9%)
Length = 897
Score = 30.8 bits (68), Expect = 1.4
Identities = 18/50 (36%), Positives = 33/50 (66%), Gaps = 4/50 (8%)
Frame = +3
Query: 296 FDEYVNDLLYLREGSCL--PRPIYRK-NVKHAVERKKLE-GIDHNLKNRR 341
FDE+V+ + LR+GS +YR+ ++KH++ RK +E + HN+++ R
Sbjct: 384 FDEFVSRIFSLRDGSTAIHTLDLYRRHSMKHSLLRKIIEYAVSHNVQHLR 533
>TC91391 weakly similar to GP|21427473|gb|AAM53249.1 actin-related protein
8A {Arabidopsis thaliana}, partial (17%)
Length = 666
Score = 30.4 bits (67), Expect = 1.9
Identities = 18/61 (29%), Positives = 29/61 (47%)
Frame = +1
Query: 218 RGWKRLLEDLVGRRGPFLETECVPFAGLAIILWPLAVLGAVLAAIIVSFFLSLYGGVVVH 277
R W +L+ + +RGP + T C PF W L + + FF ++YG + V+
Sbjct: 496 RRWFWILQIWLEQRGPSIGTSCYPF-------WNLVMWKRQYILRLRHFFATVYGRMKVN 654
Query: 278 Q 278
Q
Sbjct: 655 Q 657
>TC85533 similar to SP|P42043|HMZ1_ARATH Ferrochelatase I
chloroplast/mitochondrial precursor (EC 4.99.1.1)
(Protoheme ferro-lyase), partial (76%)
Length = 1667
Score = 28.9 bits (63), Expect = 5.4
Identities = 17/69 (24%), Positives = 32/69 (45%)
Frame = -2
Query: 369 VWDWFFKSCEVNGRIVLRDGLISVKEIEECIFKGNCKKLSIKLPAWSLLQCLLTSAKSNS 428
+W++F +V I + L+ + ++ K + + L LTS KSNS
Sbjct: 1594 LWEYFQADIQVLIFIKYNNSLLLYISVTYVLYPERSKTKEKNMKYTNFLSIALTSMKSNS 1415
Query: 429 DGLVISDDI 437
G++++D I
Sbjct: 1414 GGILLNDMI 1388
>TC88525 pyrroline-5-carboxylate synthetase 1 [Medicago truncatula]
Length = 2520
Score = 28.9 bits (63), Expect = 5.4
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 9/41 (21%)
Frame = -3
Query: 49 HVFWTYYCVARLYFL---------SKFFVICFICSNSISLI 80
HV Y V + Y L FFV CF+CSN++S I
Sbjct: 1207 HVCNNSYVVGKTYDLLTL*NQPSNQGFFVSCFLCSNNVSFI 1085
>TC78713 similar to GP|21310093|gb|AAM46142.1 terminal flower-like protein 1
{Vitis vinifera}, partial (95%)
Length = 931
Score = 28.5 bits (62), Expect = 7.1
Identities = 10/28 (35%), Positives = 16/28 (56%)
Frame = +1
Query: 55 YCVARLYFLSKFFVICFICSNSISLINC 82
YC Y+ + FFV+C IC + ++C
Sbjct: 808 YCPLDKYYCTVFFVMCRICIKVVIFLSC 891
>BE325693 similar to GP|18087608|gb AT3g51050/F24M12_90 {Arabidopsis
thaliana}, partial (17%)
Length = 445
Score = 28.1 bits (61), Expect = 9.3
Identities = 14/32 (43%), Positives = 20/32 (61%)
Frame = +3
Query: 237 TECVPFAGLAIILWPLAVLGAVLAAIIVSFFL 268
T+CV G ++LWPLA+L A+ +S FL
Sbjct: 339 TKCVSCLGDDLLLWPLALLTAIELDNHISRFL 434
>AJ501232 homologue to GP|16186265|gb ring box-1-like protein {Arabidopsis
thaliana}, partial (85%)
Length = 574
Score = 28.1 bits (61), Expect = 9.3
Identities = 16/43 (37%), Positives = 25/43 (57%)
Frame = +2
Query: 263 IVSFFLSLYGGVVVHQEDSLKMGFAYIVSVVSLFDEYVNDLLY 305
+VS+FL + +++ QE SL GF+Y + F Y N L+Y
Sbjct: 443 VVSYFLLIVYWILLFQESSLFNGFSYTPTANVKF--YFNVLIY 565
>BQ137274 GP|12053007|e hypothetical protein {Homo sapiens}, partial (1%)
Length = 863
Score = 28.1 bits (61), Expect = 9.3
Identities = 29/134 (21%), Positives = 56/134 (41%), Gaps = 2/134 (1%)
Frame = -3
Query: 2 IIINVECIVKVNIHY--KVIHLAGALIGPIAFAIMVVGNSAVIIGLWTAHVFWTYYCVAR 59
+I +V + ++++ +++L G LIG + + V ++++ ++F YY +
Sbjct: 465 VIYSVFILYMIDVYMICDLLYLVGILIGFLNCFLFVYFYKSMVV---FCYLFMFYYLLL* 295
Query: 60 LYFLSKFFVICFICSNSISLINCWPLKTEQRGLDWFSRLRR*YVCRFLCYFCR*LVLLVA 119
+Y + +F+I F C LI + RF CY LV
Sbjct: 294 IYKI*IYFLI*FFCLFFFILI----------------------IMRFFCY-----SFLVN 196
Query: 120 F*VELDMAFLLLFL 133
F L + F L+F+
Sbjct: 195 FFCNLTIIFFLMFM 154
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.333 0.144 0.458
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 19,491,565
Number of Sequences: 36976
Number of extensions: 317118
Number of successful extensions: 2223
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 2191
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2216
length of query: 558
length of database: 9,014,727
effective HSP length: 101
effective length of query: 457
effective length of database: 5,280,151
effective search space: 2413029007
effective search space used: 2413029007
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 61 (28.1 bits)
Medicago: description of AC144618.8