
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= TM0098b.1
(696 letters)
Database: LJGI
28,460 sequences; 14,692,800 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC12130 similar to UP|Q93YF5 (Q93YF5) SET-domain-containing prot... 354 3e-98
BP042354 347 4e-96
TC8734 similar to UP|Q93YF5 (Q93YF5) SET-domain-containing prote... 296 6e-81
AW720071 126 1e-29
AV409769 110 1e-24
TC18991 similar to UP|SUV9_ARATH (Q9T0G7) Probable histone-lysin... 72 4e-13
TC9091 similar to UP|Q9Y7R4 (Q9Y7R4) Set domain protein, transcr... 64 1e-10
BP062550 58 4e-09
AV406621 50 9e-07
TC11218 similar to UP|Q941H0 (Q941H0) Trithorax 4 (Fragment), pa... 50 9e-07
TC16823 similar to UP|Q8LGA2 (Q8LGA2) Seed maturation-like prote... 38 0.006
TC19535 similar to UP|Q80V17 (Q80V17) LOC233904 protein, partial... 38 0.006
TC14775 PIR|S14181|S14181 DNA-directed RNA polymerase largest c... 36 0.024
BI420111 35 0.040
BP065960 35 0.053
TC18688 weakly similar to UP|Q9LP77 (Q9LP77) T1N15.9, partial (13%) 35 0.053
TC7897 weakly similar to UP|ENL2_ARATH (Q9T076) Early nodulin-li... 34 0.090
TC15958 similar to GB|AAP37838.1|30725632|BT008479 At4g24220 {Ar... 32 0.26
CN825244 32 0.34
TC18080 similar to GB|AAL90935.1|19699138|AY090274 AT3g18790/MVE... 32 0.45
>TC12130 similar to UP|Q93YF5 (Q93YF5) SET-domain-containing protein,
partial (16%)
Length = 499
Score = 354 bits (908), Expect = 3e-98
Identities = 164/165 (99%), Positives = 164/165 (99%)
Frame = +3
Query: 391 KYKLVRLPGQPQAYMIWKSILQWTDKSASRVGVILPDLTSGAEKLPVCLVNDVDNEKGPA 450
KYKLVRLPGQPQAYMIWKSILQWTDKSASRVGVILPDLTSGAEKLPVCLVNDVDNEKGPA
Sbjct: 3 KYKLVRLPGQPQAYMIWKSILQWTDKSASRVGVILPDLTSGAEKLPVCLVNDVDNEKGPA 182
Query: 451 YFTYSPTLKNLNRLAPVESSEGCTCNGGCQPGSHKCGCTQKNGGYLPYSAAGLLADLKSV 510
YFTYSPTLKNLNRLAPVESSEGCTCNGGCQPGSHKCGCTQKNGGYLPYSAAGLLADLKSV
Sbjct: 183 YFTYSPTLKNLNRLAPVESSEGCTCNGGCQPGSHKCGCTQKNGGYLPYSAAGLLADLKSV 362
Query: 511 VYECGPSCHCPPSCRNRVSQGGLKLRLEVFRTKGKGWGLRSWDPI 555
VYECGPSCHCPPSCRNRVSQGGLKLRLE FRTKGKGWGLRSWDPI
Sbjct: 363 VYECGPSCHCPPSCRNRVSQGGLKLRLEXFRTKGKGWGLRSWDPI 497
>BP042354
Length = 535
Score = 347 bits (890), Expect = 4e-96
Identities = 173/174 (99%), Positives = 173/174 (99%)
Frame = -3
Query: 113 NGDVGSSRRKSRGQLPEDDNFVDLSEVDGEGGTGDGKRRKPQKRIREKRCSSDVDPDAVA 172
NGDVGSSRRKSRGQLPEDDNFVDLSEVDGEGGTGDGKRRKPQKRIREKRCSSDVDPDAVA
Sbjct: 533 NGDVGSSRRKSRGQLPEDDNFVDLSEVDGEGGTGDGKRRKPQKRIREKRCSSDVDPDAVA 354
Query: 173 NEILKTINPGVFEILNQPDGSRDAVAYTLMIYEVMRRKLGQIDEKAKGSHSGAKRPDLKA 232
NEILKTIN GVFEILNQPDGSRDAVAYTLMIYEVMRRKLGQIDEKAKGSHSGAKRPDLKA
Sbjct: 353 NEILKTINXGVFEILNQPDGSRDAVAYTLMIYEVMRRKLGQIDEKAKGSHSGAKRPDLKA 174
Query: 233 GTLMNTKGIRANSRKRIGVVPGVEVGDIFFFRFELCLVGLHAPSMAGIDYLGTK 286
GTLMNTKGIRANSRKRIGVVPGVEVGDIFFFRFELCLVGLHAPSMAGIDYLGTK
Sbjct: 173 GTLMNTKGIRANSRKRIGVVPGVEVGDIFFFRFELCLVGLHAPSMAGIDYLGTK 12
>TC8734 similar to UP|Q93YF5 (Q93YF5) SET-domain-containing protein,
partial (13%)
Length = 804
Score = 296 bits (759), Expect = 6e-81
Identities = 141/141 (100%), Positives = 141/141 (100%)
Frame = +1
Query: 556 RAGTFICEYAGEVIDNARVEELSGENEDDYIFDSTRIYQQLEVFSSDVEAPKIPSPLYIT 615
RAGTFICEYAGEVIDNARVEELSGENEDDYIFDSTRIYQQLEVFSSDVEAPKIPSPLYIT
Sbjct: 1 RAGTFICEYAGEVIDNARVEELSGENEDDYIFDSTRIYQQLEVFSSDVEAPKIPSPLYIT 180
Query: 616 ARNEGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPL 675
ARNEGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPL
Sbjct: 181 ARNEGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPL 360
Query: 676 KVGQKKKKCLCGSVKCRGYFC 696
KVGQKKKKCLCGSVKCRGYFC
Sbjct: 361 KVGQKKKKCLCGSVKCRGYFC 423
>AW720071
Length = 377
Score = 126 bits (316), Expect = 1e-29
Identities = 62/129 (48%), Positives = 83/129 (64%), Gaps = 2/129 (1%)
Frame = +3
Query: 438 CLVNDVDNEKGPAYFTYSPTLKNLN--RLAPVESSEGCTCNGGCQPGSHKCGCTQKNGGY 495
C VN +D+EK P FTY ++ + LAP EGC C GC S KC C KNGG
Sbjct: 3 CAVNTIDDEKPPP-FTYITSMMYPDGCNLAP---PEGCYCTNGCSD-SEKCPCVVKNGGE 167
Query: 496 LPYSAAGLLADLKSVVYECGPSCHCPPSCRNRVSQGGLKLRLEVFRTKGKGWGLRSWDPI 555
+P++ + + K +VYECGP C CP +C NRVSQ G+K +LE+F+T +GWG+RS + I
Sbjct: 168 IPFNHDEAIVEAKPLVYECGPCCKCPSTCHNRVSQLGIKFQLEIFKTPTRGWGVRSLNSI 347
Query: 556 RAGTFICEY 564
+G+FICEY
Sbjct: 348 PSGSFICEY 374
>AV409769
Length = 424
Score = 110 bits (274), Expect = 1e-24
Identities = 52/52 (100%), Positives = 52/52 (100%)
Frame = +3
Query: 1 MDHNLGQDPAPAAGSFDKSRVLNVKPLRTLVPVFPSPSNPSSSATPQGGAPF 52
MDHNLGQDPAPAAGSFDKSRVLNVKPLRTLVPVFPSPSNPSSSATPQGGAPF
Sbjct: 267 MDHNLGQDPAPAAGSFDKSRVLNVKPLRTLVPVFPSPSNPSSSATPQGGAPF 422
>TC18991 similar to UP|SUV9_ARATH (Q9T0G7) Probable histone-lysine
N-methyltransferase, H3 lysine-9 specific 9 (Histone
H3-K9 methyltransferase 9) (H3-K9-HMTase 9) (Suppressor
of variegation 3-9 homolog 9) (Su(var)3-9 homolog 9) ,
partial (22%)
Length = 716
Score = 71.6 bits (174), Expect = 4e-13
Identities = 48/132 (36%), Positives = 71/132 (53%), Gaps = 9/132 (6%)
Frame = +3
Query: 549 LRSWDPIRAGTFICEYAGEVIDNARVEELSGENEDDYIF-----DSTRIYQQLEVFSSDV 603
+RS D I+AG FICEY G V+ + + L+ N D I+ + + L + S
Sbjct: 9 VRSLDLIQAGAFICEYTGVVLTTEQAKILT-MNGDTLIYPGRFSERWAEWGDLSLIDSTY 185
Query: 604 EAPKIPS--PLYIT--ARNEGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHI 659
E P PS PL NVA +++HS TPNVL + V+ ++ N H+ +A+ +I
Sbjct: 186 ERPXYPSIPPLXFAMDVSRMRNVACYISHSSTPNVLVQYVLFDHNNLMFPHLMLFAMENI 365
Query: 660 PPMMELTYDYGI 671
PPM EL+ DYG+
Sbjct: 366 PPMRELSLDYGV 401
>TC9091 similar to UP|Q9Y7R4 (Q9Y7R4) Set domain protein, transcriptional
silencing, partial (4%)
Length = 607
Score = 63.5 bits (153), Expect = 1e-10
Identities = 35/80 (43%), Positives = 47/80 (58%)
Frame = +2
Query: 614 ITARNEGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVL 673
I A GNV+RF+NHSC+P+++ V+ E+ + H+ YA R I ELTYDY L
Sbjct: 2 IDASKYGNVSRFINHSCSPHLVSHQVLIESMDCERTHIGLYASRDIALGEELTYDYQYEL 181
Query: 674 PLKVGQKKKKCLCGSVKCRG 693
V + CLC S+KCRG
Sbjct: 182 ---VPGEGSPCLCESLKCRG 232
>BP062550
Length = 525
Score = 58.2 bits (139), Expect = 4e-09
Identities = 33/83 (39%), Positives = 43/83 (51%), Gaps = 4/83 (4%)
Frame = -3
Query: 614 ITARNEGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVL 673
I A++ GNV+RF+NHSC PN+ + V+ + V +A I P ELTYDY L
Sbjct: 439 IDAQSLGNVSRFINHSCEPNLFVQCVLSSYDDIRLARVVLFAGDDIYPYQELTYDYNYKL 260
Query: 674 PLKVGQKKK----KCLCGSVKCR 692
+G KK C CG CR
Sbjct: 259 DSVIGPDKKIKQLPCHCGETTCR 191
>AV406621
Length = 126
Score = 50.4 bits (119), Expect = 9e-07
Identities = 24/41 (58%), Positives = 31/41 (75%), Gaps = 1/41 (2%)
Frame = +3
Query: 301 SGGYEDNVEDGDVLIYSGQGGTSR-EKGASDQKLERGNLAL 340
SGGYED+V++GDV+IYSG GG + + QKLE GNLA+
Sbjct: 3 SGGYEDDVDEGDVIIYSGHGGQDKHSRQVFHQKLEGGNLAM 125
>TC11218 similar to UP|Q941H0 (Q941H0) Trithorax 4 (Fragment), partial (26%)
Length = 535
Score = 50.4 bits (119), Expect = 9e-07
Identities = 27/76 (35%), Positives = 39/76 (50%)
Frame = +3
Query: 619 EGNVARFMNHSCTPNVLWRPVVRENKNEADLHVAFYAIRHIPPMMELTYDYGIVLPLKVG 678
+GN+AR +NHSC PN R ++ E + + A H+ ELTYDY + +
Sbjct: 6 QGNIARLINHSCMPNCYAR-IMSLGDQEEESRIVLIAQTHVSAGEELTYDYKFEVDER-E 179
Query: 679 QKKKKCLCGSVKCRGY 694
+ K CLC + CR Y
Sbjct: 180 EPKVPCLCKASNCRKY 227
>TC16823 similar to UP|Q8LGA2 (Q8LGA2) Seed maturation-like protein, partial
(29%)
Length = 711
Score = 37.7 bits (86), Expect = 0.006
Identities = 30/99 (30%), Positives = 43/99 (43%)
Frame = +1
Query: 3 HNLGQDPAPAAGSFDKSRVLNVKPLRTLVPVFPSPSNPSSSATPQGGAPFVCVSPSGPFP 62
H+L A AA S + P+ + P+ P P SS P G P SPS P
Sbjct: 118 HHLSSSAAAAASSPPAT------PMGSRRPLLPHPLTMSS---PPGNPPSTAASPSSPTC 270
Query: 63 SGVAPFYPFFVSPESQRLSEQNAQTPTAQRAAPISAAVP 101
S + ++P SQR+S +TP ++ + P S P
Sbjct: 271 SAASSPS---ITPSSQRVSPTPPETP*SRPSPPCSVCSP 378
>TC19535 similar to UP|Q80V17 (Q80V17) LOC233904 protein, partial (4%)
Length = 541
Score = 37.7 bits (86), Expect = 0.006
Identities = 20/43 (46%), Positives = 23/43 (52%)
Frame = +1
Query: 651 VAFYAIRHIPPMMELTYDYGIVLPLKVGQKKKKCLCGSVKCRG 693
+ YA RHI E+TY Y L +KK C CGS KCRG
Sbjct: 7 IFLYAXRHIASGEEITYXYKFPLE----EKKIPCNCGSQKCRG 123
>TC14775 PIR|S14181|S14181 DNA-directed RNA polymerase largest chain
(isoform B1) - soybean (fragment) {Glycine
max;} , partial (22%)
Length = 848
Score = 35.8 bits (81), Expect = 0.024
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Frame = +1
Query: 36 SPSNPSSSATPQGGAPFV-CVSPSGPFPSGVAPFYPFFVSPESQRLSEQNAQTPTAQRAA 94
SP++P+ S T G +P SP+ P S +P Y +P+S + S A +P++ R +
Sbjct: 10 SPTSPAYSPTSPGYSPTSPSYSPTSPSYSPTSPSY----NPQSAKYSPSLAYSPSSPRLS 177
Query: 95 PISAAVPINSFRTPT 109
P S P + +PT
Sbjct: 178 PSSPYSPTSPNYSPT 222
Score = 35.4 bits (80), Expect = 0.031
Identities = 23/65 (35%), Positives = 33/65 (50%), Gaps = 1/65 (1%)
Frame = +1
Query: 36 SPSNPSSSATPQGGAPFV-CVSPSGPFPSGVAPFYPFFVSPESQRLSEQNAQTPTAQRAA 94
SP++PS S T +P SPS P+ SGV+P Y SP S + S +P+ +
Sbjct: 214 SPTSPSYSPTSPSYSPSSPTYSPSSPYNSGVSPDY----SPSSPQFSPSTGYSPSQPGYS 381
Query: 95 PISAA 99
P S +
Sbjct: 382 PSSTS 396
>BI420111
Length = 528
Score = 35.0 bits (79), Expect = 0.040
Identities = 33/102 (32%), Positives = 43/102 (41%), Gaps = 1/102 (0%)
Frame = +3
Query: 28 RTLVPVFPSPSNPSSSATPQGGAPFVCVSPSGPFPSGVAPFYPFFVSPESQRLSEQNAQT 87
RTLV FPS S S A+P A + PS + S A S + RLS +
Sbjct: 240 RTLVATFPSASMVSVVASPFAAASLDPIFPSSIWSSSSAVRPRSMESRTTSRLSASDPSV 419
Query: 88 PTAQRAAPISAAVPINSFRTPTGATNG-DVGSSRRKSRGQLP 128
PT SA ++ RT +T+ SSRR+ R P
Sbjct: 420 PTR------SARCSTSTRRTTLASTSSVAASSSRRRQRKASP 527
>BP065960
Length = 491
Score = 34.7 bits (78), Expect = 0.053
Identities = 20/44 (45%), Positives = 22/44 (49%)
Frame = -1
Query: 650 HVAFYAIRHIPPMMELTYDYGIVLPLKVGQKKKKCLCGSVKCRG 693
H+ YA R I ELT+DY V P K CLCGS C G
Sbjct: 449 HIGLYASRDIALGEELTFDYHKVGP----GKGTPCLCGSSNCGG 330
>TC18688 weakly similar to UP|Q9LP77 (Q9LP77) T1N15.9, partial (13%)
Length = 514
Score = 34.7 bits (78), Expect = 0.053
Identities = 23/84 (27%), Positives = 35/84 (41%)
Frame = +3
Query: 9 PAPAAGSFDKSRVLNVKPLRTLVPVFPSPSNPSSSATPQGGAPFVCVSPSGPFPSGVAPF 68
P PA GS + R+ P PSP+N +++P + VS S P P+ P
Sbjct: 204 PPPATGSASTATPTPPTSSRSASPPSPSPANSHMASSPPSPTSALSVSVSTPSPAHSPPT 383
Query: 69 YPFFVSPESQRLSEQNAQTPTAQR 92
P +P S + +P + R
Sbjct: 384 SP--PAPRSATSTSSRISSPVSSR 449
>TC7897 weakly similar to UP|ENL2_ARATH (Q9T076) Early nodulin-like protein
2 precursor (Phytocyanin-like protein), partial (31%)
Length = 1482
Score = 33.9 bits (76), Expect = 0.090
Identities = 27/118 (22%), Positives = 43/118 (35%)
Frame = +3
Query: 11 PAAGSFDKSRVLNVKPLRTLVPVFPSPSNPSSSATPQGGAPFVCVSPSGPFPSGVAPFYP 70
P+A + ++ P + P SPS+ +S +P GG SP+G P+G P
Sbjct: 702 PSASTSPSPSPVSAPPAASPAPGAESPSSSPTSNSPAGGPTATSPSPAGDSPAGGPP--- 872
Query: 71 FFVSPESQRLSEQNAQTPTAQRAAPISAAVPINSFRTPTGATNGDVGSSRRKSRGQLP 128
A +PT P ++ ++ TP+ SS S P
Sbjct: 873 --------------APSPTTGDTPPSASGPDVSPSGTPSSLPADTPSSSSNNSTAPPP 1004
>TC15958 similar to GB|AAP37838.1|30725632|BT008479 At4g24220 {Arabidopsis
thaliana;}, partial (52%)
Length = 800
Score = 32.3 bits (72), Expect = 0.26
Identities = 23/82 (28%), Positives = 39/82 (47%), Gaps = 3/82 (3%)
Frame = +3
Query: 35 PSPSNPSSSATPQGGA---PFVCVSPSGPFPSGVAPFYPFFVSPESQRLSEQNAQTPTAQ 91
PSP++ SSSA+P A P + P+ P +G + P + P ++ Q+ T+
Sbjct: 237 PSPASDSSSASPASSATASPRFSLFPTPPAAAGKSTASPAALDPPGTPIT----QSTTSS 404
Query: 92 RAAPISAAVPINSFRTPTGATN 113
+PI +S +PT T+
Sbjct: 405 ATSPIPTTPKPSSPSSPTSPTS 470
>CN825244
Length = 646
Score = 32.0 bits (71), Expect = 0.34
Identities = 23/74 (31%), Positives = 35/74 (47%), Gaps = 9/74 (12%)
Frame = +1
Query: 427 DLTSGAEKLPVCLVNDVDNEKGPAYFTYSPTLKNLN--------RLAPVESSEGC-TCNG 477
DLT G E++ + VN+ ++ P+ F Y P +NL L+ + + + C TC G
Sbjct: 433 DLTKGEERVKISWVNNTTDDHPPS-FHYIP--RNLVFRDAHLDISLSRIGNEDCCFTCMG 603
Query: 478 GCQPGSHKCGCTQK 491
C S C C K
Sbjct: 604 NCVLSSKPCRCANK 645
>TC18080 similar to GB|AAL90935.1|19699138|AY090274 AT3g18790/MVE11_17
{Arabidopsis thaliana;}, partial (24%)
Length = 585
Score = 31.6 bits (70), Expect = 0.45
Identities = 21/66 (31%), Positives = 31/66 (46%)
Frame = -1
Query: 32 PVFPSPSNPSSSATPQGGAPFVCVSPSGPFPSGVAPFYPFFVSPESQRLSEQNAQTPTAQ 91
P SPS+P+++ PQ + SPS P+PS +P P P + S Q P +
Sbjct: 219 PSSQSPSHPATARAPQTPSHSPPASPSSPYPSS-SPPQPHPPPPPLESPSPQPTPQPPPR 43
Query: 92 RAAPIS 97
+ P S
Sbjct: 42 FSHPSS 25
Database: LJGI
Posted date: Jul 30, 2004 11:16 AM
Number of letters in database: 14,692,800
Number of sequences in database: 28,460
Lambda K H
0.317 0.136 0.413
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,397,958
Number of Sequences: 28460
Number of extensions: 193344
Number of successful extensions: 1276
Number of sequences better than 10.0: 89
Number of HSP's better than 10.0 without gapping: 1236
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1260
length of query: 696
length of database: 4,897,600
effective HSP length: 97
effective length of query: 599
effective length of database: 2,136,980
effective search space: 1280051020
effective search space used: 1280051020
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 58 (26.9 bits)
Lotus: description of TM0098b.1