
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140721.4 + phase: 0
(101 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC211896 UP|P93165 (P93165) Late embryogenesis-abundant protein ... 169 2e-43
TC228814 homologue to UP|O82464 (O82464) Late embryogenic abunda... 153 1e-38
TC206917 weakly similar to UP|O23238 (O23238) Ribosomal protein ... 32 0.049
TC233121 similar to GB|AAP13385.1|30023704|BT006277 At3g56360 {A... 30 0.14
BM567735 30 0.14
BF424213 weakly similar to GP|21322711|em pherophorin-dz1 protei... 28 0.71
AW705627 weakly similar to GP|30017239|gb| At5g49350 {Arabidopsi... 28 0.93
AW100271 weakly similar to GP|15724206|gb| AT5g04420/T32M21_20 {... 28 0.93
TC233516 similar to UP|Q9SBM1 (Q9SBM1) Hydroxyproline-rich glyco... 28 0.93
TC217596 weakly similar to UP|Q8K357 (Q8K357) Procr protein, par... 27 1.2
BG791134 GP|27261013|db P0434A03.29 {Oryza sativa (japonica cult... 27 1.6
AW620463 27 1.6
BE803715 27 2.1
TC229729 similar to GB|AAO63276.1|28950705|BT005212 At2g42260 {A... 27 2.1
TC217535 26 2.7
BF068994 similar to GP|15010768|gb| AT3g06130/F28L1_7 {Arabidops... 26 2.7
CF921682 26 3.5
TC215142 similar to GB|AAL32703.1|17065098|AY062625 nucleotide s... 26 3.5
BM309222 26 3.5
>TC211896 UP|P93165 (P93165) Late embryogenesis-abundant protein (Em
protein), complete
Length = 789
Score = 169 bits (429), Expect = 2e-43
Identities = 81/101 (80%), Positives = 94/101 (92%)
Frame = +1
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEM 60
MAS+Q N+QEL+E+A+QGETVVPGGTGGKSLEAQ+HLAEGRSKGGQTRKEQLGTEGYQEM
Sbjct: 64 MASRQNNKQELDERARQGETVVPGGTGGKSLEAQQHLAEGRSKGGQTRKEQLGTEGYQEM 243
Query: 61 GRKGGLSTMEKSGGERAEEEGIDIDESKFKTGGGGGRSQNK 101
GRKGGLST++KSG ERA+EEGI IDESKF+TG ++QN+
Sbjct: 244 GRKGGLSTVDKSGEERAQEEGIGIDESKFRTGNNKNQNQNE 366
>TC228814 homologue to UP|O82464 (O82464) Late embryogenic abundant protein,
partial (98%)
Length = 688
Score = 153 bits (387), Expect = 1e-38
Identities = 80/110 (72%), Positives = 87/110 (78%), Gaps = 20/110 (18%)
Frame = +3
Query: 1 MASKQQNRQELEEKAKQGETVVPGGTGGKSLEAQEHLAEGRS------------------ 42
M S+Q NR+EL+EKA+QGETVVPGGTGGKSLEAQEHLAEGRS
Sbjct: 57 MESQQANREELDEKARQGETVVPGGTGGKSLEAQEHLAEGRSRGGQTRKQQLGSEGYHEM 236
Query: 43 --KGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDESKFK 90
KGGQTRKEQ+G EGYQEMGRKGGLSTM+KSGGERAEEEGI+IDESKFK
Sbjct: 237 GTKGGQTRKEQMGREGYQEMGRKGGLSTMDKSGGERAEEEGIEIDESKFK 386
>TC206917 weakly similar to UP|O23238 (O23238) Ribosomal protein
(At4g36420/C7A10_940), partial (69%)
Length = 1186
Score = 32.0 bits (71), Expect = 0.049
Identities = 21/70 (30%), Positives = 30/70 (42%)
Frame = +3
Query: 24 GGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGID 83
G GG + EA+ A R G EG+ E R+G +SGGE ++G+
Sbjct: 336 GPGGGHAREARRQRASDRDAHGARNGR---AEGHSEGSREGRRRCHRRSGGEGGGKDGVR 506
Query: 84 IDESKFKTGG 93
+ F GG
Sbjct: 507 CEARGF*RGG 536
>TC233121 similar to GB|AAP13385.1|30023704|BT006277 At3g56360 {Arabidopsis
thaliana;} , partial (36%)
Length = 990
Score = 30.4 bits (67), Expect = 0.14
Identities = 32/94 (34%), Positives = 44/94 (46%), Gaps = 8/94 (8%)
Frame = +1
Query: 12 EEKAKQGETVVPGGTGGKSLEAQEHLAEG---RSKGGQTRK---EQLGTEGYQEMGRKGG 65
E +A++ E P GTG ++ + E + G +GG Q+G +G E G GG
Sbjct: 310 EARARRSEKRRPRGTG-EAGQVDERVGGGDREEFEGGAAAGAGFSQIGDDGDGEEGGGGG 486
Query: 66 LSTMEKSGGER--AEEEGIDIDESKFKTGGGGGR 97
L+ E + G R A GIDI G GGGR
Sbjct: 487 LAGGEGAVGIRRDAGSGGIDI-RGGVGRGHGGGR 585
>BM567735
Length = 428
Score = 30.4 bits (67), Expect = 0.14
Identities = 20/70 (28%), Positives = 32/70 (45%)
Frame = -3
Query: 28 GKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDES 87
G +L+A RS G R G G + G +E++GG ++ EG+ ++E
Sbjct: 408 GHALDATVQ*VNARSGNGMQRALGNGDLGLERCDGGIGEEEVERTGGSGSDNEGL-LEEV 232
Query: 88 KFKTGGGGGR 97
+ GG GR
Sbjct: 231 RGLEGGNAGR 202
>BF424213 weakly similar to GP|21322711|em pherophorin-dz1 protein {Volvox
carteri f. nagariensis}, partial (6%)
Length = 424
Score = 28.1 bits (61), Expect = 0.71
Identities = 16/42 (38%), Positives = 22/42 (52%)
Frame = -3
Query: 24 GGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGG 65
GG G E + EG+ KGG +++ G EG GR+GG
Sbjct: 278 GGEGRVEREGGVRVGEGKRKGGGFKEDGGGWEGVWR-GREGG 156
Score = 25.4 bits (54), Expect = 4.6
Identities = 16/35 (45%), Positives = 18/35 (50%)
Frame = -3
Query: 40 GRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGG 74
GR KGG+ R E+ G E RKGG E GG
Sbjct: 290 GRGKGGEGRVEREGGVRVGEGKRKGG-GFKEDGGG 189
>AW705627 weakly similar to GP|30017239|gb| At5g49350 {Arabidopsis thaliana},
partial (10%)
Length = 391
Score = 27.7 bits (60), Expect = 0.93
Identities = 26/78 (33%), Positives = 33/78 (41%), Gaps = 5/78 (6%)
Frame = -2
Query: 23 PGG-----TGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERA 77
PGG +GG+ L G S G LG G + G +GG T+ SGG +
Sbjct: 384 PGGCGGSVSGGRGGSVSGGLGSGMSGPGGD-SGGLGASGSK--GVEGGSVTI--SGGIKG 220
Query: 78 EEEGIDIDESKFKTGGGG 95
G+ F TGGGG
Sbjct: 219 SISGVGSTGCSFGTGGGG 166
>AW100271 weakly similar to GP|15724206|gb| AT5g04420/T32M21_20
{Arabidopsis thaliana}, partial (16%)
Length = 426
Score = 27.7 bits (60), Expect = 0.93
Identities = 14/23 (60%), Positives = 14/23 (60%)
Frame = +2
Query: 76 RAEEEGIDIDESKFKTGGGGGRS 98
RA GI IDES F GGG RS
Sbjct: 41 RAGHAGITIDESWFIVGGGDNRS 109
>TC233516 similar to UP|Q9SBM1 (Q9SBM1) Hydroxyproline-rich glycoprotein
DZ-HRGP precursor, partial (7%)
Length = 486
Score = 27.7 bits (60), Expect = 0.93
Identities = 21/55 (38%), Positives = 25/55 (45%), Gaps = 2/55 (3%)
Frame = +3
Query: 24 GGTGGKSLEAQEHLAEGRSKGGQTR--KEQLGTEGYQEMGRKGGLSTMEKSGGER 76
GG GG+ E Q H + GGQ E+LG G +GR G SGG R
Sbjct: 243 GGGGGEPRELQVHAHQDAVGGGQAHGGDEELGDGGGLGVGR-WGTRWRGVSGGSR 404
>TC217596 weakly similar to UP|Q8K357 (Q8K357) Procr protein, partial (9%)
Length = 988
Score = 27.3 bits (59), Expect = 1.2
Identities = 24/90 (26%), Positives = 38/90 (41%), Gaps = 9/90 (10%)
Frame = +3
Query: 16 KQGETVVPGGTGGKSLEAQ---------EHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGL 66
++ E+ G GG+ E++ E G GG+ ++ + G+ GY GRK
Sbjct: 141 RKEESEYGSGYGGRKEESEYGSGYDGRKEESEYGSGYGGRKQESEYGS-GYG--GRKQES 311
Query: 67 STMEKSGGERAEEEGIDIDESKFKTGGGGG 96
GG EE S +++ GGGG
Sbjct: 312 EYGSGYGGRSEYEEKPSYGRSNYESQGGGG 401
>BG791134 GP|27261013|db P0434A03.29 {Oryza sativa (japonica
cultivar-group)}, partial (9%)
Length = 415
Score = 26.9 bits (58), Expect = 1.6
Identities = 12/34 (35%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Frame = -3
Query: 69 MEKSGGERAEEEGID-IDESKFKTGGGGGRSQNK 101
++K+ ++ +EG+ ++E K + GGGGGR + K
Sbjct: 335 VKKNWCKKVVDEGVRRVNEKKDEEGGGGGRGREK 234
>AW620463
Length = 398
Score = 26.9 bits (58), Expect = 1.6
Identities = 20/48 (41%), Positives = 25/48 (51%), Gaps = 4/48 (8%)
Frame = -2
Query: 55 EGYQEMGRKGGLSTMEKSGGERAE----EEGIDIDESKFKTGGGGGRS 98
EG + G GG+ E GGE E EEG + E+ + GGGGG S
Sbjct: 388 EGEEGAGPGGGVGG-EVIGGELGEGVEGEEGEEGQEAGEEGGGGGGGS 248
>BE803715
Length = 410
Score = 26.6 bits (57), Expect = 2.1
Identities = 16/56 (28%), Positives = 26/56 (45%), Gaps = 5/56 (8%)
Frame = -2
Query: 51 QLGTEGYQEMGRKGGLSTMEKSGGERAEEEGIDIDESKFKTGGG-----GGRSQNK 101
QL + + R+ G+ + G EEEG D + + +GGG GGR + +
Sbjct: 319 QLEVTRVEALDRERGIRVWGEGSGGFVEEEGADSELERDVSGGGDEGVPGGRGREE 152
>TC229729 similar to GB|AAO63276.1|28950705|BT005212 At2g42260 {Arabidopsis
thaliana;} , partial (12%)
Length = 492
Score = 26.6 bits (57), Expect = 2.1
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Frame = -1
Query: 61 GRKGGLSTMEKSGG-ERAEEEGIDIDESKFKTGGGGG 96
GR ST + G R+E ID+DE G GGG
Sbjct: 273 GRTASSSTADGGGSLRRSERRRIDVDE*GAAAGDGGG 163
>TC217535
Length = 522
Score = 26.2 bits (56), Expect = 2.7
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 3/48 (6%)
Frame = -3
Query: 53 GTEGYQEM---GRKGGLSTMEKSGGERAEEEGIDIDESKFKTGGGGGR 97
GT+G +++ GG E R +E + S + GGGGGR
Sbjct: 283 GTDGGEDIDGDAENGGERGEEAEASARYQERVCSVSYSGRRRGGGGGR 140
>BF068994 similar to GP|15010768|gb| AT3g06130/F28L1_7 {Arabidopsis
thaliana}, partial (4%)
Length = 240
Score = 26.2 bits (56), Expect = 2.7
Identities = 13/31 (41%), Positives = 17/31 (53%)
Frame = +1
Query: 71 KSGGERAEEEGIDIDESKFKTGGGGGRSQNK 101
KSGG ++++ K GGGGG SQ K
Sbjct: 136 KSGGGGGNHNNKGQNQNQPKGGGGGGNSQAK 228
>CF921682
Length = 513
Score = 25.8 bits (55), Expect = 3.5
Identities = 18/56 (32%), Positives = 23/56 (40%)
Frame = +2
Query: 23 PGGTGGKSLEAQEHLAEGRSKGGQTRKEQLGTEGYQEMGRKGGLSTMEKSGGERAE 78
PGG G H G KGG+ R G ++GG +K GGE+ E
Sbjct: 317 PGGKKG-------HFPGGNPKGGKPRGG-----GGPPQEKRGGGKKKKKGGGEKGE 448
>TC215142 similar to GB|AAL32703.1|17065098|AY062625 nucleotide sugar
epimerase-like protein {Arabidopsis thaliana;} , partial
(15%)
Length = 681
Score = 25.8 bits (55), Expect = 3.5
Identities = 24/84 (28%), Positives = 35/84 (41%), Gaps = 8/84 (9%)
Frame = -2
Query: 21 VVPGGTGGKSLEAQEHLAEGRSKG----GQTRKEQL----GTEGYQEMGRKGGLSTMEKS 72
V P G G+ L GRS+G G+ R+ + G+ G + R+ + +
Sbjct: 503 VSPEGVXGRVL------GRGRSRG*SCRGRGRRRRAS*GRGSSGSRRSPRRR*QAPTCRW 342
Query: 73 GGERAEEEGIDIDESKFKTGGGGG 96
GGE G D + GGGGG
Sbjct: 341 GGEGGRSGGPDSPRRRRGGGGGGG 270
>BM309222
Length = 428
Score = 25.8 bits (55), Expect = 3.5
Identities = 14/32 (43%), Positives = 20/32 (61%)
Frame = +3
Query: 57 YQEMGRKGGLSTMEKSGGERAEEEGIDIDESK 88
Y +G K S E+ G+R E+EG+D +ESK
Sbjct: 183 YSSLGEKM-TSKREREAGDRVEDEGLD-NESK 272
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.299 0.126 0.331
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,450,988
Number of Sequences: 63676
Number of extensions: 26498
Number of successful extensions: 197
Number of sequences better than 10.0: 58
Number of HSP's better than 10.0 without gapping: 193
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 194
length of query: 101
length of database: 12,639,632
effective HSP length: 77
effective length of query: 24
effective length of database: 7,736,580
effective search space: 185677920
effective search space used: 185677920
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (22.0 bits)
S2: 51 (24.3 bits)
Medicago: description of AC140721.4