
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146856.2 + phase: 0
(88 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
NP595172 polyprotein [Glycine max] 55 5e-09
TC212592 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 51 1e-07
TC213413 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 44 2e-05
TC204929 similar to GB|AAD39665.1|5103835|F9L1 ESTs gb|T22508, g... 43 2e-05
TC211469 37 0.001
BE211654 37 0.002
TC211627 37 0.002
TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, parti... 36 0.004
BU549069 35 0.005
TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, parti... 32 0.055
CA784736 32 0.055
BU578017 29 0.35
TC228261 similar to UP|DCL_LYCES (Q42463) DCL protein, chloropla... 27 1.3
TC204422 27 1.3
TC212522 weakly similar to UP|O82138 (O82138) High affinity nitr... 26 3.9
TC215000 homologue to GB|AAP13371.1|30023676|BT006263 At3g07430 ... 25 5.1
TC208982 similar to UP|Q8S976 (Q8S976) Auxin response factor 10 ... 25 5.1
BG651978 25 5.1
BF425356 similar to PIR|T08400|T08 late embryonic abundant prote... 25 6.7
CK606262 25 6.7
>NP595172 polyprotein [Glycine max]
Length = 4659
Score = 55.5 bits (132), Expect = 5e-09
Identities = 31/87 (35%), Positives = 53/87 (60%), Gaps = 9/87 (10%)
Frame = +1
Query: 4 QPLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKVV-EGDG 62
QP+K+L +R + Q+LVQW+ +EATWED ++++ S+P+ LEDKVV +G+G
Sbjct: 4324 QPVKILASRIIIRGHNQIEQILVQWENGLQDEATWEDIEDIKASYPTFNLEDKVVFKGEG 4503
Query: 63 NDTYSLD--------SKHATEPGIRNE 81
N T + ++ ++E G+ N+
Sbjct: 4504 NVTNGMSRGEKVNNTAESSSERGLHNK 4584
>TC212592 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 664
Score = 50.8 bits (120), Expect = 1e-07
Identities = 33/91 (36%), Positives = 55/91 (60%), Gaps = 5/91 (5%)
Frame = +2
Query: 2 DHQ----PLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKV 57
DHQ PL +L TR++ ++LVQW+GLSP++ATWE E+ + LEDKV
Sbjct: 251 DHQSVIAPLVILATRTVNDDD---IEVLVQWQGLSPDDATWEKWTELCKEFH---LEDKV 412
Query: 58 V-EGDGNDTYSLDSKHATEPGIRNEQLNTKK 87
+ G NDT ++ +T+ I+N++ ++++
Sbjct: 413 LPHGPWNDTG--ETNTSTKTAIQNQEFSSRE 499
>TC213413 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (5%)
Length = 761
Score = 43.5 bits (101), Expect = 2e-05
Identities = 27/80 (33%), Positives = 42/80 (51%), Gaps = 1/80 (1%)
Frame = +1
Query: 5 PLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKVV-EGDGN 63
PL V+ ++ + G + +LVQW S ++A+WED +R + LEDKV+ E G+
Sbjct: 271 PLTVIDSKLVPADNGPRRMVLVQWPSASRQDASWEDWQVLRERYN---LEDKVLSEERGD 441
Query: 64 DTYSLDSKHATEPGIRNEQL 83
DT+ D R +QL
Sbjct: 442 DTHVEDEAMHQRHSARTKQL 501
>TC204929 similar to GB|AAD39665.1|5103835|F9L1 ESTs gb|T22508, gb|H36196 and
gb|AI100134 come from this gene. {Arabidopsis thaliana;}
, partial (53%)
Length = 823
Score = 43.1 bits (100), Expect = 2e-05
Identities = 25/58 (43%), Positives = 35/58 (60%)
Frame = -2
Query: 23 QMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKVVEGDGNDTYSLDSKHATEPGIRN 80
++LVQW+GL PEEA+WE D++R L DKVV +G +DS A +P + N
Sbjct: 576 RVLVQWEGLPPEEASWELWDDLRDLHN---LADKVVFDEG----GVDSNIADQPSVTN 424
>TC211469
Length = 480
Score = 37.4 bits (85), Expect = 0.001
Identities = 21/56 (37%), Positives = 33/56 (58%)
Frame = +2
Query: 5 PLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKVVEG 60
PL +L + + + +LVQW GL PE+ +WE D+++ S+ LED+VV G
Sbjct: 77 PLTILDWKLDSSVTPPRRLVLVQWMGLPPEDTSWELWDDLQQSYN---LEDEVVFG 235
>BE211654
Length = 454
Score = 37.0 bits (84), Expect = 0.002
Identities = 18/40 (45%), Positives = 27/40 (67%), Gaps = 1/40 (2%)
Frame = -3
Query: 24 MLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKV-VEGDG 62
+L++W+GL E +WE DEM+ +P LE+KV +EG G
Sbjct: 452 VLIKWRGLPHHEDSWELADEMQAVFPQLDLENKVHLEGRG 333
>TC211627
Length = 1034
Score = 37.0 bits (84), Expect = 0.002
Identities = 26/84 (30%), Positives = 40/84 (46%)
Frame = +3
Query: 4 QPLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKVVEGDGN 63
QPL+ L + + Q+LVQW L+PE+ TWE +++ + LEDKV G
Sbjct: 687 QPLQFLDWKMDESTTPPIPQVLVQWTNLAPEDTTWESWTQLKDIYD---LEDKVCFQTG- 854
Query: 64 DTYSLDSKHATEPGIRNEQLNTKK 87
+DS P ++ + N K
Sbjct: 855 ---GIDSISTM*PTLQVPRTNVDK 917
>TC211973 weakly similar to UP|Q84ZV5 (Q84ZV5) Polyprotein, partial (4%)
Length = 730
Score = 35.8 bits (81), Expect = 0.004
Identities = 27/72 (37%), Positives = 39/72 (53%), Gaps = 1/72 (1%)
Frame = +3
Query: 8 VLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKV-VEGDGNDTY 66
VL +R L+ P +K +L+QWK L P E +WE +++ + LEDKV + G G D
Sbjct: 402 VLDSRELQ-PGNVK--VLIQWKNLPPSENSWESVAKLQEIFSIYHLEDKVSLLGGGID-- 566
Query: 67 SLDSKHATEPGI 78
KH +P I
Sbjct: 567 ----KHKHKPPI 590
>BU549069
Length = 615
Score = 35.4 bits (80), Expect = 0.005
Identities = 17/46 (36%), Positives = 26/46 (55%)
Frame = -1
Query: 5 PLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSWPS 50
PL++ R+ + + V W G S E+ATWE E +MR ++PS
Sbjct: 363 PLRIEDRRTKHLRRKENPLVKVIWGGTSGEDATWELESQMRVAYPS 226
>TC213114 weakly similar to UP|Q8W150 (Q8W150) Polyprotein, partial (7%)
Length = 810
Score = 32.0 bits (71), Expect = 0.055
Identities = 15/35 (42%), Positives = 23/35 (64%)
Frame = +2
Query: 23 QMLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKV 57
++L+Q + L EATWE + ++ +PS LEDKV
Sbjct: 359 EVLIQLEDLPDFEATWESVEVIKEQFPSFHLEDKV 463
>CA784736
Length = 438
Score = 32.0 bits (71), Expect = 0.055
Identities = 16/29 (55%), Positives = 21/29 (72%)
Frame = +2
Query: 34 EEATWEDEDEMRTSWPSPILEDKVVEGDG 62
+EATWEDED + +++P LE K VE DG
Sbjct: 47 KEATWEDEDSIGSTYPDLNLEHK-VEVDG 130
>BU578017
Length = 425
Score = 29.3 bits (64), Expect = 0.35
Identities = 13/31 (41%), Positives = 18/31 (57%)
Frame = +2
Query: 5 PLKVLGTRSLRTPQGIKTQMLVQWKGLSPEE 35
PL +L R + ++LVQW GLSP+E
Sbjct: 122 PLAILDYRRSSSAANAPWEVLVQWHGLSPDE 214
>TC228261 similar to UP|DCL_LYCES (Q42463) DCL protein, chloroplast precursor
(Defective chloroplasts and leaves protein), partial
(54%)
Length = 997
Score = 27.3 bits (59), Expect = 1.3
Identities = 12/29 (41%), Positives = 16/29 (54%)
Frame = +2
Query: 29 KGLSPEEATWEDEDEMRTSWPSPILEDKV 57
KG+ E A +D+D+ W ILED V
Sbjct: 305 KGILEEHAFEDDDDDKWVDWEDQILEDTV 391
>TC204422
Length = 1820
Score = 27.3 bits (59), Expect = 1.3
Identities = 14/41 (34%), Positives = 21/41 (51%)
Frame = -3
Query: 24 MLVQWKGLSPEEATWEDEDEMRTSWPSPILEDKVVEGDGND 64
M +W+ EE + EDE R S P DK +E +G++
Sbjct: 1146 MAAEWRKGEREETPLQGEDESRRSSPP*EAMDKSLEEEGDE 1024
>TC212522 weakly similar to UP|O82138 (O82138) High affinity nitrate
transporter, partial (41%)
Length = 784
Score = 25.8 bits (55), Expect = 3.9
Identities = 11/31 (35%), Positives = 14/31 (44%)
Frame = -2
Query: 18 QGIKTQMLVQWKGLSPEEATWEDEDEMRTSW 48
QG V+W PE+ TW RT+W
Sbjct: 558 QGFPIHRRVRWPNRPPEQRTWSSS---RTAW 475
>TC215000 homologue to GB|AAP13371.1|30023676|BT006263 At3g07430 {Arabidopsis
thaliana;} , partial (47%)
Length = 742
Score = 25.4 bits (54), Expect = 5.1
Identities = 15/51 (29%), Positives = 22/51 (42%)
Frame = -1
Query: 28 WKGLSPEEATWEDEDEMRTSWPSPILEDKVVEGDGNDTYSLDSKHATEPGI 78
W G E +E + R +W P+L +K G+G T +H GI
Sbjct: 199 WAG*RREGLRFEAQASGRRTWGVPVLSNKGKRGNGVLTQ*QRHRHLNPLGI 47
>TC208982 similar to UP|Q8S976 (Q8S976) Auxin response factor 10 (Fragment),
partial (19%)
Length = 675
Score = 25.4 bits (54), Expect = 5.1
Identities = 14/52 (26%), Positives = 23/52 (43%), Gaps = 6/52 (11%)
Frame = +2
Query: 3 HQPLKVLGTRSLRTPQ------GIKTQMLVQWKGLSPEEATWEDEDEMRTSW 48
+QP +V+ TP+ ++ M +QW + +E ED R SW
Sbjct: 158 NQPFEVVYYPRANTPEFCIRTSAVRGAMRIQWSSGMRFKMPFETEDSSRISW 313
>BG651978
Length = 421
Score = 25.4 bits (54), Expect = 5.1
Identities = 7/20 (35%), Positives = 13/20 (65%)
Frame = -1
Query: 20 IKTQMLVQWKGLSPEEATWE 39
+ MLV W L+P +++W+
Sbjct: 226 VSAMMLVNWVALAPNDSSWD 167
>BF425356 similar to PIR|T08400|T08 late embryonic abundant protein EMB8
homolog F18B3.70 - Arabidopsis thaliana, partial (31%)
Length = 390
Score = 25.0 bits (53), Expect = 6.7
Identities = 9/26 (34%), Positives = 15/26 (57%)
Frame = -1
Query: 37 TWEDEDEMRTSWPSPILEDKVVEGDG 62
TW+ ++E R + ++ VEGDG
Sbjct: 135 TWQQDEEWRIGGQATVVAGDPVEGDG 58
>CK606262
Length = 609
Score = 25.0 bits (53), Expect = 6.7
Identities = 14/48 (29%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Frame = -2
Query: 2 DHQ-PLKVLGTRSLRTPQGIKTQMLVQWKGLSPEEATWEDEDEMRTSW 48
DH PLK T +G TQ+ + +KG + + D+ E W
Sbjct: 536 DHSHPLKTTTTTKTVKMKGAATQVKIHYKGSDDDYLVFVDDLETYKKW 393
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.307 0.128 0.379
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,875,010
Number of Sequences: 63676
Number of extensions: 45444
Number of successful extensions: 184
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 184
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 184
length of query: 88
length of database: 12,639,632
effective HSP length: 64
effective length of query: 24
effective length of database: 8,564,368
effective search space: 205544832
effective search space used: 205544832
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 52 (24.6 bits)
Medicago: description of AC146856.2