
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147006.6 + phase: 0 /pseudo
(797 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC218847 65 2e-10
TC206673 54 4e-07
BI321779 51 2e-06
CA851030 46 6e-05
BM891241 39 0.012
TC211561 weakly similar to UP|O22675 (O22675) Reverse transcript... 37 0.026
TC204062 homologue to UP|Q09083 (Q09083) Hydroxyproline-rich gly... 35 0.13
TC213164 35 0.17
TC214098 UP|PRP1_SOYBN (P08012) Repetitive proline-rich cell wal... 35 0.17
TC204046 UP|Q39835 (Q39835) Extensin, complete 34 0.22
BG405621 33 0.38
TC204070 similar to UP|Q39835 (Q39835) Extensin, partial (34%) 33 0.38
CA783280 weakly similar to PIR|T01871|T01 RNA-directed DNA polym... 33 0.49
TC216314 similar to GB|AAP68269.1|31711826|BT008830 At1g70320 {A... 33 0.64
BF424929 similar to GP|9663767|emb| RNA Binding Protein 45 {Nico... 31 2.4
TC224830 similar to UP|Q93WZ6 (Q93WZ6) Abscisic stress ripening-... 31 2.4
TC214359 homologue to UP|PRP1_SOYBN (P08012) Repetitive proline-... 31 2.4
CF922212 31 2.4
TC225237 similar to UP|Q9LEB4 (Q9LEB4) RNA Binding Protein 45, p... 31 2.4
CA800510 28 3.1
>TC218847
Length = 521
Score = 64.7 bits (156), Expect = 2e-10
Identities = 29/94 (30%), Positives = 54/94 (56%)
Frame = -3
Query: 65 WRLLKAKDTLITQRSRSRWLKKGDANSKFFHKCIKLRKSINSIKALEENEGWVVSPFEVC 124
W +A ++L+ Q++R++W+++ D NS +FHK I N++ + WV P V
Sbjct: 282 WTAAQAHESLLRQKARTKWIRERDCNSSYFHKLINYIHRTNAVNGVMIGGSWVEEPNRV- 106
Query: 125 RKVVNYFTNHFAEDRWDRPRLDRVDFESLTEVEN 158
+ V N+F F+E +DRP ++ +F S+ + +N
Sbjct: 105 KAVRNFFQIRFSESDYDRP*INGANFRSIGQQQN 4
>TC206673
Length = 742
Score = 53.5 bits (127), Expect = 4e-07
Identities = 24/81 (29%), Positives = 43/81 (52%)
Frame = +3
Query: 40 LDEKGEEGTLTE*EVGLHKGKLVELWRLLKAKDTLITQRSRSRWLKKGDANSKFFHKCIK 99
+++ +L+ E+ K +LW A ++L+ Q+SR +WLK+GD NS +FH I
Sbjct: 51 VEDLASNRSLSVDEIKAKKELQQQLWEASTAYESLLRQKSRDKWLKEGDNNSAYFHTVIN 230
Query: 100 LRKSINSIKALEENEGWVVSP 120
R+ N ++ + W+ P
Sbjct: 231 FRRHYNGLQGILIQGEWIALP 293
>BI321779
Length = 421
Score = 51.2 bits (121), Expect = 2e-06
Identities = 22/55 (40%), Positives = 37/55 (67%)
Frame = +2
Query: 135 FAEDRWDRPRLDRVDFESLTEVENGLLVAPFSLLEIEEVVRDSDGGKSPGPNGYN 189
F E + RP+L+ V+F S + +N +L+ F + EI++V+ D + KSPGP+GY+
Sbjct: 5 FQEQIYGRPKLNGVEFLSFSNEDNAMLIQDFEVEEIKKVI*DCESSKSPGPDGYS 169
>CA851030
Length = 358
Score = 46.2 bits (108), Expect = 6e-05
Identities = 25/58 (43%), Positives = 38/58 (65%), Gaps = 2/58 (3%)
Frame = +1
Query: 60 KLVE--LWRLLKAKDTLITQRSRSRWLKKGDANSKFFHKCIKLRKSINSIKALEENEG 115
K+VE L LLK ++ QRSR+ W K+ D N++FFH+ RK+ NSI ++++EG
Sbjct: 133 KVVEESLDSLLKQEEEYWQQRSRAIWFKERDKNTEFFHRKTLQRKASNSITKIQDDEG 306
>BM891241
Length = 407
Score = 38.5 bits (88), Expect = 0.012
Identities = 15/26 (57%), Positives = 18/26 (68%)
Frame = -2
Query: 559 KKISWVKWELVCQNKRNGSLGVNSLS 584
KKI WVKWE+VC K G LG+ +S
Sbjct: 388 KKIPWVKWEVVCLPKTKGGLGIKDIS 311
>TC211561 weakly similar to UP|O22675 (O22675) Reverse transcriptase
(Fragment) , partial (59%)
Length = 1077
Score = 37.4 bits (85), Expect = 0.026
Identities = 59/189 (31%), Positives = 79/189 (41%)
Frame = +2
Query: 290 RRKSKNVLFLRWTSRRRMIRLIGVSWSI**LGWECVPSGWLG*KLVFFVGVCPFL*MVVQ 349
R + + F +W +R IR G+ I* W V +G G K V + + F *MV +
Sbjct: 44 RGATNHASFSKWIMKRHTIRFHGIF*CI*CGEWISVRNG*NGLKSVSNLHLFLF**MVAR 223
Query: 350 RRRFAFIEALNKGIPLPLFCFSWRRKDLAA**GMRWTVVCLRALRLVIVVCWFLTCNMLM 409
+ L K LF K **G +W +C LV +VC + +MLM
Sbjct: 224 QLSSYRRGVLGKETH*HLFYSILWLKV*MV**GEQWRKICTSPT*LVQMVCPSASYSMLM 403
Query: 410 MLCVLVNQRWIIFGLLNLSLEALRWRWGSRLTSQRAR**E*MWIMSLWKWLSII*IVVCG 469
L Q + LL SL L W TS RA ++ K II IVVC
Sbjct: 404 TQFFLGKQPRRM*KLLR*SLGHLSWFLI*GSTSLRAVLVCLE*QINGSKRRPIICIVVCW 583
Query: 470 VPVLNIWVF 478
+ L IWV+
Sbjct: 584 LFHLYIWVY 610
Score = 30.4 bits (67), Expect = 3.2
Identities = 10/20 (50%), Positives = 14/20 (70%)
Frame = +1
Query: 559 KKISWVKWELVCQNKRNGSL 578
KKISW++WE +C K G +
Sbjct: 850 KKISWIRWEKLCLPKERGGI 909
>TC204062 homologue to UP|Q09083 (Q09083) Hydroxyproline-rich glycoprotein
precursor, partial (34%)
Length = 722
Score = 35.0 bits (79), Expect = 0.13
Identities = 50/189 (26%), Positives = 78/189 (40%), Gaps = 20/189 (10%)
Frame = -3
Query: 290 RRKSKNVLFLRWTSRRRMIRLIGVSWSI**LGW-ECVPSGWLG*KLVFFVGVCPFL*MVV 348
RR +V++ RW RR RL+ V+W GW V GW + V + +
Sbjct: 603 RRGCIDVVWWRW*MNRRRRRLVLVNW-----GWRRWVLVGWRWWRWVLVWRRWRLVFVDW 439
Query: 349 QRRRFAFIEALNKGIPLPLFCFSWRRKDLAA**GMRWTVVCLRALRLVIVVCWF--LTCN 406
RRRF FI + WRR+ L RW +V R +++ W+ + +
Sbjct: 438 WRRRFVFIN------------WWWRRRVLVWRRRRRWVLVRRRGRFVLVNWGWWRLVLVD 295
Query: 407 MLMMLCVLV---NQRWIIFG------LLN------LSLEALRWRWGSRLTSQRAR**E*M 451
VLV +RW++ G L+N + + LRWRW + R R*
Sbjct: 294 WGWWRWVLVWGRWRRWVLVGRRRRLVLVNWWWRGWVFVGLLRWRWWGLVVVGRRR*VLRW 115
Query: 452 W--IMSLWK 458
W ++ +W+
Sbjct: 114 WWRVIIIWR 88
>TC213164
Length = 446
Score = 34.7 bits (78), Expect = 0.17
Identities = 16/33 (48%), Positives = 19/33 (57%)
Frame = +3
Query: 160 LLVAPFSLLEIEEVVRDSDGGKSPGPNGYNFSF 192
LLVA F E+ + D KSPGP+G NF F
Sbjct: 15 LLVARFEKKEVRATMWDRGNAKSPGPDGLNFKF 113
>TC214098 UP|PRP1_SOYBN (P08012) Repetitive proline-rich cell wall protein 1
precursor, complete
Length = 1144
Score = 34.7 bits (78), Expect = 0.17
Identities = 18/58 (31%), Positives = 25/58 (43%), Gaps = 7/58 (12%)
Frame = -2
Query: 265 LNRLFLN-------VATWWTGSWW*MSWWIM*RRKSKNVLFLRWTSRRRMIRLIGVSW 315
+NR FLN WW WW + WW++ RR L W RR + ++W
Sbjct: 738 VNRRFLNRGLVHRWFVHWWFVHWWLVHWWLLYRR-----LVNWWLVNRRFLNWRLINW 580
>TC204046 UP|Q39835 (Q39835) Extensin, complete
Length = 1627
Score = 34.3 bits (77), Expect = 0.22
Identities = 57/230 (24%), Positives = 83/230 (35%), Gaps = 6/230 (2%)
Frame = -2
Query: 276 WTGSWW*MSWWIM*RRKSKNVLFLRWTSRRRMIRLIGVSWSI**LGWECVPSGWLG*KLV 335
W WW W++ R+ + ++F+ W R R + ++W W W +
Sbjct: 987 WVLVWWGWWRWVLVWRRWR-LVFVDWWRR----RFVFINW-----WWRRRVLVWRRRRR- 841
Query: 336 FFVGVCPFL*MVVQRRRFAFIEALNKGIPLPLFCFSWRRKDLAA**GMRWTVVCLRALRL 395
V+ RRR F+ L L + W R L RW +V R RL
Sbjct: 840 ----------WVLVRRRGRFVLVNWGWWRLVLVDWGWWRWVLVWGRWRRWVLVG-RRRRL 694
Query: 396 VIVVCWFLTCNMLMMLCVLVNQRWIIFGLLNLSLEALRWRWGSRLTSQRAR**E*MWIMS 455
V+V W+ + W+ GL LRWRW + R R*
Sbjct: 693 VLVNWWW--------------RGWVFVGL-------LRWRWWGLVVVGRRR*-------V 598
Query: 456 LWKWLSII------*IVVCGVPVLNIWVFRWVLTRIVCLHGSRWWRT*VV 499
LW W ++ *++ V+ +W R VL RWWR VV
Sbjct: 597 LWWWWRVVVIGRRR*VLRRWWRVIVVWRRR*VL--------RRWWRVIVV 472
>BG405621
Length = 402
Score = 33.5 bits (75), Expect = 0.38
Identities = 14/19 (73%), Positives = 15/19 (78%)
Frame = +2
Query: 77 QRSRSRWLKKGDANSKFFH 95
QRS+ WLK GD NSKFFH
Sbjct: 341 QRSKIFWLKYGDLNSKFFH 397
>TC204070 similar to UP|Q39835 (Q39835) Extensin, partial (34%)
Length = 596
Score = 33.5 bits (75), Expect = 0.38
Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 12/99 (12%)
Frame = -2
Query: 413 VLVNQRWIIFGLLNLSLEALRWRWGSRLTSQRAR**E*MWIMSLWKWLSII*IVV---CG 469
VLVN+ W F +N WRWG * ++ L++W *I V G
Sbjct: 340 VLVNRWWW*FVFVN-------WRWG------------*WVLVGLFRWWWRR*IFVRWWWG 218
Query: 470 VPVLNIW----VFRWVLT-----RIVCLHGSRWWRT*VV 499
+ ++N W VF W+L R+V + RWWR *++
Sbjct: 217 LVLINRWWRRRVFVWLLGWWWWWRVVVVRLLRWWRR*IL 101
>CA783280 weakly similar to PIR|T01871|T01 RNA-directed DNA polymerase
homolog T24M8.8 - Arabidopsis thaliana, partial (4%)
Length = 456
Score = 33.1 bits (74), Expect = 0.49
Identities = 12/25 (48%), Positives = 16/25 (64%)
Frame = +2
Query: 559 KKISWVKWELVCQNKRNGSLGVNSL 583
+KI+WV W+ VC K G LG+ L
Sbjct: 89 RKIAWVNWKTVCLPKAKGGLGIKDL 163
>TC216314 similar to GB|AAP68269.1|31711826|BT008830 At1g70320 {Arabidopsis
thaliana;} , partial (18%)
Length = 759
Score = 32.7 bits (73), Expect = 0.64
Identities = 14/33 (42%), Positives = 19/33 (57%), Gaps = 3/33 (9%)
Frame = +2
Query: 555 WEGVK---KISWVKWELVCQNKRNGSLGVNSLS 584
W G + +I WVKW +C K +G LG+ LS
Sbjct: 125 WGGTQHHNRIPWVKWADICNPKIDGGLGIKDLS 223
>BF424929 similar to GP|9663767|emb| RNA Binding Protein 45 {Nicotiana
plumbaginifolia}, partial (12%)
Length = 419
Score = 30.8 bits (68), Expect = 2.4
Identities = 14/41 (34%), Positives = 18/41 (43%)
Frame = +1
Query: 275 WWTGSWW*MSWWIM*RRKSKNVLFLRWTSRRRMIRLIGVSW 315
WW WW + W R V WT + + +L GVSW
Sbjct: 202 WWCWFWWWILWVCCTRL*KLWVCSCCWTGSQHVWQLSGVSW 324
>TC224830 similar to UP|Q93WZ6 (Q93WZ6) Abscisic stress ripening-like
protein, partial (61%)
Length = 1118
Score = 30.8 bits (68), Expect = 2.4
Identities = 14/32 (43%), Positives = 18/32 (55%), Gaps = 5/32 (15%)
Frame = +2
Query: 275 WWTGSWW*MSWWIM*R-----RKSKNVLFLRW 301
WW WW WWI+*R R++K L+ RW
Sbjct: 452 WWLWRWW---WWIL*RH*WWLRQTKWWLWRRW 538
>TC214359 homologue to UP|PRP1_SOYBN (P08012) Repetitive proline-rich cell
wall protein 1 precursor, partial (52%)
Length = 414
Score = 30.8 bits (68), Expect = 2.4
Identities = 15/56 (26%), Positives = 24/56 (42%)
Frame = -3
Query: 275 WWTGSWW*MSWWIM*RRKSKNVLFLRWTSRRRMIRLIGVSWSI**LGWECVPSGWL 330
WW WW ++WW++ N F+ W +W + + W CV + WL
Sbjct: 226 WWLVHWWLVNWWLI------NGWFVHWWF---------FNWGL--INWRCV-NRWL 113
>CF922212
Length = 445
Score = 30.8 bits (68), Expect = 2.4
Identities = 13/27 (48%), Positives = 15/27 (55%)
Frame = -2
Query: 557 GVKKISWVKWELVCQNKRNGSLGVNSL 583
G KKI+WV W+ VC K LG L
Sbjct: 375 GQKKIAWVNWDSVCLPKEKEGLGNRDL 295
>TC225237 similar to UP|Q9LEB4 (Q9LEB4) RNA Binding Protein 45, partial (79%)
Length = 1560
Score = 30.8 bits (68), Expect = 2.4
Identities = 14/41 (34%), Positives = 18/41 (43%)
Frame = +2
Query: 275 WWTGSWW*MSWWIM*RRKSKNVLFLRWTSRRRMIRLIGVSW 315
WW WW + W R V WT + + +L GVSW
Sbjct: 1175 WWCWFWWWILWVCCTRL*KLWVCSCCWTGSQHVWQLSGVSW 1297
>CA800510
Length = 409
Score = 28.5 bits (62), Expect(2) = 3.1
Identities = 13/23 (56%), Positives = 15/23 (64%)
Frame = -1
Query: 88 DANSKFFHKCIKLRKSINSIKAL 110
D NSK+FH CIK R+ N I L
Sbjct: 97 DYNSKYFHICIKSRQRRNQILEL 29
Score = 20.4 bits (41), Expect(2) = 3.1
Identities = 6/25 (24%), Positives = 13/25 (52%)
Frame = -2
Query: 63 ELWRLLKAKDTLITQRSRSRWLKKG 87
E W + K++ + Q+ +W+ G
Sbjct: 171 EFWAFTRQKESSLFQKWHPKWIN*G 97
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.356 0.160 0.625
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 49,032,430
Number of Sequences: 63676
Number of extensions: 932636
Number of successful extensions: 12244
Number of sequences better than 10.0: 60
Number of HSP's better than 10.0 without gapping: 6383
Number of HSP's successfully gapped in prelim test: 495
Number of HSP's that attempted gapping in prelim test: 4731
Number of HSP's gapped (non-prelim): 7990
length of query: 797
length of database: 12,639,632
effective HSP length: 105
effective length of query: 692
effective length of database: 5,953,652
effective search space: 4119927184
effective search space used: 4119927184
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.7 bits)
S2: 63 (28.9 bits)
Medicago: description of AC147006.6