
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC123573.8 + phase: 0 /pseudo
(548 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC211561 weakly similar to UP|O22675 (O22675) Reverse transcript... 124 1e-28
CO982349 116 3e-26
AW757033 107 1e-23
BI972659 72 5e-13
AI496436 72 8e-13
TC212055 59 6e-09
BU762692 52 7e-07
CD417230 weakly similar to GP|2244915|emb| reverse transcriptase... 47 2e-05
TC233727 similar to UP|Q9FKH8 (Q9FKH8) Similarity to non-LTR ret... 40 2e-05
TC234877 39 0.006
BI315739 29 0.11
TC224760 similar to UP|O04692 (O04692) DNA-binding protein, part... 35 0.11
TC224567 similar to PIR|T51159|T51159 HMG protein [imported] - A... 35 0.11
TC224564 similar to PIR|T51159|T51159 HMG protein [imported] - A... 32 0.56
CF920688 27 1.6
TC214570 homologue to PIR|T05766|T05766 peptidylprolyl isomerase... 30 2.1
TC228105 similar to GB|AAP37692.1|30725340|BT008333 At4g29830 {A... 30 2.1
TC216002 UP|Q9FQD4 (Q9FQD4) Glutathione S-transferase GST 24 , ... 30 2.1
AW348558 homologue to SP|Q9SC88|GCP4_ Gamma-tubulin complex comp... 30 3.6
>TC211561 weakly similar to UP|O22675 (O22675) Reverse transcriptase
(Fragment) , partial (59%)
Length = 1077
Score = 124 bits (310), Expect = 1e-28
Identities = 66/157 (42%), Positives = 92/157 (58%)
Frame = +1
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVG 60
RGLRQGDP +PFLF + AEG + LM N Y Y +G N + +S LQ+ADDT+ G
Sbjct: 244 RGLRQGDPLTPFLFNIVAEGLNGLMRRAVEENLYKPYLVGANG-VPISILQYADDTIFFG 420
Query: 61 EKS*ANVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVY 120
E + NV ++ L FE +S L++NF+KS V W A L+C + + PF+Y
Sbjct: 421 EAAKENVEAIKVILRSFELVSNLRINFAKSCFGVFGVTDQWKQEAANYLHCSLLAFPFIY 600
Query: 121 LGMLIGGNVRCLSF*EPIIDRIKSRLSGWKSKHLSFG 157
LG+ IG N R + II + + +LS WK +H+SFG
Sbjct: 601 LGIPIGANSRRCHMWDSIIRKCERKLSKWK*RHISFG 711
>CO982349
Length = 795
Score = 116 bits (290), Expect = 3e-26
Identities = 64/158 (40%), Positives = 92/158 (57%), Gaps = 1/158 (0%)
Frame = +3
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIGIN-EPIVMSHLQFADDTLIV 59
RGLRQGDP +P LF + AEG LM + + +G N EP+ S LQ+ADDT+
Sbjct: 147 RGLRQGDPLAPLLFNIVAEGLTGLMREAVDRKRFNSFLVGKNKEPV--SILQYADDTIFF 320
Query: 60 GEKS*ANVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFV 119
E + N+ ++ L FE SGLK+NF++S ++ W A LNC + S+PF
Sbjct: 321 EEATMENMKVIKTILRCFELASGLKINFAESRFGAIWKPDHWCKEAAEFLNCSMLSMPFS 500
Query: 120 YLGMLIGGNVRCLSF*EPIIDRIKSRLSGWKSKHLSFG 157
YLG+ I N RC *+PII + +++L+ WK +H+S G
Sbjct: 501 YLGIPIEANPRCREI*DPIIRKCETKLARWKQRHISLG 614
>AW757033
Length = 441
Score = 107 bits (267), Expect = 1e-23
Identities = 62/140 (44%), Positives = 82/140 (58%), Gaps = 1/140 (0%)
Frame = -2
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIG-INEPIVMSHLQFADDTLIV 59
RGLRQGDP +P LF +AAEG LM N Y + +G +P+ S LQ+ADDT+
Sbjct: 419 RGLRQGDPLAPLLFNIAAEGLTGLMREAVARNHYKSFAVGKYKKPV--SILQYADDTIFF 246
Query: 60 GEKS*ANVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFV 119
GE + NV ++ L FE SGLK+NF+KS +++V W A +NC + SLPF
Sbjct: 245 GEATMENVRVIKTILRGFELASGLKINFAKSRFGAISVPD*WRKEAAEFMNCSLLSLPFS 66
Query: 120 YLGMLIGGNVRCLSF*EPII 139
YLG+ IG N R +PII
Sbjct: 65 YLGIPIGANPRRRETWDPII 6
>BI972659
Length = 453
Score = 72.4 bits (176), Expect = 5e-13
Identities = 39/99 (39%), Positives = 57/99 (57%)
Frame = +1
Query: 59 VGEKS*ANVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPF 118
VGE S N+ +++ L FE +SGL++N++KS V W A++LNCR +PF
Sbjct: 1 VGEASWDNIIVLKSMLRGFEMVSGLRINYAKSQFGIVGFQPNWAHDAAQLLNCRQLDIPF 180
Query: 119 VYLGMLIGGNVRCLSF*EPIIDRIKSRLSGWKSKHLSFG 157
YLGM I EP+I++ K++LS W K+LS G
Sbjct: 181 HYLGMPIAVKASSRMVWEPLINKFKAKLSKWNQKYLSMG 297
>AI496436
Length = 414
Score = 71.6 bits (174), Expect = 8e-13
Identities = 36/84 (42%), Positives = 51/84 (59%)
Frame = +1
Query: 74 LLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVYLGMLIGGNVRCLS 133
L FE SGLK+NF+KS ++ +W A LN V S+PFVYLG+ IG N R
Sbjct: 7 LRCFELSSGLKINFAKSQFGTICKSDIWRKEAADYLNSSVLSMPFVYLGIPIGANSRHSD 186
Query: 134 F*EPIIDRIKSRLSGWKSKHLSFG 157
EP++ + + +L+ WK K++SFG
Sbjct: 187 VWEPVVRKCERKLASWKQKYVSFG 258
>TC212055
Length = 776
Score = 58.9 bits (141), Expect = 6e-09
Identities = 35/101 (34%), Positives = 55/101 (53%), Gaps = 1/101 (0%)
Frame = +1
Query: 13 LFLLAAEGFHVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVGEKS*ANVTGMRA 72
LF + AEG LM + Y+ +G N+ I ++ LQ+ADDT+ +GE + NV +++
Sbjct: 1 LFNIVAEGLTGLMREALDKSLYSSLMVGKNK-IPVNILQYADDTIFLGEATMQNVMTIKS 177
Query: 73 ALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVA-RVLNCR 112
L +FE SGLK++F+KS ++ W NCR
Sbjct: 178 MLRVFELASGLKIHFAKSSFGAIGKPDHWQKEAGLNTFNCR 300
>BU762692
Length = 423
Score = 52.0 bits (123), Expect = 7e-07
Identities = 25/57 (43%), Positives = 35/57 (60%)
Frame = -3
Query: 74 LLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVYLGMLIGGNVR 130
L FE +SGLK+NF+KS + + W + A LNC + ++PFVY G+ IG N R
Sbjct: 178 LRTFELVSGLKINFAKSCFGAFGMTDRWKTEAASCLNCSLLAIPFVYQGIPIGANPR 8
>CD417230 weakly similar to GP|2244915|emb| reverse transcriptase like
protein {Arabidopsis thaliana}, partial (7%)
Length = 688
Score = 47.4 bits (111), Expect = 2e-05
Identities = 29/89 (32%), Positives = 44/89 (48%)
Frame = -3
Query: 2 GLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVGE 61
G+RQ DP +P+LF+L E L+ + Q+ +++SHL F DD ++ E
Sbjct: 599 GVRQRDPIAPYLFVLCLERLSHLINLVMNKKL*RSIQVN-RGGLLISHLAFVDDLILFVE 423
Query: 62 KS*ANVTGMRAALLLFETMSGLKVNFSKS 90
+ V + L LF SG KVN K+
Sbjct: 422 ANMDQVDVINLTLELFSDSSGAKVNVDKT 336
>TC233727 similar to UP|Q9FKH8 (Q9FKH8) Similarity to non-LTR retroelement
reverse transcriptase, partial (7%)
Length = 956
Score = 39.7 bits (91), Expect(2) = 2e-05
Identities = 28/111 (25%), Positives = 51/111 (45%)
Frame = -2
Query: 45 IVMSHLQFADDTLIVGEKS*ANVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSM 104
+ SH+ ADD +I + + NV + L++ G ++ KS + ++ G L +
Sbjct: 460 LTRSHVMHADDIMIFCKGTSRNVHNLINFFQLYKDSFGQVISKHKSKIFIGSIPGQRLRV 281
Query: 105 VARVLNCRVGSLPFVYLGMLIGGNVRCLSF*EPIIDRIKSRLSGWKSKHLS 155
++ +L +PF Y G+ I + D+IK +L+ WK LS
Sbjct: 280 ISDLLQYFHSDIPFNYFGVPIFKGKPTRRHLYCVADKIKCKLASWKGSILS 128
Score = 26.6 bits (57), Expect(2) = 2e-05
Identities = 10/14 (71%), Positives = 12/14 (85%)
Frame = -3
Query: 1 RGLRQGDPFSPFLF 14
RG+RQGDP SP L+
Sbjct: 585 RGVRQGDPLSPSLY 544
>TC234877
Length = 534
Score = 38.9 bits (89), Expect = 0.006
Identities = 21/49 (42%), Positives = 26/49 (52%)
Frame = +1
Query: 108 VLNCRVGSLPFVYLGMLIGGNVRCLSF*EPIIDRIKSRLSGWKSKHLSF 156
+L VGSLPF+YL + + PI DRIK+ L WK LSF
Sbjct: 13 ILGFSVGSLPFIYLSVPLFQEKPRPVHLRPIADRIKAELQSWKGNLLSF 159
>BI315739
Length = 442
Score = 29.3 bits (64), Expect(2) = 0.11
Identities = 16/42 (38%), Positives = 22/42 (52%)
Frame = +3
Query: 111 CRVGSLPFVYLGMLIGGNVRCLSF*EPIIDRIKSRLSGWKSK 152
C + LP L L+ G + F ++D I +RL GWKSK
Sbjct: 318 CLIEKLPGSLL**LLAGRTKKEDF-AHVVDMINNRLDGWKSK 440
Score = 24.3 bits (51), Expect(2) = 0.11
Identities = 16/47 (34%), Positives = 23/47 (48%)
Frame = +2
Query: 44 PIVMSHLQFADDTLIVGEKS*ANVTGMRAALLLFETMSGLKVNFSKS 90
P +SHL F D+ L+ + + + V +R L F GLK KS
Sbjct: 155 PWYISHLFFPDNCLLFVKATSSQVKLVRDVLQQFCLT*GLKAGLQKS 295
>TC224760 similar to UP|O04692 (O04692) DNA-binding protein, partial (73%)
Length = 654
Score = 34.7 bits (78), Expect = 0.11
Identities = 19/50 (38%), Positives = 25/50 (50%)
Frame = -2
Query: 153 HLSFGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLTLYFFFFFFCRGVG 202
H FG+V*F + C CL L F + +SFP +L F + F G G
Sbjct: 368 HFRFGLV*FI--IICRACLLFLISFDKFFILSFPFCSLGFIWSFLSTGQG 225
>TC224567 similar to PIR|T51159|T51159 HMG protein [imported] - Arabidopsis
thaliana {Arabidopsis thaliana;} , partial (71%)
Length = 876
Score = 34.7 bits (78), Expect = 0.11
Identities = 19/50 (38%), Positives = 25/50 (50%)
Frame = -3
Query: 153 HLSFGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLTLYFFFFFFCRGVG 202
H FG+V*F + C CL L F + +SFP +L F + F G G
Sbjct: 550 HFRFGLV*FI--IICRACLLFLISFDKFFILSFPFCSLGFIWSFLSTGQG 407
>TC224564 similar to PIR|T51159|T51159 HMG protein [imported] - Arabidopsis
thaliana {Arabidopsis thaliana;} , partial (82%)
Length = 833
Score = 32.3 bits (72), Expect = 0.56
Identities = 18/50 (36%), Positives = 25/50 (50%)
Frame = -3
Query: 153 HLSFGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLTLYFFFFFFCRGVG 202
H FG+V*F + C CL + F + +SFP +L F + F G G
Sbjct: 507 HFRFGLV*FF--IICGACLLFIISFDKFFILSFPFRSLGFIWSFLSTGQG 364
>CF920688
Length = 521
Score = 26.6 bits (57), Expect(2) = 1.6
Identities = 10/19 (52%), Positives = 13/19 (67%)
Frame = -1
Query: 191 YFFFFFFCRGVGGEDHKKY 209
+FFFFFF +G +DH Y
Sbjct: 170 FFFFFFFFFXLGPKDHYHY 114
Score = 22.7 bits (47), Expect(2) = 1.6
Identities = 11/26 (42%), Positives = 12/26 (45%)
Frame = -2
Query: 172 FMLFPFLRLHQVSFPLLTLYFFFFFF 197
F F FL F +FFFFFF
Sbjct: 232 FFFFFFLIFFFFFFFFFFFFFFFFFF 155
>TC214570 homologue to PIR|T05766|T05766 peptidylprolyl isomerase M4E13.20 -
Arabidopsis thaliana {Arabidopsis thaliana;} , partial
(13%)
Length = 944
Score = 30.4 bits (67), Expect = 2.1
Identities = 13/34 (38%), Positives = 18/34 (52%)
Frame = -3
Query: 164 SMSCHLCLFMLFPFLRLHQVSFPLLTLYFFFFFF 197
S + LC F F + + L +L+FFFFFF
Sbjct: 597 SSTLSLCFFFFFSVFSSSEEAASLFSLFFFFFFF 496
>TC228105 similar to GB|AAP37692.1|30725340|BT008333 At4g29830 {Arabidopsis
thaliana;} , complete
Length = 1113
Score = 30.4 bits (67), Expect = 2.1
Identities = 12/23 (52%), Positives = 16/23 (69%)
Frame = +2
Query: 357 GCLCGRRSWWGNFVYYFKM*ICR 379
G +CGR+ WG+ Y FK *IC+
Sbjct: 731 GTVCGRQPRWGSHCYRFK**ICK 799
>TC216002 UP|Q9FQD4 (Q9FQD4) Glutathione S-transferase GST 24 , complete
Length = 1062
Score = 30.4 bits (67), Expect = 2.1
Identities = 15/39 (38%), Positives = 24/39 (61%)
Frame = +2
Query: 156 FGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLTLYFFF 194
+ V+ FC +CL +++ FL + ++FPL TLYF F
Sbjct: 845 WNVLNFCFLSFSVVCLCLIWIFLVVMLLTFPLFTLYFSF 961
>AW348558 homologue to SP|Q9SC88|GCP4_ Gamma-tubulin complex component 4
homolog. [Barrel medic] {Medicago truncatula}, partial
(8%)
Length = 579
Score = 29.6 bits (65), Expect = 3.6
Identities = 22/69 (31%), Positives = 31/69 (44%), Gaps = 9/69 (13%)
Frame = -2
Query: 151 SKHLSFGVV*------FC*SMSCHLCLFMLFPFLRLHQVSFPLLTL---YFFFFFFCRGV 201
SK L++G++ FC H C L +FPL L +FFFFFF
Sbjct: 299 SKFLTWGILAC*FQFSFCCXFIHHFCFXXXSSVLL*SVCTFPLFILILVWFFFFFFINVS 120
Query: 202 GGEDHKKYL 210
G+ K++L
Sbjct: 119 TGQF*KRFL 93
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.351 0.160 0.556
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 29,042,718
Number of Sequences: 63676
Number of extensions: 500125
Number of successful extensions: 12733
Number of sequences better than 10.0: 54
Number of HSP's better than 10.0 without gapping: 6075
Number of HSP's successfully gapped in prelim test: 224
Number of HSP's that attempted gapping in prelim test: 1520
Number of HSP's gapped (non-prelim): 8831
length of query: 548
length of database: 12,639,632
effective HSP length: 102
effective length of query: 446
effective length of database: 6,144,680
effective search space: 2740527280
effective search space used: 2740527280
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 61 (28.1 bits)
Medicago: description of AC123573.8