
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC123573.8 + phase: 0 /pseudo
(548 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imp... 151 1e-36
BE248682 similar to GP|18568269|gb putative gag-pol polyprotein ... 110 1e-24
BG647708 weakly similar to GP|13786450|gb| putative reverse tran... 60 2e-09
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non... 57 1e-08
BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-li... 54 1e-07
AW774658 similar to GP|2808681|emb| Hcr9-4B {Lycopersicon hirsut... 38 5e-06
CB891696 48 8e-06
TC82520 38 0.011
BG645385 similar to GP|5101676|emb| cytochrome B {Ceratitis capi... 35 0.097
TC83624 homologue to PIR|G84581|G84581 copia-like retroelement p... 34 0.13
TC91765 weakly similar to GP|19881779|gb|AAM01180.1 Putative ret... 32 0.63
BE187546 SP|Q02735|CAP Phosphoenolpyruvate carboxylase (EC 4.1.1... 31 1.1
TC77288 SP|Q02735|CAPP_MEDSA Phosphoenolpyruvate carboxylase (EC... 31 1.1
TC77286 SP|Q02735|CAPP_MEDSA Phosphoenolpyruvate carboxylase (EC... 31 1.1
TC80555 similar to GP|22137204|gb|AAM91447.1 At2g01320/F10A8.20 ... 31 1.1
BQ141748 homologue to SP|P21997|SSGP_ Sulfated surface glycoprot... 30 2.4
TC92335 similar to GP|14517422|gb|AAK62601.1 AT5g53570/MNC6_11 {... 29 5.3
TC78028 similar to GP|22136524|gb|AAM91340.1 unknown protein {Ar... 28 7.0
TC77981 similar to PIR|T12970|T12970 hypothetical protein T6H20.... 28 7.0
TC85442 similar to GP|13543783|gb|AAH06040.1 Unknown (protein fo... 28 7.0
>TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imported] -
Arabidopsis thaliana, partial (6%)
Length = 951
Score = 151 bits (382), Expect(2) = 1e-36
Identities = 77/136 (56%), Positives = 101/136 (73%)
Frame = +2
Query: 22 HVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVGEKS*ANVTGMRAALLLFETMS 81
+VLM+SL +T Y G+ P+V+SHLQFA+DTL++ K+ AN+ +RAAL++F MS
Sbjct: 467 NVLMKSLVQTQLFTRYSFGVVNPVVVSHLQFANDTLLLETKNWANIRALRAALVIF*AMS 646
Query: 82 GLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVYLGMLIGGNVRCLSF*EPIIDR 141
GLKVNF KS LV VN+ WLS A VL+ +VG +PF+YLGM I GN R LSF EPI++R
Sbjct: 647 GLKVNFHKSGLVCVNIAPSWLSEAASVLSWKVGKVPFLYLGMPIEGNSRRLSFWEPIVNR 826
Query: 142 IKSRLSGWKSKHLSFG 157
IK+RL+GW S+ LSFG
Sbjct: 827 IKARLTGWNSRFLSFG 874
Score = 20.0 bits (40), Expect(2) = 1e-36
Identities = 11/17 (64%), Positives = 11/17 (64%)
Frame = +3
Query: 159 V*FC*SMSCHLCLFMLF 175
V*FC*S LCL M F
Sbjct: 879 V*FC*SRF*PLCLSMRF 929
>BE248682 similar to GP|18568269|gb putative gag-pol polyprotein {Zea mays},
partial (1%)
Length = 441
Score = 110 bits (276), Expect = 1e-24
Identities = 59/123 (47%), Positives = 79/123 (63%)
Frame = +3
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVG 60
RGL+QGDP +PFLFLL AEG LM++ N + G+ + V SHLQ+ADDTL +G
Sbjct: 24 RGLKQGDPLAPFLFLLVAEGISGLMKNAVNRNLFQGFDVKRGGTRV-SHLQYADDTLCIG 200
Query: 61 EKS*ANVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVY 120
+ N+ ++A L FE SGLKVNF KS L+ +NV ++ R LNCR S+PF+Y
Sbjct: 201 MPTVDNLWTLKALLQGFEMASGLKVNFHKSSLIGINVPRDFMEAACRFLNCREESIPFIY 380
Query: 121 LGM 123
LG+
Sbjct: 381 LGL 389
>BG647708 weakly similar to GP|13786450|gb| putative reverse transcriptase
{Oryza sativa}, partial (9%)
Length = 708
Score = 60.1 bits (144), Expect = 2e-09
Identities = 38/116 (32%), Positives = 60/116 (50%), Gaps = 1/116 (0%)
Frame = +1
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVG 60
+GLRQGDP SP+LF+L A L++ G Q+ ++P + +HL FADD+L+
Sbjct: 10 KGLRQGDPLSPYLFILCANVLSGLLKREGNKQNLHGIQVARSDPKI-THLLFADDSLLFA 186
Query: 61 EKS*ANVTGMRAALLLFETMSGLKVNFSKS-LLVSVNVVGLWLSMVARVLNCRVGS 115
+ + L +++ SG VNF KS + S NV M+ + + + GS
Sbjct: 187 RANLTEAATIMQVLHSYQSASGQLVNFEKSEVSYSQNVPNQEKEMICQQIAIKTGS 354
>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
retroelement reverse transcriptase {Oryza sativa
(japonica cultivar-group)}, partial (2%)
Length = 1262
Score = 57.4 bits (137), Expect = 1e-08
Identities = 53/171 (30%), Positives = 77/171 (44%), Gaps = 1/171 (0%)
Frame = +3
Query: 345 GGRMVT-LGNGIGGCLCGRRSWWGNFVYYFKM*ICRLIRRTCGFGP*NRAMISLFRVLIS 403
GGR+ G+G CLCG+RS G+ V++ CR+ T G G * M++ + S
Sbjct: 399 GGRLTEGRGSGHAACLCGKRSV*GSVVFF*ITLFCRITSMTNGDGC*TL*MVTQ*K---S 569
Query: 404 S*LTIIIIIILTLRFLQRFFGIRISP*RLFCLRGGCYVIGCQLRTTCTVVVLYLQRIHYV 463
S T++ +I FGI I R L G + G +T + +LQ++ +V
Sbjct: 570 SIDTLLPQVIFQTGLWLMMFGINIFLQRFLYLCGASFATGFLQKTIWCIEGFFLQQMRHV 749
Query: 464 LAVVD**NLLLIYFYTVTFLERFGTLFIVGWVFVRFSQVCLRIISINFVLF 514
N L FY TF FG L +G VF CL ++S +LF
Sbjct: 750 YVGALTPNPLHTSFYIATFFVHFGLL*EIGLVF----HQCL-LVSFEHILF 887
>BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-like protein
{Arabidopsis thaliana}, partial (18%)
Length = 789
Score = 54.3 bits (129), Expect = 1e-07
Identities = 31/92 (33%), Positives = 50/92 (53%)
Frame = -3
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLMESLSVNNFYTGYQIGINEPIVMSHLQFADDTLIVG 60
RGLRQGDP SP+LF+L E L + G ++ N P + +HL FADDT+ G
Sbjct: 388 RGLRQGDPLSPYLFILCTEVLSGLCQQALRKGTLPGVKVARNCPPI-NHLLFADDTMFFG 212
Query: 61 EKS*ANVTGMRAALLLFETMSGLKVNFSKSLL 92
+ + ++ + + + + SG +N +KS +
Sbjct: 211 KSNASSCAILLSIMDKYRAASGRCIN*TKSAI 116
>AW774658 similar to GP|2808681|emb| Hcr9-4B {Lycopersicon hirsutum}, partial
(4%)
Length = 665
Score = 37.7 bits (86), Expect(2) = 5e-06
Identities = 18/28 (64%), Positives = 22/28 (78%)
Frame = -1
Query: 39 IGINEPIVMSHLQFADDTLIVGEKS*AN 66
IG++ V SHLQFADDTL++G KS AN
Sbjct: 410 IGMHSLTVFSHLQFADDTLLLGVKSWAN 327
Score = 30.4 bits (67), Expect(2) = 5e-06
Identities = 12/26 (46%), Positives = 20/26 (76%)
Frame = -3
Query: 70 MRAALLLFETMSGLKVNFSKSLLVSV 95
+R+ L++FE MSGLKVN + +++ V
Sbjct: 321 LRSILVIFENMSGLKVNLREEVIIYV 244
>CB891696
Length = 638
Score = 48.1 bits (113), Expect = 8e-06
Identities = 28/84 (33%), Positives = 47/84 (55%)
Frame = +1
Query: 66 NVTGMRAALLLFETMSGLKVNFSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVYLGMLI 125
N+ M+ + FE S L VNF KS L+++NV+G + + C+V + F YLG+L+
Sbjct: 10 NILTMKTIVSYFELASSLWVNFLKSGLINLNVIGHF*GW*NIYIKCKVH*VIFKYLGILV 189
Query: 126 GGNVRCLSF*EPIIDRIKSRLSGW 149
G N ++ *E ++ + + L W
Sbjct: 190 GENPCRVNM*ELLLKLLTN*LGSW 261
>TC82520
Length = 833
Score = 37.7 bits (86), Expect = 0.011
Identities = 29/74 (39%), Positives = 36/74 (48%)
Frame = +3
Query: 475 IYFYTVTFLERFGTLFIVGWVFVRFSQVCLRIISINFVLFVEIALWCPNLFCI*FGLLLR 534
I FY V FL FG +F VG++F F + S N +L+ + LFC G L
Sbjct: 342 ICFYIVIFLALFGLMFCVGYIFCWFYLLT*DNFSFNLLLW-RVRQDLLILFCKSCGSPLF 518
Query: 535 GKFGRKEITGYSMI 548
G FGRK I S I
Sbjct: 519 GCFGRKGIIECSKI 560
>BG645385 similar to GP|5101676|emb| cytochrome B {Ceratitis capitata},
partial (3%)
Length = 787
Score = 34.7 bits (78), Expect = 0.097
Identities = 19/39 (48%), Positives = 24/39 (60%), Gaps = 1/39 (2%)
Frame = -2
Query: 176 PF-LRLHQVSFPLLTLYFFFFFFCRGVGGEDHKKYLGLI 213
PF L LH+VSF +L FFF FFCR +KY G++
Sbjct: 108 PFSLSLHKVSFLVLWCVFFFLFFCR-------RKYFGVL 13
>TC83624 homologue to PIR|G84581|G84581 copia-like retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(1%)
Length = 831
Score = 34.3 bits (77), Expect = 0.13
Identities = 16/57 (28%), Positives = 32/57 (56%)
Frame = +1
Query: 87 FSKSLLVSVNVVGLWLSMVARVLNCRVGSLPFVYLGMLIGGNVRCLSF*EPIIDRIK 143
FS+ +++N+ ++ L C V +PF +LG+ IG N + S +P++D ++
Sbjct: 256 FSRVNFMALNLEESFVEASPNFLLCNVNEVPFCFLGLPIGANPKRSSTRKPVLDSLQ 426
>TC91765 weakly similar to GP|19881779|gb|AAM01180.1 Putative retroelement
{Oryza sativa (japonica cultivar-group)}, partial (1%)
Length = 625
Score = 32.0 bits (71), Expect = 0.63
Identities = 12/25 (48%), Positives = 18/25 (72%)
Frame = +2
Query: 1 RGLRQGDPFSPFLFLLAAEGFHVLM 25
RGL+QGD SP++F++ EG L+
Sbjct: 170 RGLQQGDHLSPYIFIICVEGLSFLI 244
>BE187546 SP|Q02735|CAP Phosphoenolpyruvate carboxylase (EC 4.1.1.31)
(PEPCASE). [Alfalfa] {Medicago sativa}, partial (12%)
Length = 604
Score = 31.2 bits (69), Expect = 1.1
Identities = 21/57 (36%), Positives = 31/57 (53%), Gaps = 8/57 (14%)
Frame = -1
Query: 149 WKSKH-LSFGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLT-------LYFFFFFF 197
W ++H L+FG + F +C +C FML LR H+ L+ L+FF+FFF
Sbjct: 220 WHAEHWLNFGYIFF----TC-ICFFMLKCVLRFHKY*MNLVANMHFK*VLFFFYFFF 65
>TC77288 SP|Q02735|CAPP_MEDSA Phosphoenolpyruvate carboxylase (EC 4.1.1.31)
(PEPCASE). [Alfalfa] {Medicago sativa}, partial (8%)
Length = 836
Score = 31.2 bits (69), Expect = 1.1
Identities = 21/57 (36%), Positives = 31/57 (53%), Gaps = 8/57 (14%)
Frame = +3
Query: 149 WKSKH-LSFGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLT-------LYFFFFFF 197
W ++H L+FG + F +C +C FML LR H+ L+ L+FF+FFF
Sbjct: 567 WHAEHWLNFGYIFF----TC-ICFFMLKCVLRFHKY*MNLVANMHFK*VLFFFYFFF 722
>TC77286 SP|Q02735|CAPP_MEDSA Phosphoenolpyruvate carboxylase (EC 4.1.1.31)
(PEPCASE). [Alfalfa] {Medicago sativa}, partial (70%)
Length = 2283
Score = 31.2 bits (69), Expect = 1.1
Identities = 21/57 (36%), Positives = 31/57 (53%), Gaps = 8/57 (14%)
Frame = +1
Query: 149 WKSKH-LSFGVV*FC*SMSCHLCLFMLFPFLRLHQVSFPLLT-------LYFFFFFF 197
W ++H L+FG + F +C +C FML LR H+ L+ L+FF+FFF
Sbjct: 2035 WHAEHWLNFGYIFF----TC-ICFFMLKCVLRFHKY*MNLVANMHFK*VLFFFYFFF 2190
>TC80555 similar to GP|22137204|gb|AAM91447.1 At2g01320/F10A8.20
{Arabidopsis thaliana}, partial (17%)
Length = 1158
Score = 31.2 bits (69), Expect = 1.1
Identities = 14/31 (45%), Positives = 20/31 (64%)
Frame = -3
Query: 167 CHLCLFMLFPFLRLHQVSFPLLTLYFFFFFF 197
C + +LF FL++H FP ++FFFFFF
Sbjct: 841 CVISSHLLFCFLKMH---FPPFFVFFFFFFF 758
>BQ141748 homologue to SP|P21997|SSGP_ Sulfated surface glycoprotein 185 (SSG
185). {Volvox carteri}, partial (3%)
Length = 1293
Score = 30.0 bits (66), Expect = 2.4
Identities = 15/27 (55%), Positives = 17/27 (62%), Gaps = 2/27 (7%)
Frame = -1
Query: 174 LFPFLRLHQVSF--PLLTLYFFFFFFC 198
LF F+ +SF PL T FFFFFFC
Sbjct: 702 LFSFV*HGNISFIYPLFTTIFFFFFFC 622
>TC92335 similar to GP|14517422|gb|AAK62601.1 AT5g53570/MNC6_11 {Arabidopsis
thaliana}, partial (37%)
Length = 1077
Score = 28.9 bits (63), Expect = 5.3
Identities = 16/33 (48%), Positives = 20/33 (60%), Gaps = 5/33 (15%)
Frame = +1
Query: 170 CLFMLFPFLRLH-----QVSFPLLTLYFFFFFF 197
CLFM+F F + +VS L +YFFFFFF
Sbjct: 793 CLFMIFSFDSVVS*KGIEVS*VLYHVYFFFFFF 891
>TC78028 similar to GP|22136524|gb|AAM91340.1 unknown protein {Arabidopsis
thaliana}, partial (86%)
Length = 1217
Score = 28.5 bits (62), Expect = 7.0
Identities = 12/18 (66%), Positives = 15/18 (82%), Gaps = 1/18 (5%)
Frame = +2
Query: 185 FPL-LTLYFFFFFFCRGV 201
FPL L LY+F+FFFC G+
Sbjct: 977 FPLVLDLYYFYFFFCCGL 1030
>TC77981 similar to PIR|T12970|T12970 hypothetical protein T6H20.190 -
Arabidopsis thaliana, partial (39%)
Length = 1191
Score = 28.5 bits (62), Expect = 7.0
Identities = 15/41 (36%), Positives = 22/41 (53%), Gaps = 4/41 (9%)
Frame = -3
Query: 150 KSKHLSFGVV*FC*SMSC----HLCLFMLFPFLRLHQVSFP 186
K + L F ++ C S +C H CLF+ F FL+ + FP
Sbjct: 616 KLQCLFFQLLSCCSSFACLFSFHSCLFLCFGFLQYFSIGFP 494
>TC85442 similar to GP|13543783|gb|AAH06040.1 Unknown (protein for MGC:7642)
{Mus musculus}, partial (62%)
Length = 585
Score = 28.5 bits (62), Expect = 7.0
Identities = 15/29 (51%), Positives = 17/29 (57%)
Frame = -2
Query: 169 LCLFMLFPFLRLHQVSFPLLTLYFFFFFF 197
L F+LFPF L F LL +F FFFF
Sbjct: 176 LWFFLLFPFFLL----FFLLCFFFLFFFF 102
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.351 0.160 0.556
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 20,334,081
Number of Sequences: 36976
Number of extensions: 351436
Number of successful extensions: 6156
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 3953
Number of HSP's successfully gapped in prelim test: 162
Number of HSP's that attempted gapping in prelim test: 1081
Number of HSP's gapped (non-prelim): 4895
length of query: 548
length of database: 9,014,727
effective HSP length: 101
effective length of query: 447
effective length of database: 5,280,151
effective search space: 2360227497
effective search space used: 2360227497
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 61 (28.1 bits)
Medicago: description of AC123573.8