
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC140721.5 - phase: 0
(814 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC80615 homologue to GP|10041548|emb|CAC07606. unnamed protein p... 847 0.0
TC83693 homologue to GP|10041548|emb|CAC07606. unnamed protein p... 309 2e-84
AL371498 similar to GP|10041548|em unnamed protein product {Arab... 285 4e-77
TC81833 similar to GP|11034633|dbj|BAB17157. hypothetical protei... 249 4e-66
BQ141706 similar to PIR|E96636|E966 hypothetical protein T7P1.21... 33 0.34
TC76557 similar to PIR|S71779|S71779 glycine-rich RNA-binding pr... 33 0.57
TC76561 similar to PIR|S71779|S71779 glycine-rich RNA-binding pr... 33 0.57
TC79934 similar to GP|20259633|gb|AAM14173.1 putative GAR1 prote... 33 0.57
TC76556 similar to PIR|S71779|S71779 glycine-rich RNA-binding pr... 33 0.57
TC79682 weakly similar to GP|6523547|emb|CAB62280.1 hydroxyproli... 32 0.75
BQ141932 weakly similar to GP|6523547|emb hydroxyproline-rich gl... 32 0.75
AW697002 similar to GP|10728183|gb CG11122 gene product {Drosoph... 31 1.7
BF646181 similar to GP|17104519|gb unknown protein {Arabidopsis ... 31 1.7
TC90933 similar to SP|P27484|GRP2_NICSY Glycine-rich protein 2. ... 31 1.7
BE997786 weakly similar to SP|P80002|PRT2_S Spermatid-specific p... 31 2.2
TC76707 similar to SP|Q07202|CORA_MEDSA Cold and drought-regulat... 31 2.2
TC86951 homologue to GP|2745902|gb|AAB94773.1| ERS-like ethylene... 31 2.2
TC83200 weakly similar to GP|21554083|gb|AAM63164.1 unknown {Ara... 31 2.2
TC88261 similar to PIR|S22697|S22697 extensin - Volvox carteri (... 31 2.2
TC76704 homologue to SP|Q09134|GRPA_MEDFA Abscisic acid and envi... 31 2.2
>TC80615 homologue to GP|10041548|emb|CAC07606. unnamed protein product
{Arabidopsis thaliana}, partial (48%)
Length = 1233
Score = 847 bits (2187), Expect = 0.0
Identities = 410/410 (100%), Positives = 410/410 (100%)
Frame = +3
Query: 237 SPDEIESYVILQECLEMRKRYIFKEAVAPWEKEVISDPSTPKPNLEPFFYAPEGKSDHYF 296
SPDEIESYVILQECLEMRKRYIFKEAVAPWEKEVISDPSTPKPNLEPFFYAPEGKSDHYF
Sbjct: 3 SPDEIESYVILQECLEMRKRYIFKEAVAPWEKEVISDPSTPKPNLEPFFYAPEGKSDHYF 182
Query: 297 EMQDGVIHVYPNKNSNEELFPVADATTFFTDLHQILRVIAAGNIRTLCHHRLNLLEQKFN 356
EMQDGVIHVYPNKNSNEELFPVADATTFFTDLHQILRVIAAGNIRTLCHHRLNLLEQKFN
Sbjct: 183 EMQDGVIHVYPNKNSNEELFPVADATTFFTDLHQILRVIAAGNIRTLCHHRLNLLEQKFN 362
Query: 357 LHLMLNADREFLAQKSAPHRDFYNVRKVDTHVHHSACMNQKHLLRFIKSKLRKEPDEVVI 416
LHLMLNADREFLAQKSAPHRDFYNVRKVDTHVHHSACMNQKHLLRFIKSKLRKEPDEVVI
Sbjct: 363 LHLMLNADREFLAQKSAPHRDFYNVRKVDTHVHHSACMNQKHLLRFIKSKLRKEPDEVVI 542
Query: 417 FRDGTYLTLREVFESLDLTGYDLNVDLLDVHADKSTFHRFDKFNLKYNPCGQSRLREIFL 476
FRDGTYLTLREVFESLDLTGYDLNVDLLDVHADKSTFHRFDKFNLKYNPCGQSRLREIFL
Sbjct: 543 FRDGTYLTLREVFESLDLTGYDLNVDLLDVHADKSTFHRFDKFNLKYNPCGQSRLREIFL 722
Query: 477 KQDNLIQGRFLGELTKQVFSDLEASKYQMAEYRISIYGRKQSEWDQLASWIVNNDLYSEN 536
KQDNLIQGRFLGELTKQVFSDLEASKYQMAEYRISIYGRKQSEWDQLASWIVNNDLYSEN
Sbjct: 723 KQDNLIQGRFLGELTKQVFSDLEASKYQMAEYRISIYGRKQSEWDQLASWIVNNDLYSEN 902
Query: 537 VVWLIQLPRLYNIYKDMGIVTSFQNMLDNIFIPLFEVTVDPDSHPQLHVFLKQVVGLDLV 596
VVWLIQLPRLYNIYKDMGIVTSFQNMLDNIFIPLFEVTVDPDSHPQLHVFLKQVVGLDLV
Sbjct: 903 VVWLIQLPRLYNIYKDMGIVTSFQNMLDNIFIPLFEVTVDPDSHPQLHVFLKQVVGLDLV 1082
Query: 597 DDESKPERRPTKHMPTPAQWTNVFNPAFSYYVYYCYANLYTLNKLRESKG 646
DDESKPERRPTKHMPTPAQWTNVFNPAFSYYVYYCYANLYTLNKLRESKG
Sbjct: 1083DDESKPERRPTKHMPTPAQWTNVFNPAFSYYVYYCYANLYTLNKLRESKG 1232
>TC83693 homologue to GP|10041548|emb|CAC07606. unnamed protein product
{Arabidopsis thaliana}, partial (19%)
Length = 997
Score = 309 bits (792), Expect = 2e-84
Identities = 149/150 (99%), Positives = 150/150 (99%)
Frame = +1
Query: 665 LAATFLTAHNIAHGINLRKSPVLQYLYYLAQIGLAMSPLSNNSLFLDYHRNPLPVFFLRG 724
LAATFLTAHNIAHGINLRKSPVLQYLYYLAQIGLAMSPLSNNSLFLDYHRNPLPVFFLRG
Sbjct: 1 LAATFLTAHNIAHGINLRKSPVLQYLYYLAQIGLAMSPLSNNSLFLDYHRNPLPVFFLRG 180
Query: 725 LNVSLSTDDPLQIHLTKEPLVEEYSIAASVWKLSSCDLCEIARNSVYQSGFSHALKSHWI 784
LNVSLSTDDPLQIHLTKEPLVEEYSIAASVWKLSSCDLCEIARNSVYQSGFSHALKSHWI
Sbjct: 181 LNVSLSTDDPLQIHLTKEPLVEEYSIAASVWKLSSCDLCEIARNSVYQSGFSHALKSHWI 360
Query: 785 GKEYYKRGPNGNDIHRTNVPHIRLEFRDTV 814
GKEYYKRGPNGNDIHRTNVPHIRLEFRDT+
Sbjct: 361 GKEYYKRGPNGNDIHRTNVPHIRLEFRDTI 450
>AL371498 similar to GP|10041548|em unnamed protein product {Arabidopsis
thaliana}, partial (18%)
Length = 458
Score = 285 bits (730), Expect = 4e-77
Identities = 135/152 (88%), Positives = 143/152 (93%)
Frame = +2
Query: 438 DLNVDLLDVHADKSTFHRFDKFNLKYNPCGQSRLREIFLKQDNLIQGRFLGELTKQVFSD 497
DLNVDLLDVHADKSTFHRFDKFNLKYNPCGQSRLREIFLKQDNLIQGRFL E+TKQV D
Sbjct: 2 DLNVDLLDVHADKSTFHRFDKFNLKYNPCGQSRLREIFLKQDNLIQGRFLAEVTKQVLLD 181
Query: 498 LEASKYQMAEYRISIYGRKQSEWDQLASWIVNNDLYSENVVWLIQLPRLYNIYKDMGIVT 557
LEASKYQMAEYRIS+YGRKQSEWDQLASW VNN LYS+N VWLIQLPRLYNIY+ MGIVT
Sbjct: 182 LEASKYQMAEYRISVYGRKQSEWDQLASWFVNNALYSKNAVWLIQLPRLYNIYRSMGIVT 361
Query: 558 SFQNMLDNIFIPLFEVTVDPDSHPQLHVFLKQ 589
SFQN+LDN+FIPLFE TVDP+SHPQLH+FL Q
Sbjct: 362 SFQNILDNVFIPLFEATVDPNSHPQLHLFLNQ 457
>TC81833 similar to GP|11034633|dbj|BAB17157. hypothetical protein {Oryza
sativa (japonica cultivar-group)}, partial (16%)
Length = 509
Score = 249 bits (635), Expect = 4e-66
Identities = 123/123 (100%), Positives = 123/123 (100%)
Frame = +2
Query: 1 MDAHAVHLAMAALFGASIVAVSAYYMHRKTLTELLEFARTVEPEGDSDGGERRRGGSKRR 60
MDAHAVHLAMAALFGASIVAVSAYYMHRKTLTELLEFARTVEPEGDSDGGERRRGGSKRR
Sbjct: 140 MDAHAVHLAMAALFGASIVAVSAYYMHRKTLTELLEFARTVEPEGDSDGGERRRGGSKRR 319
Query: 61 NGGGGGYRRGSGSLPDVTAIAGGVEGNGLMHDEGIPVGLPRLQTLREGKSANNGSFKRNI 120
NGGGGGYRRGSGSLPDVTAIAGGVEGNGLMHDEGIPVGLPRLQTLREGKSANNGSFKRNI
Sbjct: 320 NGGGGGYRRGSGSLPDVTAIAGGVEGNGLMHDEGIPVGLPRLQTLREGKSANNGSFKRNI 499
Query: 121 IRP 123
IRP
Sbjct: 500 IRP 508
>BQ141706 similar to PIR|E96636|E966 hypothetical protein T7P1.21
[imported] - Arabidopsis thaliana, partial (1%)
Length = 1112
Score = 33.5 bits (75), Expect = 0.34
Identities = 13/18 (72%), Positives = 15/18 (83%)
Frame = -3
Query: 49 GGERRRGGSKRRNGGGGG 66
G ER+RGG +RR GGGGG
Sbjct: 54 GEERKRGGERRRGGGGGG 1
>TC76557 similar to PIR|S71779|S71779 glycine-rich RNA-binding protein GRP1
- wheat, partial (95%)
Length = 912
Score = 32.7 bits (73), Expect = 0.57
Identities = 23/62 (37%), Positives = 23/62 (37%), Gaps = 12/62 (19%)
Frame = +1
Query: 45 GDSDGGERRR------------GGSKRRNGGGGGYRRGSGSLPDVTAIAGGVEGNGLMHD 92
G GGERR GG R GGGGGY RG G GG G G
Sbjct: 553 GGGYGGERRGYGGGGGYGGGGGGGYGERRGGGGGYSRGGG--------GGGYGGGGYSRG 708
Query: 93 EG 94
G
Sbjct: 709 GG 714
Score = 30.0 bits (66), Expect = 3.7
Identities = 14/28 (50%), Positives = 15/28 (53%)
Frame = +1
Query: 45 GDSDGGERRRGGSKRRNGGGGGYRRGSG 72
G GG GG +R GGGGGY G G
Sbjct: 535 GGYGGGGGGYGGERRGYGGGGGYGGGGG 618
>TC76561 similar to PIR|S71779|S71779 glycine-rich RNA-binding protein GRP1
- wheat, partial (95%)
Length = 1054
Score = 32.7 bits (73), Expect = 0.57
Identities = 23/62 (37%), Positives = 23/62 (37%), Gaps = 12/62 (19%)
Frame = +1
Query: 45 GDSDGGERRR------------GGSKRRNGGGGGYRRGSGSLPDVTAIAGGVEGNGLMHD 92
G GGERR GG R GGGGGY RG G GG G G
Sbjct: 649 GGGYGGERRGYGGGGGYGGGGGGGYGERRGGGGGYSRGGG--------GGGYGGGGYSRG 804
Query: 93 EG 94
G
Sbjct: 805 GG 810
Score = 30.0 bits (66), Expect = 3.7
Identities = 14/28 (50%), Positives = 15/28 (53%)
Frame = +1
Query: 45 GDSDGGERRRGGSKRRNGGGGGYRRGSG 72
G GG GG +R GGGGGY G G
Sbjct: 631 GGYGGGGGGYGGERRGYGGGGGYGGGGG 714
>TC79934 similar to GP|20259633|gb|AAM14173.1 putative GAR1 protein
{Arabidopsis thaliana}, partial (87%)
Length = 814
Score = 32.7 bits (73), Expect = 0.57
Identities = 29/87 (33%), Positives = 37/87 (42%), Gaps = 8/87 (9%)
Frame = +1
Query: 10 MAALFGASIVAVSAYYMHRKTLTELLEFARTVEPEGDSDGGER--RRGGSKRRNG--GGG 65
M + S A +Y+ + L L F +P+G + GG R GG R G GGG
Sbjct: 397 MEGIVATSYAAGDKFYIDPRKLLPLARFLP--QPKGQASGGRGGGRGGGGFGRGGGRGGG 570
Query: 66 GYR-RG---SGSLPDVTAIAGGVEGNG 88
G+R RG G P GG G G
Sbjct: 571 GFRGRGPPRGGRGPPRGGRGGGFRGRG 651
>TC76556 similar to PIR|S71779|S71779 glycine-rich RNA-binding protein GRP1
- wheat, partial (95%)
Length = 754
Score = 32.7 bits (73), Expect = 0.57
Identities = 23/62 (37%), Positives = 23/62 (37%), Gaps = 12/62 (19%)
Frame = +2
Query: 45 GDSDGGERRR------------GGSKRRNGGGGGYRRGSGSLPDVTAIAGGVEGNGLMHD 92
G GGERR GG R GGGGGY RG G GG G G
Sbjct: 395 GGGYGGERRGYGGGGGYGGGGGGGYGERRGGGGGYSRGGG--------GGGYGGGGYSRG 550
Query: 93 EG 94
G
Sbjct: 551 GG 556
Score = 30.0 bits (66), Expect = 3.7
Identities = 14/28 (50%), Positives = 15/28 (53%)
Frame = +2
Query: 45 GDSDGGERRRGGSKRRNGGGGGYRRGSG 72
G GG GG +R GGGGGY G G
Sbjct: 377 GGYGGGGGGYGGERRGYGGGGGYGGGGG 460
>TC79682 weakly similar to GP|6523547|emb|CAB62280.1 hydroxyproline-rich
glycoprotein DZ-HRGP {Volvox carteri f. nagariensis},
partial (17%)
Length = 1031
Score = 32.3 bits (72), Expect = 0.75
Identities = 18/45 (40%), Positives = 23/45 (51%)
Frame = -1
Query: 45 GDSDGGERRRGGSKRRNGGGGGYRRGSGSLPDVTAIAGGVEGNGL 89
G + GG+ +G K GGGGG G G P +T G +G GL
Sbjct: 227 GGNSGGKGSKGAGKIDFGGGGG--GGDGVKPGITIGGKGNDGGGL 99
>BQ141932 weakly similar to GP|6523547|emb hydroxyproline-rich glycoprotein
DZ-HRGP {Volvox carteri f. nagariensis}, partial (30%)
Length = 1338
Score = 32.3 bits (72), Expect = 0.75
Identities = 24/62 (38%), Positives = 27/62 (42%), Gaps = 9/62 (14%)
Frame = +3
Query: 42 EPEGDSDGGERRRGGSKRRN--GGGGGYRRG-------SGSLPDVTAIAGGVEGNGLMHD 92
E G GGERR G + GGGGG RRG G TA A G G+ + D
Sbjct: 225 EERGRGVGGERRELGGRMEGEWGGGGGGRRGGESRNGYKGRRGSGTADAAGGRGSMTVGD 404
Query: 93 EG 94
G
Sbjct: 405 RG 410
>AW697002 similar to GP|10728183|gb CG11122 gene product {Drosophila
melanogaster}, partial (0%)
Length = 857
Score = 31.2 bits (69), Expect = 1.7
Identities = 13/22 (59%), Positives = 15/22 (68%)
Frame = +3
Query: 45 GDSDGGERRRGGSKRRNGGGGG 66
GD GGE GG++ R GGGGG
Sbjct: 348 GDRSGGEAGCGGTELREGGGGG 413
>BF646181 similar to GP|17104519|gb unknown protein {Arabidopsis thaliana},
partial (41%)
Length = 656
Score = 31.2 bits (69), Expect = 1.7
Identities = 16/41 (39%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Frame = -3
Query: 593 LDLVDDESKPERRPTK-HMPTPAQWTNVFNPAFSYYVYYCY 632
+DL D+ K R TK H P + + +F FS+Y YY Y
Sbjct: 258 VDL*RDKKKDHREDTKGHRPICHETSYIFQRVFSHYYYYYY 136
>TC90933 similar to SP|P27484|GRP2_NICSY Glycine-rich protein 2. [Wood
tobacco] {Nicotiana sylvestris}, partial (44%)
Length = 368
Score = 31.2 bits (69), Expect = 1.7
Identities = 16/30 (53%), Positives = 17/30 (56%)
Frame = +3
Query: 43 PEGDSDGGERRRGGSKRRNGGGGGYRRGSG 72
P+G S G RR G GGGGGY RG G
Sbjct: 270 PDGASVQGSRRGG------GGGGGYERGGG 341
>BE997786 weakly similar to SP|P80002|PRT2_S Spermatid-specific protein T2
[Contains: Sperm protamine SP2]. [Common cuttlefish],
partial (57%)
Length = 612
Score = 30.8 bits (68), Expect = 2.2
Identities = 13/27 (48%), Positives = 17/27 (62%)
Frame = -2
Query: 40 TVEPEGDSDGGERRRGGSKRRNGGGGG 66
+VE G++D GE GG +GGGGG
Sbjct: 260 SVEGNGENDVGEVEIGGESEESGGGGG 180
>TC76707 similar to SP|Q07202|CORA_MEDSA Cold and drought-regulated protein
CORA. [Alfalfa] {Medicago sativa}, partial (87%)
Length = 780
Score = 30.8 bits (68), Expect = 2.2
Identities = 20/56 (35%), Positives = 21/56 (36%), Gaps = 6/56 (10%)
Frame = +1
Query: 45 GDSDGGERRRGGSKRRNGGGG------GYRRGSGSLPDVTAIAGGVEGNGLMHDEG 94
G +GG GG NGGGG GY G G GG G G H G
Sbjct: 190 GGYNGGGYNHGGGGYNNGGGGYNHGGGGYNNGGGG---YNHGGGGYNGGGYNHGGG 348
>TC86951 homologue to GP|2745902|gb|AAB94773.1| ERS-like ethylene receptor
{Pisum sativum}, complete
Length = 2274
Score = 30.8 bits (68), Expect = 2.2
Identities = 15/36 (41%), Positives = 21/36 (57%)
Frame = +2
Query: 271 ISDPSTPKPNLEPFFYAPEGKSDHYFEMQDGVIHVY 306
++ P TP P+L PFF +S H + DGV+ VY
Sbjct: 83 LAPPITPPPDLPPFF-----RSFHVGVLDDGVLRVY 175
>TC83200 weakly similar to GP|21554083|gb|AAM63164.1 unknown {Arabidopsis
thaliana}, partial (71%)
Length = 565
Score = 30.8 bits (68), Expect = 2.2
Identities = 17/36 (47%), Positives = 18/36 (49%)
Frame = +1
Query: 48 DGGERRRGGSKRRNGGGGGYRRGSGSLPDVTAIAGG 83
DGGER G R GGG G R G G+ T GG
Sbjct: 331 DGGERT*EGGTRIGGGGEGDRDGGGAWEFGTGDCGG 438
>TC88261 similar to PIR|S22697|S22697 extensin - Volvox carteri (fragment),
partial (7%)
Length = 1516
Score = 30.8 bits (68), Expect = 2.2
Identities = 15/35 (42%), Positives = 18/35 (50%)
Frame = -2
Query: 45 GDSDGGERRRGGSKRRNGGGGGYRRGSGSLPDVTA 79
GD D G GG + + G GGG RG+G P A
Sbjct: 405 GDGDDGGGGGGGEEGKGGVGGGESRGNGMEPSGVA 301
>TC76704 homologue to SP|Q09134|GRPA_MEDFA Abscisic acid and environmental
stress inducible protein. [Sickle medic] {Medicago
falcata}, partial (81%)
Length = 771
Score = 30.8 bits (68), Expect = 2.2
Identities = 20/55 (36%), Positives = 21/55 (37%), Gaps = 5/55 (9%)
Frame = +2
Query: 45 GDSDGGERRRGGSKRRNGGG-----GGYRRGSGSLPDVTAIAGGVEGNGLMHDEG 94
G +GG GG NGGG GGY G G GG G G H G
Sbjct: 227 GGYNGGGYNHGGGGYNNGGGYNHGGGGYNNGGG----YNHGGGGYNGGGYNHGGG 379
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.319 0.137 0.408
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 24,992,713
Number of Sequences: 36976
Number of extensions: 371291
Number of successful extensions: 2748
Number of sequences better than 10.0: 57
Number of HSP's better than 10.0 without gapping: 2478
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2660
length of query: 814
length of database: 9,014,727
effective HSP length: 104
effective length of query: 710
effective length of database: 5,169,223
effective search space: 3670148330
effective search space used: 3670148330
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)
Medicago: description of AC140721.5