
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126012.10 + phase: 0 /pseudo
(498 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC217953 similar to UP|Q9FHH9 (Q9FHH9) DNA-binding protein-like,... 183 6e-70
TC206177 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (G... 39 0.005
TC206176 weakly similar to UP|Q8LPA7 (Q8LPA7) Cold shock protein... 37 0.027
TC219583 weakly similar to UP|GRP2_NICSY (P27484) Glycine-rich p... 36 0.045
CF921229 35 0.078
TC216231 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (G... 34 0.13
TC217412 similar to UP|Q6L724 (Q6L724) ATP-dependent RNA helicas... 34 0.17
TC230908 similar to UP|RED_HUMAN (Q13123) Red protein (RER prote... 34 0.17
TC217260 33 0.29
TC217261 33 0.39
TC221535 weakly similar to GB|AAO39941.1|28372918|BT003713 At3g0... 32 0.50
TC229016 similar to UP|HES1_MOUSE (P35428) Transcription factor ... 32 0.66
TC218632 similar to UP|GRP1_PHAVU (P10495) Glycine-rich cell wal... 32 0.86
BI786951 homologue to GP|7688065|emb constitutively photomorphog... 32 0.86
NP004897 gag-protease polyprotein 31 1.1
TC206872 similar to UP|O04024 (O04024) F7G19.7 protein, partial ... 31 1.5
TC220878 weakly similar to UP|Q84TG3 (Q84TG3) At2g35930, partial... 30 1.9
TC204083 similar to UP|O48567 (O48567) Glycine-rich RNA-binding ... 30 1.9
BG238104 similar to GP|11994336|dbj cytochrome c oxidase subunit... 30 2.5
TC215033 similar to GB|AAL05900.1|15777879|AY055100 AT3g15640/MS... 30 2.5
>TC217953 similar to UP|Q9FHH9 (Q9FHH9) DNA-binding protein-like, partial
(6%)
Length = 1115
Score = 183 bits (464), Expect(3) = 6e-70
Identities = 99/132 (75%), Positives = 107/132 (81%)
Frame = +1
Query: 246 SGNSSTENAGSTYQVQDMLSSRCPQPKIPSPTSSATSKGEPKVSQVNEGMANIQEIVAER 305
SGNSS ENAGST+Q QDM S+RCPQPKIPSPTSSA SKGE KVS VNE NIQE +R
Sbjct: 1 SGNSSAENAGSTFQAQDMESARCPQPKIPSPTSSAASKGELKVSPVNEKTTNIQETTDDR 180
Query: 306 KEVSATQQVSEQVKIPRAAVVSEVTHESMSVKEPEPASQGSAKLVEEEVQQKLVPTDGGK 365
K VSA QQ SEQV+ PRAA +SE THESMSVK EPASQ SA+ VEEEVQQKLVPT+ GK
Sbjct: 181 KAVSAPQQTSEQVRNPRAADISEATHESMSVK--EPASQWSAQQVEEEVQQKLVPTEAGK 354
Query: 366 KKKKKKVRMPAN 377
KKKKKKVR+ N
Sbjct: 355 KKKKKKVRLLPN 390
Score = 72.8 bits (177), Expect(3) = 6e-70
Identities = 40/62 (64%), Positives = 44/62 (70%)
Frame = +3
Query: 380 L*FILEWHATLFGWIYGTICCSNDNDGLWSRPL*HAICRWSASRPIWHAWLRDACSSTS* 439
L*F+LEWHAT GWIYGT+ SN NDGL PL HAI W A+RPIWHA L DA TS*
Sbjct: 462 L*FVLEWHAT-HGWIYGTVW*SNANDGLRPGPLGHAIW-WYATRPIWHARLHDAWFPTS* 635
Query: 440 SN 441
+
Sbjct: 636 GS 641
Score = 48.5 bits (114), Expect(3) = 6e-70
Identities = 26/51 (50%), Positives = 35/51 (67%), Gaps = 1/51 (1%)
Frame = +1
Query: 449 RDFAEFNIGMNVPPPVMNMNREPVMNKNREDFEARNA-ILKKQENERRVER 498
RD A+F++GMN PPPVM +RE+FEAR A + +K+EN+RR ER
Sbjct: 634 RDLADFSMGMNAPPPVM----------SREEFEARKADMRRKRENDRRPER 756
>TC206177 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (GRP2)
(AT4g38680/F20M13_240), partial (22%)
Length = 406
Score = 38.9 bits (89), Expect = 0.005
Identities = 21/50 (42%), Positives = 25/50 (50%)
Frame = +1
Query: 65 GAGRGFRRGAVGGRIGGGRGFGLERKTPPEGYVCHRCKVSGHFIQHCPTN 114
G G G G GGR GGG G G G C+ C SGHF + CP++
Sbjct: 25 GGGGGRYGGGGGGRYGGGGGGG------GGGGSCYSCGESGHFARDCPSS 156
>TC206176 weakly similar to UP|Q8LPA7 (Q8LPA7) Cold shock protein-1, partial
(77%)
Length = 883
Score = 36.6 bits (83), Expect = 0.027
Identities = 23/57 (40%), Positives = 29/57 (50%), Gaps = 3/57 (5%)
Frame = +3
Query: 61 GSDFGAGRGFRR--GAVGGR-IGGGRGFGLERKTPPEGYVCHRCKVSGHFIQHCPTN 114
G +G G G R G GGR +GGG G G G C+ C SGHF + CP++
Sbjct: 492 GDRYGGGGGGGRYGGGGGGRYVGGGGGGG-------GGGSCYSCGESGHFARDCPSS 641
Score = 34.3 bits (77), Expect = 0.13
Identities = 25/92 (27%), Positives = 37/92 (40%), Gaps = 15/92 (16%)
Frame = +3
Query: 39 KADEDSKIKALI----DTPALDWQQQGSDFGAGRGFRRGAVGGRIGGGRGFG-------- 86
+++ D + KA+ D ++ ++G D G G RG GG GGG G+G
Sbjct: 213 ESESDGRAKAVDVTGPDGASVQGTRRGGDGGRSYGGGRGGGGGGYGGGGGYGGGGGYGGG 392
Query: 87 ---LERKTPPEGYVCHRCKVSGHFIQHCPTNG 115
G C+ C SGH + C G
Sbjct: 393 GGYGGGGRGGGGGACYNCGESGHLARDCSGGG 488
>TC219583 weakly similar to UP|GRP2_NICSY (P27484) Glycine-rich protein 2,
partial (45%)
Length = 995
Score = 35.8 bits (81), Expect = 0.045
Identities = 28/74 (37%), Positives = 35/74 (46%), Gaps = 4/74 (5%)
Frame = +2
Query: 42 EDSKIKALIDTPALDWQQQGSDFGAGRGFRRGAVGGRIGG----GRGFGLERKTPPEGYV 97
+D + A+ T A+ + G G G G RG GGR GG GRGFG R PE
Sbjct: 263 DDGRTMAVDVTSAVRSRLPGGFRGGGGG--RGRGGGRYGGGEGRGRGFG-RRGGGPE--- 424
Query: 98 CHRCKVSGHFIQHC 111
C+ C GH + C
Sbjct: 425 CYNCGRIGHLARDC 466
>CF921229
Length = 680
Score = 35.0 bits (79), Expect = 0.078
Identities = 24/86 (27%), Positives = 41/86 (46%), Gaps = 8/86 (9%)
Frame = -3
Query: 285 EPKVSQVNEGMANIQEIVAERKEVSATQQVSE--------QVKIPRAAVVSEVTHESMSV 336
E K +V E + IV E+KEV T++ E +V++ +EVT E V
Sbjct: 546 EKKEVEVTEEKKEAEVIVEEKKEVEVTEEKKEVEVTEGKKEVEVIEEKKETEVTEEKKEV 367
Query: 337 KEPEPASQGSAKLVEEEVQQKLVPTD 362
+ + +++ EEE Q++VP +
Sbjct: 366 EVEVREEKKESEVKEEEKGQEVVPEE 289
Score = 28.9 bits (63), Expect = 5.6
Identities = 25/93 (26%), Positives = 41/93 (43%), Gaps = 4/93 (4%)
Frame = -3
Query: 285 EPKVSQVNEGMANIQEIVAERKEVSATQQVSEQVKIPRAAVVSEVTHESMSVKEPEPASQ 344
E K ++V E E+ E+KEV T++ E I EVT E V+ E +
Sbjct: 600 EKKEAEVKEEKKE-GEVTEEKKEVEVTEEKKEAEVIVEEKKEVEVTEEKKEVEVTEGKKE 424
Query: 345 ----GSAKLVEEEVQQKLVPTDGGKKKKKKKVR 373
K E ++K V + ++KK+ +V+
Sbjct: 423 VEVIEEKKETEVTEEKKEVEVEVREEKKESEVK 325
>TC216231 similar to UP|Q41188 (Q41188) Glycine-rich protein 2 (GRP2)
(AT4g38680/F20M13_240), partial (42%)
Length = 891
Score = 34.3 bits (77), Expect = 0.13
Identities = 17/42 (40%), Positives = 21/42 (49%)
Frame = +3
Query: 73 GAVGGRIGGGRGFGLERKTPPEGYVCHRCKVSGHFIQHCPTN 114
G GGR GGG G G C+ C SGHF + CP++
Sbjct: 282 GGGGGRYGGGGGGGGS---------CYNCGESGHFARDCPSS 380
Score = 30.0 bits (66), Expect = 2.5
Identities = 20/64 (31%), Positives = 25/64 (38%), Gaps = 9/64 (14%)
Frame = +3
Query: 61 GSDFGAGRGFRRGAVGGRIGGGRGFGLERKTPPEGY---------VCHRCKVSGHFIQHC 111
G G G RG GG GGG G+G GY C++C +GH + C
Sbjct: 93 GGGRGGYGGGGRGGRGGYGGGGGGYGGGGYGGGGGYGGGGGGGGGGCYKCGETGHIARDC 272
Query: 112 PTNG 115
G
Sbjct: 273 SQGG 284
>TC217412 similar to UP|Q6L724 (Q6L724) ATP-dependent RNA helicase, partial
(3%)
Length = 868
Score = 33.9 bits (76), Expect = 0.17
Identities = 24/75 (32%), Positives = 27/75 (36%), Gaps = 17/75 (22%)
Frame = +3
Query: 55 LDWQQQGSDFG------AGRGFRRGAVGGR-----------IGGGRGFGLERKTPPEGYV 97
LD + G DFG GR F+ G R IGGGR + G
Sbjct: 132 LDVEDSGDDFGDQSSRRGGRNFKTGNSWSRAAGKSSGDDWLIGGGRRSSRPSSSDRFGGT 311
Query: 98 CHRCKVSGHFIQHCP 112
C C SGH CP
Sbjct: 312 CFNCGESGHRASDCP 356
>TC230908 similar to UP|RED_HUMAN (Q13123) Red protein (RER protein) (IK
factor) (Cytokine IK), partial (4%)
Length = 733
Score = 33.9 bits (76), Expect = 0.17
Identities = 31/103 (30%), Positives = 46/103 (44%), Gaps = 3/103 (2%)
Frame = +2
Query: 175 DLPPELHCPLCNNVMKDAVLTSKCCFKSFCDKCIRNYIMSKSACVCLATNI-LAD--DLL 231
D+P CP+ ++MKD V S ++ + I ++ SK C T + L D DL
Sbjct: 143 DVPSFFLCPISLDIMKDPVTVSTGI--TYDRESIETWLFSKKNTTCPITKLPLIDYTDLT 316
Query: 232 PNKTLRDAIHRILESGNSSTENAGSTYQVQDMLSSRCPQPKIP 274
PN TLR R++++ S + G R P PK P
Sbjct: 317 PNHTLR----RLIQAWCSMNASHG---------IERIPTPKPP 406
>TC217260
Length = 1114
Score = 33.1 bits (74), Expect = 0.29
Identities = 23/82 (28%), Positives = 38/82 (46%), Gaps = 6/82 (7%)
Frame = +3
Query: 169 STRSVGDLPPEL--HCPLCNNVMKDAVLTSKCCFKSFCDKCIRNYIMSKSACVCLATNIL 226
S + + P EL +CP+C M + T C FC CI+ I ++ C ++
Sbjct: 462 SVKKTPEPPKELVFNCPICMGPMVHEMSTR--CGHIFCKDCIKAAISAQGKCPTCRKKVV 635
Query: 227 ADDL----LPNKTLRDAIHRIL 244
A DL LP+ + R ++ +L
Sbjct: 636 AKDLIRVYLPSTS*RISMEAVL 701
>TC217261
Length = 1186
Score = 32.7 bits (73), Expect = 0.39
Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 6/72 (8%)
Frame = +3
Query: 166 GLSSTRSVGDLPPE------LHCPLCNNVMKDAVLTSKCCFKSFCDKCIRNYIMSKSACV 219
G SS PPE +CP+C M + T C FC CI+ I ++ C
Sbjct: 462 GSSSMEQSFKKPPEPPKELVFNCPICMGPMVHEMSTR--CGHIFCKDCIKAAISAQGKCP 635
Query: 220 CLATNILADDLL 231
++A DL+
Sbjct: 636 TCRKKVVAKDLI 671
>TC221535 weakly similar to GB|AAO39941.1|28372918|BT003713 At3g07200
{Arabidopsis thaliana;} , partial (32%)
Length = 658
Score = 32.3 bits (72), Expect = 0.50
Identities = 13/41 (31%), Positives = 23/41 (55%)
Frame = +2
Query: 178 PELHCPLCNNVMKDAVLTSKCCFKSFCDKCIRNYIMSKSAC 218
P +CP+C + +++ TS C FC CIR + +++ C
Sbjct: 167 PVFNCPICMSALEEE--TSTRCAHIFCKNCIRAALSAQAKC 283
>TC229016 similar to UP|HES1_MOUSE (P35428) Transcription factor HES-1 (Hairy
and enhancer of split 1), partial (6%)
Length = 769
Score = 32.0 bits (71), Expect = 0.66
Identities = 30/128 (23%), Positives = 50/128 (38%), Gaps = 12/128 (9%)
Frame = +1
Query: 129 GIPRSMLMVNPQ--------GSYALQNGSVAVLKPNEAAFDKEMEGLSSTRSVGDLPPEL 180
G+P++ ++N S A +N P E KE P
Sbjct: 274 GVPQNQTIINGDLYVNLANNSSSASENAKKTPEPPKEPEAPKE--------------PVF 411
Query: 181 HCPLCNNVMKDAVLTSKCCFKSFCDKCIRNYIMSKSAC-VC---LATNILADDLLPNKTL 236
+CP+C + + + + T C FC CIR I +++ C C + N L LP +
Sbjct: 412 NCPICMSPLVEEMSTR--CGHIFCKNCIRAAISAQAKCPTCRKKVTKNSLIRVFLPATS* 585
Query: 237 RDAIHRIL 244
R +I ++
Sbjct: 586 RSSIQSLI 609
>TC218632 similar to UP|GRP1_PHAVU (P10495) Glycine-rich cell wall
structural protein 1.0 precursor (GRP 1.0), partial
(27%)
Length = 447
Score = 31.6 bits (70), Expect = 0.86
Identities = 15/27 (55%), Positives = 18/27 (66%)
Frame = +3
Query: 60 QGSDFGAGRGFRRGAVGGRIGGGRGFG 86
+G+ GAG G+ GAVGG GGG G G
Sbjct: 51 EGAGQGAGGGYGGGAVGGGGGGGGGSG 131
>BI786951 homologue to GP|7688065|emb constitutively photomorphogenic 1
protein {Pisum sativum}, partial (13%)
Length = 421
Score = 31.6 bits (70), Expect = 0.86
Identities = 14/38 (36%), Positives = 19/38 (49%)
Frame = +2
Query: 199 CFKSFCDKCIRNYIMSKSACVCLATNILADDLLPNKTL 236
C SFC CI ++ +KS C C + +L PN L
Sbjct: 14 CGHSFCYMCIITHLRNKSDCPCCGDYLTNTNLFPNLLL 127
>NP004897 gag-protease polyprotein
Length = 1923
Score = 31.2 bits (69), Expect = 1.1
Identities = 15/46 (32%), Positives = 24/46 (51%)
Frame = +1
Query: 88 ERKTPPEGYVCHRCKVSGHFIQHCPTNGDSTFDIKKVRQPTGIPRS 133
E+ + +G+ CH C+ GH CPT+ +KK R+ + RS
Sbjct: 874 EKPSHSKGFQCHGCEGYGHIKAECPTH------LKKQRKGLSVCRS 993
>TC206872 similar to UP|O04024 (O04024) F7G19.7 protein, partial (13%)
Length = 1851
Score = 30.8 bits (68), Expect = 1.5
Identities = 14/32 (43%), Positives = 17/32 (52%)
Frame = -2
Query: 263 MLSSRCPQPKIPSPTSSATSKGEPKVSQVNEG 294
+ S RCP P + T+S TS G P S N G
Sbjct: 332 LTSGRCPDPVMLRTTASPTSSGPPNASIENSG 237
>TC220878 weakly similar to UP|Q84TG3 (Q84TG3) At2g35930, partial (22%)
Length = 923
Score = 30.4 bits (67), Expect = 1.9
Identities = 28/110 (25%), Positives = 46/110 (41%), Gaps = 4/110 (3%)
Frame = +3
Query: 175 DLPPELHCPLCNNVMKDAVLTSKCCFKSFCDKCIRNYIMSKSACVCLATN----ILADDL 230
++P CP+ +MKD V T ++ + I +++ C C T + L
Sbjct: 66 EIPQFFLCPISLQIMKDPVTTVTGI--TYDRESIEKWLLKAKDCTCPITKQPLPRSPEFL 239
Query: 231 LPNKTLRDAIHRILESGNSSTENAGSTYQVQDMLSSRCPQPKIPSPTSSA 280
PN TLR R++++ S+ E G + P PK P S+A
Sbjct: 240 TPNHTLR----RLIQAWCSANEANG---------VDQIPTPKSPLSNSNA 350
>TC204083 similar to UP|O48567 (O48567) Glycine-rich RNA-binding protein,
partial (95%)
Length = 1019
Score = 30.4 bits (67), Expect = 1.9
Identities = 15/31 (48%), Positives = 20/31 (64%), Gaps = 1/31 (3%)
Frame = +2
Query: 61 GSDFGAGRGFRRGAVG-GRIGGGRGFGLERK 90
G FG+G G+ RG+ G G GGG G+G R+
Sbjct: 320 GGGFGSGGGYNRGSGGYGGGGGGGGYGGRRE 412
>BG238104 similar to GP|11994336|dbj cytochrome c oxidase subunit Vb
precursor-like protein {Arabidopsis thaliana}, partial
(40%)
Length = 476
Score = 30.0 bits (66), Expect = 2.5
Identities = 14/39 (35%), Positives = 20/39 (50%)
Frame = -3
Query: 58 QQQGSDFGAGRGFRRGAVGGRIGGGRGFGLERKTPPEGY 96
++ G G G G GG GGG GF + R+ PP+ +
Sbjct: 177 RESGESGGGGSRSGSGGGGGGGGGG*GFEVRREKPPQHF 61
>TC215033 similar to GB|AAL05900.1|15777879|AY055100 AT3g15640/MSJ11_4
{Arabidopsis thaliana;} , partial (60%)
Length = 923
Score = 30.0 bits (66), Expect = 2.5
Identities = 14/39 (35%), Positives = 20/39 (50%)
Frame = -2
Query: 58 QQQGSDFGAGRGFRRGAVGGRIGGGRGFGLERKTPPEGY 96
++ G G G G GG GGG GF + R+ PP+ +
Sbjct: 292 RESGESGGGGSRSGSGGGGGGGGGG*GFEVRREKPPQHF 176
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.320 0.135 0.416
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,174,182
Number of Sequences: 63676
Number of extensions: 375592
Number of successful extensions: 3021
Number of sequences better than 10.0: 74
Number of HSP's better than 10.0 without gapping: 2826
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2965
length of query: 498
length of database: 12,639,632
effective HSP length: 101
effective length of query: 397
effective length of database: 6,208,356
effective search space: 2464717332
effective search space used: 2464717332
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)
Medicago: description of AC126012.10