
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146805.1 - phase: 0 /pseudo
(1445 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin... 231 2e-60
TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alterna... 214 2e-55
BF003873 similar to GP|14715222|em putative polyprotein {Cicer a... 199 6e-51
AL366725 178 1e-44
BG644740 similar to PIR|A84460|A84 probable retroelement pol pol... 89 2e-17
BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - ... 85 2e-16
TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Ci... 74 3e-13
BG452991 PIR|A25875|A25 histone H4 - Tetrahymena thermophila, pa... 58 3e-08
CB893783 weakly similar to GP|22830935|dbj hypothetical protein~... 34 0.36
>BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin cluster;
Ty3-Gypsy type {Oryza sativa}, partial (15%)
Length = 716
Score = 231 bits (589), Expect = 2e-60
Identities = 124/243 (51%), Positives = 161/243 (66%), Gaps = 5/243 (2%)
Frame = +2
Query: 489 DEIPDVPPEREVEFSINLVPGTKPVSMAPYRMSASELSELKKQLEDLLEKKFVRPSVSPW 548
D + VPPE +++F I+L+P P+ + YR++ +L LK QL+DLLEK F++PS+ P
Sbjct: 2 DHLL*VPPEWKIDFGIDLLPNMNPI*IPSYRINPLKLKVLKLQLKDLLEKGFIQPSIYP* 181
Query: 549 GAPVLLVKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSGYH 608
G VL +KKKDG +R+ IDY QLN V IK +YPLP ID+L D L G++ F KIDLR G H
Sbjct: 182 GVVVLFLKKKDGFLRMSIDYPQLNNVNIKIKYPLPLIDELFDNLQGSKWFFKIDLRLG*H 361
Query: 609 QIKVKDEDMQKTALRTRYGHYEYKVMPFGVTNAPRVFMEYMNRIFHAFLDRFVVVFIDDI 668
Q +V ED+ KTA R RYGHYE VM FG TN P FME MNR+F +LD V+VF +DI
Sbjct: 362 QHRVIGEDVPKTAFRIRYGHYEILVMSFG*TNPPMAFMELMNRVFQDYLDSLVIVFSNDI 541
Query: 669 LIYSKTEEEHAEHLKIVLQVLKEKKLYAKLSKCEF----WLKEVSFLG-HVISGDGIAVD 723
LIYSK E EH HL++ L+VLK+ + C+ L EV F HVISG+G+ VD
Sbjct: 542 LIYSKNENEHENHLRLALKVLKD------IGLCQISYV*ILVEVGFFSLHVISGEGLKVD 703
Query: 724 PSK 726
+
Sbjct: 704 SKR 712
>TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alternaria
alternata}, partial (21%)
Length = 1540
Score = 214 bits (545), Expect = 2e-55
Identities = 112/264 (42%), Positives = 172/264 (64%), Gaps = 9/264 (3%)
Frame = +1
Query: 479 VVCDFPEVY-PDEIPDVPPEREV-EFSINLVP----GTKPVSMAP-YRMSASELSELKKQ 531
V+ +FP+++ P++ VP R + + +I L+P P+ P Y MS EL LKK
Sbjct: 343 VLEEFPDLFNPEKAYQVPASRGLLDHAIPLIPDKDGNDPPLPWGPLYGMSRQELLVLKKT 522
Query: 532 LEDLLEKKFVRPSVSPWGAPVLLVKKKDGSMRLCIDYRQLNKVTIKNRYPLPRIDDLMDQ 591
LEDLL+K F++ S S GAPVL V+K G +R C+DYR LN +T K+RYPLP I + + +
Sbjct: 523 LEDLLDKGFIKASGSAAGAPVLFVRKPGGGIRFCVDYRALNAITKKDRYPLPLISETLRR 702
Query: 592 LVGARVFSKIDLRSGYHQIKVKDEDMQKTALRTRYGHYEYKVMPFGVTNAPRVFMEYMNR 651
+ GAR F+K+D+ + +H++++KDED +KTA RTRYG +E+ V PFG+T AP F Y+N+
Sbjct: 703 VAGARWFTKLDVVAAFHKMRIKDEDQEKTAFRTRYGLFEWIVCPFGLTGAPATFQRYINK 882
Query: 652 IFHAFLDRFVVVFIDDILIYSK-TEEEHAEHLKIVLQVLKEKKLYAKLSKCEFWLKEVSF 710
H FLD FV +IDD+LIY+ ++++H ++ VL+ L + L KCEF + V +
Sbjct: 883 TLHEFLDDFVTAYIDDVLIYTTGSKKDHEAQVRRVLRRLADAGLSLDPKKCEFSVTTVKY 1062
Query: 711 LGHVI-SGDGIAVDPSKVEAVSQW 733
+G ++ +G G++ DP K+ A+ W
Sbjct: 1063VGFILTAGKGVSCDPLKLAAIRDW 1134
>BF003873 similar to GP|14715222|em putative polyprotein {Cicer arietinum},
partial (82%)
Length = 559
Score = 199 bits (506), Expect = 6e-51
Identities = 104/131 (79%), Positives = 112/131 (85%)
Frame = +1
Query: 1315 WCRTCIEVKEVDPEVYWSVSDIRKSWNGGLSSGFTTASFKFARRFPCVATSEVCSGSISC 1374
W TC EVKEVD E++WSVSDIRKSWNGG+SSG TTASF+FA CVATSEVCSGSISC
Sbjct: 1 WSWTCFEVKEVDCEIHWSVSDIRKSWNGGVSSGITTASFEFA*CLSCVATSEVCSGSISC 180
Query: 1375 DPE*RCAS*RQPYGRDFTGED**S*SEDVERQRDTSRESRLVGSDW*KLDVGAWE*DAGV 1434
DPE* CAS RQPYGRDFTGED**S SED+ERQ DTS ESRL S+W*KLDVGA E*D GV
Sbjct: 181 DPE**CASQRQPYGRDFTGED**SKSEDIERQGDTSCESRLGQSEW*KLDVGA*E*DGGV 360
Query: 1435 LSRIVFLR*IF 1445
LSR+V +R*IF
Sbjct: 361 LSRVVCMR*IF 393
>AL366725
Length = 485
Score = 178 bits (452), Expect = 1e-44
Identities = 106/161 (65%), Positives = 121/161 (74%), Gaps = 1/161 (0%)
Frame = +1
Query: 143 EVLSALCCRDY*VFEMHQVRERFEARH*EGDWIPTD*SFSRFGQ*LQDL*RGY*GSL*DC 202
+VLSALCC D *V EMHQVRE FEARH EG+ IPT SF F + LQ+L GY*GS C
Sbjct: 1 KVLSALCC*DC*VLEMHQVREWFEARHQEGNRIPTAPSFP*FSEYLQNLRGGY*GS*QGC 180
Query: 203 EREEGQGTAESS*AVQCPC**RETEDGR**AA*EEGC-YRDCVFQLWCESPQKQCLS*RD 261
E E QG ES *A+ CPC** +TE+GR**A+*EEGC DC+FQLW E PQ+ CLS*RD
Sbjct: 181 E*AEDQGPIESP*AL*CPC**GQTENGR**AS*EEGCSCGDCLFQLWRERPQE*CLS*RD 360
Query: 262 QEVCPVWQEGSYCS*LQA*GHCVF*LQ*RGTY*FTVYSA*E 302
QE+CPV QEGSYCS*LQA +CV LQ RG+Y F + +A*E
Sbjct: 361 QEMCPV*QEGSYCS*LQAK*YCVLQLQRRGSYWFPMQAA*E 483
>BG644740 similar to PIR|A84460|A84 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 754
Score = 88.6 bits (218), Expect = 2e-17
Identities = 46/93 (49%), Positives = 63/93 (67%), Gaps = 4/93 (4%)
Frame = -1
Query: 518 YRMSASELSELKKQLEDLL----EKKFVRPSVSPWGAPVLLVKKKDGSMRLCIDYRQLNK 573
++ S ++ S ++ +DL K+F +PS+SP GA +L V+KKDG R+CIDYRQ NK
Sbjct: 346 FKPSLADGSNRVEKTKDLT*RFARKRFQQPSISP*GAALLFVRKKDGYFRMCIDYRQFNK 167
Query: 574 VTIKNRYPLPRIDDLMDQLVGARVFSKIDLRSG 606
VT KN+YPLPRID+L D++ F IDLR G
Sbjct: 166 VTTKNKYPLPRIDNLFDKIQEDCYF*NIDLRLG 68
Score = 47.8 bits (112), Expect = 3e-05
Identities = 30/73 (41%), Positives = 43/73 (58%), Gaps = 3/73 (4%)
Frame = -2
Query: 471 QAVIDKLQVVC---DFPEVYPDEIPDVPPEREVEFSINLVPGTKPVSMAPYRMSASELSE 527
++VI +VV F V+PD P +P ERE+ F I+L+ T+ +S P M +EL +
Sbjct: 483 ESVIPLFEVVLVLKGFS*VFPDNFPVIPLEREIFFCIDLLLDTQLISNPP*LMDRTELKK 304
Query: 528 LKKQLEDLLEKKF 540
LK L+D LEK F
Sbjct: 303 LKI*LKDSLEKGF 265
>BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (13%)
Length = 763
Score = 84.7 bits (208), Expect = 2e-16
Identities = 43/116 (37%), Positives = 67/116 (57%)
Frame = +2
Query: 615 EDMQKTALRTRYGHYEYKVMPFGVTNAPRVFMEYMNRIFHAFLDRFVVVFIDDILIYSKT 674
+D++KTA T G Y YKVMPFG+ NA + +NR+F L + V+IDD+L+ S
Sbjct: 11 DDLEKTAFITDRGTYCYKVMPFGLKNAGSTYQRLVNRMFADKLGNTMEVYIDDMLVKSLR 190
Query: 675 EEEHAEHLKIVLQVLKEKKLYAKLSKCEFWLKEVSFLGHVISGDGIAVDPSKVEAV 730
+H HLK + L E + +KC F + FLG++++ GI V+P ++ A+
Sbjct: 191 ATDHLNHLKE*FKTLDEYIMKLNPAKCTFGVTSGEFLGYIVTQQGIEVNPKQITAI 358
>TC93153 similar to GP|14715220|emb|CAC44106. gag polyprotein {Cicer
arietinum}, partial (8%)
Length = 516
Score = 74.3 bits (181), Expect = 3e-13
Identities = 59/166 (35%), Positives = 82/166 (48%), Gaps = 9/166 (5%)
Frame = +1
Query: 3 CCP---------NCGTTT*RECWC*C*DEDVGDFHQEEPSNFQRKI*P*WSPDVA*RD*E 53
CCP +C T T* W * ++D DF +E +N Q + *W +VA D*+
Sbjct: 19 CCPCCCA*GGSSSCSTAT*G*YW**W-NQDARDFLEESSTNIQG*VCS*WCLEVAEGD*K 195
Query: 54 DFSCYAVH*RSESAVWYASAGRGS**LVG*SIT*PRA*RSCCYLGCVQERVPEKILFGRC 113
+ +A+ * +E AVW A *LV * + A C LG VQE V ++ C
Sbjct: 196 NIPSHAMF*DTEGAVWDAHVS*RGR*LVD*FVACSGAG*CCGNLGHVQEGVSGQVFSRGC 375
Query: 114 SREERDRVP*VEARKYVCD*VCCQVC*VGEVLSALCCRDY*VFEMH 159
ERD + VE +VC VCC+VC +L +L C D *V +++
Sbjct: 376 QG*ERD*ISGVETG*HVCHRVCCKVCGTCHILPSLQCGDS*VLQVY 513
>BG452991 PIR|A25875|A25 histone H4 - Tetrahymena thermophila, partial (33%)
Length = 560
Score = 58.2 bits (139), Expect(2) = 3e-08
Identities = 44/94 (46%), Positives = 54/94 (56%)
Frame = +1
Query: 939 LNSSEI*VWFANGHLRV*NWEC*RLIVNF*KVSRKHRKLI*SLWTCWLLEIRLKTLILKS 998
LNSSE *VWF RV*NW C*R NF VS++ R+ +*S L IRLK +ILK
Sbjct: 70 LNSSET*VWFVKCRHRV*NWGC*RSTTNFWIVSKRLRRWM*SW*I*CLGIIRLKMVILKL 249
Query: 999 MIKVC*DSEEEFVF*T*RD*EDDS*RESQE*LEY 1032
MIK C E+ +DDS R+S +* E+
Sbjct: 250 MIKECCSFGIEYE-------KDDSRRKS*K*CEF 330
Score = 19.2 bits (38), Expect(2) = 3e-08
Identities = 9/12 (75%), Positives = 9/12 (75%)
Frame = +3
Query: 928 CPL*WLESSSYL 939
C L*WLES S L
Sbjct: 36 CLL*WLESWSCL 71
>CB893783 weakly similar to GP|22830935|dbj hypothetical protein~similar to
gag-pol polyprotein {Oryza sativa (japonica
cultivar-group)}, partial (8%)
Length = 853
Score = 34.3 bits (77), Expect = 0.36
Identities = 35/150 (23%), Positives = 69/150 (45%), Gaps = 25/150 (16%)
Frame = +2
Query: 385 RCPLSMF--GRDFEMDLVC--LPLSGMDVILGMNWLEYKHV----HINCFSKSVYF---- 432
+C L F G+ ++ ++ C + + ++LG W +H H N ++ Y
Sbjct: 401 KCCLVSFSIGQKYKDNVWCDVISMDACHMLLGRPWQYDRHALYDGHANTYTFVKYGVKIK 580
Query: 433 -----SSAEEECGAEF-----LSTKQLKQMERDGILMFSLMASLSIENQAVIDKL--QVV 480
+A +E +F L +K+ ++ I SL+ + ++ I K ++
Sbjct: 581 LVPLPPNAFDEGKKDFKPIVSLVSKEPFKVTTKDIQDMSLILLVKSNEESTIQKEVEHLL 760
Query: 481 CDFPEVYPDEIPD-VPPEREVEFSINLVPG 509
DF +V P EIP +PP R+++ +I+ +PG
Sbjct: 761 VDFTDVVPSEIPSGLPPMRDIQHAIDFIPG 850
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.360 0.161 0.597
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 45,600,889
Number of Sequences: 36976
Number of extensions: 658123
Number of successful extensions: 6050
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 1724
Number of HSP's successfully gapped in prelim test: 248
Number of HSP's that attempted gapping in prelim test: 4237
Number of HSP's gapped (non-prelim): 2182
length of query: 1445
length of database: 9,014,727
effective HSP length: 108
effective length of query: 1337
effective length of database: 5,021,319
effective search space: 6713503503
effective search space used: 6713503503
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.8 bits)
S2: 65 (29.6 bits)
Medicago: description of AC146805.1