
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC125481.2 + phase: 0 /pseudo
(1546 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC83528 95 2e-19
BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - ... 93 9e-19
TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alterna... 89 1e-17
AL367832 weakly similar to GP|14091845|gb Putative retroelement ... 84 4e-16
BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin... 79 1e-14
BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana... 70 6e-12
BG587127 weakly similar to PIR|H84506|H84 probable retroelement ... 61 3e-09
CB065285 weakly similar to PIR|A84500|A84 probable retroelement ... 50 9e-06
BG644740 similar to PIR|A84460|A84 probable retroelement pol pol... 45 2e-04
BG586308 weakly similar to PIR|F84528|F8 probable retroelement p... 42 0.002
BG644720 41 0.004
TC77595 weakly similar to PIR|T18350|T18350 probable pol polypro... 38 0.035
BG586289 weakly similar to PIR|H86337|H86 protein F5M15.26 [impo... 33 0.86
TC89211 similar to GP|3668093|gb|AAC61825.1| unknown protein {Ar... 32 1.9
BQ751067 similar to GP|18461169|dbj hypothetical protein~similar... 30 7.3
>TC83528
Length = 555
Score = 95.1 bits (235), Expect = 2e-19
Identities = 50/125 (40%), Positives = 75/125 (60%), Gaps = 4/125 (3%)
Frame = +3
Query: 359 PLVVSFQLLN-WEIKRVLVDTGSSANVLYYDAFSKMGLSEEQLQPFKGTLSGFTGDRVHV 417
P+V+ Q+ N + + RV V+ S ++LY+ AF KM L E L+P +G L G G + V
Sbjct: 180 PMVIKLQINNNFSVLRVFVNPMSKVDILYWSAFLKMKLQESMLKPCQGFLKGTFGKGLPV 359
Query: 418 RGYVTLKTTFGTRDQQKSIKIRYLVFNAPSS---YNAIIGRPSINLLDAFVSTKHLMMKY 474
+GY+ L TTFG + K+IK+RY V +P S YN ++G PS+ L A +S +KY
Sbjct: 360 KGYIDLDTTFGKGENTKTIKVRYFVVESPPSVSIYNVVLGWPSLKDLKAVLSVAEFTIKY 539
Query: 475 PLDNG 479
P+ +G
Sbjct: 540 PVGDG 554
>BG587145 similar to PIR|H86337|H8 protein F5M15.26 [imported] - Arabidopsis
thaliana, partial (13%)
Length = 763
Score = 92.8 bits (229), Expect = 9e-19
Identities = 43/95 (45%), Positives = 66/95 (69%)
Frame = +2
Query: 703 DAPKIVFMTNQKNYHYEVMSFGLRNAGATFQRSMDTIFSAQIGRNLEVYVDDLVVKTSAE 762
D K F+T++ Y Y+VM FGL+NAG+T+QR ++ +F+ ++G +EVY+DD++VK+
Sbjct: 14 DLEKTAFITDRGTYCYKVMPFGLKNAGSTYQRLVNRMFADKLGNTMEVYIDDMLVKSLRA 193
Query: 763 GQHSEDLKEIFQQVRKANMRLNPAKCTFGVHAENF 797
H LKE F+ + + M+LNPAKCTFGV + F
Sbjct: 194 TDHLNHLKE*FKTLDEYIMKLNPAKCTFGVTSGEF 298
>TC86737 weakly similar to GP|6683624|dbj|BAA89272.1 Pol {Alternaria
alternata}, partial (21%)
Length = 1540
Score = 89.0 bits (219), Expect = 1e-17
Identities = 49/157 (31%), Positives = 79/157 (50%), Gaps = 1/157 (0%)
Frame = +1
Query: 637 ANTVLVKKSSGKWRMCVDYTDLNMACPKDPYPLPNIDHLIDNAAGYKTLSFMDAYSDYNQ 696
A + V+K G R CVDY LN KD YPLP I + AG + + +D + +++
Sbjct: 577 APVLFVRKPGGGIRFCVDYRALNAITKKDRYPLPLISETLRRVAGARWFTKLDVVAAFHK 756
Query: 697 MKMDLLDAPKIVFMTNQKNYHYEVMSFGLRNAGATFQRSMDTIFSAQIGRNLEVYVDD-L 755
M++ D K F T + + V FGL A ATFQR ++ + + Y+DD L
Sbjct: 757 MRIKDEDQEKTAFRTRYGLFEWIVCPFGLTGAPATFQRYINKTLHEFLDDFVTAYIDDVL 936
Query: 756 VVKTSAEGQHSEDLKEIFQQVRKANMRLNPAKCTFGV 792
+ T ++ H ++ + +++ A + L+P KC F V
Sbjct: 937 IYTTGSKKDHEAQVRRVLRRLADAGLSLDPKKCEFSV 1047
>AL367832 weakly similar to GP|14091845|gb Putative retroelement {Oryza
sativa}, partial (2%)
Length = 384
Score = 84.0 bits (206), Expect = 4e-16
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 2/127 (1%)
Frame = -1
Query: 523 QQERIEP-VEDLKDIMIGTGPNQT-IKIGTSLERAEEKNLIQLLQENADLFAWSPSDMPG 580
+++ I+P E+++ I +GT N+ IK+G +LE ++ + QLL+E D+FA S DMPG
Sbjct: 381 ERKAIQPHQEEIELINLGTEENKREIKVGAALEEGVKRKIFQLLREYLDIFACSYEDMPG 202
Query: 581 IDIKVACHHLAINPSVKPVVQKKRKMGEEKRKAVDEEVRKLKEARFISEIKYPTWLANTV 640
+D K+ H + P PV K R+ + + EV+K +A F+ ++YP W+AN V
Sbjct: 201 LDPKIVEHRIPTKPECPPVR*KLRRTHPDMALKIKSEVQKQIDAGFLMTVEYPEWVANIV 22
Query: 641 LVKKSSG 647
V K G
Sbjct: 21 PVPKKDG 1
>BG644693 weakly similar to GP|18767374|g Putative 22 kDa kafirin cluster;
Ty3-Gypsy type {Oryza sativa}, partial (15%)
Length = 716
Score = 79.0 bits (193), Expect = 1e-14
Identities = 52/177 (29%), Positives = 82/177 (45%)
Frame = +2
Query: 594 PSVKPVVQKKRKMGEEKRKAVDEEVRKLKEARFISEIKYPTWLANTVLVKKSSGKWRMCV 653
P++ P+ ++ K K + +++ L E FI YP + + +KK G RM +
Sbjct: 59 PNMNPI*IPSYRINPLKLKVLKLQLKDLLEKGFIQPSIYP*GVV-VLFLKKKDGFLRMSI 235
Query: 654 DYTDLNMACPKDPYPLPNIDHLIDNAAGYKTLSFMDAYSDYNQMKMDLLDAPKIVFMTNQ 713
DY LN K YPLP ID L DN G K +D +Q ++ D PK F
Sbjct: 236 DYPQLNNVNIKIKYPLPLIDELFDNLQGSKWFFKIDLRLG*HQHRVIGEDVPKTAFRIRY 415
Query: 714 KNYHYEVMSFGLRNAGATFQRSMDTIFSAQIGRNLEVYVDDLVVKTSAEGQHSEDLK 770
+Y VMSFG N F M+ +F + + V+ +D+++ + E +H L+
Sbjct: 416 GHYEILVMSFG*TNPPMAFMELMNRVFQDYLDSLVIVFSNDILIYSKNENEHENHLR 586
>BG587101 similar to GP|6691191|gb F7F22.15 {Arabidopsis thaliana}, partial
(10%)
Length = 624
Score = 70.1 bits (170), Expect = 6e-12
Identities = 38/108 (35%), Positives = 54/108 (49%), Gaps = 5/108 (4%)
Frame = +2
Query: 1221 LYKRGTVTPLLRCLGEEETKLVLLEVHEGVCGSHISGRSLVAKLLRAGYYWPRMTQDCCE 1280
LYK+ +RC+ EEE +L H H + V+K+ +AG++WP M +D
Sbjct: 41 LYKQCADNIYIRCVAEEEIPGILFHCHGSNYAGHFAVSKTVSKIQQAGFWWPTMFKDAHS 220
Query: 1281 FVKKCDKCQR---FSDKKTAPANEL--TSVFSPWPFHKSGVDIVGPFP 1323
F+ KCD CQR S + P N + VF W G+D +GPFP
Sbjct: 221 FISKCDPCQRQGNIS*RNEMPQNFILEVEVFDVW-----GIDFMGPFP 349
>BG587127 weakly similar to PIR|H84506|H84 probable retroelement pol
polyprotein [imported] - Arabidopsis thaliana, partial
(13%)
Length = 415
Score = 61.2 bits (147), Expect = 3e-09
Identities = 29/68 (42%), Positives = 43/68 (62%)
Frame = +3
Query: 1092 DSQLMTNQISGEYQTKDSQLSKYLSSVRNLAAFFNFFEVIYVPREQNVRADLLAKLASTK 1151
DSQL+ +Q SGEY+ +D + YL V+ LA ++F + +PR +NV+AD LA LAS+
Sbjct: 156 DSQLVASQFSGEYEARDELMDTYLKLVQKLAQKLDYFALTRIPRSENVQADALAALASSS 335
Query: 1152 RPGNNRTV 1159
P R +
Sbjct: 336 DPELKRVI 359
>CB065285 weakly similar to PIR|A84500|A84 probable retroelement gag/pol
polyprotein [imported] - Arabidopsis thaliana, partial
(2%)
Length = 592
Score = 49.7 bits (117), Expect = 9e-06
Identities = 25/61 (40%), Positives = 36/61 (58%)
Frame = -2
Query: 1092 DSQLMTNQISGEYQTKDSQLSKYLSSVRNLAAFFNFFEVIYVPREQNVRADLLAKLASTK 1151
DS L+ NQI GE++T + L Y R L +F E+ ++PR++N AD LA L+S
Sbjct: 333 DSALVINQIKGEWETHHANLIPYRDYARRLLTYFTKVELHHIPRDENQMADALATLSSMF 154
Query: 1152 R 1152
R
Sbjct: 153 R 151
>BG644740 similar to PIR|A84460|A84 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (4%)
Length = 754
Score = 45.4 bits (106), Expect = 2e-04
Identities = 19/41 (46%), Positives = 25/41 (60%)
Frame = -1
Query: 637 ANTVLVKKSSGKWRMCVDYTDLNMACPKDPYPLPNIDHLID 677
A + V+K G +RMC+DY N K+ YPLP ID+L D
Sbjct: 238 AALLFVRKKDGYFRMCIDYRQFNKVTTKNKYPLPRIDNLFD 116
>BG586308 weakly similar to PIR|F84528|F8 probable retroelement pol polyprotein
[imported] - Arabidopsis thaliana, partial (7%)
Length = 686
Score = 41.6 bits (96), Expect = 0.002
Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 2/64 (3%)
Frame = -3
Query: 1485 VLWSYHTTPHSTTGETPFTMVYGADAMLPVEIDTPTWRRENF--NEEANEVGIQCTMDMI 1542
VLWS+ T P T TPF+M + +AM P E++ + R N E N + ++ I
Sbjct: 465 VLWSHRTNPRGATKSTPFSMAHRVEAMAPAEVNVTSL*RSRMPQNIELNNDRLFNALETI 286
Query: 1543 DEIR 1546
+E R
Sbjct: 285 EERR 274
>BG644720
Length = 678
Score = 40.8 bits (94), Expect = 0.004
Identities = 29/88 (32%), Positives = 35/88 (38%)
Frame = -3
Query: 1184 WMAPILKYLTGSFDPVSEEEKQLVRKRAAKFTIVAGKLYKRGTVTPLLRCLGEEETKLVL 1243
W PI+ Y+ + +R+ LY LL CLGEEET
Sbjct: 463 WRQPIIIYMCYGIFLENPRRSTYIRRHTPHLH*YNETLYI*LFEGVLL*CLGEEETIQAF 284
Query: 1244 LEVHEGVCGSHISGRSLVAKLLRAGYYW 1271
H VCGSH S L + R GYYW
Sbjct: 283 Q*AHSRVCGSHKS--KLYFHIKRMGYYW 206
>TC77595 weakly similar to PIR|T18350|T18350 probable pol polyprotein - rice
blast fungus gypsy retroelement (fragment), partial (14%)
Length = 1708
Score = 37.7 bits (86), Expect = 0.035
Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 1/92 (1%)
Frame = +2
Query: 1234 LGEEETKLVLLEVHEGVCGSHISGRSLVAKLLRAGYYWPRMTQDCCEFVKKCDKCQRFSD 1293
L E TKLV E H+ H GR+ +++ ++WP +Q FV+ CD C
Sbjct: 158 LNELRTKLVQ-ESHDSTAAGH-PGRNGTLEIVSRKFFWPGQSQTVRRFVRNCDVCGGIHI 331
Query: 1294 KKTAPANELTSVFSPWPFHKS-GVDIVGPFPP 1324
+ A L + P H +D + PP
Sbjct: 332 WRQAKRGFLKPLPVPNRLHSDLSMDFITSLPP 427
>BG586289 weakly similar to PIR|H86337|H86 protein F5M15.26 [imported] -
Arabidopsis thaliana, partial (2%)
Length = 136
Score = 33.1 bits (74), Expect = 0.86
Identities = 12/37 (32%), Positives = 25/37 (67%)
Frame = +2
Query: 359 PLVVSFQLLNWEIKRVLVDTGSSANVLYYDAFSKMGL 395
P ++ + + ++ RVL+DTGS+ NV++ D ++M +
Sbjct: 26 PFDINLVIRDLKVGRVLIDTGSTVNVIFRDTLNRMSI 136
>TC89211 similar to GP|3668093|gb|AAC61825.1| unknown protein {Arabidopsis
thaliana}, partial (18%)
Length = 579
Score = 32.0 bits (71), Expect = 1.9
Identities = 21/61 (34%), Positives = 30/61 (48%), Gaps = 2/61 (3%)
Frame = +3
Query: 838 HYPDSFLAQGTKLSLSLPQ*RKRKNLNGLRNVTKLFRKSRFFSPR--HLFFIAQHGGRSC 895
H+P++ L TK S ++L LR++ L +RFF R HL F +H RS
Sbjct: 99 HFPNTILHDDTKTSFQTNPNHLLRSLTSLRSLQHLLPTNRFFHLRFLHLLFQLRHFLRSP 278
Query: 896 L 896
L
Sbjct: 279 L 281
>BQ751067 similar to GP|18461169|dbj hypothetical protein~similar to
Arabidopsis thaliana chromosome 5 MCL19.15, partial (3%)
Length = 754
Score = 30.0 bits (66), Expect = 7.3
Identities = 15/46 (32%), Positives = 20/46 (42%), Gaps = 3/46 (6%)
Frame = +1
Query: 1284 KCDKCQRFSDKKTAPANELTSVF---SPWPFHKSGVDIVGPFPPAP 1326
K + +R+ AN+ SV SPWP H+ P PP P
Sbjct: 70 KIESPRRYLSNNKRQANDFLSVPRN*SPWPMHRDPT*ASSPLPPTP 207
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.339 0.147 0.477
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 46,028,644
Number of Sequences: 36976
Number of extensions: 639181
Number of successful extensions: 4403
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 2396
Number of HSP's successfully gapped in prelim test: 181
Number of HSP's that attempted gapping in prelim test: 1954
Number of HSP's gapped (non-prelim): 2690
length of query: 1546
length of database: 9,014,727
effective HSP length: 109
effective length of query: 1437
effective length of database: 4,984,343
effective search space: 7162500891
effective search space used: 7162500891
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.9 bits)
S2: 65 (29.6 bits)
Medicago: description of AC125481.2