
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146758.6 + phase: 0 /pseudo
(515 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imp... 184 7e-47
CB893203 168 2e-44
BF650289 weakly similar to GP|9049283|dbj| orf129a {Beta vulgari... 138 6e-33
BG447966 weakly similar to PIR|G96509|G96 protein F27F5.21 [impo... 119 3e-27
BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-li... 110 1e-24
TC90516 103 2e-22
BG586267 weakly similar to PIR|A84888|A8 hypothetical protein At... 94 1e-19
TC81386 similar to GP|10140689|gb|AAG13524.1 putative non-LTR re... 81 1e-15
AW736247 weakly similar to PIR|T14619|T146 reverse transcriptase... 76 4e-14
BG587108 similar to PIR|G96509|G9 protein F27F5.21 [imported] - ... 76 4e-14
BI310630 similar to GP|18087548|gb AT5g55600/MDF20_4 {Arabidopsi... 29 3.8
TC92645 29 5.0
TC83151 29 5.0
CA923091 weakly similar to GP|9294383|dbj cytochrome P450 {Arabi... 28 6.5
>TC83437 weakly similar to PIR|D86384|D86384 unknown protein [imported] -
Arabidopsis thaliana, partial (6%)
Length = 951
Score = 184 bits (467), Expect = 7e-47
Identities = 107/157 (68%), Positives = 120/157 (76%), Gaps = 2/157 (1%)
Frame = +2
Query: 347 FRFVSEFHRNGKLSKGINSTFIALIPKVDNPQRLNDFRPISLVGSLYKILAKLLANRLRV 406
FRFVSEFHRN KL KGINSTFIALIPKVDNPQRLNDFRPISLVGSLYKIL KLLANRLRV
Sbjct: 2 FRFVSEFHRNRKLFKGINSTFIALIPKVDNPQRLNDFRPISLVGSLYKILGKLLANRLRV 181
Query: 407 VIGSVISDAQSAFVKNRQILDGILIANEAV-DEARKLKKDLLLFKVDFEKAYD-SVDWGY 464
VIGSVISDAQSAFVKNRQIL+ + + + R +K + ++ S+ +
Sbjct: 182 VIGSVISDAQSAFVKNRQILEMVFL*QMRLWMRLRN*RKIFCCLRWILKRLITLSIGLIW 361
Query: 465 LDSVMGRMSFPVLWRKWIKVCVSTATASVLVNGSPTD 501
+ +G MSF VLWRKWIK CVSTAT SVLVNGSPT+
Sbjct: 362 ILF*VG-MSFLVLWRKWIKECVSTATTSVLVNGSPTN 469
>CB893203
Length = 800
Score = 168 bits (425), Expect(2) = 2e-44
Identities = 72/132 (54%), Positives = 98/132 (73%)
Frame = -3
Query: 73 LSEDWCLVWPNCLQIAQMRGLSDHCPLILSVAEENWGPRPSRMLKCWTDIPGYRQFVSNK 132
+SE WC+ WPN +Q A RGLSD CP++L++ E NWGPR MLKCW D+PGY FV +K
Sbjct: 798 VSESWCIKWPNMIQRALFRGLSDRCPIMLTIDEGNWGPRLHHMLKCWADLPGYHLFVKDK 619
Query: 133 WKSFQVSGWGGYVLKEKFKLLKISLKEWHASHS*NLLGKISDLKERLATLDEKGESMLLT 192
W SFQVS WGGYVLKEK K+++ +LK+WH +H+ NL +I+D++E +A LD KGE + L
Sbjct: 618 WNSFQVSRWGGYVLKEK*KMVRNNLKDWHHNHTRNLDARINDIRESIAYLDSKGEDLTLD 439
Query: 193 DEECAEIHGVSS 204
EE ++H +S+
Sbjct: 438 TEEVDDLHTLSA 403
Score = 29.6 bits (65), Expect(2) = 2e-44
Identities = 13/17 (76%), Positives = 16/17 (93%)
Frame = -1
Query: 208 SLSRLNTSICWQQSRIQ 224
SLS+LNTSI WQ+SRI+
Sbjct: 392 SLSKLNTSIQWQKSRIK 342
>BF650289 weakly similar to GP|9049283|dbj| orf129a {Beta vulgaris}, partial
(69%)
Length = 616
Score = 138 bits (347), Expect = 6e-33
Identities = 73/199 (36%), Positives = 117/199 (58%), Gaps = 1/199 (0%)
Frame = +3
Query: 303 LIKPFSIDEVKAAVWDCDSYKSPGPDGINFGFIKEFWLDLKDEIFRFVSEFHRNGKLSKG 362
L F+ EVK A++ DS K+PG DG N F K W + D + + +F + G + K
Sbjct: 12 LCSEFTAVEVKNALFSMDSSKAPGIDGYNVHFFKCSWNIIGDSVIDAILDFFKTGFMPKI 191
Query: 363 INSTFIALIPKVDNPQRLNDFRPISLVGSLYKILAKLLANRLRVVIGSVISDAQSAFVKN 422
IN T++ L+PK N + +FRPI+ +YKI++K+L +R++ V+ SV+S+ QSAFVK
Sbjct: 192 INCTYVTLLPKEVNVTSVKNFRPIACCSVIYKIISKILTSRMQGVLNSVVSENQSAFVKG 371
Query: 423 RQILDGILIANEAVDE-ARKLKKDLLLFKVDFEKAYDSVDWGYLDSVMGRMSFPVLWRKW 481
R I D I++++E V +RK + K+D KAYDS +W ++ +M + FP + W
Sbjct: 372 RVIFDNIILSHELVKSYSRKGISPRCMVKIDLXKAYDSXEWPFIKHLMLELGFPYKFVNW 551
Query: 482 IKVCVSTATASVLVNGSPT 500
+ ++TA+ + NG T
Sbjct: 552 VMAXLTTASYTFNXNGDLT 608
>BG447966 weakly similar to PIR|G96509|G96 protein F27F5.21 [imported] -
Arabidopsis thaliana, partial (7%)
Length = 687
Score = 119 bits (298), Expect = 3e-27
Identities = 75/210 (35%), Positives = 112/210 (52%), Gaps = 2/210 (0%)
Frame = +2
Query: 225 WLREGDANSKFFHSVLASRRRRNSLCTIVVD-GVVVEGVHPIREAVFCHFENHFRSVNVE 283
WL++GD N+KFFHS + RR+ N + + + G +G + + +F N F S N
Sbjct: 5 WLKDGDKNTKFFHSKASQRRKVNEIKKLKDETGNWCKGEENVERLLITYFNNLFTSSNPT 184
Query: 284 RPSLT-NLQF*SLSVAEGVGLIKPFSIDEVKAAVWDCDSYKSPGPDGINFGFIKEFWLDL 342
T + LS V K F+ +EV A+ K+PGPDG+ F +++W +
Sbjct: 185 AIEETCEVVKGKLSHEHIVWCEKEFTEEEVLEAINQMHPVKAPGPDGLPALFFQKYWHIV 364
Query: 343 KDEIFRFVSEFHRNGKLSKGINSTFIALIPKVDNPQRLNDFRPISLVGSLYKILAKLLAN 402
E+ + V + N ++ +N TFI LIPK NP D+RPISL + KI+ K++AN
Sbjct: 365 GKEVQQMVLQVLNNSMETEELNKTFIVLIPKGKNPNTPKDYRPISLCNVVMKIITKVIAN 544
Query: 403 RLRVVIGSVISDAQSAFVKNRQILDGILIA 432
R++ + VI QSAFV+ R I D LIA
Sbjct: 545 RVKQTLPDVIDVEQSAFVQGRLITDNALIA 634
>BG586266 similar to GP|7267666|em RNA-directed DNA polymerase-like protein
{Arabidopsis thaliana}, partial (18%)
Length = 789
Score = 110 bits (275), Expect = 1e-24
Identities = 58/141 (41%), Positives = 88/141 (62%), Gaps = 3/141 (2%)
Frame = -3
Query: 378 QRLNDFRPISLVGSLYKILAKLLANRLRVVIGSVISDAQSAFVKNRQILDGILIANEAVD 437
+R++++R I+ + YKI+AK+L+ R++ ++ S+IS +QSAFV R I D +LI ++ +
Sbjct: 784 KRVSEYRTIAPCNTQYKIIAKILSKRMQPLLRSIISPSQSAFVPGRAISDNVLITHKILH 605
Query: 438 EARK--LKKDL-LLFKVDFEKAYDSVDWGYLDSVMGRMSFPVLWRKWIKVCVSTATASVL 494
R+ KK + + K D KAYD + W +L V+ R+ F +W WI CVST + S L
Sbjct: 604 YLRQSGAKKHVSMAVKTDMTKAYDRIAWNFLREVLTRLGFHGIWISWIMECVSTVSYSFL 425
Query: 495 VNGSPTDEFPFRRGLRQGDPL 515
+NG P RGLRQGDPL
Sbjct: 424 INGGPQGRVLPSRGLRQGDPL 362
>TC90516
Length = 983
Score = 103 bits (256), Expect = 2e-22
Identities = 88/287 (30%), Positives = 136/287 (46%)
Frame = -2
Query: 227 REGDANSKFFHSVLASRRRRNSLCTIVVDGVVVEGVHPIREAVFCHFENHFRSVNVERPS 286
REG N+ +F + + +R RNS+ V+ + + V ++A+ F +HF RP+
Sbjct: 754 REGH-NTTYFLACVKNRGMRNSISAPRVERWM-DDVAKNKQAIVNFFTHHFSDP*TYRPT 581
Query: 287 LTNLQF*SLSVAEGVGLIKPFSIDEVKAAVWDCDSYKSPGPDGINFGFIKEFWLDLKDEI 346
+ ++ F +S + V L F + +++ V D +SP PDG N F LK +I
Sbjct: 580 MGDIDFFHISNLDNVLLSAQFLVSKMELVVSSLDGNESPRPDGFNLNFFIRLRNMLKADI 401
Query: 347 FRFVSEFHRNGKLSKGINSTFIALIPKVDNPQRLNDFRPISLVGSLYKILAKLLANRLRV 406
+F+ L K + F+ LI KV+ P L DF +S +GSL K++AK+LA RL
Sbjct: 400 EIMFEQFYTPANLLKIFS*YFLTLISKVEYPILLGDFSLMSFLGSL*KLMAKVLALRLAH 221
Query: 407 VIGSVISDAQSAFVKNRQILDGILIANEAVDEARKLKKDLLLFKVDFEKAYDSVDWGYLD 466
++ +I QS FV+ RQ +DG++ NE + L D G +D
Sbjct: 220 IMEKIIFVNQSTFVRGRQHVDGVVAINEII----VLMGD-----------------GVVD 104
Query: 467 SVMGRMSFPVLWRKWIKVCVSTATASVLVNGSPTDEFPFRRGLRQGD 513
VCV A +LVN SPT E ++GL+QGD
Sbjct: 103 ---------------*SVCVYK*PA-ILVNCSPT*EIDIQKGLKQGD 11
>BG586267 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (16%)
Length = 794
Score = 94.4 bits (233), Expect = 1e-19
Identities = 67/197 (34%), Positives = 102/197 (51%), Gaps = 10/197 (5%)
Frame = +3
Query: 182 LDEKGESM-LLTDEECAEIHGVSSDILSLSRLNTSICW-QQSRIQWLREGDANSKFFHSV 239
L KGE + LL E E H N I W Q+SR+ WLR GD N+KFFH+V
Sbjct: 60 LFNKGEELSLLRSELNEEYH------------NEEIFWMQKSRLNWLRSGDRNTKFFHAV 203
Query: 240 LASRRRRNSLCTIVVDG----VVVEGVHPIREAVFCHFENHFRSVNVERPSLTNLQF*SL 295
+RR +N + +++ D V E + + ++ HF+ + S +V +T + S+
Sbjct: 204 TKNRRAQNRILSLIDDDDKEWFVEEDLGRLADS---HFKLLYSSEDV---GITLEDWNSI 365
Query: 296 SVA----EGVGLIKPFSIDEVKAAVWDCDSYKSPGPDGINFGFIKEFWLDLKDEIFRFVS 351
+ L+ S +EV+ AV+D + +K PGPDG+N F ++FW + D++
Sbjct: 366 PAIVTEEQNAQLMAQISREEVREAVFDINPHKCPGPDGMNVFFFQQFWDTMGDDLTSMAQ 545
Query: 352 EFHRNGKLSKGINSTFI 368
EF R GKL +GIN T I
Sbjct: 546 EFLRTGKLEEGINKTNI 596
Score = 57.8 bits (138), Expect = 1e-08
Identities = 31/77 (40%), Positives = 48/77 (62%)
Frame = +1
Query: 357 GKLSKGINSTFIALIPKVDNPQRLNDFRPISLVGSLYKILAKLLANRLRVVIGSVISDAQ 416
G L + L+PK +RL +FRPISL YKI++K+L+ RL+ V+ +I++ Q
Sbjct: 562 GNLKRESTKQTSGLVPKKLEAKRLVEFRPISLCNVAYKIVSKVLSKRLKSVLPWIITETQ 741
Query: 417 SAFVKNRQILDGILIAN 433
+AF + + I D ILIA+
Sbjct: 742 AAFGRRQLISDNILIAH 792
>TC81386 similar to GP|10140689|gb|AAG13524.1 putative non-LTR retroelement
reverse transcriptase {Oryza sativa (japonica
cultivar-group)}, partial (1%)
Length = 798
Score = 80.9 bits (198), Expect = 1e-15
Identities = 68/250 (27%), Positives = 113/250 (45%), Gaps = 7/250 (2%)
Frame = -1
Query: 7 CVCGDFNAIRSREERRSVSVGQVASDFSHFNSFIDNNVLVDLPLGGRNFTWF---KGDGK 63
C GDFN I E + S F + D N L LP G FTW +G
Sbjct: 738 CCIGDFNTILGSHEHQG-SHTPARLPMLDFQQWSDVNNLFHLPTRGSAFTWTNGRRGRNN 562
Query: 64 SMSRLDRFLLSEDWCLVWPNCLQIAQM---RGLSDHCPLILSVAEENWGPRPS-RMLKCW 119
+ RLDR ++++ L+ NC ++ + SDH P++ + +N S + +K W
Sbjct: 561 TRKRLDRSIVNQ---LMIDNCESLSACTLTKLRSDHFPILFELQTQNIQFSSSFKFMKMW 391
Query: 120 TDIPGYRQFVSNKWKSFQVSGWGGYVLKEKFKLLKISLKEWHASHS*NLLGKISDLKERL 179
+ P + W + QV G +VL +K K LK LK W+ + N+ ++ + + L
Sbjct: 390 SAHPDCINIIKQCWAN-QVVGCPMFVLNQKLKNLKEVLKVWNKNTFGNVHSQVDNAYKEL 214
Query: 180 ATLDEKGESMLLTDEECAEIHGVSSDILSLSRLNTSICWQQSRIQWLREGDANSKFFHSV 239
+ K +S+ +D + ++ S + ++S++ W EGD N+ FFH V
Sbjct: 213 DDIQVKIDSIGYSDVLMDQEKAAQLNLESALNIEEVFWHEKSKVNWHCEGDRNTAFFHRV 34
Query: 240 LASRRRRNSL 249
A +R +SL
Sbjct: 33 -AKIKRTSSL 7
>AW736247 weakly similar to PIR|T14619|T146 reverse transcriptase - beet
retrotransposon (fragment), partial (4%)
Length = 305
Score = 75.9 bits (185), Expect = 4e-14
Identities = 35/58 (60%), Positives = 42/58 (72%), Gaps = 2/58 (3%)
Frame = +1
Query: 307 FSIDEVKAAVWDCDSYKS--PGPDGINFGFIKEFWLDLKDEIFRFVSEFHRNGKLSKG 362
FS +EV+ AVWDCDS S PGPDG+NF F+KE+W +K + R V EFH NGKL KG
Sbjct: 121 FSEEEVRKAVWDCDSSHS*NPGPDGVNFTFVKEYWELIKVDFLRVVMEFHTNGKLGKG 294
>BG587108 similar to PIR|G96509|G9 protein F27F5.21 [imported] - Arabidopsis
thaliana, partial (17%)
Length = 677
Score = 75.9 bits (185), Expect = 4e-14
Identities = 38/87 (43%), Positives = 56/87 (63%)
Frame = +3
Query: 334 FIKEFWLDLKDEIFRFVSEFHRNGKLSKGINSTFIALIPKVDNPQRLNDFRPISLVGSLY 393
F + W +K ++ + V+ F +GKL +N+T I LIPK P R+ + RPISL Y
Sbjct: 411 FFQHSWHIIKMDLLKMVNSFLASGKLDTRLNTTNICLIPKKKRPTRMTELRPISLCNVGY 590
Query: 394 KILAKLLANRLRVVIGSVISDAQSAFV 420
KI++K+L RL+V + S+IS+ QSAFV
Sbjct: 591 KIISKVLCQRLKVCLPSLISETQSAFV 671
Score = 44.7 bits (104), Expect = 9e-05
Identities = 36/148 (24%), Positives = 65/148 (43%), Gaps = 3/148 (2%)
Frame = +2
Query: 218 WQQ-SRIQWLREGDANSKFFHSVLASRRRRNSLCTIV-VDGVVVEGVHPIREAVFCHFEN 275
WQQ SR W GD N KF+H++ R RN + + DG + + + +FE+
Sbjct: 53 WQQKSRNMWHISGDLNKKFYHALTKQRHARNRIVGLYDYDGNWITEEQGVEKVAVDYFED 232
Query: 276 HF-RSVNVERPSLTNLQF*SLSVAEGVGLIKPFSIDEVKAAVWDCDSYKSPGPDGINFGF 334
F R+ + S++ L++ + +EV+ A++ K+PGPDG
Sbjct: 233 LFQRTTPTGFDGFLDEITSSITPQMNQRLLRLATEEEVRLALFIMHPEKAPGPDG----- 397
Query: 335 IKEFWLDLKDEIFRFVSEFHRNGKLSKG 362
+ +F + +++NG + G
Sbjct: 398 -------MTTLLFSTLMAYNKNGSIENG 460
>BI310630 similar to GP|18087548|gb AT5g55600/MDF20_4 {Arabidopsis thaliana},
partial (19%)
Length = 730
Score = 29.3 bits (64), Expect = 3.8
Identities = 17/51 (33%), Positives = 24/51 (46%)
Frame = +1
Query: 125 YRQFVSNKWKSFQVSGWGGYVLKEKFKLLKISLKEWHASHS*NLLGKISDL 175
+RQ NK K F +S GY + L++ SHS +L G+ DL
Sbjct: 70 FRQLKGNKVKPFDLSKLRGYYTQPALSSLRVDTIHNTESHSNSLTGEDEDL 222
>TC92645
Length = 561
Score = 28.9 bits (63), Expect = 5.0
Identities = 26/85 (30%), Positives = 38/85 (44%), Gaps = 5/85 (5%)
Frame = +3
Query: 303 LIKPFSIDEVKAAVWDCDSYKSPGPDGINFGFIKEFWLDLKDEIFRFVSEFHRNG----- 357
L PFS+DE+ A++ K+ G DG+N F +L +V + G
Sbjct: 306 LTAPFSVDEIWTAIFQMHHDKAFGLDGLNLAVYYRF-*NLVGGDTNYVGLYSLVG*R*VA 482
Query: 358 KLSKGINSTFIALIPKVDNPQRLND 382
L G N I L+PK D+P + D
Sbjct: 483 *LYWGTN---IVLVPKCDSPSTMWD 548
>TC83151
Length = 617
Score = 28.9 bits (63), Expect = 5.0
Identities = 13/29 (44%), Positives = 18/29 (61%)
Frame = -3
Query: 407 VIGSVISDAQSAFVKNRQILDGILIANEA 435
V V SDA SA V+ Q++DG+L E+
Sbjct: 585 VTSEVSSDASSAIVQREQVVDGLLSGRES 499
>CA923091 weakly similar to GP|9294383|dbj cytochrome P450 {Arabidopsis
thaliana}, partial (40%)
Length = 813
Score = 28.5 bits (62), Expect = 6.5
Identities = 12/19 (63%), Positives = 13/19 (68%)
Frame = -3
Query: 125 YRQFVSNKWKSFQVSGWGG 143
+R F SNKWKSF S W G
Sbjct: 367 WRNFQSNKWKSFIFSIWMG 311
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.324 0.140 0.439
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 17,414,322
Number of Sequences: 36976
Number of extensions: 258070
Number of successful extensions: 1539
Number of sequences better than 10.0: 28
Number of HSP's better than 10.0 without gapping: 1513
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1533
length of query: 515
length of database: 9,014,727
effective HSP length: 100
effective length of query: 415
effective length of database: 5,317,127
effective search space: 2206607705
effective search space used: 2206607705
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 60 (27.7 bits)
Medicago: description of AC146758.6