
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147010.11 - phase: 0 /pseudo
(2172 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB065285 weakly similar to PIR|A84500|A84 probable retroelement ... 302 1e-81
AW684517 similar to GP|9927273|dbj Similar to Arabidopsis thalia... 258 2e-68
AL367832 weakly similar to GP|14091845|gb Putative retroelement ... 174 3e-43
AL375832 159 1e-38
AL366605 153 8e-37
CB065382 similar to PIR|T09671|T096 RPE15 protein - alfalfa (fra... 152 1e-36
BE942785 133 7e-31
AL375831 90 4e-29
AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis... 91 5e-18
TC86651 similar to PIR|F84555|F84555 similar to prolyl 4-hydroxy... 31 6.0
>CB065285 weakly similar to PIR|A84500|A84 probable retroelement gag/pol
polyprotein [imported] - Arabidopsis thaliana, partial
(2%)
Length = 592
Score = 302 bits (773), Expect = 1e-81
Identities = 164/195 (84%), Positives = 168/195 (86%)
Frame = -3
Query: 1573 NPKTATNR*LMKVQIPIASGV*SLMVLSMLMVKELGQSLYPHRGITFLLPPEFCSNVQTI 1632
N KTA NR*L+KVQIPIA+GV*SLMVLSMLMVKEL QSLYP RGIT LLPP FCSNVQTI
Sbjct: 590 NQKTAKNR*LVKVQIPIANGV*SLMVLSMLMVKELEQSLYPRRGITSLLPPGFCSNVQTI 411
Query: 1633 WPSMKHVSLGSRKQLT*ESNTSISMEILRSSSIR*RVNGRPTMLS*FLIVIMRDVC*HIL 1692
W SMKHVSLGSRKQLT*ESNTS SMEIL SS R RVNGRPTM *FL MR VC* IL
Sbjct: 410 WLSMKHVSLGSRKQLT*ESNTSTSMEILHLSSTRSRVNGRPTMPI*FLTATMRGVC*PIL 231
Query: 1693 QRLSCTIFLVMRTKWLMLLLLCPPCFE*TIGMMCQ*SKCNASKDLRMCLLLGM*SIRLVK 1752
Q+LSCTIFLV RTKWLML LL PPCFE*TIGMMCQ*SKCNA KDL MCLLLGM*SIR VK
Sbjct: 230 QKLSCTIFLVTRTKWLMLWLLYPPCFE*TIGMMCQ*SKCNALKDLHMCLLLGM*SIRPVK 51
Query: 1753 MWLTIDPGIMTSNSS 1767
MWLTI+PG TSNSS
Sbjct: 50 MWLTINPGTTTSNSS 6
>AW684517 similar to GP|9927273|dbj Similar to Arabidopsis thaliana
chromosome II BAC F26H6; putative retroelement pol
polyprotein, partial (1%)
Length = 488
Score = 258 bits (658), Expect = 2e-68
Identities = 130/187 (69%), Positives = 145/187 (77%)
Frame = +1
Query: 737 VTFQVMDINASYSCLLGRPWIHDAGAVTSTLHQKLKFIRNGKLVTVHGEEAYLVSQLSSF 796
+TFQVMDINASYSCLLGRPWIHDAGAVTSTLHQKLKF++NG
Sbjct: 1 ITFQVMDINASYSCLLGRPWIHDAGAVTSTLHQKLKFVKNG------------------- 123
Query: 797 SCIEAGSAEGTAFQGLTIEGAEPKKAGAAMASLKDAQKVIQDGQTAGWGKVIQLCENKRK 856
SAEGTAFQGL++EGAEPKK GAAMASLKDAQK +Q+GQ A WGK+IQLCENKRK
Sbjct: 124 ------SAEGTAFQGLSMEGAEPKKVGAAMASLKDAQKAVQEGQAADWGKLIQLCENKRK 285
Query: 857 EGLGFSPSSKVSSGVFHSAGFVNAISEEANGSGLRPAFVTPGGIARDWDAIDIPSIMHVS 916
EGL FSP+S VS+G FHSAGFVN ++EE RP FV PGGIA+DWDA+D+PSIMHVS
Sbjct: 286 EGLRFSPTSGVSTGTFHSAGFVNTLAEEVARFVPRPLFVIPGGIAKDWDAVDVPSIMHVS 465
Query: 917 E*CVLLL 923
*CVLLL
Sbjct: 466 X*CVLLL 486
>AL367832 weakly similar to GP|14091845|gb Putative retroelement {Oryza
sativa}, partial (2%)
Length = 384
Score = 174 bits (442), Expect = 3e-43
Identities = 100/127 (78%), Positives = 105/127 (81%)
Frame = -2
Query: 1055 KKRKPFNLIRKRLSLST*VPRRTSGKSRSVLL*GKRLKVRSSSSSGNIRIFSHGRMKICQ 1114
KK KPF+L RKRLSLST*VPRRTS +SRSVLL* K LK RSSSSS N IF H RMKICQ
Sbjct: 383 KKGKPFSLTRKRLSLST*VPRRTSERSRSVLL*RKGLKGRSSSSSENTWIFLHARMKICQ 204
Query: 1115 V*TL*LWSIGSPPSLNVLPSGRS*EGLIQIWLSRSRAKFKSRLTRVSS*QLSTLNGLLIL 1174
V* L LW+IGS SLNVLPSGR+*EGLIQIWLSR R +FKSRL RV S*QLS LNGL IL
Sbjct: 203 V*ILKLWNIGSQQSLNVLPSGRN*EGLIQIWLSRLRVRFKSRLMRVFS*QLSILNGLPIL 24
Query: 1175 CLCQRRM 1181
LCQRRM
Sbjct: 23 YLCQRRM 3
>AL375832
Length = 535
Score = 159 bits (402), Expect = 1e-38
Identities = 92/134 (68%), Positives = 95/134 (70%), Gaps = 6/134 (4%)
Frame = +3
Query: 877 FVNAISEEANGSGLRPAFVTPGGIARDWDAIDIPSIMHVSE*CVLLL*RFVFKNNNPLAL 936
FVNAISEEA GSGLRPAFVTPGGIA DWDAIDIPSIMHVSE*CVLLL*RFVFKNNNPLAL
Sbjct: 3 FVNAISEEATGSGLRPAFVTPGGIASDWDAIDIPSIMHVSE*CVLLL*RFVFKNNNPLAL 182
Query: 937 PKARVIFSVRACLFRHCR**IKIVVSFTTAFF*CLFLGKMVMPEKNQNVCIFLL------ 990
PKARVIFSVRACLFRHCR K F FF F + +K FL+
Sbjct: 183 PKARVIFSVRACLFRHCRLMNKNCRFFHDCFFFSAFFLEKW*CQKKPKRLYFLINFKNLH 362
Query: 991 ISKICMKKKKLFLK 1004
K K KKLFLK
Sbjct: 363 EKKKIQKTKKLFLK 404
Score = 70.5 bits (171), Expect = 7e-12
Identities = 44/69 (63%), Positives = 44/69 (63%), Gaps = 5/69 (7%)
Frame = +2
Query: 967 FF*CLFLGKMVMPEKNQNVCIFLLISKICMKKK-----KLFLKIIQSLYAG*TIINPLNT 1021
FF*CLFLGKMVMPEK K KKK K KIIQSLYAG*TIINPLNT
Sbjct: 275 FF*CLFLGKMVMPEKT*TSVFSY*FQKPA*KKKNTKNKKTLSKIIQSLYAG*TIINPLNT 454
Query: 1022 VILRFLQTL 1030
VIL QTL
Sbjct: 455 VILCLFQTL 481
Score = 64.3 bits (155), Expect = 5e-10
Identities = 50/100 (50%), Positives = 54/100 (54%), Gaps = 6/100 (6%)
Frame = +1
Query: 955 **IKIVVSFTTAFF*CLFLGKMVMPEKNQNVCIFLLISKICMKKKKLFLKIIQSLYAG*T 1014
**IKIVVSFTTAFF F K KN NVCIFLLISK CMKKKK + K S
Sbjct: 238 **IKIVVSFTTAFFLVPFSWKNGNARKNLNVCIFLLISKTCMKKKK-YKKQKNSF*NHPI 414
Query: 1015 II------NPLNTVILRFLQTLNSRYMKPKMRRVMTYHMR 1048
II P+ L +SRYM + RVM Y MR
Sbjct: 415 IICRLNHNKPVEHSNSMPLPNFDSRYMNRR*GRVMIYRMR 534
>AL366605
Length = 422
Score = 153 bits (386), Expect = 8e-37
Identities = 84/119 (70%), Positives = 95/119 (79%)
Frame = -1
Query: 5 CQKWMFLKSSRFRTSTGTTG*LVLKITSSNTSGRWAITKTMIPL*STVFRIV*WRMLQSG 64
CQ+WM L++S+ + STGTTG*LV K TSSNTS RWAITKTMIPL*ST RIV*W+MLQSG
Sbjct: 359 CQRWMSLRNSKSQISTGTTG*LVPKTTSSNTSERWAITKTMIPL*STASRIV*WKMLQSG 180
Query: 65 ILV*ARMTSIPLMN*PPLSRAIMGLTLD*GQIGSFSGPYLRRRRKVSVNMRKGGEGRLP 123
ILV*A+M SIP MN*P LSRA MGLT D* + G+ S LRRRRK SVN + G G+LP
Sbjct: 179 ILV*AKMISIPSMN*PLLSRATMGLTPD*SRTGNSSDLSLRRRRKASVNTHRDGGGQLP 3
>CB065382 similar to PIR|T09671|T096 RPE15 protein - alfalfa (fragment),
partial (9%)
Length = 624
Score = 152 bits (385), Expect = 1e-36
Identities = 100/173 (57%), Positives = 113/173 (64%)
Frame = -3
Query: 271 RFNNNNNKISKLDLLSLRYRCYMLSCFQLYSREGIVQLDRASPHLIRCLQGSGLISNVIF 330
RF NN++ ++LDL L Y+CYMLS +LY EG Q DR S HLIR L GSGL SNVI
Sbjct: 517 RFTNNDSS-NRLDLPFL*YQCYMLSSSRLYFSEGTAQSDRGSLHLIRYLPGSGLTSNVIS 341
Query: 331 IRVP*VMMLRGVML*STS*RSLLIKGS*LLRIMFHMSSIILFQIMPL*I*SKCARKLLDL 390
I+V VMM + ML*+T *RS L K S*LLR MSS LFQI L* * + RKL DL
Sbjct: 340 IKVLWVMMSKDAML*NTL*RSSLTKES*LLRTTSLMSSTTLFQITLL*T*LRYMRKLPDL 161
Query: 391 MSATSQLLWYLYTSSCAKLLCSAMIMPSVWDASVILWVVILFKTTSKA**MII 443
S QLLW YTSS AK LC MIMP + S+ILWV LFK TS+A** I
Sbjct: 160 TSVMLQLLWCHYTSSYAKPLCLTMIMPIAKNDSIILWVAALFKMTSRA**TTI 2
>BE942785
Length = 460
Score = 133 bits (335), Expect = 7e-31
Identities = 62/82 (75%), Positives = 73/82 (88%)
Frame = -2
Query: 776 NGKLVTVHGEEAYLVSQLSSFSCIEAGSAEGTAFQGLTIEGAEPKKAGAAMASLKDAQKV 835
+G LVT+HGEEAYL+SQLSSFSCIEAGSAEGTAFQGLT+EG EPK+ G AMASLKDAQ+
Sbjct: 378 SGXLVTIHGEEAYLISQLSSFSCIEAGSAEGTAFQGLTVEGTEPKRDGTAMASLKDAQRA 199
Query: 836 IQDGQTAGWGKVIQLCENKRKE 857
+Q+ Q AGWG++IQL ENK K+
Sbjct: 198 VQESQAAGWGRLIQLRENKHKD 133
>AL375831
Length = 467
Score = 89.7 bits (221), Expect(2) = 4e-29
Identities = 44/44 (100%), Positives = 44/44 (100%)
Frame = +3
Query: 1 TFA*CQKWMFLKSSRFRTSTGTTG*LVLKITSSNTSGRWAITKT 44
TFA*CQKWMFLKSSRFRTSTGTTG*LVLKITSSNTSGRWAITKT
Sbjct: 234 TFA*CQKWMFLKSSRFRTSTGTTG*LVLKITSSNTSGRWAITKT 365
Score = 58.9 bits (141), Expect(2) = 4e-29
Identities = 31/37 (83%), Positives = 33/37 (88%)
Frame = +1
Query: 41 ITKTMIPL*STVFRIV*WRMLQSGILV*ARMTSIPLM 77
+ + MIPL*STVFRIV*WRM QSGILV* RMTSIPLM
Sbjct: 355 LQRQMIPL*STVFRIV*WRMPQSGILV*VRMTSIPLM 465
>AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis thaliana
chromosome II BAC F26H6; putative retroelement pol
polyprotein, partial (1%)
Length = 636
Score = 90.9 bits (224), Expect = 5e-18
Identities = 67/133 (50%), Positives = 75/133 (56%)
Frame = -3
Query: 1477 RLVVL*LGLPNACVIIW*IIQLG*YPEWIRSSISLRKLLLLERLHAGRCFCLNMILCSKL 1536
R VVL* GLP+A VII I LG Y +WI+SS LR L E LH GRC+ NMI +
Sbjct: 634 RHVVL*PGLPSASVII*LITLLGWYLKWIQSSTYLRSPL*QEGLHGGRCYYRNMISSTVP 455
Query: 1537 KRQSKVAFLPIILLTNLLMIINQLSSTSPMKRSCI*NPKTATNR*LMKVQIPIASGV*SL 1596
+RQ K FL I LTN L I+ S T MKR CI* K T+ KV I I GV* L
Sbjct: 454 RRQLKAVFLLITWLTNHLKTIDLSSLTFLMKRLCI*R*KIVTSHYSEKVLIQIQCGV*YL 275
Query: 1597 MVLSMLMVKELGQ 1609
M LSM GQ
Sbjct: 274 MGLSMYTATA*GQ 236
>TC86651 similar to PIR|F84555|F84555 similar to prolyl 4-hydroxylase alpha
subunit [imported] - Arabidopsis thaliana, partial (85%)
Length = 1329
Score = 30.8 bits (68), Expect = 6.0
Identities = 15/37 (40%), Positives = 23/37 (61%)
Frame = -3
Query: 1519 RLHAGRCFCLNMILCSKLKRQSKVAFLPIILLTNLLM 1555
++H RC+ L ILC K + KV + +ILL+N +M
Sbjct: 727 KVHQYRCYALTPILCVKFIK--KVVIVRLILLSNFIM 623
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.354 0.154 0.541
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,892,804
Number of Sequences: 36976
Number of extensions: 1173893
Number of successful extensions: 14326
Number of sequences better than 10.0: 20
Number of HSP's better than 10.0 without gapping: 4433
Number of HSP's successfully gapped in prelim test: 640
Number of HSP's that attempted gapping in prelim test: 9416
Number of HSP's gapped (non-prelim): 6181
length of query: 2172
length of database: 9,014,727
effective HSP length: 111
effective length of query: 2061
effective length of database: 4,910,391
effective search space: 10120315851
effective search space used: 10120315851
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 37 (21.6 bits)
S2: 66 (30.0 bits)
Medicago: description of AC147010.11