
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC148359.6 + phase: 0 /pseudo
(2066 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CB065285 weakly similar to PIR|A84500|A84 probable retroelement ... 321 2e-87
AL367832 weakly similar to GP|14091845|gb Putative retroelement ... 191 3e-48
AL366605 152 1e-36
CB065382 similar to PIR|T09671|T096 RPE15 protein - alfalfa (fra... 149 1e-35
AW684517 similar to GP|9927273|dbj Similar to Arabidopsis thalia... 116 1e-25
AL375832 109 1e-23
AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis... 104 4e-22
BE943337 81 5e-15
AL375831 80 1e-14
BE942785 66 2e-10
TC77844 similar to PIR|T49049|T49049 hypothetical protein T5P19.... 34 0.67
BE203360 34 0.67
BF648401 weakly similar to PIR|D96781|D967 cytochrome P450 prob... 33 0.88
BG587062 similar to GP|19881001|gb chemiosmotic efflux system B ... 26 1.8
AL387996 32 2.0
TC83069 32 3.3
BQ150350 32 3.3
TC82083 31 4.4
TC85624 homologue to SP|P26969|GCSP_PEA Glycine dehydrogenase [d... 31 4.4
TC80585 weakly similar to GP|9759164|dbj|BAB09720.1 receptor kin... 31 5.7
>CB065285 weakly similar to PIR|A84500|A84 probable retroelement gag/pol
polyprotein [imported] - Arabidopsis thaliana, partial
(2%)
Length = 592
Score = 321 bits (822), Expect = 2e-87
Identities = 162/196 (82%), Positives = 169/196 (85%)
Frame = -1
Query: 1571 EIQRLRRTVD**RSRSQ*QVGFSL*WCCQCLW*GNWGSHCIPAGASHSFYRQNLVRMYQQ 1630
+I+RLRRTVD *RSRSQ*Q+GFSL*WCCQCLW* NW SHCIPAGA H FYR + VRMY+Q
Sbjct: 592 KIKRLRRTVDW*RSRSQ*QMGFSL*WCCQCLW*RNWSSHCIPAGALHPFYRPDSVRMYKQ 413
Query: 1631 YG*V*SMYLWDRGSN*HENQTPRHLWRFRARHQSDKG*MGDPPC*FDSLS*LCETFADIF 1690
YG*V*SMYLWDRGSN*HENQTPRHLWRF HQ D+G*MGDPPC DSL LCE FAD+F
Sbjct: 412 YG*V*SMYLWDRGSN*HENQTPRHLWRFCTCHQPDQG*MGDPPCQSDSLPRLCEAFADLF 233
Query: 1691 YKG*AAPYSS**EPNG*CSCYSILHVSSKPLE*CANN*SATP*KTFACVCYWGCDRSGW* 1750
YK *AAPYSS* EPNG*CS YSILHVSSKPLE*CANN SATP*KTF CVCYW CDRS *
Sbjct: 232 YKS*AAPYSS*REPNG*CSGYSILHVSSKPLE*CANNQSATP*KTFTCVCYWECDRSDR* 53
Query: 1751 KCGWL*TLVL*HQAVL 1766
KCG L*TLVL HQ VL
Sbjct: 52 KCG*L*TLVLRHQTVL 5
>AL367832 weakly similar to GP|14091845|gb Putative retroelement {Oryza
sativa}, partial (2%)
Length = 384
Score = 191 bits (484), Expect = 3e-48
Identities = 98/127 (77%), Positives = 105/127 (82%)
Frame = -3
Query: 1054 RKESHSASPGRD*AYQHRYRGKQARNQDWCYSRGRG*TEDHPTPPGISGYFCMVL*RYAR 1113
RKESHSASPGRD*AYQ RYRG+QAR+Q C SRGRG* ED P PP I GYFCM++*RYAR
Sbjct: 382 RKESHSASPGRD*AYQPRYRGEQARDQGRCCSRGRG*KEDLPAPPRIPGYFCMLV*RYAR 203
Query: 1114 SRPYDRGASDPYQA*MSSRQAEIEKDSS*YGSQNQE*SSKAD*CGFPHDS*VSRMGCQYC 1173
SR + G SDP +A*MSSRQ EIEKDSS YGSQ+ E* SKAD*CGF HDS*VS MGCQYC
Sbjct: 202 SRS*NCGTSDPNKA*MSSRQVEIEKDSSRYGSQD*E*GSKAD*CGFSHDS*VS*MGCQYC 23
Query: 1174 ACTKEGW 1180
C KEGW
Sbjct: 22 TCAKEGW 2
>AL366605
Length = 422
Score = 152 bits (384), Expect = 1e-36
Identities = 84/119 (70%), Positives = 95/119 (79%)
Frame = -1
Query: 5 CQKWMFLKSSKFRISTGTTG*LALKTTSSNTSERWAITKTMIPL*STVFRIV*WRMPLSG 64
CQ+WM L++SK +ISTGTTG*L KTTSSNTSERWAITKTMIPL*ST RIV*W+M SG
Sbjct: 359 CQRWMSLRNSKSQISTGTTG*LVPKTTSSNTSERWAITKTMIPL*STASRIV*WKMLQSG 180
Query: 65 ILV*ARMTSIPLMS*PPLSRAIMGLTLD*GQIGSFSGPYLRRRRKVSVNMRKGGEGRLP 123
ILV*A+M SIP M+*P LSRA MGLT D* + G+ S LRRRRK SVN + G G+LP
Sbjct: 179 ILV*AKMISIPSMN*PLLSRATMGLTPD*SRTGNSSDLSLRRRRKASVNTHRDGGGQLP 3
>CB065382 similar to PIR|T09671|T096 RPE15 protein - alfalfa (fragment),
partial (9%)
Length = 624
Score = 149 bits (375), Expect = 1e-35
Identities = 97/172 (56%), Positives = 110/172 (63%)
Frame = -3
Query: 271 RFNNNNKISKLGLLSLRYRCYMLSCFQLYSREGIVQLDRVSPHLIRCPQGSGLISNATFI 330
RF NN+ ++L L L Y+CYMLS +LY EG Q DR S HLIR GSGL SN I
Sbjct: 517 RFTNNDSSNRLDLPFL*YQCYMLSSSRLYFSEGTAQSDRGSLHLIRYLPGSGLTSNVISI 338
Query: 331 KVP*VMMSRGVML*SIS*RSSSIKGS*LLRIMSHTSSTILFQIMPL*I*SKCVKKLRDLM 390
KV VMMS+ ML*+ *RSS K S*LLR S SST LFQI L* * + ++KL DL
Sbjct: 337 KVLWVMMSKDAML*NTL*RSSLTKES*LLRTTSLMSSTTLFQITLL*T*LRYMRKLPDLT 158
Query: 391 SATSQLLWYLYTSSCAKLLCSIMIMPSVWDVSVILWVVILFKKTSKA**MII 442
S QLLW YTSS AK LC MIMP + S+ILWV LFK TS+A** I
Sbjct: 157 SVMLQLLWCHYTSSYAKPLCLTMIMPIAKNDSIILWVAALFKMTSRA**TTI 2
>AW684517 similar to GP|9927273|dbj Similar to Arabidopsis thaliana
chromosome II BAC F26H6; putative retroelement pol
polyprotein, partial (1%)
Length = 488
Score = 116 bits (290), Expect = 1e-25
Identities = 82/181 (45%), Positives = 98/181 (53%)
Frame = +2
Query: 741 WTLMHPTVACWEDRGYTMQVRLLPPYIRS*NL*KMENSLQSMVKKRT*SASCLLFLVLKQ 800
W M WED GY R PY RS*NL*+M+ +L++
Sbjct: 17 WISMLRIAVSWEDLGYMTLAR*PLPYTRS*NL*RMD--------------------LLRE 136
Query: 801 GLLKGLLSRV*PLKVQSLRKLGLLWLH*RMLGEPSKKVRLPAGAR*YSSVRTSVKKVSGS 860
L KG L KVQS R+L L WL *R ++ RLP GA * +SVR S +KVS S
Sbjct: 137 LLFKGCL-----WKVQSPRRLELQWLL*RTPRRLFRRDRLPTGAS*SNSVRISERKVSDS 301
Query: 861 PRHQGFPRESSTVLGLLMLSLKKPLDLVSGLYLLHQEALRQIGMPLTFLQSCMFLSNVSC 920
P+H+ PRE V GL + LK+ LDL G L +QEALR+IGMPL F QSCM NVSC
Sbjct: 302 PQHRVSPRELFIVPGLSIHLLKRWLDLFLGPCL*YQEALRKIGMPLMFPQSCMCPXNVSC 481
Query: 921 C 921
C
Sbjct: 482 C 484
>AL375832
Length = 535
Score = 109 bits (273), Expect = 1e-23
Identities = 62/85 (72%), Positives = 65/85 (75%), Gaps = 6/85 (7%)
Frame = +3
Query: 953 LINKILSFLPRLLFFSAFFPWKQW*CQKKPKRLYFLINFKNLHEK------KKLFLKSSN 1006
L+NK F FFSAFF ++W*CQKKPKRLYFLINFKNLHEK KKLFLKSSN
Sbjct: 237 LMNKNCRFFHDCFFFSAFF-LEKW*CQKKPKRLYFLINFKNLHEKKKIQKTKKLFLKSSN 413
Query: 1007 HYMQVEP**TR*TQ*SYGSSKFRFP 1031
HYMQVEP**TR*TQ* Y SSK FP
Sbjct: 414 HYMQVEP**TR*TQ*FYASSKL*FP 488
Score = 100 bits (249), Expect = 6e-21
Identities = 68/136 (50%), Positives = 77/136 (56%), Gaps = 2/136 (1%)
Frame = +1
Query: 875 GLLMLSLKKPLDLVSGLYLLHQEALRQIGMPLTFLQSCMFLSNVSCCFNALFSK*QSSRS 934
GLLMLSLKK LDLVSGL+LLHQEALR IG PLTFLQSCMF SN SCCF+ALFSK
Sbjct: 1 GLLMLSLKKQLDLVSGLHLLHQEALRVIGTPLTFLQSCMFPSNASCCFSALFSKITILSL 180
Query: 935 AQSESDIFCKG--MFVSASPLINKILSFLPRLLFFSAFFPWKQW*CQKKPKRLYFLINFK 992
F G F A I ++SF FF F WK +K FL+ K
Sbjct: 181 CPKRE*YFL*GRVCFGIAV**IKIVVSF--TTAFFLVPFSWKNGNARKNLNVCIFLLISK 354
Query: 993 NLHEKKKLFLKSSNHY 1008
+KKK + K N +
Sbjct: 355 TCMKKKK-YKKQKNSF 399
>AJ497987 weakly similar to GP|9927273|dbj Similar to Arabidopsis thaliana
chromosome II BAC F26H6; putative retroelement pol
polyprotein, partial (1%)
Length = 636
Score = 104 bits (259), Expect = 4e-22
Identities = 70/164 (42%), Positives = 100/164 (60%)
Frame = -1
Query: 1475 EDLLCFSLGCQTPASLFGESYNLVDIQNGSDKVYL*ESCCHWKDCTLADAFVRI*YCVQN 1534
ED+LC SLGCQ P SL+ S+ L I NGS++V++*E+ KDCT+ADA + I*Y V
Sbjct: 636 EDMLCSSLGCQAPPSLYD*SHYLAGI*NGSNQVHI*EARFDRKDCTVADATIGI*YRVPF 457
Query: 1535 SKGNQR*HSCRSPCLPTS**LPTN*IRFPR*RDHVFEIQRLRRTVD**RSRSQ*QVGFSL 1594
+GN R +SC S PT+* L T + *RD+V + +RL R RS S+ VGF +
Sbjct: 456 PEGN*RQYSC*SLGSPTT*RLSTYQV*LS**RDYVSKDERL*RATIRRRS*SRFSVGFDI 277
Query: 1595 *WCCQCLW*GNWGSHCIPAGASHSFYRQNLVRMYQQYG*V*SMY 1638
*W CQC+ + GS SH F+ + V +++Q+ *+ S++
Sbjct: 276 *WGCQCIRQRHRGSTPYS*RCSHPFHSKTPV*LHEQHR*IRSLH 145
>BE943337
Length = 496
Score = 80.9 bits (198), Expect = 5e-15
Identities = 43/76 (56%), Positives = 50/76 (65%)
Frame = -3
Query: 1946 WSFHIKFWHGEVTLKYLI*TIGSF*SVLSVLFLFHFTFGQILFFIQILFFLHFGPSIEVP 2005
WSFH KFWH EVTLKYLI* IGSF*SVLS+LF+F FTFGQI FI I + +
Sbjct: 233 WSFHTKFWHREVTLKYLI*AIGSF*SVLSILFVFRFTFGQITNFIFIFIAFIIFAFLVLE 54
Query: 2006 KLHFCPNFFFYFISSF 2021
++ C + F F SF
Sbjct: 53 EVQNCISVQFLFQFSF 6
>AL375831
Length = 467
Score = 79.7 bits (195), Expect = 1e-14
Identities = 39/44 (88%), Positives = 40/44 (90%)
Frame = +3
Query: 1 TFA*CQKWMFLKSSKFRISTGTTG*LALKTTSSNTSERWAITKT 44
TFA*CQKWMFLKSS+FR STGTTG*L LK TSSNTS RWAITKT
Sbjct: 234 TFA*CQKWMFLKSSRFRTSTGTTG*LVLKITSSNTSGRWAITKT 365
Score = 60.5 bits (145), Expect = 7e-09
Identities = 34/52 (65%), Positives = 38/52 (72%)
Frame = +1
Query: 26 LALKTTSSNTSERWAITKTMIPL*STVFRIV*WRMPLSGILV*ARMTSIPLM 77
L+ K+ E + + MIPL*STVFRIV*WRMP SGILV* RMTSIPLM
Sbjct: 310 LSSKSHHQIRPEDGQLQRQMIPL*STVFRIV*WRMPQSGILV*VRMTSIPLM 465
>BE942785
Length = 460
Score = 65.9 bits (159), Expect = 2e-10
Identities = 43/77 (55%), Positives = 52/77 (66%)
Frame = -3
Query: 779 LQSMVKKRT*SASCLLFLVLKQGLLKGLLSRV*PLKVQSLRKLGLLWLH*RMLGEPSKKV 838
L M +K T* AS L L KQGL +G L + * KV+S R + L WL *RM EPSK+V
Sbjct: 365 LPFMERKHT*LASYHLSLA*KQGLQRGPLFKD*LSKVRSPRGMELQWLL*RMHREPSKRV 186
Query: 839 RLPAGAR*YSSVRTSVK 855
+LPAGA *YS VRTS++
Sbjct: 185 KLPAGAG*YSFVRTSIR 135
>TC77844 similar to PIR|T49049|T49049 hypothetical protein T5P19.130 -
Arabidopsis thaliana, partial (70%)
Length = 1969
Score = 33.9 bits (76), Expect = 0.67
Identities = 24/66 (36%), Positives = 32/66 (48%), Gaps = 9/66 (13%)
Frame = +2
Query: 1981 FTFGQILFFIQILFFLHFGPSIEVP-------KLHFCPNFFFY--FISSFFNFAV*SSSS 2031
F+F FF IL FL F P +V +H +FF+Y F+ SFFNF S
Sbjct: 2 FSFTIFNFFYFILIFLFF*PFEKVTL*ILLLHHIHHLSSFFYYVLFLLSFFNFNPVFFSL 181
Query: 2032 YFCRKV 2037
FC ++
Sbjct: 182 SFCTQI 199
>BE203360
Length = 442
Score = 33.9 bits (76), Expect = 0.67
Identities = 24/59 (40%), Positives = 30/59 (50%), Gaps = 1/59 (1%)
Frame = +1
Query: 1966 IGSF*SVLSVLFLFHFTFGQILFFIQILFFLHFGPSIEVPK-LHFCPNFFFYFISSFFN 2023
I F S+ SV F+ L FI I FF P ++P L F P+F FYF S+ FN
Sbjct: 142 IRQFNSIQSVKLNLLFSS*PPLVFICICFFQLPNPDPKLPTFLSFTPDFVFYFTSTSFN 318
>BF648401 weakly similar to PIR|D96781|D967 cytochrome P450 probable
64213-66051 [imported] - Arabidopsis thaliana, partial
(10%)
Length = 660
Score = 33.5 bits (75), Expect = 0.88
Identities = 22/64 (34%), Positives = 32/64 (49%)
Frame = -2
Query: 908 FLQSCMFLSNVSCCFNALFSK*QSSRSAQSESDIFCKGMFVSASPLINKILSFLPRLLFF 967
FLQ + SN S C + L + * A S++FC +S + LI+ + F +LF
Sbjct: 203 FLQLSIISSNPSLCVSLLINL*GDINKASKFSELFCVLWLLSINTLISSRVFF--SILFL 30
Query: 968 SAFF 971
S FF
Sbjct: 29 SLFF 18
>BG587062 similar to GP|19881001|gb chemiosmotic efflux system B protein A
{Legionella pneumophila}, partial (1%)
Length = 780
Score = 26.2 bits (56), Expect(2) = 1.8
Identities = 11/24 (45%), Positives = 17/24 (70%)
Frame = -3
Query: 1971 SVLSVLFLFHFTFGQILFFIQILF 1994
+++ +LFLF F F + FF+ ILF
Sbjct: 337 ALIMLLFLFLFLFFFLFFFVAILF 266
Score = 24.6 bits (52), Expect(2) = 1.8
Identities = 12/26 (46%), Positives = 16/26 (61%)
Frame = -1
Query: 2015 FYFISSFFNFAV*SSSSYFCRKVQSS 2040
F+ SSFF+ SSSS +C +Q S
Sbjct: 165 FH*FSSFFSIYTSSSSSSYCYHLQDS 88
>AL387996
Length = 378
Score = 32.3 bits (72), Expect = 2.0
Identities = 18/49 (36%), Positives = 26/49 (52%)
Frame = +3
Query: 1974 SVLFLFHFTFGQILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFF 2022
S+LF HF I+ILFF+ I +P+L P F ++I +FF
Sbjct: 168 SILFKSHFPNFCSFCLIKILFFI----KISIPRLFLLPYEFLHYIHNFF 302
>TC83069
Length = 894
Score = 31.6 bits (70), Expect = 3.3
Identities = 19/66 (28%), Positives = 35/66 (52%), Gaps = 4/66 (6%)
Frame = -3
Query: 1971 SVLSVLFLFHFTFGQILFF----IQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNFAV 2026
+V++++ FHF+F + F I I+F +F S+ FFF+ ++SF++F +
Sbjct: 757 AVINIIISFHFSFQFVCSFADVKIIIIFCFYFLLSL---------GFFFFNLNSFYSFFI 605
Query: 2027 *SSSSY 2032
S Y
Sbjct: 604 CRSEDY 587
>BQ150350
Length = 1064
Score = 31.6 bits (70), Expect = 3.3
Identities = 15/31 (48%), Positives = 20/31 (64%)
Frame = -2
Query: 219 LLSMQLKCLRHTHICRILSIRSSHRFIISTL 249
LL++QL LRH I + I + H +IISTL
Sbjct: 946 LLNLQLHLLRHHFIATVFRITNIHSYIISTL 854
>TC82083
Length = 946
Score = 31.2 bits (69), Expect = 4.4
Identities = 19/55 (34%), Positives = 31/55 (55%), Gaps = 1/55 (1%)
Frame = -2
Query: 884 PLDLV-SGLYLLHQEALRQIGMPLTFLQSCMFLSNVSCCFNALFSK*QSSRSAQS 937
P+ +V S L+L+ + R G ++ + LS++SCCF +FS SSR+ S
Sbjct: 921 PIGVVPSDLFLM*YSSRRVGGCSISLTNTFSLLSSISCCFG*IFSVPFSSRNVLS 757
>TC85624 homologue to SP|P26969|GCSP_PEA Glycine dehydrogenase
[decarboxylating] mitochondrial precursor (EC 1.4.4.2),
partial (98%)
Length = 3772
Score = 31.2 bits (69), Expect = 4.4
Identities = 21/78 (26%), Positives = 35/78 (43%)
Frame = +3
Query: 893 LLHQEALRQIGMPLTFLQSCMFLSNVSCCFNALFSK*QSSRSAQSESDIFCKGMFVSASP 952
LLH L+ G+P+ L CM + S F+ ++ Q S I + +S P
Sbjct: 3186 LLHGSVLQSFGLPMDVLIMCMVTAT*SAPFSQHHRLLKNQLLPQHNSVIPLLPVLISRIP 3365
Query: 953 LINKILSFLPRLLFFSAF 970
+++KI F+ + L F
Sbjct: 3366 VVHKIFLFIIQSLQLLTF 3419
>TC80585 weakly similar to GP|9759164|dbj|BAB09720.1 receptor kinase-like
protein {Arabidopsis thaliana}, partial (24%)
Length = 900
Score = 30.8 bits (68), Expect = 5.7
Identities = 20/64 (31%), Positives = 28/64 (43%)
Frame = +1
Query: 1973 LSVLFLFHFTFGQILFFIQILFFLHFGPSIEVPKLHFCPNFFFYFISSFFNFAV*SSSSY 2032
L LFLFH+ +FF++ L F +V K H F + S NF * ++
Sbjct: 118 LLFLFLFHYYIISSVFFLEPLCLA*FVSVFKVEKSHQILGFSVFAAKSSLNFVA*IEAAE 297
Query: 2033 FCRK 2036
RK
Sbjct: 298 LTRK 309
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.364 0.161 0.615
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 70,622,328
Number of Sequences: 36976
Number of extensions: 1112111
Number of successful extensions: 18591
Number of sequences better than 10.0: 44
Number of HSP's better than 10.0 without gapping: 4314
Number of HSP's successfully gapped in prelim test: 793
Number of HSP's that attempted gapping in prelim test: 13533
Number of HSP's gapped (non-prelim): 6725
length of query: 2066
length of database: 9,014,727
effective HSP length: 111
effective length of query: 1955
effective length of database: 4,910,391
effective search space: 9599814405
effective search space used: 9599814405
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 36 (21.5 bits)
S2: 66 (30.0 bits)
Medicago: description of AC148359.6