
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC147537.1 + phase: 0 /pseudo
(1265 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsi... 42 0.001
TC81230 39 0.010
TC85718 homologue to SP|Q01899|HS7M_PHAVU Heat shock 70 kDa prot... 37 0.063
CB893805 similar to GP|10177935|d copia-type polyprotein {Arabid... 36 0.11
AJ501138 similar to GP|14021481|dbj unknown protein {Mesorhizobi... 34 0.31
AW690594 similar to GP|23498163|emb hypothetical protein {Plasmo... 34 0.31
TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-inf... 34 0.31
AL387019 similar to GP|13277927|gb| acidic ribosomal phosphoprot... 34 0.41
BG452212 33 0.54
AJ388885 weakly similar to GP|20268789|gb|A unknown protein {Ara... 33 0.70
TC82195 homologue to GP|19310475|gb|AAL84972.1 AT3g26744/MLJ15_1... 33 0.91
CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect... 32 1.2
TC91426 weakly similar to PIR|T47647|T47647 hypothetical protein... 32 2.0
TC86022 homologue to GP|19423939|gb|AAL87312.1 putative RNA heli... 32 2.0
BE324536 similar to GP|7299263|gb| CG9461 gene product {Drosophi... 31 2.7
BF643460 similar to PIR|T09640|T09 protein phosphatase 2C - alfa... 31 2.7
BQ135562 31 2.7
TC90274 similar to GP|10177219|dbj|BAB10294. contains similarity... 31 2.7
TC89981 similar to GP|9957218|gb|AAG09270.1| sucrose transporter... 31 2.7
TC80915 similar to PIR|S26025|S26025 NADH dehydrogenase (ubiquin... 31 2.7
>AW689768 weakly similar to GP|10177485|d polyprotein {Arabidopsis thaliana},
partial (9%)
Length = 675
Score = 42.4 bits (98), Expect = 0.001
Identities = 44/127 (34%), Positives = 61/127 (47%)
Frame = +2
Query: 782 LTPFGSKLWLKKFKL*KRHILGT*LIFLQTRFLLDANGSTKLKLVLMEV*SVIKLAY*PK 841
L P G KL KL* G +FL TR LL A+G +LK M++ + +K +
Sbjct: 86 LIPGGCKL*KLSTKL*LITKHGILFLFLHTRKLLAASGFIELKKTQMDLLTNLKPD*WLR 265
Query: 842 GTLKSMG*TTKRHLLR*LE*HLFVVSWPLLQFVNGSFFRWMSKMPSFTVI*QRKYICTLL 901
++K + T + L * L PLL +NG F + +S MP V +RK IC L
Sbjct: 266 VSVKHLDVITLKLSLL**NLSLSGSFSPLLSLINGKFNKLISTMPF*MVFFKRKCICLNL 445
Query: 902 *VITLLI 908
+ LLI
Sbjct: 446 RDLKLLI 466
>TC81230
Length = 958
Score = 39.3 bits (90), Expect = 0.010
Identities = 50/157 (31%), Positives = 66/157 (41%), Gaps = 13/157 (8%)
Frame = +2
Query: 4 MNLLQNLLIALRIGIAKVTRSSLGCATHLYRLFIFSLITMILQKKFGISWKTAIRPLDLL 63
M L LLI R GIA+ + G AT + +FI +L + QK++GI L L
Sbjct: 350 MILQMLLLIN*RNGIARTIKLLPGFATPPFLVFICNLGVLRTQKRYGII*NKGTLFLIYL 529
Query: 64 TIISCGPHFSISSKNLVNQ*MIFLLKSNQFGIK----------YLWPKSMKTI---FISF 110
I+C I + NLVN M + +GI L K M+ I F+ F
Sbjct: 530 ININC*RILVI*NNNLVNLFMSS*HRWKLYGIS*HHVSLH*KMQLI*KHMRLIAIVFVLF 709
Query: 111 KFSWLFDLNMKP*EPHCCIEIRFHRWILLFKKSFLKK 147
F WL +NM E I +LF S LKK
Sbjct: 710 NF*WLLRMNMNQSEHLRYTRILCLHLRMLFLVSSLKK 820
>TC85718 homologue to SP|Q01899|HS7M_PHAVU Heat shock 70 kDa protein
mitochondrial precursor. [Kidney bean French bean]
{Phaseolus vulgaris}, complete
Length = 2371
Score = 36.6 bits (83), Expect = 0.063
Identities = 44/164 (26%), Positives = 71/164 (42%), Gaps = 39/164 (23%)
Frame = +3
Query: 155 LLKWMLLLQLLDSHTRNRIIIHVRIA----------IVLVMYLLIVLL*RVGTVTVLAIL 204
L+K ++L++ L +R++ +V ++ + V+ L +V L T T+L +L
Sbjct: 1311 LVKGLILMRQLQWEQHSRVVSYVEMSKSSYYWMLHHSLWVLRLWVVSLQG*STATLLFLL 1490
Query: 205 LKIVPRDLQNKIVA---------------LPNPRMFQSLDLLLSLLLP---LRVRLPLL* 246
++ R Q ++ LP + ++L L LLLP LR+R PL
Sbjct: 1491 KRV--RSFQRQLTIKHK*VSKCYKVSGKWLPTTKCLENLS*LAFLLLPEVCLRLRSPLT- 1661
Query: 247 VTLKL*SNRLYLPTPLLPCL-----------LLTVILVGFLILR 279
+PT LLPCL LL LVGFL++R
Sbjct: 1662 ----------LMPTVLLPCLPRTSPLVKSSKLLLSRLVGFLMMR 1763
>CB893805 similar to GP|10177935|d copia-type polyprotein {Arabidopsis
thaliana}, partial (14%)
Length = 778
Score = 35.8 bits (81), Expect = 0.11
Identities = 25/58 (43%), Positives = 33/58 (56%)
Frame = +1
Query: 798 KRHILGT*LIFLQTRFLLDANGSTKLKLVLMEV*SVIKLAY*PKGTLKSMG*TTKRHL 855
K ILG+* + Q + LD+NGS K + ME IKL P+GTL +M TT + L
Sbjct: 115 KEIILGS*QTYDQVQKPLDSNGSLKQS*MRMERLRSIKLDLWPRGTLNNMVLTTPKFL 288
>AJ501138 similar to GP|14021481|dbj unknown protein {Mesorhizobium loti},
partial (16%)
Length = 544
Score = 34.3 bits (77), Expect = 0.31
Identities = 29/93 (31%), Positives = 50/93 (53%), Gaps = 7/93 (7%)
Frame = -1
Query: 697 LLALNHF-ISLIPQLSCFQVLM---RITLMGSIFVHPHLLL---LNLFRMLILYLQLVIP 749
L LNHF + QL ++ L+ ++ L + ++ L+L L LFR L+LY QLV+
Sbjct: 511 LQLLNHFQVCFRRQLVLYRQLVLFRQLVLYRQLVLYRQLVLYRQLVLFRQLVLYRQLVLY 332
Query: 750 PE*VIHPLILMITIVILLLKFFMSLLLIEKRVL 782
+ V+ ++++ L+L F L+L+ R L
Sbjct: 331 RQLVLFRQLVLVLFRQLVLVLFRQLVLVLCRQL 233
>AW690594 similar to GP|23498163|emb hypothetical protein {Plasmodium
falciparum 3D7}, partial (10%)
Length = 633
Score = 34.3 bits (77), Expect = 0.31
Identities = 26/79 (32%), Positives = 38/79 (47%)
Frame = -1
Query: 676 SHVMLLFRKTPCSPLFLHFIILLALNHFISLIPQLSCFQVLMRITLMGSIFVHPHLLLLN 735
SH + L P PL L ++LL L H + L L+ + L+ + H HLLL
Sbjct: 252 SHPLPLLHHHPPPPLLL-LLLLLLLRHHVHL---------LLLLLLLLLLPHHVHLLLPR 103
Query: 736 LFRMLILYLQLVIPPE*VI 754
+L+L L L +PP V+
Sbjct: 102 HVHLLLLLLHLRLPPHYVL 46
>TC87237 similar to GP|160409|gb|AAA29651.1|| mature-parasite-infected
erythrocyte surface antigen {Plasmodium falciparum},
partial (2%)
Length = 2007
Score = 34.3 bits (77), Expect = 0.31
Identities = 44/165 (26%), Positives = 75/165 (44%), Gaps = 2/165 (1%)
Frame = -1
Query: 625 FLNLLDMLVLFS-CSLMSAQNYNRALACVAFLVMTWNTKDIVVGIQSPTDYASHVMLLFR 683
FLNL+ + L S CS+ L C F ++ + + I S A ++ +F
Sbjct: 534 FLNLIMLFFLSSFCSI--------GLLCPLFTILLVIFSHLFLRIASVL-VAFFILFIF- 385
Query: 684 KTPCSPLFLHFIILLALNHFISLIPQLSCFQVLMRITLMGSIFVHP-HLLLLNLFRMLIL 742
T C F ++ L L +F S LS F L I L+ F+ +LLL LF + +L
Sbjct: 384 -TICFTCFFILLLTLFLGNFSSFF--LSPFIFLQFILLLTLFFIFVLFILLLTLFFIFVL 214
Query: 743 YLQLVIPPE*VIHPLILMITIVILLLKFFMSLLLIEKRVLTPFGS 787
++ L++ + + I ++T+ + + F + + L LT F S
Sbjct: 213 FI-LLLNLFSIFNLFIFLLTLFFIFVLFIL*MTLF----LTTFSS 94
>AL387019 similar to GP|13277927|gb| acidic ribosomal phosphoprotein PO {Mus
musculus}, partial (16%)
Length = 433
Score = 33.9 bits (76), Expect = 0.41
Identities = 21/62 (33%), Positives = 35/62 (55%)
Frame = +3
Query: 729 PHLLLLNLFRMLILYLQLVIPPE*VIHPLILMITIVILLLKFFMSLLLIEKRVLTPFGSK 788
P+LLLLN+ +L L+L++ ++ PL L + + +LLL SL + K+ L K
Sbjct: 48 PYLLLLNILSLLPKRLKLILQ---ILVPLRLPLLLPLLLLMKLKSLHQLRKKNLMKKPXK 218
Query: 789 LW 790
+W
Sbjct: 219 IW 224
>BG452212
Length = 317
Score = 33.5 bits (75), Expect = 0.54
Identities = 23/63 (36%), Positives = 29/63 (45%), Gaps = 2/63 (3%)
Frame = +1
Query: 677 HVMLLFRKTPCSPL--FLHFIILLALNHFISLIPQLSCFQVLMRITLMGSIFVHPHLLLL 734
H LL +TP FLHFI L+ +HFIS FQVL+ + H L+ L
Sbjct: 118 HQNLLRIETPIFTCAHFLHFIFLIQFHHFISFFAHYFQFQVLLERLQFHLM*THSFLVKL 297
Query: 735 NLF 737
F
Sbjct: 298 TRF 306
>AJ388885 weakly similar to GP|20268789|gb|A unknown protein {Arabidopsis
thaliana}, partial (54%)
Length = 410
Score = 33.1 bits (74), Expect = 0.70
Identities = 19/54 (35%), Positives = 31/54 (57%)
Frame = +1
Query: 720 TLMGSIFVHPHLLLLNLFRMLILYLQLVIPPE*VIHPLILMITIVILLLKFFMS 773
T G +L+L+L + IL L LV+PP L+L++ + I+LL FF++
Sbjct: 106 TNSGGCMFRYSVLILSLLALSILVLPLVMPPLPPPPLLLLLVPVFIMLLLFFIA 267
>TC82195 homologue to GP|19310475|gb|AAL84972.1 AT3g26744/MLJ15_15
{Arabidopsis thaliana}, partial (17%)
Length = 1189
Score = 32.7 bits (73), Expect = 0.91
Identities = 31/103 (30%), Positives = 53/103 (51%), Gaps = 12/103 (11%)
Frame = -1
Query: 154 KLLKW--MLLLQLLDSHTRNRIIIHVRIAIVLVMYLL----IVLL*RVGTVT------VL 201
++L+W +LLL LL S + N I ++I +L+ ++ I+ + T T +L
Sbjct: 961 QILRWKPLLLLSLLISTSNNSPI*IIQIIPILIRIIIQLRKIIHIITTTTTTSFFFPALL 782
Query: 202 AILLKIVPRDLQNKIVALPNPRMFQSLDLLLSLLLPLRVRLPL 244
+ L +VP LQ + + P + Q LLL LL+ + +RL L
Sbjct: 781 HLPLLLVPHSLQPRNLQRPTTLLKQRRWLLLLLLMMMLLRLRL 653
>CA859251 similar to GP|21305823|gb DNA polymerase I {Hz-1 insect virus}
[Heliothis zea virus 1], partial (2%)
Length = 803
Score = 32.3 bits (72), Expect = 1.2
Identities = 27/121 (22%), Positives = 68/121 (55%), Gaps = 2/121 (1%)
Frame = +3
Query: 165 LDSHTRNRIIIHVRIAIVLVMYLLIVLL*RVGTVTVLAILLKIVPRDLQNKIVALPNP-- 222
L+ + + +II V +L + ++ +++ + + + +++ ++ ++Q ++ L N
Sbjct: 279 LNVNLQTHVIIQVHHLYLLYLKIINIIIIYLIKIIQVIVIIILINHNIQ*FMLHLTNHHY 458
Query: 223 RMFQSLDLLLSLLLPLRVRLPLL*VTLKL*SNRLYLPTPLLPCLLLTVILVGFLILRVVI 282
RM + LLL+ L+ + + + +T+ +* ++ + P+L L+L +IL+ LIL ++I
Sbjct: 459 RMLKQHHLLLTELVKVVIEVKTPPLTMGV*I-KVIIIQPILLLLILILILILILILLLLI 635
Query: 283 I 283
I
Sbjct: 636 I 638
>TC91426 weakly similar to PIR|T47647|T47647 hypothetical protein T15C9.90 -
Arabidopsis thaliana, partial (19%)
Length = 701
Score = 31.6 bits (70), Expect = 2.0
Identities = 12/28 (42%), Positives = 20/28 (70%)
Frame = +3
Query: 1079 HLISLQFFVLSAMLKVLFFMVYASLLIH 1106
H++ FF+ S+ML+VL F++YA +H
Sbjct: 492 HIVVSHFFMRSSMLQVLQFLIYARCFLH 575
>TC86022 homologue to GP|19423939|gb|AAL87312.1 putative RNA helicase
{Arabidopsis thaliana}, partial (89%)
Length = 1982
Score = 31.6 bits (70), Expect = 2.0
Identities = 19/47 (40%), Positives = 30/47 (63%)
Frame = -2
Query: 253 SNRLYLPTPLLPCLLLTVILVGFLILRVVII*RLILNHCLLRHLFLL 299
+ +L P PLL LLL ++L+ L++ +V++ LIL+H L L LL
Sbjct: 346 TTQLVPPQPLLLLLLLLILLLILLLILIVLL-MLILHHVPLHILLLL 209
Score = 29.6 bits (65), Expect = 7.7
Identities = 29/115 (25%), Positives = 51/115 (44%), Gaps = 14/115 (12%)
Frame = -2
Query: 677 HVMLLFRKTPCSPLFLHFIILLALNHFISLIPQLSCFQVLMRITLMG-------SIFVHP 729
+ L R P PL ++ L L I + LSC VL I + +
Sbjct: 589 NAFFLDRGRPFKPLLVYSH*QLTLQKVILKLISLSCCHVLCSIACISWWKL*SCLPIL*T 410
Query: 730 HLLLLNLFRMLILYLQL-------VIPPE*VIHPLILMITIVILLLKFFMSLLLI 777
++ L+ +L L + ++PP+ ++ L+L+I ++ILLL + L+LI
Sbjct: 409 RVICFRLYSLLNLLDNVGVGATTQLVPPQPLLLLLLLLILLLILLLILIVLLMLI 245
>BE324536 similar to GP|7299263|gb| CG9461 gene product {Drosophila
melanogaster}, partial (1%)
Length = 361
Score = 31.2 bits (69), Expect = 2.7
Identities = 32/123 (26%), Positives = 48/123 (39%)
Frame = +2
Query: 522 LRFYAQTMRWNIVILPCFSFSVNKARRFNAHVLTPPNKMVELNENIVTSLTQFVPFSSQP 581
L+ Y Q IVI P +F ++ F+ + TPP N +++ F SS P
Sbjct: 20 LKLYHQIQLILIVIHPNLNFGCSETHLFHNQIFTPPT-------NFSSTVLFFPSTSSPP 178
Query: 582 RVQKSFGGKQLLMLSIQLIYFPP*IFKTYLLLKNCLVSLLNIPFLNLLDMLVLFSCSLMS 641
V+ + K L + F L +L+IP L+LL +L L
Sbjct: 179 LVKPTHHPKHQLQTHHLITRF----------LNQTQNPILHIPNLHLL*QNLLLLLQLSQ 328
Query: 642 AQN 644
QN
Sbjct: 329 VQN 337
>BF643460 similar to PIR|T09640|T09 protein phosphatase 2C - alfalfa, partial
(33%)
Length = 565
Score = 31.2 bits (69), Expect = 2.7
Identities = 17/46 (36%), Positives = 28/46 (59%)
Frame = +3
Query: 509 LLLVWLKHNFLVLLRFYAQTMRWNIVILPCFSFSVNKARRFNAHVL 554
+LL W + +FL ++R + + W IVIL F VN+ R F A+++
Sbjct: 204 MLLRWREMDFLCIVRKEVENI-WRIVILLLLIFMVNQNRLFLAYLM 338
>BQ135562
Length = 870
Score = 31.2 bits (69), Expect = 2.7
Identities = 28/97 (28%), Positives = 44/97 (44%), Gaps = 12/97 (12%)
Frame = +2
Query: 671 PTDYASHVMLLFRKTPCS-------PLFLHFII---LLALNHFISLI--PQLSCFQVLMR 718
PT ++ H L F TP + FLH +I LL++ F S+I P + C
Sbjct: 302 PTYFS*HNFLFFYHTPSTFQPPPILSFFLHILIMFSLLSIAPFCSIIHNPFIIC------ 463
Query: 719 ITLMGSIFVHPHLLLLNLFRMLILYLQLVIPPE*VIH 755
+L I+ + HLL L F ++ QL + ++H
Sbjct: 464 -SLFPIIYYYSHLLFLFTFSHFFIFTQLSLLFVIIVH 571
>TC90274 similar to GP|10177219|dbj|BAB10294. contains similarity to AP2
domain transcription factor~gene_id:MPF21.9 {Arabidopsis
thaliana}, partial (32%)
Length = 750
Score = 31.2 bits (69), Expect = 2.7
Identities = 29/85 (34%), Positives = 43/85 (50%), Gaps = 3/85 (3%)
Frame = +1
Query: 228 LDLLLSLLLPLRVRLPLL*VT---LKL*SNRLYLPTPLLPCLLLTVILVGFLILRVVII* 284
L LLL LL L + L LL + + + S R++L + ++ +L+ LI +
Sbjct: 421 LQLLLILLFILDLFLTLLSIRQTFMIVFSFRVFLLQGMFMMIM*LGLLLWLLIYNLHHRR 600
Query: 285 RLILNHCLLRHLFLLCHQSTQLVAI 309
+L+L H LL HLFLLC Q I
Sbjct: 601 QLLLCHHLLLHLFLLCRAILQFPRI 675
>TC89981 similar to GP|9957218|gb|AAG09270.1| sucrose transporter
{Lycopersicon esculentum}, partial (39%)
Length = 717
Score = 31.2 bits (69), Expect = 2.7
Identities = 27/77 (35%), Positives = 39/77 (50%)
Frame = +1
Query: 230 LLLSLLLPLRVRLPLL*VTLKL*SNRLYLPTPLLPCLLLTVILVGFLILRVVII*RLILN 289
+LL +LL + V L L + + R+ L P+ PCL L V +G + V R +L+
Sbjct: 490 MLLIMLLKVLVELYSLILLAMMLEGRV-LQMPISPCLWLLVTFLGMQLDHTVAGTRFLLS 666
Query: 290 HCLLRHLFLLCHQSTQL 306
H LL L LL Q + L
Sbjct: 667 HLLL--LALLXVQXSSL 711
>TC80915 similar to PIR|S26025|S26025 NADH dehydrogenase (ubiquinone) (EC
1.6.5.3) chain 5 - pig roundworm mitochondrion, partial
(5%)
Length = 799
Score = 31.2 bits (69), Expect = 2.7
Identities = 35/132 (26%), Positives = 59/132 (44%), Gaps = 11/132 (8%)
Frame = +2
Query: 430 LSLLIVYIANLANSWLYPLIVLF-LFQIVLLI*SIQTFGGLLLFLQ*--------MAFVI 480
LS ++ ++ L W L+VLF LF++ L G + ++ + FV
Sbjct: 173 LS*IVFFVLCLLGYWGVFLLVLFDLFKVTLTATPFNVKIGCCVKIEPR*CRCRLWVVFVC 352
Query: 481 LYCLLTIFLVLHGYIF*KIALNYHKFIFLLLVWLKHNF--LVLLRFYAQTMRWNIVILPC 538
+ +T+FL F + + + +IFLLL H LV++ F T ++ L
Sbjct: 353 *FFYITLFLAFTTNFFVFLFVFFSSWIFLLLSISSHKLVNLVVVLFSLVTSLVFLISLVL 532
Query: 539 FSFSVNKARRFN 550
F FSV+ R F+
Sbjct: 533 F*FSVSVLRFFD 568
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.351 0.158 0.516
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 46,772,403
Number of Sequences: 36976
Number of extensions: 830163
Number of successful extensions: 10374
Number of sequences better than 10.0: 55
Number of HSP's better than 10.0 without gapping: 4472
Number of HSP's successfully gapped in prelim test: 707
Number of HSP's that attempted gapping in prelim test: 5291
Number of HSP's gapped (non-prelim): 6028
length of query: 1265
length of database: 9,014,727
effective HSP length: 107
effective length of query: 1158
effective length of database: 5,058,295
effective search space: 5857505610
effective search space used: 5857505610
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 14 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 38 (21.9 bits)
S2: 64 (29.3 bits)
Medicago: description of AC147537.1