
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC149206.10 - phase: 0
(427 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC86742 similar to PIR|T01541|T01541 hypothetical protein A_IG00... 152 3e-37
BG456581 134 7e-32
CA921218 98 5e-21
TC83780 56 8e-17
TC83226 weakly similar to PIR|G86419|G86419 probable reverse tra... 74 1e-13
BG586862 67 1e-11
TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non... 59 2e-09
CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [i... 55 7e-08
BE124667 53 2e-07
BI273170 53 3e-07
AW686588 50 1e-06
AL382788 weakly similar to GP|11322386|emb glycine receptor beta... 50 2e-06
BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At... 44 9e-05
TC78513 homologue to GP|5679340|emb|CAB51773.1 hypothetical prot... 44 9e-05
BF520135 42 3e-04
AW690000 42 3e-04
TC82948 41 8e-04
TC93136 40 0.001
TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarot... 40 0.002
BF640104 40 0.002
>TC86742 similar to PIR|T01541|T01541 hypothetical protein A_IG005I10.16 -
Arabidopsis thaliana, partial (19%)
Length = 2073
Score = 152 bits (383), Expect = 3e-37
Identities = 74/133 (55%), Positives = 94/133 (70%), Gaps = 4/133 (3%)
Frame = -1
Query: 290 VIFRASQTMFCGAFAQNIGYATALEAEYSACMFAIEKAKELHLTNIWIETDSVNVIRAFH 349
++ R SQ F GA + NIG+AT LEAE+ ACM AIEKA EL L NI +ETDS+ V+ AFH
Sbjct: 471 LVIRDSQFGFLGALSCNIGHATPLEAEFCACMIAIEKAMELGLNNICLETDSLKVVNAFH 292
Query: 350 FNTGVPWKMHIRWHNCLLFCRSIRSLCTHVNREGNLVADALAKNGQGLALYSSH----PL 405
G+PW+M +RWHNC+ FC SI +C H+ REGNLVADALA++GQGL+L+ P
Sbjct: 291 KIVGIPWQMRVRWHNCIRFCHSIACVCVHIPREGNLVADALARHGQGLSLFFLQWWPAPP 112
Query: 406 AFISSFYVRDCLG 418
+FI SF +D G
Sbjct: 111 SFIQSFLAQDRYG 73
>BG456581
Length = 683
Score = 134 bits (337), Expect = 7e-32
Identities = 78/222 (35%), Positives = 111/222 (49%), Gaps = 8/222 (3%)
Frame = +2
Query: 69 GDLSNKLAFSFLPGHGPTVHWAKMLWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIV 128
G L+ KL +SFL H WA +WN+ PP+ +FI WR H++LPTDDNL RGC +V
Sbjct: 8 GKLTIKLVYSFLTSHTSCAPWASTIWNSCIPPSHSFICWRLAHDRLPTDDNLSSRGCALV 187
Query: 129 SICCCFCRKQAETSSHIFLQCPVTLQLWDWLLKATDQHLDFSSI--------LNISRMVQ 180
S+ C FC +Q ETS H+FL+C + LW WL LDFSS + S V+
Sbjct: 188 SM-CSFCLEQVETSDHLFLRCKFVVTLWSWLCSQLRVGLDFSSFKALLSSLPRHCSSQVR 364
Query: 181 HVMNSAIVHIMWSIWLECNNKYFDGVQKPMSTLFNTILAEVLRLSFMLDIVKGASSMQDF 240
+ +A+VH++ SIW NN F K + + ++ LS + G D
Sbjct: 365 DLYVAAVVHMVHSIWWARNNVRFSSA-KVSAHAVQVRVHALIGLSGA--VSTGKCIAADA 535
Query: 241 KLARLFSIPFKTNRVNPCREIIWVPPHGGCMKINCDGSVVGS 282
+ +F IP + + W PP +K N DGS + +
Sbjct: 536 AILDVFRIPPHRRSMXEMVSVCWKPPSAPWVKGNTDGSXLNN 661
>CA921218
Length = 707
Score = 98.2 bits (243), Expect = 5e-21
Identities = 57/153 (37%), Positives = 79/153 (51%), Gaps = 2/153 (1%)
Frame = -3
Query: 273 INCDGSVVGSPSCGSIGVIFRASQTMFCGAFAQNIGYATALEAEYSACMFAIEKAKELHL 332
INC+ + + + + G FR S F FA+N G AL A+ S M AIE A +
Sbjct: 579 INCNTNGSANINTSACGGTFRNSNAYFLLCFAENTGNGNALHAKLSGAMRAIEIAAARNW 400
Query: 333 TNIWIETDSVNVIRAFHFNTGVPWKMHIRWHNCLLFCRSIRSLCTHVNREGNLVADALAK 392
+++W+E DS V+ AF + + W + RW+NCL S+ +HV REGN AD LA
Sbjct: 399 SHLWLELDSSLVVNAFKSKSVIHWNLRNRWNNCLFLISSMNFFVSHVFREGNQCADGLAN 220
Query: 393 NGQGL--ALYSSHPLAFISSFYVRDCLGLPFSR 423
G L Y +H FI FYV + +G P R
Sbjct: 219 FGLSLDHLTYWNHVPPFIHRFYVENKIGWPMFR 121
>TC83780
Length = 400
Score = 55.8 bits (133), Expect(2) = 8e-17
Identities = 26/42 (61%), Positives = 29/42 (68%)
Frame = +3
Query: 127 IVSICCCFCRKQAETSSHIFLQCPVTLQLWDWLLKATDQHLD 168
IVSICC FC QA +S HIFL CPVT+ L DWL T Q +D
Sbjct: 3 IVSICC-FCHNQAVSSHHIFLSCPVTMVLCDWLSSGTGQQID 125
Score = 48.9 bits (115), Expect(2) = 8e-17
Identities = 27/80 (33%), Positives = 42/80 (51%)
Frame = +1
Query: 170 SSILNISRMVQHVMNSAIVHIMWSIWLECNNKYFDGVQKPMSTLFNTILAEVLRLSFMLD 229
S + S +++ VM A++H +W IW+E NN G NT+ + + LSF L+
Sbjct: 148 SCLSTSSSLLRKVMLFAVLHTIWIIWIERNNITLLGQTHXA*IT*NTVF*QNIHLSFSLN 327
Query: 230 IVKGASSMQDFKLARLFSIP 249
++ G SSM FK+ F P
Sbjct: 328 LMMGTSSMXKFKVTLFFQHP 387
>TC83226 weakly similar to PIR|G86419|G86419 probable reverse transcriptase
100033-105622 [imported] - Arabidopsis thaliana, partial
(2%)
Length = 885
Score = 73.9 bits (180), Expect = 1e-13
Identities = 68/276 (24%), Positives = 127/276 (45%), Gaps = 15/276 (5%)
Frame = +3
Query: 86 TVHWAKMLWNAYTPPTGAFITWRFLHNKLPTDDNLRKRG--CYIVSICCCFCRKQAETSS 143
T+ W K +W+ +T P + WR L++ LP +LRKRG CY + C C + ET +
Sbjct: 111 TLIWKK-IWSLHTIPRHKVLLWRILNDSLPVRSSLRKRGIQCYPL---CPRCHSKTETIT 278
Query: 144 HIFLQCPVTLQLWDWLLKATDQHLDFSSILN---ISRMVQHVM------NSAIVHIMWSI 194
H+F+ CP++ ++W ++ ++F ++ N I+ + + ++ I I++++
Sbjct: 279 HLFMSCPLSKRVW----FGSNLCINFDNLPNPNFIN*LYEAIL*KDECITI*IAAIIYNL 446
Query: 195 WLECNNKYFDGVQKPMSTLFNTILAE---VLRLSFMLDIVKGASSMQDFKLARLFSIPFK 251
W N +S L + + E + R S + K A++ +AR P
Sbjct: 447 WHARN----------LSVLEDQTILEMDIIQRASNCISDYKQANTQAPPSMARTGYDPRS 596
Query: 252 TNRVNPCREIIWVPPHGGCMKINCDGSVVGSPSCGSIGVIFRASQTMFCGAFA-QNIGYA 310
+R P + W P+ G +K+N D ++ G +G+I R + A + G
Sbjct: 597 QHR--PAKNTKWKRPNLGLVKVNTDANLQNHGKWG-LGIIIRDEVGLVMAASTWETDGND 767
Query: 311 TALEAEYSACMFAIEKAKELHLTNIWIETDSVNVIR 346
ALEAE A + + AK+ + E D+ +++
Sbjct: 768 RALEAEAYALLTGMRFAKDCGFXKVXFEGDNEKLMK 875
>BG586862
Length = 804
Score = 67.4 bits (163), Expect = 1e-11
Identities = 50/197 (25%), Positives = 82/197 (41%), Gaps = 7/197 (3%)
Frame = -1
Query: 93 LWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIVSICCCFCRKQAETSSHIFLQCPVT 152
+W T P WR LHN LP D L KRG S+ C C + ET H+FL C VT
Sbjct: 654 VWGIKTIPRHKSFLWRLLHNALPVKDELHKRGIR-CSLLCPRCESKIETVQHLFLNCEVT 478
Query: 153 LQLWDWLLKATDQHLDFSSILNISRMVQHVMNS-------AIVHIMWSIWLECNNKYFDG 205
+ +W + S +L+ + + + A+ +++SIW N K F+
Sbjct: 477 QK--EWFGSQLGINFHSSGVLHFHDWITNFILKNDEETIIALTALLYSIWHARNQKVFEN 304
Query: 206 VQKPMSTLFNTILAEVLRLSFMLDIVKGASSMQDFKLARLFSIPFKTNRVNPCREIIWVP 265
+ P + I + +SS+ FK+A++ +N + P + +
Sbjct: 303 IDVPGDVV----------------IQRASSSLHSFKMAQVSDSVLPSNAI-PSYSLWGI- 178
Query: 266 PHGGCMKINCDGSVVGS 282
G + NC+G + S
Sbjct: 177 ---GVVARNCEGLAMAS 136
>TC80819 weakly similar to GP|10140689|gb|AAG13524.1 putative non-LTR
retroelement reverse transcriptase {Oryza sativa
(japonica cultivar-group)}, partial (2%)
Length = 1262
Score = 58.5 bits (140), Expect(2) = 2e-09
Identities = 37/113 (32%), Positives = 55/113 (47%), Gaps = 8/113 (7%)
Frame = +2
Query: 55 DETLDKLIWT-DSVDGDLSNKLAFSFLP--GHGPTVHWAKMLWNAYTPPTGAFITWRFLH 111
D DK W D V+G S K+ + ++ GH +W+ + P + WR L
Sbjct: 506 DNVNDKWRWLLDPVNG-YSVKVFYRYITSTGHISDRSLVDDVWHKHIPSKVSLFVWRLLR 682
Query: 112 NKLPTDDNLRKRGCYIVSICCCFCR-KQAETSSHIFLQCPVTLQLW----DWL 159
N+LPT DNL RG + + C C +E+++H+FL C V LW +WL
Sbjct: 683 NRLPTKDNLVHRGVLLATNAACVCGCVDSESTTHLFLHCNVFCSLWSLVRNWL 841
Score = 21.2 bits (43), Expect(2) = 2e-09
Identities = 7/12 (58%), Positives = 8/12 (66%)
Frame = +1
Query: 192 WSIWLECNNKYF 203
W IW E NN+ F
Sbjct: 946 WVIWKERNNRLF 981
>CA919100 homologue to PIR|G90291|G902 endoglucanase precursor [imported] -
Sulfolobus solfataricus, partial (3%)
Length = 789
Score = 54.7 bits (130), Expect = 7e-08
Identities = 36/141 (25%), Positives = 58/141 (40%), Gaps = 12/141 (8%)
Frame = -2
Query: 89 WAKMLWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIVSICCCFCRKQA-ETSSHIFL 147
+A+M+W+ P + WR L ++LPT NL RG C A E++ H+FL
Sbjct: 596 YAEMIWHRQVPLKVSVFAWRLLRDRLPTKSNLIYRGVIPTEAGLCVSGCGALESAQHLFL 417
Query: 148 QCPVTLQLW----DWL-------LKATDQHLDFSSILNISRMVQHVMNSAIVHIMWSIWL 196
C LW DW+ +D + F ++ Q + + W +W
Sbjct: 416 SCSYFASLWSLVRDWIGFVGVDTNVLSDHFVQFVHSTGGNKASQSFLQLIWLLCAWVLWT 237
Query: 197 ECNNKYFDGVQKPMSTLFNTI 217
E NN F+ P+ L + +
Sbjct: 236 ERNNMCFNDSITPLPRLLDKV 174
>BE124667
Length = 540
Score = 53.1 bits (126), Expect = 2e-07
Identities = 33/128 (25%), Positives = 59/128 (45%), Gaps = 8/128 (6%)
Frame = -1
Query: 132 CCFCRKQAETSSHIFLQCPVTLQLWDWLLKATDQHLDFSSI--------LNISRMVQHVM 183
C C+ +ET+ H+F +C ++W WL + L FS+ LN + + V+
Sbjct: 384 CSSCQASSETTFHLFFECNFATKMWSWLASLLNTPLQFSTSADMWSPLQLNWTPQCKVVI 205
Query: 184 NSAIVHIMWSIWLECNNKYFDGVQKPMSTLFNTILAEVLRLSFMLDIVKGASSMQDFKLA 243
+ I++++ IW NN F T N I+++V LS L A++M +F +
Sbjct: 204 TACIINLLNVIWFRRNNIRFQDKVIDWKTAINMIISKV-SLSGNLTKKTSAANMLEFTIF 28
Query: 244 RLFSIPFK 251
+ + K
Sbjct: 27 KACKVSIK 4
>BI273170
Length = 391
Score = 52.8 bits (125), Expect = 3e-07
Identities = 29/76 (38%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Frame = +1
Query: 354 VPWKMHIRWHNCLLFCRSIRSLCTHVNREGNLVADALAKNGQG---LALYSSHPLAFISS 410
VPWK+ RW NC+L R++ + +HV REGN AD LA G L ++ P I S
Sbjct: 85 VPWKLRNRWENCILATRNMNFIVSHVFREGNECADMLANIGLSLNCLTIWLELP-DCIKS 261
Query: 411 FYVRDCLGLPFSRLSF 426
++++ LG P +R +
Sbjct: 262 IFIKNKLGWPNNRFVY 309
>AW686588
Length = 567
Score = 50.4 bits (119), Expect = 1e-06
Identities = 31/114 (27%), Positives = 52/114 (45%), Gaps = 12/114 (10%)
Frame = +1
Query: 103 AFITWRFLHNKLPTDDNLRKRGCYIVSICCCFCR-KQAETSSHIFLQCPVTLQLWD---- 157
+ + WR + ++LPT NL +R C V C AET++H+FL C +W
Sbjct: 157 SILAWRLIRDRLPTKANLVRRRCLAVEAAGCVVGCGIAETANHLFLHCATFGAVWQHIRA 336
Query: 158 WL-LKATDQH------LDFSSILNISRMVQHVMNSAIVHIMWSIWLECNNKYFD 204
W+ + D H + F + +R + M + +W +W E NN+ F+
Sbjct: 337 WIGVSGADPHDLSDHFIQFITCTGHTRARRSFMQLIWLLCVWMVWNERNNRLFN 498
>AL382788 weakly similar to GP|11322386|emb glycine receptor betaZ subunit
{Danio rerio}, partial (4%)
Length = 353
Score = 50.1 bits (118), Expect = 2e-06
Identities = 33/117 (28%), Positives = 49/117 (41%)
Frame = +2
Query: 268 GGCMKINCDGSVVGSPSCGSIGVIFRASQTMFCGAFAQNIGYATALEAEYSACMFAIEKA 327
GG +K D + GS +G +FR G F G A+ AE+ A +A+E
Sbjct: 5 GGWIKAKIDNATCGSSDPSIVGSLFRDQFGSNFGCFIIFFGVNFAIHAEFYAAAYAVEIV 184
Query: 328 KELHLTNIWIETDSVNVIRAFHFNTGVPWKMHIRWHNCLLFCRSIRSLCTHVNREGN 384
+ N W+ D V+ AFH N V W + + FC + H+ +E N
Sbjct: 185 YLCNWHNFWLACDLKLVVDAFHSNYKVLWSLAT*F-----FCNQMNF*AFHIYKERN 340
>BG587113 weakly similar to PIR|A84888|A8 hypothetical protein At2g45230
[imported] - Arabidopsis thaliana, partial (10%)
Length = 767
Score = 44.3 bits (103), Expect = 9e-05
Identities = 22/64 (34%), Positives = 29/64 (44%)
Frame = -3
Query: 93 LWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIVSICCCFCRKQAETSSHIFLQCPVT 152
+W T P WR + N LPT N+R R C C ++ET +HI QCP
Sbjct: 288 VWKYNTSPKVRHFLWRCISNSLPTAANMRSRHISKDG-SCSRCGMESETVNHILFQCPYA 112
Query: 153 LQLW 156
+W
Sbjct: 111 RLIW 100
>TC78513 homologue to GP|5679340|emb|CAB51773.1 hypothetical protein {Galega
orientalis}, partial (20%)
Length = 765
Score = 44.3 bits (103), Expect = 9e-05
Identities = 20/50 (40%), Positives = 32/50 (64%)
Frame = +2
Query: 354 VPWKMHIRWHNCLLFCRSIRSLCTHVNREGNLVADALAKNGQGLALYSSH 403
+PW + RW+N + + ++ + +H+ REGN VAD LA + GL+L S H
Sbjct: 437 IPWNL*NRWNNVKVILQGLKCIISHIYREGNQVADTLANH--GLSLDSIH 580
>BF520135
Length = 202
Score = 42.4 bits (98), Expect = 3e-04
Identities = 22/65 (33%), Positives = 30/65 (45%), Gaps = 2/65 (3%)
Frame = +3
Query: 89 WAKMLWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGC--YIVSICCCFCRKQAETSSHIF 146
W +LW + WR H +LPT N+ KRG + +C CR E+ H+F
Sbjct: 3 WCLLLWRKEDLSKVSIFAWRLFHGRLPTKANVFKRGIVHHDAHMCVTRCR-LIESDVHLF 179
Query: 147 LQCPV 151
L C V
Sbjct: 180 LHCDV 194
>AW690000
Length = 652
Score = 42.4 bits (98), Expect = 3e-04
Identities = 16/41 (39%), Positives = 23/41 (56%)
Frame = +2
Query: 132 CCFCRKQAETSSHIFLQCPVTLQLWDWLLKATDQHLDFSSI 172
C C KQAE+S H+F +C + LW W ++ L F S+
Sbjct: 14 CSLCCKQAESSLHLFFECSYAVNLWCWFASVLNKTLHFQSL 136
Score = 41.6 bits (96), Expect = 6e-04
Identities = 34/115 (29%), Positives = 50/115 (42%), Gaps = 1/115 (0%)
Frame = +3
Query: 234 ASSMQDFKLARLFSIPFKTNRVNPCREIIWVPPHGGCMKINCDGSVVG-SPSCGSIGVIF 292
++S+ +F + + F++ + E+IW PP +K N DGS S +CG IF
Sbjct: 324 SASISNFLILKKFNVTIHPPKAPKIIEVIWRPPIPHWIKCNTDGSSRSHSSACGG---IF 494
Query: 293 RASQTMFCGAFAQNIGYATALEAEYSACMFAIEKAKELHLTNIWIETDSVNVIRA 347
R T FA+N G A AE + K+ TNI +N I A
Sbjct: 495 RNHDTDLLLCFAENTGECNAFHAELLXSL----KSHXACQTNIXGRISGLNXILA 647
>TC82948
Length = 705
Score = 41.2 bits (95), Expect = 8e-04
Identities = 15/49 (30%), Positives = 28/49 (56%)
Frame = +3
Query: 128 VSICCCFCRKQAETSSHIFLQCPVTLQLWDWLLKATDQHLDFSSILNIS 176
+ C C K E++ H+F +C V++Q+WDW FS+++N++
Sbjct: 432 IDFMCSLCFKSFESTHHLFFECSVSVQIWDW----------FSNLINLN 548
>TC93136
Length = 722
Score = 40.4 bits (93), Expect = 0.001
Identities = 23/83 (27%), Positives = 39/83 (46%), Gaps = 5/83 (6%)
Frame = +1
Query: 140 ETSSHIFLQCPVTLQLWDWLLKATDQHLDFSSILNISRMVQHVMNSAIVHIMWS-----I 194
ETSSH+FL CP +W +L L +S + + R+++ + V ++W I
Sbjct: 262 ETSSHLFLHCPFLSSVWSKIL----GWLYYSKFVCVFRLLERRSRTKGVWLIWHATIWVI 429
Query: 195 WLECNNKYFDGVQKPMSTLFNTI 217
W NN+ F + K + + I
Sbjct: 430 WKGINNRIFKNISKAIDEIVEEI 498
>TC77455 similar to GP|22335695|dbj|BAC10549. nine-cis-epoxycarotenoid
dioxygenase1 {Pisum sativum}, partial (43%)
Length = 1865
Score = 39.7 bits (91), Expect = 0.002
Identities = 23/61 (37%), Positives = 29/61 (46%), Gaps = 2/61 (3%)
Frame = -2
Query: 93 LWNAYTPPTGAFITWRFLHNKLPTDDNLRKRGCYIV--SICCCFCRKQAETSSHIFLQCP 150
LW + P +W +++PT NL KR V S C FC Q ET H+FL C
Sbjct: 322 LWKSKAPAKVLAFSWTLFLDRIPTMVNLGKRRLLRVEDSKRCVFCGCQDETVVHLFLHCD 143
Query: 151 V 151
V
Sbjct: 142 V 140
>BF640104
Length = 344
Score = 39.7 bits (91), Expect = 0.002
Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 5/88 (5%)
Frame = -1
Query: 263 WVPPHGGCMKINCDGSVVGSPSCGSIGVIFRASQTMFCGAFAQN-IGYATALEAE----Y 317
W+ PH G +KINCD ++ G IGVI R + + N G+ + AE Y
Sbjct: 338 WIKPHQGVIKINCDANLTSEDVWG-IGVITRNDNGIVMASGTWNRPGFMCPITAEAWGVY 162
Query: 318 SACMFAIEKAKELHLTNIWIETDSVNVI 345
A +FA+++ N+ E D+ +I
Sbjct: 161 QAALFALDQG----FQNVLFENDNEKLI 90
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.327 0.139 0.458
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,023,318
Number of Sequences: 36976
Number of extensions: 313399
Number of successful extensions: 2576
Number of sequences better than 10.0: 74
Number of HSP's better than 10.0 without gapping: 2493
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2555
length of query: 427
length of database: 9,014,727
effective HSP length: 99
effective length of query: 328
effective length of database: 5,354,103
effective search space: 1756145784
effective search space used: 1756145784
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 60 (27.7 bits)
Medicago: description of AC149206.10