
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC146721.2 - phase: 1 /pseudo
(407 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
CA921489 weakly similar to PIR|T47566|T475 hypothetical protein ... 181 5e-46
TC85019 weakly similar to GP|9293890|dbj|BAB01793.1 emb|CAB70981... 179 2e-45
AJ548369 weakly similar to PIR|T48469|T484 hypothetical protein ... 122 2e-28
BE942775 77 2e-14
AW696166 GP|15025223|gb Uncharacterized conserved protein {Clost... 72 5e-13
AW687065 similar to GP|9293890|dbj| emb|CAB70981.1~gene_id:MVE11... 59 3e-09
BG586161 similar to GP|21554129|gb| putative thaumatin {Arabidop... 44 9e-05
BG587051 42 4e-04
TC83706 weakly similar to PIR|T48283|T48283 ankyrin-like protein... 41 0.001
TC91163 similar to PIR|T44832|T44832 probable emulsan repeating ... 35 0.040
TC88966 similar to GP|21305941|gb|AAM45816.1 NADH dehydrogenase ... 33 0.15
TC83751 weakly similar to GP|21311555|gb|AAM46778.1 15.8 kDa ole... 31 0.76
TC87068 similar to PIR|G96703|G96703 unknown protein 30164-3299... 30 1.3
TC84611 similar to PIR|T48213|T48213 hypothetical protein T20L15... 29 2.9
TC77007 similar to PIR|T10804|T10804 tonoplast intrinsic protein... 29 3.8
TC84209 similar to GP|4204311|gb|AAD10692.1| lcl|prt_seq No defi... 28 6.4
TC86560 similar to GP|8843737|dbj|BAA97285.1 myosin heavy chain-... 28 8.4
BQ140905 28 8.4
AL383749 similar to GP|11907611|gb| Cdc24 {Eremothecium gossypii... 28 8.4
>CA921489 weakly similar to PIR|T47566|T475 hypothetical protein F24B22.30 -
Arabidopsis thaliana, partial (10%)
Length = 809
Score = 181 bits (459), Expect = 5e-46
Identities = 97/207 (46%), Positives = 133/207 (63%), Gaps = 1/207 (0%)
Frame = -3
Query: 194 AALQMQGELQWFKAVESTVPPMCKEAKNADGLTPHELFTKNHEHLLNEGRQWAKDIASSF 253
+A QMQ E FK VE P+ KE KN +G T ++F + H+ LL EG+ W KD ++S
Sbjct: 729 SAFQMQREYSGFKEVEKWDHPLHKEVKNQEGKTAWQVFKEEHKALLEEGKNWMKDTSNSC 550
Query: 254 TIVGTLIITIMFAAAFTVPGGNNQDKGTPIFLGKNAFSFFIVTDSLSLIASASSVLMFIG 313
+V TLI TI FAAA TVPGGNNQDKG PIFL N F F+V+D+L+L +S +S+LMF+
Sbjct: 549 MLVATLIATIAFAAAITVPGGNNQDKGIPIFLSDNTFMVFVVSDALALFSSMASLLMFLA 370
Query: 314 ILTSRYAEEDFNTSLPAKLLFGLFTIFLSVVFMMCSFCSALALMLK-GYRWIIITAIASS 372
IL +RY EEDF +LP +L+ G+ ++F +VV M +F +AL+++LK W I +
Sbjct: 369 ILNARYTEEDFMMALPERLILGMASLFFAVVTTMVAFGAALSMLLKERLTWAPIPIALLA 190
Query: 373 VIPILVFMFSLLRLFSEVCISFLRSYF 399
+PI +F L LF E+ IS S F
Sbjct: 189 CVPIALFAKLQLPLFIEMVISTYESQF 109
>TC85019 weakly similar to GP|9293890|dbj|BAB01793.1
emb|CAB70981.1~gene_id:MVE11.3~similar to unknown
protein {Arabidopsis thaliana}, partial (10%)
Length = 663
Score = 179 bits (453), Expect = 2e-45
Identities = 101/215 (46%), Positives = 135/215 (61%), Gaps = 1/215 (0%)
Frame = +2
Query: 143 AILNRQDKVFKLIYEMEGQKELKTTK-DIFENNLLHLAAELGPSSYRGCRSNAALQMQGE 201
AI NRQ+KVF L+ EM L D N HLAA + +S + AA QM+ E
Sbjct: 23 AIKNRQEKVFNLLREMPIICNLLVLALDESNNTTSHLAARV--ASQAESIACAAFQMKRE 196
Query: 202 LQWFKAVESTVPPMCKEAKNADGLTPHELFTKNHEHLLNEGRQWAKDIASSFTIVGTLII 261
L WFK VE P+ K+ KN DG T ++F + H+ LL EG+ W KD ++S +V TLI
Sbjct: 197 LHWFKEVEKLDHPLHKDVKNNDGKTAWQVFKEEHKTLLEEGKNWMKDTSNSCMLVATLIA 376
Query: 262 TIMFAAAFTVPGGNNQDKGTPIFLGKNAFSFFIVTDSLSLIASASSVLMFIGILTSRYAE 321
TI FAAA TVPGGNNQDKG PIFL F FIV+D+L+L +S S+LMF+ I+ RYA+
Sbjct: 377 TITFAAAITVPGGNNQDKGIPIFLSDKTFMLFIVSDALALFSSMVSLLMFLSIIHGRYAK 556
Query: 322 EDFNTSLPAKLLFGLFTIFLSVVFMMCSFCSALAL 356
EDF +LP +L+ G+ +F +V M +F +AL++
Sbjct: 557 EDFVVALPKRLILGMAALFFAVGTTMIAFGAALSM 661
>AJ548369 weakly similar to PIR|T48469|T484 hypothetical protein T1E3.90 -
Arabidopsis thaliana, partial (18%)
Length = 579
Score = 122 bits (306), Expect = 2e-28
Identities = 64/175 (36%), Positives = 99/175 (56%), Gaps = 1/175 (0%)
Frame = +2
Query: 109 AAKYGIIEFINSMREANPDLLWAMDKYKRGIFAHAILNRQDKVFKLIYEMEGQKE-LKTT 167
A K G F+ + + PDL+ +D R IF A+ + +F LI+E+ K+ +
Sbjct: 50 ATKVGNFHFVAELLRSEPDLIRDVDDKNRSIFHIAVQHCHSSIFSLIHELGSFKDSIIDL 229
Query: 168 KDIFENNLLHLAAELGPSSYRGCRSNAALQMQGELQWFKAVESTVPPMCKEAKNADGLTP 227
+D NN+LH AA+L P S S AALQM E+ WF+ V+ + P+ + KN++G TP
Sbjct: 230 EDDERNNILHYAAKLAPPSQLNLISGAALQMTHEILWFEEVKELMSPIEIKKKNSNGKTP 409
Query: 228 HELFTKNHEHLLNEGRQWAKDIASSFTIVGTLIITIMFAAAFTVPGGNNQDKGTP 282
E+F + H+ LL + W + I + ++ TLI T +F A F +PGG N++ GTP
Sbjct: 410 DEIFAEEHKELLTKAESWIESITNYCILISTLIFTGVFTATFNIPGGFNKNTGTP 574
>BE942775
Length = 111
Score = 76.6 bits (187), Expect = 2e-14
Identities = 36/36 (100%), Positives = 36/36 (100%)
Frame = +3
Query: 73 NHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQ 108
NHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQ
Sbjct: 3 NHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQ 110
>AW696166 GP|15025223|gb Uncharacterized conserved protein {Clostridium
acetobutylicum}, partial (1%)
Length = 661
Score = 71.6 bits (174), Expect = 5e-13
Identities = 45/151 (29%), Positives = 72/151 (46%), Gaps = 5/151 (3%)
Frame = +3
Query: 107 LQAAKYGIIEFINSMREANPDLLWAMDKYKRGIFAHAILNRQDKVFKLIYEMEGQK---- 162
L AAK GI+E +N + P + K + A+ RQ + + + ++ K
Sbjct: 117 LVAAKNGIVEMVNEILIKVPSAIHNTTSRKENVLLVAVKYRQPLIVETLRMIKHSKPELW 296
Query: 163 -ELKTTKDIFENNLLHLAAELGPSSYRGCRSNAALQMQGELQWFKAVESTVPPMCKEAKN 221
L D EN +LHLAAE + +ALQM +++WF+ ++S VP N
Sbjct: 297 NNLILAMDEDENTVLHLAAEALGGDKPWQIAGSALQMMWDIKWFQYIKSLVPQHFIFRNN 476
Query: 222 ADGLTPHELFTKNHEHLLNEGRQWAKDIASS 252
+ G T E+F K H+ L+ + +W KD + S
Sbjct: 477 SSGKTSREIFKKTHKGLIKDSSEWLKDTSES 569
>AW687065 similar to GP|9293890|dbj| emb|CAB70981.1~gene_id:MVE11.3~similar
to unknown protein {Arabidopsis thaliana}, partial (2%)
Length = 650
Score = 58.9 bits (141), Expect = 3e-09
Identities = 29/61 (47%), Positives = 39/61 (63%)
Frame = +3
Query: 208 VESTVPPMCKEAKNADGLTPHELFTKNHEHLLNEGRQWAKDIASSFTIVGTLIITIMFAA 267
VE P+ KE KN +G T ++F + H+ LL EG+ W KD ++S +V TLI TI FAA
Sbjct: 465 VEKLDHPLHKEVKNQEGKTAWQVFKEEHKALLEEGKNWMKDTSNSCMLVATLIATIAFAA 644
Query: 268 A 268
A
Sbjct: 645 A 647
Score = 31.2 bits (69), Expect = 0.76
Identities = 18/35 (51%), Positives = 21/35 (59%)
Frame = +3
Query: 172 ENNLLHLAAELGPSSYRGCRSNAALQMQGELQWFK 206
+N HLAA L +S S +A QMQ ELQWFK
Sbjct: 3 QNTTSHLAARL--ASQVESISGSAFQMQRELQWFK 101
>BG586161 similar to GP|21554129|gb| putative thaumatin {Arabidopsis
thaliana}, partial (6%)
Length = 754
Score = 44.3 bits (103), Expect = 9e-05
Identities = 38/142 (26%), Positives = 65/142 (45%), Gaps = 4/142 (2%)
Frame = +1
Query: 247 KDIASSFTIVGTLIITIMFAAAFTVPGGNNQDKGTPIFLGKNAFSFFIVTDSLSLIASAS 306
KD+ + +V TLIIT AA F VPG + G L F FI+ ++SL +S S
Sbjct: 235 KDMVETLILVSTLIITASVAACFAVPG---EADGKANNLCHAMFQAFIIFITISLFSSIS 405
Query: 307 SVLMFIGILTSRYAEEDFNTSLPAKLLFGLFTIFLSVVFMMCSFCSALALMLKGYRWI-- 364
S+++ F+ + +L G+ I LS+ F+ + L ++ W+
Sbjct: 406 SIIILFWATLGLTELVKFSLKIVMPIL-GIALISLSLAFI-----AGLYTVISELTWLAN 567
Query: 365 --IITAIASSVIPILVFMFSLL 384
++ + V+ IL++M L
Sbjct: 568 VFLVMTLIFVVVEILLYMLLFL 633
>BG587051
Length = 759
Score = 42.0 bits (97), Expect = 4e-04
Identities = 25/65 (38%), Positives = 36/65 (54%), Gaps = 2/65 (3%)
Frame = +1
Query: 144 ILNRQDKVFKLIYEMEGQKELKTT--KDIFENNLLHLAAELGPSSYRGCRSNAALQMQGE 201
+L+R +F L++++ K + T D N LLHLAA+L P + S AA QM E
Sbjct: 541 VLHRHASIFNLVHQIGHIKGIIVTYENDDDRNTLLHLAAKLAPRNQLELVSGAAFQMCVE 720
Query: 202 LQWFK 206
L WF+
Sbjct: 721 LLWFE 735
>TC83706 weakly similar to PIR|T48283|T48283 ankyrin-like protein -
Arabidopsis thaliana, partial (34%)
Length = 746
Score = 40.8 bits (94), Expect = 0.001
Identities = 30/109 (27%), Positives = 52/109 (47%), Gaps = 9/109 (8%)
Frame = +1
Query: 251 SSFTIVGTLIITIMFAAAFTVPGGNNQD---------KGTPIFLGKNAFSFFIVTDSLSL 301
+S T+V LI T+ FAA FTVPG Q+ G F F++ DS +L
Sbjct: 100 NSNTVVAVLIATVAFAAIFTVPGQYPQNTKNLAPGMSPGEANIAPNIEFLIFVIFDSTAL 279
Query: 302 IASASSVLMFIGILTSRYAEEDFNTSLPAKLLFGLFTIFLSVVFMMCSF 350
S + V++ ++ + T++ KL++ + + +SV F+ S+
Sbjct: 280 FISLAVVIVQTSVVVIEREAKKQMTAVINKLMW-IACVLISVAFLAMSY 423
>TC91163 similar to PIR|T44832|T44832 probable emulsan repeating unit
polymerase [imported] - Acinetobacter lwoffii, partial
(3%)
Length = 606
Score = 35.4 bits (80), Expect = 0.040
Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Frame = +1
Query: 332 LLFGLFTIFLSVV--FMMCSFCSALALMLKGYRWIIITAIASSVIPILVFMFSLLRLFSE 389
+LF F F VV F+ SFCSAL L R +++A+ S+ +LV S LF
Sbjct: 313 VLFSFFVPFFIVVSFFLFFSFCSALLLC----RRFVVSAVGSAAARLLVCGSSASCLFLV 480
Query: 390 VCISFLRSYFL 400
V + L+S FL
Sbjct: 481 VLLCRLKSGFL 513
>TC88966 similar to GP|21305941|gb|AAM45816.1 NADH dehydrogenase subunit 2
{Neoheterandria umbratilis}, partial (4%)
Length = 915
Score = 33.5 bits (75), Expect = 0.15
Identities = 23/92 (25%), Positives = 42/92 (45%), Gaps = 23/92 (25%)
Frame = -2
Query: 244 QWAKDIASSFTIVGTLIITIMFAAAFTVPGGNNQDK---------------GTPIFLGK- 287
+W KD+ SS ++ +LI T+ F+ A PGG Q T I +G+
Sbjct: 821 EWLKDMKSSISLTASLIATLTFSLATNPPGGVVQASVGDSNECGKILISTINTTICVGEA 642
Query: 288 -------NAFSFFIVTDSLSLIASASSVLMFI 312
+ + F++ +++ IAS S +L+ +
Sbjct: 641 ILATRSHDKYLAFLICNTICFIASLSVILVLV 546
>TC83751 weakly similar to GP|21311555|gb|AAM46778.1 15.8 kDa oleosin
{Theobroma cacao}, partial (29%)
Length = 860
Score = 31.2 bits (69), Expect = 0.76
Identities = 17/55 (30%), Positives = 30/55 (53%)
Frame = +1
Query: 333 LFGLFTIFLSVVFMMCSFCSALALMLKGYRWIIITAIASSVIPILVFMFSLLRLF 387
LFGLFT+F+ + C + + G+ I+ + + PILV +F++L +F
Sbjct: 91 LFGLFTLFIVCTISLFLTCLTFVVTIMGF--ILFAPMIILLSPILVPVFAVLFVF 249
>TC87068 similar to PIR|G96703|G96703 unknown protein 30164-32998
[imported] - Arabidopsis thaliana, partial (24%)
Length = 711
Score = 30.4 bits (67), Expect = 1.3
Identities = 20/61 (32%), Positives = 29/61 (46%), Gaps = 12/61 (19%)
Frame = -3
Query: 333 LFGLFTIFLSVVFMMCSFCSALALMLKG------------YRWIIITAIASSVIPILVFM 380
L G+ IF +++F M SF AL L+ Y W +I I + PI+VF+
Sbjct: 412 LNGIVFIF*ALIFFMTSFSQP*ALFLQQIIQNSWNIILCIY*WQVIWLIIQTSFPIIVFL 233
Query: 381 F 381
F
Sbjct: 232 F 230
>TC84611 similar to PIR|T48213|T48213 hypothetical protein T20L15.190 -
Arabidopsis thaliana, partial (52%)
Length = 946
Score = 29.3 bits (64), Expect = 2.9
Identities = 17/66 (25%), Positives = 34/66 (50%)
Frame = +1
Query: 54 NLIKFTMLSGIKKIYGIKRNHYLVREIMRRLCEKIEKISSESELHQCSIHDAMLQAAKYG 113
NL + ++ I KRN ++++IMR++ ++KI +H+ + + L K G
Sbjct: 79 NLESVMFRNALQGIDSSKRNALIIKQIMRQIITSLKKIHDTGIVHR-DVKPSNLVVTKKG 255
Query: 114 IIEFIN 119
I+ I+
Sbjct: 256 QIKLID 273
>TC77007 similar to PIR|T10804|T10804 tonoplast intrinsic protein delta
type - upland cotton, complete
Length = 1041
Score = 28.9 bits (63), Expect = 3.8
Identities = 15/60 (25%), Positives = 34/60 (56%), Gaps = 3/60 (5%)
Frame = +3
Query: 331 KLLFGLFTIFLSVVFMMCSFCSALALMLKGYRWIIITAIASSVIP---ILVFMFSLLRLF 387
+LLF + FL ++ ++ +F ++ +L W+++ + SS++ +L F+ LL +F
Sbjct: 249 QLLFAMVLHFLLLLQLVPTFLVVMSTLLSPLDWLLVDRLPSSLVSSTGLLSFLDPLLHVF 428
>TC84209 similar to GP|4204311|gb|AAD10692.1| lcl|prt_seq No definition line
found {Arabidopsis thaliana}, partial (36%)
Length = 701
Score = 28.1 bits (61), Expect = 6.4
Identities = 17/57 (29%), Positives = 29/57 (50%), Gaps = 8/57 (14%)
Frame = -1
Query: 44 SS*LGSLICV--------NLIKFTMLSGIKKIYGIKRNHYLVREIMRRLCEKIEKIS 92
+S GS +C+ LI + K+ + KRNH +VRE+ R+ K +K++
Sbjct: 416 NSLFGSSVCIIAFSQKNHELIGMNQTARKKRCFP-KRNHDIVRELFRKFLNKADKVA 249
>TC86560 similar to GP|8843737|dbj|BAA97285.1 myosin heavy chain-like
{Arabidopsis thaliana}, partial (28%)
Length = 2450
Score = 27.7 bits (60), Expect = 8.4
Identities = 18/49 (36%), Positives = 26/49 (52%), Gaps = 4/49 (8%)
Frame = -3
Query: 289 AFSFFIVTDSLSLIASA----SSVLMFIGILTSRYAEEDFNTSLPAKLL 333
+FSFF + + SL+ASA SSVL FI + R + L K++
Sbjct: 1608 SFSFFSASSNASLVASASFFNSSVLFFISSTSRRASSVSDVNRLSVKII 1462
>BQ140905
Length = 308
Score = 27.7 bits (60), Expect = 8.4
Identities = 19/53 (35%), Positives = 24/53 (44%), Gaps = 2/53 (3%)
Frame = -2
Query: 276 NQDKGTPIFLGKNAFSFFIVTDSLSL--IASASSVLMFIGILTSRYAEEDFNT 326
+ DKG PIF N FS F ++ L L + I +L R ED NT
Sbjct: 298 HDDKGAPIFYTNNNFSTFFCSNRLDLFVVLYEHKKSPKIILLAKRRKRED*NT 140
>AL383749 similar to GP|11907611|gb| Cdc24 {Eremothecium gossypii}, partial
(2%)
Length = 305
Score = 27.7 bits (60), Expect = 8.4
Identities = 12/27 (44%), Positives = 15/27 (55%), Gaps = 5/27 (18%)
Frame = +3
Query: 6 CRCCN-----KSCKVLSSNNDVVSSIW 27
C CCN KSCK+ S N+ S+W
Sbjct: 222 CTCCNNCKLFKSCKLFSIYNNCYYSLW 302
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.328 0.139 0.417
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,616,198
Number of Sequences: 36976
Number of extensions: 193841
Number of successful extensions: 1550
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 1537
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 1547
length of query: 407
length of database: 9,014,727
effective HSP length: 98
effective length of query: 309
effective length of database: 5,391,079
effective search space: 1665843411
effective search space used: 1665843411
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 59 (27.3 bits)
Medicago: description of AC146721.2