
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC125477.10 - phase: 0 /pseudo
(355 letters)
Database: MTGI
36,976 sequences; 27,044,181 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
TC82182 similar to GP|22136508|gb|AAM91332.1 unknown protein {Ar... 305 1e-83
AL370234 similar to PIR|T14076|T14 probable villin [imported] - ... 230 5e-61
AW256886 similar to GP|22136508|gb unknown protein {Arabidopsis ... 202 2e-52
AW697159 similar to GP|22136508|gb unknown protein {Arabidopsis ... 115 2e-26
TC88298 weakly similar to GP|20198093|gb|AAD23629.2 putative vil... 57 1e-08
AW694471 similar to PIR|T14076|T14 probable villin [imported] - ... 52 3e-07
BG584838 weakly similar to PIR|T14076|T14 probable villin [impor... 52 3e-07
AW736221 similar to GP|22136974|gb putative villin 2 protein {Ar... 34 0.075
TC80171 similar to GP|14334824|gb|AAK59590.1 putative protein tr... 31 0.83
TC77302 similar to GP|1354207|gb|AAB82061.1| rof1 {Arabidopsis t... 30 1.4
BQ140834 28 5.4
AW683866 similar to GP|15217302|gb| Hypothetical protein {Oryza ... 28 5.4
TC83174 28 5.4
AW256777 weakly similar to PIR|T01269|T01 serine/threonine-speci... 28 5.4
TC78695 similar to GP|4115563|dbj|BAA36423.1 UDP-glucose:anthocy... 28 7.0
TC90283 similar to GP|14517426|gb|AAK62603.1 AT5g40660/MNF13_180... 28 7.0
TC89046 weakly similar to GP|21740816|emb|CAD41006. OSJNBa0042L1... 28 7.0
TC93161 weakly similar to GP|21536619|gb|AAM60951.1 unknown {Ara... 27 9.2
>TC82182 similar to GP|22136508|gb|AAM91332.1 unknown protein {Arabidopsis
thaliana}, partial (40%)
Length = 750
Score = 305 bits (782), Expect = 1e-83
Identities = 145/251 (57%), Positives = 194/251 (76%), Gaps = 1/251 (0%)
Frame = +1
Query: 67 IPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLNHDDIFILDTESKIFQ 126
IP EGG ASGF E EE +TRL+VC+GK VV +K+VPFARSSLNHDD+FILDT+ KI+Q
Sbjct: 1 IPLEGGVASGFXKPEEEEFETRLYVCRGKRVVRLKQVPFARSSLNHDDVFILDTDDKIYQ 180
Query: 127 FNGSNSSIQERAKALEVVQYIKDTYHDGKCEVASIEDGRLMADSESGEFWGLFGGFAPLP 186
FNG+NS+IQERAKALEVVQ++K+ YH+GKC+VA ++DG+L +S+SGEFW LFGGFAP+
Sbjct: 181 FNGANSNIQERAKALEVVQFLKEEYHEGKCDVAIVDDGKLDTESDSGEFWLLFGGFAPIA 360
Query: 187 RKTVSDDDKTIDSHPPKLLCVEKGKAEPFETDSLTKELLDTNKCYILDCGLEVFVWIGRN 246
+K +S+DD ++ P +L + + + E L+K LL+ NKCY+LDCG EVF+W GR
Sbjct: 361 KKVISEDDIIPEAIPAQLYSIIDREVKS-EEGELSKSLLENNKCYLLDCGAEVFIWFGRG 537
Query: 247 TSLDERKSASGSTDELVSSTNRPKS-QIIRVMEGFETVMFRSKFDSWPQTTNAAMPEDGR 305
T ++ERK+A + +E VS NRPKS +I R+ +G+ET F+S FDSWP + + + EDG+
Sbjct: 538 TQVEERKAACQAAEEFVSGQNRPKSTRITRITQGYETRSFKSNFDSWPSGSASTVAEDGK 717
Query: 306 GKVAALLKRQG 316
GKV ALLK+QG
Sbjct: 718 GKVTALLKQQG 750
>AL370234 similar to PIR|T14076|T14 probable villin [imported] - Arabidopsis
thaliana, partial (15%)
Length = 551
Score = 230 bits (587), Expect = 5e-61
Identities = 110/141 (78%), Positives = 125/141 (88%)
Frame = -3
Query: 204 LLCVEKGKAEPFETDSLTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSASGSTDELV 263
LL VEKG+AEP E DSL +E LDTNKCYILDCGLE+FVW+GRNTSLDERKSASG DELV
Sbjct: 480 LLSVEKGQAEPVEADSLRREFLDTNKCYILDCGLEIFVWMGRNTSLDERKSASGVADELV 301
Query: 264 SSTNRPKSQIIRVMEGFETVMFRSKFDSWPQTTNAAMPEDGRGKVAALLKRQGLDVKGLV 323
S ++ K QI+RV+EGFETV+F+SKFDSWPQT + + EDGRGKVAALLKRQG++VKGL+
Sbjct: 300 SGIDQLKPQIVRVIEGFETVLFKSKFDSWPQTPDVTVSEDGRGKVAALLKRQGVNVKGLL 121
Query: 324 KADPVKEEPQPYIDCTGHLQV 344
KAD VKEEPQPYIDCTGHLQV
Sbjct: 120 KADAVKEEPQPYIDCTGHLQV 58
>AW256886 similar to GP|22136508|gb unknown protein {Arabidopsis thaliana},
partial (28%)
Length = 557
Score = 202 bits (514), Expect = 2e-52
Identities = 94/127 (74%), Positives = 108/127 (85%)
Frame = +2
Query: 1 TTASKSGALRHDIHYWLGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLS 60
TT K G+ DIH+W+GKDTSQDEAG AAIKT+ELDA LGGRAVQ+RE+QGHE+ KFLS
Sbjct: 176 TTQGKGGSYLFDIHFWIGKDTSQDEAGTAAIKTIELDAALGGRAVQWREIQGHESDKFLS 355
Query: 61 YFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLNHDDIFILDT 120
YFKPCIIP EGG ASGFK E EE +TRL+VCKGK VV +K++PFARSSLNHDD+FILDT
Sbjct: 356 YFKPCIIPLEGGVASGFKKPEEEEFETRLYVCKGKRVVRIKQIPFARSSLNHDDVFILDT 535
Query: 121 ESKIFQF 127
+ K FQF
Sbjct: 536 QDKNFQF 556
>AW697159 similar to GP|22136508|gb unknown protein {Arabidopsis thaliana},
partial (21%)
Length = 651
Score = 115 bits (289), Expect = 2e-26
Identities = 52/72 (72%), Positives = 60/72 (83%)
Frame = +3
Query: 1 TTASKSGALRHDIHYWLGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLS 60
TT K A +D+H+W+GKDTSQDEAG AAIK VELDA LGGRAVQ+RE+QGHE+ FLS
Sbjct: 387 TTQGKGAAYFYDLHFWIGKDTSQDEAGTAAIKAVELDAALGGRAVQHREIQGHESDNFLS 566
Query: 61 YFKPCIIPQEGG 72
YF+PCIIP EGG
Sbjct: 567 YFRPCIIPLEGG 602
>TC88298 weakly similar to GP|20198093|gb|AAD23629.2 putative villin
{Arabidopsis thaliana}, partial (25%)
Length = 1430
Score = 56.6 bits (135), Expect = 1e-08
Identities = 58/236 (24%), Positives = 103/236 (43%), Gaps = 8/236 (3%)
Frame = +1
Query: 84 EHKTRLFV--CKGKHVVYVKEVPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERAKAL 141
E+ LF+ C + +V SSLN +I+ TE+ ++ + GS SS ++
Sbjct: 43 ENLVALFLVHCTSPDNMQAIQVNQVSSSLNSSYCYIVQTEAAMYTWIGSLSSARDHTLLD 222
Query: 142 EVVQYIKDTYHDGKCEVASIEDGRLMADSESGEFWGLFGGFAPLPRKTVSDDDKTIDSHP 201
+V+ + T S+ +G +E FW + GG A P++ + ID
Sbjct: 223 RMVELLNPTQLP-----VSVREG-----NEPDIFWDVLGGKAEYPKE--KEIQGFIDDPH 366
Query: 202 PKLLCVEKGKAEPFETDSLTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSASG---- 257
L + KG + E + T++ L T +LDC E+++W+G ++ + ++ A
Sbjct: 367 LFALKITKGDFKVKEIYNYTQDDLITEDVLLLDCQREIYIWVGLHSVVKSKQEALNLGLK 546
Query: 258 --STDELVSSTNRPKSQIIRVMEGFETVMFRSKFDSWPQTTNAAMPEDGRGKVAAL 311
D LV + + I VMEG+E F ++F W + + K+A L
Sbjct: 547 FLEMDVLVEGLSL-EVPIYVVMEGYEPPFF-TRFFLWDHSKANIIGNSFERKLAIL 708
>AW694471 similar to PIR|T14076|T14 probable villin [imported] - Arabidopsis
thaliana, partial (21%)
Length = 648
Score = 52.4 bits (124), Expect = 3e-07
Identities = 50/203 (24%), Positives = 93/203 (45%), Gaps = 12/203 (5%)
Frame = +3
Query: 16 WLGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLSYFKPCIIPQEGGAAS 75
W+GK++ ++E +A ++ + A Q R +G+E +F S + I+ +GG +
Sbjct: 30 WIGKNSVEEERASANSLASKMVESMKFLASQARIYEGNEPIQFHSILQTFIV-FKGGLSD 206
Query: 76 GFKHVEAE---------EHKTRLFVCKGK---HVVYVKEVPFARSSLNHDDIFILDTESK 123
G+K AE E LF +G ++ ++ P A SSLN +IL
Sbjct: 207 GYKTYIAEKEIPDETYNEDSVALFRIQGTGPDNMQAIQVEPVA-SSLNSSYCYILHNGPA 383
Query: 124 IFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEVASIEDGRLMADSESGEFWGLFGGFA 183
IF ++GSN++ +++ ++ IK +++ +ES +FW L GG +
Sbjct: 384 IFTWSGSNTTAEDQELIERMLDLIK----------PNLQSKPQREGTESEQFWDLLGGKS 533
Query: 184 PLPRKTVSDDDKTIDSHPPKLLC 206
P + +S + ++ P L C
Sbjct: 534 EYPSQKISREAES----DPHLFC 590
Score = 33.1 bits (74), Expect = 0.17
Identities = 18/46 (39%), Positives = 26/46 (56%)
Frame = +3
Query: 242 WIGRNTSLDERKSASGSTDELVSSTNRPKSQIIRVMEGFETVMFRS 287
WIG+N+ +ER SA+ ++V S SQ R+ EG E + F S
Sbjct: 30 WIGKNSVEEERASANSLASKMVESMKFLASQ-ARIYEGNEPIQFHS 164
Score = 31.6 bits (70), Expect = 0.49
Identities = 18/61 (29%), Positives = 30/61 (48%)
Frame = +3
Query: 225 LDTNKCYILDCGLEVFVWIGRNTSLDERKSASGSTDELVSSTNRPKSQIIRVMEGFETVM 284
L+++ CYIL G +F W G NT+ ++++ D + +P Q EG E+
Sbjct: 342 LNSSYCYILHNGPAIFTWSGSNTTAEDQELIERMLDLI-----KPNLQSKPQREGTESEQ 506
Query: 285 F 285
F
Sbjct: 507 F 509
>BG584838 weakly similar to PIR|T14076|T14 probable villin [imported] -
Arabidopsis thaliana, partial (13%)
Length = 811
Score = 52.0 bits (123), Expect = 3e-07
Identities = 38/148 (25%), Positives = 71/148 (47%)
Frame = +2
Query: 108 SSLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEVASIEDGRLM 167
SSLN +I+ TE+ ++ + GS SS ++ +V+ + T S+ +G
Sbjct: 134 SSLNSSYCYIVQTEAAMYTWIGSLSSARDHTLLDRMVELLNPTQLP-----VSVREG--- 289
Query: 168 ADSESGEFWGLFGGFAPLPRKTVSDDDKTIDSHPPKLLCVEKGKAEPFETDSLTKELLDT 227
+E FW + GG A P++ + ID L + KG + E + T++ L T
Sbjct: 290 --NEPDIFWDVLGGKAEYPKE--KEIQGFIDDPHLFALKITKGDFKVKEIYNYTQDDLIT 457
Query: 228 NKCYILDCGLEVFVWIGRNTSLDERKSA 255
+LDC E+++W+G ++ + ++ A
Sbjct: 458 EDVLLLDCQREIYIWVGLHSVVKSKQEA 541
>AW736221 similar to GP|22136974|gb putative villin 2 protein {Arabidopsis
thaliana}, partial (13%)
Length = 420
Score = 34.3 bits (77), Expect = 0.075
Identities = 33/131 (25%), Positives = 59/131 (44%), Gaps = 13/131 (9%)
Frame = +3
Query: 165 RLMADSESGEFWGLF-------GGFAPLPRKTVSDD---DKTIDSHPPKLLCVEKG---K 211
R+ ES EF LF GG + +K ++D D+T + L+ +
Sbjct: 45 RIFDGKESPEFVALFQPIVVLKGGVSSGYKKLIADKGLPDETYTAESIALIRISGTAIHN 224
Query: 212 AEPFETDSLTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSASGSTDELVSSTNRPKS 271
++ + D++ L T +C++L G VF W G +S+++++ A+ + L RP
Sbjct: 225 SKTMQVDAVAASLNST-ECFLLQSGSTVFTWHGNQSSVEQQQLATKVAEFL-----RPGI 386
Query: 272 QIIRVMEGFET 282
+ EG ET
Sbjct: 387 ALKYSKEGTET 419
>TC80171 similar to GP|14334824|gb|AAK59590.1 putative protein transport
factor {Arabidopsis thaliana}, partial (36%)
Length = 1340
Score = 30.8 bits (68), Expect = 0.83
Identities = 17/60 (28%), Positives = 30/60 (49%), Gaps = 5/60 (8%)
Frame = +2
Query: 225 LDTNKCYILDCGLEVFVWIGRNTSLDERKSASG-----STDELVSSTNRPKSQIIRVMEG 279
+ ++ +LD G +VF+W+G +E KSAS + E ++ P +I+ EG
Sbjct: 581 MQSDTAVVLDHGTDVFIWLGAELVANEGKSASALAACRTLAEELTEFRFPAPRILAFKEG 760
>TC77302 similar to GP|1354207|gb|AAB82061.1| rof1 {Arabidopsis thaliana},
partial (56%)
Length = 1221
Score = 30.0 bits (66), Expect = 1.4
Identities = 22/56 (39%), Positives = 31/56 (55%), Gaps = 3/56 (5%)
Frame = +1
Query: 159 ASIEDGRLMADSESGEFWGLFGGFAP-LPR--KTVSDDDKTIDSHPPKLLCVEKGK 211
A ++DG L+A S+ EF G F P LP+ KT+ +K I + P+ EKGK
Sbjct: 730 ARLDDGTLVAKSDGVEFTVKEGYFCPALPKAVKTMKKGEKVILTVKPQYGFDEKGK 897
>BQ140834
Length = 560
Score = 28.1 bits (61), Expect = 5.4
Identities = 12/33 (36%), Positives = 20/33 (60%)
Frame = -2
Query: 226 DTNKCYILDCGLEVFVWIGRNTSLDERKSASGS 258
D++KC + +CG E+ V+ +T L R+ GS
Sbjct: 316 DSSKCNLANCGSEIIVFQIMHTHLQMRRDFMGS 218
>AW683866 similar to GP|15217302|gb| Hypothetical protein {Oryza sativa}
[Oryza sativa (japonica cultivar-group)], partial (4%)
Length = 552
Score = 28.1 bits (61), Expect = 5.4
Identities = 19/70 (27%), Positives = 32/70 (45%)
Frame = +3
Query: 93 KGKHVVYVKEVPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYH 152
K ++ V K++ FAR L FIL + K+ + GS ER + + + K+ H
Sbjct: 231 KSRNTVMKKKIQFAREKLXMKKAFILYYQLKLLEAWGS----MER-QHVSTITATKECLH 395
Query: 153 DGKCEVASIE 162
C + +E
Sbjct: 396 SAVCRIPLLE 425
>TC83174
Length = 1323
Score = 28.1 bits (61), Expect = 5.4
Identities = 12/33 (36%), Positives = 20/33 (60%)
Frame = -1
Query: 226 DTNKCYILDCGLEVFVWIGRNTSLDERKSASGS 258
D++KC + +CG E+ V+ +T L R+ GS
Sbjct: 336 DSSKCNLANCGSEIIVFQIMHTHLQMRRDFMGS 238
>AW256777 weakly similar to PIR|T01269|T01 serine/threonine-specific protein
kinase (EC 2.7.1.-) F27F23.1 - Arabidopsis thaliana,
partial (13%)
Length = 373
Score = 28.1 bits (61), Expect = 5.4
Identities = 26/88 (29%), Positives = 37/88 (41%), Gaps = 20/88 (22%)
Frame = +3
Query: 201 PPKLLCVEKGKAEPFETDSLTKELLDT----NKCYIL-------DCGLEVFVWIGRNTSL 249
PP L VE K + F ++ +DT K Y + CG ++W G N SL
Sbjct: 48 PPILNAVEVYKVKNFSQSETQQDDVDTMRNIKKAYGVARNWQGDPCGPVNYMWEGLNCSL 227
Query: 250 DERK---------SASGSTDELVSSTNR 268
D S+SG T E+ SS ++
Sbjct: 228 DGNNIPRITSLNLSSSGLTGEISSSISK 311
>TC78695 similar to GP|4115563|dbj|BAA36423.1 UDP-glucose:anthocyanin
5-O-glucosyltransferase {Verbena x hybrida}, partial
(49%)
Length = 1741
Score = 27.7 bits (60), Expect = 7.0
Identities = 33/139 (23%), Positives = 53/139 (37%), Gaps = 28/139 (20%)
Frame = +1
Query: 220 LTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSASGSTDEL---VSSTNRPKSQIIRV 276
L+K ++ +LD G F+W+ R+ L ++K DEL N +I++
Sbjct: 943 LSKRQMEEIARALLDSGFS-FLWVIRDKKLQQQKEEEVDDDELSCREELENNMNGKIVKW 1119
Query: 277 MEGFETVMFRS------------------------KFDSW-PQTTNAAMPEDGRGKVAAL 311
E + RS F W QTTNA + ED V
Sbjct: 1120 CSQVEVLSHRSLGCFMTHCGWNSTLESLGSGVPMVAFPQWTDQTTNAKLIED----VWKT 1287
Query: 312 LKRQGLDVKGLVKADPVKE 330
R D +G+VK + +++
Sbjct: 1288 GLRMEHDEEGMVKVEEIRK 1344
>TC90283 similar to GP|14517426|gb|AAK62603.1 AT5g40660/MNF13_180
{Arabidopsis thaliana}, partial (69%)
Length = 1101
Score = 27.7 bits (60), Expect = 7.0
Identities = 17/55 (30%), Positives = 28/55 (50%)
Frame = -3
Query: 57 KFLSYFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLN 111
+ L+ F PC G+AS + HK CKG + +Y+ +PF+ +SL+
Sbjct: 544 QMLNNFGPCHRNSFQGSAS-------KSHKRH---CKGSYAIYLLIIPFSGNSLS 410
>TC89046 weakly similar to GP|21740816|emb|CAD41006. OSJNBa0042L16.4 {Oryza
sativa}, partial (35%)
Length = 1399
Score = 27.7 bits (60), Expect = 7.0
Identities = 18/66 (27%), Positives = 35/66 (52%)
Frame = +1
Query: 130 SNSSIQERAKALEVVQYIKDTYHDGKCEVASIEDGRLMADSESGEFWGLFGGFAPLPRKT 189
++SS +R +EVV+++KD K E+ ++ + + D E+ E + F K
Sbjct: 997 TDSSPDKRPSMIEVVEWLKDGVSKRKKEIPNLSNNK-GHDEENDENYEEFVTMQSNNLKI 1173
Query: 190 VSDDDK 195
+SD+D+
Sbjct: 1174 LSDNDR 1191
>TC93161 weakly similar to GP|21536619|gb|AAM60951.1 unknown {Arabidopsis
thaliana}, partial (50%)
Length = 823
Score = 27.3 bits (59), Expect = 9.2
Identities = 15/50 (30%), Positives = 23/50 (46%)
Frame = -3
Query: 89 LFVCKGKHVVYVKEVPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERA 138
+FVCK VVY+ + ++ F LDT+ Q N + QE +
Sbjct: 761 IFVCKTYVVVYITSFCVQTHTYSYLIFFFLDTKDAFIQLNHVKYAPQENS 612
Database: MTGI
Posted date: Oct 22, 2004 3:39 PM
Number of letters in database: 27,044,181
Number of sequences in database: 36,976
Lambda K H
0.316 0.135 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,314,731
Number of Sequences: 36976
Number of extensions: 133344
Number of successful extensions: 617
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 605
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 613
length of query: 355
length of database: 9,014,727
effective HSP length: 97
effective length of query: 258
effective length of database: 5,428,055
effective search space: 1400438190
effective search space used: 1400438190
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (27.3 bits)
Medicago: description of AC125477.10