
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC125477.10 - phase: 0 /pseudo
(355 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BU548249 204 5e-53
AW598284 183 9e-47
TC219145 similar to UP|VIL2_ARATH (O81644) Villin 2, partial (35%) 68 6e-12
TC227838 similar to UP|VIL4_ARATH (O65570) Villin 4, partial (38%) 60 2e-09
TC216626 similar to UP|VIL2_ARATH (O81644) Villin 2, partial (36%) 59 3e-09
TC227706 similar to UP|VIL1_ARATH (O81643) Villin 1, partial (11%) 40 0.001
TC227708 weakly similar to UP|Q9LVC6 (Q9LVC6) Villin, partial (17%) 39 0.005
BG507436 similar to GP|8777365|dbj| villin {Arabidopsis thaliana... 33 0.26
TC229162 32 0.57
TC227707 similar to UP|VIL1_ARATH (O81643) Villin 1, partial (5%) 31 0.98
TC226462 homologue to UP|PCNA_PEA (O82134) Proliferating cell nu... 30 1.7
TC220538 similar to UP|Q9FHM5 (Q9FHM5) Similarity to DNA-binding... 28 4.8
BM086603 28 4.8
TC206651 UP|Q949H4 (Q949H4) Leaf ubiquitous urease , complete 28 4.8
TC230851 weakly similar to UP|Q7Z5Q7 (Q7Z5Q7) Lung cancer oncoge... 28 6.3
TC234144 28 8.3
TC206031 similar to UP|Q38949 (Q38949) Rof1 , partial (68%) 28 8.3
>BU548249
Length = 537
Score = 204 bits (519), Expect = 5e-53
Identities = 94/136 (69%), Positives = 116/136 (85%)
Frame = -2
Query: 1 TTASKSGALRHDIHYWLGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLS 60
TT K +D+H+W+GK TSQDEAG AAIKTVELDA +GGRAVQ+RE+QGHE+ KFLS
Sbjct: 410 TTQGKGSTYFYDLHFWIGKHTSQDEAGTAAIKTVELDAAIGGRAVQHREIQGHESDKFLS 231
Query: 61 YFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLNHDDIFILDT 120
YFKPCIIP EGG ASGFK E E+ +T L+VC+GK VV +++VPFARSSLNH+D+FILDT
Sbjct: 230 YFKPCIIPLEGGVASGFKKPEEEKFETCLYVCRGKRVVRLRQVPFARSSLNHEDVFILDT 51
Query: 121 ESKIFQFNGSNSSIQE 136
++KI+QFNG+NS+IQE
Sbjct: 50 QNKIYQFNGANSNIQE 3
>AW598284
Length = 440
Score = 183 bits (465), Expect = 9e-47
Identities = 86/115 (74%), Positives = 98/115 (84%)
Frame = +2
Query: 1 TTASKSGALRHDIHYWLGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLS 60
TT K A +DIH+W+GKDTSQDEAG AAIKTVELDA LGGRAVQ+RE+QGHE+ KFLS
Sbjct: 95 TTQGKGSAYLYDIHFWIGKDTSQDEAGTAAIKTVELDASLGGRAVQHREIQGHESDKFLS 274
Query: 61 YFKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLNHDDI 115
YFKPCIIP EGG ASGFK E EE +TRL+VC+GK VV +K+VPFARSSLNHDD+
Sbjct: 275 YFKPCIIPLEGGVASGFKKPEEEEFETRLYVCRGKRVVRIKQVPFARSSLNHDDV 439
>TC219145 similar to UP|VIL2_ARATH (O81644) Villin 2, partial (35%)
Length = 1261
Score = 68.2 bits (165), Expect = 6e-12
Identities = 71/277 (25%), Positives = 121/277 (43%), Gaps = 16/277 (5%)
Frame = +2
Query: 52 GHETQKFLSYFKPCIIPQEGGAASGFKHVEAE-----EHKTRLFVC------KGKHVVYV 100
G E +F+ F P ++ +GG +SG+K + A+ E T V H V
Sbjct: 11 GKEPPQFIVLFHPMVV-LKGGLSSGYKKLIADKGLPDETYTAESVAFIRISGTSTHNNKV 187
Query: 101 KEVPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEVAS 160
+V + LN + F+L + S +F ++G+ S++++ A +V ++++ +
Sbjct: 188 VQVDAVAALLNSTECFVLQSGSAVFTWHGNQCSLEQQQLAAKVAEFLRPGV-----SLKL 352
Query: 161 IEDGRLMADSESGEFWGLFGGFAPLPRKTVSDDDKTIDSHPPKLLCVEKGKAEPFETDSL 220
++G +E+ FW GG K V++D D H L +GK + E +
Sbjct: 353 AKEG-----TETSTFWFALGGKQSYTSKNVTNDIVR-DPHL-FTLSFNRGKLQVEEVYNF 511
Query: 221 TKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSASGSTDELVSSTN-----RPKSQIIR 275
+++ L T ILD EVFVWIG+ E++ A + + P + +
Sbjct: 512 SQDDLLTEDILILDTHTEVFVWIGQCVDPKEKQKAFEIAQKYIDKAASLEGLSPHVPLYK 691
Query: 276 VMEGFETVMFRSKFDSWPQTTNAAMPEDGRGKVAALL 312
V EG E F + F SW T A +P + K LL
Sbjct: 692 VTEGNEPCFFTTYF-SWDH-TKAMVPGNSFQKKVTLL 796
>TC227838 similar to UP|VIL4_ARATH (O65570) Villin 4, partial (38%)
Length = 1235
Score = 60.1 bits (144), Expect = 2e-09
Identities = 69/294 (23%), Positives = 124/294 (41%), Gaps = 18/294 (6%)
Frame = +2
Query: 44 AVQYREVQGHETQKFLSYFKPCIIPQEGGAASGFKHVEAE---------EHKTRLFVCKG 94
A Q R +G+E +F S I+ +GG + G+K A+ E+ LF +G
Sbjct: 8 ASQARIYEGNEPIQFHSILHRFIV-FKGGLSEGYKTYIAQKEIPDDTYNENGVALFRIQG 184
Query: 95 K---HVVYVKEVPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTY 151
++ ++ P A SSLN +IL +F G+++S + + ++ IK
Sbjct: 185 SGPDNMQAIQVEPVA-SSLNSSYCYILHNGPAVFTCFGNSTSAENQELVERMLDLIKP-- 355
Query: 152 HDGKCEVASIEDGRLMADSESGEFWGLFGGFAPLPRKTVSDDDKTIDSHPPKLLC-VEKG 210
+++ SES +FW GG + P + + + +S P C KG
Sbjct: 356 --------NLQSKPQREGSESEQFWDFLGGKSEYPSQKILREP---ESDPHLFSCHFSKG 502
Query: 211 KAEPFETDSLTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSASGSTDELVS-----S 265
+ E + +++ L T +ILDC E+FVW+G+ R A ++ +
Sbjct: 503 NLKVTEVYNFSQDDLMTEDIFILDCHSEIFVWVGQQVDSKSRMQALTIGEKFLEHDFLLE 682
Query: 266 TNRPKSQIIRVMEGFETVMFRSKFDSWPQTTNAAMPEDGRGKVAALLKRQGLDV 319
+ + VMEG E F ++F W ++ + + K+ ++K G V
Sbjct: 683 KLSHVAPVYVVMEGSEPPFF-TRFFKWDSAKSSMLGNSFQRKL-TIVKSGGAPV 838
>TC216626 similar to UP|VIL2_ARATH (O81644) Villin 2, partial (36%)
Length = 1777
Score = 59.3 bits (142), Expect = 3e-09
Identities = 54/208 (25%), Positives = 91/208 (42%), Gaps = 5/208 (2%)
Frame = +3
Query: 109 SLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEVASIEDGRLMA 168
SLN + F+L + S IF ++G+ S +++ A +V +++ A+++ +
Sbjct: 3 SLNSTECFVLQSGSTIFTWHGNQCSFEQQQLAAKVADFLRPG--------ATLKHAK--E 152
Query: 169 DSESGEFWGLFGGFAPLPRKTVSDDDKTIDSHPPKLLCVEKGKAEPFETDSLTKELLDTN 228
+ES FW GG K V ++ + L KGK E + +++ L
Sbjct: 153 GTESSAFWSALGGKQSYTSKKVVNE--VVRDPHLFTLSFNKGKFNVEEVYNFSQDDLLPE 326
Query: 229 KCYILDCGLEVFVWIGRNTSLDERKSA---SGSTDELVSSTN--RPKSQIIRVMEGFETV 283
ILD EVF+WIG + E+++A +LV+S P + +V EG E
Sbjct: 327 DILILDTHAEVFIWIGHSVEPKEKRNAFEIGQKYIDLVASLEGLSPHVPLYKVTEGNEPC 506
Query: 284 MFRSKFDSWPQTTNAAMPEDGRGKVAAL 311
F + F SW M + KV+ L
Sbjct: 507 FFTTYF-SWDHAKAMVMGNSFQKKVSLL 587
>TC227706 similar to UP|VIL1_ARATH (O81643) Villin 1, partial (11%)
Length = 515
Score = 40.4 bits (93), Expect = 0.001
Identities = 35/142 (24%), Positives = 63/142 (43%)
Frame = +1
Query: 102 EVPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERAKALEVVQYIKDTYHDGKCEVASI 161
+V +SLN +IL +++ I+ + GS SS ++ +V+ T+ S+
Sbjct: 73 QVDQVSTSLNSSYCYILQSKASIYTWIGSLSSARDHNLLDRMVELSNPTWLP-----VSV 237
Query: 162 EDGRLMADSESGEFWGLFGGFAPLPRKTVSDDDKTIDSHPPKLLCVEKGKAEPFETDSLT 221
+G +E FW G A P+ + ID L + +G + E + T
Sbjct: 238 REG-----NEPDIFWDALSGKAEYPKG--KEIQGFIDDPHLFALKITRGDFKVKEIYNYT 396
Query: 222 KELLDTNKCYILDCGLEVFVWI 243
++ L T +LDC E++VW+
Sbjct: 397 QDDLITEDVLLLDCQREIYVWV 462
>TC227708 weakly similar to UP|Q9LVC6 (Q9LVC6) Villin, partial (17%)
Length = 1262
Score = 38.5 bits (88), Expect = 0.005
Identities = 44/168 (26%), Positives = 75/168 (44%), Gaps = 11/168 (6%)
Frame = +1
Query: 178 LFGGFAPLPRKTVSDDDKTIDS---HPPKLLCVEKGKAEPFETDSLTKELLDTNKCYILD 234
+F G + R+++ K DS H +C+ E E + T++ L T +LD
Sbjct: 124 IFSGMLSVERQSIQRAKKFKDS*MIH----ICLH*K*REVKEIYNYTQDDLITEDILLLD 291
Query: 235 CGLEVFVWIGRNTSLDERKSASG------STDELVS--STNRPKSQIIRVMEGFETVMFR 286
C E++VW+G ++++ ++ A D LV S N P I V EG E F
Sbjct: 292 CQREIYVWVGLHSAIKSKQEALNLGLKFLEMDVLVEGLSMNIP---IYIVTEGHEPPFF- 459
Query: 287 SKFDSWPQTTNAAMPEDGRGKVAALLKRQGLDVKGLVKADPVKEEPQP 334
++F SW + K+ A+LK + ++G + P+K +P
Sbjct: 460 TRFFSWDHSKENIFGNSFERKL-AILKGKPKSLEGHNRT-PLKANSRP 597
>BG507436 similar to GP|8777365|dbj| villin {Arabidopsis thaliana}, partial
(7%)
Length = 426
Score = 32.7 bits (73), Expect = 0.26
Identities = 14/40 (35%), Positives = 21/40 (52%)
Frame = +2
Query: 216 ETDSLTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSA 255
E + +++ L T Y LDC E+FVW+G+ R A
Sbjct: 221 EIHNFSQDDLMTEDIYTLDCHSEIFVWVGQQVDSKSRMQA 340
>TC229162
Length = 1027
Score = 31.6 bits (70), Expect = 0.57
Identities = 16/60 (26%), Positives = 31/60 (51%), Gaps = 5/60 (8%)
Frame = +1
Query: 225 LDTNKCYILDCGLEVFVWIGRNTSLDERKSASG-----STDELVSSTNRPKSQIIRVMEG 279
+ ++ +LD G +VF+W+G + DE +SA+ + E ++ P +I+ EG
Sbjct: 331 MQSDAAVVLDHGTDVFIWLGAELAADEGRSAAALAACRTLAEELTEYRFPAPRILAFKEG 510
>TC227707 similar to UP|VIL1_ARATH (O81643) Villin 1, partial (5%)
Length = 457
Score = 30.8 bits (68), Expect = 0.98
Identities = 12/40 (30%), Positives = 25/40 (62%)
Frame = +1
Query: 216 ETDSLTKELLDTNKCYILDCGLEVFVWIGRNTSLDERKSA 255
E + T++ L T +LDC E++VW+G ++++ ++ A
Sbjct: 328 EIYNYTQDDLITEDVLLLDCQREIYVWVGLHSAVKSKQEA 447
>TC226462 homologue to UP|PCNA_PEA (O82134) Proliferating cell nuclear
antigen, complete
Length = 1202
Score = 30.0 bits (66), Expect = 1.7
Identities = 25/80 (31%), Positives = 40/80 (49%), Gaps = 1/80 (1%)
Frame = -2
Query: 17 LGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLSYFKPCIIPQEGGA-AS 75
LG + DE A + +E + GG AV+ R V HE L F+ + QEG +
Sbjct: 280 LGSEEESDEGDVAGVHGLEGEP--GGGAVEVRVV--HE---LLDGFQNLL--QEGALHET 128
Query: 76 GFKHVEAEEHKTRLFVCKGK 95
F+H++ ++ R+F C G+
Sbjct: 127 EFQHLDDDDRTKRVFFCDGE 68
>TC220538 similar to UP|Q9FHM5 (Q9FHM5) Similarity to DNA-binding protein,
partial (41%)
Length = 825
Score = 28.5 bits (62), Expect = 4.8
Identities = 20/51 (39%), Positives = 27/51 (52%), Gaps = 1/51 (1%)
Frame = +3
Query: 164 GRLMADSESGEFW-GLFGGFAPLPRKTVSDDDKTIDSHPPKLLCVEKGKAE 213
GR + ++ E W GL GG V+D D +DS +LL +EKGK E
Sbjct: 108 GREEEEGQA*EVWTGLQGG-------AVADADIGVDSVDRRLLSLEKGKRE 239
>BM086603
Length = 434
Score = 28.5 bits (62), Expect = 4.8
Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 3/59 (5%)
Frame = +2
Query: 79 HVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLNH--DDIFIL-DTESKIFQFNGSNSSI 134
H+E+ + V +G+H+V VK + LN+ D++ +L TE KI FN S +
Sbjct: 218 HLESVLANEPVVVIRGQHIVEVKPQVYLN*GLNN*TDNVCLLPQTERKIVHFNWDYSCL 394
>TC206651 UP|Q949H4 (Q949H4) Leaf ubiquitous urease , complete
Length = 2884
Score = 28.5 bits (62), Expect = 4.8
Identities = 31/140 (22%), Positives = 52/140 (37%)
Frame = +1
Query: 2 TASKSGALRHDIHYWLGKDTSQDEAGAAAIKTVELDAVLGGRAVQYREVQGHETQKFLSY 61
T ++SG + H I + G+ I T + GG A +V G +
Sbjct: 1732 TLNESGFVEHTIAAFKGR----------TIHTYHSEGAGGGHAPDIIKVCGEKN------ 1863
Query: 62 FKPCIIPQEGGAASGFKHVEAEEHKTRLFVCKGKHVVYVKEVPFARSSLNHDDIFILDTE 121
++P + H +EH L VC + ++V FA S + + I D
Sbjct: 1864 ----VLPSSTNPTRPYTHNTIDEHLDMLMVCHHLNKNIPEDVAFAESRIRAETIAAED-- 2025
Query: 122 SKIFQFNGSNSSIQERAKAL 141
I G+ S I ++A+
Sbjct: 2026 --ILHDKGAISIISSDSQAM 2079
>TC230851 weakly similar to UP|Q7Z5Q7 (Q7Z5Q7) Lung cancer oncogene 5,
partial (20%)
Length = 591
Score = 28.1 bits (61), Expect = 6.3
Identities = 23/79 (29%), Positives = 34/79 (42%), Gaps = 2/79 (2%)
Frame = +3
Query: 85 HKTRLFVCKGKHVVYVKE--VPFARSSLNHDDIFILDTESKIFQFNGSNSSIQERAKALE 142
H +FVC G HV Y+ E + + R I +L + + GSN++I+ E
Sbjct: 45 HNLTIFVCVGMHVCYIGEFYLGYCRRPKTSFLISVLKAQE---EKRGSNTNIKMDEATRE 215
Query: 143 VVQYIKDTYHDGKCEVASI 161
K TY CE S+
Sbjct: 216 -----KTTYKRITCEEKSV 257
>TC234144
Length = 468
Score = 27.7 bits (60), Expect = 8.3
Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Frame = +1
Query: 58 FLSYFKPCII---PQEGGAASGFKHVEAEEHKTRLFVCK 93
FL+YF PC+I + A G KH+ A H RL++ K
Sbjct: 238 FLNYFNPCLI*RWSRLEHHAGGIKHL-AA*HSDRLYISK 351
>TC206031 similar to UP|Q38949 (Q38949) Rof1 , partial (68%)
Length = 1350
Score = 27.7 bits (60), Expect = 8.3
Identities = 20/56 (35%), Positives = 28/56 (49%), Gaps = 3/56 (5%)
Frame = +2
Query: 159 ASIEDGRLMADSESGEFWGLFGGFAPL---PRKTVSDDDKTIDSHPPKLLCVEKGK 211
A +EDG L+A S+ EF G F P KT+ +K + + P+ EKGK
Sbjct: 29 AHLEDGTLVAKSDGVEFTVNDGHFCPAFSKAVKTMKKGEKVLLTVKPQYGFGEKGK 196
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.316 0.135 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 14,793,473
Number of Sequences: 63676
Number of extensions: 190865
Number of successful extensions: 775
Number of sequences better than 10.0: 34
Number of HSP's better than 10.0 without gapping: 767
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 770
length of query: 355
length of database: 12,639,632
effective HSP length: 98
effective length of query: 257
effective length of database: 6,399,384
effective search space: 1644641688
effective search space used: 1644641688
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (27.3 bits)
Medicago: description of AC125477.10