
BLAST2 result
TBLASTN 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC126782.9 + phase: 1 /pseudo
(871 letters)
Database: GMGI
63,676 sequences; 37,918,896 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
BE661681 310 2e-84
TC223260 249 4e-66
CO982525 214 1e-55
TC229985 110 3e-24
TC209437 52 9e-07
TC207639 similar to UP|O48592 (O48592) GT2 protein, partial (24%) 48 2e-05
TC208893 similar to UP|O48592 (O48592) GT2 protein, partial (11%) 46 6e-05
NP426645 GT-2 factor [Glycine max] 45 2e-04
TC222513 42 9e-04
BE607585 42 0.002
TC233765 similar to UP|Q8WZS6 (Q8WZS6) Related to BRR5 (Componen... 42 0.002
TC211199 similar to UP|Q6Z1Z7 (Q6Z1Z7) PWWP domain protein-like,... 39 0.008
AW596787 similar to GP|18182311|gb| GT-2 factor {Glycine max}, p... 37 0.049
BU549108 36 0.064
BQ253306 35 0.11
TC210182 similar to UP|Q9AVE4 (Q9AVE4) DNA-binding protein DF1, ... 35 0.14
TC226738 weakly similar to UP|Q9SEE9 (Q9SEE9) Arginine/serine-ri... 35 0.19
TC207029 weakly similar to UP|Q7R8E7 (Q7R8E7) Ran-binding protei... 33 0.70
TC216149 weakly similar to UP|MNN4_YEAST (P36044) MNN4 protein, ... 32 0.92
TC218783 similar to UP|Q40363 (Q40363) NuM1 protein, partial (19%) 32 1.2
>BE661681
Length = 495
Score = 310 bits (794), Expect = 2e-84
Identities = 153/165 (92%), Positives = 162/165 (97%)
Frame = +1
Query: 330 VITTQFASNLHRIGSVKAAADLTGRKLVFVGMSLRTYLEAAWKDGKAPFDPSTLVKAEDI 389
VITTQFASNLHR+GSVKAAADLTGRKLVFVGMSLRTYL+AAWKDGKAP DPSTLVKAEDI
Sbjct: 1 VITTQFASNLHRLGSVKAAADLTGRKLVFVGMSLRTYLDAAWKDGKAPIDPSTLVKAEDI 180
Query: 390 DAYAPKDLLIVTTGSQAEPRAALNLASFGSSHAFKLTKEDIVLYSAKVIPGNESRVMEMM 449
DAYAPKDLLIVTTGSQAEPRAALNLAS+GSSHAFKL KED+VLYSAKVIPGNESRVM+M+
Sbjct: 181 DAYAPKDLLIVTTGSQAEPRAALNLASYGSSHAFKLAKEDVVLYSAKVIPGNESRVMKML 360
Query: 450 NRISEIGSTIVMGRNENLHTSGHAYRGELEEVLRIVKPQHFLPVH 494
NRISEIGSTIVMG+NE LHTSGHAYRGELEE+LRIVKPQHFLP+H
Sbjct: 361 NRISEIGSTIVMGKNEGLHTSGHAYRGELEEILRIVKPQHFLPIH 495
>TC223260
Length = 424
Score = 249 bits (636), Expect = 4e-66
Identities = 122/141 (86%), Positives = 133/141 (93%)
Frame = +2
Query: 489 HFLPVHGEYLFLKEHESLGKSTGIRHTAVIKNGEMLGVSHLRNRRVLSNGFISLGKENLQ 548
HFLP+HGE LFLKEHE LGKSTGIRHT VIKNGEMLGVSHLRNRRVLSNGFISLGKENLQ
Sbjct: 2 HFLPIHGELLFLKEHELLGKSTGIRHTTVIKNGEMLGVSHLRNRRVLSNGFISLGKENLQ 181
Query: 549 LKYSDGDKAFGTSGELFLDERMRIALDGIIVVSMEIFRPKNLESLAGNTLKGKIRITTRC 608
L YSDG+KAFGTS +LF+DER++IALDGIIVV+MEIFRP+NL+S NTLKGKIRITTRC
Sbjct: 182 LMYSDGEKAFGTSSDLFIDERLKIALDGIIVVNMEIFRPQNLDSPVENTLKGKIRITTRC 361
Query: 609 LWLDKGKLLDALYKAAHAALS 629
LWLDKGKLLDAL+KAAHAAL+
Sbjct: 362 LWLDKGKLLDALHKAAHAALA 424
>CO982525
Length = 773
Score = 214 bits (546), Expect = 1e-55
Identities = 107/142 (75%), Positives = 121/142 (84%), Gaps = 1/142 (0%)
Frame = -2
Query: 728 ETITTAAEGDLSDSGESDEFWKPFIAS-SVEKSIKANNGYVSRKEHKSNTKQDDSEDIDE 786
+ + AEGDL +S +SDEFWKPFI S VEKSI A+N YVS+KE KSN K+DDSEDIDE
Sbjct: 772 DNTASGAEGDLXESEDSDEFWKPFITSLPVEKSISADNSYVSQKEQKSNLKKDDSEDIDE 593
Query: 787 AKSEEMSDSEPESSKSEKKNKWKTEEVKKLIDLRSDLRDRFKVVKGRMALWEEISQSLLA 846
AKSEE S+SEP+ SKS K+NKWKTEEVKKLI +R +L DRF+VVKGRMALWEEISQ LLA
Sbjct: 592 AKSEETSNSEPKLSKSVKRNKWKTEEVKKLIGMRGELSDRFQVVKGRMALWEEISQKLLA 413
Query: 847 DGISRSPGQCKSLWTSLALKYE 868
DGISRSPGQCKSLWTSL +KYE
Sbjct: 412 DGISRSPGQCKSLWTSLVVKYE 347
>TC229985
Length = 777
Score = 110 bits (275), Expect = 3e-24
Identities = 52/72 (72%), Positives = 60/72 (83%)
Frame = +2
Query: 797 PESSKSEKKNKWKTEEVKKLIDLRSDLRDRFKVVKGRMALWEEISQSLLADGISRSPGQC 856
P +++ WK EEVKKLID+R +L DRF+VVKGRMALWEEISQ+LLA+GISRSPGQC
Sbjct: 77 PNLQNQQREITWKHEEVKKLIDMRGELNDRFQVVKGRMALWEEISQNLLANGISRSPGQC 256
Query: 857 KSLWTSLALKYE 868
KSLWTSL KYE
Sbjct: 257 KSLWTSLLQKYE 292
Score = 40.8 bits (94), Expect = 0.003
Identities = 20/38 (52%), Positives = 25/38 (65%)
Frame = +1
Query: 773 KSNTKQDDSEDIDEAKSEEMSDSEPESSKSEKKNKWKT 810
+S K D SED +E S SDSEP+SSKS K+N +T
Sbjct: 4 RSPLKDDCSEDTEECNSVNTSDSEPKSSKSAKRNNMET 117
>TC209437
Length = 649
Score = 52.4 bits (124), Expect = 9e-07
Identities = 23/28 (82%), Positives = 25/28 (89%)
Frame = +2
Query: 841 SQSLLADGISRSPGQCKSLWTSLALKYE 868
+Q LLADGI RSPGQCKSLWTSL +KYE
Sbjct: 8 AQKLLADGICRSPGQCKSLWTSLVVKYE 91
>TC207639 similar to UP|O48592 (O48592) GT2 protein, partial (24%)
Length = 1689
Score = 47.8 bits (112), Expect = 2e-05
Identities = 23/67 (34%), Positives = 36/67 (53%)
Frame = +1
Query: 797 PESSKSEKKNKWKTEEVKKLIDLRSDLRDRFKVVKGRMALWEEISQSLLADGISRSPGQC 856
P S S ++W EV LI LR+ L +++ + LWE+IS +L G +RS +C
Sbjct: 463 PSSLSSLSSSRWPKTEVHALIRLRTSLETKYQENGPKTPLWEDISAGMLRLGYNRSAKRC 642
Query: 857 KSLWTSL 863
K W ++
Sbjct: 643 KEKWENI 663
>TC208893 similar to UP|O48592 (O48592) GT2 protein, partial (11%)
Length = 853
Score = 46.2 bits (108), Expect = 6e-05
Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 10/103 (9%)
Frame = +3
Query: 771 EHKSNTKQDDSEDIDEAKSEEMSDSEP---------ESSKSEKKNKWKTEEVKKLIDLR- 820
EH ++ + +I +S +S +P E+SK + +W +EV LI+LR
Sbjct: 3 EHNPSSSLNSHNNIIPLESNSVSTYKPTSTTPMASSENSKDDIGRRWPRDEVLALINLRC 182
Query: 821 SDLRDRFKVVKGRMALWEEISQSLLADGISRSPGQCKSLWTSL 863
+ L + + LWE ISQ + A G RS +CK W ++
Sbjct: 183 TSLSSNEEKEGNKGPLWERISQGMSALGYKRSAKRCKEKWENI 311
>NP426645 GT-2 factor [Glycine max]
Length = 767
Score = 44.7 bits (104), Expect = 2e-04
Identities = 26/82 (31%), Positives = 40/82 (48%)
Frame = +1
Query: 789 SEEMSDSEPESSKSEKKNKWKTEEVKKLIDLRSDLRDRFKVVKGRMALWEEISQSLLADG 848
S D E KS N+W +E L+ +RSD+ F+ + LWEE+S+ L G
Sbjct: 76 SNSGDDRVEEGDKSFGGNRWPRQETLALLKIRSDMDVAFRDASVKGPLWEEVSRKLAELG 255
Query: 849 ISRSPGQCKSLWTSLALKYEVR 870
R+ +CK + ++ KY R
Sbjct: 256 YHRNAKKCKEKFENV-YKYHKR 318
>TC222513
Length = 455
Score = 42.4 bits (98), Expect = 9e-04
Identities = 23/98 (23%), Positives = 46/98 (46%), Gaps = 11/98 (11%)
Frame = +3
Query: 782 EDIDEAKSEEMSDSEPESSKSEKKNKWKTEEVKKLIDLRSDLRDRFKVVKGRMA------ 835
+++ + ++ D E K+ + +W +E+ LI + D ++F+ +GR A
Sbjct: 84 QEVGQNRNTNGVDGGDEGGKAPRLPRWTRQEILVLIQGKRDAENKFR--RGRTAGLAFGS 257
Query: 836 -----LWEEISQSLLADGISRSPGQCKSLWTSLALKYE 868
W +S G++R P QC+ W++LA Y+
Sbjct: 258 GQVEPKWASVSSYCRKHGVNRGPVQCRKRWSNLAGDYK 371
>BE607585
Length = 443
Score = 41.6 bits (96), Expect = 0.002
Identities = 22/69 (31%), Positives = 36/69 (51%), Gaps = 4/69 (5%)
Frame = +1
Query: 799 SSKSEKKNKWKTEEVKKLIDLR----SDLRDRFKVVKGRMALWEEISQSLLADGISRSPG 854
+ K + +W +EV LI+LR ++ + K ++ LWE ISQ +L G RS
Sbjct: 148 NEKDDVGRRWPKDEVLALINLRCTSVNNNNNEEKEGNNKVPLWERISQGMLELGYKRSAK 327
Query: 855 QCKSLWTSL 863
+CK W ++
Sbjct: 328 RCKEKWENI 354
>TC233765 similar to UP|Q8WZS6 (Q8WZS6) Related to BRR5 (Component of
pre-mRNA polyadenylation factor PF I), partial (9%)
Length = 423
Score = 41.6 bits (96), Expect = 0.002
Identities = 26/94 (27%), Positives = 42/94 (44%)
Frame = +2
Query: 106 DGPPLRVLPIGGLGEIGMNCMLVGNHDRYILIDAGIMFPDYDDLGVQKIIPDTTFIRKWS 165
D + V P+G E+G +C+ + + IL D GI LG + F
Sbjct: 23 DDEVMIVTPLGAGNEVGRSCVYMSYKGKSILFDCGI------HLGFSGMSALPYFDEIDP 184
Query: 166 HKIEALVITHGHEDHIGALPWVIPALDSNTPIFA 199
++ L+ITH H DH +LP+ + + P F+
Sbjct: 185 STLDVLLITHFHLDHAASLPYFLEKTNLRRPAFS 286
>TC211199 similar to UP|Q6Z1Z7 (Q6Z1Z7) PWWP domain protein-like, partial
(8%)
Length = 914
Score = 39.3 bits (90), Expect = 0.008
Identities = 20/69 (28%), Positives = 34/69 (48%), Gaps = 7/69 (10%)
Frame = +3
Query: 807 KWKTEEVKKLIDLRSDLRDRFKVVKGRMAL-------WEEISQSLLADGISRSPGQCKSL 859
+W +E+ LI +SD RF+ +G + W +S G++R P QC+
Sbjct: 264 RWTRQEILVLIQGKSDAESRFRPGRGSGSAFGSSEPKWALVSSYCKKHGVNREPVQCRKR 443
Query: 860 WTSLALKYE 868
W++LA Y+
Sbjct: 444 WSNLAGDYK 470
>AW596787 similar to GP|18182311|gb| GT-2 factor {Glycine max}, partial (34%)
Length = 411
Score = 36.6 bits (83), Expect = 0.049
Identities = 20/64 (31%), Positives = 34/64 (52%)
Frame = +1
Query: 807 KWKTEEVKKLIDLRSDLRDRFKVVKGRMALWEEISQSLLADGISRSPGQCKSLWTSLALK 866
+W +E L+ +RSD+ F+ + LWEE+++ L G RS +CK + ++ K
Sbjct: 1 RWPRQETLALLKIRSDMDAVFRDSSLKGPLWEEVARKLSELGYHRSAKKCKEKFENV-YK 177
Query: 867 YEVR 870
Y R
Sbjct: 178 YHKR 189
>BU549108
Length = 615
Score = 36.2 bits (82), Expect = 0.064
Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 9/75 (12%)
Frame = -1
Query: 798 ESSKSEKKNKWKTEEVKKLIDLR---------SDLRDRFKVVKGRMALWEEISQSLLADG 848
E+SK + +W +EV LI R ++ + K + LWE ISQ + G
Sbjct: 612 ENSKDDIGRRWPRDEVLALIXXRCTXXSSSNNNNNNNEEKEGNNKGPLWERISQGMSELG 433
Query: 849 ISRSPGQCKSLWTSL 863
RS +CK W ++
Sbjct: 432 YKRSAKRCKEKWENI 388
>BQ253306
Length = 422
Score = 35.4 bits (80), Expect = 0.11
Identities = 16/48 (33%), Positives = 29/48 (60%)
Frame = +1
Query: 816 LIDLRSDLRDRFKVVKGRMALWEEISQSLLADGISRSPGQCKSLWTSL 863
LI LR+ + ++++ + LWEEIS S+ G +R+ +CK W ++
Sbjct: 7 LIKLRTSMDEKYQENGPKGPLWEEISASMKKLGYNRNAKRCKEKWENI 150
>TC210182 similar to UP|Q9AVE4 (Q9AVE4) DNA-binding protein DF1, partial
(13%)
Length = 918
Score = 35.0 bits (79), Expect = 0.14
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 7/65 (10%)
Frame = +2
Query: 806 NKWKTEEVKKLIDLRSDL-------RDRFKVVKGRMALWEEISQSLLADGISRSPGQCKS 858
++W +EV+ LI LR+ + + + LWEEIS ++ + G RS +CK
Sbjct: 101 SRWPKDEVEALIRLRTQIDVQAQWNNNNNNNDGSKGPLWEEISSAMKSLGYDRSAKRCKE 280
Query: 859 LWTSL 863
W ++
Sbjct: 281 KWENI 295
>TC226738 weakly similar to UP|Q9SEE9 (Q9SEE9) Arginine/serine-rich protein,
partial (24%)
Length = 486
Score = 34.7 bits (78), Expect = 0.19
Identities = 20/63 (31%), Positives = 29/63 (45%), Gaps = 1/63 (1%)
Frame = +3
Query: 50 SKPTRLSVSASALSASVGNDGSTSRVPQKRRRRIEGPRKSMEDSVQRRMEQFY-EGNDGP 108
S P R + + V + R P RRR + PR+ + V+RR++ Y G D P
Sbjct: 114 SSPRRKPPPSPRRRSPVPRRAGSPRRPDSPRRRADSPRRRADSPVRRRLDSPYRRGGDTP 293
Query: 109 PLR 111
P R
Sbjct: 294 PRR 302
>TC207029 weakly similar to UP|Q7R8E7 (Q7R8E7) Ran-binding protein, partial
(19%)
Length = 712
Score = 32.7 bits (73), Expect = 0.70
Identities = 29/95 (30%), Positives = 44/95 (45%), Gaps = 7/95 (7%)
Frame = +1
Query: 728 ETITTAAEGDLSDSGESDEFWKPFIASSVEKSIKANNGYVSRKEHKSNTKQDDSEDIDEA 787
E T E +D GE DE K K + K+HK K+D E+ ++
Sbjct: 7 EETDTLKEKGKNDEGEDDE---------ENKKNKKKDKKEKEKDHKKEKKEDGKEEGEKE 159
Query: 788 KSE-EMS----DSEPESSKSEKKNKWKTE--EVKK 815
KS+ E+S D E + + EK++K K + EVK+
Sbjct: 160 KSKVEVSVRDIDIEETTKEGEKEDKEKDDGKEVKE 264
>TC216149 weakly similar to UP|MNN4_YEAST (P36044) MNN4 protein, partial (4%)
Length = 2040
Score = 32.3 bits (72), Expect = 0.92
Identities = 18/47 (38%), Positives = 27/47 (57%)
Frame = +3
Query: 769 RKEHKSNTKQDDSEDIDEAKSEEMSDSEPESSKSEKKNKWKTEEVKK 815
++E K K+++ + EAK EE EP+ + EKK + K EE KK
Sbjct: 1245 KEEPKKEEKKEEPKKEGEAKKEEEKKEEPK-KEEEKKEEPKKEEEKK 1382
>TC218783 similar to UP|Q40363 (Q40363) NuM1 protein, partial (19%)
Length = 870
Score = 32.0 bits (71), Expect = 1.2
Identities = 18/76 (23%), Positives = 36/76 (46%), Gaps = 1/76 (1%)
Frame = +3
Query: 733 AAEGDLSDSGESDEFWKPFIASSVEKSIKANNGYVSRKEHKSNTKQDDSE-DIDEAKSEE 791
+++ + S+S + D KP + + + S +A S S++ +D + DID+
Sbjct: 441 SSDDESSESSDEDNDAKPAVTAVSKPSARAQKKVESSDSDDSSSDEDKGKMDIDDDDDSS 620
Query: 792 MSDSEPESSKSEKKNK 807
EP+ K+ K +K
Sbjct: 621 DESEEPQKKKAVKNSK 668
Database: GMGI
Posted date: Oct 22, 2004 4:58 PM
Number of letters in database: 37,918,896
Number of sequences in database: 63,676
Lambda K H
0.318 0.135 0.387
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,329,064
Number of Sequences: 63676
Number of extensions: 426615
Number of successful extensions: 2380
Number of sequences better than 10.0: 58
Number of HSP's better than 10.0 without gapping: 2361
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 2377
length of query: 871
length of database: 12,639,632
effective HSP length: 106
effective length of query: 765
effective length of database: 5,889,976
effective search space: 4505831640
effective search space used: 4505831640
frameshift window, decay const: 50, 0.1
T: 13
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)
Medicago: description of AC126782.9