Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002435A_C01 KMC002435A_c01
(1031 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_199096.1| splicing factor U2AF small subunit, putative; p... 244 1e-82
gb|AAL06332.1|AF409140_1 U2 auxiliary factor small subunit [Arab... 237 1e-80
ref|NP_174086.1| splicing factor U2AF small subunit, putative; p... 236 2e-79
gb|AAL06331.1|AF409139_1 U2 auxiliary factor small subunit [Arab... 230 1e-77
emb|CAA77132.1| U2 snRNP auxiliary factor, small subunit [Oryza ... 225 3e-76
>ref|NP_199096.1| splicing factor U2AF small subunit, putative; protein id:
At5g42820.1, supported by cDNA: gi_15723292 [Arabidopsis
thaliana] gi|22531195|gb|AAM97101.1| U2 snRNP auxiliary
factor small subunit [Arabidopsis thaliana]
gi|23198022|gb|AAN15538.1| U2 snRNP auxiliary factor
small subunit [Arabidopsis thaliana]
Length = 283
Score = 244 bits (623), Expect(2) = 1e-82
Identities = 136/189 (71%), Positives = 147/189 (76%), Gaps = 12/189 (6%)
Frame = -3
Query: 903 LRTLNVCDNLADHMIGNVYVQFREEDEAAKALAALHGRFYSGRPIIAEFSPVTDFREATC 724
+ +LNVCDNLADHMIGNVYV F+EED AA AL AL GRFYSGRPIIA+FSPVTDFREATC
Sbjct: 95 VESLNVCDNLADHMIGNVYVLFKEEDHAAAALQALQGRFYSGRPIIADFSPVTDFREATC 154
Query: 723 RQYEENSCNRGGYCNFMHVKLIGRDLRRKLFSSH--SFRIRSRSPVRRSRSRSPPRRR-G 553
RQYEENSCNRGGYCNFMHVK I R+LRRKLF + S+R SRS RSRS SP R+R
Sbjct: 155 RQYEENSCNRGGYCNFMHVKQISRELRRKLFGRYRRSYRRGSRS---RSRSISPRRKREH 211
Query: 552 SMDRERRH-RDRDYDSRGRRSSDR-RSSDRDGGGRRRHGG-----SP--AREGSEERRAR 400
S +RER RDRD G+RSSDR DRDGGGRRRHG SP REGSEERRAR
Sbjct: 212 SRERERGDVRDRDRHGNGKRSSDRSERHDRDGGGRRRHGSPKRSRSPRNVREGSEERRAR 271
Query: 399 IEQWNRERE 373
IEQWNRER+
Sbjct: 272 IEQWNRERD 280
Score = 85.9 bits (211), Expect(2) = 1e-82
Identities = 37/45 (82%), Positives = 42/45 (93%)
Frame = -2
Query: 1030 QRPDMITPGVDPQGQPIDPREIQQHFEDFYEDIFLELSKFGEIEN 896
QRPDMITPGVDPQGQP+DP +IQ HFEDFYEDIF EL+KFGE+E+
Sbjct: 53 QRPDMITPGVDPQGQPLDPSKIQDHFEDFYEDIFEELNKFGEVES 97
>gb|AAL06332.1|AF409140_1 U2 auxiliary factor small subunit [Arabidopsis thaliana]
Length = 283
Score = 237 bits (605), Expect(2) = 1e-80
Identities = 134/189 (70%), Positives = 145/189 (75%), Gaps = 12/189 (6%)
Frame = -3
Query: 903 LRTLNVCDNLADHMIGNVYVQFREEDEAAKALAALHGRFYSGRPIIAEFSPVTDFREATC 724
+ +LNVC NLADHMIGNVYV F+EED AA AL AL GRFYSGRPIIA+FSPVTDFREATC
Sbjct: 95 VESLNVCVNLADHMIGNVYVLFKEEDHAAAALQALQGRFYSGRPIIADFSPVTDFREATC 154
Query: 723 RQYEENSCNRGGYCNFMHVKLIGRDLRRKLFSSH--SFRIRSRSPVRRSRSRSPPRRR-G 553
RQYEENSCNRGG CNFMHVK I R+LRRKLF + S+R SRS RSRS SP R+R
Sbjct: 155 RQYEENSCNRGGCCNFMHVKQISRELRRKLFGRYRRSYRRGSRS---RSRSISPRRKREH 211
Query: 552 SMDRERRH-RDRDYDSRGRRSSDRRSS-DRDGGGRRRHGG-----SP--AREGSEERRAR 400
S +RER RDRD G+RSSDR DRDGGGRRRHG SP REGSEERRAR
Sbjct: 212 SRERERGDVRDRDRHGNGKRSSDRSERYDRDGGGRRRHGSPKRSRSPRNVREGSEERRAR 271
Query: 399 IEQWNRERE 373
IEQWNRER+
Sbjct: 272 IEQWNRERD 280
Score = 85.9 bits (211), Expect(2) = 1e-80
Identities = 37/45 (82%), Positives = 42/45 (93%)
Frame = -2
Query: 1030 QRPDMITPGVDPQGQPIDPREIQQHFEDFYEDIFLELSKFGEIEN 896
QRPDMITPGVDPQGQP+DP +IQ HFEDFYEDIF EL+KFGE+E+
Sbjct: 53 QRPDMITPGVDPQGQPLDPSKIQDHFEDFYEDIFEELNKFGEVES 97
>ref|NP_174086.1| splicing factor U2AF small subunit, putative; protein id:
At1g27650.1, supported by cDNA: 7697., supported by
cDNA: gi_12744990, supported by cDNA: gi_15723290,
supported by cDNA: gi_17528935, supported by cDNA:
gi_19699274, supported by cDNA: gi_20465942 [Arabidopsis
thaliana] gi|5668775|gb|AAD46002.1|AC005916_14 Strong
similarity to gb|Y18349 U2 snRNP auxiliary factor, small
subunit from Oryza sativa. ESTs gb|AA586295 and
gb|AA597332 come from this gene. [Arabidopsis thaliana]
gi|6693017|gb|AAF24943.1|AC012375_6 T22C5.10
[Arabidopsis thaliana]
gi|12744991|gb|AAK06875.1|AF344324_1 putative U2 snRNP
auxiliary factor [Arabidopsis thaliana]
gi|17528936|gb|AAL38678.1| putative U2 snRNP auxiliary
factor [Arabidopsis thaliana] gi|19699275|gb|AAL91249.1|
At1g27650/T22C5_2 [Arabidopsis thaliana]
gi|20465943|gb|AAM20157.1| putative U2 snRNP auxiliary
factor protein [Arabidopsis thaliana]
gi|21595106|gb|AAM66073.1| putative U2 snRNP auxiliary
factor [Arabidopsis thaliana] gi|21689611|gb|AAM67427.1|
At1g27650/T22C5_2 [Arabidopsis thaliana]
Length = 296
Score = 236 bits (602), Expect(2) = 2e-79
Identities = 135/202 (66%), Positives = 151/202 (73%), Gaps = 21/202 (10%)
Frame = -3
Query: 903 LRTLNVCDNLADHMIGNVYVQFREEDEAAKALAALHGRFYSGRPIIAEFSPVTDFREATC 724
+ +LN+CDNLADHMIGNVYVQF+EED+AA AL AL GRFYSGRPIIA+FSPVTDFREATC
Sbjct: 95 IESLNICDNLADHMIGNVYVQFKEEDQAAAALQALQGRFYSGRPIIADFSPVTDFREATC 154
Query: 723 RQYEENSCNRGGYCNFMHVKLIGRDLRRKLFSSH--SFRIRSRSPVRRSRSRS-PPR--- 562
RQYEEN+CNRGGYCNFMHVKL+ R+LRRKLF + S+R SRS RSRSRS PR
Sbjct: 155 RQYEENNCNRGGYCNFMHVKLVSRELRRKLFGRYRRSYRRGSRS---RSRSRSISPRNKR 211
Query: 561 ---RRGSMDRERRHRDRDYD----SRGRRSSDR-RSSDRDGG-GRR----RHGGSP--AR 427
RR RE HRDRD + G+RSS+R +RDG GRR + GGSP R
Sbjct: 212 DNDRRDPSHREFSHRDRDREFYRHGSGKRSSERSERQERDGSRGRRQASPKRGGSPGGGR 271
Query: 426 EGSEERRARIEQWNREREGKLE 361
EGSEERRARIEQWNRERE K E
Sbjct: 272 EGSEERRARIEQWNREREEKEE 293
Score = 83.6 bits (205), Expect(2) = 2e-79
Identities = 36/45 (80%), Positives = 42/45 (93%)
Frame = -2
Query: 1030 QRPDMITPGVDPQGQPIDPREIQQHFEDFYEDIFLELSKFGEIEN 896
QRPDMITPGVD QGQP+DPR+IQ+HFEDF+ED+F EL KFGEIE+
Sbjct: 53 QRPDMITPGVDAQGQPLDPRKIQEHFEDFFEDLFEELGKFGEIES 97
>gb|AAL06331.1|AF409139_1 U2 auxiliary factor small subunit [Arabidopsis thaliana]
Length = 296
Score = 230 bits (586), Expect(2) = 1e-77
Identities = 133/202 (65%), Positives = 149/202 (72%), Gaps = 21/202 (10%)
Frame = -3
Query: 903 LRTLNVCDNLADHMIGNVYVQFREEDEAAKALAALHGRFYSGRPIIAEFSPVTDFREATC 724
+ +LN+CDNLADHMIGNVYVQF+EED+AA AL AL GRFYSGRPIIA+FSPVTDFREATC
Sbjct: 95 IESLNICDNLADHMIGNVYVQFKEEDQAAAALQALQGRFYSGRPIIADFSPVTDFREATC 154
Query: 723 RQYEENSCNRGGYCNFMHVKLIGRDLRRKLFSSH--SFRIRSRSPVRRSRSRS-PPR--- 562
RQYEEN+C RGGYCNFMHVKL+ R+LRRKL + S+R SRS RSRSRS PR
Sbjct: 155 RQYEENNCYRGGYCNFMHVKLVSRELRRKLSGRYRRSYRRGSRS---RSRSRSISPRNKR 211
Query: 561 ---RRGSMDRERRHRDRDYD----SRGRRSSDR-RSSDRDGG-GRR----RHGGSP--AR 427
RR RE HRDRD + G+RSS+R +RDG GRR + GGSP R
Sbjct: 212 DNDRRDPSHREFSHRDRDREFYRHGSGKRSSERSERQERDGSRGRRQASPKRGGSPGGGR 271
Query: 426 EGSEERRARIEQWNREREGKLE 361
EGSEERRARIEQWNRERE K E
Sbjct: 272 EGSEERRARIEQWNREREEKEE 293
Score = 83.6 bits (205), Expect(2) = 1e-77
Identities = 36/45 (80%), Positives = 42/45 (93%)
Frame = -2
Query: 1030 QRPDMITPGVDPQGQPIDPREIQQHFEDFYEDIFLELSKFGEIEN 896
QRPDMITPGVD QGQP+DPR+IQ+HFEDF+ED+F EL KFGEIE+
Sbjct: 53 QRPDMITPGVDAQGQPLDPRKIQEHFEDFFEDLFEELGKFGEIES 97
>emb|CAA77132.1| U2 snRNP auxiliary factor, small subunit [Oryza sativa]
Length = 301
Score = 225 bits (574), Expect(2) = 3e-76
Identities = 126/212 (59%), Positives = 136/212 (63%), Gaps = 35/212 (16%)
Frame = -3
Query: 903 LRTLNVCDNLADHMIGNVYVQFREEDEAAKALAALHGRFYSGRPIIAEFSPVTDFREATC 724
+ TLNVCDNLADHMIGNVYVQFREE++A A AL GRFYSGRPII E+SPVTDFREATC
Sbjct: 95 VETLNVCDNLADHMIGNVYVQFREEEQAVAAHNALQGRFYSGRPIIVEYSPVTDFREATC 154
Query: 723 RQYEENSCNRGGYCNFMHVKLIGRDLRRKLFSSHSFRIRSRSPVRRSRSRSPPRRRGSMD 544
RQ+EENSCNRGGYCNFMHVK IGR+LRRKL+ RSR RSRS SP RRG+ D
Sbjct: 155 RQFEENSCNRGGYCNFMHVKQIGRELRRKLYGG-----RSRRSHGRSRSPSPRHRRGNRD 209
Query: 543 RE--RRHRD---------------------------RDYDSRGRRSSDRRSSDRDGGGRR 451
R+ RR RD R GRR R D GGRR
Sbjct: 210 RDDFRRERDGYRGGGDGYRGGGGGGGGDGYRGGDSYRGGGGGGRRGGGSRYDRYDDGGRR 269
Query: 450 RHGG------SPAREGSEERRARIEQWNRERE 373
RHG SP RE SEERRA+IEQWNRERE
Sbjct: 270 RHGSPPRRARSPVRESSEERRAKIEQWNRERE 301
Score = 83.2 bits (204), Expect(2) = 3e-76
Identities = 36/44 (81%), Positives = 41/44 (92%)
Frame = -2
Query: 1030 QRPDMITPGVDPQGQPIDPREIQQHFEDFYEDIFLELSKFGEIE 899
QRPDMITPGVD QGQPIDP ++Q+HFEDFYEDI+ ELSKFGE+E
Sbjct: 53 QRPDMITPGVDAQGQPIDPEKMQEHFEDFYEDIYEELSKFGEVE 96
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 954,085,950
Number of Sequences: 1393205
Number of extensions: 24773512
Number of successful extensions: 180183
Number of sequences better than 10.0: 3986
Number of HSP's better than 10.0 without gapping: 101865
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 146984
length of database: 448,689,247
effective HSP length: 124
effective length of database: 275,931,827
effective search space used: 60429070113
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)