KMC012948A_c01
[Fasta Sequence]
[Nr Search]
[EST assemble image]
Fasta Sequence
>KMC012948A_C01 KMC012948A_c01
ctggagagtattgttctcactcgtcgctgccatcgCCACGATCTCCGATCCGATTCAGTT
TCTCGGGTACGCTATTCATCACTAGCTCACCGTGGAATACTTGATTTGAATTTCAACTCT
GATCTGAATGCATTTGGTTGCTTCTCTGTTACTAAGTTTTTGATTGAATTGAATCGAATT
ACCAGTGATTTTGATTGTTGTTCTATAAATGGGGACTGAAAATCCCGGTCGTCCAACCTT
CCCTGCAAGACCTGTTTCCTCACCATTCGCTGCAGCTCCTGCACCACAGACTGTAACACC
TTTTGCCTCCATGCCTGGATCAGACCCTCCTCCTCCTTTTCGACCTACGCCTCCGGCTCC
ACCGCAAGCATCCTCGCCGTTTTCATCGTTTGGACCGCCGGTTAGACCTGGGGCACCGAG
TTTTAGACCTGCCCCGCCTGGTAGATTCAATGACCCATCCGTGCCTCCACCTCCTCCTCC
CCAATCATCCAATTTCCTTCCTGCTGGAAATTTCCAGCAATTCTCAGC
Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC012948A_C01 KMC012948A_c01
(528 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM97042.1| putative Sec24-like COPII protein [Arabidopsis th... 84 1e-15
ref|NP_187366.1| putative Sec24-like COPII protein; protein id: ... 84 1e-15
gb|EAA35680.1| predicted protein [Neurospora crassa] 70 2e-11
sp|P21997|SSGP_VOLCA SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185)... 66 2e-10
ref|NP_176464.1| hypothetical protein; protein id: At1g62763.1 [... 66 2e-10
>gb|AAM97042.1| putative Sec24-like COPII protein [Arabidopsis thaliana]
gi|23197930|gb|AAN15492.1| putative Sec24-like COPII
protein [Arabidopsis thaliana]
Length = 1038
Score = 83.6 bits (205), Expect = 1e-15
Identities = 43/99 (43%), Positives = 50/99 (50%)
Frame = +2
Query: 209 MGTENPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSS 388
MGTEN G P FPARP SSPFA+AP P P FRP P+ Q + PF++
Sbjct: 1 MGTENQGYPNFPARPASSPFASAPPPGIPPQSGGPPTGSEAVGFRPFTPSASQPTRPFTA 60
Query: 389 FGPPVRPGAPSFRPAPPGRFNDPSVPPPPPPQSSNFLPA 505
GPP P + RP P F PPP SSN P+
Sbjct: 61 SGPPPAPPVGTMRPGQPSPFVSQIPGSRPPPPSSNSFPS 99
Score = 43.1 bits (100), Expect = 0.002
Identities = 36/114 (31%), Positives = 47/114 (40%), Gaps = 11/114 (9%)
Frame = +2
Query: 212 GTENPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPP----FRPTPPAPPQASSP 379
G +P P +P++ P A P PQ+ T + +PG++ PP P P Q+ P
Sbjct: 136 GHLSPPMSLRPQQPMA-PVAMGPPPQSTT--SGLPGANAYPPATDYHMPARPGFQQSMPP 192
Query: 380 FSSFGPPVRPGAPSFRPAP-------PGRFNDPSVPPPPPPQSSNFLPAGNFQQ 520
+ P V PSF P P F PP PPP SS G F Q
Sbjct: 193 VTPSYPGVGGSQPSFPGYPSKQVLQAPTPFQTSQGPPGPPPVSSYPPHTGGFAQ 246
>ref|NP_187366.1| putative Sec24-like COPII protein; protein id: At3g07100.1
[Arabidopsis thaliana] gi|13124526|sp|Q9SFU0|S24A_ARATH
Putative protein transport protein Sec24-like At3g07100
gi|6642655|gb|AAF20236.1|AC012395_23 putative Sec24-like
COPII protein [Arabidopsis thaliana]
Length = 1054
Score = 83.6 bits (205), Expect = 1e-15
Identities = 43/99 (43%), Positives = 50/99 (50%)
Frame = +2
Query: 209 MGTENPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSS 388
MGTEN G P FPARP SSPFA+AP P P FRP P+ Q + PF++
Sbjct: 1 MGTENQGYPNFPARPASSPFASAPPPGIPPQSGGPPTGSEAVGFRPFTPSASQPTRPFTA 60
Query: 389 FGPPVRPGAPSFRPAPPGRFNDPSVPPPPPPQSSNFLPA 505
GPP P + RP P F PPP SSN P+
Sbjct: 61 SGPPPAPPVGTMRPGQPSPFVSQIPGSRPPPPSSNSFPS 99
Score = 43.1 bits (100), Expect = 0.002
Identities = 36/114 (31%), Positives = 47/114 (40%), Gaps = 11/114 (9%)
Frame = +2
Query: 212 GTENPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPP----FRPTPPAPPQASSP 379
G +P P +P++ P A P PQ+ T + +PG++ PP P P Q+ P
Sbjct: 136 GHLSPPMSLRPQQPMA-PVAMGPPPQSTT--SGLPGANAYPPATDYHMPARPGFQQSMPP 192
Query: 380 FSSFGPPVRPGAPSFRPAP-------PGRFNDPSVPPPPPPQSSNFLPAGNFQQ 520
+ P V PSF P P F PP PPP SS G F Q
Sbjct: 193 VTPSYPGVGGSQPSFPGYPSKQVLQAPTPFQTSQGPPGPPPVSSYPPHTGGFAQ 246
>gb|EAA35680.1| predicted protein [Neurospora crassa]
Length = 283
Score = 70.1 bits (170), Expect = 2e-11
Identities = 41/95 (43%), Positives = 47/95 (49%), Gaps = 2/95 (2%)
Frame = +2
Query: 227 GRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASS--PFSSFGPP 400
G P P P SS + P PQ+ P AS G PPP P+PPA A S P SS PP
Sbjct: 104 GTPPPPPPPASSG-SPPPPPQSSVPPASSGGPSAPPPPPPSPPASSSAPSAPPASSGAPP 162
Query: 401 VRPGAPSFRPAPPGRFNDPSVPPPPPPQSSNFLPA 505
P + PAPP PS PPPP P +S P+
Sbjct: 163 APPASSGAPPAPPASSGSPSPPPPPHPPASTGAPS 197
Score = 57.0 bits (136), Expect = 1e-07
Identities = 37/95 (38%), Positives = 43/95 (44%), Gaps = 7/95 (7%)
Frame = +2
Query: 242 PARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRP-TPPAPPQASSPFSSFGPPVRPGAP 418
PA P TV P +S PPPP +PP PPQ+S P +S G P P P
Sbjct: 81 PAHSTVVTTVTIPVSTTVCPVSSGTPPPPPPPASSGSPPPPPQSSVPPASSGGPSAPPPP 140
Query: 419 SFRPAPPGRFNDPSVP------PPPPPQSSNFLPA 505
P+PP + PS P PP PP SS PA
Sbjct: 141 P--PSPPASSSAPSAPPASSGAPPAPPASSGAPPA 173
Score = 52.0 bits (123), Expect = 4e-06
Identities = 40/94 (42%), Positives = 46/94 (48%), Gaps = 6/94 (6%)
Frame = +2
Query: 227 GRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRP-TPPAPPQAS---SPFSSFG 394
G P+ P P SP A++ AP P AS G+ P PP PPAPP +S SP
Sbjct: 132 GGPSAPPPPPPSPPASSSAPSA--PPASS-GAPPAPPASSGAPPAPPASSGSPSPPPPPH 188
Query: 395 PPVRPGAPSFRPA--PPGRFNDPSVPPPPPPQSS 490
PP GAPS PA P N S PPPP +S
Sbjct: 189 PPASTGAPSAPPASTQPSPSNPASSAAPPPPHNS 222
Score = 37.4 bits (85), Expect = 0.11
Identities = 27/76 (35%), Positives = 33/76 (42%), Gaps = 2/76 (2%)
Frame = +2
Query: 221 NPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSD--PPPPFRPTPPAPPQASSPFSSFG 394
+P P P P S+ +AP P + P S P S PPPP P A ++S SS G
Sbjct: 180 SPSPPPPPHPPASTGAPSAP-PASTQPSPSNPASSAAPPPPHNSGPSATEPSASGSSSGG 238
Query: 395 PPVRPGAPSFRPAPPG 442
GAP PG
Sbjct: 239 GGGGGGAPPTATTSPG 254
Score = 31.6 bits (70), Expect = 6.0
Identities = 19/73 (26%), Positives = 31/73 (42%)
Frame = +2
Query: 224 PGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSSFGPPV 403
P P P +S + +P P P ++ S PP +P+P P +++P PP
Sbjct: 165 PASSGAPPAPPASSGSPSPPPPPHPPASTGAPSAPPASTQPSPSNPASSAAP----PPPH 220
Query: 404 RPGAPSFRPAPPG 442
G + P+ G
Sbjct: 221 NSGPSATEPSASG 233
>sp|P21997|SSGP_VOLCA SULFATED SURFACE GLYCOPROTEIN 185 (SSG 185) gi|99441|pir||A33647
sulfated surface glycoprotein 185 - Volvox carteri
gi|1405821|emb|CAA35953.1| SULFATED SURFACE GLYCOPROTEIN
185 [Volvox carteri]
Length = 485
Score = 66.2 bits (160), Expect = 2e-10
Identities = 38/92 (41%), Positives = 45/92 (48%)
Frame = +2
Query: 230 RPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSSFGPPVRP 409
+PT +RP S P + P P P S P PPPP P PP PP + P PP P
Sbjct: 234 QPTASSRPPSPPPSPRP-PSPPPPSPSPPPPPPPPP--PPPPPPPPSPPPPPPPPPPPPP 290
Query: 410 GAPSFRPAPPGRFNDPSVPPPPPPQSSNFLPA 505
P P+PP + PS P PPPP + LPA
Sbjct: 291 PPPPPSPSPPRKPPSPSPPVPPPPSPPSVLPA 322
Score = 57.0 bits (136), Expect = 1e-07
Identities = 35/90 (38%), Positives = 38/90 (41%), Gaps = 4/90 (4%)
Frame = +2
Query: 233 PTFPARPVSSPFAAAP----APQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSSFGPP 400
PT + P +P AP P + P AS PPP RP P PP S P PP
Sbjct: 209 PTGLSGPNVNPIGPAPNNSPLPPSPQPTASSRPPSPPPSPRPPSPPPPSPSPPPPPPPPP 268
Query: 401 VRPGAPSFRPAPPGRFNDPSVPPPPPPQSS 490
P P P PP P PPPPPP S
Sbjct: 269 PPPPPPPPSPPPPPPPPPPPPPPPPPPSPS 298
Score = 40.0 bits (92), Expect = 0.017
Identities = 26/74 (35%), Positives = 30/74 (40%)
Frame = +2
Query: 221 NPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSSFGPP 400
+P P P P P P+P P P PPPP P+PP P + SP P
Sbjct: 257 SPSPPPPPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPPP-SPSPPRKPPSPSP----PVP 311
Query: 401 VRPGAPSFRPAPPG 442
P PS PA G
Sbjct: 312 PPPSPPSVLPAATG 325
Score = 35.4 bits (80), Expect = 0.42
Identities = 22/67 (32%), Positives = 24/67 (34%)
Frame = +2
Query: 224 PGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPTPPAPPQASSPFSSFGPPV 403
P P P P P P P P P P PP +P P+PP P P V
Sbjct: 263 PPPPPPPPPPPPPPSPPPPPPPPPPPPPPPPPPSPSPPRKPPSPSPPVPPPPSP---PSV 319
Query: 404 RPGAPSF 424
P A F
Sbjct: 320 LPAATGF 326
>ref|NP_176464.1| hypothetical protein; protein id: At1g62763.1 [Arabidopsis
thaliana] gi|6630459|gb|AAF19547.1|AC007190_15 F23N19.12
[Arabidopsis thaliana]
Length = 312
Score = 66.2 bits (160), Expect = 2e-10
Identities = 40/116 (34%), Positives = 55/116 (46%), Gaps = 12/116 (10%)
Frame = +2
Query: 179 LPVILIVVL*MGTENPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRPT-PP 355
L + + ++L + T + + P SP + +P + P + P S PP P+ PP
Sbjct: 10 LHLSIAILLFITTSSSSLSPSSSSPSLSPSPPSSSPSSAPPSSLSPSSPPPLSLSPSSPP 69
Query: 356 APPQASSPFSSFGPPVRPGAPSFRP--APPGRFN---------DPSVPPPPPPQSS 490
PP +SSP SS P + P PS P APP + PS PPPPPP SS
Sbjct: 70 PPPPSSSPLSSLSPSLSPSPPSSSPSSAPPSSLSPSSPPPLSLSPSSPPPPPPSSS 125
Score = 39.7 bits (91), Expect = 0.022
Identities = 24/70 (34%), Positives = 32/70 (45%), Gaps = 11/70 (15%)
Frame = +2
Query: 221 NPGRPTFPARPVSSPFAAAPAPQTVTPFASMPGSDPPPPFRP-----------TPPAPPQ 367
+P P P P SSP ++ + +P +S P S PP P +PP PP
Sbjct: 64 SPSSPP-PPPPSSSPLSSLSPSLSPSPPSSSPSSAPPSSLSPSSPPPLSLSPSSPPPPPP 122
Query: 368 ASSPFSSFGP 397
+SSP SS P
Sbjct: 123 SSSPLSSLSP 132
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 529,968,825
Number of Sequences: 1393205
Number of extensions: 15804805
Number of successful extensions: 384962
Number of sequences better than 10.0: 11902
Number of HSP's better than 10.0 without gapping: 98557
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 222808
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 17308240320
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
EST assemble image
|
|
|
|
clone |
accession |
position |
1 |
MPDL006d08_f |
AV776820 |
1 |
229 |
2 |
SPDL034h08_f |
BP054160 |
36 |
528 |
|
Lotus japonicus
Kazusa DNA Research Institute