Jatropha Genome Database
- Jcr4S00006.270
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Jcr4S00006.270
(262 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G48780.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 109 2e-24
AT3G18300.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 101 5e-22
AT1G68330.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 95 5e-20
AT1G67050.1 | Symbols: | unknown protein; BEST Arabidopsis thal... 76 2e-14
>AT1G48780.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT3G18300.1); Has 89 Blast hits to 89 proteins in
11 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi
- 0; Plants - 86; Viruses - 0; Other Eukaryotes - 1
(source: NCBI BLink). | chr1:18041989-18042744 FORWARD
LENGTH=251
Length = 251
Score = 109 bits (272), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 122/262 (46%), Gaps = 35/262 (13%)
Query: 1 MCSETSPRISFSNDLAHGDGT-----ENEHVPRRDTTLLES-NSDFEFNISSRLLDYESS 54
+C+E RISFS+DL D E + RRD TLL+S NSDFEF+IS+ +SS
Sbjct: 2 ICTEALQRISFSSDLGQSDKAPPPVIEPSGLIRRDETLLDSSNSDFEFHISNSFDPGDSS 61
Query: 55 LADELFSNGMLLPFQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXXXXXX 114
ADE+F++GM+LPF T P R +
Sbjct: 62 PADEIFADGMILPFHVTAASTVPK------RLYKYELPPITSSLSPSPLSPQPLPTKHSE 115
Query: 115 XXXEIRVSNSELEEKAEPKS-TFWGFKRSNSLNCDLKKGSIFSLPVLSRSNSTGTAAPNS 173
R S + + +AE S +FW FKRS+SLNCD+KK I S P L+RSNSTG+ NS
Sbjct: 116 KETNGRASGANSDSEAEKSSKSFWSFKRSSSLNCDIKKSLICSFPRLTRSNSTGSVT-NS 174
Query: 174 KRSSMAKKLLPSTSSGSSTNYVYTFPHXXXXXXXXKPPLKKKYGGVPNCNNGVKISPILN 233
KR+ + SS SS Y F K KK GG + P+LN
Sbjct: 175 KRAMLRDVNNHRPSSRSSCCNAYQF-------RPQKHTGKKGEGG-----GSFSVIPVLN 222
Query: 234 VPPPYIAKGAANLFGLGSLLRN 255
P + FGLGS+LR+
Sbjct: 223 GP---------STFGLGSILRH 235
>AT3G18300.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G48780.1); Has 69 Blast hits to 69 proteins in
7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
- 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0
(source: NCBI BLink). | chr3:6283396-6284220 FORWARD
LENGTH=274
Length = 274
Score = 101 bits (252), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 101/291 (34%), Positives = 130/291 (44%), Gaps = 47/291 (16%)
Query: 1 MCSETS-PRISFSNDLAHGD-GTENEHVP----RRDTTLLES-NSDFEFNISSRLLDYES 53
+C+E++ R SF+ DL D GT E P RRDTTLL+S NSDFEF+ISS +S
Sbjct: 2 ICTESNHQRFSFAGDLGQSDKGTPMEQQPSGPVRRDTTLLDSSNSDFEFHISSNFDPGDS 61
Query: 54 SLADELFSNGMLLP---FQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXX 110
S ADE+F++GM+LP FQ T P + + V
Sbjct: 62 SPADEIFADGMILPVLPFQVTATSTMPK--RLYKYELPPIVSAPTLSSYLPPLPLPLPEH 119
Query: 111 XXXXXXXEIRVS--------NSELEEKAEPKSTFWGFKRSNSLNCDLKKGSIFSLPVLSR 162
E R S NS+ E + KS FW FKRS+SLNCD+KK I S P L+R
Sbjct: 120 SRKYSVKETRGSLNGRGSGANSDSEAEKSSKS-FWSFKRSSSLNCDIKKSLICSFPRLTR 178
Query: 163 SNSTGTAAPNSKRSSMAKKLLPSTSSGSSTNYVYTFPHXXXXXXXXKP-----------P 211
SNSTG+ A SKR ++L + SS + P P P
Sbjct: 179 SNSTGSVA-ISKR-----EMLRDINKHSSQRHGVPRPGVNPSSHMRPPSSFCCSSYQFRP 232
Query: 212 LKKKYGGVPNCNNGVKISPILNVPPPYIAKGAANLFGLGSLLRNGKEKNRK 262
K I+P++ P P FGLGS+LR KEK +K
Sbjct: 233 QKHAGKNGGGRGGSFWIAPVIGGPSP---------FGLGSILRLTKEKKKK 274
>AT1G68330.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT1G48780.1); Has 155 Blast hits to 147 proteins
in 23 species: Archae - 0; Bacteria - 0; Metazoa - 19;
Fungi - 3; Plants - 126; Viruses - 0; Other Eukaryotes -
7 (source: NCBI BLink). | chr1:25611242-25612048 FORWARD
LENGTH=268
Length = 268
Score = 94.7 bits (234), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/282 (35%), Positives = 127/282 (45%), Gaps = 53/282 (18%)
Query: 2 CSETS-----PRISFSNDLAHGDGTENEHVPRRDTTLLESNSDFEFNISSRLLDYESSLA 56
CSE S PRISFS DL D T++ V R D+TLL+S S+F+F S E S A
Sbjct: 7 CSEASGSGISPRISFSYDL---DSTDDGEV-RLDSTLLDSGSEFDFCFGSSCSVQEVSPA 62
Query: 57 DELFSNGMLLPFQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXXXXXXXX 116
DELFS G +LP Q K+E+ P Q ++ V R
Sbjct: 63 DELFSEGKILPVQIKKEESLP---QTVTFRVPRSASLSSSSSSSSSSSSSSRAPEKKMRL 119
Query: 117 XEIRVSNSELEEKAEPKSTFWGFKRSNSLNCDL---KKGSIFSLPVLSRSNSTGTAAPNS 173
E+ + N E + + +P+ F FKRS SLN D KG I S LSRSNST PN
Sbjct: 120 KELLL-NPESDFEDKPRGLFLQFKRSISLNYDKSRNSKGLIRSFHFLSRSNST----PNP 174
Query: 174 KRSSMAKKLLPSTSSGSSTNYVYTFPHXXXXXXXXKPPL---------------KKKYG- 217
+ K+ T++ PH KPPL KK G
Sbjct: 175 NLDLLPKE----------THH----PHKTHNLPKHKPPLRRSSSLSSSSVPFYSKKPLGR 220
Query: 218 -GVPNCNNGVKISPILNVPPP-YIAKGAANLFGLGSLLRNGK 257
N N GV++SP+LN PPP +I+ A F +GSL NGK
Sbjct: 221 NSFGNGNGGVRVSPVLNFPPPAFISNVADGFFSIGSLC-NGK 261
>AT1G67050.1 | Symbols: | unknown protein; BEST Arabidopsis
thaliana protein match is: unknown protein
(TAIR:AT5G38320.1); Has 617 Blast hits to 318 proteins
in 80 species: Archae - 0; Bacteria - 16; Metazoa - 141;
Fungi - 62; Plants - 128; Viruses - 2; Other Eukaryotes
- 268 (source: NCBI BLink). | chr1:25028862-25029656
REVERSE LENGTH=264
Length = 264
Score = 76.3 bits (186), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/279 (34%), Positives = 130/279 (46%), Gaps = 44/279 (15%)
Query: 3 SETSPRISFSNDLAHGDGTENEHVPRRDT----TLLESNSDFEFNI-----SSRLLDYES 53
S SPRISFS D D E P R + + L S+ DF+F I S D S
Sbjct: 10 SNMSPRISFSRDFCQSDAIPIEKRPLRSSNSKPSSLNSSIDFDFCIPGGVNSGESFDQGS 69
Query: 54 SLADELFSNGMLLPFQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXXXXX 113
ADELFSNG +LP + K+ S+
Sbjct: 70 WSADELFSNGKILPTEIKKKPEPGKKEPEPKPVKSK-----------PDSRKQRKQPNEE 118
Query: 114 XXXXEIRVSNSELEEKAEPKSTFWGFKRSNSLNCDLKKG-SIFSLPVLSRSNSTGTAAPN 172
++ ++ EEK KS FWGFKRS+SLNC G S+ LP+L+RSNSTG+ +
Sbjct: 119 QQEDDVIITT---EEKTNTKS-FWGFKRSSSLNCGSTYGRSLCPLPLLNRSNSTGSTSSK 174
Query: 173 SKRSSMAK-----KLLPSTSSGSSTNYVYTFPHXXXXXXXXKPPLKK---KYGGVPNCNN 224
K+SS K KL S+S SS++ + + KPPLKK Y +
Sbjct: 175 QKQSSSRKHNEHVKLQQSSSLSSSSSASSSLSN----NGFSKPPLKKSYGGYSYGSHGGG 230
Query: 225 GVKISPILNVPPPYIAKGAANLFGLGSLLR-NGKEKNRK 262
G+++SP++NV P + NLFG GS+ NG++KN+K
Sbjct: 231 GIRVSPVINVVP------SGNLFGFGSMFSGNGRDKNKK 263