Jatropha Genome Database

Jcr4S00006.270
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Jcr4S00006.270
         (262 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G48780.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   109   2e-24
AT3G18300.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...   101   5e-22
AT1G68330.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    95   5e-20
AT1G67050.1 | Symbols:  | unknown protein; BEST Arabidopsis thal...    76   2e-14

>AT1G48780.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT3G18300.1); Has 89 Blast hits to 89 proteins in
           11 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi
           - 0; Plants - 86; Viruses - 0; Other Eukaryotes - 1
           (source: NCBI BLink). | chr1:18041989-18042744 FORWARD
           LENGTH=251
          Length = 251

 Score =  109 bits (272), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 94/262 (35%), Positives = 122/262 (46%), Gaps = 35/262 (13%)

Query: 1   MCSETSPRISFSNDLAHGDGT-----ENEHVPRRDTTLLES-NSDFEFNISSRLLDYESS 54
           +C+E   RISFS+DL   D       E   + RRD TLL+S NSDFEF+IS+     +SS
Sbjct: 2   ICTEALQRISFSSDLGQSDKAPPPVIEPSGLIRRDETLLDSSNSDFEFHISNSFDPGDSS 61

Query: 55  LADELFSNGMLLPFQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXXXXXX 114
            ADE+F++GM+LPF      T P       R     +                       
Sbjct: 62  PADEIFADGMILPFHVTAASTVPK------RLYKYELPPITSSLSPSPLSPQPLPTKHSE 115

Query: 115 XXXEIRVSNSELEEKAEPKS-TFWGFKRSNSLNCDLKKGSIFSLPVLSRSNSTGTAAPNS 173
                R S +  + +AE  S +FW FKRS+SLNCD+KK  I S P L+RSNSTG+   NS
Sbjct: 116 KETNGRASGANSDSEAEKSSKSFWSFKRSSSLNCDIKKSLICSFPRLTRSNSTGSVT-NS 174

Query: 174 KRSSMAKKLLPSTSSGSSTNYVYTFPHXXXXXXXXKPPLKKKYGGVPNCNNGVKISPILN 233
           KR+ +        SS SS    Y F          K   KK  GG         + P+LN
Sbjct: 175 KRAMLRDVNNHRPSSRSSCCNAYQF-------RPQKHTGKKGEGG-----GSFSVIPVLN 222

Query: 234 VPPPYIAKGAANLFGLGSLLRN 255
            P         + FGLGS+LR+
Sbjct: 223 GP---------STFGLGSILRH 235


>AT3G18300.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48780.1); Has 69 Blast hits to 69 proteins in
           7 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi
           - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 0
           (source: NCBI BLink). | chr3:6283396-6284220 FORWARD
           LENGTH=274
          Length = 274

 Score =  101 bits (252), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 101/291 (34%), Positives = 130/291 (44%), Gaps = 47/291 (16%)

Query: 1   MCSETS-PRISFSNDLAHGD-GTENEHVP----RRDTTLLES-NSDFEFNISSRLLDYES 53
           +C+E++  R SF+ DL   D GT  E  P    RRDTTLL+S NSDFEF+ISS     +S
Sbjct: 2   ICTESNHQRFSFAGDLGQSDKGTPMEQQPSGPVRRDTTLLDSSNSDFEFHISSNFDPGDS 61

Query: 54  SLADELFSNGMLLP---FQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXX 110
           S ADE+F++GM+LP   FQ     T P   +     +   V                   
Sbjct: 62  SPADEIFADGMILPVLPFQVTATSTMPK--RLYKYELPPIVSAPTLSSYLPPLPLPLPEH 119

Query: 111 XXXXXXXEIRVS--------NSELEEKAEPKSTFWGFKRSNSLNCDLKKGSIFSLPVLSR 162
                  E R S        NS+ E +   KS FW FKRS+SLNCD+KK  I S P L+R
Sbjct: 120 SRKYSVKETRGSLNGRGSGANSDSEAEKSSKS-FWSFKRSSSLNCDIKKSLICSFPRLTR 178

Query: 163 SNSTGTAAPNSKRSSMAKKLLPSTSSGSSTNYVYTFPHXXXXXXXXKP-----------P 211
           SNSTG+ A  SKR     ++L   +  SS  +    P          P           P
Sbjct: 179 SNSTGSVA-ISKR-----EMLRDINKHSSQRHGVPRPGVNPSSHMRPPSSFCCSSYQFRP 232

Query: 212 LKKKYGGVPNCNNGVKISPILNVPPPYIAKGAANLFGLGSLLRNGKEKNRK 262
            K              I+P++  P P         FGLGS+LR  KEK +K
Sbjct: 233 QKHAGKNGGGRGGSFWIAPVIGGPSP---------FGLGSILRLTKEKKKK 274


>AT1G68330.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT1G48780.1); Has 155 Blast hits to 147 proteins
           in 23 species: Archae - 0; Bacteria - 0; Metazoa - 19;
           Fungi - 3; Plants - 126; Viruses - 0; Other Eukaryotes -
           7 (source: NCBI BLink). | chr1:25611242-25612048 FORWARD
           LENGTH=268
          Length = 268

 Score = 94.7 bits (234), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/282 (35%), Positives = 127/282 (45%), Gaps = 53/282 (18%)

Query: 2   CSETS-----PRISFSNDLAHGDGTENEHVPRRDTTLLESNSDFEFNISSRLLDYESSLA 56
           CSE S     PRISFS DL   D T++  V R D+TLL+S S+F+F   S     E S A
Sbjct: 7   CSEASGSGISPRISFSYDL---DSTDDGEV-RLDSTLLDSGSEFDFCFGSSCSVQEVSPA 62

Query: 57  DELFSNGMLLPFQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXXXXXXXX 116
           DELFS G +LP Q  K+E+ P   Q ++  V R                           
Sbjct: 63  DELFSEGKILPVQIKKEESLP---QTVTFRVPRSASLSSSSSSSSSSSSSSRAPEKKMRL 119

Query: 117 XEIRVSNSELEEKAEPKSTFWGFKRSNSLNCDL---KKGSIFSLPVLSRSNSTGTAAPNS 173
            E+ + N E + + +P+  F  FKRS SLN D     KG I S   LSRSNST    PN 
Sbjct: 120 KELLL-NPESDFEDKPRGLFLQFKRSISLNYDKSRNSKGLIRSFHFLSRSNST----PNP 174

Query: 174 KRSSMAKKLLPSTSSGSSTNYVYTFPHXXXXXXXXKPPL---------------KKKYG- 217
               + K+          T++    PH        KPPL               KK  G 
Sbjct: 175 NLDLLPKE----------THH----PHKTHNLPKHKPPLRRSSSLSSSSVPFYSKKPLGR 220

Query: 218 -GVPNCNNGVKISPILNVPPP-YIAKGAANLFGLGSLLRNGK 257
               N N GV++SP+LN PPP +I+  A   F +GSL  NGK
Sbjct: 221 NSFGNGNGGVRVSPVLNFPPPAFISNVADGFFSIGSLC-NGK 261


>AT1G67050.1 | Symbols:  | unknown protein; BEST Arabidopsis
           thaliana protein match is: unknown protein
           (TAIR:AT5G38320.1); Has 617 Blast hits to 318 proteins
           in 80 species: Archae - 0; Bacteria - 16; Metazoa - 141;
           Fungi - 62; Plants - 128; Viruses - 2; Other Eukaryotes
           - 268 (source: NCBI BLink). | chr1:25028862-25029656
           REVERSE LENGTH=264
          Length = 264

 Score = 76.3 bits (186), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 95/279 (34%), Positives = 130/279 (46%), Gaps = 44/279 (15%)

Query: 3   SETSPRISFSNDLAHGDGTENEHVPRRDT----TLLESNSDFEFNI-----SSRLLDYES 53
           S  SPRISFS D    D    E  P R +    + L S+ DF+F I     S    D  S
Sbjct: 10  SNMSPRISFSRDFCQSDAIPIEKRPLRSSNSKPSSLNSSIDFDFCIPGGVNSGESFDQGS 69

Query: 54  SLADELFSNGMLLPFQDNKQETSPNVSQNISRNVSRHVXXXXXXXXXXXXXXXXXXXXXX 113
             ADELFSNG +LP +  K+              S+                        
Sbjct: 70  WSADELFSNGKILPTEIKKKPEPGKKEPEPKPVKSK-----------PDSRKQRKQPNEE 118

Query: 114 XXXXEIRVSNSELEEKAEPKSTFWGFKRSNSLNCDLKKG-SIFSLPVLSRSNSTGTAAPN 172
               ++ ++    EEK   KS FWGFKRS+SLNC    G S+  LP+L+RSNSTG+ +  
Sbjct: 119 QQEDDVIITT---EEKTNTKS-FWGFKRSSSLNCGSTYGRSLCPLPLLNRSNSTGSTSSK 174

Query: 173 SKRSSMAK-----KLLPSTSSGSSTNYVYTFPHXXXXXXXXKPPLKK---KYGGVPNCNN 224
            K+SS  K     KL  S+S  SS++   +  +        KPPLKK    Y    +   
Sbjct: 175 QKQSSSRKHNEHVKLQQSSSLSSSSSASSSLSN----NGFSKPPLKKSYGGYSYGSHGGG 230

Query: 225 GVKISPILNVPPPYIAKGAANLFGLGSLLR-NGKEKNRK 262
           G+++SP++NV P      + NLFG GS+   NG++KN+K
Sbjct: 231 GIRVSPVINVVP------SGNLFGFGSMFSGNGRDKNKK 263