Miyakogusa Predicted Gene

Lj0g3v0141589.1
Show Alignment: 
BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj0g3v0141589.1 Non Chatacterized Hit- tr|I1NLP1|I1NLP1_ORYGL
Uncharacterized protein OS=Oryza glaberrima PE=3
SV=1,37.42,6e-18,INTEGRAL MEMBRANE FAMILY PROTEIN,NULL; NITRATE,
FROMATE, IRON DEHYDROGENASE,NULL; A_tha_TIGR01569: p,CUFF.8642.1
         (163 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G03700.1 | Symbols:  | Uncharacterised protein family (UPF049...   209   7e-55
AT4G03540.1 | Symbols:  | Uncharacterised protein family (UPF049...   197   3e-51
AT5G44550.1 | Symbols:  | Uncharacterised protein family (UPF049...    96   8e-21
AT5G15290.1 | Symbols:  | Uncharacterised protein family (UPF049...    82   1e-16
AT1G14160.1 | Symbols:  | Uncharacterised protein family (UPF049...    77   3e-15
AT4G25040.1 | Symbols:  | Uncharacterised protein family (UPF049...    69   2e-12
AT2G36100.1 | Symbols:  | Uncharacterised protein family (UPF049...    69   2e-12
AT4G20390.1 | Symbols:  | Uncharacterised protein family (UPF049...    68   2e-12
AT4G15610.1 | Symbols:  | Uncharacterised protein family (UPF049...    68   3e-12
AT2G27370.1 | Symbols:  | Uncharacterised protein family (UPF049...    64   3e-11
AT5G06200.1 | Symbols:  | Uncharacterised protein family (UPF049...    59   1e-09
AT3G06390.1 | Symbols:  | Uncharacterised protein family (UPF049...    57   7e-09
AT4G15630.1 | Symbols:  | Uncharacterised protein family (UPF049...    55   2e-08
AT3G11550.1 | Symbols:  | Uncharacterised protein family (UPF049...    53   9e-08
AT1G17200.1 | Symbols:  | Uncharacterised protein family (UPF049...    50   1e-06

>AT1G03700.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr1:921038-921844 FORWARD LENGTH=164
          Length = 164

 Score =  209 bits (532), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 99/160 (61%), Positives = 127/160 (79%)

Query: 4   TRWICHLLVRFLAFAATLSAVIVMATSHERASFLAISFEAKYTNTPAFKYFVIANSVVTV 63
           T+ +  L++RF AF A L AVI M TS ER+SF  IS  AKY++  AFKYFVIAN++VTV
Sbjct: 5   TQRLGGLVLRFAAFCAALGAVIAMITSRERSSFFVISLVAKYSDLAAFKYFVIANAIVTV 64

Query: 64  YGFLVFFLPAESLLWRLVVAMDLVFTMLLISSISAALTIAEVGKKGNSYAAWLPICDSVP 123
           Y FLV FLP ESLLW+ VV +DL+ TMLL SS+SAA+ +A+VGK+GN+ A WLPIC  VP
Sbjct: 65  YSFLVLFLPKESLLWKFVVVLDLMVTMLLTSSLSAAVAVAQVGKRGNANAGWLPICGQVP 124

Query: 124 KFCDQVTGALIAGFIAVIVYMILLLHSIHSVLDPLLLRKT 163
           +FCDQ+TGALIAG +A+++Y+ LL+ SIH V+DP LLRK+
Sbjct: 125 RFCDQITGALIAGLVALVLYVFLLIFSIHHVVDPFLLRKS 164


>AT4G03540.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr4:1570042-1571483 FORWARD LENGTH=164
          Length = 164

 Score =  197 bits (500), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 108/160 (67%), Positives = 135/160 (84%)

Query: 4   TRWICHLLVRFLAFAATLSAVIVMATSHERASFLAISFEAKYTNTPAFKYFVIANSVVTV 63
           T+ I  L++R  AF A L+A+IVM TS ERASFLAIS EAKYT+  AFKYFVIAN+VV+V
Sbjct: 5   TKRIGGLVLRLAAFGAALAALIVMITSRERASFLAISLEAKYTDMAAFKYFVIANAVVSV 64

Query: 64  YGFLVFFLPAESLLWRLVVAMDLVFTMLLISSISAALTIAEVGKKGNSYAAWLPICDSVP 123
           Y FLV FLP ESLLW+ VV +DLV TMLL SS+SAAL +A+VGKKGN+ A WLPIC  VP
Sbjct: 65  YSFLVLFLPKESLLWKFVVVLDLVMTMLLTSSLSAALAVAQVGKKGNANAGWLPICGQVP 124

Query: 124 KFCDQVTGALIAGFIAVIVYMILLLHSIHSVLDPLLLRKT 163
           KFCDQ+TGALIAGF+A+++Y++LLL+S+H+V+DP LL+K+
Sbjct: 125 KFCDQITGALIAGFVALVLYVLLLLYSLHAVVDPFLLQKS 164


>AT5G44550.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr5:17942100-17943174 REVERSE LENGTH=197
          Length = 197

 Score = 96.3 bits (238), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 64/168 (38%), Positives = 96/168 (57%), Gaps = 18/168 (10%)

Query: 8   CHLLV--RFLAFAATLSAVIVMATSHERASFL---------AISFEAKYTNTPAFKYFVI 56
           C +L+  R LAF+ATLSA IVM  + E  +F+           +F AK+ +TPAF +FV+
Sbjct: 16  CKILLGLRLLAFSATLSAAIVMGLNKETKTFIVGKVGNTPIQATFTAKFDHTPAFVFFVV 75

Query: 57  ANSVVTVYGFL-----VFFLPAESLLWRL--VVAMDLVFTMLLISSISAALTIAEVGKKG 109
           AN++V+ +  L     +F    E   +RL  V  +D++   L+ ++ +AA  +AEVGK G
Sbjct: 76  ANAMVSFHNLLMIALQIFGGKMEFTGFRLLSVAILDMLNVTLISAAANAAAFMAEVGKNG 135

Query: 110 NSYAAWLPICDSVPKFCDQVTGALIAGFIAVIVYMILLLHSIHSVLDP 157
           N +A W  ICD    +CD   GALIA F  VI+ +I+   SI  ++ P
Sbjct: 136 NKHARWDKICDRFATYCDHGAGALIAAFAGVILMLIISAASISRLVQP 183


>AT5G15290.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr5:4967094-4967846 FORWARD LENGTH=187
          Length = 187

 Score = 82.0 bits (201), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 9/146 (6%)

Query: 1   MAKTRWICHLLVRFLAFAATLSAVIVMATSHERASFLA--ISFEAKYTNTPAFKYFVIAN 58
           M++   I   ++R +AF  T+ + I+M T+HE   F    I F+A+Y + PA  +FV+AN
Sbjct: 21  MSRRIAILEFILRIVAFFNTIGSAILMGTTHETLPFFTQFIRFQAEYNDLPALTFFVVAN 80

Query: 59  SVVTVYGFLVFFLPAESLLWR-------LVVAMDLVFTMLLISSISAALTIAEVGKKGNS 111
           +VV+ Y  L   L    ++ R       L++ +D+    LL S  S+A  I  +   GN+
Sbjct: 81  AVVSGYLILSLTLAFVHIVKRKTQNTRILLIILDVAMLGLLTSGASSAAAIVYLAHNGNN 140

Query: 112 YAAWLPICDSVPKFCDQVTGALIAGF 137
              W  IC     FC++++G+LI  F
Sbjct: 141 KTNWFAICQQFNSFCERISGSLIGSF 166


>AT1G14160.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr1:4840798-4841660 REVERSE LENGTH=209
          Length = 209

 Score = 77.4 bits (189), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 83/151 (54%), Gaps = 13/151 (8%)

Query: 11  LVRFLAFAATLSAVIVMATSHERASFLA--ISFEAKYTNTPAFKYFVIANSVVTVYGFLV 68
           ++R  A   T+ + + M T+HE    L+  +  + KY++ P   +FV+AN++    G+LV
Sbjct: 54  VLRLFAVFGTIGSALAMGTTHESVVSLSQLVLLKVKYSDLPTLMFFVVANAISG--GYLV 111

Query: 69  FFLP--------AESLLWRLVV-AMDLVFTMLLISSISAALTIAEVGKKGNSYAAWLPIC 119
             LP         ++   R+++  +D V   L+ S  SAA     +  +GN+ A W PIC
Sbjct: 112 LSLPVSIFHIFSTQAKTSRIILLVVDTVMLALVSSGASAATATVYLAHEGNTTANWPPIC 171

Query: 120 DSVPKFCDQVTGALIAGFIAVIVYMILLLHS 150
                FC++++G+LI  F AVI+ M+++++S
Sbjct: 172 QQFDGFCERISGSLIGSFCAVILLMLIVINS 202


>AT4G25040.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr4:12868320-12869319 FORWARD LENGTH=170
          Length = 170

 Score = 68.6 bits (166), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/139 (38%), Positives = 74/139 (53%), Gaps = 4/139 (2%)

Query: 12  VRFLAFAATLSAVIVMATSHERASFLAISFEAKYTNTPAFKYFVIANSVVTVYGFLVFFL 71
           +R L   A ++++ VM T+ E AS   I+FEAKY+ + AF+Y V A   V          
Sbjct: 21  MRVLTIGAAMASMWVMITNREVASVYGIAFEAKYSYSSAFRYLVYAQIAVCAATLFTLVW 80

Query: 72  PAESLLWR-LVVAM---DLVFTMLLISSISAALTIAEVGKKGNSYAAWLPICDSVPKFCD 127
              ++  R LV A+   DL+ T+  IS+ SAA     VGK GN  A WLPIC  V  +C 
Sbjct: 81  ACLAVRRRGLVFALFFFDLLTTLTAISAFSAAFAEGYVGKYGNKQAGWLPICGYVHGYCS 140

Query: 128 QVTGALIAGFIAVIVYMIL 146
           +VT +L   F + I+  IL
Sbjct: 141 RVTISLAMSFASFILLFIL 159


>AT2G36100.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr2:15159744-15160669 REVERSE LENGTH=206
          Length = 206

 Score = 68.6 bits (166), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/144 (36%), Positives = 73/144 (50%), Gaps = 13/144 (9%)

Query: 7   ICHLLVRFLAFAATLSAVIVMATSHERASFLA--ISFEAKYTNTPAFKYFVIANSVVTVY 64
           I   L+R  A A T+ A  VM T+ E   F    + F+A Y + PAF+YFVIA +VV  Y
Sbjct: 47  IFDFLLRLAAIAVTIGAASVMYTAEETLPFFTQFLQFQAGYDDLPAFQYFVIAVAVVASY 106

Query: 65  GFLVFFLP--------AESLLWRLVVAM-DLVFTMLLISSISAALTIAEVGKKGNSYAAW 115
             LV  LP          ++  RL++ + D +   L  S+ +AA +I  +   GN    W
Sbjct: 107 --LVLSLPFSIVSIVRPHAVAPRLILLICDTLVVTLNTSAAAAAASITYLAHNGNQSTNW 164

Query: 116 LPICDSVPKFCDQVTGALIAGFIA 139
           LPIC     FC  V+ A++A  IA
Sbjct: 165 LPICQQFGDFCQNVSTAVVADSIA 188


>AT4G20390.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr4:11007068-11007869 FORWARD LENGTH=197
          Length = 197

 Score = 68.2 bits (165), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 74/139 (53%), Gaps = 16/139 (11%)

Query: 6   WICHLLVRFLAFAATLSAVIVMATSHERASF---------LAISFEAKYTNTPAFKYFVI 56
           W   L +R  AF ATL+A IVM+ + E  +          +  +  AK+ +TPAF +FVI
Sbjct: 16  WKLLLGLRIFAFMATLAAAIVMSLNKETKTLVVATIGTVPIKATLTAKFQHTPAFVFFVI 75

Query: 57  ANSVVTVYGFL-----VFFLPAESLLWRL--VVAMDLVFTMLLISSISAALTIAEVGKKG 109
           AN +V+ +  L     +F    E    RL  +  +D++   L+ ++ +AA+ +AE+GK G
Sbjct: 76  ANVMVSFHNLLMIVVQIFSRKLEYKGLRLLSIAILDMLNATLVSAAANAAVFVAELGKNG 135

Query: 110 NSYAAWLPICDSVPKFCDQ 128
           N +A W  +CD    +CD 
Sbjct: 136 NKHAKWNKVCDRFTTYCDH 154


>AT4G15610.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr4:8909162-8910641 FORWARD LENGTH=193
          Length = 193

 Score = 68.2 bits (165), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/145 (33%), Positives = 76/145 (52%), Gaps = 15/145 (10%)

Query: 3   KTRWICHLLVRFLAFAATLSAVIVMATSHERAS-FLAISF----EAKYTNTPAFKYFVIA 57
           K+  +  +++RF+ FAATL++++VM TS +  + FL  +      A++TN+PA  YFV+A
Sbjct: 24  KSCSMTQVVLRFVLFAATLTSIVVMVTSKQTKNIFLPGTPIRIPAAEFTNSPALIYFVVA 83

Query: 58  NSVVTVYGFLVFFLPAES---------LLWRLVVAMDLVFTMLLISSISAALTIAEVGKK 108
            SV   Y  +  F+   +         LL  L + MD V   ++ S+  A   +A +G K
Sbjct: 84  LSVACFYSIVSTFVTVSAFKKHSCSAVLLLNLAI-MDAVMVGIVASATGAGGGVAYLGLK 142

Query: 109 GNSYAAWLPICDSVPKFCDQVTGAL 133
           GN    W  IC    KFC  V GA+
Sbjct: 143 GNKEVRWGKICHIYDKFCRHVGGAI 167


>AT2G27370.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr2:11708628-11709905 REVERSE LENGTH=221
          Length = 221

 Score = 64.3 bits (155), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 42/121 (34%), Positives = 60/121 (49%), Gaps = 11/121 (9%)

Query: 39  ISFEAKYTNTPAFKYFVIANSVVTVYGFLVFFLPAE--SLLWRLVVA-------MDLVFT 89
           + F+A YT+ P    FVI NS+V   G+L   LP     +L  L V         D V  
Sbjct: 95  LQFQADYTDLPTMSSFVIVNSIVG--GYLTLSLPFSIVCILRPLAVPPRLFLILCDTVMM 152

Query: 90  MLLISSISAALTIAEVGKKGNSYAAWLPICDSVPKFCDQVTGALIAGFIAVIVYMILLLH 149
            L + + SA+  I  +   GNS + WLP+C     FC   +GA++A FIA  + M L++ 
Sbjct: 153 GLTLMAASASAAIVYLAHNGNSSSNWLPVCQQFGDFCQGTSGAVVASFIAATLLMFLVIL 212

Query: 150 S 150
           S
Sbjct: 213 S 213


>AT5G06200.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr5:1877333-1878116 FORWARD LENGTH=202
          Length = 202

 Score = 58.9 bits (141), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/142 (33%), Positives = 72/142 (50%), Gaps = 13/142 (9%)

Query: 27  MATSHERASFLA--ISFEAKYTNTPAFKYFVIANSVVTVYGFLVFFLPAE--SLLWRLVV 82
           M TS E   F    + FEA Y + P F++FV+A ++V   G+LV  LP    +++  L V
Sbjct: 63  MGTSDETLPFFTQFLQFEASYDDLPTFQFFVVAIAIVA--GYLVLSLPFSVVTIVRPLAV 120

Query: 83  AMDLVF-------TMLLISSISAALTIAEVGKKGNSYAAWLPICDSVPKFCDQVTGALIA 135
           A  L+          L  ++ SAA  I  +   GN+   WLPIC     FC + +GA+++
Sbjct: 121 APRLLLLVLDTAALALDTAAASAAAAIVYLAHNGNTNTNWLPICQQFGDFCQKTSGAVVS 180

Query: 136 GFIAVIVYMILLLHSIHSVLDP 157
            F +V    IL++ S  S+  P
Sbjct: 181 AFASVTFLAILVVISGVSLKRP 202


>AT3G06390.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr3:1938913-1939707 REVERSE LENGTH=199
          Length = 199

 Score = 56.6 bits (135), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 68/133 (51%), Gaps = 16/133 (12%)

Query: 9   HLLVRFLAFAATLSAVIVMATSH--ERASFLAIS----FEAKYTNTPAFKYFVIANSVVT 62
            ++ R L F+ATL+A+IVM TS   E      +S      A++ ++PAF YFV+A  V +
Sbjct: 38  DIITRVLLFSATLTALIVMVTSDQTEMTQLPGVSSPAPVSAEFNDSPAFIYFVVALVVAS 97

Query: 63  VYGFLVFFLPAESLLWR---------LVVAMDLVFTMLLISSISAALTIAEVGKKGNSYA 113
            Y  L+  L + SLL +          + ++D+V   +L S+   A  +A +  KGN   
Sbjct: 98  FYA-LISTLVSISLLLKPEFTAQFSIYLASLDMVMLGILASATGTAGGVAYIALKGNEEV 156

Query: 114 AWLPICDSVPKFC 126
            W  IC+   KFC
Sbjct: 157 GWNKICNVYDKFC 169


>AT4G15630.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr4:8917527-8918683 FORWARD LENGTH=190
          Length = 190

 Score = 55.1 bits (131), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 79/174 (45%), Gaps = 24/174 (13%)

Query: 4   TRWICHLLVRFLAFAATLSAVIVMATSHERASF----------LAISFEAKYTNTPAFKY 53
           +R    L +R LA   T+ A  V+  + +              L +S  AK +   AF Y
Sbjct: 23  SRKGLELTMRVLALVLTMVAATVLGVAKQTKVVPIKLIPTLPPLNVSTTAKASYLSAFVY 82

Query: 54  FVIANSVVTVYGFLVFFL-------PAESLLWRLVVAMDLVFTMLLISSISAALTIAEVG 106
            + AN++   Y  +   +        ++SLL  +++  DL+   LL SS  AA  I  +G
Sbjct: 83  NISANAIACGYTAISIVIVMISKGKRSKSLLMAVLIG-DLMMVALLFSSTGAAGAIGLMG 141

Query: 107 KKGNSYAAWLPICDSVPKFCDQVTGALIAGFIAVIVYMILLLHSIHSVLDPLLL 160
           + GN +  W  +C    KFC+Q   ++    IA +V+M+L+      VLD L L
Sbjct: 142 RHGNKHVMWKKVCGVFGKFCNQAAVSVAITLIASVVFMLLV------VLDALKL 189


>AT3G11550.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr3:3638262-3639052 FORWARD LENGTH=204
          Length = 204

 Score = 52.8 bits (125), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 38/117 (32%), Positives = 56/117 (47%), Gaps = 13/117 (11%)

Query: 26  VMATSHERASFLA--ISFEAKYTNTPAFKYFVIANSVVTVYGFLVFFLPAE--SLLWRLV 81
            M TS E   F    + FEA Y + P F++FVIA ++V   G+LV  LP    ++L  L 
Sbjct: 64  TMGTSDETLPFFTQFLQFEASYDDLPTFQFFVIAMALVG--GYLVLSLPISVVTILRPLA 121

Query: 82  VAMDLVFTMLLISSIS-------AALTIAEVGKKGNSYAAWLPICDSVPKFCDQVTG 131
            A  L+  +L    ++       +A  I+ +   GN    WLPIC     FC + +G
Sbjct: 122 TAPRLLLLVLDTGVLALNTAAASSAAAISYLAHSGNQNTNWLPICQQFGDFCQKSSG 178


>AT1G17200.1 | Symbols:  | Uncharacterised protein family (UPF0497)
           | chr1:5878493-5879871 FORWARD LENGTH=204
          Length = 204

 Score = 49.7 bits (117), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 72/157 (45%), Gaps = 15/157 (9%)

Query: 11  LVRFLAFAATLSAVIVMATSHERASFLAISFEAKYTNTPAFKYFVIANSVVTVYGFL--- 67
           ++R       ++A++VM    E   F +IS    Y+N  AF+Y V AN +   Y  L   
Sbjct: 36  MLRLAPVGLCVAALVVMLKDSETNEFGSIS----YSNLTAFRYLVHANGICAGYSLLSAA 91

Query: 68  VFFLPAES----LLWRLVVAMDLVFTMLLISSISAALTIAEVGKKGNSYAAWLPICDSVP 123
           +  +P  S     +W     +D + T L++++ + +  +  +   G+S   W   C S  
Sbjct: 92  IAAMPRSSSTMPRVWTFFC-LDQLLTYLVLAAGAVSAEVLYLAYNGDSAITWSDACSSYG 150

Query: 124 KFCDQVTGALIAGFIAVIVYMILLL---HSIHSVLDP 157
            FC + T ++I  F  V  Y++L L   + + +  DP
Sbjct: 151 GFCHRATASVIITFFVVCFYIVLSLISSYKLFTRFDP 187