Miyakogusa Predicted Gene

Lj1g3v4515780.1
Show Alignment: 

BLASTP 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Lj1g3v4515780.1 Non Chatacterized Hit- tr|C6SZB1|C6SZB1_SOYBN
Putative uncharacterized protein OS=Glycine max PE=2
S,39.75,3e-17,RETINOIC ACID INDUCED 1/TRANSCRIPTION FACTOR 20,NULL;
zf-HC5HC2H,NULL,gene.g36666.t1.1
         (407 letters)

Database: TAIR10_pep 
           35,386 sequences; 14,482,855 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

AT1G04020.2 | Symbols: ATBARD1, BARD1 | breast cancer associated...   211   6e-55
AT1G04020.1 | Symbols: ATBARD1, BARD1, ROW1 | breast cancer asso...   210   1e-54
AT4G21070.1 | Symbols: ATBRCA1, BRCA1 | breast cancer susceptibi...   150   2e-36
AT3G15120.1 | Symbols:  | P-loop containing nucleoside triphosph...    61   1e-09

>AT1G04020.2 | Symbols: ATBARD1, BARD1 | breast cancer associated
           RING 1 | chr1:1036610-1040045 FORWARD LENGTH=713
          Length = 713

 Score =  211 bits (538), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 163/471 (34%), Positives = 215/471 (45%), Gaps = 112/471 (23%)

Query: 13  LMNPWMLHFQKLALE---------------LKCP--LC-SCLVDSSLSGSECAVCKTKYA 54
           LMNPW+LH QKL LE               L C    C SC+  SS   S C VCK+K+ 
Sbjct: 8   LMNPWVLHLQKLELELKCPLCLKLLNRPVLLPCDHVFCDSCVHKSSQVESGCPVCKSKHP 67

Query: 55  QTD--DSRVLQ------------------QCQTFRDSSYSNIKKADNFSQSSPNSNGFGV 94
           +    D R ++                  Q Q   D +Y N         +  NSN    
Sbjct: 68  KKGKRDLRFMESVISIYKSLNAAVSVHLPQLQIPNDCNYKN--------DALNNSNSPKH 119

Query: 95  GENRKSMITMHVKPEELEMSSGGRAGFRNDVKPYPMQRS---RVEIGDYVE--------- 142
           GE+  S +T     +++   SGG      D  P P       R +  D+ E         
Sbjct: 120 GESEDSEMT----DKDVSKRSGGTDSSSRDGSPLPTSEESDPRPKHQDWTEKQLSDHLLL 175

Query: 143 MDVNQVTQAAVYSPPFCDTKGSDNDCSELDSDHRASTGKGNLKERK---SQFRSESSAS- 198
            +      AA ++P     + + N       D  AS    N   ++     F  ESS + 
Sbjct: 176 YEFESEYDAANHTPESYTEQAAKN-----VRDITASEQPSNAARKRICGDSFIQESSPNP 230

Query: 199 ETDKPTRDLKRKKYLTKGDDHIQHVSTHHSKLVDSHCGLD----------------LKSG 242
           +T  PT  L R     + DD   +V   + +L  SH   D                LK  
Sbjct: 231 KTQDPT--LLRLMESLRSDDPTDYVKAQNHQLPKSHTEQDSKRKRDITASDAMENHLKVP 288

Query: 243 KEPGELLPANIPIDLN-----------------------PSTSICSFCQSSETSEATGPM 279
           K    L+  +  ID N                        + +IC FCQS+  SEATG M
Sbjct: 289 KRENNLMQKSADIDCNGKCSANSDDQLSEKISKALEQTSSNITICGFCQSARVSEATGEM 348

Query: 280 LHYANGKSVIGDAAMQPNVIHVHRCCVDWAPQVYFVDETCKNLKAEVARGAKLKCSTCGL 339
           LHY+ G+ V GD   + NVIHVH  C++WAPQVY+  +T KNLKAE+ARG K+KC+ C L
Sbjct: 349 LHYSRGRPVDGDDIFRSNVIHVHSACIEWAPQVYYEGDTVKNLKAELARGMKIKCTKCSL 408

Query: 340 KGAALGCYVKSCKRTYHVPCAMDVSTCRWDQEKYLLLCPVHSNAKFPHEKS 390
           KGAALGC+VKSC+R+YHVPCA ++S CRWD E +LLLCP HS+ KFP+EKS
Sbjct: 409 KGAALGCFVKSCRRSYHVPCAREISRCRWDYEDFLLLCPAHSSVKFPNEKS 459


>AT1G04020.1 | Symbols: ATBARD1, BARD1, ROW1 | breast cancer
           associated RING 1 | chr1:1036610-1040045 FORWARD
           LENGTH=714
          Length = 714

 Score =  210 bits (535), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 164/472 (34%), Positives = 217/472 (45%), Gaps = 113/472 (23%)

Query: 13  LMNPWMLHFQKLALE---------------LKCP--LC-SCLVDSSLSGSECAVCKTKYA 54
           LMNPW+LH QKL LE               L C    C SC+  SS   S C VCK+K+ 
Sbjct: 8   LMNPWVLHLQKLELELKCPLCLKLLNRPVLLPCDHVFCDSCVHKSSQVESGCPVCKSKHP 67

Query: 55  QT--DDSRVLQ------------------QCQTFRDSSYSNIKKADNFSQSSPNSNGFGV 94
           +    D R ++                  Q Q   D +Y N         +  NSN    
Sbjct: 68  KKARRDLRFMESVISIYKSLNAAVSVHLPQLQIPNDCNYKN--------DALNNSNSPKH 119

Query: 95  GENRKSMITMHVKPEELEMSSGGRAGFRNDVKPYPMQRS---RVEIGDYVE--------- 142
           GE+  S +T     +++   SGG      D  P P       R +  D+ E         
Sbjct: 120 GESEDSEMT----DKDVSKRSGGTDSSSRDGSPLPTSEESDPRPKHQDWTEKQLSDHLLL 175

Query: 143 MDVNQVTQAAVYSPPFCDTKGSDNDCSELDSDHRASTGKGNLKERK---SQFRSESSAS- 198
            +      AA ++P     + + N       D  AS    N   ++     F  ESS + 
Sbjct: 176 YEFESEYDAANHTPESYTEQAAKN-----VRDITASEQPSNAARKRICGDSFIQESSPNP 230

Query: 199 ETDKPTRDLKRKKYLTKGDDHIQHV-STHHSKLVDSHCGLD----------------LKS 241
           +T  PT  L R     + DD   +V + +H +L  SH   D                LK 
Sbjct: 231 KTQDPT--LLRLMESLRSDDPTDYVKAQNHQQLPKSHTEQDSKRKRDITASDAMENHLKV 288

Query: 242 GKEPGELLPANIPIDLN-----------------------PSTSICSFCQSSETSEATGP 278
            K    L+  +  ID N                        + +IC FCQS+  SEATG 
Sbjct: 289 PKRENNLMQKSADIDCNGKCSANSDDQLSEKISKALEQTSSNITICGFCQSARVSEATGE 348

Query: 279 MLHYANGKSVIGDAAMQPNVIHVHRCCVDWAPQVYFVDETCKNLKAEVARGAKLKCSTCG 338
           MLHY+ G+ V GD   + NVIHVH  C++WAPQVY+  +T KNLKAE+ARG K+KC+ C 
Sbjct: 349 MLHYSRGRPVDGDDIFRSNVIHVHSACIEWAPQVYYEGDTVKNLKAELARGMKIKCTKCS 408

Query: 339 LKGAALGCYVKSCKRTYHVPCAMDVSTCRWDQEKYLLLCPVHSNAKFPHEKS 390
           LKGAALGC+VKSC+R+YHVPCA ++S CRWD E +LLLCP HS+ KFP+EKS
Sbjct: 409 LKGAALGCFVKSCRRSYHVPCAREISRCRWDYEDFLLLCPAHSSVKFPNEKS 460


>AT4G21070.1 | Symbols: ATBRCA1, BRCA1 | breast cancer
           susceptibility1 | chr4:11248174-11252633 FORWARD
           LENGTH=941
          Length = 941

 Score =  150 bits (378), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 67/138 (48%), Positives = 89/138 (64%)

Query: 264 CSFCQSSETSEATGPMLHYANGKSVIGDAAMQPNVIHVHRCCVDWAPQVYFVDETCKNLK 323
           C+FCQ SE +EA+G M HY  G+ V  D      VIHVH+ C +WAP VYF D T  NL 
Sbjct: 564 CAFCQCSEDTEASGEMTHYYRGEPVSADFNGGSKVIHVHKNCAEWAPNVYFNDLTIVNLD 623

Query: 324 AEVARGAKLKCSTCGLKGAALGCYVKSCKRTYHVPCAMDVSTCRWDQEKYLLLCPVHSNA 383
            E+ R  ++ CS CGLKGAALGCY KSCK ++HV CA  +  CRWD  K+++LCP+ ++ 
Sbjct: 624 VELTRSRRISCSCCGLKGAALGCYNKSCKNSFHVTCAKLIPECRWDNVKFVMLCPLDASI 683

Query: 384 KFPHEKSRPKKQATQEHP 401
           K P E++  K +  +  P
Sbjct: 684 KLPCEEANSKDRKCKRTP 701


>AT3G15120.1 | Symbols:  | P-loop containing nucleoside triphosphate
           hydrolases superfamily protein | chr3:5088487-5095482
           REVERSE LENGTH=1954
          Length = 1954

 Score = 61.2 bits (147), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 12/88 (13%)

Query: 301 VHRCCVDWAPQVYFVDETC-KNLKAEVARGAKLKCSTCGLKGAALGCYVKSCKRTYHVPC 359
           VH+ C  W+P+VYF    C KN++A + RG  LKC+ C   GA  GC           PC
Sbjct: 560 VHQNCAVWSPEVYFAGVGCLKNIRAALFRGRSLKCTRCDRPGATTGCR----------PC 609

Query: 360 AMDVSTCRWDQEKYLLLCPVHSNAKFPH 387
           A   + C +D  K+L+ C  H +   PH
Sbjct: 610 AR-ANGCIFDHRKFLIACTDHRHHFQPH 636