Miyakogusa Predicted Gene
- Lj1g3v4515780.1
BLASTP 2.2.25 [Feb-01-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Lj1g3v4515780.1 Non Chatacterized Hit- tr|C6SZB1|C6SZB1_SOYBN
Putative uncharacterized protein OS=Glycine max PE=2
S,39.75,3e-17,RETINOIC ACID INDUCED 1/TRANSCRIPTION FACTOR 20,NULL;
zf-HC5HC2H,NULL,gene.g36666.t1.1
(407 letters)
Database: TAIR10_pep
35,386 sequences; 14,482,855 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
AT1G04020.2 | Symbols: ATBARD1, BARD1 | breast cancer associated... 211 6e-55
AT1G04020.1 | Symbols: ATBARD1, BARD1, ROW1 | breast cancer asso... 210 1e-54
AT4G21070.1 | Symbols: ATBRCA1, BRCA1 | breast cancer susceptibi... 150 2e-36
AT3G15120.1 | Symbols: | P-loop containing nucleoside triphosph... 61 1e-09
>AT1G04020.2 | Symbols: ATBARD1, BARD1 | breast cancer associated
RING 1 | chr1:1036610-1040045 FORWARD LENGTH=713
Length = 713
Score = 211 bits (538), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 163/471 (34%), Positives = 215/471 (45%), Gaps = 112/471 (23%)
Query: 13 LMNPWMLHFQKLALE---------------LKCP--LC-SCLVDSSLSGSECAVCKTKYA 54
LMNPW+LH QKL LE L C C SC+ SS S C VCK+K+
Sbjct: 8 LMNPWVLHLQKLELELKCPLCLKLLNRPVLLPCDHVFCDSCVHKSSQVESGCPVCKSKHP 67
Query: 55 QTD--DSRVLQ------------------QCQTFRDSSYSNIKKADNFSQSSPNSNGFGV 94
+ D R ++ Q Q D +Y N + NSN
Sbjct: 68 KKGKRDLRFMESVISIYKSLNAAVSVHLPQLQIPNDCNYKN--------DALNNSNSPKH 119
Query: 95 GENRKSMITMHVKPEELEMSSGGRAGFRNDVKPYPMQRS---RVEIGDYVE--------- 142
GE+ S +T +++ SGG D P P R + D+ E
Sbjct: 120 GESEDSEMT----DKDVSKRSGGTDSSSRDGSPLPTSEESDPRPKHQDWTEKQLSDHLLL 175
Query: 143 MDVNQVTQAAVYSPPFCDTKGSDNDCSELDSDHRASTGKGNLKERK---SQFRSESSAS- 198
+ AA ++P + + N D AS N ++ F ESS +
Sbjct: 176 YEFESEYDAANHTPESYTEQAAKN-----VRDITASEQPSNAARKRICGDSFIQESSPNP 230
Query: 199 ETDKPTRDLKRKKYLTKGDDHIQHVSTHHSKLVDSHCGLD----------------LKSG 242
+T PT L R + DD +V + +L SH D LK
Sbjct: 231 KTQDPT--LLRLMESLRSDDPTDYVKAQNHQLPKSHTEQDSKRKRDITASDAMENHLKVP 288
Query: 243 KEPGELLPANIPIDLN-----------------------PSTSICSFCQSSETSEATGPM 279
K L+ + ID N + +IC FCQS+ SEATG M
Sbjct: 289 KRENNLMQKSADIDCNGKCSANSDDQLSEKISKALEQTSSNITICGFCQSARVSEATGEM 348
Query: 280 LHYANGKSVIGDAAMQPNVIHVHRCCVDWAPQVYFVDETCKNLKAEVARGAKLKCSTCGL 339
LHY+ G+ V GD + NVIHVH C++WAPQVY+ +T KNLKAE+ARG K+KC+ C L
Sbjct: 349 LHYSRGRPVDGDDIFRSNVIHVHSACIEWAPQVYYEGDTVKNLKAELARGMKIKCTKCSL 408
Query: 340 KGAALGCYVKSCKRTYHVPCAMDVSTCRWDQEKYLLLCPVHSNAKFPHEKS 390
KGAALGC+VKSC+R+YHVPCA ++S CRWD E +LLLCP HS+ KFP+EKS
Sbjct: 409 KGAALGCFVKSCRRSYHVPCAREISRCRWDYEDFLLLCPAHSSVKFPNEKS 459
>AT1G04020.1 | Symbols: ATBARD1, BARD1, ROW1 | breast cancer
associated RING 1 | chr1:1036610-1040045 FORWARD
LENGTH=714
Length = 714
Score = 210 bits (535), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 164/472 (34%), Positives = 217/472 (45%), Gaps = 113/472 (23%)
Query: 13 LMNPWMLHFQKLALE---------------LKCP--LC-SCLVDSSLSGSECAVCKTKYA 54
LMNPW+LH QKL LE L C C SC+ SS S C VCK+K+
Sbjct: 8 LMNPWVLHLQKLELELKCPLCLKLLNRPVLLPCDHVFCDSCVHKSSQVESGCPVCKSKHP 67
Query: 55 QT--DDSRVLQ------------------QCQTFRDSSYSNIKKADNFSQSSPNSNGFGV 94
+ D R ++ Q Q D +Y N + NSN
Sbjct: 68 KKARRDLRFMESVISIYKSLNAAVSVHLPQLQIPNDCNYKN--------DALNNSNSPKH 119
Query: 95 GENRKSMITMHVKPEELEMSSGGRAGFRNDVKPYPMQRS---RVEIGDYVE--------- 142
GE+ S +T +++ SGG D P P R + D+ E
Sbjct: 120 GESEDSEMT----DKDVSKRSGGTDSSSRDGSPLPTSEESDPRPKHQDWTEKQLSDHLLL 175
Query: 143 MDVNQVTQAAVYSPPFCDTKGSDNDCSELDSDHRASTGKGNLKERK---SQFRSESSAS- 198
+ AA ++P + + N D AS N ++ F ESS +
Sbjct: 176 YEFESEYDAANHTPESYTEQAAKN-----VRDITASEQPSNAARKRICGDSFIQESSPNP 230
Query: 199 ETDKPTRDLKRKKYLTKGDDHIQHV-STHHSKLVDSHCGLD----------------LKS 241
+T PT L R + DD +V + +H +L SH D LK
Sbjct: 231 KTQDPT--LLRLMESLRSDDPTDYVKAQNHQQLPKSHTEQDSKRKRDITASDAMENHLKV 288
Query: 242 GKEPGELLPANIPIDLN-----------------------PSTSICSFCQSSETSEATGP 278
K L+ + ID N + +IC FCQS+ SEATG
Sbjct: 289 PKRENNLMQKSADIDCNGKCSANSDDQLSEKISKALEQTSSNITICGFCQSARVSEATGE 348
Query: 279 MLHYANGKSVIGDAAMQPNVIHVHRCCVDWAPQVYFVDETCKNLKAEVARGAKLKCSTCG 338
MLHY+ G+ V GD + NVIHVH C++WAPQVY+ +T KNLKAE+ARG K+KC+ C
Sbjct: 349 MLHYSRGRPVDGDDIFRSNVIHVHSACIEWAPQVYYEGDTVKNLKAELARGMKIKCTKCS 408
Query: 339 LKGAALGCYVKSCKRTYHVPCAMDVSTCRWDQEKYLLLCPVHSNAKFPHEKS 390
LKGAALGC+VKSC+R+YHVPCA ++S CRWD E +LLLCP HS+ KFP+EKS
Sbjct: 409 LKGAALGCFVKSCRRSYHVPCAREISRCRWDYEDFLLLCPAHSSVKFPNEKS 460
>AT4G21070.1 | Symbols: ATBRCA1, BRCA1 | breast cancer
susceptibility1 | chr4:11248174-11252633 FORWARD
LENGTH=941
Length = 941
Score = 150 bits (378), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 67/138 (48%), Positives = 89/138 (64%)
Query: 264 CSFCQSSETSEATGPMLHYANGKSVIGDAAMQPNVIHVHRCCVDWAPQVYFVDETCKNLK 323
C+FCQ SE +EA+G M HY G+ V D VIHVH+ C +WAP VYF D T NL
Sbjct: 564 CAFCQCSEDTEASGEMTHYYRGEPVSADFNGGSKVIHVHKNCAEWAPNVYFNDLTIVNLD 623
Query: 324 AEVARGAKLKCSTCGLKGAALGCYVKSCKRTYHVPCAMDVSTCRWDQEKYLLLCPVHSNA 383
E+ R ++ CS CGLKGAALGCY KSCK ++HV CA + CRWD K+++LCP+ ++
Sbjct: 624 VELTRSRRISCSCCGLKGAALGCYNKSCKNSFHVTCAKLIPECRWDNVKFVMLCPLDASI 683
Query: 384 KFPHEKSRPKKQATQEHP 401
K P E++ K + + P
Sbjct: 684 KLPCEEANSKDRKCKRTP 701
>AT3G15120.1 | Symbols: | P-loop containing nucleoside triphosphate
hydrolases superfamily protein | chr3:5088487-5095482
REVERSE LENGTH=1954
Length = 1954
Score = 61.2 bits (147), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/88 (37%), Positives = 45/88 (51%), Gaps = 12/88 (13%)
Query: 301 VHRCCVDWAPQVYFVDETC-KNLKAEVARGAKLKCSTCGLKGAALGCYVKSCKRTYHVPC 359
VH+ C W+P+VYF C KN++A + RG LKC+ C GA GC PC
Sbjct: 560 VHQNCAVWSPEVYFAGVGCLKNIRAALFRGRSLKCTRCDRPGATTGCR----------PC 609
Query: 360 AMDVSTCRWDQEKYLLLCPVHSNAKFPH 387
A + C +D K+L+ C H + PH
Sbjct: 610 AR-ANGCIFDHRKFLIACTDHRHHFQPH 636