FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7806, 593 aa 1>>>pF1KB7806 593 - 593 aa - 593 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.0934+/-0.00088; mu= 6.5039+/- 0.054 mean_var=323.8910+/-65.712, 0's: 0 Z-trim(117.9): 24 B-trim: 194 in 1/52 Lambda= 0.071265 statistics sampled from 18680 (18703) to 18680 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.82), E-opt: 0.2 (0.575), width: 16 Scan time: 4.740 The best scores are: opt bits E(32554) CCDS12050.1 ARID3A gene_id:1820|Hs108|chr19 ( 593) 3939 418.4 1.3e-116 CCDS76777.1 ARID3B gene_id:10620|Hs108|chr15 ( 561) 1179 134.6 3.3e-31 CCDS10264.1 ARID3B gene_id:10620|Hs108|chr15 ( 560) 1170 133.7 6.2e-31 CCDS35006.1 ARID3C gene_id:138715|Hs108|chr9 ( 412) 841 99.7 7.7e-21 >>CCDS12050.1 ARID3A gene_id:1820|Hs108|chr19 (593 aa) initn: 3939 init1: 3939 opt: 3939 Z-score: 2207.1 bits: 418.4 E(32554): 1.3e-116 Smith-Waterman score: 3939; 99.8% identity (100.0% similar) in 593 aa overlap (1-593:1-593) 10 20 30 40 50 60 pF1KB7 MKLQAVMETLLQRQQRARQELEARQQLPPDPPAAPPGRARAAPDEDREPESARMQRAQMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 MKLQAVMETLLQRQQRARQELEARQQLPPDPPAAPPGRARAAPDEDREPESARMQRAQMA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 ALAAMRAAAAGLGHPASPGGSEDGPPGSEEEDAAREGTPGSPGRGREGPGEEHFEDMASD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 ALAAMRAAAAGLGHPASPGGSEDGPPGSEEEDAAREGTPGSPGRGREGPGEEHFEDMASD 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 EDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEEDEEGLGPPGPASLGTTALFPRKAQPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 EDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEEDEEGLGPPGPASLGTTALFPRKAQPP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 QAFRGDGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPPDHGDWTYEEQFKQLYELDGDPK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 QAFRGDGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPPDHGDWTYEEQFKQLYELDGDPK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 RKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLFMLYVLVTEKGGLVEVINKKLWREITKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 RKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLFMLYVLVTEKGGLVEVINKKLWREITKG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 LNLPTSITSAAFTLRTQYMKYLYPYECEKRGLSNPNELQAAIDSNRREGRRQSFGGSLFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 LNLPTSITSAAFTLRTQYMKYLYPYECEKRGLSNPNELQAAIDSNRREGRRQSFGGSLFA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 YSPGGAHGMLSSPKLPVSSLGLAASTNGSSITPAPKIKKEEDSAIPITVPGRLPVSLAGH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 YSPGGAHGMLSSPKLPVSSLGLAASTNGSSITPAPKIKKEEDSAIPITVPGRLPVSLAGH 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB7 PVVAAQAAAVQAAAAQAAVAAQAAALEQLREKLESAEPPEKKMALVADEQQRLMQRALQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 PVVAAQAAAVQAAAAQAAVAAQAAALEQLREKLESAEPPEKKMALVADEQQRLMQRALQQ 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB7 NFLAMAAQLPMSIRINSQASESRQDSAVNLTGTNGSNSISMSVEINGIMYTGVLFAQPPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS12 NFLAMAAQLPMSIRINSQASESRQDSAVNLTGTNGSNSISMSVEINGIMYTGVLFAQPPA 490 500 510 520 530 540 550 560 570 580 590 pF1KB7 PTPTSAPNKGGGGGGSSSSNAGGRGGNTGTSGGQAGPAGLSTPSTSTSNNSLP :::::::::::::::.::::::::::::::::::::::::::::::::::::: CCDS12 PTPTSAPNKGGGGGGGSSSNAGGRGGNTGTSGGQAGPAGLSTPSTSTSNNSLP 550 560 570 580 590 >>CCDS76777.1 ARID3B gene_id:10620|Hs108|chr15 (561 aa) initn: 1157 init1: 778 opt: 1179 Z-score: 673.8 bits: 134.6 E(32554): 3.3e-31 Smith-Waterman score: 1231; 41.9% identity (63.2% similar) in 627 aa overlap (7-588:1-560) 10 20 30 40 50 60 pF1KB7 MKLQAVMETLLQRQQRARQELEARQQLPPDPPAAPPGRARAAPDEDREPESARMQRAQMA :: : :.::. .:: .: :: . :: .. .:..::. CCDS76 MEPLQQQQQQ-------QQQQQKQPHLAP------LQMDAREKQGQQMREAQF- 10 20 30 40 70 80 90 100 110 pF1KB7 ALAAMRAAAAGLGHPASPGGSEDGPPGSEEEDAAREGTPGSPGRGREGPGE---EHFEDM : :.. .. .:. ... : :: :: .: .: : . :: CCDS76 -LYAQKLVT----QPTLLSATAGRPSGS---------TPLGP-LARVPPTAAVAQVFER- 50 60 70 80 120 130 140 150 160 170 pF1KB7 ASDEDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEE--DEEGLGPPGPASLGTTALFPR .. . .:. :. .:.. :.:: : . : . . . .. : . :.: CCDS76 -GNMNSEPEEEDGGLEDEDGDDEVAEVAEKETQAASKYFHVQKVARQDPRVAPMSNLLPA 90 100 110 120 130 140 180 190 200 210 220 230 pF1KB7 KAQPPQAFRGDGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPPDHGDWTYEEQFKQ---- . ::. : : . : :....:... . .:. .::.:: CCDS76 PGLPPH-----------GQQAKED----HTKDASKASPSVSTAGQPNWNLDEQLKQNGGL 150 160 170 180 240 250 260 270 pF1KB7 --------------------LYELDGDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLD ::::::::.::::::::: ::::::::.:::::::::.:: CCDS76 AWSDDADGGRGREISRDFAKLYELDGDPERKEFLDDLFVFMQKRGTPINRIPIMAKQILD 190 200 210 220 230 240 280 290 300 310 320 330 pF1KB7 LFMLYVLVTEKGGLVEVINKKLWREITKGLNLPTSITSAAFTLRTQYMKYLYPYECEKRG :.::: ::::::::::.::::.:::::::::::::::::::::::::::::: :::::.. CCDS76 LYMLYKLVTEKGGLVEIINKKIWREITKGLNLPTSITSAAFTLRTQYMKYLYAYECEKKA 250 260 270 280 290 300 340 350 360 370 380 pF1KB7 LSNPNELQAAIDSNRREGRRQSFGGSLFAYSPG--------GAHGMLSSPKL--PVSSLG ::.: :::::::.::::::: :...:::.:::. :: ..:: ::. :. .:: CCDS76 LSSPAELQAAIDGNRREGRRPSYSSSLFGYSPAAATAAAAAGAPALLSPPKIRFPILGLG 310 320 330 340 350 360 390 400 410 420 430 pF1KB7 LAASTNGSS--ITPAPKIKKEEDSAIP-ITVPGRLPVSLAGHPVVAAQAAAVQAAAAQAA ...:: :: :.:: ..: . . . . ::.:: : ::. :. : CCDS76 SSSGTNTSSPRISPATTLRKGDGAPVTTVPVPNRLAV-----PVTLASQQA--------- 370 380 390 400 410 440 450 460 470 480 490 pF1KB7 VAAQAAALEQLREKLESAEPPEKKMALVADEQQRLMQRALQQNFLAMAAQLPMSIRINSQ ....::::::::.:::.:: ::: . ...:.:::.:.:.:.::..:: ::::.::::.. CCDS76 -GTRTAALEQLRERLESGEPAEKKASRLSEEEQRLVQQAFQRNFFSMARQLPMKIRINGR 420 430 440 450 460 470 500 510 520 530 540 550 pF1KB7 ASESRQDSAVNLTGTNGS-NSISMSVEINGIMYTGVLFAQPPAP--TPTSAPNKGGGGGG : . . ::. :. :..: .::.:::.:.: :.:::::: :. ::: .. :... CCDS76 AEDRAEASAAALNLTTSSIGSINMSVDIDGTTYAGVLFAQKPVVHLITGSAP-QSLGSSA 480 490 500 510 520 530 560 570 580 590 pF1KB7 SSSSNAGGRGGNTGTSGGQAGPAGLSTPSTSTSNNSLP ::::.. . :.. : :. . :::: : CCDS76 SSSSSSHCSPSPTSSRGT---PS--AEPSTSWSL 540 550 560 >>CCDS10264.1 ARID3B gene_id:10620|Hs108|chr15 (560 aa) initn: 1145 init1: 778 opt: 1170 Z-score: 668.8 bits: 133.7 E(32554): 6.2e-31 Smith-Waterman score: 1222; 41.8% identity (63.5% similar) in 627 aa overlap (7-588:1-559) 10 20 30 40 50 60 pF1KB7 MKLQAVMETLLQRQQRARQELEARQQLPPDPPAAPPGRARAAPDEDREPESARMQRAQMA :: : :.::. .:: .: :: . :: .. .:..::. CCDS10 MEPLQQQQQQ-------QQQQQKQPHLAP------LQMDAREKQGQQMREAQF- 10 20 30 40 70 80 90 100 110 pF1KB7 ALAAMRAAAAGLGHPASPGGSEDGPPGSEEEDAAREGTPGSPGRGREGPGE---EHFEDM : :.. .. .:. ... : :: :: .: .: : . :: CCDS10 -LYAQKLVT----QPTLLSATAGRPSGS---------TPLGP-LARVPPTAAVAQVFER- 50 60 70 80 120 130 140 150 160 170 pF1KB7 ASDEDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEE--DEEGLGPPGPASLGTTALFPR .. . .:. :. .:.. :.:: : . : . . . .. : . :.: CCDS10 -GNMNSEPEEEDGGLEDEDGDDEVAEVAEKETQAASKYFHVQKVARQDPRVAPMSNLLPA 90 100 110 120 130 140 180 190 200 210 220 230 pF1KB7 KAQPPQAFRGDGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPPDHGDWTYEEQFKQ---- . ::. : : . : :....:... . .:. .::.:: CCDS10 PGLPPH-----------GQQAKED----HTKDASKASPSVSTAGQPNWNLDEQLKQNGGL 150 160 170 180 240 250 260 270 pF1KB7 --------------------LYELDGDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLD ::::::::.::::::::: ::::::::.:::::::::.:: CCDS10 AWSDDADGGRGREISRDFAKLYELDGDPERKEFLDDLFVFMQKRGTPINRIPIMAKQILD 190 200 210 220 230 240 280 290 300 310 320 330 pF1KB7 LFMLYVLVTEKGGLVEVINKKLWREITKGLNLPTSITSAAFTLRTQYMKYLYPYECEKRG :.::: ::::::::::.::::.:::::::::::::::::::::::::::::: :::::.. CCDS10 LYMLYKLVTEKGGLVEIINKKIWREITKGLNLPTSITSAAFTLRTQYMKYLYAYECEKKA 250 260 270 280 290 300 340 350 360 370 380 pF1KB7 LSNPNELQAAIDSNRREGRRQSFGGSLFAYSPG--------GAHGMLSSPKL--PVSSLG ::.: :::::::.::::::: :...:::.:::. :: ..:: ::. :. .:: CCDS10 LSSPAELQAAIDGNRREGRRPSYSSSLFGYSPAAATAAAAAGAPALLSPPKIRFPILGLG 310 320 330 340 350 360 390 400 410 420 430 pF1KB7 LAASTNGSS--ITPAPKIKKEEDSAIP-ITVPGRLPVSLAGHPVVAAQAAAVQAAAAQAA ...:: :: :.:: ..: . . . . ::.:: : ::. :. : CCDS10 SSSGTNTSSPRISPATTLRKGDGAPVTTVPVPNRLAV-----PVTLASQQA--------- 370 380 390 400 410 440 450 460 470 480 490 pF1KB7 VAAQAAALEQLREKLESAEPPEKKMALVADEQQRLMQRALQQNFLAMAAQLPMSIRINSQ ....::::::::.:::.:: ::: . ...:.:::.:.:.:.::..:: ::::.::::.. CCDS10 -GTRTAALEQLRERLESGEPAEKKASRLSEEEQRLVQQAFQRNFFSMARQLPMKIRINGR 420 430 440 450 460 470 500 510 520 530 540 550 pF1KB7 ASESRQDSA-VNLTGTNGSNSISMSVEINGIMYTGVLFAQPPAP--TPTSAPNKGGGGGG .... ..: .::: :.. .::.:::.:.: :.:::::: :. ::: .. :... CCDS10 EDRAEASAAALNLT-TSSIGSINMSVDIDGTTYAGVLFAQKPVVHLITGSAP-QSLGSSA 480 490 500 510 520 530 560 570 580 590 pF1KB7 SSSSNAGGRGGNTGTSGGQAGPAGLSTPSTSTSNNSLP ::::.. . :.. : :. . :::: : CCDS10 SSSSSSHCSPSPTSSRGT---PS--AEPSTSWSL 540 550 560 >>CCDS35006.1 ARID3C gene_id:138715|Hs108|chr9 (412 aa) initn: 1145 init1: 779 opt: 841 Z-score: 487.6 bits: 99.7 E(32554): 7.7e-21 Smith-Waterman score: 1007; 46.3% identity (63.5% similar) in 447 aa overlap (145-547:2-406) 120 130 140 150 160 170 pF1KB7 EDMASDEDMKPKWEEEEMEEDLGEDEEEEEEDYEDEEEEEDEEGLGPPGPAS--LGTTAL : . .. . .:.:: .:: : CCDS35 MEALQKQQAARLAQGVGPLAPACPLLPPQPP 10 20 30 180 190 200 210 pF1KB7 FP--RKAQPPQAFRG------------DGVPRVLGGQERPGPGPAHPGGAAHVAPQLQPP .: : : :.. : : : .: :. . ..::. . .:. ::: CCDS35 LPDHRTLQAPEGALGNVGAEEEEDAEEDEEKREEAGAEEEAAEESRPGAQGPSSPSSQPP 40 50 60 70 80 90 220 230 240 250 260 270 pF1KB7 D-HG-DWTYEEQFKQLYELDGDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLFMLY : .::::::::::::::.::::::::::::::::::::::::.::::::::::. :. CCDS35 GLHPHEWTYEEQFKQLYELDADPKRKEFLDDLFSFMQKRGTPVNRVPIMAKQVLDLYALF 100 110 120 130 140 150 280 290 300 310 320 330 pF1KB7 VLVTEKGGLVEVINKKLWREITKGLNLPTSITSAAFTLRTQYMKYLYPYECEKRGLSNPN ::: :::::::::.:.:::.:.::.:::.:::::::::::::::::::::: :.::.:. CCDS35 RLVTAKGGLVEVINRKVWREVTRGLSLPTTITSAAFTLRTQYMKYLYPYECETRALSSPG 160 170 180 190 200 210 340 350 360 370 380 pF1KB7 ELQAAIDSNRREGRRQSFGGS-LFAYS---PGGAH----GMLSSPKLPVSSLGLA-ASTN ::::::::::::::::.. .. ::. . : ::. : .: :: : : .::. CCDS35 ELQAAIDSNRREGRRQAYTATPLFGLAGPPPRGAQDPALGPGPAPPATQSSPGPAQGSTS 220 230 240 250 260 270 390 400 410 420 430 440 pF1KB7 G------SSITPAPKIKKEEDSAIPITVPGRLPVSLAGHPVVAAQAAAVQAAAAQAAVAA : ....:.: ::::: :.:: . :::.:: :. CCDS35 GLPAHACAQLSPSP-IKKEE-SGIPNPCLA-LPVGLALGPT------------------- 280 290 300 450 460 470 480 490 500 pF1KB7 QAAALEQLREKLESAEPPEKKMALVADEQQRLMQRALQQNFLAMAAQLPMSIRINSQASE :::: :::::. .:.. . . . .:: . ..:. : CCDS35 --------REKLAPEEPPEKRAVLMGPMDPP--RPCMPPSFLPRG-KVPLR--------E 310 320 330 340 350 510 520 530 540 550 pF1KB7 SRQDSAVNLTGTNGSNSISMSVEINGIMYTGVLFA--QP---------PAPTPTSAPNKG : :. .::.:. : .::.:..::::..::::::: :: ::: :...: CCDS35 ERLDGPLNLAGS-GISSINMALEINGVVYTGVLFARRQPVPASQGPTNPAPPPSTGPPSS 360 370 380 390 400 560 570 580 590 pF1KB7 GGGGGSSSSNAGGRGGNTGTSGGQAGPAGLSTPSTSTSNNSLP CCDS35 ILP 410 593 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 06:13:53 2016 done: Sun Nov 6 06:13:53 2016 Total Scan time: 4.740 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]