FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4536, 603 aa 1>>>pF1KE4536 603 - 603 aa - 603 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3478+/-0.000845; mu= 19.0262+/- 0.051 mean_var=67.4513+/-13.513, 0's: 0 Z-trim(106.6): 17 B-trim: 253 in 1/54 Lambda= 0.156163 statistics sampled from 9055 (9067) to 9055 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.649), E-opt: 0.2 (0.279), width: 16 Scan time: 3.050 The best scores are: opt bits E(32554) CCDS6977.2 DBH gene_id:1621|Hs108|chr9 ( 617) 4137 941.2 0 CCDS5152.2 MOXD1 gene_id:26002|Hs108|chr6 ( 613) 1053 246.4 8.1e-65 >>CCDS6977.2 DBH gene_id:1621|Hs108|chr9 (617 aa) initn: 4137 init1: 4137 opt: 4137 Z-score: 5032.2 bits: 941.2 E(32554): 0 Smith-Waterman score: 4137; 100.0% identity (100.0% similar) in 603 aa overlap (1-603:15-617) 10 20 30 40 pF1KE4 MREAAFMYSTAVAIFLVILVAALQGSAPRESPLPYHIPLDPEGSLE :::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MPALSRWASLPGPSMREAAFMYSTAVAIFLVILVAALQGSAPRESPLPYHIPLDPEGSLE 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE4 LSWNVSYTQEAIHFQLLVRRLKAGVLFGMSDRGELENADLVVLWTDGDTAYFADAWSDQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LSWNVSYTQEAIHFQLLVRRLKAGVLFGMSDRGELENADLVVLWTDGDTAYFADAWSDQK 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE4 GQIHLDPQQDYQLLQVQRTPEGLTLLFKRPFGTCDPKDYLIEDGTVHLVYGILEEPFRSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 GQIHLDPQQDYQLLQVQRTPEGLTLLFKRPFGTCDPKDYLIEDGTVHLVYGILEEPFRSL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE4 EAINGSGLQMGLQRVQLLKPNIPEPELPSDACTMEVQAPNIQIPSQETTYWCYIKELPKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 EAINGSGLQMGLQRVQLLKPNIPEPELPSDACTMEVQAPNIQIPSQETTYWCYIKELPKG 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE4 FSRHHIIKYEPIVTKGNEALVHHMEVFQCAPEMDSVPHFSGPCDSKMKPDRLNYCRHVLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 FSRHHIIKYEPIVTKGNEALVHHMEVFQCAPEMDSVPHFSGPCDSKMKPDRLNYCRHVLA 250 260 270 280 290 300 290 300 310 320 330 340 pF1KE4 AWALGAKAFYYPEEAGLAFGGPGSSRYLRLEVHYHNPLVIEGRNDSSGIRLYYTAKLRRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 AWALGAKAFYYPEEAGLAFGGPGSSRYLRLEVHYHNPLVIEGRNDSSGIRLYYTAKLRRF 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE4 NAGIMELGLVYTPVMAIPPRETAFILTGYCTDKCTQLALPPSGIHIFASQLHTHLTGRKV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 NAGIMELGLVYTPVMAIPPRETAFILTGYCTDKCTQLALPPSGIHIFASQLHTHLTGRKV 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE4 VTVLVRDGREWEIVNQDNHYSPHFQEIRMLKKVVSVHPGDVLITSCTYNTEDRELATVGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 VTVLVRDGREWEIVNQDNHYSPHFQEIRMLKKVVSVHPGDVLITSCTYNTEDRELATVGG 430 440 450 460 470 480 470 480 490 500 510 520 pF1KE4 FGILEEMCVNYVHYYPQTQLELCKSAVDAGFLQKYFHLINRFNNEDVCTCPQASVSQQFT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 FGILEEMCVNYVHYYPQTQLELCKSAVDAGFLQKYFHLINRFNNEDVCTCPQASVSQQFT 490 500 510 520 530 540 530 540 550 560 570 580 pF1KE4 SVPWNSFNRDVLKALYSFAPISMHCNKSSAVRFQGEWNLQPLPKVISTLEEPTPQCPTSQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 SVPWNSFNRDVLKALYSFAPISMHCNKSSAVRFQGEWNLQPLPKVISTLEEPTPQCPTSQ 550 560 570 580 590 600 590 600 pF1KE4 GRSPAGPTVVSIGGGKG ::::::::::::::::: CCDS69 GRSPAGPTVVSIGGGKG 610 >>CCDS5152.2 MOXD1 gene_id:26002|Hs108|chr6 (613 aa) initn: 663 init1: 343 opt: 1053 Z-score: 1277.1 bits: 246.4 E(32554): 8.1e-65 Smith-Waterman score: 1069; 33.1% identity (59.7% similar) in 595 aa overlap (16-589:6-584) 10 20 30 40 50 pF1KE4 MREAAFMYSTAVAIFLVILVAALQGSAPRES--PLPYHIPLDPEGSLELSWNVSYTQEAI :..: . : :.: : :.. :: ::. :.:. .: : CCDS51 MCCWPLLLLWGLLPGTAAGGSGRTYPHRTLLDSEGKYWLGWSQRGSQ--I 10 20 30 40 60 70 80 90 100 110 pF1KE4 HFQLLVRRLKAG-VLFGMSDRGELENADLVVLWTDGDTAYFADAWSDQKGQIHLDPQQDY :.: :: :: : ::.: : . .::.:: . :. : ... . ... : :::: CCDS51 AFRLQVRT--AGYVGFGFSPTGAMASADIVVGGVAHGRPYLQDYFTNANRELKKDAQQDY 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE4 QLLQVQRTPEGLTLLFKRPFGTCDPKDYLIEDGTVHLVYGILEEPFRSLEA-INGSGLQM .: .... . : : . ::: .: : :.::..... .: . :: . . CCDS51 HLEYAMENSTHTIIEFTRELHTCDINDKSITDSTVRVIWAYHHED--AGEAGPKYHDSNR 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE4 GLQRVQLLKPNIPEPELPSDACTMEVQAPNIQIPSQETTYWCYIKELPKGFSRHHIIKYE : . ..::.:. : . ... .. ::...::::: . ..: .::.:: : CCDS51 GTKSLRLLNPE-KTSVLSTALPYFDLVNQDVPIPNKDTTYWCQMFKIPVFQEKHHVIKVE 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE4 PIVTKGNEALVHHMEVFQCAPEM-DSVPHFSGPCDSKMKPDRLNYCRHVLAAWALGAKAF :.. .:.:.::::. ..::. .. ::: . . : :: . :. :. :::.:...: CCDS51 PVIQRGHESLVHHILLYQCSNNFNDSVLESGHECYHPNMPDAFLTCETVIFAWAIGGEGF 230 240 250 260 270 280 300 310 320 330 340 350 pF1KE4 YYPEEAGLAFGGPGSSRYLRLEVHYHNPLVIEGRNDSSGIRLYYTAKLRRFNAGIMELGL :: ..::..: : . .:. ::::: :: :: :.::.::.:: .:...::..: :: CCDS51 SYPPHVGLSLGTPLDPHYVLLEVHYDNPTYEEGLIDNSGLRLFYTMDIRKYDAGVIEAGL 290 300 310 320 330 340 360 370 380 390 400 410 pF1KE4 VYTPVMAIPPRETAFILTGYCTDKCTQLALP---PSGIHIFASQLHTHLTGRKVVTVLVR . .::: : :.:: .: . :: :::::.:: ::.::.:: . : CCDS51 WVSLFHTIPPGMPEFQSEGHCTLECLEEALEAEKPSGIHVFAVLLHAHLAGRGIRLRHFR 350 360 370 380 390 400 420 430 440 450 460 470 pF1KE4 DGREWEIVNQDNHYSPHFQEIRMLKKVVSVHPGDVLITSCTYNTEDRELATVGGFGILEE :.: ... :. .. .:::...::. .. ::: ::: : :::.:: : ::.. : CCDS51 KGKEMKLLAYDDDFDFNFQEFQYLKEEQTILPGDNLITECRYNTKDRAEMTWGGLSTRSE 410 420 430 440 450 460 480 490 500 510 520 pF1KE4 MCVNYVHYYPQTQLELCKSAVDAGFLQKYFHLINRFNNED----VCTCPQASVSQQFTSV ::..:. :::. .: : : : ... . . . . :. . .: .. CCDS51 MCLSYLLYYPRINLTRCASIPDIMEQLQFIGVKEIYRPVTTWPFIIKSPKQYKNLSFMDA 470 480 490 500 510 520 530 540 550 560 570 pF1KE4 ----PWN-----SFNRDVLKALYSFAPISMHCNKSSAVRFQGEWNLQPLPKVISTLEEPT :. :::. ::. :....:.:.. ..::..: . . .:.: CCDS51 MNKFKWTKKEGLSFNKLVLS-----LPVNVRCSKTD----NAEWSIQGMTALPPDIERPY 530 540 550 560 570 580 590 600 pF1KE4 PQCPTSQGRSPAGPTVVSIGGGKG : : : CCDS51 KAEPLVCGTSSSSSLHRDFSINLLVCLLLLSCTLSTKSL 580 590 600 610 603 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:09:37 2016 done: Sun Nov 6 00:09:37 2016 Total Scan time: 3.050 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]