FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6665, 290 aa 1>>>pF1KE6665 290 - 290 aa - 290 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.8578+/-0.000918; mu= 8.2090+/- 0.055 mean_var=96.5806+/-19.042, 0's: 0 Z-trim(107.6): 13 B-trim: 0 in 0/51 Lambda= 0.130506 statistics sampled from 9662 (9668) to 9662 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.68), E-opt: 0.2 (0.297), width: 16 Scan time: 2.050 The best scores are: opt bits E(32554) CCDS47877.1 GDAP1 gene_id:54332|Hs108|chr8 ( 290) 1902 368.2 4e-102 CCDS34911.1 GDAP1 gene_id:54332|Hs108|chr8 ( 358) 1902 368.2 4.8e-102 CCDS74727.1 GDAP1L1 gene_id:78997|Hs108|chr20 ( 278) 1022 202.5 2.9e-52 CCDS13328.1 GDAP1L1 gene_id:78997|Hs108|chr20 ( 367) 1022 202.5 3.7e-52 CCDS74725.1 GDAP1L1 gene_id:78997|Hs108|chr20 ( 386) 895 178.6 6.1e-45 CCDS74726.1 GDAP1L1 gene_id:78997|Hs108|chr20 ( 296) 415 88.2 7.8e-18 >>CCDS47877.1 GDAP1 gene_id:54332|Hs108|chr8 (290 aa) initn: 1902 init1: 1902 opt: 1902 Z-score: 1946.9 bits: 368.2 E(32554): 4e-102 Smith-Waterman score: 1902; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KE6 MRLNSTGEVPVLIHGENIICEATQIIDYLEQTFLDERTPRLMPDKESMYYPRVQHYRELL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MRLNSTGEVPVLIHGENIICEATQIIDYLEQTFLDERTPRLMPDKESMYYPRVQHYRELL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 DSLPMDAYTHGCILHPELTVDSMIPAYATTRIRSQIGNTESELKKLAEENPDLQEAYIAK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DSLPMDAYTHGCILHPELTVDSMIPAYATTRIRSQIGNTESELKKLAEENPDLQEAYIAK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 QKRLKSKLLDHDNVKYLKKILDELEKVLDQVETELQRRNEETPEEGQQPWLCGESFTLAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 QKRLKSKLLDHDNVKYLKKILDELEKVLDQVETELQRRNEETPEEGQQPWLCGESFTLAD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 VSLAVTLHRLKFLGFARRNWGNGKRPNLETYYERVLKRKTFNKVLGHVNNILISAVLPTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VSLAVTLHRLKFLGFARRNWGNGKRPNLETYYERVLKRKTFNKVLGHVNNILISAVLPTA 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 FRVAKKRAPKVLGTTLVVGLLAGVGYFAFMLFRKRLGSMILAFRPRPNYF :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 FRVAKKRAPKVLGTTLVVGLLAGVGYFAFMLFRKRLGSMILAFRPRPNYF 250 260 270 280 290 >>CCDS34911.1 GDAP1 gene_id:54332|Hs108|chr8 (358 aa) initn: 1902 init1: 1902 opt: 1902 Z-score: 1945.5 bits: 368.2 E(32554): 4.8e-102 Smith-Waterman score: 1902; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:69-358) 10 20 30 pF1KE6 MRLNSTGEVPVLIHGENIICEATQIIDYLE :::::::::::::::::::::::::::::: CCDS34 KVRLVIAEKALKCEEHDVSLPLSEHNEPWFMRLNSTGEVPVLIHGENIICEATQIIDYLE 40 50 60 70 80 90 40 50 60 70 80 90 pF1KE6 QTFLDERTPRLMPDKESMYYPRVQHYRELLDSLPMDAYTHGCILHPELTVDSMIPAYATT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 QTFLDERTPRLMPDKESMYYPRVQHYRELLDSLPMDAYTHGCILHPELTVDSMIPAYATT 100 110 120 130 140 150 100 110 120 130 140 150 pF1KE6 RIRSQIGNTESELKKLAEENPDLQEAYIAKQKRLKSKLLDHDNVKYLKKILDELEKVLDQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 RIRSQIGNTESELKKLAEENPDLQEAYIAKQKRLKSKLLDHDNVKYLKKILDELEKVLDQ 160 170 180 190 200 210 160 170 180 190 200 210 pF1KE6 VETELQRRNEETPEEGQQPWLCGESFTLADVSLAVTLHRLKFLGFARRNWGNGKRPNLET :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 VETELQRRNEETPEEGQQPWLCGESFTLADVSLAVTLHRLKFLGFARRNWGNGKRPNLET 220 230 240 250 260 270 220 230 240 250 260 270 pF1KE6 YYERVLKRKTFNKVLGHVNNILISAVLPTAFRVAKKRAPKVLGTTLVVGLLAGVGYFAFM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 YYERVLKRKTFNKVLGHVNNILISAVLPTAFRVAKKRAPKVLGTTLVVGLLAGVGYFAFM 280 290 300 310 320 330 280 290 pF1KE6 LFRKRLGSMILAFRPRPNYF :::::::::::::::::::: CCDS34 LFRKRLGSMILAFRPRPNYF 340 350 >>CCDS74727.1 GDAP1L1 gene_id:78997|Hs108|chr20 (278 aa) initn: 1046 init1: 616 opt: 1022 Z-score: 1051.8 bits: 202.5 E(32554): 2.9e-52 Smith-Waterman score: 1022; 55.4% identity (83.1% similar) in 278 aa overlap (1-275:1-276) 10 20 30 40 50 60 pF1KE6 MRLNSTGEVPVLIHGENIICEATQIIDYLEQTFLDERTPRLMPDKESMYYPRVQHYRELL :::: ::::.:: .::: . :::::.:.:: :.. :::. :. . :: .::::: CCDS74 MRLNLGEEVPVIIHRDNIISDYDQIIDYVERTFTGEHVVALMPEVGSLQHARVLQYRELL 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 DSLPMDAYTHGCILHPELTVDSMIPAYATTRIRSQIGNTESELKKLA-EENPDLQEAYIA :.:::::::::::::::::.::::: :::..:: ...:. ..: :: ::.:.:.: :.. CCDS74 DALPMDAYTHGCILHPELTTDSMIPKYATAEIRRHLANATTDLMKLDHEEEPQLSEPYLS 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE6 KQKRLKSKLLDHDNVKYLKKILDELEKVLDQVETELQRRNEETPEEGQQP--WLCGESFT :::.: .:.:.::.:.:::::: :: ::::.:.::..:. :. :::. :::: .:: CCDS74 KQKKLMAKILEHDDVSYLKKILGELAMVLDQIEAELEKRKLEN--EGQKCELWLCGCAFT 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 LADVSLAVTLHRLKFLGFARRNWGNGKRPNLETYYERVLKRKTFNKVLGHVNNILISAVL :::: :..:::::::::.... : .:.::::....::: .: .: :::: ... :.:::. CCDS74 LADVLLGATLHRLKFLGLSKKYWEDGSRPNLQSFFERVQRRFAFRKVLGDIHTTLLSAVI 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 PTAFRVAKKRAPKVLGTTLVVGLLAGVGYFAFMLFRKRLGSMILAFRPRPNYF :.:::..:.. :. .:.....: :.:.::::. ..:. CCDS74 PNAFRLVKRKPPSFFGASFLMGSLGGMGYFAYWYLKKKYI 240 250 260 270 >>CCDS13328.1 GDAP1L1 gene_id:78997|Hs108|chr20 (367 aa) initn: 1026 init1: 616 opt: 1022 Z-score: 1049.8 bits: 202.5 E(32554): 3.7e-52 Smith-Waterman score: 1022; 55.4% identity (83.1% similar) in 278 aa overlap (1-275:90-365) 10 20 30 pF1KE6 MRLNSTGEVPVLIHGENIICEATQIIDYLE :::: ::::.:: .::: . :::::.: CCDS13 KVRLVIAEKGLVCEERDVSLPQSEHKEPWFMRLNLGEEVPVIIHRDNIISDYDQIIDYVE 60 70 80 90 100 110 40 50 60 70 80 90 pF1KE6 QTFLDERTPRLMPDKESMYYPRVQHYRELLDSLPMDAYTHGCILHPELTVDSMIPAYATT .:: :.. :::. :. . :: .::::::.:::::::::::::::::.::::: :::. CCDS13 RTFTGEHVVALMPEVGSLQHARVLQYRELLDALPMDAYTHGCILHPELTTDSMIPKYATA 120 130 140 150 160 170 100 110 120 130 140 pF1KE6 RIRSQIGNTESELKKLA-EENPDLQEAYIAKQKRLKSKLLDHDNVKYLKKILDELEKVLD .:: ...:. ..: :: ::.:.:.: :..:::.: .:.:.::.:.:::::: :: ::: CCDS13 EIRRHLANATTDLMKLDHEEEPQLSEPYLSKQKKLMAKILEHDDVSYLKKILGELAMVLD 180 190 200 210 220 230 150 160 170 180 190 200 pF1KE6 QVETELQRRNEETPEEGQQP--WLCGESFTLADVSLAVTLHRLKFLGFARRNWGNGKRPN :.:.::..:. :. :::. :::: .:::::: :..:::::::::.... : .:.::: CCDS13 QIEAELEKRKLEN--EGQKCELWLCGCAFTLADVLLGATLHRLKFLGLSKKYWEDGSRPN 240 250 260 270 280 290 210 220 230 240 250 260 pF1KE6 LETYYERVLKRKTFNKVLGHVNNILISAVLPTAFRVAKKRAPKVLGTTLVVGLLAGVGYF :....::: .: .: :::: ... :.:::.:.:::..:.. :. .:.....: :.:.::: CCDS13 LQSFFERVQRRFAFRKVLGDIHTTLLSAVIPNAFRLVKRKPPSFFGASFLMGSLGGMGYF 300 310 320 330 340 350 270 280 290 pF1KE6 AFMLFRKRLGSMILAFRPRPNYF :. ..:. CCDS13 AYWYLKKKYI 360 >>CCDS74725.1 GDAP1L1 gene_id:78997|Hs108|chr20 (386 aa) initn: 1013 init1: 616 opt: 895 Z-score: 920.3 bits: 178.6 E(32554): 6.1e-45 Smith-Waterman score: 975; 52.2% identity (77.4% similar) in 297 aa overlap (1-275:90-384) 10 20 30 pF1KE6 MRLNSTGEVPVLIHGENIICEATQIIDYLE :::: ::::.:: .::: . :::::.: CCDS74 KVRLVIAEKGLVCEERDVSLPQSEHKEPWFMRLNLGEEVPVIIHRDNIISDYDQIIDYVE 60 70 80 90 100 110 40 50 60 70 pF1KE6 QTFLDE---RTPR----------------LMPDKESMYYPRVQHYRELLDSLPMDAYTHG .:: : : :::. :. . :: .::::::.::::::::: CCDS74 RTFTGGGRGRCPSGFPAQPLAVPTEHVVALMPEVGSLQHARVLQYRELLDALPMDAYTHG 120 130 140 150 160 170 80 90 100 110 120 130 pF1KE6 CILHPELTVDSMIPAYATTRIRSQIGNTESELKKLA-EENPDLQEAYIAKQKRLKSKLLD ::::::::.::::: :::..:: ...:. ..: :: ::.:.:.: :..:::.: .:.:. CCDS74 CILHPELTTDSMIPKYATAEIRRHLANATTDLMKLDHEEEPQLSEPYLSKQKKLMAKILE 180 190 200 210 220 230 140 150 160 170 180 pF1KE6 HDNVKYLKKILDELEKVLDQVETELQRRNEETPEEGQQP--WLCGESFTLADVSLAVTLH ::.:.:::::: :: ::::.:.::..:. :. :::. :::: .:::::: :..::: CCDS74 HDDVSYLKKILGELAMVLDQIEAELEKRKLEN--EGQKCELWLCGCAFTLADVLLGATLH 240 250 260 270 280 290 190 200 210 220 230 240 pF1KE6 RLKFLGFARRNWGNGKRPNLETYYERVLKRKTFNKVLGHVNNILISAVLPTAFRVAKKRA ::::::.... : .:.::::....::: .: .: :::: ... :.:::.:.:::..:.. CCDS74 RLKFLGLSKKYWEDGSRPNLQSFFERVQRRFAFRKVLGDIHTTLLSAVIPNAFRLVKRKP 300 310 320 330 340 350 250 260 270 280 290 pF1KE6 PKVLGTTLVVGLLAGVGYFAFMLFRKRLGSMILAFRPRPNYF :. .:.....: :.:.::::. ..:. CCDS74 PSFFGASFLMGSLGGMGYFAYWYLKKKYI 360 370 380 >>CCDS74726.1 GDAP1L1 gene_id:78997|Hs108|chr20 (296 aa) initn: 799 init1: 413 opt: 415 Z-score: 433.7 bits: 88.2 E(32554): 7.8e-18 Smith-Waterman score: 664; 42.9% identity (62.5% similar) in 275 aa overlap (1-275:90-294) 10 20 30 pF1KE6 MRLNSTGEVPVLIHGENIICEATQIIDYLE :::: ::::.:: .::: . :::::.: CCDS74 KVRLVIAEKGLVCEERDVSLPQSEHKEPWFMRLNLGEEVPVIIHRDNIISDYDQIIDYVE 60 70 80 90 100 110 40 50 60 70 80 90 pF1KE6 QTFLDERTPRLMPDKESMYYPRVQHYRELLDSLPMDAYTHGCILHPELTVDSMIPAYATT .:: :.. :::. :. . :: .::::::.:::::::::::::::::.::::: :::. CCDS74 RTFTGEHVVALMPEVGSLQHARVLQYRELLDALPMDAYTHGCILHPELTTDSMIPKYATA 120 130 140 150 160 170 100 110 120 130 140 150 pF1KE6 RIRSQIGNTESELKKLAEENPDLQEAYIAKQKRLKSKLLDHDNVKYLKKILDELEKVLDQ .:: : . :: CCDS74 EIRRQ----KCEL----------------------------------------------- 180 160 170 180 190 200 210 pF1KE6 VETELQRRNEETPEEGQQPWLCGESFTLADVSLAVTLHRLKFLGFARRNWGNGKRPNLET :::: .:::::: :..:::::::::.... : .:.::::.. CCDS74 -------------------WLCGCAFTLADVLLGATLHRLKFLGLSKKYWEDGSRPNLQS 190 200 210 220 220 230 240 250 260 270 pF1KE6 YYERVLKRKTFNKVLGHVNNILISAVLPTAFRVAKKRAPKVLGTTLVVGLLAGVGYFAFM ..::: .: .: :::: ... :.:::.:.:::..:.. :. .:.....: :.:.::::. CCDS74 FFERVQRRFAFRKVLGDIHTTLLSAVIPNAFRLVKRKPPSFFGASFLMGSLGGMGYFAYW 230 240 250 260 270 280 280 290 pF1KE6 LFRKRLGSMILAFRPRPNYF ..:. CCDS74 YLKKKYI 290 290 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 15:14:28 2016 done: Tue Nov 8 15:14:29 2016 Total Scan time: 2.050 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]