FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1890, 418 aa 1>>>pF1KE1890 418 - 418 aa - 418 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0426+/-0.000853; mu= 14.7346+/- 0.051 mean_var=100.7444+/-19.632, 0's: 0 Z-trim(109.8): 24 B-trim: 0 in 0/50 Lambda= 0.127780 statistics sampled from 11125 (11147) to 11125 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.706), E-opt: 0.2 (0.342), width: 16 Scan time: 2.990 The best scores are: opt bits E(32554) CCDS33211.1 SPRED2 gene_id:200734|Hs108|chr2 ( 418) 3006 564.5 6.6e-161 CCDS46308.1 SPRED2 gene_id:200734|Hs108|chr2 ( 415) 2950 554.2 8.5e-158 CCDS42560.1 SPRED3 gene_id:399473|Hs108|chr19 ( 410) 990 192.9 4.9e-49 CCDS32193.1 SPRED1 gene_id:161742|Hs108|chr15 ( 444) 937 183.1 4.6e-46 >>CCDS33211.1 SPRED2 gene_id:200734|Hs108|chr2 (418 aa) initn: 3006 init1: 3006 opt: 3006 Z-score: 3002.2 bits: 564.5 E(32554): 6.6e-161 Smith-Waterman score: 3006; 100.0% identity (100.0% similar) in 418 aa overlap (1-418:1-418) 10 20 30 40 50 60 pF1KE1 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE1 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE1 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE1 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE1 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD 310 320 330 340 350 360 370 380 390 400 410 pF1KE1 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA 370 380 390 400 410 >>CCDS46308.1 SPRED2 gene_id:200734|Hs108|chr2 (415 aa) initn: 2950 init1: 2950 opt: 2950 Z-score: 2946.5 bits: 554.2 E(32554): 8.5e-158 Smith-Waterman score: 2950; 99.5% identity (99.8% similar) in 412 aa overlap (7-418:4-415) 10 20 30 40 50 60 pF1KE1 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH : .::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 MASPGSDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIH 10 20 30 40 50 70 80 90 100 110 120 pF1KE1 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 GERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIE 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE1 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 DLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTL 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE1 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 GHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKY 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE1 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 PDPSEDADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRR 240 250 260 270 280 290 310 320 330 340 350 360 pF1KE1 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 KEDGERSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGD 300 310 320 330 340 350 370 380 390 400 410 pF1KE1 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS46 YTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA 360 370 380 390 400 410 >>CCDS42560.1 SPRED3 gene_id:399473|Hs108|chr19 (410 aa) initn: 909 init1: 368 opt: 990 Z-score: 993.8 bits: 192.9 E(32554): 4.9e-49 Smith-Waterman score: 990; 39.4% identity (63.2% similar) in 419 aa overlap (13-418:1-409) 10 20 30 40 50 pF1KE1 MTEETHPDDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKV--MHPEGNGRSG-F .:::.::::.::::::::.: :::.:.:.::.: .:::..:.: . CCDS42 MVRVRAVVMARDDSSGGWLPVGGGGLSQVSVCRVRGARPEGGARQGHY 10 20 30 40 60 70 80 90 100 110 pF1KE1 LIHGERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRK .::::: .:. ..::: .. :::.:.:: ::::.. . ::::::::::.: :.... CCDS42 VIHGERLRDQKTTLECTLKPGLVYNKVNPIFHHWSLGDCKFGLTFQSPAEADEFQKSLLA 50 60 70 80 90 100 120 130 140 150 160 170 pF1KE1 AIEDLIEGSTTSSSTIHNEAELGDDDV----FTTATDSSSNSSQKR-EQPTRTISSPTSC :. : .:: : ::. . . : .:. .::.:.::..: : : . ..: CCDS42 ALAALGRGSLTPSSSSSSSSPSQDTAETPCPLTSHVDSDSSSSHSRQETPPSAAAAPIIT 110 120 130 140 150 160 180 190 200 210 220 230 pF1KE1 EHRRIYTLGHLHDSYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWM-TGYEDY . . : . : . : .: ...:. .: .. . : ::::: CCDS42 MES---ASGFGPTTPPQRRRSSAQSYPPLLPFTGIPEPSEPLAGAGGLG--WGGRGYEDY 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE1 RHAPVRGKYPDPSEDADSSYVRFAK-GEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKT : :. : : .. ::::: : . . : . . . . : . CCDS42 R----RSGPPAPLA-LSTCVVRFAKTGALRGAALGPPAALPAPLTEAAPPAPPARPPPGP 230 240 250 260 270 300 310 320 330 340 pF1KE1 QPSRGKSRRRKEDGERSRCVYCRDMFNHE-ENRRGHCQDAPDSVRTCIRRVSCMWCADSM :: . .. : : .:::.:: .: .. ..: :.: .::: : .::.::.:::.:. CCDS42 GPSSAPAKASPEAEEAARCVHCRALFRRRADGRGGRCAEAPDPGRLLVRRLSCLWCAESL 280 290 300 310 320 330 350 360 370 380 390 400 pF1KE1 LYHCMSDPEGDYTDPCSCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRC ::::.:: :::..:::.:. . . :: :: :::. .::.::: :::::. .. : : CCDS42 LYHCLSDAEGDFSDPCACEPGHPRPAARWAALAALSLAVPCLCCYAPLRACHWVAARCGC 340 350 360 370 380 390 410 pF1KE1 --CGGKHKAAA :::.:. :: CCDS42 AGCGGRHEEAAR 400 410 >>CCDS32193.1 SPRED1 gene_id:161742|Hs108|chr15 (444 aa) initn: 1368 init1: 769 opt: 937 Z-score: 940.5 bits: 183.1 E(32554): 4.6e-46 Smith-Waterman score: 1484; 53.1% identity (70.5% similar) in 458 aa overlap (1-417:1-443) 10 20 30 40 50 pF1KE1 MTEETHP-DDDSYIVRVKAVVMTRDDSSGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLI :.::: :.:. .::.:::::::::::::.: :.:.: : : :: : : :: . :.: CCDS32 MSEETATSDNDNSYARVRAVVMTRDDSSGGWLPLGGSGLSSVTVFKVPHQEENGCADFFI 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 HGERQKDKLVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAI .::: .::.:::::...:::.:.:..:::::::.:..::::::::::::::::::.:.:: CCDS32 RGERLRDKMVVLECMLKKDLIYNKVTPTFHHWKIDDKKFGLTFQSPADARAFDRGIRRAI 70 80 90 100 110 120 120 130 140 150 160 pF1KE1 EDLIEGSTTSSSTIHNEAELGDDDVFTTATDSSSNSSQ----KRE-----QPTRTIS-SP ::. .: :. :::: : ::. .. ::::. . ..: .: :. . : CCDS32 EDISQGCPESK----NEAE-GADDLQANEEDSSSSLVKDHLFQQETVVTSEPYRSSNIRP 130 140 150 160 170 170 180 190 200 pF1KE1 TSCEH---RRIY--------TLGH--LHDSYPTDHYHLDQ--------------PMPRPY . : ::.: :.:. : . . .: : :. . CCDS32 SPFEDLNARRVYMQSQANQITFGQPGLDIQSRSMEYVQRQISKECGSLKSQNRVPL-KSI 180 190 200 210 220 230 210 220 230 240 250 260 pF1KE1 RQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKYPDPSEDADSSYVRFAKGEVPKH :.::: :.:: :::::::. : . : :::: : : .::::: ..:.: . : CCDS32 RHVSFQDEDE-IVRINPRD-ILIRRYADYRH-PDMWKNDLERDDADSS-IQFSKPDSKKS 240 250 260 270 280 290 270 280 290 300 310 pF1KE1 DYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSR---GKSRRRKEDGERSRCVYCRDMFNHE :: : : . .. .:: . :.::::: ::.::::::::::::::.. :::: CCDS32 DYLYSCGDETKLS---SPKD--SVVFKTQPSSLKIKKSKRRKEDGERSRCVYCQERFNHE 300 310 320 330 340 320 330 340 350 360 370 pF1KE1 ENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGDYTDPCSCDTSDEKFCLRWM :: ::.:::::: .. :: .:::: ::.::::::::: :::..:::::::::.::::::. CCDS32 ENVRGKCQDAPDPIKRCIYQVSCMLCAESMLYHCMSDSEGDFSDPCSCDTSDDKFCLRWL 350 360 370 380 390 400 380 390 400 410 pF1KE1 ALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA ::.::::..::::::.::: :..:: : ::::::::: CCDS32 ALVALSFIVPCMCCYVPLRMCHRCGEACGCCGGKHKAAG 410 420 430 440 418 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 12:07:46 2016 done: Sun Nov 6 12:07:47 2016 Total Scan time: 2.990 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]