FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5363, 299 aa 1>>>pF1KE5363 299 - 299 aa - 299 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.6037+/-0.000728; mu= 11.2674+/- 0.044 mean_var=139.0076+/-26.718, 0's: 0 Z-trim(115.2): 26 B-trim: 52 in 1/49 Lambda= 0.108781 statistics sampled from 15717 (15739) to 15717 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.808), E-opt: 0.2 (0.483), width: 16 Scan time: 2.910 The best scores are: opt bits E(32554) CCDS47296.1 SPRY4 gene_id:81848|Hs108|chr5 ( 299) 2170 351.2 5.5e-97 CCDS4274.1 SPRY4 gene_id:81848|Hs108|chr5 ( 322) 2170 351.2 5.8e-97 CCDS9463.1 SPRY2 gene_id:10253|Hs108|chr13 ( 315) 830 140.9 1.2e-33 CCDS3731.1 SPRY1 gene_id:10252|Hs108|chr4 ( 319) 825 140.2 2e-33 CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrX ( 288) 769 131.3 8.2e-31 CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrY ( 288) 769 131.3 8.2e-31 >>CCDS47296.1 SPRY4 gene_id:81848|Hs108|chr5 (299 aa) initn: 2170 init1: 2170 opt: 2170 Z-score: 1854.7 bits: 351.2 E(32554): 5.5e-97 Smith-Waterman score: 2170; 100.0% identity (100.0% similar) in 299 aa overlap (1-299:1-299) 10 20 30 40 50 60 pF1KE5 MEPPIPQSAPLTPNSVMVQPLLDSRMSHSRLQHPLTILPIDQVKTSHVENDYIDNPSLAL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MEPPIPQSAPLTPNSVMVQPLLDSRMSHSRLQHPLTILPIDQVKTSHVENDYIDNPSLAL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 TTGPKRTRGGAPELAPTPARCDQDVTHHWISFSGRPSSVSSSSSTSSDQRLLDHMAPPPV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 TTGPKRTRGGAPELAPTPARCDQDVTHHWISFSGRPSSVSSSSSTSSDQRLLDHMAPPPV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ADQASPRAVRIQPKVVHCQPLDLKGPAVPPELDKHFLLCEACGKCKCKECASPRTLPSCW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ADQASPRAVRIQPKVVHCQPLDLKGPAVPPELDKHFLLCEACGKCKCKECASPRTLPSCW 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEGSCADHPCSCSRSNCCARWSFMGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEGSCADHPCSCSRSNCCARWSFMGA 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 LSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCKHTNSVICKAASGDAKTSRPDKPF ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCKHTNSVICKAASGDAKTSRPDKPF 250 260 270 280 290 >>CCDS4274.1 SPRY4 gene_id:81848|Hs108|chr5 (322 aa) initn: 2170 init1: 2170 opt: 2170 Z-score: 1854.3 bits: 351.2 E(32554): 5.8e-97 Smith-Waterman score: 2170; 100.0% identity (100.0% similar) in 299 aa overlap (1-299:24-322) 10 20 30 pF1KE5 MEPPIPQSAPLTPNSVMVQPLLDSRMSHSRLQHPLTI ::::::::::::::::::::::::::::::::::::: CCDS42 MLSPLPTGPLEACFSVQSRTSSPMEPPIPQSAPLTPNSVMVQPLLDSRMSHSRLQHPLTI 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 LPIDQVKTSHVENDYIDNPSLALTTGPKRTRGGAPELAPTPARCDQDVTHHWISFSGRPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LPIDQVKTSHVENDYIDNPSLALTTGPKRTRGGAPELAPTPARCDQDVTHHWISFSGRPS 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 SVSSSSSTSSDQRLLDHMAPPPVADQASPRAVRIQPKVVHCQPLDLKGPAVPPELDKHFL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SVSSSSSTSSDQRLLDHMAPPPVADQASPRAVRIQPKVVHCQPLDLKGPAVPPELDKHFL 130 140 150 160 170 180 160 170 180 190 200 210 pF1KE5 LCEACGKCKCKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 LCEACGKCKCKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEG 190 200 210 220 230 240 220 230 240 250 260 270 pF1KE5 SCADHPCSCSRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCKH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SCADHPCSCSRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCKH 250 260 270 280 290 300 280 290 pF1KE5 TNSVICKAASGDAKTSRPDKPF :::::::::::::::::::::: CCDS42 TNSVICKAASGDAKTSRPDKPF 310 320 >>CCDS9463.1 SPRY2 gene_id:10253|Hs108|chr13 (315 aa) initn: 838 init1: 401 opt: 830 Z-score: 717.9 bits: 140.9 E(32554): 1.2e-33 Smith-Waterman score: 854; 45.5% identity (68.1% similar) in 279 aa overlap (31-287:34-305) 10 20 30 40 50 pF1KE5 MEPPIPQSAPLTPNSVMVQPLLDSRMSHSRLQHPLTILPIDQVKTSHVENDYIDNPSLA- : . . .: .::... . :.: ..:... CCDS94 RAQSGNGSQPLLQTPRDGGRQRGEPDPRDALTQQVHVLSLDQIRAIRNTNEYTEGPTVVP 10 20 30 40 50 60 60 70 80 90 pF1KE5 ---LTTGP------KRTR-GGAPELAPTP----------ARCDQDVTHHWISFSGRPSSV : .: :. : : :: : :: . . .: ..: :. CCDS94 RPGLKPAPRPSTQHKHERLHGLPEHRQPPRLQHSQVHSSARAPLSRSISTVSSGSRSSTR 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 SSSSSTSSDQRLL-DHMAPPPVADQASPRAVRIQPKVVHCQPLDLKGPAVPPELDKHFLL .:.::.::.:::: . .. :::: .:.::: . .: .:: : .: : CCDS94 TSTSSSSSEQRLLGSSFSSGPVADGI----IRVQPKS-ELKPGELK-PLSKEDLGLHAYR 130 140 150 160 170 160 170 180 190 200 210 pF1KE5 CEACGKCKCKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEGS :: :::::::::. :: ::: :.:...::::::....::::.: :.:.::::.: ::: . CCDS94 CEDCGKCKCKECTYPRPLPSDWICDKQCLCSAQNVIDYGTCVCCVKGLFYHCSN-DDEDN 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE5 CADHPCSCSRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCKHT :::.:::::.:.::.::: ::..:. :::: ::::: ::.:: : :::. :::::::.. CCDS94 CADNPCSCSQSHCCTRWSAMGVMSLFLPCLWCYLPAKGCLKLCQGCYDRVNRPGCRCKNS 240 250 260 270 280 290 280 290 pF1KE5 NSVICKAASGDAKTSRPDKPF :.: ::. . CCDS94 NTVCCKVPTVPPRNFEKPT 300 310 >>CCDS3731.1 SPRY1 gene_id:10252|Hs108|chr4 (319 aa) initn: 889 init1: 726 opt: 825 Z-score: 713.6 bits: 140.2 E(32554): 2e-33 Smith-Waterman score: 894; 44.7% identity (68.8% similar) in 311 aa overlap (6-287:3-309) 10 20 30 40 50 pF1KE5 MEPPIPQSAPLTPNSVMV--QPLLDSR--MSHSRLQHPLTILPIDQVKTSHVENDYIDNP ::. . .:..: :: :::: ... : .: .:: .::.:. . :.: ..: CCDS37 MDPQNQHGSGSSLVVIQQPSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGP 10 20 30 40 50 60 70 80 90 pF1KE5 SL----ALTTGPKRTRGG-APELAPTPARCDQDVTH-----HWI-------------SFS :. : :.:.. . . :. : . . . : : . . . CCDS37 SVVKRPAPRTAPRQEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTST 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE5 GRPSSVSSSSSTSSDQRLLDHMAPP--PVADQASPRAVRIQPKVVHCQPLDLKGPAVPPE : .: .:.::.::.: :: . .:: :: . : ::.: ::: . . :::: .. . CCDS37 GSAASSGSNSSASSEQGLLGR-SPPTRPVPGHRSERAIRTQPKQLIVD--DLKG-SLKED 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE5 LDKHFLLCEACGKCKCKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCT : .: ..:: :::::: ::..::::::: .::..:::::...:.::::::::.::::::. CCDS37 LTQHKFICEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCS 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE5 NEDDEGSCADHPCSCSRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRP :.:. : .:.:::::.:.::.:. :::.:. ::::::: :: ::.:: .: :: ..:: CCDS37 NDDEGDSYSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRP 240 250 260 270 280 290 280 290 pF1KE5 GCRCKHTNSVICKAASGDAKTSRPDKPF :::::..:.: :: : CCDS37 GCRCKNSNTVYCKLESCPSRGQGKPS 300 310 >>CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrX (288 aa) initn: 796 init1: 379 opt: 769 Z-score: 666.6 bits: 131.3 E(32554): 8.2e-31 Smith-Waterman score: 819; 46.2% identity (70.5% similar) in 275 aa overlap (37-290:12-278) 10 20 30 40 50 60 pF1KE5 QSAPLTPNSVMVQPLLDSRMSHSRLQHPLTILPIDQVKTSHVENDYIDNP----SLALTT ::::.:....:. :::.. : . ::.. CCDS14 MDAAVTDDFQQILPIEQLRSTHASNDYVERPPAPCKQALSS 10 20 30 40 70 80 90 100 110 pF1KE5 GPK---RTRGGAPELAPTP-------ARCDQ--DVTHHWISFSGRPSSVSSSSSTSSDQR :. .:. . :: : ..: : . .: .: :. ::.: :. :.:::: CCDS14 -PSLIVQTHKSDWSLATMPTSLPRSLSQCHQLQPLPQH-LSQSSIASSMSHST-TASDQR 50 60 70 80 90 120 130 140 150 160 pF1KE5 LLDHMAPPPVADQASPRAVRIQPKV-VHCQPLD-LKGPAVPP--ELDKHFLLCEACGKCK :: ..: : .. .: :: . :: . ::: : . ..:...:: ::.:: CCDS14 LLASITPSP----SGQSIIRTQPGAGVHPKADGALKGEAEQSAGHPSEHLFICEECGRCK 100 110 120 130 140 150 170 180 190 200 210 220 pF1KE5 CKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEGSCADHPCSC : :.. : :::::.:::.:::::..:..::::.: :.:.::::.. ::: .:::.:::: CCDS14 CVPCTAARPLPSCWLCNQRCLCSAESLLDYGTCLCCVKGLFYHCST-DDEDNCADEPCSC 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE5 SRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCK-HTNSVICKA . :.: .::. :. .:. :::: ::::. ::..: :.::: ::::::::: :::.: : CCDS14 GPSSCFVRWAAMSLISLFLPCLCCYLPTRGCLHLCQQGYDSLRRPGCRCKRHTNTVCRKI 220 230 240 250 260 270 290 pF1KE5 ASGDAKTSRPDKPF .::.: CCDS14 SSGSAPFPKAQEKSV 280 >>CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrY (288 aa) initn: 796 init1: 379 opt: 769 Z-score: 666.6 bits: 131.3 E(32554): 8.2e-31 Smith-Waterman score: 819; 46.2% identity (70.5% similar) in 275 aa overlap (37-290:12-278) 10 20 30 40 50 60 pF1KE5 QSAPLTPNSVMVQPLLDSRMSHSRLQHPLTILPIDQVKTSHVENDYIDNP----SLALTT ::::.:....:. :::.. : . ::.. CCDS14 MDAAVTDDFQQILPIEQLRSTHASNDYVERPPAPCKQALSS 10 20 30 40 70 80 90 100 110 pF1KE5 GPK---RTRGGAPELAPTP-------ARCDQ--DVTHHWISFSGRPSSVSSSSSTSSDQR :. .:. . :: : ..: : . .: .: :. ::.: :. :.:::: CCDS14 -PSLIVQTHKSDWSLATMPTSLPRSLSQCHQLQPLPQH-LSQSSIASSMSHST-TASDQR 50 60 70 80 90 120 130 140 150 160 pF1KE5 LLDHMAPPPVADQASPRAVRIQPKV-VHCQPLD-LKGPAVPP--ELDKHFLLCEACGKCK :: ..: : .. .: :: . :: . ::: : . ..:...:: ::.:: CCDS14 LLASITPSP----SGQSIIRTQPGAGVHPKADGALKGEAEQSAGHPSEHLFICEECGRCK 100 110 120 130 140 150 170 180 190 200 210 220 pF1KE5 CKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCTNEDDEGSCADHPCSC : :.. : :::::.:::.:::::..:..::::.: :.:.::::.. ::: .:::.:::: CCDS14 CVPCTAARPLPSCWLCNQRCLCSAESLLDYGTCLCCVKGLFYHCST-DDEDNCADEPCSC 160 170 180 190 200 210 230 240 250 260 270 280 pF1KE5 SRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRPGCRCK-HTNSVICKA . :.: .::. :. .:. :::: ::::. ::..: :.::: ::::::::: :::.: : CCDS14 GPSSCFVRWAAMSLISLFLPCLCCYLPTRGCLHLCQQGYDSLRRPGCRCKRHTNTVCRKI 220 230 240 250 260 270 290 pF1KE5 ASGDAKTSRPDKPF .::.: CCDS14 SSGSAPFPKAQEKSV 280 299 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:08:03 2016 done: Tue Nov 8 00:08:03 2016 Total Scan time: 2.910 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]