FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2207, 319 aa 1>>>pF1KE2207 319 - 319 aa - 319 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9297+/-0.000782; mu= 10.4375+/- 0.048 mean_var=146.6145+/-28.344, 0's: 0 Z-trim(114.2): 39 B-trim: 44 in 1/52 Lambda= 0.105922 statistics sampled from 14749 (14788) to 14749 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.79), E-opt: 0.2 (0.454), width: 16 Scan time: 2.650 The best scores are: opt bits E(32554) CCDS3731.1 SPRY1 gene_id:10252|Hs108|chr4 ( 319) 2282 359.6 1.8e-99 CCDS9463.1 SPRY2 gene_id:10253|Hs108|chr13 ( 315) 1131 183.7 1.6e-46 CCDS47296.1 SPRY4 gene_id:81848|Hs108|chr5 ( 299) 825 137.0 1.8e-32 CCDS4274.1 SPRY4 gene_id:81848|Hs108|chr5 ( 322) 825 137.0 1.9e-32 CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrY ( 288) 739 123.8 1.6e-28 CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrX ( 288) 739 123.8 1.6e-28 >>CCDS3731.1 SPRY1 gene_id:10252|Hs108|chr4 (319 aa) initn: 2282 init1: 2282 opt: 2282 Z-score: 1899.2 bits: 359.6 E(32554): 1.8e-99 Smith-Waterman score: 2282; 100.0% identity (100.0% similar) in 319 aa overlap (1-319:1-319) 10 20 30 40 50 60 pF1KE2 MDPQNQHGSGSSLVVIQQPSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGPSVV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 MDPQNQHGSGSSLVVIQQPSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGPSVV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 KRPAPRTAPRQEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTSTGSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 KRPAPRTAPRQEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTSTGSA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 ASSGSNSSASSEQGLLGRSPPTRPVPGHRSERAIRTQPKQLIVDDLKGSLKEDLTQHKFI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 ASSGSNSSASSEQGLLGRSPPTRPVPGHRSERAIRTQPKQLIVDDLKGSLKEDLTQHKFI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 CEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCSNDDEGDS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 CEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCSNDDEGDS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 YSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRPGCRCKNS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 YSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRPGCRCKNS 250 260 270 280 290 300 310 pF1KE2 NTVYCKLESCPSRGQGKPS ::::::::::::::::::: CCDS37 NTVYCKLESCPSRGQGKPS 310 >>CCDS9463.1 SPRY2 gene_id:10253|Hs108|chr13 (315 aa) initn: 1013 init1: 461 opt: 1131 Z-score: 948.7 bits: 183.7 E(32554): 1.6e-46 Smith-Waterman score: 1152; 55.2% identity (73.0% similar) in 330 aa overlap (1-319:1-315) 10 20 30 40 50 pF1KE2 MDPQNQHGSGSSLVVIQQPSLDSRQRLDYE-REI--QPTAILSLDQIKAIRGSNEYTEGP :. . : :.::. ..: : .::: . . :. : . .::::::.:::..::::::: CCDS94 MEARAQSGNGSQ-PLLQTPRDGGRQRGEPDPRDALTQQVHVLSLDQIRAIRNTNEYTEGP 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 SVVKRPAPRTAPR---QEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRS .:: ::. . ::: :.:::: : . : :::. .: :. . :.::.: :::: CCDS94 TVVPRPGLKPAPRPSTQHKHERLHGL-P-------EHRQPPRLQHSQVHSSARAP-LSRS 60 70 80 90 100 110 120 130 140 150 160 pF1KE2 TST---GSAASS-GSNSSASSEQGLLGRSPPTRPVPGHRSERAIRTQPK-QLIVDDLKGS :: :: .:. :.::.:::: ::: : . :: .. ::.::: .: .:: CCDS94 ISTVSSGSRSSTRTSTSSSSSEQRLLGSSFSSGPV----ADGIIRVQPKSELKPGELKPL 120 130 140 150 160 170 180 190 200 210 220 pF1KE2 LKEDLTQHKFICEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIF :::: : . ::.:::::: ::: :: ::: :..::::::.....::::.: :::.: CCDS94 SKEDLGLHAYRCEDCGKCKCKECTYPRPLPSDWICDKQCLCSAQNVIDYGTCVCCVKGLF 170 180 190 200 210 220 230 240 250 260 270 280 pF1KE2 YHCSNDDEGDSYSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDW :::::::: :. .::::::::::::.:. ::.:::::::: :: :::::::::. ::: CCDS94 YHCSNDDE-DNCADNPCSCSQSHCCTRWSAMGVMSLFLPCLWCYLPAKGCLKLCQGCYDR 230 240 250 260 270 280 290 300 310 pF1KE2 IHRPGCRCKNSNTVYCKLESCPSRGQGKPS ..:::::::::::: ::. . : :. ::. CCDS94 VNRPGCRCKNSNTVCCKVPTVPPRNFEKPT 290 300 310 >>CCDS47296.1 SPRY4 gene_id:81848|Hs108|chr5 (299 aa) initn: 889 init1: 726 opt: 825 Z-score: 696.3 bits: 137.0 E(32554): 1.8e-32 Smith-Waterman score: 894; 44.7% identity (68.8% similar) in 311 aa overlap (3-309:6-287) 10 20 30 40 50 pF1KE2 MDPQNQHGSGSSLVVIQQPSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGP ::. . .:..: :: ::: :... : .: .:: .::.:. . :.: ..: CCDS47 MEPPIPQSAPLTPNSVMV--QPLLDS--RMSHSRLQHPLTILPIDQVKTSHVENDYIDNP 10 20 30 40 50 60 70 80 90 100 110 pF1KE2 SVVKRPAPRTAPRQEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTST :. : :.:.. . . :. : . . . : : . . . CCDS47 SL----ALTTGPKRTRGG-APELAPTPARCDQDVTH-----HWI-------------SFS 60 70 80 90 120 130 140 150 160 170 pF1KE2 GSAASSGSNSSASSEQGLLGR-SPPTRPVPGHRSERAIRTQPKQLIVD--DLKG-SLKED : .: .:.::.::.: :: . .:: :: . : ::.: ::: . . :::: .. . CCDS47 GRPSSVSSSSSTSSDQRLLDHMAPP--PVADQASPRAVRIQPKVVHCQPLDLKGPAVPPE 100 110 120 130 140 150 180 190 200 210 220 230 pF1KE2 LTQHKFICEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCS : .: ..:: :::::: ::..::::::: .::..:::::...:.::::::::.::::::. CCDS47 LDKHFLLCEACGKCKCKECASPRTLPSCWVCNQECLCSAQTLVNYGTCMCLVQGIFYHCT 160 170 180 190 200 210 240 250 260 270 280 290 pF1KE2 NDDEGDSYSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRP :.:. : .:.:::::.:.::.:. :::.:. ::::::: :: ::.:: .: :: ..:: CCDS47 NEDDEGSCADHPCSCSRSNCCARWSFMGALSVVLPCLLCYLPATGCVKLAQRGYDRLRRP 220 230 240 250 260 270 300 310 pF1KE2 GCRCKNSNTVYCKLESCPSRGQGKPS :::::..:.: :: : CCDS47 GCRCKHTNSVICKAASGDAKTSRPDKPF 280 290 >>CCDS4274.1 SPRY4 gene_id:81848|Hs108|chr5 (322 aa) initn: 889 init1: 726 opt: 825 Z-score: 695.8 bits: 137.0 E(32554): 1.9e-32 Smith-Waterman score: 894; 44.7% identity (68.8% similar) in 311 aa overlap (3-309:29-310) 10 20 30 pF1KE2 MDPQNQHGSGSSLVVIQQPSLDSRQRLDYEREIQ ::. . .:..: :: ::: :... : . CCDS42 MLSPLPTGPLEACFSVQSRTSSPMEPPIPQSAPLTPNSVMV--QPLLDS--RMSHSRLQH 10 20 30 40 50 40 50 60 70 80 90 pF1KE2 PTAILSLDQIKAIRGSNEYTEGPSVVKRPAPRTAPRQEKHERTHEIIPINVNNNYEHRHT : .:: .::.:. . :.: ..::. : :.:.. . . :. : . . . : CCDS42 PLTILPIDQVKTSHVENDYIDNPSL----ALTTGPKRTRGG-APELAPTPARCDQDVTH- 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE2 SHLGHAVLPSNARGPILSRSTSTGSAASSGSNSSASSEQGLLGR-SPPTRPVPGHRSERA : . . .: .: .:.::.::.: :: . .:: :: . : :: CCDS42 ----HWI-------------SFSGRPSSVSSSSSTSSDQRLLDHMAPP--PVADQASPRA 120 130 140 150 160 170 180 190 200 210 pF1KE2 IRTQPKQLIVD--DLKG-SLKEDLTQHKFICEQCGKCKCGECTAPRTLPSCLACNRQCLC .: ::: . . :::: .. .: .: ..:: :::::: ::..::::::: .::..::: CCDS42 VRIQPKVVHCQPLDLKGPAVPPELDKHFLLCEACGKCKCKECASPRTLPSCWVCNQECLC 160 170 180 190 200 210 220 230 240 250 260 270 pF1KE2 SAESMVEYGTCMCLVKGIFYHCSNDDEGDSYSDNPCSCSQSHCCSRYLCMGAMSLFLPCL ::...:.::::::::.::::::.:.:. : .:.:::::.:.::.:. :::.:. :::: CCDS42 SAQTLVNYGTCMCLVQGIFYHCTNEDDEGSCADHPCSCSRSNCCARWSFMGALSVVLPCL 220 230 240 250 260 270 280 290 300 310 pF1KE2 LCYPPAKGCLKLCRRCYDWIHRPGCRCKNSNTVYCKLESCPSRGQGKPS ::: :: ::.:: .: :: ..:::::::..:.: :: : CCDS42 LCYLPATGCVKLAQRGYDRLRRPGCRCKHTNSVICKAASGDAKTSRPDKPF 280 290 300 310 320 >>CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrY (288 aa) initn: 665 init1: 369 opt: 739 Z-score: 625.4 bits: 123.8 E(32554): 1.6e-28 Smith-Waterman score: 746; 42.3% identity (64.5% similar) in 279 aa overlap (38-306:12-271) 10 20 30 40 50 60 pF1KE2 GSGSSLVVIQQPSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGPSVVKRPA--- :: ..:... ..::.:.: : . . : CCDS14 MDAAVTDDFQQILPIEQLRSTHASNDYVERPPAPCKQALSS 10 20 30 40 70 80 90 100 110 120 pF1KE2 PRTAPRQEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTSTGSAASSG : . .: . . .: .. : :. : . : : . : .: ::: CCDS14 PSLIVQTHKSDWSLATMPTSLP-----RSLSQC-HQLQP-------LPQHLSQSSIASSM 50 60 70 80 130 140 150 160 170 pF1KE2 SNSSASSEQGLLGRSPPTRPVPGHRSERAIRTQPKQLIVDDLKGSLKEDLTQ-------H :.:...:.: ::. :. :. .: ::::: . :.:: . : : CCDS14 SHSTTASDQRLLASITPS---PSGQS--IIRTQPGAGVHPKADGALKGEAEQSAGHPSEH 90 100 110 120 130 140 180 190 200 210 220 230 pF1KE2 KFICEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCSNDDE ::::.::.::: ::: : :::: ::..:::::::...::::.: :::.:::::.::: CCDS14 LFICEECGRCKCVPCTAARPLPSCWLCNQRCLCSAESLLDYGTCLCCVKGLFYHCSTDDE 150 160 170 180 190 200 240 250 260 270 280 290 pF1KE2 GDSYSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRPGCRC :. .:.::::. : : :. :. .::::::: :: :..:::.::.. :: ..:::::: CCDS14 -DNCADEPCSCGPSSCFVRWAAMSLISLFLPCLCCYLPTRGCLHLCQQGYDSLRRPGCRC 210 220 230 240 250 260 300 310 pF1KE2 KNSNTVYCKLESCPSRGQGKPS : ... :. CCDS14 KRHTNTVCRKISSGSAPFPKAQEKSV 270 280 >>CCDS14769.4 SPRY3 gene_id:10251|Hs108|chrX (288 aa) initn: 665 init1: 369 opt: 739 Z-score: 625.4 bits: 123.8 E(32554): 1.6e-28 Smith-Waterman score: 746; 42.3% identity (64.5% similar) in 279 aa overlap (38-306:12-271) 10 20 30 40 50 60 pF1KE2 GSGSSLVVIQQPSLDSRQRLDYEREIQPTAILSLDQIKAIRGSNEYTEGPSVVKRPA--- :: ..:... ..::.:.: : . . : CCDS14 MDAAVTDDFQQILPIEQLRSTHASNDYVERPPAPCKQALSS 10 20 30 40 70 80 90 100 110 120 pF1KE2 PRTAPRQEKHERTHEIIPINVNNNYEHRHTSHLGHAVLPSNARGPILSRSTSTGSAASSG : . .: . . .: .. : :. : . : : . : .: ::: CCDS14 PSLIVQTHKSDWSLATMPTSLP-----RSLSQC-HQLQP-------LPQHLSQSSIASSM 50 60 70 80 130 140 150 160 170 pF1KE2 SNSSASSEQGLLGRSPPTRPVPGHRSERAIRTQPKQLIVDDLKGSLKEDLTQ-------H :.:...:.: ::. :. :. .: ::::: . :.:: . : : CCDS14 SHSTTASDQRLLASITPS---PSGQS--IIRTQPGAGVHPKADGALKGEAEQSAGHPSEH 90 100 110 120 130 140 180 190 200 210 220 230 pF1KE2 KFICEQCGKCKCGECTAPRTLPSCLACNRQCLCSAESMVEYGTCMCLVKGIFYHCSNDDE ::::.::.::: ::: : :::: ::..:::::::...::::.: :::.:::::.::: CCDS14 LFICEECGRCKCVPCTAARPLPSCWLCNQRCLCSAESLLDYGTCLCCVKGLFYHCSTDDE 150 160 170 180 190 200 240 250 260 270 280 290 pF1KE2 GDSYSDNPCSCSQSHCCSRYLCMGAMSLFLPCLLCYPPAKGCLKLCRRCYDWIHRPGCRC :. .:.::::. : : :. :. .::::::: :: :..:::.::.. :: ..:::::: CCDS14 -DNCADEPCSCGPSSCFVRWAAMSLISLFLPCLCCYLPTRGCLHLCQQGYDSLRRPGCRC 210 220 230 240 250 260 300 310 pF1KE2 KNSNTVYCKLESCPSRGQGKPS : ... :. CCDS14 KRHTNTVCRKISSGSAPFPKAQEKSV 270 280 319 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 10:00:56 2016 done: Sun Nov 6 10:00:57 2016 Total Scan time: 2.650 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]