FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE1330, 132 aa 1>>>pF1KE1330 132 - 132 aa - 132 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1803+/-0.000753; mu= 11.5908+/- 0.045 mean_var=67.7341+/-14.054, 0's: 0 Z-trim(108.8): 94 B-trim: 358 in 1/50 Lambda= 0.155837 statistics sampled from 10339 (10450) to 10339 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.725), E-opt: 0.2 (0.321), width: 16 Scan time: 1.220 The best scores are: opt bits E(32554) CCDS30928.1 SH2D1B gene_id:117157|Hs108|chr1 ( 132) 888 207.9 1.5e-54 CCDS48162.1 SH2D1A gene_id:4068|Hs108|chrX ( 125) 355 88.1 1.7e-18 CCDS14608.1 SH2D1A gene_id:4068|Hs108|chrX ( 128) 350 86.9 3.7e-18 CCDS8213.1 INPPL1 gene_id:3636|Hs108|chr11 (1258) 285 73.0 5.8e-13 CCDS77543.1 INPP5D gene_id:3635|Hs108|chr2 (1188) 260 67.3 2.7e-11 CCDS74672.1 INPP5D gene_id:3635|Hs108|chr2 (1189) 260 67.3 2.7e-11 >>CCDS30928.1 SH2D1B gene_id:117157|Hs108|chr1 (132 aa) initn: 888 init1: 888 opt: 888 Z-score: 1093.0 bits: 207.9 E(32554): 1.5e-54 Smith-Waterman score: 888; 100.0% identity (100.0% similar) in 132 aa overlap (1-132:1-132) 10 20 30 40 50 60 pF1KE1 MDLPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREKH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 MDLPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREKH 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRTSPSLRWRGLKLELETF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS30 GYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRTSPSLRWRGLKLELETF 70 80 90 100 110 120 130 pF1KE1 VNSNSDYVDVLP :::::::::::: CCDS30 VNSNSDYVDVLP 130 >>CCDS48162.1 SH2D1A gene_id:4068|Hs108|chrX (125 aa) initn: 376 init1: 346 opt: 355 Z-score: 445.7 bits: 88.1 E(32554): 1.7e-18 Smith-Waterman score: 355; 43.5% identity (73.0% similar) in 115 aa overlap (1-114:1-115) 10 20 30 40 50 pF1KE1 MD-LPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREK :: . :::..... : ::: :.::..:::::::.::: :::: ... .::::. . . CCDS48 MDAVAVYHGKISRETGEKLLLATGLDGSYLLRDSESVPGVYCLCVLYHGYIYTYRVSQTE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 HGYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRTSPSLRWRGLKLELET : . .:: : :. : ..:.::: :.::.::.:. : :... : . .:.. CCDS48 TGSWSAETAPGVHKRYFRKIKNLISAFQKPDQGIVIPLQYPVEKKSSARSTQGIREDPDV 70 80 90 100 110 120 120 130 pF1KE1 FVNSNSDYVDVLP CCDS48 CLKAP >>CCDS14608.1 SH2D1A gene_id:4068|Hs108|chrX (128 aa) initn: 376 init1: 346 opt: 350 Z-score: 439.5 bits: 86.9 E(32554): 3.7e-18 Smith-Waterman score: 350; 44.2% identity (72.6% similar) in 113 aa overlap (1-112:1-113) 10 20 30 40 50 pF1KE1 MD-LPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREK :: . :::..... : ::: :.::..:::::::.::: :::: ... .::::. . . CCDS14 MDAVAVYHGKISRETGEKLLLATGLDGSYLLRDSESVPGVYCLCVLYHGYIYTYRVSQTE 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE1 HGYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRTSPSLRWRGLKLELET : . .:: : :. : ..:.::: :.::.::.:. : :... : . .: CCDS14 TGSWSAETAPGVHKRYFRKIKNLISAFQKPDQGIVIPLQYPVEKKSSARSTQGTTGIRED 70 80 90 100 110 120 120 130 pF1KE1 FVNSNSDYVDVLP CCDS14 PDVCLKAP >>CCDS8213.1 INPPL1 gene_id:3636|Hs108|chr11 (1258 aa) initn: 283 init1: 283 opt: 285 Z-score: 346.1 bits: 73.0 E(32554): 5.8e-13 Smith-Waterman score: 285; 42.9% identity (71.4% similar) in 98 aa overlap (5-102:21-118) 10 20 30 40 pF1KE1 MDLPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCV .:: :.. : :: . : ::.::.:::::. :.. ::: CCDS82 MASACGAPGPGGALGSQAPSWYHRDLSRAAAEELLARAGRDGSFLVRDSESVAGAFALCV 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE1 SFKNIVYTYRIFREKHGYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRT ... :.::::. . . . .::..: : . : .: :::. . .::::.: :: :.. CCDS82 LYQKHVHTYRILPDGEDFLAVQTSQGVPVRRFQTLGELIGLYAQPNQGLVCALLLPVEGE 70 80 90 100 110 120 110 120 130 pF1KE1 SPSLRWRGLKLELETFVNSNSDYVDVLP CCDS82 REPDPPDDRDASDGEDEKPPLPPRSGSTSISAPTGPSSPLPAPETPTAPAAESAPNGLST 130 140 150 160 170 180 >>CCDS77543.1 INPP5D gene_id:3635|Hs108|chr2 (1188 aa) initn: 260 init1: 260 opt: 260 Z-score: 316.1 bits: 67.3 E(32554): 2.7e-11 Smith-Waterman score: 260; 42.1% identity (68.4% similar) in 95 aa overlap (7-101:7-101) 10 20 30 40 50 60 pF1KE1 MDLPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREKH :: .:.. : :: . : ::.::.: :::: . ::: ..: ::::::. .. CCDS77 MVPCWNHGNITRSKAEELLSRTGKDGSFLVRASESISRAYALCVLYRNCVYTYRILPNED 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRTSPSLRWRGLKLELETF . .:..:: . : .: .:: ..: :.:.:.:: :. CCDS77 DKFTVQASEGVSMRFFTKLDQLIEFYKKENMGLVTHLQYPVPLEEEDTGDDPEEDTESVV 70 80 90 100 110 120 130 pF1KE1 VNSNSDYVDVLP CCDS77 SPPELPPRNIPLTASSCEAKEVPFSNENPRATETSRPSLSETLFQRLQSMDTSGLPEEHL 130 140 150 160 170 180 >>CCDS74672.1 INPP5D gene_id:3635|Hs108|chr2 (1189 aa) initn: 260 init1: 260 opt: 260 Z-score: 316.1 bits: 67.3 E(32554): 2.7e-11 Smith-Waterman score: 260; 42.1% identity (68.4% similar) in 95 aa overlap (7-101:7-101) 10 20 30 40 50 60 pF1KE1 MDLPYYHGRLTKQDCETLLLKEGVDGNFLLRDSESIPGVLCLCVSFKNIVYTYRIFREKH :: .:.. : :: . : ::.::.: :::: . ::: ..: ::::::. .. CCDS74 MVPCWNHGNITRSKAEELLSRTGKDGSFLVRASESISRAYALCVLYRNCVYTYRILPNED 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE1 GYYRIQTAEGSPKQVFPSLKELISKFEKPNQGMVVHLLKPIKRTSPSLRWRGLKLELETF . .:..:: . : .: .:: ..: :.:.:.:: :. CCDS74 DKFTVQASEGVSMRFFTKLDQLIEFYKKENMGLVTHLQYPVPLEEEDTGDDPEEDTVESV 70 80 90 100 110 120 130 pF1KE1 VNSNSDYVDVLP CCDS74 VSPPELPPRNIPLTASSCEAKEVPFSNENPRATETSRPSLSETLFQRLQSMDTSGLPEEH 130 140 150 160 170 180 132 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 22:42:32 2016 done: Sun Nov 6 22:42:32 2016 Total Scan time: 1.220 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]