FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA1435, 410 aa 1>>>pF1KA1435 410 - 410 aa - 410 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.8959+/-0.00115; mu= 4.3994+/- 0.067 mean_var=174.2759+/-37.858, 0's: 0 Z-trim(105.9): 132 B-trim: 54 in 1/50 Lambda= 0.097153 statistics sampled from 8572 (8706) to 8572 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.622), E-opt: 0.2 (0.267), width: 16 Scan time: 2.530 The best scores are: opt bits E(32554) CCDS33387.1 WDFY1 gene_id:57590|Hs108|chr2 ( 410) 2885 417.2 1.4e-116 CCDS9429.1 WDFY2 gene_id:115825|Hs108|chr13 ( 400) 1925 282.6 4.5e-76 >>CCDS33387.1 WDFY1 gene_id:57590|Hs108|chr2 (410 aa) initn: 2885 init1: 2885 opt: 2885 Z-score: 2206.3 bits: 417.2 E(32554): 1.4e-116 Smith-Waterman score: 2885; 100.0% identity (100.0% similar) in 410 aa overlap (1-410:1-410) 10 20 30 40 50 60 pF1KA1 MAAEIHSRPQSSRPVLLSKIEGHQDAVTAALLIPKEDGVITASEDRTIRVWLKRDSGQYW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MAAEIHSRPQSSRPVLLSKIEGHQDAVTAALLIPKEDGVITASEDRTIRVWLKRDSGQYW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 PSIYHTMASPCSAMAYHHDSRRIFVGQDNGAVMEFHVSEDFNKMNFIKTYPAHQNRVSAI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PSIYHTMASPCSAMAYHHDSRRIFVGQDNGAVMEFHVSEDFNKMNFIKTYPAHQNRVSAI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 IFSLATEWVISTGHDKCVSWMCTRSGNMLGRHFFTSWASCLQYDFDTQYAFVGDYSGQIT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 IFSLATEWVISTGHDKCVSWMCTRSGNMLGRHFFTSWASCLQYDFDTQYAFVGDYSGQIT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 LLKLEQNTCSVITTLKGHEGSVACLWWDPIQRLLFSGASDNSIIMWDIGGRKGRTLLLQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LLKLEQNTCSVITTLKGHEGSVACLWWDPIQRLLFSGASDNSIIMWDIGGRKGRTLLLQG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 HHDKVQSLCYLQLTRQLVSCSSDGGIAVWNMDVSREEAPQWLESDSCQKCEQPFFWNIKQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 HHDKVQSLCYLQLTRQLVSCSSDGGIAVWNMDVSREEAPQWLESDSCQKCEQPFFWNIKQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 MWDTKTLGLRQHHCRKCGQAVCGKCSSKRSSYPVMGFEFQVRVCDSCYDSIKDEDRTSLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MWDTKTLGLRQHHCRKCGQAVCGKCSSKRSSYPVMGFEFQVRVCDSCYDSIKDEDRTSLA 310 320 330 340 350 360 370 380 390 400 410 pF1KA1 TFHEGKHNISHMSMDIARGLMVTCGTDRIVKIWDMTPVVGCSLATGFSPH :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 TFHEGKHNISHMSMDIARGLMVTCGTDRIVKIWDMTPVVGCSLATGFSPH 370 380 390 400 410 >>CCDS9429.1 WDFY2 gene_id:115825|Hs108|chr13 (400 aa) initn: 1925 init1: 1925 opt: 1925 Z-score: 1479.3 bits: 282.6 E(32554): 4.5e-76 Smith-Waterman score: 1925; 61.9% identity (88.2% similar) in 399 aa overlap (1-399:1-399) 10 20 30 40 50 60 pF1KA1 MAAEIHSRPQSSRPVLLSKIEGHQDAVTAALLIPKEDGVITASEDRTIRVWLKRDSGQYW :::::. .: . .:.::...:: :..:. :...:::.:::..:::::.:::::::::::: CCDS94 MAAEIQPKPLTRKPILLQRMEGSQEVVNMAVIVPKEEGVISVSEDRTVRVWLKRDSGQYW 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA1 PSIYHTMASPCSAMAYHHDSRRIFVGQDNGAVMEFHVSEDFNKMNFIKTYPAHQNRVSAI ::.::.: :::: :... ..::. .: :::.. :: .:::.:::. .:.: :::.::. : CCDS94 PSVYHAMPSPCSCMSFNPETRRLSIGLDNGTISEFILSEDYNKMTPVKNYQAHQSRVTMI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA1 IFSLATEWVISTGHDKCVSWMCTRSGNMLGRHFFTSWASCLQYDFDTQYAFVGDYSGQIT .: : :::.:::.:: .: :..::. :: . .. :: ::.: .:...:.::.:::.: CCDS94 LFVLELEWVLSTGQDKQFAWHCSESGQRLGGYRTSAVASGLQFDVETRHVFIGDHSGQVT 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA1 LLKLEQNTCSVITTLKGHEGSVACLWWDPIQRLLFSGASDNSIIMWDIGGRKGRTLLLQG .:::::..:...::..:: :.:. : :::.::.::::.::.:.:::::::::: .. ::: CCDS94 ILKLEQENCTLVTTFRGHTGGVTALCWDPVQRVLFSGSSDHSVIMWDIGGRKGTAIELQG 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA1 HHDKVQSLCYLQLTRQLVSCSSDGGIAVWNMDVSREEAPQWLESDSCQKCEQPFFWNIKQ :.:.::.: : : ::::.::..::::.:::::: :.:.:.::.:::::::.::::::.:: CCDS94 HNDRVQALSYAQHTRQLISCGGDGGIVVWNMDVERQETPEWLDSDSCQKCDQPFFWNFKQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA1 MWDTKTLGLRQHHCRKCGQAVCGKCSSKRSSYPVMGFEFQVRVCDSCYDSIKDEDRTSLA :::.: .:::::::::::.:::::::::::: :.:::::.:::::::...: ::.:. : CCDS94 MWDSKKIGLRQHHCRKCGKAVCGKCSSKRSSIPLMGFEFEVRVCDSCHEAITDEERAPTA 310 320 330 340 350 360 370 380 390 400 410 pF1KA1 TFHEGKHNISHMSMDIARGLMVTCGTDRIVKIWDMTPVVGCSLATGFSPH :::..:::: :. .: .:: ..: :::...:.::::::: CCDS94 TFHDSKHNIVHVHFDATRGWLLTSGTDKVIKLWDMTPVVS 370 380 390 400 410 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 06:15:26 2016 done: Thu Nov 3 06:15:27 2016 Total Scan time: 2.530 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]