FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7040, 462 aa 1>>>pF1KB7040 462 - 462 aa - 462 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.5708+/-0.00117; mu= 2.2839+/- 0.070 mean_var=153.4840+/-31.502, 0's: 0 Z-trim(106.9): 38 B-trim: 117 in 1/50 Lambda= 0.103524 statistics sampled from 9265 (9280) to 9265 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.632), E-opt: 0.2 (0.285), width: 16 Scan time: 3.050 The best scores are: opt bits E(32554) CCDS13345.1 SEMG1 gene_id:6406|Hs108|chr20 ( 462) 3078 471.9 6.3e-133 CCDS13346.1 SEMG2 gene_id:6407|Hs108|chr20 ( 582) 2133 330.8 2.4e-90 >>CCDS13345.1 SEMG1 gene_id:6406|Hs108|chr20 (462 aa) initn: 3078 init1: 3078 opt: 3078 Z-score: 2500.0 bits: 471.9 E(32554): 6.3e-133 Smith-Waterman score: 3078; 100.0% identity (100.0% similar) in 462 aa overlap (1-462:1-462) 10 20 30 40 50 60 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS13 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH 370 380 390 400 410 420 430 440 450 460 pF1KB7 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT :::::::::::::::::::::::::::::::::::::::::: CCDS13 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT 430 440 450 460 >>CCDS13346.1 SEMG2 gene_id:6407|Hs108|chr20 (582 aa) initn: 3619 init1: 2115 opt: 2133 Z-score: 1735.6 bits: 330.8 E(32554): 2.4e-90 Smith-Waterman score: 2133; 75.0% identity (89.2% similar) in 436 aa overlap (1-436:1-436) 10 20 30 40 50 60 pF1KB7 MKPNIIFVLSLLLILEKQAAVMGQKGGSKGRLPSEFSQFPHGQKGQHYSGQKGKQQTESK :: :.::::::::::::::::::::::::.::: :::::::::::: ::: .:.:.:: CCDS13 MKSIILFVLSLLLILEKQAAVMGQKGGSKGQLPSGSSQFPHGQKGQHYFGQKDQQHTKSK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GSFSIQYTYHVDANDHDQSRKSQQYDLNALHKTTKSQRHLGGSQQLLHNKQEGRDHDKSK ::::::.::::: :::: .:::::::::::::.:::..:::::::::. ::::::::::: CCDS13 GSFSIQHTYHVDINDHDWTRKSQQYDLNALHKATKSKQHLGGSQQLLNYKQEGRDHDKSK 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GHFHRVVIHHKGGKAHRGTQNPSQDQGNSPSGKGISSQYSNTEERLWVHGLSKEQTSVSG :::: .:::::::.::.:::::::::::::::::.::: ::::.:::::::::::.:.:: CCDS13 GHFHMIVIHHKGGQAHHGTQNPSQDQGNSPSGKGLSSQCSNTEKRLWVHGLSKEQASASG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 AQKGRKQGGSQSSYVLQTEELVANKQQRETKNSHQNKGHYQNVVEVREEHSSKVQTSLCP ::::: ::::::::::::::::.:::::::::::::::::::::.::::::::.:::: : CCDS13 AQKGRTQGGSQSSYVLQTEELVVNKQQRETKNSHQNKGHYQNVVDVREEHSSKLQTSLHP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 AHQDKLQHGSKDIFSTQDELLVYNKNQHQTKNLNQDQQHGRKANKISYQSSSTEERRLHY ::::.:::: ::::.::::::::::::::::::.:::.:::::.:::: :: ::::.::. CCDS13 AHQDRLQHGPKDIFTTQDELLVYNKNQHQTKNLSQDQEHGRKAHKISYPSSRTEERQLHH 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 GENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKISYQSSSTEERRLHY ::..::::::..:: ::::: .::::.:.:: ::.:::..: :::::::::::::.:. CCDS13 GEKSVQKDVSKGSISIQTEEKIHGKSQNQVTIHSQDQEHGHKENKISYQSSSTEERHLNC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 GENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGESGQSTNREQDLLSH ::.:.:: ::. :: :::. . :::: :. :.: .:.. . : ::.. :. :. CCDS13 GEKGIQKGVSKGSISIQTEEQIHGKSQNQVRIPSQAQEYGHKENKISYQSSSTEERRLNS 370 380 390 400 410 420 430 440 450 460 pF1KB7 EQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT .: .. :.:...: CCDS13 GEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKMSYQSSSTEERRLNY 430 440 450 460 470 480 >-- initn: 962 init1: 618 opt: 618 Z-score: 512.7 bits: 104.5 E(32554): 3.1e-22 Smith-Waterman score: 618; 65.1% identity (85.6% similar) in 146 aa overlap (317-462:437-582) 290 300 310 320 330 340 pF1KB7 SYQSSSTEERRLHYGENGVQKDVSQSSIYSQTEEKAQGKSQKQITIPSQEQEHSQKANKI ::::: .::::.:.:::::.:::..: ::. CCDS13 SYQSSSTEERRLNSGEKDVQKGVSKGSISIQTEEKIHGKSQNQVTIPSQDQEHGHKENKM 410 420 430 440 450 460 350 360 370 380 390 400 pF1KB7 SYQSSSTEERRLHYGENGVQKDVSQRSIYSQTEKLVAGKSQIQAPNPKQEPWHGENAKGE ::::::::::::.:: ...:::::: :: : :::: ::::::.:::.:. : :.::::. CCDS13 SYQSSSTEERRLNYGGKSTQKDVSQSSISFQIEKLVEGKSQIQTPNPNQDQWSGQNAKGK 470 480 490 500 510 520 410 420 430 440 450 460 pF1KB7 SGQSTNREQDLLSHEQKGRHQHGSHGGLDIVIIEQEDDSDRHLAQHLNNDRNPLFT ::::.. .:::::::::::... : . .::: :.: .: ::.:. :.::::. : CCDS13 SGQSADSKQDLLSHEQKGRYKQESSESHNIVITEHEVAQDDHLTQQYNEDRNPIST 530 540 550 560 570 580 462 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 13:03:25 2016 done: Sun Nov 6 13:03:26 2016 Total Scan time: 3.050 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]