FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8930, 281 aa 1>>>pF1KB8930 281 - 281 aa - 281 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.9778+/-0.000686; mu= 10.8984+/- 0.042 mean_var=159.6020+/-32.888, 0's: 0 Z-trim(116.3): 170 B-trim: 919 in 1/51 Lambda= 0.101521 statistics sampled from 16739 (16927) to 16739 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.52), width: 16 Scan time: 2.830 The best scores are: opt bits E(32554) CCDS31270.1 LBX1 gene_id:10660|Hs108|chr10 ( 281) 1859 283.0 1.7e-76 CCDS62938.1 LBX2 gene_id:85474|Hs108|chr2 ( 198) 515 86.0 2.4e-17 CCDS33228.1 LBX2 gene_id:85474|Hs108|chr2 ( 194) 381 66.3 1.9e-11 >>CCDS31270.1 LBX1 gene_id:10660|Hs108|chr10 (281 aa) initn: 1859 init1: 1859 opt: 1859 Z-score: 1486.8 bits: 283.0 E(32554): 1.7e-76 Smith-Waterman score: 1859; 100.0% identity (100.0% similar) in 281 aa overlap (1-281:1-281) 10 20 30 40 50 60 pF1KB8 MTSKEDGKAAPGEERRRSPLDHLPPPANSNKPLTPFSIEDILNKPSVRRSYSLCGAAHLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MTSKEDGKAAPGEERRRSPLDHLPPPANSNKPLTPFSIEDILNKPSVRRSYSLCGAAHLL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 AAADKHAQGGLPLAGRALLSQTSPLCALEELASKTFKGLEVSVLQAAEGRDGMTIFGQRQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 AAADKHAQGGLPLAGRALLSQTSPLCALEELASKTFKGLEVSVLQAAEGRDGMTIFGQRQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 TPKKRRKSRTAFTNHQIYELEKRFLYQKYLSPADRDQIAQQLGLTNAQVITWFQNRRAKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 TPKKRRKSRTAFTNHQIYELEKRFLYQKYLSPADRDQIAQQLGLTNAQVITWFQNRRAKL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 KRDLEEMKADVESAKKLGPSGQMDIVALAELEQNSEATAGGGGGCGRAKSRPGSPVLPPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 KRDLEEMKADVESAKKLGPSGQMDIVALAELEQNSEATAGGGGGCGRAKSRPGSPVLPPG 190 200 210 220 230 240 250 260 270 280 pF1KB8 APKAPGAGALQLSPASPLTDQPASSQDCSEDEEDEEIDVDD ::::::::::::::::::::::::::::::::::::::::: CCDS31 APKAPGAGALQLSPASPLTDQPASSQDCSEDEEDEEIDVDD 250 260 270 280 >>CCDS62938.1 LBX2 gene_id:85474|Hs108|chr2 (198 aa) initn: 591 init1: 513 opt: 515 Z-score: 424.9 bits: 86.0 E(32554): 2.4e-17 Smith-Waterman score: 523; 46.7% identity (61.6% similar) in 229 aa overlap (59-281:13-198) 30 40 50 60 70 80 pF1KB8 SNKPLTPFSIEDILNKPSVRRSYSLCGAAHLLAAADKHAQGGLPLAGRA-LLSQ-----T ::. :: : .: : : : . : CCDS62 MNSGREPRTPRTLLSIADILAPRMVPRAPSAPQLPESGPGPT 10 20 30 40 90 100 110 120 130 140 pF1KB8 SPLCALEELASKTFKGLEVSVLQAAEGRDGMTIFGQRQTPKKRRKSRTAFTNHQIYELEK :::::::::.::::.::.. .:: .::: : .: .:::::::::: .:. :::. CCDS62 SPLCALEELTSKTFRGLDARALQPSEGRAGPDALGPGPFGRKRRKSRTAFTAQQVLELER 50 60 70 80 90 100 150 160 170 180 190 200 pF1KB8 RFLYQKYLSPADRDQIAQQLGLTNAQVITWFQNRRAKLKRDLEEMKADVESAKKLGPSGQ ::..::::.:..:: .: .:::.::::.:::::::::::::.:::.::: : . :.: CCDS62 RFVFQKYLAPSERDGLATRLGLANAQVVTWFQNRRAKLKRDVEEMRADVASLRALSP--- 110 120 130 140 150 210 220 230 240 250 260 pF1KB8 MDIVALAELEQNSEATAGGGGGCGRAKSRPGSPVLPPGAPKAPGAGALQLSPASPLTDQP ... :. : :: ::: :: : :.::.: CCDS62 -EVL------------------CSLA--------LPEGAPD-PG---LCLGPAGP----- 160 170 180 270 280 pF1KB8 ASSQDCSEDEEDEEIDVDD : ::::.::: CCDS62 ----DSRPHLSDEEIQVDD 190 >>CCDS33228.1 LBX2 gene_id:85474|Hs108|chr2 (194 aa) initn: 426 init1: 370 opt: 381 Z-score: 319.0 bits: 66.3 E(32554): 1.9e-11 Smith-Waterman score: 389; 45.9% identity (61.6% similar) in 172 aa overlap (110-281:66-194) 80 90 100 110 120 130 pF1KB8 SQTSPLCALEELASKTFKGLEVSVLQAAEGRDGMTIFGQRQTPKKRRKSRTAFTNHQIYE : : .: .:::::::::: .:. : CCDS33 RPGGWRWARRDLCKTASRAENNSQACRPQRRAGPDALGPGPFGRKRRKSRTAFTAQQVLE 40 50 60 70 80 90 140 150 160 170 180 190 pF1KB8 LEKRFLYQKYLSPADRDQIAQQLGLTNAQVITWFQNRRAKLKRDLEEMKADVESAKKLGP ::.::..::::.:..:: .: .:::.::::.:::::::::::::.:::.::: : . :.: CCDS33 LERRFVFQKYLAPSERDGLATRLGLANAQVVTWFQNRRAKLKRDVEEMRADVASLRALSP 100 110 120 130 140 150 200 210 220 230 240 250 pF1KB8 SGQMDIVALAELEQNSEATAGGGGGCGRAKSRPGSPVLPPGAPKAPGAGALQLSPASPLT ... :. : :: ::: :: : :.::.: CCDS33 ----EVL------------------CSLA--------LPEGAPD-PG---LCLGPAGP-D 160 170 180 260 270 280 pF1KB8 DQPASSQDCSEDEEDEEIDVDD ..: : ::::.::: CCDS33 SRPHLS--------DEEIQVDD 190 281 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:21:00 2016 done: Tue Nov 8 04:21:01 2016 Total Scan time: 2.830 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]