FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7578, 284 aa 1>>>pF1KB7578 284 - 284 aa - 284 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.3897+/-0.000815; mu= 5.0406+/- 0.050 mean_var=289.6243+/-59.381, 0's: 0 Z-trim(117.7): 146 B-trim: 172 in 1/54 Lambda= 0.075363 statistics sampled from 18336 (18493) to 18336 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.841), E-opt: 0.2 (0.568), width: 16 Scan time: 3.210 The best scores are: opt bits E(32554) CCDS1947.1 TLX2 gene_id:3196|Hs108|chr2 ( 284) 1981 227.6 7.9e-60 CCDS34288.1 TLX3 gene_id:30012|Hs108|chr5 ( 291) 1016 122.7 3.1e-28 CCDS7510.1 TLX1 gene_id:3195|Hs108|chr10 ( 330) 866 106.5 2.7e-23 CCDS55725.1 TLX1 gene_id:3195|Hs108|chr10 ( 257) 547 71.7 6.3e-13 >>CCDS1947.1 TLX2 gene_id:3196|Hs108|chr2 (284 aa) initn: 1981 init1: 1981 opt: 1981 Z-score: 1187.6 bits: 227.6 E(32554): 7.9e-60 Smith-Waterman score: 1981; 100.0% identity (100.0% similar) in 284 aa overlap (1-284:1-284) 10 20 30 40 50 60 pF1KB7 MEPGMLGPHNLPHHEPISFGIDQILSGPETPGGGLGLGRGGQGHGENGAFSGGYHGASGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 MEPGMLGPHNLPHHEPISFGIDQILSGPETPGGGLGLGRGGQGHGENGAFSGGYHGASGY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 GPAGSLAPLPGSSGVGPGGVIRVPAHRPLPVPPPAGGAPAVPGPSGLGGAGGLAGLTFPW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 GPAGSLAPLPGSSGVGPGGVIRVPAHRPLPVPPPAGGAPAVPGPSGLGGAGGLAGLTFPW 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 MDSGRRFAKDRLTAALSPFSGTRRIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 MDSGRRFAKDRLTAALSPFSGTRRIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 YLASAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAERHRAGRLLLHLQQDALP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS19 YLASAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAERHRAGRLLLHLQQDALP 190 200 210 220 230 240 250 260 270 280 pF1KB7 RPLRPPLPPDPLCLHNSSLFALQNLQPWAEDNKVASVSGLASVV :::::::::::::::::::::::::::::::::::::::::::: CCDS19 RPLRPPLPPDPLCLHNSSLFALQNLQPWAEDNKVASVSGLASVV 250 260 270 280 >>CCDS34288.1 TLX3 gene_id:30012|Hs108|chr5 (291 aa) initn: 1063 init1: 770 opt: 1016 Z-score: 620.5 bits: 122.7 E(32554): 3.1e-28 Smith-Waterman score: 1102; 64.1% identity (80.1% similar) in 287 aa overlap (12-280:10-290) 10 20 30 40 50 pF1KB7 MEPGMLGPHNLPH-HEPISFGIDQILSGPETPGGGLGLGRGGQGHGENGAFSGGYHGASG :: ::::::::::::..:. .. :: .: . :. :: ::. CCDS34 MEAPASAQTPHPHEPISFGIDQILNSPDQDSAPA--PRGPDGASYLGGPPGGRPGATY 10 20 30 40 50 60 70 80 90 100 pF1KB7 YG-PA---GSLAPLP--GSSGVG----PGGVIRVPAHRPLP--VPPPAGGA-PAVPG-PS . :: : ::. :: .:. :.:::::::::::: :::: .: ::.:. :. CCDS34 PSLPASFAGLGAPFEDAGSYSVNLSLAPAGVIRVPAHRPLPGAVPPPLPSALPAMPSVPT 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 GLGGAGGLAGLTFPWMDSGRRFAKDRLTAA--LSPFSGTRRIGHPYQNRTPPKRKKPRTS ...:.::.::::.:.:::.:::.::: :.::. :::::::::::::::::::::: CCDS34 ----VSSLGGLNFPWMESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTS 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB7 FSRSQVLELERRFLRQKYLASAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAE ::: :. :::.:: :::::::::::::::.:.:::::::::::::::::::::::::::: CCDS34 FSRVQICELEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAE 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB7 RHRAGRLLLHLQQDALPRPLRPPLPPDPLCLHNSSLFALQNLQPWAEDN-KVASVSGLAS :..:.::.:.::.::. . : . :::::::::::::::::::: ::. :: .:..: CCDS34 RQQASRLMLQLQHDAFQKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV 240 250 260 270 280 290 pF1KB7 VV >>CCDS7510.1 TLX1 gene_id:3195|Hs108|chr10 (330 aa) initn: 1021 init1: 715 opt: 866 Z-score: 531.7 bits: 106.5 E(32554): 2.7e-23 Smith-Waterman score: 1018; 55.2% identity (68.6% similar) in 328 aa overlap (13-282:13-327) 10 20 30 40 pF1KB7 MEPGMLGPHNLPHHEPISFGIDQILSGPETPG----------GGLGLG--------RGGQ : :::::::::::..:. : : ::: :: CCDS75 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG 10 20 30 40 50 60 50 60 70 pF1KB7 GH------GENGAFSGGYHGASGYGPAG-----SLAPLPGS--------SGVGPGG---- : : ::.. : :. : :::: :..:: :: .: :::: CCDS75 GSAAATGAGGAGAYGTGGPGGPG-GPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGS 70 80 90 100 110 80 90 100 110 120 pF1KB7 -----------VIRVPAHRPLPV----PPP-AGGAPAVPGPSGLGGAGGLAGLTFPWMDS :::::::::: : : : : :.::. .. :...:.:::::::.: CCDS75 SGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMES 120 130 140 150 160 170 130 140 150 160 170 180 pF1KB7 GRRFAKDRLTAALSPFSGTRRIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQKYLA .::..:::.: :::::::::::.:::::::.: :. :::.:: :::::: CCDS75 NRRYTKDRFT------------GHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLA 180 190 200 210 220 190 200 210 220 230 240 pF1KB7 SAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAERHRAGRLLLHLQQDALPRPL :::::::::::.:::::::::::::::::::::::::::::..:.:.::.:::.:. . : CCDS75 SAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAFQKSL 230 240 250 260 270 280 250 260 270 280 pF1KB7 RPPLPPDPLCLHNSSLFALQNLQPWAEDN-KVASVSGLASVV ::: ::::.::::::::::::::..:. :..::...:: CCDS75 AQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSVASACE 290 300 310 320 330 >>CCDS55725.1 TLX1 gene_id:3195|Hs108|chr10 (257 aa) initn: 580 init1: 423 opt: 547 Z-score: 345.5 bits: 71.7 E(32554): 6.3e-13 Smith-Waterman score: 699; 51.9% identity (62.4% similar) in 258 aa overlap (13-213:13-257) 10 20 30 40 pF1KB7 MEPGMLGPHNLPHHEPISFGIDQILSGPETPG----------GGLGLG--------RGGQ : :::::::::::..:. : : ::: :: CCDS55 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGGG 10 20 30 40 50 60 50 60 70 pF1KB7 GH------GENGAFSGGYHGASGYGPAG-----SLAPLPGS--------SGVGPGG---- : : ::.. : :. : :::: :..:: :: .: :::: CCDS55 GSAAATGAGGAGAYGTGGPGGPG-GPAGGGGACSMGPLTGSYNVNMALAGGPGPGGGGGS 70 80 90 100 110 80 90 100 110 120 pF1KB7 -----------VIRVPAHRPLPV----PPP-AGGAPAVPGPSGLGGAGGLAGLTFPWMDS :::::::::: : : : : :.::. .. :...:.:::::::.: CCDS55 SGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWMES 120 130 140 150 160 170 130 140 150 160 170 180 pF1KB7 GRRFAKDRLTAALSPFSGTRRIGHPYQNRTPPKRKKPRTSFSRSQVLELERRFLRQKYLA .::..:::.: :::::::::::.:::::::.: :. :::.:: :::::: CCDS55 NRRYTKDRFT------------GHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQKYLA 180 190 200 210 220 190 200 210 220 230 240 pF1KB7 SAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAERHRAGRLLLHLQQDALPRPL :::::::::::.:::::::::::::::::: CCDS55 SAERAALAKALKMTDAQVKTWFQNRRTKWR 230 240 250 284 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 18:25:15 2016 done: Sun Nov 6 18:25:16 2016 Total Scan time: 3.210 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]