FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7594, 291 aa 1>>>pF1KB7594 291 - 291 aa - 291 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.5463+/-0.000821; mu= 4.2043+/- 0.050 mean_var=270.5164+/-54.335, 0's: 0 Z-trim(116.6): 147 B-trim: 0 in 0/53 Lambda= 0.077979 statistics sampled from 17057 (17221) to 17057 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.529), width: 16 Scan time: 3.080 The best scores are: opt bits E(32554) CCDS34288.1 TLX3 gene_id:30012|Hs108|chr5 ( 291) 1984 235.4 3.8e-62 CCDS1947.1 TLX2 gene_id:3196|Hs108|chr2 ( 284) 1016 126.5 2.3e-29 CCDS7510.1 TLX1 gene_id:3195|Hs108|chr10 ( 330) 852 108.1 9e-24 CCDS55725.1 TLX1 gene_id:3195|Hs108|chr10 ( 257) 512 69.7 2.5e-12 >>CCDS34288.1 TLX3 gene_id:30012|Hs108|chr5 (291 aa) initn: 1984 init1: 1984 opt: 1984 Z-score: 1229.2 bits: 235.4 E(32554): 3.8e-62 Smith-Waterman score: 1984; 100.0% identity (100.0% similar) in 291 aa overlap (1-291:1-291) 10 20 30 40 50 60 pF1KB7 MEAPASAQTPHPHEPISFGIDQILNSPDQDSAPAPRGPDGASYLGGPPGGRPGATYPSLP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 MEAPASAQTPHPHEPISFGIDQILNSPDQDSAPAPRGPDGASYLGGPPGGRPGATYPSLP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 ASFAGLGAPFEDAGSYSVNLSLAPAGVIRVPAHRPLPGAVPPPLPSALPAMPSVPTVSSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 ASFAGLGAPFEDAGSYSVNLSLAPAGVIRVPAHRPLPGAVPPPLPSALPAMPSVPTVSSL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 GGLNFPWMESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQICE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 GGLNFPWMESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQICE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 LEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQASRLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQASRLM 190 200 210 220 230 240 250 260 270 280 290 pF1KB7 LQLQHDAFQKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 LQLQHDAFQKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV 250 260 270 280 290 >>CCDS1947.1 TLX2 gene_id:3196|Hs108|chr2 (284 aa) initn: 1063 init1: 770 opt: 1016 Z-score: 640.8 bits: 126.5 E(32554): 2.3e-29 Smith-Waterman score: 1102; 64.1% identity (80.1% similar) in 287 aa overlap (10-290:12-280) 10 20 30 40 50 pF1KB7 MEAPASAQTPHPHEPISFGIDQILNSPDQDSAPAP--RGPDGASYLGGPPGGRPGATY :: ::::::::::::..:. .. :: .: . :. :: ::. CCDS19 MEPGMLGPHNLPH-HEPISFGIDQILSGPETPGGGLGLGRGGQGHGENGAFSGGYHGASG 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 PSLPASFAGLGAPFEDAGSYSVNLSLAPAGVIRVPAHRPLPGAVPPPLPSALPAMPSVPT . :: : ::. :: .:. :.:::::::::::: :::: .: ::.:. :. CCDS19 YG-PA---GSLAPL--PGSSGVG----PGGVIRVPAHRPLP--VPPPAGGA-PAVPG-PS 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 ----VSSLGGLNFPWMESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTS ...:.::.::::.:.:::.:::.::: :.::. :::::::::::::::::::::: CCDS19 GLGGAGGLAGLTFPWMDSGRRFAKDRLTAA--LSPFSGTRRIGHPYQNRTPPKRKKPRTS 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB7 FSRVQICELEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAE ::: :. :::.:: :::::::::::::::.:.:::::::::::::::::::::::::::: CCDS19 FSRSQVLELERRFLRQKYLASAERAALAKALRMTDAQVKTWFQNRRTKWRRQTAEEREAE 170 180 190 200 210 220 240 250 260 270 280 290 pF1KB7 RQQASRLMLQLQHDAFQKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV :..:.::.:.::.::. . : . :::::::::::::::::::: ::. :: .:..: CCDS19 RHRAGRLLLHLQQDALPRPLRPPLPPDPLCLHNSSLFALQNLQPWAEDN-KVASVSGLAS 230 240 250 260 270 280 CCDS19 VV >>CCDS7510.1 TLX1 gene_id:3195|Hs108|chr10 (330 aa) initn: 1111 init1: 792 opt: 852 Z-score: 540.3 bits: 108.1 E(32554): 9e-24 Smith-Waterman score: 1127; 59.6% identity (71.7% similar) in 332 aa overlap (11-290:10-325) 10 20 30 40 pF1KB7 MEAPASAQTPHP-H-EPISFGIDQILNSPDQDS--APAPRGPDGASYLG----------- :: : :::::::::::::::: . .:: : :: :: CCDS75 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGG 10 20 30 40 50 50 60 70 80 pF1KB7 -------GPPG-GRPGATYPSLPASFAGLGA-----PFEDAGSYSVNLSLA--P------ : : : :. :. :.. :: :. :. .:::.::..:: : CCDS75 GGSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPL--TGSYNVNMALAGGPGPGGGG 60 70 80 90 100 110 90 100 110 120 pF1KB7 -----------AGVIRVPAHRPLPGAV--PPPLPSALPAMPSVPT---VSSLGGLNFPWM :::::::::::: ::: : :: ..::..::::. :..: ::.:::: CCDS75 GSSGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWM 120 130 140 150 160 170 130 140 150 160 170 180 pF1KB7 ESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQICELEKRFHRQ ::.::..::::: :::::::::::.:::::::.:.:::::::::::: CCDS75 ESNRRYTKDRFT--------------GHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQ 180 190 200 210 220 190 200 210 220 230 240 pF1KB7 KYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQASRLMLQLQHDAF :::::::::::::.::::::::::::::::::::::::::::::::::.:..::::..:: CCDS75 KYLASAERAALAKALKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQANRILLQLQQEAF 230 240 250 260 270 280 250 260 270 280 290 pF1KB7 QKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV :::: . . ::::.:::::::::::::: .::.:. .:::. CCDS75 QKSLAQPLPADPLCVHNSSLFALQNLQPWSDDSTKITSVTSVASACE 290 300 310 320 330 >>CCDS55725.1 TLX1 gene_id:3195|Hs108|chr10 (257 aa) initn: 687 init1: 460 opt: 512 Z-score: 334.9 bits: 69.7 E(32554): 2.5e-12 Smith-Waterman score: 787; 56.4% identity (67.8% similar) in 264 aa overlap (11-222:10-257) 10 20 30 40 pF1KB7 MEAPASAQTPHP-H-EPISFGIDQILNSPDQDS--APAPRGPDGA-----------SYLG :: : :::::::::::::::: . .:: : :: .: : CCDS55 MEHLGPHHLHPGHAEPISFGIDQILNSPDQGGCMGPASRLQDGEYGLGCLVGGAYTYGG 10 20 30 40 50 50 60 70 80 pF1KB7 GPPGGRPGA----TY----PSLPASFAGLGA-----PFEDAGSYSVNLSLA--P------ : .. :: .: :. :.. :: :. :. .:::.::..:: : CCDS55 GGSAAATGAGGAGAYGTGGPGGPGGPAGGGGACSMGPL--TGSYNVNMALAGGPGPGGGG 60 70 80 90 100 110 90 100 110 120 pF1KB7 -----------AGVIRVPAHRPLPGAV--PPPLPSALPAMPSVPT---VSSLGGLNFPWM :::::::::::: ::: : :: ..::..::::. :..: ::.:::: CCDS55 GSSGGAGALSAAGVIRVPAHRPLAGAVAHPQPLATGLPTVPSVPAMPGVNNLTGLTFPWM 120 130 140 150 160 170 130 140 150 160 170 180 pF1KB7 ESSRRFVKDRFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFSRVQICELEKRFHRQ ::.::..::::: :::::::::::.:::::::.:.:::::::::::: CCDS55 ESNRRYTKDRFT--------------GHPYQNRTPPKKKKPRTSFTRLQICELEKRFHRQ 180 190 200 210 220 190 200 210 220 230 240 pF1KB7 KYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQQASRLMLQLQHDAF :::::::::::::.:::::::::::::::::::: CCDS55 KYLASAERAALAKALKMTDAQVKTWFQNRRTKWR 230 240 250 250 260 270 280 290 pF1KB7 QKSLNDSIQPDPLCLHNSSLFALQNLQPWEEDSSKVPAVTSLV 291 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 01:03:40 2016 done: Mon Nov 7 01:03:41 2016 Total Scan time: 3.080 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]