FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3030, 156 aa 1>>>pF1KE3030 156 - 156 aa - 156 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.9871+/-0.000852; mu= 9.0019+/- 0.051 mean_var=96.5281+/-19.809, 0's: 0 Z-trim(108.7): 114 B-trim: 0 in 0/50 Lambda= 0.130541 statistics sampled from 10282 (10413) to 10282 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.695), E-opt: 0.2 (0.32), width: 16 Scan time: 1.720 The best scores are: opt bits E(32554) CCDS44534.1 LMO1 gene_id:4004|Hs108|chr11 ( 156) 1124 221.3 1.9e-58 CCDS58118.1 LMO1 gene_id:4004|Hs108|chr11 ( 155) 1113 219.3 7.8e-58 CCDS58211.1 LMO3 gene_id:55885|Hs108|chr12 ( 156) 962 190.8 2.8e-49 CCDS58212.1 LMO3 gene_id:55885|Hs108|chr12 ( 163) 951 188.8 1.2e-48 CCDS8678.1 LMO3 gene_id:55885|Hs108|chr12 ( 145) 948 188.2 1.7e-48 CCDS44567.1 LMO2 gene_id:4005|Hs108|chr11 ( 158) 578 118.5 1.7e-27 CCDS7888.2 LMO2 gene_id:4005|Hs108|chr11 ( 227) 578 118.6 2.2e-27 CCDS713.1 LMO4 gene_id:8543|Hs108|chr1 ( 165) 554 114.0 4e-26 CCDS58210.1 LMO3 gene_id:55885|Hs108|chr12 ( 167) 469 98.0 2.7e-21 CCDS30962.1 LHX9 gene_id:56956|Hs108|chr1 ( 388) 440 92.8 2.3e-19 CCDS1338.1 LHX4 gene_id:89884|Hs108|chr1 ( 390) 440 92.8 2.3e-19 CCDS1393.1 LHX9 gene_id:56956|Hs108|chr1 ( 397) 440 92.8 2.3e-19 CCDS6853.1 LHX2 gene_id:9355|Hs108|chr9 ( 406) 430 90.9 8.7e-19 CCDS6994.1 LHX3 gene_id:8022|Hs108|chr9 ( 397) 406 86.4 2e-17 CCDS6995.1 LHX3 gene_id:8022|Hs108|chr9 ( 402) 406 86.4 2e-17 CCDS11316.1 LHX1 gene_id:3975|Hs108|chr17 ( 406) 379 81.3 6.7e-16 CCDS10290.1 ISL2 gene_id:64843|Hs108|chr15 ( 359) 370 79.6 2e-15 CCDS1247.1 LMX1A gene_id:4009|Hs108|chr1 ( 382) 370 79.6 2.1e-15 CCDS6866.2 LMX1B gene_id:4010|Hs108|chr9 ( 395) 367 79.1 3.2e-15 CCDS55342.1 LMX1B gene_id:4010|Hs108|chr9 ( 402) 367 79.1 3.2e-15 CCDS55343.1 LMX1B gene_id:4010|Hs108|chr9 ( 406) 367 79.1 3.2e-15 CCDS9171.1 LHX5 gene_id:64211|Hs108|chr12 ( 402) 365 78.7 4.2e-15 CCDS43314.1 ISL1 gene_id:3670|Hs108|chr5 ( 349) 358 77.3 9.3e-15 CCDS56583.1 LHX6 gene_id:26468|Hs108|chr9 ( 363) 349 75.7 3.1e-14 CCDS56584.1 LHX6 gene_id:26468|Hs108|chr9 ( 366) 349 75.7 3.1e-14 CCDS6837.2 LHX6 gene_id:26468|Hs108|chr9 ( 377) 349 75.7 3.2e-14 CCDS6838.2 LHX6 gene_id:26468|Hs108|chr9 ( 392) 349 75.7 3.3e-14 CCDS58008.1 LHX8 gene_id:431707|Hs108|chr1 ( 346) 347 75.3 3.9e-14 CCDS30756.1 LHX8 gene_id:431707|Hs108|chr1 ( 356) 347 75.3 4e-14 >>CCDS44534.1 LMO1 gene_id:4004|Hs108|chr11 (156 aa) initn: 1124 init1: 1124 opt: 1124 Z-score: 1163.0 bits: 221.3 E(32554): 1.9e-58 Smith-Waterman score: 1124; 100.0% identity (100.0% similar) in 156 aa overlap (1-156:1-156) 10 20 30 40 50 60 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN 70 80 90 100 110 120 130 140 150 pF1KE3 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ :::::::::::::::::::::::::::::::::::: CCDS44 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ 130 140 150 >>CCDS58118.1 LMO1 gene_id:4004|Hs108|chr11 (155 aa) initn: 1113 init1: 1113 opt: 1113 Z-score: 1151.8 bits: 219.3 E(32554): 7.8e-58 Smith-Waterman score: 1113; 99.4% identity (100.0% similar) in 155 aa overlap (2-156:1-155) 10 20 30 40 50 60 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE ::::.:::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MVLDQEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN 60 70 80 90 100 110 130 140 150 pF1KE3 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ :::::::::::::::::::::::::::::::::::: CCDS58 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ 120 130 140 150 >>CCDS58211.1 LMO3 gene_id:55885|Hs108|chr12 (156 aa) initn: 1052 init1: 948 opt: 962 Z-score: 998.1 bits: 190.8 E(32554): 2.8e-49 Smith-Waterman score: 962; 85.3% identity (91.0% similar) in 156 aa overlap (1-156:1-156) 10 20 30 40 50 60 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE :. .: :. :::::: : ::::::::::::::::::::::::::::::::::::::: CCDS58 MQKKEKSFGIQMLSVQPDTKPKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN :::::::::::::::::::::::.::::::::::::::::::::.::::::::::::::: CCDS58 VGSTLYTKANLILCRRDYLRLFGVTGNCAACSKLIPAFEMVMRAKDNVYHLDCFACQLCN 70 80 90 100 110 120 130 140 150 pF1KE3 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ ::::::::::::::::::: ::::: .. . ::. CCDS58 QRFCVGDKFFLKNNMILCQTDYEEGLMKEGYAPQVR 130 140 150 >>CCDS58212.1 LMO3 gene_id:55885|Hs108|chr12 (163 aa) initn: 1040 init1: 948 opt: 951 Z-score: 986.6 bits: 188.8 E(32554): 1.2e-48 Smith-Waterman score: 951; 88.4% identity (93.2% similar) in 147 aa overlap (10-156:17-163) 10 20 30 40 50 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCAC . :::::: : :::::::::::::::::::::::::::::::: CCDS58 MLMEGGRAAAARNVSSIQMLSVQPDTKPKGCAGCNRKIKDRYLLKALDKYWHEDCLKCAC 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 CDCRLGEVGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDC ::::::::::::::::::::::::::::::.::::::::::::::::::::.:::::::: CCDS58 CDCRLGEVGSTLYTKANLILCRRDYLRLFGVTGNCAACSKLIPAFEMVMRAKDNVYHLDC 70 80 90 100 110 120 120 130 140 150 pF1KE3 FACQLCNQRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ :::::::::::::::::::::::::: ::::: .. . ::. CCDS58 FACQLCNQRFCVGDKFFLKNNMILCQTDYEEGLMKEGYAPQVR 130 140 150 160 >>CCDS8678.1 LMO3 gene_id:55885|Hs108|chr12 (145 aa) initn: 1052 init1: 948 opt: 948 Z-score: 984.3 bits: 188.2 E(32554): 1.7e-48 Smith-Waterman score: 948; 89.7% identity (93.8% similar) in 145 aa overlap (12-156:1-145) 10 20 30 40 50 60 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE :::::: : ::::::::::::::::::::::::::::::::::::::: CCDS86 MLSVQPDTKPKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE 10 20 30 40 70 80 90 100 110 120 pF1KE3 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN :::::::::::::::::::::::.::::::::::::::::::::.::::::::::::::: CCDS86 VGSTLYTKANLILCRRDYLRLFGVTGNCAACSKLIPAFEMVMRAKDNVYHLDCFACQLCN 50 60 70 80 90 100 130 140 150 pF1KE3 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ ::::::::::::::::::: ::::: .. . ::. CCDS86 QRFCVGDKFFLKNNMILCQTDYEEGLMKEGYAPQVR 110 120 130 140 >>CCDS44567.1 LMO2 gene_id:4005|Hs108|chr11 (158 aa) initn: 653 init1: 565 opt: 578 Z-score: 607.2 bits: 118.5 E(32554): 1.7e-27 Smith-Waterman score: 578; 59.1% identity (81.9% similar) in 127 aa overlap (24-149:30-156) 10 20 30 40 50 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACC :.::...: :::.:::.:.:::::::.: : CCDS44 MSSAIERKSLDPSEEPVDEVLQIPPSLLTCGGCQQNIGDRYFLKAIDQYWHEDCLSCDLC 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE3 DCRLGEVGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCF ::::::: :: : . ::::::::::: : ::.:.: : :.::.::..:.::::.:: CCDS44 GCRLGEVGRRLYYKLGRKLCRRDYLRLFGQDGLCASCDKRIRAYEMTMRVKDKVYHLECF 70 80 90 100 110 120 120 130 140 150 pF1KE3 ACQLCNQRFCVGDKFFLKNNMILCQMD-YEEGQLNGTFESQVQ : :...:::::...: :. :.:..: :: ..:: CCDS44 KCAACQKHFCVGDRYLLINSDIVCEQDIYEWTKINGMI 130 140 150 >>CCDS7888.2 LMO2 gene_id:4005|Hs108|chr11 (227 aa) initn: 653 init1: 565 opt: 578 Z-score: 605.0 bits: 118.6 E(32554): 2.2e-27 Smith-Waterman score: 578; 59.1% identity (81.9% similar) in 127 aa overlap (24-149:99-225) 10 20 30 40 50 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCAC :.::...: :::.:::.:.:::::::.: CCDS78 PMSSAIERKSLDPSEEPVDEVLQIPPSLLTCGGCQQNIGDRYFLKAIDQYWHEDCLSCDL 70 80 90 100 110 120 60 70 80 90 100 110 pF1KE3 CDCRLGEVGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDC : ::::::: :: : . ::::::::::: : ::.:.: : :.::.::..:.::::.: CCDS78 CGCRLGEVGRRLYYKLGRKLCRRDYLRLFGQDGLCASCDKRIRAYEMTMRVKDKVYHLEC 130 140 150 160 170 180 120 130 140 150 pF1KE3 FACQLCNQRFCVGDKFFLKNNMILCQMD-YEEGQLNGTFESQVQ : : :...:::::...: :. :.:..: :: ..:: CCDS78 FKCAACQKHFCVGDRYLLINSDIVCEQDIYEWTKINGMI 190 200 210 220 >>CCDS713.1 LMO4 gene_id:8543|Hs108|chr1 (165 aa) initn: 638 init1: 554 opt: 554 Z-score: 582.5 bits: 114.0 E(32554): 4e-26 Smith-Waterman score: 554; 52.3% identity (78.8% similar) in 132 aa overlap (22-153:21-152) 10 20 30 40 50 60 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE : ::::. :: ::.:: :.:.::: ::::.::. .::. CCDS71 MVNPGSSSQPPPVTAGSLSWKRCAGCGGKIADRFLLYAMDSYWHSRCLKCSCCQAQLGD 10 20 30 40 50 70 80 90 100 110 120 pF1KE3 VGSTLYTKANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCN .:.. :::...:::: ::.::::..: :.::.. ::: :.::::. ::::: ::.:. : CCDS71 IGTSCYTKSGMILCRNDYIRLFGNSGACSACGQSIPASELVMRAQGNVYHLKCFTCSTCR 60 70 80 90 100 110 130 140 150 pF1KE3 QRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ .:. ::.: :. ..:. : . .:: ..: CCDS71 NRLVPGDRFHYINGSLFCEHDRPTALINGHLNSLQSNPLLPDQKVC 120 130 140 150 160 >>CCDS58210.1 LMO3 gene_id:55885|Hs108|chr12 (167 aa) initn: 861 init1: 469 opt: 469 Z-score: 495.9 bits: 98.0 E(32554): 2.7e-21 Smith-Waterman score: 766; 69.0% identity (76.8% similar) in 168 aa overlap (12-156:1-167) 10 20 30 40 50 60 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE :::::: : ::::::::::::::::::::::::::::::::::::::: CCDS58 MLSVQPDTKPKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGE 10 20 30 40 70 80 90 pF1KE3 ----VGSTLYTKA---NLILCRR----------DYLR------LFGTTGNCAACSKLIPA :: .: .: .. .: :. . :::.::::::::::::: CCDS58 HLKRCGS-IYIHAAHTEIGICVTLEWPLPFVIIDFSQQITIAWLFGVTGNCAACSKLIPA 50 60 70 80 90 100 100 110 120 130 140 150 pF1KE3 FEMVMRARDNVYHLDCFACQLCNQRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQVQ :::::::.:::::::::::::::::::::::::::::::::: ::::: .. . ::. CCDS58 FEMVMRAKDNVYHLDCFACQLCNQRFCVGDKFFLKNNMILCQTDYEEGLMKEGYAPQVR 110 120 130 140 150 160 >>CCDS30962.1 LHX9 gene_id:56956|Hs108|chr1 (388 aa) initn: 376 init1: 234 opt: 440 Z-score: 461.2 bits: 92.8 E(32554): 2.3e-19 Smith-Waterman score: 440; 45.0% identity (68.2% similar) in 151 aa overlap (6-155:47-190) 10 20 30 pF1KE3 MMVLDKEDGVPMLSVQPKGKQKGCAGCNRKIKDRY .. :.: :: :. : ::::. ::.::: CCDS30 SGGHIQGIMEEMERRSKTEARLAKGAQLNGRDAGMPPLS--PE-KPALCAGCGGKISDRY 20 30 40 50 60 70 40 50 60 70 80 90 pF1KE3 LLKALDKYWHEDCLKCACCDCRLG-EVGSTLYTKANLILCRRDYLRLFGTTGNCAACSKL : :.:: :: :::: :.:.:. : : ..: . : :..:: : : .. :: : CCDS30 YLLAVDKQWHLRCLKC--CECKLALESELTCFAKDGSIYCKEDYYRRF-SVQRCARCHLG 80 90 100 110 120 130 100 110 120 130 140 150 pF1KE3 IPAFEMVMRARDNVYHLDCFACQLCNQRFCVGDKFFLKNNMILCQMDYEEGQLNGTFESQ : : ::::::::.::::.::.:. ::. . .::.: .:.... :. .: :.: . : CCDS30 ISASEMVMRARDSVYHLSCFTCSTCNKTLTTGDHFGMKDSLVYCRAHFET-LLQGEYPPQ 140 150 160 170 180 pF1KE3 VQ . CCDS30 LSYTELAAKSGGLALPYFNGTGTVQKGRPRKRKSPALGVDIVNYNSGCNENEADHLDRDQ 190 200 210 220 230 240 156 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 04:56:52 2016 done: Sun Nov 6 04:56:52 2016 Total Scan time: 1.720 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]