FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3880, 379 aa 1>>>pF1KE3880 379 - 379 aa - 379 aa Library: human.CCDS.faa 18921897 residues in 33420 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.6642+/-0.000791; mu= 2.7794+/- 0.048 mean_var=188.7254+/-38.503, 0's: 0 Z-trim(114.8): 11 B-trim: 163 in 2/48 Lambda= 0.093360 statistics sampled from 15524 (15533) to 15524 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.789), E-opt: 0.2 (0.465), width: 16 Scan time: 1.540 The best scores are: opt bits E(33420) CCDS45268.1 MINDY2 gene_id:54629|Hs109|chr15 ( 620) 2258 316.0 5.8e-86 CCDS42046.1 MINDY2 gene_id:54629|Hs109|chr15 ( 621) 2258 316.0 5.8e-86 CCDS976.1 MINDY1 gene_id:55793|Hs109|chr1 ( 469) 461 73.9 3.3e-13 CCDS53361.1 MINDY1 gene_id:55793|Hs109|chr1 ( 517) 461 73.9 3.6e-13 CCDS55635.1 MINDY1 gene_id:55793|Hs109|chr1 ( 374) 448 72.1 9.3e-13 >>CCDS45268.1 MINDY2 gene_id:54629|Hs109|chr15 (620 aa) initn: 2283 init1: 2258 opt: 2258 Z-score: 1657.0 bits: 316.0 E(33420): 5.8e-86 Smith-Waterman score: 2401; 91.7% identity (91.7% similar) in 409 aa overlap (1-375:1-409) 10 20 30 40 50 60 pF1KE3 MESSPESLQPLEHGVAAGPASGTGSSQEGLQETRLAAGDGPGVWAAETSGGNGLGAAAAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 MESSPESLQPLEHGVAAGPASGTGSSQEGLQETRLAAGDGPGVWAAETSGGNGLGAAAAR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RSLPDSASPAGSPEVPGPCSSSAGLDLKDSGLESPAAAEAPLRGQYKVTASPETAVAGVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 RSLPDSASPAGSPEVPGPCSSSAGLDLKDSGLESPAAAEAPLRGQYKVTASPETAVAGVG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 HELGTAGDAGARPDLAGTCQAELTAAGSEEPSSAGGLSSSCSDPSPPGESPSLDSLESFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 HELGTAGDAGARPDLAGTCQAELTAAGSEEPSSAGGLSSSCSDPSPPGESPSLDSLESFS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 NLHSFPSSCEFNSEEGAENRVPEEEEGAAVLPGAVPLCKEEEGEETAQVLAASKERFPGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 NLHSFPSSCEFNSEEGAENRVPEEEEGAAVLPGAVPLCKEEEGEETAQVLAASKERFPGQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 SVYHIKWIQWKEENTPIITQNENGPCPLLAILNVLLLAWKVKLPPMMEIITAEQLMEYLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS45 SVYHIKWIQWKEENTPIITQNENGPCPLLAILNVLLLAWKVKLPPMMEIITAEQLMEYLG 250 260 270 280 290 300 310 320 330 340 pF1KE3 DYMLDAKPKEISEIQRLNYEQNMSDAMAILHKLQTGLDVN-------------------- :::::::::::::::::::::::::::::::::::::::: CCDS45 DYMLDAKPKEISEIQRLNYEQNMSDAMAILHKLQTGLDVNVRFTGVRVFEYTPECIVFDL 310 320 330 340 350 360 350 360 370 pF1KE3 --------------IDDIVKAVGNCSYNQLVEKIISCKQSDNSELVSEGGLCS ::::::::::::::::::::::::::::::::::: CCDS45 LDIPLYHGWLVDPQIDDIVKAVGNCSYNQLVEKIISCKQSDNSELVSEGFVAEQFLNNTA 370 380 390 400 410 420 CCDS45 TQLTYHGLCELTSTVQEGELCVFFRNNHFSTMTKYKGQLYLLVTDQGFLTEEKVVWESLH 430 440 450 460 470 480 >>CCDS42046.1 MINDY2 gene_id:54629|Hs109|chr15 (621 aa) initn: 2283 init1: 2258 opt: 2258 Z-score: 1657.0 bits: 316.0 E(33420): 5.8e-86 Smith-Waterman score: 2401; 91.7% identity (91.7% similar) in 409 aa overlap (1-375:1-409) 10 20 30 40 50 60 pF1KE3 MESSPESLQPLEHGVAAGPASGTGSSQEGLQETRLAAGDGPGVWAAETSGGNGLGAAAAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 MESSPESLQPLEHGVAAGPASGTGSSQEGLQETRLAAGDGPGVWAAETSGGNGLGAAAAR 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 RSLPDSASPAGSPEVPGPCSSSAGLDLKDSGLESPAAAEAPLRGQYKVTASPETAVAGVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 RSLPDSASPAGSPEVPGPCSSSAGLDLKDSGLESPAAAEAPLRGQYKVTASPETAVAGVG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 HELGTAGDAGARPDLAGTCQAELTAAGSEEPSSAGGLSSSCSDPSPPGESPSLDSLESFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 HELGTAGDAGARPDLAGTCQAELTAAGSEEPSSAGGLSSSCSDPSPPGESPSLDSLESFS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 NLHSFPSSCEFNSEEGAENRVPEEEEGAAVLPGAVPLCKEEEGEETAQVLAASKERFPGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 NLHSFPSSCEFNSEEGAENRVPEEEEGAAVLPGAVPLCKEEEGEETAQVLAASKERFPGQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 SVYHIKWIQWKEENTPIITQNENGPCPLLAILNVLLLAWKVKLPPMMEIITAEQLMEYLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS42 SVYHIKWIQWKEENTPIITQNENGPCPLLAILNVLLLAWKVKLPPMMEIITAEQLMEYLG 250 260 270 280 290 300 310 320 330 340 pF1KE3 DYMLDAKPKEISEIQRLNYEQNMSDAMAILHKLQTGLDVN-------------------- :::::::::::::::::::::::::::::::::::::::: CCDS42 DYMLDAKPKEISEIQRLNYEQNMSDAMAILHKLQTGLDVNVRFTGVRVFEYTPECIVFDL 310 320 330 340 350 360 350 360 370 pF1KE3 --------------IDDIVKAVGNCSYNQLVEKIISCKQSDNSELVSEGGLCS ::::::::::::::::::::::::::::::::::: CCDS42 LDIPLYHGWLVDPQIDDIVKAVGNCSYNQLVEKIISCKQSDNSELVSEGFVAEQFLNNTA 370 380 390 400 410 420 CCDS42 TQLTYHGLCELTSTVQEGELCVFFRNNHFSTMTKYKGQLYLLVTDQGFLTEEKVVWESLH 430 440 450 460 470 480 >>CCDS976.1 MINDY1 gene_id:55793|Hs109|chr1 (469 aa) initn: 619 init1: 429 opt: 461 Z-score: 350.7 bits: 73.9 E(33420): 3.3e-13 Smith-Waterman score: 520; 42.1% identity (63.6% similar) in 228 aa overlap (185-375:58-280) 160 170 180 190 200 210 pF1KE3 GGLSSSCSDPSPPGESPSLDSLESFSNLHSFPSSCEFNSEEGAENRVPEEEEGAAVLP-- .::.: .. :. .:: .: : CCDS97 LAGPDEHPQDTDARDADGEAREREPADQALLPSQCG----DNLESPLPEAS-SAPPGPTL 30 40 50 60 70 80 220 230 240 250 260 270 pF1KE3 GAVPLCKEEEGEETAQVLAAS-KERFPGQSVYHIKWIQWKEENTPIITQNENGPCPLLAI :..: . .. : : : . : : . : .::: :: :.::::::. ::::::::: CCDS97 GTLPEVETIRACSMPQELPQSPRTRQPEPDFYCVKWIPWKGEQTPIITQSTNGPCPLLAI 90 100 110 120 130 140 280 290 300 310 320 330 pF1KE3 LNVLLLAWKVKLPPMMEIITAEQLMEYLGDYMLDAKPKEISEIQRLNYEQNMSDAMAILH .:.:.: :::::::. :.::...:: .::. .:. ::.: :: .::..::..:::..: CCDS97 MNILFLQWKVKLPPQKEVITSDELMAHLGNCLLSIKPQEKSEGLQLNFQQNVDDAMTVLP 150 160 170 180 190 200 340 350 pF1KE3 KLQTGLDVNI----------------------------------DDIVKAVGNCSYNQLV :: ::::::. . :.:::. :::::: CCDS97 KLATGLDVNVRFTGVSDFEYTPECSVFDLLGIPLYHGWLVDPQSPEAVRAVGKLSYNQLV 210 220 230 240 250 260 360 370 pF1KE3 EKIISCKQSDNSELVSEGGLCS :.::.::.:....::.:: CCDS97 ERIITCKHSSDTNLVTEGLIAEQFLETTAAQLTYHGLCELTAAAKEGELSVFFRNNHFST 270 280 290 300 310 320 >>CCDS53361.1 MINDY1 gene_id:55793|Hs109|chr1 (517 aa) initn: 619 init1: 429 opt: 461 Z-score: 350.1 bits: 73.9 E(33420): 3.6e-13 Smith-Waterman score: 520; 42.1% identity (63.6% similar) in 228 aa overlap (185-375:106-328) 160 170 180 190 200 210 pF1KE3 GGLSSSCSDPSPPGESPSLDSLESFSNLHSFPSSCEFNSEEGAENRVPEEEEGAAVLP-- .::.: .. :. .:: .: : CCDS53 LAGPDEHPQDTDARDADGEAREREPADQALLPSQCG----DNLESPLPEAS-SAPPGPTL 80 90 100 110 120 130 220 230 240 250 260 270 pF1KE3 GAVPLCKEEEGEETAQVLAAS-KERFPGQSVYHIKWIQWKEENTPIITQNENGPCPLLAI :..: . .. : : : . : : . : .::: :: :.::::::. ::::::::: CCDS53 GTLPEVETIRACSMPQELPQSPRTRQPEPDFYCVKWIPWKGEQTPIITQSTNGPCPLLAI 140 150 160 170 180 190 280 290 300 310 320 330 pF1KE3 LNVLLLAWKVKLPPMMEIITAEQLMEYLGDYMLDAKPKEISEIQRLNYEQNMSDAMAILH .:.:.: :::::::. :.::...:: .::. .:. ::.: :: .::..::..:::..: CCDS53 MNILFLQWKVKLPPQKEVITSDELMAHLGNCLLSIKPQEKSEGLQLNFQQNVDDAMTVLP 200 210 220 230 240 250 340 350 pF1KE3 KLQTGLDVNI----------------------------------DDIVKAVGNCSYNQLV :: ::::::. . :.:::. :::::: CCDS53 KLATGLDVNVRFTGVSDFEYTPECSVFDLLGIPLYHGWLVDPQSPEAVRAVGKLSYNQLV 260 270 280 290 300 310 360 370 pF1KE3 EKIISCKQSDNSELVSEGGLCS :.::.::.:....::.:: CCDS53 ERIITCKHSSDTNLVTEGLIAEQFLETTAAQLTYHGLCELTAAAKEGELSVFFRNNHFST 320 330 340 350 360 370 >>CCDS55635.1 MINDY1 gene_id:55793|Hs109|chr1 (374 aa) initn: 597 init1: 429 opt: 448 Z-score: 342.7 bits: 72.1 E(33420): 9.3e-13 Smith-Waterman score: 507; 47.2% identity (68.2% similar) in 176 aa overlap (234-375:10-185) 210 220 230 240 250 260 pF1KE3 EEEGAAVLPGAVPLCKEEEGEETAQVLAASKERFPGQSVYHIKWIQWKEENTPIITQNEN . : : . : .::: :: :.::::::. : CCDS55 MPQELPQSPRTRQPEPDFYCVKWIPWKGEQTPIITQSTN 10 20 30 270 280 290 300 310 320 pF1KE3 GPCPLLAILNVLLLAWKVKLPPMMEIITAEQLMEYLGDYMLDAKPKEISEIQRLNYEQNM ::::::::.:.:.: :::::::. :.::...:: .::. .:. ::.: :: .::..::. CCDS55 GPCPLLAIMNILFLQWKVKLPPQKEVITSDELMAHLGNCLLSIKPQEKSEGLQLNFQQNV 40 50 60 70 80 90 330 340 pF1KE3 SDAMAILHKLQTGLDVNI----------------------------------DDIVKAVG .:::..: :: ::::::. . :.::: CCDS55 DDAMTVLPKLATGLDVNVRFTGVSDFEYTPECSVFDLLGIPLYHGWLVDPQSPEAVRAVG 100 110 120 130 140 150 350 360 370 pF1KE3 NCSYNQLVEKIISCKQSDNSELVSEGGLCS . :::::::.::.::.:....::.:: CCDS55 KLSYNQLVERIITCKHSSDTNLVTEGLIAEQFLETTAAQLTYHGLCELTAAAKEGELSVF 160 170 180 190 200 210 379 residues in 1 query sequences 18921897 residues in 33420 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Wed Aug 4 20:37:17 2021 done: Wed Aug 4 20:37:17 2021 Total Scan time: 1.540 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]