FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3132, 336 aa 1>>>pF1KE3132 336 - 336 aa - 336 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2590+/-0.000964; mu= 8.5702+/- 0.058 mean_var=133.1378+/-26.330, 0's: 0 Z-trim(109.6): 9 B-trim: 272 in 1/52 Lambda= 0.111154 statistics sampled from 11030 (11035) to 11030 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.339), width: 16 Scan time: 2.610 The best scores are: opt bits E(32554) CCDS47121.1 AIMP1 gene_id:9255|Hs108|chr4 ( 336) 2161 357.6 8.4e-99 CCDS3674.1 AIMP1 gene_id:9255|Hs108|chr4 ( 312) 2007 332.9 2.2e-91 CCDS368.1 YARS gene_id:8565|Hs108|chr1 ( 528) 597 106.9 3.8e-23 >>CCDS47121.1 AIMP1 gene_id:9255|Hs108|chr4 (336 aa) initn: 2161 init1: 2161 opt: 2161 Z-score: 1887.3 bits: 357.6 E(32554): 8.4e-99 Smith-Waterman score: 2161; 100.0% identity (100.0% similar) in 336 aa overlap (1-336:1-336) 10 20 30 40 50 60 pF1KE3 MLPAVAVSEPVVLRFMIFCRLLAKMANNDAVLKRLEQKGAEADQIIEYLKQQVSLLKEKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MLPAVAVSEPVVLRFMIFCRLLAKMANNDAVLKRLEQKGAEADQIIEYLKQQVSLLKEKA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 ILQATLREEKKLRVENAKLKKEIEELKQELIQAEIQNGVKQIPFPSGTPLHANSMVSENV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ILQATLREEKKLRVENAKLKKEIEELKQELIQAEIQNGVKQIPFPSGTPLHANSMVSENV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 IQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIAGSADSKPIDVSRLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 IQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIAGSADSKPIDVSRLD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 LRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVILLCNLKPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVILLCNLKPA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 KMRGVLSQAMVMCASSPEKIEILAPPNGSVPGDRITFDAFPGEPDKELNPKKKIWEQIQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KMRGVLSQAMVMCASSPEKIEILAPPNGSVPGDRITFDAFPGEPDKELNPKKKIWEQIQP 250 260 270 280 290 300 310 320 330 pF1KE3 DLHTNDECVATYKGVPFEVKGKGVCRAQTMSNSGIK :::::::::::::::::::::::::::::::::::: CCDS47 DLHTNDECVATYKGVPFEVKGKGVCRAQTMSNSGIK 310 320 330 >>CCDS3674.1 AIMP1 gene_id:9255|Hs108|chr4 (312 aa) initn: 2007 init1: 2007 opt: 2007 Z-score: 1754.3 bits: 332.9 E(32554): 2.2e-91 Smith-Waterman score: 2007; 100.0% identity (100.0% similar) in 312 aa overlap (25-336:1-312) 10 20 30 40 50 60 pF1KE3 MLPAVAVSEPVVLRFMIFCRLLAKMANNDAVLKRLEQKGAEADQIIEYLKQQVSLLKEKA :::::::::::::::::::::::::::::::::::: CCDS36 MANNDAVLKRLEQKGAEADQIIEYLKQQVSLLKEKA 10 20 30 70 80 90 100 110 120 pF1KE3 ILQATLREEKKLRVENAKLKKEIEELKQELIQAEIQNGVKQIPFPSGTPLHANSMVSENV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 ILQATLREEKKLRVENAKLKKEIEELKQELIQAEIQNGVKQIPFPSGTPLHANSMVSENV 40 50 60 70 80 90 130 140 150 160 170 180 pF1KE3 IQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIAGSADSKPIDVSRLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 IQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIAGSADSKPIDVSRLD 100 110 120 130 140 150 190 200 210 220 230 240 pF1KE3 LRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVILLCNLKPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 LRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVILLCNLKPA 160 170 180 190 200 210 250 260 270 280 290 300 pF1KE3 KMRGVLSQAMVMCASSPEKIEILAPPNGSVPGDRITFDAFPGEPDKELNPKKKIWEQIQP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 KMRGVLSQAMVMCASSPEKIEILAPPNGSVPGDRITFDAFPGEPDKELNPKKKIWEQIQP 220 230 240 250 260 270 310 320 330 pF1KE3 DLHTNDECVATYKGVPFEVKGKGVCRAQTMSNSGIK :::::::::::::::::::::::::::::::::::: CCDS36 DLHTNDECVATYKGVPFEVKGKGVCRAQTMSNSGIK 280 290 300 310 >>CCDS368.1 YARS gene_id:8565|Hs108|chr1 (528 aa) initn: 564 init1: 391 opt: 597 Z-score: 529.0 bits: 106.9 E(32554): 3.8e-23 Smith-Waterman score: 597; 48.7% identity (76.4% similar) in 195 aa overlap (147-332:332-526) 120 130 140 150 160 170 pF1KE3 SENVIQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIA-GSA-DSKPI : .:. . . .::. .: : : .:.: CCDS36 EVVHPGDLKNSVEVALNKLLDPIREKFNTPALKKLASAAYPDPSKQKPMAKGPAKNSEPE 310 320 330 340 350 360 180 190 200 210 220 230 pF1KE3 DV--SRLDLRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVI .: ::::.:.: :::..:::::::::::..:::: :::::::::. :: :..:.:.:. CCDS36 EVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRLVV 370 380 390 400 410 420 240 250 260 270 280 pF1KE3 LLCNLKPAKMRGVLSQAMVMCASSP---EKIEILAPPNGSVPGDRITFDAFP-GEPDKEL .:::::: ::::: ::.:..::: ...: : :: ::.::... .. :.::.:: CCDS36 VLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDEEL 430 440 450 460 470 480 290 300 310 320 330 pF1KE3 NPKKKIWEQIQPDLHTNDECVATYKGVPFEVK-GKGVCRAQTMSNSGIK .::::..:..: :.. ..::.: .: . : .: :. :.. .: CCDS36 KPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNIS 490 500 510 520 336 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:19:54 2016 done: Sun Nov 6 17:19:55 2016 Total Scan time: 2.610 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]