FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE4504, 528 aa 1>>>pF1KE4504 528 - 528 aa - 528 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.0757+/-0.000913; mu= 14.7644+/- 0.055 mean_var=78.1726+/-15.662, 0's: 0 Z-trim(106.7): 14 B-trim: 105 in 1/50 Lambda= 0.145060 statistics sampled from 9154 (9161) to 9154 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.656), E-opt: 0.2 (0.281), width: 16 Scan time: 3.460 The best scores are: opt bits E(32554) CCDS368.1 YARS gene_id:8565|Hs108|chr1 ( 528) 3456 733.0 2.1e-211 CCDS3674.1 AIMP1 gene_id:9255|Hs108|chr4 ( 312) 597 134.5 1.7e-31 CCDS47121.1 AIMP1 gene_id:9255|Hs108|chr4 ( 336) 597 134.6 1.8e-31 >>CCDS368.1 YARS gene_id:8565|Hs108|chr1 (528 aa) initn: 3456 init1: 3456 opt: 3456 Z-score: 3908.9 bits: 733.0 E(32554): 2.1e-211 Smith-Waterman score: 3456; 100.0% identity (100.0% similar) in 528 aa overlap (1-528:1-528) 10 20 30 40 50 60 pF1KE4 MGDAPSPEEKLHLITRNLQEVLGEEKLKEILKERELKIYWGTATTGKPHVAYFVPMSKIA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 MGDAPSPEEKLHLITRNLQEVLGEEKLKEILKERELKIYWGTATTGKPHVAYFVPMSKIA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE4 DFLKAGCEVTILFADLHAYLDNMKAPWELLELRVSYYENVIKAMLESIGVPLEKLKFIKG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 DFLKAGCEVTILFADLHAYLDNMKAPWELLELRVSYYENVIKAMLESIGVPLEKLKFIKG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE4 TDYQLSKEYTLDVYRLSSVVTQHDSKKAGAEVVKQVEHPLLSGLLYPGLQALDEEYLKVD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 TDYQLSKEYTLDVYRLSSVVTQHDSKKAGAEVVKQVEHPLLSGLLYPGLQALDEEYLKVD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE4 AQFGGIDQRKIFTFAEKYLPALGYSKRVHLMNPMVPGLTGSKMSSSEEESKIDLLDRKED :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 AQFGGIDQRKIFTFAEKYLPALGYSKRVHLMNPMVPGLTGSKMSSSEEESKIDLLDRKED 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE4 VKKKLKKAFCEPGNVENNGVLSFIKHVLFPLKSEFVILRDEKWGGNKTYTAYVDLEKDFA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 VKKKLKKAFCEPGNVENNGVLSFIKHVLFPLKSEFVILRDEKWGGNKTYTAYVDLEKDFA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE4 AEVVHPGDLKNSVEVALNKLLDPIREKFNTPALKKLASAAYPDPSKQKPMAKGPAKNSEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 AEVVHPGDLKNSVEVALNKLLDPIREKFNTPALKKLASAAYPDPSKQKPMAKGPAKNSEP 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE4 EEVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRLV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 EEVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRLV 370 380 390 400 410 420 430 440 450 460 470 480 pF1KE4 VVLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 VVLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDEE 430 440 450 460 470 480 490 500 510 520 pF1KE4 LKPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNIS :::::::::::::::::::::::::::::::::::::::::::::::: CCDS36 LKPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNIS 490 500 510 520 >>CCDS3674.1 AIMP1 gene_id:9255|Hs108|chr4 (312 aa) initn: 564 init1: 391 opt: 597 Z-score: 678.9 bits: 134.5 E(32554): 1.7e-31 Smith-Waterman score: 597; 48.7% identity (76.4% similar) in 195 aa overlap (332-526:123-308) 310 320 330 340 350 360 pF1KE4 EVVHPGDLKNSVEVALNKLLDPIREKFNTPALKKLASAAYPDPSKQKPMAKGPAKNSEPE : .:. . . .::. .: : : .:.: CCDS36 SENVIQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIA-GSA-DSKPI 100 110 120 130 140 150 370 380 390 400 410 420 pF1KE4 EVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRLVV .: ::::.:.: :::..:::::::::::..:::: :::::::::. :: :..:.:.:. CCDS36 DV--SRLDLRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVI 160 170 180 190 200 430 440 450 460 470 480 pF1KE4 VLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDEEL .:::::: ::::: ::.:..::: ...: : :: ::.::... .. :.::.:: CCDS36 LLCNLKPAKMRGVLSQAMVMCASSP---EKIEILAPPNGSVPGDRITFDAFP-GEPDKEL 210 220 230 240 250 260 490 500 510 520 pF1KE4 KPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNIS .::::..:..: :.. ..::.: .: . : .: :. :.. .: CCDS36 NPKKKIWEQIQPDLHTNDECVATYKGVPFEVK-GKGVCRAQTMSNSGIK 270 280 290 300 310 >>CCDS47121.1 AIMP1 gene_id:9255|Hs108|chr4 (336 aa) initn: 564 init1: 391 opt: 597 Z-score: 678.4 bits: 134.6 E(32554): 1.8e-31 Smith-Waterman score: 597; 48.7% identity (76.4% similar) in 195 aa overlap (332-526:147-332) 310 320 330 340 350 360 pF1KE4 EVVHPGDLKNSVEVALNKLLDPIREKFNTPALKKLASAAYPDPSKQKPMAKGPAKNSEPE : .:. . . .::. .: : : .:.: CCDS47 SENVIQSTAVTTVSSGTKEQIKGGTGDEKKAKEKIEKKGEKKEKKQQSIA-GSA-DSKPI 120 130 140 150 160 170 370 380 390 400 410 420 pF1KE4 EVIPSRLDIRVGKIITVEKHPDADSLYVEKIDVGEAEPRTVVSGLVQFVPKEELQDRLVV .: ::::.:.: :::..:::::::::::..:::: :::::::::. :: :..:.:.:. CCDS47 DV--SRLDLRIGCIITARKHPDADSLYVEEVDVGEIAPRTVVSGLVNHVPLEQMQNRMVI 180 190 200 210 220 230 430 440 450 460 470 480 pF1KE4 VLCNLKPQKMRGVESQGMLLCASIEGINRQVEPLDPPAGSAPGEHVFVKGYEKGQPDEEL .:::::: ::::: ::.:..::: ...: : :: ::.::... .. :.::.:: CCDS47 LLCNLKPAKMRGVLSQAMVMCASSP---EKIEILAPPNGSVPGDRITFDAFP-GEPDKEL 240 250 260 270 280 490 500 510 520 pF1KE4 KPKKKVFEKLQADFKISEECIAQWKQTNFMTKLGSISCKSLKGGNIS .::::..:..: :.. ..::.: .: . : .: :. :.. .: CCDS47 NPKKKIWEQIQPDLHTNDECVATYKGVPFEVK-GKGVCRAQTMSNSGIK 290 300 310 320 330 528 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 00:24:00 2016 done: Sun Nov 6 00:24:01 2016 Total Scan time: 3.460 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]