FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8296, 422 aa 1>>>pF1KB8296 422 - 422 aa - 422 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.7228+/-0.000856; mu= 7.2115+/- 0.051 mean_var=151.1530+/-29.856, 0's: 0 Z-trim(112.0): 20 B-trim: 36 in 1/51 Lambda= 0.104320 statistics sampled from 12849 (12866) to 12849 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.746), E-opt: 0.2 (0.395), width: 16 Scan time: 2.680 The best scores are: opt bits E(32554) CCDS6969.1 REXO4 gene_id:57109|Hs108|chr9 ( 422) 2796 432.3 4.2e-121 CCDS65179.1 REXO4 gene_id:57109|Hs108|chr9 ( 250) 782 129.0 4.9e-30 CCDS10344.1 AEN gene_id:64782|Hs108|chr15 ( 325) 515 88.9 7.5e-18 CCDS1153.1 ISG20L2 gene_id:81875|Hs108|chr1 ( 353) 458 80.4 3.1e-15 CCDS10345.1 ISG20 gene_id:3669|Hs108|chr15 ( 181) 389 69.8 2.4e-12 >>CCDS6969.1 REXO4 gene_id:57109|Hs108|chr9 (422 aa) initn: 2796 init1: 2796 opt: 2796 Z-score: 2287.6 bits: 432.3 E(32554): 4.2e-121 Smith-Waterman score: 2796; 100.0% identity (100.0% similar) in 422 aa overlap (1-422:1-422) 10 20 30 40 50 60 pF1KB8 MGKAKVPASKRAPSSPVAKPGPVKTLTRKKNKKKKRFWKSKAREVSKKPASGPGAVVRPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 MGKAKVPASKRAPSSPVAKPGPVKTLTRKKNKKKKRFWKSKAREVSKKPASGPGAVVRPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 KAPEDFSQNWKALQEWLLKQKSQAPEKPLVISQMGSKKKPKIIQQNKKETSPQVKGEEMP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 KAPEDFSQNWKALQEWLLKQKSQAPEKPLVISQMGSKKKPKIIQQNKKETSPQVKGEEMP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 AGKDQEASRGSVPSGSKMDRRAPVPRTKASGTEHNKKGTKERTNGDIVPERGDIEHKKRK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 AGKDQEASRGSVPSGSKMDRRAPVPRTKASGTEHNKKGTKERTNGDIVPERGDIEHKKRK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 AKEAAPAPPTEEDIWFDDVDPADIEAAIGPEAAKIARKQLGQSEGSVSLSLVKEQAFGGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 AKEAAPAPPTEEDIWFDDVDPADIEAAIGPEAAKIARKQLGQSEGSVSLSLVKEQAFGGL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 TRALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVTDYRTAVSGIRPEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 TRALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVTDYRTAVSGIRPEN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 LKQGEELEVVQKEVAEMLKGRILVGHALHNDLKVLFLDHPKKKIRDTQKYKPFKSQVKSG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 LKQGEELEVVQKEVAEMLKGRILVGHALHNDLKVLFLDHPKKKIRDTQKYKPFKSQVKSG 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 RPSLRLLSEKILGLQVQQAEHCSIQDAQAAMRLYVMVKKEWESMARDRRPLLTAPDHCSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS69 RPSLRLLSEKILGLQVQQAEHCSIQDAQAAMRLYVMVKKEWESMARDRRPLLTAPDHCSD 370 380 390 400 410 420 pF1KB8 DA :: CCDS69 DA >>CCDS65179.1 REXO4 gene_id:57109|Hs108|chr9 (250 aa) initn: 1277 init1: 778 opt: 782 Z-score: 652.7 bits: 129.0 E(32554): 4.9e-30 Smith-Waterman score: 990; 50.1% identity (53.9% similar) in 425 aa overlap (1-422:1-250) 10 20 30 40 50 60 pF1KB8 MGKAKVPASKRAPSSPVAKPGPVKTLTRKKNKKKKRFWKSKAREVSKKPASGPGAVVRPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 MGKAKVPASKRAPSSPVAKPGPVKTLTRKKNKKKKRFWKSKAREVSKKPASGPGAVVRPP 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 KAPEDFSQNWKALQE---WLLKQKSQAPEKPLVISQMGSKKKPKIIQQNKKETSPQVKGE ::::::::::::::: : .. :: .: . : .:.:. ::. CCDS65 KAPEDFSQNWKALQEVPRWTGGRQYLAP-RP--VEQSTIRKEPR-------------KGQ 70 80 90 100 120 130 140 150 160 170 pF1KB8 EMPAGKDQEASRGSVPSGSKMDRRAPVPRTKASGTEHNKKGTKERTNGDIVPERGDIEHK . ... .: :. :: :. :: : :. CCDS65 MVILFQNEGTS--SIRSG-KL-RRQPQPH------------------------------- 110 120 180 190 200 210 220 230 pF1KB8 KRKAKEAAPAPPTEEDIWFDDVDPADIEAAIGPEAAKIARKQLGQSEGSVSLSLVKEQAF :: :: CCDS65 ----------PPREE--------------------------------------------- 130 240 250 260 270 280 290 pF1KB8 GGLTRALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVTDYRTAVSGIR CCDS65 ------------------------------------------------------------ 300 310 320 330 340 350 pF1KB8 PENLKQGEELEVVQKEVAEMLKGRILVGHALHNDLKVLFLDHPKKKIRDTQKYKPFKSQV ::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 ---------LEVVQKEVAEMLKGRILVGHALHNDLKVLFLDHPKKKIRDTQKYKPFKSQV 140 150 160 170 180 360 370 380 390 400 410 pF1KB8 KSGRPSLRLLSEKILGLQVQQAEHCSIQDAQAAMRLYVMVKKEWESMARDRRPLLTAPDH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS65 KSGRPSLRLLSEKILGLQVQQAEHCSIQDAQAAMRLYVMVKKEWESMARDRRPLLTAPDH 190 200 210 220 230 240 420 pF1KB8 CSDDA ::::: CCDS65 CSDDA 250 >>CCDS10344.1 AEN gene_id:64782|Hs108|chr15 (325 aa) initn: 498 init1: 385 opt: 515 Z-score: 433.9 bits: 88.9 E(32554): 7.5e-18 Smith-Waterman score: 515; 39.1% identity (67.8% similar) in 230 aa overlap (201-422:65-292) 180 190 200 210 220 230 pF1KB8 RGDIEHKKRKAKEAAPAPPTEEDIWFDDVDPADIEAAIGPEAAKIARKQLGQSEGSVSLS :. . :: . :::. ... : . ::. : CCDS10 RQHQRFMARKALLQEQGLLSMPPEPGSSPLPTPFGAATATEAASSGKQCLRAGSGSAPCS 40 50 60 70 80 90 240 250 260 270 280 pF1KB8 L--VKEQAFGGL-TRALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVT . .: : : .. .:.::::::.::.:. : :: :::. .:. .::::..: :.. CCDS10 RRPAPGKASGPLPSKCVAIDCEMVGTGPRGRVSELARCSIVSYHGNVLYDKYIRPEMPIA 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB8 DYRTAVSGIRPENLKQGEELEVVQKEVAEMLKGRILVGHALHNDLKVLFLDHPKKKIRDT :::: ::: ...... ..:.:::. ..:::...::::::::...: ::... ::: CCDS10 DYRTRWSGITRQHMRKAVPFQVAQKEILKLLKGKVVVGHALHNDFQALKYVHPRSQTRDT 160 170 180 190 200 210 350 360 370 380 390 400 pF1KB8 QKYKPFKSQV---KSGRPSLRLLSEKILG--LQVQQAEHCSIQDAQAAMRLYVMVKKEWE : :. .: ::. :. ..: .:: : : :..:: .::.:: .:. .:: CCDS10 TYVPNFLSEPGLHTRARVSLKDLALQLLHKKIQVGQHGHSSVEDATTAMELYRLVEVQWE 220 230 240 250 260 270 410 420 pF1KB8 SMARDRRPLLTAPDHCSDDA . .. : : : :. :. CCDS10 Q--QEARSLWTCPEDREPDSSTDMEQYMEDQYWPDDLAHGSRGGAREAQDRRN 280 290 300 310 320 >>CCDS1153.1 ISG20L2 gene_id:81875|Hs108|chr1 (353 aa) initn: 478 init1: 338 opt: 458 Z-score: 387.0 bits: 80.4 E(32554): 3.1e-15 Smith-Waterman score: 481; 31.8% identity (61.1% similar) in 321 aa overlap (98-407:50-349) 70 80 90 100 110 120 pF1KB8 QNWKALQEWLLKQKSQAPEKPLVISQMGSKKKPKIIQQ-NKKETSPQVKGE-EMPAGKDQ : ::. .. .:: .: : : . :. . CCDS11 EGNAKHRNFVKKRRLLERRGFLSKKNQPPSKAPKLHSEPSKKGETPTVDGTWKTPSFPKK 20 30 40 50 60 70 130 140 150 160 170 180 pF1KB8 EASRGSVPSGSKMDRRAPVPR-TKASGTEHNKKGTKERTNGDIVPERGDIE-HKKRKAKE ... .: ::. .:..: : : : . . .. ..: :.. :. : :. :. CCDS11 KTAASSNGSGQPLDKKAAVSWLTPAPSKKADSVAAKVDLLGEFQSALPKINSHPTRSQKK 80 90 100 110 120 130 190 200 210 220 230 240 pF1KB8 AAPAPPTEEDIWFDDVDPADIEAAIGPEAAKIARKQLGQSEGSVSLSLVKEQAFGGLTRA .. .... :. . ..::.. : . : : : CCDS11 SSQKKSSKKN---------------HPQKNAPQNSTQAHSENKCSGASQK------LPRK 140 150 160 170 250 260 270 280 290 300 pF1KB8 L-ALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKPTEPVTDYRTAVSGIRPENLK . :.::::::.::::. : :: :::: : .::.:. : ..:::: :::: ... CCDS11 MVAIDCEMVGTGPKGHVSSLARCSIVNYNGDVLYDEYILPPCHIVDYRTRWSGIRKQHMV 180 190 200 210 220 230 310 320 330 340 350 pF1KB8 QGEELEVVQKEVAEMLKGRILVGHALHNDLKVLFLDHPKKKIRDTQKYKPFKSQV---KS .. ..... .. ..: :.:.::::.:::.:.: :::. :::.. :.. .. .. CCDS11 NATPFKIARGQILKILTGKIVVGHAIHNDFKALQYFHPKSLTRDTSHIPPLNRKADCPEN 240 250 260 270 280 290 360 370 380 390 400 410 pF1KB8 GRPSLRLLSEKILG--LQVQQAEHCSIQDAQAAMRLYVMVKKEWES-MARDRRPLLTAPD . ::. :..:.:. .:: .. : :..::::.:.:: .:. ::: .::. CCDS11 ATMSLKHLTKKLLNRDIQVGKSGHSSVEDAQATMELYKLVEVEWEEHLARNPPTD 300 310 320 330 340 350 420 pF1KB8 HCSDDA >>CCDS10345.1 ISG20 gene_id:3669|Hs108|chr15 (181 aa) initn: 297 init1: 240 opt: 389 Z-score: 335.1 bits: 69.8 E(32554): 2.4e-12 Smith-Waterman score: 389; 37.8% identity (67.6% similar) in 185 aa overlap (239-416:3-181) 210 220 230 240 250 260 pF1KB8 GPEAAKIARKQLGQSEGSVSLSLVKEQAFGGLTRALALDCEMVGVGPKGEESMAARVSIV : ...:.::::::.::. .:: :: :.: CCDS10 MAGSREVVAMDCEMVGLGPH-RESGLARCSLV 10 20 30 270 280 290 300 310 320 pF1KB8 NQYGKCVYDKYVKPTEPVTDYRTAVSGIRPENLKQGEELEVVQKEVAEMLKGRILVGHAL : .: .:::...: .::::: :::. :... . . :.. :. ..:::...::: : CCDS10 NVHGAVLYDKFIRPEGEITDYRTRVSGVTPQHMVGATPFAVARLEILQLLKGKLVVGHDL 40 50 60 70 80 90 330 340 350 360 370 380 pF1KB8 HNDLKVLFLDHPKKKIRDTQKYKPFKSQVKSG---RPSLRLLSEKILGLQVQQAE--HCS ..:...: : : ::. . . ..: : :::.:::..: ..:.. : : CCDS10 KHDFQALKEDMSGYTIYDTSTDRLLWREAKLDHCRRVSLRVLSERLLHKSIQNSLLGHSS 100 110 120 130 140 150 390 400 410 420 pF1KB8 IQDAQAAMRLYVMVKKEWESMARDRR--PLLTAPDHCSDDA ..::.:.:.:: . .. : :: : :.. : CCDS10 VEDARATMELYQISQR-----IRARRGLPRLAVSD 160 170 180 422 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 03:27:35 2016 done: Mon Nov 7 03:27:35 2016 Total Scan time: 2.680 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]