FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9976, 707 aa 1>>>pF1KB9976 707 - 707 aa - 707 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 17.2750+/-0.00136; mu= -28.0967+/- 0.083 mean_var=968.6740+/-198.181, 0's: 0 Z-trim(119.2): 129 B-trim: 57 in 2/53 Lambda= 0.041208 statistics sampled from 20185 (20311) to 20185 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.825), E-opt: 0.2 (0.624), width: 16 Scan time: 5.330 The best scores are: opt bits E(32554) CCDS388.1 SFPQ gene_id:6421|Hs108|chr1 ( 707) 5111 319.2 1.3e-86 CCDS41870.1 PSPC1 gene_id:55269|Hs108|chr13 ( 523) 1783 121.3 3.8e-27 CCDS14410.1 NONO gene_id:4841|Hs108|chrX ( 471) 1716 117.2 5.6e-26 CCDS55445.1 NONO gene_id:4841|Hs108|chrX ( 382) 1463 102.1 1.6e-21 >>CCDS388.1 SFPQ gene_id:6421|Hs108|chr1 (707 aa) initn: 5111 init1: 5111 opt: 5111 Z-score: 1668.5 bits: 319.2 E(32554): 1.3e-86 Smith-Waterman score: 5111; 100.0% identity (100.0% similar) in 707 aa overlap (1-707:1-707) 10 20 30 40 50 60 pF1KB9 MSRDRFRSRGGGGGGFHRRGGGGGRGGLHDFRSPPPGMGLNQNRGPMGPGPGQSGPKPPI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 MSRDRFRSRGGGGGGFHRRGGGGGRGGLHDFRSPPPGMGLNQNRGPMGPGPGQSGPKPPI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 PPPPPHQQQQQPPPQQPPPQQPPPHQPPPHPQPHQQQQPPPPPQDSSKPVVAQGPGPAPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 PPPPPHQQQQQPPPQQPPPQQPPPHQPPPHPQPHQQQQPPPPPQDSSKPVVAQGPGPAPG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 VGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTPPSSGVPTTPPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 VGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTPPSSGVPTTPPQ 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 AGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPGGHPKPPHRGGGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 AGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPGGHPKPPHRGGGE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 PRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRPGEKTYTQRCRLF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 PRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRPGEKTYTQRCRLF 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 VGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIAKAELDDTPMRGR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 VGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIAKAELDDTPMRGR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 QLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRGRSTGKGIVEFAS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 QLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRGRSTGKGIVEFAS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB9 KPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNPMYQKERETPPRF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 KPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNPMYQKERETPPRF 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB9 AQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYHEHQANLLRQDLM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 AQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYHEHQANLLRQDLM 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB9 RRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQMRRQREESYSRM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 RRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQMRRQREESYSRM 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB9 GYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGGIGYEANPGVPPATMSGSMMGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 GYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGGIGYEANPGVPPATMSGSMMGS 610 620 630 640 650 660 670 680 690 700 pF1KB9 DMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREEYEGPNKKPRF ::::::::::::::::::::::::::::::::::::::::::::::: CCDS38 DMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREEYEGPNKKPRF 670 680 690 700 >>CCDS41870.1 PSPC1 gene_id:55269|Hs108|chr13 (523 aa) initn: 1710 init1: 1586 opt: 1783 Z-score: 600.8 bits: 121.3 E(32554): 3.8e-27 Smith-Waterman score: 1836; 60.5% identity (81.0% similar) in 484 aa overlap (259-707:45-523) 230 240 250 260 270 280 pF1KB9 GHPKPPHRGGGEPRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRP : : .:. :.. .. :: ... . .: CCDS41 NPARLRALESAVGESEPAAAAAMALALAGEPAPPAPAP-PEDHPDEEMGFTIDIKSFLKP 20 30 40 50 60 70 290 300 310 320 330 340 pF1KB9 GEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIA :::::::::::::::::.::::..::::: .::::.:::::. .:::::.::::.::::: CCDS41 GEKTYTQRCRLFVGNLPTDITEEDFKRLFERYGEPSEVFINRDRGFGFIRLESRTLAEIA 80 90 100 110 120 130 350 360 370 380 390 400 pF1KB9 KAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDRG ::::: : ...: ::.:::::.:::.:.:::: :::::::.:::::::.:.:::.::::: CCDS41 KAELDGTILKSRPLRIRFATHGAALTVKNLSPVVSNELLEQAFSQFGPVEKAVVVVDDRG 140 150 160 170 180 190 410 420 430 440 450 460 pF1KB9 RSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQKNP :.::::.::::.:: ::::.:::..:.::::::::::::::.::.:::::::::: ::. CCDS41 RATGKGFVEFAAKPPARKALERCGDGAFLLTTTPRPVIVEPMEQFDDEDGLPEKLMQKTQ 200 210 220 230 240 250 470 480 490 500 510 520 pF1KB9 MYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEMEDAYH .:.:::: :::::: ::::.::..:::.::::::::::::..:...::.:::.::: : : CCDS41 QYHKEREQPPRFAQPGTFEFEYASRWKALDEMEKQQREQVDRNIREAKEKLEAEMEAARH 260 270 280 290 300 310 530 540 550 560 570 580 pF1KB9 EHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQREMEEQ ::: :.:::::::::::::.:::.:::.::::..:::.:::.::::::: ::.::.:: CCDS41 EHQLMLMRQDLMRRQEELRRLEELRNQELQKRKQIQLRHEEEHRRREEEM-IRHREQEE- 320 330 340 350 360 370 590 600 610 620 630 640 pF1KB9 MRRQREESYSRMGYMDPRERDMRMGG-G--GAMNMGD---PYGSGGQKFPPLGGGGGIGY .::: .:.. . .::. ::..:::: : ::.:::: : .:.: ::. : . . CCDS41 LRRQ-QEGF-KPNYMENREQEMRMGDMGPRGAINMGDAFSPAPAGNQGPPPMMGMNMNNR 380 390 400 410 420 650 660 670 pF1KB9 EANPGVP--PATMSG----SMMGSDM-------RTERFGQGG----AGPVGG-------Q . :: : :. : . ::. : ...:: :: ..:.:. : CCDS41 ATIPGPPMGPGPAMGPEGAANMGTPMMPDNGAVHNDRFPQGPPSQMGSPMGSRTGSETPQ 430 440 450 460 470 480 680 690 700 pF1KB9 GP-RGMGP--GTPAGYGRGRE--EYEGPNKKPRF .: :.:: : :.:.::: . ..:::::. :. CCDS41 APMSGVGPVSGGPGGFGRGSQGGNFEGPNKRRRY 490 500 510 520 >>CCDS14410.1 NONO gene_id:4841|Hs108|chrX (471 aa) initn: 1765 init1: 1622 opt: 1716 Z-score: 579.9 bits: 117.2 E(32554): 5.6e-26 Smith-Waterman score: 1778; 60.0% identity (79.9% similar) in 473 aa overlap (241-707:16-471) 220 230 240 250 260 pF1KB9 KMPGGPKPGGGPGLSTPGGHPKPPHRGGGEPRGGRQHH--PPYHQQHHQGPPPGGPGGRS :: .::: .:::..: ::: . . CCDS14 MQSNKTFNLEKQNHTPRKHHQHHHQQQHHQQQQQQPPPPPIPANG 10 20 30 40 270 280 290 300 310 320 pF1KB9 EEKISDSEGFKANLSLLRRPGEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFI .. :..::. .:. .:.:::::.::: :::::::: ::::.:...:: :::. ::::: CCDS14 QQASSQNEGLTIDLKNFRKPGEKTFTQRSRLFVGNLPPDITEEEMRKLFEKYGKAGEVFI 50 60 70 80 90 100 330 340 350 360 370 380 pF1KB9 NKGKGFGFIKLESRALAEIAKAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLE .: ::::::.::.:.::::::.:::. :.::.::::::: :.:.:.:::: :::::::: CCDS14 HKDKGFGFIRLETRTLAEIAKVELDNMPLRGKQLRVRFACHSASLTVRNLPQYVSNELLE 110 120 130 140 150 160 390 400 410 420 430 440 pF1KB9 EAFSQFGPIERAVVIVDDRGRSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVE :::: :: .:::::::::::: .:::::::..:::::::..::::: ::::: :::: :: CCDS14 EAFSVFGQVERAVVIVDDRGRPSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPRPVTVE 170 180 190 200 210 220 450 460 470 480 490 500 pF1KB9 PLEQLDDEDGLPEKLAQKNPMYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQV :..:::::.::::::. :: ...:::: :::::: :.:::::..:::.: ::::::..:: CCDS14 PMDQLDDEEGLPEKLVIKNQQFHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQV 230 240 250 260 270 280 510 520 530 540 550 560 pF1KB9 EKNMKDAKDKLESEMEDAYHEHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQE ..:.:.:..::: ::: : ::::. :.:::::::::::::::::::::.::::...:::: CCDS14 DRNIKEAREKLEMEMEAARHEHQVMLMRQDLMRRQEELRRMEELHNQEVQKRKQLELRQE 290 300 310 320 330 340 570 580 590 600 610 620 pF1KB9 EERRRREEEMMIRQREMEEQMRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGG :::::::::: .:..::.:::: .:.. . . : ::...::: : :: .: .. CCDS14 EERRRREEEM---RRQQEEMMRRQ-QEGF-KGTFPDAREQEIRMG---QMAMGGAMGINN 350 360 370 380 390 630 640 650 660 670 680 pF1KB9 Q-KFPPLGGGGGIGYEANPGVPPATM--SGSMMGSDMRTERFGQGGAGPVGGQGPRGMGP . .:: . : : :: :::: .:.. . :::::: :. . : : : CCDS14 RGAMPP--APVPAGTPAPPG--PATMMPDGTLGLTPPTTERFGQ--AATMEGIGAIG--- 400 410 420 430 440 690 700 pF1KB9 GTPAGYGRGREEYE-GPNKKPRF ::: ...:. : .:::. :. CCDS14 GTPPAFNRAAPGAEFAPNKRRRY 450 460 470 >>CCDS55445.1 NONO gene_id:4841|Hs108|chrX (382 aa) initn: 1574 init1: 1389 opt: 1463 Z-score: 499.8 bits: 102.1 E(32554): 1.6e-21 Smith-Waterman score: 1525; 61.8% identity (80.9% similar) in 398 aa overlap (314-707:2-382) 290 300 310 320 330 340 pF1KB9 LLRRPGEKTYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRA ..:: :::. :::::.: ::::::.::.:. CCDS55 MRKLFEKYGKAGEVFIHKDKGFGFIRLETRT 10 20 30 350 360 370 380 390 400 pF1KB9 LAEIAKAELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVI ::::::.:::. :.::.::::::: :.:.:.:::: :::::::::::: :: .:::::: CCDS55 LAEIAKVELDNMPLRGKQLRVRFACHSASLTVRNLPQYVSNELLEEAFSVFGQVERAVVI 40 50 60 70 80 90 410 420 430 440 450 460 pF1KB9 VDDRGRSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKL :::::: .:::::::..:::::::..::::: ::::: :::: :::..:::::.:::::: CCDS55 VDDRGRPSGKGIVEFSGKPAARKALDRCSEGSFLLTTFPRPVTVEPMDQLDDEEGLPEKL 100 110 120 130 140 150 470 480 490 500 510 520 pF1KB9 AQKNPMYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEM . :: ...:::: :::::: :.:::::..:::.: ::::::..::..:.:.:..::: :: CCDS55 VIKNQQFHKEREQPPRFAQPGSFEYEYAMRWKALIEMEKQQQDQVDRNIKEAREKLEMEM 160 170 180 190 200 210 530 540 550 560 570 580 pF1KB9 EDAYHEHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIRQR : : ::::. :.:::::::::::::::::::::.::::...:::::::::::::: .: CCDS55 EAARHEHQVMLMRQDLMRRQEELRRMEELHNQEVQKRKQLELRQEEERRRREEEM---RR 220 230 240 250 260 590 600 610 620 630 640 pF1KB9 EMEEQMRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGGQ-KFPPLGGGGGIGY ..::.:::: .:.. . . : ::...::: : :: .: ... .:: . : CCDS55 QQEEMMRRQ-QEGF-KGTFPDAREQEIRMG---QMAMGGAMGINNRGAMPP--APVPAGT 270 280 290 300 310 320 650 660 670 680 690 pF1KB9 EANPGVPPATM--SGSMMGSDMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREEYE- : :: :::: .:.. . :::::: :. . : : : ::: ...:. : CCDS55 PAPPG--PATMMPDGTLGLTPPTTERFGQ--AATMEGIGAIG---GTPPAFNRAAPGAEF 330 340 350 360 370 700 pF1KB9 GPNKKPRF .:::. :. CCDS55 APNKRRRY 380 707 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Mon Nov 7 14:38:06 2016 done: Mon Nov 7 14:38:06 2016 Total Scan time: 5.330 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]