FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5349, 297 aa 1>>>pF1KE5349 297 - 297 aa - 297 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.8286+/-0.000886; mu= 12.2846+/- 0.053 mean_var=62.3034+/-12.605, 0's: 0 Z-trim(104.8): 23 B-trim: 3 in 1/48 Lambda= 0.162487 statistics sampled from 8093 (8103) to 8093 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.628), E-opt: 0.2 (0.249), width: 16 Scan time: 2.050 The best scores are: opt bits E(32554) CCDS55518.1 FMR1 gene_id:2332|Hs108|chrX ( 537) 1957 467.4 1e-131 CCDS76039.1 FMR1 gene_id:2332|Hs108|chrX ( 586) 1957 467.4 1.1e-131 CCDS55519.1 FMR1 gene_id:2332|Hs108|chrX ( 611) 1957 467.4 1.2e-131 CCDS14682.1 FMR1 gene_id:2332|Hs108|chrX ( 632) 1957 467.4 1.2e-131 CCDS46965.1 FXR1 gene_id:8087|Hs108|chr3 ( 539) 1478 355.1 6.5e-98 CCDS3238.1 FXR1 gene_id:8087|Hs108|chr3 ( 621) 1478 355.1 7.4e-98 CCDS45604.1 FXR2 gene_id:9513|Hs108|chr17 ( 673) 1411 339.4 4.3e-93 CCDS33894.1 FXR1 gene_id:8087|Hs108|chr3 ( 536) 1002 243.5 2.5e-64 >>CCDS55518.1 FMR1 gene_id:2332|Hs108|chrX (537 aa) initn: 1957 init1: 1957 opt: 1957 Z-score: 2478.2 bits: 467.4 E(32554): 1e-131 Smith-Waterman score: 1957; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGLKI :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGKVIGKN 250 260 270 280 290 300 CCDS55 GKLIQEIVDKSGVVRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAPEEKKHLDIKE 310 320 330 340 350 360 >>CCDS76039.1 FMR1 gene_id:2332|Hs108|chrX (586 aa) initn: 1957 init1: 1957 opt: 1957 Z-score: 2477.6 bits: 467.4 E(32554): 1.1e-131 Smith-Waterman score: 1957; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGLKI :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGKVIGKN 250 260 270 280 290 300 CCDS76 GKLIQEIVDKSGVVRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAPEEKKHLDIKE 310 320 330 340 350 360 >>CCDS55519.1 FMR1 gene_id:2332|Hs108|chrX (611 aa) initn: 1957 init1: 1957 opt: 1957 Z-score: 2477.2 bits: 467.4 E(32554): 1.2e-131 Smith-Waterman score: 1957; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGLKI :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS55 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGKVIGKN 250 260 270 280 290 300 CCDS55 GKLIQEIVDKSGVVRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAPEEKKHLDIKE 310 320 330 340 350 360 >>CCDS14682.1 FMR1 gene_id:2332|Hs108|chrX (632 aa) initn: 1957 init1: 1957 opt: 1957 Z-score: 2477.0 bits: 467.4 E(32554): 1.2e-131 Smith-Waterman score: 1957; 100.0% identity (100.0% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGLKI :::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS14 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGKVIGKN 250 260 270 280 290 300 CCDS14 GKLIQEIVDKSGVVRVRIEAENEKNVPQEEEIMPPNSLPSNNSRVGPNAPEEKKHLDIKE 310 320 330 340 350 360 >>CCDS46965.1 FXR1 gene_id:8087|Hs108|chr3 (539 aa) initn: 1478 init1: 1478 opt: 1478 Z-score: 1871.3 bits: 355.1 E(32554): 6.5e-98 Smith-Waterman score: 1478; 74.1% identity (90.1% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN : ::.::::::::::::.:.:::::::.::.:::::::.::.::..::.::: .:.:. CCDS46 MAELTVEVRGSNGAFYKGFIKDVHEDSLTVVFENNWQPERQVPFNEVRLPPPPDIKKEIS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP :.::::::::::..::: ::::::::.:::::::::::::::::::::.:::: :: :: CCDS46 EGDEVEVYSRANDQEPCGWWLAKVRMMKGEFYVIEYAACDATYNEIVTFERLRPVNQNKT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR . :.:: : .:::::::. ::.: ::::::::::: . : ::. ::.::: .:.: :: CCDS46 VKKNTFFKCTVDVPEDLREACANENAHKDFKKAVGACRIFYHPETTQLMILSASEATVKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN ...: :::.::.:::: :. :::::.:.:: ..:::. :::.:.::::::::::::::.: CCDS46 VNILSDMHLRSIRTKLMLMSRNEEATKHLECTKQLAAAFHEEFVVREDLMGLAIGTHGSN 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGLKI :::::::::::::.::::: ::.::::. :::::::.::::.:: ::::::::: CCDS46 IQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFVEDFIQVPRNLVGKVIGKN 250 260 270 280 290 300 CCDS46 GKVIQEIVDKSGVVRVRIEGDNENKLPREDGMVPFVFVGTKESIGNVQVLLEYHIAYLKE 310 320 330 340 350 360 >>CCDS3238.1 FXR1 gene_id:8087|Hs108|chr3 (621 aa) initn: 1478 init1: 1478 opt: 1478 Z-score: 1870.3 bits: 355.1 E(32554): 7.4e-98 Smith-Waterman score: 1478; 74.1% identity (90.1% similar) in 294 aa overlap (1-294:1-294) 10 20 30 40 50 60 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFPPPVGYNKDIN : ::.::::::::::::.:.:::::::.::.:::::::.::.::..::.::: .:.:. CCDS32 MAELTVEVRGSNGAFYKGFIKDVHEDSLTVVFENNWQPERQVPFNEVRLPPPPDIKKEIS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSVNPNKP :.::::::::::..::: ::::::::.:::::::::::::::::::::.:::: :: :: CCDS32 EGDEVEVYSRANDQEPCGWWLAKVRMMKGEFYVIEYAACDATYNEIVTFERLRPVNQNKT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 ATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINEVTSKR . :.:: : .:::::::. ::.: ::::::::::: . : ::. ::.::: .:.: :: CCDS32 VKKNTFFKCTVDVPEDLREACANENAHKDFKKAVGACRIFYHPETTQLMILSASEATVKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 AHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIGTHGAN ...: :::.::.:::: :. :::::.:.:: ..:::. :::.:.::::::::::::::.: CCDS32 VNILSDMHLRSIRTKLMLMSRNEEATKHLECTKQLAAAFHEEFVVREDLMGLAIGTHGSN 190 200 210 220 230 240 250 260 270 280 290 pF1KE5 IQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGLKI :::::::::::::.::::: ::.::::. :::::::.::::.:: ::::::::: CCDS32 IQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFVEDFIQVPRNLVGKVIGKN 250 260 270 280 290 300 CCDS32 GKVIQEIVDKSGVVRVRIEGDNENKLPREDGMVPFVFVGTKESIGNVQVLLEYHIAYLKE 310 320 330 340 350 360 >>CCDS45604.1 FXR2 gene_id:9513|Hs108|chr17 (673 aa) initn: 1411 init1: 1411 opt: 1411 Z-score: 1784.8 bits: 339.4 E(32554): 4.3e-93 Smith-Waterman score: 1411; 71.5% identity (87.3% similar) in 291 aa overlap (4-294:14-304) 10 20 30 40 50 pF1KE5 MEELVVEVRGSNGAFYKAFVKDVHEDSITVAFENNWQPDRQIPFHDVRFP : ::::::::::::.:::::::::.:. :::::: .::::: :::.: CCDS45 MGGLASGGDVEPGLPVEVRGSNGAFYKGFVKDVHEDSVTIFFENNWQSERQIPFGDVRLP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE5 PPVGYNKDINESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIE ::. :::.:.:.:::::::::::.::: ::::.:::.::.::::::::::::::::::.: CCDS45 PPADYNKEITEGDEVEVYSRANEQEPCGWWLARVRMMKGDFYVIEYAACDATYNEIVTLE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE5 RLRSVNPNKPATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVI ::: :::: ::: .: :. . ::::::. :..: .::.::::.:: . . : .: : CCDS45 RLRPVNPNPLATKGSFFKVTMAVPEDLREACSNENVHKEFKKALGANCIFLNITNSELFI 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE5 LSINEVTSKRAHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLM :: .:. ::: .: ::::::::::: :. :::::.:.::.:.:::. :.:.: :::::: CCDS45 LSTTEAPVKRASLLGDMHFRSLRTKLLLMSRNEEATKHLETSKQLAAAFQEEFTVREDLM 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE5 GLAIGTHGANIQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPR :::::::::::::::::::::::.: :.::::.:::: .: ..:::.:::.:: .:::: CCDS45 GLAIGTHGANIQQARKVPGVTAIELGEETCTFRIYGETPEACRQARSYLEFSEDSVQVPR 250 260 270 280 290 300 pF1KE5 NLVGLKI :::: CCDS45 NLVGKVIGKNGKVIQEIVDKSGVVRVRVEGDNDKKNPREEGMVPFIFVGTRENISNAQAL 310 320 330 340 350 360 >>CCDS33894.1 FXR1 gene_id:8087|Hs108|chr3 (536 aa) initn: 1002 init1: 1002 opt: 1002 Z-score: 1268.3 bits: 243.5 E(32554): 2.5e-64 Smith-Waterman score: 1002; 73.7% identity (88.5% similar) in 209 aa overlap (86-294:1-209) 60 70 80 90 100 110 pF1KE5 NKDINESDEVEVYSRANEKEPCCWWLAKVRMIKGEFYVIEYAACDATYNEIVTIERLRSV :.:::::::::::::::::::::.:::: : CCDS33 MMKGEFYVIEYAACDATYNEIVTFERLRPV 10 20 30 120 130 140 150 160 170 pF1KE5 NPNKPATKDTFHKIKLDVPEDLRQMCAKEAAHKDFKKAVGAFSVTYDPENYQLVILSINE : :: . :.:: : .:::::::. ::.: ::::::::::: . : ::. ::.::: .: CCDS33 NQNKTVKKNTFFKCTVDVPEDLREACANENAHKDFKKAVGACRIFYHPETTQLMILSASE 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE5 VTSKRAHMLIDMHFRSLRTKLSLIMRNEEASKQLESSRQLASRFHEQFIVREDLMGLAIG .: ::...: :::.::.:::: :. :::::.:.:: ..:::. :::.:.::::::::::: CCDS33 ATVKRVNILSDMHLRSIRTKLMLMSRNEEATKHLECTKQLAAAFHEEFVVREDLMGLAIG 100 110 120 130 140 150 240 250 260 270 280 290 pF1KE5 THGANIQQARKVPGVTAIDLDEDTCTFHIYGEDQDAVKKARSFLEFAEDVIQVPRNLVGL :::.::::::::::::::.::::: ::.::::. :::::::.::::.:: ::::::::: CCDS33 THGSNIQQARKVPGVTAIELDEDTGTFRIYGESADAVKKARGFLEFVEDFIQVPRNLVGK 160 170 180 190 200 210 pF1KE5 KI CCDS33 VIGKNGKVIQEIVDKSGVVRVRIEGDNENKLPREDGMVPFVFVGTKESIGNVQVLLEYHI 220 230 240 250 260 270 297 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:00:54 2016 done: Tue Nov 8 00:00:54 2016 Total Scan time: 2.050 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]