FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6436, 460 aa 1>>>pF1KE6436 460 - 460 aa - 460 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1920+/-0.000831; mu= 17.9800+/- 0.050 mean_var=59.1113+/-11.743, 0's: 0 Z-trim(105.2): 20 B-trim: 0 in 0/52 Lambda= 0.166816 statistics sampled from 8301 (8309) to 8301 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.638), E-opt: 0.2 (0.255), width: 16 Scan time: 2.970 The best scores are: opt bits E(32554) CCDS375.1 AZIN2 gene_id:113451|Hs108|chr1 ( 460) 3114 757.9 4.7e-219 CCDS76138.1 AZIN2 gene_id:113451|Hs108|chr1 ( 480) 2009 492.0 5.6e-139 CCDS1672.1 ODC1 gene_id:4953|Hs108|chr2 ( 461) 1569 386.1 4.1e-107 CCDS6295.1 AZIN1 gene_id:51582|Hs108|chr8 ( 448) 1271 314.4 1.5e-85 >>CCDS375.1 AZIN2 gene_id:113451|Hs108|chr1 (460 aa) initn: 3114 init1: 3114 opt: 3114 Z-score: 4046.1 bits: 757.9 E(32554): 4.7e-219 Smith-Waterman score: 3114; 100.0% identity (100.0% similar) in 460 aa overlap (1-460:1-460) 10 20 30 40 50 60 pF1KE6 MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IKYAAKHGIQLLSFDNEMELAKVVKSHPSAKMVLCIATDDSHSLSCLSLKFGVSLKSCRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 IKYAAKHGIQLLSFDNEMELAKVVKSHPSAKMVLCIATDDSHSLSCLSLKFGVSLKSCRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LLENAKKHHVEVVGVSFHIGSGCPDPQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 LLENAKKHHVEVVGVSFHIGSGCPDPQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIAKKEVLLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 GTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIAKKEVLLD 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE6 QPGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKKPSTEQPLYSSSLWGPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 QPGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKKPSTEQPLYSSSLWGPA 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE6 VDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQACHITYAMSRVAWEALRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS37 VDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQACHITYAMSRVAWEALRR 370 380 390 400 410 420 430 440 450 460 pF1KE6 QLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIM :::::::::::::::::::::::::::::::::::::::: CCDS37 QLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIM 430 440 450 460 >>CCDS76138.1 AZIN2 gene_id:113451|Hs108|chr1 (480 aa) initn: 3100 init1: 2009 opt: 2009 Z-score: 2608.6 bits: 492.0 E(32554): 5.6e-139 Smith-Waterman score: 3064; 95.8% identity (95.8% similar) in 480 aa overlap (1-460:1-480) 10 20 30 40 50 60 pF1KE6 MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKCL 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 PRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIAQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 IKYAAKHGIQLLSFDNEMELAKVVKSHPSAKMVLCIATDDSHSLSCLSLKFGVSLKSCRH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 IKYAAKHGIQLLSFDNEMELAKVVKSHPSAKMVLCIATDDSHSLSCLSLKFGVSLKSCRH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE6 LLENAKKHHVEVVGVSFHIGSGCPDPQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LLENAKKHHVEVVGVSFHIGSGCPDPQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE6 GTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIAKKEVLLD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 GTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIAKKEVLLD 250 260 270 280 290 300 310 320 330 340 pF1KE6 QPGRE--------------------EENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPI ::::: ::::::::::::::::::::::::::::::::::: CCDS76 QPGREAPLPPPHIATCAASEPSPPAEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPI 310 320 330 340 350 360 350 360 370 380 390 400 pF1KE6 LQKKPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 LQKKPSTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWG 370 380 390 400 410 420 410 420 430 440 450 460 pF1KE6 TQACHITYAMSRVAWEALRRQLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 TQACHITYAMSRVAWEALRRQLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIM 430 440 450 460 470 480 >>CCDS1672.1 ODC1 gene_id:4953|Hs108|chr2 (461 aa) initn: 1533 init1: 1497 opt: 1569 Z-score: 2036.6 bits: 386.1 E(32554): 4.1e-107 Smith-Waterman score: 1569; 52.5% identity (80.2% similar) in 440 aa overlap (7-443:8-444) 10 20 30 40 50 pF1KE6 MAGYLSESDFVMVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLKC : : ...:::...:.: . .: ..:. ::.::::: :..::. .:: CCDS16 MNNFGNEEFDCHFLDEGFTAKDILDQKINEVS--SSDDKDAFYVADLGDILKKHLRWLKA 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 LPRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQIA :::: ::::::::.: ...:.:: : ::.::.:.:..::: .:.: .:: ::::::.. CCDS16 LPRVTPFYAVKCNDSKAIVKTLAATGTGFDCASKTEIQLVQSLGVPPERIIYANPCKQVS 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 QIKYAAKHGIQLLSFDNEMELAKVVKSHPSAKMVLCIATDDSHSLSCLSLKFGVSLKSCR ::::::..:.:...::.:.:: ::...::.::.:: ::::::... ::.:::..:.. : CCDS16 QIKYAANNGVQMMTFDSEVELMKVARAHPKAKLVLRIATDDSKAVCRLSVKFGATLRTSR 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 HLLENAKKHHVEVVGVSFHIGSGCPDPQAYAQSIADARLVFEMGTELGHKMHVLDLGGGF ::: ::. ...:::::::.:::: ::....:.:.::: ::.::.:.: .:..::.:::: CCDS16 LLLERAKELNIDVVGVSFHVGSGCTDPETFVQAISDARCVFDMGAEVGFSMYLLDIGGGF 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 PGTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIAKKEVLL ::.: .:..::::..::: ::: ::: :: :.:: :::::.::::.::.::::: :: CCDS16 PGSEDVKLKFEEITGVINPALDKYFPSDSGVRIIAEPGRYYVASAFTLAVNIIAKKIVLK 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 DQPGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKKPSTEQPLYSSSLWGP .: : ..:. :. .:..:....:::: :: .:.:. :.:::.:. .. ::::.::: CCDS16 EQTGSDDEDESSEQTFMYYVNDGVYGSFNCILYDHAHVKPLLQKRPKPDEKYYSSSIWGP 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 AVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQACHITYAMSRVAWEALR . :: : ..: ::..:::::..:.:::::::. .: : : : : :.:: ::. : CCDS16 TCDGLDRIVERCDLPEMHVGDWMLFENMGAYTVAAASTFNGFQRPTIYYVMSGPAWQ-LM 360 370 380 390 400 410 420 430 440 450 460 pF1KE6 RQLMAAEQEDDVE---GVCKPLSCGWEITDTLCVGPVFTPASIM .:.. . .:: . :.::.:: CCDS16 QQFQNPDFPPEVEEQDASTLPVSCAWESGMKRHRAACASASINV 420 430 440 450 460 >>CCDS6295.1 AZIN1 gene_id:51582|Hs108|chr8 (448 aa) initn: 1244 init1: 776 opt: 1271 Z-score: 1649.2 bits: 314.4 E(32554): 1.5e-85 Smith-Waterman score: 1271; 45.5% identity (74.4% similar) in 422 aa overlap (1-419:1-416) 10 20 30 40 50 pF1KE6 MAGYLSESDFV--MVEEGFSTRDLLKELTLGASQATTDEVAAFFVADLGAIVRKHFCFLK : :....... ...:: . ... . . . : ::::.::: ::.:: . . CCDS62 MKGFIDDANYSVGLLDEGTNLGNVIDNYVY---EHTLTGKNAFFVGDLGKIVKKHSQWQN 10 20 30 40 50 60 70 80 90 100 110 pF1KE6 CLPRVRPFYAVKCNSSPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICANPCKQI . ...:::.:::::.:.::..:: :: ::.:..: :: :::..:.: .:: .::::. CCDS62 VVAQIKPFYTVKCNSAPAVLEILAALGTGFACSSKNEMALVQELGVPPENIIYISPCKQV 60 70 80 90 100 110 120 130 140 150 160 170 pF1KE6 AQIKYAAKHGIQLLSFDNEMELAKVVKSHPSAKMVLCIATDDSHSLSCLSLKFGVSLKSC .::::::: :...:. :::.:: :....::.::..: :::.:. . ..:::..::.: CCDS62 SQIKYAAKVGVNILTCDNEIELKKIARNHPNAKVLLHIATEDNIGGEEGNMKFGTTLKNC 120 130 140 150 160 170 180 190 200 210 220 230 pF1KE6 RHLLENAKKHHVEVVGVSFHIGSGCPDPQAYAQSIADARLVFEMGTELGHKMHVLDLGGG ::::: ::. :...::.::..:.: . :.:.....::: ::.:. :.: :..::.::: CCDS62 RHLLECAKELDVQIIGVKFHVSSACKESQVYVHALSDARCVFDMAGEIGFTMNMLDIGGG 180 190 200 210 220 230 240 250 260 270 280 290 pF1KE6 FPGTEGAKVRFEEIASVINSALDLYFPEGCGVDIFAELGRYYVTSAFTVAVSIIAKKEVL : ::: ..::. ::. ::.::::: :: :..: : :::.::::.::.::::: : CCDS62 FTGTE---FQLEEVNHVISPLLDIYFPEGSGVKIISEPGSYYVSSAFTLAVNIIAKKVVE 240 250 260 270 280 290 300 310 320 330 340 350 pF1KE6 LDQ-PGREEENGSTSKTIVYHLDEGVYGIFNSVLFDNICPTPILQKKPSTEQPLYSSSLW :. :. :..:: ...:....:::: : : : ... : ..:: . ..::..:::: CCDS62 NDKFPSGVEKTGSDEPAFMYYMNDGVYGSFASKLSEDLNTIPEVHKKYKEDEPLFTSSLW 300 310 320 330 340 350 360 370 380 390 400 410 pF1KE6 GPAVDGCDCVAEGLWLPQLHVGDWLVFDNMGAYTVGMGSPFWGTQACHITYAMSRVAWEA ::. : : ..:. ::.:.:::::.:::::: . : : : : : :: : CCDS62 GPSCDELDQIVESCLLPELNVGDWLIFDNMGADSFHEPSAFNDFQRPAIYYMMSFSDWYE 360 370 380 390 400 410 420 430 440 450 460 pF1KE6 LRRQLMAAEQEDDVEGVCKPLSCGWEITDTLCVGPVFTPASIM .. CCDS62 MQDAGITSDSMMKNFFFVPSCIQLSQEDSFSAEA 420 430 440 460 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 03:23:30 2016 done: Tue Nov 8 03:23:30 2016 Total Scan time: 2.970 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]