FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE3905, 445 aa 1>>>pF1KE3905 445 - 445 aa - 445 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.8343+/-0.00106; mu= 20.2031+/- 0.064 mean_var=60.6797+/-12.109, 0's: 0 Z-trim(102.7): 32 B-trim: 2 in 1/50 Lambda= 0.164647 statistics sampled from 7047 (7054) to 7047 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.588), E-opt: 0.2 (0.217), width: 16 Scan time: 2.110 The best scores are: opt bits E(32554) CCDS7071.1 GDI2 gene_id:2665|Hs108|chr10 ( 445) 2937 706.5 1.4e-203 CCDS35452.1 GDI1 gene_id:2664|Hs108|chrX ( 447) 2642 636.4 1.7e-182 CCDS44352.1 GDI2 gene_id:2665|Hs108|chr10 ( 400) 2094 506.2 2.4e-143 CCDS31073.1 CHML gene_id:1122|Hs108|chr1 ( 656) 438 113.0 9.2e-25 CCDS14454.1 CHM gene_id:1121|Hs108|chrX ( 653) 408 105.9 1.3e-22 >>CCDS7071.1 GDI2 gene_id:2665|Hs108|chr10 (445 aa) initn: 2937 init1: 2937 opt: 2937 Z-score: 3768.5 bits: 706.5 E(32554): 1.4e-203 Smith-Waterman score: 2937; 100.0% identity (100.0% similar) in 445 aa overlap (1-445:1-445) 10 20 30 40 50 60 pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS70 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI 370 380 390 400 410 420 430 440 pF1KE3 YKRMTGSEFDFEEMKRKKNDIYGED ::::::::::::::::::::::::: CCDS70 YKRMTGSEFDFEEMKRKKNDIYGED 430 440 >>CCDS35452.1 GDI1 gene_id:2664|Hs108|chrX (447 aa) initn: 2642 init1: 2642 opt: 2642 Z-score: 3389.8 bits: 636.4 E(32554): 1.7e-182 Smith-Waterman score: 2642; 86.5% identity (96.8% similar) in 444 aa overlap (1-444:1-444) 10 20 30 40 50 60 pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG :.:::::::::::::::::::::::::::::::::::::::::.::::::.:::::.. CCDS35 MDEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESSSITPLEELYKRFQLLE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP .:::::::::::::::::::::::::::::::::::::::::::.::::::::::::::: CCDS35 GPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVVEGSFVYKGGKIYKVP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE3 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD :::.:::::.:::.:::::::::::.::::::.::.::::.::. :.:::::.::::::: CCDS35 STETEALASNLMGMFEKRRFRKFLVFVANFDENDPKTFEGVDPQTTSMRDVYRKFDLGQD 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE3 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR :::::::::::::::::::::: ::.:::::::::::::::::::::::::::::::::: CCDS35 VIDFTGHALALYRTDDYLDQPCLETVNRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE3 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV ::::::::::::::...::..::::.:::::::.::::::::::::. :::.:.:::::. CCDS35 LSAIYGGTYMLNKPVDDIIMENGKVVGVKSEGEVARCKQLICDPSYIPDRVRKAGQVIRI 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE3 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK ::::::::::::::::::::::::::::::::::::::.:::::::::::::.:::::: CCDS35 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISYAHNVAAQGKYIAIASTTVETT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KE3 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI .::::..:::::::::.::::.:::: : : : :::.: : .:::::::::::.:::.: CCDS35 DPEKEVEPALELLEPIDQKFVAISDLYEPIDDGCESQVFCSCSYDATTHFETTCNDIKDI 370 380 390 400 410 420 430 440 pF1KE3 YKRMTGSEFDFEEMKRKKNDIYGED ::::.:. ::::.::::.::..:: CCDS35 YKRMAGTAFDFENMKRKQNDVFGEAEQ 430 440 >>CCDS44352.1 GDI2 gene_id:2665|Hs108|chr10 (400 aa) initn: 2091 init1: 2091 opt: 2094 Z-score: 2687.0 bits: 506.2 E(32554): 2.4e-143 Smith-Waterman score: 2562; 89.9% identity (89.9% similar) in 445 aa overlap (1-445:1-400) 10 20 30 40 50 60 pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRNPYYGGESASITPLEDLYKRFKIPG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE3 SPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTEGSFVYKGGKIYKVP ::::::::::::::::::::::::: CCDS44 SPPESMGRGRDWNVDLIPKFLMANG----------------------------------- 70 80 130 140 150 160 170 180 pF1KE3 STEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ----------LMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKTTMRDVYKKFDLGQD 90 100 110 120 130 190 200 210 220 230 240 pF1KE3 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLYPLYGLGELPQGFAR 140 150 160 170 180 190 250 260 270 280 290 300 pF1KE3 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 LSAIYGGTYMLNKPIEEIIVQNGKVIGVKSEGEIARCKQLICDPSYVKDRVEKVGQVIRV 200 210 220 230 240 250 310 320 330 340 350 360 pF1KE3 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ICILSHPIKNTNDANSCQIIIPQNQVNRKSDIYVCMISFAHNVAAQGKYIAIVSTTVETK 260 270 280 290 300 310 370 380 390 400 410 420 pF1KE3 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EPEKEIRPALELLEPIEQKFVSISDLLVPKDLGTESQIFISRTYDATTHFETTCDDIKNI 320 330 340 350 360 370 430 440 pF1KE3 YKRMTGSEFDFEEMKRKKNDIYGED ::::::::::::::::::::::::: CCDS44 YKRMTGSEFDFEEMKRKKNDIYGED 380 390 400 >>CCDS31073.1 CHML gene_id:1122|Hs108|chr1 (656 aa) initn: 455 init1: 204 opt: 438 Z-score: 558.0 bits: 113.0 E(32554): 9.2e-25 Smith-Waterman score: 438; 24.3% identity (62.0% similar) in 334 aa overlap (69-392:226-549) 40 50 60 70 80 90 pF1KE3 YGGESASITPLEDLYKRFKIPGSPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTR :: .:.::. :.:...: :. .:. ..:.: CCDS31 GDKDESKSTVEDKADEPIRNRITYSQIVKEGRRFNIDLVSKLLYSQGLLIDLLIKSDVSR 200 210 220 230 240 250 100 110 120 130 140 150 pF1KE3 YLDFKVTEGSFVYKGGKIYKVPSTEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTF :..:: . .... ::. .:: ..:... :. . . ::: . :::.. .. :. : . CCDS31 YVEFKNVTRILAFREGKVEQVPCSRADVFNSKELTMVEKRMLMKFLTFCLEY-EQHPDEY 260 270 280 290 300 310 160 170 180 190 200 210 pF1KE3 EGIDPKKTTMRDVYKKFDLGQDVIDFTGHALALYRTDDYLDQPC--YETINRIKLYSESL ... .. .. . : : .. :. :..:. .. : . .: : . . : CCDS31 QAF--RQCSFSEYLKTKKLTPNLQHFVLHSIAMTS-----ESSCTTIDGLNATKNFLQCL 320 330 340 350 360 220 230 240 250 260 270 pF1KE3 ARYGKSPYLYPLYGLGELPQGFARLSAIYGGTYMLNKPIEEIIV--QNGKVIGVKSE-GE .:.:..:.:.:::: ::.:::: :. :..:: : : . .. ..: ..:. .. .. :. CCDS31 GRFGNTPFLFPLYGQGEIPQGFCRMCAVFGGIYCLRHKVQCFVVDKESGRCKAIIDHFGQ 370 380 390 400 410 420 280 290 300 310 320 pF1KE3 IARCKQLICDPSYVKDRVE---KVGQVIRVICILSHPIKNTN-DANSCQIIIPQNQVNRK : .: . ::..... . :. :.. : .. : .:. : .. .:.: . . CCDS31 RINAKYFIVEDSYLSEETCSNVQYKQISRAVLITDQSILKTDLDQQTSILIVPPAEPG-A 430 440 450 460 470 480 330 340 350 360 370 380 pF1KE3 SDIYVCMISFAHNVAAQGKYIAIVSTTVETKEPEKEIRPALE-LLEPIEQKFVSISDLLV . : . . . . :.. .. . .: ..... ... :. : . .. .: CCDS31 CAVRVTELCSSTMTCMKDTYLVHLTCS-SSKTAREDLESVVKKLFTPYTETEINEEELTK 490 500 510 520 530 540 390 400 410 420 430 440 pF1KE3 PKDLGTESQIFISRTYDATTHFETTCDDIKNIYKRMTGSEFDFEEMKRKKNDIYGED :. : CCDS31 PRLLWALYFNMRDSSGISRSSYNGLPSNVYVCSGPDCGLGNEHAVKQAETLFQEIFPTEE 550 560 570 580 590 600 >>CCDS14454.1 CHM gene_id:1121|Hs108|chrX (653 aa) initn: 532 init1: 195 opt: 408 Z-score: 519.5 bits: 105.9 E(32554): 1.3e-22 Smith-Waterman score: 408; 24.6% identity (58.9% similar) in 418 aa overlap (25-421:174-573) 10 20 30 40 pF1KE3 MNEEYDVIVLGTGLTECILSGIMSVNGKKVLHMDRN---PYYGGESAS-ITPL- :.:.: : : . : ..:. : .:. CCDS14 ESLSTMSCEMLTEQTPSSDPENALEVNGAEVTGEKENHCDDKTCVPSTSAEDMSENVPIA 150 160 170 180 190 200 50 60 70 80 90 100 pF1KE3 EDLY---KRFKIPGSPPESMGRGRDWNVDLIPKFLMANGQLVKMLLYTEVTRYLDFKVTE :: :. .: : . . .:: .:.::. :.:.. : :. .:. ..:.:: .:: CCDS14 EDTTEQPKKNRITYS--QIIKEGRRFNIDLVSKLLYSRGLLIDLLIKSNVSRYAEFKNIT 210 220 230 240 250 260 110 120 130 140 150 160 pF1KE3 GSFVYKGGKIYKVPSTEAEALASSLMGLFEKRRFRKFLVYVANFDEKDPRTFEGIDPKKT .... :.. .:: ..:... :. . . ::: . :::.. .. :: : ..: . . CCDS14 RILAFREGRVEQVPCSRADVFNSKQLTMVEKRMLMKFLTFCMEY-EKYPDEYKGYE--EI 270 280 290 300 310 170 180 190 200 210 220 pF1KE3 TMRDVYKKFDLGQDVIDFTGHALALYRTDDYLDQPCYETINRIKLYSESLARYGKSPYLY :. . : : .. .. :..:. :.. . . .. : . . :.:::..:.:. CCDS14 TFYEYLKTQKLTPNLQYIVMHSIAM--TSE-TASSTIDGLKATKNFLHCLGRYGNTPFLF 320 330 340 350 360 370 230 240 250 260 270 280 pF1KE3 PLYGLGELPQGFARLSAIYGGTYMLNKPIEEIIV--QNGKVIGVKSE-GEIARCKQLICD :::: ::::: : :. :..:: : : . .. ..: .. : .. .. :. .... . CCDS14 PLYGQGELPQCFCRMCAVFGGIYCLRHSVQCLVVDKESRKCKAIIDQFGQRIISEHFLVE 380 390 400 410 420 430 290 300 310 320 330 pF1KE3 PSYVKD----RVEKVGQVIRVICILSHPIKNTNDANSCQII-IPQNQVNRKSDIYVCMIS :: . ::. :. :.. : .. . .:.. .. .:. .: .. . . . : . CCDS14 DSYFPENMCSRVQ-YRQISRAVLITDRSVLKTDSDQQISILTVPAEEPGTFA-VRVIELC 440 450 460 470 480 490 340 350 360 370 380 390 pF1KE3 FAHNVAAQGKYIAIVSTTVETKEPEKEIRPALELLEPIEQK-FVSISDLLVPKDLGTESQ . . .: :.. .. : . : : :: . :: :: ... . .. . . CCDS14 SSTMTCMKGTYLVHLTCTSS--------KTAREDLESVVQKLFVPYTEMEIENEQVEKPR 500 510 520 530 540 400 410 420 430 440 pF1KE3 IFISRTYDA--TTHFETTC-DDI-KNIYKRMTGSEFDFEEMKRKKNDIYGED :. . .. .. . .: .:. .:.: CCDS14 ILWALYFNMRDSSDISRSCYNDLPSNVYVCSGPDCGLGNDNAVKQAETLFQEICPNEDFC 550 560 570 580 590 600 445 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 09:05:30 2016 done: Sun Nov 6 09:05:30 2016 Total Scan time: 2.110 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]