FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE5460, 334 aa 1>>>pF1KE5460 334 - 334 aa - 334 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4287+/-0.00109; mu= 14.9684+/- 0.065 mean_var=58.9560+/-12.018, 0's: 0 Z-trim(102.2): 27 B-trim: 274 in 1/50 Lambda= 0.167036 statistics sampled from 6840 (6862) to 6840 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.571), E-opt: 0.2 (0.211), width: 16 Scan time: 2.290 The best scores are: opt bits E(32554) CCDS1874.1 MDH1 gene_id:4190|Hs108|chr2 ( 334) 2155 528.0 4.3e-150 CCDS56121.1 MDH1 gene_id:4190|Hs108|chr2 ( 352) 2148 526.3 1.4e-149 CCDS56122.1 MDH1 gene_id:4190|Hs108|chr2 ( 245) 1594 392.7 1.6e-109 CCDS82561.1 MDH1B gene_id:130752|Hs108|chr2 ( 420) 453 117.8 1.5e-26 CCDS63102.1 MDH1B gene_id:130752|Hs108|chr2 ( 517) 453 117.9 1.9e-26 CCDS33365.1 MDH1B gene_id:130752|Hs108|chr2 ( 518) 453 117.9 1.9e-26 >>CCDS1874.1 MDH1 gene_id:4190|Hs108|chr2 (334 aa) initn: 2155 init1: 2155 opt: 2155 Z-score: 2808.2 bits: 528.0 E(32554): 4.3e-150 Smith-Waterman score: 2155; 100.0% identity (100.0% similar) in 334 aa overlap (1-334:1-334) 10 20 30 40 50 60 pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLDITPMMGVLDGVLMELQDC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLDITPMMGVLDGVLMELQDC 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE5 ALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLLKANVKIFKSQGAALDKYA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 ALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLLKANVKIFKSQGAALDKYA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE5 KKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVKN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 KKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVKN 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE5 VIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 VIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARKL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE5 SSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS18 SSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFV 250 260 270 280 290 300 310 320 330 pF1KE5 EGLPINDFSREKMDLTAKELTEEKESAFEFLSSA :::::::::::::::::::::::::::::::::: CCDS18 EGLPINDFSREKMDLTAKELTEEKESAFEFLSSA 310 320 330 >>CCDS56121.1 MDH1 gene_id:4190|Hs108|chr2 (352 aa) initn: 2148 init1: 2148 opt: 2148 Z-score: 2798.7 bits: 526.3 E(32554): 1.4e-149 Smith-Waterman score: 2148; 100.0% identity (100.0% similar) in 333 aa overlap (2-334:20-352) 10 20 30 40 pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLD ::::::::::::::::::::::::::::::::::::::::: CCDS56 MRRCSYFPKDVTVFDKDDKSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKDQPIILVLLD 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE5 ITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 ITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLL 70 80 90 100 110 120 110 120 130 140 150 160 pF1KE5 KANVKIFKSQGAALDKYAKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 KANVKIFKSQGAALDKYAKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNR 130 140 150 160 170 180 170 180 190 200 210 220 pF1KE5 AKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGE 190 200 210 220 230 240 230 240 250 260 270 280 pF1KE5 FVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 FVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPD 250 260 270 280 290 300 290 300 310 320 330 pF1KE5 DLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 DLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA 310 320 330 340 350 >>CCDS56122.1 MDH1 gene_id:4190|Hs108|chr2 (245 aa) initn: 1594 init1: 1594 opt: 1594 Z-score: 2079.7 bits: 392.7 E(32554): 1.6e-109 Smith-Waterman score: 1594; 100.0% identity (100.0% similar) in 245 aa overlap (90-334:1-245) 60 70 80 90 100 110 pF1KE5 CALPLLKDVIATDKEDVAFKDLDVAILVGSMPRREGMERKDLLKANVKIFKSQGAALDKY :::::::::::::::::::::::::::::: CCDS56 MPRREGMERKDLLKANVKIFKSQGAALDKY 10 20 30 120 130 140 150 160 170 pF1KE5 AKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 AKKSVKVIVVGNPANTNCLTASKSAPSIPKENFSCLTRLDHNRAKAQIALKLGVTANDVK 40 50 60 70 80 90 180 190 200 210 220 230 pF1KE5 NVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 NVIIWGNHSSTQYPDVNHAKVKLQGKEVGVYEALKDDSWLKGEFVTTVQQRGAAVIKARK 100 110 120 130 140 150 240 250 260 270 280 290 pF1KE5 LSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 LSSAMSAAKAICDHVRDIWFGTPEGEFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKF 160 170 180 190 200 210 300 310 320 330 pF1KE5 VEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA ::::::::::::::::::::::::::::::::::: CCDS56 VEGLPINDFSREKMDLTAKELTEEKESAFEFLSSA 220 230 240 >>CCDS82561.1 MDH1B gene_id:130752|Hs108|chr2 (420 aa) initn: 364 init1: 212 opt: 453 Z-score: 589.9 bits: 117.8 E(32554): 1.5e-26 Smith-Waterman score: 453; 28.4% identity (62.0% similar) in 334 aa overlap (4-328:33-360) 10 20 30 pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKD :..: .:.:.. :.:. . .: ::: CCDS82 TELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACYNLIPILTSGEVFGMH 10 20 30 40 50 60 40 50 60 70 80 90 pF1KE5 QPIILVLLDITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRR : ..:.: : ....: :: : :.:..: : . ::.. : ... . . CCDS82 TEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAFRQAHVIVVLDDSTNK 70 80 90 100 110 120 100 110 120 130 140 150 pF1KE5 EGMERKDLLKANVKIFKSQGAALDKYAKKSVKVIVVGNP-ANTNCLTASKSAPSIPKENF : . .: :.. : . . : ..: :..::.::: : .: . . . :: : .. . CCDS82 EVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKTVLLMRYAPRIAHNII 130 140 150 160 170 180 160 170 180 190 200 pF1KE5 SCLTRLDHNRAKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKV-KLQGKEVG--- . .. ..::: .: :: .. . .:.:::::: :...: :. ...: . .. : CCDS82 AVALGVE-GEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKTRVYRYESAIWGPLH 190 200 210 220 230 240 210 220 230 240 250 260 pF1KE5 ----VYEALKDDSWLKGEFVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEG : . . :. :.: :::. ... . .:.... . ::..: .. . :.: : CCDS82 YSRPVLNLIFDSEWVKREFVAILKN---LTTTGRQFGGIL-AAHSIATTLKYWYHGSPPG 250 260 270 280 290 270 280 290 300 310 320 pF1KE5 EFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEK :.::.:..:.:. .:.: ...:.:: ..: :: . : ..:.. : ...: .:: CCDS82 EIVSLGILSEGQ-FGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTRMTSDLIQEK 300 310 320 330 340 350 330 pF1KE5 ESAFEFLSSA :. CCDS82 LVALGDKIHFQPYQSGHKDLVPDEEKNLAMSDAAEFPNQIPQTTFEKPQSLEFLNEFEGK 360 370 380 390 400 410 >>CCDS63102.1 MDH1B gene_id:130752|Hs108|chr2 (517 aa) initn: 364 init1: 212 opt: 453 Z-score: 588.4 bits: 117.9 E(32554): 1.9e-26 Smith-Waterman score: 453; 28.4% identity (62.0% similar) in 334 aa overlap (4-328:131-458) 10 20 30 pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKD :..: .:.:.. :.:. . .: ::: CCDS63 TELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACYNLIPILTSGEVFGMH 110 120 130 140 150 160 40 50 60 70 80 90 pF1KE5 QPIILVLLDITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRR : ..:.: : ....: :: : :.:..: : . ::.. : ... . . CCDS63 TEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAFRQAHVIVVLDDSTNK 170 180 190 200 210 220 100 110 120 130 140 150 pF1KE5 EGMERKDLLKANVKIFKSQGAALDKYAKKSVKVIVVGNP-ANTNCLTASKSAPSIPKENF : . .: :.. : . . : ..: :..::.::: : .: . . . :: : .. . CCDS63 EVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKTVLLMRYAPRIAHNII 230 240 250 260 270 280 160 170 180 190 200 pF1KE5 SCLTRLDHNRAKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKV-KLQGKEVG--- . .. ..::: .: :: .. . .:.:::::: :...: :. ...: . .. : CCDS63 AVALGVE-GEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKTRVYRYESAIWGPLH 290 300 310 320 330 210 220 230 240 250 260 pF1KE5 ----VYEALKDDSWLKGEFVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEG : . . :. :.: :::. ... . .:.... . ::..: .. . :.: : CCDS63 YSRPVLNLIFDSEWVKREFVAILKN---LTTTGRQFGGIL-AAHSIATTLKYWYHGSPPG 340 350 360 370 380 390 270 280 290 300 310 320 pF1KE5 EFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEK :.::.:..:.:. .:.: ...:.:: ..: :: . : ..:.. : ...: .:: CCDS63 EIVSLGILSEGQ-FGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTRMTSDLIQEK 400 410 420 430 440 450 330 pF1KE5 ESAFEFLSSA :. CCDS63 LVALGDKIHFQPYQSGHKDLVPDEEKNLAMSDAEFPNQIPQTTFEKPQSLEFLNEFEGKT 460 470 480 490 500 510 >>CCDS33365.1 MDH1B gene_id:130752|Hs108|chr2 (518 aa) initn: 292 init1: 212 opt: 453 Z-score: 588.4 bits: 117.9 E(32554): 1.9e-26 Smith-Waterman score: 453; 28.4% identity (62.0% similar) in 334 aa overlap (4-328:131-458) 10 20 30 pF1KE5 MSEPIRVLVTGAAGQIAYSLLYSIGNGSVFGKD :..: .:.:.. :.:. . .: ::: CCDS33 TELMMVIAQENLGAHIEKEQEEEALKTCINPLQVWITSASAPACYNLIPILTSGEVFGMH 110 120 130 140 150 160 40 50 60 70 80 90 pF1KE5 QPIILVLLDITPMMGVLDGVLMELQDCALPLLKDVIATDKEDVAFKDLDVAILVGSMPRR : ..:.: : ....: :: : :.:..: : . ::.. : ... . . CCDS33 TEISITLFDNKQAEEHLKSLVVETQDLASPVLRSVSICTKVEEAFRQAHVIVVLDDSTNK 170 180 190 200 210 220 100 110 120 130 140 150 pF1KE5 EGMERKDLLKANVKIFKSQGAALDKYAKKSVKVIVVGNP-ANTNCLTASKSAPSIPKENF : . .: :.. : . . : ..: :..::.::: : .: . . . :: : .. . CCDS33 EVFTLEDCLRSRVPLCRLYGYLIEKNAHESVRVIVGGRTFVNLKTVLLMRYAPRIAHNII 230 240 250 260 270 280 160 170 180 190 200 pF1KE5 SCLTRLDHNRAKAQIALKLGVTANDVKNVIIWGNHSSTQYPDVNHAKV-KLQGKEVG--- . .. ..::: .: :: .. . .:.:::::: :...: :. ...: . .. : CCDS33 AVALGVE-GEAKAILARKLKTAPSYIKDVIIWGNISGNNYVDLRKTRVYRYESAIWGPLH 290 300 310 320 330 210 220 230 240 250 260 pF1KE5 ----VYEALKDDSWLKGEFVTTVQQRGAAVIKARKLSSAMSAAKAICDHVRDIWFGTPEG : . . :. :.: :::. ... . .:.... . ::..: .. . :.: : CCDS33 YSRPVLNLIFDSEWVKREFVAILKN---LTTTGRQFGGIL-AAHSIATTLKYWYHGSPPG 340 350 360 370 380 390 270 280 290 300 310 320 pF1KE5 EFVSMGVISDGNSYGVPDDLLYSFPVVIKNKTWKFVEGLPINDFSREKMDLTAKELTEEK :.::.:..:.:. .:.: ...:.:: ..: :: . : ..:.. : ...: .:: CCDS33 EIVSLGILSEGQ-FGIPKGIVFSMPVKFENGTWVVLTDLKDVEISEQIMTRMTSDLIQEK 400 410 420 430 440 450 330 pF1KE5 ESAFEFLSSA :. CCDS33 LVALGDKIHFQPYQSGHKDLVPDEEKNLAMSDAAEFPNQIPQTTFEKPQSLEFLNEFEGK 460 470 480 490 500 510 334 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 00:59:30 2016 done: Tue Nov 8 00:59:30 2016 Total Scan time: 2.290 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]