GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:39:43 Sequence gi568815595f:14903915_15143005 : 239091 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3726 3797 72 2 0 114 96 72 0.996 10.20 1.02 Intr + 6947 7015 69 1 0 136 119 48 0.999 12.08 1.03 Intr + 13335 13418 84 2 0 91 94 176 0.574 18.52 1.04 Intr + 14436 14645 210 2 0 67 92 54 0.735 2.81 1.05 Intr + 14840 14919 80 1 2 65 80 82 0.546 3.45 1.06 Intr + 15776 15825 50 2 2 46 81 49 0.422 -1.68 1.07 Intr + 17758 17952 195 2 0 84 39 101 0.550 4.19 1.08 Intr + 17991 18103 113 1 2 70 105 84 0.773 8.40 1.09 Intr + 18497 18634 138 1 0 49 113 53 0.683 4.56 1.10 Intr + 19132 19261 130 0 1 23 76 266 0.999 19.17 1.11 Intr + 20088 20224 137 1 2 83 88 141 0.745 13.89 1.12 Intr + 22156 22284 129 0 0 114 52 175 0.985 17.39 1.13 Intr + 28663 28817 155 0 2 140 80 89 0.876 12.17 1.14 Intr + 43473 43992 520 0 1 38 83 286 0.249 16.06 1.15 Intr + 44410 44501 92 0 2 79 54 23 0.116 -3.21 1.16 Intr + 55243 55282 40 1 1 128 94 44 0.295 7.23 1.17 Intr + 99962 100072 111 1 0 107 82 51 0.217 6.98 1.18 Intr + 109675 109875 201 0 0 67 110 165 0.946 16.08 1.19 Intr + 112238 112340 103 1 1 74 96 127 0.999 11.85 1.20 Intr + 116839 117018 180 2 0 93 82 144 0.999 14.14 1.21 Intr + 119286 119433 148 1 1 24 86 196 0.999 12.29 1.22 Intr + 120201 120294 94 0 1 50 61 93 0.993 2.77 1.23 Intr + 124672 124805 134 0 2 93 76 73 0.993 6.04 1.24 Intr + 126361 126538 178 1 1 85 113 91 0.990 11.22 1.25 Intr + 128465 128586 122 1 2 119 106 85 0.998 12.69 1.26 Intr + 130756 130895 140 1 2 81 97 177 0.999 18.01 1.27 Intr + 134086 134223 138 2 0 66 111 183 0.999 18.84 1.28 Intr + 135208 135313 106 2 1 87 93 100 0.515 9.67 1.29 Term + 138920 139094 175 2 1 69 43 102 0.382 1.03 1.30 PlyA + 143753 143758 6 1.05 2.05 PlyA - 144623 144618 6 1.05 2.04 Term - 148719 148527 193 2 1 73 48 199 0.900 11.29 2.03 Intr - 149553 149466 88 1 1 105 61 131 0.999 11.13 2.02 Intr - 155561 155455 107 1 2 105 95 21 0.653 4.36 2.01 Init - 161280 161147 134 0 2 111 82 216 0.981 20.91 2.00 Prom - 165660 165621 40 -5.96 3.15 PlyA - 166177 166172 6 1.05 3.14 Term - 171016 169868 1149 1 0 120 47 1192 0.996 110.22 3.13 Intr - 171796 171692 105 1 0 90 89 67 0.993 7.41 3.12 Intr - 173250 173148 103 2 1 63 103 23 0.923 1.48 3.11 Intr - 174247 174161 87 0 0 78 116 92 0.999 10.09 3.10 Intr - 176888 176818 71 2 2 91 80 65 0.985 3.98 3.09 Intr - 178694 178453 242 0 2 71 45 445 0.936 35.77 3.08 Intr - 180982 180821 162 2 0 60 58 176 0.906 11.75 3.07 Intr - 181131 181086 46 0 1 62 78 12 0.929 -4.42 3.06 Intr - 182047 181947 101 2 2 120 94 47 0.934 8.33 3.05 Intr - 186625 186485 141 2 0 72 91 108 0.999 9.82 3.04 Intr - 192225 192059 167 2 2 81 69 127 0.504 9.70 3.03 Intr - 195372 195186 187 1 1 53 1 154 0.246 1.95 3.02 Intr - 220559 220386 174 0 0 20 84 128 0.277 5.51 3.01 Intr - 229921 229892 30 2 0 108 65 23 0.068 0.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:14903915_15143005|GENSCAN_predicted_peptide_1|1347_aa ENLQKLVHIEHSVRGQGDLLQPGREFLKEGTLMKVTGKNRRPRHLFLMNDVLLYTYPQKD GKYRLKNTLAVANMKALYHGEGEGGSTFLSMEVCSLLEPKAPPRSLLEKGMGDVVTGRYL SNMTVHLGLPGLGPEHDALQPSQRWVSRPVMEKVPYALKIETSESCLMLSARLQVRKSKV KALTDSVSAALGVRGISLFQCKKKQTQGQLMDQWSARKPSLAGDLFFAGGSGQCERCRLK GHLSENLIHAEMEAHARSSCAERDEWYGCLSRALPEDYKAQALAAFHHSVEIRERLGVSL GERPPTLVPVTHVMMCMNCGCDFSLTLRRHHCHACGKIVCRNCSRNKYPLKYLKDRMAKV CDGCFGELKKRGRAVPGLMRVTERPVSMSFPLSSPRFSGSAFSSVFQSINPSTFKKQKKV PSALTEVAASGEGSAISGYLSRCKRGKRHWKKLWFVIKGKVLYTYMASEDKVALESMPLL GFTIAPEKEEGSSEVGPIFHLYHKKTLFYSFKAEDTNSAQSHISARPAAAKAGGSYVAAA ARGPTAHGPDGPSAGCGVPVRARASANRVLARPPPRRRGWRWEEEEEHSPSSYQRAGGGG GGTGVTNSDLSLTKTITNPPFSATPAAPPGPSPPPRSHLGVSSLARCPASPRPPGSRHPP TPGARAPRALPSAPGAARRRHGTRFEAALAPNPESAFGGRRSPTAVFSEGEGKRKGGRRV RGATSTAYGCGSSERSSQVTRTQTSRPESPGMTSPSPRIQIISTDSAVASPQRIQIVTDQ QTGQKIQIVTAVDASGSPKQQFILTSPDGAGTGKVILASPETSSAKQLIFTTSDNLVPGR IQIVTDSASVERLLGKTDVQRPQVVEYCVVCGDKASGRHYGAVSCEGCKGFFKRSVRKNL TYSCRSNQDCIINKHHRNRCQFCRLKKCLEMGMKMESVQSERKPFDVQREKPSNCAASTE KIYIRKDLRSPLIATPTFVADKDGARQTGLLDPGMLVNIQQPLIREDGTVLLATDSKAET SQGALGTLANVVTSLANLSESLNNGDTSEIQPEDQSASEITRAFDTLAKALNTTDSSSSP SLADGIDTSGGGSIHVISRDQSTPIIEVEGPLLSDTHVTFKLTMPSPMPEYLNVHYICES ASRLLFLSMHWARSIPAFQALGQDCNTSLVRACWNELFTLGLAQCAQVMSLSTILAAIVN HLQNSIQEDKLSGDRIKQVMEHIWKLQEFCNSMAKLDIDGYEYAYLKAIVLFSPDHPGLT STSQIEKFQEKAQMELQDYVQKTYSEDTYRLARILVRLPALRLMSSNITEELFFTGLIGN VSIDSIIPYILKMETAEYNGQITGASL >gi568815595f:14903915_15143005|GENSCAN_predicted_CDS_1|4044_bp gaaaacctgcagaagctggtccacattgagcacagcgtccggggccaaggggatctcctc cagccaggaagggagtttctgaaggaagggacgctgatgaaagtaacagggaaaaacaga cggccccggcacctatttctgatgaacgatgtgctcctgtacacctatccccagaaggat gggaagtaccggctgaagaacacattggctgtggccaacatgaaggctctttaccatggg gaaggggaaggaggaagcacctttctcagcatggaggtttgttcccttttggaaccaaag gctccaccgaggagcctgttagaaaaaggcatgggagacgtggtcactggcaggtacttg tccaacatgacagtgcacctggggttgcccgggctgggccctgagcatgacgctctgcag ccttcccagcggtgggtcagccgccctgtgatggagaaagtgccctacgctctaaagatt gagacttccgagtcctgcctgatgctgtctgcgaggctgcaggtcaggaagtccaaggtc aaggcactgactgattcggtgtctgcagccctgggagttaggggaatatcattattccag tgtaagaagaaacagacccaaggacagctaatggaccagtggtctgctcgtaaacctagt ctggcaggtgatctcttctttgctggtggttctgggcagtgtgagaggtgcaggctcaag gggcatctgagtgagaacctcatccatgccgagatggaggcccatgcccgcagctcctgt gcagagagggacgagtggtatggctgtctgagcagagccctccctgaggactacaaggcc caggcgctggctgcattccaccatagcgtggagatacgagagaggctgggggttagcctt ggggagaggccccccaccctggtgcctgtcacacacgtcatgatgtgcatgaactgcggc tgcgacttctccctcaccctgcggcgtcatcactgtcacgcctgtggcaagatcgtgtgc cggaactgttcgcggaacaagtacccgctgaagtacctgaaggacaggatggccaaggtc tgcgacggctgcttcggggagctgaagaagcggggcagggctgtcccgggcctgatgaga gttacagagcggcctgtgagcatgagcttcccgctgtcttcaccccgcttctcgggcagt gccttttcatccgtcttccagagcattaacccctcgaccttcaagaagcagaagaaagtc ccttcagccctgacagaggtggctgcctctggagagggctctgccatcagtggctatctc agccggtgtaagaggggcaagcggcactggaagaagctctggtttgtcatcaaaggcaaa gttctctacacctacatggccagtgaggacaaagtggccttggagagtatgcctctgcta ggcttcaccattgctccagaaaaggaagagggcagcagtgaagtaggacctatttttcac ctttaccacaagaaaaccctattttatagcttcaaagcagaagataccaattcagctcag agccacatatccgcccgcccggccgcggccaaggccggcggcagctacgtagccgccgcc gcccggggccccacagcccacgggccggatggaccgagcgccggctgcggcgttccggtg cgcgcgcgcgcctcagccaatcgcgttctcgcccgccccccgccgcggcggcgaggctgg cgctgggaagaggaagaagaacattcgccctcctcctaccagcgcgcgggcggcggcggc ggcggcaccggggtcacgaactctgacctttcactgacaaaaacaataacaaaccccccc ttctccgcgaccccggcggcgcccccgggcccgtccccgccaccccgctcccacctcggc gtctcgtctctcgcccgctgccccgcgagcccgcggcccccgggctcccgccatccgccg acaccgggagcccgggctccccgcgccctgccctccgcgccgggggccgcccgccgcaga cacgggacccgcttcgaggccgctttggcgccaaatcctgagtccgcctttggcggccgt cgcagcccaaccgccgttttttctgaaggggagggtaaaagaaaaggaggaaggcgggtg cgaggcgcgacctcgaccgcctatggctgcggtagcagtgagagaagctctcaggtaaca cgtacacagacctctcggccggaatctccagggatgaccagcccctccccacgcatccag ataatctccaccgactctgctgtagcctcacctcagcgcattcagattgtcacagaccag cagacaggacagaaaatccagatagtcaccgcagtggacgcctccggatcccccaaacag cagttcatcctgaccagcccagatggagctggaactgggaaggtgatcctggcttcccca gagacatccagcgccaagcaactcatattcaccacctcagacaacctcgtccctggcagg atccagattgtcacggattctgcctctgtggagcgtttactggggaagacggacgtccag cggccccaggtggtagagtactgtgtggtctgtggcgacaaagcctccggccgtcactat ggggctgtcagttgtgaaggttgcaaaggtttcttcaaaaggagtgtgaggaaaaatttg acctacagctgccggagcaaccaagactgcatcatcaataaacatcaccggaaccgctgt cagttttgccggctgaaaaaatgcttagagatgggcatgaaaatggaatctgtgcagagt gaacggaagcccttcgatgtgcaacgggagaaaccaagcaattgtgctgcttcaactgag aaaatctatatccggaaagacctgagaagtcccctgatagctactcccacgtttgtggca gacaaagatggagcaagacaaacaggtcttcttgatccagggatgcttgtgaacatccag cagcctttgatacgtgaggatggtacagttctcctggccacggattctaaggctgaaaca agccagggagctctgggcacactggcaaatgtagtgacctcccttgccaacctaagtgaa tctttgaacaacggtgacacttcagaaatccagccagaggaccagtctgcaagtgagata actcgggcatttgataccttagctaaagcacttaataccacagacagctcctcttctcca agcttggcagatgggatagacaccagtggaggagggagcatccacgtcatcagcagagac cagtcgacacccatcattgaggttgaaggccccctcctttcagacacacacgtcacattt aagctaacaatgcccagtccaatgccagagtacctcaacgtgcactacatctgtgagtct gcatcccgtctgcttttcctctcaatgcactgggctcggtcaatcccagcctttcaggca cttgggcaggactgcaacaccagccttgtgcgggcctgctggaatgagctcttcaccctc ggcctggcccagtgtgcccaggtcatgagtctctccaccatcctggctgccattgtcaac cacctgcagaacagcatccaggaagataaactttctggtgaccggataaagcaagtcatg gagcacatctggaagctgcaggagttctgtaacagcatggcgaagctggatatagatggc tatgagtatgcataccttaaagctatagttctctttagccccgatcatccaggtttgacc agcacaagccagattgaaaaattccaagaaaaggcacagatggagttgcaggactatgtt cagaaaacctactcagaagacacctaccgattggcccggatcctcgttcgcctgccggca ctcaggctgatgagctccaacataacagaagaacttttttttactggtctcattggcaat gtttcgatagacagcataatcccctacatcctcaagatggagacagcagagtataatggc cagatcaccggagccagtctatag >gi568815595f:14903915_15143005|GENSCAN_predicted_peptide_2|173_aa MPMKGRFPIRRTLQYLSQGNVVFKDSVKVMTVNYNTHGELGEGARKFVFFNIPQIQYKNP WVQIMMFKNMTPSPFLRFYLDSGEQVLVDVETKSNKEIMEHIRKILGKNEETLREEEEEK KQLSHPANFGPRKYCLRECICEVEGQVPCPSLVPLPKEMRGKYKAALKADAQD >gi568815595f:14903915_15143005|GENSCAN_predicted_CDS_2|522_bp atgcccatgaagggccgcttccccatccgccgcaccctgcaatatctgagccaggggaac gtggtgttcaaggactccgtgaaggtcatgacagtgaattacaacacgcatggggagctg ggcgagggcgccaggaagtttgtgtttttcaacatacctcagattcaatacaaaaaccct tgggtgcagatcatgatgtttaagaacatgacgccgtcacccttcctgcgattctactta gattctggggagcaggtcctggtggatgtggagaccaagagcaataaggagatcatggag cacatcagaaaaatcttggggaagaatgaggaaaccctcagggaagaggaggaggagaaa aagcagctttctcacccagccaacttcggccctcgaaagtactgcctgcgggagtgcatc tgtgaagtggaagggcaggtgccctgccccagcctggtgccattacccaaggagatgagg gggaagtacaaagccgctctgaaagccgatgcccaggactaa >gi568815595f:14903915_15143005|GENSCAN_predicted_peptide_3|921_aa XSPEKEQRDRATSANESNVDFLSLPDTTSPEIRLDHSPPVPDRSISALEHIPQTFPKPGT GLHTSTPSSPAQKPPARPPLFESVSALFAPSKTRRSTGVYQLVFRFRRRRRRTCRVPSDR WRWRLRLSDGRVESGTAMASLDDPGEVREGFLCPLCLKDLQSFYQLHSHYEEEHSGEDRD VKGQIKSLVQKAKKAKDRLLKREGDDRAESGTQGYESFSYGGVDPYMWEPQELGAVRSHL SDFKKHRAARIDHYVVEVNKLIIRLEKLTAFDRTNTESAKIRAIEKSVVPWVNDQDVPFC PDCGNKFSIRNRRHHCRLCGSIMCKKCMELISLPLANKLTSASKESLSTHTSPSQSPNSV HGSRRGSISSMSSVSSVLDEKDDDRIRCCTHCKDTLLKREQQIDEKEHTPDIVKLYEKLR LCMEKVDQKAPEYIRMAASLNAGETTYSLEHASDLRVEVQKVYELIDALSKKILTLGLNQ DPPPHPSNLRLQRMIRYSATLFVQEKLLGLMSLPTKEQFEELKKKRKEEMERKRAVERQA ALESQRRLEERQSGLASRAANGEVASLRRGPAPLRKAEGWLPLSGGQGQSEDSDPLLQQI HNITSFIRQAKAAGRMDEVRTLQENLRQLQDEYDQQQTEKAIELSRRQAEEEDLQREQLQ MLRERELEREREQFRVASLHTRTRSLDFREIGPFQLEPSREPRTHLAYALDLGSSPVPSS TAPKTPSLSSTQPTRVWSGPPAVGQERLPQSSMPQQHEGPSLNPFDEEDLSSPMEEATTG PPAAGVSLDPSARILKEYNPFEEEDEEEEAVAGNPFIQPDSPAPNPFSEEDEHPQQRLSS PLVPGNPFEEPTCINPFEMDSDSGPEAEEPIEEELLLQQIDNIKAYIFDAKQCGRLDEVE VLTENLRELKHTLAKQKGGTD >gi568815595f:14903915_15143005|GENSCAN_predicted_CDS_3|2766_bp ngttctccagagaaagagcagagagatagagcaacctcagccaatgagtccaatgtcgac ttcctgtccttgcctgacactaccagccctgagatcagacttgaccattcacctccagta cctgataggtccatcagtgctttggaacatatcccacaaacattccccaaaccaggcacc ggactccacacatcaacaccgtcatcgcctgcgcagaagccgccggcccgccctccacta ttcgaatccgtatccgcccttttcgcgccctcgaagactcggcgctcgacaggcgtctac cagcttgtcttccggtttcgtcgccgccgcaggcgcacctgccgagttccgagcgaccga tggagatggcggctgcggctgagtgacggacgggttgagagcggcactgccatggcttct ctggacgacccaggggaagtgagggagggcttcctctgccctctgtgcctgaaggatctg cagtctttctatcagcttcactcacattacgaggaagaacactcaggggaagaccgtgat gtcaaagggcaaattaaaagtcttgtccagaaggctaaaaaagcaaaggacaggttgttg aaacgagaaggggatgatcgagcagagtcagggacccaaggatatgagtctttcagctat ggaggggttgatccttacatgtgggaaccccaggagcttggtgctgtgagaagccatctt tccgacttcaaaaaacaccgagctgctagaattgaccactatgttgtggaagtcaataaa ctaataatcaggttagagaagctcactgcatttgacagaacaaatactgagtctgcaaag attcgagcaatagaaaagtctgtggtgccttgggtcaacgaccaggatgtccctttctgt ccagactgtgggaataagttcagcatccggaaccgccgccaccactgccgcctctgcggg tctattatgtgcaagaagtgtatggagctcatcagccttcccttggcaaacaagctcacc agtgccagcaaggagtccctgagcacccacaccagccccagccagtcacccaacagtgtc catggctcccgccgaggcagcatcagcagcatgagcagtgtcagctcggtcctggatgag aaggacgatgaccggatccgctgctgtacacactgcaaggacacgctgctcaagagagag cagcagattgatgagaaggagcacacacctgacatcgtgaagctctacgagaaattacga ctttgcatggagaaagttgaccagaaagctccagaatacatcaggatggcagcatcatta aatgctggggagacaacctacagtctggaacatgccagtgaccttcgagtggaagtgcag aaagtgtatgagctgatagacgctttaagtaagaagatcttaaccttgggcttgaaccag gaccctccaccacatccaagcaatttgcggctgcagagaatgatcagatactcagctaca ctttttgtgcaggaaaagttgcttggtttgatgtcactgccaaccaaagaacagtttgag gaactgaaaaagaaaaggaaggaggaaatggagaggaagagggccgtggagagacaagct gccctggagtcccagcgaaggcttgaggaaaggcagagtggcctggcttctcgagcggcc aacggggaggtggcatctctccgcaggggccctgcccccttgagaaaggctgagggctgg ctcccactgtcaggaggtcaggggcagagtgaggactcagacccgctcctccagcagatc cacaacatcacatcattcatcaggcaggccaaggccgcgggccgcatggatgaagtgcgc actctgcaggagaacctgcggcagctgcaggacgagtatgaccagcagcagacagagaag gccatcgagctgtcccggaggcaggctgaggaggaggacctgcagcgggaacagctgcag atgttgcgtgaacgggagttggaacgagaaagggagcagtttcgggtggcatccctgcac acacggactcggtccctggacttcagagaaatcggcccttttcagctggagcccagcaga gagcctcgcacccaccttgcttatgctttggatctaggctcttccccagttccaagcagc acagctcccaagaccccttcacttagctcaactcaacccaccagagtgtggtctgggccc ccagccgttggccaggagcgcttaccccagagcagcatgccacagcaacatgaggggccc tccttaaacccctttgatgaggaagacctctccagccccatggaagaggccactactggt cctcctgctgcaggggtttccttagacccttcagcccgcatcctgaaagagtacaatcct ttcgaggaagaggacgaggaggaggaagcagtggcagggaatccattcattcagccagac agcccagctcctaaccccttcagtgaggaagacgaacatccccagcagaggctctcaagc cctctggttcctggtaacccctttgaggaacccacctgtatcaacccctttgagatggac agtgacagtgggccagaggctgaggagcccatagaggaagagctcctcctgcagcagatc gataacatcaaggcatacatctttgatgccaagcagtgcggccgcctggatgaggtagag gtgctgacagagaatctgcgggagctgaagcacaccctggccaagcagaaggggggcact gactga