GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:59:23 Sequence gi568815591f:74408081_74702430 : 294350 bp : 50.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 883 922 40 -2.06 1.01 Init + 42761 42877 117 1 0 84 80 83 0.227 7.30 1.02 Intr + 99995 100123 129 1 0 141 101 199 0.905 27.39 1.03 Intr + 104750 104891 142 1 1 99 78 215 0.998 21.53 1.04 Intr + 107361 107516 156 1 0 119 55 124 0.997 12.18 1.05 Intr + 110059 110242 184 2 1 105 105 295 0.900 31.75 1.06 Intr + 111329 111639 311 2 2 69 37 389 0.807 27.86 1.07 Intr + 113128 113217 90 2 0 99 91 131 0.962 14.47 1.08 Intr + 115991 116074 84 0 0 83 101 114 0.982 11.99 1.09 Intr + 118604 118908 305 0 2 64 27 228 0.165 10.61 1.10 Intr + 120458 120494 37 1 1 81 64 5 0.147 -4.76 1.11 Intr + 121654 121837 184 2 1 97 64 325 0.922 29.85 1.12 Intr + 127033 127058 26 1 2 146 91 42 0.999 7.97 1.13 Intr + 128087 128195 109 0 1 145 85 90 0.996 13.74 1.14 Intr + 130056 130093 38 0 2 131 94 44 0.990 7.21 1.15 Intr + 130600 130680 81 2 0 119 110 95 0.998 14.41 1.16 Intr + 131799 131888 90 1 0 46 110 62 0.942 4.17 1.17 Intr + 136675 136722 48 2 0 100 72 63 0.937 4.55 1.18 Intr + 137664 137729 66 1 0 92 101 119 0.726 12.48 1.19 Intr + 139023 139206 184 1 1 19 94 380 0.914 30.55 1.20 Intr + 147094 147143 50 1 2 97 75 52 0.891 3.12 1.21 Intr + 147358 147414 57 2 0 107 116 87 0.971 12.26 1.22 Intr + 149559 149642 84 1 0 90 115 117 0.955 14.39 1.23 Intr + 150781 150964 184 2 1 105 78 359 0.828 35.45 1.24 Intr + 151547 151575 29 2 2 156 99 0 0.723 5.66 1.25 Intr + 181771 181848 78 2 0 118 73 168 0.892 17.82 1.26 Intr + 182745 182937 193 1 1 124 81 446 0.975 46.05 1.27 Intr + 186934 186971 38 1 2 135 109 66 0.989 11.31 1.28 Intr + 192964 193100 137 2 2 83 48 113 0.968 7.09 1.29 Term + 194285 194353 69 1 0 99 41 81 0.965 2.44 1.30 PlyA + 199326 199331 6 1.05 2.00 Prom + 206322 206361 40 -3.26 2.01 Sngl + 242239 242787 549 0 0 76 38 337 0.856 23.61 2.02 PlyA + 243783 243788 6 1.05 3.00 Prom + 246736 246775 40 -7.26 3.01 Init + 248472 248672 201 2 0 88 31 190 0.489 12.18 3.02 Intr + 249176 249341 166 1 1 72 61 70 0.340 2.23 3.03 Intr + 249880 249988 109 2 1 92 100 33 0.610 4.24 3.04 Intr + 250897 250937 41 1 2 92 75 36 0.382 0.57 3.05 Intr + 281044 281147 104 2 2 107 77 98 0.761 10.49 3.06 Intr + 282893 283031 139 1 1 83 61 120 0.994 8.84 3.07 Intr + 290881 291015 135 2 0 104 53 134 0.999 12.04 3.08 Term + 292167 292354 188 1 2 101 45 135 0.932 8.15 3.09 PlyA + 292718 292723 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 118604 118953 350 0 2 64 36 233 0.818 10.35 S.002 Term + 164779 164904 126 0 0 122 45 66 0.824 3.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:74408081_74702430|GENSCAN_predicted_peptide_1|1099_aa MGATLWSNEKLQEEGSGDSQRKTDLLKVMQVIRQRGRGQATMALLGKRCDVPTNGCGPDR WNSAFTRKDEIITSLVSALDSMCSALSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKE LQSDFLRFCRGPPWKDPEAEHPKKVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVL YSEALGRASVVPLPYERLLREPGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFK LKRPLEDGGRDSKALVELNGVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYS TALPNHAIRELKQEAPSCPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQN VHASKRILFSIVHDKSEKWDAFIKETEDINTLRECVQILFNSRYVLKRQAPINQMNQTET NSQALKGPIAQGLGLGVAADVGGGGEVGRDALPAVEISKVSFDNELSVTSQWDLDAKAAN ADLGRCINRKTVSEAPKGGPKQGWRETVHIEMSVDVVSAEALGLDHMVPVPYRKIACDPE AVEIVGIPDKIPFKRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLE PASPPEDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGGLRPIKIEPEDLDIIQ VTVPDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPL RKQVELLFNTRYAKAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLRK ILEASNSIQFVIKRPELLTEGVKEPIMDSQERDSGDPLVDESLKRQGFQENYDARLSRID IANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGSVIIEGLPPGIPFRKPCTFGS QNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGEKVILREQVKELFNEKYGEALGL NRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYDIHRLEKILKAREHVRMVIINQLQP FAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSSSSSSSSSSSNPDSVASANQISLVQWP MYMVDYAGLNVQLPGPLNY >gi568815591f:74408081_74702430|GENSCAN_predicted_CDS_1|3300_bp atgggggccacactgtggtccaatgagaagctgcaggaggagggaagtggagactcacag aggaaaactgacttgctcaaggtcatgcaggtgattagacaaagaggcaggggccaggcg accatggccttgctgggtaagcgctgtgacgtccccaccaacggctgcggacccgaccgc tggaactccgcgttcacccgcaaagacgagatcatcaccagcctcgtgtctgccttagac tccatgtgctcagcgctgtccaaactgaacgccgaggtggcctgtgtcgccgtgcacgat gagagcgcctttgtggtgggcacagagaaggggagaatgttcctgaatgcccggaaggag ctacagtcagacttcctcaggttctgccgagggcccccgtggaaggatccggaggcagag caccccaagaaggtgcagcggggcgagggtggaggccgtagcctccctcggtcctccctg gaacatggctcagatgtgtaccttctgcggaagatggtagaggaggtgtttgatgttctt tatagcgaggccctgggaagggccagtgtggtgccactgccctatgagaggctgctcagg gagccagggctgctggccgtgcaggggctgcccgaaggcctggccttccgaaggccagcc gagtatgaccccaaggccctcatggccatcctggaacacagccaccgcatccgcttcaag ctcaagaggccacttgaggatggcgggcgggactcgaaggccctggtggagctgaacggt gtctccctgattcccaaggggtcacgggactgtggcctgcatggccaggcccccaaggtg ccaccccaggacctgcccccaaccgccacctcctcctccatggccagcttcctgtacagc acggcgctccccaaccacgccatccgagagctcaagcaggaagcaccttcctgccccctt gcccccagcgacctgggcctgagtcggcccatgccagagcccaaggccaccggtgcccaa gacttctccgactgttgtggacagaagcccactgggcctggtgggcctctcatccagaac gtccatgcctccaagcgcattctcttctccatcgtccatgacaagtcagagaagtgggac gccttcataaaggaaaccgaggacatcaacacgctccgggagtgtgtgcagatcctgttt aacagcagatatgtgctgaaaagacaagccccaataaaccagatgaaccagacggagaca aattcacaggcactcaaaggaccaattgctcaagggctggggctgggggtggctgctgat gtgggaggtgggggggaagtggggagagatgctttgccagctgttgagatttcgaaggtt tcatttgacaacgagctaagtgtgacatctcagtgggatctggatgccaaagccgccaac gcagacctgggccgctgcatcaacaggaaaacagtgtcagaggcacccaagggagggccc aagcagggctggagggagacagtacacatagagatgagtgtggatgtagtgtcagcggaa gccctgggcctggaccacatggtccccgtgccctaccggaagattgcctgtgacccggag gctgtggagatcgtgggcatcccggacaagatccccttcaagcgcccctgcacttatgga gtccccaagctgaagcggatcctggaggagcgccatagtatccacttcatcattaagagg atgtttgatgagcgaattttcacagggaacaagtttaccaaagacaccacgaagctggag ccagccagcccgccagaggacacctctgcagaggtctctagggccaccgtccttgacctt gctgggaatgctcggtcagacaagggcagcatgtctgaagactgtgggccaggaacctcc ggggagctgggcgggctgaggccgatcaaaattgagccagaggatctggacatcattcag gtcaccgtcccagacccctcgccaacctctgaggaaatgacagactcgatgcctgggcac ctgccatcggaggattctggttatgggatggagatgctgacagacaaaggtctgagtgag gacgcgcggcccgaggagaggcccgtggaggacagccacggtgacgtgatccggcccctg cggaagcaggtggagctgctcttcaacacacgatacgccaaggccattggcatctcggag cccgtcaaggtgccgtactccaagtttctgatgcacccggaggagctgtttgtggtggga ctgcctgaaggcatctccctccgcaggcccaactgcttcgggatcgccaagctccggaag attctggaggccagcaacagcatccagtttgtcatcaagaggcccgagctgctcactgag ggagtcaaagagcccatcatggatagtcaagagagggattccggggaccctctggtggac gagagcctgaagagacagggctttcaagaaaattatgacgcgaggctctcacggatcgac atcgccaacacactaagggagcaggtccaggaccttttcaataagaaatacggggaagcc ttgggcatcaagtacccggtccaggtcccctacaagcggatcaagagtaaccccggctcc gtgatcatcgaggggctgcccccaggaatcccgttccgaaagccctgtaccttcggctcc cagaacctggagaggattcttgctgtggctgacaagatcaagttcacagtcaccaggcct ttccaaggactcatcccaaagcctgatgaagatgacgccaacagactcggggagaaggtg atcctgcgggagcaggtgaaggaactcttcaacgagaaatacggtgaggccctgggcctg aaccggccggtgctggtcccttataaactaatccgggacagcccagacgccgtggaggtc acgggtctgcctgatgacatccccttccggaaccccaacacgtacgacatccaccggctg gagaagatcctgaaggcccgagagcatgtccgcatggtcatcattaaccagctccaaccc tttgcagaaatctgcaatgatgccaaggtgccagccaaagacagcagcattcccaagcgc aagagaaagcgggtctcggaaggaaattccgtctcctcttcctcctcgtcttcctcttcc tcgtcctctaacccggattcagtggcatcggccaaccagatctcactcgtgcaatggcca atgtacatggtggactatgccggcctgaacgtgcagctcccgggacctcttaattactag >gi568815591f:74408081_74702430|GENSCAN_predicted_peptide_2|182_aa MAPKHKSSDAGNLDRPKRSRKVLPLSEKVKVLDLIRKDKKSYAEVAKIYGKNESSIREIV KKEKEIRASFAVSPPTAKVTATVRDKCLVKMEQALHLWVEEMNRKRVPIDSNMLRQKALS LYQDFSKGCSETDTKPFTASKGWLHRFRHRFSHHYKKKKGEYSTSYFEKEEREREMDHIR LT >gi568815591f:74408081_74702430|GENSCAN_predicted_CDS_2|549_bp atggccccaaagcacaagagtagtgatgctgggaatttggataggccaaagagaagccgt aaagtgcttcctctaagtgaaaaggtgaaagttctcgacttaatcaggaaagacaaaaaa tcctatgctgaggttgctaagatctacgggaagaatgaatcttccatccgtgaaattgtg aagaaggaaaaagaaattcgtgctagttttgctgtctcacctccaactgctaaagtgacg gccacagtgcgtgataagtgcttagttaagatggaacaggcactgcatttgtgggtggaa gagatgaacagaaaacgtgtccccattgacagcaacatgttgcgccagaaagccttgagc ctataccaagacttcagcaagggatgctctgaaactgacaccaagccatttactgcgagt aagggatggttacacagattcaggcatagattctcacatcattacaagaagaagaagggt gagtacagtacaagttattttgagaaagaggagagagagagagagatggatcacattcgc ctaacttag >gi568815591f:74408081_74702430|GENSCAN_predicted_peptide_3|360_aa MGNANRRTQIERKGFWNPHQKTGCSESFLASDKGLEAGKSVVVATYREAAGGGRMAVVAR IPSGWYWLPCLAICKECGVQEQVADLDGGWRSVGTTPEPPPVQISVSAPRLPSPTRAAAQ APAPLSRGCSAAAAARARPTLASPRHPRPRSCLEAAALGLWICADLRLGMFTGIMAQVAM STLPVEDEESSESRMVVTFLMSALESMCKELAKSKAEVACIAVYETDVFVVGTERGRAFV NTRKDFQKDFVKYCVEEEEKAAEMHKMKSTTQANRMSVDAVEIETLRKTVEDYFCFCYGK ALGKSTVVPVPYEKMLRDQSAVVVQGLPEGVAFKHPENYDLATLKWILENKAGISFIIKR >gi568815591f:74408081_74702430|GENSCAN_predicted_CDS_3|1083_bp atggggaatgccaatcggagaacccagattgaaaggaaaggtttttggaacccacatcaa aagactggctgctcggaaagttttctcgctagcgacaaaggcttggaggctggaaaaagc gtcgtggtcgccacctaccgagaagcggcgggtggggggaggatggccgtcgtggcccga atccccagtgggtggtactggcttccctgcctcgcaatttgcaaggaatgtggggtgcaa gagcaagtggcggacttggacggcggctggcgctcggtggggacgacgccggagccacca cccgtccagatttccgtttctgcaccccggctcccctcccccacccgggcggccgcgcag gccccagctcccctcagccgaggctgctccgcggcggccgcagcccgcgcgcggcccaca ctcgcctcccctcggcacccccggccccggagctgcctggaggcggccgcactcgggtta tggatttgcgctgatttgcggctggggatgttcacagggatcatggcccaagttgcaatg tccaccctccccgttgaagatgaggagtcctcggagagcaggatggtggtgacattcctc atgtcagctctcgagtccatgtgtaaagaactggccaagtccaaagccgaagtggcctgc attgcagtgtatgaaacagacgtgtttgtcgtcggaactgaaagaggacgtgcttttgtc aataccagaaaggattttcaaaaagattttgtaaaatattgtgttgaagaagaagaaaaa gctgcagagatgcataaaatgaaatctacaacccaggcaaatcggatgagtgtagatgct gtagaaattgaaacactcagaaaaacagttgaggactatttctgcttttgctatgggaaa gctttaggcaaatccacagtggtacctgtaccatatgagaagatgctgcgagaccagtcg gctgtggtagtgcaggggcttccggaaggtgttgcctttaaacaccccgagaactatgat cttgcaaccctgaaatggattttggagaacaaagcagggatttcattcatcattaagagg tga