GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:07:21 Sequence gi568815586r:110437040_110683031 : 245992 bp : 44.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 113 45 69 2 0 120 86 -20 0.194 0.38 1.13 Intr - 3349 3273 77 2 2 77 94 140 0.578 12.73 1.12 Intr - 8512 8413 100 1 1 77 89 21 0.515 0.78 1.11 Intr - 16832 16704 129 2 0 38 86 139 0.995 9.59 1.10 Intr - 18643 18547 97 0 1 70 107 61 0.997 6.21 1.09 Intr - 18891 18776 116 0 2 96 81 104 0.991 9.75 1.08 Intr - 20595 20471 125 1 2 47 47 74 0.890 -0.50 1.07 Intr - 22823 22656 168 0 0 76 68 210 0.929 17.72 1.06 Intr - 28175 28067 109 2 1 83 119 163 0.619 18.76 1.05 Intr - 31948 31688 261 1 0 58 52 225 0.455 13.58 1.04 Intr - 55083 54996 88 2 1 104 9 66 0.008 0.27 1.03 Intr - 56192 55957 236 2 2 60 92 121 0.911 6.19 1.02 Intr - 59164 58973 192 1 0 23 49 151 0.135 4.39 1.01 Init - 65309 65250 60 2 0 81 49 86 0.059 5.48 1.00 Prom - 67624 67585 40 -2.06 2.00 Prom + 68253 68292 40 -3.86 2.01 Init + 81177 81184 8 2 2 40 103 0 0.317 -2.87 2.02 Intr + 82755 82877 123 0 0 69 116 36 0.793 4.20 2.03 Intr + 85138 85372 235 1 1 73 93 67 0.825 3.39 2.04 Term + 90333 90482 150 2 0 103 53 77 0.759 3.61 2.05 PlyA + 90604 90609 6 1.05 3.07 PlyA - 93948 93943 6 1.05 3.06 Term - 100056 99998 59 1 2 97 37 61 0.700 -0.25 3.05 Intr - 101234 101105 130 2 1 72 99 114 0.997 11.17 3.04 Intr - 102906 102783 124 2 1 106 86 119 0.779 14.09 3.03 Intr - 109039 108841 199 2 1 59 89 177 0.997 13.31 3.02 Intr - 114929 114750 180 0 0 101 123 22 0.880 6.84 3.01 Init - 145992 145770 223 0 1 68 109 563 0.886 53.32 3.00 Prom - 166170 166131 40 -2.86 4.00 Prom + 175103 175142 40 -4.96 4.01 Init + 177144 177363 220 2 1 65 76 249 0.977 18.19 4.02 Intr + 182797 182917 121 2 1 95 113 77 0.811 10.45 4.03 Intr + 189323 189453 131 2 2 91 93 -12 0.974 -0.06 4.04 Intr + 191728 191879 152 2 2 59 83 128 0.480 9.28 4.05 Intr + 195433 195520 88 0 1 97 110 -7 0.452 1.94 4.06 Intr + 197631 197740 110 1 2 18 90 86 0.399 1.80 4.07 Intr + 199442 199462 21 1 0 95 116 5 0.397 1.74 4.08 Intr + 203344 203478 135 0 0 74 92 127 0.279 12.46 4.09 Intr + 203985 204110 126 2 0 71 110 29 0.943 4.28 4.10 Intr + 207953 208090 138 1 0 35 98 149 0.810 11.26 4.11 Intr + 210172 210297 126 0 0 78 80 40 0.807 3.08 4.12 Term + 210710 210853 144 1 0 89 45 59 0.893 -0.39 4.13 PlyA + 211541 211546 6 -0.45 5.07 PlyA - 211763 211758 6 1.05 5.06 Term - 212709 212563 147 0 0 104 42 29 0.517 -2.20 5.05 Intr - 213241 213129 113 2 2 71 116 65 0.859 7.70 5.04 Intr - 214409 214178 232 2 1 147 88 314 0.999 34.65 5.03 Intr - 218299 218195 105 1 0 123 54 217 0.961 22.21 5.02 Intr - 224409 224125 285 0 0 99 96 188 0.773 18.24 5.01 Init - 240399 240253 147 0 0 76 100 56 0.484 5.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 23341 23314 28 1 1 34 89 1 0.855 -5.44 S.002 Sngl + 23949 24197 249 2 0 75 41 356 0.997 24.68 S.003 Term - 27500 27490 11 0 2 108 40 12 0.925 -3.24 S.004 Intr + 49286 49387 102 1 0 95 32 101 0.836 5.57 S.005 Term + 49495 49665 171 0 0 58 54 93 0.825 0.73 S.006 Term - 55083 54966 118 2 1 104 42 71 0.966 2.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:110437040_110683031|GENSCAN_predicted_peptide_1|609_aa MAAIRPKKQARKARLSSTAQLVLVLGDLHIPHRCNSLPAKFKKLLVPGKIQHILCTGNLC TKESYDYLKTLAGDVHIVRGDFDENLNYPEQKVVTVGQFKIGLIHGHQVIPWGDMASLAL LQRQFDVDILISGHTHKFEAFEHENKFYINPGSATGAYNALETNIIPSFVLMDIQASTVV TYVYQLIGDDVKRAALQKSYAPSSRTPDPGRASPRRRDREPCAGAAVPASLPRQHACVRL LVFRRGGSDAGFSPLRSRSDTARTASGERGGQIRDPGVTSTYCATMVQHCEALNRSVQVV NLDPAAEHFNYSVMADIRELIEVDDVMEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGH VEDDYILFDCPGQIELYTHLPVMKQLVQQLEQWEFRVCGVFLVDSQFMVESFKFISGILA ALSAMISLEIPQVNIMTKMDLLSKKAKKEIEKFLDPDMYSLLEDSTSDLRSKKFKKLTKA ICGLIDDYSMVRFLPYDQSDEESMNIVLQHIDFAIQYGEDLEFKEPKAYHSSLMDPDTKL IGNMALLPIRSQFKGPAPRETKDTDIVDEAIYYFKANVFFKNYEIKNEADRTLIYITLYI SECLKKLQK >gi568815586r:110437040_110683031|GENSCAN_predicted_CDS_1|1827_bp atggctgccatccgccctaaaaagcaggctcggaaggcgcgcctcagctcgacggcccag ttggtgttggtattaggagatctgcacatcccacaccggtgcaacagtttgccagctaaa ttcaaaaaactcctggtgccaggaaaaattcagcacattctctgcacaggaaacctttgc accaaagagagttatgactatctcaagactctggctggtgatgttcatattgtgagagga gacttcgatgagaatctgaattatccagaacagaaagttgtgactgttggacagttcaaa attggtctgatccatggacatcaagttattccatggggagatatggccagcttagccctg ttgcagaggcaatttgatgtggacattcttatctcgggacacacacacaaatttgaagca tttgagcatgaaaataaattctacattaatccaggttctgccactggggcatataatgcc ttggaaacaaacattattccatcatttgtgttgatggatatccaggcttctacagtggtc acctatgtgtatcagctaattggagatgatgtgaaacgggcggctctgcagaagagctac gctccgtccagtcggaccccggaccctggccgggcatctccgcggcgccgagaccgcgag ccgtgtgcgggagcagctgtcccagcatccctcccccgacagcacgcgtgcgtcaggctg ctggtattccggagaggcggctcggatgctggcttctcgcccctccgaagccgttcggac actgcccgaacagcttcgggagagcggggcgggcagatccgcgaccccggagttacgagc acctactgtgccaccatggtccagcactgtgaagccctcaaccggtctgtccaagttgta aacctggatccagcagcagaacacttcaactactccgtgatggctgacatccgggaactg atcgaggtggatgatgtaatggaggatgattctctgcgattcggtcccaacggaggattg gtattttgcatggagtactttgccaataattttgactggctggagaactgtcttggccat gtagaggacgactatatcctttttgattgtccaggtcagattgagttgtacactcacctg cctgtgatgaaacagctggtccagcagctcgagcagtgggagttccgagtctgtggagtt tttcttgttgattctcagttcatggtggagtcattcaagtttatttctggcatcttggca gccctgagtgccatgatctctctagaaattccgcaagtcaacatcatgacaaaaatggat ctgctgagtaaaaaagcaaaaaaggaaattgagaaatttttagatccagacatgtattct ttattagaagattctacaagtgacttaagaagcaaaaaattcaagaaactgactaaagct atatgtggactgattgatgactacagcatggttcgatttttaccttacgatcagtcagat gaagaaagcatgaacattgtattgcagcatattgattttgccattcaatatggagaagac ctagaatttaaagaaccaaaggcttaccactcttctctcatggatcctgataccaaactc atcggaaacatggcactgttgcctatcagaagtcaattcaaaggacctgcccccagagag acaaaagatacagatattgtggatgaagccatctattacttcaaggccaatgtcttcttc aaaaactatgaaattaagaatgaagctgataggaccttgatatatataactctctacatt tctgaatgtctgaagaaactgcaaaag >gi568815586r:110437040_110683031|GENSCAN_predicted_peptide_2|171_aa MCIPLALSIDDMLVEANFILATLADEQSRASSPQSLCLSQKRKRSDLIEKKAGKNVTGQA LECISKKAAPRRLYPKETLTNISALENCGSPAMKRVDGDVSEVSESSVSNTEEVPGSLCL RKCDITSSEVHGLSTWTSGNHFPACHSQYHGYLESRFNSCKASLAEGIFTF >gi568815586r:110437040_110683031|GENSCAN_predicted_CDS_2|516_bp atgtgcatacctctggctttgagtattgatgatatgttagtggaagctaactttattttg gccacattagctgatgaacaaagtagagcatcttcaccacagtcactgtgtctttcacag aaacgaaaaaggtcagatctgattgaaaaaaaggctggcaaaaatgtaactggccaggcc ctggaatgtatttcaaaaaaagcagcaccaagaaggctttatcctaaggagactctcaca aacatatctgcattggaaaactgtggcagccctgcaatgaaaagagtggatggagatgtc agtgaagtatcagaaagcagtgtcagcaacacagaggaagtgccagggtctctgtgtctc agaaagtgtgacataacatcttcagaggttcatgggcttagtacatggacctctgggaac cattttcctgcctgccacagccagtatcacggctatctggagagcaggttcaacagctgc aaggcaagccttgcagagggcatcttcaccttctga >gi568815586r:110437040_110683031|GENSCAN_predicted_peptide_3|304_aa MFSVLSYGRLVARAVLGGLSQTDPRAGGGGGGDYGLVTAGCGFGKDFRKGLLKKGACYGD DACFVARHRSADVLGVADGVGGWRDYGVDPSQFSGTLMRTCERLVKEGRFVPSNPIGILT TSYCELLQNKVPLLGSSTACIVVLDRTSHRLHTANLGDSGFLVVRGGEVVHRSDEQQHYF NTPFQLSIAPPEAEGVVLSDSPDAADSTSFDVQLGDIILTATDGLFDNMPDYMILQELKK LKNSNYESIQQTARSIAEQAHELAYDPNYMSPFAQFACDNGLNVRGGKPDDITVLLSIVA EYTD >gi568815586r:110437040_110683031|GENSCAN_predicted_CDS_3|915_bp atgttctcggtcctctcgtacgggcggctggtggcccgcgccgtgctcggcggcctctcg cagaccgaccccagggccggcggcggcggcggcggcgactacggactggtgacggccggc tgcggcttcgggaaggacttccgtaagggcctcctcaagaagggcgcgtgctacggggac gacgcgtgcttcgtggcccggcaccgttccgcggacgtgctcggggttgcagatggtgta ggaggctggagagactatggagttgatccatctcaattctcagggactttaatgcggacg tgtgaacgtttagtaaaagaaggacggttcgtacctagtaatcccattggaattctcacc acaagctactgtgagttgctgcaaaataaagtccctttgctcggtagcagcaccgcctgc attgtggtgctggacagaaccagccaccgcttacacacagcaaacctgggcgattcaggc ttcctggttgtcaggggtggtgaagtcgtgcaccgatcagatgagcagcagcattacttc aacactccattccagctctcaatcgctccccctgaagccgagggagtcgtcttgagcgac agtccggatgctgctgatagcacgtctttcgatgtccagctaggagacattatcctgacg gcaacagatggactctttgacaacatgcctgattatatgattcttcaggagctaaaaaag ttaaagaattcaaattatgagagtatacaacagactgccagaagcattgctgagcaagct catgagctggcctatgacccaaattatatgtcaccttttgcacagtttgcatgtgacaat ggattgaatgtgagaggtggaaagccagatgacatcaccgtccttctttcaatagtggct gagtatacagactag >gi568815586r:110437040_110683031|GENSCAN_predicted_peptide_4|503_aa MRPRGLPPLLVVLLGCWASVSAQTDATPAVTTEGLNSTEAALATFGTFPSTRPPGTPRAP GPSSGPRPTPVTDVAVLCVCDLSPAQCDINCCCDPDCSSVDFSVFSACSVPVVTGDSQFC SQKAVIYSLNFTANPPQRVFELVDQINPSIFCIHITNYKPALSFINPEVPDENNFDTLMK TSDGFTLNAESYVSFTTKLDIPTAAKYEYGVPLQTSDSFLRFPSSLTSSLCTDNNPAAFL VNQAVKCTRKINLEQCEEIEALSMAFYSSPEILRVPDSRKKVPITVQSIVIQSLNKTLTR REDTDVLQPTLVNAGHFSLCVNVVLEVKYSLTYTDAGEVTKADLSFVLGTVSSVVVPLQQ KFEIHFLQLVAQKVKSLLWGQGFPDYVAPFGNSQAQDMLDWVPIHFITQSFNRKDSCQLP GALVIEVKWTKYGSLLNPQAKIVNVTANLISSSFPEANSGNERTILISTAVTFVDVSAPA EAGFRAPPAINARLPFNFFFPFV >gi568815586r:110437040_110683031|GENSCAN_predicted_CDS_4|1512_bp atgaggccgcgaggtctcccgccgctcctggtggtgctcctgggctgctgggcctccgtg agcgcccagaccgatgccaccccggcggtgacgacagagggcctcaactccaccgaggca gccctggccaccttcggaactttcccgtcgaccaggccccccgggactcccagggctcca gggccctcctccggccccaggcctaccccagtcacggacgttgctgttctctgtgtctgt gacttatccccagcacagtgtgacatcaactgctgctgtgatcccgactgcagctccgtg gatttcagtgtcttttctgcctgctcagttccagttgtcacgggcgacagccagttttgt agtcaaaaagcagtcatctattcattgaattttacagcaaacccacctcaaagagtattt gaacttgttgaccagattaatccatctattttctgcattcatattacaaactataaacct gcattatcctttattaatccagaagtacctgatgaaaacaattttgatacattgatgaaa acatctgatggttttacattgaatgctgaatcatatgtttccttcacaaccaaactggat attcctactgctgctaaatatgagtatggggttcctctgcagacttcagattcgtttctg agatttccttcgtccctgacatcatctctgtgcactgataataaccctgcagcgtttctg gtgaaccaggctgttaagtgcaccagaaaaataaatttagaacagtgtgaagaaattgaa gccctcagcatggctttttacagcagcccggaaattctgagggtacctgattcaagaaaa aaggtccctatcactgttcagtccatcgtcattcagtctctaaataaaacgctcacccga cgggaggacactgatgtgctgcagccgactctcgtcaacgctggacactttagcctttgc gtgaatgttgttcttgaggtaaagtacagcctcacatacacagatgcaggtgaagtcacc aaagctgatctctcattcgttctggggacagttagcagcgtagtggtcccactgcagcaa aagtttgaaattcattttcttcagctcgtagcacagaaggtgaagagcctgctgtggggc cagggcttcccagattacgtggccccttttggaaattcccaggcccaggacatgctggac tgggtgcccatccacttcatcacccagtcattcaacaggaaggattcctgccagctccca ggggctttggttatagaagtgaagtggactaaatacggatccctgctgaatccacaggcc aaaatagtcaatgtaactgcaaatctaatttcatcctcctttcctgaggccaactcagga aatgaaaggacgattcttatttccactgcggttacttttgtggatgtgtctgcacctgca gaggcaggcttcagagctccaccagccatcaatgccaggctgccctttaacttcttcttc ccgtttgtttga >gi568815586r:110437040_110683031|GENSCAN_predicted_peptide_5|342_aa MKKKIEDRVKGGCTFYAKEVTCAKVLWLEKPAKYERLNVARVGTEVRGKAVTRRAKVAPA ERMSKFLRHFTVVGDDYHAWNINYKKWENEEEEEEEEQPPPTPVSGEEGRAAAPDVAPAP GPAPRAPLDFRGMLRKLFSSHRFQVIIICLVVLDALLVLAELILDLKIIQPDKNNYAAMV FHYMSITILVFFMMEIIFKLFVFRLEFFHHKFEILDAVVVVVSFILDIVLLFQEHQFEAL GLLILLRLWRVARIINGIIISVKTRSERQLLRLKQMNVQLAAKIQHLEFSCSEKIQPRRH ILPEAVPTPTSQGSARSPTFVFSLLSLLLSEAYLIVAVSSAD >gi568815586r:110437040_110683031|GENSCAN_predicted_CDS_5|1029_bp atgaagaagaagatagaggacagggtgaaagggggttgcacattctacgcaaaagaagtc acgtgtgcaaaggtcctgtggttggagaaaccagccaagtatgaaaggctcaatgtggcc agagtggggacagaggtgagagggaaggcagtcacccgcagggccaaggtggctcccgct gagaggatgagcaagttcttaaggcacttcacggtcgtgggagacgactaccatgcctgg aacatcaactacaagaaatgggagaatgaagaggaggaggaggaggaggagcagccacca cccacaccagtctcaggcgaggaaggcagagctgcagcccctgacgttgcccctgcccct ggccccgcacccagggccccccttgacttcaggggcatgttgaggaaactgttcagctcc cacaggtttcaggtcatcatcatctgcttggtggttctggatgccctcctggtgcttgct gagctcatcctggacctgaagatcatccagcccgacaagaataactatgctgccatggta ttccactacatgagcatcaccatcttggtcttttttatgatggagatcatctttaaatta tttgtcttccgcctggagttctttcaccacaagtttgagatcctggatgccgtcgtggtg gtggtctcattcatcctcgacattgtcctcctgttccaggagcaccagtttgaggctctg ggcctgctgattctgctccggctgtggcgggtggcccggatcatcaatgggattatcatc tcagttaagacacgttcagaacggcaactcttaaggttaaaacagatgaatgtacaattg gccgccaagattcaacaccttgagttcagctgctctgagaagatccagcccaggcgtcac atcctccctgaagctgtgcccacccccacatcccagggctctgccaggtcccccacgttt gtcttctcactgctgtcactgctgctcagtgaagcctatctgattgtagcagtctcctca gcagactga