GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:39:52 Sequence gi568815595f:170122670_170403124 : 280455 bp : 40.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 6883 6015 869 2 2 115 91 536 0.127 46.73 1.05 Intr - 13996 13750 247 1 1 77 75 138 0.927 7.51 1.04 Intr - 22852 22754 99 1 0 77 99 55 0.754 4.89 1.03 Intr - 26575 26417 159 1 0 92 95 108 0.765 11.16 1.02 Intr - 50043 49888 156 0 0 53 110 151 0.910 13.09 1.01 Init - 56247 56104 144 0 0 66 68 136 0.964 9.57 1.00 Prom - 57241 57202 40 -9.25 2.00 Prom + 58069 58108 40 -12.72 2.01 Init + 58150 58338 189 0 0 75 98 131 0.890 11.86 2.02 Term + 66627 66767 141 2 0 57 43 209 0.985 10.25 2.03 PlyA + 66838 66843 6 1.05 3.00 Prom + 96938 96977 40 -2.95 3.01 Init + 99719 99884 166 1 1 55 96 159 0.080 11.26 3.02 Intr + 99978 100101 124 1 1 35 100 169 0.641 11.52 3.03 Intr + 112561 112682 122 1 2 117 113 102 0.746 14.82 3.04 Intr + 137300 137389 90 0 0 105 95 79 0.988 9.35 3.05 Intr + 140710 140760 51 2 0 124 73 15 0.705 1.56 3.06 Intr + 145246 145331 86 2 2 87 59 84 0.963 4.02 3.07 Intr + 147752 147892 141 1 0 77 110 100 0.980 10.73 3.08 Intr + 157558 157734 177 0 0 77 99 145 0.988 13.69 3.09 Intr + 158497 158594 98 0 2 40 110 18 0.917 -2.91 3.10 Intr + 159213 159299 87 0 0 90 109 72 0.986 7.67 3.11 Intr + 161792 161927 136 2 1 73 100 95 0.956 8.85 3.12 Intr + 174635 174724 90 1 0 90 80 88 0.994 7.47 3.13 Intr + 176326 176441 116 0 2 82 91 93 0.985 7.33 3.14 Intr + 197261 197358 98 2 2 30 96 82 0.030 1.93 3.15 Term + 199144 199241 98 2 2 93 42 84 0.791 1.45 3.16 PlyA + 199284 199289 6 1.05 4.06 PlyA - 199386 199381 6 1.05 4.05 Term - 205914 205712 203 1 2 46 38 222 0.757 9.57 4.04 Intr - 206080 205942 139 1 1 62 80 43 0.545 0.02 4.03 Intr - 206237 206151 87 2 0 83 101 46 0.532 4.65 4.02 Intr - 208005 207863 143 1 2 47 19 95 0.463 -2.35 4.01 Init - 209923 209806 118 1 1 78 77 105 0.673 8.81 4.00 Prom - 212191 212152 40 -7.05 5.00 Prom + 232684 232723 40 -2.75 5.01 Init + 237663 238790 1128 2 0 66 50 543 0.065 42.49 5.02 Intr + 258575 258672 98 1 2 106 109 34 0.519 5.19 5.03 Intr + 261864 262096 233 0 2 86 97 110 0.829 8.19 5.04 Intr + 267554 267795 242 0 2 82 53 203 0.970 12.55 5.05 Intr + 268367 268591 225 1 0 87 110 204 0.999 19.76 5.06 Term + 269590 269748 159 0 0 21 43 138 0.956 -0.44 5.07 PlyA + 270471 270476 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 6883 5928 956 2 2 115 36 524 0.860 41.13 S.002 Intr + 173242 173321 80 2 2 80 70 80 0.982 3.75 S.003 Term + 180371 180458 88 2 1 78 42 115 0.886 2.05 S.004 Term + 239286 239434 149 1 2 111 47 124 0.923 7.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:170122670_170403124|GENSCAN_predicted_peptide_1|558_aa MDTEPNPGTSSVSTTTSSTTTTTITTSSSRMQQPQISVYSGSDRHAVQVIQQALHRPPSS AAQYLQQMYAAQQQHLMLHTAALQQQHLSSSQLQSLAAVQINLSTSPTPAQLISRSQASS STSGSITQQTMLLGSTSPTLTASQAQMYLRAQMLIFTPATTVAAVQSDIPVVSSSSSSSC QSAATQVQNLTLRSQKLGVLSSSQNGPPKSTSQTQSLTICHNKTTVTSSKISQRDPSPES NKKGESPSLESRSTAVTRTSSIHQLIAPASYSPIQPHSLIKHQQIPLHSPPSKVSHHQLI LQQQQQQIQPITLQNSTQDPPPSQHCIPLQNHGLPPAPSNAQSQHCSPIQSHPSPLTVSP NQSQSAQQSVVVSPPPPHSPSQSPTIIIHPQALIQPHPLVSSALQPGPNLQQSTANQVQA TAQLNLPSHLPLPASPVVHIGPVQQSALVSPGQQIVSPSHQQYSSLQSSPIPIASPPQMS TSPPAQIPPLPLQSMQSLQVQPEILSQGQVLVQNALVSEEELPAAEALVQLPFQTLPPPQ TVAVNLQVQPPAPVDPPV >gi568815595f:170122670_170403124|GENSCAN_predicted_CDS_1|1674_bp atggatactgaaccaaacccgggaacatcttctgtgtcaacaacaaccagcagtaccacc accaccaccatcaccacttcctcctctcgaatgcagcagccacagatctctgtctacagt ggttcagaccgacatgctgtacaggtaattcaacaggcattgcatcggccccccagctca gctgctcagtaccttcagcaaatgtatgcagcccaacaacagcacttgatgctgcatact gcagctcttcagcagcagcatttaagcagctcccagcttcagagccttgctgctgttcag atcaacctctccacttctcctacacctgcacagttaataagccgttcccaggcttccagt tctaccagcggcagtattacccaacagactatgttactagggagtacttcccctacccta acggcaagccaagctcaaatgtatctccgagctcaaatgctgattttcacacccgctacc actgtggctgctgtacagtctgacattcctgttgtctcgtcgtcatcgtcatcttcctgt cagtctgcagctactcaggttcagaatttaacattacgcagccagaagttgggtgtatta tctagctcacagaatggtccaccaaaaagcactagtcaaactcagtcattgacaatttgt cataacaaaacaacagtgaccagttctaaaatcagccaacgagatccttctccagaaagt aataagaaaggagagagcccaagcctggaatcacgaagcacagctgtcacccggacatca agtattcaccagttaatagcaccagcttcatattctccaattcagcctcattctctaata aaacatcagcagattcctcttcattcaccaccttccaaagtttcccatcatcagctgata ttacaacagcagcaacagcaaattcagccaatcacacttcagaattcaactcaagaccca cccccatcccagcactgtataccactccagaaccatggccttcctccagctcccagtaat gcccagtcacagcattgttcaccgattcagagtcatccctctcctttaacagtgtctcct aatcagtcacagtcagcacagcagtctgtagtggtgtctcctccaccacctcattcacca agtcagtctcctactataattattcatccacaagcacttattcagccacaccctcttgtg tcatcagctctccagccagggccaaatttgcagcagtccactgctaatcaggtgcaagct acagcacagttgaatcttccatcccatcttccacttccagcttcccctgttgtacacatt ggcccagttcagcagtctgccttggtatccccaggccagcagattgtctctccatcacac cagcaatattcatccctgcagtcctctccaatcccaattgcaagtcctccacagatgtcg acatctcctccagctcagattccaccactgcccttgcagtctatgcagtctttacaagtg cagcctgaaattctgtcccagggccaggttttggtgcagaatgctttggtgtcagaagag gaacttccagctgcagaagctttggtccagttgccatttcagactcttcctcctccacag actgttgcggtaaacctacaagtgcaaccaccagcacctgttgatccaccagtg >gi568815595f:170122670_170403124|GENSCAN_predicted_peptide_2|109_aa MVTENPNSRHKSKHRLSGTGQTIALHFEAWHKRTTLVTPPSRHTNTCKIQPAVLVHKRPT QETWVALYTVGCISLGAEEWKQSIKVGRDFRIDVTHTEEEFMAKAESKL >gi568815595f:170122670_170403124|GENSCAN_predicted_CDS_2|330_bp atggtcacagagaatccaaattctcggcataagtctaaacatcggttgagcggaactggc caaaccatcgcactgcatttcgaggcctggcacaaaagaacaactttggtaactccaccg tctaggcatactaatacctgcaaaatacagccagcagttctggtgcacaagaggcctacc caggaaacatgggtggcactgtacacagttggctgtatctccctcggtgctgaagagtgg aaacagagcattaaagttggaagggactttcgaattgatgtgacccatactgaggaagaa ttcatggcaaaagcagaatctaaactctga >gi568815595f:170122670_170403124|GENSCAN_predicted_peptide_3|559_aa MWAFGASEVGGTDAGGVLGPGGCRGGGAYGQWEEPRGSGCSGEATLGSALRARWAALRRG SEEMPTQRDSSTMSHTVAGGGSGDHSHQVRVKAYYRGDIMITHFEPSISFEGLCNEVRDM CSFDNEQLFTMKWIDEEGDPCTVSSQLELEEAFRLYELNKDSELLIHVFPCVPERPGMPC PGEDKSIYRRGARRWRKLYCANGHTFQAKRFNRRAHCAICTDRIWGLGRQGYKCINCKLL VHKKCHKLVTIECGRHSLPQAMNTRESGKASSSLGLQDFDLLRVIGRGSYAKVLLVRLKK TDRIYAMKVVKKELVNDDEDIDWVQTEKHVFEQASNHPFLVGLHSCFQTESRLFFVIEYV NGGDLMFHMQRQRKLPEEHARFYSAEISLALNYLHERGIIYRDLKLDNVLLDSEGHIKLT DYGMCKDPKERLGCHPQTGFADIQGHPFFRNVDWDMMEQKQVVPPFKPNISGEFGLDNFD SQFTNEPVQLTPDDDRQKYLTGQKNMLQEWGPNPDPERVLGSRTRIQATCPQECGSLVTA ALIIEKQVEWKKMNYVNLH >gi568815595f:170122670_170403124|GENSCAN_predicted_CDS_3|1680_bp atgtgggcctttggagcgagcgaagtgggagggaccgacgcaggaggtgtcttgggcccg ggcggctgtagaggcggcggcgcctacgggcagtgggaggagccgcgcggttccggctgc tccggcgaggcgacccttgggtcggcgctgcgggcgaggtgggcagcgttgaggcggggg agtgaggagatgccgacccagagggacagcagcaccatgtcccacacggtcgcaggcggc ggcagcggggaccattcccaccaggtccgggtgaaagcctactaccgcggggatatcatg ataacacattttgaaccttccatctcctttgagggcctttgcaatgaggttcgagacatg tgttcttttgacaacgaacagctcttcaccatgaaatggatagatgaggaaggagacccg tgtacagtatcatctcagttggagttagaagaagcctttagactttatgagctaaacaag gattctgaactcttgattcatgtgttcccttgtgtaccagaacgtcctgggatgccttgt ccaggagaagataaatccatctaccgtagaggtgcacgccgctggagaaagctttattgt gccaatggccacactttccaagccaagcgtttcaacaggcgtgctcactgtgccatctgc acagaccgaatatggggacttggacgccaaggatataagtgcatcaactgcaaactcttg gttcataagaagtgccataaactcgtcacaattgaatgtgggcggcattctttgccacag gcaatgaacaccagggaaagtggcaaagcttcatccagtctaggtcttcaggattttgat ttgctccgggtaataggaagaggaagttatgccaaagtactgttggttcgattaaaaaaa acagatcgtatttatgcaatgaaagttgtgaaaaaagagcttgttaatgatgatgaggat attgattgggtacagacagagaagcatgtgtttgagcaggcatccaatcatcctttcctt gttgggctgcattcttgctttcagacagaaagcagattgttctttgttatagagtatgta aatggaggagacctaatgtttcatatgcagcgacaaagaaaacttcctgaagaacatgcc agattttactctgcagaaatcagtctagcattaaattatcttcatgagcgagggataatt tatagagatttgaaactggacaatgtattactggactctgaaggccacattaaactcact gactacggcatgtgtaaggaccctaaggaacgattgggttgtcatcctcaaacaggattt gctgatattcagggacacccgttcttccgaaatgttgattgggatatgatggagcaaaaa caggtggtacctccctttaaaccaaatatttctggggaatttggtttggacaactttgat tctcagtttactaatgaacctgtccagctcactccagatgacgatcgccagaaatatctt acaggacagaaaaatatgttacaagaatggggtcccaatccagaccccgagagagttctt ggatctcgcacaagaattcaggctacctgcccacaagaatgtggctccctggtgacagct gcactaattatagaaaaacaagttgagtggaagaagatgaattatgtaaatttacattga >gi568815595f:170122670_170403124|GENSCAN_predicted_peptide_4|229_aa MAIIKKSGNNRSLKSQAIIKKETTDAGEDVEKYECFYTIGLQLEHTLQFTMAPARPITGP GKHFHTQQVLLFPVTSYPGSTLALLTPDSEHPPNIVSRIGGFLVSLTSRMKLQTLAFLPS GGFVVSVASGVKLQTFMVSIAALKGSVSGVVLSFHPELFLPPGVNLQTFKMSVTAHKGSA DPNSEQQQELLQGAKEQSFDIAEGGRQLVAAASLGSLLLFPYLTPPTSC >gi568815595f:170122670_170403124|GENSCAN_predicted_CDS_4|690_bp atggcgatcattaaaaagtcaggaaacaacagatcattaaaaagtcaggcgatcattaaa aaggaaacaacagatgctggagaggatgtggagaaatacgaatgcttttatactattggc cttcagctagagcatactcttcagtttaccatggccccagcacgtcctatcactggacca gggaaacattttcacactcagcaagtactactattcccagtgaccagctaccctggatct accttggctctgctgacgccggatagtgaacatcctccaaacattgtgtccagaattggt gggttcttggtctccctgacttcaagaatgaagctgcagacccttgcgtttcttccttct ggtggattcgtggtctcggtggcttcaggagtgaagctgcagacctttatggtgagcatt gcagctctcaaaggcagtgtgtctggagttgttctttccttccatccggagttgttcctc cctcccggagtgaacctgcagaccttcaagatgagtgttacagctcataaaggtagcgca gacccaaacagtgagcagcagcaagagttactgcaaggagcaaaagaacaaagcttcgac attgcagaagggggacgccagctggttgctgctgctagcttgggcagcctgcttttattc ccttatctgaccccacccacatcctgctga >gi568815595f:170122670_170403124|GENSCAN_predicted_peptide_5|694_aa MENLQTNFSLVQGSTKKLNGMGDDGSPPAKKMITDIHANGKTINKVPTVKKEHLDDYGEA PVETDGEHVKRTCTSVPETLHLNPSLKHTLAQFHLSSQSSLGGPAAFSARHSQESMSPTV FLPLPSPQVLPGPLLIPSDSSTELTQTVLEGESISCFQVGGEKRLCLPQVLNSVLREFTL QQINTVCDELYIYCSRCTSDQLHILKVLGILPFNAPSCGLITLTDAQRLCNALLRPRTFP QNGSVLPAKSSLAQLKETGSAFEVEHECLGKCQGLFAPQFYVQPDAPCIQCLECCGMFAP QTFVMHSHRSPDKRTCHWGFESAKWHCYLHVNQKYLGTPEEKKLKIILEEMKEKFSMRSG KRNQSKASFLYQFLIMTDAPSGMELQSWYPVIKQEGDHVSQTHSFLHPSYYLYMCDKVVA PNVSLTSAVSQSKELTKTEASKSISRQSEKAHSSGKLQKTVSYPDVSLEEQEKMDLKTSR ELCSRLDASISNNSTSKRKSESATCNLVRDINKVGIGLVAAASSPLLVKDVICEDDKGKI MEEVMRTYLKQQEKLNLILQKKQQLQMEVKMLSSSKSMKELTEEQQNLQKELESLQNEHA QRMEEFYVEQKDLEKKLEQIMKQKCTCDSNLEKDKEAEYAGQLAELRQRLDHAEADRQEL QDELRQEREARQKLEMMIKELKLQILKSSKTAKE >gi568815595f:170122670_170403124|GENSCAN_predicted_CDS_5|2085_bp atggaaaacctccagacaaatttctccttggttcagggctcaactaaaaaactgaatggg atgggagatgatggcagccccccagcgaaaaaaatgataacggacattcatgcaaatgga aaaacgataaacaaggtgccaacagttaagaaggaacacttggatgactatggagaagca ccagtggaaactgatggagagcatgttaagcgaacctgtacttctgttcctgaaactttg catttaaatcccagtttgaaacacacattggcacaattccatttaagtagtcagagctcg ctgggtggaccagcagcattttctgctcggcattcccaagaaagcatgtcgcctactgta tttctgcctcttccatcacctcaggttcttcctggcccattgctcatcccttcagatagc tccacagaactcactcagactgtgttggaaggggaatctatttcttgttttcaagttgga ggagaaaagagactctgtttgccccaagtcttaaattctgttctccgagaatttacactc cagcaaataaatacagtgtgtgatgaactgtacatatattgttcaaggtgtacttcagac cagcttcatatcttaaaggtactgggcatacttccattcaatgccccatcctgtgggctg attacattaactgatgcacaaagattatgtaatgctttattgcggccacgaacttttcct caaaatggtagcgtacttcctgctaaaagctcattggcccagttaaaggaaactggcagt gcctttgaagtggagcatgaatgcctaggcaaatgtcagggtttatttgcaccccagttt tatgttcagcctgatgctccgtgtattcaatgtctggagtgttgtggaatgtttgcaccc cagacgtttgtgatgcattctcacagatcacctgacaaaagaacttgccactggggcttt gaatcagctaaatggcattgctatcttcatgtgaaccaaaaatacttaggaacacctgaa gaaaagaaactgaagataattttagaagaaatgaaggagaagtttagcatgagaagtgga aagagaaatcaatccaaggcaagttttttatatcaatttttaataatgacagatgcacca tcaggaatggaattacagtcatggtatcctgttataaagcaggaaggtgaccatgtttct cagacacattcatttttacaccccagctactacttatacatgtgtgataaagtggttgcc ccaaatgtgtcacttacttctgctgtatcccagtctaaagagctcacaaagacagaggca agtaagtccatatcaagacagtcagagaaggctcacagtagtggtaaacttcaaaaaaca gtgtcttatccagatgtctcacttgaggaacaggagaaaatggatttaaaaacaagtaga gaattatgtagccgtttagatgcatcaatctcaaataattctacaagtaaaaggaaatct gagtctgccacttgcaacttagtcagagacataaacaaagtgggaattggccttgttgct gccgcttcatctccgcttcttgtgaaagatgtcatttgtgaggatgataagggaaaaatc atggaagaagtaatgagaacttatttaaaacaacaggaaaaactaaacttgattttgcaa aagaagcaacaacttcagatggaagtaaaaatgttgagtagttcaaaatctatgaaggaa ctcactgaagaacagcagaatttacagaaagagcttgaatctttgcagaatgaacatgct caaagaatggaagaattttatgttgaacagaaagacttagagaaaaaattggagcagata atgaagcaaaaatgtacctgtgactcaaatttagaaaaagacaaagaggctgaatatgca ggacagttggcagaactgaggcagagattggaccatgctgaggccgataggcaagaactc caagatgaactcagacaggaacgggaagcaagacagaagttagagatgatgataaaagag ctaaagctgcaaattctgaaatcatcaaagactgctaaagaatag