GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:04:05 Sequence gi568815576f:39775296_39976112 : 200817 bp : 40.99% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 24486 24640 155 2 2 84 71 119 0.111 7.31 1.02 Intr + 30997 31094 98 1 2 72 93 31 0.057 0.73 1.03 Intr + 35680 35760 81 2 0 99 50 112 0.294 7.19 1.04 Intr + 35987 36027 41 0 2 41 42 41 0.105 -8.18 1.05 Term + 37279 37413 135 0 0 58 37 188 0.758 7.74 1.06 PlyA + 39381 39386 6 1.05 2.05 PlyA - 40882 40877 6 1.05 2.04 Term - 45108 45077 32 1 2 75 54 26 0.309 -5.06 2.03 Intr - 45818 45698 121 2 1 98 99 104 0.697 11.65 2.02 Intr - 57535 57398 138 1 0 100 33 73 0.365 2.74 2.01 Init - 60598 60545 54 1 0 45 107 22 0.762 1.13 2.00 Prom - 66149 66110 40 -6.15 3.02 PlyA - 66318 66313 6 1.05 3.01 Sngl - 67202 66546 657 2 0 65 43 365 0.912 25.62 3.00 Prom - 71322 71283 40 -6.75 4.05 PlyA - 71786 71781 6 1.05 4.04 Term - 74663 74520 144 2 0 34 45 155 0.557 2.73 4.03 Intr - 83869 83708 162 1 0 12 86 146 0.534 6.05 4.02 Intr - 86712 86435 278 1 2 62 69 146 0.812 6.41 4.01 Init - 88338 88329 10 0 1 99 115 2 0.929 4.65 4.00 Prom - 97837 97798 40 -8.05 5.00 Prom + 98790 98829 40 -4.25 5.01 Sngl + 99969 100820 852 2 0 26 44 865 0.576 71.44 5.02 PlyA + 100957 100962 6 1.05 6.00 Prom + 101542 101581 40 -6.05 6.01 Init + 102863 103037 175 1 1 72 39 156 0.039 8.66 6.02 Intr + 118038 118112 75 1 0 51 53 119 0.890 3.27 6.03 Intr + 118449 118595 147 1 0 90 115 58 0.905 7.99 6.04 Intr + 120699 120791 93 1 0 59 77 51 0.487 0.12 6.05 Term + 121941 122050 110 1 2 107 37 68 0.702 1.29 6.06 PlyA + 122053 122058 6 1.05 7.04 PlyA - 122067 122062 6 1.05 7.03 Term - 132195 132029 167 1 2 47 45 103 0.632 -1.00 7.02 Intr - 132740 132534 207 0 0 -12 83 150 0.453 2.63 7.01 Init - 135644 135587 58 2 1 75 111 16 0.591 4.18 7.00 Prom - 141821 141782 40 -4.25 8.07 PlyA - 142914 142909 6 1.05 8.06 Term - 153958 153863 96 1 0 138 49 44 0.762 2.49 8.05 Intr - 155895 155737 159 0 0 85 44 76 0.463 2.16 8.04 Intr - 157746 157701 46 2 1 111 82 50 0.647 4.19 8.03 Intr - 163503 163423 81 2 0 109 62 115 0.764 8.83 8.02 Intr - 173820 173389 432 2 0 -34 60 302 0.017 7.03 8.01 Init - 174508 174453 56 1 2 79 1 67 0.030 -2.09 8.00 Prom - 174848 174809 40 -6.45 9.00 Prom + 175023 175062 40 -5.05 9.01 Init + 180284 180343 60 1 0 71 92 72 0.932 7.20 9.02 Intr + 180524 180615 92 1 2 86 108 58 0.923 5.47 9.03 Intr + 182189 182356 168 2 0 59 94 52 0.512 1.04 9.04 Intr + 184387 184454 68 1 2 101 73 78 0.337 5.23 9.05 Intr + 184760 184879 120 0 0 116 84 50 0.304 6.95 9.06 Term + 185125 185282 158 2 2 85 35 53 0.206 -3.19 9.07 PlyA + 186555 186560 6 1.05 10.00 Prom + 188833 188872 40 -8.15 10.01 Init + 189080 189245 166 1 1 94 44 319 0.851 27.94 10.02 Intr + 190821 190996 176 1 2 33 55 91 0.678 -0.96 10.03 Intr + 192747 192977 231 2 0 81 70 270 0.646 21.55 10.04 Intr + 194116 194238 123 0 0 106 39 88 0.324 5.56 10.05 Term + 195610 195789 180 0 0 82 37 127 0.250 3.63 10.06 PlyA + 197843 197848 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 23367 23245 123 0 0 54 46 120 0.832 1.80 S.002 Init - 28032 27961 72 0 0 66 96 80 0.862 7.72 S.003 Init + 171495 171848 354 2 0 78 94 175 0.849 14.28 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_1|169_aa MTWAAFGCLLSLSLGTLSLLLGPTVAHCCNLLGSWGLEDSQDVEMQGLLGPRLSLANVNN QVSTRDEESSTKFQKTALLVGNLAEQHTGYHIDDTVLSGLMQEVASTLDATFLSEDQSKQ KALQQSFQWFFSGPCSKDDLNRQGLNNMTAPHQGDLAATDAEHPECPEW >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_1|510_bp atgacctgggcagcattcggttgtctcctgtctctatcactggggaccctgtcacttctg ctgggtccaacagtggcacactgctgcaaccttctgggtagttgggggttggaggactcc caggatgtggagatgcaggggctgttaggtcccagactctctttagctaatgttaacaat caagtgtcaactagggatgaggaaagttctacaaaatttcaaaagacagccctgctggtg gggaacctcgctgagcaacacactggctatcacattgatgatacagtgctaagtggactt atgcaagaagtggcaagcactcttgatgctacatttttgagtgaggaccaaagcaaacaa aaggccctgcagcagtcattccagtggttcttcagtgggccatgtagcaaggatgacctc aacaggcaggggctcaacaacatgactgctcctcaccaaggtgacctggctgccacagat gcggagcatccagaatgcccggagtggtag >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_2|114_aa MLSQETLPLKIHGWKSTEDKASTDTPTHSHTHLPVQKSSVSPRGPFSFLLILMSFLYSTS WVDLDLMTFLDDDPELPLLATPPSIVSPITCLSEAEEVCNLSGAESRMEDLLPP >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_2|345_bp atgttgtcccaggaaacacttccactcaagattcatggttggaaatcaacagaggataaa gccagcaccgacacccctactcactctcacacacatttaccagttcaaaagagctctgtg tctccccgggggcctttcagttttcttctcattttgatgtcgttcctttatagtacctcc tgggttgatcttgacctgatgacatttttggatgatgatccagagctgcctttactagca acacctccttccattgtctctccaatcacttgcttgtcagaagcagaagaagtttgtaat ctttcgggtgcagaaagtagaatggaagatcttctgccaccatga >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_3|218_aa MEDEMNEMKREGKSREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQADVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGWV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRNSYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKELLKEALNMERNNQYQPLQNHAKM >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_3|657_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtctagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccgacgttcagattcaggaaata cagagaacaccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgggtt accctcaaagggaagcccatcagactaacagcagatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaaat tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccagtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_4|197_aa MEKGYYIREKSKQVITLLMDEPLLCKEREVACRTRQRTSHSILFSKRQLGSSNSLTACTS APTPDISASEKKYKLPKFGRLHNKSTYNEHWPKMQQTEREEETAEENLEASRGGFMRFKE SNCHHNTQVQSEAASTDGEAVASYPEDLAKNKVEEKDISSPCYVSFVVEVLAITVKQARE IKNVSIKMEETSIICKR >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_4|594_bp atggagaaaggttattatatccgggaaaaatctaagcaagtcatcacattgctgatggat gaaccattgctgtgtaaagagagggaagtggcatgtcggactagacagcgtacctcccac tctatattgttttctaaaagacaacttggttcaagtaactcactgacagcgtgcacttct gcccccacaccggatatttctgcttcagagaagaagtataagcttcctaagtttggaagg ttacataataaaagtacgtataatgaacattggcctaaaatgcaacagactgagagagaa gaggaaactgcagaagaaaatcttgaagctagcagaggtgggttcatgaggttcaaggaa agcaactgtcaccataacacacaagtacaaagtgaagcagcaagtactgatggagaagct gtagcaagttatccagaagatctagctaagaataaggtggaagaaaaggacatcagttct ccttgctacgtcagctttgtagtggaggtcctagccatcacagtaaaacaagcaagagaa attaaaaatgtcagtatcaaaatggaagaaacgagcatcatttgtaaacgatag >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_5|283_aa MQQIYTVKEIRSVAARSGPFAPVLSATSRGVAGALRPLVQATVPATPEQPVLDLKRPFLS RESLSGQAVRRPLVASVGLNVPASVCYSHTDVKVPDFSEYRRLEVLDSTKSSRESTEARK GFSYLVTGVTTVGVAYAAKNAVTQFVSSMSASADVLALAKIEIKLSDIPEGKNMAFKWRG KPLFVRHRTQKEIKQEAAVELSQLRDPQHDLDRVKKPEWVILIGVCTHLGCVPIANAGDF GGYYCPCHGSHYDASGRIRLGPATLNLEVPTYEFTSDDMVIVG >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_5|852_bp atgcaacaaatctatacagttaaagaaattcggtcggtcgcagcccgctcgggcccgttc gcgcccgtcctgtcggccacgtcccgcggggtggcgggcgcgctgcggcccttggtgcag gccacggtgcccgccaccccggagcagcctgtgttggacctgaagcggcccttcctcagc cgggagtcgctgagcggccaggccgtgcgccggcctttggtcgcctccgtgggcctcaat gtccctgcttctgtttgttattcccacacagacgtcaaggtgcctgacttctctgaatac cgccgccttgaagttttagatagtacgaagtcttcaagagaaagcaccgaggctaggaaa ggtttctcctatttggtaactggagtaactactgtgggtgtcgcatatgctgccaagaat gccgtcacccagttcgtttccagcatgagtgcttctgctgatgtgttggccctggcgaaa atcgaaatcaagttatccgatattccagaaggcaagaacatggctttcaaatggagaggc aaacccctgtttgtacgtcatagaacccagaaggaaattaagcaggaagctgcagttgaa ttatcacagttgagggacccacagcatgatctagatcgagtaaagaaacctgaatgggtt atcctgataggtgtttgcactcatcttggttgtgtacccattgcaaatgcaggagatttt ggtggttattactgcccttgccatgggtcacactatgatgcatctggcaggatcagattg ggtcctgctactctcaaccttgaagtccccacgtatgagttcaccagtgacgatatggtg attgttggttag >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_6|199_aa MKNAFGRLSGRLDTAEERIFELEDISIETTKTEKQREKPNDNNNNKNRISKNYGTTIKGP FLGILVFIHTTPTSDDDCVPDEPGYSTYSSQAQGPRPVPTSASSLSGLVQFLSGRQARWL QIPELEGTKGNEALSPTPRTNLWSLILSSISNSVTSGLVMFGEGSQAMNLGVTLTPVFVS YSSHDIHELTFEIQGGYPV >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_6|600_bp atgaagaatgcctttggcaggctttctgggagactggacacagctgaggaaagaatcttt gagctggaggacatctcaatagaaaccactaaaactgaaaagcaaagagaaaaaccgaat gacaacaacaataacaagaacagaatatccaagaactatgggacaactataaaagggcct ttcctcggcatcctcgtcttcattcatacaacccccacttcagatgacgactgtgtacca gatgagccaggatattccacctactccagccaggctcaagggcccaggccggtccccact tccgcttcctcactttctggcctggtgcagttcctgtcagggcgtcaagcaagatggttg cagatacctgagctggagggtaccaaggggaatgaagctctttcccctactccaaggacc aatctttggtccttgattttaagctctatcagcaactcagtcacaagtggacttgttatg tttggagaaggttctcaggccatgaacctgggagtcaccttaactcctgtgtttgtctct tatagctctcatgatatccatgagcttacctttgaaatacaaggtggctacccagtctag >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_7|143_aa MKFWPFRGQRSIMRLAAPAALVEPEFSFPEAHRGLLHLQNPVFGPVLWKGPSLQRDRLGA TSRQTLANYLEQVPETLQALLFPLCRMRVNEPVRQLFLKLHQDQCTITLEKDNFSYSGYM GGHQVIYDGWTPEFPTEILYKVA >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_7|432_bp atgaagttctggccctttagaggccagagatccatcatgagattagctgcaccagcagct ctggtagagccagaattcagctttccagaagcccatcgagggctcctgcatctccagaat cctgtgtttggaccagtgttgtggaaggggcccagcctacagcgtgacaggcttggtgct acgtcccggcagacattagcaaattaccttgagcaagtcccggaaactctccaagccttg cttttcccactctgcagaatgagagtcaatgagcctgtacgacaactctttttaaagctc catcaggatcagtgtacaataactctagagaaagacaatttttcctattcaggatatatg ggtggacaccaggtgatttatgatggatggacaccagaatttcccacggaaattctgtac aaagttgcgtag >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_8|289_aa MDDPDNAGHHLLGVYVGTGGWSSCGDIQQTIGSLGLEFKSKARARTRATDLRVFGTEVIV AAEEVDEVAQSEKKNQKAKGSTFREVYISGSREEKECLEMRPVGVVRKGGEEPGEGGAAK AREKGASRRGPVSSVNPTDISIGSLYYTLRRSLVNFKRMLQGGCVPRITIHPLAASLFLK AANRAVSDDRGLEVIYPLLSAAAIWEYPYALCIRSTNISCAPLCVKPCVEHEDPVWTRQP ILRPQTVQSCRKNKSAKPEIAKIVQRSPVLFTPLPLMGTSYITIVQYQS >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_8|870_bp atggatgatcccgataatgcgggccaccatttattgggcgtctacgttgggacagggggc tggagttcatgtggggacatccagcagacaattggaagtctgggtctggagttcaagagc aaggccagagctagaacacgagccacagacttgagggtctttggcacagaggtgatagtt gcagctgaagaagtggatgaggtggctcagagtgagaagaaaaatcagaaagccaaaggc agcaccttcagggaagtgtatatttcaggatcaagagaggaaaaagaatgcttggaaatg agacctgtaggagtggtcaggaagggaggagaagaacccggagagggtggtgctgccaaa gccagggagaaaggggcttccaggagagggccagttagcagtgtcaatcccacagacatt tccatagggtcattatactacacattaaggagatcactggtgaacttcaagagaatgttg cagggagggtgtgttccccgaataacgattcatcctctggctgcttcgctttttctgaaa gccgccaacagagctgtctccgatgacaggggacttgaagttatctaccccttgctcagt gctgccgccatctgggagtatccatatgctttgtgtattcgttcaacaaacatttcctgt gccccactctgtgtcaagccctgtgtagagcatgaagatccagtgtggactagacagccc atcttaagacctcagacagttcagtcttgtaggaaaaacaagtcagccaaaccagaaatt gcaaaaatagtacagaggtcccctgtactcttcaccccacttcccctaatggggacatct tacatcaccatagtacagtatcaaagctag >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_9|221_aa MSGRHKWTEITWGQEKLTPKILSNQEEWFKAELGSQEGYVPKNFIDIQFPNWLEGTVTAS SRNSPLSLFHVNLLSLSAGSFLSANTHAAISPNLPKHKQTKPNPEASASSVLAALVYLFR SCRQAAASCNGFTKASLDTRQRTYSWARRLASSSSGPARAPQGTSPSLSAQVLLEGQDFP LLHLYAFNSVLHKRETLSEYLWKEGGIGGGELPLVQIHLGV >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_9|666_bp atgagtggacggcacaagtggacagaaataacctggggtcaggagaagctgactccaaag attttaagtaaccaagaggagtggtttaaggcggagcttgggagccaggaaggatatgtg cccaagaatttcatagacatccagtttcccaactggctggaaggaactgtcactgcttct agcagaaattctcctctctctctcttccacgtcaacctcctttccctctctgctggatca ttcctgtcagcaaacacacacgctgcaatttctcccaacttaccaaaacacaaacaaaca aaaccaaacccagaggccagtgcatcctctgtcctcgcagctctggtctatctcttccgc agctgccggcaggcggctgcttcctgcaatggtttcacgaaggcctctctcgacaccagg cagagaacttactcatgggcaaggaggttggcttcttcatcatccgggccagccagagct ccccaggggacttctccatctctgtcagcccaggtgcttctggaagggcaggattttcct ctacttcatctttatgcctttaacagtgtcttgcataaaagagagacactcagtgagtat ttgtggaaggaaggagggataggaggaggagaactcccacttgtccagatccatctgggg gtttaa >gi568815576f:39775296_39976112|GENSCAN_predicted_peptide_10|291_aa MSSHEGGKKKALKQPKKQAKEMDEEEKAFKQKQKEEQKKLEVLKAKVVGKGPLATDRSRS SLETEPEKTRYAPDPVDPNLEILGSLSSHMNHQEKCIGWVDPLSILKDRAQSSEGHRGNS LDRRSQGGPHLSGAVGEEIRPSMNRKLSDHPPTLPLQQHQHQPQPPQYAPAPQQLQQPPQ QRYLQHHHFHQERRGGSLDINDGHCGTGLGSEMNAALMHRRHTDPVQLQAAGRVRWARAL YDFEALEDDELGFHSGEVVEVLDSSNPSWWTGRLHNKLGLFPANYVAPMTR >gi568815576f:39775296_39976112|GENSCAN_predicted_CDS_10|876_bp atgtccagccacgaaggtggcaagaagaaggcactgaaacagcccaagaagcaggccaag gagatggacgaggaagagaaggctttcaagcagaaacaaaaagaggagcagaagaaactc gaggtgctaaaagcgaaggtcgtggggaaggggcctctggccacagacagaagcagatct tccttagagacagaacccgagaagaccaggtatgctccagatccagtcgaccccaatcta gagattttaggcagcctgagttctcacatgaaccaccaggaaaaatgcataggatgggta gatcctctgagtatactgaaagacagagctcagagctctgagggtcaccggggcaacagc ctggaccggaggtcccagggaggcccacacctcagtggggctgtgggagaagaaatccga ccttcgatgaaccggaagctgtcggatcaccccccgacccttcccctgcagcagcaccag caccagccacagcctccgcaatatgccccagcgccccagcagctgcagcagcccccacag cagcgatatctgcagcaccaccatttccaccaggaacgccgaggaggcagccttgacata aatgatgggcattgtggcaccggcttgggcagtgaaatgaatgcggccctcatgcatcgg agacacacagacccagtgcagctccaggcggcagggcgagtgcggtgggcccgggcgctg tatgactttgaggccctggaggatgacgagctggggttccacagcggggaggtggtggag gtcctggatagctccaacccatcctggtggaccggccgcctgcacaacaagctgggcctc ttccctgccaactacgtggcacccatgacccgataa