GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:10:34 Sequence gi568815596r:70731053_70935776 : 204724 bp : 45.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 20215 20058 158 0 2 86 100 107 0.341 11.35 1.06 Intr - 36417 36278 140 0 2 31 68 92 0.007 0.76 1.05 Intr - 42316 42247 70 0 1 76 74 28 0.003 -0.62 1.04 Intr - 46619 46585 35 2 2 98 96 20 0.013 0.82 1.03 Intr - 48361 48326 36 1 0 92 98 33 0.026 3.16 1.02 Intr - 56749 56597 153 1 0 121 98 6 0.784 5.17 1.01 Init - 59586 59356 231 0 0 85 76 481 0.979 42.86 1.00 Prom - 62643 62604 40 -4.06 2.00 Prom + 66275 66314 40 -7.46 2.01 Init + 71749 71751 3 0 0 108 81 0 0.744 1.30 2.02 Term + 72072 72374 303 2 0 10 45 344 0.758 17.47 2.03 PlyA + 73165 73170 6 1.05 3.07 PlyA - 73501 73496 6 1.05 3.06 Term - 78042 77877 166 2 1 -77 55 226 0.725 0.19 3.05 Intr - 78805 78687 119 1 2 99 75 89 0.773 7.96 3.04 Intr - 81546 81395 152 1 2 100 71 56 0.730 4.98 3.03 Intr - 86060 84942 1119 0 0 118 106 760 0.600 70.42 3.02 Intr - 88839 88723 117 1 0 102 94 30 0.877 5.44 3.01 Init - 89471 89411 61 2 1 94 100 48 0.931 8.01 3.00 Prom - 96866 96827 40 -4.76 4.08 PlyA - 96903 96898 6 1.05 4.07 Term - 97231 97163 69 1 0 112 44 55 0.316 1.44 4.06 Intr - 100767 100654 114 0 0 97 37 56 0.280 2.04 4.05 Intr - 101999 101848 152 0 2 110 109 55 0.995 9.68 4.04 Intr - 102968 102594 375 0 0 123 106 441 0.995 44.39 4.03 Intr - 104555 104439 117 0 0 98 78 102 0.999 10.64 4.02 Intr - 104792 104738 55 2 1 107 84 33 0.940 3.35 4.01 Init - 118407 118405 3 0 0 108 81 0 0.003 1.30 4.00 Prom - 121553 121514 40 -4.96 5.00 Prom + 127801 127840 40 -1.56 5.01 Init + 134846 134848 3 1 0 92 101 0 0.066 1.70 5.02 Intr + 140537 140604 68 1 2 71 87 66 0.077 2.50 5.03 Term + 148802 149399 598 2 1 44 48 206 0.249 6.20 5.04 PlyA + 149790 149795 6 1.05 6.00 Prom + 150554 150593 40 -2.96 6.01 Init + 165417 165514 98 2 2 54 79 130 0.883 8.48 6.02 Intr + 168647 168884 238 2 1 113 100 177 0.991 19.02 6.03 Intr + 169115 169216 102 1 0 55 105 61 0.924 4.87 6.04 Intr + 169531 169816 286 0 1 -23 94 267 0.532 13.11 6.05 Intr + 174659 174820 162 0 0 32 107 42 0.111 0.45 6.06 Intr + 190046 190233 188 0 2 91 110 342 0.454 36.11 6.07 Term + 201715 202152 438 0 0 99 47 424 0.969 34.68 6.08 PlyA + 202364 202369 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 49876 49957 82 0 1 90 86 113 0.933 12.43 S.002 Sngl + 112148 112486 339 1 0 87 37 181 0.876 8.93 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:70731053_70935776|GENSCAN_predicted_peptide_1|275_aa MDPAPGVLDPRAAPPALLGTPQAEVLEDVLREQFGPLPQLAAVCRLKRLPSGGYSSTENL QLVLERRRVANAKERERIKNLNRGFARLKALVPFLPQSRKPSKVDILKGATEYIQVLSDL LEGAKDSKVVALYDGKTSTTDRFPEVELLSHRTFQGIGSGIISFECGIVMELGLGRCEAG LAGLGASPGASRSLAARRSAAVAGRAGAASAAAYIEIGATGTFLSNKRQLLYTETFPLEF QSVKVEGIKQIFQSNLFIIEIRHMKLRKDKALIQX >gi568815596r:70731053_70935776|GENSCAN_predicted_CDS_1|825_bp atggaccccgcgcccggcgtcctagatccccgcgccgcgccgcccgcgctcctgggcacc ccgcaagccgaggtgctggaggacgtgttgcgggagcagttcgggccgctgccccagctg gccgctgtctgccggctcaagcggctgccctcgggcggctactcgtccactgaaaacctc cagttggtgctggagcggcggcgtgtggccaacgccaaggagcgtgagcggataaaaaat ctcaaccgtggttttgccagattgaaggcacttgtgccatttcttccccaaagcaggaag cccagcaaagttgatatccttaaaggtgcgactgaatatatacaggttctcagtgatctt ttggaaggagccaaagactcaaaggtagtggctctatacgatggtaaaacttctaccaca gatagattcccagaagtagaactgctgagtcacagaacttttcaaggcattggcagtggg attatttcttttgaatgtggcattgtaatggaactgggtctcgggcggtgcgaggcgggc ttggcggggctgggcgcgtcccccggggcctcgcgatcgctggctgcgcggcgctcagcc gcagtggccggccgagcaggtgcggcgtcggcagccgcctacattgagatcggggcaacc ggcacgtttctgagtaacaagcgtcaactgttatatactgaaacattccctctggaattc cagagtgtcaaagtggaagggatcaaacagatcttccagtccaacctcttcattatagaa atcagacacatgaagctgagaaaagataaggcactgatccaagnn >gi568815596r:70731053_70935776|GENSCAN_predicted_peptide_2|101_aa MHLSPTAVTAATLPKRKTEGDVKGDKAKVDTPQRRSTRLSAKPAPPKPEPKPKKAPAKKG EKVPKGKKGKADAGKEGNYPAEKGDAITDQAREAEGDGEAK >gi568815596r:70731053_70935776|GENSCAN_predicted_CDS_2|306_bp atgcatctaagtcccactgctgtcaccgccgccaccttgcccaagaggaagaccgaaggg gatgttaaaggagataaagccaaggtggacacacctcagagaagatccacaaggttgtct gctaaacctgctcctccaaagccagagcccaagcctaaaaaggcccctgcaaagaaggga gagaaggtacctaaagggaaaaagggaaaagctgatgctggcaaggaggggaattaccct gcagaaaaaggagatgccataacagaccaggccagggaagctgaaggtgatggagaggcc aagtga >gi568815596r:70731053_70935776|GENSCAN_predicted_peptide_3|577_aa MDGEAVRFCTDNQCVSLHPQEVDSVAMAPAAPKIPRLVQATPAFMAVTLVFSLVTLFVVD HHHFGREAEMRELIQTFKGHMENSSAWVVEIQMLKCRVDNVNSQLQVLGDHLGNTNADIQ MVKGVLKDATTLSLQTQMLRSSLEGTNAEIQRLKEDLEKADALTFQTLNFLKSSLENTSI ELHVLSRGLENANSEIQMLNASLETANTQAQLANSSLKNANAEIYVLRGHLDSVNDLRTQ NQVLRNSLEGANAEIQGLKENLQNTNALNSQTQAFIKSSFDNTSAEIQFLRGHLERAGDE IHVLKRDLKMVTAQTQKANGRLDQTDTQIQVFKSEMENVNTLNAQIQVLNGHMKNASREI QTLKQGMKNASALTSQTQMLDSNLQKASAEIQRLRGDLENTKALTMEIQQEQSRLKTLHV VITSQEQLQRTQSQLLQMVLQGWKFNGGSLYYFSSVKKSWHEAEQFCVSQGAHLASVASK EEQAFLVEFTSKVYYWIGLTDRGTEGSWRWTDGTPFNAAQNKAPVVFGFWEKNQSDNWRH KNGQTEDCVQIQQKWNDMTCDTPYQWVCKKPMGQGVA >gi568815596r:70731053_70935776|GENSCAN_predicted_CDS_3|1734_bp atggacggtgaggcagtccgcttctgcacagataaccagtgtgtctccctgcacccccaa gaggtggactctgtggcaatggctcctgcagcccccaagataccgaggctcgttcaggct accccggcatttatggctgtgaccttggtcttctctcttgtgactctctttgtagtggat catcaccactttggcagggaggcagaaatgcgagagcttatccagacatttaaaggccac atggagaattccagtgcctgggtagtagaaatccagatgttgaagtgcagagtggacaat gtcaattcgcagctccaggtgctcggtgatcatctgggaaacaccaatgctgacatccag atggtaaaaggagttctaaaggatgccactacattgagtttgcagacacagatgttaagg agttccctggagggaaccaatgctgagatccagaggctcaaggaagaccttgaaaaggca gatgctttaactttccagacgctgaatttcttaaaaagcagtttagaaaacaccagcatt gagctccacgtgctaagcagaggcttagaaaatgcaaactctgaaattcagatgttgaat gccagtttggaaacggcaaatacccaggctcagttagccaatagcagtttaaagaacgct aatgctgagatctatgttttgagaggccatctagatagtgtcaatgacttgaggacccag aaccaggttttaagaaatagtttggaaggagccaatgctgagatccagggactaaaggaa aatttgcagaacacaaatgctttaaactcccagacccaggcctttataaaaagcagtttt gacaacactagtgctgagatccagttcttaagaggtcatttggaaagagctggtgatgaa attcacgtgttaaaaagggatttgaaaatggtcacagcccagacccaaaaagcaaatggc cgtctggaccagacagatactcagattcaggtattcaagtcagagatggaaaatgtgaat accttaaatgcccagattcaggtcttaaatggtcatatgaaaaatgccagcagagagata cagaccctaaaacaaggaatgaagaatgcttcagccttaacttcccagacccagatgtta gacagcaatctgcagaaggccagtgccgagatccagaggttaagaggggatctagagaac accaaagctctaaccatggaaatccagcaggagcagagtcgcctgaagaccctccatgtg gtcattacttcacaggaacagctacaaagaacccaaagtcagcttctccagatggtcctg caaggctggaagttcaatggtggaagcttatattatttttctagtgtcaagaagtcttgg catgaggctgagcagttctgcgtgtcccagggagcccatctggcatctgtggcctccaag gaggagcaggcatttctggtagagttcacaagtaaagtgtactactggatcggtctcact gacaggggcacagagggctcctggcgctggacagatgggacaccattcaacgccgcccag aacaaagcccctgttgtcttcgggttttgggaaaagaatcagtctgacaactggcggcac aagaatgggcagactgaagactgtgtccaaattcagcagaagtggaatgacatgacctgt gacaccccctatcagtgggtgtgcaagaagcccatgggccagggtgtggcctga >gi568815596r:70731053_70935776|GENSCAN_predicted_peptide_4|294_aa MIRALGETGSQKHLCSQDKEPPPKSGPSLVPGKTPTVRAALICLTLVLVASVLLQAVLYP RFMGTISDVKTNVQLLKGRVDNISTLDSEIKKNSDGMEAAGVQIQMVNESLGYVRSQFLK LKTSVEKANAQIQILTRSWEEVSTLNAQIPELKSDLEKASALNTKIRALQGSLENMSKLL KRQNDILQVVSQGWKYFKGNFYYFSLIPKTWYSAEQFCVSRNSHLTSVTSESEQEFLYKT AGGLIYWIGLTKAGMEGDWSWVDDTPFNKVQSGGRYMGTCPSAGSVASPRKSHH >gi568815596r:70731053_70935776|GENSCAN_predicted_CDS_4|885_bp atgataagagcccttggagagacaggcagccagaagcacctgtgctcccaggataaggag cctcctcccaagtccggtccatctctggtcccggggaaaacacccacagtccgtgctgca ttaatctgcctgacgctggtcctggtcgcctccgtcctgctgcaggccgtcctttatccc cggtttatgggcaccatatcagatgtaaagaccaatgtccagttgctgaaaggtcgtgtg gacaacatcagcaccctggattctgaaattaaaaagaatagtgacggcatggaggcagct ggcgttcagatccagatggtgaatgagagcctgggttatgtgcgttctcagttcctgaag ttaaaaaccagtgtggagaaggccaacgcacagatccagatcttaacaagaagttgggaa gaagtcagtaccttaaatgcccaaatcccagagttaaaaagtgatttggagaaagccagt gctttaaatacaaagatccgggcactccagggcagcttggagaatatgagcaagttgctc aaacgacaaaatgatattctacaggtggtttctcaaggctggaagtacttcaaggggaac ttctattacttttctctcattccaaagacctggtatagtgccgagcagttctgtgtgtcc aggaattcacacctgacctcggtgacctcagagagtgagcaggagtttctgtataaaaca gcggggggactcatctactggattggcctgactaaagcagggatggaaggggactggtcc tgggtggatgacacgccattcaacaaggtccaaagtggaggtcgctacatgggcacctgc ccttctgctggcagtgtggcctcgcccaggaagtcccaccattga >gi568815596r:70731053_70935776|GENSCAN_predicted_peptide_5|222_aa MGAIAHHPGCLRKEVGEERNPSVLCRKAFDKIQQCFMLKTLNKLDVDGTYLKIIRAIYDK LTANIILNGQKLEAFPLKTGTRQGCPLSPPLFNIVLEVLARAIRQEKEIKGIQLGEEEVK LSLFADDMIVYLENSIISAQNLLKLINNFTKVSGYKINVQKPQAFLYTINRQTESQIMSE LPFTIAAKRIKFLGIQLTRDVKDLFKENYKPQLNEIKEDTNK >gi568815596r:70731053_70935776|GENSCAN_predicted_CDS_5|669_bp atgggagccatagcccaccacccggggtgcctgcgcaaggaagtcggcgaagagcgaaac ccctcggtgctatgcagaaaggcctttgacaaaattcaacagtgcttcatgctaaaaact ctcaataaactagatgttgatggaacatatctcaaaataataagagctatttatgacaaa ctcacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaacaggc acaagacaaggatgccctctctcaccacccctattcaacatagtgttggaagttctggcc agggcaattaggcaagagaaagaaataaagggtattcaattaggagaagaggaagtcaaa ttgtccctgtttgcagatgacatgattgtatatttagaaaactccatcatctcagcccaa aatctccttaagctgataaacaacttcaccaaagtctcaggatacaaaatcaatgtgcaa aaaccacaggcattcctctacaccattaacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgctgcaaagagaataaaattcctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccacagctcaatgaaataaaagaggacaca aacaaatga >gi568815596r:70731053_70935776|GENSCAN_predicted_peptide_6|503_aa MLDTGSEHLNRILKALPALQSAGSEGQNGSAESLGEGGTRDSDRARRKLRGGNKEIPTFY PCLVVRSPVTASDLRGTQDFAAYHGLSLILEPLGACNRLSVCVPVHSPPGMRVSPRSPSL RTLVIDPAEPAGAQRLRFSGKERSGEAGSAVEGLAVAVSMGDGGAERDRGPARRAESGGG GGRCGDRSGAGDLRADGGGHSPTEVAGTSASSPAGSRESGADSDGQPGPGEADHCRRILV RGSFWGVRGARRPNPLEARTRSGYLGLQEREEMAVACTRSQSKPPGSSLPLLRSTDAKGT IREIVLPKGLDLDRPKRTRTSFTAEQLYRLEMEFQRCQYVVGRERTELARQLNLSETQVK VWFQNRRTKQKKDQSRDLEKRASSSASEAFATSNILRLLEQGRLLSVPRAPSLLALTPSL PGLPASHRGTSLGDPRNSSPRLNPLSSASASPPLPPPLPAVCFSSAPLLDLPAGYELGSS AFEPYSWLERKVGSASSCKKANT >gi568815596r:70731053_70935776|GENSCAN_predicted_CDS_6|1512_bp atgctggacacaggttctgagcacctgaaccggatactgaaggctctgccagccctccag agcgctggcagtgaaggacagaacggctccgcagaaagcctcggagaaggcgggacccgg gacagcgacagagcacggagaaagctccgcgggggaaacaaggaaatccccaccttctat ccctgtctcgtggtgcggtcccctgtgacggcctcagacctgcggggaacccaggacttt gccgcctaccacggtctcagccttattctcgagcctctgggcgcctgcaaccggctgtct gtctgtgtgcctgttcactctcccccggggatgcgggtctctccgaggtcgccttctttg cgcactttggttattgatcctgctgaaccggccggagcgcagcggctgcgcttctctggg aaagaacggagtggggaggctggctccgccgtagaggggttggcagtggcggtcagcatg ggcgatgggggcgccgagcgcgaccggggccccgcgcgccgggcggagtctggtggcggc ggtgggcgctgcggagaccgcagcggagcgggggacttgcgagctgatggcggtggccac agcccaacggaggtggccgggacctcagcctccagtcccgcaggctccagggagagtgga gccgacagcgacgggcagcccgggcccggcgaggcagaccactgccgccgcatactggtg cgaggtagcttctggggagtccgaggggcaaggaggccaaatcctttggaagcccggaca aggagtgggtacctggggctacaggaaagggaggagatggcagtggcatgcacccgcagt cagtcaaagccaccgggctcaagtcttcctcttttgcgcagcaccgatgccaaagggaca attcgggaaattgtcctgcctaagggcctggacctggaccggcccaagcggacacgtaca tccttcactgccgagcagctgtaccgcctggagatggagttccagcgctgccagtatgtg gtgggccgcgagcgcactgagctggcccgccagctgaacctctccgagacccaggtgaag gtctggttccagaaccgccgcaccaagcagaagaaagaccagagcagagacctggagaag cgggcgtcctcctcagcctccgaggcctttgccacctccaacattctgcggctgctggag cagggccggctgctctctgtgcccagggcccctagcctcctggcgctgacccctagcctg ccaggcctacctgccagccacaggggcacctccttaggtgaccccaggaactcctcccca cgcctcaacccgctgtcctcggcctcagcgtcccccccactgccgccccctctgccagct gtctgcttttcctcggccccgctcctggatctgcctgccggctacgaactgggttcctcg gccttcgagccatacagctggctagaacggaaagtgggcagcgccagcagctgcaagaaa gctaacacttaa