GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:13:18 Sequence gi568815590r:65619293_65888941 : 269649 bp : 39.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 3062 2971 92 2 2 86 100 94 0.146 8.27 1.01 Init - 8085 7924 162 0 0 88 89 220 0.903 21.88 1.00 Prom - 19990 19951 40 -5.05 2.05 PlyA - 20434 20429 6 1.05 2.04 Term - 25572 25296 277 2 1 94 54 223 0.687 13.45 2.03 Intr - 25952 25808 145 0 1 7 58 126 0.249 -0.08 2.02 Intr - 59826 59754 73 0 1 80 77 54 0.414 1.56 2.01 Init - 63620 63507 114 2 0 62 70 72 0.510 3.06 2.00 Prom - 70524 70485 40 -3.45 3.00 Prom + 73514 73553 40 -6.55 3.01 Init + 75819 75916 98 2 2 48 52 68 0.179 -0.97 3.02 Intr + 85402 85637 236 1 2 126 62 268 0.968 24.41 3.03 Intr + 87718 87964 247 2 1 107 78 229 0.887 19.40 3.04 Intr + 88551 88719 169 0 1 52 97 271 0.896 23.43 3.05 Term + 89684 89752 69 1 0 93 38 56 0.856 -1.94 3.06 PlyA + 91237 91242 6 1.05 4.04 PlyA - 91261 91256 6 1.05 4.03 Term - 97344 97114 231 0 0 94 39 117 0.061 2.89 4.02 Intr - 108009 107878 132 0 0 65 96 47 0.090 3.12 4.01 Init - 115600 115502 99 1 0 50 116 125 0.894 11.81 4.00 Prom - 116752 116713 40 -3.65 5.05 PlyA - 118902 118897 6 1.05 5.04 Term - 120305 120294 12 2 0 138 39 3 0.008 -2.37 5.03 Intr - 128511 128360 152 1 2 100 93 77 0.173 8.36 5.02 Intr - 160511 160428 84 0 0 72 93 55 0.324 3.27 5.01 Init - 176206 176077 130 1 1 77 82 64 0.182 5.06 5.00 Prom - 194900 194861 40 -4.25 6.00 Prom + 202165 202204 40 -4.85 6.01 Init + 204981 205150 170 2 2 91 -11 119 0.017 1.35 6.02 Intr + 222462 222821 360 0 0 40 -16 343 0.173 12.51 6.03 Intr + 223138 223476 339 1 0 24 44 203 0.173 3.16 6.04 Intr + 225449 225585 137 2 2 106 85 37 0.347 4.49 6.05 Intr + 245424 245917 494 1 2 63 31 284 0.231 11.99 6.06 Intr + 246123 246526 404 2 2 32 53 228 0.460 6.00 6.07 Intr + 248614 248803 190 1 1 48 68 106 0.482 3.27 6.08 Intr + 248883 248931 49 2 1 98 80 19 0.564 -0.57 6.09 Intr + 249073 249175 103 2 1 73 82 49 0.455 1.11 6.10 Term + 249246 249576 331 0 1 78 53 126 0.218 1.24 6.11 PlyA + 250573 250578 6 1.05 7.03 PlyA - 251795 251790 6 1.05 7.02 Term - 252693 252509 185 1 2 39 42 191 0.304 6.32 7.01 Intr - 255501 255449 53 2 2 73 74 58 0.020 0.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 41194 41058 137 2 2 93 40 92 0.837 2.10 S.002 Init + 119613 119736 124 2 1 58 86 54 0.875 2.61 S.003 Term + 121639 121745 107 2 2 125 32 88 0.886 4.49 S.004 Sngl + 204981 205196 216 2 0 91 42 143 0.849 4.92 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_1|85_aa MSEEPDALSVVNQLRDLAADPLNRRAIVQDQGCLPGLILFMDHPNPPVVHSALLALRYLA ECRANREKMKGELGMMLSLQNVIQN >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_1|255_bp atgagtgaagagcctgacgctctatcggtagttaaccagttacgggatctagcagcagat ccgttaaacagaagagccatcgtccaggatcagggatgtctgcctggccttattttattt atggaccatcccaaccctccagtcgtccactccgctttgcttgctcttcgatacttggca gaatgccgtgcaaacagagaaaagatgaaaggagaactgggtatgatgttgagcttacaa aatgttatacagaan >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_2|202_aa MDPIINADLETNDGKQLIWHPQRSKNKAGEQSGYPKRKQAKVLSGDEEDKKLTKKNLERI CQGRLAVHPRSPAPEPVKGAAPKGAAAKGGLPRTAAWRPAREPGVRKMDRRASHRRQACS RLHHPLANRFSEFPPYSPRPRGSQHPLRLLRLLAWLSRGVAANQPIPAPSLSQADQSAVS SGSEAGTRSSAEDAATAPQAGC >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_2|609_bp atggatcctataattaatgctgatcttgaaaccaatgacggaaagcaacttatatggcat ccacagagatctaaaaataaagcaggggaacagagtggctatccaaagaggaagcaagca aaggtcttgtctggagatgaagaggacaaaaagctcacaaagaaaaatttagaaaggatt tgccaaggccggctggcggtgcacccgcgcagtccggctcctgagcccgttaagggcgca gcacccaagggagccgccgcgaagggcggtctccccaggaccgccgcctggcggccggcg cgggaaccgggggtccgaaagatggaccgaagggcatcccatcgccgtcaggcctgcagc cgacttcatcacccgctagccaaccgcttttccgagttcccaccctactcacctcggccg cgcggcagccagcaccctctccggctcctcaggcttctagcttggctgtcccgtggcgtt gctgccaatcagccaatccccgccccaagcctgagccaggctgatcaatcagctgtttcc tcaggctcggaggcgggcacaagaagctcggcagaggacgcggcaaccgctccgcaagcc ggctgctga >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_3|272_aa MKNRKEPDQEIDQLGDDQNLTELRNWQKRFKAKTEVRSRPPLQDDLLFFEKAPSRQISLP DLSQEEPQLKTPALANEEALQKICALENELAALRAQIAKIVTQQEQQNLTAGDLDSTTFG TIPPHPPPPPPPLPPPALGLHQSTSAVDLIKERREKRANAGKTLVKNNPKKPEMPNMLEI LKEMNSVKLRSVKRSEQDVKPKPVDATDPAALIAEALKKKFAYRYRSDSQDEVEKGIPKS ESEATSERVLFGPHMLKPTGKMKALIENVSDS >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_3|819_bp atgaagaatagaaaagagccagaccaagaaatagaccaactaggagatgatcagaattta actgaattaagaaactggcagaagaggttcaaagccaagacagaggtcagatcaaggcca ccccttcaggatgaccttcttttctttgagaaggccccaagcagacagatttccttacca gacttgtctcaagaagagcctcagctgaagaccccagcgctggcaaatgaggaagcactg cagaagatttgcgctctcgaaaatgaacttgctgctctcagagctcagattgccaaaatt gtgacccagcaggagcagcaaaatctcactgcaggtgacttagattctaccacatttggt accataccaccacaccctccacctcccccaccgcccctgcctccccctgcactggggctc caccaaagtacatctgctgttgatctgattaaagaacgaagagagaaaagagccaatgct ggaaagactttggttaagaacaatccaaagaaacctgaaatgccaaatatgctagagatc cttaaagagatgaacagtgtaaaacttcggtcagtgaagaggtcagagcaagatgtgaag cccaagccagtggatgctactgaccctgctgccctcatagcagaggctctgaaaaagaaa tttgcttatcggtatcgaagtgatagccaagatgaagttgaaaaaggaattccaaagtct gaatcagaggccacctcagagagagtgttgtttgggccacacatgttgaagccaacagga aaaatgaaggctttaattgaaaatgtatcagactcctaa >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_4|153_aa MIQEDYHSQNPYHNAVHAADVTQAMHCYLKEPKLANSVTPWDILLSLIAAATHDLDHPGV NQPFLIKTNHYLATLYKANLPGKPNYEQLVSMDSEQGKQVASCTHVPLGAQKTHVFCDIH VCGPGYPLPLEGARDSLPVVASLHQLLIGTCIK >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_4|462_bp atgattcaagaagattaccacagtcaaaatccttaccataacgcagtccacgctgcggat gttactcaggccatgcactgttacttaaaggaacctaagcttgccaattctgtaactcct tgggatatcttgctgagcttaattgcagctgccactcatgatctggatcatccaggtgtt aatcaacctttccttattaaaactaaccattacttggcaactttatacaaggcaaatctt cctgggaagccaaattatgagcagctggtgagcatggattctgagcaagggaagcaagtt gcctcgtgcacccatgtccctttgggtgcacagaagacccatgtcttctgtgatattcat gtttgtggaccagggtaccctcttcccctggaaggtgctagagattctctgccagtagtt gcttctctacatcagcttctgattggcacatgcattaaataa >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_5|125_aa MESNPTHTQLSIQPKIQGDPPSDFWSSLSSQLPPLWHSALQIPGDVRVRSRAGFESERRG SHPYIDFRIFHSQSEIEVSVSARNIRRLLSFQRYLRSSRFFRGTAVSNSLNILDDDYNGQ AKEIV >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_5|378_bp atggagtctaatcctacacatactcagcttagtattcagccaaagattcaaggggatcca ccatcagacttctggagctccttgtcctcacagctccctcctctctggcattcagccctg caaattccaggagatgtacgtgtaaggagccgagcaggatttgaatcagaaagaagaggt tctcacccatatattgattttcgtattttccactctcaatctgaaattgaagtgtctgtc tctgcaaggaatatcagaaggctactaagtttccagcgatatcttagatcttcacgcttt tttcgtggtactgcggtttcaaattccctaaacattttagatgatgattataatggacaa gccaaggaaatagtctag >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_6|858_aa MKLVHITLKFNQEFLLTDEQRKWFPELKFTPGEDAVNTAEMTTKNSGCYINLVDKAGRGG TGCKRGQHRQSRKGNPAALGLLGREEQVPGLQSSSAARTEQSSARKDSWSQRPDPGRAGL RSSDQLGPPPPPPPPPPESCSSPPPEPDFTPPPARARRNSHFAAADLAPTTAAATVTGGV PACRPQKPKAPGAVAVLSPDRGVTAGFSLLEFPFMGFARDRCGSDVSGPREHRPPGYVTA SPSVGPGSSARLADAAPGPSRARGAERGAENSEVRRGPGDKGKKPRLRKKPPCHQFSNHI PSEVQTIYPSSQILLMNQYRPPHVAIPLSRQNEQPGAIFGPWPTLTSASVLECPLQEVAG TDGIVSVHVPFSLTDFSQINKRLGSFPEDPTFYIREFQYLTQSYEQTWHDLYVILSSTLT PEHQDHIWTLAQAHADTIHHQAPVRPTGAEVVPNEDPHWDYQDGSPGHHHQDHMIVCLLA GLKKGALKQSTMKNFQKSPKPSKSSTIMMKKAKGKNRQSFKCLPPPSGALQAHRAAAPHR SLLDIHLHLAPVSSAAMKATGPDNAQTQVSPPGRAPSTEDPTGSQTVGGPHKDRPHPFLS QPNPPTQISSALPLKTDGALERTPQQLPSLPLSQGALTLIHLERMRFPNYKNTPVINGSL ISKLLQATRLPQKVAIIHCRGHQTPDNPISAGNALADQKKRRTSEPQTFKSKDHAVFEKS PSPALSAIQCHPRSPSGHGLFLPTKPRASLCLYFLWVVEVFPTTSEVANVVTQTLTMHIL PRFGLQTSIQSDNGPAFISQITQGVSTSLGIKWVLHTPYRPQSSGKVEKINSVLKAQLTK LALETCQSWTKKISFSPS >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_6|2577_bp atgaaacttgtccacataacactcaaatttaatcaggaattccttcttactgatgagcaa agaaagtggtttccggagttgaaatttactcctggtgaagatgctgtgaacactgctgaa atgacaacaaagaattcaggatgttacataaacttggttgataaagcagggcgagggggg acggggtgcaagcggggccagcacaggcaaagtcggaaagggaaccccgccgccctgggg ctcctcggccgagaggagcaggtacccggactgcagagttcgagcgcagcacgaacggag caaagcagtgcaagaaaagacagctggagccagcggccagacccagggcgcgcgggactc aggagcagcgaccagctcgggccgccgccgccgccgccgccgccgccgccggagtcctgc tcctcccctcccccggagcctgacttcactccgccgccggcgcgagcccggcgtaattca cactttgcagccgcggaccttgcgccaaccacggctgccgccaccgtgacaggcggcgtt ccggcctgccgaccccagaaacccaaagcacccggcgcagtcgccgtcctcagccctgac cgcggggtcactgctggcttttccctccttgaattcccattcatgggtttcgctagagac cgctgcggcagtgacgtcagcgggccgcgcgagcaccgtcctcctggctacgtcactgcc agtccctcggtggggccaggctccagcgctcgcctggcagatgcggccccggggccatcc cgagcccgaggggcggagaggggagccgagaattcggaagtgagaaggggacccggagac aaagggaaaaaaccaagactgagaaagaaacctccttgtcaccagttttctaatcatatt ccttccgaggtccaaaccatctatccaagcagtcagatactgctcatgaatcagtataga cctccacatgtagctattcccctttccagacaaaatgaacaaccaggcgccatcttcggc ccatggcccacccttacttcagcatctgtgctagagtgcccccttcaggaagtagcagga actgacggtattgttagcgttcatgttcccttctccctcactgatttctctcaaattaac aaaagacttggttcatttccagaagaccctaccttttatattagagagtttcaatacctc acccagtcttatgaacaaacttggcatgacctctacgttatcctctcttccaccctcacc ccagaacaccaggaccatatctggaccctagctcaggcacatgctgatacaattcatcac caagctcctgtccggcctactggtgcagaggtagtccctaacgaggacccccactgggat tatcaagacgggtcccctggacaccaccatcaagaccacatgattgtgtgtctccttgca ggactcaaaaagggtgccctaaagcagtcaactatgaaaaactttcagaaatcacccaag ccttcaaagtcttcaacaatcatgatgaagaaagcaaaaggcaaaaacaggcagagtttc aaatgcttgcctccgccatcaggggccctgcaggctcacagggctgcagctccacacaga agcctcctagacatccacctccacctggcgcctgtttcaagtgcggcaatgaaggccact ggtccagacaatgcccaaacccaggtaagcccaccaggccgtgccccctctacagaagac cccactggaagtcagactgtgggtggcccccacaaggaccgcccccatcccttcctgagc cagccaaatcctcctactcagatctcatcagccttgccactgaagactgacggtgccctg gaacggacgccccagcaactaccatctcttcctctgagccaaggtgctctcacactcatc catctggaaagaatgaggttccctaactacaaaaacactcctgtcataaatggctctctc atcagcaaactccttcaagctaccaggctcccacagaaagttgccatcattcattgcagg ggccaccaaaccccagacaatcctatatcagcaggaaatgcgctagcagatcagaagaaa aggaggacttctgagccccaaaccttcaaaagcaaagaccatgccgtgttcgagaaatca cccagtcctgctctatctgccattcagtgtcatcccaggtctccctctggccacggcctt ttcctacccaccaagcccagggccagtctttgtctgtactttctctgggtggtagaagtg ttcccaacaacttcagaagttgcaaatgtcgtcacacaaactctcaccatgcatatactt ccccgctttggactccaaacatccatccagtctgataacgggcccgccttcatcagtcaa attactcaaggtgtctctacatcattaggtataaaatgggttctccacacaccctacagg cctcaatcttcaggcaaagttgaaaaaattaactctgtccttaaagcccaactcaccaag ctggctctagaaacttgccagtcatggacaaaaaaaatctccttttcgccctcatga >gi568815590r:65619293_65888941|GENSCAN_predicted_peptide_7|79_aa XGVSSGALKKQATPSATGVRSKGTLNGSKQVHYQIKSNPGLSPVSGTFKPYLDQKFAQRN WESLKQKSMEIQNPREDLP >gi568815590r:65619293_65888941|GENSCAN_predicted_CDS_7|240_bp nnaggtgtgagctctggggcactgaagaagcaagccactccctctgccacaggggtcaga agtaaggggactttaaatggatccaagcaagttcattaccagatcaaatccaatcctggc ctcagtccagtttctgggactttcaaaccctacttggatcagaaatttgctcagagaaac tgggagagcttaaaacagaaatccatggagattcagaatccgagagaagacttaccatga