GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:48:00 Sequence gi568815586r:106877560_107078714 : 201155 bp : 39.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 8345 8456 112 2 1 113 42 162 0.687 11.15 1.02 PlyA + 8878 8883 6 1.05 2.00 Prom + 9602 9641 40 -6.25 2.01 Sngl + 13632 14126 495 2 0 42 41 216 0.209 8.12 2.02 PlyA + 14336 14341 6 -0.45 3.05 PlyA - 14624 14619 6 1.05 3.04 Term - 16750 16611 140 2 2 23 54 106 0.355 -2.16 3.03 Intr - 18276 18107 170 2 2 47 85 242 0.441 18.47 3.02 Intr - 24606 24434 173 0 2 57 27 242 0.247 12.82 3.01 Init - 24779 24681 99 2 0 96 13 87 0.153 2.32 3.00 Prom - 24885 24846 40 -8.35 4.00 Prom + 25092 25131 40 -6.35 4.01 Sngl + 25488 25934 447 2 0 65 42 327 0.475 21.77 4.02 PlyA + 26439 26444 6 1.05 5.04 PlyA - 26482 26477 6 1.05 5.03 Term - 36065 35819 247 1 1 67 50 155 0.767 3.88 5.02 Intr - 36339 36223 117 2 0 109 27 150 0.687 9.66 5.01 Init - 38235 38078 158 0 2 53 97 53 0.484 2.16 5.00 Prom - 61663 61624 40 -4.95 6.00 Prom + 71828 71867 40 -4.25 6.01 Sngl + 77800 77997 198 0 0 69 38 281 0.879 16.12 6.02 PlyA + 79102 79107 6 1.05 7.00 Prom + 82964 83003 40 -7.55 7.01 Init + 86527 86582 56 0 2 77 100 30 0.460 3.91 7.02 Intr + 87525 87606 82 0 1 28 115 54 0.728 0.82 7.03 Intr + 89552 89621 70 1 1 86 115 78 0.794 8.14 7.04 Term + 93546 93832 287 1 2 79 44 323 0.991 21.58 7.05 PlyA + 96403 96408 6 1.05 8.02 PlyA - 97116 97111 6 1.05 8.01 Sngl - 101155 99998 1158 1 0 83 44 442 0.535 33.25 8.00 Prom - 106888 106849 40 -8.05 9.00 Prom + 107135 107174 40 -6.55 9.01 Init + 107726 107816 91 1 1 84 7 61 0.570 -3.18 9.02 Term + 109486 109844 359 2 2 98 43 344 0.871 24.59 9.03 PlyA + 109855 109860 6 -0.45 10.12 PlyA - 110208 110203 6 1.05 10.11 Term - 110560 110394 167 2 2 32 41 140 0.032 0.80 10.10 Intr - 119827 119735 93 2 0 105 100 86 0.926 10.52 10.09 Intr - 120131 119929 203 1 2 82 77 136 0.979 9.91 10.08 Intr - 120420 120356 65 0 2 68 99 7 0.567 -3.60 10.07 Intr - 122303 121992 312 2 0 85 115 92 0.984 7.06 10.06 Intr - 122523 122383 141 0 0 111 61 93 0.996 8.53 10.05 Intr - 123809 123721 89 0 2 101 16 55 0.980 -1.63 10.04 Intr - 124389 124205 185 2 2 79 95 184 0.998 16.71 10.03 Intr - 127689 127547 143 0 2 88 89 129 0.965 11.23 10.02 Intr - 144633 144525 109 2 1 110 86 78 0.899 9.17 10.01 Init - 159013 158925 89 1 2 23 64 86 0.135 -0.14 10.00 Prom - 162574 162535 40 -4.65 11.03 PlyA - 163713 163708 6 1.05 11.02 Term - 165619 165455 165 1 0 93 37 89 0.703 1.23 11.01 Init - 165734 165657 78 2 0 92 98 50 0.973 7.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_1|37_aa XEELLKPMGLKPDGTITPLEEALNQYSVIEETSSDTD >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_1|114_bp nnagaggagttgcttaaaccaatgggactaaaacctgatgggacaataacgcctttggag gaagcactcaaccagtactctgtcatcgaagagaccagctctgacacagactaa >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_2|164_aa MGKRFTEHKLLSMMETAIPCLVIGHMSSSLSGMKRTPPGRALWLPYREWIDAGTDRRDQT ESCCLYLGGASRAWIRTLAVDVEKGMDSRNISKIKETHFDLGERQERAHLTHENENVRRK DHLNHWLLTCELNNSGFGSRRIELRVNSRSLQMCMVTKHYLYTK >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_2|495_bp atgggcaaaaggtttacagaacacaagttgttgagtatgatggaaacagctatcccatgt ctcgtgattggacacatgagcagcagcctttcaggcatgaaaaggaccccaccaggaaga gcactctggctgccgtatagagaatggattgatgcagggacagacagaagagaccagaca gaaagctgttgcctctacctgggaggggcaagtagggcctggattagaacactggcagta gacgtagaaaaggggatggactcaagaaatatttcaaagataaaagaaacacactttgat cttggagaaaggcaagaaagagctcatttaactcatgagaatgagaatgttcggagaaaa gatcatctgaaccattggctcctaacctgtgaactaaacaactctgggtttgggagtagg agaatagagttaagagttaattcaaggagtctgcaaatgtgtatggtcaccaaacattat ttatataccaaataa >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_3|193_aa MGFRDLKSPASLQVLNDYLVDKSYIEGMCHHKHIKSYEKEKASLPGVKKAVGRYGPAKVE DTTGSGATNSKDDDEIDLLGSDEEESEEAKSSVPAVTALSDDDDDDDNDDGYDDSKPFCA SLEKLQAVLYQLEKNSLSTPLLNTVKTGSRSPVSLSSASTSCPTGRSSGAMACMELSSPV IAMPSGIPPEGPA >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_3|582_bp atgggtttcagagatctgaaaagcccggctagcctccaggtgctcaacgattacctggtg gacaagagctacatcgagggtatgtgccatcacaagcacatcaagtcttatgaaaaggaa aaggccagcctgccaggagtgaagaaagctgtgggcaggtatggtcctgctaaggtggaa gacactacaggaagtggagctacaaatagtaaagatgatgatgaaattgatctccttgga tctgatgaggaggaaagtgaagaagcaaagagttcagttcctgctgttacggctttgtct gatgatgatgatgatgatgataacgatgatggttatgatgatagcaagcctttttgtgca agcttagaaaaactccaggccgttttgtaccagttagaaaaaaactccctttccacacct ctcctaaacacagtcaagactgggtcaagatccccagtatcactgtcttccgcctccaca tcctgtcccactggaaggtcttcaggggcaatggcatgcatggagctgtcatctcctgtg atagcaatgccttctggaataccccctgaaggacctgcctga >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_4|148_aa MPMMTVMMLPVRLRERKDERTTRDPRGLEAAEKGPQEIPIPCKLSFVIRHRALLDHTDGI LIQPRFCKRSKKFLQGVGIEGSSINKNKCTFQEAAKALTFPDYALEVPVYTRLDSRATWI REPMVQLDHPACSGHPPREWSARIQVQI >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_4|447_bp atgccgatgatgacggtgatgatgctgccagtgcgtctcagagagagaaaggacgaacgc acaacccgggatccaaggggcctggaagctgctgagaaaggccctcaggaaatcccaata ccgtgcaaattgagctttgtaataaggcacagggcacttttggatcacacggacgggatt cttatccagcctaggttctgcaagagatcaaaaaagttcctccaaggagtgggcatagaa gggagcagtataaataaaaataaatgcacattccaagaagctgctaaagctctaaccttt ccggactacgctttggaagtgccagtttacaccaggctcgacagtcgagccacttggatc cgagagcccatggtccagctggaccacccagcctgctcaggacacccgcctcgcgagtgg agcgctaggattcaggtccagatctga >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_5|173_aa MSLSRNSRESGQSLVMSVVPADRGVGATVPEAGMALYVPGQHNWRAGSWGARRTGDSLRL SVSNKRGFVRRTVGSEGGPGHGVTCICAKQQGHLRAGASWQVRSSEENGAFLRLFPGTTS LDIWGDEKRKRQGIACSSGSISVSVVERQKGSSCGSATYQPYIRSLTYSRHCE >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_5|522_bp atgtcattgagcagaaactccagggagagtgggcagagccttgtcatgtcagtggtccca gcagacagaggagtaggagcaacagtcccagaggcgggaatggccttgtatgttcctggc cagcacaactggagagcagggagctggggggccagaaggactggcgactctctgagactc agcgtttccaacaagcggggctttgtcagaagaactgtgggctccgagggaggccctggt catggggtcacctgcatttgtgcgaagcagcaggggcacctgcgggccggggcttcttgg caggtgagaagcagcgaggagaatggggcttttctcagactgttccctggaacaacatct cttgacatctggggagatgagaagcggaaaagacaaggcattgcgtgttcatctgggtcc ataagcgtctcggtggtggagaggcagaaagggagctcatgtggaagcgctacttaccag ccttacatccggagcttaacatattccagacactgtgaatga >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_6|65_aa MQGRLTSLDEESAGETGDRPTFPKAKALNSEDRYSARNAGTADVKTPGLGAAAYASPGLR APRPI >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_6|198_bp atgcaagggaggttgacgtccctggacgaagaaagtgctggagaaactggcgaccgtccc accttccctaaggcgaaggctctgaattccgaagaccgttactcggcgcggaacgcgggc accgccgacgtcaaaacaccgggcctgggggccgcggcctacgcgtctcccgggcttcgc gcccctcgcccgatttga >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_7|164_aa MNKLDKGVCPLEINCPAGKPHSLSSSPTALDATLPHFTGCSFLLIQEIMNQTDKNQQEIP SYLNDEPPEGSMKDHPQQQPGMLSRVTGGIFSVTKGAVGATIGGVAWIGGKSLEVTKTAV TTVPSMGIGLVKGGVSAVAGGVTAVGSAVVNKVPLTGKKKDKSD >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_7|495_bp atgaacaaattggacaaaggtgtctgtcctcttgaaattaattgtccagcagggaaacct cactctttgtcttcctctcctactgcccttgatgcgaccctgccacactttactggctgt tccttcttgctgatccaggagatcatgaatcagacagataaaaatcaacaagaaatccca tcataccttaatgatgaaccaccagaaggttcaatgaaagatcacccacagcagcagcca ggcatgttgtcccgtgtgactgggggtatcttcagtgttacaaagggagctgttggtgcc accattggtggtgtggcttggattggtggaaagagtctggaagtgaccaaaacagctgtt acaactgtgccttccatgggaatagggctggtgaaagggggtgtctctgctgtggctgga ggtgttacagctgttgggtctgctgttgtaaacaaagtgcccttaacaggaaagaagaaa gacaaatctgactga >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_8|385_aa MLWKLLLRSQSCRLCSFRKMRSPPKYRPFLACFTYTTDKQSSKENTRTVEKLYKCSVDIR KIRRLKGWVLLEDETYVEEIANILQELGADETAVASILERCPEAIVCSPTAVNTQRKLWQ LVCKNEEELIKLIEQFPESFFTIKDQENQKLNVQFFQELGLKNVVISRLLTAAPNVFHNP VEKNKQMVRILQESYLDVGGSEANMKVWLLKLLSQNPFILLNSPTAIKETLEFLQEQGFT SFEILQLLSKLKGFLFQLCPRSIQNSISFSKNAFKCTDHDLKQLVLKCPALLYYSVPVLE ERMQGLLREGISIAQIRETPMVLELTPQIVQYRIRKLNSSGYRIKDGHLANLNGSKKEFE ANFGKIQAKKVRPLFNPVAPLNVEE >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_8|1158_bp atgttgtggaagctgctgctgagatcccagtcctgcaggctgtgttctttcagaaagatg cgatcacctccaaaatacagacctttcttagcatgcttcacctatacaactgataaacag tcgagcaaagaaaatacaagaacagtggaaaagctctataaatgttcagttgacattagg aaaattcgtagattaaaaggatgggtacttttagaggatgaaacctatgttgaagaaatt gcgaatattttacaagaactaggtgccgatgagactgctgtagccagtattttggaacgc tgcccggaagcaattgtctgtagtccaaccgctgttaacacccagagaaaactctggcag ttggtctgcaaaaatgaggaagagttaatcaagttaatagagcagtttccagaatctttc tttactattaaagaccaagagaaccagaagctgaatgttcagttctttcaagagttggga ctaaaaaatgtggtcattagcagacttttgacagctgcacctaatgtttttcataatcct gttgagaagaataagcaaatggtaagaattctccaagagagttatctagatgtaggtggc tctgaggccaacatgaaagtttggctactaaaattgttaagccaaaacccatttattttg ttaaattctcccacagctataaaggaaacactagaatttctccaggagcaaggtttcacc agctttgaaattctccagcttctatccaaactcaaaggatttctttttcaactttgccca agaagtatacagaatagtatttccttctctaaaaatgcttttaaatgcacagatcatgac ctgaagcaattagttttgaaatgtcctgcccttttatattattctgttccagttttagaa gagagaatgcaaggattattgagagaaggaatttccatagctcagataagagagacgcca atggttcttgaattaacaccacagatagtacagtacaggataaggaaactgaattcctca ggctacagaataaaggatggacatctagcaaatctaaatggatcaaaaaaagagtttgaa gctaattttggcaaaattcaggccaaaaaagtaaggccattatttaaccctgtggcacca ttaaatgttgaagaatga >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_9|149_aa MAYKASLGSATTLIPISLSCTSCICFLSVPASPQSAVPEAWSRDFGPRSQEAPDSSQQRQ PGHQSLRPWPIRAREMTANPRPPKTRPRVPTQPVRARDTTTRPSPHHEGESEWESAFQAA QPVLAGASGIFTPFPSQHFRKRSSSSFDA >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_9|450_bp atggcctacaaggcctctttgggctctgccaccaccctaattcccatttccctctcctgc accagctgcatttgcttcctgtcagtccctgcctccccacagtccgcagtccccgaggcc tggagccgggatttcggtcctcggtctcaagaagctcccgacagttctcagcagcgccaa ccaggccatcagtcccttcgcccatggccaatcagggctcgggagatgaccgccaacccc cgcccccccaagacacgcccccgcgtcccaacgcagccagtcagagcgcgggacacaact acgcgtccctccccgcaccacgagggcgagtcggagtgggaatcggctttccaggcagcc cagcctgtgctggcgggggcatctggcattttcacaccctttccttcccagcattttagg aagcggagttccagtagctttgacgcttag >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_10|531_aa MDWEVIAVGDFVHRDDMTTWHLGEDWTGWQFLLQCLEDLDANLRKLNSRLFVIRGQPADV FPRLFKEWNITKLSIEYDSEPFGKERDAAIKKLATEAGVEVIVRISHTLYDLDKIIELNG GQPPLTYKRFQTLISKMEPLEIPVETITSEVIEKCTTPLSDDHDEKYGVPSLEELGFDTD GLSSAVWPGGETEALTRLERHLERKAWVANFERPRMNANSLLASPTGLSPYLRFGCLSCR LFYFKLTDLYKKVKKNSSPPLSLYGQLLWREFFYTAATNNPRFDKMEGNPICVQIPWDKN PEALAKWAEGRTGFPWIDAIMTQLRQEGWIHHLARHAVACFLTRGDLWISWEEGMKFFHC YCPVGFGRRTDPNGDYIRRYLPVLRGFPAKYIYDPWNAPEGIQKVAKCLIGVNYPKPMVN HAEASRLNIERMKQIYQQLSRYRGLGLLASVPSNPNGNGGFMGYSAENIPGCSSSGRCEG SLYASATDGRTVHSVLRTPDSRHLQLGKPVFLTPTPHKQNHQAQLRAPCPR >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_10|1596_bp atggactgggaagtcattgcagttggagactttgttcatagagatgacatgaccacttgg catctgggtgaagactggactggatggcaatttttgcttcagtgtcttgaggatcttgat gccaatctacgaaaattaaactcccgtctgtttgtgattcgtggacaaccagcagatgtg tttcccaggcttttcaaggaatggaacattactaaactttcaattgagtatgattctgag ccctttggaaaggaacgagacgcagctattaagaaactggcaactgaagctggagtagaa gtcattgtaagaatttcacatacattatatgacctagacaagatcatagaactcaatggt ggacaaccgcctctaacttataaaagattccagactctcatcagcaaaatggaaccacta gagataccagtagagacaattacttcagaagtgatagaaaagtgcacaactcctctgtct gatgaccatgatgagaaatatggagtcccttcactggaagagctaggttttgatacagat ggcttatcctctgcagtgtggccaggtggagaaactgaagcacttactcgtttggaaagg catttggaaagaaaagcttgggtggcaaattttgaaagacctcgaatgaatgcgaattct ctgcttgcaagccctactggacttagtccttatctccgatttggttgtttgtcatgtcga ctgttttacttcaaactaacagatctctacaaaaaggtaaagaagaacagttcccctccc ctttccctttatgggcaactgttatggcgtgaatttttctatacagcagcaacaaataat ccacgctttgataaaatggaaggaaaccctatctgtgttcagattccttgggataaaaat cctgaggctttagccaaatgggcggaaggccggacaggctttccatggattgatgccatc atgacacagcttcgtcaggagggttggattcatcatctagccaggcatgcagttgcttgc ttcctgacacgaggggacctgtggattagttgggaagaaggaatgaagttttttcactgc tattgccctgttggttttggtaggagaacagatcccaatggagactatatcaggcgttat ttgcctgtcctaagaggcttccctgcaaaatatatctatgatccctggaatgcaccagaa ggtatccaaaaggtagccaaatgtttgataggagttaattatcctaaaccaatggtgaac catgctgaggcaagccgtttgaatatcgaaaggatgaaacagatctatcagcagctttca cgatatagaggactaggtcttctggcatcagtaccttctaatcctaatgggaatggaggc ttcatgggatattctgcagaaaatatcccaggttgtagcagcagtggaaggtgcgagggc agtctgtatgccagtgccaccgacggcagaactgtgcacagtgtgctacgaacacctgac agcagacacctccagctgggaaaaccagttttcctgactccaactccacacaaacagaat caccaggcccagctcagggcaccctgtcctcgataa >gi568815586r:106877560_107078714|GENSCAN_predicted_peptide_11|80_aa MTAQSGQGTVFARGKILLQLKHKGGGERKLGLGMLDHLAGAVVQLHLSLRDEGESRLLTA RARHPPAIVPILRWHSVVAL >gi568815586r:106877560_107078714|GENSCAN_predicted_CDS_11|243_bp atgactgctcagagtggccaaggtactgtttttgcaagaggcaagatactacttcagctc aagcacaagggtggaggggaaaggaagcttggcttggggatgttggaccacttggctggg gcggtggttcagttgcatcttagtctcagagatgaaggagagtcacggctactcaccgcc agagcaagacaccctccagccatagttccaattctacgatggcatagcgtagtagccctg tag