GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:05:55 Sequence gi568815584r:56701738_56905456 : 203719 bp : 41.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1035 1244 210 2 0 97 84 114 0.916 10.09 1.02 Intr + 6411 6571 161 2 2 119 -6 146 0.022 6.16 1.03 Intr + 13811 13918 108 2 0 62 91 53 0.076 1.48 1.04 Intr + 16566 16687 122 0 2 -2 40 176 0.079 3.02 1.05 Intr + 18373 18618 246 2 0 71 81 118 0.060 5.91 1.06 Intr + 31749 31849 101 1 2 107 42 111 0.011 7.31 1.07 Intr + 34000 34052 53 0 2 34 105 2 0.005 -6.71 1.08 Intr + 37676 37871 196 2 1 30 113 183 0.139 13.50 1.09 Intr + 45907 46016 110 0 2 80 56 39 0.006 -1.94 1.10 Intr + 57697 57828 132 1 0 74 91 38 0.019 1.54 1.11 Term + 63680 63815 136 2 1 105 33 107 0.625 3.41 1.12 PlyA + 63902 63907 6 1.05 2.03 PlyA - 64369 64364 6 1.05 2.02 Term - 74006 73935 72 2 0 62 41 121 0.457 1.73 2.01 Init - 75433 75311 123 1 0 90 80 32 0.410 2.82 2.00 Prom - 91293 91254 40 -2.95 3.06 PlyA - 92203 92198 6 1.05 3.05 Term - 93135 92538 598 2 1 65 49 389 0.343 25.71 3.04 Intr - 94210 94111 100 2 1 65 36 56 0.040 -3.65 3.03 Intr - 97008 96889 120 1 0 44 52 103 0.076 1.85 3.02 Intr - 97244 97178 67 2 1 96 64 44 0.106 0.36 3.01 Init - 98360 98277 84 2 0 71 98 84 0.171 8.57 3.00 Prom - 98998 98959 40 -8.45 4.06 PlyA - 99703 99698 6 1.05 4.05 Term - 100661 99998 664 1 1 41 49 426 0.928 26.35 4.04 Intr - 101732 101371 362 2 2 57 103 133 0.914 4.79 4.03 Intr - 102602 102451 152 0 2 97 91 184 0.964 18.56 4.02 Intr - 103486 103421 66 2 0 98 87 14 0.544 0.26 4.01 Init - 103716 103623 94 0 1 60 114 101 0.544 10.49 4.00 Prom - 105164 105125 40 -6.15 5.00 Prom + 105291 105330 40 -10.25 5.01 Init + 106576 106788 213 0 0 65 -5 277 0.687 14.89 5.02 Intr + 107509 107651 143 0 2 92 -1 100 0.275 -0.27 5.03 Term + 107853 108285 433 0 1 56 48 183 0.202 4.88 5.04 PlyA + 108289 108294 6 -0.45 6.08 PlyA - 108681 108676 6 1.05 6.07 Term - 110495 110251 245 0 2 77 32 200 0.353 8.38 6.06 Intr - 110760 110549 212 2 2 68 23 139 0.323 3.03 6.05 Intr - 115908 115574 335 0 2 20 62 262 0.294 10.24 6.04 Intr - 122250 122210 41 1 2 81 74 12 0.014 -3.88 6.03 Intr - 133073 132893 181 2 1 55 85 107 0.160 5.62 6.02 Intr - 143142 142967 176 1 2 48 79 92 0.804 3.04 6.01 Init - 144822 144762 61 0 1 55 116 85 0.974 9.46 6.00 Prom - 146429 146390 40 -8.15 7.03 PlyA - 147455 147450 6 1.05 7.02 Term - 147840 147706 135 0 0 61 48 148 0.607 5.14 7.01 Init - 155684 155541 144 2 0 94 37 70 0.434 2.68 7.00 Prom - 159276 159237 40 -6.05 8.03 PlyA - 159351 159346 6 1.05 8.02 Term - 173863 173384 480 1 0 39 42 252 0.819 9.51 8.01 Init - 191259 191107 153 0 0 79 44 163 0.624 11.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_1|524_aa CGPVANRPRTSTGLGTSFIRDLRGDENIAYLHRINVNILVLTLFYGPARYYYWGKLGKGY TGSFYFLQLHACARRLISCVGNEAESGSGRATVISEPHCPPSMVWALRTANWHLCSPGQY LKVAIKLTSLSNEAAPPSQVACGCPFHTVSHLEALSMFFRNEEKKKRKNKNKQEEEEEEA KKEKSKDLEVLGPHSHLAAIVGESSQQNWRVVGPPQPMVVVDMLLVLKYPAKFKCRMSSM SFLGPHSSCWSSEKAQGSRAYYYLNQVAFSETEESCGLPPGKGVPGQLWSLTVGAAAWVL LALHGPSYTLEAKPLQKSCGDGVRSFTQNAWCGCKKLMSLGALVNWNWGLGAAGLVELPL LGHLNPQQADDRLQEKLHSDSKGPSPAPFHDALNDPASKTLPCSEETNHSSAIFTFKKLP IHILSSPCLSVIKEQGFSHLKSTHTNKEKEKTQTNHTHSLEMRAGITKIKRTAFASRLPC ALCAQATTDYLGQKNISASSALAGGSRTHIRIALKPSCHEKLIT >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_1|1575_bp tgtggccccgttgctaacaggccacgaaccagtactgggttggggacttcttttataagg gatctccgtggtgatgaaaatattgcgtatcttcaccgtatcaatgtcaacatcctggtt ctgactttgttctatggtcctgcaagatattactattgggggaaactaggcaaaggatac acaggatctttttatttcctacaactgcatgcctgtgcaagaagacttatttcctgtgtg ggaaatgaagcagaaagtggcagtggacgagccaccgtcatttctgagccgcactgccca cctagcatggtttgggctctcagaacagctaactggcacctttgctccccggggcagtat ttgaaagtagcgattaaactcacatccctcagtaacgaagcggcacccccttcccaggta gcatgtggttgtcctttccacacagtgtcacatttggaagctttgtccatgtttttcaga aacgaagaaaagaagaagaggaagaataagaataagcaggaggaggaggaagaagaagca aagaaagaaaagtctaaagatttggaagtcctgggcccacattcccacttggcagcaatt gtgggtgagagttcccagcagaactggcgcgtggtaggtcctcctcaaccaatggtagtt gttgatatgttattggtgctcaagtatccggccaagttcaagtgcagaatgagcagcatg agcttcctggggccccacagcagctgctggtcctcagagaaagctcaaggcagccgtgct tattattatttaaatcaggtagccttttcagaaactgaagaaagctgtggactccctcct ggaaaaggagtcccagggcagctctggagcctgaccgtcggagcagctgcgtgggtcctg ctcgctctccacgggccatcgtacacacttgaagccaaaccactgcagaagagttgtggg gatggagtgagatcattcacgcagaatgcttggtgtggctgtaagaaactgatgtctcta ggagccctggtcaactggaattggggacttggtgctgctggtttggttgagctgccgctg ctaggccatctgaacccacagcaagctgatgatagactgcaggaaaagctccattcagat agcaagggtccttcaccagccccattccatgacgcactcaacgaccctgcctccaagaca ttgccctgctctgaagaaaccaaccactcatctgctatattcacctttaagaagttgcct attcacatcctgtcatctccctgtctctcagtaataaaagagcaaggcttctcacacctg aagagcacccacaccaacaaggagaaggagaaaacccaaacaaaccatacccacagcttg gagatgagggctggtattacgaagataaaaagaacagcctttgcatctagactcccttgt gccctctgtgcacaagccaccactgactacctggggcagaaaaatattagtgccagctca gcactagctggcggcagtaggactcatatcaggatagccctgaagcccagctgtcatgag aagctgataacataa >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_2|64_aa MTSACRFRKLNLIGDASLSSQTWKRAVSQIGEHVLCGAIIQPIRGPDEAVKLPAVPIRKD CSGE >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_2|195_bp atgactagtgcctgcaggttcagaaaacttaatttgattggagatgctagtttgtcttca caaacatggaaaagagctgtaagtcagatcggagaacatgtcctatgtggagccattatc cagcctatacgtggacctgatgaagctgtgaagctgccggctgtacctatcaggaaagac tgcagtggagagtag >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_3|322_aa MGPFSVVPAFCKPVTRKDKWANDHGAPQIRREIQSTQLSSGFGNLQVLDNGALAKELAFL TPQCPVGVTWEQHSHLCPEGPQLGQTPDSPGPSNAAFRSHRPAPPLSQFPQVKAGKYPLE TGPCKSLGESRETTGGGRELLKRLRGSFLASCGEHPAHDPPTQASDGRRPPEAGLKLAAA RGGAAGSPPGRRPVVSAGPVDAKARAPEHRASALRGGGGLGSTRAALFLLLCGLCGGQSV VIVGVTTPGGDVGVLPAALRDRGPRQGGLRSPGFLLTWELRQAWPQGSRPGVPASRNRVC SCPQDLAGAVPGPSRKCAHPGT >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_3|969_bp atgggtcccttttcagttgtacctgcattttgtaaaccagtcaccagaaaagacaaatgg gccaatgatcatggtgctcctcagataagaagagaaattcaatcaactcaattaagttcc ggctttggaaacctccaagttttagataatggcgctctggctaaagagctggcttttcta actccccagtgcccagtcggagttacctgggaacagcactcacatctctgccccgagggc ccccagctggggcagacgccggactccccaggaccttcaaatgctgcctttaggagccac aggcctgctccccctctatctcagtttccccaggtaaaagctggaaagtaccctttagag acaggtccttgcaagtcacttggagagtccagagagactaccggaggaggcagggagttg ctcaagaggctcagaggcagctttctggccagctgcggggaacatccggctcacgacccc ccaacccaggcctccgacggccgccgccccccggaagcgggtctgaaactcgcagccgca cgtggcggggccgccgggtcccctccagggcggcggccggtggttagcgctggcccggtg gatgcgaaggctcgagccccagaacatcgtgcctccgcgctcagaggcgggggtggcttg gggagtacgagggcagccctcttcctactcctctgtggtctttgcgggggtcagagtgtg gtgattgtgggtgtaacaacaccgggaggtgacgtgggcgttcttcccgccgcgctccgg gaccgcggccccaggcagggtgggctccgaagcccaggcttcctcttgacgtgggaactg agacaggcttggcctcagggctcccgacctggagtacctgcctcccggaaccgagtgtgc agttgcccgcaagacttggccggcgctgtccccgggccctcgcggaagtgtgcccacccg ggcacctag >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_4|445_aa MSYLKQPPYAVNGLSLTTSGMDLLHPSVGYPGLPSEEAVPGHLLTKQRTGVGKTTPRKQR RERTTFTRAQLDVLEALFAKTRYPDIFMREEVALKINLPESRVQNMFMKATDISVSCKLG KGLFQSRLEEVLALARLEPKSSLGENKWLQMVGGLELGLGGLTFTTVTAPIQRAHSKVQL RICPPENASSSSSKPPRTPIIAPSLQNMNRAGKENCQDCSRGLKWNYQNRVKEFSFPSKV WFKNRRAKCRQQQQQQQNGGQNKVRPAKKKTSPAREVSSESGTSGQFTPPSSTSVPTIAS SSAPVSIWSPASISPLSDPLSTSSSCMQRSYPMTYTQASGYSQGYAGSTSYFGGMDCGSY LTPMHHQLPGPGATLSPMGTNAVTSHLNQSPASLSTQGYGASSLGFNSTTDCLDYKDQTA SWKLNFNADCLDYKDQTSSWKFQVL >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_4|1338_bp atgtcttatcttaagcaaccgccttacgcagtcaatgggctgagtctgaccacttcgggt atggacttgctgcacccctccgtgggctacccgggtttgccttctgaggaagcagtccca gggcatttactgaccaagcagagaacaggggttgggaaaaccaccccccggaaacagcgc cgggagaggacgacgttcactcgggcgcagctagatgtgctggaagcactgtttgccaag acccggtacccagacatcttcatgcgagaggaggtggcactgaaaatcaacttgcccgag tcgagggtgcagaatatgtttatgaaagcgacagacatcagtgtcagctgcaaattggga aaggggctttttcaaagtaggttagaggaagttttagcacttgcaaggcttgaaccaaag tcctcactaggagaaaacaaatggctccaaatggttggtggtcttgagcttggccttggt ggcctcacttttaccacagttacggcacccatacagagagctcattcaaaagttcagcta aggatttgtcctccagaaaatgccagtagttcctcttctaagccccctcgtacaccaatc atagctccttcattgcagaacatgaacagggctggtaaagagaattgtcaagattgcagc agggggttgaagtggaactatcaaaaccgagttaaagaattttctttcccttccaaggta tggtttaagaatcgaagagctaagtgccgccaacaacagcaacaacagcagaatggaggt caaaacaaagtgagacctgccaaaaagaagacatctccagctcgggaagtgagttcagag agtggaacaagtggccaattcactcccccctctagcacctcagtcccgaccattgccagc agcagtgctcctgtgtctatctggagcccagcttccatctccccactgtcagatcccttg tccacctcctcttcctgcatgcagaggtcctatcccatgacctatactcaggcttcaggt tatagtcaaggatatgctggctcaacttcctactttgggggcatggactgtggatcatat ttgacccctatgcatcaccagcttcccggaccaggggccacactcagtcccatgggtacc aatgcagtcaccagccatctcaatcagtccccagcttctctttccacccagggatatgga gcttcaagcttgggttttaactcaaccactgattgcttggattataaggaccaaactgcc tcctggaagcttaacttcaatgctgactgcttggattataaagatcagacatcctcgtgg aaattccaggttttgtga >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_5|262_aa MERKQNYADWRLACGWEDDEEEERKKKETCGHRGKRPALASLRRTRVKVSGLGESNNSAT REGGRKKRAKRVRAEGYVGRAATRQSHVEKTLFTFARPGDQGRGPRQPGVGRLRNYNPGG RQGLERRPPAQADPQANRASGARAQGGAWLFIIVVSIHPSSYICIFLSRTLKRSLGSIQT TPTKSGGQNAKIAGEAQNGVDIKTKPQTKPSKTPQITFAAFRRRFAEATTPRSTLSASLP LPLSSLDHIFLFATIVITPQNL >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_5|789_bp atggaaaggaaacaaaactacgcggactggcgactggcctgcggctgggaagacgacgaa gaggaggaaagaaagaaaaaggagacgtgtgggcaccgcggaaaacggccggcgctggcc tctctccggcgaactcgagtgaaagtttctggcctcggggaatcaaataactctgccacc cgcgagggagggaggaagaaacgtgccaaaagggtgcgcgcggagggctacgtggggcgg gccgcgacccggcaaagtcatgttgaaaaaacactcttcacgttcgctcggcctggtgac cagggtcggggaccacgacaaccgggggttgggaggctgcgtaattacaacccagggggc cgacaggggctcgagcggcggcccccggcccaggccgacccgcaagcgaaccgagcttcc ggcgcgcgggcccaaggaggcgcctggctttttattattgttgtttcaatccatccatct agttacatctgcatctttttgtctcggactctaaaaaggtccctgggatccatccaaacg accccaaccaaatctgggggccaaaacgcaaagatcgcgggagaagcccagaacggcgtt gacataaaaacaaaaccacaaacaaaaccttccaaaacaccccagattacattcgcagcg tttcgacgacgttttgcagaagcgacgacccccaggagcacgctctctgcctctctccca ctaccgctctcatctctagatcacatttttctttttgcaacgatcgttattacacctcaa aatttgtaa >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_6|416_aa MCVETAQEKGLSNPMQNLAEVKVQGSGRGVRTVQSKPPHLINFSQLLLLATLLPPQLPLE LTTESGTQRLSLYSRKRKKVGNCSHGCTSHLVFMGSTSELLNMILMAVGKEVECLKKGTK VKGAPEPPPLQRVSLRPAAEGTFLLCPHSKMEGLGDCAGEEAGAEPVPRGVGPQARGAYP FANIEALRGDQTEFAPSPSCHGPGDLRTAGANRVLESWRPGGPAASRTRPGTKGCHSFRY EGGIPGVPQSRAVSPSSAHWVRGGRGRKETLLQVIEVDLPFLHRKHHDSDQAHPTSASDS QRHEGHNARGICYPLLTTIPIKGFGGGRQVFQYSGETLDSWERDSQRVSGLELLQRAGKE LGGLRAPRGLEGPPGPKALGTPVTGAAAAESPAPGRARAGARLPRACCLVRTPVIR >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_6|1251_bp atgtgtgtggaaacggcacaagaaaagggcttgtccaatccaatgcagaatttagcagaa gttaaagtacaagggagtggccggggagtgaggacggtgcagtccaaaccgcctcactta attaatttcagccaactcctcctgctggccacgctgctgccgcctcaattacccttggag ttaacaacggaaagcggcacacaaagactttcactttatagcaggaaaaggaaaaaggta ggtaattgttctcacggttgtacttctcatttggtgtttatgggaagcacttcagaatta cttaatatgattttaatggcagtgggaaaagaagttgaatgcttaaagaaaggaacaaag gtaaaaggagccccggagccacctccgctgcagcgagtgtcactgaggccagctgcagag ggcaccttcttgctgtgtcctcacagcaagatggaagggctgggagactgtgctggggaa gaggcaggggcagagccagttccgaggggtgtaggaccccaagcaagaggggcctaccct tttgcgaacatcgaagcacttcgtggagaccaaacagagtttgccccttccccaagctgt cacggccccggggatcttaggactgcgggggcaaaccgcgtcctcgagtcctggcgcccc ggagggccggcagcctcaaggaccaggcccggcaccaagggctgccactccttccgctac gagggcgggatcccaggggtcccccagtcccgcgccgtctcccctagctcggcgcactgg gtccgaggtggccggggtaggaaggagactctgcttcaggtgatcgaggtggatttgcct ttccttcatcgaaagcatcatgactctgaccaagcgcacccaaccagtgcctccgacagc cagcgccacgagggtcataacgcccggggcatttgttaccccctcctcaccaccatcccc attaaaggcttcgggggcggccgccaggtgtttcagtacagtggggaaactcttgactcc tgggaacgcgacagccagcgggtctccggactggagctgctgcagagggccgggaaggag ctgggcgggctccgggcccctcgggggctcgaggggccgccagggcccaaagctctaggg acacctgtgacaggggccgctgccgcggagtccccagccccaggccgagcccgagccgga gcccggctgccgagagcttgctgcctcgtgcgcactcccgtaattcgttaa >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_7|92_aa MGRLWATKRQGRIGKHRKLFCQRCAGQFAVNRCGRQKRQEEGNFTNLSVSENLVGRRRRK KKEEEKEEENEEKEEEQEDKDDLWLLCTAQRN >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_7|279_bp atggggaggctgtgggcaacgaaaagacagggtcgaattggcaagcatagaaaactattt tgccagcgatgtgcaggacagtttgcagtcaatcgatgtggaagacagaaaaggcaagaa gaaggaaattttactaatctgagtgtttctgaaaaccttgtagggaggaggaggaggaag aaaaaagaagaggagaaagaggaggagaatgaagagaaagaggaagaacaagaagacaag gatgacttgtggttattgtgcacagcccagagaaattga >gi568815584r:56701738_56905456|GENSCAN_predicted_peptide_8|210_aa MGREKKEKKLDKEKQVHDAVKNSRDCQHSGCYVEPASCIVLGFRHPVLSGQPGQQSETPP KKRKKRKDKTRKEKKRKEKRKKKKKKERKGRKEERKKKKERQRKKKERKRKKEEGKTKKE ERKKERQRKKKERRKKERRRKEGGKEGRGERQKASEKQKERRKKEKGRKEGIKKEREKPE NRKNWLLPPGLLIVKGCPFGLQEWKGQRAQ >gi568815584r:56701738_56905456|GENSCAN_predicted_CDS_8|633_bp atgggtagggaaaagaaggaaaagaagttggacaaagagaaacaagtacatgatgctgtg aaaaacagcagagactgtcaacactcaggctgctacgttgaaccagcttcttgtattgtc ttgggcttccggcatcctgttttatctggacagcctgggcaacagagtgagactccgcca aaaaaaagaaagaaaagaaaagacaagacaagaaaagaaaagaaaagaaaagaaaaaaga aagaaaaagaaaaagaaagaaagaaaaggaaggaaggaagaaagaaagaaaaagaaggaa agacaaagaaagaagaaagaaagaaaaagaaagaaagaagaaggaaagacaaagaaagaa gaaagaaagaaggaaagacaaagaaagaagaaagaaagaagaaagaaagaaagacgaagg aaggaaggagggaaggaaggaaggggagagagacagaaagcaagcgagaaacagaaagaa agaagaaagaaagaaaaaggaaggaaggaaggaataaagaaagagagagagaaaccagaa aataggaaaaactggctgcttccccctggactgctgatagtaaaggggtgtccctttggc cttcaggagtggaaaggccaaagagcgcaatga