GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:03:48 Sequence gi568815591r:17926829_18127780 : 200952 bp : 38.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 829 824 6 1.05 1.01 Sngl - 13913 13497 417 2 0 54 50 343 0.868 21.15 1.00 Prom - 20115 20076 40 -5.15 2.00 Prom + 24523 24562 40 -7.75 2.01 Init + 25882 26254 373 0 1 31 105 239 0.824 17.07 2.02 Intr + 27532 27616 85 2 1 59 95 -22 0.456 -6.44 2.03 Intr + 27964 28114 151 1 1 72 86 68 0.446 4.24 2.04 Intr + 31842 31859 18 2 0 134 82 5 0.188 0.39 2.05 Intr + 35873 36007 135 1 0 29 12 175 0.306 3.84 2.06 Intr + 36050 36222 173 1 2 22 87 147 0.252 5.82 2.07 Intr + 36275 36344 70 2 1 21 115 64 0.261 0.67 2.08 Term + 47257 47631 375 0 0 37 45 170 0.282 1.35 2.09 PlyA + 49070 49075 6 1.05 3.03 PlyA - 49252 49247 6 1.05 3.02 Term - 56549 56311 239 0 2 35 43 428 0.525 28.35 3.01 Init - 74459 74132 328 2 1 63 -19 167 0.032 1.03 3.00 Prom - 80380 80341 40 -3.65 4.02 PlyA - 81355 81350 6 1.05 4.01 Sngl - 82863 82429 435 0 0 70 32 211 0.977 9.73 4.00 Prom - 83813 83774 40 -5.05 5.03 PlyA - 83952 83947 6 1.05 5.02 Term - 84730 84211 520 0 1 1 42 261 0.763 5.58 5.01 Init - 85208 85063 146 2 2 88 98 138 0.930 14.44 5.00 Prom - 85697 85658 40 -7.15 6.00 Prom + 91780 91819 40 -4.35 6.01 Sngl + 94453 94833 381 0 0 52 38 198 0.197 7.12 6.02 PlyA + 98447 98452 6 1.05 7.03 PlyA - 98617 98612 6 1.05 7.02 Term - 100963 99998 966 1 0 -13 42 981 0.954 74.46 7.01 Init - 103594 103511 84 1 0 47 96 68 0.731 4.37 7.00 Prom - 112457 112418 40 -4.45 8.00 Prom + 116344 116383 40 -7.15 8.01 Init + 118535 118610 76 1 1 50 94 104 0.964 8.50 8.02 Term + 123667 123911 245 2 2 59 45 150 0.485 2.88 8.03 PlyA + 124778 124783 6 1.05 9.03 PlyA - 126646 126641 6 1.05 9.02 Term - 155631 155352 280 2 1 90 42 257 0.672 15.23 9.01 Init - 156939 156932 8 0 2 40 103 0 0.182 -2.82 9.00 Prom - 158260 158221 40 -2.45 10.04 PlyA - 158414 158409 6 1.05 10.03 Term - 161151 160969 183 0 0 73 37 206 0.759 10.56 10.02 Intr - 165815 165617 199 1 1 90 46 100 0.497 4.43 10.01 Init - 177154 177072 83 1 2 52 78 105 0.278 4.57 10.00 Prom - 189677 189638 40 -4.45 11.02 PlyA - 189818 189813 6 1.05 11.01 Term - 197465 197358 108 2 0 87 45 116 0.796 4.73 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 160253 160132 122 2 2 32 70 214 0.807 11.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_1|138_aa MRRPQGRARPFPEGSGAGAPLHSQRSQVGPGARSRGPTSQRQADCRQPTSASPRAGAMET VTRRQQPLRPPMGTHVAARRGRRSSKMAARPSRATGPRGGQRSRVKPPPGRRLKEQLPPL AAARAVFAAATAVAMRRG >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_1|417_bp atgcgccggccccaagggcgcgcacgtccattcccggaaggctcgggcgctggggcccct ctgcactcccagcggtcccaagttggcccaggcgcccggtcccggggtcctacaagtcag cgccaagccgactgccgacagcctacctccgcttctccgcgcgccggcgctatggagacg gtcacgagacgccagcagcccctgcgcccgccaatgggaacgcacgttgcagcgagacgc ggacgtcgctcttccaagatggcggcgcgtccgtcgcgagcgaccgggccgaggggaggc cagcgaagccgagtaaaaccgccgcccgggagaagactgaaggagcagttgccgccgttg gcggcggcccgagcagttttcgctgctgctacggctgttgccatgaggcgaggctag >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_2|459_aa MHSKALTLFSSIKAERSKETAEERLEASRGWFRRFKERSYLYNMKVQGEATRADVEAAAS YPEDLAKIKDESGYIKQYIFKVDKTAFYWKKMSSRIFTAREEKTMPGFRASKDRLTLLLE ANATGSQKICVTCFIEIFAPLLWSGIEPTISLREQIYQRSTDTVLPPSHCLSLSINCQLV SFFDILTFVWLKLEIVVRQKLRKVTLKLKDWRFLKFHKYEEKQGLETGASKGLLPVIEVQ SYDLDWTEAPAAVMFAESPAGKRLLIPVAPGVDSLNSAMAARILFFEGKRQLRVRAEHLS RDRSYHGEGTEKRLAESETVASQLVTVASMFRRQQPHVKKVGENYSSIARYSHYQGLNMF PHCLSSKGQTSYGAIQVPTAPGSRLPLQVYLSSISFHNPHRDAVFPPHFQMPKSYMVQMQ LSLHSPDSSNRNESPPFLEHLSLCMAILMFYNNLNLGVN >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_2|1380_bp atgcatagtaaggccctaactctcttcagttctatcaaggctgagagaagtaaagaaact gcagaagaaaggcttgaagctagcagaggttggttcaggaggtttaaagaaagaagctat ctctataacatgaaagtacaaggtgaagcaacaagggctgatgtagaagctgcagcaagt tatccagaagatctagccaagatcaaagatgaaagtggctatattaaacaatatattttc aaagtagataaaacggctttctattggaagaagatgtcatctaggatatttacagctaga gaagagaagacaatgcctggcttcagagcttcaaaggacaggctgactctcttattagag gctaatgcaactggaagccaaaaaatttgtgtgacttgctttattgagatatttgctcca ttgctgtggtctggaattgaacccacaatatctctaagagagcaaatttatcagcgcagc actgacactgttttacccccttcacattgccttagtttatccattaattgccagctcgtt tccttctttgacattctcacatttgtttggctgaaacttgaaatagttgtcagacagaag ttgagaaaggtaactcttaaactaaaggattggcgattcctgaagtttcacaagtatgaa gaaaagcaaggtctagaaactggagccagtaaaggtttgctgcctgttattgaggttcag agttatgacttggactggacagaggcaccagcagctgtaatgtttgccgagagccctgcg ggcaagaggctgctgatccctgttgcacctggagtggacagcctcaactcagccatggct gcacgcatcctgttttttgaagggaaaagacaactgcgggtgagggcggaacacttgagc agggacaggagttaccacggagaggggacagaaaagaggctggcagagtcggaaactgtg gcctctcagttggtgactgtggcctccatgttcaggaggcaacagcctcatgtgaaaaaa gttggagagaactattctagcattgccaggtattctcattatcaaggtctgaatatgttc ccccattgcctgtcgagcaaaggccaaacttcttatggtgctattcaagtccctacagca cctggttccaggctgcccttgcaggtctatttgtcatcaatatccttccacaacccacat agggatgccgtgtttccacctcatttccaaatgcccaaatcctacatggttcaaatgcaa ctgtctctacacagtcctgattcctccaaccggaatgaatcccctccttttctagagcat ttgtcactctgtatggctattctcatgttctacaataatttaaatcttggtgttaactag >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_3|188_aa MRHRKGHEKIKVMKKQKAFVKCGISEALQYLWRVFRASLMQKTYLGPVSLRPSGTPNRVV KTSTMHDGESLSPVLRSAEVKDSKYQEGKLRRQKPRKQTLIKSHEETGRGESSFCFWIPK RRRRRKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKKQLRKLIKRNPRQAKR AMWRKMLD >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_3|567_bp atgaggcacagaaagggtcatgaaaaaataaaagtaatgaagaagcagaaagcgttcgta aaatgtggcataagtgaagctctgcaatacttgtggagggtcttcagggcttccctgatg cagaagacctacctgggcccagtctccttgaggccaagtggaaccccaaacagagttgtc aagacaagcactatgcatgacggggaaagtctcagtccagttttgaggagtgcagaagtt aaagactccaaataccaagagggcaagttacgaaggcagaaaccaaggaaacagacgttg ataaaaagccacgaagagaccgggcgcggtgagtcatcattctgcttttggataccaaaa agaagaagaagaagaaaagaagaagaagaagaagaagaagaagaagaggaagaggaagag gaagaggaagaggaggaggaggaggaggaggaggaagaggaagaggaagaggaagaggaa gaggaagaaaagaagcagctcaggaaactgataaaaagaaaccctagacaagcaaagagg gctatgtggagaaaaatgttagattag >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_4|144_aa MDKFLDTYTLSSLSQEEVESLNKLITSSEIEAVINSLSTKKRPGPDGFTAEFYQRYKQEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKGNFRPISLMNVDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQG >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_4|435_bp atggataaattcctggacacatacacactctcaagtctaagccaggaagaagttgaatcc ctgaataaactaataacaagttctgaaatagaggcagtaattaatagcctatcaaccaaa aaaaggccaggaccagatggattcacagccgaattctaccagaggtacaaacaggagctg gtaccattcctcctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctcataccaaaacctggcagagacacaacaaaaaaagga aatttcaggccaatatccctgatgaacgttgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagttggcttcatacct gggatgcaaggctag >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_5|221_aa MGKNQHKKPENSKNQNASFPPKDHNSLPAREQNWMENEFDKLPAVGFRSDGENGTKLENT LQDIIQENFPTLARHANIQIQETQRTPQKYSLRRATQRHIIIRFTKVEIKEKMLRAAKEK GRVIHKRKPIRLTADLPAETLQARREWRPIFNVLKEKKFQPPRISYPAKLSFISKGEIKS FTDKQMLRDFVTRPALQELLREALNVERNSQYQPLQKHTKL >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_5|666_bp atggggaaaaaccagcacaaaaagcctgaaaattccaaaaaccagaatgcctcttttcct ccaaaggatcacaactccttgccagcaagggaacaaaactggatggagaatgagtttgac aaactgccagcagtaggcttcagaagtgacggggagaatggaaccaagttggaaaacact cttcaggatattatccaggagaacttccccaccctagcaagacatgccaacattcaaatt caggaaacacaaagaacaccacaaaaatactccttgagaagagcaacccaaaggcacata atcatcagattcaccaaggttgaaattaaggaaaaaatgttaagggcagccaaagagaaa ggtcgggttatccacaaacggaagcccatcagactaacagcagatctgcctgcagaaacg ctacaagccagaagagaatggaggccaatattcaacgttctcaaagaaaagaagtttcaa ccacccagaatttcatatccagccaaactaagtttcataagcaaaggagaaataaaatcc tttacagacaagcaaatgctgagggattttgtcaccagacctgccttacaagagctcctg agggaagcactaaatgtggaaaggaacagccagtaccaaccactgcaaaaacataccaaa ttgtaa >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_6|126_aa MFVSPQNSCVEILTPTMMVLGGGNFGGRLGHKGGAPMNGISALIKETPKSSLTPLSYEDS EKTAVHEPVSGPSPGTESSSTLILHFSSFRTVRRKFLLLISYAVYGSLSQQSGWTKMMLH LLGPQL >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_6|381_bp atgtttgtgtctccccaaaattcctgtgttgaaatcctaacccctactatgatggtatta ggaggtggaaactttggtgggcgattaggtcataagggtggagcccccatgaatgggatt agtgcccttataaaagagacccccaagagttctctaacccctctgtcatatgaagacagt gaaaagacagctgtccatgaaccagtaagtgggccctcaccaggcacagaatcttctagc accttgatcttgcacttttcatcctttagaactgtcagacgtaaatttctgttgcttata agctacgcagtctatggtagtttatcacagcagtctgggtggactaagatgatgctccac ttgctggggccacagctctag >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_7|349_aa MQPMIKQMSLSTYRDQSKQTLNPLAIDQLAKTPNIKIFSGSSHQDLSQKIADRLGLELGK VVTKKFSNQETCVEIDESVRGEDVYIVQSGCGEINDSLMELLIMINACKIASASRVTAVI PCFPYARQDKKDKSRSPISAKLVANMLSIAGADHIITMDLHASQIQGFFDIPVDNLYAEP TVLKWIRENIPEWKNCIIVSPDAGGAKRVTSIADQLNVDFALIHKERKKANEVDCIVLVG DVNDRVAILVDDMADTCVTICLAADKLLSAGATRVYAILTHGIFSGPAISRINTACFEAV VVTNTIPQDEKMKHCSKIRVIDISMILAEAIRRTHNGESVSYLFSHVPL >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_7|1050_bp atgcagccaatgataaaacagatgtccctctcaacctacagagaccaaagtaaacagacg ctaaatccccttgcaatagaccagttggccaagacgccgaatatcaaaatcttcagcggc agctcccaccaggacttatcccagaaaattgctgaccgcctgggcctggagctaggcaag gtggtgactaagaaattcagcaaccaggagacctgcgtggaaattgatgagagtgtgcgt ggagaggatgtctacatcgttcagagtggttgtggcgaaatcaacgacagtctaatggag cttttgatcatgattaatgcctgcaagattgcttcagctagccgagttactgcagtcatc ccatgcttcccttatgcccgacaggataagaaggataagagccggtccccaatctctgcc aagcttgttgcaaatatgctctctatagcaggtgcggatcatatcatcaccatggaccta catgcttctcaaattcagggcttttttgatatcccagtagacaacttgtatgcagagcca actgtcctgaagtggataagggagaatatccctgagtggaagaactgcattattgtctcg ccagatgctggtggagctaaaagagtgacctccattgcagaccagttgaatgtggacttt gctttgattcataaagaacggaagaaggccaatgaagtggactgcatagtgctagtggga gatgtgaatgatcgtgtggctatccttgtagatgacatggcagacacttgtgttacaatc tgcctcgcagctgacaaacttctctcagctggagcaaccagagtttatgctatcttgact catggaatcttttctggcccagccatttctcgcatcaacactgcatgctttgaagcagtg gtagtcaccaataccatacctcaagatgagaagatgaagcattgctccaaaatacgagta attgacatctccatgatccttgcagaagccataaggagaactcataatggggaatctgtt tcctacctgttcagccatgttcctttataa >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_8|106_aa MQLAEGEIKQALTDSVTVTFKLQETAKAEAQDLAGGQWSGYIGLMVGDAKRGPHGADMEK PDTAYPSGNGLKASWKGQLRELLESLAGLSTYPQIKPISKAPWQYA >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_8|321_bp atgcaacttgcagaaggagaaatcaagcaggctctcacagactcagtcacagtgacgttt aagttacaggaaacagctaaagccgaagctcaggacctagcaggtgggcaatggtcaggg tacattggattaatggtgggtgacgcaaagagagggccacatggagcagatatggagaaa ccagacacagcttatccttcaggaaatgggctgaaggcctcctggaaagggcagttgcga gagttgttggagtcactggctggtctctcaacctatccccaaataaaacctatctccaaa gctccctggcagtatgcctga >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_9|95_aa MCILKLKAHSSNRSVPIDPGLGQDPMDTGCRNTLMNTGSRATPVDSINMSTPVDADRRYN PIDLGTGAASLGNPAVTLPVNHTRKHARISRQTDW >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_9|288_bp atgtgcatattgaagctcaaagcacacagcagtaacaggtcagtccctatagacccagga ttggggcaggatcccatggacacaggctgtaggaacaccctcatgaacacaggctccagg gccacccccgtggactcaatcaacatgtccaccccagtggatgcagaccgcaggtacaac cccatagacctgggcactggggcagcctccctgggaaatccagcagtaactctacctgtg aaccacaccagaaagcatgccagaatctctagacagactgactggtaa >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_10|154_aa MASCPGLVPNWCPELLGSALATQDHELVSGHLSFFLAMDNGTPLLYVENPLFYDIEPTFN CRSYMPAATAAVPHRTNSAEGFGALFLLEWLQSQATSPDNRASDLERAQSRSVTSQTHRP DTSTSHCGQGDNGFKAKQTQPARRSGNAPVLRSL >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_10|465_bp atggcatcctgtccggggctggttcccaactggtgccctgaactgctgggatcggctctg gccacccaagaccatgaattggtcagtggacacctgtcattctttttggcaatggacaat gggaccccattattatatgtggagaatcctctcttttatgatatagagcccacattcaac tgtagaagctatatgccagcagcgacagcagcagtacctcacaggacaaattctgcagag ggatttggagccttgtttctacttgagtggcttcaaagtcaggccacctctcctgacaac cgcgcctcggatctggagagagctcagtcccgctctgtcacatcacagacacacaggcct gacaccagcacctctcattgcggacaaggggataacggcttcaaagccaaacaaacgcag cctgctaggcgctcagggaacgcccccgtcctccgttccctctag >gi568815591r:17926829_18127780|GENSCAN_predicted_peptide_11|35_aa AIAVWKQTVKDIDNIHPTDIPPSYLSTDLQMSSAK >gi568815591r:17926829_18127780|GENSCAN_predicted_CDS_11|108_bp gctattgctgtatggaagcagaccgtaaaagacatagataacattcacccaacagatatt ccaccttcttatttgtctacagatctccaaatgagctcggcaaagtga