GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:48:44 Sequence gi568815594f:105384696_105586077 : 201382 bp : 37.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 545 540 6 1.05 1.03 Term - 1941 1921 21 0 0 132 44 16 0.128 -1.07 1.02 Intr - 14469 14342 128 1 2 112 67 107 0.955 10.48 1.01 Init - 15243 15201 43 0 1 85 68 43 0.815 2.63 1.00 Prom - 19647 19608 40 -7.15 2.00 Prom + 20559 20598 40 -6.75 2.01 Sngl + 25254 25493 240 2 0 88 36 192 0.492 8.83 2.02 PlyA + 25498 25503 6 1.05 3.00 Prom + 26671 26710 40 -6.15 3.01 Init + 27124 27324 201 0 0 60 86 92 0.310 5.12 3.02 Intr + 30766 31092 327 0 0 24 39 200 0.180 3.67 3.03 Term + 42605 42964 360 1 0 72 48 345 0.892 22.45 3.04 PlyA + 43791 43796 6 1.05 4.00 Prom + 47854 47893 40 -5.65 4.01 Init + 51014 51118 105 1 0 46 55 96 0.063 2.37 4.02 Intr + 78723 79040 318 2 0 -13 87 188 0.141 3.83 4.03 Intr + 86051 86123 73 1 1 38 116 49 0.349 0.76 4.04 Term + 87375 87499 125 1 2 83 54 154 0.650 9.07 4.05 PlyA + 87723 87728 6 -3.94 5.04 PlyA - 87736 87731 6 1.05 5.03 Term - 88656 88483 174 0 0 47 41 203 0.895 8.28 5.02 Intr - 89197 88908 290 2 2 82 49 179 0.902 9.44 5.01 Init - 89355 89199 157 0 1 91 83 292 0.997 27.12 5.00 Prom - 92382 92343 40 -7.75 6.00 Prom + 93529 93568 40 -6.75 6.01 Init + 93672 93735 64 2 1 68 62 49 0.752 1.66 6.02 Intr + 99992 100774 783 0 0 -4 12 766 0.436 50.32 6.03 Term + 100844 101385 542 0 2 29 36 726 0.704 54.93 6.04 PlyA + 101681 101686 6 1.05 7.05 PlyA - 101928 101923 6 1.05 7.04 Term - 104956 104831 126 1 0 75 52 88 0.200 1.20 7.03 Intr - 109301 109123 179 0 2 47 106 71 0.062 3.52 7.02 Intr - 115999 115746 254 0 2 44 8 217 0.119 5.45 7.01 Init - 137021 136963 59 2 2 62 111 38 0.101 4.33 7.00 Prom - 140884 140845 40 -3.05 8.03 PlyA - 140956 140951 6 1.05 8.02 Term - 142202 141835 368 0 2 72 38 264 0.056 13.58 8.01 Init - 152494 152176 319 1 1 100 87 119 0.762 10.55 8.00 Prom - 157291 157252 40 -3.05 9.00 Prom + 162335 162374 40 -3.95 9.01 Sngl + 168071 168430 360 1 0 61 50 188 0.363 8.12 9.02 PlyA + 168433 168438 6 -0.45 10.04 PlyA - 168445 168440 6 1.05 10.03 Term - 169188 169040 149 1 2 43 44 119 0.582 0.08 10.02 Intr - 174681 174542 140 2 2 47 97 118 0.046 7.79 10.01 Init - 181302 181184 119 0 2 35 105 83 0.712 4.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 131091 130765 327 0 0 84 41 116 0.810 2.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_1|63_aa MEQDSARFHQATQNDIDDVKKFKPGYLEATLNWFRLYKVPDGKPENQFAFNGEFKNKHKR ADI >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_1|192_bp atggagcaggacagtgcaagatttcatcaggctactcagaatgatattgatgatgttaag aagttcaaaccgggttacctggaagctactcttaattggtttagattatataaggtacca gatggaaaaccagaaaaccagtttgcttttaatggagaattcaaaaacaagcacaaacgt gcagatatctga >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_2|79_aa MGRKQNRKAEKSKNQSTSPLPKNCSSSPAMEQSWMENEFDEWTEVGFRRSVTTNFSELKE HVLTHRKEAKNLEKGYTNG >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_2|240_bp atggggagaaaacagaatagaaaagctgaaaaatcaaaaaaccagagcacctctcctctt ccaaagaattgcagctcctcgccagcaatggaacaaagctggatggagaatgagtttgat gagtggacagaagtaggcttcagaaggtcagtaacaacaaacttctccgagcttaaggaa catgttctaacccatcgcaaagaagctaaaaaccttgaaaaaggttacacaaatggataa >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_3|295_aa MSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNKIKEATNKWKNIPCSWIGRISIV KMAILPKHPGSASTSLQNQSGERPGSSSRHFRACRGGWGLLRPPRVQGCPGPQLWLCGCS CVQEGGDPTLPTRKGVGLPPVPGSCSPVECAALAVPPRLQPVFAAAAPDGPLLPTIDCSS SPATEQNCMKNDFDELTKVGFRRSIITNFSELEEHVLTLCKEAKNFEKKLDEWLTRINSV EKSLNDLMELKTTARELREAYKSFNSQFNQAQDRISVTEDQINEIKREDKIREKE >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_3|888_bp atgagtgaactcccattcacaattgctacaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactataaaccactgctcaacaaaataaaa gaagccacaaacaaatggaaaaacattccatgctcatggataggaagaatcagtattgtg aaaatggccatactgcccaagcacccaggatcggcctcgacttcactccaaaatcagagt ggggagaggccaggcagcagtagcaggcacttccgagcctgcagaggtgggtgggggctt ctcaggcccccgagagtgcagggatgcccaggtccacagctgtggctgtgtggctgcagc tgtgtccaggaaggtggtgatcccaccctgccaactcggaagggggtggggctcccacct gttcctggctcctgtagcccggtggagtgtgcagccttggctgtgcctccccggctgcag ccagtctttgcagcagctgctccagacgggccactgctgccaacaatagattgcagctcc tcaccagcaacagaacaaaactgcatgaagaatgactttgatgagttgacaaaagtaggc ttcagaaggtcgataataacaaacttctccgagctagaggagcacgttctaaccctttgc aaggaagctaaaaactttgaaaaaaagttagacgaatggctaactagaatcaacagtgta gagaagagcttaaatgacctgatggagctgaaaaccacagcacgagaacttcgtgaagca tacaaaagcttcaacagccaattcaatcaagcacaagacaggatatcagtgactgaagat caaattaatgaaataaagcgagaagacaagattagagaaaaagagtga >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_4|206_aa MIISIDTEKARDKIQHSFMIKTFNELGTEGTYLKIHSEHKSLENLWPDNVREKKIPFFEK KFKPAAEICISNKKPNVRPQDSDENVSSACQRPSRQPFPSQALRPSRKRWFRWAQGPSAV CSLGSWCLASQVVQPWLKGAKQFYKITSAIPILFTEKLSYRMSSEETVRISVIHPMPSGN TSAEKDLEDHRFKVKSFILQLKKLAL >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_4|621_bp atgatcatctcaatagatacagaaaaagcacgtgataaaattcaacattccttcatgata aaaaccttcaacgaactaggcactgaaggaacatacctcaaaatacattcagagcataaa agtttggaaaatttgtggcctgacaatgtgagagaaaagaaaatcccattttttgagaag aaattcaagccagctgcagaaatttgcataagtaacaagaagccaaatgttcgtccccaa gacagtgatgaaaatgtctctagcgcatgtcagaggccttcacggcagcccttcccatca caggccctgaggcctagtaggaaaaggtggtttcgctgggctcagggtccctctgctgtg tgcagcctagggagttggtgccttgcgtcccaggtggtccagccatggctaaaaggggcc aagcaattctataaaattactagtgctatccccattttatttacggagaaactgagctac cggatgtcatctgaagagactgttagaatctccgttattcatccaatgccttcagggaat accagtgctgagaaggaccttgaggatcatcgtttcaaggtcaaaagcttcattttacag ctgaagaaactggctctgtga >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_5|206_aa MSALLRLLRTGAPAAACLRLGTSAGTGSRRAMALYHTEERGQPCSQNYRLFFISSSRRFA ERRTGETWVGDGMGDTWDAPRGLPSLFLGPSAVAARSRPPYFPRERAPSHENCQRGGGGS RPQSGEQRRRAAGWGVDSALCPASRPPRGKLDLVRTSLHPECDPPSKASLVPEQPLFLPK LNKPRRLSDFGGVPELPSSEKGPRSS >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_5|621_bp atgagcgcgctgctgcggctgctgcgcacgggtgccccagccgctgcgtgcctgcggttg gggaccagtgcagggaccgggtcgcgccgtgctatggccctgtaccacactgaggagcgc ggccagccctgctcgcagaattaccgcctcttctttataagtagctcccggaggttcgcc gagcggcgcaccggagaaacctgggtgggggatggaatgggggacacttgggacgccccg cgcggtctcccttctctctttttggggccctcagcggtagccgcgcggtccaggccacct tacttcccgcgggagcgcgcaccgagtcacgagaactgccagcgaggaggcggcggatca agaccccaaagcggagagcagaggcgcagggctgccggctggggggtagatagcgccctg tgccctgccagccgccctccccgagggaagcttgacttggtacgcacaagtttacaccct gagtgtgatccaccatccaaagcctcgttggttccggagcagccccttttcctccccaag ctgaacaagcccagaaggctaagcgacttcgggggggtccctgagttgccgagctcagag aaaggacccaggtcttcgtga >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_6|462_aa MDIKKGITDISASLRVESGWEARTRKEKTHINTVIIGHVDSGKSTTTGHLIYKCGGVDKR TIEKFEKEAAEMGKCSFKYAWVLDKLKAEREHGITIDISLWKFETSKYYVTIIDAPGHRD LIKNMITGTSQADCAVLIVAAGFGEFEAGISKNGQTREHALLAYTLGVKQLIVGVNKMDS TEPPYSHKRYEEIVKEVSTYIKKIGHNTDTVAFVPVSGWNGDNTLEPSANMPWFKGWKVT RKDGNASGTTLLEALDCILPPTRPTDKPLCLPLQDVYKIGGIVNVATEVKSVEMHHEALS EVLPGDNVGFNVKNVSVKDVRRGNVAGDSKKDPPMEAAGFTAQVIILNHPGQISAGYAPV LDCHTAHIACKFAELKEKTDRHSGKKLEDGPKFLKSGDAAIVDMVPGKPVYVESFSDYPP LGRFAVHDIRQTVAVGVIKAVDKKAAGAGKVTKSAQKAQKAK >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_6|1389_bp atggacataaagaagggaataacagacatcagcgcctctttgagagtggagagtggatgg gaagccagaacgagaaaggaaaagactcatatcaacactgtgatcatcggacacgtagat tcgggcaagtccaccactactggccatctgatctacaaatgcggtggcgtcgacaaaaga accatcgaaaaatttgagaaggaggctgctgagatgggaaagtgctccttcaagtatgcc tgggtcttggataaactgaaagctgagcgtgaacatggtatcaccattgatatctctttg tggaaatttgagaccagcaagtactatgtgactatcattgatgccccaggacacagagac ctcatcaaaaacatgattacagggacatctcaggctgactgtgctgtcttgattgttgct gctggttttggtgaatttgaagctggtatctccaagaatgggcagacccgagagcacgcc cttctggcttacacactgggtgtgaaacaactaattgttggtgttaacaaaatggattcc actgagccaccctacagccataagagatatgaggaaattgttaaggaagtcagcacttac attaagaaaattggccacaacaccgacacagtagcatttgtgccagtttctggttggaat ggtgacaacacgctggagccaagtgctaacatgccttggttcaagggatggaaagtcacc cgtaaggatggcaatgccagtggaaccacgctgcttgaggctcttgactgcatcctacca ccaactcgtccaactgacaagcccttgtgcctgcctctccaggatgtctacaaaattggt ggtattgtcaacgttgcaacagaagtaaaatctgttgaaatgcaccatgaagctttgagt gaagttcttcctggggacaatgtgggcttcaatgtcaagaatgtgtctgtcaaggatgtt cgtcgtggcaacgttgctggtgacagcaaaaaggacccaccaatggaagcagctggcttc actgctcaggtgattatcctgaaccatccaggccaaataagtgctggctatgcccctgta ttggattgccacacggctcacattgcatgcaagtttgctgagctgaaggaaaagactgat cgccattctggtaaaaagctggaagatggccctaaattcttgaagtctggtgatgctgcc attgttgatatggttcctggcaagcccgtgtatgttgagagcttctcagactatccacct ttgggtcgctttgctgttcatgatatcagacagacagttgcggtgggtgtcatcaaagca gtggacaagaaggctgctggagctggcaaggtcaccaagtctgcccagaaagctcagaag gctaaataa >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_7|205_aa MDTNMETIDTGYSKSKEGASPILEVRSASAWLPPHLGGEERLCLATVQSSKCEVTAFVQV YPTAPKRQRPSRMGHDDDGGFVEKKRGKCGEKKERSDCYCVCVESQKLLKLISHFIKDSG YKINVQKSLRFLYTNNSQAESQIINELPFAIATKKIIPRNTDNKPWRQEHIVCIGPYARM EMGIKESATPFKDKEHQRLMTGESL >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_7|618_bp atggacacgaatatggaaacaatagatactgggtactccaaaagtaaggagggagccagc cccatcttggaagtgaggagtgcctctgcctggctgccaccccatctgggaggtgaggag cgcctctgcctggccactgtgcaatcctccaagtgtgaagtgacagcctttgtgcaggtg tacccaacagctccgaagagacagcgaccatcgagaatgggccacgatgacgatggcggt tttgttgaaaagaaaagggggaaatgtggggaaaagaaagagagatcagattgttactgt gtctgtgtagaaagccaaaagcttcttaagctgataagccacttcatcaaagactcagga tacaaaataaacgtgcaaaagtccctaagattcctatataccaacaacagtcaagcagag agccaaatcatcaatgaacttccatttgcaattgccacaaagaagataatacctaggaat acagataacaaaccctggagacaagagcatatagtttgcattggaccatatgctagaatg gaaatggggataaaagaatcagcaaccccattcaaagataaggagcaccagagactgatg actggggaatcactgtga >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_8|228_aa MVLASAQLLGRPQETYNHGGRQRESWHFTWPEQEEESGEKVLHTFKQPHLMRTNSRHENS TEWNGVKPFTRNHLHDPITTHQAPPPTLGFTIQHEIWIGTQIQTISGSPAHPVRLSRREA AFRSPYSSTEPLCSPSESDSDRGTSGAGTQHLQKLSQELDEAIMAEESDDMTVSLIHHGG SALQFCSAGNKTCLTTEASYPRMSVLPMSFLEEPPRSHQYPWAMLTRT >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_8|687_bp atggtgctggcatctgctcagcttctggggaggcctcaggaaacttacaatcatggtgga aggcaaagggagagctggcacttcacatggccagagcaggaggaagagagtggtgagaag gtgctacacacttttaaacaaccacatctcatgagaacgaactcacgtcatgagaacagc actgagtggaatggtgttaaaccattcacgagaaaccacctccatgatccaatcaccacc caccaggctccacctccaacactggggtttacaattcaacatgagatttggatagggaca cagatccaaacaatatcaggctcacctgcccacccagtaaggttgtcgaggcgggaggct gccttccggagcccctactcctcaacagagcccctctgctctcccagtgagtctgacagt gaccgagggacgtcgggggcgggaactcagcatctccagaagctctcccaagagctggac gaagctattatggcggaagagagtgatgacatgaccgtctctctcattcaccacggagga agtgccctgcagttctgttctgcaggaaacaagacctgtctgaccaccgaggcttcatac ccaaggatgtctgtgcttccaatgagcttcctggaggaaccccccaggagtcatcagtac ccctgggccatgctaacaaggacctga >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_9|119_aa MEPKEATGKENMVTKKKNLAFLRSRLYMLERRKTDTVVESSVSGDHSGTLRRSQSDRTEY NQKLQGNQKEINCPSYCTPSSISATFPIAKKTHKAEGKIEYHLSFPQNKKMKVNVTYKL >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_9|360_bp atggagcccaaagaagccactgggaaagaaaacatggtcaccaagaaaaagaatctggcc ttcttgaggtctagactctatatgctggagagaaggaagactgacactgtggttgagagc agtgtttctggggaccactctggcaccttgaggaggagccaatctgacaggaccgaatac aaccagaaattacaaggtaaccaaaaagaaatcaattgtcccagttactgcactcccagc tccatttctgctacgtttccaattgcaaagaaaacacacaaggcagaggggaaaattgaa taccacctctccttcccacaaaataagaagatgaaagtaaatgtgacctataaattatag >gi568815594f:105384696_105586077|GENSCAN_predicted_peptide_10|135_aa MDSPVTRGKKEGVKLSLIAKKIERTEQIKGNSRGEISMNWPKSMHTSNHSLPLRAWDKKR SKQMLEPVLRPVEQKIPQSQKQSRSEVFKSKTTAHSGMMRVQTHLVLKAGEASSGGPHNK YRRKDLKCCPISTSK >gi568815594f:105384696_105586077|GENSCAN_predicted_CDS_10|408_bp atggacagtccagtcactaggggaaagaaggagggagttaagcttagtctaattgccaag aagatagaaagaacagagcagatcaagggaaattcaagaggtgaaattagcatgaactgg cctaagagcatgcacaccagcaaccacagtctgcccttgagagcatgggataagaagcgg tccaagcaaatgttggagccagtactgaggcctgttgaacagaaaattccacagtctcag aaacagtctagaagtgaagtttttaaatccaaaacgacagctcacagtgggatgatgaga gttcagacacaccttgtgctaaaagcaggagaggcaagttctggaggaccacataacaag taccgaagaaaagacttaaaatgctgcccaatatcaacatcaaaatga