GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:04:03 Sequence gi568815584r:63276012_63639685 : 363674 bp : 41.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2678 2717 40 -3.65 1.01 Init + 4795 5124 330 0 0 41 37 259 0.858 13.46 1.02 Intr + 7110 7235 126 2 0 69 22 171 0.776 8.56 1.03 Intr + 8446 8605 160 0 1 16 45 117 0.705 -1.16 1.04 Intr + 10205 10380 176 0 2 31 92 88 0.160 2.24 1.05 Term + 14867 15013 147 1 0 94 51 136 0.358 7.42 1.06 PlyA + 17186 17191 6 1.05 2.07 PlyA - 18205 18200 6 1.05 2.06 Term - 20910 20888 23 1 2 105 43 16 0.098 -3.70 2.05 Intr - 25939 25791 149 0 2 88 89 78 0.498 6.86 2.04 Intr - 27938 27910 29 2 2 71 97 26 0.414 -2.20 2.03 Intr - 33060 32946 115 2 1 84 86 99 0.603 8.83 2.02 Intr - 37105 36933 173 1 2 43 -26 187 0.006 0.62 2.01 Init - 41838 41635 204 0 0 84 105 284 0.970 26.60 2.00 Prom - 62133 62094 40 -5.35 3.00 Prom + 62224 62263 40 -6.95 3.01 Init + 72411 72524 114 2 0 77 10 181 0.525 9.46 3.02 Term + 90388 90456 69 0 0 119 41 81 0.546 3.46 3.03 PlyA + 90878 90883 6 1.05 4.17 PlyA - 93285 93280 6 1.05 4.16 Term - 108371 108264 108 2 0 129 42 111 0.872 8.13 4.15 Intr - 113720 113601 120 2 0 89 92 46 0.945 4.87 4.14 Intr - 116014 115961 54 1 0 66 100 36 0.633 0.86 4.13 Intr - 117917 117809 109 1 1 111 91 24 0.983 4.37 4.12 Intr - 119274 119215 60 2 0 130 105 66 0.998 9.43 4.11 Intr - 120705 120575 131 0 2 54 115 111 0.435 8.97 4.10 Intr - 139221 139129 93 0 0 80 115 19 0.142 3.04 4.09 Intr - 146083 145982 102 1 0 123 80 85 0.817 10.65 4.08 Intr - 162293 162121 173 0 2 -7 41 162 0.368 0.74 4.07 Intr - 163866 163835 32 2 2 127 63 49 0.481 3.16 4.06 Intr - 164111 163990 122 2 2 87 54 90 0.525 3.87 4.05 Intr - 177959 177678 282 2 0 75 101 157 0.217 12.39 4.04 Intr - 198332 198245 88 1 1 100 8 68 0.000 -0.95 4.03 Intr - 207689 207513 177 1 0 69 37 180 0.024 9.11 4.02 Intr - 222494 222345 150 1 0 77 88 28 0.020 0.06 4.01 Init - 235383 235346 38 0 2 62 59 46 0.027 -1.33 4.00 Prom - 239475 239436 40 -4.75 5.00 Prom + 252319 252358 40 -1.45 5.01 Sngl + 266796 267212 417 2 0 72 49 375 0.860 26.05 5.02 PlyA + 268612 268617 6 1.05 6.00 Prom + 279077 279116 40 -6.15 6.01 Init + 284505 284530 26 2 2 77 58 43 0.237 -0.75 6.02 Intr + 287436 287551 116 0 2 21 91 115 0.649 4.27 6.03 Intr + 291871 292247 377 2 2 69 65 329 0.481 22.51 6.04 Term + 300376 300474 99 0 0 93 32 84 0.428 0.45 6.05 PlyA + 300910 300915 6 1.05 7.03 PlyA - 303235 303230 6 1.05 7.02 Term - 318971 318347 625 1 1 -11 36 760 0.249 53.77 7.01 Init - 319263 319052 212 0 2 66 -3 318 0.999 16.81 7.00 Prom - 321372 321333 40 -6.65 8.02 PlyA - 321658 321653 6 1.05 8.01 Sngl - 323931 322768 1164 0 0 70 44 806 0.999 70.28 8.00 Prom - 333341 333302 40 -5.45 9.00 Prom + 337125 337164 40 -6.45 9.01 Sngl + 343040 343237 198 1 0 78 49 184 0.930 8.42 9.02 PlyA + 344125 344130 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 37105 36917 189 1 0 43 54 200 0.921 8.57 S.002 Sngl + 325498 325719 222 0 0 86 38 275 0.989 17.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_1|312_aa MAPLVLVEKELPDSRVEEETGSQCAVVDSYLKSDLGTTWDTPYTGCTNPCLLFPQEDYNQ LRPLSYPNTDVFLICFSVVNPASYHNVQEEWVPELKDCMPHVPYVLIGTQIDLRDDPKTL ARLLYMKEKPLTYEHGVKLAKAVQSDLNFILNLTDEETVLEDKHIANNKVKNFQSSSVDI TTRVPDRHVNDLQLCIFPWVKWVYAFTSPLVSGLQRPEEEGDRQTKACHLDKHSNTHLGN VPGSTEERETNTFTEGGFRDFFNQIGAQCYLECSALTQKGLKAVFDEAILTIFHPKKKKK RCSEGHSCCSII >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_1|939_bp atggctcctttagttctagtagagaaagaactaccagattccagggttgaggaagaaacg ggatcccagtgcgctgtagtggactcttacctgaaatctgaccttggaactacatgggac acaccttacacaggctgtacaaacccttgtctgctcttcccacaggaggactacaaccag ctgaggccactctcctaccccaacacggatgtgtttttgatctgcttctctgtcgtaaac cctgcctcttaccacaatgtccaggaggaatgggtccccgagctcaaggactgcatgcct cacgtgccttatgtcctcatagggacccagattgatctccgtgatgacccaaaaaccttg gcccgtttgctgtatatgaaagagaaacctctcacttacgagcatggtgtgaagctcgca aaagcggtacagtcagatttgaatttcattttaaatcttacagatgaggaaacagtctta gaagacaagcacatagctaataacaaagtcaaaaacttccaatccagctctgtggacatc accactcgtgtacctgaccgtcacgtaaatgacctccaactgtgtatctttccttgggtg aaatgggtttatgccttcacttcaccactggtgtctgggttacaaaggccagaagaggag ggagacaggcaaacaaaagcatgccacctggataaacacagtaacacacatctgggcaat gtaccaggtagcacagaagagagagagactaatacattcacggaaggaggtttcagagat ttctttaaccagatcggagcacagtgctacttggaatgttcagctctgactcagaaaggt ctcaaagcggtttttgatgaagcaatcctcaccattttccaccccaagaaaaagaagaaa cgctgttctgagggtcacagctgctgttcaattatctga >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_2|230_aa MKLAFLFLGPMALLLLAGYGCVLGASSGNLRTFVGCAVREFTFLAKKPGCRGLRITTDAC WGRCETWEKPILEPPYIEAHHRVCTYNETKQVTVKLPNCAPGVDPFYTYPVAIRCDCGAC STATTELRLMPGEAAVALGFWCQRRRQGSRTTGTRWRHAAVRDKVSLLKAVDGWNGLLGD PASSQGLSASSCTPVFPLAFQIDSASGKVGNFSSKQTFIFSSAEITLGGT >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_2|693_bp atgaagctggcattcctcttccttggccccatggccctcctccttctggctggctatggc tgtgtcctcggtgcctccagtgggaacctgcgcacctttgtgggctgtgccgtgagggag tttactttcctggccaagaagccaggctgcaggggccttcggatcaccacggatgcctgc tggggtcgctgtgagacctgggagaaacccattctggaacccccctatattgaagcccat catcgagtctgtacctacaacgagaccaaacaggtgactgtcaagctgcccaactgtgcc ccgggagtcgaccccttctacacctatcccgtggccatccgctgtgactgcggagcctgc tccactgccaccacggagctgaggttgatgccaggggaagctgctgtggcactgggcttc tggtgtcagcgtaggagacagggatctaggacaacagggaccaggtggcgacatgcagct gtgagagacaaggtgagtctcctgaaggcagtagatggttggaatgggctgcttggggac ccagcgagctcccagggcctttctgcttcttcctgtacccctgtatttcccttggctttc caaattgactcagcttctggtaaagttggaaacttttccagcaaacagaccttcatcttc tccagtgcagagattacattaggaggaacatga >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_3|60_aa MVPGAVAAILQAGGYKPVNENQCDEDSRVVTVKSLALWPTQREDDKDEGLYANPFLLSEQ >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_3|183_bp atggtgcccggagctgtggcagccatcttgcaagcaggaggatacaagcctgtgaatgaa aatcagtgcgacgaggacagcagagtggtgacagtaaagtccctggccctttggcctact caacgtgaagatgacaaagatgaaggcctttatgctaatccatttctacttagtgaacag taa >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_4|612_aa MTVGKGYRDAWAFDINEDPRHLKTFQRFIHTKLEKHPFLKPCQAKHDVDLKRKKEKKRND VKSVAPVDPPFMELPAGAGSVLPWTPQQKMEALGSEPAQLFSSGALFPPPSSWGAACREC AVCSNQTTRLPPTERGEVGQVPRVTFDPSLSLAFEVTFTENSCYLEIDDRLTPFPFFFLD VPSSEQPELFLKKLQQCCVIFDFMDTLSDLKMKEYKRSTLNELVDYITISRGCLTEQTYP EVVRMKFQKSGSSARWCSLCGNIGVAIPYLCIIEGPHLAALFVLNWQAMRAAKESLGDTE MKERDLLADCLEEAGYTTVVHDEQEKAEKMSVLNVEFTETGVPEARIWQADNEQVSCNIF RTLPPSDSNEFDPEEDEPTLEASWPHLQLVYEFFIRFLESQEFQPSIAKKYIDQKFVLQL LELFDSEDPRERDYLKTVLHRIYGKFLGLRAFIRKQINNIFLRFVYETEHFNGVAELLEI LGSIINGFALPLKAEHKQFLVKVLIPLHTVRSLSLFHAQLAYCIVQFLEKDPSLTEPVMF LGELEEILDVIEPSQFVKIQEPLFKQIAKCVSSPHFQPYYVPDAVLVAEDVCEKIKSLPS GDSLTGENKQIL >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_4|1839_bp atgacggttggcaagggctacagggatgcctgggcttttgatattaatgaagatccaaga cacttaaaaacattccaaaggttcatacacacaaaacttgaaaaacatccttttctaaaa ccatgtcaagcaaagcatgatgtagacttaaaaagaaaaaaagaaaagaaaagaaatgat gtcaagagcgtggcccctgtggacccacctttcatggagctgcctgcaggagctggtagt gtgctcccttggaccccacagcagaaaatggaagctctcggatctgaacctgcccagctt ttctctagtggcgccctctttcctcctcccagttcatggggagctgcctgccgtgaatgc gctgtgtgctcaaaccagacaacccggctcccacccactgaacgtggagaagtggggcaa gtgccaagagtcacctttgatccctctctgtctcttgcctttgaagtaactttcacagag aattcctgttacctggaaattgatgacagacttacaccatttcctttttttttcttagac gttccatcctcagagcagcctgaactgttcctaaagaaacttcagcagtgctgtgtcatt tttgacttcatggacacgctatctgatcttaaaatgaaagaatacaagcgctccactctt aatgaactggtggactacattacaataagcagaggctgtttgacagagcagacttaccct gaagtagttagaatgaagttccaaaaatcaggctcctctgcccgttggtgctccttatgt ggtaacattggcgtggcgataccttacctgtgcatcatagaggggccccaccttgcagcc ctgtttgtgttgaattggcaagctatgcgagctgccaaggaatccttaggagacacagaa atgaaagagagggatttattggctgattgtttggaggaagctggttacaccacagtggtg catgatgagcaggaaaaagctgagaaaatgagtgtcctcaatgtggaattcacagagaca ggagtgcctgaggcaagaatctggcaagcagacaatgagcaggtatcttgcaatatattc agaactctccctcctagtgacagcaatgaatttgatccagaagaagatgaacctaccctt gaggcatcgtggccacacttacagcttgtatatgaatttttcatacgatttttggaaagc caagaattccaacccagcattgccaaaaaatatatagatcagaaatttgtattacagctt ctggagctatttgacagcgaagaccctcgggaacgggactacttaaaaacagtcttacac agaatttatggcaagtttcttggtcttagagcatttatccgaaaacagattaacaatatt tttctaaggtttgtttatgaaacagaacacttcaatggtgtagctgaactgctggaaata ttaggaagtattatcaatggctttgctttacctcttaaggcagaacacaaacagtttctg gtgaaagtattgatccctttacacactgtcaggagcttatcactcttccatgcacagctg gcatattgtatagtacagtttctggagaaagatccttcactcacagaaccagtcatgttc cttggggaactggaagaaatattggatgtgattgaaccttcacaatttgttaaaatccaa gaacctttgtttaaacaaatcgccaagtgtgtatctagcccccattttcagccttattat gtgccagatgctgtccttgttgctgaagacgtatgtgaaaagataaaatccttaccttct ggggattcactgactggagaaaataaacagatactttaa >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_5|138_aa MICGLEGRETGEVAAAGAGGGTGTCRGRSGQLRGAWGDGCPVRGPSGSAAAEPLGLTAGP EPQRSGGGGGLTILPRGPVPSNRHRPRCGSAGGSSLTFLAGGASARCPGPGGGGDDVGRG GREGGRLDLQRQQEKGTE >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_5|417_bp atgatctgtggcttggaggggcgggagacgggcgaggtggccgcagcgggggcgggcggc gggaccgggacgtgcagggggaggtccgggcagctgcggggagcctggggcgacggctgt ccggtacggggtccctccggttccgcggcggcggagcctctaggcctcacggccggaccc gaacctcagcgttccggtggcggcggcggcctcacgatcctcccccgcgggcccgtccca tcaaatcggcaccgaccccgttgcggctccgccggtggctcctcgctcacattcctggcg ggcggcgcctcagctcgttgccccggacccggcggcggcggcgacgacgtggggaggggg ggaagggagggaggaagactggatctgcagcggcagcaagagaaggggacagaataa >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_6|205_aa MLPVQPAEPTLKTLTKIVKINFFGTLYNNKRFAASPGAFIREKWLNLVFQEIPSGNRCFL SPTICSSLAFQLGPGRIALAKKDGKKKSLSAISIHKRIHGVGFKKCAPRALEEIQKCAMK EMGTPDVHIDTRLNKAVWAKGIRIVPYRIPVWLSRKRNEDENSTNKLYTLVTHEPHYLAM TDKDRVRVLNLDINLAMQMYSLIDA >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_6|618_bp atgcttcctgtacaacctgcagaaccaacactgaaaacactgacaaaaattgtcaaaatc aatttctttggaacactgtacaataacaaacgttttgcagcaagtccaggagcatttatt cgagaaaaatggctgaatcttgtgtttcaagaaataccttcagggaaccgctgctttctt tctcccacaatctgtagttctcttgctttccaacttgggcccggcaggatcgctcttgca aagaaggatggcaagaagaagagcctttctgccatcagcattcacaagcgcatccatggt gtgggcttcaagaagtgtgcccctcgggcactcgaagagatccaaaaatgtgccatgaag gagatgggcactccagatgtgcacattgatactaggctcaacaaagctgtttgggccaaa ggaataaggattgtcccataccgtatccctgtgtggttgtccagaaaacgtaacgaggat gaaaattcaacaaacaagctctatactttggttacccatgagcctcattacttggccatg actgataaggaccgagttagagtgctgaaccttgacattaatctggccatgcaaatgtat tcattaatagatgcttaa >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_7|278_aa MWAGNAWRAALSGVPCGRSAQSVLAQLRGILEGELEGIRGAGTWKSERVITSRQGPHIHV DGVSGGILNLTSVRFIRGTQSIHKNLEAKIARFHQREDAILYPSCCDANAGLFEVLLRPE DAVLSDELNCASIIHGICLCKAHKYHYCHLDVAYLETKLQEAQKHRLFLVATDGAFSMDG DIVPLQKICRLASRYGALVFVDECHATGFLGLTGQGTDELLGVMGQVTIINSTLGKALGG ASGGYTTGPGPLVSLLLPSPISSPTVCHLLLLAAPLRP >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_7|837_bp atgtgggctggaaacgcctggcgcgccgcgctttccggggtgccgtgcggccgcagcgcg cagtcagtactggcccagctgcgcggcattctggagggggagctggaagggatccgcgga gctggcacctggaagagtgagcgggtcatcacgtcccgtcaggggccgcacatccacgtg gacggcgtctccggaggaatcctcaacttaacctcggtccgcttcatccgtggaacccaa agcatccacaagaatctagaagcaaaaatagcccgcttccaccagcgggaggatgccatc ctctatcccagctgttgtgacgccaacgccggcctctttgaggtcctgctgagacccgag gacgcagtcctgtcggacgagctgaactgtgcctccatcatccacggcatctgcctgtgc aaggcccacaagtaccactattgccacctggacgtggcctacctagaaaccaagcttcag gaggcccagaagcatcggctgttcctggtggccaccgatggggccttttccatggatggc gacatcgtgcccctgcagaagatctgccgcctcgcctctagatatggcgccctggtcttt gtggatgaatgccatgccactggtttcctgggactcacaggacagggcacagatgagctg ctgggtgtgatgggccaggtcaccatcatcaactccaccctggggaaggccctgggtggt gcatcagggggctacacgacagggcctgggcccctggtgtcactgctgctgcccagccct atctcttctccaacagtctgccacctgctgttgttggctgcacctctaaggccctag >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_8|387_aa MEKIEEQFANLHIVKCSLGTKEPTYLLGIDTSKTVQAGKENLVAVLCSNGSIRIYDKERL NVLREFSGYPGLLNGVRFANSCDSVYSACTDGTVKCWDARVAREKPVQLFKGYPSNIFIS FDINCNDHIICAGTEKVDDDALLVFWDARMNSQNLSTTKDSLGAYSETHSDDVTQVRFHP SNPNMVVSGSSDGLVNVFDINIDNEEDALVTTCNSISSVSCIGWSGKGYKQIYCMTHDEG FYWWDLNHLDTDEPVTRLNIQDVREVVNMKEDALDYLIGGLYHEKTDTLHVIGGTNKGRI HLMNCSMSGLTHVTSLQGGHAATVRSFCWNVQDDSLLTGGEDAQLLLWKPGAIEKTFTKK ESMKIASSVHQRVRVHSNDSYKRRKKQ >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_8|1164_bp atggaaaagattgaggaacaatttgctaatctgcacattgttaaatgttccttaggaacc aaagagcccacttaccttcttggtatagacacatcaaagactgtccaagcaggaaaggaa aacttggttgctgttttatgttctaatggatcaatcagaatatatgataaagaaaggtta aatgtactacgagaatttagtggatatcctggacttcttaatggagtcagatttgcaaat tcctgtgacagtgtatattcagcatgtactgatggcactgtgaaatgctgggatgctcga gtagccagagaaaaacctgttcagctcttcaagggttacccttccaatatttttatcagt tttgatattaattgtaatgatcatattatttgtgctggtacagaaaaagttgatgatgat gcattgttggtgttttgggatgcaaggatgaattctcagaatttatctacaactaaagac tcacttggtgcatattcagagacacatagtgatgatgtcactcaagtacgtttccatccc agcaatcccaacatggtagtctcaggttcatctgatggcctggtaaatgtatttgatatt aatattgataatgaggaggatgcactggttacaacctgtaactcaatttcatcagtaagc tgtattggttggtctgggaaaggttataaacagatttactgcatgacacatgatgaagga ttttattggtgggatcttaatcatctggacactgatgaaccagttacacgtttgaacatc caggatgtcagagaagtagttaacatgaaagaagatgctttggactatttgattggtggc ctatatcatgaaaagacagacacattgcatgttattggaggaacaaacaaaggaaggatt catttgatgaactgcagcatgtcaggactgacccatgtgactagccttcagggagggcat gctgctacagtccgttctttctgttggaatgtgcaagatgattctttgttgactggagga gaagatgcacagttgttactttggaaacctggagctatagagaagacctttacaaagaaa gagagtatgaaaatagcatcctctgtgcaccaacgagtacgagttcatagtaatgattct tataaaagaaggaaaaagcagtga >gi568815584r:63276012_63639685|GENSCAN_predicted_peptide_9|65_aa MERMPPYFCQALRCGKSTNGRQVVGKSMERNLLLGPAGTEWSSNKEKTLQHPRPDAKHKI VTAHS >gi568815584r:63276012_63639685|GENSCAN_predicted_CDS_9|198_bp atggagcgaatgcctccctacttctgccaagccttacgctgtggaaagagcaccaacggt cggcaggtggtggggaagagcatggagagaaacctcctgttggggcctgctgggactgag tggagcagcaacaaggagaaaactctccagcatcccaggcctgacgctaagcacaagata gtaacagctcatagctga